Remove 2008 Remove Knowledge Discovery Remove Marketing Remove Testing
article thumbnail

ML internals: Synthetic Minority Oversampling (SMOTE) Technique

Domino Data Lab

Their tests are performed using C4.5-generated note that this variant “performs worse than plain under-sampling based on AUC” when tested on the Adult dataset (Dua & Graff, 2017). Data mining for direct marketing: Problems and solutions. Chawla et al., Pima Indian Diabetes (Smith et al., 1988), E-state data (Hall et al.,