Remove 2008 Remove Knowledge Discovery Remove Risk Remove Testing
article thumbnail

ML internals: Synthetic Minority Oversampling (SMOTE) Technique

Domino Data Lab

Their tests are performed using C4.5-generated This carries the risk of this modification performing worse than simpler approaches like majority under-sampling. note that this variant “performs worse than plain under-sampling based on AUC” when tested on the Adult dataset (Dua & Graff, 2017). Chawla et al., Chawla et al.