Remove 2002 Remove Data mining Remove Data-driven Remove Measurement
article thumbnail

ML internals: Synthetic Minority Oversampling (SMOTE) Technique

Domino Data Lab

Further, imbalanced data exacerbates problems arising from the curse of dimensionality often found in such biological data. This renders measures like classification accuracy meaningless. In their 2002 paper Chawla et al. 2002) have performed a comprehensive evaluation of the impact of SMOTE- based up-sampling.