Remove 2002 Remove Data mining Remove Machine Learning Remove Metrics
article thumbnail

ML internals: Synthetic Minority Oversampling (SMOTE) Technique

Domino Data Lab

Machine Learning algorithms often need to handle highly-imbalanced datasets. Further, imbalanced data exacerbates problems arising from the curse of dimensionality often found in such biological data. In their 2002 paper Chawla et al. Generation of artificial examples. return synthetic. Chawla et al.,