Remove 2002 Remove Knowledge Discovery Remove Machine Learning Remove Visualization
article thumbnail

ML internals: Synthetic Minority Oversampling (SMOTE) Technique

Domino Data Lab

Machine Learning algorithms often need to handle highly-imbalanced datasets. In their 2002 paper Chawla et al. Figure 3 shows visual explanation of how SMOTE generates synthetic observations in this case. 2002) have performed a comprehensive evaluation of the impact of SMOTE- based up-sampling. Chawla et al.,