Remove 2001 Remove Knowledge Discovery Remove Machine Learning Remove Visualization
article thumbnail

ML internals: Synthetic Minority Oversampling (SMOTE) Technique

Domino Data Lab

Machine Learning algorithms often need to handle highly-imbalanced datasets. Figure 3 shows visual explanation of how SMOTE generates synthetic observations in this case. A weighted nearest neighbor algorithm for learning with symbolic features. Machine Learning, 57–78. UCI machine learning repository.