Remove 2002 Remove Knowledge Discovery Remove Machine Learning
article thumbnail

ML internals: Synthetic Minority Oversampling (SMOTE) Technique

Domino Data Lab

Machine Learning algorithms often need to handle highly-imbalanced datasets. In their 2002 paper Chawla et al. Learning wider regions improves the generalisation of the classifier, as the region of the minority class is not so tightly constrained by the observations in the majority. Generation of artificial examples.