Remove 2002 Remove Data mining Remove Forecasting Remove Machine Learning
article thumbnail

ML internals: Synthetic Minority Oversampling (SMOTE) Technique

Domino Data Lab

Machine Learning algorithms often need to handle highly-imbalanced datasets. In their 2002 paper Chawla et al. Learning wider regions improves the generalisation of the classifier, as the region of the minority class is not so tightly constrained by the observations in the majority. Generation of artificial examples.