Remove 2001 Remove Knowledge Discovery Remove Modeling Remove Risk
article thumbnail

ML internals: Synthetic Minority Oversampling (SMOTE) Technique

Domino Data Lab

In this article we discuss why fitting models on imbalanced datasets is problematic, and how class imbalance is typically addressed. This carries the risk of this modification performing worse than simpler approaches like majority under-sampling. Chawla et al. Indeed, in the original paper Chawla et al. References. link] Chawla, N.