Remove 2008 Remove Knowledge Discovery Remove Modeling Remove Visualization
article thumbnail

ML internals: Synthetic Minority Oversampling (SMOTE) Technique

Domino Data Lab

In this article we discuss why fitting models on imbalanced datasets is problematic, and how class imbalance is typically addressed. Figure 3 shows visual explanation of how SMOTE generates synthetic observations in this case. Proceedings of the Fourth International Conference on Knowledge Discovery and Data Mining, 73–79.