ML internals: Synthetic Minority Oversampling (SMOTE) Technique
Domino Data Lab
MAY 20, 2021
In this article we discuss why fitting models on imbalanced datasets is problematic, and how class imbalance is typically addressed. Figure 3 shows visual explanation of how SMOTE generates synthetic observations in this case. References. Banko, M., & Brill, E. Scaling to very very large corpora for natural language disambiguation.
Let's personalize your content