ML internals: Synthetic Minority Oversampling (SMOTE) Technique
Domino Data Lab
MAY 20, 2021
In this article we discuss why fitting models on imbalanced datasets is problematic, and how class imbalance is typically addressed. Insufficient training data in the minority class — In domains where data collection is expensive, a dataset containing 10,000 examples is typically considered to be fairly large. References.
Let's personalize your content