Remove 2002 Remove Knowledge Discovery Remove Machine Learning Remove Metrics
article thumbnail

ML internals: Synthetic Minority Oversampling (SMOTE) Technique

Domino Data Lab

Machine Learning algorithms often need to handle highly-imbalanced datasets. In their 2002 paper Chawla et al. def get_neigbours(M, k): nn = NearestNeighbors(n_neighbors=k+1, metric="euclidean").fit(M) 2002) have performed a comprehensive evaluation of the impact of SMOTE- based up-sampling. Chawla et al.,