Remove 2009 Remove Knowledge Discovery Remove Machine Learning Remove Visualization
article thumbnail

ML internals: Synthetic Minority Oversampling (SMOTE) Technique

Domino Data Lab

Machine Learning algorithms often need to handle highly-imbalanced datasets. Figure 3 shows visual explanation of how SMOTE generates synthetic observations in this case. A weighted nearest neighbor algorithm for learning with symbolic features. Machine Learning, 57–78. UCI machine learning repository.

article thumbnail

Explaining black-box models using attribute importance, PDPs, and LIME

Domino Data Lab

The interest in interpretation of machine learning has been rapidly accelerating in the last decade. This can be attributed to the popularity that machine learning algorithms, and more specifically deep learning, has been gaining in various domains. PDPs for the bicycle count prediction model (Molnar, 2009).

Modeling 139