Remove 2009 Remove Knowledge Discovery Remove Machine Learning Remove Risk
article thumbnail

ML internals: Synthetic Minority Oversampling (SMOTE) Technique

Domino Data Lab

Machine Learning algorithms often need to handle highly-imbalanced datasets. This carries the risk of this modification performing worse than simpler approaches like majority under-sampling. A weighted nearest neighbor algorithm for learning with symbolic features. Machine Learning, 57–78. Chawla et al.

article thumbnail

Explaining black-box models using attribute importance, PDPs, and LIME

Domino Data Lab

The interest in interpretation of machine learning has been rapidly accelerating in the last decade. This can be attributed to the popularity that machine learning algorithms, and more specifically deep learning, has been gaining in various domains. PDPs for the bicycle count prediction model (Molnar, 2009).

Modeling 139