Remove 2009 Remove Data Collection Remove Knowledge Discovery Remove Metrics
article thumbnail

ML internals: Synthetic Minority Oversampling (SMOTE) Technique

Domino Data Lab

Further, imbalanced data exacerbates problems arising from the curse of dimensionality often found in such biological data. Insufficient training data in the minority class — In domains where data collection is expensive, a dataset containing 10,000 examples is typically considered to be fairly large.

article thumbnail

Explaining black-box models using attribute importance, PDPs, and LIME

Domino Data Lab

Because of its architecture, intrinsically explainable ANNs can be optimised not just on its prediction performance, but also on its explainability metric. PDPs for the bicycle count prediction model (Molnar, 2009). Courville, Pascal Vincent, Visualizing Higher-Layer Features of a Deep Network, 2009. Ribeiro, M. Guestrin, C.,

Modeling 139