Remove 2017 Remove Knowledge Discovery Remove Metrics Remove Testing
article thumbnail

Explaining black-box models using attribute importance, PDPs, and LIME

Domino Data Lab

Because of its architecture, intrinsically explainable ANNs can be optimised not just on its prediction performance, but also on its explainability metric. After forming the X and y variables, we split the data into training and test sets. For sample 23 from the test set, the model is leaning towards a bad credit prediction.

Modeling 139
article thumbnail

ML internals: Synthetic Minority Oversampling (SMOTE) Technique

Domino Data Lab

Working with highly imbalanced data can be problematic in several aspects: Distorted performance metrics — In a highly imbalanced dataset, say a binary dataset with a class ratio of 98:2, an algorithm that always predicts the majority class and completely ignores the minority class will still be 98% correct. return synthetic. Chawla et al.,