Remove 2009 Remove Knowledge Discovery Remove Risk Remove Visualization
article thumbnail

ML internals: Synthetic Minority Oversampling (SMOTE) Technique

Domino Data Lab

Figure 3 shows visual explanation of how SMOTE generates synthetic observations in this case. This carries the risk of this modification performing worse than simpler approaches like majority under-sampling. Proceedings of the Fourth International Conference on Knowledge Discovery and Data Mining, 73–79. Chawla et al.

article thumbnail

Explaining black-box models using attribute importance, PDPs, and LIME

Domino Data Lab

Skater provides a wide range of algorithms that can be used for visual interpretation (e.g. This dataset classifies customers based on a set of attributes into two credit risk groups – good or bad. This is to be expected, as there is no reason for a perfect 50:50 separation of the good vs. bad credit risk.

Modeling 139