Remove smote-oversampling-technique
article thumbnail

ML internals: Synthetic Minority Oversampling (SMOTE) Technique

Domino Data Lab

We present the inner workings of the SMOTE algorithm and show a simple “from scratch” implementation of SMOTE. We use an artificially constructed imbalance dataset (based on Iris) to generate synthetic observations via our SMOTE implementation, and discuss modifications that help SMOTE handle categorical attributes.

article thumbnail

Credit Card Fraud Detection using XGBoost, SMOTE, and threshold moving

Domino Data Lab

We’ll use a gradient boosting technique via XGBoost to create a model and I’ll walk you through steps you can take to avoid overfitting and build a model that is fit for purpose and ready for production. In this blog post we will show an implementation of an ML-based anomaly detection based on XGBoost. Oversampling.

article thumbnail

Explaining black-box models using attribute importance, PDPs, and LIME

Domino Data Lab

2016) for an example of this technique (LIME). Intrinsic methods – this technique is based on ANNs that have been designed to output an explanation alongside the standard prediction. Skater uses different techniques depending on the type of the model (e.g. See Ribeiro et al. 1 570 0 570 Name: credit, dtype: int64.

Modeling 139