Remove 2017 Remove Knowledge Discovery Remove Measurement Remove Testing
article thumbnail

Changing assignment weights with time-based confounders

The Unofficial Google Data Science Blog

Another reason to use ramp-up is to test if a website's infrastructure can handle deploying a new arm to all of its users. The website wants to make sure they have the infrastructure to handle the feature while testing if engagement increases enough to justify the infrastructure. We offer two examples where this may be the case.

article thumbnail

ML internals: Synthetic Minority Oversampling (SMOTE) Technique

Domino Data Lab

This renders measures like classification accuracy meaningless. Their tests are performed using C4.5-generated note that this variant “performs worse than plain under-sampling based on AUC” when tested on the Adult dataset (Dua & Graff, 2017). The use of multiple measurements in taxonomic problems. Quinlan, J.

article thumbnail

Explaining black-box models using attribute importance, PDPs, and LIME

Domino Data Lab

After forming the X and y variables, we split the data into training and test sets. but it generally relies on measuring the entropy in the change of predictions given a perturbation of a feature. Next, we pick a sample that we want to get an explanation for, say the first sample from our test dataset (sample id 0). References.

Modeling 139