Remove 2013 Remove Optimization Remove Predictive Modeling Remove Statistics
article thumbnail

Credit Card Fraud Detection using XGBoost, SMOTE, and threshold moving

Domino Data Lab

Rules-based fraud detection (top) vs. classification decision tree-based detection (bottom): The risk scoring in the former model is calculated using policy-based, manually crafted rules and their corresponding weights. Let’s also look at the basic descriptive statistics for all attributes. from sklearn import metrics.

article thumbnail

Data Science at The New York Times

Domino Data Lab

Diving into examples of building and deploying ML models at The New York Times including the descriptive topic modeling-oriented Readerscope (audience insights engine), a prediction model regarding who was likely to subscribe/cancel their subscription, as well as prescriptive example via recommendations of highly curated editorial content.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Trending Sources

article thumbnail

Deep Learning Illustrated: Building Natural Language Processing Models

Domino Data Lab

Although it’s not perfect, [Note: These are statistical approximations, of course!] GloVe and word2vec differ in their underlying methodology: word2vec uses predictive models, while GloVe is count based. Note: A test set of 19,500 such analogies was developed by Tomas Mikolov and his colleagues in their 2013 word2vec paper.

article thumbnail

What Is Embedded Analytics?

Jet Global

Companies like Tableau (which raised over $250 million when it had its IPO in 2013) demonstrated an unmet need in the market. Advanced Analytics Some apps provide a unique value proposition through the development of advanced (and often proprietary) statistical models. Their dashboards were visually stunning.