article thumbnail

How Big Data Impacts The Finance And Banking Industries

Smart Data Collective

A 2013 survey conducted by the IBM’s Institute of Business Value and the University of Oxford showed that 71% of the financial service firms had already adopted analytics and big data. Big Data can efficiently enhance the ways firms utilize predictive models in the risk management discipline. The Underlying Concept.

Big Data 141
article thumbnail

Why you should care about debugging machine learning models

O'Reilly on Data

If a model is going to be used on all kinds of people, it’s best to ensure the training data has a representative distribution of all kinds of people as well. Interpretable ML models and explainable ML. The debugging techniques we propose should work on almost any kind of ML-based predictive model.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Some stuff that’s always on my mind

DMBS2

The ongoing rise of “edge computing” and the “Internet of Things” fit into the general trend that in 2013 I summarized as appliances, clusters and clouds. However, I stand by my overview opinions of last February , and I delivered on some of its IOUs in a two-part series on persuasion.

article thumbnail

Credit Card Fraud Detection using XGBoost, SMOTE, and threshold moving

Domino Data Lab

The dataset contains transactions made by European credit card holders in September 2013, and has been anonymized – Features V1, V2, …, V28 are results from applying PCA on the raw data. The only intact features are Time and Amount. The class label is titled Class where 0 denotes a genuine transaction and 1 signifies fraud.

article thumbnail

Data Science at The New York Times

Domino Data Lab

Diving into examples of building and deploying ML models at The New York Times including the descriptive topic modeling-oriented Readerscope (audience insights engine), a prediction model regarding who was likely to subscribe/cancel their subscription, as well as prescriptive example via recommendations of highly curated editorial content.

article thumbnail

Deep Learning Illustrated: Building Natural Language Processing Models

Domino Data Lab

When running word2vec, you can choose between two underlying model architectures— skip-gram (SG) or continuous bag of words (CBOW; pronounced see-bo)— either of which will typically produce roughly comparable results despite maximizing probabilities from “opposite” perspectives. Note: Mikolov, T., arXiv:1301.3781].

article thumbnail

What Is Embedded Analytics?

Jet Global

Companies like Tableau (which raised over $250 million when it had its IPO in 2013) demonstrated an unmet need in the market. Predictive analytics use a combination of data sets from multiple sources to find relationships and correlations. Users’ varied needs require a shift in traditional BI thinking.