2012, Data Science and Predictive Modeling

2012

Data Science

Predictive Modeling

Defining data science in 2018

Data Science and Beyond

JULY 22, 2018

I got my first data science job in 2012, the year Harvard Business Review announced data scientist to be the sexiest job of the 21st century. Two years later, I published a post on my then-favourite definition of data science , as the intersection between software engineering and statistics.

Data Science

Data Science Machine Learning Statistics Predictive Modeling

Structural Evolutions in Data

O'Reilly on Data

SEPTEMBER 19, 2023

While data scientists were no longer handling Hadoop-sized workloads, they were trying to build predictive models on a different kind of “large” dataset: so-called “unstructured data.” ” There’s as much Keras, TensorFlow, and Torch today as there was Hadoop back in 2010-2012.

Machine Learning

Machine Learning Testing Modeling Cost-Benefit

Join 52,000+

Insiders

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Webinars

Generative AI Deep Dive: Advancing from Proof of Concept to Production

Understanding User Needs and Satisfying Them

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

Leading the Development of Profitable and Sustainable Products

How To Get Promoted In Product Management

MORE WEBINARS

Trending Sources

Analytics Vidhya

The curse of Dimensionality

Domino Data Lab

OCTOBER 7, 2020

There are four properties of high dimensional data: Points move far away from each other in high dimensions. The accuracy of any predictive model approaches 100%. Property 4: The accuracy of any predictive model approaches 100%. There should be no model to accurately predict even and odd rows with random data.

Statistics

Statistics Testing Predictive Modeling Modeling

Webinars

Generative AI Deep Dive: Advancing from Proof of Concept to Production

Understanding User Needs and Satisfying Them

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

Leading the Development of Profitable and Sustainable Products

How To Get Promoted In Product Management

MORE WEBINARS

Credit Card Fraud Detection using XGBoost, SMOTE, and threshold moving

Domino Data Lab

APRIL 21, 2021

The complete dataset and code used in this blog post is available at try.dominodatalab.com, and all results shown here are fully reproducible, thanks to the Domino reproducibility engine, which is part of the Domino Data Science platform. Knowledge and Data Engineering, IEEE Transactions on, 21, 1263-1284. References. [1]

Statistics

Statistics Machine Learning Modeling Metrics

Using random effects models in prediction problems

The Unofficial Google Data Science Blog

MARCH 31, 2016

We have many routine analyses for which the sparsity pattern is closer to the nested case and lme4 scales very well; however, our prediction models tend to have input data that looks like the simulation on the right. Compact approximations to bayesian predictive distributions." Cambridge University Press, (2012). [4]

Modeling

Modeling Statistics Advertising Testing

Data Leaders Brief

Defining data science in 2018

Structural Evolutions in Data

Webinars

Trending Sources

The curse of Dimensionality

Webinars

Credit Card Fraud Detection using XGBoost, SMOTE, and threshold moving

Using random effects models in prediction problems

Stay Connected