article thumbnail

Defining data science in 2018

Data Science and Beyond

I got my first data science job in 2012, the year Harvard Business Review announced data scientist to be the sexiest job of the 21st century. Two years later, I published a post on my then-favourite definition of data science , as the intersection between software engineering and statistics. Things have changed considerably since 2012.

article thumbnail

The curse of Dimensionality

Domino Data Lab

The accuracy of any predictive model approaches 100%. Property 4: The accuracy of any predictive model approaches 100%. This means models can always be found that predict group characteristic with high accuracy. There should be no model to accurately predict even and odd rows with random data.

article thumbnail

Using random effects models in prediction problems

The Unofficial Google Data Science Blog

Often one tries several different techniques and then either combines the outputs or selects the most accurate, and we believe random effects models are a valuable addition to the usual suite of approaches which includes penalized GLMs, decision trees, etc. Compact approximations to bayesian predictive distributions." ICML, (2005). [3]