article thumbnail

Fundamentals of Data Mining

Data Science 101

Data mining is the process of discovering these patterns among the data and is therefore also known as Knowledge Discovery from Data (KDD). The patterns discovered after this step are interpreted using various visualization and reporting techniques and are made comprehensible for other team members to understand. Deployment.

article thumbnail

Using Empirical Bayes to approximate posteriors for large "black box" estimators

The Unofficial Google Data Science Blog

For an introduction to Empirical Bayes, see the paper [3] by Brad Efron (with more in his book [4]). References [1] Omkar Muralidharan, Amir Najmi "Second Order Calibration: A Simple Way To Get Approximate Posteriors" , Technical Report, Google, 2015. [2] In Figure 2, the red line shows a Gamma prior that leads to a good fit.

KDD 40