Remove Data mining Remove KDD Remove Modeling Remove Reporting
article thumbnail

Fundamentals of Data Mining

Data Science 101

This data alone does not make any sense unless it’s identified to be related in some pattern. Data mining is the process of discovering these patterns among the data and is therefore also known as Knowledge Discovery from Data (KDD). Machine learning provides the technical basis for data mining.

article thumbnail

Using Empirical Bayes to approximate posteriors for large "black box" estimators

The Unofficial Google Data Science Blog

But most common machine learning methods don’t give posteriors, and many don’t have explicit probability models. More precisely, our model is that $theta$ is drawn from a prior that depends on $t$, then $y$ comes from some known parametric family $f_theta$. Here, our items are query-ad pairs. Calculate posterior quantities of interest.

KDD 40