article thumbnail

Experiment design and modeling for long-term studies in ads

The Unofficial Google Data Science Blog

A/B testing is used widely in information technology companies to guide product development and improvements. For questions as disparate as website design and UI, prediction algorithms, or user flows within apps, live traffic tests help developers understand what works well for users and the business, and what doesn’t.

article thumbnail

Density-Based Clustering

Domino Data Lab

Due to its importance in both theory and applications, this algorithm is one of three algorithms awarded the Test of Time Award at the KDD conference in 2014. To test out DBSCAN, I’m going to use a dataset consisting of annual customer data for a wholesale distributor. Application.

Metrics 116
article thumbnail

Using Empirical Bayes to approximate posteriors for large "black box" estimators

The Unofficial Google Data Science Blog

For more on ad CTR estimation, refer to [2]. One way to check $f_theta$ is to gather test data and check whether the model fits the relationship between training and test data. Figure 4 shows the results of such a test. A machine learning system produces an estimated CTR $t_i$ for each query-ad pair.

KDD 40