article thumbnail

Variance and significance in large-scale online services

The Unofficial Google Data Science Blog

There are commercial sites which allow users to search for and purchase goods or book rooms they desire. Well, it turns out that depending on what it cares to measure, an LSOS might not have enough data. That’s more of a business question about the value of the underlying effect than about one’s ability to measure the effect.

article thumbnail

On the Hunt for Patterns: from Hippocrates to Supercomputers

Ontotext

Ever since Hippocrates founded his school of medicine in ancient Greece some 2,500 years ago, writes Hannah Fry in her book Hello World: Being Human in the Age of Algorithms , what has been fundamental to healthcare (as she calls it “the fight to keep us healthy”) was observation, experimentation and the analysis of data. Certainly not!

article thumbnail

Using Empirical Bayes to approximate posteriors for large "black box" estimators

The Unofficial Google Data Science Blog

Posteriors are useful to understand the system, measure accuracy, and make better decisions. Methods like the Poisson bootstrap can help us measure the variability of $t$, but don’t give us posteriors either, particularly since good high-dimensional estimators aren’t unbiased. How exactly should we model $G$?

KDD 40