2005, Statistics, Testing and Uncertainty

Measuring Validity and Reliability of Human Ratings

The Unofficial Google Data Science Blog

JULY 18, 2023

Editor's note : The relationship between reliability and validity are somewhat analogous to that between the notions of statistical uncertainty and representational uncertainty introduced in an earlier post. But for more complicated metrics like xRR, our preference is to bootstrap when measuring uncertainty.

Measurement

Measurement Metrics Uncertainty Slice and Dice

Using random effects models in prediction problems

The Unofficial Google Data Science Blog

MARCH 31, 2016

We often use statistical models to summarize the variation in our data, and random effects models are well suited for this — they are a form of ANOVA after all. In the context of prediction problems, another benefit is that the models produce an estimate of the uncertainty in their predictions: the predictive posterior distribution.

Modeling

Modeling Statistics Advertising Testing

Data Leaders Brief

Measuring Validity and Reliability of Human Ratings

Using random effects models in prediction problems

Webinars

Stay Connected