Measurement, Slice and Dice, Testing and Uncertainty

Measuring Validity and Reliability of Human Ratings

The Unofficial Google Data Science Blog

JULY 18, 2023

E ven after we account for disagreement, human ratings may not measure exactly what we want to measure. Researchers and practitioners have been using human-labeled data for many years, trying to understand all sorts of abstract concepts that we could not measure otherwise. That’s the focus of this blog post.

Measurement

Measurement Metrics Uncertainty Slice and Dice

Data scientist as scientist

The Unofficial Google Data Science Blog

OCTOBER 21, 2015

The beliefs of this community are always evolving, and the process of thoughtfully generating, testing, refuting and accepting ideas looks a lot like Science. Note also that this account does not involve ambiguity due to statistical uncertainty. We sliced and diced the experimental data in many many ways.

Slice and Dice

Slice and Dice Experimentation Data-driven Data Science

Data Leaders Brief

Measuring Validity and Reliability of Human Ratings

Data scientist as scientist

Webinars

Stay Connected