Remove Data Quality Remove Metrics Remove Slice and Dice Remove Uncertainty
article thumbnail

Measuring Validity and Reliability of Human Ratings

The Unofficial Google Data Science Blog

Once we’ve answered that, we will then define and use metrics to understand the quality of human-labeled data, along with a measurement framework that we call Cross-replication Reliability or xRR. Last, we’ll provide a case study of how xRR can be used to measure improvements in a data-labeling platform.