Remove 2001 Remove Measurement Remove Strategy Remove Uncertainty
article thumbnail

Estimating the prevalence of rare events — theory and practice

The Unofficial Google Data Science Blog

The measurement may be biased if our samples are generated from a procedure that samples without replacement, such as reservoir sampling , especially if some items have disproportionate weight, i.e., $p(v_i) cdot n$ is large. There are many strategies we can use to estimate this quantity, and we will discuss each option in detail.

Metrics 98