Estimating the prevalence of rare events — theory and practice
The Unofficial Google Data Science Blog
AUGUST 27, 2019
The measurement may be biased if our samples are generated from a procedure that samples without replacement, such as reservoir sampling , especially if some items have disproportionate weight, i.e., $p(v_i) cdot n$ is large. There are many strategies we can use to estimate this quantity, and we will discuss each option in detail.
Let's personalize your content