Remove Data Science Remove Metrics Remove Statistics Remove Uncertainty
article thumbnail

Uncertainties: Statistical, Representational, Interventional

The Unofficial Google Data Science Blog

by AMIR NAJMI & MUKUND SUNDARARAJAN Data science is about decision making under uncertainty. Some of that uncertainty is the result of statistical inference, i.e., using a finite sample of observations for estimation. This kind of decision making must address particular kinds of uncertainty.

article thumbnail

Humans-in-the-loop forecasting: integrating data science and business planning

The Unofficial Google Data Science Blog

by THOMAS OLAVSON Thomas leads a team at Google called "Operations Data Science" that helps Google scale its infrastructure capacity optimally. This classification is based on the purpose, horizon, update frequency and uncertainty of the forecast. Our team does a lot of forecasting.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Trending Sources

article thumbnail

Towards optimal experimentation in online systems

The Unofficial Google Data Science Blog

the weight given to Likes in our video recommendation algorithm) while $Y$ is a vector of outcome measures such as different metrics of user experience (e.g., Crucially, it takes into account the uncertainty inherent in our experiments. Here, $X$ is a vector of tuning parameters that control the system's operating characteristics (e.g.

article thumbnail

Three Emerging Analytics Products Derived from Value-driven Data Innovation and Insights Discovery in the Enterprise

Rocket-Powered Data Science

I recently saw an informal online survey that asked users which types of data (tabular, text, images, or “other”) are being used in their organization’s analytics applications. This was not a scientific or statistically robust survey, so the results are not necessarily reliable, but they are interesting and provocative.

article thumbnail

Data Science, Past & Future

Domino Data Lab

Paco Nathan presented, “Data Science, Past & Future” , at Rev. At Rev’s “ Data Science, Past & Future” , Paco Nathan covered contextual insight into some common impactful themes over the decades that also provided a “lens” help data scientists, researchers, and leaders consider the future.

article thumbnail

Measuring Validity and Reliability of Human Ratings

The Unofficial Google Data Science Blog

Once we’ve answered that, we will then define and use metrics to understand the quality of human-labeled data, along with a measurement framework that we call Cross-replication Reliability or xRR. Last, we’ll provide a case study of how xRR can be used to measure improvements in a data-labeling platform.

article thumbnail

Estimating the prevalence of rare events — theory and practice

The Unofficial Google Data Science Blog

Of course, any mistakes by the reviewers would propagate to the accuracy of the metrics, and the metrics calculation should take into account human errors. If we could separate bad videos from good videos perfectly, we could simply calculate the metrics directly without sampling. The missing verdicts create two problems.

Metrics 98