Data Science, Metrics, Statistics and Uncertainty

Data Science

Metrics

Statistics

Uncertainty

Uncertainties: Statistical, Representational, Interventional

The Unofficial Google Data Science Blog

DECEMBER 14, 2021

by AMIR NAJMI & MUKUND SUNDARARAJAN Data science is about decision making under uncertainty. Some of that uncertainty is the result of statistical inference, i.e., using a finite sample of observations for estimation. This kind of decision making must address particular kinds of uncertainty.

Uncertainty

Uncertainty Statistics Measurement Cost-Benefit

Humans-in-the-loop forecasting: integrating data science and business planning

The Unofficial Google Data Science Blog

DECEMBER 4, 2019

by THOMAS OLAVSON Thomas leads a team at Google called "Operations Data Science" that helps Google scale its infrastructure capacity optimally. This classification is based on the purpose, horizon, update frequency and uncertainty of the forecast. Our team does a lot of forecasting.

Forecasting

Forecasting Data Science Statistics Uncertainty

Join 52,000+

Insiders

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Webinars

How to Optimize the Developer Experience for Monumental Impact

Generative AI Deep Dive: Advancing from Proof of Concept to Production

Understanding User Needs and Satisfying Them

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

Leading the Development of Profitable and Sustainable Products

MORE WEBINARS

Trending Sources

Towards optimal experimentation in online systems

The Unofficial Google Data Science Blog

APRIL 23, 2024

the weight given to Likes in our video recommendation algorithm) while $Y$ is a vector of outcome measures such as different metrics of user experience (e.g., Crucially, it takes into account the uncertainty inherent in our experiments. Here, $X$ is a vector of tuning parameters that control the system's operating characteristics (e.g.

Experimentation

Experimentation Optimization Uncertainty Metrics

Webinars

How to Optimize the Developer Experience for Monumental Impact

Generative AI Deep Dive: Advancing from Proof of Concept to Production

Understanding User Needs and Satisfying Them

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

Leading the Development of Profitable and Sustainable Products

MORE WEBINARS

Three Emerging Analytics Products Derived from Value-driven Data Innovation and Insights Discovery in the Enterprise

Rocket-Powered Data Science

JULY 19, 2023

I recently saw an informal online survey that asked users which types of data (tabular, text, images, or “other”) are being used in their organization’s analytics applications. This was not a scientific or statistically robust survey, so the results are not necessarily reliable, but they are interesting and provocative.

Data-driven

Data-driven Enterprise Analytics Machine Learning

Data Science, Past & Future

Domino Data Lab

JULY 22, 2019

Paco Nathan presented, “Data Science, Past & Future” , at Rev. At Rev’s “ Data Science, Past & Future” , Paco Nathan covered contextual insight into some common impactful themes over the decades that also provided a “lens” help data scientists, researchers, and leaders consider the future.

Data Science

Data Science Machine Learning Data Governance Modeling

Measuring Validity and Reliability of Human Ratings

The Unofficial Google Data Science Blog

JULY 18, 2023

Once we’ve answered that, we will then define and use metrics to understand the quality of human-labeled data, along with a measurement framework that we call Cross-replication Reliability or xRR. Last, we’ll provide a case study of how xRR can be used to measure improvements in a data-labeling platform.

Measurement

Measurement Metrics Uncertainty Slice and Dice

Estimating the prevalence of rare events — theory and practice

The Unofficial Google Data Science Blog

AUGUST 27, 2019

Of course, any mistakes by the reviewers would propagate to the accuracy of the metrics, and the metrics calculation should take into account human errors. If we could separate bad videos from good videos perfectly, we could simply calculate the metrics directly without sampling. The missing verdicts create two problems.

Metrics

Metrics Statistics Uncertainty Optimization

Variance and significance in large-scale online services

The Unofficial Google Data Science Blog

JANUARY 14, 2016

by AMIR NAJMI Running live experiments on large-scale online services (LSOS) is an important aspect of data science. Because individual observations have so little information, statistical significance remains important to assess. We must therefore maintain statistical rigor in quantifying experimental uncertainty.

Experimentation

Experimentation Statistics Metrics Measurement

Predicting Movie Profitability and Risk at the Pre-production Phase

Insight

FEBRUARY 19, 2020

Using variability in machine learning predictions as a proxy for risk can help studio executives and producers decide whether or not to green light a film project Photo by Kyle Smith on Unsplash Originally posted on Toward Data Science. Hollywood is a $10 billion-a-year industry, and movies range from huge hits to box office bombs.

Risk

Risk ROI Modeling Metrics

Fact-based Decision-making

Peter James Thomas

AUGUST 12, 2018

Again see Using BI to drive improvements in data quality for further details. Pertinence and fidelity of metrics developed from Data. Here we get past issues with data itself (or how it is handled and moved around) and instead consider how it is used. There are often compromises to be made in defining metrics.

Metrics

Metrics Statistics Data Quality Measurement

LSOS experiments: how I learned to stop worrying and love the variability

The Unofficial Google Data Science Blog

FEBRUARY 29, 2016

In this post we explore why some standard statistical techniques to reduce variance are often ineffective in this “data-rich, information-poor” realm. Despite a very large number of experimental units, the experiments conducted by LSOS cannot presume statistical significance of all effects they deem practically significant.

Experimentation

Experimentation Metrics Statistics Measurement

Misadventures in experiments for growth

The Unofficial Google Data Science Blog

APRIL 16, 2019

Such decisions involve an actual hypothesis test on specific metrics (e.g. Often, an established product will have an overall evaluation criterion (OEC) that incorporates trade-offs among important metrics and between short- and long-term success. The metrics to measure the impact of the change might not yet be established.

Experimentation

Experimentation Sales Metrics Measurement

Product Management for AI

Domino Data Lab

JUNE 23, 2019

Companies with successful ML projects are often companies that already have an experimental culture in place as well as analytics that enable them to learn from data. Ensure that product managers work on projects that matter to the business and/or are aligned to strategic company metrics. That’s another pattern.

Management

Management Machine Learning Experimentation Metrics

Estimating causal effects using geo experiments

The Unofficial Google Data Science Blog

MAY 31, 2016

This means it is possible to specify exactly in which geos an ad campaign will be served – and to observe the ad spend and the response metric at the geo level. In other words, iROAS is the slope of a curve of the response metric plotted against the underlying advertising spend. They are non-overlapping geo-targetable regions.

Advertising

Advertising Testing Sales Statistics

Data scientist as scientist

The Unofficial Google Data Science Blog

OCTOBER 21, 2015

Our post describes how we arrived at recent changes to design principles for the Google search page, and thus highlights aspects of a data scientist’s role which involve practicing the scientific method. There has been debate as to whether the term “data science” is necessary. Some don’t see the point.

Slice and Dice

Slice and Dice Experimentation Data-driven Data Science

Data Leaders Brief

Uncertainties: Statistical, Representational, Interventional

Humans-in-the-loop forecasting: integrating data science and business planning

Webinars

Trending Sources

Towards optimal experimentation in online systems

Webinars

Three Emerging Analytics Products Derived from Value-driven Data Innovation and Insights Discovery in the Enterprise

Data Science, Past & Future

Measuring Validity and Reliability of Human Ratings

Estimating the prevalence of rare events — theory and practice

Variance and significance in large-scale online services

Predicting Movie Profitability and Risk at the Pre-production Phase

Fact-based Decision-making

LSOS experiments: how I learned to stop worrying and love the variability

Misadventures in experiments for growth

Product Management for AI

Estimating causal effects using geo experiments

Data scientist as scientist

Stay Connected