article thumbnail

Uncertainties: Statistical, Representational, Interventional

The Unofficial Google Data Science Blog

by AMIR NAJMI & MUKUND SUNDARARAJAN Data science is about decision making under uncertainty. Some of that uncertainty is the result of statistical inference, i.e., using a finite sample of observations for estimation. But there are other kinds of uncertainty, at least as important, that are not statistical in nature.

article thumbnail

Regulatory uncertainty overshadows gen AI despite pace of adoption

CIO Business Intelligence

It’s no surprise, then, that according to a June KPMG survey, uncertainty about the regulatory environment was the top barrier to implementing gen AI. So here are some of the strategies organizations are using to deploy gen AI in the face of regulatory uncertainty. We’re still in the pilot phases of evaluating LLMs,” he says.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Measuring Validity and Reliability of Human Ratings

The Unofficial Google Data Science Blog

E ven after we account for disagreement, human ratings may not measure exactly what we want to measure. Researchers and practitioners have been using human-labeled data for many years, trying to understand all sorts of abstract concepts that we could not measure otherwise. That’s the focus of this blog post.

article thumbnail

Business Strategies for Deploying Disruptive Tech: Generative AI and ChatGPT

Rocket-Powered Data Science

Those F’s are: Fragility, Friction, and FUD (Fear, Uncertainty, Doubt). Keep it agile, with short design, develop, test, release, and feedback cycles: keep it lean, and build on incremental changes. Test early and often. Encourage and reward a Culture of Experimentation that learns from failure, “ Test, or get fired!

Strategy 289
article thumbnail

You Can’t Regulate What You Don’t Understand

O'Reilly on Data

If we want prosocial outcomes, we need to design and report on the metrics that explicitly aim for those outcomes and measure the extent to which they have been achieved. And they are stress testing and “ red teaming ” them to uncover vulnerabilities. That is a crucial first step, and we should take it immediately.

Metrics 283
article thumbnail

Towards optimal experimentation in online systems

The Unofficial Google Data Science Blog

the weight given to Likes in our video recommendation algorithm) while $Y$ is a vector of outcome measures such as different metrics of user experience (e.g., Crucially, it takes into account the uncertainty inherent in our experiments. Figure 2: Spreading measurements out makes estimates of model (slope of line) more accurate.

article thumbnail

5 hot IT leadership trends — and 4 going cold

CIO Business Intelligence

He points to a recent observation from GitHub CEO Thomas Dohmke, who noted 40% of computer-generated code was adopted by developers beta testing its Copilot AI automated code-writing system. The test also cut programming time by 55%. “Many people believe this will increase to 80%,” Mehlkopf said. “If

IT 124