2019, Blog, Experimentation and Testing

2019

Blog

Experimentation

Testing

MNIST Expanded: 50,000 New Samples Added

Domino Data Lab

JUNE 13, 2019

Recently, Chhavi Yadav (NYU) and Leon Bottou (Facebook AI Research and NYU) indicated in their paper, “ Cold Case: The Lost MNIST Digits ”, how they reconstructed the MNIST (Modified National Institute of Standards and Technology) dataset and added 50,000 samples to the test set for a total of 60,000 samples. Did they overfit the test set?

Testing

Testing Data Science Experimentation Metadata

Some highlights from 2020

Data Science and Beyond

APRIL 5, 2021

The Australian bushfires of 2019-20 provided me with extra motivation to help nudge Automattic to do more in the fight against climate change. I summarised this work in a post on the company’s blog , and discussed it in an interview with PublishPress. This aligns well with my long-standing interest in causal inference.

Experimentation

Experimentation Testing Publishing Measurement

Join 52,000+

Insiders

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Webinars

How to Optimize the Developer Experience for Monumental Impact

Generative AI Deep Dive: Advancing from Proof of Concept to Production

Understanding User Needs and Satisfying Them

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

Leading the Development of Profitable and Sustainable Products

MORE WEBINARS

Trending Sources

Towards optimal experimentation in online systems

The Unofficial Google Data Science Blog

APRIL 23, 2024

This blog post discusses such a comprehensive approach that is used at Youtube. To find optimal values of two parameters experimentally, the obvious strategy would be to experiment with and update them in separate, sequential stages. And we can keep repeating this approach, relying on intuition and luck.

Experimentation

Experimentation Optimization Uncertainty Metrics

Webinars

How to Optimize the Developer Experience for Monumental Impact

Generative AI Deep Dive: Advancing from Proof of Concept to Production

Understanding User Needs and Satisfying Them

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

Leading the Development of Profitable and Sustainable Products

MORE WEBINARS

The DataOps Vendor Landscape, 2021

DataKitchen

APRIL 13, 2021

Read the complete blog below for a more detailed description of the vendors and their capabilities. Testing and Data Observability. It orchestrates complex pipelines, toolchains, and tests across teams, locations, and data centers. Testing and Data Observability. Production Monitoring and Development Testing.

Testing

Testing Machine Learning Consulting Data Quality

Real-Real-World Programming with ChatGPT

O'Reilly on Data

JULY 25, 2023

So far I’ve read a gazillion blog posts about people’s experiences with these AI coding assistance tools. For instance, if I’m reading a paper from 2019, a popular song from that year could start playing. Swift Papers felt like a well-scoped project to test how well AI handles a realistic yet manageable real-world programming task.

Consulting

Consulting Interactive Software IT

Themes and Conferences per Pacoid, Episode 9

Domino Data Lab

MAY 8, 2019

Finale Doshi-Velez, Been Kim (2017-02-28) ; see also the Domino blog article about TCAV. They also require advanced skills in statistics, experimental design, causal inference, and so on – more than most data science teams will have. Other good related papers include: “ Towards A Rigorous Science of Interpretable Machine Learning ”.

Machine Learning

Machine Learning Data Science Modeling Visualization

More VR Misdirection

Perceptual Edge

AUGUST 26, 2019

A serious approach would begin with a thorough understanding of data visualization, which is not Pangilinan’s area of expertise, and would then proceed scientifically by designing and running experimental studies to test its usefulness. Her case is hollow.

Visualization

Visualization Cost-Benefit Interactive Experimentation

The trinity of errors in applying confidence intervals: An exploration using Statsmodels

O'Reilly on Data

DECEMBER 9, 2019

Recall from my previous blog post that all financial models are at the mercy of the Trinity of Errors , namely: errors in model specifications, errors in model parameter estimates, and errors resulting from the failure of a model to adapt to structural changes in its environment. Indeed, we do present a key in this blog post.

Statistics

Statistics Uncertainty Risk Marketing

Omdia Selects DataRobot as Recommended MLOps Vendor

DataRobot

JUNE 2, 2021

They went on to say that investing in MLOps directly answers one of the biggest questions facing AI practitioners in the enterprise: how to move from experimentation to transformation. This blog post explores some of the most interesting findings in the report about the growing importance of MLOps.

Experimentation

Experimentation Machine Learning Reporting Modeling

Smarten Advanced Data Discovery is All the Buzz!

Smarten

MAY 11, 2017

Advanced Data Discovery allows business users to perform early prototyping and to test hypothesis without the skills of a data scientist, ETL or developer. Advanced Data Discovery ensures data democratization by enabling users to drastically reduce the time and cost of analysis and experimentation.

Experimentation

Experimentation Visualization Business Intelligence Predictive Analytics

Data Leaders Brief

MNIST Expanded: 50,000 New Samples Added

Some highlights from 2020

Webinars

Trending Sources

Towards optimal experimentation in online systems

Webinars

The DataOps Vendor Landscape, 2021

Real-Real-World Programming with ChatGPT

Themes and Conferences per Pacoid, Episode 9

More VR Misdirection

The trinity of errors in applying confidence intervals: An exploration using Statsmodels

Omdia Selects DataRobot as Recommended MLOps Vendor

Smarten Advanced Data Discovery is All the Buzz!

Stay Connected