article thumbnail

Towards optimal experimentation in online systems

The Unofficial Google Data Science Blog

To find optimal values of two parameters experimentally, the obvious strategy would be to experiment with and update them in separate, sequential stages. Our experimentation platform supports this kind of grouped-experiments analysis, which allows us to see rough summaries of our designed experiments without much work.

article thumbnail

6 trends framing the state of AI and ML

O'Reilly on Data

Reinforcement learning fell by 5% in 2019; it’s up hugely—1,500+%—since 2017, however. Aggregating artificial intelligence and machine learning topics accounts for nearly 5% of all usage activity on the platform, a touch less than, and growing 50% faster than, the well-established “data science” topic (see Figure 2).

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Trending Sources

article thumbnail

Changing assignment weights with time-based confounders

The Unofficial Google Data Science Blog

Instead, we focus on the case where an experimenter has decided to run a full traffic ramp-up experiment and wants to use the data from all of the epochs in the analysis. When there are changing assignment weights and time-based confounders, this complication must be considered either in the analysis or the experimental design.

article thumbnail

Themes and Conferences per Pacoid, Episode 11

Domino Data Lab

In other words, using metadata about data science work to generate code. In this case, code gets generated for data preparation, where so much of the “time and labor” in data science work is concentrated. The approach they’ve used applies to other popular data science APIs such as NumPy , Tensorflow , and so on.

Metadata 105
article thumbnail

Themes and Conferences per Pacoid, Episode 9

Domino Data Lab

The lens of reductionism and an overemphasis on engineering becomes an Achilles heel for data science work. Instead, consider a “full stack” tracing from the point of data collection all the way out through inference. Other good related papers include: “ Towards A Rigorous Science of Interpretable Machine Learning ”.

article thumbnail

Is Google Cloud Platform Ready to Run Your Data Analytics Pipeline?

Sanjeev Mohan

I spent the majority of my time helping clients decide which was the right Hadoop platform and which NoSQL / nonrelational data store to pick for specific use cases. Fast forward to early 2017. Then in the middle of 2017, a realization set in that we were one year away from GDPR and needed to focus on data governance.

article thumbnail

Themes and Conferences per Pacoid, Episode 6

Domino Data Lab

In Paco Nathan ‘s latest column, he explores the role of curiosity in data science work as well as Rev 2 , an upcoming summit for data science leaders. Welcome back to our monthly series about data science. and dig into details about where science meets rhetoric in data science.