Remove 2017 Remove Data-driven Remove Experimentation Remove Optimization
article thumbnail

Towards optimal experimentation in online systems

The Unofficial Google Data Science Blog

If the relationship of $X$ to $Y$ can be approximated as quadratic (or any polynomial), the objective and constraints as linear in $Y$, then there is a way to express the optimization as a quadratically constrained quadratic program (QCQP). However, joint optimization is possible by increasing both $x_1$ and $x_2$ at the same time.

article thumbnail

Themes and Conferences per Pacoid, Episode 11

Domino Data Lab

Paco Nathan ‘s latest article covers program synthesis, AutoPandas, model-driven data queries, and more. In other words, using metadata about data science work to generate code. In this case, code gets generated for data preparation, where so much of the “time and labor” in data science work is concentrated.

Metadata 105
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Themes and Conferences per Pacoid, Episode 6

Domino Data Lab

In Paco Nathan ‘s latest column, he explores the role of curiosity in data science work as well as Rev 2 , an upcoming summit for data science leaders. Welcome back to our monthly series about data science. and dig into details about where science meets rhetoric in data science. Introduction. This is not that.

article thumbnail

What Are ChatGPT and Its Friends?

O'Reilly on Data

Many of these go slightly (but not very far) beyond your initial expectations: you can ask it to generate a list of terms for search engine optimization, you can ask it to generate a reading list on topics that you’re interested in. Unlike labels, embeddings are learned from the training data, not produced by humans.

IT 262
article thumbnail

Themes and Conferences per Pacoid, Episode 9

Domino Data Lab

For example, common practices for collecting data to build training datasets tend to throw away valuable information along the way. The lens of reductionism and an overemphasis on engineering becomes an Achilles heel for data science work. Finale Doshi-Velez, Been Kim (2017-02-28) ; see also the Domino blog article about TCAV.

article thumbnail

Topics to watch at the Strata Data Conference in New York 2019

O'Reilly on Data

Machine learning, artificial intelligence, data engineering, and architecture are driving the data space. The Strata Data Conferences helped chronicle the birth of big data, as well as the emergence of data science, streaming, and machine learning (ML) as disruptive phenomena. The term “AI,” meanwhile, is No.

IoT 20
article thumbnail

Why adopt a hybrid, multi-cloud strategy?

Cloudera

of application workloads were still on-premises in enterprise data centers; by the end of 2017, less than half (47.2%) were on-premises. A hybrid, multi-cloud strategy is the best approach to managing these distributed, heterogeneous data ecosystems. Enterprises are moving to the cloud. In 2016, 60.9% Future proof.