article thumbnail

Data Science, Past & Future

Domino Data Lab

Paco Nathan presented, “Data Science, Past & Future” , at Rev. At Rev’s “ Data Science, Past & Future” , Paco Nathan covered contextual insight into some common impactful themes over the decades that also provided a “lens” help data scientists, researchers, and leaders consider the future.

article thumbnail

Towards optimal experimentation in online systems

The Unofficial Google Data Science Blog

Crucially, it takes into account the uncertainty inherent in our experiments. Experiments, Parameters and Models At Youtube, the relationships between system parameters and metrics often seem simple — straight-line models sometimes fit our data well. It is a big picture approach, worthy of your consideration.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Trending Sources

article thumbnail

The trinity of errors in financial models: An introductory analysis using TensorFlow Probability

O'Reilly on Data

They trade the markets using quantitative models based on non-financial theories such as information theory, data science, and machine learning. Whether financial models are based on academic theories or empirical data mining strategies, they are all subject to the trinity of modeling errors explained below. Thompson, L.S.

Modeling 133
article thumbnail

Cloudera + Hortonworks, from the Edge to AI

Cloudera

That team delivered the first production cluster in 2006 and continued to improve it in the years that followed. In 2008, I co-founded Cloudera with folks from Google, Facebook, and Yahoo to deliver a big data platform built on Hadoop to the enterprise market. Yahoo quickly recognized the promise of the project.

article thumbnail

Our quest for robust time series forecasting at scale

The Unofficial Google Data Science Blog

Quantification of forecast uncertainty via simulation-based prediction intervals. We conclude with an example of our forecasting routine applied to publicly available Turkish Electricity data. Prediction Intervals A statistical forecasting system should not lack uncertainty quantification.

article thumbnail

Fitting Bayesian structural time series with the bsts R package

The Unofficial Google Data Science Blog

SCOTT Time series data are everywhere, but time series modeling is a fairly specialized area within statistics and data science. Introduction Time series data appear in a surprising number of applications, ranging from business, to the physical and social sciences, to health, medicine, and engineering. Holmes, C.

article thumbnail

Using random effects models in prediction problems

The Unofficial Google Data Science Blog

In the context of prediction problems, another benefit is that the models produce an estimate of the uncertainty in their predictions: the predictive posterior distribution. Data analysis using regression and multilevel/hierarchical models." Cambridge University Press, (2006). [2] bandit problems). ICML, (2005). [3]