article thumbnail

Software commodities are eating interesting data science work

Data Science and Beyond

When I started my PhD in 2009, the plan was to work on sentiment analysis of opinion polls. This got me into applied machine learning using Java and Weka , with which I made some modest contributions to the field. What can one do to remain relevant in such an environment? Read this post to find out. Highlights from my past.

Software 103
article thumbnail

Adding Common Sense to Machine Learning with TensorFlow Lattice

The Unofficial Google Data Science Blog

by TAMAN NARAYAN & SEN ZHAO A data scientist is often in possession of domain knowledge which she cannot easily apply to the structure of the model. On the other hand, sophisticated machine learning models are flexible in their form but not easy to control. On the one hand, basic statistical models (e.g.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Trending Sources

article thumbnail

Data Science at The New York Times

Domino Data Lab

Chris Wiggins , Chief Data Scientist at The New York Times, presented “Data Science at the New York Times” at Rev. Wiggins also indicated that data science, data engineering, and data analysis are different groups at The New York Times. Data science. Session Summary.

article thumbnail

Explaining black-box models using attribute importance, PDPs, and LIME

Domino Data Lab

The interest in interpretation of machine learning has been rapidly accelerating in the last decade. This can be attributed to the popularity that machine learning algorithms, and more specifically deep learning, has been gaining in various domains. PDPs for the bicycle count prediction model (Molnar, 2009).

Modeling 139
article thumbnail

ML internals: Synthetic Minority Oversampling (SMOTE) Technique

Domino Data Lab

Machine Learning algorithms often need to handle highly-imbalanced datasets. The dataset and code used in this blog post are available at [link] and all results shown here are fully reproducible, thanks to the Domino reproducibility engine, which is part of the Domino Data Science platform. Machine Learning, 57–78.

article thumbnail

Themes and Conferences per Pacoid, Episode 9

Domino Data Lab

I’ve been out themespotting and this month’s article features several emerging threads adjacent to the interpretability of machine learning models. The lens of reductionism and an overemphasis on engineering becomes an Achilles heel for data science work. Machine learning model interpretability.

article thumbnail

Exploring US Real Estate Values with Python

Domino Data Lab

This post covers data exploration using machine learning and interactive plotting. Models are at the heart of data science. Data exploration is vital to model development and is particularly important at the start of any data science project. Housing Market Bottom in December 2009.