November, 2015

article thumbnail

The hardest parts of data science

Data Science and Beyond

Contrary to common belief, the hardest part of data science isn’t building an accurate model or obtaining good, clean data. It is much harder to define feasible problems and come up with reasonable ways of measuring solutions. This post discusses some examples of these issues and how they can be addressed. The not-so-hard parts Before discussing the hardest parts of data science, it’s worth quickly addressing the two main contenders: model fitting and data collection/cleaning.

article thumbnail

How to get a job at Google — as a data scientist

The Unofficial Google Data Science Blog

by SEAN GERRISH If you are a regular at this blog, thanks for reading. We will continue to bring you posts from the range of data science activities at Google. This post is different. It is for those who are interested enough in our activities to consider joining us. We briefly highlight some of the things we look for in data scientists we hire at Google and give tips on ways to prepare.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Trending Sources

article thumbnail

Nutanix and Lenovo

Nutanix

There’s no question that enterprise application usability has for the most part been left behind.

article thumbnail

How To Suck At Social Media: An Indispensable Guide For Businesses

Occam's Razor

Facebook, at last count, has 1.5 billion monthly active users. YouTube has 1.2 billion users (watching 6 billion hours of videos!). Instagram has an estimated 400 million users. Those are some big gigantic numbers! I believe that every human with time to spare, and a connection to the web, should be on social media. The benefits are numerous. Facebook allows you to stay close to people you choose to.

B2B 167
article thumbnail

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

Speaker: Timothy Chan, PhD., Head of Data Science

Are you ready to move beyond the basics and take a deep dive into the cutting-edge techniques that are reshaping the landscape of experimentation? 🌐 From Sequential Testing to Multi-Armed Bandits, Switchback Experiments to Stratified Sampling, Timothy Chan, Data Science Lead, is here to unravel the mysteries of these powerful methodologies that are revolutionizing how we approach testing.

article thumbnail

Migrating a simple web application from MongoDB to Elasticsearch

Data Science and Beyond

Bandcamp Recommender (BCRecommender) is a web application that serves music recommendations from Bandcamp. I recently switched BCRecommender’s data store from MongoDB to Elasticsearch. This has made it possible to offer a richer search experience to users at a similar cost. This post describes the migration process and discusses some of the advantages and disadvantages of using Elasticsearch instead of MongoDB.

IT 52
article thumbnail

Is Your Organization “VDI Ready”?

Nutanix

20

More Trending

article thumbnail

Is Your Organization “VDI Ready”?

Nutanix

20
article thumbnail

Using Empirical Bayes to approximate posteriors for large "black box" estimators

The Unofficial Google Data Science Blog

by OMKAR MURALIDHARAN Many machine learning applications have some kind of regression at their core, so understanding large-scale regression systems is important. But doing this can be hard, for reasons not typically encountered in problems with smaller or less critical regression systems. In this post, we describe the challenges posed by one problem — how to get approximate posteriors — and an approach that we have found useful.

KDD 40