article thumbnail

The curse of Dimensionality

Domino Data Lab

Statistical methods for analyzing this two-dimensional data exist. This statistical test is correct because the data are (presumably) bivariate normal. When there are many variables the Curse of Dimensionality changes the behavior of data and standard statistical methods give the wrong answers. Data Has Properties.

article thumbnail

Deep Learning Illustrated: Building Natural Language Processing Models

Domino Data Lab

Although it’s not perfect, [Note: These are statistical approximations, of course!] At the time—in 2014—the three were colleagues working. GloVe and word2vec differ in their underlying methodology: word2vec uses predictive models, while GloVe is count based. Example 11.6 Pennington, J., GloVe: Global vectors.

article thumbnail

Data Science at The New York Times

Domino Data Lab

Diving into examples of building and deploying ML models at The New York Times including the descriptive topic modeling-oriented Readerscope (audience insights engine), a prediction model regarding who was likely to subscribe/cancel their subscription, as well as prescriptive example via recommendations of highly curated editorial content.