article thumbnail

Topics to watch at the Strata Data Conference in New York 2019

O'Reilly on Data

So, we used a form of the Term Frequency-Inverse Document Frequency (TF/IDF) technique to identify and rank the top terms in this year’s Strata NY proposal topics—as well as those for 2018, 2017, and 2016. 2) is unchanged from Strata NY 2018, it’s up three places from Strata NY 2017—and eight places relative to 2016.

IoT 20
article thumbnail

Deep Learning Illustrated: Building Natural Language Processing Models

Domino Data Lab

GloVe and word2vec differ in their underlying methodology: word2vec uses predictive models, while GloVe is count based. We waved our finger in the air to select 64, so some experimentation and optimization are warranted at your end if you feel like it. Note: Google Translate has incorporated NMT since 2016. Joulin, A.,