Remove 2001 Remove 2017 Remove Risk Remove Testing
article thumbnail

Themes and Conferences per Pacoid, Episode 8

Domino Data Lab

Also, while surveying the literature two key drivers stood out: Risk management is the thin-edge-of-the-wedge ?for My read of that narrative arc is that some truly weird tensions showed up circa 2001: Arguably, it’s the heyday of DW+BI. A very big mess since circa 2001, and now becoming quite a dangerous mess. a second priority?at

article thumbnail

ML internals: Synthetic Minority Oversampling (SMOTE) Technique

Domino Data Lab

The problem with this approach is that in highly imbalanced sets it can easily lead to a situation where most of the data has to be discarded, and it has been firmly established that when it comes to machine learning data should not be easily thrown out (Banko and Brill, 2001; Halevy et al., Their tests are performed using C4.5-generated

article thumbnail

Themes and Conferences per Pacoid, Episode 12

Domino Data Lab

Consider the following timeline: 2001 – Physics grad students are getting hired in quantity by hedge funds to work on Wall St. The probabilistic nature changes the risks and process required. We face problems—crises—regarding risks involved with data and machine learning in production. Public Health Reports (2017-07-10).