Remove 2001 Remove Data Lake Remove Interactive Remove Risk
article thumbnail

Speed up queries with the cost-based optimizer in Amazon Athena

AWS Big Data

Amazon Athena is a serverless, interactive analytics service built on open source frameworks, supporting open table file formats. Athena provides a simplified, flexible way to analyze petabytes of data where it lives. Grouping after joining means a large number of records have to participate the join before being aggregated.

article thumbnail

Themes and Conferences per Pacoid, Episode 8

Domino Data Lab

Probably the best one-liner I’ve encountered is the analogy that: DG is to data assets as HR is to people. Also, while surveying the literature two key drivers stood out: Risk management is the thin-edge-of-the-wedge ?for Somehow, the gravity of the data has a geological effect that forms data lakes. It’s a mess.

article thumbnail

Themes and Conferences per Pacoid, Episode 12

Domino Data Lab

Consider the following timeline: 2001 – Physics grad students are getting hired in quantity by hedge funds to work on Wall St. to join data science teams, e.g., to support advertising, social networks, gaming, and so on—I hired more than a few. 2018 – Global reckoning about data governance, aka “Oops!