Remove 2001 Remove Metadata Remove Risk Remove Testing
article thumbnail

Speed up queries with the cost-based optimizer in Amazon Athena

AWS Big Data

Starting today, the Athena SQL engine uses a cost-based optimizer (CBO), a new feature that uses table and column statistics stored in the AWS Glue Data Catalog as part of the table’s metadata. Testing on the TPC-DS benchmark showed an 11% improvement in overall query performance when using CBO compared to without it.

article thumbnail

Themes and Conferences per Pacoid, Episode 8

Domino Data Lab

That’s a lot of priorities – especially when you group together closely related items such as data lineage and metadata management which rank nearby. Also, while surveying the literature two key drivers stood out: Risk management is the thin-edge-of-the-wedge ?for Allows metadata repositories to share and exchange. It’s a mess.

article thumbnail

Themes and Conferences per Pacoid, Episode 12

Domino Data Lab

The gist is, leveraging metadata about research datasets, projects, publications, etc., Consider the following timeline: 2001 – Physics grad students are getting hired in quantity by hedge funds to work on Wall St. The probabilistic nature changes the risks and process required. To wit: data science is a team sport.