article thumbnail

Speed up queries with the cost-based optimizer in Amazon Athena

AWS Big Data

Starting today, the Athena SQL engine uses a cost-based optimizer (CBO), a new feature that uses table and column statistics stored in the AWS Glue Data Catalog as part of the table’s metadata. By using these statistics, CBO improves query run plans and boosts the performance of queries run in Athena.

article thumbnail

Themes and Conferences per Pacoid, Episode 8

Domino Data Lab

That’s a lot of priorities – especially when you group together closely related items such as data lineage and metadata management which rank nearby. Also, while surveying the literature two key drivers stood out: Risk management is the thin-edge-of-the-wedge ?for Allows metadata repositories to share and exchange. It’s a mess.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Trending Sources

article thumbnail

Data Science, Past & Future

Domino Data Lab

What I’m trying to say is this evolution of system architecture, the hardware driving the software layers, and also, the whole landscape with regard to threats and risks, it changes things. You see these drivers involving risk and cost, but also opportunity. I can point to the year 2001. All righty. Where did this happen?

article thumbnail

Themes and Conferences per Pacoid, Episode 12

Domino Data Lab

The gist is, leveraging metadata about research datasets, projects, publications, etc., Consider the following timeline: 2001 – Physics grad students are getting hired in quantity by hedge funds to work on Wall St. The probabilistic nature changes the risks and process required. To wit: data science is a team sport.