Remove 2001 Remove Metadata Remove Presentation Remove Risk
article thumbnail

Speed up queries with the cost-based optimizer in Amazon Athena

AWS Big Data

Starting today, the Athena SQL engine uses a cost-based optimizer (CBO), a new feature that uses table and column statistics stored in the AWS Glue Data Catalog as part of the table’s metadata. The following graph presents the top 10 queries from the TPC-DS benchmark with the greatest performance improvement.

article thumbnail

Themes and Conferences per Pacoid, Episode 8

Domino Data Lab

In particular, here’s my Strata SF talk “Overview of Data Governance” presented in article form. That’s a lot of priorities – especially when you group together closely related items such as data lineage and metadata management which rank nearby. The on-the-ground reality of DG presents an almost overwhelming array of topics.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Data Science, Past & Future

Domino Data Lab

Paco Nathan presented, “Data Science, Past & Future” , at Rev. I am honored to be able to present here and thrilled to have been involved in Rev. The presentation layer was about, say, web browsers, right, what you could do in a web browser. You see these drivers involving risk and cost, but also opportunity.

article thumbnail

Themes and Conferences per Pacoid, Episode 12

Domino Data Lab

Secondly: some key insights discussed at Sci Foo finally clicked for me—after I’d heard them presented a few times elsewhere. The gist is, leveraging metadata about research datasets, projects, publications, etc., The probabilistic nature changes the risks and process required.