Remove 2002 Remove Big Data Remove Data Analytics Remove Metadata
article thumbnail

Speed up queries with the cost-based optimizer in Amazon Athena

AWS Big Data

Starting today, the Athena SQL engine uses a cost-based optimizer (CBO), a new feature that uses table and column statistics stored in the AWS Glue Data Catalog as part of the table’s metadata. Analytics Architect on Amazon Athena. This means a 3 TB benchmark dataset accurately represents customer workloads on 30–50 TB datasets.

article thumbnail

Themes and Conferences per Pacoid, Episode 10

Domino Data Lab

She had much to say to leaders of data science teams, coming from perspectives of data engineering at scale. And by “scale” I’m referring to what is arguably the largest, most successful data analytics operation in the cloud of any public firm that isn’t a cloud provider. Being model-driven is like using GPS.”. “If