article thumbnail

Use Amazon Athena with Spark SQL for your open-source transactional table formats

AWS Big Data

These formats enable ACID (atomicity, consistency, isolation, durability) transactions, upserts, and deletes, and advanced features such as time travel and snapshots that were previously only available in data warehouses. It will never remove files that are still required by a non-expired snapshot.

Snapshot 100
article thumbnail

Backtesting index rebalancing arbitrage with Amazon EMR and Apache Iceberg

AWS Big Data

Backtesting is a process used in quantitative finance to evaluate trading strategies using historical data. With scalable metadata indexing, Apache Iceberg is able to deliver performant queries to a variety of engines such as Spark and Athena by reducing planning time. This is where the tagging feature in Apache Iceberg comes in handy.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Manage your data warehouse cost allocations with Amazon Redshift Serverless tagging

AWS Big Data

Tags allows you to assign metadata to your AWS resources. Solution overview Let’s say that your company has two departments: marketing and finance. For Filter by resource type , you can filter by Workgroup , Namespace , Snapshot , and Recovery Point. For more details on tagging, refer to Tagging resources overview.

article thumbnail

Benefits of Enterprise Modeling and Data Intelligence Solutions

erwin

This matters because, as he said, “By placing the data and the metadata into a model, which is what the tool does, you gain the abilities for linkages between different objects in the model, linkages that you cannot get on paper or with Visio or PowerPoint.” They’re static snapshots of a diagram at some point in time.

article thumbnail

BI Cubed: Data Lineage on OLAP Anyone?

Octopai

How much time has your BI team wasted on finding data and creating metadata management reports? BI groups spend more than 50% of their time and effort manually searching for metadata. It’s a snapshot of data at a specific point in time, at the end of a day, week, month or year. – Business changes. Cube to the rescue.

OLAP 56
article thumbnail

What Is Data Intelligence?

Alation

It includes intelligence about data, or metadata. The earliest DI use cases leveraged metadata — EG, popularity rankings reflecting the most used data — to surface assets most useful to others. Again, metadata is key. Data Intelligence and Metadata. Data intelligence is fueled by metadata.