Remove Business Analysis Remove Data Processing Remove Metadata Remove Snapshot
article thumbnail

Build a data lake with Apache Flink on Amazon EMR

AWS Big Data

The AWS Glue Data Catalog provides a uniform repository where disparate systems can store and find metadata to keep track of data in data silos. With unified metadata, both data processing and data consuming applications can access the tables using the same metadata. For metadata read/write, Flink has the catalog interface.