Remove Business Analysis Remove Data Lake Remove Snapshot Remove Workshop
article thumbnail

Build a data lake with Apache Flink on Amazon EMR

AWS Big Data

This post shows you how to integrate Apache Flink in Amazon EMR with the AWS Glue Data Catalog so that you can ingest streaming data in real time and access the data in near-real time for business analysis. We use the AWS Glue Data Catalog to store the metadata such as table schema and table location.