Measure performance of AWS Glue Data Quality for ETL pipelines
AWS Big Data
MARCH 12, 2024
In this post, we provide benchmark results of running increasingly complex data quality rulesets over a predefined test dataset. Dataset details The test dataset contains 104 columns and 1 million rows stored in Parquet format. On the AWS Glue console, under ETL jobs in the navigation pane, choose Visual ETL.
Let's personalize your content