Apache Iceberg optimization: Solving the small files problem in Amazon EMR
AWS Big Data
OCTOBER 3, 2023
For more information on streaming applications on AWS, refer to Real-time Data Streaming and Analytics. To learn more about the available optimize data executors and catalog properties, refer to the README file in the GitHub repo. For our testing, we generated about 58,176 small objects with total size of 2 GB.
Let's personalize your content