Remove Management Remove Optimization Remove Snapshot Remove Testing
article thumbnail

In-place version upgrades for applications on Amazon Managed Service for Apache Flink now supported

AWS Big Data

For existing users of Amazon Managed Service for Apache Flink who are excited about the recent announcement of support for Apache Flink runtime version 1.18, you can now statefully migrate your existing applications that use older versions of Apache Flink to a more recent version, including Apache Flink version 1.18.

article thumbnail

Apache Iceberg optimization: Solving the small files problem in Amazon EMR

AWS Big Data

Systems of this nature generate a huge number of small objects and need attention to compact them to a more optimal size for faster reading, such as 128 MB, 256 MB, or 512 MB. As of this writing, only the optimize-data optimization is supported. For our testing, we generated about 58,176 small objects with total size of 2 GB.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Optimize checkpointing in your Amazon Managed Service for Apache Flink applications with buffer debloating and unaligned checkpoints – Part 2

AWS Big Data

Unaligned checkpoints help, under specific conditions, to reduce checkpointing time for applications suffering temporary backpressure, and can be now enabled in Amazon Managed Service for Apache Flink applications running Apache Flink 1.15.2 When barriers from all upstream partitions have arrived, the sub-task takes a snapshot of its state.

article thumbnail

Real-time cost savings for Amazon Managed Service for Apache Flink

AWS Big Data

When running Apache Flink applications on Amazon Managed Service for Apache Flink , you have the unique benefit of taking advantage of its serverless nature. This means that cost-optimization exercises can happen at any time—they no longer need to happen in the planning phase. per hour, and attached application storage costs $0.10

article thumbnail

Optimize checkpointing in your Amazon Managed Service for Apache Flink applications with buffer debloating and unaligned checkpoints – Part 1

AWS Big Data

Amazon Managed Service for Apache Flink , formerly known as Amazon Kinesis Data Analytics, is the AWS service offering fully managed Apache Flink. Internally, Apache Flink uses clever mechanisms to maintain exactly-once state consistency, while also optimizing for throughput and reduced latency. This is a two-phase operation.

article thumbnail

How Cloudinary transformed their petabyte scale streaming data lake with Apache Iceberg and AWS Analytics

AWS Big Data

Cloudinary is a cloud-based media management platform that provides a comprehensive set of tools and services for managing, optimizing, and delivering images, videos, and other media assets on websites and mobile applications.

article thumbnail

Amazon Managed Service for Apache Flink now supports Apache Flink version 1.18

AWS Big Data

Amazon Managed Service for Apache Flink , which offers a fully managed, serverless experience in running Apache Flink applications, now supports Apache Flink 1.18.1 , the latest version of Apache Flink at the time of writing. By default, the sink writes in batches to optimize throughput. The dependency for Apache Flink 1.18