Orca Security’s journey to a petabyte-scale data lake with Apache Iceberg and AWS Analytics
AWS Big Data
JULY 20, 2023
Specifically, the system uses Amazon SageMaker Processing jobs to process the data stored in the data lake, employing the AWS SDK for Pandas (previously known as AWS Wrangler) for various data transformation operations, including cleaning, normalization, and feature engineering.
Let's personalize your content