article thumbnail

Simplifying data processing at Capitec with Amazon Redshift integration for Apache Spark

AWS Big Data

As a result of utilizing the Amazon Redshift integration for Apache Spark, developer productivity increased by a factor of 10, feature generation pipelines were streamlined, and data duplication reduced to zero. These tables are then joined with tables from the Enterprise Data Lake (EDL) at runtime. options(**read_config).option("query",

article thumbnail

DaVita’s technology strategy driven by the ‘power of purpose’

CIO Business Intelligence

We’re looking at a variety of sources of data, putting it in data lakes, and then using that to drive predictive models that really help our doctors and our care teams to stratify our patient’s risk by taking actions at the right time.

article thumbnail

Amazon Kinesis Data Streams: celebrating a decade of real-time data innovation

AWS Big Data

Ten years ago, we launched Amazon Kinesis Data Streams , the first cloud-native serverless streaming data service, to serve as the backbone for companies, to move data across system boundaries, breaking data silos. Another integration launched in 2023 is with Amazon Monitron to power predictive maintenance management.

IoT 55