2022, Data Processing, Metadata and Visualization

2022

Data Processing

Metadata

Visualization

Apache Ozone Powers Data Science in CDP Private Cloud

Cloudera

AUGUST 26, 2021

Before we jump into the data ingestion step, here is a quick overview of how Ozone manages its metadata namespace through volumes, buckets and keys. . If created using the Filesystem interface, the intermediate prefixes ( application-1 & application-1/instance-1 ) are created as directories in the Ozone metadata store. s3 = boto3.resource('s3',

Data Science

Data Science Forecasting Metadata Machine Learning

Create an end-to-end data strategy for Customer 360 on AWS

AWS Big Data

MARCH 26, 2024

In 2022, AWS commissioned a study conducted by the American Productivity and Quality Center (APQC) to quantify the Business Value of Customer 360. Profile aggregation – When you’ve uniquely identified a customer, you can build applications in Managed Service for Apache Flink to consolidate all their metadata, from name to interaction history.

Data Strategy

Data Strategy Strategy Data Warehouse Prescriptive Analytics

Join 52,000+

Insiders

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Webinars

How to Optimize the Developer Experience for Monumental Impact

Generative AI Deep Dive: Advancing from Proof of Concept to Production

Understanding User Needs and Satisfying Them

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

Leading the Development of Profitable and Sustainable Products

MORE WEBINARS

Trending Sources

Simplify data loading into Type 2 slowly changing dimensions in Amazon Redshift

AWS Big Data

MARCH 9, 2023

SCD2 metadata – rec_eff_dt and rec_exp_dt indicate the state of the record. Register source tables in the AWS Glue Data Catalog We use an AWS Glue crawler to infer metadata from delimited data files like the CSV files used in this post. It is also called the surrogate key and has a unique value that is monotonically increasing.

Slice and Dice

Slice and Dice Data Warehouse Metrics Metadata

Webinars

How to Optimize the Developer Experience for Monumental Impact

Generative AI Deep Dive: Advancing from Proof of Concept to Production

Understanding User Needs and Satisfying Them

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

Leading the Development of Profitable and Sustainable Products

MORE WEBINARS

Build incremental data pipelines to load transactional data changes using AWS DMS, Delta 2.0, and Amazon EMR Serverless

AWS Big Data

MARCH 3, 2023

Although this post uses an Aurora PostgreSQL database hosted on AWS as the data source, the solution can be extended to ingest data from any of the AWS DMS supported databases hosted on your data centers. Solution overview The following diagram shows the overall architecture of the solution that we implement in this post.

Data Lake

Data Lake Dashboards Metrics Metadata

How smava makes loans transparent and affordable using Amazon Redshift Serverless

AWS Big Data

DECEMBER 21, 2023

After the data lands in Amazon S3, smava uses the AWS Glue Data Catalog and crawlers to automatically catalog the available data, capture the metadata, and provide an interface that allows querying all data assets. smava decided to use Tableau for business intelligence, data visualization, and further analytics.

Data Lake

Data Lake Data Warehouse Data-driven B2B

Summing Up Three Days at Gartner’s Data and Analytics Conference in Orlando, Florida, USA

Andrew White

MARCH 31, 2023

I hosted 25 1-1s in between the meetings and presentations. Data mesh versus data fabric I am not the expert here but in lay terms, I believe both fabric and mesh include a semantic inference engine that consumes active metadata. It seems 2022 was a record year for VC funding overall.

Analytics

Analytics Marketing Visualization Data-driven

Data Leaders Brief

Apache Ozone Powers Data Science in CDP Private Cloud

Create an end-to-end data strategy for Customer 360 on AWS

Webinars

Trending Sources

Simplify data loading into Type 2 slowly changing dimensions in Amazon Redshift

Webinars

Build incremental data pipelines to load transactional data changes using AWS DMS, Delta 2.0, and Amazon EMR Serverless

How smava makes loans transparent and affordable using Amazon Redshift Serverless

Summing Up Three Days at Gartner’s Data and Analytics Conference in Orlando, Florida, USA

Stay Connected