Blog, Data Architecture, Data Integration and Data Processing

Blog

Data Architecture

Data Integration

Data Processing

Data Integrity, the Basis for Reliable Insights

Sisense

AUGUST 28, 2020

We live in a world of data: There’s more of it than ever before, in a ceaselessly expanding array of forms and locations. Dealing with Data is your window into the ways data teams are tackling the challenges of this new world to help their companies and their customers thrive. What is data integrity?

Data Integration

Data Integration Testing Data Quality Data-driven

Migrate an existing data lake to a transactional data lake using Apache Iceberg

AWS Big Data

OCTOBER 3, 2023

Over the years, data lakes on Amazon Simple Storage Service (Amazon S3) have become the default repository for enterprise data and are a common choice for a large set of users who query data for a variety of analytics and machine leaning use cases. Analytics use cases on data lakes are always evolving. Choose ETL Jobs.

Data Lake

Data Lake Metadata Snapshot Recreation/Entertainment

Join 52,000+

Insiders

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Webinars

How to Optimize the Developer Experience for Monumental Impact

Generative AI Deep Dive: Advancing from Proof of Concept to Production

Understanding User Needs and Satisfying Them

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

Leading the Development of Profitable and Sustainable Products

MORE WEBINARS

Trending Sources

KGF 2023: Bikes To The Moon, Datastrophies, Abstract Art And A Knowledge Graph Forum To Embrace Them All

Ontotext

DECEMBER 1, 2023

So, KGF 2023 proved to be a breath of fresh air for anyone interested in topics like data mesh and data fabric , knowledge graphs, text analysis , large language model (LLM) integrations, retrieval augmented generation (RAG), chatbots, semantic data integration , and ontology building.

Metadata

Metadata Sales Consulting Enterprise

Webinars

How to Optimize the Developer Experience for Monumental Impact

Generative AI Deep Dive: Advancing from Proof of Concept to Production

Understanding User Needs and Satisfying Them

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

Leading the Development of Profitable and Sustainable Products

MORE WEBINARS

Big Data Ingestion: Parameters, Challenges, and Best Practices

datapine

AUGUST 20, 2019

Big data: Architecture and Patterns. The Big data problem can be comprehended properly using a layered architecture. Big data architecture consists of different layers and each layer performs a specific function. The architecture of Big data has 6 layers. Artificial Intelligence. Self-Service.

Big Data

Big Data B2B Cost-Benefit Structured Data

How The Cloud Made ‘Data-Driven Culture’ Possible | Part 2: Cloud Adoption

BizAcuity

MAY 24, 2022

Quick recap from the previous blog- The cloud is better than on-premises solutions for the following reasons: Cost cutting: Renting and sharing resources instead of building on your own. IaaS provides a platform for compute, data storage and networking capabilities. Microsoft’s blog paints quite the picture about this issue.

Data-driven

Data-driven Cost-Benefit Digital Transformation Strategy

The power of remote engine execution for ETL/ELT data pipelines

IBM Big Data Hub

MAY 15, 2024

Unified, governed data can also be put to use for various analytical, operational and decision-making purposes. This process is known as data integration, one of the key components to a strong data fabric. The remote execution engine is a fantastic technical development which takes data integration to the next level.

Cost-Benefit

Cost-Benefit Data Integration Data Architecture Manufacturing

How Cargotec uses metadata replication to enable cross-account data sharing

AWS Big Data

JUNE 7, 2023

This is a guest blog post co-written with Sumesh M R from Cargotec and Tero Karttunen from Knowit Finland. For this, Cargotec built an Amazon Simple Storage Service (Amazon S3) data lake and cataloged the data assets in AWS Glue Data Catalog. The source code for the application is hosted the AWS Glue GitHub.

Metadata

Metadata Data Lake Machine Learning Big Data

How Financial Services and Insurance Streamline AI Initiatives with a Hybrid Data Platform

Cloudera

SEPTEMBER 7, 2023

Perhaps the biggest challenge of all is that AI solutions—with their complex, opaque models, and their appetite for large, diverse, high-quality datasets—tend to complicate the oversight, management, and assurance processes integral to data management and governance. Find out more about CDP, modern data architectures and AI here.

Insurance

Insurance Risk Data-driven Finance

The Gartner 2022 Leadership Vision for Data and Analytics Leaders Questions and Answers

Andrew White

JANUARY 9, 2022

On Thursday January 6th I hosted Gartner’s 2022 Leadership Vision for Data and Analytics webinar. But I would search on our web site for AI and mid-size enterprise; or ask Anthony Mullen who leads our AI research in our data and analytics team. Can you remind where we can find the mentioned blog?

Analytics

Analytics Measurement Modeling Data-driven

Orca Security’s journey to a petabyte-scale data lake with Apache Iceberg and AWS Analytics

AWS Big Data

JULY 20, 2023

With data becoming the driving force behind many industries today, having a modern data architecture is pivotal for organizations to be successful. This data is sent to Apache Kafka, which is hosted on Amazon Managed Streaming for Apache Kafka (Amazon MSK).

Data Lake

Data Lake Analytics Snapshot Optimization

How Cloudera Data Flow Enables Successful Data Mesh Architectures

Cloudera

OCTOBER 7, 2021

In this blog, I will demonstrate the value of Cloudera DataFlow (CDF) , the edge-to-cloud streaming data platform available on the Cloudera Data Platform (CDP) , as a Data integration and Democratization fabric. Components of a Data Mesh. How CDF enables successful Data Mesh Architectures.

Metadata

Metadata Cost-Benefit Enterprise Interactive

Data Leaders Brief

Data Integrity, the Basis for Reliable Insights

Migrate an existing data lake to a transactional data lake using Apache Iceberg

Webinars

Trending Sources

KGF 2023: Bikes To The Moon, Datastrophies, Abstract Art And A Knowledge Graph Forum To Embrace Them All

Webinars

Big Data Ingestion: Parameters, Challenges, and Best Practices

How The Cloud Made ‘Data-Driven Culture’ Possible | Part 2: Cloud Adoption

The power of remote engine execution for ETL/ELT data pipelines

How Cargotec uses metadata replication to enable cross-account data sharing

How Financial Services and Insurance Streamline AI Initiatives with a Hybrid Data Platform

The Gartner 2022 Leadership Vision for Data and Analytics Leaders Questions and Answers

Orca Security’s journey to a petabyte-scale data lake with Apache Iceberg and AWS Analytics

How Cloudera Data Flow Enables Successful Data Mesh Architectures

Stay Connected