Data Integration, Data Transformation, Metadata and Modeling

Data Integration

Data Transformation

Metadata

Modeling

Choosing A Graph Data Model to Best Serve Your Use Case

Ontotext

MARCH 27, 2024

For example, GPS, social media, cell phone handoffs are modeled as graphs while data catalogs, data lineage and MDM tools leverage knowledge graphs for linking metadata with semantics. Knowledge graphs model knowledge of a domain as a graph with a network of entities and relationships.

Modeling

Modeling Metadata Data Quality Enterprise

Build and manage your modern data stack using dbt and AWS Glue through dbt-glue, the new “trusted” dbt adapter

AWS Big Data

NOVEMBER 29, 2023

dbt is an open source, SQL-first templating engine that allows you to write repeatable and extensible data transforms in Python and SQL. dbt is predominantly used by data warehouses (such as Amazon Redshift ) customers who are looking to keep their data transform logic separate from storage and engine.

Data Lake

Data Lake Management Metrics Data Warehouse

Join 52,000+

Insiders

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Webinars

How to Optimize the Developer Experience for Monumental Impact

Generative AI Deep Dive: Advancing from Proof of Concept to Production

Understanding User Needs and Satisfying Them

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

Leading the Development of Profitable and Sustainable Products

MORE WEBINARS

Trending Sources

Data Preparation and Data Mapping: The Glue Between Data Management and Data Governance to Accelerate Insights and Reduce Risks

erwin

JANUARY 11, 2019

Organizations have spent a lot of time and money trying to harmonize data across diverse platforms , including cleansing, uploading metadata, converting code, defining business glossaries, tracking data transformations and so on. And there’s control of that landscape to facilitate insight and collaboration and limit risk.

Data Governance

Data Governance Risk Metadata Management

Webinars

How to Optimize the Developer Experience for Monumental Impact

Generative AI Deep Dive: Advancing from Proof of Concept to Production

Understanding User Needs and Satisfying Them

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

Leading the Development of Profitable and Sustainable Products

MORE WEBINARS

How Infomedia built a serverless data pipeline with change data capture using AWS Glue and Apache Hudi

AWS Big Data

MARCH 15, 2023

The API retrieves data at runtime from an Amazon Aurora PostgreSQL-Compatible Edition database for end-user consumption. To populate the database, the Infomedia team developed a data pipeline using Amazon Simple Storage Service (Amazon S3) for data storage, AWS Glue for data transformations, and Apache Hudi for CDC and record-level updates.

Cost-Benefit

Cost-Benefit Data Processing Optimization Data-driven

How healthcare organizations can analyze and create insights using price transparency data

AWS Big Data

OCTOBER 11, 2023

Due to this low complexity, the solution uses AWS serverless services to ingest the data, transform it, and make it available for analytics. The serverless architecture features auto scaling, high availability, and a pay-as-you-go billing model to increase agility and optimize costs.

Visualization

Visualization Dashboards Data-driven Gap analysis

Orca Security’s journey to a petabyte-scale data lake with Apache Iceberg and AWS Analytics

AWS Big Data

JULY 20, 2023

This data is then used by various applications for streaming analytics, business intelligence, and reporting. Amazon SageMaker is used to build, train, and deploy a range of ML models. This ensures that the data is suitable for training purposes. Additionally, SageMaker training jobs are employed for training the models.

Data Lake

Data Lake Analytics Snapshot Optimization

Addressing the Three Scalability Challenges in Modern Data Platforms

Cloudera

NOVEMBER 22, 2021

In legacy analytical systems such as enterprise data warehouses, the scalability challenges of a system were primarily associated with computational scalability, i.e., the ability of a data platform to handle larger volumes of data in an agile and cost-efficient way. As a result, alternative data integration technologies (e.g.,

Data Processing

Data Processing Data Warehouse Enterprise Visualization

“You Complete Me,” said Data Lineage to DataOps Observability.

DataKitchen

JANUARY 23, 2023

To capture a more complete picture of the data’s journey, it is important to have a DataOps Observability system in place. Data lineage is static and often lags by weeks or months. Data lineage is often considered static because it is typically based on snapshots of data and metadata taken at a specific time.

Testing

Testing Data Governance Data Quality Data-driven

The importance of data ingestion and integration for enterprise AI

IBM Big Data Hub

JANUARY 9, 2024

Companies still often accept the risk of using internal data when exploring large language models (LLMs) because this contextual data is what enables LLMs to change from general-purpose to domain-specific knowledge. In the generative AI or traditional AI development cycle, data ingestion serves as the entry point.

Enterprise

Enterprise Data Integration Data Quality Contextual Data

The Modern Data Stack Explained: What The Future Holds

Alation

JANUARY 17, 2023

What if, experts asked, you could load raw data into a warehouse, and then empower people to transform it for their own unique needs? Today, data integration platforms like Rivery do just that. By pushing the T to the last step in the process, such products have revolutionized how data is understood and analyzed.

Data Warehouse

Data Warehouse Cost-Benefit Data Transformation Data Science

Biggest Trends in Data Visualization Taking Shape in 2022

Smart Data Collective

OCTOBER 13, 2021

There are countless examples of big data transforming many different industries. There is no disputing the fact that the collection and analysis of massive amounts of unstructured data has been a huge breakthrough. Does Data Virtualization support web data integration?

Visualization

Visualization Cost-Benefit Big Data Prescriptive Analytics

What is Data Mapping?

Jet Global

FEBRUARY 23, 2024

Data mapping is essential for integration, migration, and transformation of different data sets; it allows you to improve your data quality by preventing duplications and redundancies in your data fields. Data mapping is important for several reasons.

Data Warehouse

Data Warehouse Reporting Data Transformation Sales

What Is Embedded Analytics?

Jet Global

MAY 1, 2023

These licensing terms are critical: Perpetual license vs subscription: Subscription is a pay-as-you-go model that provides flexibility as you evaluate a vendor. Pricing model: The pricing scale is dependent on several factors. Some cloud applications can even provide new benchmarks based on customer data.

Analytics

Analytics Cost-Benefit Visualization Dashboards

A Stitch in Time: How Jet Analytics Boosts Microsoft Fabric Time-to-Value

Jet Global

MARCH 14, 2024

Mastering Data: Effectively Manage Your Data Download Now How Jet Analytics Enhances Microsoft Fabric Jet Analytics from insightsoftware is a complete data preparation, automation and modeling solution that enables Microsoft Dynamics customers to accelerate Dynamics ERP-ready BI projects without requiring specialist skills.

Analytics

Analytics Management Reporting Enterprise

Data Leaders Brief

Choosing A Graph Data Model to Best Serve Your Use Case

Build and manage your modern data stack using dbt and AWS Glue through dbt-glue, the new “trusted” dbt adapter

Webinars

Trending Sources

Data Preparation and Data Mapping: The Glue Between Data Management and Data Governance to Accelerate Insights and Reduce Risks

Webinars

How Infomedia built a serverless data pipeline with change data capture using AWS Glue and Apache Hudi

How healthcare organizations can analyze and create insights using price transparency data

Orca Security’s journey to a petabyte-scale data lake with Apache Iceberg and AWS Analytics

Addressing the Three Scalability Challenges in Modern Data Platforms

“You Complete Me,” said Data Lineage to DataOps Observability.

The importance of data ingestion and integration for enterprise AI

The Modern Data Stack Explained: What The Future Holds

Biggest Trends in Data Visualization Taking Shape in 2022

What is Data Mapping?

What Is Embedded Analytics?

A Stitch in Time: How Jet Analytics Boosts Microsoft Fabric Time-to-Value

Stay Connected