article thumbnail

RDF-Star: Metadata Complexity Simplified

Ontotext

To handle such scenarios you need a transalytical graph database – a database engine that can deal with both frequent updates (OLTP workload) as well as with graph analytics (OLAP). Not Every Graph is a Knowledge Graph: Schemas and Semantic Metadata Matter. Metadata about Relationships Come in Handy. Schemas are powerful.

Metadata 119
article thumbnail

How Cargotec uses metadata replication to enable cross-account data sharing

AWS Big Data

This is a guest blog post co-written with Sumesh M R from Cargotec and Tero Karttunen from Knowit Finland. They chose AWS Glue as their preferred data integration tool due to its serverless nature, low maintenance, ability to control compute resources in advance, and scale when needed.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Ontotext’s Top 5 Most Popular Blog Posts for 2020

Ontotext

At the end of an unconventional year, we at Ontotext still want to honor our tradition and provide our readers with a round-up of the most popular posts on our blog. In its third generation, Ontotext Platform enables organizations to build, use and evolve knowledge graphs as a hub for data, metadata and content.

article thumbnail

Insights from Gartner Data & Analytics Summit Orlando 2023

Alation

Ehtisham Zaidi, Gartner’s VP of data management, and Robert Thanaraj, Gartner’s director of data management, gave an update on the fabric versus mesh debate in light of what they call the “active metadata era” we’re currently in. The active metadata helix Indeed, automation was on everyone’s minds. We couldn’t agree more.

article thumbnail

How Cloudinary transformed their petabyte scale streaming data lake with Apache Iceberg and AWS Analytics

AWS Big Data

In this blog post, we dive into different data aspects and how Cloudinary breaks the two concerns of vendor locking and cost efficient data analytics by using Apache Iceberg, Amazon Simple Storage Service (Amazon S3 ), Amazon Athena , Amazon EMR , and AWS Glue. Old metadata files are kept for history by default.

Data Lake 108
article thumbnail

A Look Back at the Gartner Data and Analytics Summit

Cloudera

More Businesses Are Taking a Holistic Approach to Data Strategy One of the more common trends we saw coming up through conversations during the summit was the need for a reframing of how we approach data strategy—taking a much more holistic viewpoint to it than organizations otherwise would have in past years.

Analytics 110
article thumbnail

Run Trino queries 2.7 times faster with Amazon EMR 6.15.0

AWS Big Data

Benchmark setup In our testing, we used the 3 TB dataset stored in Amazon S3 in compressed Parquet format and metadata for databases and tables is stored in the AWS Glue Data Catalog. This benchmark uses unmodified TPC-DS data schema and table relationships. As shown in this blog post, our TPC-DS benchmark showed a 2.7