article thumbnail

GraphDB in Action: Putting the Most Reliable RDF Database to Work for Better Human-machine Interaction

Ontotext

In today’s world, we increasingly interact with the environment around us through data. For all these data operations to flow smoothly, data needs to be interoperable, of good quality and easy to integrate. The catalog stores the asset’s metadata in RDF.

article thumbnail

Migrate an existing data lake to a transactional data lake using Apache Iceberg

AWS Big Data

In this post, we show you how you can convert existing data in an Amazon S3 data lake in Apache Parquet format to Apache Iceberg format to support transactions on the data using Jupyter Notebook based interactive sessions over AWS Glue 4.0. Note that the migrate procedure isn’t supported in AWS Glue Data Catalog.

Data Lake 102
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Trending Sources

article thumbnail

Why Establishing Data Context is the Key to Creating Competitive Advantage

Ontotext

Context: The Key to Making Data Useful It goes without saying that data without meaning can yield incorrect insights, leading to potentially dangerous decisions. Beyond that, and without a way to visualize, connect, and utilize the data, it’s still just a bunch of random information.

article thumbnail

Data governance in the age of generative AI

AWS Big Data

However, enterprise data generated from siloed sources combined with the lack of a data integration strategy creates challenges for provisioning the data for generative AI applications. Data discoverability Unlike structured data, which is managed in well-defined rows and columns, unstructured data is stored as objects.

article thumbnail

Enhance monitoring and debugging for AWS Glue jobs using new job observability metrics, Part 3: Visualization and trend analysis using Amazon QuickSight

AWS Big Data

QuickSight makes it straightforward for business users to visualize data in interactive dashboards and reports. You can slice data by different dimensions like job name, see anomalies, and share reports securely across your organization. All of the resources are defined in a sample AWS Cloud Development Kit (AWS CDK) template.

Metrics 104
article thumbnail

KGF 2023: Bikes To The Moon, Datastrophies, Abstract Art And A Knowledge Graph Forum To Embrace Them All

Ontotext

So, KGF 2023 proved to be a breath of fresh air for anyone interested in topics like data mesh and data fabric , knowledge graphs, text analysis , large language model (LLM) integrations, retrieval augmented generation (RAG), chatbots, semantic data integration , and ontology building.

article thumbnail

Introducing Amazon MWAA support for the Airflow REST API and web server auto scaling

AWS Big Data

In this post, we’re excited to introduce two new features that address common customer challenges and unlock new possibilities for building robust, scalable, and flexible data orchestration solutions using Amazon MWAA. Big Data and ETL Solutions Architect, MWAA and AWS Glue ETL expert. Kamen Sharlandjiev is a Sr. His secret weapon?

Testing 86