article thumbnail

How to establish lineage transparency for your machine learning initiatives

IBM Big Data Hub

Machine learning (ML) has become a critical component of many organizations’ digital transformation strategy. The answer lies in the data used to train these models and how that data is derived. The answer lies in the data used to train these models and how that data is derived.

article thumbnail

Best Practices for Metadata Management

Alation

What Is Metadata? Metadata is information about data. A clothing catalog or dictionary are both examples of metadata repositories. Indeed, a popular online catalog, like Amazon, offers rich metadata around products to guide shoppers: ratings, reviews, and product details are all examples of metadata.

Metadata 105
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Trending Sources

article thumbnail

SAP Datasphere Powers Business at the Speed of Data

Rocket-Powered Data Science

We live in a data-rich, insights-rich, and content-rich world. Data collections are the ones and zeroes that encode the actionable insights (patterns, trends, relationships) that we seek to extract from our data through machine learning and data science. Source: [link] I will finish with three quotes.

article thumbnail

Becoming a machine learning company means investing in foundational technologies

O'Reilly on Data

Companies successfully adopt machine learning either by building on existing data products and services, or by modernizing existing models and algorithms. In this post, I share slides and notes from a keynote I gave at the Strata Data Conference in London earlier this year. Use ML to unlock new data types—e.g.,

article thumbnail

NVIDIA RAPIDS in Cloudera Machine Learning

Cloudera

In the previous blog post in this series, we walked through the steps for leveraging Deep Learning in your Cloudera Machine Learning (CML) projects. As a machine learning problem, it is a classification task with tabular data, a perfect fit for RAPIDS. The raw data is in a series of CSV files.

article thumbnail

Maximize your data dividends with active metadata

IBM Big Data Hub

Metadata management performs a critical role within the modern data management stack. It helps blur data silos, and empowers data and analytics teams to better understand the context and quality of data. This, in turn, builds trust in data and the decision-making to follow. Improve data discovery.

article thumbnail

How Cargotec uses metadata replication to enable cross-account data sharing

AWS Big Data

Cargotec captures terabytes of IoT telemetry data from their machinery operated by numerous customers across the globe. This data needs to be ingested into a data lake, transformed, and made available for analytics, machine learning (ML), and visualization.