Data Collection, Data Lake and Interactive

Data Collection

Data Lake

Interactive

Streaming Edge Data Collection and Global Data Distribution

Cloudera

JUNE 9, 2022

From origin through all points of consumption both on-prem and in the cloud, all data flows need to be controlled in a simple, secure, universal, scalable, and cost-effective way. controlling distribution while also allowing the freedom and flexibility to deliver the data to different services is more critical than ever. .

Data Collection

Data Collection IoT Data Lake Unstructured Data

Data Cataloging in the Data Lake: Alation + Kylo

Alation

FEBRUARY 20, 2020

More than any other advancement in analytic systems over the last 10 years, Hadoop has disrupted data ecosystems. By dramatically lowering the cost of storing data for analysis, it ushered in an era of massive data collection. You did not have to understand or prepare the data to get it into Hadoop, so people rarely did.

Data Lake

Data Lake Metadata Structured Data Big Data

Join 52,000+

Insiders

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Webinars

Generative AI Deep Dive: Advancing from Proof of Concept to Production

Understanding User Needs and Satisfying Them

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

Leading the Development of Profitable and Sustainable Products

How To Get Promoted In Product Management

MORE WEBINARS

Trending Sources

Analytics Vidhya

Moving Enterprise Data From Anywhere to Any System Made Easy

Cloudera

JUNE 2, 2022

Over the last decade, we have often heard about the proliferation of data creating sources (mobile applications, laptops, sensors, enterprise apps) in heterogeneous environments (cloud, on-prem, edge) resulting in the exponential growth of data being created.

Enterprise

Enterprise Data Lake Data Collection Data-driven

Webinars

Generative AI Deep Dive: Advancing from Proof of Concept to Production

Understanding User Needs and Satisfying Them

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

Leading the Development of Profitable and Sustainable Products

How To Get Promoted In Product Management

MORE WEBINARS

Create an end-to-end data strategy for Customer 360 on AWS

AWS Big Data

MARCH 26, 2024

Customer 360 (C360) provides a complete and unified view of a customer’s interactions and behavior across all touchpoints and channels. This view is used to identify patterns and trends in customer behavior, which can inform data-driven decisions to improve business outcomes. Then, you transform this data into a concise format.

Data Strategy

Data Strategy Strategy Data Warehouse Prescriptive Analytics

Analyze Elastic IP usage history using Amazon Athena and AWS CloudTrail

AWS Big Data

MAY 15, 2024

Athena is an interactive query service that simplifies data analysis in Amazon Simple Storage Service (Amazon S3) using standard SQL. By extracting detailed information from CloudTrail and querying it using Athena, this solution streamlines the process of data collection, analysis, and reporting of EIP usage within an AWS account.

Snapshot

Snapshot Optimization Data Lake Reporting

Moving Enterprise Data From Anywhere to Any System Made Easy

CIO Business Intelligence

JULY 13, 2022

Enterprise

Enterprise Data Lake Data Collection Data-driven

What is a Data Pipeline?

Jet Global

MAY 9, 2024

The key components of a data pipeline are typically: Data Sources : The origin of the data, such as a relational database , data warehouse, data lake , file, API, or other data store. This can include tasks such as data ingestion, cleansing, filtering, aggregation, or standardization.

Data Lake

Data Lake Data Warehouse Business Intelligence Machine Learning

How HR&A uses Amazon Redshift spatial analytics on Amazon Redshift Serverless to measure digital equity in states across the US

AWS Big Data

DECEMBER 5, 2023

To fill in the gaps in existing data, HR&A creates digital equity surveys to build a more complete picture before developing digital equity plans. HR&A has used Amazon Redshift Serverless and CARTO to process survey findings more efficiently and create custom interactive dashboards to facilitate understanding of the results.

Measurement

Measurement Dashboards Data Warehouse Analytics

The Sprint towards Digital Healthcare

Cloudera

APRIL 20, 2022

However, consider all the data collection, merging, analyzing and storing this simple interaction requires; it’s not so simple. Data needs to be stored for treatment, drug interactions and/or allergies, patient records, compliance, pharmacy, payment and insurance purposes.

Insurance

Insurance Measurement Data Lake Risk

Better, faster decisions: Why businesses thrive on real-time data

CIO Business Intelligence

SEPTEMBER 8, 2022

Most organizations understand the profound impact that data is having on modern business. In Foundry’s 2022 Data & Analytics Study , 88% of IT decision-makers agree that data collection and analysis have the potential to fundamentally change their business models over the next three years.

Cost-Benefit

Cost-Benefit Internet of Things Data-driven Data Lake

Data democratization: How data architecture can drive business decisions and AI initiatives

IBM Big Data Hub

AUGUST 4, 2023

Architecture for data democratization Data democratization requires a move away from traditional “data at rest” architecture, which is meant for storing static data. Traditionally, data was seen as information to be put on reserve, only called upon during customer interactions or executing a program.

Data Architecture

Data Architecture Data Lake Machine Learning Data Governance

Federated Learning, Machine Learning, Decentralized Data

Cloudera

DECEMBER 8, 2020

Federated Learning is a paradigm in which machine learning models are trained on decentralized data. Instead of collecting data on a single server or data lake, it remains in place — on smartphones, industrial sensing equipment, and other edge devices — and models are trained on-device.

Machine Learning

Machine Learning Data Lake Reporting Modeling

When will AI usher in a new era of manufacturing?

CIO Business Intelligence

JULY 12, 2023

P&G engineers developed a high-speed data collection system to capture data to use for training AI models. One challenge they faced is that, while production errors are extremely costly and disruptive, they don’t happen often, which means that failure events are underrepresented in the training data.

Manufacturing

Manufacturing Cost-Benefit Data Lake Optimization

What is Data Mesh?

Ontotext

NOVEMBER 16, 2023

In a data mesh, domains are represented by a node, which can be an operational data store (ODS), a data warehouse, or a data lake tailored to the domain’s requirements. The owner also provides infrastructure and mechanisms to permit data producers and consumers to interact.

Metadata

Metadata Data-driven Data Quality Data Architecture

8 tips for unleashing the power of unstructured data

CIO Business Intelligence

NOVEMBER 28, 2023

“The insights derived from this audio data have directly contributed to improving the game’s audio experience, ensuring that players are constantly emotionally engaged in the gameplay and interacting with the environment,” Konoval says. Games are dynamic, and so is the data they generate, Konoval says.

Unstructured Data

Unstructured Data Data-driven Visualization Data Quality

A hybrid approach in healthcare data warehousing with Amazon Redshift

AWS Big Data

FEBRUARY 21, 2023

At the heart of all data warehousing is integration, and this layer contains integrated data from multiple sources built around the enterprise-wide business keys. Although data lakes resemble data vaults, a data vault provides more features of a data warehouse. What is a hybrid model?

Data Warehouse

Data Warehouse Data Lake Cost-Benefit Modeling

Improving Multi-tenancy with Virtual Private Clusters

Cloudera

JUNE 6, 2019

When a mix of batch, interactive, and data serving workloads are added to the mix, the problem becomes nearly intractable. While this approach provides isolation, it creates another significant challenge: duplication of data, metadata, and security policies, or ‘split-brain’ data lake. Cloudera Manager (CM) 6.2

Metadata

Metadata Data Lake Optimization Strategy

The Value is in the Data (Wrangling)

Darkhorse

JULY 6, 2017

So what is data wrangling? Let’s imagine the process of building a data lake. First off, data wrangling is gathering the appropriate data. You’ve got yourself a little data lake, but its waters are brackish. It’s time to start digging into the data content. I hope you enjoy that sort of thing.

Data Lake

Data Lake Sales Machine Learning Visualization

How to Build a Customer Centric Business: The Complete Guide

Alation

AUGUST 2, 2022

Similary, every touchpoint offers data that can help you improve that customer experience, from the number and duration of support interactions to the intuitiveness of your website. Analyzing this data can build your ability to anticipate a customer’s specific needs. But customers aren’t data; they’re people.

Strategy

Strategy Cost-Benefit Metrics Data Lake

Why We Started the Data Intelligence Project

Alation

JULY 7, 2022

To answer these questions we need to look at how data roles within the job market have evolved, and how academic programs have changed to meet new workforce demands. In the 2010s, the growing scope of the data landscape gave rise to a new profession: the data scientist. Supporting the next data-literate generation.

Metadata

Metadata Data-driven Insurance Statistics

Data Leaders Brief

Streaming Edge Data Collection and Global Data Distribution

Data Cataloging in the Data Lake: Alation + Kylo

Webinars

Trending Sources

Moving Enterprise Data From Anywhere to Any System Made Easy

Webinars

Create an end-to-end data strategy for Customer 360 on AWS

Analyze Elastic IP usage history using Amazon Athena and AWS CloudTrail

Moving Enterprise Data From Anywhere to Any System Made Easy

What is a Data Pipeline?

How HR&A uses Amazon Redshift spatial analytics on Amazon Redshift Serverless to measure digital equity in states across the US

The Sprint towards Digital Healthcare

Better, faster decisions: Why businesses thrive on real-time data

Data democratization: How data architecture can drive business decisions and AI initiatives

Federated Learning, Machine Learning, Decentralized Data

When will AI usher in a new era of manufacturing?

What is Data Mesh?

8 tips for unleashing the power of unstructured data

A hybrid approach in healthcare data warehousing with Amazon Redshift

Improving Multi-tenancy with Virtual Private Clusters

The Value is in the Data (Wrangling)

How to Build a Customer Centric Business: The Complete Guide

Why We Started the Data Intelligence Project

Stay Connected