Data Lake, Interactive and Strategy

Data Lake

Interactive

Strategy

Migrate an existing data lake to a transactional data lake using Apache Iceberg

AWS Big Data

OCTOBER 3, 2023

A data lake is a centralized repository that you can use to store all your structured and unstructured data at any scale. You can store your data as-is, without having to first structure the data and then run different types of analytics for better business insights.

Data Lake

Data Lake Metadata Snapshot Recreation/Entertainment

Data Architecture and Strategy in the AI Era

Cloudera

MARCH 28, 2024

But, even with the backdrop of an AI-dominated future, many organizations still find themselves struggling with everything from managing data volumes and complexity to security concerns to rapidly proliferating data silos and governance challenges.

Data Architecture

Data Architecture Strategy Data Lake Data-driven

Join 52,000+

Insiders

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Webinars

Generative AI Deep Dive: Advancing from Proof of Concept to Production

Understanding User Needs and Satisfying Them

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

Leading the Development of Profitable and Sustainable Products

How To Get Promoted In Product Management

MORE WEBINARS

Trending Sources

Analytics Vidhya

Simplify operational data processing in data lakes using AWS Glue and Apache Hudi

AWS Big Data

SEPTEMBER 13, 2023

A modern data architecture is an evolutionary architecture pattern designed to integrate a data lake, data warehouse, and purpose-built stores with a unified governance model. The company wanted the ability to continue processing operational data in the secondary Region in the rare event of primary Region failure.

Data Lake

Data Lake Data Processing Metadata Snapshot

Webinars

Generative AI Deep Dive: Advancing from Proof of Concept to Production

Understanding User Needs and Satisfying Them

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

Leading the Development of Profitable and Sustainable Products

How To Get Promoted In Product Management

MORE WEBINARS

Build a serverless transactional data lake with Apache Iceberg, Amazon EMR Serverless, and Amazon Athena

AWS Big Data

MARCH 10, 2023

Since the deluge of big data over a decade ago, many organizations have learned to build applications to process and analyze petabytes of data. Data lakes have served as a central repository to store structured and unstructured data at any scale and in various formats.

Data Lake

Data Lake Sales Data Warehouse Snapshot

How to modernize data lakes with a data lakehouse architecture

IBM Big Data Hub

JULY 5, 2023

Data Lakes have been around for well over a decade now, supporting the analytic operations of some of the largest world corporations. Such data volumes are not easy to move, migrate or modernize. The challenges of a monolithic data lake architecture Data lakes are, at a high level, single repositories of data at scale.

Data Lake

Data Lake Metadata Cost-Benefit Data Warehouse

Enable business users to analyze large datasets in your data lake with Amazon QuickSight

AWS Big Data

JUNE 23, 2023

Events and many other security data types are stored in Imperva’s Threat Research Multi-Region data lake. Imperva harnesses data to improve their business outcomes. As part of their solution, they are using Amazon QuickSight to unlock insights from their data.

Data Lake

Data Lake Cost-Benefit Dashboards Data Warehouse

Build a real-time GDPR-aligned Apache Iceberg data lake

AWS Big Data

FEBRUARY 24, 2023

Data lakes are a popular choice for today’s organizations to store their data around their business activities. As a best practice of a data lake design, data should be immutable once stored. A data lake built on AWS uses Amazon Simple Storage Service (Amazon S3) as its primary storage environment.

Data Lake

Data Lake Metadata Testing Data Warehouse

Create an end-to-end data strategy for Customer 360 on AWS

AWS Big Data

MARCH 26, 2024

Customer 360 (C360) provides a complete and unified view of a customer’s interactions and behavior across all touchpoints and channels. This view is used to identify patterns and trends in customer behavior, which can inform data-driven decisions to improve business outcomes. Then, you transform this data into a concise format.

Data Strategy

Data Strategy Strategy Data Warehouse Prescriptive Analytics

OCBC Bank Accelerates Its Data Strategy with Cloudera

Cloudera

DECEMBER 14, 2022

OCBC also won a Cloudera Data Impact Award 2022 in the Transformation category for the project. Real-time data analysis for better business and customer solutions. They were also able to develop smarter processes on the platform by introducing chatbots to take over 10% of customer interactions on their website. “We

Data Strategy

Data Strategy Strategy IT Contextual Data

Five Strategies to Accelerate Data Product Development

Cloudera

JULY 26, 2021

With this first article of the two-part series on data product strategies, I am presenting some of the emerging themes in data product development and how they inform the prerequisites and foundational capabilities of an Enterprise data platform that would serve as the backbone for developing successful data product strategies.

Strategy

Strategy Data Science Marketing Unstructured Data

Build and manage your modern data stack using dbt and AWS Glue through dbt-glue, the new “trusted” dbt adapter

AWS Big Data

NOVEMBER 29, 2023

We have seen a strong customer demand to expand its scope to cloud-based data lakes because data lakes are increasingly the enterprise solution for large-scale data initiatives due to their power and capabilities. Let’s say that this company is located in Europe and the data product must comply with the GDPR.

Data Lake

Data Lake Management Metrics Data Warehouse

Interact with Apache Iceberg tables using Amazon Athena and cross account fine-grained permissions using AWS Lake Formation

AWS Big Data

MARCH 23, 2023

Register the S3 path storing the table using Lake Formation We register the S3 full path in Lake Formation: Navigate to the Lake Formation console. In the navigation pane, under Register and ingest , choose Data lake locations. Jack Ye is a software engineer of the Athena Data Lake and Storage team at AWS.

Interactive

Interactive Snapshot Data Lake Software

Creating Data Value With a Decentralized Data Strategy

CIO Business Intelligence

APRIL 6, 2022

For decades organizations chased the Holy Grail of a centralized data warehouse/lake strategy to support business intelligence and advanced analytics. Thinking about that intelligence as having millions of loosely connected decision points at the edge requires a different strategy, and you can’t micromanage it.

Data Strategy

Data Strategy Strategy Internet of Things Data Warehouse

The rise of the data lakehouse: A new era of data value

CIO Business Intelligence

AUGUST 18, 2022

Previously, Walgreens was attempting to perform that task with its data lake but faced two significant obstacles: cost and time. Those challenges are well-known to many organizations as they have sought to obtain analytical knowledge from their vast amounts of data. Lakehouses redeem the failures of some data lakes.

Data Lake

Data Lake Data Warehouse Unstructured Data Business Intelligence

Data governance in the age of generative AI

AWS Big Data

FEBRUARY 29, 2024

Data is your generative AI differentiator, and a successful generative AI implementation depends on a robust data strategy incorporating a comprehensive data governance approach. As part of the transformation, the objects need to be treated to ensure data privacy (for example, PII redaction).

Data Governance

Data Governance Unstructured Data Metadata Data Lake

Your guide to AWS Analytics at AWS re:Invent 2023

AWS Big Data

NOVEMBER 13, 2023

As your company transforms, you need a data foundation for business applications, new technical innovations, and data-driven business initiatives. With new capabilities for self-service and simpler builder experiences, you can democratize data access for line-of-business users, analysts, scientists, and engineers.

Analytics

Analytics Data Lake Data Warehouse Data-driven

Achieve your AI goals with an open data lakehouse approach

IBM Big Data Hub

OCTOBER 4, 2023

Artificial intelligence (AI) is now at the forefront of how enterprises work with data to help reinvent operations, improve customer experiences, and maintain a competitive advantage. It’s no longer a nice-to-have, but an integral part of a successful data strategy. Later this year, watsonx.data will infuse watsonx.ai

Data Lake

Data Lake Metadata Cost-Benefit Data Warehouse

Thermo Fisher transforms its customer experience

CIO Business Intelligence

AUGUST 12, 2022

The rapid growth left the company highly dependent on fragmented, manual processes and disparate data sources and systems. So, we have a lot of disparate systems across our company — ERPs, CRMs, middleware — but our go-to-market strategy for our customers, you have to make that all invisible for them.”. Catalyzing change.

IT Data Lake Sales Interactive

Enforce fine-grained access control on Open Table Formats via Amazon EMR integrated with AWS Lake Formation

AWS Big Data

JANUARY 17, 2024

With Amazon EMR 6.15, we launched AWS Lake Formation based fine-grained access controls (FGAC) on Open Table Formats (OTFs), including Apache Hudi, Apache Iceberg, and Delta lake. Many large enterprise companies seek to use their transactional data lake to gain insights and improve decision-making.

Data Lake

Data Lake Snapshot Big Data Data-driven

Navigating the Chaos of Unruly Data: Solutions for Data Teams

DataKitchen

NOVEMBER 10, 2023

The Perilous State of Today’s Data Environments Data teams often navigate a labyrinth of chaos within their databases. Extrinsic Control Deficit: Many of these changes stem from tools and processes beyond the immediate control of the data team.

Data Quality

Data Quality Testing Data Lake Data Integration

Empowering data-driven excellence: How the Bluestone Data Platform embraced data mesh for success

AWS Big Data

FEBRUARY 27, 2024

The following are the key components of the Bluestone Data Platform: Data mesh architecture – Bluestone adopted a data mesh architecture, a paradigm that distributes data ownership across different business units. This enables data-driven decision-making across the organization.

Data-driven

Data-driven Data Lake Data Quality Data Governance

Exploring the hyper-competitive future of customer experience

IBM Big Data Hub

JANUARY 19, 2024

The future of customer experience (CX) is more : more data, more technology, more surprising and delighting. It’s also more pressure to retain those customers, whether those interactions happen online or in-store. As such, future CX strategies will be more data-driven than ever before.

Data-driven

Data-driven Consulting Interactive Data Lake

5 Best Practices for Extracting, Analyzing, and Visualizing Data

Smart Data Collective

DECEMBER 13, 2022

It is important to note that data analytics relies on computer tools and software to collect and analyze data so that business choices may be made properly. Data analytics is widely used in business since it allows organizations to better understand their consumers and improve their advertising strategies.

Visualization

Visualization Key Performance Indicator Sales Data Lake

DIY cloud cost management: The strategic case for building your own tools

CIO Business Intelligence

APRIL 25, 2024

This strategy alone — achieved with the DIY approach — was responsible for saving $300,000 when his team dug into expenses surrounding Google Cloud Platform log storage, resulting in his team moving the logs to a lower tier of cloud storage and rightsizing the data they retain to adhere to retention requirements.

Management

Management Optimization Strategy Enterprise

AWS Lake Formation 2022 year in review

AWS Big Data

JANUARY 31, 2023

We have collected some of the key talks and solutions on data governance, data mesh, and modern data architecture published and presented in AWS re:Invent 2022, and a few data lake solutions built by customers and AWS Partners for easy reference. Starting with Amazon EMR release 6.7.0,

Data Lake

Data Lake Data Governance Data Architecture Machine Learning

7 key Microsoft Azure analytics services (plus one extra)

CIO Business Intelligence

JUNE 29, 2022

The recent announcement of the Microsoft Intelligent Data Platform makes that more obvious, though analytics is only one part of that new brand. Here we take a look at Microsoft Azure’s essential analytics services, what they are used for, and how they come together to make a comprehensive stack for your analytics strategy in the cloud.

Data Lake

Data Lake Analytics Data Warehouse Machine Learning

How Amazon Devices scaled and optimized real-time demand and supply forecasts using serverless analytics

AWS Big Data

FEBRUARY 1, 2023

With data volumes exhibiting a double-digit percentage growth rate year on year and the COVID pandemic disrupting global logistics in 2021, it became more critical to scale and generate near-real-time data. This introduces the need for both polling and pushing the data to access and analyze in near-real time.

Optimization

Optimization Forecasting Data Lake Metadata

Chipotle’s recipe for digital transformation: Cloud plus AI

CIO Business Intelligence

OCTOBER 21, 2022

Chipotle IT’s secret sauce Garner credits Chipotle’s wholly owned business model for enabling him to deploy advanced technologies such as the cloud, analytics, data lake, and AI uniformly to all restaurants because they are all based on the same digital backbone. Chipotle’s digital business in 2022 was $3.5

Digital Transformation

Digital Transformation Data Lake Forecasting Technology

What is a Data Pipeline?

Jet Global

MAY 9, 2024

The key components of a data pipeline are typically: Data Sources : The origin of the data, such as a relational database , data warehouse, data lake , file, API, or other data store. This can include tasks such as data ingestion, cleansing, filtering, aggregation, or standardization.

Data Lake

Data Lake Data Warehouse Business Intelligence Machine Learning

Analyze Elastic IP usage history using Amazon Athena and AWS CloudTrail

AWS Big Data

MAY 15, 2024

Athena is an interactive query service that simplifies data analysis in Amazon Simple Storage Service (Amazon S3) using standard SQL. Check out the GitHub repo to regularly run this analysis as part of your cost-optimization strategy to identify and release inactive EIPs to reduce costs.

Snapshot

Snapshot Optimization Data Lake Reporting

Why the Data Journey Manifesto?

DataKitchen

JUNE 12, 2023

We had been talking about “Agile Analytic Operations,” “DevOps for Data Teams,” and “Lean Manufacturing For Data,” but the concept was hard to get across and communicate. I spent much time de-categorizing DataOps: we are not discussing ETL, Data Lake, or Data Science.

Testing

Testing Data Lake Dashboards Data Science

2020 Data Impact Award Winner Spotlight: Merck KGaA

Cloudera

DECEMBER 11, 2020

Many businesses now need to achieve free and open data access in order to derive value and improve efficiencies as they navigate the ‘new norm’ — whether that’s involved working from a home office, or the garden shed. Toolsets and strategies have had to shift to ensure controlled access to data.

Data Lake

Data Lake Cost-Benefit Unstructured Data Data Governance

Real-time streaming data top picks you cannot miss at AWS re:Invent 2023

AWS Big Data

NOVEMBER 8, 2023

Putting your data to work with generative AI – Innovation Talk Thursday, November 30 | 12:30 – 1:30 PM PST | The Venetian Join Mai-Lan Tomsen Bukovec, Vice President, Technology at AWS to learn how you can turn your data lake into a business advantage with generative AI. Reserve your seat now! Reserve your seat now!

Data-driven

Data-driven Data Lake Machine Learning Cost-Benefit

Unleashing the power of Presto: The Uber case study

IBM Big Data Hub

SEPTEMBER 25, 2023

Presto is an open source distributed SQL query engine for data analytics and the data lakehouse, designed for running interactive analytic queries against datasets of all sizes, from gigabytes to petabytes. Uber understood that digital superiority required the capture of all their transactional data, not just a sampling.

OLAP

OLAP Data Lake Data-driven Snapshot

Breaking barriers in geospatial: Amazon Redshift, CARTO, and H3

AWS Big Data

MAY 16, 2024

However, visualizing and analyzing large-scale geospatial data presents a formidable challenge due to the sheer volume and intricacy of information. The need to balance detail and context while maintaining real-time interactivity can lead to issues of scalability and rendering complexity.

Data Warehouse

Data Warehouse Visualization Cost-Benefit Optimization

How HR&A uses Amazon Redshift spatial analytics on Amazon Redshift Serverless to measure digital equity in states across the US

AWS Big Data

DECEMBER 5, 2023

To fill in the gaps in existing data, HR&A creates digital equity surveys to build a more complete picture before developing digital equity plans. HR&A has used Amazon Redshift Serverless and CARTO to process survey findings more efficiently and create custom interactive dashboards to facilitate understanding of the results.

Measurement

Measurement Dashboards Data Warehouse Analytics

How the BMW Group analyses semiconductor demand with AWS Glue

AWS Big Data

APRIL 26, 2023

This multinational production strategy follows an even more international and extensive supplier network. To enable this use case, we used the BMW Group’s cloud-native data platform called the Cloud Data Hub. To learn more about the Cloud Data Hub, refer to BMW Group Uses AWS-Based Data Lake to Unlock the Power of Data.

Forecasting

Forecasting Manufacturing Data Lake Big Data

Running both IT and digital at Alorica

CIO Business Intelligence

JUNE 1, 2022

At the bottom of the pyramid are conversational capabilities that interact like a human. The whole inverted pyramid creates a closed-loop customer interaction. . The pandemic accelerated a change to digital interactions that was already happening in the market. What data do you collect from those channels?

IT Interactive Marketing Consulting

Data democratization: How data architecture can drive business decisions and AI initiatives

IBM Big Data Hub

AUGUST 4, 2023

Architecture for data democratization Data democratization requires a move away from traditional “data at rest” architecture, which is meant for storing static data. Traditionally, data was seen as information to be put on reserve, only called upon during customer interactions or executing a program.

Data Architecture

Data Architecture Data Lake Machine Learning Data Governance

CIOs weigh where to place AI bets — and how to de-risk them

CIO Business Intelligence

MARCH 18, 2024

The CIO has strategies in place to address all three. One such company has built a tool that predicts customer intent and behavior based on previous interactions and other market data. AI tools rely on the data in use in these solutions. Artificial Intelligence, Generative AI, IT Leadership, IT Strategy, Risk Management

Risk

Risk Cost-Benefit Data Processing Testing

Build an Amazon Redshift data warehouse using an Amazon DynamoDB single-table design

AWS Big Data

JUNE 21, 2023

A typical ask for this data may be to identify sales trends as well as sales growth on a yearly, monthly, or even daily basis. A key pillar of AWS’s modern data strategy is the use of purpose-built data stores for specific use cases to achieve performance, cost, and scale. This is achieved by partitioning the data.

Data Warehouse

Data Warehouse Data Lake OLAP Cost-Benefit

Estimating Scope 1 Carbon Footprint with Amazon Athena

AWS Big Data

AUGUST 2, 2023

In this blog, we will walk through how we can apply existing enterprise data to better understand and estimate Scope 1 carbon footprint using Amazon Simple Storage Service (S3) and Amazon Athena , a serverless interactive analytics service that makes it easy to analyze data using standard SQL.

Data Lake

Data Lake Measurement Visualization Data Architecture

Build a decentralized semantic search engine on heterogeneous data stores using autonomous agents

AWS Big Data

MAY 28, 2024

For example, financial analysts currently have to manually read and summarize lengthy regulatory filings and earnings transcripts in order to respond to Q&A on investment strategies. For getting data from Amazon Redshift, we use the Anthropic Claude 2.0 For client interaction we use Agent Tools based on ReAct.

Unstructured Data

Unstructured Data Structured Data Data Warehouse Testing

Interview with: Sankar Narayanan, Chief Practice Officer at Fractal Analytics

Corinium

JUNE 6, 2019

Some of the work is very foundational, such as building an enterprise data lake and migrating it to the cloud, which enables other more direct value-added activities such as self-service. Newer methods can work with large amounts of data and are able to unearth latent interactions.

Insurance

Insurance Analytics Forecasting Deep Learning

Migrate an existing data lake to a transactional data lake using Apache Iceberg

Data Architecture and Strategy in the AI Era

Webinars

Trending Sources

Simplify operational data processing in data lakes using AWS Glue and Apache Hudi

Webinars

Build a serverless transactional data lake with Apache Iceberg, Amazon EMR Serverless, and Amazon Athena

How to modernize data lakes with a data lakehouse architecture

Enable business users to analyze large datasets in your data lake with Amazon QuickSight

Build a real-time GDPR-aligned Apache Iceberg data lake

Create an end-to-end data strategy for Customer 360 on AWS

OCBC Bank Accelerates Its Data Strategy with Cloudera

Five Strategies to Accelerate Data Product Development

Build and manage your modern data stack using dbt and AWS Glue through dbt-glue, the new “trusted” dbt adapter

Interact with Apache Iceberg tables using Amazon Athena and cross account fine-grained permissions using AWS Lake Formation

Creating Data Value With a Decentralized Data Strategy

The rise of the data lakehouse: A new era of data value

Data governance in the age of generative AI

Your guide to AWS Analytics at AWS re:Invent 2023

Achieve your AI goals with an open data lakehouse approach

Thermo Fisher transforms its customer experience

Enforce fine-grained access control on Open Table Formats via Amazon EMR integrated with AWS Lake Formation

Navigating the Chaos of Unruly Data: Solutions for Data Teams

Empowering data-driven excellence: How the Bluestone Data Platform embraced data mesh for success

Exploring the hyper-competitive future of customer experience

5 Best Practices for Extracting, Analyzing, and Visualizing Data

DIY cloud cost management: The strategic case for building your own tools

AWS Lake Formation 2022 year in review

7 key Microsoft Azure analytics services (plus one extra)

How Amazon Devices scaled and optimized real-time demand and supply forecasts using serverless analytics

Chipotle’s recipe for digital transformation: Cloud plus AI

What is a Data Pipeline?

Analyze Elastic IP usage history using Amazon Athena and AWS CloudTrail

Why the Data Journey Manifesto?

2020 Data Impact Award Winner Spotlight: Merck KGaA

Real-time streaming data top picks you cannot miss at AWS re:Invent 2023

Unleashing the power of Presto: The Uber case study

Breaking barriers in geospatial: Amazon Redshift, CARTO, and H3

How HR&A uses Amazon Redshift spatial analytics on Amazon Redshift Serverless to measure digital equity in states across the US

How the BMW Group analyses semiconductor demand with AWS Glue

Running both IT and digital at Alorica

Data democratization: How data architecture can drive business decisions and AI initiatives

CIOs weigh where to place AI bets — and how to de-risk them

Build an Amazon Redshift data warehouse using an Amazon DynamoDB single-table design

Estimating Scope 1 Carbon Footprint with Amazon Athena

Build a decentralized semantic search engine on heterogeneous data stores using autonomous agents

Interview with: Sankar Narayanan, Chief Practice Officer at Fractal Analytics

Stay Connected