Analytics, Data Lake, Data Processing and Enterprise

Analytics

Data Lake

Data Processing

Enterprise

Migrate an existing data lake to a transactional data lake using Apache Iceberg

AWS Big Data

OCTOBER 3, 2023

A data lake is a centralized repository that you can use to store all your structured and unstructured data at any scale. You can store your data as-is, without having to first structure the data and then run different types of analytics for better business insights.

Data Lake

Data Lake Metadata Snapshot Recreation/Entertainment

Build a serverless transactional data lake with Apache Iceberg, Amazon EMR Serverless, and Amazon Athena

AWS Big Data

MARCH 10, 2023

Since the deluge of big data over a decade ago, many organizations have learned to build applications to process and analyze petabytes of data. Data lakes have served as a central repository to store structured and unstructured data at any scale and in various formats.

Data Lake

Data Lake Sales Data Warehouse Snapshot

Join 52,000+

Insiders

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Webinars

Peak Performance: Continuous Testing & Evaluation of LLM-Based Applications

The Key to Sustainable Energy Optimization: A Data-Driven Approach for Manufacturing

From Developer Experience to Product Experience: How a Shared Focus Fuels Product Success

Understanding User Needs and Satisfying Them

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

MORE WEBINARS

Trending Sources

Petabyte-scale log analytics with Amazon S3, Amazon OpenSearch Service, and Amazon OpenSearch Ingestion

AWS Big Data

MARCH 7, 2024

At the same time, they need to optimize operational costs to unlock the value of this data for timely insights and do so with a consistent performance. With this massive data growth, data proliferation across your data stores, data warehouse, and data lakes can become equally challenging.

Data Lake

Data Lake Analytics Dashboards Metrics

Webinars

Peak Performance: Continuous Testing & Evaluation of LLM-Based Applications

The Key to Sustainable Energy Optimization: A Data-Driven Approach for Manufacturing

From Developer Experience to Product Experience: How a Shared Focus Fuels Product Success

Understanding User Needs and Satisfying Them

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

MORE WEBINARS

Power enterprise-grade Data Vaults with Amazon Redshift – Part 2

AWS Big Data

NOVEMBER 16, 2023

Amazon Redshift is a popular cloud data warehouse, offering a fully managed cloud-based service that seamlessly integrates with an organization’s Amazon Simple Storage Service (Amazon S3) data lake, real-time streams, machine learning (ML) workflows, transactional workflows, and much more—all while providing up to 7.9x

Enterprise

Enterprise Data Warehouse Snapshot Cost-Benefit

DS Smith sets a single-cloud agenda for sustainability

CIO Business Intelligence

DECEMBER 6, 2023

Its digital transformation began with an application modernization phase, in which Dickson and her IT teams determined which applications should be hosted in the public cloud and which should remain on a private cloud. In total, the company’s operations rely on 700 applications.

Manufacturing

Manufacturing Data Lake Digital Transformation Machine Learning

Modernize your ETL platform with AWS Glue Studio: A case study from BMS

AWS Big Data

DECEMBER 13, 2023

For the past 5 years, BMS has used a custom framework called Enterprise Data Lake Services (EDLS) to create ETL jobs for business users. BMS’s EDLS platform hosts over 5,000 jobs and is growing at 15% YoY (year over year). Pavan Kumar Bijja is a Senior Data Engineer at BMS.

Metadata

Metadata Data Lake Visualization Data Transformation

TDC Digital leverages IBM Cloud for transparent billing and improved customer satisfaction

IBM Big Data Hub

MAY 19, 2023

Small and midsize enterprises (SMEs) are the fastest-growing segment in the market due to reliability, scalability, integration, flexibility and improved productivity. As a small- to medium-sized enterprise (SME), TDC Digital needed a transparent billing system to predict its expenses and price its services effectively.

Unstructured Data

Unstructured Data Data Processing Manufacturing Data Lake

Implement alerts in Amazon OpenSearch Service with PagerDuty

AWS Big Data

JUNE 8, 2023

This data is often stored and analyzed using various tools, such as Amazon OpenSearch Service , a powerful search and analytics service offered by AWS. OpenSearch Service provides real-time insights into your data to support use cases like interactive log analytics, real-time application monitoring, website search, and more.

Data Lake

Data Lake Dashboards Metrics Testing

Enhance monitoring and debugging for AWS Glue jobs using new job observability metrics, Part 3: Visualization and trend analysis using Amazon QuickSight

AWS Big Data

MARCH 29, 2024

Typically, you have multiple accounts to manage and run resources for your data pipeline. Aggregating metrics and slicing data by different dimensions such as job name can provide deeper insights. The sample dashboard showed metrics over time, top errors, and comparative job analytics.

Metrics

Metrics Visualization Dashboards Interactive

Access Amazon Athena in your applications using the WebSocket API

AWS Big Data

MARCH 2, 2023

Many organizations are building data lakes to store and analyze large volumes of structured, semi-structured, and unstructured data. In addition, many teams are moving towards a data mesh architecture, which requires them to expose their data sets as easily consumable data products.

Data Lake

Data Lake Testing Interactive Unstructured Data

How Data Governance Protects Sensitive Data

erwin

APRIL 2, 2021

With more companies increasingly migrating their data to the cloud to ensure availability and scalability, the risks associated with data management and protection also are growing. Data Security Starts with Data Governance. Do You Know Where Your Sensitive Data Is?

Data Governance

Data Governance Cost-Benefit Risk Metadata

5 misconceptions about cloud data warehouses

IBM Big Data Hub

FEBRUARY 2, 2023

In today’s world, data warehouses are a critical component of any organization’s technology ecosystem. They provide the backbone for a range of use cases such as business intelligence (BI) reporting, dashboarding, and machine-learning (ML)-based predictive analytics, that enable faster decision making and insights.

Data Warehouse

Data Warehouse Cost-Benefit Unstructured Data Data Architecture

CDP Private Cloud is a Game-changer for Partners

Cloudera

SEPTEMBER 2, 2020

Additionally, lines of business (LOBs) are able to gain access to a shared data lake that is secured and governed by the use of Cloudera Shared Data Experience (SDX). According to 451 Research’s Voice of the Enterprise: Cloud, Hosting & Managed Services study, 58% of Enterprises are moving towards a hybrid IT environment.

Cost-Benefit

Cost-Benefit Data Warehouse Data Lake Machine Learning

Announcing the 2021 Data Impact Awards

Cloudera

MAY 12, 2021

2020 saw us hosting our first ever fully digital Data Impact Awards ceremony, and it certainly was one of the highlights of our year. We saw a record number of entries and incredible examples of how customers were using Cloudera’s platform and services to unlock the power of data. DATA FOR ENTERPRISE AI.

Digital Transformation

Digital Transformation Machine Learning Optimization Data Lake

5 ways to maximize your cloud investment

CIO Business Intelligence

JANUARY 10, 2024

McMasters takes full advantage of the analytical tools Microsoft provides with Azure Cloud Computing Services to assist with cost optimization analysis, but he doesn’t stop there. “We In this quarter, 74% of large enterprises report having FinOps teams and processes in place, up from 61% in 2022,” he says.

Cost-Benefit

Cost-Benefit Measurement Optimization Metrics

Automate deployment of an Amazon QuickSight analysis connecting to an Amazon Redshift data warehouse with an AWS CloudFormation template

AWS Big Data

FEBRUARY 16, 2023

Amazon Redshift is the most widely used data warehouse in the cloud, best suited for analyzing exabytes of data and running complex analytical queries. Amazon QuickSight is a fast business analytics service to build visualizations, perform ad hoc analysis, and quickly get business insights from your data.

Data Warehouse

Data Warehouse Sales Visualization Data Processing

Use your corporate identities for analytics with Amazon EMR and AWS IAM Identity Center

AWS Big Data

APRIL 26, 2024

To enable your workforce users for analytics with fine-grained data access controls and audit data access, you might have to create multiple AWS Identity and Access Management (IAM) roles with different data permissions and map the workforce users to one of those roles. You will not see any applications on this page yet.

Analytics

Analytics Data Lake Management Enterprise

Eight Top DataOps Trends for 2022

DataKitchen

NOVEMBER 29, 2021

DataOps adoption continues to expand as a perfect storm of social, economic, and technological factors drive enterprises to invest in process-driven innovation. As a result, enterprises will examine their end-to-end data operations and analytics creation workflows. Data Gets Meshier. Rise of the DataOps Engineer.

Testing

Testing Data Lake Data Architecture Manufacturing

Accomplish Agile Business Intelligence & Analytics For Your Business

datapine

APRIL 15, 2020

When encouraging these BI best practices what we are really doing is advocating for agile business intelligence and analytics. In our opinion, both terms, agile BI and agile analytics, are interchangeable and mean the same. What Is Agile Analytics And BI? Agile Business Intelligence & Analytics Methodology.

Business Intelligence

Business Intelligence Analytics Testing Dashboards

Themes and Conferences per Pacoid, Episode 8

Domino Data Lab

APRIL 3, 2019

It includes perspectives about current issues, themes, vendors, and products for data governance. My interest in data governance (DG) began with the recent industry surveys by O’Reilly Media about enterprise adoption of “ABC” (AI, Big Data, Cloud). for DG adoption in the enterprise. Rinse, lather, repeat.

Data Governance

Data Governance Machine Learning Metadata Big Data

Extreme data center pressure? Burst to the cloud with CDP!

Cloudera

NOVEMBER 12, 2020

Typically, when we talk about data warehousing at an enterprise level on the cloud, one of the biggest concerns is that moving workloads from on-premises to the cloud is not seamless and opens up new risks for data safety and security. Let’s take a look at another customer scenario to provide some additional perspective. .

Data Warehouse

Data Warehouse Reporting Risk Cost-Benefit

Alation’s Role in the Sentient Enterprise

Alation

FEBRUARY 20, 2020

Imagine a new type of business, one in which the fabric of data is so woven throughout the enterprise that it becomes almost a living, breathing entity that one day may even be able to make the right decisions for you. All of these Alation customers are experimenting with the beginnings of the Sentient Enterprise.

Enterprise

Enterprise Data Processing Data Lake Insurance

Get Your Analytics Insights Instantly – Without Abandoning Central IT

Cloudera

JANUARY 21, 2021

Of course you don’t want to re-create the risks and costs of data silos your organization has spent the last decade trying to eliminate. But – you need those mission critical analytics services, and you need them now! . When your IT admin registers an environment in CDP, a Data Lake is automatically deployed.

Data Lake

Data Lake Data Warehouse IT Analytics

FINRA CIO Steve Randich pushes the public cloud forward

CIO Business Intelligence

FEBRUARY 10, 2023

And, in his experience, the public cloud is “not quite” as infinitely horizontally scalable as many think — though only a handful of enterprises come even close to reaching the barrier, he says. Randich, who came to FINRA.org in 2013 after stints as co-CIO of Citigroup and former CIO of Nasdaq, is no stranger to the public cloud.

Unstructured Data

Unstructured Data Data Lake Machine Learning Enterprise

AWS Glue crawlers support cross-account crawling to support data mesh architecture

AWS Big Data

MARCH 27, 2023

Data lakes have come a long way, and there’s been tremendous innovation in this space. Today’s modern data lakes are cloud native, work with multiple data types, and make this data easily available to diverse stakeholders across the business.

Data Lake

Data Lake Data-driven Management Data Architecture

Announcing the 2020 Data Impact Award Winners

Cloudera

NOVEMBER 18, 2020

In fact, each of the 29 finalists represented organizations running cutting-edge use cases that showcase a winning enterprise data cloud strategy. The technological linchpin of its digital transformation has been its Enterprise Data Architecture & Governance platform. Data Champions . Data for Enterprise AI.

Internet Publishing and Broadcasting

Internet Publishing and Broadcasting Data-driven Broadcasting Digital Transformation

10 Keys to a Secure Cloud Data Lakehouse

Cloudera

OCTOBER 25, 2022

Enabling data and analytics in the cloud allows you to have infinite scale and unlimited possibilities to gain faster insights and make better decisions with data. Cloud data lakehouses provide significant scaling, agility, and cost advantages compared to cloud data lakes and cloud data warehouses.

Data Processing

Data Processing Data Lake Cost-Benefit Risk

Create an end-to-end data strategy for Customer 360 on AWS

AWS Big Data

MARCH 26, 2024

In this post, we discuss how you can use purpose-built AWS services to create an end-to-end data strategy for C360 to unify and govern customer data that address these challenges. We recommend building your data strategy around five pillars of C360, as shown in the following figure.

Data Strategy

Data Strategy Strategy Data Warehouse Prescriptive Analytics

How Agencies Can Gain the Cyber Edge with Smart Data Solutions

Cloudera

DECEMBER 13, 2022

The attack targeted a host of public and private sector organizations (18,000 customers) including NASA, the Justice Department, and Homeland Security, and it is believed the attackers persisted on SolarWinds systems for 14 months prior to discovery. All with the integrated security and governance technologies required for compliance.

Machine Learning

Machine Learning Experimentation Data Lake Data Processing

Modern Data Architecture for Telecommunications

Cloudera

SEPTEMBER 6, 2022

Data has continued to grow both in scale and in importance through this period, and today telecommunications companies are increasingly seeing data architecture as an independent organizational challenge, not merely an item on an IT checklist. Previously, there were three types of data structures in telco: .

Data Architecture

Data Architecture Cost-Benefit Digital Transformation Business Driver

How The Cloud Made ‘Data-Driven Culture’ Possible | Part 1

BizAcuity

MAY 10, 2022

Cloud technology and innovation drives data-driven decision making culture in any organization. It is no surprise that almost all large enterprises and SMEs have shifted a part of their operations to the cloud. Storing data is extremely expensive even with VMs during this time. Cloud became a competitive advantage.

Data-driven

Data-driven IoT Unstructured Data Data Lake

Federate Amazon QuickSight access with open-source identity provider Keycloak

AWS Big Data

JUNE 13, 2023

Amazon QuickSight is a scalable, serverless, embeddable, machine learning (ML) powered business intelligence (BI) service built for the cloud that supports identity federation in both Standard and Enterprise editions. Vamsi Bhadriraju is a Data Architect at AWS. Solutions Architect at AWS, focusing on Analytics.

Metadata

Metadata Dashboards Business Intelligence Management

CIOs rise to the ESG reporting challenge

CIO Business Intelligence

JANUARY 30, 2024

“Always the gatekeepers of much of the data necessary for ESG reporting, CIOs are finding that companies are even more dependent on them,” says Nancy Mentesana, ESG executive director at Labrador US, a global communications firm focused on corporate disclosure documents. Most companies find themselves in a similar situation.

Reporting

Reporting Data Quality Strategy Data-driven

Exploring the AI and data capabilities of watsonx

IBM Big Data Hub

JULY 17, 2023

is our enterprise-ready next-generation studio for AI builders, bringing together traditional machine learning (ML) and new generative AI capabilities powered by foundation models. With watsonx.ai, businesses can effectively train, validate, tune and deploy AI models with confidence and at scale across their enterprise. IBM watsonx.ai

Machine Learning

Machine Learning Data Warehouse Modeling Cost-Benefit

Build efficient ETL pipelines with AWS Step Functions distributed map and redrive feature

AWS Big Data

DECEMBER 18, 2023

Solution overview One of the common functionalities involved in data pipelines is extracting data from multiple data sources and exporting it to a data lake or synchronizing the data to another database. There are multiple tables related to customers and order data in the RDS database.

Metadata

Metadata Visualization Data Lake Data-driven

How Novo Nordisk built distributed data governance and control at scale

AWS Big Data

APRIL 28, 2023

This is the second post of a three-part series detailing how Novo Nordisk , a large pharmaceutical enterprise, partnered with AWS Professional Services to build a scalable and secure data and analytics platform. This is a guest post co-written with Jonatan Selsing and Moses Arthur from Novo Nordisk.

Data Governance

Data Governance Management Data-driven Data Lake

Ingest, transform, and deliver events published by Amazon Security Lake to Amazon OpenSearch Service

AWS Big Data

JUNE 19, 2023

Security Lake automatically centralizes security data from cloud, on-premises, and custom sources into a purpose-built data lake stored in your account. With Security Lake, you can get a more complete understanding of your security data across your entire organization.

Publishing

Publishing Dashboards Visualization Management

Dancing with Elephants in 5 Easy Steps

Cloudera

AUGUST 21, 2020

And next to those legacy ERP, HCM, SCM and CRM systems, that mysterious elephant in the room – that “Big Data” platform running in the data center that is driving much of the company’s analytics and BI – looks like a great potential candidate. . Streaming data analytics. .

Cost-Benefit

Cost-Benefit Big Data ROI Risk

Generate security insights from Amazon Security Lake data using Amazon OpenSearch Ingestion

AWS Big Data

AUGUST 28, 2023

By converting logs and events using Open Cybersecurity Schema Framework , an open standard for storing security events in a common and shareable format, Security Lake optimizes and normalizes your security data for analysis using your preferred analytics tool. For more information, refer to Lifecycle management in Security Lake.

Dashboards

Dashboards Visualization Metadata Management

Improving Multi-tenancy with Virtual Private Clusters

Cloudera

JUNE 6, 2019

The typical Cloudera Enterprise Data Hub Cluster starts with a few dozen nodes in the customer’s datacenter hosting a variety of distributed services. Over time, workloads start processing more data, tenants start onboarding more workloads, and administrators (admins) start onboarding more tenants. With CM 6.2,

Metadata

Metadata Data Lake Optimization Strategy

Periscope Data Expands to Israel, Empowering Data Teams with Powerful Tools

Sisense

DECEMBER 11, 2019

This was the key learning from the Sisense event heralding the launch of Periscope Data in Tel Aviv, Israel — the beating heart of the startup nation. An exciting slate of presentations took them on a journey from why to how they should use data analytics to optimize their operations successfully and maximize their business opportunities.

Data Lake

Data Lake Big Data Sales Data-driven

Data Management Requirements for the Enterprise Data Lake

In(tegrate) the Clouds

MAY 1, 2016

SnapLogic published Eight Data Management Requirements for the Enterprise Data Lake. They are: Storage and Data Formats. Transformation and Analytics. The company also recently hosted a webinar on Democratizing the Data Lake with Constellation Research and published 2 whitepapers from Mark Madsen.

Data Lake

Data Lake Enterprise Management Metadata

PODCAST: Making AI Real – Episode 4: Unlocking the Value of Enterprise AI with Data Engineering Capabilities

bridgei2i

MARCH 3, 2021

Episode 4: Unlocking the Value of Enterprise AI with Data Engineering Capabilities. Unlocking the Value of Enterprise AI with Data Engineering Capabilities. Tune in to the podcast to know more about the evolving industry and how new technologies are transforming the enterprise AI landscape. PODCAST: Making AI Real.

Enterprise

Enterprise Digital Transformation Data-driven Interactive

Real-time streaming data top picks you cannot miss at AWS re:Invent 2023

AWS Big Data

NOVEMBER 8, 2023

Putting your data to work with generative AI – Innovation Talk Thursday, November 30 | 12:30 – 1:30 PM PST | The Venetian Join Mai-Lan Tomsen Bukovec, Vice President, Technology at AWS to learn how you can turn your data lake into a business advantage with generative AI. Reserve your seat now! Reserve your seat now!

Data-driven

Data-driven Data Lake Machine Learning Cost-Benefit

Migrate an existing data lake to a transactional data lake using Apache Iceberg

Build a serverless transactional data lake with Apache Iceberg, Amazon EMR Serverless, and Amazon Athena

Webinars

Trending Sources

Petabyte-scale log analytics with Amazon S3, Amazon OpenSearch Service, and Amazon OpenSearch Ingestion

Webinars

Power enterprise-grade Data Vaults with Amazon Redshift – Part 2

DS Smith sets a single-cloud agenda for sustainability

Modernize your ETL platform with AWS Glue Studio: A case study from BMS

TDC Digital leverages IBM Cloud for transparent billing and improved customer satisfaction

Implement alerts in Amazon OpenSearch Service with PagerDuty

Enhance monitoring and debugging for AWS Glue jobs using new job observability metrics, Part 3: Visualization and trend analysis using Amazon QuickSight

Access Amazon Athena in your applications using the WebSocket API

How Data Governance Protects Sensitive Data

5 misconceptions about cloud data warehouses

CDP Private Cloud is a Game-changer for Partners

Announcing the 2021 Data Impact Awards

5 ways to maximize your cloud investment

Automate deployment of an Amazon QuickSight analysis connecting to an Amazon Redshift data warehouse with an AWS CloudFormation template

Use your corporate identities for analytics with Amazon EMR and AWS IAM Identity Center

Eight Top DataOps Trends for 2022

Accomplish Agile Business Intelligence & Analytics For Your Business

Themes and Conferences per Pacoid, Episode 8

Extreme data center pressure? Burst to the cloud with CDP!

Alation’s Role in the Sentient Enterprise

Get Your Analytics Insights Instantly – Without Abandoning Central IT

FINRA CIO Steve Randich pushes the public cloud forward

AWS Glue crawlers support cross-account crawling to support data mesh architecture

Announcing the 2020 Data Impact Award Winners

10 Keys to a Secure Cloud Data Lakehouse

Create an end-to-end data strategy for Customer 360 on AWS

How Agencies Can Gain the Cyber Edge with Smart Data Solutions

Modern Data Architecture for Telecommunications

How The Cloud Made ‘Data-Driven Culture’ Possible | Part 1

Federate Amazon QuickSight access with open-source identity provider Keycloak

CIOs rise to the ESG reporting challenge

Exploring the AI and data capabilities of watsonx

Build efficient ETL pipelines with AWS Step Functions distributed map and redrive feature

How Novo Nordisk built distributed data governance and control at scale

Ingest, transform, and deliver events published by Amazon Security Lake to Amazon OpenSearch Service

Dancing with Elephants in 5 Easy Steps

Generate security insights from Amazon Security Lake data using Amazon OpenSearch Ingestion

Improving Multi-tenancy with Virtual Private Clusters

Periscope Data Expands to Israel, Empowering Data Teams with Powerful Tools

Data Management Requirements for the Enterprise Data Lake

PODCAST: Making AI Real – Episode 4: Unlocking the Value of Enterprise AI with Data Engineering Capabilities

Real-time streaming data top picks you cannot miss at AWS re:Invent 2023

Stay Connected