Data Leaders Brief

Run interactive workloads on Amazon EMR Serverless from Amazon EMR Studio

AWS Big Data

APRIL 24, 2024

Starting from release 6.14, Amazon EMR Studio supports interactive analytics on Amazon EMR Serverless. EMR Studio is an integrated development environment (IDE) that makes it straightforward for data scientists and data engineers to develop, visualize, and debug analytics applications written in PySpark, Python, and Scala.

Interactive

Interactive Visualization Big Data Management

Streaming Ingestion for Apache Iceberg With Cloudera Stream Processing

Cloudera

MARCH 2, 2023

Recently, we announced enhanced multi-function analytics support in Cloudera Data Platform (CDP) with Apache Iceberg. Iceberg is a high-performance open table format for huge analytic data sets. This enables you to maximize utilization of streaming data at scale. Currently, Iceberg support in CSP is in technical preview mode.

Snapshot

Snapshot Data Processing Metadata Management

Migrate your existing SQL-based ETL workload to an AWS serverless ETL infrastructure using AWS Glue

AWS Big Data

JULY 31, 2023

Customers often use many SQL scripts to select and transform the data in relational databases hosted either in an on-premises environment or on AWS and use custom workflows to manage their ETL. AWS Glue is a serverless data integration and ETL service with the ability to scale on demand.

Sales

Sales Data Warehouse Visualization Testing

Webinars

Peak Performance: Continuous Testing & Evaluation of LLM-Based Applications

Manufacturing Sustainability Surge: Your Guide to Data-Driven Energy Optimization & Decarbonization

From Developer Experience to Product Experience: How a Shared Focus Fuels Product Success

Understanding User Needs and Satisfying Them

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

MORE WEBINARS

Harnessing Streaming Data: Insights at the Speed of Life

Sisense

OCTOBER 15, 2020

Streaming data analytics is expected to grow into a $38.6 As real-time analytics and machine learning stream processing are growing rapidly, they introduce a new set of technological and conceptual challenges. We live in a world of data: There’s more of it than ever before, in a ceaselessly expanding array of forms and locations.

Dashboards

Dashboards IoT Optimization Internet of Things

One Big Cluster Stuck: The Right Tool for the Right Job

Cloudera

JUNE 26, 2023

Over time, using the wrong tool for the job can wreak havoc on environmental health. Here are some tips and tricks of the trade to prevent well-intended yet inappropriate data engineering and data science activities from cluttering or crashing the cluster. Over time, those practices lead to cluster and Impala instability.

Testing

Testing Data Processing Visualization Data Science

Scale AWS Glue jobs by optimizing IP address consumption and expanding network capacity using a private NAT gateway

AWS Big Data

MARCH 19, 2024

For data engineering workloads when AWS Glue is used in such a constrained network configuration, your team may sometimes face hurdles running many jobs simultaneously. When an AWS Glue job runs in your VPC, the job creates an ENI inside the configured VPC for each data connection, and that ENI uses an IP address in the specified VPC.

Optimization

Optimization Data-driven Management Testing

Use Apache Iceberg in a data lake to support incremental data processing

AWS Big Data

MARCH 2, 2023

Apache Iceberg is an open table format for very large analytic datasets, which captures metadata information on the state of datasets as they evolve and change over time. It adds tables to compute engines including Spark, Trino, PrestoDB, Flink, and Hive using a high-performance table format that works just like a SQL table.

Data Lake

Data Lake Data Processing Metadata Snapshot

2021 Data/AI Salary Survey

O'Reilly on Data

SEPTEMBER 15, 2021

Would your job still be there in a year? At the same time, employees were reluctant to look for new jobs, especially if they would require relocating—at least according to the rumor mill. 22% of respondents said they intended to change jobs, roughly what we would have expected. Executive Summary. Executive Summary.

Machine Learning

Machine Learning Statistics Reporting Consulting

Educating Data Analysts at Scale: Cloudera Launches Modern Big Data Analysis with SQL on Coursera

Cloudera

JULY 15, 2019

At a time when machine learning, deep learning, and artificial intelligence capture an outsize share of media attention, jobs requiring SQL skills continue to vastly outnumber jobs requiring those more advanced skills. Educating Data Analysts at Scale. What We Teach.

Big Data

Big Data Deep Learning Data Warehouse Data-driven

What you need to know about product management for AI

O'Reilly on Data

MARCH 31, 2020

We won’t go into the mathematics or engineering of modern machine learning here. AI systems differ from traditional software in many ways, but the biggest difference is that machine learning shifts engineering from a deterministic process to a probabilistic one.

Management

Management Machine Learning Experimentation Metrics

Data architecture strategy for data quality

IBM Big Data Hub

JANUARY 5, 2023

How the right data architecture improves data quality. The right data architecture can help your organization improve data quality because it provides the framework that determines how data is collected, transported, stored, secured, used and shared for business intelligence and data science use cases.

Data Quality

Data Quality Data Architecture Strategy Data Lake

Modernize Your ETL Processes, Discover Better Insights

Sisense

JULY 8, 2020

Product and engineering teams dig into productivity metrics or bug reports to help them better prioritize their resources. Product and engineering teams dig into productivity metrics or bug reports to help them better prioritize their resources. In recent years we’ve seen data become vastly more available to businesses.

Data Warehouse

Data Warehouse Data Lake Data-driven Cost-Benefit

How Fifth Third Bank Implements a Data Mesh with Alation and Snowflake

Alation

JUNE 14, 2023

We didn’t have access to hundreds of data engineers out in the marketplace,” Lavorini points out. So instead of looking toward the job market, Lavorini’s team looked internally at their people and supply chain. “If Every organization wants to better serve its customers, and that goal is often achieved through data. That’s a problem!

Data-driven

Data-driven Finance Uncertainty Digital Transformation

Preprocess and fine-tune LLMs quickly and cost-effectively using Amazon EMR Serverless and Amazon SageMaker

AWS Big Data

FEBRUARY 1, 2024

In general, you can build applications powered by LLMs by incorporating prompt engineering into your code. Prompt engineering is about guiding the model’s output by crafting input prompts, whereas fine-tuning is about training the model on custom datasets to make it better suited for specific tasks or domains.

Modeling

Modeling Metadata Data Processing Unstructured Data

Migrate from Amazon Kinesis Data Analytics for SQL Applications to Amazon Kinesis Data Analytics Studio

AWS Big Data

JUNE 29, 2023

Amazon Kinesis Data Analytics makes it easy to transform and analyze streaming data in real time. In this post, we discuss why AWS recommends moving from Kinesis Data Analytics for SQL Applications to Amazon Kinesis Data Analytics for Apache Flink to take advantage of Apache Flink’s advanced streaming capabilities.

Data Analytics

Data Analytics Analytics IoT Data Lake

Monitor and optimize cost on AWS Glue for Apache Spark

AWS Big Data

APRIL 28, 2023

AWS Glue is a serverless data integration service that makes it simple to discover, prepare, and combine data for analytics, machine learning (ML), and application development. For Usage type , choose the following options: Choose -ETL-DPU-Hour (DPU-Hour) for standard jobs. Choose -ETL-Flex-DPU-Hour (DPU-Hour) for Flex jobs.

Optimization

Optimization Metrics Interactive Data Integration

How Wallapop improved performance of analytics workloads with Amazon Redshift Serverless and data sharing

AWS Big Data

NOVEMBER 14, 2023

Amazon Redshift is a fast, fully managed cloud data warehouse that makes it straightforward and cost-effective to analyze all your data at petabyte scale, using standard SQL and your existing business intelligence (BI) tools. Their cluster was deployed with 8 nodes ra3.4xlarge and concurrency scaling enabled.

Data Warehouse

Data Warehouse Analytics Testing Cost-Benefit

Unlocking the value of data as your differentiator

AWS Big Data

NOVEMBER 29, 2023

While Swami explored many facets of this beneficial relationship in the keynote today, one area that is especially critical for our customers to get right if they want to see success in generative AI is data. There has never been a more exciting time in modern technology. These models need vast amounts of data.

Data Warehouse

Data Warehouse Data Lake Data Integration Dashboards

Create an end-to-end data strategy for Customer 360 on AWS

AWS Big Data

MARCH 26, 2024

We structure it in five pillars that power C360: data collection, unification, analytics, activation, and data governance, along with a solution architecture that you can use for your implementation. This view is used to identify patterns and trends in customer behavior, which can inform data-driven decisions to improve business outcomes.

Data Strategy

Data Strategy Strategy Data Warehouse Prescriptive Analytics

How SikSin improved customer engagement with AWS Data Lab and Amazon Personalize

AWS Big Data

JANUARY 25, 2023

The SikSin Food Service team wanted to view web analytics log data by multiple dimensions, such as customer profiles and places. The AWS Data Lab offers accelerated, joint-engineering engagements between customers and AWS technical resources to create tangible deliverables that accelerate data and analytics modernization initiatives.

Visualization

Visualization Interactive Modeling Machine Learning

How to Build a Successful Metadata Management Framework

Alation

JUNE 28, 2022

With a metadata management framework, your data analysts: Optimize search and findability: Create a single portal using role-based access for rapid data access based on job function and need. Scale effectively: Leverage taxonomies to ensure consistent modeling outcomes when introducing new data sets or changing business demands.

Metadata

Metadata Management Data Governance Machine Learning

Choosing an open table format for your transactional data lake on AWS

AWS Big Data

JUNE 9, 2023

A modern data architecture enables companies to ingest virtually any type of data through automated pipelines into a data lake, which provides highly durable and cost-effective object storage at petabyte or exabyte scale. Apache Iceberg 1.2.0, and Delta Lake 2.3.0. Apache Iceberg 1.2.0, and Delta Lake 2.3.0.

Data Lake

Data Lake Metadata Optimization Statistics

Product Management for AI

Domino Data Lab

JUNE 23, 2019

Companies with successful ML projects are often companies that already have an experimental culture in place as well as analytics that enable them to learn from data. Companies that understand how to apply machine learning will be best positioned to scale and win their respective markets over the next decade. Session Summary.

Management

Management Machine Learning Experimentation Metrics

18 Best KPIs and Metrics for the Agile CEO

Jet Global

JUNE 30, 2021

Depending on the size of the company, a CEO’s role can greatly vary. However, there is one thing that they all have in common. They are the highest-ranking executive in the company and make high-level strategic decisions that chart the course for the company. Financial KPIs for the CEO’s Dashboard. You can read more about financial KPIs here.

Metrics

Metrics KPI Dashboards Advertising

Ultimate Guide to Equity Compensation Management and Software

Jet Global

FEBRUARY 4, 2022

These include: Employee stock options afford an employee the right to purchase a given number of company shares at a predetermined price. Stock appreciation rights ( SARs) give an employee a claim to the company’s share price increase over a given period. Different Forms of Equity Compensation.

Software

Software Management Recreation/Entertainment Reporting

Best Tax KPIs and Metric Examples for 2021 Reporting

Jet Global

NOVEMBER 15, 2021

Organizations should use it in conjunction with other metrics beyond their industry standards–since the standards can vary, especially for the companies that operate on a global scale. Since every organization has its own manner of operation, the KPIs or metrics used for tax will vary from one organization to another. Download Now.

Metrics

Metrics Reporting KPI Risk Management

Efficiently Scaling Your Cap Table: A Look at Equity Management Solutions

Jet Global

JULY 13, 2021

A cap table is critically important for emerging companies, but it can quickly become very complicated. A cap table (short for “capitalization table”) is a list of the ownership shares in a company, along with SAFE, warrants, options, and other convertable securities. A company’s cap table is important for several reasons.

Management

Management Recreation/Entertainment Reporting Software

Workforce Planning Best Practices for 2022

Jet Global

DECEMBER 22, 2021

Companies are struggling to find employees with the right mix of skills and to apply those skills effectively and efficiently. In many parts of the world, labor markets are under stress. As a result, strategic workforce planning is garnering considerable attention today, and workforce planning software is in high demand.

Recreation/Entertainment

Recreation/Entertainment Finance Software Reporting

5 Ways Certent Equity Management Can Help You with Your SPAC

Jet Global

DECEMBER 13, 2021

Why SPACs Matter Right Now. These warrants give investors the right to buy more shares at a pre-set price in the future. You’ve probably heard a lot about special purpose acquisition companies (SPACs) lately. This allows trading as quickly as 30 days after IPOs, which is much faster than the traditional IPO which requires 180 days.

Management

Management Recreation/Entertainment Reporting Insurance

Three Best Practices for ASC 718 Reporting

Jet Global

JUNE 15, 2021

During the dot-com boom, the practice of issuing equity shares as a portion of employee compensation gained tremendous popularity. Unfortunately, at the time that so many tech startups were springing up in the early 2000s, accounting practices related to the expensing of equity-based compensation were not well standardized.

Reporting

Reporting Recreation/Entertainment Finance Cost-Benefit

Leveraging the Benefits of Sales and Operations Planning

Jet Global

DECEMBER 22, 2021

Recoup 50 Percent of Your Time with the Right Financial Reporting and Planning Tools. In companies that deal with physical products, there is generally a clear delineation between supply chain operations and sales functions. contrast, with S&OP, planning processes are collaborative, and the overall goal shifts toward maximizing value.

Sales

Sales Forecasting Recreation/Entertainment Finance

Is Self-Service BI a Hollow Promise or Crucial Capability?

Jet Global

MARCH 23, 2023

‘Self-service’ capabilities like Self-Service BI are the manifestation of this expectation within many technologies. For most, ease of use is no longer enough. Now tools must be simple to use, and flexible enough to cater to a wide range of skills and intricacy of analysis. Put simply, ‘self-service’ relates to true autonomy.

Dashboards

Dashboards Reporting Key Performance Indicator Interactive

Take Your Budgeting, Planning, and Forecasting to the Next Level With the Bizview EBS Interface

Jet Global

FEBRUARY 9, 2022

This applies to collaborative planning, budgeting, and forecasting, which, without the right tools, can be daunting. Having the right tools to save you time and improve output can help any team to work better as they work smarter. You may wish to work smarter but may be impeded by the “what-ifs” of change.

Forecasting

Forecasting Recreation/Entertainment Reporting Software

Four Benefits of Scenario Modeling in Excel

Jet Global

JULY 15, 2021

There’s an old saying in the business world that “All forecasts are wrong.” There’s another adage, often repeated by military leaders, that says “no plan of battle ever survives first contact with the enemy.”. Unfortunately, there are a number of situations which business leaders simply cannot predict.

Modeling

Modeling Recreation/Entertainment Sales Cost-Benefit

Reduce ESPP Reporting Friction with Equity Management Software

Jet Global

JUNE 22, 2021

Let’s look at four ways in which companies can reduce the friction associated with ESPP reporting using the right equity management software and services. To make matters worse, the job often falls to a single person with specialized knowledge of the systems and processes involved with ESPP record-keeping. Automate ESPP Processes.

Software

Software Reporting Management Recreation/Entertainment

Choosing the Right Software for ESEF Compliance and Beyond

Jet Global

MAY 6, 2021

Transition to the European Single Electronic Format (ESEF) is mandated by the European Securities and Markets Authority (ESMA). ESEF is mandatory for all 7,500 listed companies and involves producing their annual report in a web-native XHTML format, rather than the more traditional downloadable PDF. Beyond ESEF Compliance. Management Accounts.

Software

Software Recreation/Entertainment Reporting Finance

Run interactive workloads on Amazon EMR Serverless from Amazon EMR Studio

Streaming Ingestion for Apache Iceberg With Cloudera Stream Processing

Webinars

Trending Sources

Migrate your existing SQL-based ETL workload to an AWS serverless ETL infrastructure using AWS Glue

Webinars

Harnessing Streaming Data: Insights at the Speed of Life

One Big Cluster Stuck: The Right Tool for the Right Job

Scale AWS Glue jobs by optimizing IP address consumption and expanding network capacity using a private NAT gateway

Use Apache Iceberg in a data lake to support incremental data processing

2021 Data/AI Salary Survey

Educating Data Analysts at Scale: Cloudera Launches Modern Big Data Analysis with SQL on Coursera

What you need to know about product management for AI

Data architecture strategy for data quality

Modernize Your ETL Processes, Discover Better Insights

How Fifth Third Bank Implements a Data Mesh with Alation and Snowflake

Preprocess and fine-tune LLMs quickly and cost-effectively using Amazon EMR Serverless and Amazon SageMaker

Migrate from Amazon Kinesis Data Analytics for SQL Applications to Amazon Kinesis Data Analytics Studio

Monitor and optimize cost on AWS Glue for Apache Spark

How Wallapop improved performance of analytics workloads with Amazon Redshift Serverless and data sharing

Unlocking the value of data as your differentiator

Create an end-to-end data strategy for Customer 360 on AWS

How SikSin improved customer engagement with AWS Data Lab and Amazon Personalize

How to Build a Successful Metadata Management Framework

Choosing an open table format for your transactional data lake on AWS

Product Management for AI

18 Best KPIs and Metrics for the Agile CEO

Ultimate Guide to Equity Compensation Management and Software

Best Tax KPIs and Metric Examples for 2021 Reporting

Efficiently Scaling Your Cap Table: A Look at Equity Management Solutions

Workforce Planning Best Practices for 2022

5 Ways Certent Equity Management Can Help You with Your SPAC

Three Best Practices for ASC 718 Reporting

Leveraging the Benefits of Sales and Operations Planning

Is Self-Service BI a Hollow Promise or Crucial Capability?

Take Your Budgeting, Planning, and Forecasting to the Next Level With the Bizview EBS Interface

Four Benefits of Scenario Modeling in Excel

Reduce ESPP Reporting Friction with Equity Management Software

Choosing the Right Software for ESEF Compliance and Beyond

Stay Connected