2012, Data Lake and Reporting - Data Leaders Brief

2012

Data Lake

Reporting

How BMO improved data security with Amazon Redshift and AWS Lake Formation

AWS Big Data

MARCH 1, 2024

One of the bank’s key challenges related to strict cybersecurity requirements is to implement field level encryption for personally identifiable information (PII), Payment Card Industry (PCI), and data that is classified as high privacy risk (HPR). Only users with required permissions are allowed to access data in clear text.

Data Lake

Data Lake Data Warehouse Management Risk

Measure performance of AWS Glue Data Quality for ETL pipelines

AWS Big Data

MARCH 12, 2024

In recent years, data lakes have become a mainstream architecture, and data quality validation is a critical factor to improve the reusability and consistency of the data. This post is Part 6 of a six-part series of posts to explain how AWS Glue Data Quality works. Choose Create new report. Choose Apply.

Data Quality

Data Quality Measurement Testing Visualization

Join 52,000+

Insiders

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Webinars

Generative AI Deep Dive: Advancing from Proof of Concept to Production

Understanding User Needs and Satisfying Them

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

Leading the Development of Profitable and Sustainable Products

How To Get Promoted In Product Management

MORE WEBINARS

Trending Sources

Analytics Vidhya

Federate Amazon QuickSight access with open-source identity provider Keycloak

AWS Big Data

JUNE 13, 2023

Vamsi Bhadriraju is a Data Architect at AWS. He works closely with enterprise customers to build data lakes and analytical applications on the AWS Cloud. Srikanth Baheti is a Specialized World Wide Principal Solutions Architect for Amazon QuickSight. Raji Sivasubramaniam is a Sr.

Metadata

Metadata Dashboards Business Intelligence Management

Webinars

Generative AI Deep Dive: Advancing from Proof of Concept to Production

Understanding User Needs and Satisfying Them

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

Leading the Development of Profitable and Sustainable Products

How To Get Promoted In Product Management

MORE WEBINARS

Convergent Evolution

Peter James Thomas

AUGUST 18, 2018

That was the Science, here comes the Technology… A Brief Hydrology of Data Lakes. Overlapping with the above, from around 2012, I began to get involved in also designing and implementing Big Data Architectures; initially for narrow purposes and later Data Lakes spanning entire enterprises.

Data Lake

Data Lake Data Warehouse Data mining Statistics

How SumUp made digital analytics more accessible using AWS Glue

AWS Big Data

JUNE 6, 2023

Founded in 2012, SumUp is the financial partner for more than 4 million small merchants in over 35 markets worldwide, helping them start, run and grow their business. Unless, of course, the rest of their data also resides in the Google Cloud. The Data Science teams also use this data for churn prediction and CLTV modeling.

Analytics

Analytics Data Lake Testing Optimization

Simplify external object access in Amazon Redshift using automatic mounting of the AWS Glue Data Catalog

AWS Big Data

JULY 28, 2023

Amazon Redshift now makes it easier for you to run queries in AWS data lakes by automatically mounting the AWS Glue Data Catalog. You no longer have to create an external schema in Amazon Redshift to use the data lake tables cataloged in the Data Catalog. There are additional changes required in IAM policy.

Data Lake

Data Lake Data Governance Data Warehouse Modeling

Build efficient ETL pipelines with AWS Step Functions distributed map and redrive feature

AWS Big Data

DECEMBER 18, 2023

Solution overview One of the common functionalities involved in data pipelines is extracting data from multiple data sources and exporting it to a data lake or synchronizing the data to another database. The other step in the parallel state to fetch the DynamoDB table ran successfully.

Metadata

Metadata Visualization Data Lake Data-driven

Periscope Data Expands to Israel, Empowering Data Teams with Powerful Tools

Sisense

DECEMBER 11, 2019

And he explained that the leap to visual-based data discovery put analytics in the hands of data experts working in different lines of business, rather than in IT, where bottlenecks could previously occur, and reporting could take much longer. From a startup in 2012, it is now valued at $3.2 A true unicorn.

Data Lake

Data Lake Big Data Sales Data-driven

Simplify and speed up Apache Spark applications on Amazon Redshift data with Amazon Redshift integration for Apache Spark

AWS Big Data

APRIL 20, 2023

For sales across multiple markets, the product sales data such as orders, transactions, and shipment data is available on Amazon S3 in the data lake. The data engineering team can use Apache Spark with Amazon EMR or AWS Glue to analyze this data in Amazon S3. enableHiveSupport().getOrCreate()

Data Lake

Data Lake Data Warehouse Sales Data-driven

Themes and Conferences per Pacoid, Episode 8

Domino Data Lab

APRIL 3, 2019

Not that I’m implying anything about current economic conditions vis-a-vis the timing of this report… #justsayin. Somehow, the gravity of the data has a geological effect that forms data lakes. Also, data science workflows begin to create feedback loops from the big data side of the illo above over to the DW side.

Data Governance

Data Governance Machine Learning Metadata Big Data

Themes and Conferences per Pacoid, Episode 12

Domino Data Lab

AUGUST 8, 2019

Once upon a time, circa 2012-ish, data science conferences were replete with talks about an industry hellbent on loading amazing enormous Big Data into some kind of data lake, and applying all kinds of odd astrophysics-ish approaches…for eventual PROFIT! Public Health Reports (2017-07-10). Or something.

Data Science

Data Science Machine Learning Data Governance Statistics

Why We Started the Data Intelligence Project

Alation

JULY 7, 2022

To answer these questions we need to look at how data roles within the job market have evolved, and how academic programs have changed to meet new workforce demands. In the 2010s, the growing scope of the data landscape gave rise to a new profession: the data scientist. The data scientist. We’ve made incredible progress.

Metadata

Metadata Data-driven Insurance Statistics

How BMO improved data security with Amazon Redshift and AWS Lake Formation

Measure performance of AWS Glue Data Quality for ETL pipelines

Webinars

Trending Sources

Federate Amazon QuickSight access with open-source identity provider Keycloak

Webinars

Convergent Evolution

How SumUp made digital analytics more accessible using AWS Glue

Simplify external object access in Amazon Redshift using automatic mounting of the AWS Glue Data Catalog

Build efficient ETL pipelines with AWS Step Functions distributed map and redrive feature

Periscope Data Expands to Israel, Empowering Data Teams with Powerful Tools

Simplify and speed up Apache Spark applications on Amazon Redshift data with Amazon Redshift integration for Apache Spark

Themes and Conferences per Pacoid, Episode 8

Themes and Conferences per Pacoid, Episode 12

Why We Started the Data Intelligence Project

Stay Connected