article thumbnail

10 Years Later: Who’s the GOAT of Data Catalogs?

Alation

December 2012: Alation forms and goes to work creating the first enterprise data catalog. Later, in its inaugural report on data catalogs, Forrester Research recognizes that “Alation started the MLDC trend.”. June 2021: Snowflake names Alation its Data Governance Partner of the Year. of private SaaS companies.

article thumbnail

Use Apache Iceberg in your data lake with Amazon S3, AWS Glue, and Snowflake

AWS Big Data

He is passionate about helping customers build modern data architectures on the AWS Cloud. He has helped customers of all sizes implement data management, data warehouse, and data lake solutions. Brian Dolan joined Amazon as a Military Relations Manager in 2012 after his first career as a Naval Aviator.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Trending Sources

article thumbnail

Take Your SQL Skills To The Next Level With These Popular SQL Books

datapine

Originally published in 2018, the book has a second edition that was released in January of 2022. The new version includes details on the latest SQL features such as techniques for wrangling data, as well as two new chapters, one dedicated to setting up your own system, and another for using PostgreSQL and JSON. Viescas, Douglas J.

article thumbnail

Accelerate HiveQL with Oozie to Spark SQL migration on Amazon EMR

AWS Big Data

Many customers run big data workloads such as extract, transform, and load (ETL) on Apache Hive to create a data warehouse on Hadoop. sql_parameters DATE=2022-07-04::HOUR=00 Any additional or dynamic parameters expected by the SQL files. Amol Guldagad is a Data Analytics Consultant based in India.

article thumbnail

Getting started guide for near-real time operational analytics using Amazon Aurora zero-ETL integration with Amazon Redshift

AWS Big Data

Amazon Aurora zero-ETL integration with Amazon Redshift was announced at AWS re:Invent 2022 and is now available in public preview for Amazon Aurora MySQL-Compatible Edition 3 (compatible with MySQL 8.0) For this illustration, we use a provisioned Aurora database and an Amazon Redshift Serverless data warehouse.

article thumbnail

How to use Netezza Performance Server query data in Amazon Simple Storage Service (S3)

IBM Big Data Hub

In this example, we will demonstrate using current data within a Netezza Performance Server as a Service (NPSaaS) table combined with historical data in Parquet files to determine if flight delays have increased in 2022 due to the impact of the COVID-19 pandemic on the airline travel industry. Prerequisites for the demo.

article thumbnail

How The Explosive Growth Of Data Access Affects Your Engineer’s Team Efficiency

Smart Data Collective

In fact, you may have even heard about IDC’s new Global DataSphere Forecast, 2021-2025 , which projects that global data production and replication will expand at a compound annual growth rate of 23% during the projection period, reaching 181 zettabytes in 2025. zettabytes of data in 2020, a tenfold increase from 6.5

Big Data 104