Remove 2022 Remove Blog Remove Data Transformation Remove Metadata
article thumbnail

Fabrics, Meshes & Stacks, oh my! Q&A with Sanjeev Mohan

Alation

He’s a true expert in the field, having worked at Oracle, Scient, BearingPoint, and Booz Allen Hamilton, and on data-focused projects with companies like LMVH, Major League Baseball, Toyota, American Express, Freddie Mac, and many, many others. I recently had the opportunity to connect with Mohan at Snowflake Summit 2022 in Las Vegas.

article thumbnail

Supercharge Your Data Lakehouse with Apache Iceberg in Cloudera Data Platform

Cloudera

These tools empower analysts and data scientists to easily collaborate on the same data, with their choice of tools and analytic engines. No more lock-in, unnecessary data transformations, or data movement across tools and clouds just to extract insights out of the data.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Trending Sources

article thumbnail

Build incremental data pipelines to load transactional data changes using AWS DMS, Delta 2.0, and Amazon EMR Serverless

AWS Big Data

Data ingestion – Steps 1 and 2 use AWS DMS, which connects to the source database and moves full and incremental data (CDC) to Amazon S3 in Parquet format. Data transformation – Steps 3 and 4 represent an EMR Serverless Spark application (Amazon EMR 6.9 For Name , enter emr-delta-blog. For Type , choose Spark.

article thumbnail

Biggest Trends in Data Visualization Taking Shape in 2022

Smart Data Collective

There are countless examples of big data transforming many different industries. There is no disputing the fact that the collection and analysis of massive amounts of unstructured data has been a huge breakthrough. This is something that you can learn more about in just about any technology blog.

article thumbnail

Build and manage your modern data stack using dbt and AWS Glue through dbt-glue, the new “trusted” dbt adapter

AWS Big Data

dbt is an open source, SQL-first templating engine that allows you to write repeatable and extensible data transforms in Python and SQL. dbt is predominantly used by data warehouses (such as Amazon Redshift ) customers who are looking to keep their data transform logic separate from storage and engine.

Data Lake 104