article thumbnail

Announcing the AWS Well-Architected Data Analytics Lens

AWS Big Data

He’s worked with small and big data for most of his career, and has built applications running on AWS since 2008. Theo Tolv is a Senior Analytics Architect based in Stockholm, Sweden. In his spare time he likes to tinker with electronics and read space opera. Bruce Ross is a Senior Solutions Architect at AWS in the New York Area.

article thumbnail

Simplify and speed up Apache Spark applications on Amazon Redshift data with Amazon Redshift integration for Apache Spark

AWS Big Data

For sales across multiple markets, the product sales data such as orders, transactions, and shipment data is available on Amazon S3 in the data lake. The data engineering team can use Apache Spark with Amazon EMR or AWS Glue to analyze this data in Amazon S3. where( col("year") == 2008).groupBy("qtr").sum("qtysold").select(

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Automate deployment of an Amazon QuickSight analysis connecting to an Amazon Redshift data warehouse with an AWS CloudFormation template

AWS Big Data

Analytics Specialist based out of Northern Virginia, specialized in the design and implementation of analytics and data lake solutions. About the author Sandeep Bajwa is a Sr.

article thumbnail

Use your corporate identities for analytics with Amazon EMR and AWS IAM Identity Center

AWS Big Data

Set up EMR Studio In this step, we demonstrate the actions needed from the data lake administrator to set up EMR Studio enabled for trusted identity propagation and with IAM Identity Center integration. On the Lake Formation console, choose Data lake permissions under Permissions in the navigation pane.

article thumbnail

New Thinking, Old Thinking and a Fairytale

Peter James Thomas

A decade later, Gartner had some rather sobering thoughts to offer on the same subject: Gartner predicted that through 2008, about 60% of organizations that outsource customer-facing functions will see client defections and hidden costs that outweigh any potential cost savings. And reduced costs aren’t guaranteed […]. .

article thumbnail

Driving Industry Transformation Through the Use of Data

Cloudera

To accomplish that, the company needed to resolve issues around outdated usage data, limited storage and processing capabilities, siloed operations, and a limited view of customers. . MTN leveraged a data lake powered by the EVA (Enterprise Value Analytics) platform and deployed Cloudera CDP to unify data access across its operations.

article thumbnail

Exploring new ETL and ELT capabilities for Amazon Redshift from the AWS Glue Studio visual editor

AWS Big Data

In a modern data architecture, unified analytics enable you to access the data you need, whether it’s stored in a data lake or a data warehouse. If we run a similar query from the Amazon Redshift Query Editor, we can see that there are actually five such venues within that time frame.