Remove 2012 Remove Data Lake Remove Data-driven Remove Testing
article thumbnail

How SumUp made digital analytics more accessible using AWS Glue

AWS Big Data

SumUp is a leading global financial technology company driven by the purpose of leveling the playing field for small businesses. Founded in 2012, SumUp is the financial partner for more than 4 million small merchants in over 35 markets worldwide, helping them start, run and grow their business.

article thumbnail

Handle UPSERT data operations using open-source Delta Lake and AWS Glue

AWS Big Data

Many customers need an ACID transaction (atomic, consistent, isolated, durable) data lake that can log change data capture (CDC) from operational data sources. There is also demand for merging real-time data into batch data. Delta Lake framework provides these two capabilities.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Simplify and speed up Apache Spark applications on Amazon Redshift data with Amazon Redshift integration for Apache Spark

AWS Big Data

Customers use Amazon Redshift to run their business-critical analytics on petabytes of structured and semi-structured data. Apache Spark enables you to build applications in a variety of languages, such as Java, Scala, and Python, by accessing the data in your Amazon Redshift data warehouse.

article thumbnail

Use your corporate identities for analytics with Amazon EMR and AWS IAM Identity Center

AWS Big Data

To enable your workforce users for analytics with fine-grained data access controls and audit data access, you might have to create multiple AWS Identity and Access Management (IAM) roles with different data permissions and map the workforce users to one of those roles. We use Okta as the IdP for this demonstration.

article thumbnail

How The Cloud Made ‘Data-Driven Culture’ Possible | Part 1

BizAcuity

Cloud technology and innovation drives data-driven decision making culture in any organization. Cloud washing is storing data on the cloud for use over the internet. Storing data is extremely expensive even with VMs during this time. An efficient big data management and storage solution that AWS quickly took advantage of.

article thumbnail

Build efficient ETL pipelines with AWS Step Functions distributed map and redrive feature

AWS Big Data

AWS Step Functions is a fully managed visual workflow service that enables you to build complex data processing pipelines involving a diverse set of extract, transform, and load (ETL) technologies such as AWS Glue , Amazon EMR , and Amazon Redshift. The data is then cleansed, transformed, and uploaded to Amazon S3 for further processing.

Metadata 117
article thumbnail

Themes and Conferences per Pacoid, Episode 12

Domino Data Lab

Paco Nathan ‘s latest monthly article covers Sci Foo as well as why data science leaders should rethink hiring and training priorities for their data science teams. In this episode I’ll cover themes from Sci Foo and important takeaways that data science teams should be tracking. Introduction. Ever heard of it before?