article thumbnail

What’s hard about AI? Operations!

Timo Elliott

It’s not creating the models or doing the data science—it’s actually making it part of an operational process. Data scientists typically extract data from operational systems and move it to a hyperscalar data lake, then use open source algorithms to create and test their mdoels.

Data Lake 101
article thumbnail

How Novanta’s CIO mobilized its data-driven transformation

CIO Business Intelligence

On investing in capabilities: We’ve set up something called a BI Center of Excellence where we train and have workshops and seminars on a monthly basis that team members across Novanta can join to learn about how they could leverage data marts or data sources to build their own reporting.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Data Modeling 201 for the cloud: designing databases for data warehouses

erwin

The first and most important thing to recognize and understand is the new and radically different target environment that you are now designing a data model for. Star schema: a data modeling and database design paradigm for data warehouses and data lakes. Business Focus. Operational. Operational Tactical.

article thumbnail

Orca Security’s journey to a petabyte-scale data lake with Apache Iceberg and AWS Analytics

AWS Big Data

With data becoming the driving force behind many industries today, having a modern data architecture is pivotal for organizations to be successful. In this post, we describe Orca’s journey building a transactional data lake using Amazon Simple Storage Service (Amazon S3), Apache Iceberg, and AWS Analytics.

article thumbnail

Themes and Conferences per Pacoid, Episode 8

Domino Data Lab

Fun fact: in the early 1980s I attended a Systems seminar by some of our department’s grad students and professors, plus their colleagues from a nearby university with bears, who presented about a thing called the “Stanford University Network” workstation. Somehow, the gravity of the data has a geological effect that forms data lakes.

article thumbnail

How Aura from Unity revolutionized their big data pipeline with Amazon Redshift Serverless

AWS Big Data

Amazon Redshift is a recommended service for online analytical processing (OLAP) workloads such as cloud data warehouses, data marts, and other analytical data stores. You can use simple SQL to analyze structured and semi-structured data, operational databases, and data lakes to deliver the best price/performance at any scale.