article thumbnail

Themes and Conferences per Pacoid, Episode 11

Domino Data Lab

Paco Nathan ‘s latest article covers program synthesis, AutoPandas, model-driven data queries, and more. Using ML models to search more effectively brought the search space down to 102—which can run on modest hardware. Model-Driven Data Queries. Introduction. BTW, videos for Rev2 are up: [link]. That’s impressive.

Metadata 105
article thumbnail

Accelerating revenue growth with real-time analytics: Poshmark’s journey

AWS Big Data

Personalized recommendations – User behavior based on clickstream events can be captured up to the last second before enriching it for personalization and sending it to the model to predict the recommendations. Spark Structured Streaming continuous processing is an experimental feature and provides at-least once guarantees.

article thumbnail

Unleashing the power of Presto: The Uber case study

IBM Big Data Hub

Uber’s DNA as an analytics company At its core, Uber’s business model is deceptively simple: connect a customer at point A to their destination at point B. Next, they build model data sets out of the snapshots, cleanse and deduplicate the data, and prepare it for analysis as Parquet files. It lands as raw data in HDFS.

OLAP 87