article thumbnail

How Etihad taps data science to optimise airline operations

CIO Business Intelligence

Despite the worldwide chaos, UAE national airline Etihad has managed to generate productivity gains and cost savings from insights using data science. Etihad began its data science journey with the Cloudera Data Platform and moved its data to the cloud to set up a data lake. Reem Alaya Lebhar.

article thumbnail

Data Modeling 201 for the cloud: designing databases for data warehouses

erwin

The first and most important thing to recognize and understand is the new and radically different target environment that you are now designing a data model for. Star schema: a data modeling and database design paradigm for data warehouses and data lakes. Don’t obstruct the optimizer from seeing it’s a star schema.

article thumbnail

Materialized Views in Hive for Iceberg Table Format

Cloudera

Queries containing joins, filters, projections, group-by, or aggregations without group-by can be transparently rewritten by the Hive optimizer to use one or more eligible materialized views. Materialized views can be partitioned on one or more columns. This can potentially lead to orders of magnitude improvement in performance.