Overview

A Data Lakehouse is a modern data architecture that seeks to combine the best features of data lakes and data warehouses. It enables BI and ML on all data, providing the structure and performance of a warehouse with the low cost and flexibility of a lake.

Key Features

  • Transaction Support: ACID transactions for data integrity.
  • Schema Enforcement: Ensuring data quality.
  • BI Support: Direct access for reporting tools.
  • Decoupled Storage and Compute: Independent scaling.
  • Open Formats: Using formats like Parquet or Avro.

Related Terms