Overview

A data lake stores data in its raw format until it is needed. This allows organizations to store massive amounts of data without having to define a schema first.

Purpose

Used for big data processing, real-time analytics, and machine learning training.

Related Terms