Overview
A data lake stores data in its raw format until it is needed. This allows organizations to store massive amounts of data without having to define a schema first.
Purpose
Used for big data processing, real-time analytics, and machine learning training.