Overview
Apache Hive facilitates reading, writing, and managing large datasets residing in distributed storage using SQL. Hive provides a mechanism to project structure onto this data and query the data using a SQL-like language called HiveQL.
Key Features
- HiveQL: SQL-like query language.
- Metastore: Stores metadata about the tables.
- Execution Engines: Can run on MapReduce, Tez, or Spark.