Overview

Apache Hive facilitates reading, writing, and managing large datasets residing in distributed storage using SQL. Hive provides a mechanism to project structure onto this data and query the data using a SQL-like language called HiveQL.

Key Features

  • HiveQL: SQL-like query language.
  • Metastore: Stores metadata about the tables.
  • Execution Engines: Can run on MapReduce, Tez, or Spark.

Related Terms