TheDeveloperBlog.com

Home | Contact Us

C-Sharp | Java | Python | Swift | GO | WPF | Ruby | Scala | F# | JavaScript | SQL | PHP | Angular | HTML

What is Hive

What is Hive in hadoop Tutorial for beginners and professionals with examples. DDL, DML, UDF, UDAF, UDTF on hive, pig, hbase, hdfs, mapreduce, oozie, zooker, spark, sqoop

<< Back to WHAT

What is HIVE

Hive is a data warehouse system which is used to analyze structured data. It is built on the top of Hadoop. It was developed by Facebook.

Hive provides the functionality of reading, writing, and managing large datasets residing in distributed storage. It runs SQL like queries called HQL (Hive query language) which gets internally converted to MapReduce jobs.

Using Hive, we can skip the requirement of the traditional approach of writing complex MapReduce programs. Hive supports Data Definition Language (DDL), Data Manipulation Language (DML), and User Defined Functions (UDF).

Features of Hive

These are the following features of Hive:

  • Hive is fast and scalable.
  • It provides SQL-like queries (i.e., HQL) that are implicitly transformed to MapReduce or Spark jobs.
  • It is capable of analyzing large datasets stored in HDFS.
  • It allows different storage types such as plain text, RCFile, and HBase.
  • It uses indexing to accelerate queries.
  • It can operate on compressed data stored in the Hadoop ecosystem.
  • It supports user-defined functions (UDFs) where user can provide its functionality.

Limitations of Hive

  • Hive is not capable of handling real-time data.
  • It is not designed for online transaction processing.
  • Hive queries contain high latency.

Differences between Hive and Pig

Hive Pig
Hive is commonly used by Data Analysts. Pig is commonly used by programmers.
It follows SQL-like queries. It follows the data-flow language.
It can handle structured data. It can handle semi-structured data.
It works on server-side of HDFS cluster. It works on client-side of HDFS cluster.
Hive is slower than Pig. Pig is comparatively faster than Hive.
Next TopicHive Architecture




Related Links:


Related Links

Adjectives Ado Ai Android Angular Antonyms Apache Articles Asp Autocad Automata Aws Azure Basic Binary Bitcoin Blockchain C Cassandra Change Coa Computer Control Cpp Create Creating C-Sharp Cyber Daa Data Dbms Deletion Devops Difference Discrete Es6 Ethical Examples Features Firebase Flutter Fs Git Go Hbase History Hive Hiveql How Html Idioms Insertion Installing Ios Java Joomla Js Kafka Kali Laravel Logical Machine Matlab Matrix Mongodb Mysql One Opencv Oracle Ordering Os Pandas Php Pig Pl Postgresql Powershell Prepositions Program Python React Ruby Scala Selecting Selenium Sentence Seo Sharepoint Software Spellings Spotting Spring Sql Sqlite Sqoop Svn Swift Synonyms Talend Testng Types Uml Unity Vbnet Verbal Webdriver What Wpf