Saturday 1 December 2012

Installation Of Hive In Ubuntu


Hive is a data warehouse system for Hadoop that facilitates easy data summarization, ad-hoc queries, and the analysis of large datasets stored in Hadoop compatible file systems. Hive provides a mechanism to project structure onto this data and query the data using a SQL-like language called HiveQL.

The hive-0.8.1 installation is done in below versions of Linux, Java and Hadoop respectively.

UBUNTU 12.04 LTS
JAVA 1.7.0_09
HADOOP 1.1.0

I have hduser as a dedicated hadoop system user. I had installed my Hadoop in /home/hduser/hadoop folder. Now I am going to install hive  in /home/hduser folder. Change the directory to the hduser and execute below commands.

Download the hive from below URL using wget.

Unzip the tar file.
sudo tar xzf  hive-0.8.1.tar.gz

Change the name to hive.
sudo mv hive-0.8.1 hive

Set the HIVE_HOME path.
I had downloaded the hive to the /home/hduser/hive folder
export HIVE_HOME=/home/hduser/hive
export PATH=$PATH:$HIVE_HOME/bin

Type hive in command line.

To list the tables use show tables hive command.

4 comments:

  1. Is there a need to associate zookeeper with hive for security purposes.

    ReplyDelete
  2. Here you can find a list of all releases tarballs, gzips, etc. Useful when one is interested in another version different from 0.8.1

    http://archive.apache.org/dist/hive/

    ReplyDelete
  3. Sekhar can you please help with hive installation on a multimode cluster?

    ReplyDelete
    Replies
    1. Hive is a client of Hadoop. Hive Queries are converted into Map Reduce programs. Install Hive only on node which had NameNode and JobTracker. Make sure that Hadoop path i set.

      Delete