Hive is a data warehouse system for Hadoop that
facilitates easy data summarization, ad-hoc queries, and the analysis of large
datasets stored in Hadoop compatible file systems. Hive provides a mechanism to
project structure onto this data and query the data using a SQL-like language
called HiveQL.
The hive-0.8.1
installation is
done in below versions of Linux, Java and Hadoop respectively.
UBUNTU 12.04 LTS
JAVA 1.7.0_09
HADOOP 1.1.0
I have hduser as a dedicated hadoop system
user. I had installed my Hadoop in /home/hduser/hadoop folder. Now I am going
to install hive in /home/hduser folder.
Change the directory to the hduser and execute below commands.
Download the
hive from below URL using wget.
Unzip the tar file.
sudo
tar xzf hive-0.8.1.tar.gz
Change the name to hive.
sudo
mv hive-0.8.1 hive
Set the HIVE_HOME path.
I had downloaded the hive
to the /home/hduser/hive folder
export
HIVE_HOME=/home/hduser/hive
export
PATH=$PATH:$HIVE_HOME/bin
Type
hive in command line.
To
list the tables use show tables hive command.
Is there a need to associate zookeeper with hive for security purposes.
ReplyDeleteHere you can find a list of all releases tarballs, gzips, etc. Useful when one is interested in another version different from 0.8.1
ReplyDeletehttp://archive.apache.org/dist/hive/
Sekhar can you please help with hive installation on a multimode cluster?
ReplyDeleteHive is a client of Hadoop. Hive Queries are converted into Map Reduce programs. Install Hive only on node which had NameNode and JobTracker. Make sure that Hadoop path i set.
Delete