Académique Documents
Professionnel Documents
Culture Documents
Version 1.x.x
Avro
Chukwa
ZooKeeper
HDFS
MapReduce
Prerequisites
Ubuntu server 12.04
Ubuntu desktop (for monitoring) Python (to add repositories) Oracle (Sun) Java 1.7.0_25
http://www.webupd8.org/2012/01/install-oracle-java-jdk-7-inubuntu-via.html
SSH
Check Setup
Check /etc/hosts
For pseudo-distributed mode: 127.0.0.1 localhost NO 127.0.1.1 entry !! ubuntu
Hadoop Downloads
Hadoop 1.1.2 current stable version Hadoop 2.0.5-alpha YARN version
Differences:
YARN HDFS Federation: multiple, redundant name nodes acting in congress Scalability beyond 4000 nodes
User Permissions
$ sudo addgroup hadoop
$ sudo adduser --ingroup hadoop hduser $ sudo chown -R hduser:hadoop hadoop
Hadoop Configuration
Startup: .bashrc All Hadoop config files are in hadoop/conf Environment: hadoop-env.sh
Site specific:
Core: core-site.xml HDFS: hdfs-site.xml
MapReduce: mapred-site.xml
Fully-distributed mode:
conf/masters
conf/slaves
Format HDFS
Existing data in HDFS will be erased !
$ bin/hadoop namenode -format
Check Daemons
jps
1367 Jps 8695 DataNode 8609 NameNode 6318 SecondaryNameNode 2600 NodeManager 2830 ResourceManager
Web Interface
http://localhost:50030 Jobtracker Cluster status http://localhost:50070 NameNode
File system
Pig (Latin)
Pig Installation
$ cd /usr/local $ sudo wget http://mirror.sdunix.com/apache/pig/stable/pig -0.11.1.tar.gz $ sudo tar -xzvf pig-0.11.1.tar.gz $ sudo mv pig-0.11.1 pig $ sudo chown -R hduser:hadoop pig $ sudo rm pig-0.10.0.tar.gz
HBase
HBase Installation
$ cd /usr/local $ sudo wget http://apache.petsads.us/hbase/stable/hbase0.94.10.tar.gz $ sudo tar -xzvf hbase-0.94.10.tar.gz $ sudo mv hbase-0.94.10 hbase $ sudo chown -R hduser:hadoop hbase $ sudo rm hbase-0.94.10.tar.gz
HBase Commands
Starting (Hadoop needs to be running) $ bin/start-hbase.sh Stopping (Stop before stopping Hadoop)
$ bin/stop-hbase.sh
User Interfaces
http://localhost:60010
http://localhost:60030
Check Daemons
jps
1367 Jps 8695 DataNode 8609 NameNode 6318 SecondaryNameNode 2600 NodeManager 2830 ResourceManager 2251 HRegionServer 3458 HMaster 2312 HQuorumPeer
HBase Shell
Starting a shell $ bin/hbase shell Help
$ help
Monitoring
http://localhost:50030 jobtracker http://localhost:50070 NameNode
http://localhost:60010 Master
http://localhost:60030 RegionServer
Questions ?