hadoop2.6.0集群HA搭建

来源:互联网 发布:台州妇产医院网络咨询 编辑:程序博客网 时间:2024/04/27 06:44

环境配置:

4台虚拟机centOS 6.6

 NNDNZKZKFCJNnode11
11
node211111node3
11
1node4
1

1


NN:namenode

DN:datanode

ZK:zookeeper

ZKFC:ZKFailoverController

JN:journalnode

关于jdk,ssh的安装参考我另一篇部署文档http://blog.csdn.net/rosen_luo/article/details/48543947

zookeeper安装:

解压zookeeper-3.4.6.tar.gz,修改/添加一下zoo.cfg文件配置:

dataDir=/home/hadoop/zookeeper/dataserver.1=node1:2888:3888server.2=node2:2888:3888server.3=node3:2888:3888
在/home/hadoop/zookeeper/data 目录中添加myid文件,其内容为各个机器上配置的编号:1/2/3。
hadoop安装:

类似JDK安装,Hadoop安装只需要将安装包解压到指定位置,然后设置环境变量即可,这里不详说。
1. 修改hadoop-env.sh文件,添加jdk的环境变量。

#hadoop-env.sh文件修改export JAVA_HOME=/usr/lib/java

2. 修改hdfs-site.xml文件

<configuration>  <property>    <name>dfs.nameservices</name>    <value>mycluster</value>  </property>  <property>    <name>dfs.ha.namenodes.mycluster</name>    <value>nn1,nn2</value>  </property>  <property>    <name>dfs.namenode.rpc-address.mycluster.nn1</name>    <value>node1:9000</value>  </property>  <property>    <name>dfs.namenode.http-address.mycluster.nn1</name>    <value>node1:50070</value>  </property>  <property>    <name>dfs.namenode.rpc-address.mycluster.nn2</name>    <value>node2:9000</value>  </property>  <property>    <name>dfs.namenode.http-address.mycluster.nn2</name>    <value>node2:50070</value>  </property>  <property>    <name>dfs.namenode.shared.edits.dir</name>    <value>qjournal://node2:8485;node3:8485;node4:8485/mycluster</value>  </property>  <property>    <name>dfs.client.failover.proxy.provider.mycluster</name>    <value>org.apache.hadoop.hdfs.server.namenode.ha.ConfiguredFailoverProxyProvider</value>  </property>  <property>    <name>dfs.ha.automatic-failover.enabled</name>    <value>true</value>  </property>  <property>    <name>dfs.ha.fencing.methods</name>    <value>sshfence</value>  </property>  <property>    <name>dfs.ha.fencing.ssh.private-key-files</name>    <value>/home/hadoop/.ssh/id_rsa</value>  </property></configuration>

3. 修改core-site.xml文件

 <property>    <name>fs.defaultFS</name>    <value>hdfs://mycluster</value>  </property>  <property>    <name>hadoop.tmp.dir</name>    <value>/home/hadoop/tmp</value>  </property>  <property>    <name>ha.zookeeper.quorum</name>    <value>node1:2181,node2:2181,node3e:2181</value>  </property>
4. 启动集群

#在所有DataNode中启动Journal节点,在NameNode中执行(node1, ${HADOOP_HOME}/sbin/)./hadoop-daemon.sh start journalnode#格式化NameNode,在NameNode节点中执行(node1)hdfs namenode -format#启动node1下的NameNode./hadoop-daemon.sh start namenode#在node2上/bin下执行hdfs namenode -bootstrapStandby#在node1上停止所有服务./stop-dfs.sh#在node1中执行zookeeper中格式化:hdfs zkfc -formatZK#启动hdfs,在NodeNode节点中执行(node1, ${HADOOP_HOME}/sbin/)./start-dfs.sh

启动验证:

#node1NameNodeDFSZKFailoverControllerQuorumPeerMain#node2NameNodeDFSZKFailoverControllerQuorumPeerMainDataNodeDFSZKFailoverControllerJournalNode#node3/node4DataNodeQuorumPeerMainJournalNode


0 0