一、主机规划、存储规划
服务器配置信息:CentOS6.5 最小化安装+development tools组包,其余组件yum安装即可。
二、系统设置如下:
1、服务器信息如下(/etc/hosts文件):
192.168.100.231 db01.chavin.king db01 192.168.100.232 db02.chavin.king db02 192.168.100.233 db03.chavin.king db03 192.168.100.234 db04.chavin.king db04 192.168.100.235 db05.chavin.king db05 192.168.100.236 db06.chavin.king db06 192.168.100.237 db07.chavin.king db07 |
2、创建普通用户及密码:
useradd -g hadoop hadoop echo "dbking588" | passwd --stdin hadoop |
3、配置hadoop账号sudo权限(/etc/sudoers):
chmod u+w /etc/sudoers echo "hadoop ALL=(root)NOPASSWD:ALL" >> /etc/sudoers chmod u-w /etc/sudoers |
4、关闭防火墙并且禁用selinux
5、设置文件打开数量及最大进程数
6、配置集群时间同步服务
cp /etc/ntp.conf /etc/ntp.conf.bak cp /etc/sysconfig/ntpd /etc/sysconfig/ntpd.bak echo "restrict 192.168.100.0 mask 255.255.255.0 nomodify notrap" >> /etc/ntp.conf echo "SYNC_HWCLOCK=yes" >> /etc/sysconfig/ntpd service ntpd restart |
0-59/10 * * * * /opt/scripts/sync_time.sh # cat /opt/scripts/sync_time.sh /sbin/service ntpd stop /usr/sbin/ntpdate db01.chavin.king /sbin/service ntpd start |
三、安装mysql数据库和postgresql数据库
1、安装mysql数据库(mysql-5.6.24-linux-glibc2.5-x86_64.tar.gz)
2、安装cloudera集成postgresql数据库
--需要以以下方式安装postgresql数据库: [root@db01 postgresq-libs]# ll total 6564 -rw-r--r-- 1 root root 2905984 Apr 16 23:58 postgresql-8.4.18-1.el6_4.x86_64.rpm -rw-r--r-- 1 root root 205732 Apr 16 23:58 postgresql-libs-8.4.18-1.el6_4.x86_64.rpm -rw-r--r-- 1 root root 3602880 Apr 16 23:58 postgresql-server-8.4.18-1.el6_4.x86_64.rpm [root@db01 postgresq-libs]# rpm -ivh *.rpm Preparing... ########################################### [100%] 1:postgresql-libs ########################################### [ 33%] 2:postgresql ########################################### [ 67%] 3:postgresql-server ########################################### [100%] |
四、安装CM5
1、软件下载:
安装版本CM 5.3.6
总下载地址:http://archive.cloudera.com/cm5/
cm-5.3.6 bin文件下载地址:http://archive.cloudera.com/cm5/installer/5.3.6/
cm-5.3.6依赖rpm包:http://archive.cloudera.com/cm5/redhat/6/x86_64/cm/5.3.6/
2、配置本地源
1)安装papche服务器:
yum -y install httpd
service httpd start
chkconfig httpd on
cd /var/www/html/
mkdir -p cm5/redhat/6/x86_64/cm/5.3.6/RPMS/x86_64/
--将下载好的cm5依赖包mv到/var/www/html/cm5/redhat/6/x86_64/cm/5.3.6/RPMS/x86_64/目录下:
[root@db01 x86_64]# ll
total 700568
-rw-r--r-- 1 root root 3989520 Apr 16 22:21 cloudera-manager-agent-5.3.6-1.cm536.p0.244.el6.x86_64.rpm
-rw-r--r-- 1 root root 499418684 Apr 16 22:22 cloudera-manager-daemons-5.3.6-1.cm536.p0.244.el6.x86_64.rpm
-rw-r--r-- 1 root root 7852 Apr 16 22:21 cloudera-manager-server-5.3.6-1.cm536.p0.244.el6.x86_64.rpm
-rw-r--r-- 1 root root 9884 Apr 16 22:21 cloudera-manager-server-db-2-5.3.6-1.cm536.p0.244.el6.x86_64.rpm
-rw-r--r-- 1 root root 693024 Apr 16 22:21 enterprise-debuginfo-5.3.6-1.cm536.p0.244.el6.x86_64.rpm
-rw-r--r-- 1 root root 71204325 Apr 16 22:21 jdk-6u31-linux-amd64.rpm
-rw-r--r-- 1 root root 142039186 Apr 16 22:21 oracle-j2sdk1.7-1.7.0+update67-1.x86_64.rpm
--创建repodata相关依赖文件、否则安装cm将默认查找最新版本(/var/www/html/cm5/redhat/6/x86_64/cm/5.3.6目录下):
[root@db01 repo-libs]# ll
total 196
-rw-r--r-- 1 root root 96552 Apr 16 23:21 createrepo-0.9.9-18.el6.noarch.rpm
-rw-r--r-- 1 root root 72520 Apr 16 23:21 deltarpm-3.5-0.5.20090913git.el6.x86_64.rpm
-rw-r--r-- 1 root root 27748 Apr 16 23:21 python-deltarpm-3.5-0.5.20090913git.el6.x86_64.rpm
[root@db01 repo-libs]# rpm -ivh *.repo
[root@db01 repo-libs]# cd /var/www/html/cm5/redhat/6/x86_64/cm/5.3.6/
[root@db01 5.3.6]# createrepo .
2)配置repo文件
[cloudera-manager]
# Packages for Cloudera Manager, Version 5, on RedHat or CentOS 6 x86_64
name=Cloudera Manager
baseurl=http://db01.chavin.king/cm5/redhat/6/x86_64/cm/5.3.6/
enabled=1
gpgcheck=0
3、安装CM5
./cloudera-manager-installer.bin
4、浏览器登录,我这里登录地址为:db01:7180,用户名密码默认admin/admin
五、安装CDH5(parcels包安装)
1、下载parcels软件包(安装版本CDH 5.3.6):http://archive.cloudera.com/cdh5/parcels/5.3.6/
2、上传文件到/opt/cloudera/parcel-repo/目录下:
[root@db01 parcel-repo]# ll
total 725736
-rw-r--r-- 1 root root 743145472 Apr 17 12:23 CDH-5.3.6-1.cdh5.3.6.p0.11-el6.parcel.parcel
-rw-r--r-- 1 root root 41 Apr 17 12:21 CDH-5.3.6-1.cdh5.3.6.p0.11-el6.parcel.sha1
[root@db01 parcel-repo]# mv CDH-5.3.6-1.cdh5.3.6.p0.11-el6.parcel.parcel CDH-5.3.6-1.cdh5.3.6.p0.11-el6.parcel
[root@db01 parcel-repo]# mv CDH-5.3.6-1.cdh5.3.6.p0.11-el6.parcel.sha1 CDH-5.3.6-1.cdh5.3.6.p0.11-el6.parcel.sha
[root@db01 parcel-repo]# ll
total 1473856
-rw-r--r-- 1 root root 1509217191 Apr 17 12:24 CDH-5.3.6-1.cdh5.3.6.p0.11-el6.parcel
-rw-r--r-- 1 root root 41 Apr 17 12:21 CDH-5.3.6-1.cdh5.3.6.p0.11-el6.parcel.sha
3、重启cloudera服务:
[root@db01 parcel-repo]# service cloudera-scm-server status
cloudera-scm-server (pid 22823) is running...
[root@db01 parcel-repo]# service cloudera-scm-server restart
Stopping cloudera-scm-server: [ OK ]
Starting cloudera-scm-server: [ OK ]
4、配置yum源:
将db01上yum文件同步到db02、db03、db04、db05、db06、db07/etc/yum.repo.d/上。
5、向集群中添加主机:
执行如下命令安装依赖包:
#yum -y install cyrus-sasl-gssapi fuse cyrus-sasl-plain libxslt fuse-libs redhat-lsb portmap bind-utils
#yum -y install libxslt fuse-libs
注意:安装以上依赖包才可以正确安装agent服务,否则很可能报错,需要根据实际情况处理。
问题:
解决办法:
# sysctl -w vm.swappiness=0
# echo "vm.swappiness=0" >>/etc/sysctl.conf
6、配置java环境变量
echo "export JAVA_HOME=/usr/java/jdk1.7.0_67-cloudera" >> /etc/profile echo "export PATH=$JAVA_HOME/bin:$JAVA_HOME/jre/bin:$PATH" >>/etc/profile echo "export CLASSPATH=.:$JAVA_HOME/lib:$JAVA_HOME/jre/lib:$CLASSPATH" >> /etc/profile source /etc/profile |
7、版本汇总
Cluster 1 — CDH 5 |
|||
主机 |
|||
db[01-07].chavin.king |
|||
组件 |
版本 |
发行版 |
CDH 版本 |
Bigtop-Tomcat(仅限 CDH 5) |
0.7.0+cdh5.3.6+0 |
1.cdh5.3.6.p0.18 |
CDH 5 |
Crunch(仅限 CDH 5 ) |
0.11.0+cdh5.3.6+31 |
1.cdh5.3.6.p0.18 |
CDH 5 |
Flume NG |
1.5.0+cdh5.3.6+93 |
1.cdh5.3.6.p0.18 |
CDH 5 |
MapReduce 1 |
2.5.0+cdh5.3.6+898 |
1.cdh5.3.6.p0.18 |
CDH 5 |
Hadoop |
2.5.0+cdh5.3.6+898 |
1.cdh5.3.6.p0.18 |
CDH 5 |
HDFS |
2.5.0+cdh5.3.6+898 |
1.cdh5.3.6.p0.18 |
CDH 5 |
HttpFS |
2.5.0+cdh5.3.6+898 |
1.cdh5.3.6.p0.18 |
CDH 5 |
hadoop-kms |
2.5.0+cdh5.3.6+898 |
1.cdh5.3.6.p0.18 |
CDH 5 |
MapReduce 2 |
2.5.0+cdh5.3.6+898 |
1.cdh5.3.6.p0.18 |
CDH 5 |
YARN |
2.5.0+cdh5.3.6+898 |
1.cdh5.3.6.p0.18 |
CDH 5 |
HBase |
0.98.6+cdh5.3.6+115 |
1.cdh5.3.6.p0.18 |
CDH 5 |
Lily HBase Indexer |
1.5+cdh5.3.6+31 |
1.cdh5.3.6.p0.18 |
CDH 5 |
Hive |
0.13.1+cdh5.3.6+397 |
1.cdh5.3.6.p0.18 |
CDH 5 |
HCatalog |
0.13.1+cdh5.3.6+397 |
1.cdh5.3.6.p0.18 |
CDH 5 |
Hue |
3.7.0+cdh5.3.6+203 |
1.cdh5.3.6.p0.18 |
CDH 5 |
Impala |
2.1.5+cdh5.3.6+0 |
1.cdh5.3.6.p0.18 |
CDH 5 |
Kite(仅限 CDH 5 ) |
0.15.0+cdh5.3.6+201 |
1.cdh5.3.6.p0.18 |
CDH 5 |
Llama(仅限 CDH 5 ) |
1.0.0+cdh5.3.6+0 |
1.cdh5.3.6.p0.18 |
CDH 5 |
Mahout |
0.9+cdh5.3.6+25 |
1.cdh5.3.6.p0.18 |
CDH 5 |
Oozie |
4.0.0+cdh5.3.6+349 |
1.cdh5.3.6.p0.18 |
CDH 5 |
Parquet |
1.5.0+cdh5.3.6+69 |
1.cdh5.3.6.p0.18 |
CDH 5 |
Pig |
0.12.0+cdh5.3.6+59 |
1.cdh5.3.6.p0.18 |
CDH 5 |
sentry |
1.4.0+cdh5.3.6+155 |
1.cdh5.3.6.p0.18 |
CDH 5 |
Solr |
4.4.0+cdh5.3.6+352 |
1.cdh5.3.6.p0.18 |
CDH 5 |
spark |
1.2.0+cdh5.3.6+379 |
1.cdh5.3.6.p0.18 |
CDH 5 |
Sqoop |
1.99.4+cdh5.3.6+32 |
1.cdh5.3.6.p0.18 |
CDH 5 |
Sqoop |
1.4.5+cdh5.3.6+78 |
1.cdh5.3.6.p0.18 |
CDH 5 |
Whirr |
0.9.0+cdh5.3.6+19 |
1.cdh5.3.6.p0.18 |
CDH 5 |
ZooKeeper |
3.4.5+cdh5.3.6+91 |
1.cdh5.3.6.p0.18 |
CDH 5 |
Cloudera Manager Management Daemon |
5.3.6 |
1.cm536.p0.244 |
不适用 |
Java 6 |
JAVA_HOME=/usr/java/jdk1.6.0_31 java version "1.6.0_31" Java(TM) SE Runtime Environment (build 1.6.0_31-b04) Java HotSpot(TM) 64-Bit Server VM (build 20.6-b01, mixed mode) |
不可用 |
不适用 |
Java 7 |
JAVA_HOME=/usr/java/jdk1.7.0_67-cloudera java version "1.7.0_67" Java(TM) SE Runtime Environment (build 1.7.0_67-b01) Java HotSpot(TM) 64-Bit Server VM (build 24.65-b04, mixed mode) |
不可用 |
不适用 |
Cloudera Manager Agent |
5.3.6 |
1.cm536.p0.244.el6 |
不适用 |
六、添加cloudera managerment service
图形界面(略),以下同此要求。
七、添加服务组件
1、安装zookeeper
2、安装hdfs
3、安装yarn
4、安装hive
5、安装hbase
进行相关基准测试。