重新编译spark源码,使CDH支持spark sql

来源:互联网 发布:sql语句创建视图 编辑:程序博客网 时间:2024/05/14 15:24

1、编辑$MAVEN_HOME/bin/mvn文件,增加配置:

MAVEN_OPTS="-Xmx2g -XX:MaxPermSize=512M -XX:ReservedCodeCacheSize=512m"

2、执行mvn命令:

mvn -Pyarn -PHadoop-2.6 -Dhadoop.version=2.6.0-cdh5.8.3 -Dscala-2.10.5 -Phive -Phive-thriftserver -DskipTests install

编译成功截图:



3、复制jar包:

cp spark-1.6.0/assembly/target/scala-2.10/spark-assembly-1.6.0-hadoop2.6.0-cdh5.8.3.jar /opt/cloudera/parcels/CDH-5.8.3-1.cdh5.8.3.p0.2/jars

4、修改jar包软链接(/opt/cloudera/parcels/CDH/lib/spark/lib):

ln -s ../../../jars/spark-assembly-1.6.0-hadoop2.6.0-cdh5.8.3.jar spark-assembly-1.6.0-cdh5.8.3-hadoop2.6.0-cdh5.8.3.jar ln -s spark-assembly-1.6.0-cdh5.8.3-hadoop2.6.0-cdh5.8.3.jar spark-assembly.jar

5、复制jar包到hdfs:

hdfs dfs -put /opt/cloudera/parcels/CDH/jars/spark-assembly-1.6.0-hadoop2.6.0-cdh5.8.3.jar /user/spark/lib

查看jar包:

[root@cdh1 lib]# hdfs dfs -ls /user/spark/libFound 1 items-rwxr-xr-x   3 hdfs spark  192854141 2016-12-28 13:54 /user/spark/lib/spark-assembly-1.6.0-hadoop2.6.0-cdh5.8.3.jar

6、复制spark-sql文件:

cp spark-1.6.0/bin/spark-sql /opt/cloudera/parcels/CDH/lib/spark/bin

7、配置CM:




0 0
原创粉丝点击