spark on yarn
来源:互联网 发布:花生壳 该域名被锁定 编辑:程序博客网 时间:2024/06/10 03:47
安装 hadoop
环境变量:
export HADOOP_HOME=/home/spark/app/hadoop-2.4.1export HADOOP_CONF_DIR=$HADOOP_HOME/etc/hadoopexport YARN_HOME=/home/spark/app/hadoop-2.4.1export YARN_CONF_DIR=$YARN_HOME/etc/hadoopexport PATH=$PATH:$JAVA_HOME/bin:$JRE_HOME/bin:$HADOOP_HOME/sbin:$HADOOP_HOME/bin
配置:
hadoop-env.sh: export JAVA_HOME=/usr/lib/jvm/jdk1.8yarn-env.sh: export JAVA_HOME=/usr/lib/jvm/jdk1.8hdfs-site.xml:<property><name>dfs.support.append</name><value>true</value></property><property><name>dfs.replication</name><value>1</value></property>mapred-site.xml:<property><name>mapreduce.framework.name</name><value>yarn</value></property>yarn-env.sh:<property><name>yarn.nodemanager.aux-services</name><value>mapreduce_shuffle</value></property><property> <name>yarn.nodemanager.aux-services.mapreduce.shuffle.class</name> <value>org.apache.hadoop.mapred.ShuffleHandler</value></property><property> <name>yarn.resourcemanager.webapp.address</name> <value>spark02:8088</value></property>core-site.xml <property><name>fs.defaultFS</name><value>hdfs://spark02:9000</value></property><property><name>hadoop.tmp.dir</name><value>/home/spark/app/hadoop-2.4.1/tmp</value></property>
spark:
环境变量:
export SPARK_HOME=/home/spark/app/spark-2.0.0-bin-hadoop2.4/export PATH=$PATH:$SPARK_HOME/bin:$SPARK_HOME/sbin
配置:
文件spark-env.sh :export JAVA_HOME=/usr/lib/jvm/jdk1.8export SCALA_HOME=/home/spark/app/scala-2.11.11/export HADOOP_CONF_DIR=/home/spark/app/hadoop-2.4.1/etc/hadoop/export SPARK_PID_DIR=/home/spark/app/spark-2.0.0-bin-hadoop2.4/pidtmpexport SPARK_MASTER_IP=spark02export SPARK_MASTER_PORT=7077 export SPARK_MASTER_WEBUI_PORT=8080 export SPARK_WORKER_CORES=1 #cpu核心数量 export SPARK_WORKER_MEMORY=1024m #生产中要改大,默认1G export SPARK_WORKER_PORT=7078 export SPARK_WORKER_WEBUI_PORT=8081 export SPARK_WORKER_INSTANCES=1export YARN_CONF_DIR=/home/spark/app/hadoop-2.4.1/etc/hadoop/export SPARK_LIBARY_PATH=.:$JAVA_HOME/lib:$JAVA_HOME/jre/lib:$HADOOP_HOME/lib/native#not found sc in spack-shellexport SPARK_LOCAL_IP=spark02
hadoop:
bin/start-all.sh
spark:
bin/start-all.sh
master:http://ip:8080worker:http://ip:8081task:http://ip:4040yarn资源管理界面:http://ip:8088hdfs:http://ip:50070启动历时服务器$mr-jobhistory-daemon.sh start historyserver http://ip:19888
test:
./bin/spark-submit –class org.apache.spark.examples.SparkPi –master yarn –deploy-mode cluster –driver-memory 1G –executor-memory 1G –executor-cores 1 lib/spark-examples-1.6.1-hadoop2.6.0.jar 40
阅读全文
0 0
- Spark on Yarn部署
- Spark on Yarn
- spark on yarn
- spark on yarn
- Spark on Yarn简介
- spark on yarn
- Spark on YARN 部署
- spark on yarn 配置
- spark on yarn
- spark on yarn 安装
- Spark on Yarn 图
- Spark on yarn
- 源码-Spark on Yarn
- Launching Spark on YARN
- Spark On Yarn 知识点
- Spark 2.0 On Yarn
- spark on yarn
- Spark Executor on YARN
- 特斯拉机器学习专家谷俊丽加盟小鹏汽车 负责自动驾驶 | 行业
- 三星电子投资中国AI创企深鉴科技 S9或用上AI芯片 | 聚焦
- 机器学习中距离和相似性度量方法
- 文章标题
- C语言:递归
- spark on yarn
- LINUX目录和文件各自的权限说明,以及目录和文件权限之间的关系(应用:配置linux下上传图片的存储目录)
- Python学习手册
- MVC模式与三层架构
- 188. Best Time to Buy and Sell Stock IV
- 特斯拉首次正面回应在华建厂事宜!别高兴太早,独资建厂的特斯拉给不了你白菜价
- ICCV2017 | 一文详解GAN之父Ian Goodfellow 演讲《生成对抗网络的原理与应用》(附完整PPT)
- 资源 | 最新机器学习必备十大入门算法!都在这里了
- 观点 | 转行人士如何在人工智能领域保持一定的竞争力?