Spark集群运行问题
来源:互联网 发布:听考研英语单词软件 编辑:程序博客网 时间:2024/06/05 08:01
spark输出太多warning messages
WARN Executor: 2 block locks were not released by TID =
Lock release errors occur frequently in executor logs
原因:If there are any releasedLocks (after calling BlockManager.releaseAllLocksForTask earlier) and spark.storage.exceptionOnPinLeak is enabled (it is not by default) with no exception having been thrown while the task was running, a SparkException is thrown:
[releasedLocks] block locks were not released by TID = [taskId]:
[releasedLocks separated by comma]
Otherwise, if spark.storage.exceptionOnPinLeak is disabled or an exception was thrown by the task, the following WARN message is displayed in the logs instead:
WARN Executor: [releasedLocks] block locks were not released by TID = [taskId]:
[releasedLocks separated by comma]
Note If there are any releaseLocks, they lead to a SparkException or WARN message in the logs.
[jaceklaskowski/mastering-apache-spark-book/spark-executor-taskrunner.adoc]
mapWithState causes block lock warning?
The warning was added by: SPARK-12757 Add block-level read/write locks to BlockManager?
[connectedComponents() raises lots of warnings that say "block locks were not released by TID = ..."]
[Lock release errors occur frequently in executor logs]
解决:终于在调试log时候发现问题解决了
在简略Spark输出设置时[Spark安装和配置]修改过$SPARK_HOME/conf/log4j.properties.template文件只输出WARN信息,就算改成了ERROR,信息也还是会自动修改成WARN输出出来,不过多了一条提示:
Setting default log level to "WARN". To adjust logging level use sc.setLogLevel(newLevel).
就在这时发现了一个解决方案:
根据提示在代码中加入sc.setLogLevel('ERROR')就可以解决了!
from: http://blog.csdn.net/pipisorry/article/details/52916307
ref:
- Spark集群运行问题
- 集群运行spark时出现的问题
- Spark集群运行模式
- Spark-分布式集群运行
- spark集群运行大数据集的word2vec问题汇总
- spark 集群运行python作业
- spark在集群上运行
- 在集群上运行Spark
- <SPARK-轉載> spark 集群运行python作业
- spark 运行问题总结
- spark集群管理问题集锦
- spark集群时间同步问题
- 在IDEA中开发代码,并运行在Spark集群中的问题
- 7.在集群上运行Spark
- 在集群运行spark代码记录程序
- 在集群上运行spark app
- Spark:本地连接集群运行Saprk程序
- 在Docker中运行Hadoop+Spark集群
- Java设计模式(三) Visitor(访问者)模式及多分派场景应用
- Hibernate之Query接口的uniqueResult()方法
- Apache与Nginx的优缺点比较
- 用GPU做caffe训练提示 out of memory
- 如何切换python的默认版本类型
- Spark集群运行问题
- ubuntu远程桌面windows
- 新人刚来,报道问好
- fabric1.0 错误分析总结
- hadoop ha 高可用实现原理
- [NOIP2011day2]观光公交 贪心
- BU_DATE_CHAR abap screen 日期字段搜索帮助
- BZOJ 1604: [Usaco2008 Open]Cow Neighborhoods 奶牛的邻居 曼哈顿距离转切比雪夫距离 Treap
- pyspark-MLlib(Classification and Regression)