yarn日常维护之nm健康状态为false
来源:互联网 发布:中文翻译阿拉伯语软件 编辑:程序博客网 时间:2024/06/01 08:11
最近发现yarn集群的ui上显示的nodes个数为2个,正常情况下是2个,然后就很无语了,因为以前一直都没有问题
然后差问题呗,从ui上显示丢失了206机器的nm,重新启动206上的nm 然后我查看206机器nm的日志和207上的rm的日志
从日志上来看 没有任何问题,nm显示注册到了207机器,207机器显示收到了206机器的注册,这就无语了,我累个法克
然后磨叽了好几个小时,在查看206 nm的ui上注意到了一个东西,上面显示NodeHealthyStatus为false而且还显示出data log bad
那就谷歌了一下
https://stackoverflow.com/questions/29131449/why-does-hadoop-report-unhealthy-node-local-dirs-and-log-dirs-are-bad
The most common cause of local-dirs are bad
is due to available disk space on the node exceeding yarn's max-disk-utilization-per-disk-percentage
default value of 90.0%
.
Either clean up the disk that the unhealthy node is running on, or increase the threshold in yarn-site.xml
<property> <name>yarn.nodemanager.disk-health-checker.max-disk-utilization-per-disk-percentage</name> <value>98.5</value></property>
Avoid disabling disk check, because your jobs may failed when the disk eventually run out of space, or if there are permission issues. Refer to the yarn-site.xml Disk Checker section for more details.
我的选择是先修改阀值
The most common cause of local-dirs are bad
is due to available disk space on the node exceeding yarn's max-disk-utilization-per-disk-percentage
default value of 90.0%
.
Either clean up the disk that the unhealthy node is running on, or increase the threshold in yarn-site.xml
<property> <name>yarn.nodemanager.disk-health-checker.max-disk-utilization-per-disk-percentage</name> <value>98.5</value></property>
Avoid disabling disk check, because your jobs may failed when the disk eventually run out of space, or if there are permission issues. Refer to the yarn-site.xml Disk Checker section for more details.
- yarn日常维护之nm健康状态为false
- YARN源码分析之ApplicationMaster启动流程之NM端
- 利用graphviz生成hadoop 2.0 Yarn中的MR/RM/NM状态转换图
- 日常维护
- DB2日常维护之优化 【优化】
- elasticseach日常维护之shard管理
- MySQL运维之--日常维护操作
- YARN NM与RM通信
- Yarn源代码分析之旅---NodeManager---健康检查
- hadoop之 YARN配置参数剖析—RM与NM相关参数
- oracle9i日常维护之undo表空间切换
- VCS日常维护指导
- oracle数据库日常维护
- Oracle数据库日常维护
- Oracle数据库日常维护
- 电脑日常维护三步曲
- Oracle日常维护手册
- 日常操作维护
- redis第一节课笔记
- Java
- Spring Boot与Spring Security整合后post数据不了,403拒绝访问
- centos 验证mysql的安装
- jquery的art.dialog弹窗插件
- yarn日常维护之nm健康状态为false
- Hibernate @Transient实现临时字段映射
- 第六章 函数
- 国外免费数据集下载网址
- 使用STVD+COSMIX编译STM8S工程问题汇总
- IntelliJ Idea 常用快捷键列表
- Android MTK N 平台上如何添加双卡铃声功能
- kettle调度监控最佳实践
- 百度图表柱子背景颜色使用渐变效果