大数据平台监控指标整理
来源:互联网 发布:视频字幕特效软件 编辑:程序博客网 时间:2024/04/30 05:52
hadoop metrics2
监控的内容:
1. yarn
2. jvm
3. rpc
4. rpcdetailed
5. metricssystem
6. mapred
7. dfs
8. ugi
已经提供的:
Source : org.apache.hadoop.metrics2.source.JvmMertics
和org.apache.hadoop.metrics2.source.JvmMetricsInfo
其他相关
FSOpDurations : org.apache.hadoop.yarn.server.resourcemanager.scheduler.fair.FSOpDurations
fairscheduler-op-durations context :
@Metric("Duration for a continuous scheduling run") MutableRate continuousSchedulingRun; @Metric("Duration to handle a node update") MutableRate nodeUpdateCall; @Metric("Duration for a update thread run") MutableRate updateThreadRun; @Metric("Duration for an update call") MutableRate updateCall; @Metric("Duration for a preempt call") MutableRate preemptCall;
QueueMetrics : org.apache.hadoop.yarn.server.resourcemanager.scheduler.QueueMetrics
yarn context :
@Metric("# of apps submitted") MutableCounterInt appsSubmitted; @Metric("# of running apps") MutableGaugeInt appsRunning; @Metric("# of pending apps") MutableGaugeInt appsPending; @Metric("# of apps completed") MutableCounterInt appsCompleted; @Metric("# of apps killed") MutableCounterInt appsKilled; @Metric("# of apps failed") MutableCounterInt appsFailed; @Metric("Allocated memory in MB") MutableGaugeInt allocatedMB; @Metric("Allocated CPU in virtual cores") MutableGaugeInt allocatedVCores; @Metric("# of allocated containers") MutableGaugeInt allocatedContainers; @Metric("Aggregate # of allocated containers") MutableCounterLong aggregateContainersAllocated; @Metric("Aggregate # of released containers") MutableCounterLong aggregateContainersReleased; @Metric("Available memory in MB") MutableGaugeInt availableMB; @Metric("Available CPU in virtual cores") MutableGaugeInt availableVCores; @Metric("Pending memory allocation in MB") MutableGaugeInt pendingMB; @Metric("Pending CPU allocation in virtual cores") MutableGaugeInt pendingVCores; @Metric("# of pending containers") MutableGaugeInt pendingContainers; @Metric("# of reserved memory in MB") MutableGaugeInt reservedMB; @Metric("Reserved CPU in virtual cores") MutableGaugeInt reservedVCores; @Metric("# of reserved containers") MutableGaugeInt reservedContainers; @Metric("# of active users") MutableGaugeInt activeUsers; @Metric("# of active applications") MutableGaugeInt activeApplications;
FSQueueMetrics : org.apache.hadoop.yarn.server.resourcemanager.scheduler.fair.FSQueueMetrics
yarn context :
@Metric("Fair share of memory in MB") MutableGaugeInt fairShareMB; @Metric("Fair share of CPU in vcores") MutableGaugeInt fairShareVCores; @Metric("Steady fair share of memory in MB") MutableGaugeInt steadyFairShareMB; @Metric("Steady fair share of CPU in vcores") MutableGaugeInt steadyFairShareVCores; @Metric("Minimum share of memory in MB") MutableGaugeInt minShareMB; @Metric("Minimum share of CPU in vcores") MutableGaugeInt minShareVCores; @Metric("Maximum share of memory in MB") MutableGaugeInt maxShareMB; @Metric("Maximum share of CPU in vcores") MutableGaugeInt maxShareVCores;
MetricsSystemImpl : org.apache.hadoop.metrics2.impl.MetricsSystemImpl
metricssystem context :
@Metric({"Snapshot", "Snapshot stats"}) MutableStat snapshotStat; @Metric({"Publish", "Publishing stats"}) MutableStat publishStat; @Metric("Dropped updates by all sinks") MutableCounterLong droppedPubAll;
Sink : org.apache.hadoop.metrics2.sink.GraphiteSink
和org.apache.hadoop.metrics2.sink.FileSink
以及org.apache.hadoop.metrics2.sink.AbstractGangliaSink
metricsSystem : org.apache.hadoop.metrics2.lib.DefaultMetricsSystem
自我实现:
- Source :
org.apache.hadoop.metrics2.MetricsSource
- Sink :
org.apache.hadoop.metrics2.MetricsSink
- MetricsSystem :
org.apache.hadoop.metrics2.MetricsSystem
使用方式:
$HADOOP_HOME/etc/hadoop/hadoop-metrics2.properties
中配置就可以
spark metrics
可以获取到的内容主要有:
1. master
2. applications
3. worker
4. executor
5. driver
实现的Sink有
ConsoleSink : org.apache.spark.metrics.sink.ConsoleSink
CSVSink : org.apache.spark.metrics.sink.CSVSink
JmxSink : org.apache.spark.metrics.sink.JmxSink
MetricsServlet : org.apache.spark.metrics.sink.MetricsServlet
GraphiteSink : org.apache.spark.metrics.sink.GraphiteSink
Slf4jSink : org.apache.spark.metrics.sink.Slf4jSink
实现的Source有
JvmSource : org.apache.spark.metrics.source.JvmSource
ApplicationSource : org.apache.spark.deploy.master.ApplicationSource
- status
- runtime_ms
- cores
BlockManagerSource : org.apache.spark.storage.BlockManagerSource
- maxMem_MB
- remainingMem_MB
- memUsed_MB
- diskSpaceUsed_MB
DAGSchedulerSource : org.apache.spark.scheduler.DAGSchedulerSource
- failedStages
- runningStages
- waitingStages
- allJobs
- activeJobs
ExecutorAllocationManagerSource : package org.apache.spark.ExecutorAllocationManagerSource
- numberExecutorsToAdd
- numberExecutorsPendingToRemove
- numberAllExecutors
- numberTargetExecutors
- numberMaxNeededExecutors
ExecutorSource : org.apache.spark.executor.ExecutorSource
- activeTasks
- completeTasks
- currentPool_size
- maxPool_size
- read_bytes
- write_bytes
- read_ops
- largeRead_ops
- write_ops
MasterSource : org.apache.spark.deploy.master.MasterSource
- workers
- aliveWorkers
- apps
- waitingApps
MesosClusterSchedulerSource : org.apache.spark.scheduler.cluster.mesos.MesosClusterSchedulerSource
- waitingDrivers
- launchedDrivers
- retryDrivers
WorkerSource : org.apache.spark.deploy.worker.WorkerSource
- executors
- coresUsed
- memUsed_MB
- coresFree
- memFree_MB
0 0
- 大数据平台监控指标整理
- 大数据平台架构收集和整理
- 监控和大库测试指标
- MySQL:性能监控 4 大指标
- 大数据平台任务调度与监控系统
- LR监控指标数据分析(转载)
- 监控指标数据采集和展示
- 大数据环境平台仓库日常跑批整理
- 大数据环境平台日志日常跑批整理
- MySQL 性能监控4大指标——第一部分
- MySQL 性能监控4大指标——第二部分
- MySQL 性能监控4大指标——第一部分
- MySQL 性能监控4大指标——第二部分
- 大数据-游戏运营的指标分析
- 集群监控--监控指标
- 大数据平台监控(一):Ganglia在集群中快速安装方案
- 大数据平台监控(二):Ganglia与Nagios的整合
- 大数据平台监控(二):Ganglia与Nagios的整合
- 随便写写
- 3、使用JaxWs开发Web Service
- C++中const大杂烩
- 1006. Team Rankings
- 图片,文字等控件的上下抖动或左右晃动
- 大数据平台监控指标整理
- 某一整数的质数因子
- AR实时阴影制作
- Spring整合Mybatis
- 约瑟夫环问题
- 提高项目22-成绩处理函数版 (参数)
- Ubantu下的输入法--小巧有用
- 如何在Java IDE中使用selenium
- 1007. To and Fro