通过本文您可以了解E-MapReduce的监控项。

当您调用云监控的API接口时,需要获取当前云服务的NamespacePeriod,具体取值如下:

  • Namespaceacs_emr
  • Period默认为60秒,也可以为60的整数倍。

当前云服务的MetricNameDimensions的取值如下表所示。

监控项 单位 MetricName Dimensions Statistics
alink nginx端口是否正常 Count ALINKNginxPortOpen userId、clusterId、role、instanceId Average、Maximum、Minimum
alink server端口是否正常 Count ALINKServerPortOpen userId、clusterId、role、instanceId Average、Maximum、Minimum
active状态的作业个数 Count ActiveApplications userId、clusterId、role Average
active的用户数 Count ActiveUsers userId、clusterId、role Average
总共分配的container个数 Count AggregateContainersAllocated userId、clusterId、role Average
总共释放的container个数 Count AggregateContainersReleased userId、clusterId、role Average
分配的container个数 Count AllocatedContainers userId、clusterId、role Average
已完成的作业数 Count AppsCompleted userId、clusterId、role Average
失败的作业数 Count AppsFailed userId、clusterId、role Average
被杀死的作业数 Count AppsKilled userId、clusterId、role Average
等待的作业数 Count AppsPending userId、clusterId、role Average
运行中的作业数 Count AppsRunning userId、clusterId、role Average
提交的作业数 Count AppsSubmitted userId、clusterId、role Average
当前队列当前可用的内存大小 MB AvailableMB userId、clusterId、role Average
当前队列可用的VCore个数 Count AvailableVCores userId、clusterId、role Average
block的总容量 Count BlockCapacity userId、clusterId、role Average
blockChecksum操作的平均时间 ms BlockChecksumOpAvgTime userId、clusterId、role Average
blockChecksum操作次数 Count BlockChecksumOpNumOps userId、clusterId、role Average
blockReport操作的平均时间 ms BlockReportsAvgTime userId、clusterId、role Average
blockReport的操作次数 Count BlockReportsNumOps userId、clusterId、role Average
验证失败的block数 Count BlockVerificationFailures userId、clusterId、role Average
读取的block数 Count BlocksRead userId、clusterId、role Average
移除的block数 Count BlocksRemoved userId、clusterId、role Average
复制的block数 Count BlocksReplicated userId、clusterId、role Average
block总数 Count BlocksTotal userId、clusterId、role Average
没缓存的block数 Count BlocksUncached userId、clusterId、role Average
已验证的block数 Count BlocksVerified userId、clusterId、role Average
已写的block数 Count BlocksWritten userId、clusterId、role Average
读取的字节数 Byte BytesRead userId、clusterId、role Average
写入的字节数 Byte BytesWritten userId、clusterId、role Average
RPC Call队列长度 Count CallQueueLength userId、clusterId、role Average
HDFS剩余存储空间 Byte CapacityRemaining userId、clusterId、role Average
存储空间总量 Byte CapacityTotal userId、clusterId、role Average
已使用的存储空间 Byte CapacityUsed userId、clusterId、role Average
非文件系统部分占用的存储空间 Byte CapacityUsedNonDFS userId、clusterId、role Average
完成的container个数 Count ContainersCompleted userId、clusterId、role Average
失败的container个数 Count ContainersFailed userId、clusterId、role Average
初始化中的container个数 Count ContainersIniting userId、clusterId、role Average
被杀死的container个数 Count ContainersKilled userId、clusterId、role Average
已加载的container个数 Count ContainersLaunched userId、clusterId、role Average
运行中的container个数 Count ContainersRunning userId、clusterId、role Average
CopyBlock操作的平均时间 ms CopyBlockOpAvgTime userId、clusterId、role Average
CopyBlock的操作次数 Count CopyBlockOpNumOps userId、clusterId、role Average
坏块数量 Count CorruptBlocks userId、clusterId、role Average
DataNodeDfsUsedPercent % DataNodeDfsUsedPercent userId、clusterId、role Average、Maximum、Minimum
DataNode中HTTP端口的可用性 Count DataNodeHttpPortOpen userId、clusterId、role Average
DataNode中IPC端口的可用性 Count DataNodeIpcPortOpen userId、clusterId、role Average
DataNode中数据端口的可用性 Count DataNodePortOpen userId、clusterId、role Average
/mnt/disk1剩余的磁盘空间占比 % DiskUsageMntDisk1 userId、clusterId、role Average
根文件系统剩余磁盘空间占比 % DiskUsageRootfs userId、clusterId、role Average
工作流已提交 bit/s EmrFlowSubmitted userId、instanceId Average
多余的block数 Count ExcessBlocks userId、clusterId、role Average
过期的心跳数 Count ExpiredHeartbeats userId、clusterId、role Average
Fetch.LocalTimeMs.max Millisecond Fetch.LocalTimeMs.max userId、clusterId、role Average
Fetch.LocalTimeMs.mean Millisecond Fetch.LocalTimeMs.mean userId、clusterId、role Average
Fetch.LocalTimeMs.stddev Millisecond Fetch.LocalTimeMs.stddev userId、clusterId、role Average
Fetch.RemoteTimeMs.max Millisecond Fetch.RemoteTimeMs.max userId、clusterId、role Average
Fetch.RemoteTimeMs.mean Millisecond Fetch.RemoteTimeMs.mean userId、clusterId、role Average
Fetch.RemoteTimeMs.stddev Millisecond Fetch.RemoteTimeMs.stddev userId、clusterId、role Average
Fetch.RequestQueueTimeMs.max Millisecond Fetch.RequestQueueTimeMs.max userId、clusterId、role Average
Fetch.RequestQueueTimeMs.mean Millisecond Fetch.RequestQueueTimeMs.mean userId、clusterId、role Average
Fetch.RequestQueueTimeMs.stddev Millisecond Fetch.RequestQueueTimeMs.stddev userId、clusterId、role Average
Fetch.ResponseQueueTimeMs.max Millisecond Fetch.ResponseQueueTimeMs.max userId、clusterId、role Average
Fetch.ResponseQueueTimeMs.mean Millisecond Fetch.ResponseQueueTimeMs.mean userId、clusterId、role Average
Fetch.ResponseQueueTimeMs.stddev Millisecond Fetch.ResponseQueueTimeMs.stddev userId、clusterId、role Average
Fetch.ResponseSendTimeMs.max Millisecond Fetch.ResponseSendTimeMs.max userId、clusterId、role Average
Fetch.ResponseSendTimeMs.mean Millisecond Fetch.ResponseSendTimeMs.mean userId、clusterId、role Average
Fetch.ResponseSendTimeMs.stddev Millisecond Fetch.ResponseSendTimeMs.stddev userId、clusterId、role Average
Fetch.ThrottleTimeMs.max Millisecond Fetch.ThrottleTimeMs.max userId、clusterId、role Average
Fetch.ThrottleTimeMs.mean Millisecond Fetch.ThrottleTimeMs.mean userId、clusterId、role Average
Fetch.ThrottleTimeMs.stddev Millisecond Fetch.ThrottleTimeMs.stddev userId、clusterId、role Average
Fetch.TotalTimeMs.max Millisecond Fetch.TotalTimeMs.max userId、clusterId、role Average
Fetch.TotalTimeMs.mean Millisecond Fetch.TotalTimeMs.mean userId、clusterId、role Average
Fetch.TotalTimeMs.stddev Millisecond Fetch.TotalTimeMs.stddev userId、clusterId、role Average
(Flink)任务缓冲区队列长度 Count FlinkJobTaskBuffersOutputQueueLength userId、clusterId、hostname、app_id、job_id Average、Maximum、Minimum
(Flink)jobmanager最后任务checkpoint耗时 Count FlinkJobmanagerJobLastCheckpointDuration userId、clusterId、hostname、app_id、job_id Average、Maximum、Minimum
(Flink)jobmanager最后checkpoint恢复耗时 Count FlinkJobmanagerJobLastCheckpointRestoreTimestamp userId、clusterId、hostname、app_id、job_id Average、Maximum、Minimum
(Flink)jobmanager最后任务checkpoint大小 Count FlinkJobmanagerJobLastCheckpointSize userId、clusterId、hostname、app_id、job_id Average、Maximum、Minimum
(Flink)jobmanager任务完成checkpoint数量 Count FlinkJobmanagerJobNumberOfCompletedCheckpoints userId、clusterId、hostname、app_id、job_id Average、Maximum、Minimum
(Flink)jobmanager任务失败checkpoint数量 Count FlinkJobmanagerJobNumberOfFailedCheckpoints userId、clusterId、hostname、app_id、job_id Average、Maximum、Minimum
(Flink)jobmanager任务处理中checkpoint数量 Count FlinkJobmanagerJobNumberOfInProgressCheckpoints userId、clusterId、hostname、app_id、job_id Average、Maximum、Minimum
(Flink)jobmanager任务checkpoint总数 Count FlinkJobmanagerJobTotalNumberOfCheckpoints userId、clusterId、hostname、app_id、job_id Average、Maximum、Minimum
(Flink)jobmanager任务执行时间 Count FlinkJobmanagerJobUptime userId、clusterId、hostname、app_id Average、Maximum、Minimum
(Flink)jobmanager被注册taskmanager数量 Count FlinkJobmanagerNumRegisteredTaskManagers userId、clusterId、hostname、app_id Average、Maximum、Minimum
(Flink)Jobmanager运行任务数量 Count FlinkJobmanagerNumRunningJobs userId、clusterId、hostname、app_id Average、Maximum、Minimum
(Flink)jobmanager的CPU负载 Count FlinkJobmanagerStatusJVMCPULoad userId、clusterId、hostname、app_id Average、Maximum、Minimum
(Flink)jobjmanager的JVM 线程数量 Count FlinkJobmanagerStatusJVMThreadsCount userId、clusterId、hostname、app_id Average、Maximum、Minimum
(Flink)jobmanager可用slot数量 Count FlinkJobmanagerTaskSlotsAvailable userId、clusterId、hostname、app_id Average、Maximum、Minimum
(Flink)jobmanager总slot数量 Count FlinkJobmanagerTaskSlotsTotal userId、clusterId、hostname、app_id Average、Maximum、Minimum
(Flink)任务缓冲区inpool使用率 Count FlinkTaskmanagerJobTaskBuffersInPoolUsage userId、clusterId、hostname、app_id、job_id Average、Maximum、Minimum
(Flink)任务输入缓冲区队列长度 Count FlinkTaskmanagerJobTaskBuffersInputQueueLength userId、clusterId、hostname、app_id、job_id Average、Maximum、Minimum
(Flink)任务缓冲区outpool使用率 Count FlinkTaskmanagerJobTaskBuffersOutPoolUsage userId、clusterId、hostname、app_id、job_id Average、Maximum、Minimum
(Flink)当前输入水印 Millisecond FlinkTaskmanagerJobTaskCurrentInputWatermark userId、clusterId、hostname、app_id、job_id Average、Maximum、Minimum
(Flink)job任务本地buffer每秒流量 Byte FlinkTaskmanagerJobTaskNumBuffersInLocalPerSecond userId、clusterId、hostname、app_id、job_id Average、Maximum、Minimum
(Flink)job任务远程buffer每秒流量 Byte FlinkTaskmanagerJobTaskNumBuffersInRemotePerSecond userId、clusterId、hostname、app_id、job_id Average、Maximum、Minimum
(Flink)job任务每秒buffer出口流量 Byte FlinkTaskmanagerJobTaskNumBuffersOutPerSecond userId、clusterId、hostname、app_id、job_id Average、Maximum、Minimum
(Flink)job任务本地每秒流量 Byte FlinkTaskmanagerJobTaskNumBytesInLocalPerSecond userId、clusterId、hostname、app_id、job_id Average、Maximum、Minimum
(Flink)job任务远程每秒流量 Byte FlinkTaskmanagerJobTaskNumBytesInRemotePerSecond userId、clusterId、hostname、app_id、job_id Average、Maximum、Minimum
(Flink)job任务每秒出口流量 Byte FlinkTaskmanagerJobTaskNumBytesOutPerSecond userId、clusterId、hostname、app_id、job_id Average、Maximum、Minimum
(Flink)任务输入记录数 Count FlinkTaskmanagerJobTaskNumRecordsIn userId、clusterId、hostname、app_id、job_id Average、Maximum、Minimum
(Flink)任务输入记录数每秒 Count FlinkTaskmanagerJobTaskNumRecordsInPerSecond userId、clusterId、hostname、app_id、job_id Average、Maximum、Minimum
(Flink)任务输出记录数 Count FlinkTaskmanagerJobTaskNumRecordsOut userId、clusterId、hostname、app_id、job_id Average、Maximum、Minimum
(Flink)任务输出记录数每秒 Count FlinkTaskmanagerJobTaskNumRecordsOutPerSecond userId、clusterId、hostname、app_id、job_id Average、Maximum、Minimum
(Flink)当前操作输入水印 Millisecond FlinkTaskmanagerJobTaskOperatorCurrentInputWatermark userId、clusterId、hostname、app_id、job_id Average、Maximum、Minimum
(Flink)当前操作输出水印 Millisecond FlinkTaskmanagerJobTaskOperatorCurrentOutputWatermark userId、clusterId、hostname、app_id、job_id Average、Maximum、Minimum
(Flink)taskmanager的CPU负载 Count FlinkTaskmanagerStatusJVMCPULoad userId、clusterId、hostname、app_id Average、Maximum、Minimum
(Flink)taskmanager的JVM 线程数量 Count FlinkTaskmanagerStatusJVMThreadsCount userId、clusterId、hostname、app_id Average、Maximum、Minimum
datanode的flush平均时长 ns FlushNanosAvgTime userId、clusterId、role Average
datanode的flush次数 ns FlushNanosNumOps userId、clusterId、role Average
datanode的fsync次数 Count FsyncCount userId、clusterId、role Average
GC总次数 Count GcCount userId、clusterId、role Average
GC总时长 ms GcTimeMillis userId、clusterId、role Average
HMaster的HTTP端口可用性 Count HMasterHttpPortOpen userId、clusterId、role Average
HMaster的ipc端口可用性 Count HMasterIpcPortOpen userId、clusterId、role Average
HMaster的jmx端口可用性 Count HMasterJmxPortOpen userId、clusterId、role Average
HRegionServer的HTTP端口可用性 Count HRegionServerHttpPortOpen userId、clusterId、role Average
HRegionServer的ipc端口可用性 Count HRegionServerIpcPortOpen userId、clusterId、role Average
HRegionServer的jmx端口可用性 Count HRegionServerJmxPortOpen userId、clusterId、role Average
HiveServer2的服务端口可用性 Count HiveServer2PortOpen userId、clusterId、role Average
HiveServer2的webui端口可用性 Count HiveServer2WebuiPortOpen userId、clusterId、role Average
Hue的端口可用性 Count HuePortOpen userId、clusterId、role Average
JobHistory的服务端口可用性 Count JobHistoryPortOpen userId、clusterId、role Average
JobHistory的Web端口可用性 Count JobHistoryWebappPortOpen userId、clusterId、role Average
JournalNode的HTTP端口可用性 Count JournalNodeHttpPortOpen userId、clusterId、role Average
JournalNode的RPC端口可用性 Count JournalNodeRpcPortOpen userId、clusterId、role Average
Kafka Broker每秒流入字节数 Byte/s Kafka.Broker.BytesInPerSec.OneMinuteRate userId、clusterId、hostname Average
Kafka Broker每秒流出字节数 Byte/s Kafka.Broker.BytesOutPerSec.OneMinuteRate userId、clusterId、hostname Average
Kafka Broker每秒处理失败Fetch请求次数 Count/s Kafka.Broker.FailedFetchRequestsPerSec.OneMinuteRate userId、clusterId、hostname Average
Kafka Broker每秒处理失败Produce请求次数 Count/s Kafka.Broker.FailedProduceRequestsPerSec.OneMinuteRate userId、clusterId、hostname Average
Kafka Broker每秒Fetch导致消息版本转换条数 Message/s Kafka.Broker.FetchMessageConversionsPerSec.OneMinuteRate userId、clusterId、hostname Average
Kafka Broker G1老年代GC次数 Count Kafka.Broker.G1_Old_Generation.CollectionCount userId、clusterId、hostname Average
Kafka Broker G1老年代GC时间 ms Kafka.Broker.G1_Old_Generation.CollectionTime userId、clusterId、hostname Average
Kafka Broker G1年轻代GC次数 ms Kafka.Broker.G1_Young_Generation.CollectionCount userId、clusterId、hostname Average
Kafka Broker G1年轻代GC时间 ms Kafka.Broker.G1_Young_Generation.CollectionTime userId、clusterId、hostname Average
Kafka Broker Isr膨胀频率 Count/s Kafka.Broker.IsrExpandsPerSec.OneMinuteRate userId、clusterId、hostname Average
Kafka Broker Isr缩减频率 Count/s Kafka.Broker.IsrShrinksPerSec.OneMinuteRate userId、clusterId、hostname Average
Kafka Broker Leader个数 Count Kafka.Broker.LeaderCount userId、clusterId、hostname Average
Kafka Broker存储字节大小 Byte Kafka.Broker.Log.Size userId、clusterId、hostname Average
Kafka Broker刷盘速率 Count/s Kafka.Broker.LogFlushRateAndTimeMs.Rate userId、clusterid、hostname Average
Kafka Broker刷盘耗时 ms Kafka.Broker.LogFlushRateAndTimeMs.TImeMs userId、clusterid、hostname Average
Kafka Broker每秒流入条数 Message/s Kafka.Broker.MessagesInPerSec.OneMinuteRate userId、clusterId、hostname Average
Kafka Broker网络处理线程的空闲比 Ratio Kafka.Broker.NetworkProcessorAvgIdlePercent userId、clusterId、hostname Average
Kafka Broker掉线的目录个数 Count Kafka.Broker.OfflineLogDirectoryCount userId、clusterid、hostname Average
Kafka Broker Offline Replica个数 Count Kafka.Broker.OfflineReplicaCount userId、clusterId、hostname Average
Kafka Broker Partition个数 Count Kafka.Broker.PartitionCount userId、clusterId、hostname Average
Kafka Broker每秒Produce导致消息版本转换条数 Message/s Kafka.Broker.ProduceMessageConversionsPerSec.OneMinuteRate userId、clusterId、hostname Average
Kafka Broker副本每秒流入字节数 Byte/s Kafka.Broker.ReplicationBytesInPerSec.OneMinuteRate userId、clusterId、hostname Average
Kafka Broker副本每秒流出字节数 Byte/s Kafka.Broker.ReplicationBytesOutPerSec.OneMinuteRate userId、clusterId、hostname Average
Kafka Broker每个Fetch请求的字节数 Byte Kafka.Broker.RequestBytes.99thPercentile.Fetch userId、clusterId、hostname Average
Kafka Broker每个Produce请求的字节数 Byte Kafka.Broker.RequestBytes.99thPercentile.Produce userId、clusterId、hostname Average
Kafka Broker处理Request线程的空闲比 Ratio Kafka.Broker.RequestHandlerAvgIdlePercent userId、clusterId、hostname Average
Kafka Broker RequestQueue队列的使用率 Ratio Kafka.Broker.RequestQueueUsagePercent userId、clusterId、hostname Average
Kafka Broker Fetch 请求频率 Count/s Kafka.Broker.RequestsPerSec.OneMinuteRate.Fetch userId、clusterId、hostname Average
Kafka Broker FetchConsumer请求频率 Count/s Kafka.Broker.RequestsPerSec.OneMinuteRate.FetchConsumer userId、clusterId、hostname Average
Kafka Broker FetchFollower请求频率 Count/s Kafka.Broker.RequestsPerSec.OneMinuteRate.FetchFollower userId、clusterId、hostname Average
Kafka Broker Produce请求频率 Count/s Kafka.Broker.RequestsPerSec.OneMinuteRate.Produce userId、clusterId、hostname Average
Kafka Broker每秒处理Fetch请求次数 Count/s Kafka.Broker.TotalFetchRequestsPerSec.OneMinuteRate userId、clusterid、hostname Average
Kafka Broker每秒处理Produce请求次数 Count/s Kafka.Broker.TotalProduceRequestsPerSec.OneMinuteRate userId、clusterId、hostname Average
Kafka Broker每个Fetch Request处理时长 ms Kafka.Broker.TotalTimeMs.99thPercentile.Fetch userId、clusterId、hostname Average
Kafka Broker每个Produce Request处理时长 ms Kafka.Broker.TotalTimeMs.99thPercentile.Produce userId、clusterId、hostname Average
Kafka Broker 低于Min Isr Partition的个数 Count Kafka.Broker.UnderMinIsrPartitionCount userId、clusterId、hostname Average
Kafka Broker处于未同步状态Partition个数 Count Kafka.Broker.UnderReplicatedPartitions userId、clusterId、hostname Average
Kafka Broker Zookeeper客户端断开频率 Count/s Kafka.Broker.ZooKeeperDisconnectsPerSec userId、clusterId、hostname Average
Kafka Broker Zookeeper客户端Session过期频率 Count/s Kafka.Broker.ZooKeeperExpiresPerSec userId、clusterId、hostname Average
Kafka Broker磁盘使用率的最大值 Ratio Kafka.Broker.disk-usages-max-percent userId、clusterId、hostname Average
Kafka Broker磁盘使用率的平均值 Ratio Kafka.Broker.disk-usages-mean-percent userId、clusterId、hostname Average
Kafka Broker磁盘使用率的平均值 Ratio Kafka.Broker.disk-usages-min-percent userId、clusterId、hostname Average
Kafka Broker磁盘使用率的标准差 Ratio Kafka.Broker.disk-usages-stddev-percent userId、clusterId、hostname Average
Kafka Cluster活跃 Controller个数 Count Kafka.Cluster.KafkaController.ActiveControllerCount userId、clusterId max
Kafka Cluster全部 Partition个数 Count Kafka.Cluster.KafkaController.GlobalPartitionCount userId、clusterId max
Kafka Cluster全部 Topic个数 Count Kafka.Cluster.KafkaController.GlobalTopicCount userId、clusterId max
Kafka Cluster掉线 Partition个数 Count Kafka.Cluster.KafkaController.OfflinePartitionsCount userId、clusterId max
Kafka Topic每秒流入字节数 Byte/s Kafka.Topic.BytesInPerSec.OneMinuteRate userId、clusterId、tag_topic Average
Kafka Topic每秒流出字节数 Byte/s Kafka.Topic.BytesOutPerSec.OneMinuteRate userId、clusterId、tag_topic Average
Kafka Topic每秒处理失败Fetch请求次数 Count/s Kafka.Topic.FailedFetchRequestsPerSec.OneMinuteRate userId、clusterId、tag_topic Average
Kafka Topic每秒处理失败Produce请求次数 Count/s Kafka.Topic.FailedProduceRequestsPerSec.OneMinuteRate userId、clusterId、tag_topic Average
Kafka Topic每秒 Fetch导致消息版本转换条数 Message/s Kafka.Topic.FetchMessageConversionsPerSec.OneMinuteRate userId、clusterId、tag_topic Average
Kafka Topic存储字节大小 Byte Kafka.Topic.Log.Size userId、clusterId、tag_topic Average
Kafka Topic每秒流入条数 Message/s Kafka.Topic.MessagesInPerSec.OneMinuteRate userId、clusterId、tag_topic Average
Kafka Topic每秒Produce导致消息版本转换条数 Message/s Kafka.Topic.ProduceMessageConversionsPerSec.OneMinuteRate userId、clusterId、tag_topic Average
Kafka Topic每秒处理Fetch请求次数 Count/s Kafka.Topic.TotalFetchRequestsPerSec.OneMinuteRate userId、clusterId、tag_topic Average
Kafka Topic每秒处理Produce请求次数 Count/s Kafka.Topic.TotalProduceRequestsPerSec.OneMinuteRate userId、clusterId、tag_topic Average
KafkaActiveControllerCount Count KafkaActiveControllerCount userId、clusterId、role Average
KafkaBrokerTopicMetricsBytesInPerSec1MinuteRate ByteSecond KafkaBrokerTopicMetricsBytesInPerSec1MinuteRate userId、clusterId、role Average
KafkaBrokerTopicMetricsBytesInPerSecCount ByteSecond KafkaBrokerTopicMetricsBytesInPerSecCount userId、clusterId、role Average
KafkaBrokerTopicMetricsBytesOutPerSec1MinuteRate ByteSecond KafkaBrokerTopicMetricsBytesOutPerSec1MinuteRate userId、clusterId、role Average
KafkaBrokerTopicMetricsBytesOutPerSecCount ByteSecond KafkaBrokerTopicMetricsBytesOutPerSecCount userId、clusterId、role Average
KafkaBrokerTopicMetricsBytesRejectedPerSec1MinuteRate ByteSecond KafkaBrokerTopicMetricsBytesRejectedPerSec1MinuteRate userId、clusterId、role Average
KafkaBrokerTopicMetricsBytesRejectedPerSecCount ByteSecond KafkaBrokerTopicMetricsBytesRejectedPerSecCount userId、clusterId、role Average
KafkaBrokerTopicMetricsFailedFetchRequestsPerSec1MinuteRate CountSecond KafkaBrokerTopicMetricsFailedFetchRequestsPerSec1MinuteRate userId、clusterId、role Average
KafkaBrokerTopicMetricsFailedFetchRequestsPerSecCount CountSecond KafkaBrokerTopicMetricsFailedFetchRequestsPerSecCount userId、clusterId、role Average
KafkaBrokerTopicMetricsFailedProduceRequestsPerSec1MinuteRate CountSecond KafkaBrokerTopicMetricsFailedProduceRequestsPerSec1MinuteRate userId、clusterId、role Average
KafkaBrokerTopicMetricsFailedProduceRequestsPerSecCount CountSecond KafkaBrokerTopicMetricsFailedProduceRequestsPerSecCount userId、clusterId、role Average
KafkaBrokerTopicMetricsMessagesInPerSec1MinuteRate CountSecond KafkaBrokerTopicMetricsMessagesInPerSec1MinuteRate userId、clusterId、role Average
KafkaBrokerTopicMetricsMessagesInPerSecCount CountSecond KafkaBrokerTopicMetricsMessagesInPerSecCount userId、clusterId、role Average
KafkaBrokerTopicMetricsTotalFetchRequestsPerSec1MinuteRate CountSecond KafkaBrokerTopicMetricsTotalFetchRequestsPerSec1MinuteRate userId、clusterId、role Average
KafkaBrokerTopicMetricsTotalFetchRequestsPerSecCount CountSecond KafkaBrokerTopicMetricsTotalFetchRequestsPerSecCount userId、clusterId、role Average
KafkaBrokerTopicMetricsTotalProduceRequestsPerSec1MinuteRate CountSecond KafkaBrokerTopicMetricsTotalProduceRequestsPerSec1MinuteRate userId、clusterId、role Average
KafkaBrokerTopicMetricsTotalProduceRequestsPerSecCount CountSecond KafkaBrokerTopicMetricsTotalProduceRequestsPerSecCount userId、clusterId、role Average
KafkaBytesRejectedPerSec1MinuteRate CountSecond KafkaBytesRejectedPerSec1MinuteRate userId、clusterId、role Average
KafkaCleanerIo1MinuteRate CountMinute KafkaCleanerIo1MinuteRate userId、clusterId、role Average
KafkaCleanerIoCount Count KafkaCleanerIoCount userId、clusterId、role Average
KafkaDiskUsagesMaxPercent % KafkaDiskUsagesMaxPercent userId、clusterId、role Average
KafkaDiskUsagesStddevPercent % KafkaDiskUsagesStddevPercent userId、clusterId、role Average
KafkaGroupMetadataManagerNumGroups Count KafkaGroupMetadataManagerNumGroups userId、clusterId、role Average
KafkaGroupMetadataManagerNumOffsets Count KafkaGroupMetadataManagerNumOffsets userId、clusterId、role Average
KafkaIsrExpandsPerSec1MinuteRate CountSecond KafkaIsrExpandsPerSec1MinuteRate userId、clusterId、role Average
KafkaIsrExpandsPerSecCount CountSecond KafkaIsrExpandsPerSecCount userId、clusterId、role Average
KafkaIsrShrinksPerSec1MinuteRate CountSecond KafkaIsrShrinksPerSec1MinuteRate userId、clusterId、role Average
KafkaIsrShrinksPerSecCount CountSecond KafkaIsrShrinksPerSecCount userId、clusterId、role Average
KafkaLeaderElectionRateAndTimeMs1MinuteRate CountMilliSecond KafkaLeaderElectionRateAndTimeMs1MinuteRate userId、clusterId、role Average
KafkaLeaderElectionRateAndTimeMsCount Count KafkaLeaderElectionRateAndTimeMsCount userId、clusterId、role Average
KafkaLeaderElectionRateAndTimeMsMax Count KafkaLeaderElectionRateAndTimeMsMax userId、clusterId、role Average
KafkaLeaderElectionRateAndTimeMsMean Count KafkaLeaderElectionRateAndTimeMsMean userId、clusterId、role Average
KafkaLeaderElectionRateAndTimeMsMeanRate Count KafkaLeaderElectionRateAndTimeMsMeanRate userId、clusterId、role Average
KafkaLeaderElectionRateAndTimeMsMedian Count KafkaLeaderElectionRateAndTimeMsMedian userId、clusterId、role Average
KafkaLeaderElectionRateAndTimeMsMin Count KafkaLeaderElectionRateAndTimeMsMin userId、clusterId、role Average
KafkaLeaderElectionRateAndTimeMsStddev Count KafkaLeaderElectionRateAndTimeMsStddev userId、clusterId、role Average
KafkaLogCleanerMaxBufferUtilPercent % KafkaLogCleanerMaxBufferUtilPercent userId、clusterId、role Average
KafkaLogCleanerMaxCleanTime Second KafkaLogCleanerMaxCleanTime userId、clusterId、role Average
KafkaLogCleanerMaxDirtyPercent % KafkaLogCleanerMaxDirtyPercent userId、clusterId、role Average
KafkaLogCleanerRecopyPercent % KafkaLogCleanerRecopyPercent userId、clusterId、role Average
KafkaLogFlush1MinuteRate Count KafkaLogFlush1MinuteRate userId、clusterId、role Average
KafkaLogFlushCount Count KafkaLogFlushCount userId、clusterId、role Average
KafkaLogFlushRateAndTimeMsMax Count KafkaLogFlushRateAndTimeMsMax userId、clusterId、role Average
KafkaLogFlushRateAndTimeMsMean Count KafkaLogFlushRateAndTimeMsMean userId、clusterId、role Average
KafkaLogFlushRateAndTimeMsMeanRate Count KafkaLogFlushRateAndTimeMsMeanRate userId、clusterId、role Average
KafkaLogFlushRateAndTimeMsMidean Count KafkaLogFlushRateAndTimeMsMidean userId、clusterId、role Average
KafkaLogFlushRateAndTimeMsMin Count KafkaLogFlushRateAndTimeMsMin userId、clusterId、role Average
KafkaLogFlushRateAndTimeMsStddev Count KafkaLogFlushRateAndTimeMsStddev userId、clusterId、role Average
KafkaNetworkProcessorAvgIdlePercent % KafkaNetworkProcessorAvgIdlePercent userId、clusterId、role Average
KafkaOfflineLogDirectoryCount Count KafkaOfflineLogDirectoryCount userId、clusterId、role Average
KafkaOfflinePartitionsCount Count KafkaOfflinePartitionsCount userId、clusterId、role Average
KafkaOfflineReplicaCount Count KafkaOfflineReplicaCount userId、clusterId、role Average
KafkaPreferredReplicaImbalanceCount Count KafkaPreferredReplicaImbalanceCount userId、clusterId、role Average
KafkaReplicaManagerDiskUsageMax % KafkaReplicaManagerDiskUsageMax userId、clusterId、role Average
KafkaReplicaManagerDiskUsageMean % KafkaReplicaManagerDiskUsageMean userId、clusterId、role Average
KafkaReplicaManagerDiskUsageMin % KafkaReplicaManagerDiskUsageMin userId、clusterId、role Average
KafkaReplicaManagerDiskUsageStddev % KafkaReplicaManagerDiskUsageStddev userId、clusterId、role Average
KafkaReplicaManagerLeaderCount Count KafkaReplicaManagerLeaderCount userId、clusterId、role Average
KafkaReplicaManagerPartitionCount Count KafkaReplicaManagerPartitionCount userId、clusterId、role Average
KafkaReplicaManagerUnderReplicatedPartitions Count KafkaReplicaManagerUnderReplicatedPartitions userId、clusterId、role Average
KafkaRequestHandlerAvgIdlePercent1MinuteRate CountSecond KafkaRequestHandlerAvgIdlePercent1MinuteRate userId、clusterId、role Average
KafkaRequestHandlerAvgIdlePercentCount CountSecond KafkaRequestHandlerAvgIdlePercentCount userId、clusterId、role Average
KafkaRequestQueueSize Count KafkaRequestQueueSize userId、clusterId、role Average
KafkaResponseQueueSize Count KafkaResponseQueueSize userId、clusterId、role Average
KafkaUncleanLeaderElectionsPerSec1MinuteRate CountSecond KafkaUncleanLeaderElectionsPerSec1MinuteRate userId、clusterId、role Average
KafkaUncleanLeaderElectionsPerSecCount CountSecond KafkaUncleanLeaderElectionsPerSecCount userId、clusterId、role Average
KafkaUnderReplicatedPartitions Count KafkaUnderReplicatedPartitions userId、clusterId、role Average
KafkaZooKeeperAuthFailuresPerSec1MinuteRate CountSecond KafkaZooKeeperAuthFailuresPerSec1MinuteRate userId、clusterId、role Average
KafkaZooKeeperAuthFailuresPerSecCount CountSecond KafkaZooKeeperAuthFailuresPerSecCount userId、clusterId、role Average
KafkaZooKeeperDisconnectsPerSec1MinuteRat CountSecond KafkaZooKeeperDisconnectsPerSec1MinuteRat userId、clusterId、role Average
KafkaZooKeeperDisconnectsPerSecCount CountSecond KafkaZooKeeperDisconnectsPerSecCount userId、clusterId、role Average
KafkaZooKeeperExpiresPerSec1MinuteRate CountSecond KafkaZooKeeperExpiresPerSec1MinuteRate userId、clusterId、role Average
KafkaZooKeeperExpiresPerSecCount CountSecond KafkaZooKeeperExpiresPerSecCount userId、clusterId、role Average
KafkaZooKeeperReadOnlyConnectsPerSec1MinuteRate CountSecond KafkaZooKeeperReadOnlyConnectsPerSec1MinuteRate userId、clusterId、role Average
KafkaZooKeeperReadOnlyConnectsPerSecCount CountSecond KafkaZooKeeperReadOnlyConnectsPerSecCount userId、clusterId、role Average
KafkaZooKeeperSaslAuthenticationsPerSec1MinuteRate CountSecond KafkaZooKeeperSaslAuthenticationsPerSec1MinuteRate userId、clusterId、role Average
KafkaZooKeeperSaslAuthenticationsPerSecCount CountSecond KafkaZooKeeperSaslAuthenticationsPerSecCount userId、clusterId、role Average
KafkaZooKeeperSyncConnectsPerSec1MinuteRate CountSecond KafkaZooKeeperSyncConnectsPerSec1MinuteRate userId、clusterId、role Average
KafkaZooKeeperSyncConnectsPerSecCount CountSecond KafkaZooKeeperSyncConnectsPerSecCount userId、clusterId、role Average
所有DataNode的HDFS容量使用最大的百分比 % MaxDFSUsedPercent userId、clusterId、role Average
jvm提交堆内存大小 MB MemHeapCommittedM userId、clusterId、role Average
jvm最大的堆内存大小 MB MemHeapMaxM userId、clusterId、role Average
jvm使用的堆内存大小 MB MemHeapUsedM userId、clusterId、role Average
MemMaxM MB MemMaxM userId、clusterId、role Average
jvm提交的非堆内存大小 MB MemNonHeapCommittedM userId、clusterId、role Average
jvm最大的非堆内存大小 MB MemNonHeapMaxM userId、clusterId、role Average
jvm最大非堆内存大小 MB MemNonHeapMaxM_original userId、clusterId、role Average
jvm使用的非堆内存大小 MB MemNonHeapUsedM userId、clusterId、role Average
jvm使用的非堆内存大小 MB MemNonHeapUsedM_original userId、clusterId、role Average
jvm已使用的内存占比 % MemUsedPercent userId、clusterId、role Average
HiveMetaStore的端口可用性 Count MetastorePortOpen userId、clusterId、role Average
丢失的块数 Count MissingBlocks userId、clusterId、role Average
NameNode的主备状态 Count NameNodeActive userId、clusterId、role Average
NameNode的HTTP端口可用性 Count NameNodeHttpPortOpen userId、clusterId、role Average
NameNode是否处于安全模式 Count NameNodeInSafeMode userId、clusterId、role Average
NameNode的IPC端口可用性 Count NameNodeIpcPortOpen userId、clusterId、role Average
HDFS文件块元数据的堆内存占比 Ratio NameNode_HeapUsage_Files_Blocks userId、clusterId、role Average
NetworkProcessorAvgIdlePercent % NetworkProcessorAvgIdlePercent userId、clusterId、role Average
NodeManager的HTTP端口可用性 Count NodeManagerHttpPortOpen userId、clusterId、role Average
NodeManager分配的container数 Count NodeManager_AllocatedContainers userId、clusterId、role Average
Active的NodeManager个数 Count NumActiveNMs userId、clusterId Average、Maximum、Minimum
Dead的DataNode个数 Count NumDeadDataNode userId、clusterId、role Average
Decommissioned的NodeManager个数 Count NumDecommissionedNMs userId、clusterId、role Average
开启的连接数 Count NumOpenConnections userId、clusterId、role Average
重启的NodeManager个数 Count NumRebootedNMs userId、clusterId、role Average
Unhealthy的NodeManager个数 Count NumUnhealthyNMs userId、clusterId、role Average
oozie的admin端口可用性 Count OozieAdminPortOpen userId、clusterId、role Average
oozie的HTTP端口可用性 Count OozieHttpPortOpen userId、clusterId、role Average
IO请求的平均处理时间在所有磁盘中最大值 Millisecond PartitionMaxAwait userId、clusterId、role Average
磁盘使用率在所有磁盘中的最大值 % PartitionMaxUtilization userId、clusterId、role Average
等待的container个数 Count PendingContainers userId、clusterId、role Average
等待的DataNode消息数 Count PendingDataNodeMessageCount userId、clusterId、role Average
等待删除的block数 Count PendingDeletionBlocks userId、clusterId、role Average
等待复制的block数 Count PendingReplicationBlocks userId、clusterId、role Average
(HA)集群独有指标)replication 延迟的 block 个数 Count PostponedMisreplicatedBlocks userId、clusterId、role Average
PrestoMaster的HTTP端口可用性 Count PrestoMasterHttpPortOpen userId、clusterId、role Average
PrestoWorker的HTTP端口可用性 Count PrestoWorkerHttpPortOpen userId、clusterId、role Average
Produce.LocalTimeMs.max Millisecond Produce.LocalTimeMs.max userId、clusterId、role Average
Produce.LocalTimeMs.mean Millisecond Produce.LocalTimeMs.mean userId、clusterId、role Average
Produce.LocalTimeMs.stddev Millisecond Produce.LocalTimeMs.stddev userId、clusterId、role Average
Produce.RemoteTimeMs.max Millisecond Produce.RemoteTimeMs.max userId、clusterId、role Average
Produce.RemoteTimeMs.mean Millisecond Produce.RemoteTimeMs.mean userId、clusterId、role Average
Produce.RemoteTimeMs.stddev Millisecond Produce.RemoteTimeMs.stddev userId、clusterId、role Average
Produce.RequestQueueTimeMs.max Millisecond Produce.RequestQueueTimeMs.max userId、clusterId、role Average
Produce.RequestQueueTimeMs.mean Millisecond Produce.RequestQueueTimeMs.mean userId、clusterId、role Average
Produce.RequestQueueTimeMs.stddev Millisecond Produce.RequestQueueTimeMs.stddev userId、clusterId、role Average
Produce.ResponseQueueTimeMs.max Millisecond Produce.ResponseQueueTimeMs.max userId、clusterId、role Average
Produce.ResponseQueueTimeMs.mean Millisecond Produce.ResponseQueueTimeMs.mean userId、clusterId、role Average
Produce.ResponseQueueTimeMs.stddev Millisecond Produce.ResponseQueueTimeMs.stddev userId、clusterId、role Average
Produce.ResponseSendTimeMs.max Millisecond Produce.ResponseSendTimeMs.max userId、clusterId、role Average
Produce.ResponseSendTimeMs.mean Millisecond Produce.ResponseSendTimeMs.mean userId、clusterId、role Average
Produce.ResponseSendTimeMs.stddev Millisecond Produce.ResponseSendTimeMs.stddev userId、clusterId、role Average
Produce.ThrottleTimeMs.max Millisecond Produce.ThrottleTimeMs.max userId、clusterId、role Average
Produce.ThrottleTimeMs.mean Millisecond Produce.ThrottleTimeMs.mean userId、clusterId、role Average
Produce.ThrottleTimeMs.stddev Millisecond Produce.ThrottleTimeMs.stddev userId、clusterId、role Average
Produce.TotalTimeMs.max Millisecond Produce.TotalTimeMs.max userId、clusterId、role Average
Produce.TotalTimeMs.mean Millisecond Produce.TotalTimeMs.mean userId、clusterId、role Average
Produce.TotalTimeMs.stddev Millisecond Produce.TotalTimeMs.stddev userId、clusterId、role Average
WebAppProxy的端口可用性 Count ProxyServerPortOpen userId、clusterId、role Average
ReadBlock操作的平均时间 ms ReadBlockOpAvgTime userId、clusterId、role Average
ReadBlock的操作次数 Count ReadBlockOpNumOps userId、clusterId、role Average
接收的字节数 Byte ReceivedBytes userId、clusterId、role Average
ReplaceBlock操作的平均时间 ms ReplaceBlockOpAvgTime userId、clusterId、role Average
ReplaceBlock的操作次数 Count ReplaceBlockOpNumOps userId、clusterId、role Average
RequestHandlerAvgIdlePercent % RequestHandlerAvgIdlePercent userId、clusterId、role Average
RequestQueueUsagePercent % RequestQueueUsagePercent userId、clusterId、role Average
预留的container个数 Count ReservedContainers userId、clusterId、role Average
ResourceManager的主备状态 Count ResourceManagerActive userId、clusterId、role Average
ResourceManager的Admin端口可用性 Count ResourceManagerAdminPortOpen userId、clusterId、role Average
ResourceManager的服务端口可用性 Count ResourceManagerPortOpen userId、clusterId、role Average
ResourceManager的ResourceTracker端口可用性 Count ResourceManagerResourcetrackerPortOpen userId、clusterId、role Average
ResourceManager的Schedule端口可用性 Count ResourceManagerSchedulerPortOpen userId、clusterId、role Average
ResourceManager的WebApp端口可用性 Count ResourceManagerWebappPortOpen userId、clusterId、role Average
失联的NodeManager个数 Count RevisedNumLostNMs userId、clusterId、role Average
被调度复制的block数 Count ScheduledReplicationBlocks userId、clusterId、role Average
发送的字节数 Byte SentBytes userId、clusterId、role Average
SparkHistoryServer的UI端口可用性 Count SparkHistoryServerUiPortOpen userId、clusterId、role Average
StormNimbus的Thrift端口可用性 Count StormNimbusThriftPortOpen userId、clusterId、role Average
Storm的UI端口可用性 Count StormUiPortOpen userId、clusterId、role Average
阻塞的线程数 Count ThreadsBlocked userId、clusterId、role Average
新创建的线程数 Count ThreadsNew userId、clusterId、role Average
可调度运行的线程数 Count ThreadsRunnable userId、clusterId、role Average
结束的线程数 Count ThreadsTerminated userId、clusterId、role Average
等待另一个线程执行取决于指定等待时间的操作的线程数目 Count ThreadsTimedWaiting userId、clusterId、role Average
无限期地等待另一个线程来执行某一特定操作的线程数目 Count ThreadsWaiting userId、clusterId、role Average
ThriftServer的Info端口可用性 Count ThriftServerInfoPortOpen userId、clusterId、role Average
ThriftServer的Jmx端口可用性 Count ThriftServerJmxPortOpen userId、clusterId、role Average
ThriftServer的服务端口可用性 Count ThriftServerPortOpen userId、clusterId、role Average
TimelineServer的服务端口可用性 Count TimelineServerPortOpen userId、clusterId、role Average
TimelineServer的WebApp端口可用性 Count TimelineServerWebappPortOpen userId、clusterId、role Average
集群的HDFS总容量使用百分比 % TotalDFSUsedPercent userId、clusterId、role Average
HDFS的总文件数 Count TotalFiles userId、clusterId、role Average
当前的总连接数 Count TotalLoad userId、clusterId、role Average
正在被复制的block数 Count UnderReplicatedBlocks userId、clusterId、role Average
VVPAppManager的端口可用性 Count VVPAppManagerPortOpen userId、clusterId、instanceId、role Average、Maximum、Minimum
VVPGateway的端口可用性 Count VVPGatewayPortOpen userId、clusterId、role、instanceId Average、Maximum、Minimum
VVP的UI端口可用性 Count VVPUiPortOpen userId、clusterId、role、instanceId Average、Maximum、Minimum
HDFS检测出的坏盘数 Count VolumeFailures userId、clusterId、role Average
WriteBlock操作的平均时间 ms WriteBlockOpAvgTime userId、clusterId、role Average
WriteBlock的操作次数 Count WriteBlockOpNumOps userId、clusterId、role Average
失败的作业数 Count YarnAppsFailed userId、clusterId、role Average、Maximum、Minimum
root队列中Active的作业数 Count YarnRootActiveApplications userId、clusterId、role Average、Maximum、Minimum
root队列中Active的用户数 Count YarnRootActiveUsers userId、clusterId、role Average、Maximum、Minimum
root队列中分配的container数 Count YarnRootAllocatedContainers userId、clusterId、role Average、Maximum、Minimum
root队列中分配的内存量 MB YarnRootAllocatedMB userId、clusterId、role Average、Maximum、Minimum
root队列中分配的vcore数 Count YarnRootAllocatedVCores userId、clusterId、role Average、Maximum、Minimum
root队列中完成的作业数 Count YarnRootAppsCompleted userId、clusterId、role Average、Maximum、Minimum
root队列中杀死的作业数 Count YarnRootAppsKilled userId、clusterId、role Average、Maximum、Minimum
root队列中pending的作业数 Count YarnRootAppsPending userId、clusterId、role Average、Maximum、Minimum
root队列中运行的作业数 Count YarnRootAppsRunning userId、clusterId、role Average、Maximum、Minimum
root队列中提交的作业数 Count YarnRootAppsSubmitted userId、clusterId、role Average、Maximum、Minimum
root队列中可用的内存量 MB YarnRootAvailableMB userId、clusterId、role Average、Maximum、Minimum
root队列中可用的vcore数 Count YarnRootAvailableVCores userId、clusterId、role Average、Maximum、Minimum
root队列中pending的container占比 % YarnRootContainerPendingRatio userId、clusterId、role、instanceId Average、Maximum、Minimum
root队列中可用内存的占比 % YarnRootMemoryAvailablePercentage userId、clusterId、role、instanceId Average、Maximum、Minimum
root队列中pending的container数 Count YarnRootPendingContainers userId、clusterId、role Average、Maximum、Minimum
root队列中pending的内存量 MB YarnRootPendingMB userId、clusterId、role Average、Maximum、Minimum
root队列中pending的vcore数 Count YarnRootPendingVCores userId、clusterId、role Average、Maximum、Minimum
root队列中预留的container数 Count YarnRootReservedContainers userId、clusterId、role Average、Maximum、Minimum
root队列中预留的内存量 MB YarnRootReservedMB userId、clusterId、role Average、Maximum、Minimum
root队列中预留的vcore数 Count YarnRootReservedVCores userId、clusterId、role Average、Maximum、Minimum
root队列中预留的vcore数占比 % YarnRootVCoreAvailablePercentage userId、clusterId、role、instanceId Average、Maximum、Minimum
zk处理平均延迟 Millisecond ZKAvgLatency userId、clusterId、role Average
Zookeeper客户端监听端口的可用性 Count ZKClientPortOpen userId、clusterId、role Average
ZKFC端口的可用性 Count ZKFCPortOpen userId、clusterId、role Average
是否是ZK集群的Leader Count ZKIsLeader userId、clusterId、role Average
ZKLeader端口的可用性 Count ZKLeaderPortOpen userId、clusterId、role Average
ZKPeer端口的可用性 Count ZKPeerPortOpen userId、clusterId、role Average
zk最大文件描述符个数 Count ZkMaxFileDescriptorCount userId、clusterId、role Average
zk处理最大时延 Millisecond ZkMaxLatency userId、clusterId、role Average
zk处理最小时延 Millisecond ZkMinLatency userId、clusterId、role Average
zk存活的连接数 Count ZkNumAliveConnections userId、clusterId、role Average
zk打开的文件描述符数 Count ZkOpenFileFescriptorCount userId、clusterId、role Average
zk排队请求的数量 Count ZkOutstandingRequests userId、clusterId、role Average
zk接收的数据包数 Count ZkPacketsReceived userId、clusterId、role Average
zk发送的数据包数 Count ZkPacketsSent userId、clusterId、role Average
zk的watch数目 Count ZkWatchCount userId、clusterId、role Average
zk的znode数量 Count ZkZnodeCount userId、clusterId、role Average
ZooKeeperAuthFailuresPerSec CountSecond ZooKeeperAuthFailuresPerSec userId、clusterId、role Average
ZooKeeperDisconnectsPerSec CountSecond ZooKeeperDisconnectsPerSec userId、clusterId、role Average
ZooKeeperExpiresPerSec CountSecond ZooKeeperExpiresPerSec userId、clusterId、role Average
ZooKeeperReadOnlyConnectsPerSec CountSecond ZooKeeperReadOnlyConnectsPerSec userId、clusterId、role Average
ZooKeeperSaslAuthenticationsPerSec CountSecond ZooKeeperSaslAuthenticationsPerSec userId、clusterId、role Average
ZooKeeperSyncConnectsPerSec CountSecond ZooKeeperSyncConnectsPerSec userId、clusterId、role Average
网络流入速率 bit/s bytes_in userId、clusterId、role Average
网络流出速率 bit/s bytes_out userId、clusterId、role Average
Percent of time since boot idle CPU % cpu_aidle userId、clusterId、role Average
CPU空闲率 % cpu_idle userId、clusterId、role Average
Percent CPU interrupt % cpu_intr userId、clusterId、role Average
Percent CPU nice % cpu_nice userId、clusterId、role Average
Percent CPU soft interrupt % cpu_sintr userId、clusterId、role Average
系统态CPU使用率 % cpu_system userId、clusterId、role Average
用户态CPU使用率 % cpu_user userId、clusterId、role Average
Percent CPU wait io % cpu_wio userId、clusterId、role Average
空闲磁盘容量 Byte disk_free userId、clusterId、role Average
磁盘总容量 Byte disk_total userId、clusterId、role Average
正在进行状态转换的region数量 Count hbase_hmaster_assignment_ritCount userId、clusterId、hostname Average
正在进行状态转换并且超过阈值时间的region数量(默认阈值60s) Count hbase_hmaster_assignment_ritCountOverThreshold userId、clusterId、hostname Average
region处于状态转换最长的时间 ms hbase_hmaster_assignment_ritOldestAge userId、clusterId、hostname Average
死掉的region server数量 Count hbase_hmaster_server_numDeadRegionServers userId、clusterId Average
存活的region server数量 Count hbase_hmaster_server_numRegionServers userId、clusterId Average
region server rpc活跃的句柄数 Count hbase_regionserver_ipc_numActiveHandler userId、clusterId、hostname Average
region server rpc队列长度 Count hbase_regionserver_ipc_numCallsInGeneralQueue userId、clusterId、hostname Average
region server replication操作队列长度 Count hbase_regionserver_ipc_numCallsInReplicationQueue userId、clusterId、hostname Average
region server rpc连接数 Count hbase_regionserver_ipc_numOpenConnections userId、clusterId、hostname Average
region server追加操作 99%操作延迟 ms hbase_regionserver_server_Append_99th_percentile userId、clusterId、hostname Average
region server平均追加时间 ms hbase_regionserver_server_Append_mean userId、clusterId、hostname Average
region server追加操作数 Count hbase_regionserver_server_Append_num_ops userId、clusterId、hostname Average
region server删除操作99%操作延迟 ms hbase_regionserver_server_Delete_99th_percentile userId、clusterId、hostname Average
region server平均删除时间 ms hbase_regionserver_server_Delete_mean userId、clusterId、hostname Average
region server删除操作数 Count hbase_regionserver_server_Delete_num_ops userId、clusterId、hostname Average
region server获取操作99%操作延迟 ms hbase_regionserver_server_Get_99th_percentile userId、clusterId、hostname Average
region server平均获取时间 ms hbase_regionserver_server_Get_mean userId、clusterId、hostname Average
region server获取操作数 Count hbase_regionserver_server_Get_num_ops userId、clusterId、hostname Average
region server新增操作99%操作延迟 ms hbase_regionserver_server_Increment_99th_percentile userId、clusterId、hostname Average
region server平均新增时间 ms hbase_regionserver_server_Increment_mean userId、clusterId、hostname Average
region server新增操作数 count hbase_regionserver_server_Increment_num_ops userId、clusterId、hostname Average
region server写入操作99%操作延迟 ms hbase_regionserver_server_Put_99th_percentile userId、clusterId、hostname Average
region server平均写入时间 ms hbase_regionserver_server_Put_mean userId、clusterId、hostname Average
region server写入操作数 Count hbase_regionserver_server_Put_num_ops userId、clusterId、hostname Average
region server compaction队列长度 Count hbase_regionserver_server_compactionQueueLength userId、clusterId、hostname Average
region server flush队列长度 Count hbase_regionserver_server_flushQueueLength userId、clusterId、hostname Average
region server管理的hlog文件数量 Count hbase_regionserver_server_hlogFileCount userId、clusterId、hostname Average
region server 写入操作标记可以绕过WAL个数 Count hbase_regionserver_server_mutationsWithoutWALCount userId、clusterId、hostname Average
region server读请求数 Count hbase_regionserver_server_readRequestCount userId、clusterId、hostname Average
region server所拥有的的region数量 Count hbase_regionserver_server_regionCount userId、clusterId、hostname Average
region server慢追加操作数 Count hbase_regionserver_server_slowAppendCount userId、clusterId、hostname Average
region server慢删除操作数 Count hbase_regionserver_server_slowDeleteCount userId、clusterId、hostname Average
region server慢获取操作数 Count hbase_regionserver_server_slowGetCount userId、clusterId、hostname Average
region server慢新增操作数 Count hbase_regionserver_server_slowIncrementCount userId、clusterId、hostname Average
region server慢写入操作数 Count hbase_regionserver_server_slowPutCount userId、clusterId、hostname Average
region server管理的文件个数 Count hbase_regionserver_server_storeFileCount userId、clusterId、hostname Average
region server管理的文件大小 Count hbase_regionserver_server_storeFileSize userId、clusterId、hostname Average
region server总请求数 Count hbase_regionserver_server_totalRequestCount userId、clusterId、hostname Average
region server更新操作阻塞时间 Count hbase_regionserver_server_updatesBlockedTime userId、clusterId、hostname Average
region server写请求数 Count hbase_regionserver_server_writeRequestCount userId、clusterId、hostname Average
datanode gc时间 Count hdfs_datanode_jvm_GcTimeMillis userId、clusterId、hostname Average
datanode rpc队列积压长度 Count hdfs_datanode_rpc_activity_CallQueueLength userId、clusterId、hostname Average
journalnode gc时间 ms hdfs_journalnode_jvm_GcTimeMillis userId、clusterId、hostname Average
journalnode rpc队列积压长度 Count hdfs_journalnode_rpc_activity_CallQueueLength userId、clusterId、hostname Average
namenode剩余的空间大小 GB hdfs_namenode_fsnamesystem_CapacityRemainingGB userId、clusterId、hostname Average
namenode损坏的块数 Count hdfs_namenode_fsnamesystem_CorruptBlocks userId、clusterId、hostname Average
datanode向namenode心跳超时的个数 Count hdfs_namenode_fsnamesystem_ExpiredHeartbeats userId、clusterId、hostname Average
namenode丢失的块数 Count hdfs_namenode_fsnamesystem_MissingBlocks userId、clusterId、hostname Average
namenode离线的已经decommission的datanode个数 Count hdfs_namenode_fsnamesystem_NumDecomDeadDataNodes userId、clusterId、hostname Average
namenode在线的已经decommision的datanode个数 Count hdfs_namenode_fsnamesystem_NumDecomLiveDataNodes userId、clusterId、hostname Average
namenode由于heartbeat延迟被标记为stale的datanode个数(默认是超过三个心跳时间) Count hdfs_namenode_fsnamesystem_StaleDataNodes userId、clusterId、hostname Average
namenode不足副本的块数 Count hdfs_namenode_fsnamesystem_UnderReplicatedBlocks userId、clusterId、hostname Average
namenode存盘算坏的总数 Count hdfs_namenode_fsnamesystem_VolumeFailuresTotal userId、clusterId、hostname Average
namenode上一次ha切换时间 ms hdfs_namenode_ha_transition_time userId、clusterId、hostname Average
namenode gc次数 Count hdfs_namenode_jvm_GcCount userId、clusterId、hostname Average
namenode gc时间 ms hdfs_namenode_jvm_GcTimeMillis userId、clusterId、hostname Average
namenode内存水位预估 MB hdfs_namenode_memory_threshold userId、clusterId、hostname Average
namenode启动的时间 ms hdfs_namenode_namenodeinfo_NNStartedTimeInMillis userId、clusterId、hostname Average
namenode安全模式时间 ms hdfs_namenode_rpc_SafeModeTime userId、clusterId、hostname Average
namenode rpc队列积压长度 Count hdfs_namenode_rpc_activity_CallQueueLength userId、clusterId、hostname Average
namenode rpc client端口队列积压长度 Count hdfs_namenode_rpc_client_activity_CallQueueLength userId、clusterId、hostname Average
namenode safemode状态 Count hdfs_namenode_safemode_status userId、clusterId、hostname Average
namenode状态(-1:未知状态或进程挂掉、 0:standby、 1:active) Count hdfs_namenode_status userId、clusterId、hostname Average
metastore活跃create table请求 μs hive_active_calls_api_create_table userId、clusterId、hostname Average
metastore活跃drop table请求 μs hive_active_calls_api_drop_table userId、clusterId、hostname Average
metastore alter table请求 μs hive_api_alter_table userId、clusterId、hostname Average
metastore alter table请求(with env context) μs hive_api_alter_table_with_environment_context userId、clusterId、hostname Average
metastore create table请求 μs hive_api_create_table userId、clusterId、hostname Average
metastore create table请求(with env context) μs hive_api_create_table_with_environment_context userId、clusterId、hostname Average
drop table请求 μs hive_api_drop_table userId、clusterId、hostname Average
drop table请求(with env context) μs hive_api_drop_table_with_environment_context userId、clusterId、hostname Average
get all databases请求 μs hive_api_get_all_databases userId、clusterId、hostname Average
get all functions请求 μs hive_api_get_all_functions userId、clusterId、hostname Average
get database请求 μs hive_api_get_database userId、clusterId、hostname Average
get databases请求 μs hive_api_get_databases userId、clusterId、hostname Average
get multi table请求 μs hive_api_get_multi_table userId、clusterId、hostname Average
get table请求 μs hive_api_get_table userId、clusterId、hostname Average
get table objects by name请求 μs hive_api_get_table_objects_by_name_req userId、clusterId、hostname Average
get table req请求 μs hive_api_get_table_req userId、clusterId、hostname Average
get table statistics请求 μs hive_api_get_table_statistics_req userId、clusterId、hostname Average
get tables请求 μs hive_api_get_tables userId、clusterId、hostname Average
get tables by type请求 μs hive_api_get_tables_by_type userId、clusterId、hostname Average
get table objects by name请求 μs hive_api_get_table_objects_by_name_req userId、clusterId、hostname Average
get table req请求 μs hive_api_get_table_req userId、clusterId、hostname Average
get table statistics请求 μs hive_api_get_table_statistics_req userId、clusterId、hostname Average
hf2挂起的SQL操作 Count hive_metrics_api_hs2_sql_operation_PENDING userId、clusterId、hostname Average
kudu_失效的数据目录个数 Count kudu_data_dirs_failed userId、clusterId、hostname Average
kudu_Full状态的数据目录个数 Count kudu_data_dirs_full userId、clusterId、hostname Average
15分钟平均负载 Count load_fifteen userId、clusterId、role Average
5分钟平均负载 Count load_five userId、clusterId、role Average
1分钟平均负载 Count load_one userId、clusterId、role Average
Amount of buffered memory KB mem_buffers userId、clusterId、role Average
Amount of cached memory KB mem_cached userId、clusterId、role Average
空闲内存容量 KB mem_free userId、clusterId、role Average
Amount of shared memory KB mem_shared userId、clusterId、role Average
总内存容量 KB mem_total userId、clusterId、role Average
已使用的内存占比 % mem_used_percent userId、clusterId、role Average
Maximum percent used for all partitions % part_max_used userId、clusterId、role Average
数据包流入速率 Count/Second pkts_in userId、clusterId、role Average
数据包流出速率 Count/Second pkts_out userId、clusterId、role Average
presto_ClusterMemoryManager_集群内存泄漏的查询总数 Count presto_ClusterMemoryManager_NumberOfLeakedQueries userId、clusterId Average
presto_ClusterMemoryManager_oom killed的查询总数 Count presto_ClusterMemoryManager_QueriesKilledDueToOutOfMemory userId、clusterId Average
presto_ClusterMemoryPool_集群block节点数 Count presto_ClusterMemoryPool_name_general_BlockedNodes userId、clusterId Average
presto_ClusterMemoryPool_集群内存池可用内存 Byte presto_ClusterMemoryPool_name_general_FreeDistributedBytes userId、clusterId Average
presto_ClusterMemoryPool_集群内存池节点数 Count presto_ClusterMemoryPool_name_general_Nodes userId、clusterId Average
presto_JVM最大内存 Byte presto_Memory_HeapMemoryUsage_max userId、clusterId、hostname Average
presto非堆内存使用量 Byte presto_Memory_NonHeapMemoryUsage_used userId、clusterId、hostname Average
presto_QueryManager_查询处理的cpu时间/min Second presto_QueryManager_ConsumedCpuTimeSecs_OneMinute_Count userId、clusterId、hostname Average
presto_QueryManager_外部异常导致的失败查询数/min Count presto_QueryManager_ExternalFailures_OneMinute_Count userId、clusterId、hostname Average
presto_QueryManager_失败的查询数/min Count presto_QueryManager_FailedQueries_OneMinute_Count userId、clusterId、hostname Average
presto_QueryManager_服务内部异常导致的失败查询数/min Count presto_QueryManager_InternalFailures_OneMinute_Count userId、clusterId、hostname Average
presto_等待中的query总数 Count presto_QueryManager_QueuedQueries userId、clusterId、hostname Average
presto_QueryManager_查询平均排队时间/min ms presto_QueryManager_QueuedTime_OneMinute_Avg userId、clusterId、hostname Average
presto_QueryManager_用户异常导致的失败查询数/min Count presto_QueryManager_UserErrorFailures_OneMinute_Count userId、clusterId、hostname Average
运行中的进程数目 Count proc_run userId、clusterId、role Average
总进程数目 Count proc_total userId、clusterId、role Average
阻塞的进程数目 Count procs_blocked userId、clusterId、role Average
创建的进程/线程数目 Count procs_created userId、clusterId、role Average
Amount of available swap memory KB swap_free userId、clusterId、role Average
Total amount of swap space displayed in KBs KB swap_total userId、clusterId、role Average
resoucemanager 存活的nodemanager节点个数 Count yarn_cluster_NumActiveNMs userId、clusterId、hostname Average
resoucemanager 已经decommission的nodemanager节点个数 Count yarn_cluster_NumDecommissionedNMs userId、clusterId、hostname Average
resoucemanager 正在decommission的nodemanager节点个数 Count yarn_cluster_NumDecommissioningNMs userId、clusterId、hostname Average
resoucemanager 已经失联的nodemanager节点个数 Count yarn_cluster_NumLostNMs userId、clusterId、hostname Average
resoucemanager 重启的nodemanager节点个数 Count yarn_cluster_NumRebootedNMs userId、clusterId、hostname Average
resoucemanager 关闭的nodemanager节点个数 Count yarn_cluster_NumShutdownNMs userId、clusterId、hostname Average
resoucemanager 不健康的nodemanager节点个数 Count yarn_cluster_NumUnhealthyNMs userId、clusterId、hostname Average
集群重启的节点数 Count yarn_cluster_rebootedNodes userId、clusterId、hostname Average
集群被预留调度的内存大小 MB yarn_cluster_reservedMB userId、clusterId、hostname Average
集群被预留调度的虚拟核数 Count yarn_cluster_reservedVirtualCores userId、clusterId、hostname Average
集群关闭的节点数 Count yarn_cluster_shutdownNodes userId、clusterId、hostname Average
集群总内存大小 MB yarn_cluster_totalMB userId、clusterId、hostname Average
集群总节点数 Count yarn_cluster_totalNodes userId、clusterId、hostname Average
集群总虚拟核数 Count yarn_cluster_totalVirtualCores userId、clusterId、hostname Average
集群不健康的节点数 Count yarn_cluster_unhealthyNodes userId、clusterId、hostname Average
jobhistory jvm gc次数 Count yarn_jobhistory_jvm_GcCount userId、clusterId、hostname Average
jobhistory jvm gc时间 ms yarn_jobhistory_jvm_GcTimeMillis userId、clusterId、hostname Average
nodemanager可用的虚拟核数 Count yarn_nodemanager_availableVirtualCores userId、clusterId、hostname Average
nodemanager可用的内存大小 MB yarn_nodemanager_availMemoryMB userId、clusterId、hostname Average
nodemanager磁盘损坏个数 Count yarn_nodemanager_BadLocalDirs userId、clusterId、hostname Average
nodemanager container完成个数 Count yarn_nodemanager_ContainersCompleted userId、clusterId、hostname Average
nodemanager container失败个数 Count yarn_nodemanager_ContainersFailed userId、clusterId、hostname Average
nodemanager container初始化中个数 Count yarn_nodemanager_ContainersIniting userId、clusterId、hostname Average
nodemanager container被杀掉个数 Count yarn_nodemanager_ContainersKilled userId、clusterId、hostname Average
nodemanager container启动个数 Count yarn_nodemanager_ContainersLaunched userId、clusterId、hostname Average
nodemanager container正在运行个数 Count yarn_nodemanager_ContainersRunning userId、clusterId、hostname Average
nodemanager磁盘利用率 Percent yarn_nodemanager_GoodLocalDirsDiskUtilizationPerc userId、clusterId、hostname Average
nodemanager jvm gc次数 Count yarn_nodemanager_jvm_GcCount userId、clusterId、hostname Average
nodemanager jvm gc时间 ms yarn_nodemanager_jvm_GcTimeMillis userId、clusterId、hostname Average
nodemanager最近一次健康状态时间 Count yarn_nodemanager_lastHealthUpdate userId、clusterId、hostname Average
nodemanager运行的container个数 Count yarn_nodemanager_numContainers userId、clusterId、hostname Average
nodemanager shuffle连接数 Count yarn_nodemanager_shuffle_ShuffleConnections userId、clusterId、hostname Average
nodemanager shuffle输出失败个数 Count yarn_nodemanager_shuffle_ShuffleOutputsFailed userId、clusterId、hostname Average
nodemanager shuffle输出成功个数 Count yarn_nodemanager_shuffle_ShuffleOutputsOK userId、clusterId、hostname Average
nodemanager使用的内存大小 MB yarn_nodemanager_usedMemoryMB userId、clusterId、hostname Average
nodemanager使用的虚拟核数 Count yarn_nodemanager_usedVirtualCores userId、clusterId、hostname Average
resoucemanager jvm gc次数 Count yarn_resourcemanager_jvm_GcCount userId、clusterId、hostname Average
resoucemanager jvm gc时间 ms yarn_resourcemanager_jvm_GcTimeMillis userId、clusterId、hostname Average
resourcemanager capacity调度器特定队列在总集群中使用的capactity Percent yarn_resourcemanager_queue_absoluteUsedCapacity userId、clusterId、queueName Average
resourcemanager调度器特定队列累积的container分配数 Count yarn_resourcemanager_queue_AggregateContainersAllocated userId、clusterId、queueName Average
resourcemanager调度器特定队列累积的container释放数 Count yarn_resourcemanager_queue_AggregateContainersReleased userId、clusterId、queueName Average
resourcemanager调度器特定队列分配的container数 Count yarn_resourcemanager_queue_AllocatedContainers userId、clusterId、queueName Average
resourcemanager调度器特定队列分配的内存大小 MB yarn_resourcemanager_queue_AllocatedMB userId、clusterId、queueName Average
resourcemanager调度器特定队列分配的虚拟核数 Count yarn_resourcemanager_queue_AllocatedVCores userId、clusterId、queueName Average
resourcemanager capacity调度器特定队列application master使用的内存 MB yarn_resourcemanager_queue_amResourceUsed_memory userId、clusterId、queueName Average
resourcemanager capacity调度器特定队列application master使用的虚拟核数 Count yarn_resourcemanager_queue_amResourceUsed_vCores userId、clusterId、queueName Average
resourcemanager fair调度器特定队列application master使用的内存大小 MB yarn_resourcemanager_queue_amUsedResources_memory userId、clusterId、queueName Average
resourcemanager fair调度器特定队列applicatio master使用的虚拟核数 Count yarn_resourcemanager_queue_amUsedResources_vCores userId、clusterId、queueName Average
resourcemanager调度器特定队列完成的任务数 Count yarn_resourcemanager_queue_AppsCompleted userId、clusterId、queueName Average
resourcemanager调度器特定队列失败的任务数 Count yarn_resourcemanager_queue_AppsFailed userId、clusterId、queueName Average
resourcemanager调度器特定队列被杀死的任务数 Count yarn_resourcemanager_queue_AppsKilled userId、clusterId、queueName Average
resourcemanager调度器特定队列阻塞的任务数 Count yarn_resourcemanager_queue_AppsPending userId、clusterId、queueName Average
resourcemanager调度器特定队列正在运行的任务数 Count yarn_resourcemanager_queue_AppsRunning userId、clusterId、queueName Average
resourcemanager调度器特定队列提交的任务数 Count yarn_resourcemanager_queue_AppsSubmitted userId、clusterId、queueName Average
resourcemanager调度器特定队可用内存 MB yarn_resourcemanager_queue_AvailableMB userId、clusterId、queueName Average
resourcemanager调度器特定队列可用核数 Count yarn_resourcemanager_queue_AvailableVCores userId、clusterId、queueName Average
resourcemanager capacity调度器特定队列使用的内存 MB yarn_resourcemanager_queue_memoryUsed userId、clusterId、queueName Average
resourcemanager capacity调度器特定队列运行的application个数 Count yarn_resourcemanager_queue_numActiveApplications userId、clusterId、queueName Average
resourcemanager fair调度器特定队列正在运行的application个数 Count yarn_resourcemanager_queue_numActiveApps userId、clusterId、queueName Average
resourcemanager capacity调度器特定队列在调度器中的application个数 Count yarn_resourcemanager_queue_numApplications userId、clusterId、queueName Average
resourcemanager capacity调度器特定队列container个数 Count yarn_resourcemanager_queue_numContainers userId、clusterId、queueName Average
resourcemanager capacity调度器特定队列阻塞调度的application个数 Count yarn_resourcemanager_queue_numPendingApplications userId、clusterId、queueName Average
resourcemanager fair调度器特定队列阻塞调度的application个数 Count yarn_resourcemanager_queue_numPendingApps userId、clusterId、queueName Average
resourcemanager调度器特定队列阻塞调度container个数 Count yarn_resourcemanager_queue_PendingContainers userId、clusterId、queueName Average
resourcemanager调度器特定队列阻塞调度内存 MB yarn_resourcemanager_queue_PendingMB userId、clusterId、queueName Average
resourcemanager调度器特定队列阻塞调度核数 Count yarn_resourcemanager_queue_PendingVCores userId、clusterId、queueName Average
resourcemanager调度器特定队列预留container数 Count yarn_resourcemanager_queue_ReservedContainers userId、clusterId、queueName Average
resourcemanager调度器特定队列预留内存 MB yarn_resourcemanager_queue_ReservedMB userId、clusterId、queueName Average
resourcemanager fair调度器特定队列预留的内存大小 MB yarn_resourcemanager_queue_reservedResources_memory userId、clusterId、queueName Average
resourcemanager fair调度器特定队列预留的虚拟核数 Count yarn_resourcemanager_queue_reservedResources_vCores userId、clusterId、queueName Average
resourcemanager调度器特定队列预留核数 Count yarn_resourcemanager_queue_ReservedVCores userId、clusterId、queueName Average
resourcemanager capacity调度器特定队列在父队列中使用的capacity Percent yarn_resourcemanager_queue_usedCapacity userId、clusterId、queueName Average
resourcemanager fair调度器特定队列使用的内存大小 MB yarn_resourcemanager_queue_usedResources_memory userId、clusterId、queueName Average
resourcemanager fair调度器特定队列使用的虚拟核数 Count yarn_resourcemanager_queue_usedResources_vCores userId、clusterId、queueName Average
resourcemanager capacity调度器特定队列使用的虚拟核数 Count yarn_resourcemanager_queue_vCoresUsed userId、clusterId、queueName Average
resoucemanager rpc队列积压长度 Count yarn_resourcemanager_rpc_CallQueueLength userId、clusterId、hostname Average
resoucemanager rpc连接数 Count yarn_resourcemanager_rpc_NumOpenConnections userId、clusterId、hostname Average
resoucemanager rpc队列平均处理时间 ms yarn_resourcemanager_rpc_RpcProcessingTimeAvgTime userId、clusterId、hostname Average
timeline server jvm gc次数 Count yarn_timelineserver_jvm_GcCount userId、clusterId、hostname Average
timeline server jvm gc时间 ms yarn_timelineserver_jvm_GcTimeMillis userId、clusterId、hostname Average
timeline server获取Domain操作数 Count yarn_timeline_GetDomainOps userId、clusterId、hostname Average
timeline server批量获取Domains操作数 Count yarn_timeline_GetDomainsOps userId、clusterId、hostname Average
timeline server批量获取Domains平均时间 ms yarn_timeline_GetDomainsTimeAvgTime userId、clusterId、hostname Average
timeline server获取Domain平均时间 ms yarn_timeline_GetDomainTimeAvgTime userId、clusterId、hostname Average
timeline server获取批量entities操作数 Count yarn_timeline_GetEntitiesOps userId、clusterId、hostname Average
timeline server获取批量entities平均时间 ms yarn_timeline_GetEntitiesTimeAvgTime userId、clusterId、hostname Average
timeline server获取entity操作数 Count yarn_timeline_GetEntityOps userId、clusterId、hostname Average
timeline server获取entity平均时间 ms yarn_timeline_GetEntityTimeAvgTime userId、clusterId、hostname Average
timeline server获取批量events操作数 Count yarn_timeline_GetEventsOps userId、clusterId、hostname Average
timeline server获取批量evnets平均时间 ms yarn_timeline_GetEventsTimeAvgTime userId、clusterId、hostname Average
timeline server更新批量entities操作数 Count yarn_timeline_PostEntitiesOps userId、clusterId、hostname Average
timeline server更新批量entities的平均时间 ms yarn_timeline_PostEntitiesTimeAvgTime userId、clusterId、hostname Average
timeline server更新Domain操作数 Count yarn_timeline_PutDomainOps userId、clusterId、hostname Average
timeline server更新Domain平均时间 ms yarn_timeline_PutDomainTimeAvgTime userId、clusterId、hostname Average
Zepplin端口的可用性 Count ZeppelinPortOpen userId、clusterId、role Average
zk处理平均延迟 Millisecond ZKAvgLatency userId、clusterId、role Average
Zookeeper客户端监听端口的可用性 Count ZKClientPortOpen userId、clusterId、role Average
ZKFC端口的可用性 Count ZKFCPortOpen userId、clusterId、role Average
是否是ZK集群的Leader Count ZKIsLeader userId、clusterId、role Average
ZKLeader端口的可用性 Count ZKLeaderPortOpen userId、clusterId、role Average
zk最大文件描述符个数 Count ZkMaxFileDescriptorCount userId、clusterId、role Average
zk处理最大时延 Millisecond ZkMaxLatency userId、clusterId、role Average
zk处理最小时延 Millisecond ZkMinLatency userId、clusterId、role Average
zk存活的连接数 Count ZkNumAliveConnections userId、clusterId、role Average
zk打开的文件描述符数 Count ZkOpenFileFescriptorCount userId、clusterId、role Average
zk排队请求的数量 Count ZkOutstandingRequests userId、clusterId、role Average
zk接收的数据包数 Count ZkPacketsReceived userId、clusterId、role Average
zk发送的数据包数 Count ZkPacketsSent userId、clusterId、role Average
ZKPeer端口的可用性 Count ZKPeerPortOpen userId、clusterId、role Average
zk的watch数目 Count ZkWatchCount userId、clusterId、role Average
zk的znode数量 Count ZkZnodeCount userId、clusterId、role Average
zookeeper平均请求延迟 ms zk_avg_latency userId、clusterId、hostname Average
zookeeper节点状态(-1:节点不可用、0:follower、1:leader) Count zk_node_status userId、clusterId、hostname Average
zookeeper存活的连接数 Count zk_num_alive_connections userId、clusterId、hostname Average
zookeeper排队请求的数量 Count zk_outstanding_requests userId、clusterId、hostname Average
同步的zookeeper服务数量 Count zk_synced_followers userId、clusterId、hostname Average
zookeeper znode的数量 Count zk_znode_count userId、clusterId、hostname Average