The page has been translated by Gen AI.

ServiceWatch Metrics

Kubernetes Engine sends metrics to ServiceWatch. The metrics provided as basic monitoring are data collected at 1-minute intervals.

Note
For information on how to check metrics in ServiceWatch, refer to the ServiceWatch guide.

Basic Metrics

The following are basic metrics for the Kubernetes Engine namespace.

Metrics with metric names shown in bold below are key metrics selected among the basic metrics provided by Kubernetes Engine. Key metrics are used to configure service dashboards that are automatically built for each service in ServiceWatch.

For each metric, the user guide describes which statistical value is meaningful when querying that metric, and the statistical value shown in bold among the meaningful statistics is the key statistic. You can query key metrics through key statistics in the service dashboard.

Metric NameDetailed DescriptionUnitMeaningful Statistics
cluster_upCluster upCount
  • Sum
  • Average
  • Maximum
  • Minimum
cluster_node_countCluster node countCount
  • Sum
  • Average
  • Maximum
  • Minimum
cluster_failed_node_countCluster failed node countCount
  • Sum
  • Average
  • Maximum
  • Minimum
cluster_namespace_phase_countCluster namespace phase countCount
  • Sum
  • Average
  • Maximum
  • Minimum
cluster_pod_phase_countCluster pod phase countCount
  • Sum
  • Average
  • Maximum
  • Minimum
node_cpu_allocatableNode CPU allocatable-
  • Sum
  • Average
  • Maximum
  • Minimum
node_cpu_capacityNode CPU capacity-
  • Sum
  • Average
  • Maximum
  • Minimum
node_cpu_usageNode CPU usage-
  • Sum
  • Average
  • Maximum
  • Minimum
node_cpu_utilizationNode CPU utilization-
  • Sum
  • Average
  • Maximum
  • Minimum
node_memory_allocatableNode memory allocatableBytes
  • Sum
  • Average
  • Maximum
  • Minimum
node_memory_capacityNode memory capacityBytes
  • Sum
  • Average
  • Maximum
  • Minimum
node_memory_usageNode memory usageBytes
  • Sum
  • Average
  • Maximum
  • Minimum
node_memory_utilizationNode memory utilization-
  • Sum
  • Average
  • Maximum
  • Minimum
node_network_rx_bytesNode network receive bytesBytes/Second
  • Sum
  • Average
  • Maximum
  • Minimum
node_network_tx_bytesNode network transmit bytesBytes/Second
  • Sum
  • Average
  • Maximum
  • Minimum
node_network_total_bytesNode network total bytesBytes/Second
  • Sum
  • Average
  • Maximum
  • Minimum
node_number_of_running_podsNode number of running podsCount
  • Sum
  • Average
  • Maximum
  • Minimum
namespace_number_of_running_podsNamespace number of running podsCount
  • Sum
  • Average
  • Maximum
  • Minimum
namespace_deployment_pod_countNamespace deployment pod countCount
  • Sum
  • Average
  • Maximum
  • Minimum
namespace_statefulset_pod_countNamespace statefulset pod countCount
  • Sum
  • Average
  • Maximum
  • Minimum
namespace_daemonset_pod_countNamespace daemonset pod countCount
  • Sum
  • Average
  • Maximum
  • Minimum
namespace_job_active_countNamespace job active countCount
  • Sum
  • Average
  • Maximum
  • Minimum
namespace_cronjob_active_countNamespace cronjob active countCount
  • Sum
  • Average
  • Maximum
  • Minimum
pod_cpu_usagePod CPU usage-
  • Sum
  • Average
  • Maximum
  • Minimum
pod_memory_usagePod memory usageBytes
  • Sum
  • Average
  • Maximum
  • Minimum
pod_network_rx_bytesPod network receive bytesBytes/Second
  • Sum
  • Average
  • Maximum
  • Minimum
pod_network_tx_bytesPod network transmit bytesBytes/Second
  • Sum
  • Average
  • Maximum
  • Minimum
pod_network_total_bytesPod network total bytesCount
  • Sum
  • Average
  • Maximum
  • Minimum
container_cpu_usageContainer CPU usage-
  • Sum
  • Average
  • Maximum
  • Minimum
container_cpu_limitContainer CPU limit-
  • Sum
  • Average
  • Maximum
  • Minimum
container_cpu_utilizationContainer CPU utilization-
  • Sum
  • Average
  • Maximum
  • Minimum
container_memory_usageContainer memory usageBytes
  • Sum
  • Average
  • Maximum
  • Minimum
container_memory_limitContainer memory limitBytes
  • Sum
  • Average
  • Maximum
  • Minimum
container_memory_utilizationContainer memory utilization-
  • Sum
  • Average
  • Maximum
  • Minimum
node_gpu_countNode GPU countCount
  • Sum
  • Average
  • Maximum
  • Minimum
gpu_tempGPU temperature-
  • Sum
  • Average
  • Maximum
  • Minimum
gpu_power_usageGPU power usage-
  • Sum
  • Average
  • Maximum
  • Minimum
gpu_utilGPU utilizationPercent
  • Sum
  • Average
  • Maximum
  • Minimum
gpu_sm_clockGPU SM clock-
  • Sum
  • Average
  • Maximum
  • Minimum
gpu_fb_usedGPU FB usageMegabytes
  • Sum
  • Average
  • Maximum
  • Minimum
gpu_tensor_activeGPU tensor active rate-
  • Sum
  • Average
  • Maximum
  • Minimum
pod_gpu_utilPod GPU utilizationPercent
  • Sum
  • Average
  • Maximum
  • Minimum
pod_gpu_tensor_activePod GPU tensor active rate-
  • Sum
  • Average
  • Maximum
  • Minimum
Table. Kubernetes Engine Basic Metrics
Monitoring Metrics
How-to guides