| GPU | GPU Count | cnt | 1m | Number of GPUs |
| GPU | GPU Memory Usage | % | 1m | GPU memory usage rate |
| GPU | GPU Memory Used | bytes | 1m | GPU memory usage |
| GPU | GPU Temperature | ℃ | 1m | GPU temperature |
| GPU | GPU Usage | % | 1m | Total GPU usage rate (using all 8 GPUs at 100%: 800%) |
| GPU | GPU Usage [Avg] | % | 1m | Average GPU usage rate (%) |
| GPU | GPU Power Cap | W | 1m | Maximum power capacity of the GPU |
| GPU | GPU Power Usage | W | 1m | Current power usage of the GPU |
| GPU | GPU Memory Usage [Avg] | % | 1m | Average GPU memory usage rate |
| GPU | GPU Count in use | cnt | 1m | Number of GPUs in use by jobs running on the node |
| GPU | Execution State for nvidia-smi | state | 1m | Execution result of the nvidia-smi command |
| CPU | Core Usage [IO Wait] | % | 1m | CPU time spent in wait state (disk wait) |
| CPU | Core Usage [System] | % | 1m | CPU time spent in kernel space |
| CPU | Core Usage [User] | % | 1m | CPU time spent in user space |
| CPU | CPU Cores | cnt | 1m | Number of CPU cores on the host. The maximum value of the unnormalized ratio is 100%* of the number of cores. The unnormalized ratio already reflects this value, and the maximum value is 100%* of the number of cores. |
| CPU | CPU Usage [Active] | % | 1m | CPU time used, excluding idle and IOWait states (using all 4 cores at 100%: 400%) |
| CPU | CPU Usage [Idle] | % | 1m | CPU time spent in idle state |
| CPU | CPU Usage [IO Wait] | % | 1m | CPU time spent in wait state (disk wait) |
| CPU | CPU Usage [System] | % | 1m | CPU time used by the kernel (using all 4 cores at 100%: 400%) |
| CPU | CPU Usage [User] | % | 1m | CPU time used by the user (using all 4 cores at 100%: 400%) |
| CPU | CPU Usage/Core [Active] | % | 1m | CPU time used, excluding idle and IOWait states (normalized by the number of cores, using all 4 cores at 100%: 100%) |
| CPU | CPU Usage/Core [Idle] | % | 1m | CPU time spent in idle state |
| CPU | CPU Usage/Core [IO Wait] | % | 1m | CPU time spent in wait state (disk wait) |
| CPU | CPU Usage/Core [System] | % | 1m | CPU time used by the kernel (normalized by the number of cores, using all 4 cores at 100%: 100%) |
| CPU | CPU Usage/Core [User] | % | 1m | CPU time used by the user (normalized by the number of cores, using all 4 cores at 100%: 100%) |
| Disk | Disk CPU Usage [IO Request] | % | 1m | The ratio of CPU time spent executing I/O requests for the device (device bandwidth utilization). If this value is close to 100%, the device is in a saturated state. |
| Disk | Disk Queue Size [Avg] | num | 1m | The average queue length of requests executed for the device. |
| Disk | Disk Read Bytes | bytes | 1m | The number of bytes read from the device per second. |
| Disk | Disk Read Bytes [Delta Avg] | bytes | 1m | The average of system.diskio.read.bytes_delta for individual disks. |
| Disk | Disk Read Bytes [Delta Max] | bytes | 1m | The maximum of system.diskio.read.bytes_delta for individual disks. |
| Disk | Disk Read Bytes [Delta Min] | bytes | 1m | The minimum of system.diskio.read.bytes_delta for individual disks. |
| Disk | Disk Read Bytes [Delta Sum] | bytes | 1m | The sum of system.diskio.read.bytes_delta for individual disks. |
| Disk | Disk Read Bytes [Delta] | bytes | 1m | The delta value of system.diskio.read.bytes for individual disks. |
| Disk | Disk Read Bytes [Success] | bytes | 1m | The total number of bytes read successfully. On Linux, it is assumed that the sector size is 512 and the value is calculated by multiplying the number of sectors read by 512. |
| Disk | Disk Read Requests | cnt | 1m | The number of read requests for the disk device per second. |
| Disk | Disk Read Requests [Delta Avg] | cnt | 1m | The average of system.diskio.read.count_delta for individual disks. |
| Disk | Disk Read Requests [Delta Max] | cnt | 1m | The maximum of system.diskio.read.count_delta for individual disks. |
| Disk | Disk Read Requests [Delta Min] | cnt | 1m | The minimum of system.diskio.read.count_delta for individual disks. |
| Disk | Disk Read Requests [Delta Sum] | cnt | 1m | The sum of system.diskio.read.count_delta for individual disks. |
| Disk | Disk Read Requests [Success Delta] | cnt | 1m | The delta value of system.diskio.read.count for individual disks. |
| Disk | Disk Read Requests [Success] | cnt | 1m | The total number of successful read requests. |
| Disk | Disk Request Size [Avg] | num | 1m | The average size of requests executed for the device (in sectors). |
| Disk | Disk Service Time [Avg] | ms | 1m | The average service time for input requests executed for the device (in milliseconds). |
| Disk | Disk Wait Time [Avg] | ms | 1m | The average time spent executing requests for the device. |
| Disk | Disk Wait Time [Read] | ms | 1m | The average disk wait time for read operations. |
| Disk | Disk Wait Time [Write] | ms | 1m | The average disk wait time for write operations. |
| Disk | Disk Write Bytes [Delta Avg] | bytes | 1m | The average of system.diskio.write.bytes_delta for individual disks. |
| Disk | Disk Write Bytes [Delta Max] | bytes | 1m | The maximum of system.diskio.write.bytes_delta for individual disks. |
| Disk | Disk Write Bytes [Delta Min] | bytes | 1m | The minimum of system.diskio.write.bytes_delta for individual disks. |
| Disk | Disk Write Bytes [Delta Sum] | bytes | 1m | The sum of system.diskio.write.bytes_delta for individual disks. |
| Disk | Disk Write Bytes [Delta] | bytes | 1m | The delta value of system.diskio.write.bytes for individual disks. |
| Disk | Disk Write Bytes [Success] | bytes | 1m | The total number of bytes written successfully. On Linux, it is assumed that the sector size is 512 and the value is calculated by multiplying the number of sectors written by 512. |
| Disk | Disk Write Requests | cnt | 1m | The number of write requests for the disk device per second. |
| Disk | Disk Write Requests [Delta Avg] | cnt | 1m | The average of system.diskio.write.count_delta for individual disks. |
| Disk | Disk Write Requests [Delta Max] | cnt | 1m | The maximum of system.diskio.write.count_delta for individual disks. |
| Disk | Disk Write Requests [Delta Min] | cnt | 1m | The minimum of system.diskio.write.count_delta for individual disks. |
| Disk | Disk Write Requests [Delta Sum] | cnt | 1m | The sum of system.diskio.write.count_delta for individual disks. |
| Disk | Disk Write Requests [Success Delta] | cnt | 1m | The delta value of system.diskio.write.count for individual disks. |
| Disk | Disk Write Requests [Success] | cnt | 1m | The total number of successful write requests. |
| Disk | Disk Writes Bytes | bytes | 1m | The number of bytes written to the device per second. |
| FileSystem | Filesystem Hang Check | state | 1m | Filesystem (local/NFS) hang check (normal: 1, abnormal: 0). |
| FileSystem | Filesystem Nodes | cnt | 1m | The total number of file nodes in the file system. |
| FileSystem | Filesystem Nodes [Free] | cnt | 1m | The total number of available file nodes in the file system. |
| FileSystem | Filesystem Size [Available] | bytes | 1m | The available disk space (in bytes) that can be used by non-privileged users. |
| FileSystem | Filesystem Size [Free] | bytes | 1m | The available disk space (in bytes). |
| FileSystem | Filesystem Size [Total] | bytes | 1m | The total disk space (in bytes). |
| FileSystem | Filesystem Usage | % | 1m | The percentage of used disk space. |
| FileSystem | Filesystem Usage [Avg] | % | 1m | The average of filesystem.used.pct for individual file systems. |
| FileSystem | Filesystem Usage [Inode] | % | 1m | The inode usage rate. |
| FileSystem | Filesystem Usage [Max] | % | 1m | The maximum of filesystem.used.pct for individual file systems. |
| FileSystem | Filesystem Usage [Min] | % | 1m | The minimum of filesystem.used.pct for individual file systems. |
| FileSystem | Filesystem Usage [Total] | % | 1m | - |
| FileSystem | Filesystem Used | bytes | 1m | The used disk space (in bytes). |
| FileSystem | Filesystem Used [Inode] | bytes | 1m | The inode usage. |
| Memory | Memory Free | bytes | 1m | The total available memory (in bytes), excluding memory used by system cache and buffers (see system.memory.actual.free). |
| Memory | Memory Free [Actual] | bytes | 1m | The actual available memory (in bytes), which varies depending on the OS. On Linux, it is calculated using /proc/meminfo, and on OSX, it is the sum of available and inactive memory. On Windows, it is the same as system.memory.free. |
| Memory | Memory Free [Swap] | bytes | 1m | The available swap memory. |
| Memory | Memory Total | bytes | 1m | The total memory. |
| Memory | Memory Total [Swap] | bytes | 1m | The total swap memory. |
| Memory | Memory Usage | % | 1m | The percentage of used memory. |
| Memory | Memory Usage [Actual] | % | 1m | The percentage of actual used memory. |
| Memory | Memory Usage [Cache Swap] | % | 1m | The cache swap usage rate. |
| Memory | Memory Usage [Swap] | % | 1m | The percentage of used swap memory. |
| Memory | Memory Used | bytes | 1m | The used memory. |
| Memory | Memory Used [Actual] | bytes | 1m | The actual used memory (in bytes), which is the total memory minus the used memory. The available memory varies depending on the OS (see system.actual.free). |
| Memory | Memory Used [Swap] | bytes | 1m | The used swap memory. |
| Network | Collisions | cnt | 1m | Network collisions. |
| Network | Network In Bytes | bytes | 1m | The number of bytes received. |
| Network | Network In Bytes [Delta Avg] | bytes | 1m | The average of system.network.in.bytes_delta for individual networks. |
| Network | Network In Bytes [Delta Max] | bytes | 1m | The maximum of system.network.in.bytes_delta for individual networks. |
| Network | Network In Bytes [Delta Min] | bytes | 1m | The minimum of system.network.in.bytes_delta for individual networks. |
| Network | Network In Bytes [Delta Sum] | bytes | 1m | The sum of system.network.in.bytes_delta for individual networks. |
| Network | Network In Bytes [Delta] | bytes | 1m | The delta value of the number of bytes received. |
| Network | Network In Dropped | cnt | 1m | The number of packets dropped during reception. |
| Network | Network In Errors | cnt | 1m | The number of errors during reception. |
| Network | Network In Packets | cnt | 1m | The number of packets received. |
| Network | Network In Packets [Delta Avg] | cnt | 1m | The average of system.network.in.packets_delta for individual networks. |
| Network | Network In Packets [Delta Max] | cnt | 1m | The maximum of system.network.in.packets_delta for individual networks. |
| Network | Network In Packets [Delta Min] | cnt | 1m | The minimum of system.network.in.packets_delta for individual networks. |
| Network | Network In Packets [Delta Sum] | cnt | 1m | The sum of system.network.in.packets_delta for individual networks. |
| Network | Network In Packets [Delta] | cnt | 1m | The delta value of the number of packets received. |
| Network | Network Out Bytes | bytes | 1m | The number of bytes sent. |
| Network | Network Out Bytes [Delta Avg] | bytes | 1m | The average of system.network.out.bytes_delta for individual networks. |
| Network | Network Out Bytes [Delta Max] | bytes | 1m | The maximum of system.network.out.bytes_delta for individual networks. |
| Network | Network Out Bytes [Delta Min] | bytes | 1m | The minimum of system.network.out.bytes_delta for individual networks. |
| Network | Network Out Bytes [Delta Sum] | bytes | 1m | The sum of system.network.out.bytes_delta for individual networks. |
| Network | Network Out Bytes [Delta] | bytes | 1m | The delta value of the number of bytes sent. |
| Network | Network Out Dropped | cnt | 1m | The number of packets dropped during transmission. This value is not reported by the OS and is always 0 on Darwin and BSD. |
| Network | Network Out Errors | cnt | 1m | The number of errors during transmission. |
| Network | Network Out Packets | cnt | 1m | The number of packets sent. |
| Network | Network Out Packets [Delta Avg] | cnt | 1m | The average of system.network.out.packets_delta for individual networks. |
| Network | Network Out Packets [Delta Max] | cnt | 1m | The maximum of system.network.out.packets_delta for individual networks. |
| Network | Network Out Packets [Delta Min] | cnt | 1m | The minimum of system.network.out.packets_delta for individual networks. |
| Network | Network Out Packets [Delta Sum] | cnt | 1m | The sum of system.network.out.packets_delta for individual networks. |
| Network | Network Out Packets [Delta] | cnt | 1m | The delta value of the number of packets sent. |
| Network | Open Connections [TCP] | cnt | 1m | The number of open TCP connections. |
| Network | Open Connections [UDP] | cnt | 1m | The number of open UDP connections. |
| Network | Port Usage | % | 1m | The port usage rate. |
| Network | SYN Sent Sockets | cnt | 1m | The number of sockets in the SYN_SENT state (when connecting to a remote host). |
| Process | Kernel PID Max | cnt | 1m | The kernel.pid_max value. |
| Process | Kernel Thread Max | cnt | 1m | The kernel.threads-max value. |
| Process | Process CPU Usage | % | 1m | The percentage of CPU time consumed by the process since the last update. This value is similar to the %CPU value displayed by the top command on Unix systems. |
| Process | Process CPU Usage/Core | % | 1m | The percentage of CPU time used by the process since the last event, normalized by the number of cores (0-100%). |
| Process | Process Memory Usage | % | 1m | The percentage of main memory (RAM) used by the process. |
| Process | Process Memory Used | bytes | 1m | The resident set size, which is the amount of memory used by the process in RAM. On Windows, it is the current working set size. |
| Process | Process PID | PID | 1m | The process ID. |
| Process | Process PPID | PID | 1m | The parent process ID. |
| Process | Processes [Dead] | cnt | 1m | The number of dead processes. |
| Process | Processes [Idle] | cnt | 1m | The number of idle processes. |
| Process | Processes [Running] | cnt | 1m | The number of running processes. |
| Process | Processes [Sleeping] | cnt | 1m | The number of sleeping processes. |
| Process | Processes [Stopped] | cnt | 1m | The number of stopped processes. |
| Process | Processes [Total] | cnt | 1m | The total number of processes. |
| Process | Processes [Unknown] | cnt | 1m | The number of processes with unknown or unsearchable states. |
| Process | Processes [Zombie] | cnt | 1m | The number of zombie processes. |
| Process | Running Process Usage | % | 1m | The process usage rate. |
| Process | Running Processes | cnt | 1m | The number of running processes. |
| Process | Running Thread Usage | % | 1m | The thread usage rate. |
| Process | Running Threads | cnt | 1m | The total number of threads running in running processes. |
| System | Context Switches | cnt | 1m | The number of context switches per second. |
| System | Load/Core [1 min] | cnt | 1m | The load average over the last 1 minute, normalized by the number of cores. |
| System | Load/Core [15 min] | cnt | 1m | The load average over the last 15 minutes, normalized by the number of cores. |
| System | Load/Core [5 min] | cnt | 1m | The load average over the last 5 minutes, normalized by the number of cores. |
| System | Multipaths [Active] | cnt | 1m | The number of active paths for external storage connections. |
| System | Multipaths [Failed] | cnt | 1m | The number of failed paths for external storage connections. |
| System | Multipaths [Faulty] | cnt | 1m | The number of faulty paths for external storage connections. |
| System | NTP Offset | num | 1m | The measured offset (time difference between the NTP server and the local environment) of the last sample. |
| System | Run Queue Length | num | 1m | The length of the run queue. |
| System | Uptime | ms | 1m | The OS uptime (in milliseconds). |
| Windows | Context Switchies | cnt | 1m | The number of CPU context switches per second. |
| Windows | Disk Read Bytes [Sec] | cnt | 1m | The number of bytes read from the Windows logical disk per second. |
| Windows | Disk Read Time [Avg] | sec | 1m | The average time spent reading data (in seconds). |
| Windows | Disk Transfer Time [Avg] | sec | 1m | The average disk wait time. |
| Windows | Disk Usage | % | 1m | The disk usage rate. |
| Windows | Disk Write Bytes [Sec] | cnt | 1m | The number of bytes written to the Windows logical disk per second. |
| Windows | Disk Write Time [Avg] | sec | 1m | The average time spent writing data (in seconds). |
| Windows | Pagingfile Usage | % | 1m | The paging file usage rate. |
| Windows | Pool Used [Non Paged] | bytes | 1m | The Nonpaged Pool usage of kernel memory. |
| Windows | Pool Used [Paged] | bytes | 1m | The Paged Pool usage of kernel memory. |
| Windows | Process [Running] | cnt | 1m | The number of currently running processes. |
| Windows | Threads [Running] | cnt | 1m | The number of currently running threads. |
| Windows | Threads [Waiting] | cnt | 1m | The number of threads waiting for processor time. |