The page has been translated by Gen AI.

Monitoring Metrics

Cloud Monitoring service termination notice

According to Samsung Cloud Platform’s policy, the Cloud Monitoring service is scheduled to be discontinued in September 2026.
Accordingly, after the September 2026 release, resource monitoring of the Samsung Cloud Platform via Cloud Monitoring will no longer be possible.

With the new alternative service, you can continuously perform resource monitoring by leveraging ServiceWatch released in October 2025.
ServiceWatch provides more modern and powerful features, replacing Cloud Monitoring to deliver a seamless monitoring environment.

If you are collecting metrics and logs through the Cloud Monitoring Agent, you need to switch to the ServiceWatch Agent.

For detailed information about ServiceWatch, please refer to ServiceWatch Overview.
Detailed information about ServiceWatch Agent can be found in the ServiceWatch Agent.

GPU Server Monitoring Metrics

The table below shows the monitoring metrics of the GPU server that can be viewed through Cloud Monitoring.

Even without installing the Agent, it provides basic monitoring metrics and the table below. Please check the GPU Server monitoring metrics (provided by default). Additionally, the metrics that can be viewed by installing the Agent are in the table. Please refer to the additional monitoring metrics for GPU Server (Agent installation required).

For detailed usage of Cloud Monitoring, refer to the Cloud Monitoring guide.

Performance Item NameExplanationunit
Memory Total [Basic]bytes of usable memorybytes
Memory Used [Basic]bytes of currently used memorybytes
Memory Swap In [Basic]bytes of the replaced memorybytes
Memory Swap Out [Basic]bytes of the replaced memorybytes
Memory Free [Basic]bytes of unused memorybytes
Disk Read Bytes [Basic]Read bytesbytes
Disk Read Requests [Basic]Number of read requestscnt
Disk Write Bytes [Basic]write bytesbytes
Disk Write Requests [Basic]Number of write requestscnt
CPU Usage [Basic]Average system CPU usage over 1 minute%
Instance State [Basic]Instance statusstate
Network In Bytes [Basic]Received bytesbytes
Network In Dropped [Basic]Incoming packet dropcnt
Network In Packets [Basic]Number of received packetscnt
Network Out Bytes [Basic]sent bytesbytes
Network Out Dropped [Basic]Transmit packet dropcnt
Network Out Packets [Basic]Number of transmitted packetscnt
Table. GPU Server Basic Monitoring Metrics (Provided by Default)
Performance Item NameExplanationunit
GPU CountNumber of GPUscnt
GPU Memory UsageMemory usage rate%
GPU Memory UsedMemory usageMB
GPU TemperatureGPU temperature
GPU Usageutilization%
GPU Usage [Avg]Overall average GPU utilization (%)%
GPU Power CapMaximum power capacity of the GPUW
GPU Power UsageCurrent GPU power usageW
GPU Memory Usage [Avg]GPU Memory Uti. AVG%
GPU Count in useNumber of GPUs in use by jobs on the nodecnt
Execution Status for nvidia-smiResult of running the nvidia-smi commandstatus
Core Usage [IO Wait]Ratio of CPU time spent in wait state (disk wait)%
Core Usage [System]Proportion of CPU time spent in kernel space%
Core Usage [User]Proportion of CPU time spent in user space%
CPU CoresNumber of CPU cores on the hostcnt
CPU Usage [Active]Percentage of CPU time used, excluding Idle and IOWait states%
CPU Usage [Idle]It is the proportion of CPU time spent in idle state.%
CPU Usage [IO Wait]The proportion of CPU time spent in a waiting state (disk wait).%
CPU Usage [System]Percentage of CPU time used by the kernel%
CPU Usage [User]Percentage of CPU time used in user space.%
CPU Usage/Core [Active]Percentage of CPU time used other than Idle and IOWait states%
CPU Usage/Core [Idle]It is the proportion of CPU time spent in idle state.%
CPU Usage/Core [IO Wait]This is the proportion of CPU time spent in a waiting state (disk wait).%
CPU Usage/Core [System]Percentage of CPU time used by the kernel%
CPU Usage/Core [User]Percentage of CPU time used in user space.%
Disk CPU Usage [IO Request]Proportion of CPU time during which I/O requests to the device were executed%
Disk Queue Size [Avg]The average queue length of requests executed for the device.num
Disk Read BytesThe number of bytes read per second from the device.bytes
Disk Read Bytes [Delta Avg]Average of system.diskio.read.bytes_delta for individual disksbytes
Disk Read Bytes [Delta Max]Maximum system.diskio.read.bytes_delta of individual disksbytes
Disk Read Bytes [Delta Min]Minimum system.diskio.read.bytes_delta of individual disksbytes
Disk Read Bytes [Delta Sum]Sum of the system.diskio.read.bytes_delta of individual disksbytes
Disk Read Bytes [Delta]Delta of the system.diskio.read.bytes value for each diskbytes
Disk Read Bytes [Success]Total number of bytes successfully read.bytes
Disk Read RequestsNumber of read requests to the disk device per secondcnt
Disk Read Requests [Delta Avg]Average of the system.diskio.read.count_delta for individual diskscnt
Disk Read Requests [Delta Max]Maximum system.diskio.read.count_delta for individual diskscnt
Disk Read Requests [Delta Min]Minimum of system.diskio.read.count_delta for individual diskscnt
Disk Read Requests [Delta Sum]Sum of system.diskio.read.count_delta for individual diskscnt
Disk Read Requests [Success Delta]Delta of system.diskio.read.count for each diskcnt
Disk Read Requests [Success]Total number of successful readscnt
Disk Request Size [Avg]The average size of requests executed on the device (unit: sectors).num
Disk Service Time [Avg]Average service time (milliseconds) of input requests executed on the device.ms
Disk Wait Time [Avg]Average time taken for requests executed on the supported device.ms
Disk Wait Time [Read]Average disk wait timems
Disk Wait Time [Write]Average disk wait timems
Disk Write Bytes [Delta Avg]Average of system.diskio.write.bytes_delta for each diskbytes
Disk Write Bytes [Delta Max]Maximum system.diskio.write.bytes_delta of individual disksbytes
Disk Write Bytes [Delta Min]Minimum of system.diskio.write.bytes_delta for individual disksbytes
Disk Write Bytes [Delta Sum]Sum of system.diskio.write.bytes_delta for individual disksbytes
Disk Write Bytes [Delta]Delta of the system.diskio.write.bytes value for each diskbytes
Disk Write Bytes [Success]Total number of bytes successfully written.bytes
Disk Write RequestsNumber of write requests to the disk device per secondcnt
Disk Write Requests [Delta Avg]Average of system.diskio.write.count_delta for individual diskscnt
Disk Write Requests [Delta Max]Maximum system.diskio.write.count_delta for individual diskscnt
Disk Write Requests [Delta Min]Minimum of system.diskio.write.count_delta for individual diskscnt
Disk Write Requests [Delta Sum]Sum of the system.diskio.write.count_delta of individual diskscnt
Disk Write Requests [Success Delta]Delta of system.diskio.write.count for each diskcnt
Disk Write Requests [Success]Total number of successful writescnt
Disk Writes BytesIt is the number of bytes per second written to the device.bytes
Filesystem Hang Checkfilesystem (local/NFS) hang check (normal:1, abnormal:0)status
Filesystem NodesIt is the total number of file nodes in the file system.cnt
Filesystem Nodes [Free]It is the total number of available file nodes in the file system.cnt
Filesystem Size [Available]Disk space (bytes) available to unauthorized usersbytes
Filesystem Size [Free]Available disk space (bytes)bytes
Filesystem Size [Total]Total disk space (bytes)bytes
Filesystem UsageUsed disk space percentage%
Filesystem Usage [Avg]Average of individual filesystem.used.pct values%
Filesystem Usage [Inode]inode usage%
Filesystem Usage [Max]Maximum among individual filesystem.used.pct%
Filesystem Usage [Min]minimum among individual filesystem.used.pct%
Filesystem Usage [Total]-%
Filesystem UsedUsed disk space (bytes)bytes
Filesystem Used [Inode]inode usagebytes
Memory FreeTotal amount of available memory (bytes).bytes
Memory Free [Actual]Actual usable memory (bytes).bytes
Memory Free [Swap]Available swap memory.bytes
Memory Totaltotal memorybytes
Memory Total [Swap]Total swap memory.bytes
Memory UsagePercentage of used memory%
Memory Usage [Actual]Percentage of memory actually used%
Memory Usage [Cache Swap]cached swap usage rate%
Memory Usage [Swap]Percentage of used swap memory%
Memory Usedused memorybytes
Memory Used [Actual]Actual memory used (bytes).bytes
Memory Used [Swap]Swap memory used.bytes
CollisionsNetwork collisioncnt
Network In BytesNumber of received bytesbytes
Network In Bytes [Delta Avg]Average of system.network.in.bytes_delta for individual networksbytes
Network In Bytes [Delta Max]Maximum of system.network.in.bytes_delta for each networkbytes
Network In Bytes [Delta Min]Minimum system.network.in.bytes_delta for each networkbytes
Network In Bytes [Delta Sum]Sum of system.network.in.bytes_delta for individual networksbytes
Network In Bytes [Delta]Delta of received byte countbytes
Network In DroppedNumber of deleted packets among incoming packetscnt
Network In ErrorsNumber of errors during receptioncnt
Network In PacketsNumber of received packetscnt
Network In Packets [Delta Avg]Average of system.network.in.packets_delta for each networkcnt
Network In Packets [Delta Max]Maximum of system.network.in.packets_delta for each networkcnt
Network In Packets [Delta Min]Minimum of system.network.in.packets_delta for individual networkscnt
Network In Packets [Delta Sum]Sum of system.network.in.packets_delta for individual networkscnt
Network In Packets [Delta]Delta of received packet countcnt
Network Out BytesNumber of transmitted bytesbytes
Network Out Bytes [Delta Avg]Average of system.network.out.bytes_delta for each networkbytes
Network Out Bytes [Delta Max]Maximum of system.network.out.bytes_delta for individual networksbytes
Network Out Bytes [Delta Min]Minimum of system.network.out.bytes_delta for individual networksbytes
Network Out Bytes [Delta Sum]Sum of system.network.out.bytes_delta for individual networksbytes
Network Out Bytes [Delta]Delta of transmitted byte countbytes
Network Out DroppedNumber of deleted packets among outgoing packets.cnt
Network Out ErrorsNumber of errors during transmissioncnt
Network Out PacketsNumber of transmitted packetscnt
Network Out Packets [Delta Avg]Average of system.network.out.packets_delta for each networkcnt
Network Out Packets [Delta Max]Maximum of system.network.out.packets_delta for each networkcnt
Network Out Packets [Delta Min]Minimum of system.network.out.packets_delta for each networkcnt
Network Out Packets [Delta Sum]Sum of system.network.out.packets_delta for individual networkscnt
Network Out Packets [Delta]Delta of transmitted packet countcnt
Open Connections [TCP]All open TCP connectionscnt
Open Connections [UDP]All open UDP connectionscnt
Port UsageAvailable port usage rate%
SYN Sent SocketsNumber of sockets in SYN_SENT state (when connecting from local to remote)cnt
Kernel PID Maxkernel.pid_max valuecnt
Kernel Thread Maxkernel.threads-max valuecnt
Process CPU UsagePercentage of CPU time consumed by the process since the last update.%
Process CPU Usage/CorePercentage of CPU time used by the process since the last event.%
Process Memory UsageProportion of main memory (RAM) occupied by a process%
Process Memory UsedResident Set size. The amount of memory a process occupies in RAM.bytes
Process PIDprocess pidPID
Process PPIDparent process PIDPID
Processes [Dead]Number of dead processescnt
Processes [Idle]Number of idle processescnt
Processes [Running]Number of running processescnt
Processes [Sleeping]Number of sleeping processescnt
Processes [Stopped]stopped processes countcnt
Processes [Total]Total number of processescnt
Processes [Unknown]Number of processes with an unsearchable or unknown statuscnt
Processes [Zombie]Zombie processes countcnt
Running Process Usageprocess usage rate%
Running ProcessesNumber of running processescnt
Running Thread UsageThread usage rate%
Running ThreadsTotal number of threads running in running processescnt
Context Switchescontext switch count (per second)cnt
Load/Core [1 min]The load over the last 1 minute divided by the number of corescnt
Load/Core [15 min]The load over the last 15 minutes divided by the number of corescnt
Load/Core [5 min]The load over the last 5 minutes divided by the number of corescnt
Multipaths [Active]External storage connection path status = active countcnt
Multipaths [Failed]External storage connection path status = failed countcnt
Multipaths [Faulty]External storage connection path status = faulty countcnt
NTP Offsetmeasured offset of the last sample (the time difference between the NTP server and the local environment)num
Run Queue LengthExecution queue lengthnum
UptimeOS uptime (milliseconds).ms
Context SwitchiesCPU context switch count (per second)cnt
Disk Read Bytes [Sec]Number of bytes read from a Windows logical disk in 1 secondcnt
Disk Read Time [Avg]Average data read time (seconds)sec
Disk Transfer Time [Avg]Disk average wait timesec
Disk UsageDisk usage%
Disk Write Bytes [Sec]Number of bytes written in one second on a Windows logical diskcnt
Disk Write Time [Avg]Average data write time (seconds)sec
Pagingfile UsagePaging file usage%
Pool Used [Non Paged]Nonpaged Pool usage in kernel memorybytes
Pool Used [Paged]Paged Pool usage in kernel memorybytes
Process [Running]Number of currently running processescnt
Threads [Running]Number of currently running threadscnt
Threads [Waiting]Number of threads waiting for processor timecnt
Table. Additional monitoring metrics for GPU Server (Agent installation required)

Server type
ServiceWatch Metrics