Virtual Server is a cloud computing-optimized virtual server that lets you freely allocate the amount you need at the required time without having to purchase infrastructure resources such as CPU and memory individually. In a cloud environment, you can use resources with optimized performance according to your computing purpose, such as development, testing, and application execution.

Features

Easy and convenient computing environment setup: Through a web-based Console, users can easily perform self-service provisioning of Virtual Servers, as well as resource and cost management. If you need to change the capacity of major resources such as CPU or Memory while using a Virtual Server, you can easily scale up or down without operator intervention.
Providing various types of services: Provides virtualized vCore/Memory resources according to predefined server types (1~128 vCore).
- General Virtual Server: Provides commonly used computing specs (up to 16 vCore, 256 GB)
- High-capacity Virtual Server: Provided when resources larger than the standard Virtual Server spec are needed.
Strong Security Implementation: By using the Security Group service, you can control inbound/outbound traffic communicating with the external Internet or other VPCs (Virtual Private Cloud) to securely protect the server. Additionally, real-time monitoring enables stable operation of computing resources.

Service Architecture Diagram

Provided features

Virtual Server provides the following features.

Auto Provisioning and Management: Provides Virtual Server provisioning, resource management, and cost management functions through a web-based Console. If you need to change the capacity of major resources such as CPU or Memory while using Virtual Server, you can modify the server type immediately using the server type modification feature.
Standard Server Types and Image Provision: Provides virtualized vCore/Memory resources according to standard server types, and offers standard OS images.
Storage Connection: Provides additional attached storage beyond the OS disk. Block Storage, File Storage, and Object Storage can be attached and used.
Network Connection: You can connect the standard subnet/IP settings of the Virtual Server and the Public NAT IP. Provides a local subnet connection for inter-server communication. This can be modified on the detail page.
Security Group Application: Use the Security Group service to control inbound and outbound traffic communicating with external internet or other VPCs, thereby securely protecting the server.
Monitoring: You can view monitoring information for computing resources such as CPU, Memory, and Disk through the Cloud Monitoring service.
Backup and Recovery: You can back up and restore the Virtual Server Image using the Backup service.
Cost Management: You can create, stop, or terminate servers as needed, and since billing is based on actual usage time, you can monitor costs according to consumption.
ServiceWatch Service Integration: You can monitor data using the ServiceWatch service.

Component

Virtual Server provides standard server types and standard OS images. Users can select and use them according to the desired service scale.

Image

You can create and manage images. The main features are as follows.

Image creation: You can create an Image from the configuration of a Virtual Server you are using, and you can also create an Image by uploading your Image file to Object Storage.
Create Shared Image: You can create an Image with Visibility set to Private as a Shared Image that can be shared.
Share with another Account: You can share the Image with another Account.
Refer to the How-to guides > Image document for how to create and use images.

Keypair

To ensure a more secure OS login, we strengthen security by providing a Key Pair instead of the ID/Password entry method. The main features are as follows.

Keypair creation: Generate a user credential to connect to the Virtual Server.
Retrieve Public Key: You can load a file or manually enter the public key to retrieve it.
Refer to the How-to guides > Keypair document for creating and using keypairs.

Server Group

Through Server Group settings, you can position the Block Storage added when creating a Virtual Server close to or distributed across racks and hosts. The main features are as follows.

Server Group Creation: You can set Virtual Servers belonging to the same Server Group as Anti-Affinity(Distributed placement), Affinity(Proximate placement), or Partition(Virtual Server and Block Storage distributed placement).
Refer to the How-to guides > Server Group document for how to create and use Server Groups.

OS Image provided version

The OS images provided by Virtual Server are as follows

OS Image version	EoS Date
Alma Linux 8.10	2029-05-31
Alma Linux 9.6	2025-11-17
Oracle Linux 8.10	2029-07-31
Oracle Linux 9.6	2025-11-25
RHEL 8.10	2029-05-31
RHEL 9.4	2026-04-30
RHEL 9.6	2027-05-31
Rocky Linux 8.10	2029-05-31
Rocky Linux 9.6	2025-11-30
Ubuntu 22.04	2027-06-30
Ubuntu 24.04	2029-06-30
Windows 2019	2029-01-09
Windows 2022	2031-10-14
Windows 2016	2027-01-12

Table. Virtual Server provided OS Image version

Reference

Linux operating systems such as Alma Linux and Rocky Linux provide only even Minor versions, except for the final release of a Major version. This policy ensures the stability and consistency of the SCP system. We recommend checking the EOS (End of Support) and EOL (End of Life) dates for the operating system, and, if necessary, applying new or additional individual packages to maintain a stable environment.

Server type

The server types supported by Virtual Server are as follows. For detailed information about server types, see Virtual Server Server Types.

Standard s1v2m4

Category	example	Detailed description
Server type	Standard	Provided server type classifications Standard: Configured with the commonly used standard specifications (vCPU, Memory) High Capacity: Large-capacity server specifications exceeding Standard
Server specifications	s1	Provided server type classification and generation s1: s denotes a standard specification, and 1 denotes the generation provided by Samsung Cloud Platform v2 s2: s denotes a standard specification, and 2 denotes the generation provided by Samsung Cloud Platform v2 h2: h denotes a high-capacity server specification, and 2 denotes the generation provided by Samsung Cloud Platform v2
Server specifications	v2	Number of vCores v2: 2 virtual cores
Server specifications	m4	Memory capacity m4: 4GB Memory

Table. Virtual Server server type

Constraints

Reference

When creating a Virtual Server with Rocky Linux or Oracle Linux, additional configuration is required for time synchronization (NTP: Network Time Protocol). For other images, it is set automatically and no separate configuration is needed.
For more details, refer to Linux NTP Setup.
If you created RHEL and Windows Server before August 2025, you need to modify the RHEL Repository and WKMS (Windows Key Management Service) settings.
For more details, see RHEL Repo and WKMS Configuration.

Preliminary Service

This is a list of services that need to be pre-configured before creating the service. Please refer to the guide provided for each service and prepare in advance.

Service Category	service	Detailed description
Networking	VPC	A service that provides an isolated virtual network in a cloud environment
Networking	Security Group	Virtual firewall that controls server traffic

Table. Virtual Server Preliminary Service

1.1.1 - Server Type

Virtual Server server type

Virtual Server provides server types that match the intended use. Server types consist of various combinations such as CPU, Memory, and Network Bandwidth. The host server used for a Virtual Server is determined by the server type selected when creating the Virtual Server. Please choose a server type based on the specifications of the application you plan to run on the Virtual Server.

The server types supported by Virtual Server are as follows.

Standard s1v2m4

Category	example	Detailed description
Server type	Standard	Provided server type classifications Standard: Configured with the commonly used standard specifications (vCPU, Memory) High Capacity: Large-capacity server specifications exceeding Standard
Server specifications	s1	Provided server type classification and generation s1: s denotes a standard specification, and 1 denotes the generation provided by Samsung Cloud Platform v2 s2: s denotes a standard specification, and 2 denotes the generation provided by Samsung Cloud Platform v2 h2: h denotes a high-capacity server specification, and 2 denotes the generation provided by Samsung Cloud Platform v2
Server specifications	v2	Number of vCores v2: 2 virtual cores
Server specifications	m4	Memory capacity m4: 4GB Memory

Table. Virtual Server Server Type format

s1 server type

The s1 server type of Virtual Server is offered with standard specifications (vCPU, Memory) and is suitable for various applications.

First generation of Samsung Cloud Platform v2: Intel 3rd generation (Ice Lake) Xeon Gold 6342 Processor with up to 3.3 GHz
Supports up to 16 vCPUs and 256 GB of memory
Maximum networking speed of 12.5 Gbps

Category	Server type	vCPU	Memory	Network Bandwidth
Standard	s1v1m2	1 vCore	2 GB	Up to 10 Gbps
Standard	s1v2m4	2 vCore	4 GB	Up to 10 Gbps
Standard	s1v2m8	2 vCore	8 GB	Up to 10 Gbps
Standard	s1v2m16	2 vCore	16 GB	Up to 10 Gbps
Standard	s1v2m24	2 vCore	24 GB	Up to 10 Gbps
Standard	s1v2m32	2 vCore	32 GB	Up to 10 Gbps
Standard	s1v4m8	4 vCore	8 GB	Up to 10 Gbps
Standard	s1v4m16	4 vCore	16 GB	Up to 10 Gbps
Standard	s1v4m32	4 vCore	32 GB	Up to 10 Gbps
Standard	s1v4m48	4 vCore	48 GB	Up to 10 Gbps
Standard	s1v4m64	4 vCore	64 GB	Up to 10 Gbps
Standard	s1v6m12	6 vCore	12 GB	Up to 10 Gbps
Standard	s1v6m24	6 vCore	24 GB	Up to 10 Gbps
Standard	s1v6m48	6 vCore	48 GB	Up to 10 Gbps
Standard	s1v6m72	6 vCore	72 GB	Up to 10 Gbps
Standard	s1v6m96	6 vCore	96 GB	Up to 10 Gbps
Standard	s1v8m16	8 vCore	16 GB	Up to 10 Gbps
Standard	s1v8m32	8 vCore	32 GB	Up to 10 Gbps
Standard	s1v8m64	8 vCore	64 GB	Up to 10 Gbps
Standard	s1v8m96	8 vCore	96 GB	Up to 10 Gbps
Standard	s1v8m128	8 vCore	128 GB	Up to 10 Gbps
Standard	s1v10m20	10 vCore	20 GB	Up to 10 Gbps
Standard	s1v10m40	10 vCore	40 GB	Up to 10 Gbps
Standard	s1v10m80	10 vCore	80 GB	Up to 10 Gbps
Standard	s1v10m120	10 vCore	120 GB	Up to 10 Gbps
Standard	s1v10m160	10 vCore	160 GB	Up to 10 Gbps
Standard	s1v12m24	12 vCore	24 GB	Up to 12.5 Gbps
Standard	s1v12m48	12 vCore	48 GB	Up to 12.5 Gbps
Standard	s1v12m96	12 vCore	96 GB	Up to 12.5 Gbps
Standard	s1v12m144	12 vCore	144 GB	Up to 12.5 Gbps
Standard	s1v12m192	12 vCore	192 GB	Up to 12.5 Gbps
Standard	s1v14m28	14 vCore	28 GB	Up to 12.5 Gbps
Standard	s1v14m56	14 vCore	56 GB	Up to 12.5 Gbps
Standard	s1v14m112	14 vCore	112 GB	Up to 12.5 Gbps
Standard	s1v14m168	14 vCore	168 GB	Up to 12.5 Gbps
Standard	s1v14m224	14 vCore	224 GB	Up to 12.5 Gbps
Standard	s1v16m32	16 vCore	32 GB	Up to 12.5 Gbps
Standard	s1v16m64	16 vCore	64 GB	Up to 12.5 Gbps
Standard	s1v16m128	16 vCore	128 GB	Up to 12.5 Gbps
Standard	s1v16m192	16 vCore	192 GB	Up to 12.5 Gbps
Standard	s1v16m256	16 vCore	256 GB	Up to 12.5 Gbps

Table. Virtual Server Server Type specifications - s1 Server Type

s2 server type

The Virtual Server s2 server type is offered with standard specifications (vCPU, Memory) and is suitable for various applications.

Samsung Cloud Platform v2, 2nd generation: Intel 4th‑generation (Sapphire Rapids) Xeon Gold 6448H Processor up to 3.2 GHz
Supports up to 16 vCPUs and 256 GB of memory
Maximum networking speed of 12.5 Gbps

Category	Server type	CPU vCore	Memory	Network Bandwidth(Gbps)
Standard	s2v1m2	1 vCore	2 GB	Up to 10 Gbps
Standard	s2v2m4	2 vCore	4 GB	Up to 10 Gbps
Standard	s2v2m8	2 vCore	8 GB	Up to 10 Gbps
Standard	s2v2m16	2 vCore	16 GB	Up to 10 Gbps
Standard	s2v2m24	2 vCore	24 GB	Up to 10 Gbps
Standard	s2v2m32	2 vCore	32 GB	Up to 10 Gbps
Standard	s2v4m8	4 vCore	8 GB	Up to 10 Gbps
Standard	s2v4m16	4 vCore	16 GB	Up to 10 Gbps
Standard	s2v4m32	4 vCore	32 GB	Up to 10 Gbps
Standard	s2v4m48	4 vCore	48 GB	Up to 10 Gbps
Standard	s2v4m64	4 vCore	64 GB	Up to 10 Gbps
Standard	s2v6m12	6 vCore	12 GB	Up to 10 Gbps
Standard	s2v6m24	6 vCore	24 GB	Up to 10 Gbps
Standard	s2v6m48	6 vCore	48 GB	Up to 10 Gbps
Standard	s2v6m72	6 vCore	72 GB	Up to 10 Gbps
Standard	s2v6m96	6 vCore	96 GB	Up to 10 Gbps
Standard	s2v8m16	8 vCore	16 GB	Up to 10 Gbps
Standard	s2v8m32	8 vCore	32 GB	Up to 10 Gbps
Standard	s2v8m64	8 vCore	64 GB	Up to 10 Gbps
Standard	s2v8m96	8 vCore	96 GB	Up to 10 Gbps
Standard	s2v8m128	8 vCore	128 GB	Up to 10 Gbps
Standard	s2v10m20	10 vCore	20 GB	Up to 10 Gbps
Standard	s2v10m40	10 vCore	40 GB	Up to 10 Gbps
Standard	s2v10m80	10 vCore	80 GB	Up to 10 Gbps
Standard	s2v10m120	10 vCore	120 GB	Up to 10 Gbps
Standard	s2v10m160	10 vCore	160 GB	Up to 10 Gbps
Standard	s2v12m24	12 vCore	24 GB	Up to 12.5 Gbps
Standard	s2v12m48	12 vCore	48 GB	Up to 12.5 Gbps
Standard	s2v12m96	12 vCore	96 GB	Up to 12.5 Gbps
Standard	s2v12m144	12 vCore	144 GB	Up to 12.5 Gbps
Standard	s2v12m192	12 vCore	192 GB	Up to 12.5 Gbps
Standard	s2v14m28	14 vCore	28 GB	Up to 12.5 Gbps
Standard	s2v14m56	14 vCore	56 GB	Up to 12.5 Gbps
Standard	s2v14m112	14 vCore	112 GB	Up to 12.5 Gbps
Standard	s2v14m168	14 vCore	168 GB	Up to 12.5 Gbps
Standard	s2v14m224	14 vCore	224 GB	Up to 12.5 Gbps
Standard	s2v16m32	16 vCore	32 GB	Up to 12.5 Gbps
Standard	s2v16m64	16 vCore	64 GB	Up to 12.5 Gbps
Standard	s2v16m128	16 vCore	128 GB	Up to 12.5 Gbps
Standard	s2v16m192	16 vCore	192 GB	Up to 12.5 Gbps
Standard	s2v16m256	16 vCore	256 GB	Up to 12.5 Gbps

Table. Virtual Server Server Type specifications - s2 Server Type

h2 Server Type

The h2 server type of Virtual Server is offered with high-capacity specifications and is suitable for applications that require large-scale data processing.

Second generation of Samsung Cloud Platform v2: Intel 4th‑generation (Sapphire Rapids) Xeon Gold 6448H processor up to 3.2 GHz
Supports up to 128 vCPUs and 1,536 GB of memory
Networking speed up to 25 Gbps

Category	Server type	vCPU	Memory	Network Bandwidth
High Capacity	h2v24m48	24 vCore	48 GB	Up to 25 Gbps
High Capacity	h2v24m96	24 vCore	96 GB	Maximum 25 Gbps
High Capacity	h2v24m192	24 vCore	192 GB	Maximum 25 Gbps
High Capacity	h2v24m288	24 vCore	288 GB	Maximum 25 Gbps
High Capacity	h2v32m64	32 vCore	64 GB	Maximum 25 Gbps
High Capacity	h2v32m128	32 vCore	128 GB	Maximum 25 Gbps
High Capacity	h2v32m256	32 vCore	256 GB	Maximum 25 Gbps
High Capacity	h2v32m384	32 vCore	384 GB	Maximum 25 Gbps
High Capacity	h2v48m96	48 vCore	96 GB	Maximum 25 Gbps
High Capacity	h2v48m192	48 vCore	192 GB	Maximum 25 Gbps
High Capacity	h2v48m384	48 vCore	384 GB	Maximum 25 Gbps
High Capacity	h2v48m576	48 vCore	576 GB	Maximum 25 Gbps
High Capacity	h2v64m128	64 vCore	128 GB	Maximum 25 Gbps
High Capacity	h2v64m256	64 vCore	256 GB	Up to 25 Gbps
High Capacity	h2v64m512	64 vCore	512 GB	Maximum 25 Gbps
High Capacity	h2v64m768	64 vCore	768 GB	Maximum 25 Gbps
High Capacity	h2v72m144	72 vCore	144 GB	Up to 25 Gbps
High Capacity	h2v72m288	72 vCore	288 GB	Maximum 25 Gbps
High Capacity	h2v72m576	72 vCore	576 GB	Maximum 25 Gbps
High Capacity	h2v72m864	72 vCore	864 GB	Maximum 25 Gbps
High Capacity	h2v96m192	96 vCore	192 GB	Maximum 25 Gbps
High Capacity	h2v96m384	96 vCore	384 GB	Maximum 25 Gbps
High Capacity	h2v96m768	96 vCore	768 GB	Maximum 25 Gbps
High Capacity	h2v96m1152	96 vCore	1152 GB	Maximum 25 Gbps
High Capacity	h2v128m256	128 vCore	256 GB	Maximum 25 Gbps
High Capacity	h2v128m512	128 vCore	512 GB	Maximum 25 Gbps
High Capacity	h2v128m1024	128 vCore	1024 GB	Up to 25 Gbps
High Capacity	h2v128m1536	128 vCore	1536 GB	Maximum 25 Gbps

Table. Virtual Server Server Type specifications - h2 Server Type

1.1.2 - Monitoring Metrics

Cloud Monitoring service termination notice

According to Samsung Cloud Platform’s policy, the Cloud Monitoring service is scheduled to be discontinued in September 2026.
Starting after the September 2026 release, resource monitoring of the Samsung Cloud Platform via Cloud Monitoring will no longer be possible.

With the new alternative service, you can continuously perform resource monitoring by leveraging ServiceWatch released in October 2025.
ServiceWatch provides more modern and powerful features, replacing Cloud Monitoring to deliver a seamless monitoring environment.

Detailed information about ServiceWatch can be found in the ServiceWatch Overview.

Virtual Server Monitoring Metrics

The table below shows the monitoring metrics of Virtual Server that can be viewed through Cloud Monitoring. For detailed usage of Cloud Monitoring, refer to the Cloud Monitoring guide.

Provides basic monitoring metrics even without installing an agent, as shown below table. Please check the Virtual Server monitoring metrics (default). Additionally, the metrics that can be viewed by installing the Agent are in the table below. Virtual Server additional monitoring metrics (Agent installation required)** Please refer to it.

For Windows OS, memory-related metrics can only be viewed after installing the Agent.

Performance items	Detailed description	unit
Memory Total [Basic]	bytes of usable memory	bytes
Memory Used [Basic]	Current memory usage in bytes	bytes
Memory Swap In [Basic]	bytes of the replaced memory	bytes
Memory Swap Out [Basic]	bytes of the replaced memory	bytes
Memory Free [Basic]	bytes of unused memory	bytes
Disk Read Bytes [Basic]	Read bytes	bytes
Disk Read Requests [Basic]	Number of read requests	cnt
Disk Write Bytes [Basic]	write bytes	bytes
Disk Write Requests [Basic]	Number of write requests	cnt
CPU Usage [Basic]	Average system CPU usage over 1 minute	%
Instance State [Basic]	Instance status	state
Network In Bytes [Basic]	Received bytes	bytes
Network In Dropped [Basic]	Incoming packet drop	cnt
Network In Packets [Basic]	Number of received packets	cnt
Network Out Bytes [Basic]	sent bytes	bytes
Network Out Dropped [Basic]	Transmit packet drop	cnt
Network Out Packets [Basic]	Number of transmitted packets	cnt

Table. Virtual Server Monitoring Metrics (Provided by default)

Performance items	Detailed description	unit
Core Usage [IO Wait]	Ratio of CPU time spent in wait state (disk wait)	%
Core Usage [System]	Proportion of CPU time spent in kernel space	%
Core Usage [User]	Proportion of CPU time spent in user space	%
CPU Cores	Number of CPU cores on the host	cnt
CPU Usage [Active]	Percentage of CPU time used other than Idle and IOWait states	%
CPU Usage [Idle]	It is the proportion of CPU time spent in idle state.	%
CPU Usage [IO Wait]	This is the proportion of CPU time spent in a waiting state (disk wait).	%
CPU Usage [System]	Percentage of CPU time used by the kernel	%
CPU Usage [User]	Percentage of CPU time used in user space	%
CPU Usage/Core [Active]	Percentage of CPU time used other than Idle and IOWait states	%
CPU Usage/Core [Idle]	It is the proportion of CPU time spent in idle state.	%
CPU Usage/Core [IO Wait]	This is the proportion of CPU time spent in a waiting state (disk wait).	%
CPU Usage/Core [System]	Percentage of CPU time used by the kernel	%
CPU Usage/Core [User]	Percentage of CPU time used in user space	%
DiskCPU Usage [IO Request]	Proportion of CPU time during which I/O requests to the device were executed	%
Disk Queue Size [Avg]	The average queue length of requests executed for the device.	num
Disk Read Bytes	The number of bytes read per second from the device.	bytes
Disk Read Bytes [Delta Avg]	Average of system.diskio.read.bytes_delta for individual disks	bytes
Disk Read Bytes [Delta Max]	Maximum system.diskio.read.bytes_delta of individual disks	bytes
Disk Read Bytes [Delta Min]	Minimum of system.diskio.read.bytes_delta for individual disks	bytes
Disk Read Bytes [Delta Sum]	Sum of system.diskio.read.bytes_delta of individual disks	bytes
Disk Read Bytes [Delta]	Delta of the system.diskio.read.bytes value for each Disk	bytes
Disk Read Bytes [Success]	Total bytes successfully read	bytes
Disk Read Requests	Number of read requests to the disk device per second	cnt
Disk Read Requests [Delta Avg]	Average of system.diskio.read.count_delta for individual disks	cnt
Disk Read Requests [Delta Max]	Maximum system.diskio.read.count_delta for individual disks	cnt
Disk Read Requests [Delta Min]	Minimum of system.diskio.read.count_delta for individual disks	cnt
Disk Read Requests [Delta Sum]	Sum of the system.diskio.read.count_delta of individual disks	cnt
Disk Read Requests [Success Delta]	Delta of system.diskio.read.count for each Disk	cnt
Disk Read Requests [Success]	Total number of successful reads	cnt
Disk Request Size [Avg]	It is the average size of requests executed on the device (unit: sectors).	num
Disk Service Time [Avg]	Average service time (milliseconds) of input requests executed on the device.	ms
Disk Wait Time [Avg]	Average time taken for requests executed on the supported device.	ms
Disk Wait Time [Read]	Average disk wait time	ms
Disk Wait Time [Write]	Average disk wait time	ms
Disk Write Bytes [Delta Avg]	Average of system.diskio.write.bytes_delta for each disk	bytes
Disk Write Bytes [Delta Max]	Maximum system.diskio.write.bytes_delta of individual disks	bytes
Disk Write Bytes [Delta Min]	Minimum of system.diskio.write.bytes_delta for individual disks	bytes
Disk Write Bytes [Delta Sum]	Sum of system.diskio.write.bytes_delta for individual disks	bytes
Disk Write Bytes [Delta]	Delta of the system.diskio.write.bytes value for each Disk	bytes
Disk Write Bytes [Success]	Total number of bytes successfully written	bytes
Disk Write Requests	Number of write requests to the disk device per second	cnt
Disk Write Requests [Delta Avg]	Average of system.diskio.write.count_delta for individual disks	cnt
Disk Write Requests [Delta Max]	Maximum system.diskio.write.count_delta of individual disks	cnt
Disk Write Requests [Delta Min]	Minimum of system.diskio.write.count_delta for individual disks	cnt
Disk Write Requests [Delta Sum]	Sum of system.diskio.write.count_delta for individual disks	cnt
Disk Write Requests [Success Delta]	Delta of system.diskio.write.count for each Disk	cnt
Disk Write Requests [Success]	Total number of successful writes	cnt
Disk Writes Bytes	Bytes per second written to the device	bytes
Filesystem Hang Check	filesystem(local/NFS) hang check(normal:1, abnormal:0)	status
Filesystem Nodes	Total number of file nodes in the file system.	cnt
Filesystem Nodes [Free]	It is the total number of available file nodes in the file system.	cnt
Filesystem Size [Available]	Disk space (bytes) available to unauthorized users	bytes
Filesystem Size [Free]	Available disk space (bytes)	bytes
Filesystem Size [Total]	Total disk space (bytes)	bytes
Filesystem Usage	Used disk space percentage	%
Filesystem Usage [Avg]	Average of individual filesystem.used.pct	%
Filesystem Usage [Inode]	iNode usage rate	%
Filesystem Usage [Max]	Maximum among individual filesystem.used.pct	%
Filesystem Usage [Min]	Min among individual filesystem.used.pct	%
Filesystem Usage [Total]	-	%
Filesystem Used	Used disk space (bytes)	bytes
Filesystem Used [Inode]	iNode usage	bytes
Memory Free	Total available memory (bytes)	bytes
Memory Free [Actual]	Actual usable Memory (bytes)	bytes
Memory Free [Swap]	Available Swap memory	bytes
Memory Total	Total Memory	bytes
Memory Total [Swap]	Total Swap memory.	bytes
Memory Usage	Percentage of used memory	%
Memory Usage [Actual]	Percentage of memory actually used	%
Memory Usage [Cache Swap]	cached swap usage	%
Memory Usage [Swap]	Percentage of used Swap memory	%
Memory Used	Used Memory	bytes
Memory Used [Actual]	Actual memory used (bytes)	bytes
Memory Used [Swap]	Used Swap memory	bytes
Collisions	Network collision	cnt
Network In Bytes	Number of received bytes	bytes
Network In Bytes [Delta Avg]	Average of system.network.in.bytes_delta for individual networks	bytes
Network In Bytes [Delta Max]	Maximum system.network.in.bytes_delta for each Network	bytes
Network In Bytes [Delta Min]	Minimum of system.network.in.bytes_delta for each network	bytes
Network In Bytes [Delta Sum]	Sum of system.network.in.bytes_delta for individual networks	bytes
Network In Bytes [Delta]	Delta of received byte count	bytes
Network In Dropped	Number of deleted packets among incoming packets	cnt
Network In Errors	Number of errors during reception	cnt
Network In Packets	Number of received packets	cnt
Network In Packets [Delta Avg]	Average of system.network.in.packets_delta for individual Networks	cnt
Network In Packets [Delta Max]	Maximum of system.network.in.packets_delta for each Network	cnt
Network In Packets [Delta Min]	Minimum of system.network.in.packets_delta for each Network	cnt
Network In Packets [Delta Sum]	Sum of system.network.in.packets_delta for individual Networks	cnt
Network In Packets [Delta]	Delta of received packet count	cnt
Network Out Bytes	Number of transmitted bytes	bytes
Network Out Bytes [Delta Avg]	Average of system.network.out.bytes_delta for individual networks	bytes
Network Out Bytes [Delta Max]	Maximum system.network.out.bytes_delta for each Network	bytes
Network Out Bytes [Delta Min]	Minimum of system.network.out.bytes_delta for individual Networks	bytes
Network Out Bytes [Delta Sum]	Sum of system.network.out.bytes_delta for individual Networks	bytes
Network Out Bytes [Delta]	Delta of transmitted byte count	bytes
Network Out Dropped	Number of deleted packets among outgoing packets	cnt
Network Out Errors	Number of errors during transmission	cnt
Network Out Packets	Number of transmitted packets	cnt
Network Out Packets [Delta Avg]	Average of system.network.out.packets_delta for each Network	cnt
Network Out Packets [Delta Max]	Maximum system.network.out.packets_delta for each Network	cnt
Network Out Packets [Delta Min]	Minimum of system.network.out.packets_delta for each Network	cnt
Network Out Packets [Delta Sum]	Sum of system.network.out.packets_delta for individual networks	cnt
Network Out Packets [Delta]	Delta of transmitted packet count	cnt
Open Connections [TCP]	All open TCP connections	cnt
Open Connections [UDP]	All open UDP connections	cnt
Port Usage	Connectable port utilization	%
SYN Sent Sockets	Number of sockets in SYN_SENT state (when connecting from local to remote)	cnt
Kernel PID Max	kernel.pid_max value	cnt
Kernel Thread Max	kernel.threads-max value	cnt
Process CPU Usage	Percentage of CPU time consumed by the process since the last update	%
Process CPU Usage/Core	Percentage of CPU time used by the process since the last event	%
Process Memory Usage	Proportion of main memory (RAM) occupied by a process	%
Process Memory Used	Resident Set size. The amount of memory a process occupies in RAM.	bytes
Process PID	process pid	pid
Process PPID	Parent process PID	pid
Processes [Dead]	Number of deadProcesses	cnt
Processes [Idle]	idle Processes count	cnt
Processes [Running]	running Processes count	cnt
Processes [Sleeping]	sleeping processes count	cnt
Processes [Stopped]	stopped processes count	cnt
Processes [Total]	Total number of processes	cnt
Processes [Unknown]	Number of processes whose status cannot be retrieved or is unknown	cnt
Processes [Zombie]	Number of zombie processes	cnt
Running Process Usage	process usage	%
Running Processes	Number of running processes	cnt
Running Thread Usage	Thread usage rate	%
Running Threads	Total number of threads running in running processes	cnt
Context Switches	context switch count (per second)	cnt
Load/Core [1 min]	The load over the last 1 minute divided by the number of cores	cnt
Load/Core [15 min]	The load over the last 15 minutes divided by the number of cores	cnt
Load/Core [5 min]	The load over the last 5 minutes divided by the number of cores	cnt
Multipaths [Active]	External storage connection path status = active count	cnt
Multipaths [Failed]	External storage connection path status = failed count	cnt
Multipaths [Faulty]	External storage connection path status = faulty count	cnt
NTP Offset last	sample’s measured offset (time difference between NTP server and local environment)	num
Run Queue Length	Execution queue length	num
Uptime	OS uptime (milliseconds)	ms
Context Switchies CPU	context switch count (per second)	cnt
Disk Read Bytes [Sec]	Number of bytes read in one second from a Windows logical disk Windows only	cnt
Disk Read Time [Avg]	Data read average time (seconds) Windows only	sec
Disk Transfer Time [Avg]	Disk average wait time (seconds) Windows only	sec
Disk Write Bytes [Sec]	Number of bytes written in one second on a Windows logical disk Windows only	cnt
Disk Write Time [Avg]	Average data write time (seconds) Windows only	sec
Pagingfile Usage	paging file usage Windows only	%
Pool Used [Non Paged]	Nonpaged Pool usage in kernel memory Windows only	bytes
Pool Used [Paged]	Paged Pool usage in kernel memory Windows only	bytes
Process [Running]	Number of currently running processes Windows only	cnt
Threads [Running]	Number of currently running threads Windows only	cnt
Threads [Waiting]	Number of threads waiting for processor time Windows only	cnt

Table. Additional monitoring metrics for Virtual Server (Agent installation required)

1.1.3 - ServiceWatch Metrics

Virtual Server sends metrics to ServiceWatch. The metrics provided by default monitoring are data collected at 5‑minute intervals. If detailed monitoring is enabled, you can view data collected at 1‑minute intervals.

Reference

For how to view metrics in ServiceWatch, refer to the ServiceWatch guide.

See How-to guides > ServiceWatch Enable Detailed Monitoring for instructions on enabling detailed monitoring of Virtual Server.

Basic Metrics

The following are the basic metrics for the Virtual Server namespace.

The indicators whose names are displayed in bold below are the indicators selected as key metrics among the default metrics provided by Virtual Server. The key metrics are used to build service dashboards that are automatically generated for each service in ServiceWatch. They can also be viewed on the Monitoring tab of the Virtual Server detail page.

Each metric provides guidance in the user guide on which statistical value is meaningful to query, and among the meaningful statistics, the values shown in bold text are the primary statistics. In the service dashboard or monitoring tab, you can view primary metrics using these primary statistics.

Performance item (indicator name)	Detailed description	unit	meaningful statistics
Instance State	Instance status display 1 - Active 0 - Off	None	Total
CPU Usage	CPU usage	Percent	Average Maximum Minimum
Disk Read Bytes	Bytes read from block device (bytes)	Bytes	Total Average Maximum Minimum
Disk Read Requests	Number of read requests on a block device	Count	Total Average Maximum Minimum
Disk Write Bytes	Write capacity (bytes) on block device	Bytes	Total Average Maximum Minimum
Disk Write Requests	Number of write requests on block device	Count	Total Average Maximum Minimum
Network In Bytes	Received bytes on the network interface	Bytes	Total Average Maximum Minimum
Network In Dropped	Number of packet drops received on the network interface	Count	Total Average Maximum Minimum
Network In Packets	Number of packets received on the network interface	Count	Total Average Maximum Minimum
Network Out Bytes	Data transmitted on the network interface (bytes)	Bytes	Total Average Maximum Minimum
Network Out Dropped	Number of packet drops transmitted from the network interface	Count	Total Average Maximum Minimum
Network Out Packets	Number of packets transmitted on the network interface	Count	Total Average Maximum Minimum

Table. Virtual Server basic metrics

Reference

Refer to the ServiceWatch Agent guide for how to collect metrics using the ServiceWatch Agent.

1.2 - How-to guides

Users can create the service by entering the required information for a Virtual Server and selecting detailed options through the Samsung Cloud Platform Console.

Create Virtual Server

You can create and use a Virtual Server service in the Samsung Cloud Platform Console.

To create a Virtual Server, follow these steps.

Click the All Services > Compute > Virtual Server menu. 1. Navigate to the Service Home page of the Virtual Server.
On the Service Home page, click the Create Virtual Server button. 2. Navigate to the Create Virtual Server page.

On the Virtual Server Creation page, enter the information required to create the service and select detailed options.

Select the required information in the Image and version selection area.

Category	required status	Detailed description
Image	Required	Select the provided Image type Standard: Samsung Cloud Platform standard provided Image Alma Linux, Oracle Linux, RHEL, Rocky Linux, Ubuntu, Windows Custom: User-created Image Kubernetes: Image for Kubernetes RHEL, Ubuntu Marketplace: Image subscribed from Marketplace
Image version	Required	Select version of the selected Image Provide version list of the server Image

Category

required status

Detailed description

Image

Required

Select the provided Image type

Standard: Samsung Cloud Platform standard provided Image
- Alma Linux, Oracle Linux, RHEL, Rocky Linux, Ubuntu, Windows

Custom: User-created Image

Kubernetes: Image for Kubernetes
- RHEL, Ubuntu

Marketplace: Image subscribed from Marketplace

Image version

Required

Select version of the selected Image

Provide version list of the server Image

Table. Virtual Server Image and version selection input items

In the Service Information Input area, enter or select the required information.

Category	required status	Detailed description
Number of servers	Required	Number of servers to create concurrently Only numeric input is allowed, and enter a value between 1~100
Service Type > Server Type	Required	Virtual Server server type Standard: standard specifications commonly used High Capacity: large‑capacity server specifications beyond Standard For detailed information about the server types provided by Virtual Server, see Virtual Server 서버 타입
Service Type > Planned Compute	Required	Planned Compute-configured resource status In Use: Number of resources with Planned Compute that are currently in use Configured: Number of resources with Planned Compute configured Coverage Preview: Amount applied per resource by Planned Compute Apply for Planned Compute Service: Go to the Planned Compute service creation page For more details, refer to Planned Compute Apply
Block Storage	Essential	Block Storage settings used by the server according to purpose Basic OS: Area where the OS is installed and used Capacity is entered in Units, and the minimum size varies by OS Image type Alma Linux: Enter a value between 2~1,536 Oracle Linux: Enter a value between 7~1,536 RHEL: Enter a value between 2~1,536 Rocky Linux: Enter a value between 2~1,536 Ubuntu: Enter a value between 2~1,536 Windows: Enter a value between 4~1,536 SSD: High‑performance general volume HDD: General volume SSD/HDD_KMS: Additional encrypted volume using Samsung Cloud Platform KMS (Key Management Service) encryption keys Encryption can be applied only at initial creation and cannot be changed afterward Using the SSD_KMS disk type may cause performance degradation SSD_Provisioned: SSD volume with configurable IOPS and Throughput Additional: Used when extra user space beyond the OS area is needed After selecting Use, enter the storage type and capacity To add storage, click the + button; to delete, click the x button (up to 25 can be added) Capacity is entered in Units, with a value between 1~1,536 1 Unit equals 8 GB, so 8~12,288 GB are created SSD: High‑performance general volume HDD: General volume SSD/HDD_KMS: Additional encrypted volume using Samsung Cloud Platform KMS (Key Management Service) encryption keys Encryption can be applied only at initial creation and cannot be changed afterward Using the SSD_KMS disk type may cause performance degradation SSD/HDD_MultiAttach: Volume that can be attached to two or more servers SSD_Provisioned: SSD volume with configurable IOPS and Throughput For details on each Block Storage type, see Block Storage 생성하기 Delete on termination: When Delete on Termination is selected, the volume is terminated together with the server A volume with an existing snapshot is not deleted even when Delete on termination is enabled A multi‑attach volume can be deleted only when the server being removed is the last remaining server attached to the volume Max IOPS: Enter a maximum IOPS value between 5,000~20,000 Not configurable for disk types HDD, HDD_KMS, HDD_MultiAttach Max Throughput: Enter a maximum Throughput value between 250~1,000 Not configurable for disk types HDD, HDD_KMS, HDD_MultiAttach
Server Group	Select	After selecting Use for setting Anti-Affinity (distributed placement), Affinity (proximate placement), Partition (distributed placement of Virtual Server and Block Storage) for servers belonging to the same Server Group , select the Server Group , choose Create New to create the Server Group
Servers belonging to the same Server Group are placed in a Best Effort manner according to the selected policy
The policy can be selected among Anti-Affinity (distributed placement), Affinity (proximate placement), Partition (distributed placement of Virtual Server and Block Storage)

Table. Virtual Server service information input items

Caution

When using the Partition (distributed placement of Virtual Server and Block Storage) policy among Server Group policies, you cannot allocate additional Block Storage Volumes after creating a Virtual Server, so create all required Block Storage during the Virtual Server creation step.

Required Information Input area, please enter or select the required information.

Category	required status	Detailed description
Server name	Essential	Enter a name to distinguish the server when the selected number of servers is 1 Set the hostname to the entered server name Enter using English letters, numbers, spaces, and special characters (`-`, `_`) within 63 characters
Network Settings > Create New Network Port	Required	Set the network where the Virtual Server will be installed VPC Name: Select a pre‑created VPC General Subnet: Select a pre‑created general Subnet IP can be set to auto‑generate or user‑provided; if input is selected, the user can manually enter the IP NAT: Available only when there is a single server and the VPC is attached to an Internet Gateway. Checking Use allows selection of a NAT IP NAT IP: Select a NAT IP If no NAT IP is available, click the Create New button to generate a Public IP Refresh button to view and select the created Public IP Creating a Public IP incurs charges according to the Public IP pricing policy Local Subnet (optional): Select Use for a local Subnet It is not a required element for creating the service A pre‑created local Subnet must be selected IP can be set to auto‑generate or user‑provided; if input is selected, the user can manually enter the IP Security Group: Settings required to access the server Select: Choose a pre‑created Security Group Create New: If no applicable Security Group exists, it can be created separately in the Security Group service Up to 5 can be selected If no Security Group is set, all connections are blocked by default A Security Group must be configured to allow required connections
Network Settings > Specify Existing Network Port	Required	Set the network where the Virtual Server will be installed VPC: Select a pre‑created VPC General Subnet: Select a pre‑created general Subnet and port NAT: Available only when there is a single server and the VPC is attached to an Internet Gateway. When enabled, you can select a NAT IP NAT IP: Select a NAT IP If no NAT IP is available to select, click the Create New button to generate a Public IP Refresh button to view and select the created Public IP Local Subnet (Optional): Choose Use for the local Subnet Select a pre‑created local Subnet and port
Keypair	Required	User authentication method to use when connecting to the server Create new: Create a new one if a new keypair is required Refer to Keypair 생성하기 Default login account list by OS Alma Linux: almalinux Oracle Linux: cloud-user RHEL: cloud-user Rocky Linux: rocky Ubuntu: ubuntu Windows: sysadmin

Table: Required input fields for Virtual Server

Additional Information Input area, please enter or select the required information.

Category	required status	Detailed description
Lock	Select	Lock usage setting Using the lock prevents actions such as server termination, start, stop, etc., from being executed, preventing misoperations due to mistakes
Init script	Select	the script that runs when the server starts The init script must be written as a Batch script for Windows, a Shell script or cloud‑init for Linux, depending on the Image type. Up to 45,000 bytes can be entered
tag	Select	Add Tag Up to 50 can be added per resource After clicking the Add Tag button, enter or select Key, Value values

Category

required status

Detailed description

Lock

Select

Lock usage setting

Using the lock prevents actions such as server termination, start, stop, etc., from being executed, preventing misoperations due to mistakes

Init script

Select

the script that runs when the server starts

The init script must be written as a Batch script for Windows, a Shell script or cloud‑init for Linux, depending on the Image type.

Up to 45,000 bytes can be entered

tag

Select

Add Tag

Up to 50 can be added per resource

After clicking the Add Tag button, enter or select Key, Value values

Table. Virtual Server additional information input fields

Summary Check the detailed information and estimated charges generated in the panel, and click the Create button.
- Once creation is complete, check the created resources on the Virtual Server List page.

information

When entering the server name, if spaces and special characters (_) are used, the OS hostname will have spaces and special characters (_) changed to the special character (-) and be set. * Refer to this when setting the OS hostname.
- Example: If the server name is ‘server name_01’, the OS hostname is set to ‘server-name-01’.
If you need to manage server names uniquely, specify a different server name (Prefix) when creating them.
- When creating a server, because the numbering does not automatically increment based on the server name (Prefix), a Virtual Server with the same name can be created.
- Example: If you first create two Virtual Servers using the server name (Prefix) ’test’, ’test-1’ and ’test-2’ will be created. * After that, even if you create two Virtual Servers again using the prefix ’test’, ’test-1’ and ’test-2’ will be created.

Reference

If you create a Virtual Server with Rocky Linux or Oracle Linux, additional configuration is required for time synchronization (NTP: Network Time Protocol). * For more details, please refer to Linux NTP 설정하기.
If RHEL and Windows Server were created before July 2025, you need to modify the RHEL Repository and WKMS (Windows Key Management Service) settings. For detailed information, refer to RHEL Repo 및 WKMS 설정하기.

Check Virtual Server detailed information

The Virtual Server service allows you to view and edit the entire resource list and detailed information. The Virtual Server Details page includes the Details, Monitoring, Tags, Activity Log tabs.

To view detailed information about the Virtual Server service, follow these steps.

All Services > Compute > Virtual Server menu, click it. 1. Navigate to the Service Home page of the Virtual Server.
On the Service Home page, click the Virtual Server menu. 2. Go to the Virtual Server List page.

On the Virtual Server List page, click the resource to view detailed information. 3. Navigate to the Virtual Server Details page.

Virtual Server Details page displays status information and additional feature information, and consists of Details, Monitoring, Tags, Activity Log tabs.

For detailed information about Virtual Server Add-on Features, please refer to Virtual Server Management Add-on Features.

Category	Detailed description
Virtual Server status	Status of a user-created Virtual Server Build: State where a Build command has been received Building: Build in progress Networking: Server creation networking process Scheduling: Server creation scheduling process Block_Device_Mapping: Attaching Block Storage during server creation Spawning: Server creation process is ongoing Active: Available state Powering_off: State when a shutdown request is made Deleting: Server deletion in progress Reboot_Started: Reboot in progress Error: Error state Migrating: Server is being migrated to another host Reboot: Reboot command has been received Rebooting: Restart in progress Rebuild: Rebuild command has been received Rebuilding: State when a Rebuild request is made Rebuild_Spawning: Rebuild process is ongoing Resize: Resize command has been received Resizing: Resize in progress Resize_Prep: State when a server type change is requested Resize_Migrating: Server is being moved to another host while resizing Resize_Migrated: Server has completed moving to another host during resize Resize_Finish: Resize completed Revert_Resize: Resize or migration of the server failed for some reason. The target server is cleaned up and the original server is restarted Shutoff: State when powering off is completed Verity_ Resize: After a server type change request and Resize_Prep, the server type is confirmed or can be reverted Resize_Reverting: State when a server type revert is requested Resize_Confirming: State while confirming the server’s Resize request
Server control	Buttons to change server status Start: Start a stopped server Stop: Stop a running server Restart: Restart a running server
Image generation	Create a user Image from the current server’s Image For detailed Image creation instructions, see Create Image
Console log	View console logs of the current server You can check the console logs output by the current server. For more details, see 콘솔 로그 확인하기
Create dump	Create a dump of the current server Dump file is created inside the Virtual Server Refer to Dump 생성하기 for detailed dump creation instructions
Rebuild	The OS area data and settings of the existing Virtual Server are deleted, and it is rebuilt on a new server For details, see Rebuild 수행하기
Service termination	Button to cancel the service

Table. Virtual Server status information and additional functions

Detailed Information

Virtual Server List page lets you view detailed information of the selected resource and, if needed, modify the information.

Category	Detailed description
service	Service name
Resource Type	Resource Type
SRN	Unique resource ID in Samsung Cloud Platform In Virtual Server, it refers to the Virtual Server SRN
Resource name	Resource name In the Virtual Server service, it means the Virtual Server name
Resource ID	Unique resource ID in the service
Constructor	User who created the service
Creation date and time	Service creation timestamp
Modifier	User who edited the service information
Modification timestamp	Date and time the service information was modified
Server name	Server name Edit Click the button to change the name When changing the server name, the OS hostname is not changed; only the information within the Samsung Cloud Platform Console is updated Modification is not allowed for Virtual Servers created from other resources
Server type	vCPU, memory information display If you need to change to a different server type, click the Edit button to configure
Image name	Server OS Image and version Image selectable by version and build date
Lock	Show lock usage status If you need to change the Lock property value, click the Edit button to set it
Server group	Server group name to which the server belongs If the server group is not used, it will not be displayed.
Keypair name	Server authentication information set by the user The default login accounts for each OS are as follows. Alma Linux: almalinux Oracle Linux: cloud-user RHEL: cloud-user Rocky Linux: rocky Ubuntu: ubuntu Windows: sysadmin
Planned Compute	Resource status with Planned Compute configured For more information, refer to Planned Compute 신청하기
ServiceWatch detailed monitoring	ServiceWatch detailed monitoring activation status display To enable ServiceWatch detailed monitoring, click the Edit button to configure For more information, refer to ServiceWatch Detailed Monitoring Activation For detailed information about the ServiceWatch service, see ServiceWatch Overview Not provided for Virtual Servers created in Auto-Scaling Group or Marketplace
Network	Network information of the Virtual Server VPC, standard Subnet, IP and status, Public NAT IP and status, Private NAT IP and status, Virtual IP, Security Group If an IP change is required, click the Edit button to configure Editing is possible only when the Virtual Server status is other than Active or Shutoff For the default port, Default is displayed next to the IP, and it cannot be detached If a Security Group change is required, click the Edit button to configure If a Virtual IP change is required, it can be edited on the Virtual IP Management tab of the Networking > VPC > Subnet Details page Add as a new network port: select a standard Subnet and IP You can select a different standard Subnet within the same VPC IP can be set to auto-generate or manual entry; if manual is selected, the user can directly input the IP Add to an existing network port: select a pre-created standard Subnet and port
Local Subnet	Local Subnet information of the Virtual Server Local Subnet, Local Subnet IP, Security Group name, Virtual IP If a Security Group change is required, you can click the Edit button to configure Add as new network port: select the local Subnet and IP You can select another regular Subnet within the same VPC IP can be set to auto-generated or user input, and if input is selected, the user can directly enter the IP Add to existing network port: select a pre-created local Subnet and port
Block Storage	Server-connected Block Storage information Volume ID, Volume Name, Disk Type, Capacity, Connection Information, Type, Delete on termination, Status Add: Connect additional Block Storage when needed Edit Delete on termination: Modify the Delete on termination value of the selected Block Storage in the list More > Detach: Detach the selected Block Storage’s connection Detachment is not possible for OS default Storage

Table. Virtual Server Details Tab Items

Monitoring

You can monitor the ServiceWatch metrics of the resources selected on the Virtual Server List page. In the Monitoring tab, you can view monitoring charts for the Virtual Server, and each chart is based on the available Service Watch metrics.

Category	Detailed description
Period setting area	Select the period applied to the chart Metric lookup can be set from now up to a maximum of 455 days
Time Zone Settings Section	Select time zone for the chart
Reset button	Reset all manipulations and settings made on the chart
Refresh Settings Area	Select chart refresh interval Refresh button updates the information based on the current time Click the refresh interval to choose the desired period: Off, 10 seconds, 1 minute, 2 minutes, 5 minutes, 15 minutes
Go to the service dashboard	Navigate to the ServiceWatch dashboard list screen
More	Display additional tasks for managing charts For detailed information on chart management, see Managing Metric Charts
graph area	Data graph collected over the chart’s applied period Available graph Average CPU Usage (Percent) Average Disk Reads (Bytes) Average Disk Writes (Bytes) Summary Disk Read Requests (Count) Summary Disk Write Requests (Count) Maximum Network In (Bytes) Maximum Network Out (Bytes) Summary Instance State (Any) When the mouse cursor is placed on the graph, a popup displays the time, data value, and metric data information for that point You can drag the mouse to zoom in on a specific area of the graph Clicking a label shown in the legend displays detailed information about that legend in a popup

Table. Virtual Server Monitoring Tab Chart Items

Reference

The metrics provided by basic monitoring are data collected at 5‑minute intervals.
For detailed information about the ServiceWatch metrics of Virtual Server, refer to Virtual Server ServiceWatch metrics.

Category	Detailed description
Tag list	Tag list Tag’s Key, Value information can be checked Up to 50 tags can be added per resource When entering a tag, search the existing list of Keys and Values and select

Job History

Virtual Server List page allows you to view the operation history of the selected resource.

Category	Detailed description
Task History List	Resource Change History You can view operation details, operation time, resource type, resource name, operation result, and operator information Operation History List Click the relevant resource in the list. Operation History Details popup window opens.

Category

Detailed description

Task History List

Resource Change History

You can view operation details, operation time, resource type, resource name, operation result, and operator information

Operation History List Click the relevant resource in the list. Operation History Details popup window opens.

Table. Virtual Server Operation History Tab Detailed Information Items

Control Virtual Server Operation

If you need to control the operation of a created Virtual Server resource, you can perform the task from the Virtual Server List or Virtual Server Details page. You can start, stop, and restart a running server.

Getting Started with Virtual Server

You can start a Virtual Server that is Shutoff. To start the Virtual Server, follow these steps.

Click the All Services > Compute > Virtual Server menu. 1. Navigate to the Service Home page of the Virtual Server.
On the Service Home page, click the Virtual Server menu. 2. Go to the Virtual Server List page.
Virtual Server List page, click the resource to start among the shutoff servers, and navigate to the Virtual Server Details page.
- On the Virtual Server List page, you can Start each resource via the right More button.
- After selecting multiple servers with checkboxes, you can control multiple servers simultaneously using the Start button at the top.
Virtual Server Details page, click the Start button at the top to start the server. 4. Check the status of the modified server in the Status Indicator item.
- When the Virtual Server start is complete, the server status changes from Shutoff to Active.
- For detailed information about the Virtual Server status, please refer to Virtual Server Detailed Information.

Stop Virtual Server

You can stop a Virtual Server that is running (Active). To stop the Virtual Server, follow the steps below.

Click the All Services > Compute > Virtual Server menu. 1. Navigate to the Service Home page of the Virtual Server.
On the Service Home page, click the Virtual Server menu. 2. Go to the Virtual Server List page.
On the Virtual Server List page, click the resource to stop among the active servers, and proceed to the Virtual Server Details page.
- On the Virtual Server List page, you can stop each resource via the right More button.
- After selecting multiple servers with checkboxes, you can control multiple servers simultaneously using the Stop button at the top.
On the Virtual Server Details page, click the Stop button at the top to start the server. 4. Check the status of the modified server in the Status Indicator item.
- When the Virtual Server shutdown is complete, the server status changes from Active to Shutoff.
- For detailed information about the Virtual Server status, please refer to Virtual Server Detailed Information.

Restart Virtual Server

You can restart the created Virtual Server. To restart the Virtual Server, follow these steps.

Click the All Services > Compute > Virtual Server menu. 1. Navigate to the Service Home page of the Virtual Server.
On the Service Home page, click the Virtual Server menu. 2. Go to the Virtual Server List page.
On the Virtual Server List page, click the resource to restart, and go to the Virtual Server Details page.
- On the Virtual Server list page, you can restart each resource via the right More button.
- After selecting multiple servers with checkboxes, you can control multiple servers simultaneously using the Restart button at the top.
On the Virtual Server Details page, click the Restart button at the top to start the server. 4. Check the status of the changed server in the Status Display section.
- During a Virtual Server restart, the server status goes through Rebooting and finally changes to Active.
- For detailed information about the Virtual Server status, please refer to Virtual Server Detailed Information.

Managing Virtual Server Resources

If you need server control and management functions for the created Virtual Server resources, you can perform tasks on the Virtual Server List or Virtual Server Details page.

Create Image

You can create an image of a running Virtual Server.

Reference

This content explains how to create a user image on a running Virtual Server.

On the Virtual Server list or Virtual Server details page, click the Create Image button to create a user Image.
Refer to Image Detailed Guide: Creating an Image for the method of creating an Image by uploading an Image file owned by the user.

To create an Image of a Virtual Server, follow these steps.

Click the All Services > Compute > Virtual Server menu. 1. Navigate to the Service Home page of Virtual Server.
On the Service Home page, click the Virtual Server menu. 2. Navigate to the Virtual Server List page.
On the Virtual Server List page, click the resource to create an Image. 3. Navigate to the Virtual Server Details page.
On the Virtual Server Details page, click the Create Image button. 4. Go to the Image creation page.
- Enter the required information in the Service Information Input area.
  Category
  required status
  Detailed description
  Image name Required the name of the Image to be created
  English letters, numbers, spaces, and special characters (-, _) within 200 characters
  Table. Image service information input fields
Check the input information and click the Complete button.
- When creation is complete, check the created resources on the All Services > Compute > Virtual Server > Image List page.

Category	required status	Detailed description
Image name	Required	the name of the Image to be created English letters, numbers, spaces, and special characters (`-`, `_`) within 200 characters

Information

When an Image is created, the generated Image is stored in the Object Storage used as internal storage. * Therefore, a usage fee will be charged for image storage.
Since the file system of an Image created from an Active Virtual Server cannot be guaranteed to be intact, it is recommended to stop the server before creating the Image.

Edit Server Type

You can modify the server type of a Virtual Server.

Reference

For the configurable server types offered by Virtual Server, see Virtual Server 서버 타입.

To modify the server type of a Virtual Server, follow these steps.

Click the All Services > Compute > Virtual Server menu. 1. Navigate to the Service Home page of the Virtual Server.
On the Service Home page, click the Virtual Server menu. 2. Go to the Virtual Server List page.
On the Virtual Server List page, click the resource you want to control. 3. Go to the Virtual Server Details page.
On the Virtual Server Details page, check the server status and click the Edit button for the server type. 4. Edit Server Type popup window opens.
Edit Server Type In the popup window, after changing the server type, click the Confirm button.
- If you modify the Virtual Server’s server type, the Virtual Server status changes to a state related to performing a resize.
- For detailed information about the Virtual Server status, please refer to Virtual Server Detailed Information.

Service Architecture Diagram

Provided features

Virtual Server Auto-Scaling provides the following features.

Launch Configuration: It is a configuration template used to create a Virtual Server in an Auto-Scaling Group. When creating a Launch Configuration, you set information about the Virtual Server such as the image, server type, Key Pair, Block Storage, etc.
Server Count Adjustment: Provides several ways to adjust the number of servers. By using policies, you can add a Virtual Server when load exceeds a threshold and release the Virtual Server when demand is low, maintaining application availability while reducing costs. You can also add and release Virtual Servers according to a predefined schedule, and manually adjust the number of servers in an Auto-Scaling Group as needed.
Load Balancer Integration: You can use a Load Balancer to evenly distribute application traffic across Virtual Servers. Whenever a Virtual Server is added or removed, it is automatically registered with or deregistered from the Load Balancer.
Network Connection: You can connect the Auto-Scaling Group’s standard subnet, automatic IP allocation, and Public NAT IP. Provides a local subnet connection for inter-server communication.
Security Group Application: Security Group is a virtual logical firewall that controls inbound/outbound traffic generated on a Virtual Server. Inbound rules control incoming traffic to the Virtual Server, and outbound rules control outgoing traffic from the Virtual Server.
Monitoring: You can view monitoring data such as CPU, Memory, and Disk of Virtual Servers created in an Auto-Scaling Group via the ServiceWatch service. Based on the monitoring data, you can set load thresholds using Auto-Scaling policies, and when thresholds are exceeded, you can add or remove servers.

Reference

The policy of the Auto-Scaling Group will be integrated with ServiceWatch starting March 25, 2026. The policies of an Auto-Scaling Group are created in conjunction with ServiceWatch alarm policies, and alarm policies linked to an Auto-Scaling Group cannot be modified or deleted in ServiceWatch; they can only be managed through the Auto-Scaling Group.

Component

Virtual Server Auto-Scaling creates an Auto-Scaling Group through a Launch Configuration and monitors and manages the servers.

Launch Configuration

This is a Configuration template used to create a Virtual Server in an Auto-Scaling Group. The main features are as follows.

Image: Provides OS standard images and Custom images created by the user. Users can select and use them according to the service they wish to configure.
Keypair: Provides the Keypair method for secure OS access.
Init script: Users can define a script to be executed when the Virtual Server starts.
For more details, refer to Launch Configuration Create.

Reference

In Launch Configuration, please refer to the Virtual Server OS Image Provisioning Versions and Virtual Server Server Types for the selectable images and server types.

Auto-Scaling Group

Launch Configuration is used as a pre-configuration template for creating servers. By creating an Auto-Scaling Group, you can adjust and manage the number of servers. The main features are as follows.

Launch Configuration: A Configuration template used to create a Virtual Server in an Auto-Scaling Group.
Server Count Setting: Virtual Server Auto-Scaling provides several ways to adjust the number of servers in an Auto-Scaling Group.
- Fixed Server Count Method: When creating an Auto-Scaling Group, this method keeps the default settings using the configured number of servers without any additional schedules or policies. Refer to Create Auto-Scaling Group to set the Min, Desired, and Max server counts.
- Server Count Manual Adjustment Method: In an Auto-Scaling Group, you can increase or decrease the number of servers by modifying the server count to the desired amount. You can choose whether to manually set the desired server count. Please refer to 서버 수 수정하기.
- Schedule reservation method: You can schedule daily, weekly, monthly, or one-time, and set the desired number of servers at the specified time. This is useful when you can predict when to scale the number of servers up or down. If you use the schedule method, please refer to Manage Schedules to add and manage schedules.
- Policy Mode: You can use a policy to dynamically adjust servers. When a monitoring metric exceeds a configured threshold, the number of servers is adjusted. At this time, you can choose one of three methods to adjust the server count: increase or decrease by a specified number, increase or decrease by a specified percentage, or fix the server count to a given value. When servers are started or terminated due to a policy, the monitoring metric (CPU utilization) may temporarily exceed the threshold registered in the policy. However, because this is a brief moment, a cooldown period is set to avoid treating it as an abnormal situation. If you want to use the policy mode, see Managing Policies.
Load Balancer: Automatically connects and disconnects to the Load Balancer registered in the Auto-Scaling Group whenever a Virtual Server is added or terminated.
- Refer to Auto-Scaling Group Detailed Information for detailed information about the Load Balancer of an Auto-Scaling Group.

Reference

The Load Balancer of the Auo-Scaling Group will operate in conjunction with the Load Balancer starting in February 2025.

Constraints

The constraints of Virtual Server Auto-Scaling are as follows.

Category	Explanation
Number of Virtual Servers per Auto-Scaling Group	50 or fewer
Number of policies per Auto-Scaling Group	12 or fewer
Number of schedules per Auto-Scaling Group	20 or fewer
Number of LB server groups and ports per Auto-Scaling Group	3 or fewer

Table. Virtual Server Auto-Scaling Group Constraints

Caution

If the Image you are using is a discontinued standard Image, Scale-out will not work.
If the Image you are using is Custom Image, Scale out will continue to operate correctly even after that version is discontinued.
We recommend replacing the Launch Configuration with the latest version of the Image or a Custom Image before the current Image reaches end of support.
For detailed information about the OS Image provided by Virtual Server, see OS Image Provided Versions.

Prior Service

This is a list of services that must be pre-configured before creating the service. Please refer to the guide provided for each service and prepare in advance.

Service Category	service	Detailed description
Networking	VPC	A service that provides an isolated virtual network in a cloud environment
Networking	Security Group	Virtual firewall that controls server traffic

Table. Virtual Server Auto-Scaling Prerequisite Service

2.1.1 - Monitoring Metrics

information

The Auto-Scaling Group policy will be linked with the ServiceWatch alarm policy starting March 25, 2026. Please refer to Virtual Server Auto-Scaling > ServiceWatch Metrics.

Cloud Monitoring service termination notice

According to Samsung Cloud Platform’s policy, the Cloud Monitoring service is scheduled to be discontinued in September 2026.
Accordingly, after the September 2026 release, resource monitoring of the Samsung Cloud Platform via Cloud Monitoring will no longer be possible.

With the new alternative service, you can continuously perform resource monitoring by using ServiceWatch, released in October 2025.
ServiceWatch provides more modern and powerful features, replacing Cloud Monitoring to deliver a smooth monitoring environment.

Detailed information about ServiceWatch is available in the ServiceWatch Overview.

Virtual Server Auto-Scaling is a service provided for Virtual Servers that offers individual Virtual Server monitoring metrics and monitoring metrics provided by policies based on Cloud Monitoring.

Virtual Server Monitoring Metrics

The table below shows the monitoring metrics of Virtual Server that can be viewed through Cloud Monitoring. For detailed usage of Cloud Monitoring, refer to the Cloud Monitoring guide.

Memory-related metrics are not provided for Windows OS.

Performance items	Detailed description	unit
Memory Total [Basic]	bytes of usable memory	bytes
Memory Used [Basic]	bytes of currently used memory	bytes
Memory Swap In [Basic]	bytes of the replaced memory	bytes
Memory Swap Out [Basic]	bytes of the replaced memory	bytes
Memory Free [Basic]	bytes of unused memory	bytes
Disk Read Bytes [Basic]	Read bytes	bytes
Disk Read Requests [Basic]	Number of read requests	cnt
Disk Write Bytes [Basic]	write bytes	bytes
Disk Write Requests [Basic]	Number of write requests	cnt
CPU Usage [Basic]	Average system CPU usage over 1 minute	%
Instance State [Basic]	Instance status	state
Network In Bytes [Basic]	Received bytes	bytes
Network In Dropped [Basic]	Incoming packet drop	cnt
Network In Packets [Basic]	Number of received packets	cnt
Network Out Bytes [Basic]	sent bytes	bytes
Network Out Dropped [Basic]	Transmit packet drop	cnt
Network Out Packets [Basic]	Number of transmitted packets	cnt

Table. Virtual Server Monitoring Metrics (Provided by default)

Monitoring metrics provided by Cloud Monitoring-based policies

The table below shows the monitoring metrics provided by the policy of a Cloud Monitoring‑based Auto‑Scaling Group. For detailed information on policy settings based on Cloud Monitoring, see 정책 관리하기.

Performance items	Detailed description	unit
CPU Usage [Basic]	Average system CPU usage over 1 minute	%
Memory Used [Basic]	bytes of currently used memory	bytes
Network In Bytes [Basic]	Received bytes	bytes
Network In Packets [Basic]	Number of received packets	cnt
Network Out Bytes [Basic]	sent bytes	bytes
Network Out Packets [Basic]	Number of transmitted packets	cnt

Table. Monitoring metrics provided by Cloud Monitoring-based policies

2.1.2 - ServiceWatch Metrics

Virtual Server Auto-Scaling is a service provided for Virtual Servers that offers individual Virtual Server monitoring metrics and monitoring metrics supplied by ServiceWatch-based policies.

Virtual Server Monitoring Metrics

The basic metrics provided by Virtual Server can be found in ServiceWatch > Virtual Server Basic Metrics.

Reference

For checking metrics in ServiceWatch, refer to the ServiceWatch guide.

ServiceWatch monitoring metrics provided by the Auto-Scaling Group policy

The table below shows the ServiceWatch monitoring metrics provided by the Auto-Scaling Group policy. For detailed information on configuring Auto-Scaling Group policies, see Managing Policies.

Performance items	Detailed description	unit
CPU Usage	CPU usage	Percent
Network In Bytes	Received bytes on the network interface	Bytes
Network In Packets	Number of packets received on the network interface	Count
Network Out Bytes	Data transmitted on the network interface (bytes)	Bytes
Network Out Packets	Number of packets transmitted on the network interface	Count

Table. ServiceWatch monitoring metrics provided by the Auto-Scaling Group policy

2.2 - How-to guides

Users can create the service by entering the required information for an Auto-Scaling Group and selecting detailed options through the Samsung Cloud Platform Console.

Create Auto-Scaling Group

You can create and use the Auto-Scaling Group service from the Samsung Cloud Platform Console.

Reference

To create an Auto-Scaling Group, you need to create a Launch Configuration in advance. Please refer to Create Launch Configuration.

To create an Auto-Scaling Group, follow these steps.

Click the All Services > Compute > Virtual Server menu. Navigate to the Service Home page of Virtual Server.
Click the Auto-Scaling Group menu. You will be taken to the Auto-Scaling Group List page.
On the Auto-Scaling Group List page, click the Create Auto-Scaling Group button. You will be taken to the Create Auto-Scaling Group page.

On the Auto-Scaling Group Creation page, enter the information required to create the service.

In the Launch Configuration area, select the Launch Configuration.
- Launch Configuration creation button can be clicked to create a new Launch Configuration.

Enter or select the required information in the Service Information Input area.

Category	Whether required	Detailed description
Auto-Scaling Group name	Required	Auto-Scaling Group name Manage servers of the same type and purpose as a group
Server name	Required	Server name to be created within the Auto-Scaling Group The server name is automatically assigned as a combination of the entered server name and a sequence, serving as an identifier to distinguish servers created within the Auto-Scaling Group.
Number of servers	Required	Number of servers to create in the Auto-Scaling Group Enter a value between 0 and 20 (Min≤Desired≤Max) Min: Set the minimum number of servers the Auto-Scaling Group should maintain Desired: Set the target number of servers within the Auto-Scaling Group; also indicates the number of servers initially created when the Auto-Scaling Group is created Max: Set the maximum number of servers the Auto-Scaling Group can maintain After creating the Auto-Scaling Group, you can adjust the settings using the Edit button. For details, see Modify Server Count
Desired server count manual setting	Selection	Select whether to manually change the Desired number of servers Even after creating the Auto-Scaling Group, configure it via the Edit button. For details, refer to Manually edit Desired server count
Network Settings > Network Settings	Required	Network settings for Auto-Scaling Group Select the desired VPC and standard Subnet IP can only be auto-generated. When you select local Subnet, you can choose the desired local Subnet, and IP can only be auto-generated
Network Settings > Security Group	Selection	To allow required connections, you need to configure a Security Group If you do not configure a Security Group, all inbound and outbound traffic will be blocked according to the default rule (Any/Deny) For Linux servers, allow SSH traffic For Windows servers, allow RDP traffic Even after creating an Auto-Scaling Group, configure it using the modify button. For details, refer to Configure Security Group
Load Balancer	Selection	Connect the Auto-Scaling Group to a Load Balancer Register the Auto-Scaling Group’s servers as members of the LB server group LB server group: Select a LB server group that exists in the chosen VPC Port: Enter a value between 1 and 65,534 Press the + button to add an LB server group (up to a total of three LB server groups and ports are allowed) LB server groups that use Weighted Round Robin or Weighted Least Connection load balancing cannot be selected Draining Timeout value: After checking Draining Timeout as Enabled, you can set the Draining Timeout value Draining Timeout: The waiting period before removing a server from the Load Balancer Since sessions may remain connected to the server, setting a Draining Timeout and waiting allows safe session termination If the Load Balancer is unused, Draining Timeout cannot be set The default is 300 seconds; you can enter a value from 1 second up to 3,600 seconds. Changes can be made after creating the Auto-Scaling Group; for details, see Using Auto-Scaling Group Load Balancer

Table. Auto-Scaling Group service information input items

In the Scaling policy setting area, set the scaling policy.

For detailed information on policy settings, refer to Add Policy.

Category	Required	Detailed description
Current setting	Selection	Set scaling policy now Add Policy button click shows policy information input fields
Set later	Selection	After creating an Auto-Scaling Group, set policies on the detail information page

Table. Auto-Scaling Group scaling policy configuration items

In the Notification Settings area, configure the notification recipients and notification method.

For detailed information on notification settings, see Add Notification.

Category	Required?	Detailed description
Current setting	Selection	Set the notification recipients and notification method now Click the Add Notification button to open the Add Notification popup For detailed information about notification settings, see the details In the notification recipients list, click the Edit button to modify the notification information
Set later	Selection	After creating an Auto-Scaling Group, set the notification recipients and notification method on the details page.

Category

Required?

Detailed description

Current setting

Selection

Set the notification recipients and notification method now

Click the Add Notification button to open the Add Notification popup

For detailed information about notification settings, see the details

In the notification recipients list, click the Edit button to modify the notification information

Set later

Selection

After creating an Auto-Scaling Group, set the notification recipients and notification method on the details page.

Table. Auto-Scaling Group notification configuration items

In the Additional Information Input area, enter or select the required information.

Category	Required status	Detailed description
Status check	Selection	Check the status of Virtual Server and Load Balancer and replace servers that are Unhealthy Virtual Server status check: Enabled is the default and cannot be changed Load Balancer status check: Enabled only when a Load Balancer is connected in the service information input area Grace period: Set the time to defer status checks until the newly added server operates normally Enter 0 to disable the status check grace period
tag	Selection	Add Tag Up to 50 per resource can be added After clicking the Add Tag button, enter or select Key, Value values

Category

Required status

Detailed description

Status check

Selection

Check the status of Virtual Server and Load Balancer and replace servers that are Unhealthy

Virtual Server status check: Enabled is the default and cannot be changed

Load Balancer status check: Enabled only when a Load Balancer is connected in the service information input area

Grace period: Set the time to defer status checks until the newly added server operates normally
- Enter 0 to disable the status check grace period

tag

Selection

Add Tag

Up to 50 per resource can be added

After clicking the Add Tag button, enter or select Key, Value values

Table. Auto-Scaling Group additional information input fields

Caution

A server that the user changes to Stop is not considered Unhealthy, so even when using the health check feature, the server will not be replaced.

Summary Check the detailed information and estimated billing amount generated in the panel, and click the Create button.
- When creation is complete, check the created Auto-Scaling Group on the Auto-Scaling Group list page.

Check detailed information of Auto-Scaling Group

Auto-Scaling Group service allows you to view and edit the full resource list and detailed information. The Auto-Scaling Group Details page is composed of Details, Policies, Schedule, Virtual Server, Load Balancer, Tags, Operation History tabs.

To view detailed information about the Auto-Scaling Group, follow these steps.

Click the All Services > Compute > Virtual Server menu. Navigate to the Service Home page of Virtual Server.
Click the Auto-Scaling Group menu. You will be taken to the Auto-Scaling Group List page.

On the Auto-Scaling Group List page, click the resource to view detailed information. You will be taken to the Auto-Scaling Group Details page.

Auto-Scaling Group Details page displays status information and additional feature information, and consists of Details, Policies, Schedule, Virtual Server, Load Balancer, Tags, Activity History tabs.

Category	Detailed description
Auto-Scaling Group status	Status of the Auto-Scaling Group created by the user Creating: Creating Auto-Scaling Group In Service: Serviceable state Scale In: Scale In in progress Scale Out: Scale Out in progress Cool Down: Cooling down Terminating: Terminating Auto-Scaling Group Attach to LB: Attaching to Load Balancer Detach from LB: Detaching from Load Balancer
Delete Auto-Scaling Group	Button to delete Auto-Scaling Group

Category

Detailed description

Auto-Scaling Group status

Status of the Auto-Scaling Group created by the user

Creating: Creating Auto-Scaling Group

In Service: Serviceable state

Scale In: Scale In in progress

Scale Out: Scale Out in progress

Cool Down: Cooling down

Terminating: Terminating Auto-Scaling Group

Attach to LB: Attaching to Load Balancer

Detach from LB: Detaching from Load Balancer

Delete Auto-Scaling Group

Button to delete Auto-Scaling Group

Table. Auto-Scaling Group status information and additional features

Detailed Information

Auto-Scaling Group List page lets you view detailed information of the selected resource and modify it if necessary.

Category	Detailed description
service	Service name
Resource Type	Resource Type
SRN	Unique resource ID in Samsung Cloud Platform In Auto-Scaling Group, it refers to the Auto-Scaling Group SRN
Resource name	Resource name In Auto-Scaling Group, it refers to the Auto-Scaling Group name
Resource ID	Unique resource ID in the service
constructor	User who created the service
Creation date and time	Service creation date and time
editor	User who edited the service information
Modification date	Date and time the service information was modified
Auto-Scaling Group name	Auto-Scaling Group name
Launch Configuration name	When creating an Auto-Scaling Group, the selected Launch Configuration name the image template used when the Auto-Scaling Group creates a Virtual Server Click the Edit button to configure. For more information, see Edit Launch Configuration
Number of servers	Current number of instances in the Auto-Scaling Group and the configured Min, Desired, Max instance counts Edit button to set. For details, refer to Update server count
Manually set the desired number of servers	Desired server count manual setting enable/disable If you want to manually change the Desired server count, click the Edit button to configure. For details, refer to Desired 서버 수 수동 설정 수정하기.
VPC	VPC information of the Auto-Scaling Group
General Subnet	General Subnet and NAT IP usage information for the Auto-Scaling Group
Local Subnet	Local Subnet of Auto-Scaling Group
Security Group	Auto-Scaling Group’s Security Group If you want to change the Security Group, click the Edit button to configure it. For more details, refer to Security Group configuration
Status check	Whether to use the replacement function for Virtual Server and Load Balancer in Unhealthy state Virtual Server: Enabled is the default and cannot be changed Load Balancer: Indicates usage when a Load Balancer is attached Enabled: When a Load Balancer is connected and health check is used Disabled: When a Load Balancer is connected and health check is not used Enabled (does not operate because no LB is currently connected): When a Load Balancer is connected and health check is enabled, but the Load Balancer has been detached -: When no Load Balancer is connected Grace period: Time to defer health checks If set to 0, the health check grace period is not used Edit* button can be clicked to set the Load Balancer health check usage and Grace period. For details, see 상태 체크 변경하기

Table. Auto-Scaling Group Details – Details Tab Items

Caution

A server that the user changes to Stop is not considered Unhealthy, so even when using the health check feature, the server will not be replaced.

Policy

On the Auto-Scaling Group List page, you can view the policy list of the selected resource and, if needed, add or manage policies.

Category	Detailed description
Category	Policy Type Scale In: return servers Scale Out: increase server count
Policy Name	Name for policy-specific categorization
Execution conditions	Conditions to trigger the policy Statistic: Method to calculate the Metric Type Average: Average of servers in the Auto-Scaling Group Min: Minimum value among servers in the Auto-Scaling Group Max: Maximum value among servers in the Auto-Scaling Group Metric Type: CPU Usage, Memory Usage, Network In(Bytes), Network Out(Bytes), Network In(Packets), Network Out(Packets) Operator: `>=` `>` `<=` `<` Threshold: Threshold corresponding to the Metric Type Period: Continuous duration required to trigger the condition (the condition must be continuously satisfied for N minutes for the policy to execute)
execution unit	Method of executing the policy Policy Type: Select the type of policy to execute Increase or decrease the number of servers by a specified count: Increase or return the server count to Target Value Increase or decrease the number of servers by a specified ratio: Increase or return by the Target Value ratio Fix the number of servers to the entered value: Fix the server count according to the Target Value Target Value: The number or ratio to apply the selected Policy Type
Cool down	The waiting time (seconds) when a server is started or terminated due to a policy The default is 300 seconds, and it can be set from a minimum of 1 second up to a maximum of 3,600 seconds
More > Edit	Modify the policy information For details, refer to Modify Policy
More > Enable	Enable this policy Selectable only when the policy is disabled
More > Disable	Disable the policy Can be selected only when the policy is active

Table. Auto-Scaling Group Details - Policy Tab Items

Reference

For policy management and policy example explanations, please refer to Policy Management.

Schedule

On the Auto-Scaling Group List page, you can view the schedule list of the selected resource and, if necessary, add or manage schedules.

Category	Detailed description
Name	Schedule name
Min	Minimum number of servers set in the schedule
Desired	Number of target servers set in the schedule
Max	Maximum number of servers set in the schedule
period	Schedule execution frequency Set to one of daily/weekly/monthly/once
Date/Day of Week	Schedule execution date or day of week Date/day selected according to the schedule interval
Execution time	Schedule execution time
time zone	Schedule execution time window
status	Schedule status
More > Edit	Edit the schedule information
More > Activate	Activate this schedule Can be selected only when the schedule is disabled
More > Disable	Deactivate this schedule Can be selected only when the schedule is active
Add schedule	Add a new schedule
Delete	Delete the selected schedule from the list

Table. Auto-Scaling Group Details – Schedule Tab Items

Reference

For detailed information about schedule management, see Schedule Management.

Virtual Server

Auto-Scaling Group List You can view the Virtual Server list of the selected resource on the page.

Category	Detailed description
Server name	If you click the Server Name of the server created in the Auto-Scaling Group , you will be taken to the Virtual Server Details page
IP	IP assigned to the server
Creation date and time	The date and time the server was created
status	Result of status check for Virtual Server and Load Balancer Healthy: When both Virtual Server and Load Balancer are in Healthy state Unhealthy: When one or more of Virtual Server or Load Balancer are in Unhealthy state Unhealthy: Including cases where one or more of the three Load Balancer server groups connected to the Virtual Server are Unhealthy The cause of Unhealthy can be identified in the Work History tab

Table. Auto-Scaling Group Details - Virtual Server Tab Items

Load Balancer

On the Auto-Scaling Group List page, you can view the Load Balancer list of the selected resource, and, if needed, add or manage Load Balancers.

Category	Detailed description
Draining Timeout	Draining Timeout usage When you click the Edit button, set the Draining Timeout usage If it is already in use, change the time If the Load Balancer is not in use, Draining Timeout cannot be configured. For details on editing the Draining Timeout, see Load Balancer Draining Timeout Edit.
Load Balancer	Load Balancer usage Edit button sets the Load Balancer usage If already in use, add or change the Load Balancer Up to three LB server groups can be added For details on using Load Balancer, see Load Balancer Usage
Load Balancer > Load Balancer name	Load Balancer name to attach to the Auto-Scaling Group
Load Balancer > LB server group	LB server group of the Load Balancer LB server groups that use Weighted Round Robin or Weighted Least Connection load balancing cannot be selected
Load Balancer > Port	Port registered as a member in the LB server group

Table. Auto-Scaling Group Details - Load Balancer tab items

Reference

The LB server group member information configured in the Auto-Scaling Group can also be viewed in LB Server Group Connected Resources.
Also, if manual connection/disconnection between the server and the Load Balancer is required, see Add LB Server Group Member.

Notification

Auto-Scaling Group List page allows you to view the notification recipients and notification methods for the selected resource.

Category	Detailed description
Notification recipient	Name of the notification recipient
email	Notification recipient’s email
Create server	Whether to send notifications when a server creation-related alert occurs On success: Whether to send when creation succeeds On failure: Send when creation fails
Server termination	Whether to send a notification when a server termination-related alert occurs Success: Whether to send when termination succeeds Failure: Send when termination fails
When the policy execution condition is satisfied	Whether to generate a notification when the policy execution condition is satisfied
status	Notification activation status Active: Enabled
More > Edit	Edit the notification information
Show more > Enable	Enable this notification information Selectable only when the notification is disabled
More > Disable	Disable this notification information Selectable only when the notification is active
Add notification	Add a new notification
Delete	Delete the selected notification from the list

Table. Auto-Scaling Group Details - Notification Tab Items

Reference

For details on notification settings, refer to Manage Notifications.

Category	Detailed description
Tag list	Tag list You can view the Key and Value information of the tag Up to 50 tags can be added per resource When entering a tag, you can search and select from the list of previously created Keys and Values

Work History

You can view the operation history of the selected resource on the Auto-Scaling Group List page.

Category	Detailed description
Task History List	Resource Change History Operation Time, Resource ID, Resource Name, Operation Details, Event Topic, Operation Result, Check operator information

Table. Auto-Scaling Group Details - Task History Tab Items

Auto-Scaling Group Managing Resources

If you need to manage a created Auto-Scaling Group, you can perform actions on the Auto-Scaling Group Details page.

Edit Launch Configuration

You can modify the Launch Configuration of an Auto-Scaling Group.

Reference

Even if you modify the Launch Configuration, it will not affect servers that are already part of the Auto-Scaling Group; the changes will only apply to servers created thereafter. To apply the updated Launch Configuration to all servers in the Auto-Scaling Group, set the server count (Desired) to 0 to delete the existing servers, then adjust the server count (Desired) to the desired quantity.

To modify the Launch Configuration of an Auto-Scaling Group, follow these steps.

Click the All Services > Compute > Virtual Server menu. Navigate to the Service Home page of Virtual Server.
Click the Auto-Scaling Group menu. You will be taken to the Auto-Scaling Group list page.
On the Auto-Scaling Group List page, click the resource you want to edit the Launch Configuration for. You will be taken to the Auto-Scaling Group Details page.

Launch Configuration name. Click the Edit button. The Launch Configuration Edit popup opens. You can view the list of selectable Launch Configurations.

Category	Detailed description
Launch Configuration name	Launch Configuration name
image	Launch Configuration OS Image
Server type	Launch Configuration server type
Block Storage	Launch Configuration Block Storage Settings
Number of Auto-Scaling Groups	Number of Auto-Scailg Groups with Launch Configuration applied
View Details	Button to view Launch Configuration details

Table. Launch Configuration list items

In the Launch Configuration Edit popup, select the Launch Configuration to modify, then click the Confirm button. The Launch Configuration Edit Notification popup opens. Check the message in the Launch Configuration Edit Notification popup and click the Confirm button.

Edit server count

You can modify the number of servers in the Auto-Scaling Group.

Reference

The maximum configurable number of servers is 50. However, when a Load Balancer is present, servers not connected to the Load Balancer are excluded.

To modify the number of servers in an Auto-Scaling Group, follow these steps.

Click the All Services > Compute > Virtual Server menu. Navigate to the Service Home page of Virtual Server.
Click the Auto-Scaling Group menu. You will be taken to the Auto-Scaling Group list page.
On the Auto-Scaling Group List page, click the resource to edit the server count. You will be taken to the Auto-Scaling Group Details page.
Click the Edit button for server count. The Edit Server Count popup opens.

Modify Server Count In the popup window, after entering the required fields, click the Confirm button.

Category	Required	Detailed description
Number of servers > Min	Required	Minimum number of servers to modify Set the number of servers that the Auto-Scaling Group should maintain at a minimum
Number of servers > Desired	Required	Target server count to modify Set the target server count within the Auto-Scaling Group Desired server count manual setting is disabled; you cannot modify the Desired server count. To modify the Desired server count, see Desired server count manual setting modification
Number of servers > Max	Required	Target number of servers to modify Set the maximum number of servers the Auto-Scailg Group can maintain

Category

Required

Detailed description

Number of servers > Min

Required

Minimum number of servers to modify

Set the number of servers that the Auto-Scaling Group should maintain at a minimum

Number of servers > Desired

Required

Target server count to modify

Set the target server count within the Auto-Scaling Group

Desired server count manual setting is disabled; you cannot modify the Desired server count. To modify the Desired server count, see Desired server count manual setting modification

Number of servers > Max

Required

Target number of servers to modify

Set the maximum number of servers the Auto-Scailg Group can maintain

Table. Auto-Scailg Group server count modification items

Terminate Virtual Server Created by Auto-Scaling Group

Virtual Servers created in an Auto-Scaling Group can be terminated by clicking the terminate button on the Virtual Server’s detail page or by modifying the server count.

Warning

Be careful, as data cannot be recovered after terminating the service.
To cancel the service, first disconnect the File Storage and disable the Lock. If the File Storage is connected or a Lock is set, you cannot cancel the Virtual Server.

Reference

If you terminate a Virtual Server created in an Auto-Scaling Group, the associated Load Balancer is automatically detached.
When a Virtual Server is terminated, the status of the attached Storage is as follows.
- If Delete on termination is not set: the volume will not be deleted even if the Virtual Server is terminated.
- When Delete on termination is set: terminating the Virtual Server deletes the volume. However, if a Snapshot exists, the volume is not deleted even when Delete on termination is set.
- Multi attach volume: It can be deleted only when the server you are deleting is the last remaining server attached to the volume.

Cancel on the Virtual Server detail page

To cancel a Virtual Server on the Virtual Server detail page, follow these steps.

Click the All Services > Compute > Virtual Server menu. Navigate to the Service Home page of Virtual Server.
On the Service Home page, click the Virtual Server menu. You will be taken to the Virtual Server List page.
On the Virtual Server List page, select the Virtual Server created by the Auto-Scaling Group. Navigate to the Virtual Server Details page.
Click the Cancel Service button on the Virtual Server Details page. A popup confirming the server termination will appear.
Click the Confirm button. Server termination is complete.

Cancel by adjusting the server count

To cancel a Virtual Server by modifying the server count, follow the steps below.

Click the All Services > Compute > Virtual Server menu. Navigate to the Service Home page of Virtual Server.
Click the Auto-Scaling Group menu. Navigate to the Auto-Scaling Group List page.
On the Auto-Scaling Group List page, click the resource to terminate the Virtual Server. You will be taken to the Auto-Scaling Group Details page.
Click the Edit button of the Server Count item. The Server Count Edit popup opens.
Server Count Edit In the popup window, after reducing the Desired number, click the Confirm button. As the Desired server count is adjusted, the Virtual Server will be terminated.

Notice

If Desired server count manual setting is set to unused, you cannot modify the Desired server count. To modify the Desired server count, see Desired server count manual setting modification.
When a Virtual Server is terminated, the Desired server count remains unchanged, and scale‑out proceeds according to the Desired check batch.

Modify manual setting of desired server count

You can manually change the Desired server count of the Auto-Scaling Group.

Reference

Desired server count manual setting use not selected, you cannot modify the Desired server count in the server count edit of the details. Desired server count is automatically adjusted by checking the Virtual Server status every 10 minutes.

To modify the manual setting of the Desired server count for an Auto-Scaling Group, follow these steps.

Click the All Services > Compute > Virtual Server menu. Navigate to the Service Home page of Virtual Server.
Click the Auto-Scaling Group menu. You will be taken to the Auto-Scaling Group List page.
On the Auto-Scaling Group List page, click the resource to manually change the Desired server count. You will be taken to the Auto-Scaling Group Details page.
Click the Edit button for the server count. The Desired server count manual setting popup opens.
Desired server count manual setting After selecting whether to use it in the popup window, click the Confirm button.

Configure Security Group

You can configure the Security Group of an Auto-Scaling Group.

Reference

Even if you modify the Security Group, the change does not apply to servers already created in the Auto-Scaling Group; it only takes effect for servers created thereafter. To apply the updated Security Group to all servers in the Auto-Scaling Group, set the server count (Desired) to 0 to delete all existing servers, then adjust the server count (Desired) to the desired quantity.

Follow these steps to configure the Security Group of an Auto-Scaling Group.

Click the All Services > Compute > Virtual Server menu. Navigate to the Service Home page of Virtual Server.
Click the Auto-Scaling Group menu. You will be taken to the Auto-Scaling Group List page.
On the Auto-Scaling Group List page, click the resource to configure the Security Group. You will be taken to the Auto-Scaling Group Details page.
Click the Edit button for Security Group. Security Group Edit popup opens. You can view the list of selectable Security Groups.
Category Detailed description
Security Group name Security Group name
Table. Security Group list items
Security Group edit In the popup window, after selecting the Security Group, click the Confirm button. Security Group edit notification popup opens. Check the message of the Security Group edit notification popup and click the Confirm button.

Category	Detailed description
Security Group name	Security Group name

Change status check

You can configure it to check the status of Virtual Server and Load Balancer and automatically replace servers that are in an Unhealthy state.

Caution

The Load Balancer status check feature can only be used when a Load Balancer is in use.
A server that the user changes to Stop is not considered Unhealthy, so even if you use the health check feature, the server will not be replaced.

To change whether the Load Balancer health check feature of an Auto-Scaling Group is enabled, follow the steps below.

Click the All Services > Compute > Virtual Server menu. Go to the Service Home page of Virtual Server.
Click the Auto-Scaling Group menu. Go to the Auto-Scaling Group list page.
On the Auto-Scaling Group List page, click the resource for which you want to change the Load Balancer health check function. You will be taken to the Auto-Scaling Group Details page.
Click the Edit button of Status Check. Status Check Edit popup window opens.
Status Check Edit In the popup, choose whether to enable Load Balancer Status Check.
After entering the Grace period, click the Confirm button.
- If you do not want to use the grace period, set the grace period to 0.

Manage Additional Information for Auto-Scaling Group

You can set the Load Balancer of an Auto-Scaling Group to enabled and select the LB server group. For an Auto-Scaling Group that is using a Load Balancer, you can change it to disabled.

Modify Load Balancer Draining Timeout

You can set the Load Balancer Draining Timeout of an Auto-Scaling Group.

Reference

The Draining Timeout is the time to wait before detaching a server from the Load Balancer.

Because sessions may remain connected to the server, setting a Draining Timeout and waiting allows you to clean up sessions more safely.
If the Load Balancer is unused, Draining Timeout cannot be set.
The default is 300 seconds, and it can be set from a minimum of 1 second up to a maximum of 3,600 seconds.

To set the Load Balancer Draining Timeout of an Auto-Scaling Group, follow these steps.

Click the All Services > Compute > Virtual Server menu. Navigate to the Service Home page of Virtual Server.
Click the Auto-Scaling Group menu. You will be taken to the Auto-Scaling Group List page.
Auto-Scaling Group List page, click the resource to set the Load Balancer Draining Timeout. You will be taken to the Auto-Scaling Group Details page.
Click the Load Balancer tab. You will be taken to the Load Balancer list page.
Click the Edit button for Draining Timeout. The Draining Timeout Edit popup opens.
Draining Timeout Edit In the popup window, select whether to use Draining Timeout, and enter the Draining Timeout duration (seconds).
Draining Timeout Edit After verifying the input values in the popup window, click the Confirm button. Draining Timeout Edit Notification popup window opens. Check the message in the notification popup and click the Confirm button.

Using Load Balancer

You can modify the Load Balancer of an Auto-Scaling Group. To configure the Load Balancer of an Auto-Scaling Group, follow these steps.

Reference

When a server in the Auto-Scaling Group is created, it is automatically added as a member of the selected Load Balancer’s LB server group, and when the server is terminated, it is removed from the LB server group’s members.
If Draining Timeout is used, after waiting for the Draining Timeout (seconds), the server is removed from the LB server group’s members.
When a Member is separated due to Load Balancer modification, it waits in the Detach from LB state; when a Member is separated due to Scale In, it waits in the Scale In state.

Click the All Services > Compute > Virtual Server menu. Navigate to the Service Home page of Virtual Server.
Click the Auto-Scaling Group menu. Navigate to the Auto-Scaling Group List page.
On the Auto-Scaling Group List page, click the resource that will use the Load Balancer. Navigate to the Auto-Scaling Group Details page.
Click the Load Balancer tab. You will be taken to the Load Balancer list page.
Click the Edit button of the Load Balancer. The Load Balancer Edit popup opens.

Load Balancer Edit Please select whether to use it in the popup. Selecting Use allows you to choose a Load Balancer.

Category	Detailed description
LB server group	LB server group name Select the LB server group that is created in the selected VPC LB server groups that use Weighted Round Robin or Weighted Least Connection load balancing cannot be selected
Port	Port information of the LB server group When registering as a member of the LB server group, enter the required port information in the registration details. Enter a value between 1 and 65,534

Category

Detailed description

LB server group

LB server group name

Select the LB server group that is created in the selected VPC

LB server groups that use Weighted Round Robin or Weighted Least Connection load balancing cannot be selected

Port

Port information of the LB server group

When registering as a member of the LB server group, enter the required port information in the registration details.

Enter a value between 1 and 65,534

Table. Load Balancer list items

Click the + button to add an LB server group. Up to three are allowed. Click the X button to remove the added Load Balancer.

Check the Load Balancer list and click the Confirm button. Load Balancer Modification Notification popup opens. Verify the message in the notification popup and click the Confirm button.

Caution

Please be careful, as separating or connecting servers from the Load Balancer may affect the service.
If Draining Timeout is in use, setting the Load Balancer to unused or removing some of the attached Load Balancers with the X button will not detach immediately. After waiting for the Draining Timeout (seconds), the server is detached from the Load Balancer. At that time, the Auto-Scaling Group remains in the Detach from LB state.

Reference

The LB server group member information configured in the Auto-Scaling Group can also be viewed in LB Server Group Connected Resources.
Also, if manual connection/disconnection between the server and Load Balancer is required, please refer to Register LB Server Group Member.

Not Using Load Balancer

You can modify the Auto-Scaling Group’s Load Balancer to be unused. To set the Auto-Scaling Group’s Load Balancer to unused, follow these steps.

Caution

Please be careful as there may be service impact when separating or connecting servers from the Load Balancer.
If Draining Timeout is in use, setting the Load Balancer to unused or removing some of the attached Load Balancers with the X button will not detach immediately. After waiting for the Draining Timeout (seconds), the server is detached from the Load Balancer. At that time, the Auto-Scaling Group remains in the Detach from LB state.

Click the All Services > Compute > Virtual Server menu. Go to the Service Home page of Virtual Server.
Click the Auto-Scaling Group menu. You will be taken to the Auto-Scaling Group List page.
On the Auto-Scaling Group List page, click the resource to change the Load Balancer to unused. You will be taken to the Auto-Scaling Group Details page.
Click the Load Balancer tab. You will be taken to the Load Balancer list page.
Click the Edit button of the Load Balancer. The Load Balancer Edit popup opens.
Load Balancer edit Choose whether to use it in the popup window. If you deselect Use, Load Balancer will no longer be used.
Use deselection, and click the Confirm button. Load Balancer Modification Notification popup opens. Verify the message in the notification popup and click the Confirm button.

Delete Auto-Scaling Group

Deleting an unused Auto-Scaling Group can reduce operational costs. However, terminating an Auto-Scaling Group may cause the running service to stop immediately, so you should thoroughly consider the impact of service interruption before proceeding with the termination.

Caution

Please note that data cannot be recovered after deleting the service.

To terminate an Auto-Scaling Group, follow these steps.

Click the All Services > Compute > Virtual Server menu. Navigate to the Service Home page of Virtual Server.
Click the Auto-Scaling Group menu. Navigate to the Auto-Scaling Group List page.
On the Auto-Scaling Group List page, click the resource to terminate. Navigate to the Auto-Scaling Group Details page.
Click the Auto-Scaling Group Delete button.
After the deletion is complete, verify that the resource has been removed on the Auto-Scaling Group List page.

2.2.1 - Launch Configuration

To create an Auto-Scaling Group, a Launch Configuration must be created in advance.

Create Launch Configuration

You can create and use the Launch Configuration service in the Samsung Cloud Platform Console.

To create a Launch Configuration, follow these steps.

Click the All Services > Compute > Virtual Server menu. Navigate to the Service Home page of Virtual Server.
Click the Launch Configuration menu. Navigate to the Launch Configuration List page.
On the Launch Configuration list page, click the Create Launch Configuration button. You will be taken to the Create Launch Configuration page.
On the Launch Configurationp Creation page, in the Image and Version Selection area, select the required information and click the Next button.
Reference
Please refer to the server type in Virtual Server OS Image Provisioning Versions for images selectable in Launch Configuration.

Enter the required information in the Service Information Input area of the Launch Configurationp Creation page.

Category	Required status	Detailed description
Launch Configuration name	Required	Launch Configuration name Name used to distinguish the Launch Configuration
Service Type > Server Type	Required	Launch Configuration server type Standard: Standard specifications commonly used High Capacity: Large-scale server specifications beyond Standard Refer to Virtual Server Server Type for selectable server types
Block Storage	Required	Configure Block Storage according to the purpose of the Launch Configuration Basic OS: The area where the OS is installed and used Enter the size in Units, and the minimum size varies depending on the OS image type Alma Linux: Enter a value between 2 and 1,536 Oracle Linux: Enter a value between 5 and 1,536 RHEL: Enter a value between 2 and 1,536 Rocky Linux: Enter a value between 2 and 1,536 Ubuntu: Enter a value between 1 and 1,536 Windows: Enter a value between 4 and 1,536 SSD: High‑performance general volume HDD: General volume SSD/HDD_KMS: Additional encrypted volume using Samsung Cloud Platform KMS (Key Management System) encryption keys Encryption can be applied only at initial creation and cannot be changed afterward Performance degradation occurs when using the SSD_KMS disk type Additional: Use when extra space beyond the OS area is needed After selecting Use, enter the storage type and size Click the + button to add storage, or the x button to delete (up to 25 can be added) Enter the size in Units, with a value between 1 and 1,536 Since 1 Unit equals 8 GB, this creates 8 GB to 12,288 GB SSD: High‑performance general volume HDD: General volume SSD/HDD_KMS: Additional encrypted volume using Samsung Cloud Platform KMS (Key Management System) encryption keys Encryption can be applied only at initial creation and cannot be changed afterward Performance degradation occurs when using the SSD_KMS disk type SSD_MultiAttach: Volume that can be attached to more than one server For details on each Block Storage type, see Create Block Storage
Keypair	Required	Select the user authentication method for the Launch Configuration Server authentication information for accessing the servers created by generating an Auto-Scaling Group with a Launch Configuration New creation: Create a new one if a new Keypair is required Refer to Create Keypair for how to create a new Keypair Default login account list by OS Alma Linux: almalinux RHEL: cloud-user Rocky Linux: rocky Ubuntu: ubuntu Windows: sysadmin
File Storage Settings	Selection	Scale-out/Scale-in can be set to automatically connect File Storage Volume: After clicking the Select button, choose the Volume to use in the Select File Storage popup Up to 5 can be selected The selected Volume can be deleted by clicking the X button

Table. Launch Configuration service information input items

On the Launch Configuration Creation page, after entering information in the Additional Information Input area, click the Next button.

Category	Required	Detailed description
Init Script	Selection	Script that runs when the server of an Auto Scaling Group using a Launch Configuration starts Enter within 45,000 bytes The init script must be a batch script for Windows, a shell script for Linux, or cloud‑init, depending on the selected image.
tag	Selection	Add Tag Up to 50 can be added per resource After clicking the Add Tag button, enter or select Key, Value values

Category

Required

Detailed description

Init Script

Selection

Script that runs when the server of an Auto Scaling Group using a Launch Configuration starts

Enter within 45,000 bytes

The init script must be a batch script for Windows, a shell script for Linux, or cloud‑init, depending on the selected image.

tag

Selection

Add Tag

Up to 50 can be added per resource

After clicking the Add Tag button, enter or select Key, Value values

Table. Launch Configuration additional information input fields

Verify creation information Check the information you entered on the page and the estimated amount, and click the Create button.
- When creation is complete, verify the created Launch Configuration on the Launch Configuration List page.

View Launch Configuration Details

The Launch Configuration service allows you to view and edit the full list of resources and detailed information. Launch Configuration Details page consists of Details, Tags, Activity Log tabs.

To view detailed information of the Launch Configuration, follow these steps.

Click the All Services > Compute > Virtual Server menu. Navigate to the Service Home page of Virtual Server.
Click the Launch Configuration menu. Navigate to the Launch Configuration List page.
On the Launch Configuration List page, click the resource you want to view detailed information for. You will be taken to the Launch Configuration Details page.
- Launch Configuration Details At the top of the page, status information and additional feature information are displayed, and it consists of Details, Tags, Activity History tabs.
  Category Detailed description
  Launch Configuration status User-created Launch Configuration status
  Active: Available
  Delete Launch Configuration Delete Launch Configuration button
  Table. Launch Configuration status information and additional features

Category	Detailed description
Launch Configuration status	User-created Launch Configuration status Active: Available
Delete Launch Configuration	Delete Launch Configuration button

Detailed information

Launch Configuration list page allows you to view detailed information of the selected resource and edit the information if needed.

Category	Detailed description
service	Service name
Resource Type	Resource Type
SRN	Unique resource ID in Samsung Cloud Platform In Launch Configuration, it refers to the Launch Configuration SRN
Resource name	Resource name In a Launch Configuration, it refers to the Launch Configuration name
Resource ID	Unique resource ID in the service
constructor	User who created the service
Creation date and time	Service creation date and time
editor	User who edited the service information
Modification date	Date and time the service information was modified
Launch Configuration name	Launch Configuration name
image	When creating a Launch Configuration, the selected image name . This is the OS image used when launching servers in the Auto Scaling Group that uses this Launch Configuration.
Number of Auto-Scaling Groups	Number of Auto Scaling Groups using a Launch Configuration
Server type	Server type set in Launch Configuration
Block Storage	Block Storage information per server configured in the Launch Configuration type, capacity, kind
Keypair	Server authentication information set in the Launch Configuration Keypair information used to connect to the servers of the Auto Scaling Group that uses this Launch Configuration
Init Script	The Init Script set in the Launch Configuration is the script that runs when a server in an Auto-Scaling Group using that Launch Configuration starts.
File Storage Settings	File Storage Volume name set in the Launch Configuration Click the Volume name to view detailed File Storage information

Table. Launch Configuration Details Tab Items

Category	Detailed description
Tag list	Tag list You can view the Key and Value information of the tag Up to 50 tags can be added per resource When entering a tag, you can search and select from the list of previously created Keys and Values

Work History

You can view the operation history of the selected resource on the Launch Configuration List page.

Category	Detailed description
Task History List	Resource Change History Operation Time, Resource ID, Resource Name, Operation Details, Event Topic, Operation Result, Check Operator Information

Table. Launch Configuration Job History Tab Detailed Information Items

Delete Launch Configuration

Deleting unused Launch Configurations can reduce operational costs. However, terminating a Launch Configuration may cause running services to stop immediately, so you should carefully consider the impact of service interruption before proceeding with the termination.

Caution

Please be careful, as data cannot be recovered after deletion.

To delete a Launch Configuration, follow these steps.

Click the All Services > Compute > Virtual Server menu. Navigate to the Service Home page of Virtual Server.
Click the Launch Configuration menu. Go to the Launch Configuration List page.
On the Launch Configuration List page, click the resource to be terminated. Navigate to the Launch Configuration Details page.
Launch Configuration Delete Click the button.
After deletion is complete, verify on the Launch Configuration List page that the resource has been removed.

Caution

A Launch Configuration attached to an Auto-Scaling Group cannot be deleted. Please delete the Launch Configuration after first deleting the Auto-Scaling Group.

2.2.2 - Manage Policy

You can dynamically adjust the number of servers in an Auto-Scaling Group based on monitoring metrics. When the metric exceeds the threshold you set, the server count is adjusted. At that time, you can choose one of three methods to adjust the server count: increase or decrease by a specified number, increase or decrease by a specified percentage, or fix the server count to a given value. When servers are launched or terminated due to a policy, the monitoring metric, such as CPU utilization, may temporarily exceed the threshold registered in the policy. However, because this is a brief moment, a cooldown period is set to avoid treating it as an abnormal condition. You can add and manage policies for an Auto-Scaling Group created in the Samsung Cloud Platform Console.

Add Policy

You can add policies to an Auto-Scaling Group. To add a policy to an Auto-Scaling Group, follow these steps.

Click the All Services > Compute > Virtual Server menu. Navigate to the Service Home page of Virtual Server.
Click the Auto-Scaling Group menu. You will be taken to the Auto-Scaling Group List page.
Auto-Scaling Group List page, click the resource you want to view detailed information for. You will be taken to the Auto-Scaling Group Details page.
Click the Policy Tab. You will be taken to the Policy Tab page.

Click the Add Policy button. The Add Policy popup window opens.

Category	Required	Detailed description
Category	Required	Policy Category Scale In: return server count Scale Out: increase server count
Policy Name	Required	Name for policy-specific categorization
Execution conditions	Required	Conditions to execute the policy Statistic: How to calculate the Metric Type Average: Average of servers in the Auto-Scaling Group Min: Minimum value among servers in the Auto-Scaling Group Max: Maximum value among servers in the Auto-Scaling Group Metric Type: CPU Usage, Network In(bytes), Network Out(bytes), Network In(Packets), Network Out(Packets) Operator: `>=` `>` `<=` `<` Threshold: Threshold corresponding to the Metric Type Period: Continuous duration required to trigger the condition (the condition must be continuously satisfied for N minutes for the policy to execute)
execution unit	Required	Policy execution method Policy Type: Select the type of policy to execute. Increase or decrease the number of servers by a specified count: Increase or decrease the server count to the Target Value Increase or decrease the number of servers by a specified ratio: Increase or decrease by the Target Value ratio Fix the number of servers to the entered value: Fix the server count according to the Target Value Target Value: The number or ratio to apply for the selected Policy Type
cooldown	Required	The waiting time (seconds) when a server is started or terminated due to a policy The default is 300 seconds, and it can be set between a minimum of 60 seconds and a maximum of 3,600 seconds.

Table. Add Policy Popup Items

Reference

Policy > Cooldown Settings

When a server is started or terminated due to a policy, it waits for the configured cooldown period. Temporarily, the monitoring metric CPU utilization may exceed the threshold defined in the policy. However, because this is a transient moment rather than a condition for adjusting the number of servers, it is not considered an abnormal situation, and the system waits by applying the cooldown time.

guide

Policy execution operates within the configured Min/Max server count range.

Even if you input values outside the Min/Max server count range—such as increasing, returning, or fixing the number of servers—it operates within the configured Min/Max server count.
Example: When the minimum number of servers is 3, even if you set the server count fixed to 1, the server count does not drop to 1 and remains at the minimum of 3.

Add Policy After entering the required values in the popup window, click the Confirm button. The added policy can be viewed in the Policy List.

Policy creation example

The following is an explanation of the policy example. Please refer to it when creating a policy.

Policy example description 1

Category	Execution Conditions	execution unit	cooldown
Scale Out	Average CPU Usage >= 60% occurs for 1 minute	Increase the server count by the specified number, incrementing by one unit.	300 seconds

Table. Auto-Scaling Group policy example 1

If the average CPU usage of the servers in the Auto-Scaling Group exceeds 60% for one minute, a server is added one at a time.
When a server is added, the cooldown period is 300 seconds, and during this time, no server addition or removal due to policy occurs.
After the cooldown period ends, the policy execution conditions are checked again.

Policy example description 2

Category	Execution Conditions	execution unit	cooldown
Scale In	Min CPU Usage <= 5% occurs for 1 minute	Scale the number of servers up or down by the specified ratio, returning 50%.	300 seconds

Table. Auto-Scaling Group policy example 2

If the minimum CPU usage of servers in the Auto-Scaling Group remains below 5% for 1 minute, 50% of the current servers will be terminated.
When a server is terminated, the cooldown period is 300 seconds, and during this time, no server additions or removals occur due to policy.
After the cooldown period ends, the policy execution conditions are checked again.

Policy example description 3

Category	Execution Conditions	Execution Unit	cooldown
Scale Out	Max CPU Usage >= 90% occurs for 1 minute	Fix the number of servers to 5 based on the entered value	300 seconds

Table. Auto-Scaling Group policy example 3

If the maximum CPU usage among the servers in the Auto-Scaling Group exceeds 90% for 1 minute and the current number of servers is less than 5, servers will be created up to a total of 5.
During server creation, the cooldown period is 300 seconds, and no server additions or removals due to policy occur during the cooldown period.
After the cooldown period ends, the policy execution conditions are checked again.

Edit Policy

You can modify the policies of an Auto-Scaling Group. To modify the policies of an Auto-Scaling Group, follow these steps.

Click the All Services > Compute > Virtual Server menu. Navigate to the Service Home page of Virtual Server.
Click the Auto-Scaling Group menu. Navigate to the Auto-Scaling Group List page.
Auto-Scaling Group List page, click the resource to view detailed information. You will be taken to the Auto-Scaling Group Details page.
Click the Policy Tab. You will be taken to the Policy Tab page.

Click the More > Edit button for the policy you want to modify. The Policy Edit Popup opens.

Category	Required	Detailed description
Category	Required	Policy Classification Scale In: Return server count Scale Out: Increase server count
Policy Name	Required	Name for policy-specific categorization
Execution conditions	Required	Conditions to execute the policy Statistic: Method to calculate the Metric Type Average: Average of servers in the Auto-Scaling Group Min: Minimum value among servers in the Auto-Scaling Group Max: Maximum value among servers in the Auto-Scaling Group Metric Type: CPU Usage, Network In(bytes), Network Out(bytes), Network In(Packets), Network Out(Packets) Operator: `>=` `>` `<=` `<` Threshold: Threshold corresponding to the Metric Type Period: Continuous duration (in minutes) that triggers the execution condition
execution unit	Required	Policy execution method Policy Type: Select the type of policy to execute. Increase or decrease the number of servers by a specified count: Increase or reduce the server count to the Target Value Increase or decrease the number of servers by a specified ratio: Increase or reduce by the Target Value ratio Fix the number of servers to the entered value: Fix the server count according to the Target Value Target Value: The number or ratio to execute the selected Policy Type
Cool down	Required	The waiting time (seconds) when a server is started or terminated due to a policy The default is 300 seconds, and it can be set from a minimum of 1 second up to a maximum of 3,600 seconds

Table. Policy edit popup items

Edit Policy In the popup window, after entering the required values, click the Confirm button.

Policy addition and modification constraints

When adding or modifying a policy, constraints exist based on the policy classification, execution conditions, and the scope of those conditions. Below is an example of constraints for the policy. Refer to the constraint examples to add or modify the policy.

Example 1 - Need to check for duplicate registration of policy classification/execution conditions

Policy classification (Scale Out or Scale In) and execution condition (Metric type) cannot be registered redundantly with the same values.

Policy classification	Policy Name	Execution Condition(Statistic)	Execution condition(Metric type)	Execution condition range
`Scale Out`	`ScaleOutPolicy`	`Average`	`CPU Usage`	`>= 60%`

Table. Policy Constraint Example 1 - Pre-registration Policy

If a policy is registered as above, you cannot add a policy with the category (Scale Out) execution condition (Metric type=CPU Usage) or modify it to that condition.

Example 2 - Need to verify the execution condition range for execution conditions (Metric type) and execution conditions (Statistic) according to policy classification

When the policy type (Scale Out or Scale In) differs, duplicate registration of the execution condition range (Comparison operator + Threshold) is not allowed for the same execution condition (Metric type) and execution condition (Statistic).

Policy classification	Policy Name	Execution condition(Statistic)	Execution condition(Metric type)	Execution condition range
`Scale Out`	`ScaleOutPolicy`	`Average`	`CPU Usage`	`>= 60%`

Table. Policy Constraint Example 2 - Pre-registration Policy

If a policy is registered as shown above, you cannot add a policy as below or modify it under the following conditions.
If the average CPU Usage is 60% or higher, a Scale Out policy is already registered; therefore, registering a Scale In policy when the average CPU Usage is 60% or lower would duplicate the 60% execution condition and cannot be added.

Policy Classification	Policy Name	Execution Condition(Statistic)	Execution condition(Metric type)	Execution condition range
`Scale In`	`AddUpdatePolicy`	`Average`	`CPU Usage`	`<= 60%`

Table. Policy Constraint Example 2 - Non-Addable Policy

Example 3 - Need to verify the execution condition range for execution conditions (Metric type) and execution conditions (Statistic) according to policy classification

When the policy classification (Scale Out or Scale In) differs, duplicate registration of the execution condition range (Comparison operator + Threshold) is not allowed for the same execution condition (Metric type) and execution condition (Statistic).

Policy classification	Policy Name	Execution condition(Statistic)	Execution condition (Metric type)	Execution condition range
`Scale In`	`ScaleInPolicy`	`Average`	`CPU Usage`	`<= 10%`

Table. Policy Constraint Example 3 - Pre-registration Policy

If a policy is registered as shown above, you cannot add a policy as below or modify it under the following conditions.
When the average CPU Usage is 10% or less, a Scale In policy is already registered; therefore, if the average CPU Usage is less than 60% / 60% or less / 10% or more / exceeds 9%, you cannot register a Scale Out policy because the execution condition ranges would overlap.

Policy classification	Policy Name	Execution Condition(Statistic)	Execution condition(Metric type)	Execution condition range
`Scale Out`	`AddUpdatePolicy1`	`Average`	`CPU Usage`	`< 60%`
`Scale Out`	`AddUpdatePolicy2`	`Average`	`CPU Usage`	`<= 60%`
`Scale Out`	`AddUpdatePolicy3`	`Average`	`CPU Usage`	`>= 10%`
`Scale Out`	`AddUpdatePolicy4`	`Average`	`CPU Usage`	`> 9%`

Table. Policy constraint example 3 – non‑addable policy

Example 4 - Execution conditions (Metric type) and execution conditions (Statistic) can be registered according to the execution condition range based on policy classification

If the policy type (Scale Out or Scale In) differs, you can register when the execution condition (Metric type) is the same but the execution condition (Statistic) is different, or when the execution condition range (Comparison operator + Threshold) does not overlap.

Policy classification	Policy Name	Execution Condition(Statistic)	Execution condition (Metric type)	Execution Condition Range
`Scale Out`	`ScaleOutPolicy`	`Average`	`CPU Usage`	`>= 60%`

Table. Policy Constraint Example 4 - Pre-registration Policy

If a policy is already registered as shown above, you can add a policy as below or modify it with the following conditions. Registration is possible when the execution condition ranges do not overlap, or when the execution condition (Statistic) differs.

Policy classification	Policy Name	Execution Condition(Statistic)	Execution condition (Metric type)	Execution condition range
`Scale In`	`AddUpdatePolicy1`	`Average`	`CPU Usage`	`<= 10%`
`Scale In`	`AddUpdatePolicy2`	`Min`	`CPU Usage`	`<= 60%`

Table. Policy constraint example 4 – Addable policy

Delete Policy

You can delete a policy of an Auto-Scaling Group. To delete a policy of an Auto-Scaling Group, follow these steps.

Click the All Services > Compute > Virtual Server menu. Navigate to the Service Home page of Virtual Server.
Click the Auto-Scaling Group menu. Navigate to the Auto-Scaling Group List page.
Auto-Scaling Group List page: click the resource you want to view details for. You will be taken to the Auto-Scaling Group Details page.
Click the Policy Tab. You will be taken to the Policy Tab page.
Select the policy to delete and click the Delete button. The Policy Deletion Confirmation popup opens.
Policy Deletion Confirmation Check the popup window and click the Confirm button.

2.2.3 - Manage Schedule

You can schedule reservations daily, weekly, monthly, or one-time, and set the desired number of servers at the specified time. This is useful when you can predict when you need to decrease or increase the number of servers.

Add schedule

You can add a schedule to an Auto-Scaling Group. To add a schedule to an Auto-Scaling Group, follow these steps.

Click the All Services > Compute > Virtual Server menu. Navigate to the Service Home page of Virtual Server.
Click the Auto-Scaling Group menu. Navigate to the Auto-Scaling Group List page.
Auto-Scaling Group List page, click the resource you want to view detailed information for. You will be taken to the Auto-Scaling Group Details page.
Click the Schedule Tab. You will be taken to the Schedule Tab page.

Click the Add Schedule button. The Add Schedule popup window opens.

Category	required	Detailed description
Schedule name	Required	Name to distinguish by schedule
Select server count	Required	Select the number of servers to adjust when executing the schedule Min: The minimum number of servers the Auto-Scailg Group should maintain Desired: The target number of servers within the Auto-Scailg Group Max: The maximum number of servers the Auto-Scailg Group can maintain
Enter the number of servers	Required	Enter the value for the selected number of servers Min value: Enter a value between 0 and 50. (Min≤Desired≤Max) Desired value: Enter a value between 0 and 50. (Min≤Desired≤Max) Max value: Enter a value between 0 and 50. (Min≤Desired≤Max)
Period	Required	Schedule execution frequency Daily: You can set the Start date, End date, and Permanent options for a daily schedule. You can also set the Time and Time zone Weekly: You can set the Start date, End date, Permanent options, and the Time and Time zone. You can also select the Day of week on which the weekly schedule will run. Monthly: You can set the Start date, End date, Permanent options, and the Time and Time zone. You can also enter the Date on which the monthly schedule will run One-time: You can set the Time and Time zone. You can also set the Date on which the one-time schedule will run
Start date	Selection	Set schedule start date Cannot set a date earlier than the current date. The default is the current date.
End date	Selection	Set schedule end date Cannot set a date earlier than the current date. The default is the current date plus 7 days.
permanent	Selection	When configuring permanently, set the schedule end date to 9999-12-31
time	Required	Schedule execution time setting Can be set in 30‑minute increments. Times earlier than the current date or current time cannot be set
time zone	Required	Schedule execution time zone (example: Asia/Seoul (GMT +09:00))
day of the week	Required	When Period is set to Weekly, select the Day of the week to execute the schedule
Date	Required	When Period is set to Monthly, enter the Date on which the schedule will run Enter one or more values from -31 to 31, excluding 0. (Example: 3,4,5) When Period is set to Once, set the Date on which the schedule will run Setting a date earlier than the current date is not allowed. The default is the current date.

Table. Schedule addition popup items

Add Schedule After entering the required values in the popup window, click the Confirm button.
Add Schedule Confirmation After checking the message in the popup window, click the Confirm button.

Reference

If you select the schedule frequency as monthly, you must enter the schedule execution date, Date. Refer to the information below to register the schedule.

If you input a number greater than 0, it represents the day of the month.
- Example: If you enter 1, you get August 1, September 1, …, December 1
If you enter a number less than 0, the calculation starts from the end of each month.
- Entering -1 means the last day of each month.
  - Example: August 31, September 30, …, December 31
- If you enter -2, it means the day before the last day of each month.
  - Example: August 30, September 29, …, December 30
- Since the last day of each month varies—31, 30, 29, or 28 days—we allow calculation from the end of the month using negative numbers, as shown above, to handle schedules that need to run on the month’s final day.

Information

When the schedule runs, if the Min server count set in the schedule is greater than the Desired server count, or if the Max server count is less than the Desired server count, the Desired server count is also adjusted.
If there are schedules with overlapping execution times, they may not run correctly. Please try to avoid overlapping execution times as much as possible.

Modify schedule

You can modify the schedule of an Auto-Scaling Group. To modify the schedule of an Auto-Scaling Group, follow these steps.

Click the All Services > Compute > Virtual Server menu. Navigate to the Service Home page of Virtual Server.
Click the Auto-Scaling Group menu. Navigate to the Auto-Scaling Group List page.
On the Auto-Scaling Group List page, click the resource for which you want to view detailed information. You will be taken to the Auto-Scaling Group Details page.
Click the Schedule Tab. You will be taken to the Schedule Tab page.

Click the More > Edit button of the schedule you want to modify. The Edit Schedule popup window will open.

Category	required	Detailed description
Schedule name	Required	Name to distinguish by schedule
Select number of servers	Required	Select the number of servers to adjust when executing the schedule Min: The minimum number of servers the Auto-Scailg Group should maintain Desired: The target number of servers within the Auto-Scailg Group Max: The maximum number of servers the Auto-Scailg Group can maintain
Enter number of servers	Required	Enter the value for the selected number of servers Min value: Enter a value between 0 and 50. (Min≤Desired≤Max) Desired value: Enter a value between 0 and 50. (Min≤Desired≤Max) Max value: Enter a value between 0 and 50. (Min≤Desired≤Max)
period	Required	Schedule execution frequency Daily: You can set the Start Date and End Date, and Permanent setting for the daily schedule. You can also set the Time and Time Zone Weekly: You can set the Start Date and End Date, Permanent setting, and the Time and Time Zone. You can also select the Day of Week for the Weekly schedule to run. Monthly: You can set the Start Date and End Date, Permanent setting, and the Time and Time Zone. You can also enter the Date for the Monthly schedule to run. Once: You can set the Time and Time Zone. You can also set the Date for the Once schedule to run
Start date	Selection	Set schedule start date Cannot set a date earlier than the current date. The default is the current date.
End date	Selection	Set schedule end date Cannot set a date earlier than the current date. The default is the current date plus 7 days.
Permanent	Selection	When configuring permanently, set the schedule end date to 9999-12-31.
time	Required	Schedule execution time setting Can be set in 30‑minute increments. Times earlier than the current date or current time cannot be set
time zone	Required	Schedule execution time zone (example: Asia/Seoul (GMT +09:00))
day of the week	Required	When Period is weekly, select the day of the week to run the schedule.
Date	Required	When Period is set to Monthly, enter the Date on which the schedule will run Enter one or more values from -31 to 31, excluding 0. (Example: 3,4,5) When Period is set to Once, set the Date on which the schedule will run Cannot set a date earlier than the current date. The default is the current date.

Table. Schedule edit popup items

Schedule Edit After entering the required values in the popup window, click the Confirm button.
Schedule Modification Confirmation After checking the message in the popup window, click the Confirm button.

Delete schedule

You can delete the schedule of an Auto-Scaling Group. To delete the schedule of an Auto-Scaling Group, follow these steps.

Click the All Services > Compute > Virtual Server menu. Navigate to the Service Home page of Virtual Server.
Click the Auto-Scaling Group menu. You will be taken to the Auto-Scaling Group List page.
On the Auto-Scaling Group List page, click the resource for which you want to view detailed information. You will be taken to the Auto-Scaling Group Details page.
Click the Schedule Tab. You will be taken to the Schedule Tab page.
Select the schedule to delete and click the Delete button. Schedule Deletion Confirmation popup will open.
Schedule Deletion Confirmation Check the popup window and click the Confirm button.

2.2.4 - Manage Notification

You can designate a notification recipient and send alert messages via E‑mail or SMS for specific situations.

Reference

The notification method (E-mail or SMS) can be set on the Notification Settings page by selecting Notification Target as Service > Virtual Server Auto-Scaling.
For details on editing notification settings, refer to Edit Notification Settings.

Add notification

You can add notifications to an Auto-Scaling Group. To add notifications to an Auto-Scaling Group, follow these steps.

Click the All Services > Compute > Virtual Server menu. Navigate to the Service Home page of Virtual Server.
Click the Auto-Scaling Group menu. You will be taken to the Auto-Scaling Group List page.
On the Auto-Scaling Group List page, click the resource to which you want to add notification information. You will be taken to the Auto-Scaling Group Details page.
Click the Notification Tab. You will be taken to the Notification Tab page.
Click the Add Notification button. The Add Notification popup opens.

Add Notification After entering the required values in the popup window, click the Confirm button.

Category

Detailed description

Notification timing

Notification timing when an Auto-Scaling Group alert occurs

Server creation, Server termination, Server creation failure, Server termination failure, When policy execution conditions are met

Multiple selection allowed

Notification recipient

User who will receive the notification when an alert occurs

Click the Add Notification Recipient button to select a user

Only Samsung Cloud Platform users can be selected as recipients

Table. Notification items

Caution

When adding a notification recipient, verify that an email address exists and add it. Only users with login history (users who have registered an email or mobile phone number) can receive notifications.

Add Notification Confirmation After checking the message in the popup window, click the Confirm button.

Edit Notification

You can modify the notification settings of an Auto-Scaling Group. To modify the notification settings of an Auto-Scaling Group, follow these steps.

Click the All Services > Compute > Virtual Server menu. Navigate to the Service Home page of Virtual Server.
Click the Auto-Scaling Group menu. Navigate to the Auto-Scaling Group List page.
On the Auto-Scaling Group List page, click the resource to edit the notification information. You will be taken to the Auto-Scaling Group Details page.
Click the Notification Tab. You will be taken to the Notification Tab page.
In the notification list, click the More > Edit button for the notification you want to modify. The Edit Notification popup window will open.

Edit Notification After editing the notification information in the popup window, click the Confirm button.

Category

Detailed description

Notification timing

Notification timing when an Auto-Scaling Group alert occurs

Server creation, Server termination, Server creation failure, Server termination failure, When policy execution conditions are met

Multiple selection allowed

Table. Notification edit items

Edit Notification Confirmation After checking the message in the popup window, click the Confirm button.

Delete Notification

You can delete notifications for an Auto-Scaling Group. To delete notifications for an Auto-Scaling Group, follow these steps.

Click the All Services > Compute > Virtual Server menu. Navigate to the Service Home page of Virtual Server.
Click the Auto-Scaling Group menu. You will be taken to the Auto-Scaling Group List page.
On the Auto-Scaling Group List page, click the resource to edit the notification information. You will be taken to the Auto-Scaling Group Details page.
Click the Notification Tab. You will be taken to the Notification Tab page.
In the notification list, select the notification you want to delete, then click the Delete button. The Delete Notification Confirmation popup will open.
Delete Notification Confirmation Review the popup window and click the Confirm button.

2.3 - API Reference

API Reference

2.4 - CLI Reference

CLI Reference

2.5 - Release Note

Virtual Server Auto-Scaling

2026.03.19

FEATURE Virtual Server and Load Balancer status check feature and other feature additions

The Virtual Server and Load Balancer status check feature has been added.
- Check the status of Virtual Server and Load Balancer and automatically replace servers that are in an Unhealthy state.
You can attach File Storage to a Launch Configuration.
- When creating a Launch Configuration, you can attach up to five File Storage Volumes.
You can monitor data by integrating with the ServiceWatch service.

2025.10.23

FEATURE Virtual Server cancellation feature improvement

The termination feature for Virtual Servers created in an Auto-Scaling Group has been improved.
- When a Virtual Server is terminated, the Load Balancer is automatically detached.
- The Desired server count remains unchanged, and scale-out is performed according to the Desired check batch.

2025.07.01

FEATURE New feature added

Added a notification feature to Virtual Server Auto-Scaling.
- You can add notification settings in the creation or detail screen of an Auto-Scaling Group.
You can set scaling policies when creating an Auto-Scaling Group.
Added a Metric Type to the Auto-Scaling Group policy.
- Add: Memory Usage, Network In(bytes), Network Out(bytes), Network In(Packets), Network Out(Packets)
You can set the Draining Timeout when connecting to a Load Balancer.
In an Auto-Scaling Group, up to 50 Virtual Servers can be connected, and up to 3 LB server groups and ports can be attached.

2025.02.27

FEATURE Virtual Server Auto Scaling-Load Balancer service integration release and NAT configuration feature addition

Virtual Server Auto-Scaling feature change
- It will be released in conjunction with the Load Balancer service launching in February 2025.
- The NAT configuration feature has been added to the Auto-Scaling Group.
Samsung Cloud Platform Common Feature Changes
- Account, IAM, Service Home, tags, and other common CX changes have been applied.

2024.11.19

NEW Virtual Server Auto Scaling service official version release

Virtual Server Auto-Scaling creates an Auto-Scaling Group through a Launch Configuration and monitors and manages the servers.
Provides a scheduling method that allows setting the desired number of servers at a specified time, and a policy method that adjusts the number of servers based on CPU utilization.

3 - GPU Server

3.1 - Overview

Service Overview

GPU Server is a virtualized computing service that lets you freely allocate and use infrastructure resources such as CPU, GPU, and memory provided by the server, without having to purchase them individually, and allocate as much as needed at the required time. It is suitable for tasks that require fast computation speed, such as AI model experimentation, prediction, and inference in a cloud environment, and you can flexibly select and use resources with optimized performance based on the type and scale of the work. The GPU Server provides the following features.

Provided Features

GPU Server Management: Through a web-based console, users can directly Self Service create, delete, and modify GPU Server provisioning, monitoring, and billing.
Product offerings by GPU quantity: Depending on the project’s purpose and scale, you can freely select the number of H100/A100 GPUs to configure a virtual server.
High‑Performance GPU Provision: We provide a high‑performance GPU server at physical‑server level using a pass‑through method.
Storage Connection: Provides additional attached storage besides the OS disk. * You can connect and use Block Storage, File Storage, and Object Storage.
Strong Security Application: Use the Security Group service to control inbound/outbound traffic exchanged with the external internet or other VPC(Virtual Private Cloud), securely protecting the server.
Monitoring: You can view monitoring information such as the status of computing resources—including CPU, Memory, Disk, and GPU—through the Cloud Monitoring service.
Network Configuration Management: The server’s subnet/IP can be easily changed from the values set at initial creation. * NAT IP provides a management feature that lets you enable or disable it as needed.
Key Pair method: To ensure a secure OS access method, we provide a Key Pair method instead of ID/PW login.
Image Management: You can create and manage Custom Images, and it provides sharing functionality between projects.
ServiceWatch Service Integration Offering: You can monitor data through the ServiceWatch service.

Component

GPU Server provides GPUs, NVSwitch, and NVLink on top of virtualized computing resources.

caution

NVSwitch can only be enabled and used for instance types that allocate eight GPUs on a single GPU server.

Specifications by GPU Type

GPU (Graphic Processing Unit) performs the calculations needed to create images that compose the computer screen, and because it is specialized for parallel processing, it can handle large amounts of data quickly, processing large-scale parallel operations such as artificial intelligence (AI) and data analysis.

The following are the specifications of the GPU Types offered by the GPU Server service.

Category	A100 Type	H100 Type	B300 Type
GPU Architecture	NVIDIA Ampere	NVIDIA Hopper	NVIDIA Blackwell Ultra
GPU Memory	80 GiB	80 GiB	268 GiB
GPU Transistors	54 billion 7N TSMC	80 billion 4N TSMC	208 billion 4NP TSMC
FP16 Tensor Core (Dense)	312 TFLOPs	989 TFLOPs	2.25 PFLOPs
FP8 Tensor Core (Dense)	Not supported	1,979 TFLOPs	4.5 PFLOPs
FP4 Tensor Core (Dense)	Unsupported	Not supported	13.5 PFLOPs
GPU Memory Bandwidth	2,039 GB/s HBM2e	3,352 GB/s HBM3	8 TB/s HBM3e
NVLink performance	NVLink 3	NVLink 4	NVLink 5
NVLink Signaling Rate	25 GB/s (x12)	25 GB/s (x18)	50 GB/s (x18)
NVSwitch GPU-to-GPU bandwidth	600 GB/s	900 GB/s	1.8 TB/s
Total NVSwitch aggregate bandwidth	4.8 TB/s	7.2 TB/s	14.4 TB/s

Table. Specifications by GPU Type

Server type

The server types offered by the GPU Server are as follows. For detailed information about the server types provided by GPU Server, refer to GPU Server 서버 타입.

Category	Server type	CPU vCore	Memory(GB)	Number of GPUs
GPU-A100-1	g1v16a1	16	234	1
GPU-A100-1	g1v32a2	32	468	2
GPU-A100-1	g1v64a4	64	936	4
GPU-A100-1	g1v128a8	128	1,872	8
GPU-H100-2	g2v12h1	12	234	1
GPU-H100-2	g2v24h2	24	468	2
GPU-H100-2	g2v48h4	48	936	4
GPU-H100-2	g2v96h8	96	1,872	8
GPU-B300-3	g3v16b1	16	480	1
GPU-B300-3	g3v32b2	32	960	2
GPU-B300-3	g3v64b4	64	1,920	4
GPU-B300-3	g3v128b8	128	3,840	8

Table. GPU Server server type

OS and GPU driver version

The operating systems (OS) supported by the GPU Server are as follows. Note that GPUs of type B300 are supported only from a specific GPU version onward, so please be careful when selecting images.

OS	OS version	GPU driver version	Server type classification
Ubuntu	24.04	580.126.20	GPU-B300-3, GPU-H100-2, GPU-A100-1
Ubuntu	24.04	570.195.03	GPU-H100-2, GPU-A100-1
Ubuntu	22.04	535.183.06	GPU-H100-2, GPU-A100-1
RHEL	9.6	580.126.20	GPU-B300-3, GPU-H100-2, GPU-A100-1
RHEL	8.10	580.126.20	GPU-B300-3, GPU-H100-2, GPU-A100-1
RHEL	8.10	535.183.06	GPU-H100-2, GPU-A100-1

Table. GPU Server OS and GPU driver version

Preceding Service

This is a service that must be installed in advance before creating this service. Please prepare by referring to the user guide provided in advance.

Service Category	service	Detailed description
Networking	VPC	A service that provides an isolated virtual network in a cloud environment
Networking	Security Group	Virtual firewall that controls server traffic

Table. GPU Server Preliminary Service

3.1.1 - Server type

GPU Server server type

GPU servers are categorized by the GPU type they provide, and the GPU used in a GPU server is determined by the server type selected when creating the GPU server. Please select the server type based on the specifications of the application you want to run on the GPU server.

The server types supported by the GPU Server are as follows.

GPU-H100-2 g2v12h1

Category	example	Detailed description
Server type	GPU-H100-2	Provided server type classification GPU-H100-2 GPU-H100 indicates the provided GPU type 2 indicates the generation GPU-A100-1 GPU-A100 indicates the provided GPU type 1 indicates the generation
Server specifications	g2	Provided server type classification and generation g2 g means GPU server specification 2 means generation
Server specifications	v12	Number of vCores v2: 2 virtual cores
Server specifications	h1	GPU type and quantity h1 h means GPU-H100 1 means one GPU a2 a means GPU-A100 2 means two GPUs

Table. GPU Server server type format

g1 server type

The g1 server type is a GPU Server that uses the NVIDIA A100 Tensor Core GPU, suitable for high-performance applications.

Up to 8 NVIDIA A100 Tensor Core GPUs provided
Equipped with 6,912 CUDA cores and 432 Tensor cores per GPU
Supports up to 128 vCPUs and 1,920 GB of memory
Networking speed up to 40 Gbps
600 GB/s GPU and NVIDIA NVSwitch P2P communication

Category	Server type	GPU	CPU	Memory	GPU Memory	Network Bandwidth
GPU-A100-1	g1v16a1	1	16 vCore	234 GB	80 GiB	Maximum 20 Gbps
GPU-A100-1	g1v32a2	2	32 vCore	468 GB	160 GiB	Up to 20 Gbps
GPU-A100-1	g1v64a4	4	64 vCore	936 GB	320 GiB	Maximum 40 Gbps
GPU-A100-1	g1v128a8	8	128 vCore	1,872 GB	640 GiB	Maximum 40 Gbps

Table. GPU Server server type > GPU-A100-1 server type

g2 server type

The g2 server type is a GPU Server that uses the NVIDIA H100 Tensor Core GPU, making it suitable for high-performance applications.

Up to 8 NVIDIA H100 Tensor Core GPUs provided
Equipped with 16,896 CUDA cores and 528 Tensor cores per GPU
Supports up to 96 vCPUs and 1,920 GB of memory
Networking speed up to 40 Gbps
900 GB/s GPU and NVIDIA NVSwitch P2P communication

Category	Server type	GPU	CPU	Memory	GPU Memory	Network Bandwidth
GPU-H100-2	g2v12h1	1	12 vCore	234 GB	80 GiB	Up to 20 Gbps
GPU-H100-2	g2v24h2	2	24 vCore	468 GB	160 GiB	Up to 20 Gbps
GPU-H100-2	g2v48h4	4	48 vCore	936 GB	320 GiB	Maximum 40 Gbps
GPU-H100-2	g2v96h8	8	96 vCore	1,872 GB	640 GiB	Maximum 40 Gbps

Table. GPU Server server type > GPU-H100-2 server type

g3 server type

The g3 server type is a GPU Server that uses the NVIDIA B300 Tensor Core GPU, making it suitable for high-performance applications.

Up to 8 NVIDIA B300 Tensor Core GPUs provided
Equipped with 20,480 CUDA cores and 640 Tensor cores per GPU
Supports up to 128 vCPUs and 3,840 GB of memory.
Maximum 40 Gbps networking speed
1.8 TB/s GPU and NVIDIA NVSwitch P2P communication

Category	Server type	GPU	CPU	Memory	GPU Memory	Network Bandwidth
GPU-B300-3	g3v16b1	1	16 vCore	480 GB	268 GiB	Up to 20 Gbps
GPU-B300-3	g3v32b2	2	32 vCore	960 GB	536 GiB	Up to 20 Gbps
GPU-B300-3	g3v64b4	4	64 vCore	1,920 GB	1,072 GiB	Up to 40 Gbps
GPU-B300-3	g3v128b8	8	128 vCore	3,840 GB	2,144 GiB	Maximum 40 Gbps

Table. GPU Server server type > GPU-B300-3 server type

3.1.2 - Monitoring Metrics

Cloud Monitoring service termination notice

If you are collecting metrics and logs through the Cloud Monitoring Agent, you need to switch to the ServiceWatch Agent.

For detailed information about ServiceWatch, please refer to ServiceWatch Overview.
Detailed information about ServiceWatch Agent can be found in the ServiceWatch Agent.

GPU Server Monitoring Metrics

The table below shows the monitoring metrics of the GPU server that can be viewed through Cloud Monitoring.

Even without installing the Agent, it provides basic monitoring metrics and the table below. Please check the GPU Server monitoring metrics (provided by default). Additionally, the metrics that can be viewed by installing the Agent are in the table. Please refer to the additional monitoring metrics for GPU Server (Agent installation required).

For detailed usage of Cloud Monitoring, refer to the Cloud Monitoring guide.

Performance Item Name	Explanation	unit
Memory Total [Basic]	bytes of usable memory	bytes
Memory Used [Basic]	bytes of currently used memory	bytes
Memory Swap In [Basic]	bytes of the replaced memory	bytes
Memory Swap Out [Basic]	bytes of the replaced memory	bytes
Memory Free [Basic]	bytes of unused memory	bytes
Disk Read Bytes [Basic]	Read bytes	bytes
Disk Read Requests [Basic]	Number of read requests	cnt
Disk Write Bytes [Basic]	write bytes	bytes
Disk Write Requests [Basic]	Number of write requests	cnt
CPU Usage [Basic]	Average system CPU usage over 1 minute	%
Instance State [Basic]	Instance status	state
Network In Bytes [Basic]	Received bytes	bytes
Network In Dropped [Basic]	Incoming packet drop	cnt
Network In Packets [Basic]	Number of received packets	cnt
Network Out Bytes [Basic]	sent bytes	bytes
Network Out Dropped [Basic]	Transmit packet drop	cnt
Network Out Packets [Basic]	Number of transmitted packets	cnt

Table. GPU Server Basic Monitoring Metrics (Provided by Default)

Performance Item Name	Explanation	unit
GPU Count	Number of GPUs	cnt
GPU Memory Usage	Memory usage rate	%
GPU Memory Used	Memory usage	MB
GPU Temperature	GPU temperature	℃
GPU Usage	utilization	%
GPU Usage [Avg]	Overall average GPU utilization (%)	%
GPU Power Cap	Maximum power capacity of the GPU	W
GPU Power Usage	Current GPU power usage	W
GPU Memory Usage [Avg]	GPU Memory Uti. AVG	%
GPU Count in use	Number of GPUs in use by jobs on the node	cnt
Execution Status for nvidia-smi	Result of running the nvidia-smi command	status
Core Usage [IO Wait]	Ratio of CPU time spent in wait state (disk wait)	%
Core Usage [System]	Proportion of CPU time spent in kernel space	%
Core Usage [User]	Proportion of CPU time spent in user space	%
CPU Cores	Number of CPU cores on the host	cnt
CPU Usage [Active]	Percentage of CPU time used, excluding Idle and IOWait states	%
CPU Usage [Idle]	It is the proportion of CPU time spent in idle state.	%
CPU Usage [IO Wait]	The proportion of CPU time spent in a waiting state (disk wait).	%
CPU Usage [System]	Percentage of CPU time used by the kernel	%
CPU Usage [User]	Percentage of CPU time used in user space.	%
CPU Usage/Core [Active]	Percentage of CPU time used other than Idle and IOWait states	%
CPU Usage/Core [Idle]	It is the proportion of CPU time spent in idle state.	%
CPU Usage/Core [IO Wait]	This is the proportion of CPU time spent in a waiting state (disk wait).	%
CPU Usage/Core [System]	Percentage of CPU time used by the kernel	%
CPU Usage/Core [User]	Percentage of CPU time used in user space.	%
Disk CPU Usage [IO Request]	Proportion of CPU time during which I/O requests to the device were executed	%
Disk Queue Size [Avg]	The average queue length of requests executed for the device.	num
Disk Read Bytes	The number of bytes read per second from the device.	bytes
Disk Read Bytes [Delta Avg]	Average of system.diskio.read.bytes_delta for individual disks	bytes
Disk Read Bytes [Delta Max]	Maximum system.diskio.read.bytes_delta of individual disks	bytes
Disk Read Bytes [Delta Min]	Minimum system.diskio.read.bytes_delta of individual disks	bytes
Disk Read Bytes [Delta Sum]	Sum of the system.diskio.read.bytes_delta of individual disks	bytes
Disk Read Bytes [Delta]	Delta of the system.diskio.read.bytes value for each disk	bytes
Disk Read Bytes [Success]	Total number of bytes successfully read.	bytes
Disk Read Requests	Number of read requests to the disk device per second	cnt
Disk Read Requests [Delta Avg]	Average of the system.diskio.read.count_delta for individual disks	cnt
Disk Read Requests [Delta Max]	Maximum system.diskio.read.count_delta for individual disks	cnt
Disk Read Requests [Delta Min]	Minimum of system.diskio.read.count_delta for individual disks	cnt
Disk Read Requests [Delta Sum]	Sum of system.diskio.read.count_delta for individual disks	cnt
Disk Read Requests [Success Delta]	Delta of system.diskio.read.count for each disk	cnt
Disk Read Requests [Success]	Total number of successful reads	cnt
Disk Request Size [Avg]	The average size of requests executed on the device (unit: sectors).	num
Disk Service Time [Avg]	Average service time (milliseconds) of input requests executed on the device.	ms
Disk Wait Time [Avg]	Average time taken for requests executed on the supported device.	ms
Disk Wait Time [Read]	Average disk wait time	ms
Disk Wait Time [Write]	Average disk wait time	ms
Disk Write Bytes [Delta Avg]	Average of system.diskio.write.bytes_delta for each disk	bytes
Disk Write Bytes [Delta Max]	Maximum system.diskio.write.bytes_delta of individual disks	bytes
Disk Write Bytes [Delta Min]	Minimum of system.diskio.write.bytes_delta for individual disks	bytes
Disk Write Bytes [Delta Sum]	Sum of system.diskio.write.bytes_delta for individual disks	bytes
Disk Write Bytes [Delta]	Delta of the system.diskio.write.bytes value for each disk	bytes
Disk Write Bytes [Success]	Total number of bytes successfully written.	bytes
Disk Write Requests	Number of write requests to the disk device per second	cnt
Disk Write Requests [Delta Avg]	Average of system.diskio.write.count_delta for individual disks	cnt
Disk Write Requests [Delta Max]	Maximum system.diskio.write.count_delta for individual disks	cnt
Disk Write Requests [Delta Min]	Minimum of system.diskio.write.count_delta for individual disks	cnt
Disk Write Requests [Delta Sum]	Sum of the system.diskio.write.count_delta of individual disks	cnt
Disk Write Requests [Success Delta]	Delta of system.diskio.write.count for each disk	cnt
Disk Write Requests [Success]	Total number of successful writes	cnt
Disk Writes Bytes	It is the number of bytes per second written to the device.	bytes
Filesystem Hang Check	filesystem (local/NFS) hang check (normal:1, abnormal:0)	status
Filesystem Nodes	It is the total number of file nodes in the file system.	cnt
Filesystem Nodes [Free]	It is the total number of available file nodes in the file system.	cnt
Filesystem Size [Available]	Disk space (bytes) available to unauthorized users	bytes
Filesystem Size [Free]	Available disk space (bytes)	bytes
Filesystem Size [Total]	Total disk space (bytes)	bytes
Filesystem Usage	Used disk space percentage	%
Filesystem Usage [Avg]	Average of individual filesystem.used.pct values	%
Filesystem Usage [Inode]	inode usage	%
Filesystem Usage [Max]	Maximum among individual filesystem.used.pct	%
Filesystem Usage [Min]	minimum among individual filesystem.used.pct	%
Filesystem Usage [Total]	-	%
Filesystem Used	Used disk space (bytes)	bytes
Filesystem Used [Inode]	inode usage	bytes
Memory Free	Total amount of available memory (bytes).	bytes
Memory Free [Actual]	Actual usable memory (bytes).	bytes
Memory Free [Swap]	Available swap memory.	bytes
Memory Total	total memory	bytes
Memory Total [Swap]	Total swap memory.	bytes
Memory Usage	Percentage of used memory	%
Memory Usage [Actual]	Percentage of memory actually used	%
Memory Usage [Cache Swap]	cached swap usage rate	%
Memory Usage [Swap]	Percentage of used swap memory	%
Memory Used	used memory	bytes
Memory Used [Actual]	Actual memory used (bytes).	bytes
Memory Used [Swap]	Swap memory used.	bytes
Collisions	Network collision	cnt
Network In Bytes	Number of received bytes	bytes
Network In Bytes [Delta Avg]	Average of system.network.in.bytes_delta for individual networks	bytes
Network In Bytes [Delta Max]	Maximum of system.network.in.bytes_delta for each network	bytes
Network In Bytes [Delta Min]	Minimum system.network.in.bytes_delta for each network	bytes
Network In Bytes [Delta Sum]	Sum of system.network.in.bytes_delta for individual networks	bytes
Network In Bytes [Delta]	Delta of received byte count	bytes
Network In Dropped	Number of deleted packets among incoming packets	cnt
Network In Errors	Number of errors during reception	cnt
Network In Packets	Number of received packets	cnt
Network In Packets [Delta Avg]	Average of system.network.in.packets_delta for each network	cnt
Network In Packets [Delta Max]	Maximum of system.network.in.packets_delta for each network	cnt
Network In Packets [Delta Min]	Minimum of system.network.in.packets_delta for individual networks	cnt
Network In Packets [Delta Sum]	Sum of system.network.in.packets_delta for individual networks	cnt
Network In Packets [Delta]	Delta of received packet count	cnt
Network Out Bytes	Number of transmitted bytes	bytes
Network Out Bytes [Delta Avg]	Average of system.network.out.bytes_delta for each network	bytes
Network Out Bytes [Delta Max]	Maximum of system.network.out.bytes_delta for individual networks	bytes
Network Out Bytes [Delta Min]	Minimum of system.network.out.bytes_delta for individual networks	bytes
Network Out Bytes [Delta Sum]	Sum of system.network.out.bytes_delta for individual networks	bytes
Network Out Bytes [Delta]	Delta of transmitted byte count	bytes
Network Out Dropped	Number of deleted packets among outgoing packets.	cnt
Network Out Errors	Number of errors during transmission	cnt
Network Out Packets	Number of transmitted packets	cnt
Network Out Packets [Delta Avg]	Average of system.network.out.packets_delta for each network	cnt
Network Out Packets [Delta Max]	Maximum of system.network.out.packets_delta for each network	cnt
Network Out Packets [Delta Min]	Minimum of system.network.out.packets_delta for each network	cnt
Network Out Packets [Delta Sum]	Sum of system.network.out.packets_delta for individual networks	cnt
Network Out Packets [Delta]	Delta of transmitted packet count	cnt
Open Connections [TCP]	All open TCP connections	cnt
Open Connections [UDP]	All open UDP connections	cnt
Port Usage	Available port usage rate	%
SYN Sent Sockets	Number of sockets in SYN_SENT state (when connecting from local to remote)	cnt
Kernel PID Max	kernel.pid_max value	cnt
Kernel Thread Max	kernel.threads-max value	cnt
Process CPU Usage	Percentage of CPU time consumed by the process since the last update.	%
Process CPU Usage/Core	Percentage of CPU time used by the process since the last event.	%
Process Memory Usage	Proportion of main memory (RAM) occupied by a process	%
Process Memory Used	Resident Set size. The amount of memory a process occupies in RAM.	bytes
Process PID	process pid	PID
Process PPID	parent process PID	PID
Processes [Dead]	Number of dead processes	cnt
Processes [Idle]	Number of idle processes	cnt
Processes [Running]	Number of running processes	cnt
Processes [Sleeping]	Number of sleeping processes	cnt
Processes [Stopped]	stopped processes count	cnt
Processes [Total]	Total number of processes	cnt
Processes [Unknown]	Number of processes with an unsearchable or unknown status	cnt
Processes [Zombie]	Zombie processes count	cnt
Running Process Usage	process usage rate	%
Running Processes	Number of running processes	cnt
Running Thread Usage	Thread usage rate	%
Running Threads	Total number of threads running in running processes	cnt
Context Switches	context switch count (per second)	cnt
Load/Core [1 min]	The load over the last 1 minute divided by the number of cores	cnt
Load/Core [15 min]	The load over the last 15 minutes divided by the number of cores	cnt
Load/Core [5 min]	The load over the last 5 minutes divided by the number of cores	cnt
Multipaths [Active]	External storage connection path status = active count	cnt
Multipaths [Failed]	External storage connection path status = failed count	cnt
Multipaths [Faulty]	External storage connection path status = faulty count	cnt
NTP Offset	measured offset of the last sample (the time difference between the NTP server and the local environment)	num
Run Queue Length	Execution queue length	num
Uptime	OS uptime (milliseconds).	ms
Context Switchies	CPU context switch count (per second)	cnt
Disk Read Bytes [Sec]	Number of bytes read from a Windows logical disk in 1 second	cnt
Disk Read Time [Avg]	Average data read time (seconds)	sec
Disk Transfer Time [Avg]	Disk average wait time	sec
Disk Usage	Disk usage	%
Disk Write Bytes [Sec]	Number of bytes written in one second on a Windows logical disk	cnt
Disk Write Time [Avg]	Average data write time (seconds)	sec
Pagingfile Usage	Paging file usage	%
Pool Used [Non Paged]	Nonpaged Pool usage in kernel memory	bytes
Pool Used [Paged]	Paged Pool usage in kernel memory	bytes
Process [Running]	Number of currently running processes	cnt
Threads [Running]	Number of currently running threads	cnt
Threads [Waiting]	Number of threads waiting for processor time	cnt

Table. Additional monitoring metrics for GPU Server (Agent installation required)

3.1.3 - ServiceWatch Metrics

The GPU Server sends metrics to ServiceWatch. The metrics provided by default monitoring are data collected at 5‑minute intervals. If detailed monitoring is enabled, you can view data collected at 1‑minute intervals.

information

The basic and detailed monitoring of the GPU Server are provided with the same metrics as the Virtual Server, and the namespace is also provided as Virtual Server.
GPU-related metrics are provided through ServiceWatch Agent, and for instructions on collecting metrics using ServiceWatch Agent, refer to the ServiceWatch Agent guide.

Reference

For how to view metrics in ServiceWatch, refer to the ServiceWatch guide.

Refer to How-to guides > ServiceWatch Detailed Monitoring Activation for how to enable detailed monitoring of the GPU Server.

Basic Metrics

The following are the basic metrics for the Virtual Server namespace.

The indicators whose names are displayed in bold below are the key indicators selected among the basic indicators provided by Virtual Server. The key metrics are used to build service dashboards that are automatically created for each service in ServiceWatch.

Each metric indicates through the user guide which statistical value is meaningful to view for that metric, and among the meaningful statistics, the statistical values shown in bold text are the primary statistics. In the service dashboard, you can view primary metrics using the primary statistical values.

Performance items	Detailed description	unit	meaningful statistics
Instance State	Instance status display 1 - Active 0 - Off	None	Total
CPU Usage	CPU usage	Percent	Average Maximum Minimum
Disk Read Bytes	Bytes read from block device (bytes)	Bytes	Total Average Maximum Minimum
Disk Read Requests	Number of read requests on a block device	Count	Total Average Maximum Minimum
Disk Write Bytes	Write capacity (bytes) on block device	Bytes	Total Average Maximum Minimum
Disk Write Requests	Number of write requests on block device	Count	Total Average Maximum Minimum
Network In Bytes	Received bytes (capacity) on the network interface	Bytes	Total Average Maximum Minimum
Network In Dropped	Number of packet drops received on the network interface	Count	Total Average Maximum Minimum
Network In Packets	Number of packets received on the network interface	Count	Total Average Maximum Minimum
Network Out Bytes	Data transmitted on the network interface (bytes)	Bytes	Total Average Maximum Minimum
Network Out Dropped	Number of packet drops transmitted from the network interface	Count	Total Average Maximum Minimum
Network Out Packets	Number of packets transmitted on the network interface	Count	Total Average Maximum Minimum

Table. Virtual Server Basic Metrics

3.2 - How-to guides

Users can create the service by entering the required GPU Server information and selecting detailed options through the Samsung Cloud Platform Console.

Creating a GPU Server

You can create and use a GPU Server service from the Samsung Cloud Platform Console.

To create a GPU server, follow the steps below.

Click the All Services > Compute > GPU Server menu. You will be taken to the Service Home page of GPU Server.
On the Service Home page, click the GPU Server Creation button. You will be taken to the GPU Server Creation page.

On the GPU Server Creation page, enter the information required to create the service and select detailed options.

Select the required information in the Image and Version Selection area.

Category

Required status

Detailed description

Image

Required

Select the type of Image provided

Standard: Samsung Cloud Platform standard provided Image
- RHEL, Ubuntu

Custom: User-created Image

Kubernetes: Image for Kubernetes
- Ubuntu

Image version

Required

Select the version of the chosen Image

Provides a list of versions for the offered server Image

For detailed information about the provided server image, see OS and GPU driver versions

Table. GPU Server image and version selection input fields

Service Information Input area, enter or select the required information.

Category	Required status	Detailed description
Number of servers	Required	Number of GPU Server servers to create concurrently Only numeric input is allowed, enter a value between 1 and 100
Service Type > Server Type	Required	GPU Server server type Indicates the specifications of a GPU-type server, allowing selection of servers with 1, 2, 4, or 8 GPUs For detailed information about the server types provided by GPU Server, refer to GPU Server Server Types
Service Type > Planned Compute	Selection	Resource status with Planned Compute configured In Use: Number of resources with Planned Compute that are currently in use Configured: Number of resources with Planned Compute configured Coverage Preview: Amount applied per resource by Planned Compute Apply for Planned Compute Service: Navigate to the Planned Compute service application page For more details, refer to Apply for Planned Compute
Block Storage	Required	Configure the Block Storage used by the GPU Server according to its purpose Basic: The area where the OS is installed and used Capacity can be entered in units (the minimum capacity varies depending on the OS image type) RHEL: Values between 3 and 1,536 can be entered Ubuntu: Values between 3 and 1,536 can be entered SSD: High‑performance general volume HDD: General volume SSD/HDD_KMS: Additional encrypted volume using Samsung Cloud Platform KMS (Key Management System) encryption keys Encryption can be applied only at initial creation (cannot be changed after creation) Performance degradation occurs when using the SSD_KMS disk type SSD_Provisioned: SSD volume with configurable IOPS and throughput Additional: Used when the user needs extra space beyond the OS area After selecting Use, enter the storage type and capacity To add storage, click the + button (up to 25 can be added); to delete, click the x button Capacity can be entered in units, with values between 1 and 1,536 Since 1 Unit equals 8 GB, this creates 8 GB to 12,288 GB SSD: High‑performance general volume HDD: General volume SSD/HDD_KMS: Additional encrypted volume using Samsung Cloud Platform KMS (Key Management System) encryption keys Encryption can be applied only at initial creation (cannot be changed after creation) Performance degradation may occur when using the SSD_KMS disk type HDD/SSD_MultiAttach: Volume that can be attached to two or more servers SSD_Provisioned: SSD volume with configurable IOPS and throughput For details on each Block Storage type, refer to Create Block Storage Delete on termination: If Delete on Termination is set to Enabled, the volume is terminated together with the server Volumes with existing snapshots are not deleted even if Delete on termination is set to Enabled A multi‑attach volume can be deleted only when the server being removed is the last remaining server attached to the volume
Max IOPS	Required	Enter a maximum IOPS value between 5,000~20,000 Can be set only when disk type is SSD_Provisioned
Max Throughput	Required	Enter the maximum Throughput value between 250~1,000 Disk type can be set only when it is SSD_Provisioned

Table. GPU Server Service Configuration Items

In the Required Information Input area, enter or select the necessary information.

Category	required or not	Detailed description
Server name	Required	Enter a name to distinguish the server when the selected number of servers is 1 Set the hostname using the entered server name Enter within 63 characters using letters, numbers, spaces, and special characters (`-` `_`)
Server name Prefix	Required	Enter a Prefix to distinguish each server generated when the selected number of servers is 2 or more Automatically generated in the form of user input value (prefix) + ‘`-#`’ Enter within 59 characters using letters, numbers, spaces, and special characters (`-`, `_`)
Network Settings > Create New Network Port	Required	Configure the network where the GPU Server will be installed Select a pre‑created VPC. General Subnet: Select a pre‑created general Subnet IP can be set to auto‑generate or manual entry; if manual is chosen, the user can directly input the IP NAT: Available only when there is a single server and the VPC is attached to an Internet Gateway. Checking Use enables selection of a NAT IP NAT IP: Select a NAT IP If no NAT IP is available, click the Create New button to generate a Public IP Click the Refresh button to view and select the created Public IP Creating a Public IP incurs charges according to the Public IP pricing policy Local Subnet (optional): Choose Use for a local Subnet It is not a required element for creating the service A pre‑created local Subnet must be selected IP can be set to auto‑generate or manual entry; selecting Manual allows the user to input the IP directly Security Group: Settings required to access the server Select: Choose a pre‑created Security Group Create New: If no applicable Security Group exists, you can create one separately in the Security Group service Up to 5 can be selected If no Security Group is set, all connections are blocked by default You must configure a Security Group to allow required connections
Network Settings > Existing Network Port Assignment	Required	Set the network where the GPU server will be installed Select a pre-created VPC General Subnet: Select a pre-created general Subnet and Port NAT: Available only when there is a single server and the VPC is connected to an Internet Gateway. When you check to use it, you can select a NAT IP. NAT IP: Select a NAT IP If there is no NAT IP to select, click the Create New button to generate a Public IP Click the Refresh button to view and select the created Public IP Local Subnet (optional): Select Use for the local Subnet Select a pre-created local Subnet and Port
Keypair	Required	User authentication methods to use when connecting to the server New creation: Create a new keypair if one is needed Refer to Keypair 생성하기 for how to create a new keypair Default login accounts by OS RHEL: cloud-user Ubuntu: ubuntu

Table. Required input fields for GPU Server

Enter or select the required information in the Additional Information Input area.

Category

Required

Detailed description

Lock

Selection

Lock usage setting

When Lock is enabled, it prevents actions such as server termination, start, and stop from being executed, thereby avoiding malfunctions caused by mistakes

Init script

Selection

Script executed when the server starts

The init script must be written as a Batch script for Windows, a Shell script for Linux, or cloud‑init, depending on the image type.

Up to 45,000 bytes can be entered

tag

Selection

Add Tag

Up to 50 can be added per resource

After clicking the Add Tag button, enter or select Key, Value values

Table. GPU Server additional information input fields

Summary Check the detailed information and estimated billing amount generated in the panel, and click the Create button.
- Once creation is complete, check the created resources on the GPU Server List page.

Check GPU Server detailed information

GPU Server service allows you to view and edit the full resource list and detailed information. GPU Server Details page consists of Details, Tags, Job History tabs.

To view detailed information about the GPU Server service, follow these steps.

Click the All Services > Compute > GPU Server menu. You will be taken to the Service Home page of GPU Server.
On the Service Home page, click the GPU Server menu. You will be taken to the GPU Server List page.

On the GPU Server List page, click the resource you want to view details for. You will be taken to the GPU Server Details page.

GPU Server Details page displays status information and additional feature information, and is composed of Details, Tags, Activity History tabs.

GPU Server Additional Features for detailed information, please refer to GPU Server Management Additional Features.

Category	Detailed description
GPU Server status	Status of user-created GPU Server Build: State where the Build command has been received Building: Build in progress Networking: Server creation networking process Scheduling: Server creation scheduling process Block_Device_Mapping: Connecting Block Storage during server creation Spawning: Server creation process is ongoing Active: Available state Powering_off: State when a stop request is made Deleting: Server deletion in progress Reboot_Started: Reboot in progress Error: Error state Migrating: Server migrating to another host Reboot: State where the Reboot command has been received Rebooting: Rebooting Rebuild: State where the Rebuild command has been received Rebuilding: State during Rebuild request Rebuild_Spawning: Rebuild process is ongoing Resize: State where the Resize command has been received Resizing: Resizing in progress Resize_Prep: State when a server type change is requested Resize_Migrating: Server is migrating to another host while resizing Resize_Migrated: Server has completed migration to another host during resizing Resize_Finish: Resize completed Revert_Resize: Resize or migration of the server failed for some reason. The target server is cleaned up and the original server is restarted Shutoff: State when powering off is completed Verity_Resize: After Resize_Prep following a server type change request, the server type is confirmed or can be reverted Resize_Reverting: State when a server type revert is requested Resize_Confirming: State confirming the server’s Resize request
Server control	Buttons to change server status Start: start a stopped server Stop: stop a running server Restart: restart a running server
Image generation	Create a custom user image from the current server image
Console log	View console logs of the current server You can view the console logs output by the current server. For more details, see Check console logs.
Create dump	Generate a dump of the current server The dump file is created inside the GPU Server For detailed dump creation instructions, refer to Create Dump
Rebuild	All data and settings of the existing server are deleted, and a new server is configured For detailed information, see Execute Rebuild.
Service termination	Cancel service button

Table. GPU Server status information and additional features

Notice

When using the mig feature, after the GPU Server’s Rebooting state has finished, you need to recheck the mig settings.

Detailed Information

GPU Server List page lets you view detailed information of the selected resource and edit it if needed.

Category	Detailed description
service	Service name
Resource Type	Resource Type
SRN	Unique resource ID in Samsung Cloud Platform In the GPU Server service, it refers to the GPU Server SRN
Resource name	Resource Name In the GPU Server service, it refers to the GPU Server name
Resource ID	Unique resource ID in the service
constructor	User who created the service
Creation date and time	Service creation date and time
editor	User who edited the service information
Modification date	Date and time the service information was modified
Server name	Server name
Server type	vCPU, memory, GPU information display If you need to change to a different server type, click the Edit button to configure
image name	Service OS image and version
Lock	Indicates whether Lock is used or not If you need to change the Lock attribute value, click the Edit button to set it
Keypair name	Server authentication information set by the user
Planned Compute	Resource status with Planned Compute configured For more details, please refer to Planned Compute 신청하기.
LLM Endpoint	URL for using LLM For more details, see AIOS 개요
ServiceWatch Detailed Monitoring	When enabled, data monitoring is possible in the ServiceWatch service You can set the activation status by clicking the Edit button For more information about the ServiceWatch service, see the ServiceWatch Overview
Network	Network information of the GPU Server VPC name, standard Subnet name, IP, NAT IP, NAT IP status, Security Group name If you need to change the NAT IP value, click the Edit button to configure If you need to change the Security Group, click the Edit button to configure Add as new network: select a standard Subnet and IP You can select a different standard Subnet within the same VPC IP can be either auto-generated or manually entered; if you choose manual entry, you can input the IP directly Add using existing port: select a pre-created standard Subnet and port
Local Subnet	Local Subnet information of the GPU Server Local Subnet name, Local Subnet IP, Security Group name If you need to change the Security Group, you can configure it by clicking the Edit button Add to New Network: select a local Subnet and IP You can select a different local Subnet within the same VPC IP can be Auto-generated or manually entered; selecting Input allows the user to directly enter the IP Add Existing Port: select a pre-created local Subnet and port
Block Storage	Information of Block Storage attached to the server Volume ID, Volume Name, Type, Capacity, Connection Info, Category, Delete on termination, Status Add: Additional Block Storage can be attached if needed Edit Delete on termination: Modify the Delete on termination value More > Detach: Detach the selected Block Storage from the list

Table. GPU Server detailed information tab items

Caution

When using ServiceWatch detailed monitoring, additional fees apply.

Job History

You can view the job history of the selected resource on the GPU Server List page.

Category	Detailed description
Task History List	Resource Change History Operation Time, Resource ID, Resource Name, Operation Details, Event Topic, Operation Result, Check Operator Information

Table. Work History Tab Detailed Information Items

Control GPU Server Operation

If you need to control the operation of a created GPU Server resource, you can perform the task on the GPU Server List or GPU Server Details page. You can start, stop, and restart a running server.

Getting Started with GPU Server

You can start a shutoff GPU Server. To start a GPU Server, follow the steps below.

Click the All Services > Compute > GPU Server menu. You will be taken to the Service Home page of the GPU Server.
On the Service Home page, click the GPU Server menu. You will be taken to the GPU Server List page.
On the GPU Server List page, click the resource you want to start among the shutoff servers, and navigate to the GPU Server Details page.
- GPU Server List page allows you to Start each resource via the right More button.
- After selecting multiple servers with checkboxes, you can control multiple servers simultaneously using the Start button at the top.
GPU Server Details page, click the Start button at the top to start the server. In the Status Display section, verify the updated server status.
- When the GPU Server startup is complete, the server status changes from Shutoff to Active.
- For detailed information about the GPU Server status, refer to GPU Server detailed information.

Stopping the GPU Server

You can stop a running (Active) GPU Server. To stop the GPU Server, follow the steps below.

Click the All Services > Compute > GPU Server menu. You will be taken to the Service Home page of GPU Server.
On the Service Home page, click the GPU Server menu. You will be taken to the GPU Server List page.
On the GPU Server List page, click the resource to stop among the servers that are running (Active), and navigate to the GPU Server Details page.
- On the GPU Server List page, you can Stop each resource via the right More button.
- After selecting multiple servers with checkboxes, you can control multiple servers simultaneously using the Stop button at the top.
GPU Server Details page, click the Stop button at the top to start the server. Check the changed server status in the Status Display item.
- When the GPU server shutdown is complete, the server status changes from Active to Shutoff.
- For detailed information about the GPU Server status, please refer to GPU Server Detailed Information.

Restart GPU Server

You can restart the created GPU Server. To restart the GPU Server, follow the steps below.

Click the All Services > Compute > GPU Server menu. You will be taken to the Service Home page of GPU Server.
On the Service Home page, click the GPU Server menu. You will be taken to the GPU Server List page.
On the GPU Server List page, click the resource to restart and go to the GPU Server Details page.
- On the GPU Server list page, you can restart each resource using the right More button.
- After selecting multiple servers with checkboxes, you can control multiple servers simultaneously using the Restart button at the top.
GPU Server Details on the page, click the Restart button at the top to start the server. Check the updated server status in the Status Display field.
- During a GPU Server restart, the server status goes through Rebooting and finally changes to Active.
- For detailed information about the GPU Server status, refer to GPU Server detailed information.

GPU Server resource management

If you need server control and management functions for the created GPU Server resources, you can perform the tasks on the GPU Server Resource List or GPU Server Details page.

Create Image

You can create an image of a running GPU server.

Reference

This guide explains how to create a custom user image from the image of a running GPU server.

On the GPU Server List or GPU Server Details page, click the Create Image button to create a user Custom Image.

To create an image of the GPU Server, follow these steps.

Click the All Services > Compute > GPU Server menu. Navigate to the Service Home page of the GPU Server.
On the Service Home page, click the GPU Server menu. You will be taken to the GPU Server List page.
On the GPU Server List page, click the resource to create an Image. You will be taken to the GPU Server Details page.
Click the Create Image button on the GPU Server Details page. You will be taken to the Image Creation page.
- Enter the required information in the Service Information Input area.
  Category
  Required status
  Detailed description
  image name Required Enter the name of the image to be created
  using English letters, numbers, spaces, and special characters (- _) within 200 characters
  Table. Image service information input fields
Check the input information and click the Create button.
- When creation is complete, check the created resources on the All Services > Compute > GPU Server > Image List page.

Notice

When you create an Image, the generated Image is stored in the Object Storage used as internal storage. Therefore, Object Storage usage fees are applied.
Since the file system of an image created from an active GPU server cannot be guaranteed to be intact, it is recommended to stop the server before creating the image.

Enabling detailed monitoring for ServiceWatch

By default, the GPU Server is linked to the basic monitoring of the ServiceWatch and Virtual Server namespaces. You can enable detailed monitoring as needed to identify operational issues more quickly and take action. For detailed information about ServiceWatch, see ServiceWatch Overview.

Note

The GPU Server provides basic and detailed monitoring in the same namespace as the Virtual Server. GPU Server’s GPU metrics will be provided by the ServiceWatch Agent. (Planned for December 2025)

Caution

Basic monitoring is provided free of charge, but enabling detailed monitoring incurs additional fees. Please be aware when using it.

To enable detailed ServiceWatch monitoring on the GPU Server, follow these steps.

Click the All Services > Compute > GPU Server menu. You will be taken to the Service Home page of GPU Server.
On the Service Home page, click the GPU Server menu. You will be taken to the GPU Server List page.
On the GPU Server List page, click the resource to enable ServiceWatch detailed monitoring. You will be taken to the GPU Server Details page.
On the GPU Server Details page, click the ServiceWatch detailed monitoring Edit button. You will be taken to the ServiceWatch Detailed Monitoring Edit popup.
ServiceWatch Detailed Monitoring Edit In the popup window, select Enable, review the instructions, and click the Confirm button.
On the GPU Server Details page, check the ServiceWatch detailed monitoring items.

Disable detailed monitoring of ServiceWatch

Caution

For cost efficiency, detailed monitoring needs to be disabled. Keep detailed monitoring enabled only when absolutely necessary, and disable it for the rest.

To disable detailed monitoring of ServiceWatch on the GPU Server, follow these steps.

Click the All Services > Compute > GPU Server menu. You will be taken to the Service Home page of GPU Server.
On the Service Home page, click the GPU Server menu. You will be taken to the GPU Server List page.
GPU Server List page, click the resource to disable ServiceWatch detailed monitoring. Navigate to the GPU Server Details page.
GPU Server Details page, click the ServiceWatch detailed monitoring Edit button. You will be taken to the ServiceWatch detailed monitoring Edit popup.
ServiceWatch Detailed Monitoring Edit In the popup window, after deselecting Enabled, review the guidance message and click the Confirm button.
On the GPU Server Details page, view the ServiceWatch detailed monitoring items.

GPU Server Management Additional Features

GPU Server can view Console logs, generate Dumps, and perform Rebuilds for server management. To view Console logs, generate Dumps, or Rebuild a GPU Server, follow the steps below.

Check console log

You can view the current console log of the GPU Server.

To view the console logs of the GPU server, follow these steps.

Click the All Services > Compute > GPU Server menu. You will be taken to the GPU Server’s Service Home page.
On the Service Home page, click the GPU Server menu. You will be taken to the GPU Server List page.
On the GPU Server List page, click the resource to view the console log. You will be taken to the GPU Server Details page.
On the GPU Server Details page, click the Console Log button. You will be taken to the Console Log popup.
Console Log Verify the console log displayed in the popup window.

Create Dump

To create a dump file on the GPU Server, follow these steps.

Click the All Services > Compute > GPU Server menu. You will be taken to the Service Home page of GPU Server.
On the Service Home page, click the GPU Server menu. You will be taken to the GPU Server List page.
GPU Server List page, click the resource to view detailed information. You will be taken to the GPU Server Details page.
On the GPU Server Details page, click the Create Dump button.
- The dump file is created inside the GPU server.

Perform Rebuild

You can delete all data and settings of the existing GPU Server and rebuild it on a new server.

Follow these steps to perform a Rebuild of the GPU Server.

Click the All Services > Compute > GPU Server menu. You will be taken to the Service Home page of GPU Server.
On the Service Home page, click the GPU Server menu. You will be taken to the GPU Server List page.
GPU Server List page, click the resource to perform Rebuild. GPU Server Details page will be displayed.
On the GPU Server Details page, click the Rebuild button.
- During a GPU Server Rebuild, the server status changes to Rebuilding, and when the Rebuild is complete, it returns to its state before the Rebuild.
- For detailed information about the GPU Server status, refer to Check GPU Server details.

Terminate GPU Server

If you terminate an unused GPU Server, you can reduce operating costs. However, terminating a GPU Server may cause the running service to stop immediately, so you should carefully consider the impact of service interruption before proceeding with the termination.

Caution

Please note that data cannot be recovered after terminating the service.

To cancel the GPU Server, follow the steps below.

Click the All Services > Compute > GPU Server menu. Go to the GPU Server’s Service Home page.
On the Service Home page, click the GPU Server menu. You will be taken to the GPU Server List page.
On the GPU Server List page, select the resource to cancel and click the Cancel Service button.
- The termination of attached storage depends on the Delete on termination setting, so refer to Termination constraints.
After termination is complete, check on the GPU Server List page whether the resources have been terminated.

Cancellation constraints

When a GPU Server termination request cannot be processed, a popup will provide guidance. Please refer to the cases below.

Cancellation not possible

If File Storage is connected please disconnect the File Storage connection first.
When the LB Pool is connected, please disconnect the LB Pool connection first.
If Lock is set please change the Lock setting to disabled and try again.

Termination of attached storage depends on the Delete on termination setting.

Delete on termination setting-specific deletion

Whether the volume is deleted also depends on the Delete on termination setting.
- Delete on termination If not set: Even if you terminate the GPU Server, the volume will not be deleted.
- When Delete on termination is set: If you terminate the GPU Server, the associated volume will be deleted.
Volumes that have snapshots will not be deleted even if Delete on termination is set.
A Multi‑attach volume can be deleted only when the server being deleted is the last remaining server attached to the volume.

3.2.1 - Manage Image

Users can create the service by entering the required information for the Image service within the GPU Server service and selecting detailed options through the Samsung Cloud Platform Console.

Create Image

You can create an image of a running GPU Server. To create an image of a GPU Server, please refer to Image Creation.

Check Image detailed information

Image service allows you to view and edit the full resource list and detailed information. The Image Details page consists of Detailed Information, Tags, Operation History tabs.

To view detailed information of the Image service, follow these steps.

Click the All Services > Compute > GPU Server menu. You will be taken to the Service Home page of GPU Server.
On the Service Home page, click the Image menu. You will be taken to the Image list page.

On the Image List page, click the resource to view detailed information. You will be taken to the Image Detail page.

Image Details page displays status information and additional feature information, and consists of Details, Tags, Activity Log tabs.

Category

Detailed description

Image status

Status of user-created Image

Active: Available state

Queued: Image has been uploaded and is waiting for processing after creation

Importing: Image has been uploaded and is currently being processed after creation

Share with another account

Image can be shared with another Account

The Image’s Visibility must be set to Shared in order to be shared with another Account

Delete image

Button to delete the Image

Once the Image is deleted, it cannot be restored

Table. GPU Server Image status information and additional features

Detailed Information

Image list page lets you view detailed information of the selected resource and modify it if necessary.

Category	Detailed description
service	Service name
Resource Type	Resource Type
SRN	Unique resource ID in Samsung Cloud Platform refers to the SRN of a GPU Server Image
Resource name	Image name
Resource ID	Image ID
constructor	User who created the Image
Creation date and time	Image creation timestamp
editor	User who edited the Image
Modification date	Image modification timestamp
image name	Image name
Minimum disk	Minimum disk capacity (GB) of the Image If you need to modify the minimum disk, click the Edit button to set it
Minimum RAM	Minimum RAM size (GB) of the Image
OS type	OS type of the image
OS hash algorithm	OS hash algorithm method
Visibility	Display access permissions for the image Private can be used only within the project, and Shared can be shared across projects
Protected	Select whether image deletion is prohibited Checking Use can prevent accidental deletion of images This setting can be changed after the image is created
image file URL	Image file URL uploaded when generating an image GPU Server detail page does not display images created through the image generation menu
Sharing status	Current status of sharing images with another Account Approved Account ID: ID of the Account for which sharing is approved Modification date and time: The date and time when sharing was requested to another Account; if the sharing status changes from Pending to Accepted, it is updated to that date and time Status: Approved status Accepted: sharing is approved and active Pending: awaiting approval Stop sharing: sharing has been stopped Select the Account ID to stop sharing from the list, then click the Stop sharing button at the top of the list to stop sharing all at once

Table. Image detailed information tab items

Job History

You can view the operation history of the selected resource on the Image List page.

Category	Detailed description
Task History List	Resource Change History Operation Time, Resource ID, Resource Name, Operation Details, Event Topic, Operation Result, Check Operator Information

Table. GPU Server Image Job History Tab Detailed Information Items

Image Resource Management

Describes the control and management functions of the generated Image.

To share an Image with another Account, follow the steps below.

Log in to the account to be shared and click the All Services > Compute > GPU Server menu. Go to the GPU Server’s Service Home page.
On the Service Home page, click the Image menu. You will be taken to the Image List page.
On the Image List page, click the Image you want to control. You will be taken to the Image Details page.
Click the Share to another Account button. Navigate to the Share image to another Account page.
- Share with another Account feature allows you to share an Image with another Account. To share an Image with another Account, the Image’s Visibility must be Shared.

Share image to another Account page, enter the required information, and click the Done button.

Category	Required	Detailed description
image name	-	Name of the image to share Input not allowed
Image ID	-	Shareable image ID Input not allowed
Shared Account ID	Required	Enter another Account ID to share English letters, numbers, special characters`-` within 64 characters

Table. Required input fields for sharing images to another Account

You can view the information in the sharing status of the Image Details page.
- When the request is first made, the status is Pending, and it changes to Accepted once approval is completed by the account receiving the share.

Notice

Only images created by uploading an image file from the current user can be shared with another Account. If you create a Custom Image from the image of a running GPU Server, it cannot be shared with another Account, and this feature will be provided in the future, so please note.

To receive an Image shared from another Account, follow these steps.

Log in to the account to be shared and click the All Services > Compute > GPU Server menu. Navigate to the GPU Server’s Service Home page.
On the Service Home page, click the Image menu. You will be taken to the Image List page.
On the Image List page, click the Receive Image Share button. You will be taken to the Receive Image Share popup.
Receive Image Sharing In the popup window, enter the Image’s resource ID you want to receive, and click the Confirm button.
When image sharing is complete, you can view the shared Image in the Image list.

Delete Image

You can delete unused Images. However, since an Image cannot be recovered after deletion, you should carefully consider the impact before performing the deletion.

Caution

Please note that data cannot be recovered after deleting the service.

To delete the Image, follow these steps.

Click the All Services > Compute > GPU Server menu. Go to the GPU Server’s Service Home page.
On the Service Home page, click the Image menu. You will be taken to the Image List page.
Image list page, select the resource to delete, and click the Delete button.
- On the Image List page, select multiple Image check boxes and click the Delete button at the top of the resource list.
After deletion is complete, verify on the Image list page that the resource has been removed.

3.2.2 - Manage Keypair

Users can create the service by entering the required Keypair information within the GPU Server service and selecting detailed options through the Samsung Cloud Platform Console.

Create a Keypair

You can create and use the Keypair service while using the GPU Server service in the Samsung Cloud Platform Console.

To create a keypair, follow these steps.

Click the All Services > Compute > GPU Server menu. You will be taken to the Service Home page of GPU Server.
On the Service Home page, click the Keypair menu. You will be taken to the Keypair List page.

On the Keypair List page, click the Keypair Create button. You will be taken to the Keypair Create page.

Enter the required information in the Service Information Input area.

Category	Required	Detailed description
Keypair name	Required	Enter the name of the Keypair to create using English letters, numbers, spaces, and special characters (`-`, `_`) within 255 characters
Keypair type	Required	ssh

Table. Keypair service information input fields

Additional Information Input area, please enter or select the required information.
Category
Required status
Detailed description
tag Selection Add Tag
Up to 50 can be added per resource
After clicking the Add Tag button, enter or select Key, Value values
Table. Keypair additional information input fields
Caution
- After creation is complete, you can download the Key only once. Since reissuance is not possible, make sure it has been downloaded.
- Store the downloaded Private Key in a safe place.

Check the input information and click the Create button.
- After creation is complete, check the created resources on the Keypair List page.

View detailed information of the Keypair

The Keypair service allows you to view and edit the full resource list and detailed information. Keypair Details page consists of Details, Tags, Activity Log tabs.

To view detailed information about a keypair, follow these steps.

Click the All Services > Compute > GPU Server menu. You will be taken to the Service Home page of GPU Server.
On the Service Home page, click the Keypair menu. You will be taken to the Keypair List page.
On the Keypair List page, click the resource to view its details. You will be taken to the Keypair Details page.
- Keypair Details page displays status information and additional feature information, and consists of Details, Tags, Activity Log tabs.

Detailed Information

Keypair List page allows you to view detailed information of the selected resource and edit the information if needed.

Category	Detailed description
service	Service name
Resource Type	Resource Type
SRN	Unique resource ID in Samsung Cloud Platform In Keypair, it refers to the Keypair SRN
Resource name	Keypair name
Resource ID	Keypair’s unique resource ID
constructor	User who created the keypair
Creation date and time	Keypair creation timestamp
editor	User who modified the keypair information
Modification date	Date and time the keypair information was modified
Keypair name	Keypair name
Fingerprint	A unique value for identifying the key
User ID	User ID of the keypair creator
public key	Public key information

Table. Keypair detailed information tab items

Job History

On the Keypair List page, you can view the operation history of the selected resource.

Category	Detailed description
Task History List	Resource Change History Operation Time, Resource ID, Resource Name, Operation Details, Event Topic, Operation Result, Check operator information

Table. Keypair operation history tab detailed information items

Keypair Resource Management

Describes the control and management functions of a keypair.

Get public key

To retrieve the public key, follow these steps.

Click the All Services > Compute > GPU Server menu. You will be taken to the Service Home page of GPU Server.
On the Service Home page, click the Keypair menu. You will be taken to the Keypair List page.

On the Keypair List page, click the More button at the top and then click the Import Public Key button. You will be taken to the Import Public Key page.

Enter or select the required information in the Required Information Input area.

Category

Required

Detailed description

Keypair name

Required

Name of the Keypair to create

Keypair type

Required

ssh

public key

Required

Enter public key

Load file: Select the Attach file button to attach the public key file
- Only files with the following extension (.pem) can be attached

Enter public key: Paste the copied public key value
- The public key value can be copied from the Keypair Details page

Table. Required input fields for retrieving the public key

Review the entered information and click the Complete button.
- Once creation is complete, check the created resources on the Keypair List page.

Delete Keypair

You can delete unused Keypairs. However, once a Keypair is deleted it cannot be recovered, so please review the impact thoroughly beforehand before proceeding with deletion.

Caution

Please note that data cannot be recovered after deleting the service.

To delete a keypair, follow these steps.

Click the All Services > Compute > GPU Server menu. Go to the Service Home page of GPU Server.
On the Service Home page, click the Keypair menu. You will be taken to the Keypair List page.
On the Keypair List page, select the resource to delete, and click the Delete button.

On the Keypair List page, select multiple Keypair check boxes and click the Delete button at the top of the resource list.

After deletion is complete, check the Keypair List page to confirm that the resource has been removed.

3.2.3 - Use Multi-instance GPU on GPU Server

After creating a GPU Server, you can enable the MIG (Multi-instance GPU) feature on the GPU Server’s VM (Guest OS) and create an instance for use.

NVIDIA Multi-instance GPU Introduction

NVIDIA Multi-instance GPU (hereafter referred to as MIG) supports safely partitioning a GPU into GPU instances and running CUDA applications starting with the NVIDIA Ampere architecture. Through this, multiple users can each utilize different GPU resources to achieve optimal GPU utilization. This feature is especially useful for workloads that do not fully utilize the GPU’s computing capacity, and users can run multiple workloads in parallel to maximize utilization.

Using Multi-instance GPU feature

To use the MIG feature, create an NVIDIA GPU Server on the Samsung Cloud Platform, then enable and disable MIG. The order of applying and removing MIG is as follows.

MIG application order

Enable MIG → Create GPU Instance → Create Compute Instance → Use MIG

MIG release order

Delete Compute Instance → Delete GPU Instance → Disable MIG feature (deactivate)

Reference

MIG can be used on Samsung Cloud Platform’s next-generation GPU Server or MNGC (Multi-node GPU Cluster).
For system requirements to use MIG, refer to the NVIDIA Multi-Instance GPU User Guide.

Applying and Using MIG

After activating the MIG and creating an Instance to assign tasks, the tasks proceed in the following order.

MIG application order

Enable MIG → Create GPU Instance → Create Compute Instance → Use MIG

Note

The example of applying MIG is explained based on an A100 GPU server.

Activate MIG

Check the GPU status on the VM Instance (GuestOS) before applying MIG.

Check whether MIG mode is Disabled.

Color mode

$ nvidia-smi
Mon Sep 27 08:37:08 2021
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 470.57.02    Driver Version: 470.57.02    CUDA Version: 11.4 |
| -------------------------------+----------------------+---------------------- |
| GPU  Name        Persistence-M | Bus-Id        Disp.A | Volatile Uncorr. ECC |
| Fan  Temp  Perf  Pwr:Usage/Cap | Memory-Usage | GPU-Util  Compute M. |
|  |  | MIG M. |
| ===============================+======================+====================== |
| 0  NVDIA A100-SXM...  Off | 00000000:05:00.0 Off | 0 |
| N/A   32C   P0    59W / 400W | 0MiB / 81251MiB | 0%      Default |
|  |  | Disabled |
+-------------------------------+----------------------+----------------------+

+-----------------------------------------------------------------------------+
| Processes: |
| GPU   GI   CI       PID   Type   Process name                   GPU Memory |
| ID   ID                                                   Usage |
| ============================================================================= |
| No running processes found |
+-----------------------------------------------------------------------------+

$ nvidia-smi
Mon Sep 27 08:37:08 2021
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 470.57.02    Driver Version: 470.57.02    CUDA Version: 11.4 |
| -------------------------------+----------------------+---------------------- |
| GPU  Name        Persistence-M | Bus-Id        Disp.A | Volatile Uncorr. ECC |
| Fan  Temp  Perf  Pwr:Usage/Cap | Memory-Usage | GPU-Util  Compute M. |
|  |  | MIG M. |
| ===============================+======================+====================== |
| 0  NVDIA A100-SXM...  Off | 00000000:05:00.0 Off | 0 |
| N/A   32C   P0    59W / 400W | 0MiB / 81251MiB | 0%      Default |
|  |  | Disabled |
+-------------------------------+----------------------+----------------------+

+-----------------------------------------------------------------------------+
| Processes: |
| GPU   GI   CI       PID   Type   Process name                   GPU Memory |
| ID   ID                                                   Usage |
| ============================================================================= |
| No running processes found |
+-----------------------------------------------------------------------------+

Code block. nvidia-smi command - Check GPU disabled status (1)

Color mode

$ nvidia-smi –L
GPU 0: NVIDIA A100-SXM-80GB (UUID: GPU-c956838f-494a-92b2-6818-56eb28fe25e0)

$ nvidia-smi –L
GPU 0: NVIDIA A100-SXM-80GB (UUID: GPU-c956838f-494a-92b2-6818-56eb28fe25e0)

Code block. nvidia-smi command - Check GPU disabled status (2)

Enable MIG (Enable) for each GPU on the VM Instance (GuestOS) and reboot the VM Instance.

Color mode

$ nvidia-smi –I 0 –mig 1
Enabled MIG mode for GPU 00000000:05:00.0
All done.

# reboot

$ nvidia-smi –I 0 –mig 1
Enabled MIG mode for GPU 00000000:05:00.0
All done.

# reboot

Code block. nvidia-smi command - enable MIG

Reference

When using a GPU and configuring MIG, you may encounter the following warning message. If the warning appears, check whether any programs are running on the GPU.

Warning: MIG mode is in pending enable state for GPU 00000000:05:00.0: In use by another client. 00000000:05:00.0 is currently being used by one or more other processes (e.g. CUDA application or a monitoring application such as another instance of nvidia-smi).

Check the GPU status after applying MIG on the VM Instance(GuestOS).

Check whether MIG mode is Enabled.

Color mode

$ nvidia-smi
Mon Sep 27 09:44:33 2021
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 470.57.02    Driver Version: 470.57.02    CUDA Version: 11.4 |
| -------------------------------+----------------------+---------------------- |
| GPU  Name        Persistence-M | Bus-Id        Disp.A | Volatile Uncorr. ECC |
| Fan  Temp  Perf  Pwr:Usage/Cap | Memory-Usage | GPU-Util  Compute M. |
|  |  | MIG M. |
| ===============================+======================+====================== |
| 0  NVDIA A100-SXM...  Off | 00000000:05:00.0 Off | On |
| N/A   32C   P0    59W / 400W | 0MiB / 81251MiB | 0%      Default |
|  |  | Enabled |
+-------------------------------+----------------------+----------------------+
+-----------------------------------------------------------------------------+
| MIG devices: |
+-----------------------------------------------------------------------------+
| GPU  GI  CI  MIG | Memory-Usage | Vol | Shared |
| ID  ID  Dev | BAR1-Usage | SM     Unc | CE  ENC  DEC  OFA  JPG |
|  |  | ECC |  |
| ============================================================================= |
| No MIG devices found |
+-----------------------------------------------------------------------------+
+-----------------------------------------------------------------------------+
| Processes: |
| GPU   GI   CI       PID   Type   Process name                   GPU Memory |
| ID   ID                                                   Usage |
| ============================================================================= |
| No running processes found |
+-----------------------------------------------------------------------------+

$ nvidia-smi
Mon Sep 27 09:44:33 2021
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 470.57.02    Driver Version: 470.57.02    CUDA Version: 11.4 |
| -------------------------------+----------------------+---------------------- |
| GPU  Name        Persistence-M | Bus-Id        Disp.A | Volatile Uncorr. ECC |
| Fan  Temp  Perf  Pwr:Usage/Cap | Memory-Usage | GPU-Util  Compute M. |
|  |  | MIG M. |
| ===============================+======================+====================== |
| 0  NVDIA A100-SXM...  Off | 00000000:05:00.0 Off | On |
| N/A   32C   P0    59W / 400W | 0MiB / 81251MiB | 0%      Default |
|  |  | Enabled |
+-------------------------------+----------------------+----------------------+
+-----------------------------------------------------------------------------+
| MIG devices: |
+-----------------------------------------------------------------------------+
| GPU  GI  CI  MIG | Memory-Usage | Vol | Shared |
| ID  ID  Dev | BAR1-Usage | SM     Unc | CE  ENC  DEC  OFA  JPG |
|  |  | ECC |  |
| ============================================================================= |
| No MIG devices found |
+-----------------------------------------------------------------------------+
+-----------------------------------------------------------------------------+
| Processes: |
| GPU   GI   CI       PID   Type   Process name                   GPU Memory |
| ID   ID                                                   Usage |
| ============================================================================= |
| No running processes found |
+-----------------------------------------------------------------------------+

Code block. nvidia-smi command - Check GPU activation status (1)

Color mode

$ nvidia-smi –L
GPU 0: NVIDIA A100-SXM-80GB (UUID: GPU-c956838f-494a-92b2-6818-56eb28fe25e0)

$ nvidia-smi –L
GPU 0: NVIDIA A100-SXM-80GB (UUID: GPU-c956838f-494a-92b2-6818-56eb28fe25e0)

Code block. nvidia-smi command - Check GPU activation status (2)

GPU Instance creation

If you have enabled MIG and verified its status, you can create a GPU Instance.

Check the list of MIG GPU Instance profiles that can be created.
Color mode
$ nvidia-smi mig -i [GPU ID] -lgip
$ nvidia-smi mig -i [GPU ID] -lgip
Code block. nvidia-smi command - view MIG GPU Instance profile list

Color mode

$ nvidia-smi mig -i 0 -lgip
+-----------------------------------------------------------------------------+
| GPU instance profiles: |
| GPU   Name             ID    Instances   Memory     P2P    SM    DEC   ENC |
| Free/Total   GiB              CE    JPEG  OFA |
| ============================================================================= |
| 0 MIG 1g.10gb        19    7/7         9.50       No     14     0     0 |
| 1     0     0 |
+-----------------------------------------------------------------------------+
| 0 MIG 1g.10gb+me     20    1/1         9.50       No     14     0     0 |
| 1     1     1 |
+-----------------------------------------------------------------------------+
| 0 MIG 2g.20gb        14    3/3         19.50      No     28     1     0 |
| 2     0     0 |
+-----------------------------------------------------------------------------+
| 0 MIG 3g.40gb         9    2/2         39.50      No     42     2     0 |
| 3     0     0 |
+-----------------------------------------------------------------------------+
| 0 MIG 4g.40gb         5    1/1         39.50      No     56     2     0 |
| 4     0     0 |
+-----------------------------------------------------------------------------+
| 0 MIG 7g.80gb         0    1/1         79.25      No     98     0     0 |
| 7     1     1 |
+-----------------------------------------------------------------------------+

$ nvidia-smi mig -i 0 -lgip
+-----------------------------------------------------------------------------+
| GPU instance profiles: |
| GPU   Name             ID    Instances   Memory     P2P    SM    DEC   ENC |
| Free/Total   GiB              CE    JPEG  OFA |
| ============================================================================= |
| 0 MIG 1g.10gb        19    7/7         9.50       No     14     0     0 |
| 1     0     0 |
+-----------------------------------------------------------------------------+
| 0 MIG 1g.10gb+me     20    1/1         9.50       No     14     0     0 |
| 1     1     1 |
+-----------------------------------------------------------------------------+
| 0 MIG 2g.20gb        14    3/3         19.50      No     28     1     0 |
| 2     0     0 |
+-----------------------------------------------------------------------------+
| 0 MIG 3g.40gb         9    2/2         39.50      No     42     2     0 |
| 3     0     0 |
+-----------------------------------------------------------------------------+
| 0 MIG 4g.40gb         5    1/1         39.50      No     56     2     0 |
| 4     0     0 |
+-----------------------------------------------------------------------------+
| 0 MIG 7g.80gb         0    1/1         79.25      No     98     0     0 |
| 7     1     1 |
+-----------------------------------------------------------------------------+

Code block. MIG GPU Instance profile list

Reference

Refer to the NVIDIA Multi-Instance GPU User Guide for GPU Instance profiles.

After creating a MIG GPU Instance, check it.

Create GPU Instance

Color mode

$ nvidia-smi mig -i [GPU ID] -cgi [Profile ID]

$ nvidia-smi mig -i [GPU ID] -cgi [Profile ID]

code block. nvidia-smi command - GPU Instance creation

Color mode

$ nvidia-smi mig -i 0 -cgi 0
Successfully created GPU instance ID 0 on GPU 0 using profile MIG 7g.80gb (ID 0)

$ nvidia-smi mig -i 0 -cgi 0
Successfully created GPU instance ID 0 on GPU 0 using profile MIG 7g.80gb (ID 0)

Code block. nvidia-smi command - Example of creating a GPU Instance

Check GPU Instance

Color mode

$ nvidia-smi mig -i [GPU ID] -lgi

$ nvidia-smi mig -i [GPU ID] -lgi

code block. nvidia-smi command - check GPU Instance

Color mode

$ nvidia-smi mig -i 0 -lgi
+--------------------------------------------------------+
| GPU instances: |
| GPU   Name               Profile  Instance  Placement |
| ID       ID      Start:Size |
| ======================================================== |
| 0  MIG 7g.80gb            0        0         0:8 |
+--------------------------------------------------------+

$ nvidia-smi mig -i 0 -lgi
+--------------------------------------------------------+
| GPU instances: |
| GPU   Name               Profile  Instance  Placement |
| ID       ID      Start:Size |
| ======================================================== |
| 0  MIG 7g.80gb            0        0         0:8 |
+--------------------------------------------------------+

Code block. nvidia-smi command - example of checking GPU Instance

Compute Instance creation

If you have created a GPU Instance, you can create a Compute Instance.

Check the MIG Compute Instance profiles you can create.

Color mode

$ nvidia-smi mig -i [GPU ID] -gi [GPU Instance ID] -lcip

$ nvidia-smi mig -i [GPU ID] -gi [GPU Instance ID] -lcip

Code block. nvidia-smi command - Check MIG Compute Instance profile

Color mode

$ nvidia-smi mig -i 0 -gi 0 -lcip
+---------------------------------------------------------------------------------+
| Compute instance profiles: |
| GPU     GPU     Name            Profile  Instances   Exclusive      Shared |
| GPU   Instance                     ID    Free/Total     SM       DEC  ENC  OFA |
| ID                                                       CE   JPEG |
| ================================================================================= |
| 0      0      MIG 1c.7g.80gb     0      7/5           14       5    0    1 |
| 7    1 |
+---------------------------------------------------------------------------------+
| 0      0      MIG 2c.7g.80gb     1      3/3           28       5    0    1 |
| 7    1 |
+---------------------------------------------------------------------------------+
| 0      0      MIG 3c.7g.80gb     2      2/2           42       5    0    1 |
| 7    1 |
+---------------------------------------------------------------------------------+
| 0      0      MIG 4c.7g.80gb     3      1/1           56       5    0    1 |
| 7    1 |
+---------------------------------------------------------------------------------+
| 0      0      MIG 7g.80gb        4*     1/1           98       5    0    1 |
| 7    1 |
+---------------------------------------------------------------------------------+

$ nvidia-smi mig -i 0 -gi 0 -lcip
+---------------------------------------------------------------------------------+
| Compute instance profiles: |
| GPU     GPU     Name            Profile  Instances   Exclusive      Shared |
| GPU   Instance                     ID    Free/Total     SM       DEC  ENC  OFA |
| ID                                                       CE   JPEG |
| ================================================================================= |
| 0      0      MIG 1c.7g.80gb     0      7/5           14       5    0    1 |
| 7    1 |
+---------------------------------------------------------------------------------+
| 0      0      MIG 2c.7g.80gb     1      3/3           28       5    0    1 |
| 7    1 |
+---------------------------------------------------------------------------------+
| 0      0      MIG 3c.7g.80gb     2      2/2           42       5    0    1 |
| 7    1 |
+---------------------------------------------------------------------------------+
| 0      0      MIG 4c.7g.80gb     3      1/1           56       5    0    1 |
| 7    1 |
+---------------------------------------------------------------------------------+
| 0      0      MIG 7g.80gb        4*     1/1           98       5    0    1 |
| 7    1 |
+---------------------------------------------------------------------------------+

Code block. Example of MIG Compute Instance profile list

Create and verify a MIG Compute Instance.

MIG Compute Instance creation

Color mode

$ nvidia-smi mig -i [GPU ID] -gi [GPU Instance ID] -cci [Compute Profile ID]

$ nvidia-smi mig -i [GPU ID] -gi [GPU Instance ID] -cci [Compute Profile ID]

code block. nvidia-smi command - Create MIG Compute Instance

Color mode

$ nvidia-smi mig -i 0 -gi 0 -cci 4
Successfully created compute instance ID 0 on GPU instance ID 0 using profile MIG 7g.80gb(ID 4)

$ nvidia-smi mig -i 0 -gi 0 -cci 4
Successfully created compute instance ID 0 on GPU instance ID 0 using profile MIG 7g.80gb(ID 4)

Code block. nvidia-smi command - Example of creating a MIG Compute Instance

Check MIG Compute Instance

Color mode

$ nvidia-smi mig -i [GPU ID] -gi [GPU Instance ID] –lci

$ nvidia-smi mig -i [GPU ID] -gi [GPU Instance ID] –lci

code block. nvidia-smi command - check MIG Compute Instance

Color mode

$ nvidia-smi mig -i 0 -gi 0 –lci
+-----------------------------------------------------------------+
| Compute instance profiles: |
| GPU     GPU     Name            Profile  Instances   Placement |
| GPU   Instance                     ID      ID        Start:Size |
| ID |
| ================================================================= |
| 0      0      MIG 7g.80gb         4       0            0:7 |
+-----------------------------------------------------------------+

$ nvidia-smi mig -i 0 -gi 0 –lci
+-----------------------------------------------------------------+
| Compute instance profiles: |
| GPU     GPU     Name            Profile  Instances   Placement |
| GPU   Instance                     ID      ID        Start:Size |
| ID |
| ================================================================= |
| 0      0      MIG 7g.80gb         4       0            0:7 |
+-----------------------------------------------------------------+

Code block. Example of checking MIG Compute Instance

Color mode

$ nvidia-smi –L
GPU 0: NVIDIA A100-SXM-80GB (UUID: GPU-c956838f-494a-92b2-6818-56eb28fe25e0)
  MIG 7g.80gb     Device  0: (UUID: MIG-53e20040-758b-5ecb-948e-c626d03a9a32)

$ nvidia-smi –L
GPU 0: NVIDIA A100-SXM-80GB (UUID: GPU-c956838f-494a-92b2-6818-56eb28fe25e0)
  MIG 7g.80gb     Device  0: (UUID: MIG-53e20040-758b-5ecb-948e-c626d03a9a32)

Code block. nvidia-smi command - Check GPU status (1)

Color mode

$ nvidia-smi
Mon Sep 27 09:52:17 2021
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 470.57.02    Driver Version: 470.57.02    CUDA Version: 11.4 |
| -------------------------------+----------------------+---------------------- |
| GPU  Name        Persistence-M | Bus-Id        Disp.A | Volatile Uncorr. ECC |
| Fan  Temp  Perf  Pwr:Usage/Cap | Memory-Usage | GPU-Util  Compute M. |
|  |  | MIG M. |
| ===============================+======================+====================== |
| 0  NVDIA A100-SXM...  Off | 00000000:05:00.0 Off | On |
| N/A   32C   P0    49W / 400W | 0MiB / 81251MiB | N/A      Default |
|  |  | Enabled |
+-------------------------------+----------------------+----------------------+

+-----------------------------------------------------------------------------+
| MIG devices: |
+-----------------------------------------------------------------------------+
| GPU  GI  CI  MIG | Memory-Usage | Vol | Shared |
| ID  ID  Dev | BAR1-Usage | SM     Unc | CE  ENC  DEC  OFA  JPG |
|  |  | ECC |  |
| ============================================================================= |
| 0    0   0    0 | 0MiB / 81251MiB | 98      0 | 7   0    5    1    1 |
|  | 1MiB / 13107... |  |  |
+-----------------------------------------------------------------------------+
+-----------------------------------------------------------------------------+
| Processes: |
| GPU   GI   CI       PID   Type   Process name                   GPU Memory |
| ID   ID                                                   Usage |
| ============================================================================= |
| No running processes found |
+-----------------------------------------------------------------------------+

$ nvidia-smi
Mon Sep 27 09:52:17 2021
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 470.57.02    Driver Version: 470.57.02    CUDA Version: 11.4 |
| -------------------------------+----------------------+---------------------- |
| GPU  Name        Persistence-M | Bus-Id        Disp.A | Volatile Uncorr. ECC |
| Fan  Temp  Perf  Pwr:Usage/Cap | Memory-Usage | GPU-Util  Compute M. |
|  |  | MIG M. |
| ===============================+======================+====================== |
| 0  NVDIA A100-SXM...  Off | 00000000:05:00.0 Off | On |
| N/A   32C   P0    49W / 400W | 0MiB / 81251MiB | N/A      Default |
|  |  | Enabled |
+-------------------------------+----------------------+----------------------+

+-----------------------------------------------------------------------------+
| MIG devices: |
+-----------------------------------------------------------------------------+
| GPU  GI  CI  MIG | Memory-Usage | Vol | Shared |
| ID  ID  Dev | BAR1-Usage | SM     Unc | CE  ENC  DEC  OFA  JPG |
|  |  | ECC |  |
| ============================================================================= |
| 0    0   0    0 | 0MiB / 81251MiB | 98      0 | 7   0    5    1    1 |
|  | 1MiB / 13107... |  |  |
+-----------------------------------------------------------------------------+
+-----------------------------------------------------------------------------+
| Processes: |
| GPU   GI   CI       PID   Type   Process name                   GPU Memory |
| ID   ID                                                   Usage |
| ============================================================================= |
| No running processes found |
+-----------------------------------------------------------------------------+

Code block. nvidia-smi command - Check GPU status (2)

Using MIG

Use the MIG Instance to perform the Job.

Example of task execution

Color mode

$ docker run --gpus '"device=[GPU ID]:[MIG ID]"' -rm nvcr.io/nvidia/cuda nvidia-smi

$ docker run --gpus '"device=[GPU ID]:[MIG ID]"' -rm nvcr.io/nvidia/cuda nvidia-smi

Code block. Task execution example

You can see an example of the work performed as follows.

Color mode

$ docker run --gpus '"device=0:0"' -rm -it --network=host --shm-size=1g --ipc=host -v /root/.ssh/:/root/.ssh

================
== TensorFlow ==
================

NVIDIA Release 21.08-tf1 (build 26012104)
TensorFlow Version 1.15.5

Container image Copyright (c) 2021, NVIDIA CORPORATION. All right reserved.
...

# Run Python process
root@d622a93c9281:/workspace# python /workspace/nvidia-examples/cnn/resnet.py --num_iter 100
...
PY 3.8.10 (default, Jun 2 2021, 10:49:15)
[GCC 9.4.0]
TF 1.15.5
...

$ docker run --gpus '"device=0:0"' -rm -it --network=host --shm-size=1g --ipc=host -v /root/.ssh/:/root/.ssh

================
== TensorFlow ==
================

NVIDIA Release 21.08-tf1 (build 26012104)
TensorFlow Version 1.15.5

Container image Copyright (c) 2021, NVIDIA CORPORATION. All right reserved.
...

# Run Python process
root@d622a93c9281:/workspace# python /workspace/nvidia-examples/cnn/resnet.py --num_iter 100
...
PY 3.8.10 (default, Jun 2 2021, 10:49:15)
[GCC 9.4.0]
TF 1.15.5
...

Code block. Operation result

Check the GPU usage. (Create JOB process)

When the job runs, you can see that a process is allocated to the MIG device and its utilization increases.
Color mode
$ nvidia-smi mig -i [GPU ID] -gi [GPU Instance ID] -lcip
$ nvidia-smi mig -i [GPU ID] -gi [GPU Instance ID] -lcip
code block. nvidia-smi command - check GPU utilization

You can check the GPU usage as shown below.

Color mode

+-----------------------------------------------------------------------------+
| MIG devices: |
+-----------------------------------------------------------------------------+
| GPU  GI  CI  MIG | Memory-Usage | Vol | Shared |
| ID  ID  Dev | BAR1-Usage | SM     Unc | CE  ENC  DEC  OFA  JPG |
|  |  | ECC |  |
| ============================================================================= |
| 0    0   0    0 | 66562MiB / 81251MiB | 98      0 | 7   0    5    1    1 |
|  | 5MiB / 13107... |  |  |
+-----------------------------------------------------------------------------+
+-----------------------------------------------------------------------------+
| Processes: |
| GPU   GI   CI       PID   Type   Process name                   GPU Memory |
| ID   ID                                                   Usage |
| ============================================================================= |
| 0     0    0     17483      C   python                           66559MiB |
+-----------------------------------------------------------------------------+

+-----------------------------------------------------------------------------+
| MIG devices: |
+-----------------------------------------------------------------------------+
| GPU  GI  CI  MIG | Memory-Usage | Vol | Shared |
| ID  ID  Dev | BAR1-Usage | SM     Unc | CE  ENC  DEC  OFA  JPG |
|  |  | ECC |  |
| ============================================================================= |
| 0    0   0    0 | 66562MiB / 81251MiB | 98      0 | 7   0    5    1    1 |
|  | 5MiB / 13107... |  |  |
+-----------------------------------------------------------------------------+
+-----------------------------------------------------------------------------+
| Processes: |
| GPU   GI   CI       PID   Type   Process name                   GPU Memory |
| ID   ID                                                   Usage |
| ============================================================================= |
| 0     0    0     17483      C   python                           66559MiB |
+-----------------------------------------------------------------------------+

Code block. Example of checking GPU utilization.

Delete and release MIG Instance

Follow these steps to delete the MIG instance and detach the MIG.

MIG release order

Delete Compute Instance → Delete GPU Instance → Disable MIG feature (deactivate)

Compute Instance Delete

Delete the Compute Instance.

Color mode

$ nvidia-smi mig -i [GPU ID] -gi [GPU Instance ID] –dci
$ nvidia-smi mig -i [GPU ID] -gi [GPU Instance ID] -ci [Compute Instance] –dci

$ nvidia-smi mig -i [GPU ID] -gi [GPU Instance ID] –dci
$ nvidia-smi mig -i [GPU ID] -gi [GPU Instance ID] -ci [Compute Instance] –dci

Code block. nvidia-smi command - Delete Compute Instance

Color mode

$ nvidia-smi mig -i 0 -gi 0 –lci
+-----------------------------------------------------------------+
| Compute instance profiles: |
| GPU     GPU     Name            Profile  Instances   Placement |
| GPU   Instance                     ID      ID        Start:Size |
| ID |
| ================================================================= |
| 0      0      MIG 7g.80gb         4       0            0:7 |
+-----------------------------------------------------------------+

$ nvidia-smi mig -i 0 -gi 0 –lci
+-----------------------------------------------------------------+
| Compute instance profiles: |
| GPU     GPU     Name            Profile  Instances   Placement |
| GPU   Instance                     ID      ID        Start:Size |
| ID |
| ================================================================= |
| 0      0      MIG 7g.80gb         4       0            0:7 |
+-----------------------------------------------------------------+

code block. Example of checking MIG Compute Instance

Color mode

$ nvidia-smi mig -i 0 -gi 0 –dci
Successfully destroyed compute instance ID  0 from GPU instance ID  0

$ nvidia-smi mig -i 0 -gi 0 –dci
Successfully destroyed compute instance ID  0 from GPU instance ID  0

Code block. Compute Instance deletion example

Color mode

$ nvidia-smi mig -i 0 -gi 0 –lci
No compute instances found: Not found

$ nvidia-smi mig -i 0 -gi 0 –lci
No compute instances found: Not found

Code block. Confirm Compute Instance deletion

Delete GPU Instance

Delete the GPU Instance.

Color mode

$ nvidia-smi mig -i [GPU ID] –dgi
$ nvidia-smi mig -i [GPU ID] -gi [GPU Instance ID] –dgi

$ nvidia-smi mig -i [GPU ID] –dgi
$ nvidia-smi mig -i [GPU ID] -gi [GPU Instance ID] –dgi

Code block. nvidia-smi command - Delete GPU Instance

Color mode

$ nvidia-smi mig -i 0 -lgi
+--------------------------------------------------------+
| GPU instances: |
| GPU   Name               Profile  Instance  Placement |
| ID       ID      Start:Size |
| ======================================================== |
| 0  MIG 7g.80gb            0        0         0:8 |
+--------------------------------------------------------+

$ nvidia-smi mig -i 0 -lgi
+--------------------------------------------------------+
| GPU instances: |
| GPU   Name               Profile  Instance  Placement |
| ID       ID      Start:Size |
| ======================================================== |
| 0  MIG 7g.80gb            0        0         0:8 |
+--------------------------------------------------------+

Code block. nvidia-smi command - Example of checking GPU Instance

Color mode

$ nvidia-smi mig -i 0 -dgi
Successfully destroyed GPU instance ID  0 from GPU  0

$ nvidia-smi mig -i 0 -dgi
Successfully destroyed GPU instance ID  0 from GPU  0

Code block. nvidia-smi command - GPU Instance deletion example

Color mode

$ nvidia-smi mig -i 0 -lgi
No GPU instances found: Not found

$ nvidia-smi mig -i 0 -lgi
No GPU instances found: Not found

code block. nvidia-smi command - example of deleting a GPU Instance

Disable MIG feature (deactivation)

After disabling MIG (Disable), reboot.

Color mode

$ nvidia-smi -mig 0
Disabled MIG Mode for GPU 00000000:05:00.0

All done.

$ nvidia-smi -mig 0
Disabled MIG Mode for GPU 00000000:05:00.0

All done.

Code block. nvidia-smi command - disable MIG

Color mode

$ nvidia-smi
Mon Sep 30 05:18:28 2021
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 470.57.02    Driver Version: 470.57.02    CUDA Version: 11.4 |
| -------------------------------+----------------------+---------------------- |
| GPU  Name        Persistence-M | Bus-Id        Disp.A | Volatile Uncorr. ECC |
| Fan  Temp  Perf  Pwr:Usage/Cap | Memory-Usage | GPU-Util  Compute M. |
|  |  | MIG M. |
| ===============================+======================+====================== |
| 0  NVDIA A100-SXM...  Off | 00000000:05:00.0 Off | 0 |
| N/A   33C   P0    60W / 400W | 0MiB / 81251MiB | 0%      Default |
|  |  | Disabled |
+-------------------------------+----------------------+----------------------+
+-----------------------------------------------------------------------------+
| MIG devices: |
+-----------------------------------------------------------------------------+
| GPU  GI  CI  MIG | Memory-Usage | Vol | Shared |
| ID  ID  Dev | BAR1-Usage | SM     Unc | CE  ENC  DEC  OFA  JPG |
|  |  | ECC |  |
| ============================================================================= |
| No MIG devices found |
+-----------------------------------------------------------------------------+
+-----------------------------------------------------------------------------+
| Processes: |
| GPU   GI   CI       PID   Type   Process name                   GPU Memory |
| ID   ID                                                   Usage |
| ============================================================================= |
| No running processes found |
+-----------------------------------------------------------------------------+

$ nvidia-smi
Mon Sep 30 05:18:28 2021
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 470.57.02    Driver Version: 470.57.02    CUDA Version: 11.4 |
| -------------------------------+----------------------+---------------------- |
| GPU  Name        Persistence-M | Bus-Id        Disp.A | Volatile Uncorr. ECC |
| Fan  Temp  Perf  Pwr:Usage/Cap | Memory-Usage | GPU-Util  Compute M. |
|  |  | MIG M. |
| ===============================+======================+====================== |
| 0  NVDIA A100-SXM...  Off | 00000000:05:00.0 Off | 0 |
| N/A   33C   P0    60W / 400W | 0MiB / 81251MiB | 0%      Default |
|  |  | Disabled |
+-------------------------------+----------------------+----------------------+
+-----------------------------------------------------------------------------+
| MIG devices: |
+-----------------------------------------------------------------------------+
| GPU  GI  CI  MIG | Memory-Usage | Vol | Shared |
| ID  ID  Dev | BAR1-Usage | SM     Unc | CE  ENC  DEC  OFA  JPG |
|  |  | ECC |  |
| ============================================================================= |
| No MIG devices found |
+-----------------------------------------------------------------------------+
+-----------------------------------------------------------------------------+
| Processes: |
| GPU   GI   CI       PID   Type   Process name                   GPU Memory |
| ID   ID                                                   Usage |
| ============================================================================= |
| No running processes found |
+-----------------------------------------------------------------------------+

Code block. nvidia-smi command - check GPU status

3.2.4 - Use NVSwitch on GPU Server

After creating a GPU Server, you can enable the NVSwitch feature on the GPU Server’s VM (Guest OS) and use fast GPU-to-GPU P2P communication.

Caution

Only the GPU Server (8 GPU) and Multi-node GPU Cluster of Samsung Cloud Platform are connected with NVSwitch and NVLink.

Exploring NVIDIA NVSwitch for Multi GPU

NVLink expands I/O by directly connecting multiple GPUs within a server both bidirectionally and GPU-to-GPU. Using NVSwitch, you can connect all GPUs in a server with full NVLink bandwidth.

Checking NVSwitch operation

Check the NVIDIA Fabric Manager, NVIDIA NVLink topology, and NVIDIA NVLink Status on the GPU server.

Reference

The example for checking NVSwitch operation is explained using the A100 GPU Server (g1v128a8) as a reference.

NVIDIA Fabric Manager operating status

Verify that active (running) is displayed when operating normally.

~$ systemctl status nvidia-fabricmanager

Color mode

nvidia-fabricmanager.service - NVIDIA fabric manager service
     Loaded: loaded (/lib/systemd/system/nvidia-fabricmanager.service; enabled; vendor preset: enabled)
     Active: active (running) since Mon 2026-02-02 16:23:27 KST; 32min ago
   Main PID: 2191 (nv-fabricmanage)
      Tasks: 18 (limit: 629145)
     Memory: 18.0M
        CPU: 33.461s
     CGroup: /system.slice/nvidia-fabricmanager.service
             └─2191 /usr/bin/nv-fabricmanager -c /usr/share/nvidia/nvswitch/fabricmanager.cfg

nvidia-fabricmanager.service - NVIDIA fabric manager service
     Loaded: loaded (/lib/systemd/system/nvidia-fabricmanager.service; enabled; vendor preset: enabled)
     Active: active (running) since Mon 2026-02-02 16:23:27 KST; 32min ago
   Main PID: 2191 (nv-fabricmanage)
      Tasks: 18 (limit: 629145)
     Memory: 18.0M
        CPU: 33.461s
     CGroup: /system.slice/nvidia-fabricmanager.service
             └─2191 /usr/bin/nv-fabricmanager -c /usr/share/nvidia/nvswitch/fabricmanager.cfg

Code block. Check NVIDIA Fabric Manager status

Check NVIDIA NVLink topology

Check the NVIDIA NVLink topology.

~$ nvidia-smi topo -m

Color mode

nvidia-smi topo -m
        GPU0    GPU1    GPU2    GPU3    GPU4    GPU5    GPU6    GPU7    CPU Affinity    NUMA Affinity   GPU NUMA ID
GPU0     X      NV12    NV12    NV12    NV12    NV12    NV12    NV12    0-127   0-7             N/A
GPU1    NV12     X      NV12    NV12    NV12    NV12    NV12    NV12    0-127   0-7             N/A
GPU2    NV12    NV12     X      NV12    NV12    NV12    NV12    NV12    0-127   0-7             N/A
GPU3    NV12    NV12    NV12     X      NV12    NV12    NV12    NV12    0-127   0-7             N/A
GPU4    NV12    NV12    NV12    NV12     X      NV12    NV12    NV12    0-127   0-7             N/A
GPU5    NV12    NV12    NV12    NV12    NV12     X      NV12    NV12    0-127   0-7             N/A
GPU6    NV12    NV12    NV12    NV12    NV12    NV12     X      NV12    0-127   0-7             N/A
GPU7    NV12    NV12    NV12    NV12    NV12    NV12    NV12     X      0-127   0-7             N/A

Legend:

  X    = Self
  SYS  = Connection traversing PCIe as well as the SMP interconnect between NUMA nodes (e.g., QPI/UPI)
  NODE = Connection traversing PCIe as well as the interconnect between PCIe Host Bridges within a NUMA node
  PHB  = Connection traversing PCIe as well as a PCIe Host Bridge (typically the CPU)
  PXB  = Connection traversing multiple PCIe bridges (without traversing the PCIe Host Bridge)
  PIX  = Connection traversing at most a single PCIe bridge
  NV#  = Connection traversing a bonded set of # NVLinks

nvidia-smi topo -m
        GPU0    GPU1    GPU2    GPU3    GPU4    GPU5    GPU6    GPU7    CPU Affinity    NUMA Affinity   GPU NUMA ID
GPU0     X      NV12    NV12    NV12    NV12    NV12    NV12    NV12    0-127   0-7             N/A
GPU1    NV12     X      NV12    NV12    NV12    NV12    NV12    NV12    0-127   0-7             N/A
GPU2    NV12    NV12     X      NV12    NV12    NV12    NV12    NV12    0-127   0-7             N/A
GPU3    NV12    NV12    NV12     X      NV12    NV12    NV12    NV12    0-127   0-7             N/A
GPU4    NV12    NV12    NV12    NV12     X      NV12    NV12    NV12    0-127   0-7             N/A
GPU5    NV12    NV12    NV12    NV12    NV12     X      NV12    NV12    0-127   0-7             N/A
GPU6    NV12    NV12    NV12    NV12    NV12    NV12     X      NV12    0-127   0-7             N/A
GPU7    NV12    NV12    NV12    NV12    NV12    NV12    NV12     X      0-127   0-7             N/A

Legend:

  X    = Self
  SYS  = Connection traversing PCIe as well as the SMP interconnect between NUMA nodes (e.g., QPI/UPI)
  NODE = Connection traversing PCIe as well as the interconnect between PCIe Host Bridges within a NUMA node
  PHB  = Connection traversing PCIe as well as a PCIe Host Bridge (typically the CPU)
  PXB  = Connection traversing multiple PCIe bridges (without traversing the PCIe Host Bridge)
  PIX  = Connection traversing at most a single PCIe bridge
  NV#  = Connection traversing a bonded set of # NVLinks

Code block. Check NVIDIA NVLink topology

Check NVIDIA NVLink Status

Check the NVIDIA NVLink Status.

~$ nvidia-smi topo -m

Color mode

GPU 1: NVIDIA A100-SXM4-80GB (UUID: GPU-64a2f685-bb12-c4af-105c-0726ece9c8d7)
         Link 0: 25 GB/s
         Link 1: 25 GB/s
         Link 2: 25 GB/s
         Link 3: 25 GB/s
         Link 4: 25 GB/s
         Link 5: 25 GB/s
         Link 6: 25 GB/s
         Link 7: 25 GB/s
         Link 8: 25 GB/s
         Link 9: 25 GB/s
         Link 10: 25 GB/s
         Link 11: 25 GB/s
GPU 2: NVIDIA A100-SXM4-80GB (UUID: GPU-2269851b-71cd-f6c7-50c5-ba1525cf3ce8)
         Link 0: 25 GB/s
         Link 1: 25 GB/s
         Link 2: 25 GB/s
         Link 3: 25 GB/s
         Link 4: 25 GB/s
         Link 5: 25 GB/s
         Link 6: 25 GB/s
         Link 7: 25 GB/s
         Link 8: 25 GB/s
         Link 9: 25 GB/s
         Link 10: 25 GB/s
         Link 11: 25 GB/s
GPU 3: NVIDIA A100-SXM4-80GB (UUID: GPU-4c397bbf-95fc-5c29-918a-a429cbe45a7a)
         Link 0: 25 GB/s
         Link 1: 25 GB/s
         Link 2: 25 GB/s
         Link 3: 25 GB/s
         Link 4: 25 GB/s
         Link 5: 25 GB/s
         Link 6: 25 GB/s
         Link 7: 25 GB/s
         Link 8: 25 GB/s
         Link 9: 25 GB/s
         Link 10: 25 GB/s
         Link 11: 25 GB/s
GPU 4: NVIDIA A100-SXM4-80GB (UUID: GPU-0e350204-9fb6-2cbe-538e-8f7849658eb8)
         Link 0: 25 GB/s
         Link 1: 25 GB/s
         Link 2: 25 GB/s
         Link 3: 25 GB/s
         Link 4: 25 GB/s
         Link 5: 25 GB/s
         Link 6: 25 GB/s
         Link 7: 25 GB/s
         Link 8: 25 GB/s
         Link 9: 25 GB/s
         Link 10: 25 GB/s
         Link 11: 25 GB/s
GPU 5: NVIDIA A100-SXM4-80GB (UUID: GPU-45f0c453-4760-edd4-3af9-25c5ea7473a5)
         Link 0: 25 GB/s
         Link 1: 25 GB/s
         Link 2: 25 GB/s
         Link 3: 25 GB/s
         Link 4: 25 GB/s
         Link 5: 25 GB/s
         Link 6: 25 GB/s
         Link 7: 25 GB/s
         Link 8: 25 GB/s
         Link 9: 25 GB/s
         Link 10: 25 GB/s
         Link 11: 25 GB/s
GPU 6: NVIDIA A100-SXM4-80GB (UUID: GPU-38409794-bb34-430e-3c50-90b42cb2bb72)
         Link 0: 25 GB/s
         Link 1: 25 GB/s
         Link 2: 25 GB/s
         Link 3: 25 GB/s
         Link 4: 25 GB/s
         Link 5: 25 GB/s
         Link 6: 25 GB/s
         Link 7: 25 GB/s
         Link 8: 25 GB/s
         Link 9: 25 GB/s
         Link 10: 25 GB/s
         Link 11: 25 GB/s
GPU 7: NVIDIA A100-SXM4-80GB (UUID: GPU-3fb478aa-801b-eb64-55c2-0ffc3f2ce404)
         Link 0: 25 GB/s
         Link 1: 25 GB/s
         Link 2: 25 GB/s
         Link 3: 25 GB/s
         Link 4: 25 GB/s
         Link 5: 25 GB/s
         Link 6: 25 GB/s
         Link 7: 25 GB/s
         Link 8: 25 GB/s
         Link 9: 25 GB/s
         Link 10: 25 GB/s
         Link 11: 25 GB/s

GPU 1: NVIDIA A100-SXM4-80GB (UUID: GPU-64a2f685-bb12-c4af-105c-0726ece9c8d7)
         Link 0: 25 GB/s
         Link 1: 25 GB/s
         Link 2: 25 GB/s
         Link 3: 25 GB/s
         Link 4: 25 GB/s
         Link 5: 25 GB/s
         Link 6: 25 GB/s
         Link 7: 25 GB/s
         Link 8: 25 GB/s
         Link 9: 25 GB/s
         Link 10: 25 GB/s
         Link 11: 25 GB/s
GPU 2: NVIDIA A100-SXM4-80GB (UUID: GPU-2269851b-71cd-f6c7-50c5-ba1525cf3ce8)
         Link 0: 25 GB/s
         Link 1: 25 GB/s
         Link 2: 25 GB/s
         Link 3: 25 GB/s
         Link 4: 25 GB/s
         Link 5: 25 GB/s
         Link 6: 25 GB/s
         Link 7: 25 GB/s
         Link 8: 25 GB/s
         Link 9: 25 GB/s
         Link 10: 25 GB/s
         Link 11: 25 GB/s
GPU 3: NVIDIA A100-SXM4-80GB (UUID: GPU-4c397bbf-95fc-5c29-918a-a429cbe45a7a)
         Link 0: 25 GB/s
         Link 1: 25 GB/s
         Link 2: 25 GB/s
         Link 3: 25 GB/s
         Link 4: 25 GB/s
         Link 5: 25 GB/s
         Link 6: 25 GB/s
         Link 7: 25 GB/s
         Link 8: 25 GB/s
         Link 9: 25 GB/s
         Link 10: 25 GB/s
         Link 11: 25 GB/s
GPU 4: NVIDIA A100-SXM4-80GB (UUID: GPU-0e350204-9fb6-2cbe-538e-8f7849658eb8)
         Link 0: 25 GB/s
         Link 1: 25 GB/s
         Link 2: 25 GB/s
         Link 3: 25 GB/s
         Link 4: 25 GB/s
         Link 5: 25 GB/s
         Link 6: 25 GB/s
         Link 7: 25 GB/s
         Link 8: 25 GB/s
         Link 9: 25 GB/s
         Link 10: 25 GB/s
         Link 11: 25 GB/s
GPU 5: NVIDIA A100-SXM4-80GB (UUID: GPU-45f0c453-4760-edd4-3af9-25c5ea7473a5)
         Link 0: 25 GB/s
         Link 1: 25 GB/s
         Link 2: 25 GB/s
         Link 3: 25 GB/s
         Link 4: 25 GB/s
         Link 5: 25 GB/s
         Link 6: 25 GB/s
         Link 7: 25 GB/s
         Link 8: 25 GB/s
         Link 9: 25 GB/s
         Link 10: 25 GB/s
         Link 11: 25 GB/s
GPU 6: NVIDIA A100-SXM4-80GB (UUID: GPU-38409794-bb34-430e-3c50-90b42cb2bb72)
         Link 0: 25 GB/s
         Link 1: 25 GB/s
         Link 2: 25 GB/s
         Link 3: 25 GB/s
         Link 4: 25 GB/s
         Link 5: 25 GB/s
         Link 6: 25 GB/s
         Link 7: 25 GB/s
         Link 8: 25 GB/s
         Link 9: 25 GB/s
         Link 10: 25 GB/s
         Link 11: 25 GB/s
GPU 7: NVIDIA A100-SXM4-80GB (UUID: GPU-3fb478aa-801b-eb64-55c2-0ffc3f2ce404)
         Link 0: 25 GB/s
         Link 1: 25 GB/s
         Link 2: 25 GB/s
         Link 3: 25 GB/s
         Link 4: 25 GB/s
         Link 5: 25 GB/s
         Link 6: 25 GB/s
         Link 7: 25 GB/s
         Link 8: 25 GB/s
         Link 9: 25 GB/s
         Link 10: 25 GB/s
         Link 11: 25 GB/s

Code block. Check NVIDIA NVLink status

3.2.5 - Install ServiceWatch Agent

Users can install the ServiceWatch Agent on a GPU server to collect custom metrics and logs.

Reference

Collecting custom metrics/logs via the ServiceWatch Agent is currently available only on Samsung Cloud Platform For Enterprise. It will also be available in other offerings in the future.

Caution

Metric collection through the ServiceWatch Agent is classified as custom metrics and, unlike the default metrics collected from each service, incurs charges; therefore, we recommend removing or disabling unnecessary metric collection settings.

ServiceWatch Agent

There are two main types of agents that need to be installed on a GPU server to collect custom metrics and logs for ServiceWatch. It is a Prometheus Exporter and Open Telemetry Collector.

Category

Detailed description

Prometheus Exporter

Provide metrics of a specific application or service in a format that Prometheus can scrape

To collect OS metrics from servers, you can use Node Exporter for Linux servers and Windows Exporter for Windows servers, depending on the OS type.
- For GPU Servers, you can use Node Exporter to collect OS metrics just like for Virtual Servers; see Virtual Server > ServiceWatch Agent for details

You can use the DCGM (NVIDIA Data Center GPU Manager) Exporter for GPU metrics
- This guide explains how to use the DCGM Exporter in the GPU metrics Prometheus Exporter installation (for Ubuntu) and GPU metrics Prometheus Exporter installation (for RHEL) guides

Open Telemetry Collector

Acts as a centralized collector that gathers telemetry data such as metrics and logs from distributed systems, processes (filtering, sampling, etc.) it, and exports it to multiple backends (e.g., Prometheus, Jaeger, Elasticsearch, etc.)

Exports data to the ServiceWatch Gateway so that ServiceWatch can collect metric and log data.

Refer to ServiceWatch > Using ServiceWatch Agent for how to use the OpenTelemetry Collector provided by ServiceWatch

Table. Explanation of Prometheus Exporter and Open Telemetry Collector

Caution

If you have configured a Kubernetes Engine on a GPU server, please check GPU metrics using the metrics provided by the Kubernetes Engine.

If you install the DCGM Exporter on a GPU server configured with Kubernetes Engine, it may not operate correctly.

Pre-configuration for Using ServiceWatch Agent

To utilize the ServiceWatch Agent, please refer to Prerequisite Settings for ServiceWatch Agent and prepare the prerequisite settings.

Install Prometheus Exporter for GPU metrics (for Ubuntu)

Install the Prometheus Exporter for collecting metrics from the GPU server according to the steps below.

Step 1. Check NVDIA Driver Installation
Step 2. NVSwitch Configuration and Query (NSCQ) Library Installation
Step 3. NVSwitch Device Monitoring API(NVSDM) Library Installation
Step 4. NVIDIA DCGM Installation

Verify NVDIA Driver Installation

Check the installed NVDIA Driver.
Color mode
nvidia-smi --query-gpu driver_version --format csv
nvidia-smi --query-gpu driver_version --format csv
Code block. NVDIA Driver version check command
Color mode
driver_version 535.183.06 ... 535.183.06
driver_version 535.183.06 ... 535.183.06
Code block. NVDIA Driver version check example

NVSwitch Configuration and Query (NSCQ) Library Installation

Reference

The NVSwitch Configuration and Query (NSCQ) Library is required for Hopper or earlier generation GPUs.

Information

The installation commands below can be used in an environment with internet access. If you are in an environment without internet access, you must download libnvdia-nscq from https://developer.download.nvidia.com/compute/cuda/repos/ and upload it.

Install cuda-keyring.

Color mode

wget https://developer.download.nvidia.com/compute/cuda/repos/<distro>/<arch>/cuda-keyring_1.1-1_all.deb

wget https://developer.download.nvidia.com/compute/cuda/repos/<distro>/<arch>/cuda-keyring_1.1-1_all.deb

code block. NSCQ library download command

Color mode

sudo dpkg -i cuda-keyring_1.1-1_all.deb
apt update

sudo dpkg -i cuda-keyring_1.1-1_all.deb
apt update

Code block. NSCQ library installation command

Color mode

nvidia-smi --query-gpu driver_version --format csv

nvidia-smi --query-gpu driver_version --format csv

Code block. NVDIA Driver version check command

Color mode

driver_version
535.183.06
...

535.183.06

driver_version
535.183.06
...

535.183.06

code block. Example of checking NVDIA driver version

Install libnvidia-nscq.

Color mode

apt-cache policy libnvidia-nscq-535

apt-cache policy libnvidia-nscq-535

Code block. NSCQ library apt-cache command

Color mode

libnvidia-nscq-535:
  Installed: (none)
  Candidate: 535.247.01-1
  Version table:
     535.247.01-1 600
        600 https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/x86_64  Packages
...
        600 https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/x86_64  Packages
     535.216.01-1 600
        600 https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/x86_64  Packages
     535.183.06-1 600  # Install the version that matches the driver
        600 https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/x86_64  Packages
     535.183.01-1 600

        600 https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/x86_64  Packages
     535.54.03-1 600
        600 https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/x86_64  Packages

libnvidia-nscq-535:
  Installed: (none)
  Candidate: 535.247.01-1
  Version table:
     535.247.01-1 600
        600 https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/x86_64  Packages
...
        600 https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/x86_64  Packages
     535.216.01-1 600
        600 https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/x86_64  Packages
     535.183.06-1 600  # Install the version that matches the driver
        600 https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/x86_64  Packages
     535.183.01-1 600

        600 https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/x86_64  Packages
     535.54.03-1 600
        600 https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/x86_64  Packages

Code block. NSCQ library apt-cache command result

Color mode

apt install libnvidia-nscq-535=535.183.06-1

apt install libnvidia-nscq-535=535.183.06-1

Code block. NSCQ library installation command

information

It must be installed with the same version as the NVDIA Driver.

Example) driver version: 535.183.06, libnvdia-nscq version: 535.183.06-1

NVSwitch Device Monitoring API(NVSDM) Library Installation

Reference

For GPU architectures after Blackwell, installing the NVSDM Library is required. NVDIA Driver versions below 560 do not provide the NVSDM Library.

Install the NVSDM library.

Color mode

apt-cache policy libnvsdm

apt-cache policy libnvsdm

Code block. NVSDM library apt-cache command

Color mode

libnvsdm:
  Installed: (none)
  Candidate: 580.105.08-1
  Version table:
     580.105.08-1 600
        600 https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/x86_64  Packages
     580.95.05-1 600
        600 https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/x86_64  Packages
     580.82.07-1 600
        600 https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/x86_64  Packages
     580.65.06-1 600
        600 https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/x86_64  Packages

libnvsdm:
  Installed: (none)
  Candidate: 580.105.08-1
  Version table:
     580.105.08-1 600
        600 https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/x86_64  Packages
     580.95.05-1 600
        600 https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/x86_64  Packages
     580.82.07-1 600
        600 https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/x86_64  Packages
     580.65.06-1 600
        600 https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/x86_64  Packages

Code block. NVSDM library apt-cache command result

Color mode

apt install libnvsdm=580.105.08-1

apt install libnvsdm=580.105.08-1

Code block. Install NVSDM library

Install NVIDIA DCGM (for Ubuntu)

Install the DCGM Exporter according to the steps below.

DCGM(datacenter-gpu-manager) Installation

refers to a specific version of NVIDIA’s Data Center GPU Manager (DCGM) tool, which is a package for managing and monitoring NVIDIA data center GPUs. In particular, cuda12 indicates that this management tool is installed for the CUDA 12 version, and datacenter-gpu-manager-4 refers to the 4.x version of DCGM. This tool provides a variety of features, including GPU status monitoring, diagnostics, alert system, and power/clock management.

Check the CUDA version.
Color mode
nvidia-smi | grep CUDA
nvidia-smi | grep CUDA
Code block. Check CUDA version
Color mode
| NVIDIA-SMI 535.183.06 Driver Version: 535.183.06 CUDA Version: 12.2 |
| NVIDIA-SMI 535.183.06 Driver Version: 535.183.06 CUDA Version: 12.2 |
Code block. Example of CUDA version check result
Color mode
CUDA_VERSION=12
CUDA_VERSION=12
Code block. CUDA version setting command
Install datacenter-gpu-manager-cuda.
Color mode
apt install datacenter-gpu-manager-4-cuda${CUDA_VERSION}
apt install datacenter-gpu-manager-4-cuda${CUDA_VERSION}
Code block. datacenter-gpu-manager-cuda installation command

Install datacenter-gpu-manager-exporter

It is a tool that, based on NVIDIA Data Center GPU Manager (DCGM), collects various GPU metrics such as GPU usage, memory usage, temperature, and power consumption, and exposes them for use in monitoring systems like Prometheus.

Install datacenter-gpu-manager-exporter.
Color mode
apt install datacenter-gpu-manager-exporter
apt install datacenter-gpu-manager-exporter
Code block. datacenter-gpu-manager-exporter installation command
Check the DCGM Exporter configuration file.
Color mode
cat /usr/lib/systemd/system/nvidia-dcgm-exporter.service | grep ExecStart
cat /usr/lib/systemd/system/nvidia-dcgm-exporter.service | grep ExecStart
Code block. Command to check the datacenter-gpu-manager-exporter configuration file
Color mode
ExecStart=/usr/bin/dcgm-exporter -f /etc/dcgm-exporter/default-counters.csv
ExecStart=/usr/bin/dcgm-exporter -f /etc/dcgm-exporter/default-counters.csv
code block. Example of verification result for the datacenter-gpu-manager-exporter configuration file

When installing the DCGM Exporter, review the provided configuration and remove # for the required metrics, and add # for the unnecessary metrics.

Color mode

vi /etc/dcgm-exporter/default-counters.csv
## Example ##
...
DCGM_FI_PROF_PIPE_TENSOR_ACTIVE, gauge, Ratio of cycles the tensor (HMMA) pipe is active.
DCGM_FI_PROF_DRAM_ACTIVE,        gauge, Ratio of cycles the device memory interface is active sending or receiving data.
# DCGM_FI_PROF_PIPE_FP64_ACTIVE,   gauge, Ratio of cycles the fp64 pipes are active.
# DCGM_FI_PROF_PIPE_FP32_ACTIVE,   gauge, Ratio of cycles the fp32 pipes are active.
...

vi /etc/dcgm-exporter/default-counters.csv
## Example ##
...
DCGM_FI_PROF_PIPE_TENSOR_ACTIVE, gauge, Ratio of cycles the tensor (HMMA) pipe is active.
DCGM_FI_PROF_DRAM_ACTIVE,        gauge, Ratio of cycles the device memory interface is active sending or receiving data.
# DCGM_FI_PROF_PIPE_FP64_ACTIVE,   gauge, Ratio of cycles the fp64 pipes are active.
# DCGM_FI_PROF_PIPE_FP32_ACTIVE,   gauge, Ratio of cycles the fp32 pipes are active.
...

Code block. Example of datacenter-gpu-manager-exporter metric configuration

Reference

See DCGM Exporter Metrics for the metrics that can be collected with the GPU DCGM Exporter and how to configure them.

Caution

Since metric collection through the ServiceWatch Agent is classified as custom metrics and incurs charges unlike the default collected metrics, unnecessary metric collection must be removed or disabled to avoid excessive charges.

Enable and start DCGM service

Enable and start the nvdia-dcgm service.
Color mode
systemctl enable --now nvidia-dcgm
systemctl enable --now nvidia-dcgm
Code block. nvdia-dcgm service activation and start command
Enable and start the nvdia-dcgm-exporter service.
Color mode
systemctl enable --now nvidia-dcgm-exporter
systemctl enable --now nvidia-dcgm-exporter
Code block. nvdia-dcgm-exporter service activation and start command

information

If you have completed the DCGM Exporter setup, you must install the Open Telemetry Collector provided by ServiceWatch to finish configuring the ServiceWatch Agent.
For more details, refer to ServiceWatch > ServiceWatch Agent 사용하기.

Install Prometheus Exporter for GPU metrics (for RHEL)

Install the ServiceWatch Agent to collect metrics from the GPU server according to the steps below.

Step 1. NVDIA Driver Installation Check
Step 2. NVSwitch Configuration and Query (NSCQ) Library Installation
Step 3. NVSwitch Device Monitoring API(NVSDM) Library Installation
Step 4. NVIDIA DCGM Installation

NVDIA Driver installation verification (for RHEL)

Check the installed NVDIA Driver.
Color mode
nvidia-smi --query-gpu driver_version --format csv
nvidia-smi --query-gpu driver_version --format csv
Code block. NVDIA Driver version check command
Color mode
driver_version 535.183.06 ... 535.183.06
driver_version 535.183.06 ... 535.183.06
Code block. Example of checking NVDIA Driver version

NVSwitch Configuration and Query (NSCQ) Library installation (for RHEL)

Reference

The NVSwitch Configuration and Query (NSCQ) Library is required for Hopper or earlier generation GPUs.

For RHEL, verify that libnvdia-nscq is installed and install it if necessary.

information

The installation commands below can be executed in an environment with internet access. If you are in an environment without internet access, you must download libnvdia-nscq from https://developer.download.nvidia.com/compute/cuda/repos/ and upload it.

Checking the libnvdia-nscq package.

Color mode

rpm -qa | grep libnvidia-nscq libnvidia-nscq-535-535.183.06-1.x86_64

rpm -qa | grep libnvidia-nscq libnvidia-nscq-535-535.183.06-1.x86_64

Code block. Check NSCQ library package

Add the CUDA Repository to DNF.

Color mode

dnf config-manager --add-repo https://developer.download.nvidia.com/compute/cuda/repos/rhel8/x86_64/cuda-rhel8.repo

dnf config-manager --add-repo https://developer.download.nvidia.com/compute/cuda/repos/rhel8/x86_64/cuda-rhel8.repo

Code block. Add DNF Repository

NVDIA Driver status reset

Color mode

dnf module reset nvidia-driver

dnf module reset nvidia-driver

Code block. Initialize the state of the NVIDIA Driver DNF module

Color mode

Updating Subscription Management repositories.
Last metadata expiration check: 0:03:15 ago on Wed 19 Nov 2025 01:23:48 AM EST.
Dependencies resolved.
=============================================
Package Architecture Version Repository Size
=============================================
Disabling module profiles:
nvidia-driver/default
nvidia-driver/fm
Resetting modules:
nvidia-driver

Transaction Summary
=============================================

Is this ok [y/N]: y

Updating Subscription Management repositories.
Last metadata expiration check: 0:03:15 ago on Wed 19 Nov 2025 01:23:48 AM EST.
Dependencies resolved.
=============================================
Package Architecture Version Repository Size
=============================================
Disabling module profiles:
nvidia-driver/default
nvidia-driver/fm
Resetting modules:
nvidia-driver

Transaction Summary
=============================================

Is this ok [y/N]: y

Code block. Example of the status initialization result of the NVIDIA Driver DNF module.

Enable the NVDIA Driver module.

Color mode

dnf module enable nvidia-driver:535-open

dnf module enable nvidia-driver:535-open

code block. NVDIA Driver module activation

Color mode

Updating Subscription Management repositories.
Last metadata expiration check: 0:04:22 ago on Wed 19 Nov 2025 01:23:48 AM EST.
Dependencies resolved.
=============================================
Package Architecture Version Repository Size
=============================================
Enabling module streams:
nvidia-driver 535-open

Transaction Summary
=============================================

Is this ok [y/N]: y

Updating Subscription Management repositories.
Last metadata expiration check: 0:04:22 ago on Wed 19 Nov 2025 01:23:48 AM EST.
Dependencies resolved.
=============================================
Package Architecture Version Repository Size
=============================================
Enabling module streams:
nvidia-driver 535-open

Transaction Summary
=============================================

Is this ok [y/N]: y

Code block. Example of NVDIA Driver module activation result

Check the libnvdia-nscq module list.
Color mode
dnf list libnvidia-nscq-535 --showduplicates
dnf list libnvidia-nscq-535 --showduplicates
Code block. Check libnvdia-nscq module list
Install libnvdia-nscq.
Color mode
dnf install libnvidia-nscq-535-535.183.06-1
dnf install libnvidia-nscq-535-535.183.06-1
code block. libnvdia-nscq installation command

NVSwitch Device Monitoring API(NVSDM) Library Installation (for RHEL)

Reference

For GPU architectures after Blackwell, installing the NVSDM Library is required. NVDIA Driver versions below 560 do not provide the NVSDM Library.

Check the NVSDM library module list.

Color mode

dnf list libnvsdm --showduplicates

dnf list libnvsdm --showduplicates

Code block. Check NVSDM library module list

Color mode

libnvsdm:
  Installed: (none)
  Candidate: 580.105.08-1
  Version table:
     580.105.08-1 600
        600 https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/x86_64  Packages
     580.95.05-1 600
        600 https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/x86_64  Packages
     580.82.07-1 600
        600 https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/x86_64  Packages
     580.65.06-1 600
        600 https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/x86_64  Packages

libnvsdm:
  Installed: (none)
  Candidate: 580.105.08-1
  Version table:
     580.105.08-1 600
        600 https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/x86_64  Packages
     580.95.05-1 600
        600 https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/x86_64  Packages
     580.82.07-1 600
        600 https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/x86_64  Packages
     580.65.06-1 600
        600 https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/x86_64  Packages

Code block. Example of NVSDM library module list verification results.

Install libnvsdm.

Color mode

dnf install libnvsdm-580.105.08-1

dnf install libnvsdm-580.105.08-1

Code block. Install NVSDM library

Color mode

Updating Subscription Management repositories.
Last metadata expiration check: 0:08:18 ago on Wed 19 Nov 2025 01:05:28 AM EST.
Dependencies resolved.
=========================================================================
Package Architecture Version Repository Size
=========================================================================
Installing:
libnvsdm x86_64 580.105.08-1 cuda-rhel8-x86_64 675 k
Installing dependencies:
infiniband-diags x86_64 48.0-1.el8 rhel-8-for-x86_64-baseos-rpms 323 k
libibumad x86_64 48.0-1.el8 rhel-8-for-x86_64-baseos-rpms 34 k

Transaction Summary
=========================================================================
Install 3 Packages

Total download size: 1.0 M
Installed size: 3.2 M
Is this ok [y/N]: y

Updating Subscription Management repositories.
Last metadata expiration check: 0:08:18 ago on Wed 19 Nov 2025 01:05:28 AM EST.
Dependencies resolved.
=========================================================================
Package Architecture Version Repository Size
=========================================================================
Installing:
libnvsdm x86_64 580.105.08-1 cuda-rhel8-x86_64 675 k
Installing dependencies:
infiniband-diags x86_64 48.0-1.el8 rhel-8-for-x86_64-baseos-rpms 323 k
libibumad x86_64 48.0-1.el8 rhel-8-for-x86_64-baseos-rpms 34 k

Transaction Summary
=========================================================================
Install 3 Packages

Total download size: 1.0 M
Installed size: 3.2 M
Is this ok [y/N]: y

Code block. Example of the NVSDM library installation command result

Install NVIDIA DCGM (for RHEL)

Install Node Exporter according to the steps below.

DCGM(datacenter-gpu-manager) Installation (for RHEL)

refers to a specific version of NVIDIA’s Data Center GPU Manager (DCGM) tool, which is a package for managing and monitoring NVIDIA data center GPUs. In particular, cuda12 indicates that this management tool is installed for the CUDA 12 version, and datacenter-gpu-manager-4 refers to the 4.x version of DCGM. This tool provides various features, including GPU status monitoring, diagnostics, alert system, and power/clock management.

Add the CUDA repository to DNF.

Color mode

dnf config-manager --add-repo https://developer.download.nvidia.com/compute/cuda/repos/rhel8/x86_64/cuda-rhel8.repo

dnf config-manager --add-repo https://developer.download.nvidia.com/compute/cuda/repos/rhel8/x86_64/cuda-rhel8.repo

code block. Add DNF Repository

Check the CUDA version.
Color mode
nvidia-smi | grep CUDA
nvidia-smi | grep CUDA
Code block. Check CUDA version
Color mode
| NVIDIA-SMI 535.183.06 Driver Version: 535.183.06 CUDA Version: 12.2 |
| NVIDIA-SMI 535.183.06 Driver Version: 535.183.06 CUDA Version: 12.2 |
Code block. Example of CUDA version check result
Color mode
CUDA_VERSION=12
CUDA_VERSION=12
Code block. CUDA version setting command

Check the list of datacenter-gpu-manager-cuda modules.

Color mode

dnf list datacenter-gpu-manager-4-cuda${CUDA_VERSION} --showduplicates

dnf list datacenter-gpu-manager-4-cuda${CUDA_VERSION} --showduplicates

Code block. Check the datacenter-gpu-manager-cuda module list

Color mode

Updating Subscription Management repositories.
Unable to read consumer identity

This system is not registered with an entitlement server. You can use subscription-manager to register.

Last metadata expiration check: 0:00:34 ago on Wed 19 Nov 2025 12:26:56 AM EST.
Available Packages
datacenter-gpu-manager-4-cuda12.x86_64   1:4.0.0-1    cuda-rhel8-x86_64
datacenter-gpu-manager-4-cuda12.x86_64   1:4.1.0-1    cuda-rhel8-x86_64
datacenter-gpu-manager-4-cuda12.x86_64   1:4.1.1-1    cuda-rhel8-x86_64
datacenter-gpu-manager-4-cuda12.x86_64   1:4.2.0-1    cuda-rhel8-x86_64
datacenter-gpu-manager-4-cuda12.x86_64   1:4.2.2-1    cuda-rhel8-x86_64
datacenter-gpu-manager-4-cuda12.x86_64   1:4.2.3-1    cuda-rhel8-x86_64
datacenter-gpu-manager-4-cuda12.x86_64   1:4.2.3-2    cuda-rhel8-x86_64
datacenter-gpu-manager-4-cuda12.x86_64   1:4.3.0-1    cuda-rhel8-x86_64
datacenter-gpu-manager-4-cuda12.x86_64   1:4.3.1-1    cuda-rhel8-x86_64
datacenter-gpu-manager-4-cuda12.x86_64   1:4.4.0-1    cuda-rhel8-x86_64
datacenter-gpu-manager-4-cuda12.x86_64   1:4.4.1-1    cuda-rhel8-x86_64
datacenter-gpu-manager-4-cuda12.x86_64   1:4.4.2-1    cuda-rhel8-x86_64

Updating Subscription Management repositories.
Unable to read consumer identity

This system is not registered with an entitlement server. You can use subscription-manager to register.

Last metadata expiration check: 0:00:34 ago on Wed 19 Nov 2025 12:26:56 AM EST.
Available Packages
datacenter-gpu-manager-4-cuda12.x86_64   1:4.0.0-1    cuda-rhel8-x86_64
datacenter-gpu-manager-4-cuda12.x86_64   1:4.1.0-1    cuda-rhel8-x86_64
datacenter-gpu-manager-4-cuda12.x86_64   1:4.1.1-1    cuda-rhel8-x86_64
datacenter-gpu-manager-4-cuda12.x86_64   1:4.2.0-1    cuda-rhel8-x86_64
datacenter-gpu-manager-4-cuda12.x86_64   1:4.2.2-1    cuda-rhel8-x86_64
datacenter-gpu-manager-4-cuda12.x86_64   1:4.2.3-1    cuda-rhel8-x86_64
datacenter-gpu-manager-4-cuda12.x86_64   1:4.2.3-2    cuda-rhel8-x86_64
datacenter-gpu-manager-4-cuda12.x86_64   1:4.3.0-1    cuda-rhel8-x86_64
datacenter-gpu-manager-4-cuda12.x86_64   1:4.3.1-1    cuda-rhel8-x86_64
datacenter-gpu-manager-4-cuda12.x86_64   1:4.4.0-1    cuda-rhel8-x86_64
datacenter-gpu-manager-4-cuda12.x86_64   1:4.4.1-1    cuda-rhel8-x86_64
datacenter-gpu-manager-4-cuda12.x86_64   1:4.4.2-1    cuda-rhel8-x86_64

Code block. Example of checking the list of datacenter-gpu-manager-cuda modules.

Install datacenter-gpu-manager-cuda.

Color mode

dnf install datacenter-gpu-manager-4-cuda${CUDA_VERSION}

dnf install datacenter-gpu-manager-4-cuda${CUDA_VERSION}

Code block. Install datacenter-gpu-manager-cuda

Color mode

Updating Subscription Management repositories.
Unable to read consumer identity

This system is not registered with an entitlement server. You can use subscription-manager to register.

Last metadata expiration check: 0:07:12 ago on Wed 19 Nov 2025 12:26:56 AM EST.
Dependencies resolved.
===================================================================================================
 Package                                       Architecture   Version     Repository          Size
===================================================================================================
Installing:
 datacenter-gpu-manager-4-cuda12               x86_64         1:4.4.2-1   cuda-rhel8-x86_64   554 M
Installing dependencies:
 datacenter-gpu-manager-4-core                 x86_64         1:4.4.2-1   cuda-rhel8-x86_64   9.9 M
Installing weak dependencies:
 datacenter-gpu-manager-4-proprietary          x86_64         1:4.4.2-1   cuda-rhel8-x86_64   5.3 M
 datacenter-gpu-manager-4-proprietary-cuda12   x86_64         1:4.4.2-1   cuda-rhel8-x86_64   289 M

Transaction Summary
====================================================================================================
Install  4 Packages
...
Is this ok [y/N]: y

Updating Subscription Management repositories.
Unable to read consumer identity

This system is not registered with an entitlement server. You can use subscription-manager to register.

Last metadata expiration check: 0:07:12 ago on Wed 19 Nov 2025 12:26:56 AM EST.
Dependencies resolved.
===================================================================================================
 Package                                       Architecture   Version     Repository          Size
===================================================================================================
Installing:
 datacenter-gpu-manager-4-cuda12               x86_64         1:4.4.2-1   cuda-rhel8-x86_64   554 M
Installing dependencies:
 datacenter-gpu-manager-4-core                 x86_64         1:4.4.2-1   cuda-rhel8-x86_64   9.9 M
Installing weak dependencies:
 datacenter-gpu-manager-4-proprietary          x86_64         1:4.4.2-1   cuda-rhel8-x86_64   5.3 M
 datacenter-gpu-manager-4-proprietary-cuda12   x86_64         1:4.4.2-1   cuda-rhel8-x86_64   289 M

Transaction Summary
====================================================================================================
Install  4 Packages
...
Is this ok [y/N]: y

Code block. Example of datacenter-gpu-manager-cuda installation result

datacenter-gpu-manager-exporter installation (for RHEL)

Add the CUDA Repository to DNF. 1. (If you have already performed this command, proceed to the next step.)

Color mode

dnf config-manager --add-repo https://developer.download.nvidia.com/compute/cuda/repos/rhel8/x86_64/cuda-rhel8.repo

dnf config-manager --add-repo https://developer.download.nvidia.com/compute/cuda/repos/rhel8/x86_64/cuda-rhel8.repo

Code block. Add DNF Repository

Check the CUDA version. 2. (If you have already performed this command, proceed to the next step.)
Color mode
nvidia-smi | grep CUDA
nvidia-smi | grep CUDA
Code block. Check CUDA version
Color mode
| NVIDIA-SMI 535.183.06 Driver Version: 535.183.06 CUDA Version: 12.2 |
| NVIDIA-SMI 535.183.06 Driver Version: 535.183.06 CUDA Version: 12.2 |
Code block. Example of CUDA version check result
Color mode
CUDA_VERSION=12
CUDA_VERSION=12
Code block. CUDA version setting command

Check the datacenter-gpu-manager-exporter module list.

Color mode

dnf list datacenter-gpu-manager-exporter --showduplicates

dnf list datacenter-gpu-manager-exporter --showduplicates

Code block. Check the list of datacenter-gpu-manager-exporter modules

Color mode

Updating Subscription Management repositories.
Unable to read consumer identity

This system is not registered with an entitlement server. You can use subscription-manager to register.

Last metadata expiration check: 0:02:11 ago on Wed 19 Nov 2025 12:26:56 AM EST.
Available Packages
datacenter-gpu-manager-exporter.x86_64   4.0.1-1   cuda-rhel8-x86_64
datacenter-gpu-manager-exporter.x86_64   4.1.0-1   cuda-rhel8-x86_64
datacenter-gpu-manager-exporter.x86_64   4.1.1-1   cuda-rhel8-x86_64
datacenter-gpu-manager-exporter.x86_64   4.1.3-1   cuda-rhel8-x86_64
datacenter-gpu-manager-exporter.x86_64   4.5.0-1   cuda-rhel8-x86_64
datacenter-gpu-manager-exporter.x86_64   4.5.1-1   cuda-rhel8-x86_64
datacenter-gpu-manager-exporter.x86_64   4.5.2-1   cuda-rhel8-x86_64
datacenter-gpu-manager-exporter.x86_64   4.6.0-1   cuda-rhel8-x86_64

Updating Subscription Management repositories.
Unable to read consumer identity

This system is not registered with an entitlement server. You can use subscription-manager to register.

Last metadata expiration check: 0:02:11 ago on Wed 19 Nov 2025 12:26:56 AM EST.
Available Packages
datacenter-gpu-manager-exporter.x86_64   4.0.1-1   cuda-rhel8-x86_64
datacenter-gpu-manager-exporter.x86_64   4.1.0-1   cuda-rhel8-x86_64
datacenter-gpu-manager-exporter.x86_64   4.1.1-1   cuda-rhel8-x86_64
datacenter-gpu-manager-exporter.x86_64   4.1.3-1   cuda-rhel8-x86_64
datacenter-gpu-manager-exporter.x86_64   4.5.0-1   cuda-rhel8-x86_64
datacenter-gpu-manager-exporter.x86_64   4.5.1-1   cuda-rhel8-x86_64
datacenter-gpu-manager-exporter.x86_64   4.5.2-1   cuda-rhel8-x86_64
datacenter-gpu-manager-exporter.x86_64   4.6.0-1   cuda-rhel8-x86_64

code block. Example of checking the module list of datacenter-gpu-manager-exporter

Install datacenter-gpu-manager-cuda. dcgm-exporter 4.5.X requires glibc 2.34 or newer, but because RHEL 9 provides glibc 2.34, we install it by specifying version 4.1.3-1.

Color mode

dnf install datacenter-gpu-manager-exporter-4.1.3-1

dnf install datacenter-gpu-manager-exporter-4.1.3-1

Code block. Install datacenter-gpu-manager-cuda

Color mode

Updating Subscription Management repositories.
Unable to read consumer identity

This system is not registered with an entitlement server. You can use subscription-manager to register.

Last metadata expiration check: 0:07:12 ago on Wed 19 Nov 2025 12:26:56 AM EST.
Dependencies resolved.
====================================================================================================
 Package                                       Architecture   Version     Repository          Size
====================================================================================================
Installing:
 datacenter-gpu-manager-exporter               x86_64         4.1.3-1     cuda-rhel8-x86_64   26 M


Is this ok [y/N]: y

Updating Subscription Management repositories.
Unable to read consumer identity

This system is not registered with an entitlement server. You can use subscription-manager to register.

Last metadata expiration check: 0:07:12 ago on Wed 19 Nov 2025 12:26:56 AM EST.
Dependencies resolved.
====================================================================================================
 Package                                       Architecture   Version     Repository          Size
====================================================================================================
Installing:
 datacenter-gpu-manager-exporter               x86_64         4.1.3-1     cuda-rhel8-x86_64   26 M


Is this ok [y/N]: y

Code block. Example of datacenter-gpu-manager-cuda installation result

Color mode

cat /usr/lib/systemd/system/nvidia-dcgm-exporter.service | grep ExecStart

cat /usr/lib/systemd/system/nvidia-dcgm-exporter.service | grep ExecStart

Code block. datacenter-gpu-manager-exporter configuration file

Color mode

ExecStart=/usr/bin/dcgm-exporter -f /etc/dcgm-exporter/default-counters.csv

ExecStart=/usr/bin/dcgm-exporter -f /etc/dcgm-exporter/default-counters.csv

Code block. Example of the result of checking the datacenter-gpu-manager-exporter configuration file.

When installing the DCGM Exporter, review the provided configuration and remove # for required metrics, and add # for unnecessary metrics.

Color mode

vi /etc/dcgm-exporter/default-counters.csv
## Example ##
...
DCGM_FI_PROF_PIPE_TENSOR_ACTIVE, gauge, Ratio of cycles the tensor (HMMA) pipe is active.
DCGM_FI_PROF_DRAM_ACTIVE,        gauge, Ratio of cycles the device memory interface is active sending or receiving data.
# DCGM_FI_PROF_PIPE_FP64_ACTIVE,   gauge, Ratio of cycles the fp64 pipes are active.
# DCGM_FI_PROF_PIPE_FP32_ACTIVE,   gauge, Ratio of cycles the fp32 pipes are active.
...

vi /etc/dcgm-exporter/default-counters.csv
## Example ##
...
DCGM_FI_PROF_PIPE_TENSOR_ACTIVE, gauge, Ratio of cycles the tensor (HMMA) pipe is active.
DCGM_FI_PROF_DRAM_ACTIVE,        gauge, Ratio of cycles the device memory interface is active sending or receiving data.
# DCGM_FI_PROF_PIPE_FP64_ACTIVE,   gauge, Ratio of cycles the fp64 pipes are active.
# DCGM_FI_PROF_PIPE_FP32_ACTIVE,   gauge, Ratio of cycles the fp32 pipes are active.
...

Code block. Example of datacenter-gpu-manager-exporter metric configuration

Reference

Refer to DCGM Exporter Metrics for the metrics that can be collected with the GPU DCGM Exporter and how to configure them.

Caution

Enable and start DCGM service (for RHEL)

Activating and starting the nvdia-dcgm service.
Color mode
systemctl enable --now nvidia-dcgm
systemctl enable --now nvidia-dcgm
Code block. nvdia-dcgm service activation and start command
Enable and start the nvdia-dcgm-exporter service.
Color mode
systemctl enable --now nvidia-dcgm-exporter
systemctl enable --now nvidia-dcgm-exporter
Code block. nvdia-dcgm-exporter service activation and start command

guide

If you have completed the DCGM Exporter configuration, you must install the Open Telemetry Collector provided by ServiceWatch and finish setting up the ServiceWatch Agent.
For more details, refer to ServiceWatch > Using ServiceWatch Agent.

DCGM Exporter metric

DCGM Exporter Key Metrics

Among the metrics provided by the DCGM Exporter, the main GPU metrics are as follows.

Category	DCGM Field	Prometheus Metric Type	Summary
Clocks	DCGM_FI_DEV_SM_CLOCK	gauge	SM clock frequency (in MHz)
Clocks	DCGM_FI_DEV_MEM_CLOCK	gauge	Memory clock frequency (in MHz)
Temperature	DCGM_FI_DEV_GPU_TEMP	gauge	GPU temperature (in C)
Power	DCGM_FI_DEV_POWER_USAGE	gauge	Power draw (in W)
Utilization	DCGM_FI_DEV_GPU_UTIL	gauge	GPU utilization (in %)
Utilization	DCGM_FI_DEV_MEM_COPY_UTIL	gauge	Memory utilization (in %)
Memory Usage	DCGM_FI_DEV_FB_FREE	gauge	Frame buffer memory free (in MiB)
Memory Usage	DCGM_FI_DEV_FB_USED	gauge	Frame buffer memory used (in MiB)
Nvlink	DCGM_FI_DEV_NVLINK_BANDWIDTH_TOTAL(8 GPU only)	counter	Total number of NVLink bandwidth counters for all lanes

Table. Major GPU metrics provided by DCGM Exporter

DCGM Exporter metric collection configuration

Please refer to the default metrics of DCGM Exporter at DCGM Exporter > 기본 지표.

Besides the default configuration, remove # from default-counters.csv for any additional metrics you want to set.
For default metrics you do not wish to collect, add # or delete the corresponding entry.

Color mode

# Format
# If line starts with a '#' it is considered a comment
# DCGM FIELD, Prometheus metric type, help message

# Clocks
DCGM_FI_DEV_SM_CLOCK, gauge, SM clock frequency (in MHz).
DCGM_FI_DEV_MEM_CLOCK, gauge, Memory clock frequency (in MHz).

# Temperature
DCGM_FI_DEV_MEMORY_TEMP, gauge, Memory temperature (in C).
DCGM_FI_DEV_GPU_TEMP,    gauge, GPU temperature (in C).

# Power
DCGM_FI_DEV_POWER_USAGE,              gauge, Power draw (in W).
DCGM_FI_DEV_TOTAL_ENERGY_CONSUMPTION, counter, Total energy consumption since boot (in mJ).

# PCIE
# DCGM_FI_PROF_PCIE_TX_BYTES,  counter, Total number of bytes transmitted through PCIe TX via NVML.
# DCGM_FI_PROF_PCIE_RX_BYTES,  counter, Total number of bytes received through PCIe RX via NVML.
...

# Format
# If line starts with a '#' it is considered a comment
# DCGM FIELD, Prometheus metric type, help message

# Clocks
DCGM_FI_DEV_SM_CLOCK, gauge, SM clock frequency (in MHz).
DCGM_FI_DEV_MEM_CLOCK, gauge, Memory clock frequency (in MHz).

# Temperature
DCGM_FI_DEV_MEMORY_TEMP, gauge, Memory temperature (in C).
DCGM_FI_DEV_GPU_TEMP,    gauge, GPU temperature (in C).

# Power
DCGM_FI_DEV_POWER_USAGE,              gauge, Power draw (in W).
DCGM_FI_DEV_TOTAL_ENERGY_CONSUMPTION, counter, Total energy consumption since boot (in mJ).

# PCIE
# DCGM_FI_PROF_PCIE_TX_BYTES,  counter, Total number of bytes transmitted through PCIe TX via NVML.
# DCGM_FI_PROF_PCIE_RX_BYTES,  counter, Total number of bytes received through PCIe RX via NVML.
...

code block. default-counters.csv configuration example

3.3 - API Reference

API Reference

3.4 - CLI Reference

CLI Reference

3.5 - Release Note

GPU Server

2026.03.19

FEATURE Add Kubernetes image and SSD_Provisioned disk type

Add Kubernetes image
- When creating a GPU Server, you can select the Kubernetes image (Ubuntu).
An SSD volume with configurable IOPS and throughput has been added.
- When creating Block Storage, you can select the SSD_Provisioned disk type.
- You can set the maximum IOPS and Throughput values.

2025.10.23

FEATURE Add new features and provide ServiceWatch service integration functionality

Provision of ServiceWatch service integration
- You can monitor data through the ServiceWatch service.
When creating a GPU server, you can select a RHEL image.
Keypair management feature has been added.
- You can generate a keypair for use, or retrieve a public key and apply it.

2025.07.01

FEATURE Add GPU Server feature, change Image sharing method, and add GPU Server usage guide

Add GPU Server feature
- The IP, Public NAT IP, and Private NAT IP configuration features have been added.
- An LLM Endpoint is provided for LLM usage.
The method for sharing images between accounts has changed.
- You can create a new Image for sharing and share it.
Add GPU Server usage guide
- Added the guides Using Multi-instance GPU on GPU Server and Using NVSwitch on GPU Server.

2025.04.28

FEATURE Add OS image

GPU Server RHEL OS and GPU driver versions have been added.

2025.02.27

FEATURE Common feature change

Add GPU Server feature
- NAT configuration feature has been added to the GPU Server.
Samsung Cloud Platform Common Feature Changes
- Account, IAM, Service Home, tags, and other common CX changes have been applied.

2024.10.01

NEW GPU Server service official version release

We have officially launched the GPU Server service.
We have launched a virtualization computing service that lets you allocate and use infrastructure resources such as CPU, GPU, and memory provided by the server as needed, without having to purchase them individually.

4 - Bare Metal Server

4.1 - Overview

Service Overview

Bare Metal Server does not use virtualization technology and is a high-performance cloud computing service that can allocate and use physically separated computing resources such as CPU and memory exclusively. It is not affected by other cloud users, allowing you to reliably operate performance-sensitive services.

Features

Easy and convenient computing environment setup: Through the web-based Console, you can easily handle everything from Bare Metal Server provisioning to resource management and cost management. You can receive a server with standard specs (CPU, Memory, Disk) allocated exclusively and use it immediately.
Providing High-Performance Computing Environment: We provide servers suitable for workloads that require large capacity and high performance—such as real-time (Real-Time) systems, HPC (High Performance Computing), and servers that demand excessive I/O usage—in a physically isolated environment.
Efficient Service Delivery: Ensure performance and stability through optimal server selection and in-house testing. Customers can choose the optimal resources that fit their service environment from the various specifications of Bare Metal Servers offered by Samsung Cloud Platform.

Service Architecture Diagram

Provided features

Bare Metal Server provides the following features.

Auto Provisioning (Auto Provisioning) and Management: Through the web-based Console, you can easily provision Bare Metal Servers, manage resources, and control costs.
Providing various types of server types and OS images: Provides CPU, Memory, and Disk resources of standard server types, and offers a variety of standard OS images.
Storage Connection: Provides additional attached storage beyond the OS disk. You can connect and use Block Storage, File Storage, and Object Storage.
Network Connection: You can connect the standard subnet/IP settings of a Bare Metal Server and a Public NAT IP. Provides a local subnet connection for inter-server communication. This can be modified on the detail page.
Monitoring: You can view monitoring information such as CPU, Memory, and Disk, which are computing resources, through Cloud Monitoring. To use the Cloud Monitoring service for Bare Metal Server, you must install the Agent. Please be sure to install the Agent for stable Bare Metal Server service usage. For more details, see Bare Metal Server Monitoring Metrics.
Backup and Recovery: You can back up and restore the Bare Metal Server’s filesystem using the Backup service.
Efficient Cost Management: You can easily create or terminate servers as needed, and because billing is based on actual usage time, you can use it cost‑effectively in various unpredictable situations.
Local disk partition creation You can create and use up to 10 local disk partitions.
Terraform Provisioning: Provides an IaC environment using Terraform.

Components

Bare Metal Server provides various standard OS images and standard server types. Users can select and use them according to the scale of the service they want to configure.

OS Image provided version

The OS images supported by Bare Metal Server are as follows

OS Image version	EoS Date
Oracle Linux 9.6	2032-06-30
RHEL 8.10	2029-05-31
RHEL 9.4	2026-04-30
RHEL 9.6	2027-05-31
Rocky Linux 8.10	2029-05-31
Rocky Linux 9.6	2025-11-30
Ubuntu 22.04	2027-06-30
Ubuntu 24.04	2029-06-30
Windows 2019	2029-01-09
Windows 2022	2031-10-14

Table. Bare Metal Server provided OS Image version

Reference

Oracle Linux, RHEL, and Rocky Linux only provide even-numbered minor versions. Check the OS Image’s EOS (End of Support)/EOL (End of Life) Date.
To ensure stable OS operation, apply new or additional individual packages yourself.

Server type

The server types supported by Bare Metal Server are as follows. For detailed information about the server types supported by Bare Metal Server, see Bare Metal Server Server Types.

s3v16m64_metal

Category	example	Detailed description
Server generation	s3	Provided server categories and generations s3: s means the standard, commonly used standard specification (vCPU, Memory) configuration, and 1 denotes the generation
CPU vCore	v16	Number of vCores v16: Allocated vCores are twice the number of physical cores 16 vCores correspond to 8 physical cores Provided with Hyperthreading enabled by default, which can be disabled when creating a service
Memory	m64	Memory capacity m64: 64GB Memory

Table. Bare Metal Server server type

Preceding Service

This is a list of services that must be pre-configured before creating the service. Please refer to the guide provided for each service and prepare in advance.

Service Category	service	Detailed description
Networking	VPC	A service that provides an isolated virtual network in a cloud environment

Table. Bare Metal Server Pre-service

4.1.1 - Server Type

Bare Metal Server server type

Bare Metal Server provides server types that suit the intended use. Server types are composed of various combinations such as CPU and Memory. The server used for a Bare Metal Server is determined by the server type selected when creating the Bare Metal Server. Please select a server type based on the specifications of the application you intend to run on the Bare Metal Server.

The server types supported by Bare Metal Server are as follows.

s3v16m64_metal

Category

example

Detailed description

Server generation

Provided server categories and generations

s3
- s means standard specification
- 3 means generation

h3
- h means high-capacity server specification
- 3 means generation

CPU vCore

v16

Number of vCores

v16: Allocated vCores are twice the number of physical cores
- 16 vCores correspond to 8 physical cores
- Hyper-Threading is enabled by default, and can be disabled when creating a service

Memory

m64

Memory Capacity

m64: 64GB Memory

Table. Bare Metal Server Server Type format

s4/h4 server type

The Bare Metal Server s4 server type is offered with standard specifications (vCPU, Memory) and, because it receives physically isolated resources, it is suitable for high-performance applications. Additionally, the Bare Metal Server h4 server type is offered with high-capacity specifications and is suitable for high-performance applications that require large-scale data processing.

Supports a total of five vCPU options (16, 32, 64, 96, 128 vCore)
- Intel 6th Generation (Granite Rapids) Processor
Supports up to 64 physical cores, 128 vCPUs, and 2,048 GB of memory
Provides two internal disks up to 1.92 TB each (for OS use)

Server type	Physical CPU	vCPU	Memory	CPU Type	Internal Disk(OS)
s4v16m64_metal	8 Core	16 vCore	64 GB	Intel Xeon 6507P up to 4.3 GHz	480GB * 2EA
s4v16m128_metal	8 Core	16 vCore	128 GB	Intel Xeon 6507P up to 4.3 GHz	480GB * 2EA
s4v16m256_metal	8 Core	16 vCore	256 GB	Intel Xeon 6507P with a maximum frequency of 4.3 GHz	480GB * 2EA
h4v32m128_metal	16 Core	32 vCore	128 GB	Intel Xeon 6517P up to 4.0 GHz	960GB * 2EA
h4v32m256_metal	16 Core	32 vCore	256 GB	Intel Xeon 6517P up to 4.0 GHz	960GB * 2EA
h4v32m512_metal	16 Core	32 vCore	512 GB	Intel Xeon 6517P up to 4.0 GHz	960GB * 2EA
h4v64m256_metal	32 Core	64 vCore	256 GB	Intel Xeon 6737P up to 4.0 GHz	1.92TB * 2EA
h4v64m512_metal	32 Core	64 vCore	512 GB	Intel Xeon 6737P up to 4.0 GHz	1.92TB * 2EA
h4v64m1024_metal	32 Core	64 vCore	1024 GB	Intel Xeon 6737P up to 4.0 GHz	1.92TB * 2EA
h4v96m512_metal	48 Core	96 vCore	512 GB	Intel Xeon 6520P up to 3.4 GHz	1.92TB * 2EA
h4v96m768_metal	48 Core	96 vCore	768 GB	Intel Xeon 6520P with a maximum frequency of 3.4 GHz	1.92TB * 2EA
h4v96m2048_metal	48 Core	96 vCore	2048 GB	Intel Xeon 6520P up to 3.4 GHz	1.92TB * 2EA
h4v128m512_metal	64 Core	128 vCore	512 GB	Intel Xeon 6737P up to 4.0 GHz	1.92TB * 2EA
h4v128m1024_metal	64 Core	128 vCore	1024 GB	Intel Xeon 6737P up to 4.0 GHz	1.92TB * 2EA
h4v128m2048_metal	64 Core	128 vCore	2048 GB	Intel Xeon 6737P up to 4.0 GHz	1.92TB * 2EA

Table. Bare Metal Server Server Type specifications > s4/h4 server type

s3/h3 server type

The Bare Metal Server s3 server type is offered with standard specifications (vCPU, Memory) and is suitable for high‑performance applications because it receives physically isolated resources. Additionally, the Bare Metal Server h3 server type is offered with high-capacity specifications and is suitable for high-performance applications that require large-scale data processing.

Supports a total of five vCPU options (16, 32, 64, 96, 128 vCore)
- Intel 4th Generation (Sapphire Rapids) Processor
Supports up to 64 physical cores, 128 vCPUs, and 2,048 GB of memory.
Provides two internal disks up to 1.92 TB (for OS use)

Server type	Physical CPU	vCPU	Memory	CPU Type	Internal Disk(OS)
s3v16m64_metal	8 Core	16 vCore	64 GB	Intel Xeon Gold 6434 up to 4.1 GHz	480 GB * 2EA
s3v16m128_metal	8 Core	16 vCore	128 GB	Intel Xeon Gold 6434 up to 4.1 GHz	480 GB * 2EA
s3v16m256_metal	8 Core	16 vCore	256 GB	Intel Xeon Gold 6434 up to 4.1 GHz	480 GB * 2EA
h3v32m128_metal	16 Core	32 vCore	128 GB	Intel Xeon Gold 6444Y up to 4.0 GHz	960 GB * 2EA
h3v32m256_metal	16 Core	32 vCore	256 GB	Intel Xeon Gold 6444Y up to 4.0 GHz	960 GB * 2EA
h3v32m512_metal	16 Core	32 vCore	512 GB	Intel Xeon Gold 6444Y up to 4.0 GHz	960 GB * 2EA
h3v64m256_metal	32 Core	64 vCore	256 GB	Intel Xeon Gold 6448H up to 3.2 GHz	1.92 TB * 2EA
h3v64m512_metal	32 Core	64 vCore	512 GB	Intel Xeon Gold 6448H up to 3.2 GHz	1.92 TB * 2EA
h3v64m1024_metal	32 Core	64 vCore	1024 GB	Intel Xeon Gold 6448H up to 3.2 GHz	1.92 TB * 2EA
h3v96m384_metal	48 Core	96 vCore	384 GB	Intel Xeon Gold 6442Y, up to 3.3 GHz	1.92 TB * 2EA
h3v96m768_metal	48 Core	96 vCore	768 GB	Intel Xeon Gold 6442Y up to 3.3 GHz	1.92 TB * 2EA
h3v96m1536_metal	48 Core	96 vCore	1536 GB	Intel Xeon Gold 6442Y, up to 3.3 GHz	1.92 TB * 2EA
h3v128m512_metal	64 Core	128 vCore	512 GB	Intel Xeon Gold 6448H up to 3.2 GHz	1.92 TB * 2EA
h3v128m1024_metal	64 Core	128 vCore	1024 GB	Intel Xeon Gold 6448H up to 3.2 GHz	1.92 TB * 2EA
h3v128m2048_metal	64 Core	128 vCore	2048 GB	Intel Xeon Gold 6448H up to 3.2 GHz	1.92 TB * 2EA

Table. Bare Metal Server Server Type specifications > s3/h3 server type

s2/h2 server type

Notice

s2/h2 Server Type has been discontinued for new applications. It does not affect the services you are currently using.

The Bare Metal Server s2 server type is offered with standard specifications (vCPU, Memory) and, because it receives physically isolated resources, it is suitable for high-performance applications.

Additionally, the Bare Metal Server h2 server type is offered with high-capacity specifications and is suitable for high-performance applications that require large-scale data processing.

Supports five vCPU options (16, 24, 32, 72, 96 vCore)
- Intel 3rd‑Generation (Ice Lake) Processor
Supports up to 48 physical cores, 96 vCPUs, and 1,024 GB of memory.
Provides two internal disks up to 1.92 TB for OS use

Server type	Physical CPU	vCPU	Memory	CPU Type	Internal Disk(OS)
s2v16m64_metal	8 Core	16 vCore	64 GB	Intel Xeon Gold 6334 with a maximum frequency of 3.6 GHz	480 GB * 2EA
s2v16m128_metal	8 Core	16 vCore	128 GB	Intel Xeon Gold 6334 with a maximum frequency of 3.6 GHz	480 GB * 2EA
s2v16m256_metal	8 Core	16 vCore	256 GB	Intel Xeon Gold 6334 with a maximum frequency of 3.6 GHz	480 GB * 2EA
h2v24m96_metal	12 Core	24 vCore	96 GB	Intel Xeon Gold 5317 up to 3.4 GHz	960 GB * 2EA
h2v24m192_metal	12 Core	24 vCore	192 GB	Intel Xeon Gold 5317 up to 3.4 GHz	960 GB * 2EA
h2v24m384_metal	12 Core	24 vCore	384 GB	Intel Xeon Gold 5317 up to 3.4 GHz	960 GB * 2EA
h2v32m128_metal	16 Core	32 vCore	128 GB	Intel Xeon Gold 6346 with a maximum frequency of 3.6 GHz	960 GB * 2EA
h2v32m256_metal	16 Core	32 vCore	256 GB	Intel Xeon Gold 6346 up to 3.6 GHz	960 GB * 2EA
h2v32m512_metal	16 Core	32 vCore	512 GB	Intel Xeon Gold 6346 up to 3.6 GHz	960 GB * 2EA
h2v72m256_metal	36 Core	72 vCore	256 GB	Intel Xeon Gold 6354, up to 3.6 GHz	1.92 TB * 2EA
h2v72m512_metal	36 Core	72 vCore	512 GB	Intel Xeon Gold 6354 with a maximum frequency of 3.6 GHz	1.92 TB * 2EA
h2v72m1024_metal	36 Core	72 vCore	1024 GB	Intel Xeon Gold 6354 with a maximum frequency of 3.6 GHz	1.92 TB * 2EA
h2v96m384_metal	48 Core	96 vCore	384 GB	Intel Xeon Gold 6342 up to 3.3 GHz	1.92 TB * 2EA
h2v96m768_metal	48 Core	96 vCore	768 GB	Intel Xeon Gold 6342 up to 3.3 GHz	1.92 TB * 2EA

Table. Bare Metal Server Server Type specifications > s2/h2 server type

4.1.2 - Monitoring Metrics

Cloud Monitoring service termination notice

If you are collecting metrics and logs through the Cloud Monitoring Agent, you need to switch to the ServiceWatch Agent.

For detailed information about ServiceWatch, please refer to ServiceWatch Overview.
Detailed information about ServiceWatch Agent can be found in the ServiceWatch Agent.

Bare Metal Server Monitoring Metrics

The table below shows the monitoring metrics of Bare Metal Server that can be viewed through Cloud Monitoring.

Guide

Bare Metal Server requires the user to install the Agent through the guide in order to view monitoring metrics. Before using the reliable Bare Metal Server service, please be sure to install the Agent. For instructions on installing the Agent and detailed usage of Cloud Monitoring, refer to the Cloud Monitoring guide.

Performance items	Detailed description	unit
Core Usage [IO Wait]	Ratio of CPU time spent in wait state (disk wait)	%
Core Usage [System]	Proportion of CPU time spent in kernel space	%
Core Usage [User]	Proportion of CPU time spent in user space	%
CPU Cores	Number of CPU cores on the host	cnt
CPU Usage [Active]	Percentage of CPU time used, excluding Idle and IOWait states	%
CPU Usage [Idle]	It is the proportion of CPU time spent in idle state.	%
CPU Usage [IO Wait]	Ratio of CPU time spent in wait state (disk wait)	%
CPU Usage [System]	Percentage of CPU time used by the kernel	%
CPU Usage [User]	Percentage of CPU time used in user space	%
CPU Usage/Core [Active]	Percentage of CPU time used other than Idle and IOWait states	%
CPU Usage/Core [Idle]	Ratio of CPU time spent in idle state	%
CPU Usage/Core [IO Wait]	Ratio of CPU time spent in wait state (disk wait)	%
CPU Usage/Core [System]	Percentage of CPU time used by the kernel	%
CPU Usage/Core [User]	Percentage of CPU time used in user space	%
Disk CPU Usage [IO Request]	Proportion of CPU time during which I/O requests to the device were executed	%
Disk Queue Size [Avg]	Average queue length of requests executed for the device	num
Disk Read Bytes	Bytes per second read from the device	bytes
Disk Read Bytes [Delta Avg]	Average of `system.diskio.read.bytes_delta` for individual disks	bytes
Disk Read Bytes [Delta Max]	Maximum `system.diskio.read.bytes_delta` of individual disks	bytes
Disk Read Bytes [Delta Min]	minimum `system.diskio.read.bytes_delta` of individual disks	bytes
Disk Read Bytes [Delta Sum]	Sum of `system.diskio.read.bytes_delta` of individual disks	bytes
Disk Read Bytes [Delta]	Delta of the system.diskio.read.bytes value for each Disk	bytes
Disk Read Bytes [Success]	Total bytes successfully read	bytes
Disk Read Requests	Number of read requests to the disk device per second	cnt
Disk Read Requests [Delta Avg]	Average of `system.diskio.read.count_delta` for individual disks	cnt
Disk Read Requests [Delta Max]	Maximum `system.diskio.read.count_delta` for individual disks	cnt
Disk Read Requests [Delta Min]	Minimum of individual Disks’ `system.diskio.read.count_delta`	cnt
Disk Read Requests [Delta Sum]	Sum of `system.diskio.read.count_delta` for individual disks	cnt
Disk Read Requests [Success Delta]	Delta of system.diskio.read.count for each Disk	cnt
Disk Read Requests [Success]	Total number of successful reads	cnt
Disk Request Size [Avg]	Average size of requests executed on the device (unit: sectors)	num
Disk Service Time [Avg]	Average service time (ms) of input requests executed on the device	ms
Disk Wait Time [Avg]	Average time taken for requests executed on the device to be supported	ms
Disk Wait Time [Read]	Average disk wait time	ms
Disk Wait Time [Write]	Disk average wait time	ms
Disk Write Bytes [Delta Avg]	Average of `system.diskio.write.bytes_delta` for individual disks	bytes
Disk Write Bytes [Delta Max]	Maximum `system.diskio.write.bytes_delta` of individual disks	bytes
Disk Write Bytes [Delta Min]	Minimum of `system.diskio.write.bytes_delta` for individual disks	bytes
Disk Write Bytes [Delta Sum]	Sum of `system.diskio.write.bytes_delta` of individual disks	bytes
Disk Write Bytes [Delta]	individual Disk’s `system.diskio.write.bytes value delta`	bytes
Disk Write Bytes [Success]	Total bytes successfully written	bytes
Disk Write Requests	Number of write requests to the disk device per second	cnt
Disk Write Requests [Delta Avg]	Average of `system.diskio.write.count_delta` for individual disks	cnt
Disk Write Requests [Delta Max]	Maximum `system.diskio.write.count_delta` of individual disks	cnt
Disk Write Requests [Delta Min]	Minimum of `system.diskio.write.count_delta` for individual disks	cnt
Disk Write Requests [Delta Sum]	Sum of `system.diskio.write.count_delta` of individual disks	cnt
Disk Write Requests [Success Delta]	Delta of system.diskio.write.count for each Disk	cnt
Disk Write Requests [Success]	Total number of successful writes	cnt
Disk Writes Bytes	Bytes per second written to the device	bytes
Filesystem Hang Check	filesystem(local/NFS) hang check (normal:1, abnormal:0)	status
Filesystem Nodes	Total number of file nodes in the file system	cnt
Filesystem Nodes [Free]	Total number of available file nodes in the file system	cnt
Filesystem Size [Available]	Disk space (bytes) available to unauthorized users	bytes
Filesystem Size [Free]	Available disk space (bytes)	bytes
Filesystem Size [Total]	Total disk space (bytes)	bytes
Filesystem Usage	Used disk space percentage	%
Filesystem Usage [Avg]	Average of individual filesystem.used.pct	%
Filesystem Usage [Inode]	inode usage	%
Filesystem Usage [Max]	Maximum among individual filesystem.used.pct	%
Filesystem Usage [Min]	minimum among individual filesystem.used.pct	%
Filesystem Usage [Total]	Total filesystem usage	%
Filesystem Used	Used disk space (bytes)	bytes
Filesystem Used [Inode]	inode usage	bytes
Memory Free	Total amount of available memory (bytes). Memory used by system cache and buffers is not included.	bytes
Memory Free [Actual]	Actual usable memory (bytes).	bytes
Memory Free [Swap]	Available swap memory.	bytes
Memory Total	total memory	bytes
Memory Total [Swap]	Total swap memory.	bytes
Memory Usage	Percentage of used memory	%
Memory Usage [Actual]	Percentage of memory actually used	%
Memory Usage [Cache Swap]	cached swap usage	%
Memory Usage [Swap]	Percentage of used swap memory	%
Memory Used	used memory	bytes
Memory Used [Actual]	Actual used memory (bytes). The value obtained by subtracting used memory from total memory.	bytes
Memory Used [Swap]	Used swap memory	bytes
Collisions	Network collision	cnt
Network In Bytes	Number of received bytes	bytes
Network In Bytes [Delta Avg]	Average of system.network.in.bytes_delta for each network	bytes
Network In Bytes [Delta Max]	Maximum of system.network.in.bytes_delta for each network	bytes
Network In Bytes [Delta Min]	Minimum of system.network.in.bytes_delta for each network	bytes
Network In Bytes [Delta Sum]	Sum of system.network.in.bytes_delta for individual networks	bytes
Network In Bytes [Delta]	Delta of received byte count	bytes
Network In Dropped	Number of deleted packets among incoming packets	cnt
Network In Errors	Number of errors during reception	cnt
Network In Packets	Number of received packets	cnt
Network In Packets [Delta Avg]	Average of system.network.in.packets_delta for each network	cnt
Network In Packets [Delta Max]	Maximum of system.network.in.packets_delta for each network	cnt
Network In Packets [Delta Min]	Minimum of system.network.in.packets_delta for each network	cnt
Network In Packets [Delta Sum]	Sum of system.network.in.packets_delta for individual networks	cnt
Network In Packets [Delta]	Delta of received packet count	cnt
Network Out Bytes	Number of transmitted bytes	bytes
Network Out Bytes [Delta Avg]	Average of system.network.out.bytes_delta for each network	bytes
Network Out Bytes [Delta Max]	Maximum of system.network.out.bytes_delta for individual networks	bytes
Network Out Bytes [Delta Min]	Minimum of system.network.out.bytes_delta for individual networks	bytes
Network Out Bytes [Delta Sum]	Sum of system.network.out.bytes_delta for individual networks	bytes
Network Out Bytes [Delta]	Delta of transmitted byte count	bytes
Network Out Dropped	Number of deleted packets among outgoing packets	cnt
Network Out Errors	Number of errors during transmission	cnt
Network Out Packets	Number of transmitted packets	cnt
Network Out Packets [Delta Avg]	Average of system.network.out.packets_delta for each network	cnt
Network Out Packets [Delta Max]	Maximum of system.network.out.packets_delta for each network	cnt
Network Out Packets [Delta Min]	Minimum of system.network.out.packets_delta for each network	cnt
Network Out Packets [Delta Sum]	Sum of system.network.out.packets_delta for individual networks	cnt
Network Out Packets [Delta]	Delta of transmitted packet count	cnt
Open Connections [TCP]	All open TCP connections	cnt
Open Connections [UDP]	All open UDP connections	cnt
Port Usage	Available port usage rate	%
SYN Sent Sockets	Number of sockets in SYN_SENT state (when connecting from local to remote)	cnt
Kernel PID Max	kernel.pid_max value	cnt
Kernel Thread Max	kernel.threads-max value	cnt
Process CPU Usage	Percentage of CPU time consumed by the process since the last update	%
Process CPU Usage/Core	Percentage of CPU time used by the process since the last event	%
Process Memory Usage	The proportion of main memory (RAM) occupied by a process	%
Process Memory Used	Resident Set size. The amount of memory a process occupies in RAM.	bytes
Process PID	process pid	pid
Process PPID	Parent process PID	pid
Processes [Dead]	Number of dead processes	cnt
Processes [Idle]	Number of idle processes	cnt
Processes [Running]	Number of running processes	cnt
Processes [Sleeping]	sleeping processes count	cnt
Processes [Stopped]	number of stopped processes	cnt
Processes [Total]	Total number of processes	cnt
Processes [Unknown]	Number of processes with an unknown or unsearchable status	cnt
Processes [Zombie]	zombie processes count	cnt
Running Process Usage	process usage	%
Running Processes	Number of running processes	cnt
Running Thread Usage	Thread usage	%
Running Threads	Total number of threads running in running processes	cnt
Context Switches	context switch count (per second)	cnt
Load/Core [1 min]	The load over the last 1 minute divided by the number of cores	cnt
Load/Core [15 min]	The load over the last 15 minutes divided by the number of cores	cnt
Load/Core [5 min]	The load over the last 5 minutes divided by the number of cores	cnt
Multipaths [Active]	External storage connection path status = active count	cnt
Multipaths [Failed]	External storage connection path status = failed count	cnt
Multipaths [Faulty]	External storage connection path status = faulty count	cnt
NTP Offset	measured offset of the last sample (the time difference between the NTP server and the local environment)	num
Run Queue Length	Execution queue length	num
Uptime	OS uptime (uptime). (milliseconds)	ms
Context Switchies	CPU context switch count (per second)	cnt
Disk Read Bytes [Sec]	Bytes read per second from the Windows logical disk	cnt
Disk Read Time [Avg]	Average data read time (seconds)	sec
Disk Transfer Time [Avg]	Disk average wait time	sec
Disk Usage	Disk usage	%
Disk Write Bytes [Sec]	Number of bytes written in one second on a Windows logical disk	cnt
Disk Write Time [Avg]	Average data write time (seconds)	sec
Pagingfile Usage	Paging file usage	%
Pool Used [Non Paged]	Nonpaged Pool usage in kernel memory	bytes
Pool Used [Paged]	Paged Pool usage in kernel memory	bytes
Process [Running]	Number of currently running processes	cnt
Threads [Running]	Current number of running threads	cnt
Threads [Waiting]	Number of threads waiting for processor time	cnt

Table. Bare Metal Server monitoring metrics (available when the Agent is installed)

4.2 - How-to guides

Users can create the service by entering the required information for a Bare Metal Server and selecting detailed options through the Samsung Cloud Platform Console.

Create Bare Metal Server

You can create and use the Bare Metal Server service from the Samsung Cloud Platform Console.

To create a Bare Metal Server, follow these steps.

Click the All Services > Compute > Bare Metal Server menu. Navigate to the Service Home page of the Bare Metal Server.
On the Service Home page, click the Create Bare Metal Server button. You will be taken to the Create Bare Metal Server page.

On the Bare Metal Server Creation page, enter the information required to create the service and select detailed options.

Select the required information in the Image and Version Selection area.

Category

Required status

Detailed description

image

Required

Select the type of image provided

RHEL

Rocky Linux

Ubuntu

Windows

Image version

Required

Select version of the chosen image

Provide a list of versions for the supplied server image

Table. Bare Metal Server image and version input fields

Enter or select the required information in the Service Information Input area.

Category	Required status	Detailed description
Number of servers	Required	Number of Bare Metal Server instances to create simultaneously Only numeric input is allowed, and it must be between 1 and 5
Service Type > Server Type	Required	Bare Metal Server server type Select the desired vCPU, Memory, Disk specifications For detailed information about the server types provided by Bare Metal Server, refer to Bare Metal Server Server Type
Service Type > Planned Compute	Required	Status of resources with Planned Compute configured In Use: Number of resources with Planned Compute that are currently in use Configured: Number of resources with Planned Compute configured Coverage Preview: Amount applied per resource by Planned Compute Create Planned Compute Service: Go to the Planned Compute application page For more details, refer to Planned Compute Apply
Automation account	Required	Automatically create an account to provide automation features after creating a Bare Metal Server This account is used only for inter-system interfaces The password is encrypted and cannot be accessed from outside the system If the account is deleted, network changes and some automation functions may be restricted

Table. Bare Metal Server Service Information Input Items

In the Required Information Input area, enter or select the necessary information.

Category	Required	Detailed description
Administrator account	Required	Set the administrator account and password to be used when connecting to the server RHEL and Ubuntu OS are provided with a fixed root account For Windows OS, enter 5 to 20 characters using lowercase letters and numbers `Administrator` Not allowed
Server name	Required	When the selected number of servers is 1, enter a name to distinguish the Bare Metal Server Set the hostname to the entered server name Start with a lowercase English letter, and use lowercase letters, numbers, and special characters (`-`) to enter between 3 and 15 characters Must not end with a special character (`-`)
Server name Prefix	Required	Enter a prefix to distinguish each Bare Metal Server generated when the selected number of servers is 2 or more Automatically generated in the form of user input value (prefix) + ‘`-###`’ Must start with a lowercase English letter and be 3 to 15 characters long, using lowercase letters, numbers, and special characters (`-`) Must not end with a special character (`-`)
Network Settings	Required	Set the network where the Bare Metal Server will be installed Select a pre‑created VPC General Subnet: Select a pre‑created general Subnet IP can be set to auto‑generate or user input, and if input is selected, the user enters the IP manually NAT: Available only when there is a single server and the VPC is attached to an Internet Gateway Checking the option allows selection of a NAT IP NAT IP: Select a NAT IP If no NAT IP is available, click the Create New button to generate a Public IP Refresh button to view and select the created Public IP Creating a Public IP incurs charges according to the Public IP pricing policy Local Subnet (optional): Choose to use a local Subnet It is not a required element for creating the service Select a pre‑created local Subnet IP can be set to auto‑generate or user input, and if input is selected, the user enters the IP manually

Table. Required information input fields for Bare Metal Server

Warning

Please use a firewall, etc., to control traffic access for the Bare Metal Server. Security Groups are not provided.
Bare Metal Server’s firewall can only be used to control traffic between the Bare Metal Server and Virtual Server. To use the Bare Metal Server’s firewall, follow the steps below.

Separate the VPC of Bare Metal Server: Separate them so that Bare Metal Server and Virtual Server do not use the same VPC.
Create Transit Gateway: Create a Transit Gateway.
- Integration between the Virtual Server’s VPC and the Bare Metal Server’s VPC uses a Transit Gateway.
- When creating a Transit Gateway integration in the VPC of a Bare Metal Server, you must also create the Bare Metal Server’s firewall.
Firewall Rule registration: Add a rule to the Bare Metal Server’s firewall.

On the Bare Metal Server Creation page, in the Additional Information Input area, enter or select the required information.

Category	Required	Detailed description
Local disk partition	Selection	Set usage of local disk partitions Up to 10 partitions can be created, including the root partition Up to 90% of total capacity can be used After checking Use, partition information can be set Set root partition information Partition type: flat or lvm selectable Partition name: enter partition name Partition type can be entered only when it is lvm Enter up to 15 characters starting with a letter and containing letters, numbers, and special characters (`-_`) Partition size: enter at least 50 GB Filesystem type: select according to the image used For RHEL, Rocky Linux: xfs, ext4 For Ubuntu: ext4, xfs, btrfs For SLES: btrfs, xfs, ext4 Mount point: start with special character `/` and include letters, numbers, and special characters (`-_`) up to 15 characters Filesystem type cannot be entered when it is swap Available capacity: 90% of the default disk capacity provided when selecting a server When setting Partition size, the remaining capacity is automatically calculated and displayed The total size of partition disks cannot exceed the available capacity Set additional partition information Partition type: flat or lvm selectable Partition name: enter partition name Partition type can be entered only when it is lvm Enter up to 15 characters starting with a letter and containing letters, numbers, and special characters (`-_`) Partition size: enter at least 1 GB Filesystem type: select according to the image used> For RHEL, Rocky Linux: xfs, ext4, swap For Ubuntu: ext4, xfs, btrfs, swap For SLES: btrfs, xfs, ext4, swap Mount point: start with special character `/` and include letters, numbers, and special characters (`-_`) up to 15 characters Filesystem type cannot be entered when it is swap Available capacity: 90% of the default disk capacity provided when selecting a server When setting Partition size, the remaining capacity is automatically calculated and displayed The total size of partition disks cannot exceed the available capacity
Placement Group	Selection	Servers belonging to the same Placement group are distributed across different racks Provides distributed placement for up to two servers belonging to the same Placement group For distributed placement of three or more servers, add additional Placement groups Applicable only at initial creation and cannot be modified afterward If you terminate the last server in a Placement group, that Placement group is automatically deleted
Lock	Selection	Using a lock prevents accidental actions that could terminate, start, or stop the server.
Hyper Threading	Selection	Set the logical core count to operate at twice the number of physical cores To turn off Hyper-Threading, clear the checkbox Cannot be changed after server creation
Init Script	Selection	Script to run when the server starts The Init Script must be selected differently depending on the image type For Windows: select Batch Script For Linux: Shell Script

Table. Bare Metal Server additional information entry fields

Summary Check the detailed information and estimated billing amount generated in the panel, and click the Create button.
- Once creation is complete, check the created resources on the Bare Metal Server List page.

Bare Metal Server Check detailed information

Bare Metal Server service allows you to view and edit the full resource list and detailed information. The Bare Metal Server Details page consists of Details, Tags, Activity Log tabs.

To view detailed information about the Bare Metal Server, follow these steps.

All Services > Compute > Bare Metal Server Click the menu. Navigate to the Service Home page of Bare Metal Server.
On the Service Home page, click the Bare Metal Server menu. You will be taken to the Bare Metal Server List page.

Bare Metal Server List page, click the resource to view detailed information. You will be taken to the Bare Metal Server Details page.

The Bare Metal Server Details page displays status information and additional feature information, and consists of Details, Tags, Activity Log tabs.

Category

Detailed description

Bare Metal Server status

Status of user-created Bare Metal Server

Creating: state while the server is being created

Running:: state after creation, ready for use

Editing:: state while IP is being changed

Unknown: error state

Starting: state while the server is starting

Stopping: state while the server is stopping

Stopped: state after the server has stopped

Terminating: state while termination is in progress

Terminated: state after termination is complete

Server control

Button to change server status

Start: Start a stopped server

Stop: Stop a running server

Service termination

Cancel service button

Table. Bare Metal Server status information and additional features

Detailed Information

Bare Metal Server List page lets you view detailed information of the selected resource and modify it if necessary.

Category	Detailed description
service	Service name
Resource Type	Resource Type
SRN	Unique resource ID in Samsung Cloud Platform In Bare Metal Server, it refers to the Bare Metal Server SRN
Resource name	Resource Name In Bare Metal Server, it refers to the server name
Resource ID	Unique resource ID in the service
constructor	User who created the service
Creation date and time	Service creation date and time
editor	User who edited the service information
Modification date	Date and time the service information was modified
Server name	Server name
Image/Version	Server OS image and version
Server type	Display vCPU and memory information
Planned Compute	Resource status with Planned Compute configured For detailed information, refer to Planned Compute Apply
Lock	Display whether Lock is enabled or disabled When Lock is enabled, it prevents server termination/start/stop operations, avoiding actions caused by mistakes. If you need to change the Lock attribute value, click the Edit button to configure.
Hyper Threading	Indicates whether Hyper Threading is enabled/disabled Hyper Threading is a setting that makes the logical core count operate at twice the number of physical cores
Network	Network information of the Bare Metal Server VPC, standard Subnet, IP and status, Public NAT IP and status, Private NAT IP and status If you need to change the IP, click the Edit button to configure
Local Subnet	Local Subnet information of the Bare Metal Server Local Subnet name, Local Subnet IP, Vlan ID, Interface Name If you need to add a local Subnet, click the Add button to configure
Block Storage	Block Storage information connected to the server Volume name, disk type, capacity, status Clicking the Add button takes you to the Block Storage creation screen
Init Script	View the Init Script content entered when creating the server

Table. Bare Metal Server detailed information tab items

Job History

Bare Metal Server List page allows you to view the operation history of the selected resource.

Category	Detailed description
Task History List	Resource Change History Operation Time, Resource ID, Resource Name, Operation Details, Event Topic, Operation Result, Check Operator Information

Table. Bare Metal Server Work History Tab Detailed Information Items

Bare Metal Server Resource Management

If you need server control and management functions for the created Bare Metal Server resources, you can perform the tasks on the Bare Metal Server List or Bare Metal Server Details page.

Bare Metal Server Operation Control

You can start, stop, and restart a running Bare Metal Server.

To control the operation of a Bare Metal Server, follow these steps.

Click the All Services > Compute > Bare Metal Server menu. You will be taken to the Service Home page of the Bare Metal Server.
On the Service Home page, click the Bare Metal Server menu. You will be taken to the Bare Metal Server List page.
Bare Metal Server List page allows you to select multiple servers and control them simultaneously using the Start and Stop buttons at the top of the table.
- Bare Metal Server Details page also allows you to start and stop the server.
On the Bare Metal Server List page, click the resource to control its operation and navigate to the Bare Metal Server Details page.
Check the server status and complete the changes using each Server Management button.
- Start: Starts a stopped server.
- Stop: Stops the running server.

Notice

When a Bare Metal Server is stopped, the server’s power is turned off.

Since it may affect running applications or storage, we recommend shutting down the OS before stopping.
After shutting down the OS, be sure to also stop it in the Console.

Unable to control operation

If a Bare Metal Server cannot be started when a start request is made, see the following.
- When lock is set: Change the lock setting to disabled, then try again.
- If the Bare Metal Server’s status is not Stopped: Change the Bare Metal Server’s status to Stopped, then try again.
If a stop request for a Bare Metal Server cannot be fulfilled, see the following.
- When Lock is set: Change the Lock setting to disabled, then try again.
- If the Bare Metal Server’s status is not Running: Change the Bare Metal Server’s status to Running, then try again.

Add Block Storage

You can add Block Storage to a Bare Metal Server.

To add Block Storage, follow the steps below.

Click the All Services > Compute > Bare Metal Server menu. Navigate to the Service Home page of the Bare Metal Server.
Click the Bare Metal Server menu on the Service Home page. You will be taken to the Bare Metal Server List page.
On the Bare Metal Server List page, click the server to which you want to add Block Storage. You will be taken to the Bare Metal Server Details page.
On the Bare Metal Server Details page, click the Add button in the Block Storage section.
When the popup confirming the addition of Block Storage opens, click the Confirm button. You will be taken to the Block Storage (BM) Creation page.
On the Block Storage(BM) Creation page, enter the information required to create the service and create a Block Storage.
- For detailed information on creating Block Storage (BM), please refer to Create Block Storage(BM).
Go to the Bare Metal Server Details page where Block Storage was added and verify that Block Storage has been added.

Caution

After creating a Block Storage, you cannot increase its capacity.

Terminate Bare Metal Server

Terminating an unused Bare Metal Server can reduce operating costs. However, terminating a Bare Metal Server may cause the running service to stop immediately, so you should proceed with termination only after fully considering the impact of service interruption.

Caution

Please note that data cannot be recovered after terminating the service.

Warning

If you terminate servers one by one that have Block Storage (BM) attached, the servers will be terminated but the attached Block Storage (BM) will not, so terminate it directly from the Block Storage (BM).

To terminate a Bare Metal Server, follow the steps below.

Click the All Services > Compute > Bare Metal Server menu. Navigate to the Service Home page of the Bare Metal Server.
Click the Bare Metal Server menu on the Service Home page. You will be taken to the Bare Metal Server List page.
Bare Metal Server List page, select the resource to terminate, and click the Terminate Service button.
- You can select multiple resources and delete them simultaneously.
- You can also delete it by clicking the Service Termination button on the Bare metal Server Details page of the resource to be terminated.
When termination is complete, verify on the Bare Metal Server list page that the resource has been terminated.

Cancellation constraints

If termination is not possible when requesting cancellation of a Bare Metal Server, a popup will inform you. Please refer to the cases below.

Cancellation not possible

When Block Storage(BM) is attached (simultaneous termination of two or more servers): Please detach the Block Storage(BM) first.
- For detailed information on how to cancel, see Block Storage(BM) Cancellation.
When File Storage is connected: First disconnect the File Storage connection.
- For detailed information on how to cancel, refer to File Storage Cancellation.
When Lock is set: Change the Lock setting to disabled, then try again.
If there are resources connected to Backup Agent or Load Balancer: First disconnect the connections of those resources.
If resource management tasks for the Bare Metal Server are in progress on the same account: After the Bare Metal Server resource management task is completed, please try again.
If the Bare Metal Server’s status is not Running or Stopped: Change the Bare Metal Server’s status to Running or Stopped, then try again.
If the selection includes a server that cannot be terminated simultaneously: Select only resources that can be terminated and try again.

Configure local Subnet

After completing the creation of a Bare Metal Server, when adding a local Subnet on the Bare Metal Server Details page, the user must directly configure the network settings of the local Subnet.

first connection(kr-west)

There is no local Subnet currently connected to the Bare Metal Server, and if you are adding the first connection, follow the guide below.

Caution

This guide applies to kr-west (Korea West) when adding the first local Subnet connection to the server.

kr-south(South Korea) guide can be found in the First Connection (kr-south) chapter.

Linux - Configuring Subnet on Ubuntu

To add a local Subnet and configure the network on an Ubuntu operating system, follow these steps.

On the Bare Metal Server Details page, check the Interface Name.

Retrieve the network configuration information.

Color mode

[root@localhost ~]# vi /etc/netplan/50-cloud-init.yaml
network:
ethernets:
 ens2f1:
 match:
 macaddress: 68:05:ca:d4:09:91
 mtu: 1500
 set-name: ens2f1
 ens4f1:
 match:
 macaddress: 68:05:ca:d4:09:01
 mtu: 1500
 set-name: ens4f1

[root@localhost ~]# vi /etc/netplan/50-cloud-init.yaml
network:
ethernets:
 ens2f1:
 match:
 macaddress: 68:05:ca:d4:09:91
 mtu: 1500
 set-name: ens2f1
 ens4f1:
 match:
 macaddress: 68:05:ca:d4:09:01
 mtu: 1500
 set-name: ens4f1

Code block. View network configuration file

After adding a new VLAN, set the IP for the bonding configuration.

Replace the ID and IP in the example code with the assigned ID and IP.

Color mode

[root@localhost ~]# vi /etc/netplan/50-cloud-init.yaml
network:
    bonds:
        bond-mgt:
            interfaces:
            - ens2f1      // **Bare Metal Server Details** page: enter the Interface Name you verified.
            - ens4f1      // **Bare Metal Server Details** page: enter the Interface Name you verified.
            mtu: 1500
            parameters:
                mii-monitor-interval: 100
                mode: active-backup
                transmit-hash-policy: layer2
    ethernets:
        ens2f1:
            match:
            macaddress: 68:05:ca:d4:09:91
            mtu: 1500
            set-name: ens2f1
        ens4f1:
            match:
            macaddress: 68:05:ca:d4:09:01
            mtu: 1500
            set-name: ens4f1
    vlans:
        bond-mgt.20:   // Enter the Vlan ID you verified in the SCP Console instead of 20.
        addresses:
            - 192.168.0.10/24 // Set it to the local Subnet IP confirmed in the SCP Console.
            id: 20    // Set it to the VLAN ID verified in the SCP Console.
            link: bond-mgt
            mtu: 1500

[root@localhost ~]# vi /etc/netplan/50-cloud-init.yaml
network:
    bonds:
        bond-mgt:
            interfaces:
            - ens2f1      // **Bare Metal Server Details** page: enter the Interface Name you verified.
            - ens4f1      // **Bare Metal Server Details** page: enter the Interface Name you verified.
            mtu: 1500
            parameters:
                mii-monitor-interval: 100
                mode: active-backup
                transmit-hash-policy: layer2
    ethernets:
        ens2f1:
            match:
            macaddress: 68:05:ca:d4:09:91
            mtu: 1500
            set-name: ens2f1
        ens4f1:
            match:
            macaddress: 68:05:ca:d4:09:01
            mtu: 1500
            set-name: ens4f1
    vlans:
        bond-mgt.20:   // Enter the Vlan ID you verified in the SCP Console instead of 20.
        addresses:
            - 192.168.0.10/24 // Set it to the local Subnet IP confirmed in the SCP Console.
            id: 20    // Set it to the VLAN ID verified in the SCP Console.
            link: bond-mgt
            mtu: 1500

Code block. IP configuration

Apply the changes to the system.
Color mode
# netplan apply
# netplan apply
Code block. Apply changes
Check the interface status.
Color mode
# ip a or # bash /usr/local/bin/ip.sh
# ip a or # bash /usr/local/bin/ip.sh
Code block. Interface lookup

Linux – Configuring Subnet on CentOS/Red Hat

After adding a local Subnet on a CentOS/Red Hat operating system, follow these steps to configure the network.

Caution

If you set the interface name incorrectly, be careful because the IP information currently in use may be deleted.

On the Bare Metal Server Details page, check the Interface Name.

Modify the following command and execute it.

Color mode

#!/usr/bin/bash

IP_ADDR="10.1.1.3/24"   // Set the local Subnet IP as verified in the Console.
VLAN_ID="7"             // Set the Vlan ID you verified in the console.

BOND_NAME="bond-mgt"
BOND_IF_name1="ens2f1"  // **Bare Metal Server Details** Enter the Interface Name as shown on the page.
BOND_IF_name2="ens4f0"  // **Bare Metal Server Details** page, enter the Interface Name you verified.


# Delete Connection
nmcli con down "Bond ${BOND_NAME}"
nmcli con del  "Bond ${BOND_NAME}"

nmcli con down "System ${BOND_IF_name1}"
nmcli con down "System ${BOND_IF_name2}"

nmcli con del  "System ${BOND_IF_name1}"
nmcli con del  "System ${BOND_IF_name2}"


# Create Bonding
nmcli con add con-name ${BOND_NAME} type bond ifname ${BOND_NAME}
nmcli conn mod ${BOND_NAME} con-name "Bond ${BOND_NAME}"
nmcli conn mod "Bond ${BOND_NAME}" ipv4.method    disabled
nmcli conn mod "Bond ${BOND_NAME}" ipv6.method    ignore
nmcli conn mod "Bond ${BOND_NAME}" connect.autoconnect yes

nmcli conn mod "Bond ${BOND_NAME}" +bond.options mode=active-backup      \
                                   +bond.options xmit_hash_policy=layer2
                                   +bond.options miimon=100
                                   +bond.options num_grat_arp=1          \
                                   +bond.options downdelay=0
                                   +bond.options updelay=0

# Assign bond-slave
nmcli conn add type bond-slave ifname ${BOND_IF_name1}  con-name "${BOND_IF_name1}" master ${BOND_NAME}
nmcli conn mod ${BOND_IF_name1} con-name "System ${BOND_IF_name1}"

nmcli conn add type bond-slave ifname ${BOND_IF_name2}  con-name "${BOND_IF_name2}" master ${BOND_NAME}
nmcli conn mod ${BOND_IF_name2} con-name "System ${BOND_IF_name2}"

# Connection UP
nmcli conn up   "Bond ${BOND_NAME}"
# add vlan
nmcli conn add type vlan ifname "${BOND_NAME}.${VLAN_ID}" con-name "${BOND_NAME}.${VLAN_ID}" id ${VLAN_ID} dev ${BOND_NAME}
nmcli con  mod ${BOND_NAME}.${VLAN_ID} con-name "Vlan ${BOND_NAME}.${VLAN_ID}"
nmcli con  mod "Vlan ${BOND_NAME}.${VLAN_ID}" ipv4.addresses ${IP_ADDR}
nmcli con  mod "Vlan ${BOND_NAME}.${VLAN_ID}" ipv4.method manual
nmcli con  mod "Vlan ${BOND_NAME}.${VLAN_ID}" ipv6.method "ignore"
nmcli con  mod "Vlan ${BOND_NAME}.${VLAN_ID}" connect.autoconnect yes
nmcli con  up  "Vlan ${BOND_NAME}.${VLAN_ID}"
nmcli      device reapply ${BOND_NAME}.${VLAN_ID}

#!/usr/bin/bash

IP_ADDR="10.1.1.3/24"   // Set the local Subnet IP as verified in the Console.
VLAN_ID="7"             // Set the Vlan ID you verified in the console.

BOND_NAME="bond-mgt"
BOND_IF_name1="ens2f1"  // **Bare Metal Server Details** Enter the Interface Name as shown on the page.
BOND_IF_name2="ens4f0"  // **Bare Metal Server Details** page, enter the Interface Name you verified.


# Delete Connection
nmcli con down "Bond ${BOND_NAME}"
nmcli con del  "Bond ${BOND_NAME}"

nmcli con down "System ${BOND_IF_name1}"
nmcli con down "System ${BOND_IF_name2}"

nmcli con del  "System ${BOND_IF_name1}"
nmcli con del  "System ${BOND_IF_name2}"


# Create Bonding
nmcli con add con-name ${BOND_NAME} type bond ifname ${BOND_NAME}
nmcli conn mod ${BOND_NAME} con-name "Bond ${BOND_NAME}"
nmcli conn mod "Bond ${BOND_NAME}" ipv4.method    disabled
nmcli conn mod "Bond ${BOND_NAME}" ipv6.method    ignore
nmcli conn mod "Bond ${BOND_NAME}" connect.autoconnect yes

nmcli conn mod "Bond ${BOND_NAME}" +bond.options mode=active-backup      \
                                   +bond.options xmit_hash_policy=layer2
                                   +bond.options miimon=100
                                   +bond.options num_grat_arp=1          \
                                   +bond.options downdelay=0
                                   +bond.options updelay=0

# Assign bond-slave
nmcli conn add type bond-slave ifname ${BOND_IF_name1}  con-name "${BOND_IF_name1}" master ${BOND_NAME}
nmcli conn mod ${BOND_IF_name1} con-name "System ${BOND_IF_name1}"

nmcli conn add type bond-slave ifname ${BOND_IF_name2}  con-name "${BOND_IF_name2}" master ${BOND_NAME}
nmcli conn mod ${BOND_IF_name2} con-name "System ${BOND_IF_name2}"

# Connection UP
nmcli conn up   "Bond ${BOND_NAME}"
# add vlan
nmcli conn add type vlan ifname "${BOND_NAME}.${VLAN_ID}" con-name "${BOND_NAME}.${VLAN_ID}" id ${VLAN_ID} dev ${BOND_NAME}
nmcli con  mod ${BOND_NAME}.${VLAN_ID} con-name "Vlan ${BOND_NAME}.${VLAN_ID}"
nmcli con  mod "Vlan ${BOND_NAME}.${VLAN_ID}" ipv4.addresses ${IP_ADDR}
nmcli con  mod "Vlan ${BOND_NAME}.${VLAN_ID}" ipv4.method manual
nmcli con  mod "Vlan ${BOND_NAME}.${VLAN_ID}" ipv6.method "ignore"
nmcli con  mod "Vlan ${BOND_NAME}.${VLAN_ID}" connect.autoconnect yes
nmcli con  up  "Vlan ${BOND_NAME}.${VLAN_ID}"
nmcli      device reapply ${BOND_NAME}.${VLAN_ID}

Code block. IP configuration script

Check the interface status.
Color mode
# ip a or # bash /usr/local/bin/ip.sh
# ip a or # bash /usr/local/bin/ip.sh
Code block. Interface lookup

Configuring Subnet on Windows

After adding a local Subnet on the Windows operating system, follow these steps to configure the network.

Windows Start icon, right-click it, then run the Windows PowerShell(Administrator) program.
On the Bare Metal Server Details page, check the Interface Name.
Run ncpa.cpl from the Windows Run menu.
Check whether the interface is enabled, and enable it if necessary.
- Activate the Interface Name confirmed on the Bare Metal Server Details page.

Create a teaming.

Color mode

PS C:\> New-NetLbfoTeam –Name “bond-mgt” –TeamMembers ens2f1,ens4f1
PS C:\> Set-NetLbfoTeam –Name “bond-mgt” –LoadBalancingAlgorithm Dynamic

PS C:\> New-NetLbfoTeam –Name “bond-mgt” –TeamMembers ens2f1,ens4f1
PS C:\> Set-NetLbfoTeam –Name “bond-mgt” –LoadBalancingAlgorithm Dynamic

Code block. Create Teaming

After adding a new VLAN, configure the IP.

Enter the VLAN ID and local Subnet IP verified on the Bare Metal Server Details page.

Color mode

PS C:\> Add-NetLbfoTeamNIC -Team bond_bond-mgt -VlanID 20 -Name bond-mgt.20
PS C:\> Get-NetAdapter
PS C:\> netsh interface ip set address bond-mgt.20 static “192.168.0.10/24”

PS C:\> Add-NetLbfoTeamNIC -Team bond_bond-mgt -VlanID 20 -Name bond-mgt.20
PS C:\> Get-NetAdapter
PS C:\> netsh interface ip set address bond-mgt.20 static “192.168.0.10/24”

Code block. Windows IP configuration

In the Windows Run menu, execute ncpa.cpl to check the interface status.

First connection(kr-south)

If there is no local Subnet already connected to the Bare Metal Server and you are adding the first connection, follow the guide below.

Caution

This guide is for kr-south(Korea region) when adding the first local Subnet connection to the server.

The guide for kr-west(Korea West) is in the First link (kr-west) chapter.

Linux - Configuring Subnet on Ubuntu

To add a local Subnet and configure the network on an Ubuntu operating system, follow these steps.

After adding a new VLAN, set the IP.

Change the ID and IP in the example code to the assigned ID and IP.

Color mode

[root@localhost ~]# vi /etc/netplan/50-cloud-init.yaml
network:
    bonds:
        bond-mgt:
            interfaces:
            - ens2f1
            - ens4f1
            mtu: 1500
            parameters:
                mii-monitor-interval: 100
                mode: active-backup
                transmit-hash-policy: layer2
    ethernets:
        ens2f1:
            match:
            macaddress: 68:05:ca:d4:09:91
            mtu: 1500
            set-name: ens2f1
        ens4f1:
            match:
            macaddress: 68:05:ca:d4:09:01
            mtu: 1500
            set-name: ens4f1
    vlans:
        bond-mgt.20:
        addresses:
            - 192.168.0.10/24
            id: 20
            link: bond-mgt
            mtu: 1500
    vlans:
        bond-mgt.21: // Enter the VLAN ID you verified in the console instead of 21.
        addresses:
            - 192.168.0.20/24 // Set it to the local Subnet IP you verified in the Console.
            id: 21    // Set to the VLAN ID verified in the console.
            link: bond-mgt
            mtu: 1500

[root@localhost ~]# vi /etc/netplan/50-cloud-init.yaml
network:
    bonds:
        bond-mgt:
            interfaces:
            - ens2f1
            - ens4f1
            mtu: 1500
            parameters:
                mii-monitor-interval: 100
                mode: active-backup
                transmit-hash-policy: layer2
    ethernets:
        ens2f1:
            match:
            macaddress: 68:05:ca:d4:09:91
            mtu: 1500
            set-name: ens2f1
        ens4f1:
            match:
            macaddress: 68:05:ca:d4:09:01
            mtu: 1500
            set-name: ens4f1
    vlans:
        bond-mgt.20:
        addresses:
            - 192.168.0.10/24
            id: 20
            link: bond-mgt
            mtu: 1500
    vlans:
        bond-mgt.21: // Enter the VLAN ID you verified in the console instead of 21.
        addresses:
            - 192.168.0.20/24 // Set it to the local Subnet IP you verified in the Console.
            id: 21    // Set to the VLAN ID verified in the console.
            link: bond-mgt
            mtu: 1500

Code block. Add VLAN and configure IP

Apply the changes to the system.
Color mode
# netplan apply
# netplan apply
Code block. Reflect changes.
Check the interface status.
Color mode
# ip a or # bash /usr/local/bin/ip.sh
# ip a or # bash /usr/local/bin/ip.sh
Code block. Interface lookup

Linux – Configuring Subnet on CentOS/Red Hat

After adding a local Subnet on a CentOS/Red Hat operating system, follow these steps to configure the network.

Caution

Be careful, as adding a local subnet or misconfiguring the network may delete the IP information currently in use.

Check the Bond Name for the local Subnet.
Color mode
# sh /usr/local/bin/ip.sh
# sh /usr/local/bin/ip.sh
Code block. Verify bonding

Modify the following command and execute it.

Color mode

#!/usr/bin/bash

IP_ADDR="10.1.1.3/24"   // Set the local Subnet IP as observed in the console.
VLAN_ID="7"             // Set the Vlan ID you verified in the console.

BOND_NAME="bond-mgt" // Set the Bond Name identified in step 1.

# add vlan
nmcli conn add type vlan ifname "${BOND_NAME}.${VLAN_ID}" con-name "${BOND_NAME}.${VLAN_ID}" id ${VLAN_ID} dev ${BOND_NAME}
nmcli con  mod ${BOND_NAME}.${VLAN_ID} con-name "Vlan ${BOND_NAME}.${VLAN_ID}"
nmcli con  mod "Vlan ${BOND_NAME}.${VLAN_ID}" ipv4.addresses ${IP_ADDR}
nmcli con  mod "Vlan ${BOND_NAME}.${VLAN_ID}" ipv4.method manual
nmcli con  mod "Vlan ${BOND_NAME}.${VLAN_ID}" ipv6.method "ignore"
nmcli con  mod "Vlan ${BOND_NAME}.${VLAN_ID}" connect.autoconnect yes
nmcli con  up  "Vlan ${BOND_NAME}.${VLAN_ID}"

nmcli      device reapply ${BOND_NAME}.${VLAN_ID}

#!/usr/bin/bash

IP_ADDR="10.1.1.3/24"   // Set the local Subnet IP as observed in the console.
VLAN_ID="7"             // Set the Vlan ID you verified in the console.

BOND_NAME="bond-mgt" // Set the Bond Name identified in step 1.

# add vlan
nmcli conn add type vlan ifname "${BOND_NAME}.${VLAN_ID}" con-name "${BOND_NAME}.${VLAN_ID}" id ${VLAN_ID} dev ${BOND_NAME}
nmcli con  mod ${BOND_NAME}.${VLAN_ID} con-name "Vlan ${BOND_NAME}.${VLAN_ID}"
nmcli con  mod "Vlan ${BOND_NAME}.${VLAN_ID}" ipv4.addresses ${IP_ADDR}
nmcli con  mod "Vlan ${BOND_NAME}.${VLAN_ID}" ipv4.method manual
nmcli con  mod "Vlan ${BOND_NAME}.${VLAN_ID}" ipv6.method "ignore"
nmcli con  mod "Vlan ${BOND_NAME}.${VLAN_ID}" connect.autoconnect yes
nmcli con  up  "Vlan ${BOND_NAME}.${VLAN_ID}"

nmcli      device reapply ${BOND_NAME}.${VLAN_ID}

Code block. IP configuration script

Check the interface status.
Color mode
# ip a or # bash /usr/local/bin/ip.sh
# ip a or # bash /usr/local/bin/ip.sh
Code block. Interface lookup

Configuring Subnet on Windows

After adding a local Subnet on the Windows operating system, follow these steps to configure the network.

Windows Start icon, right-click it, then run the Windows PowerShell(Administrator) program.
Check the Teaming name for the local Subnet.
Color mode
PS C:\> Get-NetAdapter
PS C:\> Get-NetAdapter
Code block. Check Windows interface

After adding a new VLAN, configure the IP.

Enter the Teaming name verified in step 2, and the Vlan ID and local Subnet IP verified in the Console.

Color mode

PS C:\> Add-NetLbfoTeamNIC -Team bond_bond-mgt -VlanID 20 -Name bond-mgt.20
PS C:\> Get-NetAdapter
PS C:\> netsh interface ip set address bond-mgt.20 static “192.168.0.10/24”

PS C:\> Add-NetLbfoTeamNIC -Team bond_bond-mgt -VlanID 20 -Name bond-mgt.20
PS C:\> Get-NetAdapter
PS C:\> netsh interface ip set address bond-mgt.20 static “192.168.0.10/24”

code block. Create Teaming

Run ncpa.cpl from the Windows Run menu to check the interface status.

Add second connection (kr-west, kr-south)

If there is a local subnet connected to the Bare Metal Server, the guide for the second additional connection is below.

Since a Bonding was already created when connecting the first local Subnet, there is no need to create a Bonding when connecting the second local Subnet.

Please refer to the details below.

Notice

This is a guide that can be applied commonly to kr-west and kr-south.

Linux - Setting up Subnet on Ubuntu

To add a local Subnet and configure the network on an Ubuntu operating system, follow these steps.

After adding a new VLAN, configure the IP.

Change the ID and IP in the example code to the assigned ID and IP.

Color mode

[root@localhost ~]# vi /etc/netplan/50-cloud-init.yaml
network:
    bonds:
        bond-mgt:
            interfaces:
            - ens2f1
            - ens4f1
            mtu: 1500
            parameters:
                mii-monitor-interval: 100
                mode: active-backup
                transmit-hash-policy: layer2
    ethernets:
        ens2f1:
            match:
            macaddress: 68:05:ca:d4:09:91
            mtu: 1500
            set-name: ens2f1
        ens4f1:
            match:
            macaddress: 68:05:ca:d4:09:01
            mtu: 1500
            set-name: ens4f1
    vlans:
        bond-mgt.20:
        addresses:
            - 192.168.0.10/24
            id: 20
            link: bond-mgt
            mtu: 1500
    vlans:
        bond-mgt.21: // Enter the VLAN ID you verified on the console instead of 21.
        addresses:
            - 192.168.0.20/24 // Set it to the local Subnet IP verified in the console.
            id: 21    // Set it to the VLAN ID verified in the console.
            link: bond-mgt
            mtu: 1500

[root@localhost ~]# vi /etc/netplan/50-cloud-init.yaml
network:
    bonds:
        bond-mgt:
            interfaces:
            - ens2f1
            - ens4f1
            mtu: 1500
            parameters:
                mii-monitor-interval: 100
                mode: active-backup
                transmit-hash-policy: layer2
    ethernets:
        ens2f1:
            match:
            macaddress: 68:05:ca:d4:09:91
            mtu: 1500
            set-name: ens2f1
        ens4f1:
            match:
            macaddress: 68:05:ca:d4:09:01
            mtu: 1500
            set-name: ens4f1
    vlans:
        bond-mgt.20:
        addresses:
            - 192.168.0.10/24
            id: 20
            link: bond-mgt
            mtu: 1500
    vlans:
        bond-mgt.21: // Enter the VLAN ID you verified on the console instead of 21.
        addresses:
            - 192.168.0.20/24 // Set it to the local Subnet IP verified in the console.
            id: 21    // Set it to the VLAN ID verified in the console.
            link: bond-mgt
            mtu: 1500

Code block. Add VLAN and configure IP

Apply the changes to the system.
Color mode
# netplan apply
# netplan apply
Code block. Reflect changes
Check the interface status.
Color mode
# ip a or # bash /usr/local/bin/ip.sh
# ip a or # bash /usr/local/bin/ip.sh
Code block. Interface lookup

Linux – Setting Subnet on CentOS/Red Hat

After adding a local Subnet on a CentOS/Red Hat operating system, follow these steps to configure the network.

Caution

Be careful, as adding a local subnet or misconfiguring the network may delete the IP information currently in use.

Verify the Bond Name for the local Subnet.
Color mode
# sh /usr/local/bin/ip.sh
# sh /usr/local/bin/ip.sh
Code block. Verify bonding

Modify the following command and execute it.

Color mode

#!/usr/bin/bash

IP_ADDR="10.1.1.3/24"   // Set the local Subnet IP as observed in the console.
VLAN_ID="7"             // Set the Vlan ID as observed in the console.

BOND_NAME="bond-mgt" // Set the Bond Name confirmed in step 1.

# add vlan
nmcli conn add type vlan ifname "${BOND_NAME}.${VLAN_ID}" con-name "${BOND_NAME}.${VLAN_ID}" id ${VLAN_ID} dev ${BOND_NAME}
nmcli con  mod ${BOND_NAME}.${VLAN_ID} con-name "Vlan ${BOND_NAME}.${VLAN_ID}"
nmcli con  mod "Vlan ${BOND_NAME}.${VLAN_ID}" ipv4.addresses ${IP_ADDR}
nmcli con  mod "Vlan ${BOND_NAME}.${VLAN_ID}" ipv4.method manual
nmcli con  mod "Vlan ${BOND_NAME}.${VLAN_ID}" ipv6.method "ignore"
nmcli con  mod "Vlan ${BOND_NAME}.${VLAN_ID}" connect.autoconnect yes
nmcli con  up  "Vlan ${BOND_NAME}.${VLAN_ID}"

nmcli      device reapply ${BOND_NAME}.${VLAN_ID}

#!/usr/bin/bash

IP_ADDR="10.1.1.3/24"   // Set the local Subnet IP as observed in the console.
VLAN_ID="7"             // Set the Vlan ID as observed in the console.

BOND_NAME="bond-mgt" // Set the Bond Name confirmed in step 1.

# add vlan
nmcli conn add type vlan ifname "${BOND_NAME}.${VLAN_ID}" con-name "${BOND_NAME}.${VLAN_ID}" id ${VLAN_ID} dev ${BOND_NAME}
nmcli con  mod ${BOND_NAME}.${VLAN_ID} con-name "Vlan ${BOND_NAME}.${VLAN_ID}"
nmcli con  mod "Vlan ${BOND_NAME}.${VLAN_ID}" ipv4.addresses ${IP_ADDR}
nmcli con  mod "Vlan ${BOND_NAME}.${VLAN_ID}" ipv4.method manual
nmcli con  mod "Vlan ${BOND_NAME}.${VLAN_ID}" ipv6.method "ignore"
nmcli con  mod "Vlan ${BOND_NAME}.${VLAN_ID}" connect.autoconnect yes
nmcli con  up  "Vlan ${BOND_NAME}.${VLAN_ID}"

nmcli      device reapply ${BOND_NAME}.${VLAN_ID}

Code block. IP configuration script

Check the interface status.
Color mode
# ip a or # bash /usr/local/bin/ip.sh
# ip a or # bash /usr/local/bin/ip.sh
Code block. Interface lookup

Configuring Subnet on Windows

After adding a local Subnet on the Windows operating system, follow these steps to configure the network.

Windows Start icon, right‑click it, then launch the Windows PowerShell(Administrator) program.
Check the Teaming name for the local Subnet.
Color mode
PS C:\> Get-NetAdapter
PS C:\> Get-NetAdapter
Code block. Check Windows interface

After adding a new Vlan, configure the IP.

Enter the Teaming name confirmed in step 2, the Vlan ID, and the local Subnet IP confirmed in the Console.

Color mode

PS C:\> Add-NetLbfoTeamNIC -Team bond_bond-mgt -VlanID 20 -Name bond-mgt.20
PS C:\> Get-NetAdapter
PS C:\> netsh interface ip set address bond-mgt.20 static “192.168.0.10/24”

PS C:\> Add-NetLbfoTeamNIC -Team bond_bond-mgt -VlanID 20 -Name bond-mgt.20
PS C:\> Get-NetAdapter
PS C:\> netsh interface ip set address bond-mgt.20 static “192.168.0.10/24”

Code block. Create Teaming

Run ncpa.cpl from the Windows Run menu to check the interface status.

Change IP

The IP address may be changed to carry out migration, server replacement, and similar operations.

Caution

If you proceed with changing the IP, you will no longer be able to communicate using that IP, and you cannot cancel the IP change while it is in progress.
If the server is running the Load Balancer service, you must delete the old IP from the LB server group and directly add the new IP as a member of the LB server group.
Servers using Public NAT and Privat NAT must disable and then reconfigure Public NAT and Privat NAT after an IP change.
- If you are using Public NAT and Privat NAT, first disable Public NAT and Privat NAT, complete the IP change, and then reconfigure.
- Whether to use Public NAT and Privat NAT can be changed by clicking the Edit button for Public NAT IP and Privat NAT on the Bare Metal Server Details page.

To change the IP, follow the steps below.

Click the All Services > Compute > Bare Metal Server menu. Navigate to the Service Home page of the Bare Metal Server.
Click the Bare Metal Server menu on the Service Home page. You will be taken to the Bare Metal Server List page.
Bare Metal Server List page, click the server whose IP you want to change. You will be taken to the Bare Metal Server Details page.
On the Bare Metal Server Details page, click the Edit button next to the IP item.
When the popup notifying IP modification opens, click the Confirm button. The IP Change popup opens.
In the IP Change popup, complete Step 1, Step 2, and Step 3 in order.
information
- When changing an IP, the detailed configuration method for the IP change step varies depending on the subnet of the IP to be changed. Be sure to refer to the following examples and perform the work for each step.
  - When changing to an IP that uses the same subnet: Change to an IP in the same Subnet See
  - When changing to an IP that uses a different subnet: see Changing to an IP in a different Subnet
- When each step is completed successfully, the task status in the upper right corner is shown as Completed, and you can proceed to the next step.
- When performing the final check of Step 3, it is recommended to restart the server before proceeding with the inspection.
After confirming that all tasks have completed successfully, click the Confirm button.

Change to an IP in the same Subnet

This document explains how to configure IP settings per operating system when the IP to be changed uses the same subnet.

Linux – centos/redhat operating system

Step 1

Follow the procedure below to carry out Step 1.

Select the Subnet to modify.
Enter the IP address to change.
Click the IP Allocation Request button.
If a popup window notifying you of an IP change opens, click the Confirm button.
- When the task completes successfully, the task status in the upper right corner is displayed as Completed.
  Caution
  If you proceed with the IP allocation request for Step 1, you cannot cancel or revert the IP change.

Step 2

Follow the procedure below to carry out Step 2.

For the IP change operation, connect to the target server using the NAT IP.
guide
To prevent situations where communication becomes impossible during operation, we recommend connecting via another Virtual Server or Bare Metal Server created in the same subnet.
Enter the IP assigned in Step 1 and configure the new IP on the server.
- In the following example, replace 172.17.34.150 with the assigned IP address.
- After checking the information of the Interface you want to change on the server, replace it with bond-srv.9 in the following example.
  Color mode
  # nmcli con mod "Vlan bond-srv.9" ipv4.addresses 172.17.34.150/24 # nmcli con mod "Vlan bond-srv.9" ipv4.method manual # nmcli device reapply bond-srv.9
  # nmcli con mod "Vlan bond-srv.9" ipv4.addresses 172.17.34.150/24 # nmcli con mod "Vlan bond-srv.9" ipv4.method manual # nmcli device reapply bond-srv.9
  Code block. IP settings to change
  information
  - Setting the IP disconnects the terminal session.
  - Step 2 After completing the task, if the task status changes to Completed, you can reconnect to the terminal.
When all tasks are completed, select the Step 2 completion checkbox in the IP Change popup window.
- When the task completes successfully, the task status in the upper right corner is displayed as Completed.
  information
  - If the task status of Step 2 has changed to Completed but you still have issues with terminal access, go to the All Services > Management > Support Center Contact Us menu and submit an inquiry.

Step 3

Follow the procedure below to carry out Step 3 work.

Connect to the server subject to IP change using the NAT IP and check the communication status.
- Use the following command to verify whether any pre‑change configuration remains and whether the change was applied correctly. If you can connect successfully to the target server whose IP was changed, the new IP is communicating properly.
  Color mode
  # bash /usr/local/bin/ip.sh
  # bash /usr/local/bin/ip.sh
  Code block. Communication status check
  Reference
  The NAT IP does not change.
After all tasks are completed, restart the server and perform a final check.
Reference
It is recommended to perform the final check after restarting the server.
If there are no issues in the final inspection results, select the work completion checkbox for Step 3 in the IP Change popup window.

Linux – Ubuntu operating system

Step 1

Follow the procedure below to carry out Step 1.

Select the Subnet to modify.
Enter the IP address to change.
IP Allocation Request Click the button.
When a popup indicating IP change verification opens, click the Confirm button.
- When the task completes successfully, the task status in the upper right corner is displayed as Completed.
  Caution
  If you proceed with the IP allocation request of Step 1, you cannot cancel or revert the IP change.

Step 2

Follow the procedure below to perform Step 2.

To perform the IP change, connect to the target server using its NAT IP.
Information
To prevent situations where communication becomes impossible during operation, we recommend connecting via another Virtual Server or Bare Metal Server created in the same subnet.

Enter the IP assigned in Step 1 and configure the new IP on the server.

In the following example, replace 172.17.34.150/24 with the assigned IP address.

After checking the information of the Interface you want to change on the server, replace it with the following example bond-srv.9.

Color mode

[root@localhost ~]# vi /etc/netplan/50-cloud-init.yaml
network:
    bonds:
    ...................... omitted
    ethernets:
    ...................... omitted
    vlans:
        bond-srv.9:
        addresses:
            - 172.17.34.150/24   # Enter the IP assigned in Step1.
            gateway4: 172.17.34.2
            id: 9
            link: bond-srv
            mtu: 1500
        bond-srv.350:
            addresses:
            - 172.16.87.150/24
            routes:
            - to: 172.17.87.0/24
              via: 172.16.87.1
- to: 172.17.87.0/24
              via: 172.16.87.1
            id: 350
            link: bond-srv

[root@localhost ~]# vi /etc/netplan/50-cloud-init.yaml
network:
    bonds:
    ...................... omitted
    ethernets:
    ...................... omitted
    vlans:
        bond-srv.9:
        addresses:
            - 172.17.34.150/24   # Enter the IP assigned in Step1.
            gateway4: 172.17.34.2
            id: 9
            link: bond-srv
            mtu: 1500
        bond-srv.350:
            addresses:
            - 172.16.87.150/24
            routes:
            - to: 172.17.87.0/24
              via: 172.16.87.1
- to: 172.17.87.0/24
              via: 172.16.87.1
            id: 350
            link: bond-srv

Code block. Change IP setting

Use the Netplan apply command to apply the changes to the system.
Color mode
[root@localhost ~]# netplan apply
[root@localhost ~]# netplan apply
Code block. Run Netplan apply
information
- Setting the IP disconnects the terminal session.
- Step 2 After completing the task, if the task status changes to Completed, you can reconnect to the terminal.
When all tasks are completed, select the task completion checkbox for Step 2 in the IP change popup window.
- When the task completes successfully, the task status in the upper right corner is displayed as Completed.
  information
  - If the task status of Step 2 has changed to Completed but you still experience issues with terminal access, go to the All Services > Management > Support Center Contact Us menu and submit an inquiry.

Step 3

Follow the procedure below to carry out Step 3.

Connect to the server subject to IP change using the NAT IP and check the communication status.
- Use the following command to verify whether any pre‑change configuration remains and whether the change was applied correctly. If you can connect successfully to the target server whose IP was changed, the new IP is communicating properly.
  Color mode
  # bash /usr/local/bin/ip.sh
  # bash /usr/local/bin/ip.sh
  Code block. Communication status check
  reference
  The NAT IP does not change.
After all tasks are completed, restart the server and then perform a final check.
Reference
It is recommended to perform the final check after restarting the server.
If the final inspection results show no issues, select the work completion checkbox for Step 3 in the IP Change popup window.

Windows operating system

Step 1

Follow the procedure below to perform Step 1 work.

Select the Subnet to modify.
Enter the IP address to change.
Click the IP Allocation Request button.
When the pop-up that notifies you of an IP change appears, click the Confirm button.
- When the task completes successfully, the task status in the upper right corner is displayed as Completed.
  Caution
  If you proceed with the IP allocation request of Step 1, you cannot cancel or revert the IP change.

Step 2

Follow the procedure below to perform Step 2.

To perform the IP change, connect to the target server using its NAT IP.
Information
To prevent communication failures during operation, we recommend connecting through another Virtual Server or Bare Metal Server created in the same subnet.
Right-click the Windows Start icon, then run Windows PowerShell(Administrator).
Enter the assigned IP from Step 1 and set the IP to be changed on the server.
- In the following example, replace 172.17.34.150 with the assigned IP address.
  Color mode
  PS C:\> netsh interface ip set address "bond-srv.20" static 172.17.34.150 255.255.255.0
  PS C:\> netsh interface ip set address "bond-srv.20" static 172.17.34.150 255.255.255.0
  code block. Change IP setting
  information
  - Setting the IP disconnects the terminal session.
  - Step 2 After completing the task, if the task status changes to Completed, you can reconnect to the terminal.
When all tasks are completed, select the task completion checkbox for Step 2 in the IP change popup window.
- When the task completes successfully, the task status in the upper right corner is displayed as Completed.
  Information
  - If the task status of Step 2 has changed to Completed but you still have issues with terminal access, go to the All Services > Management > Support Center Contact Us menu and submit an inquiry.

Step 3

Follow the procedure below to carry out Step 3.

Connect to the server targeted for IP change using its NAT IP and verify the communication status.
- Use the following command to verify whether any pre‑change configuration remains and to confirm that the change was applied correctly. If you can connect successfully to the target server whose IP was changed, the new IP is communicating properly.
  Color mode
  PS C:\> Get-NetIPAddress | Format-Table
  PS C:\> Get-NetIPAddress | Format-Table
  Code block. Communication status check
  Reference
  The NAT IP does not change.
After all tasks are completed, restart the server and then perform a final check.
Reference
It is recommended to perform the final check after restarting the server.
If there are no issues in the final inspection results, select the work completion checkbox for Step 3 in the IP Change popup window.

Change to an IP of a different Subnet

This explains how to configure IP settings for each operating system when the IP to be changed uses a different subnet.

Linux – centos/redhat operating system

Step 1

Follow the procedure below to carry out Step 1.

Select the Subnet to modify.
Enter the IP address to change.
Click the IP Allocation Request button.
If a popup window notifying you of an IP change opens, click the Confirm button.
- When the operation completes successfully, Vlan ID verification, Default Gateway verification information is displayed, and the task status in the upper right corner shows Completed.
  Caution
  If you proceed with the IP allocation request for Step 1, you cannot cancel or revert the IP change.

Step 2

Follow the steps below to carry out Step 2.

For the IP change operation, connect to the target server using the NAT IP.
information
To prevent situations where communication becomes impossible during operation, we recommend connecting via another Virtual Server or Bare Metal Server created in the same subnet.

To add the IP to the server, add a new VLAN and configure the IP.

Add VLAN: Create the interface for the Vlan ID identified in Step 1. In the following example, replace 20 with the assigned ID.

IP configuration: Enter the IP assigned in Step 1. In the following example, replace 192.168.0.10/24 with the assigned IP.

Color mode

# nmcli conn add type vlan ifname "bond-srv.20" con-name "bond-srv.20" id 20 dev bond-srv
# nmcli con mod bond-srv.20 con-name "Vlan bond-srv.20"
# nmcli con mod "Vlan bond-srv.20" ipv4.addresses 192.168.0.10/24
# nmcli con mod "Vlan bond-srv.20" ipv4.method manual
# nmcli con mod "Vlan bond-srv.20" ipv6.method "ignore"
# nmcli con up  "Vlan bond-srv.20"

# nmcli conn add type vlan ifname "bond-srv.20" con-name "bond-srv.20" id 20 dev bond-srv
# nmcli con mod bond-srv.20 con-name "Vlan bond-srv.20"
# nmcli con mod "Vlan bond-srv.20" ipv4.addresses 192.168.0.10/24
# nmcli con mod "Vlan bond-srv.20" ipv4.method manual
# nmcli con mod "Vlan bond-srv.20" ipv6.method "ignore"
# nmcli con up  "Vlan bond-srv.20"

Code block. IP settings to change

Configure the default gateway for the new VLAN.
- Default gateway setting: Enter the Default gateway IP assigned in Step 1. In the following example, replace 192.168.0.1 with the assigned Default gateway IP.
  Color mode
  # nmcli con mod "Vlan bond-srv.20" ipv4.gateway 192.168.0.1 # nmcli device reapply bond-srv.20
  # nmcli con mod "Vlan bond-srv.20" ipv4.gateway 192.168.0.1 # nmcli device reapply bond-srv.20
  Code block. Change IP settings
  Info
  - Setting a Default Gateway on a new VLAN disconnects the terminal session.
  - Step 2 After completing the task, if the task status changes to Completed, you can reconnect to the terminal.
When all tasks are completed, select the task completion checkbox of Step 2 in the IP change popup window.
- When the task completes successfully, the task status in the upper right corner is displayed as Completed.
  information
  - If the task status of Step 2 has changed to Completed but you still experience issues with terminal access, go to the All Services > Management > Support Center’s Contact Us menu and make an inquiry.

Step 3

Follow the procedure below to proceed with Step 3.

Connect to the server to be changed using the NAT IP.
After verifying the Default Gateway IP of the existing (pre-change) interface, delete it.
- In the example below, replace 192.168.10.1 with the IP you have verified.
  Color mode
  # ip route del default via 192.168.10.1
  # ip route del default via 192.168.10.1
  Code block. Delete the Default Gateway IP of the existing interface.
Connect to the server subject to IP change using the NAT IP and check the communication status.
- Use the following command to verify whether any pre‑change configuration remains and whether the change was applied correctly. If you can connect successfully to the target server whose IP was changed, the new IP is communicating properly.
  Color mode
  # netstat –nr # bash /usr/local/bin/ip.sh
  # netstat –nr # bash /usr/local/bin/ip.sh
  Code block. Communication status check
  Reference
  The NAT IP does not change.
Verify the VLAN information of the existing IP and delete it from the server.
- Enter the verified ID in place of 30 in the following example.
  Color mode
  # nmcli con delete "Vlan bond-srv.30"
  # nmcli con delete "Vlan bond-srv.30"
  Code block. Delete the VLAN information of the existing IP.
After all tasks are completed, restart the server and then perform a final check.
Reference
It is recommended to perform the final check after restarting the server.
If the final inspection results show no issues, select the work completion checkbox for Step 3 in the IP Change popup window.

Linux – Ubuntu operating system

Step 1

Follow the procedure below to perform Step 1.

Select the Subnet to modify.
Enter the IP address to change.
Click the IP Allocation Request button.
When a popup notifying you of an IP change opens, click the Confirm button.
- When the operation completes successfully, Vlan ID verification, Default Gateway verification information is displayed, and the operation status in the upper right corner shows Completed.
  Caution
  If you proceed with the IP allocation request of Step 1, you cannot cancel or revert the IP change.

Step 2

Follow the procedure below to perform Step 2.

To perform the IP change, connect to the target server using its NAT IP.
Information
To prevent situations where communication becomes impossible during operation, we recommend connecting via another Virtual Server or Bare Metal Server created in the same subnet.

To add the IP to the server, add a new VLAN and configure the IP and Default Gateway.

In the following example, this is the part where content is added at the bottom of the Step 1 task description.

In the following example, enter the assigned ID and IP instead of ID and IP.

Color mode

[root@localhost ~]# vi /etc/netplan/50-cloud-init.yaml
network:
    bonds:
    ...................... omitted
    ethernets:
    ...................... omitted
    vlans:
        bond-srv.9:
        addresses:
            - 172.17.34.150/24

            gateway4: 172.17.34.2
            id: 9
            link: bond-srv
            mtu: 1500
        bond-srv.350:
            addresses:
            - 172.16.87.150/24
            routes:
            - to: 172.17.87.0/24
              via: 172.16.87.1
- to: 172.17.87.0/24
              via: 172.16.87.1
            id: 350
            link: bond-srv

# Create the interface for the Vlan ID identified in Step1.
# Enter the IP assigned in Step1.
# Enter the Default gateway IP assigned in Step1.
bond-srv.20:
        addresses:
            - 192.168.0.10/24
            gateway4: 192.168.0.1
            id: 20
            link: bond-srv
            mtu: 1500

[root@localhost ~]# vi /etc/netplan/50-cloud-init.yaml
network:
    bonds:
    ...................... omitted
    ethernets:
    ...................... omitted
    vlans:
        bond-srv.9:
        addresses:
            - 172.17.34.150/24

            gateway4: 172.17.34.2
            id: 9
            link: bond-srv
            mtu: 1500
        bond-srv.350:
            addresses:
            - 172.16.87.150/24
            routes:
            - to: 172.17.87.0/24
              via: 172.16.87.1
- to: 172.17.87.0/24
              via: 172.16.87.1
            id: 350
            link: bond-srv

# Create the interface for the Vlan ID identified in Step1.
# Enter the IP assigned in Step1.
# Enter the Default gateway IP assigned in Step1.
bond-srv.20:
        addresses:
            - 192.168.0.10/24
            gateway4: 192.168.0.1
            id: 20
            link: bond-srv
            mtu: 1500

Code block. Change IP settings

Use the Netplan apply command to apply the changes to the system.
Color mode
[root@localhost ~]# netplan apply
[root@localhost ~]# netplan apply
Code block. Run Netplan apply
Info
- Setting a new Default Gateway disconnects the terminal session.
- Step 2 After completing the task, if the task status changes to Completed, you can reconnect to the terminal.
When all tasks are completed, select the task completion checkbox of Step 2 in the IP change popup window.
- When the task completes successfully, the task status in the upper right corner is displayed as Completed.
  information
  - If the task status of Step 2 has changed to Completed but you still experience issues with terminal access, go to the All Services > Management > Support Center’s Contact Us menu and make an inquiry.

Step 3

Follow the procedure below to proceed with Step 3.

Connect to the server to be changed using the NAT IP.

Check the Default Gateway IP of the existing (pre‑change) interface, then delete it.

In the following example, the Delete this line row is the part that gets deleted.

Color mode

[root@localhost ~]# vi /etc/netplan/50-cloud-init.yaml
network:
    bonds:
    ...................... omitted
    ethernets:
    ...................... omitted
    vlans:
        bond-srv.9:
        addresses:
            - 172.17.34.150/24
            gateway4: 172.17.34.2    #Delete this line
            id: 9
            link: bond-srv
            mtu: 1500
        bond-srv.350:
            addresses:
            - 172.16.87.150/24
            routes:
            - to: 172.17.87.0/24
              via: 172.16.87.1
- to: 172.17.87.0/24
              via: 172.16.87.1
            id: 350
            link: bond-srv

bond-srv.20:
        addresses:
            - 192.168.0.10/24
            gateway4: 192.168.0.1
            id: 20
            link: bond-srv
            mtu: 1500

[root@localhost ~]# vi /etc/netplan/50-cloud-init.yaml
network:
    bonds:
    ...................... omitted
    ethernets:
    ...................... omitted
    vlans:
        bond-srv.9:
        addresses:
            - 172.17.34.150/24
            gateway4: 172.17.34.2    #Delete this line
            id: 9
            link: bond-srv
            mtu: 1500
        bond-srv.350:
            addresses:
            - 172.16.87.150/24
            routes:
            - to: 172.17.87.0/24
              via: 172.16.87.1
- to: 172.17.87.0/24
              via: 172.16.87.1
            id: 350
            link: bond-srv

bond-srv.20:
        addresses:
            - 192.168.0.10/24
            gateway4: 192.168.0.1
            id: 20
            link: bond-srv
            mtu: 1500

Code block. Delete the Default Gateway IP of the existing interface.

Connect to the IP change target server using the NAT IP and check the communication status.
- Use the following command to verify whether any pre‑change configuration remains and that the change was applied correctly. If you can connect successfully to the target server whose IP was changed, the new IP is communicating properly.
  Color mode
  # netstat –nr # bash /usr/local/bin/ip.sh
  # netstat –nr # bash /usr/local/bin/ip.sh
  Code block. Communication status check
  Reference
  The NAT IP does not change.

Delete the existing IP.

In the following example, the Delete this line row is the part that gets deleted.

Color mode

[root@localhost ~]# vi /etc/netplan/50-cloud-init.yaml
network:
    bonds:
    ...................... omitted
    ethernets:
    ...................... omitted
    vlans:
        bond-srv.9:                   #Delete this line
        addresses:                    #Delete this line
            - 172.17.34.150/24        # Delete this line
            gateway4: 172.17.34.2     #Delete this line
            id: 9                     # Delete this line
            link: bond-srv            #Delete this line
            mtu: 1500                 #Delete this line
        bond-srv.350:
            addresses:
            - 172.16.87.150/24
            routes:
            - to: 172.17.87.0/24
              via: 172.16.87.1
- to: 172.17.87.0/24
              via: 172.16.87.1
            id: 350
            link: bond-srv

bond-srv.20:
        addresses:
            - 192.168.0.10/24
            gateway4: 192.168.0.1
            id: 20
            link: bond-srv
            mtu: 1

[root@localhost ~]# vi /etc/netplan/50-cloud-init.yaml
network:
    bonds:
    ...................... omitted
    ethernets:
    ...................... omitted
    vlans:
        bond-srv.9:                   #Delete this line
        addresses:                    #Delete this line
            - 172.17.34.150/24        # Delete this line
            gateway4: 172.17.34.2     #Delete this line
            id: 9                     # Delete this line
            link: bond-srv            #Delete this line
            mtu: 1500                 #Delete this line
        bond-srv.350:
            addresses:
            - 172.16.87.150/24
            routes:
            - to: 172.17.87.0/24
              via: 172.16.87.1
- to: 172.17.87.0/24
              via: 172.16.87.1
            id: 350
            link: bond-srv

bond-srv.20:
        addresses:
            - 192.168.0.10/24
            gateway4: 192.168.0.1
            id: 20
            link: bond-srv
            mtu: 1

Code block. Delete existing IP

Apply the changes to the system.

Color mode

[root@localhost ~]# netplan apply
[root@localhost ~]#  ip link delete bond-srv.9 # Additional steps when deleting VLAN

[root@localhost ~]# netplan apply
[root@localhost ~]#  ip link delete bond-srv.9 # Additional steps when deleting VLAN

Code block. Apply changes

After all tasks are completed, restart the server and then perform a final check.

Reference

It is recommended to perform the final check after restarting the server.

If there are no issues in the final inspection results, select the work completion checkbox for Step 3 in the IP change popup window.

Windows operating system

Step 1

Follow the procedure below to carry out Step 1.

Select the Subnet to modify.
Enter the IP address to change.
Click the IP Allocation Request button.
If a popup notifying IP change verification opens, click the Confirm button.
- When the operation completes successfully, Vlan ID verification, Default Gateway verification information is displayed, and the operation status in the upper right corner shows Completed.
  Caution
  If you proceed with the IP allocation request for Step 1, you cannot cancel or revert the IP change.

Step 2

Follow the procedure below to carry out Step 2.

To perform the IP change, connect to the target server using its NAT IP.
information
To prevent situations where communication becomes impossible during operation, we recommend connecting via another Virtual Server or Bare Metal Server created in the same subnet.
Windows Start icon, right-click it, then run Windows PowerShell(Administrator).
Add a VLAN and configure the IP and default gateway.
- Add VLAN: Create the interface for the Vlan ID identified in Step 1. In the following example, replace 20 with the assigned ID.
- IP configuration: Enter the IP assigned in Step 1. In the following example, replace 46 with the ifindex obtained via Get-NetAdapter, and replace 192.168.0.10 with the assigned IP.
- Default gateway setting: Enter the Default gateway IP assigned in Step 1. In the following example, replace 192.168.0.1 with the assigned Default gateway IP.
  Color mode
  PS C:\> Add-NetLbfoTeamNIC -Team bond_bond-srv -VlanID 20 -Name bond-srv.20 -Confirm:$false PS C:\> Get-NetAdapter PS C:\> New-NetIPAddress -InterfaceIndex 46 -IPAddress 192.168.0.10 -PrefixLength 24 –defaultgateway 192.168.0.1
  PS C:\> Add-NetLbfoTeamNIC -Team bond_bond-srv -VlanID 20 -Name bond-srv.20 -Confirm:$false PS C:\> Get-NetAdapter PS C:\> New-NetIPAddress -InterfaceIndex 46 -IPAddress 192.168.0.10 -PrefixLength 24 –defaultgateway 192.168.0.1
  code block. Change IP settings
  information
  - Setting a new Default Gateway disconnects the terminal session.
  - Step 2 After completing the task, if the task status changes to Completed, you can reconnect to the terminal.
When all tasks are completed, select the task completion checkbox of Step 2 in the IP change popup window.
- When the task completes successfully, the task status in the upper right corner is displayed as Completed.
  information
  - If the task status of Step 2 has changed to Completed but you still experience issues with terminal access, go to the All Services > Management > Support Center Contact Us menu and submit an inquiry.

Step 3

Follow the procedure below to carry out Step 3.

Connect to the IP-change target server using the NAT IP.

Run the interface index (ifindex) to check the existing Default Gateway IP.

Color mode

PS C:\> Get-NetAdapter
Name                      InterfaceDescription                    ifIndex Status       MacAddress             LinkSpeed
----                      --------------------                    ------- ------       ----------             ---------
bond-srv.9                Microsoft Network Adapter Multiple...#4      30 Up           40-A6-B7-27-96-D5        50 Gbps
bond-srv                  Microsoft Network Adapter Multiple...#3      19 Up           40-A6-B7-27-96-D5        50 Gbps
bond-iscsi                Microsoft Network Adapter Multiple...#2      18 Up           40-A6-B7-27-96-D4        50 Gbps
bond-backup               Microsoft Network Adapter Multiplexo...      22 Up           68-05-CA-C9-EB-88        20 Gbps
eno2                      Intel(R) Ethernet Connection X722 fo...      12 Disabled     38-68-DD-36-A0-59         1 Gbps
ens3f0                    Intel(R) Ethernet Network Adapter XX...      11 Up           40-A6-B7-27-96-D4        25 Gbps
…………………………………………..  omitted

PS C:\> Get-NetAdapter
Name                      InterfaceDescription                    ifIndex Status       MacAddress             LinkSpeed
----                      --------------------                    ------- ------       ----------             ---------
bond-srv.9                Microsoft Network Adapter Multiple...#4      30 Up           40-A6-B7-27-96-D5        50 Gbps
bond-srv                  Microsoft Network Adapter Multiple...#3      19 Up           40-A6-B7-27-96-D5        50 Gbps
bond-iscsi                Microsoft Network Adapter Multiple...#2      18 Up           40-A6-B7-27-96-D4        50 Gbps
bond-backup               Microsoft Network Adapter Multiplexo...      22 Up           68-05-CA-C9-EB-88        20 Gbps
eno2                      Intel(R) Ethernet Connection X722 fo...      12 Disabled     38-68-DD-36-A0-59         1 Gbps
ens3f0                    Intel(R) Ethernet Network Adapter XX...      11 Up           40-A6-B7-27-96-D4        25 Gbps
…………………………………………..  omitted

code block. Run Get-NetAdapter

Color mode

PS C:\> get-netroute -ifindex 30

ifIndex DestinationPrefix                              NextHop                                  RouteMetric PolicyStore
------- -----------------                              -------                                  ----------- -----------
30      255.255.255.255/32                             0.0.0.0                                          256 ActiveStore
30      224.0.0.0/4                                    0.0.0.0                                          256 ActiveStore
30      172.17.35.0/24                                 172.17.35.1                                      256 ActiveStore
30      172.17.34.255/32                               0.0.0.0                                          256 ActiveStore
30      172.17.34.14/32                                0.0.0.0                                          256 ActiveStore
30      172.17.34.0/24                                 0.0.0.0                                          256 ActiveStore
30      0.0.0.0/0                                      172.17.34.1                                        1 ActiveStore

PS C:\> get-netroute -ifindex 30

ifIndex DestinationPrefix                              NextHop                                  RouteMetric PolicyStore
------- -----------------                              -------                                  ----------- -----------
30      255.255.255.255/32                             0.0.0.0                                          256 ActiveStore
30      224.0.0.0/4                                    0.0.0.0                                          256 ActiveStore
30      172.17.35.0/24                                 172.17.35.1                                      256 ActiveStore
30      172.17.34.255/32                               0.0.0.0                                          256 ActiveStore
30      172.17.34.14/32                                0.0.0.0                                          256 ActiveStore
30      172.17.34.0/24                                 0.0.0.0                                          256 ActiveStore
30      0.0.0.0/0                                      172.17.34.1                                        1 ActiveStore

code block. Execute -ifindex

Delete the existing Default Gateway IP.

In the following example, replace 30 with the ifindex obtained using Get-NetAdapter, and replace 172.17.34.1 with the IP you verified.

Color mode

PS C:\> Remove-NetRoute -ifIndex 30 -DestinationPrefix 0.0.0.0/0 -NextHop 172.17.34.1 -Confirm:$false

PS C:\> Remove-NetRoute -ifIndex 30 -DestinationPrefix 0.0.0.0/0 -NextHop 172.17.34.1 -Confirm:$false

Code block. Delete Default Gateway IP

Color mode

PS C:\> get-netroute -ifindex 30

ifIndex DestinationPrefix                              NextHop                                  RouteMetric PolicyStore
------- -----------------                              -------                                  ----------- -----------
30      255.255.255.255/32                             0.0.0.0                                          256 ActiveStore
30      224.0.0.0/4                                    0.0.0.0                                          256 ActiveStore
30      172.17.34.255/32                               0.0.0.0                                          256 ActiveStore
30      172.17.34.14/32                                0.0.0.0                                          256 ActiveStore
30      172.17.34.0/24                                 0.0.0.0                                          256 ActiveStore

PS C:\> get-netroute -ifindex 30

ifIndex DestinationPrefix                              NextHop                                  RouteMetric PolicyStore
------- -----------------                              -------                                  ----------- -----------
30      255.255.255.255/32                             0.0.0.0                                          256 ActiveStore
30      224.0.0.0/4                                    0.0.0.0                                          256 ActiveStore
30      172.17.34.255/32                               0.0.0.0                                          256 ActiveStore
30      172.17.34.14/32                                0.0.0.0                                          256 ActiveStore
30      172.17.34.0/24                                 0.0.0.0                                          256 ActiveStore

code block. -ifindex execution

Information

If you delete the existing Default Gateway, the terminal session will be disconnected.
Step 2 After completing the task, if the task status changes to Completed, you can reconnect to the terminal.

Connect to the server targeted for IP change using its NAT IP and verify the communication status.
- Use the following command to verify whether any pre‑change configuration remains and whether the change was applied correctly. If you can connect successfully to the target server whose IP was changed, the new IP is communicating properly.
  Color mode
  PS C:\> netstat –nr | findstr Default PS C:\> Get-NetIPAddress | Format-Table
  PS C:\> netstat –nr | findstr Default PS C:\> Get-NetIPAddress | Format-Table
  Code block. Communication status check
  Reference
  The NAT IP does not change.

Check the VLAN information of the existing IP in the Team information.

Color mode

PS C:\> Get-NetLbfoTeam

Name                   : bond_bond-srv
Members                : {ens6f1, ens3f1}
TeamNics               : {bond-srv, bond-srv.9}
TeamingMode            : SwitchIndependent
LoadBalancingAlgorithm : Dynamic
Status                 : Up

Name                   : bond_bond-iscsi
Members                : {ens6f0, ens3f0}
TeamNics               : bond-iscsi
TeamingMode            : SwitchIndependent
LoadBalancingAlgorithm : Dynamic
Status                 : Up

Name                   : bond_bond-backup
Members                : {ens2f0, ens4f1}
TeamNics               : bond-backup
TeamingMode            : SwitchIndependent
LoadBalancingAlgorithm : Dynamic
Status                 : Up

PS C:\> Get-NetAdapter

Name                      InterfaceDescription                    ifIndex Status       MacAddress             LinkSpeed
----                      --------------------                    ------- ------       ----------             ---------
bond-srv.9                Microsoft Network Adapter Multiple...#4      30 Up           40-A6-B7-27-96-D5        50 Gbps
bond-srv                  Microsoft Network Adapter Multiple...#3      19 Up           40-A6-B7-27-96-D5        50 Gbps
bond-iscsi                Microsoft Network Adapter Multiple...#2      18 Up           40-A6-B7-27-96-D4        50 Gbps
bond-backup               Microsoft Network Adapter Multiplexo...      22 Up           68-05-CA-C9-EB-88        20 Gbps
eno2                      Intel(R) Ethernet Connection X722 fo...      12 Disabled     38-68-DD-36-A0-59         1 Gbps
ens3f0                    Intel(R) Ethernet Network Adapter XX...      11 Up           40-A6-B7-27-96-D4        25 Gbps
…………………………………………..  omitted

PS C:\> Get-NetLbfoTeam

Name                   : bond_bond-srv
Members                : {ens6f1, ens3f1}
TeamNics               : {bond-srv, bond-srv.9}
TeamingMode            : SwitchIndependent
LoadBalancingAlgorithm : Dynamic
Status                 : Up

Name                   : bond_bond-iscsi
Members                : {ens6f0, ens3f0}
TeamNics               : bond-iscsi
TeamingMode            : SwitchIndependent
LoadBalancingAlgorithm : Dynamic
Status                 : Up

Name                   : bond_bond-backup
Members                : {ens2f0, ens4f1}
TeamNics               : bond-backup
TeamingMode            : SwitchIndependent
LoadBalancingAlgorithm : Dynamic
Status                 : Up

PS C:\> Get-NetAdapter

Name                      InterfaceDescription                    ifIndex Status       MacAddress             LinkSpeed
----                      --------------------                    ------- ------       ----------             ---------
bond-srv.9                Microsoft Network Adapter Multiple...#4      30 Up           40-A6-B7-27-96-D5        50 Gbps
bond-srv                  Microsoft Network Adapter Multiple...#3      19 Up           40-A6-B7-27-96-D5        50 Gbps
bond-iscsi                Microsoft Network Adapter Multiple...#2      18 Up           40-A6-B7-27-96-D4        50 Gbps
bond-backup               Microsoft Network Adapter Multiplexo...      22 Up           68-05-CA-C9-EB-88        20 Gbps
eno2                      Intel(R) Ethernet Connection X722 fo...      12 Disabled     38-68-DD-36-A0-59         1 Gbps
ens3f0                    Intel(R) Ethernet Network Adapter XX...      11 Up           40-A6-B7-27-96-D4        25 Gbps
…………………………………………..  omitted

code block. Run Get-NetLbfoTeam

Delete the Vlan information of the existing IP from the server.

In the following example, replace 30 with the ID you verified.

Color mode

PS C:\> Remove-NetLbfoTeamNic -Team bond_bond-srv -VlanID 30 -Confirm:$false

PS C:\> Get-NetAdapter

Name                      InterfaceDescription                    ifIndex Status       MacAddress             LinkSpeed
----                      --------------------                    ------- ------       ----------             ---------
bond-srv                  Microsoft Network Adapter Multiple...#3      19 Up           40-A6-B7-27-96-D5        50 Gbps
bond-iscsi                Microsoft Network Adapter Multiple...#2      18 Up           40-A6-B7-27-96-D4        50 Gbps
bond-backup               Microsoft Network Adapter Multiplexo...      22 Up           68-05-CA-C9-EB-88        20 Gbps
eno2                      Intel(R) Ethernet Connection X722 fo...      12 Disabled     38-68-DD-36-A0-59         1 Gbps
ens3f0                    Intel(R) Ethernet Network Adapter XX...      11 Up           40-A6-B7-27-96-D4        25 Gbps
…………………………………………..  omitted

PS C:\> Remove-NetLbfoTeamNic -Team bond_bond-srv -VlanID 30 -Confirm:$false

PS C:\> Get-NetAdapter

Name                      InterfaceDescription                    ifIndex Status       MacAddress             LinkSpeed
----                      --------------------                    ------- ------       ----------             ---------
bond-srv                  Microsoft Network Adapter Multiple...#3      19 Up           40-A6-B7-27-96-D5        50 Gbps
bond-iscsi                Microsoft Network Adapter Multiple...#2      18 Up           40-A6-B7-27-96-D4        50 Gbps
bond-backup               Microsoft Network Adapter Multiplexo...      22 Up           68-05-CA-C9-EB-88        20 Gbps
eno2                      Intel(R) Ethernet Connection X722 fo...      12 Disabled     38-68-DD-36-A0-59         1 Gbps
ens3f0                    Intel(R) Ethernet Network Adapter XX...      11 Up           40-A6-B7-27-96-D4        25 Gbps
…………………………………………..  omitted

code block. Run Get-NetLbfoTeam

After all tasks are completed, restart the server and then perform a final check.
Reference
It is recommended to perform the final check after restarting the server.
If there are no issues in the final inspection results, select the Step 3 work completion checkbox in the IP Change popup window.

Configuring RHEL Repository

The Samsung Cloud Platform Console provides the SCP RHEL Repository to support user environments where external access is restricted, such as VPC Private Subnets.
You can use the RHEL Repository to install and download the same packages as the official RHEL Repository.

To configure the RHEL Repository, please refer to RHEL Repository Configuration Guide.

Info

If a user created RHEL before August 2025 through the Samsung Cloud Platform Console, they must modify the RHEL Repository settings.
Since the SCP RHEL Repository synchronizes with each region’s local repository according to an internal schedule, it is recommended to switch to an external public mirror site to apply the latest patches quickly.
Samsung Cloud Platform provides the latest repository for the specified major version.

4.2.1 - Install ServiceWatch Agent

Users can install the ServiceWatch Agent on a Bare Metal Server to collect custom metrics and logs.

Reference

Collecting custom metrics/logs via the ServiceWatch Agent is currently available only on Samsung Cloud Platform For Enterprise. It will also be available in other offerings in the future.

Caution

Collecting metrics through the ServiceWatch Agent is classified as custom metrics and, unlike the default collected metrics, incurs charges; therefore, it is recommended to remove or disable unnecessary metric collection settings.

ServiceWatch Agent

On a Bare Metal Server, the agents that need to be installed to collect ServiceWatch’s custom metrics and logs can be divided into two main types. It is a Prometheus Exporter and Open Telemetry Collector.

Category

Detailed description

Prometheus Exporter

Provide metrics of a specific application or service in a format that Prometheus can scrape

To collect OS metrics from servers, you can use Node Exporter for Linux servers and Windows Exporter for Windows servers, depending on the OS type.
- Refer to Virtual Server > ServiceWatch Agent for how to use Node Exporter and Windows Exporter

Open Telemetry Collector

Exports data to the ServiceWatch Gateway so that ServiceWatch can collect metric and log data.

For instructions on using the OpenTelemetry Collector provided by ServiceWatch, see ServiceWatch > ServiceWatch Agent Using

Table. Explanation of Prometheus Exporter and Open Telemetry Collector

Reference

The ServiceWatch Agent guide can be used in the same way as Virtual Server. For more details, see Virtual Server > ServiceWatch Agent.

Pre-configuration for Using ServiceWatch Agent

To use the ServiceWatch Agent, please refer to Prerequisite Settings for ServiceWatch Agent and prepare the prerequisite settings.

4.2.2 - Setting up RHEL Repo and WKMS

The Samsung Cloud Platform Console provides the SCP RHEL Repository to support user environments where external access is restricted, such as VPC Private Subnet. You can use the SCP RHEL Repository to install and download the same packages as the official RHEL Repository.

guide

If a user created RHEL before August 2025 through the Samsung Cloud Platform Console, they must modify the RHEL Repository settings.
Since the SCP RHEL Repository synchronizes with each Region Local Repository according to an internal schedule, it is recommended to switch to an external public mirror site to apply the latest patches quickly.
Samsung Cloud Platform provides the latest repository for the specified major version.

RHEL Repository Configuration Guide

When using RHEL, you can configure the SCP RHEL Repository to install and download the same packages as the official RHEL Repository. To set up the RHEL Repository, follow these steps.

On the Virtual Server, as the OS root user, use the cat command to check the /etc/yum.repos.d/scp.rhel8.repo or /etc/yum.repos.d/scp.rhel9.repo configuration.

Color mode

cat /etc/yum.repos.d/scp.rhel8.repo

cat /etc/yum.repos.d/scp.rhel8.repo

Code block. Verify repo configuration (RHEL8)

Color mode

cat /etc/yum.repos.d/scp.rhel9.repo

cat /etc/yum.repos.d/scp.rhel9.repo

Code block. Verify repo configuration (RHEL9)

When checking the configuration file, the following result is displayed.

Color mode

[rhel-8-baseos]
name=rhel-8-baseos
gpgcheck=0
enabled=1
baseurl=http://scp-rhel8-ip/rhel/8/baseos
[rhel-8-baseos-debug]
name=rhel-8-baseos-debug
gpgcheck=0
enabled=1
baseurl=http://scp-rhel8-ip/rhel/8/baseos-debug
[rhel-8-appstream]
name=rhel-8-appstream
gpgcheck=0
enabled=1
baseurl=http://scp-rhel8-ip/rhel/8/appstream

[rhel-8-baseos]
name=rhel-8-baseos
gpgcheck=0
enabled=1
baseurl=http://scp-rhel8-ip/rhel/8/baseos
[rhel-8-baseos-debug]
name=rhel-8-baseos-debug
gpgcheck=0
enabled=1
baseurl=http://scp-rhel8-ip/rhel/8/baseos-debug
[rhel-8-appstream]
name=rhel-8-appstream
gpgcheck=0
enabled=1
baseurl=http://scp-rhel8-ip/rhel/8/appstream

Code block. Check repo configuration (RHEL8)

Color mode

[rhel-9-for-x86_64-baseos-rpms]
name=rhel-9-for-x86_64-baseos-rpms
baseurl=http://scp-rhel9-ip/rhel/$releasever/x86_64/baseos
gpgcheck=0
enabled=1
[rhel-9-for-x86_64-appstream-rpms]
name=rhel-9-for-x86_64-appstream-rpms
baseurl=http://scp-rhel9-ip/rhel/$releasever/x86_64/appstream
gpgcheck=0
enabled=1
[codeready-builder-for-rhel-9-x86_64-rpms]
name=codeready-builder-for-rhel-9-x86_64-rpms
baseurl=http://scp-rhel9-ip/rhel/$releasever/x86_64/codeready-builder
gpgcheck=0
enabled=1
[rhel-9-for-x86_64-highavailability-rpms]
name=rhel-9-for-x86_64-highavailability-rpms
baseurl=http://scp-rhel9-ip/rhel/$releasever/x86_64/ha
gpgcheck=0
enabled=1
[rhel-9-for-x86_64-supplementary-rpms]
name=rhel-9-for-x86_64-supplementary-rpms
baseurl=http://scp-rhel9-ip/rhel/$releasever/x86_64/supplementary
gpgcheck=0
enabled=1

[rhel-9-for-x86_64-baseos-rpms]
name=rhel-9-for-x86_64-baseos-rpms
baseurl=http://scp-rhel9-ip/rhel/$releasever/x86_64/baseos
gpgcheck=0
enabled=1
[rhel-9-for-x86_64-appstream-rpms]
name=rhel-9-for-x86_64-appstream-rpms
baseurl=http://scp-rhel9-ip/rhel/$releasever/x86_64/appstream
gpgcheck=0
enabled=1
[codeready-builder-for-rhel-9-x86_64-rpms]
name=codeready-builder-for-rhel-9-x86_64-rpms
baseurl=http://scp-rhel9-ip/rhel/$releasever/x86_64/codeready-builder
gpgcheck=0
enabled=1
[rhel-9-for-x86_64-highavailability-rpms]
name=rhel-9-for-x86_64-highavailability-rpms
baseurl=http://scp-rhel9-ip/rhel/$releasever/x86_64/ha
gpgcheck=0
enabled=1
[rhel-9-for-x86_64-supplementary-rpms]
name=rhel-9-for-x86_64-supplementary-rpms
baseurl=http://scp-rhel9-ip/rhel/$releasever/x86_64/supplementary
gpgcheck=0
enabled=1

Code block. Verify repo configuration (RHEL9)

Use a text editor (e.g., vim) to open the /etc/hosts file.
Modify the /etc/hosts file with the following content and save it.
Color mode
198.19.2.13 scp-rhel8-ip scp-rhel9-ip scp-rhel-ip
198.19.2.13 scp-rhel8-ip scp-rhel9-ip scp-rhel-ip
code block. Change /etc/hosts file settings

Use the yum command to verify the RHEL Repository connection configured on the server.

Color mode

yum repolist –v

yum repolist –v

Code block. Verify repository connection settings

If the RHEL repository is successfully connected, you can view the repository list.

Color mode

Repo-id            : rhel-8-appstream
Repo-name          : rhel-8-appstream
Repo-revision      : 1718903734
Repo-updated       : Fri 21 Jun 2024 02:15:34 AM KST
Repo-pkgs          : 38,260
Repo-available-pkgs: 25,799
Repo-size          : 122 G
Repo-baseurl       : http://scp-rhel8-ip/rhel/8/appstream
Repo-expire        : 172,800 second(s) (last: Thu 08 Aug 2024 07:27:57 AM KST)
Repo-filename      : /etc/yum.repos.d/scp.rhel8.repo

Repo-id            : rhel-8-baseos
Repo-name          : rhel-8-baseos
Repo-revision      : 1718029433
Repo-updated       : Mon 10 Jun 2024 11:23:52 PM KST
Repo-pkgs          : 17,487
Repo-available-pkgs: 17,487
Repo-size          : 32 G
Repo-baseurl       : http://scp-rhel8-ip/rhel/8/baseos
Repo-expire        : 172,800 second(s) (last: Thu 08 Aug 2024 07:27:57 AM KST)
Repo-filename      : /etc/yum.repos.d/scp.rhel8.repo

Repo-id            : rhel-8-baseos-debug
Repo-name          : rhel-8-baseos-debug
Repo-revision      : 1717662461
Repo-updated       : Thu 06 Jun 2024 05:27:41 PM KST
Repo-pkgs          : 17,078
Repo-available-pkgs: 17,078
Repo-size          : 100 G
Repo-baseurl       : http://scp-rhel8-ip/rhel/8/baseos-debug
Repo-expire        : 172,800 second(s) (last: Thu 08 Aug 2024 07:27:57 AM KST)
Repo-filename      : /etc/yum.repos.d/scp.rhel8.repo

Repo-id            : rhel-8-appstream
Repo-name          : rhel-8-appstream
Repo-revision      : 1718903734
Repo-updated       : Fri 21 Jun 2024 02:15:34 AM KST
Repo-pkgs          : 38,260
Repo-available-pkgs: 25,799
Repo-size          : 122 G
Repo-baseurl       : http://scp-rhel8-ip/rhel/8/appstream
Repo-expire        : 172,800 second(s) (last: Thu 08 Aug 2024 07:27:57 AM KST)
Repo-filename      : /etc/yum.repos.d/scp.rhel8.repo

Repo-id            : rhel-8-baseos
Repo-name          : rhel-8-baseos
Repo-revision      : 1718029433
Repo-updated       : Mon 10 Jun 2024 11:23:52 PM KST
Repo-pkgs          : 17,487
Repo-available-pkgs: 17,487
Repo-size          : 32 G
Repo-baseurl       : http://scp-rhel8-ip/rhel/8/baseos
Repo-expire        : 172,800 second(s) (last: Thu 08 Aug 2024 07:27:57 AM KST)
Repo-filename      : /etc/yum.repos.d/scp.rhel8.repo

Repo-id            : rhel-8-baseos-debug
Repo-name          : rhel-8-baseos-debug
Repo-revision      : 1717662461
Repo-updated       : Thu 06 Jun 2024 05:27:41 PM KST
Repo-pkgs          : 17,078
Repo-available-pkgs: 17,078
Repo-size          : 100 G
Repo-baseurl       : http://scp-rhel8-ip/rhel/8/baseos-debug
Repo-expire        : 172,800 second(s) (last: Thu 08 Aug 2024 07:27:57 AM KST)
Repo-filename      : /etc/yum.repos.d/scp.rhel8.repo

Code block. Check repository list

Windows Key Management Service Configuration Guide

When using Windows Server on Samsung Cloud Platform, you can authenticate genuine licenses by utilizing the Key Management Service provided by Samsung Cloud Platform. To authenticate genuine copies using the Key Management Service, follow these steps.

Right-click the Windows Start icon, then run cmd from Windows PowerShell (Administrator) or the Windows Run menu.
Run the following command in Windows PowerShell(Administrator) or cmd to register the KMS Server.
Color mode
slmgr /skms 198.19.2.23:1688
slmgr /skms 198.19.2.23:1688
code block. WKMS configuration
Run the KMS Server registration command, verify the notification popup indicating successful registration, and then click the OK button.
Run the following command in Windows PowerShell(Administrator) or cmd to activate Windows.
Color mode
slmgr /ato
slmgr /ato
Code block. Windows Server activation settings
After confirming the notification popup indicating that activation was successful, click the OK button.
In Windows PowerShell(Administrator) or cmd, run the following command to verify that the product is properly activated.
Color mode
slmgr /dlv
slmgr /dlv
Code block. Verify Windows Server activation
After confirming the notification popup indicating that product activation was successful, click the OK button.

4.3 - API Reference

API Reference

4.4 - CLI Reference

CLI Reference

4.5 - Release Note

Bare Metal Server

2026.03.19

FEATURE Add new server type

The 4th‑generation BM based on the Intel 6th‑generation (Granite Rapids) Processor has been released.
- Bare Metal Server s4 and h4 server types have been added.
- Detailed information about the s4/h4 server type can be found in s3/h3 server type.

2025.10.23

FEATURE Add Local disk partition feature

Provides local disk partition functionality
- Now you can create and use up to 10 Local disk partitions.

2025.07.01

FEATURE Add new features and provide additional OS Image

You can terminate multiple resources simultaneously from the Bare Metal Server list.
You can change the IP of a regular Subnet.
OS Image has been added.
- RHEL 8.10, Ubuntu 24.04

2025.02.27

FEATURE Add Placement Group feature and OS Image, server type

Add Bare Metal Server feature
- Distribute servers belonging to the same Placement group across different racks.
- Additional OS images provided (RHEL 9.4, Rocky Linux 8.6, Rocky Linux 9.4)
- Add a 3rd‑generation (s3/h3) server type based on Intel 4th‑generation (Sapphire Rapids) Processor. For more information, refer to Bare Metal Server 서버 타입.
Samsung Cloud Platform Common Feature Changes
- Account, IAM, Service Home, tags, and other common CX changes have been applied.

2024.10.01

NEW Bare Metal Server service official version release

We have officially launched the Bare Metal Server service.
We have launched a Bare Metal Server service that lets customers use a physical server without virtualization, exclusively for their own use.

5 - Multi-node GPU Cluster

5.1 - Overview

Service Overview

Multi-node GPU Cluster is a service that provides physical GPU servers without virtualization for large-scale high-performance AI computation. You can use two or more Bare Metal Servers equipped with GPUs to cluster multiple GPUs, and conveniently operate GPU servers in conjunction with Samsung Cloud Platform’s high‑performance storage and networking services.

Provided Features

The Multi-node GPU Cluster provides the following features.

Auto Provisioning and Management: Through the web-based Console, you can easily provision servers of the standard GPU Bare Metal model equipped with 8 GPUs and manage resources and costs.
Network Connection: You can cluster multiple GPUs on two or more Bare Metal Servers via high‑speed interconnects, and by configuring a GPU Direct RDMA (Remote Direct Memory Access) environment, you can directly process data I/O between GPU memories, enabling high‑speed AI/Machine Learning computation.
Storage Connection: Provides various additional attached storage besides the OS disk. * High-performance SSD NAS File Storage, Block Storage, and Object Storage directly integrated with a high-speed network can also be used together.
Network Configuration Management: The server’s subnet/IP can be easily changed from the values set at initial creation. * NAT IP provides a management feature that allows you to enable or disable it as needed.
Monitoring: You can view monitoring information for computing resources such as CPU, GPU, Memory, and Disk through Cloud Monitoring. * To use the Cloud Monitoring service of a Multi-node GPU Cluster, you need to install the Agent. * Please install the Agent to ensure stable service. * For more details, please refer to Multi-node GPU Cluster Monitoring Metrics.
Terraform Provision: Provides an IaC environment via Terraform.

Component

Multi-node GPU Cluster provides GPUs as a Bare Metal Sever type with standard images and server types. NVSwitch and NVLink are provided.

Specifications by GPU Type

GPU (Graphic Processing Unit) is specialized for parallel operations that process large amounts of data quickly, enabling large-scale parallel computation in fields such as artificial intelligence (AI) and data analysis.

The following are the specifications of GPU types offered by the Multi-node GPU Cluster service.

Category	H100 Type	B300 Type
GPU Architecture	NVIDIA Hopper	NVIDIA Blackwell Ultra
GPU Memory	80 GiB	268 GiB
GPU Transistors	80 billion 4N TSMC	208 billion 4NP TSMC
FP16 Tensor Core (Dense)	989 TFLOPs	2.25 PFLOPs
FP8 Tensor Core (Dense)	1979 TFLOPs	4.5 PFLOPs
FP4 Tensor Core (Dense)	Not supported	13.5 PFLOPs
GPU Memory Bandwidth	3,352 GB/s HBM3	8 TB/s HBM3e
NVLink performance	NVLink 4	NVLink 5
NVLink Signaling Rate	25 GB/s (x18)	50 GB/s (x18)
NVSwitch GPU-to-GPU bandwidth	900 GB/s	1.8 TB/s
Total NVSwitch aggregate bandwidth	7.2 TB/s	14.4 TB/s

Table. GPU Type specifications

OS and GPU driver version

The operating systems (OS) supported by the Multi-node GPU Cluster are as follows.

OS	OS version	GPU driver version
Ubuntu	22.04	535.86.10, 535.183.06
Ubuntu	24.04	580.105.08

Table. Multi-node GPU Cluster OS and GPU driver version

Server type

The format of server types provided by the Multi-node GPU Cluster is as follows.

Example: when the server type is g2c96h8_metal

Category	example	Detailed description
Server generation	g2	Provided server generation g2: g means GPU server, and 2 means generation
CPU	c96	Number of cores c96: Allocated cores are physical cores
GPU	h8	GPU type and quantity h8: h means GPU type, and 8 means GPU quantity

Table. Multi-node GPU Cluster server type format

Reference

For detailed information about the server types provided by Multi-node GPU Cluster, refer to Multi-node GPU Cluster Server Types.

Preceding Service

This is a list of services that must be pre-configured before creating the service. Please refer to the guide provided for each service and prepare in advance.

Service Category	service	Detailed description
Networking	VPC	A service that provides an isolated virtual network in a cloud environment

Table. Multi-node GPU Cluster preliminary service

5.1.1 - Server type

Multi-node GPU Cluster server type

Multi-node GPU Cluster is categorized based on the GPU Type it provides, and the GPU used in a Multi-node GPU Cluster is determined by the server type selected when creating a GPU Node. Select the server type based on the specifications of the application you want to run on a multi-node GPU cluster.

The server types supported by the Multi-node GPU Cluster are as follows.

Example: when the server type is g2c96h8_metal.

Category	example	Detailed description
Server generation	g2	Provided server generation g2 g means GPU server specification 2 means generation
CPU	c96	Number of cores c96: Allocated cores are physical cores
GPU	h8	GPU type and quantity h8: h means GPU type, and 8 means GPU quantity

Table. Multi-node GPU Cluster server type format

g2 server type

The g2 server type is a GPU Bare Metal Server that uses NVIDIA H100 SXM GPUs, suitable for large-scale high-performance AI computation.

8 NVIDIA Hopper Architecture-based H100 GPUs provided
Provides 1,979 TFLOPS FP8 Tensor Core performance per GPU, 989 TFLOPS FP16 Tensor Core performance.
Supports up to 96 vCPUs and 2,048 GB of memory
Supports up to 1,600 Gb/s NVIDIA InfiniBand RDMA network.
Service network up to 100 Gbps
900 GB/s GPU P2P communication via NVSwitch within a node

Server type	GPU	GPU Memory	CPU(Core)	Memory	Disk	GPU P2P
g2c96h8_metal	H100	640 GiB	96 vCore	2 TB	SSD (OS) 960 GB * 2, NVMeSSD 3.84 TB * 4	900 GB/s NVSwitch

Table. Multi-node GPU Cluster server type specifications > H100 server type

g3 server type

The g3 server type is a GPU Bare Metal Server that uses NVIDIA B300 SXM GPUs, suitable not only for large-scale high-performance AI computation but also for LLM inference and AI deployment for generative AI.

8 NVIDIA Blackwell Ultra Architecture-based B300 GPUs provided
Provides 13.5 PFLOPS FP4 Tensor Core and 4.5 PFLOPS FP8 Tensor Core performance per GPU.
Supports up to 128 vCPUs and 4,096 GB of memory
Supports up to 6,400 Gb/s NVIDIA InfiniBand RDMA network
Service network up to 100 Gbps
1.8 TB/s GPU P2P communication via NVSwitch within a node

Server type	GPU	GPU Memory	CPU(Core)	Memory	Disk	GPU P2P
g3c128b8_metal	B300	2.1 TiB	128 vCore	4 TB	SSD (OS) 960 GB * 2, NVMeSSD 3.84 TB * 4	1.8 TB/s NVSwitch

Table. Multi-node GPU Cluster server type specifications > B300 server type

5.1.2 - Monitoring Metrics

Cloud Monitoring service termination notice

According to Samsung Cloud Platform’s policy, the Cloud Monitoring service is scheduled to be discontinued in September 2026.
Accordingly, from after the September 2026 release, resource monitoring of the Samsung Cloud Platform via Cloud Monitoring will no longer be possible.

With a new alternative service, you can continuously perform resource monitoring by leveraging ServiceWatch released in October 2025.
ServiceWatch provides more modern and powerful features, replacing Cloud Monitoring to deliver a seamless monitoring environment.

If you are collecting metrics and logs through the Cloud Monitoring Agent, you need to switch to the ServiceWatch Agent.

For more details about ServiceWatch, please refer to ServiceWatch Overview.
Detailed information about ServiceWatch Agent: please refer to the ServiceWatch Agent

Multi-node GPU Cluster Monitoring Metrics

The table below shows the monitoring metrics of a Multi-node GPU Cluster that can be viewed through Cloud Monitoring.

Guide

In a Multi-node GPU Cluster, users must install the Agent themselves via the guide to view monitoring metrics. Before using the stable service, please be sure to install the Agent. For instructions on installing the Agent and detailed usage of Cloud Monitoring, refer to the Cloud Monitoring guide.

Multi-node GPU Cluster [Cluster]

Performance items	Detailed description	unit
Memory Total [Basic]	bytes of usable memory	bytes
Memory Used [Basic]	Current memory usage in bytes	bytes
Memory Swap In [Basic]	bytes of the replaced memory	bytes
Memory Swap Out [Basic]	bytes of the replaced memory	bytes
Memory Free [Basic]	bytes of unused memory	bytes
Disk Read Bytes [Basic]	Read bytes	bytes
Disk Read Requests [Basic]	Number of read requests	cnt
Disk Write Bytes [Basic]	write bytes	bytes
Disk Write Requests [Basic]	Number of write requests	cnt
CPU Usage [Basic]	Average system CPU usage over 1 minute	%
Instance State [Basic]	Instance status	state
Network In Bytes [Basic]	Received bytes	bytes
Network In Dropped [Basic]	Incoming packet drop	cnt
Network In Packets [Basic]	Number of received packets	cnt
Network Out Bytes [Basic]	sent bytes	bytes
Network Out Dropped [Basic]	Transmit packet drop	cnt
Network Out Packets [Basic]	Number of transmitted packets	cnt

Table. Multi-node GPU Cluster [Cluster] Monitoring Metrics (default)

Performance items	Detailed description	unit
Cluster GPU Count	GPU Count SUM in Cluster Sum of GPU Count for nodes in the cluster: calculate the total GPU Count of each node within the same GPU CLUSTER	cnt
Cluster GPU Count In Use	Number of GPUs being used by jobs within the cluster Number of GPUs used by processes within the cluster: sum of GPUs occupied by processes, parsed from the ‘Processes:’ section at the bottom of nvidia-smi output of nodes in the same GPU cluster	cnt
Cluster GPU Usage	GPU Utilization AVG within the cluster Cluster node GPU utilization average value: calculate the average of each node’s GPU utilization values within the same GPU cluster	%
Cluster GPU Memory Usage [Avg]	Cluster GPU Memory Utilization AVG Average Memory utilization of nodes within the cluster: calculate the average of each node’s Memory utilization values among nodes in the same GPU cluster	%

Table. Multi-node GPU Cluster [Cluster] Additional monitoring metrics (Agent installation required)

Multi-node GPU Cluster [Node]

Performance items	Detailed description	unit
Memory Total [Basic]	bytes of usable memory	bytes
Memory Used [Basic]	Current memory usage in bytes	bytes
Memory Swap In [Basic]	bytes of the replaced memory	bytes
Memory Swap Out [Basic]	bytes of the replaced memory	bytes
Memory Free [Basic]	bytes of unused memory	bytes
Disk Read Bytes [Basic]	Read bytes	bytes
Disk Read Requests [Basic]	Number of read requests	cnt
Disk Write Bytes [Basic]	write bytes	bytes
Disk Write Requests [Basic]	Number of write requests	cnt
CPU Usage [Basic]	Average system CPU usage over 1 minute	%
Instance State [Basic]	Instance status	state
Network In Bytes [Basic]	Received bytes	bytes
Network In Dropped [Basic]	Incoming packet drop	cnt
Network In Packets [Basic]	Number of received packets	cnt
Network Out Bytes [Basic]	sent bytes	bytes
Network Out Dropped [Basic]	Transmit packet drop	cnt
Network Out Packets [Basic]	Number of transmitted packets	cnt

Table. Multi-node GPU Cluster [Node] Monitoring Metrics (provided by default)

Performance items	Detailed description	unit
GPU Count	Number of GPUs	cnt
GPU Temperature	GPU temperature	℃
GPU Usage	utilization	%
GPU Usage [Avg]	Overall average GPU utilization (%)	%
GPU Power Cap	Maximum power capacity of the GPU	W
GPU Power Usage	Current GPU power usage	W
GPU Memory Usage [Avg]	GPU Memory Uti. AVG	%
GPU Count in use	Number of GPUs in use by jobs on the node	cnt
Execution Status for nvidia-smi	Result of running the nvidia-smi command	status
Core Usage [IO Wait]	Ratio of CPU time spent in wait state (disk wait)	%
Core Usage [System]	Proportion of CPU time spent in kernel space	%
Core Usage [User]	Proportion of CPU time spent in user space	%
CPU Cores	The number of CPU cores on the host. The maximum value of the unnormalized ratio is 100%* of a core. The unnormalized ratio already incorporates this value, and its maximum is 100%* of a core.	cnt
CPU Usage [Active]	Percentage of CPU time used excluding Idle and IOWait states (when all 4 cores are used at 100%: 400%)	%
CPU Usage [Idle]	It is the proportion of CPU time spent in idle state.	%
CPU Usage [IO Wait]	This is the proportion of CPU time spent in a waiting state (disk wait).	%
CPU Usage [System]	Percentage of CPU time used by the kernel (when all 4 cores are used at 100%: 400%)	%
CPU Usage [User]	Percentage of CPU time used in user space. (If all 4 cores are used at 100%, it is 400%)	%
CPU Usage/Core [Active]	Percentage of CPU time used excluding Idle and IOWait states (value normalized by the number of cores; 100% when all four cores are fully utilized)	%
CPU Usage/Core [Idle]	It is the proportion of CPU time spent in idle state.	%
CPU Usage/Core [IO Wait]	This is the proportion of CPU time spent in a waiting state (disk wait).	%
CPU Usage/Core [System]	Percentage of CPU time used by the kernel (value normalized by the number of cores; 100% when all 4 cores are fully utilized)	%
CPU Usage/Core [User]	Percentage of CPU time used in user space. (Value normalized by the number of cores; 100% when all 4 cores are fully utilized)	%
Disk CPU Usage [IO Request]	It is the proportion of CPU time during which I/O requests for the device were executed (device bandwidth utilization). If this value approaches 100%, the device becomes saturated.	%
Disk Queue Size [Avg]	The average queue length of requests executed for the device.	num
Disk Read Bytes	The number of bytes read per second from the device.	bytes
Disk Read Bytes [Delta Avg]	Average of system.diskio.read.bytes_delta for individual disks	bytes
Disk Read Bytes [Delta Max]	Maximum system.diskio.read.bytes_delta of individual disks	bytes
Disk Read Bytes [Delta Min]	Minimum system.diskio.read.bytes_delta of individual disks	bytes
Disk Read Bytes [Delta Sum]	Sum of the system.diskio.read.bytes_delta of individual disks	bytes
Disk Read Bytes [Delta]	Delta of the system.diskio.read.bytes value for each disk	bytes
Disk Read Bytes [Success]	Total number of bytes successfully read. On Linux, assuming a sector size of 512, it is the number of sectors read multiplied by 512.	bytes
Disk Read Requests	Number of read requests to the disk device per second	cnt
Disk Read Requests [Delta Avg]	Average of the system.diskio.read.count_delta for individual disks	cnt
Disk Read Requests [Delta Max]	Maximum system.diskio.read.count_delta for individual disks	cnt
Disk Read Requests [Delta Min]	Minimum of system.diskio.read.count_delta for individual disks	cnt
Disk Read Requests [Delta Sum]	Sum of system.diskio.read.count_delta of individual disks	cnt
Disk Read Requests [Success Delta]	Delta of system.diskio.read.count for each disk	cnt
Disk Read Requests [Success]	Total number of successful reads	cnt
Disk Request Size [Avg]	Average size of requests executed on the device (unit: sectors).	num
Disk Service Time [Avg]	Average service time (ms) of input requests executed on the device.	ms
Disk Wait Time [Avg]	Average time taken for requests executed on the supported device.	ms
Disk Wait Time [Read]	Average disk wait time	ms
Disk Wait Time [Write]	Average disk wait time	ms
Disk Write Bytes [Delta Avg]	Average of system.diskio.write.bytes_delta for each disk	bytes
Disk Write Bytes [Delta Max]	Maximum system.diskio.write.bytes_delta of individual disks	bytes
Disk Write Bytes [Delta Min]	Minimum of system.diskio.write.bytes_delta for individual disks	bytes
Disk Write Bytes [Delta Sum]	Sum of the system.diskio.write.bytes_delta of individual disks	bytes
Disk Write Bytes [Delta]	Delta of the system.diskio.write.bytes value for each disk	bytes
Disk Write Bytes [Success]	Total number of bytes successfully written. On Linux, assuming a sector size of 512, it is the number of sectors written multiplied by 512.	bytes
Disk Write Requests	Number of write requests to the disk device per second	cnt
Disk Write Requests [Delta Avg]	Average of system.diskio.write.count_delta for individual disks	cnt
Disk Write Requests [Delta Max]	Maximum system.diskio.write.count_delta for individual disks	cnt
Disk Write Requests [Delta Min]	Minimum of system.diskio.write.count_delta for individual disks	cnt
Disk Write Requests [Delta Sum]	Sum of the system.diskio.write.count_delta of individual disks	cnt
Disk Write Requests [Success Delta]	Delta of system.diskio.write.count for each disk	cnt
Disk Write Requests [Success]	Total number of successful writes	cnt
Disk Writes Bytes	It is the number of bytes per second written to the device.	bytes
Filesystem Hang Check	filesystem (local/NFS) hang check (normal:1, abnormal:0)	status
Filesystem Nodes	It is the total number of file nodes in the file system.	cnt
Filesystem Nodes [Free]	It is the total number of available file nodes in the file system.	cnt
Filesystem Size [Available]	Disk space (bytes) that unauthorized users can use.	bytes
Filesystem Size [Free]	Available disk space (bytes)	bytes
Filesystem Size [Total]	Total disk space (bytes)	bytes
Filesystem Usage	Used disk space percentage	%
Filesystem Usage [Avg]	Average of individual filesystem.used.pct	%
Filesystem Usage [Inode]	inode usage	%
Filesystem Usage [Max]	Maximum among individual filesystem.used.pct	%
Filesystem Usage [Min]	minimum of individual filesystem.used.pct	%
Filesystem Usage [Total]	-	%
Filesystem Used	Used disk space (bytes)	bytes
Filesystem Used [Inode]	inode usage	bytes
Memory Free	Total amount of available memory (bytes). Memory used by system cache and buffers is not included (see system.memory.actual.free).	bytes
Memory Free [Actual]	Actual usable memory (bytes). The calculation method varies by OS: on Linux, it is MemAvailable from /proc/meminfo, or if meminfo cannot be used, it is calculated from available memory plus cache and buffers. On OSX, it is the sum of usable memory and inactive memory. On Windows, it corresponds to a value such as system.memory.free.	bytes
Memory Free [Swap]	Available swap memory.	bytes
Memory Total	total memory	bytes
Memory Total [Swap]	Total swap memory.	bytes
Memory Usage	Percentage of used memory ((Memory Total - Memory Free) / Memory Total) * 100 Memory Free: the amount of available memory currently free	%
Memory Usage [Actual]	Percentage of memory actually used ((Memory Total - Mememory Available) / Memory Total) * 100 or ((Memory Total - (Memmory Free + Buffers + Cached) / MemTotal) * 100 Memory Free: the amount of free memory currently available Buffers: the amount of memory used for buffers Cached: the amount of memory used for the page cache	%
Memory Usage [Cache Swap]	Cached swap usage rate	%
Memory Usage [Swap]	Percentage of used swap memory	%
Memory Used	used memory	bytes
Memory Used [Actual]	Actual used memory (bytes). The value obtained by subtracting used memory from total memory. Available memory is calculated differently for each OS (see system.actual.free).	bytes
Memory Used [Swap]	Used swap memory.	bytes
Collisions	Network collision	cnt
Network In Bytes	Number of received bytes	bytes
Network In Bytes [Delta Avg]	Average of system.network.in.bytes_delta for each network	bytes
Network In Bytes [Delta Max]	Maximum of system.network.in.bytes_delta for each network	bytes
Network In Bytes [Delta Min]	Minimum system.network.in.bytes_delta for each network	bytes
Network In Bytes [Delta Sum]	Sum of system.network.in.bytes_delta for individual networks	bytes
Network In Bytes [Delta]	Delta of received byte count	bytes
Network In Dropped	Number of deleted packets among incoming packets	cnt
Network In Errors	Number of errors during reception	cnt
Network In Packets	Number of received packets	cnt
Network In Packets [Delta Avg]	Average of system.network.in.packets_delta for individual networks	cnt
Network In Packets [Delta Max]	Maximum of system.network.in.packets_delta for each network	cnt
Network In Packets [Delta Min]	Minimum of system.network.in.packets_delta for each network	cnt
Network In Packets [Delta Sum]	Sum of system.network.in.packets_delta for individual networks	cnt
Network In Packets [Delta]	Delta of received packet count	cnt
Network Out Bytes	Number of transmitted bytes	bytes
Network Out Bytes [Delta Avg]	Average of system.network.out.bytes_delta for each network	bytes
Network Out Bytes [Delta Max]	Maximum system.network.out.bytes_delta of individual networks	bytes
Network Out Bytes [Delta Min]	Minimum of system.network.out.bytes_delta for individual networks	bytes
Network Out Bytes [Delta Sum]	Sum of system.network.out.bytes_delta for individual networks	bytes
Network Out Bytes [Delta]	Delta of transmitted byte count	bytes
Network Out Dropped	Number of deleted packets among outgoing packets. This value is not reported by the operating system, so it is always 0 on Darwin and BSD.	cnt
Network Out Errors	Number of errors during transmission	cnt
Network Out Packets	Number of transmitted packets	cnt
Network Out Packets [Delta Avg]	Average of system.network.out.packets_delta for each network	cnt
Network Out Packets [Delta Max]	Maximum of system.network.out.packets_delta for each network	cnt
Network Out Packets [Delta Min]	Minimum of system.network.out.packets_delta for each network	cnt
Network Out Packets [Delta Sum]	Sum of system.network.out.packets_delta for individual networks	cnt
Network Out Packets [Delta]	Delta of transmitted packet count	cnt
Open Connections [TCP]	All open TCP connections	cnt
Open Connections [UDP]	All open UDP connections	cnt
Port Usage	Available port usage rate	%
SYN Sent Sockets	Number of sockets in SYN_SENT state (when connecting from local to remote)	cnt
Kernel PID Max	kernel.pid_max value	cnt
Kernel Thread Max	kernel.threads-max value	cnt
Process CPU Usage	Percentage of CPU time consumed by the process since the last update. This value is similar to the %CPU value shown for the process by the top command on Unix systems.	%
Process CPU Usage/Core	The percentage of CPU time used by the process since the last event. Normalized by the number of cores, with values ranging from 0 to 100%.	%
Process Memory Usage	Proportion of main memory (RAM) occupied by a process	%
Process Memory Used	Resident Set size. The amount of memory a process occupies in RAM. In Windows, the current working set size.	bytes
Process PID	process pid	PID
Process PPID	parent process PID	PID
Processes [Dead]	Number of dead processes	cnt
Processes [Idle]	Number of idle processes	cnt
Processes [Running]	Number of running processes	cnt
Processes [Sleeping]	Number of sleeping processes	cnt
Processes [Stopped]	stopped processes count	cnt
Processes [Total]	Total number of processes	cnt
Processes [Unknown]	Number of processes with an unknown or unsearchable status	cnt
Processes [Zombie]	Number of zombie processes	cnt
Running Process Usage	process usage	%
Running Processes	Number of running processes	cnt
Running Thread Usage	Thread usage rate	%
Running Threads	Total number of threads running in running processes	cnt
Instance Status	Instance status	state
Context Switches	context switch count (per second)	cnt
Load/Core [1 min]	The load over the last 1 minute divided by the number of cores	cnt
Load/Core [15 min]	The load over the last 15 minutes divided by the number of cores	cnt
Load/Core [5 min]	The load over the last 5 minutes divided by the number of cores	cnt
Multipaths [Active]	External storage connection path status = active count	cnt
Multipaths [Failed]	External storage connection path status = failed count	cnt
Multipaths [Faulty]	External storage connection path status = faulty count	cnt
NTP Offset	measured offset of the last sample (the time difference between the NTP server and the local environment)	num
Run Queue Length	Execution queue length	num
Uptime	OS uptime(uptime). (milliseconds)	ms
Context Switchies	CPU context switch count (per second)	cnt
Disk Read Bytes [Sec]	Number of bytes read from a Windows logical disk in 1 second	cnt
Disk Read Time [Avg]	Average data read time (seconds)	sec
Disk Transfer Time [Avg]	Disk average wait time	sec
Disk Usage	Disk usage	%
Disk Write Bytes [Sec]	Number of bytes written in one second on a Windows logical disk	cnt
Disk Write Time [Avg]	Average data write time (seconds)	sec
Pagingfile Usage	Paging file usage	%
Pool Used [Non Paged]	Nonpaged Pool usage in kernel memory	bytes
Pool Used [Paged]	Paged Pool usage in kernel memory	bytes
Process [Running]	Number of currently running processes	cnt
Threads [Running]	Number of currently running threads	cnt
Threads [Waiting]	Number of threads waiting for processor time	cnt

Table. Multi-node GPU Cluster [Node] Additional monitoring metrics (Agent installation required)

5.2 - How-to guides

Users can create the service by entering the required information for the Multi-node GPU Cluster service and selecting detailed options through the Samsung Cloud Platform Console.

Multi-node GPU Cluster Getting Started

You can create and use a Multi-node GPU Cluster service in the Samsung Cloud Platform Console.

This service consists of a GPU Node and a Cluster Fabric service.

Create GPU Node

Multi-node GPU Cluster To create a Multi-node GPU Cluster, follow the steps below.

All Services > Compute > Multi-node GPU Cluster menu, click it. Navigate to the Service Home page of Multi-node GPU Cluster.
On the Service Home page, click the Create GPU Node button. You will be taken to the Create GPU Node page.

On the GPU Node Creation page, enter the information required to create the service and select detailed options.

Select the required information in the Image and Version Selection area.

Category	required or not	Detailed description
image	Required	Select the type of image provided Ubuntu
Image version	Required	Select version of the chosen image Provide a list of versions for the supplied server image

Table. GPU Node image and version selection options

In the Service Information Input area, enter or select the required information.

Category

Required

Detailed description

Number of servers

Required

Number of GPU Node servers to create simultaneously

Only numeric input is allowed, and the minimum number of servers to create is 2.

Only during the initial setup can you create 2 or more, and subsequent expansions can be done one at a time.

Service Type > Server Type

Required

GPU Node server type

Select the desired CPU, Memory, GPU, and Disk specifications

For detailed information about the server types provided by GPU Node, refer to Multi-node GPU Cluster Server Type

Service Type > Planned Compute

Required

Status of resources with Planned Compute configured

In Use: Number of resources with Planned Compute that are currently in use

Configured: Number of resources with Planned Compute configured

Coverage Preview: Amount applied per resource by Planned Compute

Apply for Planned Compute Service: Navigate to the Planned Compute service application page
- For more details, refer to Apply for Planned Compute

Table. GPU Node Service Information Input Items

In the Required Information Input area, enter or select the necessary information.

Category

Required

Detailed description

Administrator account

Required

Set the administrator account and password to be used when connecting to the server

Ubuntu OS is provided with root fixed

Server name Prefix

Required

Enter a Prefix to distinguish each GPU Node generated when the selected number of servers is 2 or more

Automatically generated in the form of user input value (prefix) + ‘-###’

Must start with a lowercase English letter and be entered using lowercase letters, numbers, and special characters (-) within 3 to 11 characters

Must not end with a special character (-)

Network Settings

Required

Set the network where the GPU Node will be installed

VPC name: select a pre‑created VPC

General Subnet name: select a pre‑created general Subnet
- IP can be auto‑generated or manually entered; if manual input is chosen, the user enters the IP directly

NAT: usable only when there is a single server and the VPC is attached to an Internet Gateway. Check to use, and you can select a NAT IP (initially it can be created only with two or more servers, so modify it on the resource detail page)

NAT IP: select a NAT IP
- If no NAT IP is available, click the Create new button to generate a Public IP
- Click the Refresh button to view and select the created Public IP
- Creating a Public IP incurs charges according to the Public IP pricing

Table. Required input fields for GPU Node

In the Cluster Selection area, create or select a Cluster Fabric.

Category

Required status

Detailed description

Cluster Fabric

Required

Configure a group of GPU Node servers that can apply GPU Direct RDMA together

Optimal GPU performance and speed can be achieved only within the same Cluster Fabric

When creating a new Cluster Fabric, *New Input > select Node pool, then enter the name of the Cluster Fabric to create

To add to an existing Cluster Fabric, Existing Input > select Node pool, then select the previously created Cluster Fabric

Table. GPU Node Cluster Fabric selection items

Additional Information Input area, enter or select the required information.

Category

Required status

Detailed description

Lock

Selection

Using a lock prevents accidental actions that could terminate, start, or stop the server.

Init Script

Selection

Script to run when the server starts

The Init Script must be selected differently depending on the image type
- For Linux: Choose Shell Script or cloud-init

tag

Selection

Add Tag

Up to 50 can be added per resource

After clicking the Add Tag button, enter or select Key, Value values

Table. GPU Node additional information input fields

Summary Check the detailed information and estimated billing amount generated in the panel, and click the Create button.
- When creation is complete, check the created resources on the GPU Node List page.

Caution

When creating a service, the GPU MIG/ECC settings are reset. However, to ensure the correct settings are applied, perform an initial reboot, verify that the settings have taken effect, and then proceed.
For detailed information on resetting GPU MIG/ECC settings, refer to the GPU MIG/ECC Reset Settings Checklist Guide.

Check GPU Node details

The Multi-node GPU Cluster service allows you to view and edit the full resource list and detailed information of GPU nodes.

GPU Node Details page consists of Details, Tags, Job History tabs.

To view detailed information about the GPU Node, follow these steps.

All Services > Compute > Multi-node GPU Cluster > GPU Node menu, click it. Navigate to the Service Home page of Multi-node GPU Cluster.

On the Service Home page, click the GPU Node menu. You will be taken to the GPU Node List page.

Resource items beyond the required columns can be added via the Settings button.

Category	Required status	Detailed description
Resource ID	Selection	User-created GPU Node ID
Cluster Fabric name	Required	User-created Cluster Fabric name
Server name	Required	User-created GPU Node name
Server type	Required	Server type of the GPU Node Users can view the number of cores, memory capacity, and GPU type and quantity of the resources they created
image	Required	User-created GPU Node image version
IP	Required	IP of the GPU node created by the user
status	Required	Status of the user-created GPU node
Creation date and time	Selection	GPU Node creation timestamp

Table. GPU Node resource list items

On the GPU Node List page, click the resource for which you want to view detailed information. You will be taken to the GPU Node Details page.

GPU Server Details At the top of the page, status information and descriptions of additional features are displayed.

Category

Detailed description

GPU Node status

Status of user-created GPU Node

Creating: server is being created

Running:: creation completed and available

Editing:: IP is being changed

Unknown: error state

Starting: server is starting

Stopping: server is stopping

Stopped: server stopped

Terminating: terminating

Terminated: termination completed

Server control

Button to change server status

Start: Start a stopped server

Stop: Stop a running server

Service termination

Cancel service button

Table. GPU Node status information and additional features

Detailed Information

On the GPU Node List page’s Details Tab, you can view the detailed information of the selected resource and edit the information if necessary.

Category	Detailed description
service	Service name
Resource Type	Resource Type
SRN	Unique resource ID in Samsung Cloud Platform In a GPU Node, it refers to the GPU Node SRN
Resource name	Resource Name In the GPU Node service, it refers to the GPU Node name
Resource ID	Unique resource ID in the service
constructor	User who created the service
Creation date and time	Service creation date and time
editor	User who edited the service information
Modification date	Date and time the service information was modified
Server name	Server name
Node pool	A collection of nodes that can be grouped into the same Cluster Fabric
Cluster Fabric name	User-created Cluster Fabric name
Image/Version	Server OS image and version
Server type	CPU, memory, GPU, information display
Planned Compute	Resource status with Planned Compute configured For more details, see Apply for Planned Compute
Lock	Display whether Lock is enabled or disabled When Lock is enabled, it prevents server termination/start/stop operations, avoiding actions caused by mistakes. If you need to change the Lock attribute value, click the Edit button to configure.
Network	GPU Node network information VPC name, general Subnet name, IP, IP status, NAT IP, NAT IP status
Block Storage	Block Storage information attached to the server Volume name, disk type, capacity, status
Init Script	View the Init Script content entered when creating the server

Table. GPU Node detailed information tab items

Job History

On the GPU Node List page’s Job History Tab, you can view the job history of the selected resource.

Category

Detailed description

Task History List

Resource Change History

Check operation details, operation timestamp, resource type, resource name, event topic, operation result, and operator information

Detailed Search button provides detailed search functionality

Table. GPU Node Job History Tab Detailed Information Items

Control GPU Node Operation

If you need server control and management functions for the created GPU Node resources, you can perform tasks on the GPU Node List or GPU Node Details page. You can start and stop the resources of a running GPU node.

Getting Started with GPU Node

You can start a GPU Node that is stopped (Stopped). To start a GPU Node, follow the steps below.

Click the All Services > Compute > Multi-node GPU Cluster menu. Go to the Service Home page of Multi-node GPU Cluster.
On the Service Home page, click the GPU Node menu. You will be taken to the GPU Node List page.
- On the GPU Node List page, after selecting individual or multiple servers with the checkboxes, you can Start using the More button at the top.
On the GPU Node List page, click the resource. Navigate to the GPU Node Detail page.
- On the GPU Node Details page, click the Start button at the top to start the server.
Check the server status and complete the status change.

Stopping GPU Node

You can stop a GPU Node that is running (Active). To stop a GPU Node, follow the steps below.

Click the All Services > Compute > Multi-node GPU Cluster menu. Go to the Service Home page of Multi-node GPU Cluster.
On the Service Home page, click the GPU Node menu. You will be taken to the GPU Node List page.
- GPU Node List page allows you to control individual or multiple servers by selecting the checkboxes and using the Stop button at the top.
On the GPU Node List page, click the resource. You will be taken to the GPU Node Details page.
- On the GPU Node Details page, click the Stop button at the top to stop the server.
Check the server status and complete the status change.

Terminate GPU Node

You can terminate unused GPU Nodes to reduce operating costs. However, terminating a service may cause the running service to stop immediately, so you should carefully consider the impact of service interruption before proceeding with termination.

Caution

Please note that data cannot be recovered after terminating the service.

To cancel a GPU Node, follow the steps below.

Click the All Services > Compute > Multi-node GPU Server menu. Navigate to the Service Home page of the Multi-node GPU Cluster.
On the Service Home page, click the Cluster Fabric menu. You will be taken to the Cluster Fabric list page.
Cluster Fabric List page, select the resources to terminate, and click the Service Termination button.
- Resources that use the same Cluster Fabric can be terminated simultaneously.
When termination is complete, check on the GPU Node List page whether the resources have been terminated.

information

The cases where a GPU Node cannot be terminated are as follows.

When Block Storage(BM) is connected: Please disconnect the Block Storage(BM) connection first.
If File Storage is connected: Please disconnect the File Storage first.
When Lock is set: Please change the Lock setting to disabled and try again.
If the selection includes a server that cannot be terminated simultaneously: Please re-select only resources that can be terminated.
If the server you want to terminate has a different Cluster Fabric: Select only resources that use the same Cluster Fabric.

Reference

If all GPU Nodes in the Cluster Fabric are deleted, the Cluster Fabric is automatically deleted.

5.2.1 - Manage Cluster Fabric

Cluster Fabric is a service that helps manage the servers (GPU Node) included in a GPU Cluster. By using Cluster Fabric, you can move servers between GPU Clusters in the same Node pool and optimize GPU performance and speed within the same GPU Cluster.

Creating Cluster Fabric

Cluster Fabric can be created together with a GPU Node, and it cannot be created or deleted separately. If all GPU Nodes within a Cluster Fabric are terminated, the Cluster Fabric is automatically deleted.
If you have not created a GPU Node, please create a GPU Node first. For more information, see GPU Node 생성하기.

Check Cluster Fabric details

Notice

Cluster Fabric can be created together when a GPU node is created, and it cannot be created or deleted independently.
If all GPU nodes in the Cluster Fabric are terminated, the Cluster Fabric is automatically deleted.
If you have not created a GPU Node, please create a GPU Node first. For more details, refer to GPU Node 생성하기.

On the Cluster Fabric List page and the Cluster Fabric Details page, you can view the generated Cluster Fabric list and details and move the server.

Click the All Services > Compute > Multi-node GPU Server menu. Go to the Service Home page of the Multi-node GPU Cluster.

On the Service Home page, click the Cluster Fabric menu. You will be taken to the Cluster Fabric List page.

On the Cluster Fabric List page, you can view the resource list of GPU clusters created by the user.

Resource items beyond the required columns can be added via the Settings button.

Category	Required	Detailed description
Resource ID	Selection	User-created Cluster Fabric ID
Cluster Fabric name	Required	User-created Cluster Fabric name
Node pool	Selection	A collection of nodes that can be grouped into the same Cluster Fabric
Number of servers	Selection	Number of GPU Nodes
Server type	Selection	Server type of GPU Node Users can view the number of cores, memory capacity, and GPU type and count of the resources they created
status	Selection	Status of the user-created Cluster Fabric
Creation date and time	Select	Cluster Fabric creation timestamp

Table. Cluster Fabric resource list items

On the Cluster Fabric List page, click the resource to view detailed information. You will be taken to the Cluster Fabric Details page.

Cluster Fabric Details At the top of the page, status information and descriptions of additional features are displayed.

Category

Detailed description

Cluster Fabric status

Status of the user-created Cluster Fabric

Creating: State while the cluster is being created

Active: State when creation is complete and the cluster is usable

Editing: State while the IP is being changed

Deleting: State while being terminated

Deleted: State after termination is complete

Add target server

A feature that allows moving a server from another cluster to the target cluster.

Table. Cluster Fabric status information and additional features

Detailed Information

On the Cluster Fabric List page’s Details Tab, you can view detailed information of the selected resource and retrieve servers from another cluster.

Category	Detailed description
service	Service name
Resource Type	Resource Type
SRN	Unique resource ID in Samsung Cloud Platform In Cluster Fabric, it refers to the Cluster Fabric SRN
Resource name	Resource name In the Cluster Fabric service, it refers to the Cluster Fabric name
Resource ID	Unique resource ID in the service
constructor	User who created the service
Creation date and time	Service creation date and time
editor	User who edited the service information
Modification date	Date and time the service information was modified
Cluster Fabric name	User-created Cluster Fabric name
Node pool	A set of nodes that can be grouped into the same Cluster Fabric
target server	GPU Node list bound to Cluster Fabric Server name, server type, IP, status

Table. Cluster Fabric detailed information tab items

Import Cluster Fabric Server

Cluster Fabric Details page’s add target server feature allows you to import servers from another cluster and add them to the selected cluster.

Click the All Services > Compute > Multi-node GPU Server menu. Navigate to the Service Home page of the Multi-node GPU Cluster.
On the Service Home page, click the Cluster Fabric menu. You will be taken to the Cluster Fabric list page.
On the Cluster Fabric List page, click the resource to view detailed information. You will be taken to the Cluster Fabric Details page.
In the target server of the Details tab, click the Add button on the right.
- The add target server popup window opens.
  - Select a cluster from Cluster Fabric.
  - GPU nodes associated with the selected cluster are listed; select the GPU node you want to retrieve.
  - The selected GPU Node’s name is displayed at the bottom.
  - Press the Confirm button to complete.
  - Pressing the Cancel button cancels the operation.
- Verify that the GPU node added on the target server is displayed.

Terminate Cluster Fabric

If all GPU Nodes in the Cluster Fabric are terminated, the Cluster Fabric is automatically deleted. For more information, see Terminate GPU Node.

5.2.2 - Install ServiceWatch Agent

Users can install the ServiceWatch Agent on GPU nodes of a Multi-node GPU Cluster to collect custom metrics and logs.

Reference

Collecting custom metrics/logs via the ServiceWatch Agent is currently available only on Samsung Cloud Platform For Enterprise. It will also be available in other offerings in the future.

Caution

Since metric collection through the ServiceWatch Agent is classified as custom metrics and incurs charges unlike the default collected metrics, it is recommended to remove or disable unnecessary metric collection settings.

ServiceWatch Agent

In a Multi-node GPU Cluster, the agents that need to be installed on GPU nodes to collect ServiceWatch custom metrics and logs can be divided into two main types. It is a Prometheus Exporter and Open Telemetry Collector.

Category

Detailed description

Prometheus Exporter

Provide metrics of a specific application or service in a format that Prometheus can scrape

For collecting OS metrics on a GPU Node, you can use the Node Exporter for Linux servers and the Windows Exporter for Windows servers, depending on the OS type.
- On a GPU Node, you can use the Node Exporter for OS metric collection just like on a Virtual Server; see Virtual Server > ServiceWatch Agent for details

You can use the DCGM (NVIDIA Data Center GPU Manager) Exporter for GPU Nodes
- Refer to GPU Server > ServiceWatch Agent Installation for how to use the DCGM Exporter

Open Telemetry Collector

Exports data to the ServiceWatch Gateway so that ServiceWatch can collect metric and log data.

For instructions on using the Open Telemetry Collector provided by ServiceWatch, see ServiceWatch > ServiceWatch Agent Usage

Table. Explanation of Prometheus Exporter and Open Telemetry Collector

information

If you have configured a Kubernetes Engine on a GPU node, please view the GPU metrics using the metrics provided by the Kubernetes Engine.

If you install the DCGM Exporter on a GPU node where Kubernetes Engine is configured, it may not operate correctly.

Reference

The ServiceWatch Agent guide for collecting GPU metrics on a GPU Node can be used the same as on a GPU Server. For more details, refer to GPU Server > ServiceWatch Agent.

Pre-configuration for Using ServiceWatch Agent

To use the ServiceWatch Agent, please refer to Prerequisite Settings for ServiceWatch Agent and prepare the prerequisite settings.

5.2.3 - Multi-node GPU Cluster Service Scope and Inspection Guide

Multi-node GPU Cluster Service Scope

If an IaaS hardware-level issue occurs with the Multi-node GPU Cluster service, you can receive technical support through Contact Us in the Support Center. However, the risks associated with changes such as OS kernel updates or application installations are the user’s responsibility, so technical support is limited; please be mindful when performing tasks such as system updates.

IaaS hardware level issue

HW fault event messages generated within the server by the IPMI hardware monitoring console.
GPU HW operation error observed in the nvdia-smi command
HW error messages that occur during inspection of InfiniBand HCA cards or InfiniBand Switches

Caution

Since the Multi-node GPU Cluster is a service sensitive to software version compatibility of Ubuntu OS / NVDIA / Infiniband, official technical support is unavailable after changes such as a user’s OS kernel update or application installation.

IaaS HW Inspection Guide

After applying for the Multi-node GPU Cluster service, it is recommended to check the IaaS HW level according to the inspection guide.

Intel E810 driver update

Check the version of the Intel E810 driver and, if necessary, refer to the following procedure to perform an update.

Server manufacturer Intel E810 driver minimum recommended version: 1.15.4 or later
Driver download: Intel Network Adapter Driver for 800 Series Devices under Linux

Caution

Proceed with the update only if the Multi-node GPU Cluster Node has an Intel E810 Device and is simultaneously using a standard image version 535.86.10 or lower.

Reference

You can use the lspci command to verify whether an E810 NIC device is present. If the E810 NIC is valid, the PCIe device is identified as follows using the E810-C information (if there is no E810 NIC device, this operation is not performed).

<div class="code-block-buttons">
  
  <button class="code-block-download">
    <div class="code-block-download-icon">
      <svg width="14" height="16" viewBox="0 0 14 16" fill="none" xmlns="http://www.w3.org/2000/svg"><path d="M13.999 14.4353v-2.998C13.999 11.0232 13.6602 10.6853 13.2461 10.6853S12.5 11.0232 12.5 11.4373v2.2519H1.5V11.4373C1.5 11.0232 1.16211 10.6853.748047 10.6853.333984 10.6853 976563e-9 11.0232 976562e-9 11.4373v2.998C976562e-9 14.8494.333984 15.1892.748047 15.1892H13.2461c.414099999999999.0.7529-.3398.7529-.7539z" fill="#5135ff"/><path d="M1.41169 6.21654c.25876-.32345.73073-.37589 1.05417-.11713l3.78463 3.0277V1.56104c0-.41422.33578-.750005.75-.750005.41421.0.75.335785.75.750005V9.12208L11.5288 6.09941C11.8523 5.84065 12.3242 5.89309 12.583 6.21654 12.8418 6.53999 12.7893 7.01196 12.4659 7.27071L6.99734 11.6455 1.52882 7.27071c-.32345-.25875-.37589-.73072-.11713-1.05417z" fill="#5135ff"/></svg></div>
    Sample Code Download
  </button>
  
  <button class="code-block-copy">
    <div class="code-block-copy-icon">
      <svg width="14" height="14" viewBox="0 0 14 14" fill="none" xmlns="http://www.w3.org/2000/svg"><path fill-rule="evenodd" clip-rule="evenodd" d="M10 4.00012V1c0-.552285-.44772-1-1-1H1C.447715.0.0.447715.0 1V9c0 .55228.447715 1 1 1H3.99988v3.0001C3.99988 13.5515 4.44849 14 4.99988 14H10.0002l3.9997-3.9995V5c0-.55139-.448600000000001-.99988-1-.99988H10zM1.4 1.4V8.6H3.99988V5c0-.55139.44861-.99988 1-.99988H8.6V1.4H1.4zM5.3999 12.6H9.08295V9.78301C9.08295 9.39641 9.39635 9.08301 9.78295 9.08301H12.5999V5.40015h-7.2V12.6zm6.1375-2.117-1.0545 1.0544V10.483h1.0545z" fill="currentcolor"/></svg></div>
    Copy Code
  </button>
</div>

Color mode

lspci | grep E810
0000:6a:00.0 Ethernet controller: Intel Corporation Ethernet Controller E810-C for QSFP (rev 02) 0000:6a:00.1 Ethernet controller: Intel Corporation Ethernet Controller E810-C for QSFP (rev 02)

lspci | grep E810
0000:6a:00.0 Ethernet controller: Intel Corporation Ethernet Controller E810-C for QSFP (rev 02) 0000:6a:00.1 Ethernet controller: Intel Corporation Ethernet Controller E810-C for QSFP (rev 02)

</div>

Example of checking for the presence of an E810 NIC device

To update the driver, follow the steps below.

Move the base driver tar file to the desired directory.
Color mode
/usr/local/src
/usr/local/src
Directory navigation example
Untar / unzip the Archiver file.
- x.x.x is the version number of the driver tar file.
  Color mode
  tar zxf ice-x.x.x.tar.gz
  tar zxf ice-x.x.x.tar.gz
  untar example
Change the driver to the src directory.
- x.x.x is the version number of the driver tar file.
  Color mode
  cd ice-x.x.x/src/
  cd ice-x.x.x/src/
  Example of changing to the src directory
Compile the driver module.
Color mode
make install
make install
Driver module compilation example
After the update is complete, check the version.
Color mode
lsmod | grep ice modinfo ice | grep version
lsmod | grep ice modinfo ice | grep version
Version check example

Check NVIDIA driver

To check the NVIDIA driver (nvidia-smi topo, IB nv_peer_mem status) and inspect the IaaS hardware level, follow these steps.

Reference

The example of applying MIG is described using an A100 GPU node as the reference.

Check the GPU driver status.

Color mode

~$ nvidia-smi

~$ nvidia-smi

Example code for checking GPU driver status

Color mode

Thu Jan 29 14:48:31 2026
+---------------------------------------------------------------------------------------+
| NVIDIA-SMI 535.183.06             Driver Version: 535.183.06   CUDA Version: 12.2 |
|-----------------------------------------+----------------------+----------------------+
| GPU  Name                 Persistence-M | Bus-Id        Disp.A | Volatile Uncorr. ECC |
| Fan  Temp   Perf          Pwr:Usage/Cap | Memory-Usage | GPU-Util  Compute M. |
|  |  | MIG M. |
| =========================================+======================+====================== |
| 0  NVIDIA A100-SXM4-80GB          On | 00000000:00:05.0 Off | On |
| N/A   36C    P0              52W / 400W | 0MiB / 81920MiB | N/A      Default |
|  |  | Enabled |
+-----------------------------------------+----------------------+----------------------+
| 1  NVIDIA A100-SXM4-80GB          On | 00000000:00:06.0 Off | 0 |
| N/A   36C    P0              61W / 400W | 0MiB / 81920MiB | 0%      Default |
|  |  | Disabled |
+-----------------------------------------+----------------------+----------------------+
| 2  NVIDIA A100-SXM4-80GB          On | 00000000:00:07.0 Off | 0 |
| N/A   36C    P0              64W / 400W | 0MiB / 81920MiB | 0%      Default |
|  |  | Disabled |
+-----------------------------------------+----------------------+----------------------+
| 3  NVIDIA A100-SXM4-80GB          On | 00000000:00:08.0 Off | 0 |
| N/A   40C    P0              64W / 400W | 0MiB / 81920MiB | 0%      Default |
|  |  | Disabled |
+-----------------------------------------+----------------------+----------------------+
| 4  NVIDIA A100-SXM4-80GB          On | 00000000:00:09.0 Off | 0 |
| N/A   36C    P0              63W / 400W | 0MiB / 81920MiB | 0%      Default |
|  |  | Disabled |
+-----------------------------------------+----------------------+----------------------+
| 5  NVIDIA A100-SXM4-80GB          On | 00000000:00:0A.0 Off | 0 |
| N/A   40C    P0              64W / 400W | 0MiB / 81920MiB | 0%      Default |
|  |  | Disabled |
+-----------------------------------------+----------------------+----------------------+
| 6  NVIDIA A100-SXM4-80GB          On | 00000000:00:0B.0 Off | 0 |
| N/A   39C    P0              65W / 400W | 0MiB / 81920MiB | 0%      Default |
|  |  | Disabled |
+-----------------------------------------+----------------------+----------------------+
| 7  NVIDIA A100-SXM4-80GB          On | 00000000:00:0C.0 Off | 0 |
| N/A   39C    P0              60W / 400W | 0MiB / 81920MiB | 0%      Default |
|  |  | Disabled |
+-----------------------------------------+----------------------+----------------------+

+---------------------------------------------------------------------------------------+
| MIG devices: |
+------------------+--------------------------------+-----------+-----------------------+
| GPU  GI  CI  MIG | Memory-Usage | Vol | Shared |
| ID  ID  Dev | BAR1-Usage | SM     Unc | CE ENC DEC OFA JPG |
|  |  | ECC |  |
| ==================+================================+===========+======================= |
| No MIG devices found |
+---------------------------------------------------------------------------------------+

+---------------------------------------------------------------------------------------+
| Processes: |
| GPU   GI   CI        PID   Type   Process name                            GPU Memory |
| ID   ID                                                             Usage |
| ======================================================================================= |
| No running processes found |
+---------------------------------------------------------------------------------------+

Thu Jan 29 14:48:31 2026
+---------------------------------------------------------------------------------------+
| NVIDIA-SMI 535.183.06             Driver Version: 535.183.06   CUDA Version: 12.2 |
|-----------------------------------------+----------------------+----------------------+
| GPU  Name                 Persistence-M | Bus-Id        Disp.A | Volatile Uncorr. ECC |
| Fan  Temp   Perf          Pwr:Usage/Cap | Memory-Usage | GPU-Util  Compute M. |
|  |  | MIG M. |
| =========================================+======================+====================== |
| 0  NVIDIA A100-SXM4-80GB          On | 00000000:00:05.0 Off | On |
| N/A   36C    P0              52W / 400W | 0MiB / 81920MiB | N/A      Default |
|  |  | Enabled |
+-----------------------------------------+----------------------+----------------------+
| 1  NVIDIA A100-SXM4-80GB          On | 00000000:00:06.0 Off | 0 |
| N/A   36C    P0              61W / 400W | 0MiB / 81920MiB | 0%      Default |
|  |  | Disabled |
+-----------------------------------------+----------------------+----------------------+
| 2  NVIDIA A100-SXM4-80GB          On | 00000000:00:07.0 Off | 0 |
| N/A   36C    P0              64W / 400W | 0MiB / 81920MiB | 0%      Default |
|  |  | Disabled |
+-----------------------------------------+----------------------+----------------------+
| 3  NVIDIA A100-SXM4-80GB          On | 00000000:00:08.0 Off | 0 |
| N/A   40C    P0              64W / 400W | 0MiB / 81920MiB | 0%      Default |
|  |  | Disabled |
+-----------------------------------------+----------------------+----------------------+
| 4  NVIDIA A100-SXM4-80GB          On | 00000000:00:09.0 Off | 0 |
| N/A   36C    P0              63W / 400W | 0MiB / 81920MiB | 0%      Default |
|  |  | Disabled |
+-----------------------------------------+----------------------+----------------------+
| 5  NVIDIA A100-SXM4-80GB          On | 00000000:00:0A.0 Off | 0 |
| N/A   40C    P0              64W / 400W | 0MiB / 81920MiB | 0%      Default |
|  |  | Disabled |
+-----------------------------------------+----------------------+----------------------+
| 6  NVIDIA A100-SXM4-80GB          On | 00000000:00:0B.0 Off | 0 |
| N/A   39C    P0              65W / 400W | 0MiB / 81920MiB | 0%      Default |
|  |  | Disabled |
+-----------------------------------------+----------------------+----------------------+
| 7  NVIDIA A100-SXM4-80GB          On | 00000000:00:0C.0 Off | 0 |
| N/A   39C    P0              60W / 400W | 0MiB / 81920MiB | 0%      Default |
|  |  | Disabled |
+-----------------------------------------+----------------------+----------------------+

+---------------------------------------------------------------------------------------+
| MIG devices: |
+------------------+--------------------------------+-----------+-----------------------+
| GPU  GI  CI  MIG | Memory-Usage | Vol | Shared |
| ID  ID  Dev | BAR1-Usage | SM     Unc | CE ENC DEC OFA JPG |
|  |  | ECC |  |
| ==================+================================+===========+======================= |
| No MIG devices found |
+---------------------------------------------------------------------------------------+

+---------------------------------------------------------------------------------------+
| Processes: |
| GPU   GI   CI        PID   Type   Process name                            GPU Memory |
| ID   ID                                                             Usage |
| ======================================================================================= |
| No running processes found |
+---------------------------------------------------------------------------------------+

GPU driver status example

Check the NVSwitch and NVLink hardware status.
- Check NVSwitch status
  Copy Code
  Color mode
  ~$ nvidia-smi nvlink --status
  ~$ nvidia-smi nvlink --status
  NVSwitch status check example

Copy Code

Color mode

GPU 1: NVIDIA A100-SXM4-80GB (UUID: GPU-64a2f685-bb12-c4af-105c-0726ece9c8d7) Link 0: 25 GB/s Link 1: 25 GB/s Link 2: 25 GB/s Link 3: 25 GB/s Link 4: 25 GB/s Link 5: 25 GB/s Link 6: 25 GB/s Link 7: 25 GB/s Link 8: 25 GB/s Link 9: 25 GB/s Link 10: 25 GB/s Link 11: 25 GB/s 2: NVIDIA A100-SXM4-80GB (UUID: GPU-2269851b-71cd-f6c7-50c5-ba1525cf3ce8) Link 0: 25 GB/s Link 1: 25 GB/s Link 2: 25 GB/s Link 3: 25 GB/s Link 4: 25 GB/s Link 5: 25 GB/s Link 6: 25 GB/s Link 7: 25 GB/s Link 8: 25 GB/s Link 9: 25 GB/s Link 10: 25 GB/s Link 11: 25 GB/s 3: NVIDIA A100-SXM4-80GB (UUID: GPU-4c397bbf-95fc-5c29-918a-a429cbe45a7a) Link 0: 25 GB/s Link 1: 25 GB/s Link 2: 25 GB/s Link 3: 25 GB/s Link 4: 25 GB/s Link 5: 25 GB/s Link 6: 25 GB/s Link 7: 25 GB/s Link 8: 25 GB/s Link 9: 25 GB/s Link 10: 25 GB/s Link 11: 25 GB/s 4: NVIDIA A100-SXM4-80GB (UUID: GPU-0e350204-9fb6-2cbe-538e-8f7849658eb8) Link 0: 25 GB/s Link 1: 25 GB/s Link 2: 25 GB/s Link 3: 25 GB/s Link 4: 25 GB/s Link 5: 25 GB/s Link 6: 25 GB/s Link 7: 25 GB/s Link 8: 25 GB/s Link 9: 25 GB/s Link 10: 25 GB/s Link 11: 25 GB/s 5: NVIDIA A100-SXM4-80GB (UUID: GPU-45f0c453-4760-edd4-3af9-25c5ea7473a5) Link 0: 25 GB/s Link 1: 25 GB/s Link 2: 25 GB/s Link 3: 25 GB/s Link 4: 25 GB/s Link 5: 25 GB/s Link 6: 25 GB/s Link 7: 25 GB/s Link 8: 25 GB/s Link 9: 25 GB/s Link 10: 25 GB/s Link 11: 25 GB/s 6: NVIDIA A100-SXM4-80GB (UUID: GPU-38409794-bb34-430e-3c50-90b42cb2bb72) Link 0: 25 GB/s Link 1: 25 GB/s Link 2: 25 GB/s Link 3: 25 GB/s Link 4: 25 GB/s Link 5: 25 GB/s Link 6: 25 GB/s Link 7: 25 GB/s Link 8: 25 GB/s Link 9: 25 GB/s Link 10: 25 GB/s Link 11: 25 GB/s 7: NVIDIA A100-SXM4-80GB (UUID: GPU-3fb478aa-801b-eb64-55c2-0ffc3f2ce404) Link 0: 25 GB/s Link 1: 25 GB/s Link 2: 25 GB/s Link 3: 25 GB/s Link 4: 25 GB/s Link 5: 25 GB/s Link 6: 25 GB/s Link 7: 25 GB/s Link 8: 25 GB/s Link 9: 25 GB/s Link 10: 25 GB/s Link 11: 25 GB/s

GPU 1: NVIDIA A100-SXM4-80GB (UUID: GPU-64a2f685-bb12-c4af-105c-0726ece9c8d7) Link 0: 25 GB/s Link 1: 25 GB/s Link 2: 25 GB/s Link 3: 25 GB/s Link 4: 25 GB/s Link 5: 25 GB/s Link 6: 25 GB/s Link 7: 25 GB/s Link 8: 25 GB/s Link 9: 25 GB/s Link 10: 25 GB/s Link 11: 25 GB/s 2: NVIDIA A100-SXM4-80GB (UUID: GPU-2269851b-71cd-f6c7-50c5-ba1525cf3ce8) Link 0: 25 GB/s Link 1: 25 GB/s Link 2: 25 GB/s Link 3: 25 GB/s Link 4: 25 GB/s Link 5: 25 GB/s Link 6: 25 GB/s Link 7: 25 GB/s Link 8: 25 GB/s Link 9: 25 GB/s Link 10: 25 GB/s Link 11: 25 GB/s 3: NVIDIA A100-SXM4-80GB (UUID: GPU-4c397bbf-95fc-5c29-918a-a429cbe45a7a) Link 0: 25 GB/s Link 1: 25 GB/s Link 2: 25 GB/s Link 3: 25 GB/s Link 4: 25 GB/s Link 5: 25 GB/s Link 6: 25 GB/s Link 7: 25 GB/s Link 8: 25 GB/s Link 9: 25 GB/s Link 10: 25 GB/s Link 11: 25 GB/s 4: NVIDIA A100-SXM4-80GB (UUID: GPU-0e350204-9fb6-2cbe-538e-8f7849658eb8) Link 0: 25 GB/s Link 1: 25 GB/s Link 2: 25 GB/s Link 3: 25 GB/s Link 4: 25 GB/s Link 5: 25 GB/s Link 6: 25 GB/s Link 7: 25 GB/s Link 8: 25 GB/s Link 9: 25 GB/s Link 10: 25 GB/s Link 11: 25 GB/s 5: NVIDIA A100-SXM4-80GB (UUID: GPU-45f0c453-4760-edd4-3af9-25c5ea7473a5) Link 0: 25 GB/s Link 1: 25 GB/s Link 2: 25 GB/s Link 3: 25 GB/s Link 4: 25 GB/s Link 5: 25 GB/s Link 6: 25 GB/s Link 7: 25 GB/s Link 8: 25 GB/s Link 9: 25 GB/s Link 10: 25 GB/s Link 11: 25 GB/s 6: NVIDIA A100-SXM4-80GB (UUID: GPU-38409794-bb34-430e-3c50-90b42cb2bb72) Link 0: 25 GB/s Link 1: 25 GB/s Link 2: 25 GB/s Link 3: 25 GB/s Link 4: 25 GB/s Link 5: 25 GB/s Link 6: 25 GB/s Link 7: 25 GB/s Link 8: 25 GB/s Link 9: 25 GB/s Link 10: 25 GB/s Link 11: 25 GB/s 7: NVIDIA A100-SXM4-80GB (UUID: GPU-3fb478aa-801b-eb64-55c2-0ffc3f2ce404) Link 0: 25 GB/s Link 1: 25 GB/s Link 2: 25 GB/s Link 3: 25 GB/s Link 4: 25 GB/s Link 5: 25 GB/s Link 6: 25 GB/s Link 7: 25 GB/s Link 8: 25 GB/s Link 9: 25 GB/s Link 10: 25 GB/s Link 11: 25 GB/s

NVSwitch status example

Check NVLinks hardware status

Color mode

~$ nvidia-smi topo -m

~$ nvidia-smi topo -m

Example code for checking NVLink hardware status

Color mode

        GPU0    GPU1    GPU2    GPU3    GPU4    GPU5    GPU6    GPU7    CPU Affinity    NUMA Affinity   GPU NUMA ID
GPU0     X      NV12    NV12    NV12    NV12    NV12    NV12    NV12    0-127   0-7             N/A
GPU1    NV12	 X      NV12    NV12    NV12    NV12    NV12    NV12    0-127   0-7             N/A
GPU2    NV12    NV12     X      NV12    NV12    NV12    NV12    NV12    0-127   0-7             N/A
GPU3    NV12    NV12    NV12     X      NV12    NV12    NV12    NV12    0-127   0-7             N/A
GPU4    NV12    NV12    NV12    NV12     X      NV12    NV12    NV12    0-127   0-7             N/A
GPU5    NV12    NV12    NV12    NV12    NV12     X      NV12    NV12    0-127   0-7             N/A
GPU6    NV12    NV12    NV12    NV12    NV12    NV12     X      NV12    0-127   0-7             N/A
GPU7    NV12    NV12    NV12    NV12    NV12    NV12    NV12     X      0-127   0-7             N/A

Legend:

  X    = Self
  SYS  = Connection traversing PCIe as well as the SMP interconnect between NUMA nodes (e.g., QPI/UPI)
  NODE = Connection traversing PCIe as well as the interconnect between PCIe Host Bridges within a NUMA node
  PHB  = Connection traversing PCIe as well as a PCIe Host Bridge (typically the CPU)
  PXB  = Connection traversing multiple PCIe bridges (without traversing the PCIe Host Bridge)
  PIX  = Connection traversing at most a single PCIe bridge
  NV#  = Connection traversing a bonded set of # NVLinks

        GPU0    GPU1    GPU2    GPU3    GPU4    GPU5    GPU6    GPU7    CPU Affinity    NUMA Affinity   GPU NUMA ID
GPU0     X      NV12    NV12    NV12    NV12    NV12    NV12    NV12    0-127   0-7             N/A
GPU1    NV12	 X      NV12    NV12    NV12    NV12    NV12    NV12    0-127   0-7             N/A
GPU2    NV12    NV12     X      NV12    NV12    NV12    NV12    NV12    0-127   0-7             N/A
GPU3    NV12    NV12    NV12     X      NV12    NV12    NV12    NV12    0-127   0-7             N/A
GPU4    NV12    NV12    NV12    NV12     X      NV12    NV12    NV12    0-127   0-7             N/A
GPU5    NV12    NV12    NV12    NV12    NV12     X      NV12    NV12    0-127   0-7             N/A
GPU6    NV12    NV12    NV12    NV12    NV12    NV12     X      NV12    0-127   0-7             N/A
GPU7    NV12    NV12    NV12    NV12    NV12    NV12    NV12     X      0-127   0-7             N/A

Legend:

  X    = Self
  SYS  = Connection traversing PCIe as well as the SMP interconnect between NUMA nodes (e.g., QPI/UPI)
  NODE = Connection traversing PCIe as well as the interconnect between PCIe Host Bridges within a NUMA node
  PHB  = Connection traversing PCIe as well as a PCIe Host Bridge (typically the CPU)
  PXB  = Connection traversing multiple PCIe bridges (without traversing the PCIe Host Bridge)
  PIX  = Connection traversing at most a single PCIe bridge
  NV#  = Connection traversing a bonded set of # NVLinks

NVLink HW status check code example

Check the InfiniBand (IB) HCA card hardware status and link.

Color mode

user@bm-dev-001:~$ ibdev2netdev -v

user@bm-dev-001:~$ ibdev2netdev -v

HW status check command example

Color mode

cat: /sys/class/infiniband/mlx5_0/device/vpd: Permission denied
0000:45:00.0 mlx5_0 (MT4123 -            )                 fw 20.29.1016 port 1 (ACTIVE) ==> ibs18 (Down)
cat: /sys/class/infiniband/mlx5_1/device/vpd: Permission denied
0000:0e:00.0 mlx5_1 (MT4123 -            )                 fw 20.29.1016 port 1 (ACTIVE) ==> ibs17 (Down)
cat: /sys/class/infiniband/mlx5_2/device/vpd: Permission denied
0000:c5:00.0 mlx5_2 (MT4123 -            )                 fw 20.29.1016 port 1 (ACTIVE) ==> ibs20 (Down)
cat: /sys/class/infiniband/mlx5_3/device/vpd: Permission denied
0000:85:00.0 mlx5_3 (MT4123 -            )                 fw 20.29.1016 port 1 (ACTIVE) ==> ibs19 (Down)
user@bm-dev-001:~$

cat: /sys/class/infiniband/mlx5_0/device/vpd: Permission denied
0000:45:00.0 mlx5_0 (MT4123 -            )                 fw 20.29.1016 port 1 (ACTIVE) ==> ibs18 (Down)
cat: /sys/class/infiniband/mlx5_1/device/vpd: Permission denied
0000:0e:00.0 mlx5_1 (MT4123 -            )                 fw 20.29.1016 port 1 (ACTIVE) ==> ibs17 (Down)
cat: /sys/class/infiniband/mlx5_2/device/vpd: Permission denied
0000:c5:00.0 mlx5_2 (MT4123 -            )                 fw 20.29.1016 port 1 (ACTIVE) ==> ibs20 (Down)
cat: /sys/class/infiniband/mlx5_3/device/vpd: Permission denied
0000:85:00.0 mlx5_3 (MT4123 -            )                 fw 20.29.1016 port 1 (ACTIVE) ==> ibs19 (Down)
user@bm-dev-001:~$

Example of HW status check result

Color mode

root@bm-dev-001:~# ibstat

root@bm-dev-001:~# ibstat

Example of link verification command

Color mode

CA 'mlx5_0'
        CA type: MT4123
        Number of ports: 1
        Firmware version: 20.29.1016
        Hardware version: 0
        Node GUID: 0x88e9a4ffff5060ac
        System image GUID: 0x88e9a4ffff5060ac
        Port 1:
                State: Active
                Physical state: LinkUp
                Rate: 200
                Base lid: 8
                LMC: 0
                SM lid: 1
                Capability mask: 0x2651e848
                Port GUID: 0x88e9a4ffff5060ac
                Link layer: InfiniBand
CA 'mlx5_1'
        CA type: MT4123
        Number of ports: 1
        Firmware version: 20.29.1016
        Hardware version: 0
        Node GUID: 0x88e9a4ffff504080
        System image GUID: 0x88e9a4ffff504080
        Port 1:
                State: Active
                Physical state: LinkUp
                Rate: 200
                Base lid: 5
                LMC: 0
                SM lid: 1
                Capability mask: 0x2651e848
                Port GUID: 0x88e9a4ffff504080
                Link layer: InfiniBand
CA 'mlx5_2'
        CA type: MT4123
        Number of ports: 1
        Firmware version: 20.29.1016
        Hardware version: 0
        Node GUID: 0x88e9a4ffff505038
        System image GUID: 0x88e9a4ffff505038
        Port 1:
                State: Active
                Physical state: LinkUp
                Rate: 200
                Base lid: 2
                LMC: 0
                SM lid: 1
                Capability mask: 0x2651e848
                Port GUID: 0x88e9a4ffff505038
                Link layer: InfiniBand
CA 'mlx5_3'
        CA type: MT4123
        Number of ports: 1
        Firmware version: 20.29.1016
        Hardware version: 0
        Node GUID: 0x88e9a4ffff504094
        System image GUID: 0x88e9a4ffff504094
        Port 1:
                State: Active
                Physical state: LinkUp
                Rate: 200
                Base lid: 7
                LMC: 0
                SM lid: 1
                Capability mask: 0x2651e848
                Port GUID: 0x88e9a4ffff504094
                Link layer: InfiniBand

CA 'mlx5_0'
        CA type: MT4123
        Number of ports: 1
        Firmware version: 20.29.1016
        Hardware version: 0
        Node GUID: 0x88e9a4ffff5060ac
        System image GUID: 0x88e9a4ffff5060ac
        Port 1:
                State: Active
                Physical state: LinkUp
                Rate: 200
                Base lid: 8
                LMC: 0
                SM lid: 1
                Capability mask: 0x2651e848
                Port GUID: 0x88e9a4ffff5060ac
                Link layer: InfiniBand
CA 'mlx5_1'
        CA type: MT4123
        Number of ports: 1
        Firmware version: 20.29.1016
        Hardware version: 0
        Node GUID: 0x88e9a4ffff504080
        System image GUID: 0x88e9a4ffff504080
        Port 1:
                State: Active
                Physical state: LinkUp
                Rate: 200
                Base lid: 5
                LMC: 0
                SM lid: 1
                Capability mask: 0x2651e848
                Port GUID: 0x88e9a4ffff504080
                Link layer: InfiniBand
CA 'mlx5_2'
        CA type: MT4123
        Number of ports: 1
        Firmware version: 20.29.1016
        Hardware version: 0
        Node GUID: 0x88e9a4ffff505038
        System image GUID: 0x88e9a4ffff505038
        Port 1:
                State: Active
                Physical state: LinkUp
                Rate: 200
                Base lid: 2
                LMC: 0
                SM lid: 1
                Capability mask: 0x2651e848
                Port GUID: 0x88e9a4ffff505038
                Link layer: InfiniBand
CA 'mlx5_3'
        CA type: MT4123
        Number of ports: 1
        Firmware version: 20.29.1016
        Hardware version: 0
        Node GUID: 0x88e9a4ffff504094
        System image GUID: 0x88e9a4ffff504094
        Port 1:
                State: Active
                Physical state: LinkUp
                Rate: 200
                Base lid: 7
                LMC: 0
                SM lid: 1
                Capability mask: 0x2651e848
                Port GUID: 0x88e9a4ffff504094
                Link layer: InfiniBand

Link verification result example

Check IB bandwidth communication

Check the IB bandwidth communication status (ib_send_bw) and follow the steps below to inspect the IaaS hardware level.

Check the name of the IB HCA interface.

In the following example, IB ports: mlx5_0, mlx5_4, mlx_5_5, mlx5_8

Color mode

~$ ibdev2netdev -v

~$ ibdev2netdev -v

Example of checking IB HCA interface name

Color mode

0000:1a:00.0 mlx5_0 (MT4129 - 1028SN     ) Nvidia ConnectX-7 Single Port Infiniband NDR OSFP Adapter fw 28.38.1002 port 1 (ACTIVE) ==> ibp26s0 (Down)
0000:1b:00.0 mlx5_1 (MT4123 - 1028SN     ) Mellanox ConnectX-6 Single Port VPI HDR QSFP Adapter fw 20.38.1002 port 1 (ACTIVE) ==> bond-nas (Up)
0000:3c:00.0 mlx5_2 (MT4129 - 1028SN     ) Nvidia ConnectX-7 Single Port Infiniband NDR OSFP Adapter fw 28.38.1002 port 1 (DOWN  ) ==> ibp60s0 (Down)
0000:4d:00.0 mlx5_3 (MT4129 - 1028SN     ) Nvidia ConnectX-7 Single Port Infiniband NDR OSFP Adapter fw 28.38.1002 port 1 (DOWN  ) ==> ibp77s0 (Down)
0000:5e:00.0 mlx5_4 (MT4129 - 1028SN     ) Nvidia ConnectX-7 Single Port Infiniband NDR OSFP Adapter fw 28.38.1002 port 1 (ACTIVE) ==> ibp94s0 (Down)
0000:9c:00.0 mlx5_5 (MT4129 - 1028SN     ) Nvidia ConnectX-7 Single Port Infiniband NDR OSFP Adapter fw 28.38.1002 port 1 (ACTIVE) ==> ibp156s0 (Down)
0000:9d:00.0 mlx5_6 (MT4123 - 1028SN     ) Mellanox ConnectX-6 Single Port VPI HDR QSFP Adapter fw 20.38.1002 port 1 (ACTIVE) ==> bond-nas (Up)
0000:bc:00.0 mlx5_7 (MT4129 - 1028SN     ) Nvidia ConnectX-7 Single Port Infiniband NDR OSFP Adapter fw 28.38.1002 port 1 (DOWN  ) ==> ibp188s0 (Down)
0000:cc:00.0 mlx5_8 (MT4129 - 1028SN     ) Nvidia ConnectX-7 Single Port Infiniband NDR OSFP Adapter fw 28.38.1002 port 1 (ACTIVE) ==> ibp204s0 (Down)
0000:dc:00.0 mlx5_9 (MT4129 - 1028SN     ) Nvidia ConnectX-7 Single Port Infiniband NDR OSFP Adapter fw 28.38.1002 port 1 (DOWN  ) ==> ibp220s0 (Down)

0000:1a:00.0 mlx5_0 (MT4129 - 1028SN     ) Nvidia ConnectX-7 Single Port Infiniband NDR OSFP Adapter fw 28.38.1002 port 1 (ACTIVE) ==> ibp26s0 (Down)
0000:1b:00.0 mlx5_1 (MT4123 - 1028SN     ) Mellanox ConnectX-6 Single Port VPI HDR QSFP Adapter fw 20.38.1002 port 1 (ACTIVE) ==> bond-nas (Up)
0000:3c:00.0 mlx5_2 (MT4129 - 1028SN     ) Nvidia ConnectX-7 Single Port Infiniband NDR OSFP Adapter fw 28.38.1002 port 1 (DOWN  ) ==> ibp60s0 (Down)
0000:4d:00.0 mlx5_3 (MT4129 - 1028SN     ) Nvidia ConnectX-7 Single Port Infiniband NDR OSFP Adapter fw 28.38.1002 port 1 (DOWN  ) ==> ibp77s0 (Down)
0000:5e:00.0 mlx5_4 (MT4129 - 1028SN     ) Nvidia ConnectX-7 Single Port Infiniband NDR OSFP Adapter fw 28.38.1002 port 1 (ACTIVE) ==> ibp94s0 (Down)
0000:9c:00.0 mlx5_5 (MT4129 - 1028SN     ) Nvidia ConnectX-7 Single Port Infiniband NDR OSFP Adapter fw 28.38.1002 port 1 (ACTIVE) ==> ibp156s0 (Down)
0000:9d:00.0 mlx5_6 (MT4123 - 1028SN     ) Mellanox ConnectX-6 Single Port VPI HDR QSFP Adapter fw 20.38.1002 port 1 (ACTIVE) ==> bond-nas (Up)
0000:bc:00.0 mlx5_7 (MT4129 - 1028SN     ) Nvidia ConnectX-7 Single Port Infiniband NDR OSFP Adapter fw 28.38.1002 port 1 (DOWN  ) ==> ibp188s0 (Down)
0000:cc:00.0 mlx5_8 (MT4129 - 1028SN     ) Nvidia ConnectX-7 Single Port Infiniband NDR OSFP Adapter fw 28.38.1002 port 1 (ACTIVE) ==> ibp204s0 (Down)
0000:dc:00.0 mlx5_9 (MT4129 - 1028SN     ) Nvidia ConnectX-7 Single Port Infiniband NDR OSFP Adapter fw 28.38.1002 port 1 (DOWN  ) ==> ibp220s0 (Down)

Example of IB HCA interface name verification result

Use the SERVER Side command to check the communication status.

Color mode

~$ ib_send_bw -d mlx5_0 -i 1 –F

~$ ib_send_bw -d mlx5_0 -i 1 –F

SERVER Side command example

Color mode

************************************
* Waiting for client to connect... * *
************************************
---------------------------------------------------------------------------------------
                    Send BW Test
 Dual-port       : OFF		Device         : mlx5_0
 Number of qps   : 1		Transport type : IB
 Connection type : RC		Using SRQ      : OFF
 PCIe relax order: ON
 ibv_wr* API     : ON
 TX depth        : 128
 CQ Moderation   : 100
 Mtu             : 4096[B]
 Link type       : IB
 Max inline data : 0[B]
 rdma_cm QPs	 : OFF
 Data ex.  method : Ethernet
---------------------------------------------------------------------------------------
 local address: LID 0x0f QPN 0x6d95 PSN 0xb974a
 remote address: LID 0x01 QPN 0x6dd2 PSN 0xc8a18c
---------------------------------------------------------------------------------------
 #bytes     #iterations    BW peak[MB/sec]    BW average[MB/sec]   MsgRate[Mpps]
 65536      1000             0.00               19827.40                   0.317238
---------------------------------------------------------------------------------------

************************************
* Waiting for client to connect... * *
************************************
---------------------------------------------------------------------------------------
                    Send BW Test
 Dual-port       : OFF		Device         : mlx5_0
 Number of qps   : 1		Transport type : IB
 Connection type : RC		Using SRQ      : OFF
 PCIe relax order: ON
 ibv_wr* API     : ON
 TX depth        : 128
 CQ Moderation   : 100
 Mtu             : 4096[B]
 Link type       : IB
 Max inline data : 0[B]
 rdma_cm QPs	 : OFF
 Data ex.  method : Ethernet
---------------------------------------------------------------------------------------
 local address: LID 0x0f QPN 0x6d95 PSN 0xb974a
 remote address: LID 0x01 QPN 0x6dd2 PSN 0xc8a18c
---------------------------------------------------------------------------------------
 #bytes     #iterations    BW peak[MB/sec]    BW average[MB/sec]   MsgRate[Mpps]
 65536      1000             0.00               19827.40                   0.317238
---------------------------------------------------------------------------------------

Example of mutual communication status check result

Use the CLIENT Side command to check the communication status.

Color mode

~$ ib_send_bw -d mlx5_0 -i 1 -F <SERVER Side IP>

~$ ib_send_bw -d mlx5_0 -i 1 -F <SERVER Side IP>

CLIENT Side command example

Color mode

---------------------------------------------------------------------------------------
                    Send BW Test
 Dual-port       : OFF		Device         : mlx5_0
 Number of qps   : 1		Transport type : IB
 Connection type : RC		Using SRQ      : OFF
 PCIe relax order: ON
 ibv_wr* API     : ON
 RX depth        : 512
 CQ Moderation   : 100
 Mtu             : 4096[B]
 Link type       : IB
 Max inline data : 0[B]
 rdma_cm QPs	 : OFF
 Data ex.  method : Ethernet
---------------------------------------------------------------------------------------
 local address: LID 0x01 QPN 0x6dd2 PSN 0xc8a18c
 remote address: LID 0x0f QPN 0x6d95 PSN 0xb974a
---------------------------------------------------------------------------------------
 #bytes     #iterations    BW peak[MB/sec]    BW average[MB/sec]   MsgRate[Mpps]
 65536      1000             19008.49            19006.37                  0.304102
---------------------------------------------------------------------------------------

---------------------------------------------------------------------------------------
                    Send BW Test
 Dual-port       : OFF		Device         : mlx5_0
 Number of qps   : 1		Transport type : IB
 Connection type : RC		Using SRQ      : OFF
 PCIe relax order: ON
 ibv_wr* API     : ON
 RX depth        : 512
 CQ Moderation   : 100
 Mtu             : 4096[B]
 Link type       : IB
 Max inline data : 0[B]
 rdma_cm QPs	 : OFF
 Data ex.  method : Ethernet
---------------------------------------------------------------------------------------
 local address: LID 0x01 QPN 0x6dd2 PSN 0xc8a18c
 remote address: LID 0x0f QPN 0x6d95 PSN 0xb974a
---------------------------------------------------------------------------------------
 #bytes     #iterations    BW peak[MB/sec]    BW average[MB/sec]   MsgRate[Mpps]
 65536      1000             19008.49            19006.37                  0.304102
---------------------------------------------------------------------------------------

Example of mutual communication status check result

Check IB service related kernel modules

Check the relevant kernel modules for the IB service (lsmod) to inspect the IaaS hardware level.

Check IB service-related kernel module - nvidia_peermem

Color mode

~$ lsmod | grep nvidia_peermem

~$ lsmod | grep nvidia_peermem

Example command to check nvidia_peermem

Color mode

nvidia_peermem         16384  0
ib_core               425984  9 rdma_cm,ib_ipoib,nvidia_peermem,iw_cm,ib_umad,rdma_ucm,ib_uverbs,mlx5_ib,ib_cm
nvidia              56524800  451 nvidia_uvm,nvidia_peermem,nvidia_modeset

nvidia_peermem         16384  0
ib_core               425984  9 rdma_cm,ib_ipoib,nvidia_peermem,iw_cm,ib_umad,rdma_ucm,ib_uverbs,mlx5_ib,ib_cm
nvidia              56524800  451 nvidia_uvm,nvidia_peermem,nvidia_modeset

Example of nvidia_peermem check result

Check IB service related kernel modules - IB Card (HCA) driver

Color mode

~$ lsmod|egrep 'ib_|_ib|mlx'

~$ lsmod|egrep 'ib_|_ib|mlx'

Example command to check the IB Card (HCA) driver

Color mode

mlx5_ib               393216  0
ib_uverbs             163840  2 irdma,mlx5_ib
ib_core               393216  3 irdma,ib_uverbs,mlx5_ib
mlx5_core            1593344  1 mlx5_ib
mlxfw                  32768  1 mlx5_core
psample                20480  1 mlx5_core
tls                   114688  1 mlx5_core
pci_hyperv_intf        16384  1 mlx5_core
ib_ipoib              139264  0
ib_cm                 131072  2 rdma_cm,ib_ipoib
ib_umad                40960  0
mlx5_ib               454656  0
ib_uverbs             135168  2 rdma_ucm,mlx5_ib
ib_core               434176  9 rdma_cm,ib_ipoib,nvidia_peermem,iw_cm,ib_umad,rdma_ucm,ib_uverbs,mlx5_ib,ib_cm
libcrc32c              16384  5 nf_conntrack,nf_nat,btrfs,nf_tables,raid456
mlx5_core            2064384  1 mlx5_ib
mlx_compat             69632  11 rdma_cm,ib_ipoib,mlxdevm,iw_cm,ib_umad,ib_core,rdma_ucm,ib_uverbs,mlx5_ib,ib_cm,mlx5_core

mlx5_ib               393216  0
ib_uverbs             163840  2 irdma,mlx5_ib
ib_core               393216  3 irdma,ib_uverbs,mlx5_ib
mlx5_core            1593344  1 mlx5_ib
mlxfw                  32768  1 mlx5_core
psample                20480  1 mlx5_core
tls                   114688  1 mlx5_core
pci_hyperv_intf        16384  1 mlx5_core
ib_ipoib              139264  0
ib_cm                 131072  2 rdma_cm,ib_ipoib
ib_umad                40960  0
mlx5_ib               454656  0
ib_uverbs             135168  2 rdma_ucm,mlx5_ib
ib_core               434176  9 rdma_cm,ib_ipoib,nvidia_peermem,iw_cm,ib_umad,rdma_ucm,ib_uverbs,mlx5_ib,ib_cm
libcrc32c              16384  5 nf_conntrack,nf_nat,btrfs,nf_tables,raid456
mlx5_core            2064384  1 mlx5_ib
mlx_compat             69632  11 rdma_cm,ib_ipoib,mlxdevm,iw_cm,ib_umad,ib_core,rdma_ucm,ib_uverbs,mlx5_ib,ib_cm,mlx5_core

Example of IB Card (HCA) driver verification result

Check storage physical disk resources and Multi-Path

Verify the storage physical disk resources and Multi-Path to assess the IaaS hardware level.

Storage Physical Disk Resource Check Results
Color mode
root@bm-dev-002:/tmp# fdisk –l
root@bm-dev-002:/tmp# fdisk –l
Example of storage physical disk resource check result
Multi-Path verification result
Color mode
root@bm-dev-002:/tmp# multipath –ll
root@bm-dev-002:/tmp# multipath –ll
Example of Multi-Path verification result

Check Service Network after new deployment of Multi-node GPU Cluster

Use the following command to verify that the MII Status of Bonding and Slave Interface is up.

Service Network check command
Color mode
~$ cat /proc/net/bonding/bond-srv
~$ cat /proc/net/bonding/bond-srv
Example command to check Service Network

Service Network check result

Color mode

Ethernet Channel Bonding Driver: v5.15.0-25-generic
Bonding Mode: fault-tolerance (active-backup)
Primary Slave: None
Currently Active Slave: ens9f0
MII Status: up
MII Polling Interval (ms): 100
Up Delay (ms): 0
Down Delay (ms): 0
Peer Notification Delay (ms): 0

Slave Interface: ens9f0
MII Status: up
Speed: 100000 Mbps
Duplex: full
Link Failure Count: 0
Permanent HW addr: 30:3e:a7:02:35:70
Slave queue ID: 0

Slave Interface: ens11f0
MII Status: up
Speed: 100000 Mbps
Duplex: full
Link Failure Count: 0
Permanent HW addr: 30:3e:a7:02:2f:e8
Slave queue ID: 0

Ethernet Channel Bonding Driver: v5.15.0-25-generic
Bonding Mode: fault-tolerance (active-backup)
Primary Slave: None
Currently Active Slave: ens9f0
MII Status: up
MII Polling Interval (ms): 100
Up Delay (ms): 0
Down Delay (ms): 0
Peer Notification Delay (ms): 0

Slave Interface: ens9f0
MII Status: up
Speed: 100000 Mbps
Duplex: full
Link Failure Count: 0
Permanent HW addr: 30:3e:a7:02:35:70
Slave queue ID: 0

Slave Interface: ens11f0
MII Status: up
Speed: 100000 Mbps
Duplex: full
Link Failure Count: 0
Permanent HW addr: 30:3e:a7:02:2f:e8
Slave queue ID: 0

Example of Service Network verification result

Reference

If some Slave Interface is in a down state, use the Contact of the Support Center to report the abnormal situation and receive a response.

Multi-node GPU Cluster: Verify time synchronization with the Time Server after new deployment

The OS image includes the installation of the chrony daemon and configuration for SCP NTP server synchronization. Use the following command to verify whether a line marked with ^* exists in the MS Name column.

Command to check the chrony daemon source and synchronization status
Color mode
~$ chronyc sources -V
~$ chronyc sources -V
Example of status check command

Result of checking chrony daemon status

Color mode

MS Name/IP address         Stratum Poll Reach LastRx Last sample
===============================================================================
^* 198.19.0.54                   4  10   377  1040    -16us[  -37us] +/- 9982us

MS Name/IP address         Stratum Poll Reach LastRx Last sample
===============================================================================
^* 198.19.0.54                   4  10   377  1040    -16us[  -37us] +/- 9982us

Example of checking the chrony daemon status

Runtime	Version	Deprecation date
GO	1.21	2026.7.30
GO	1.23	2026.7.30
GO	1.25	-
java	17	-
Node.js	18	2026.7.30
Node.js	20	2026.7.30
Node.js	22	-
Node.js	24	-
PHP	8.1	2026.7.30
PHP	8.4	-
PHP	8.5	-
Python	3.9	2026.7.30
Python	3.10	-
Python	3.11	-
Python	3.14	-

Region	Whether provided
Korea West 1 (kr-west1)	Provide
Korea East 1 (kr-east1)	Provided
South Korea 1 (kr-south1)	Not provided
South Korea South 2 (kr-south2)	Not provided
South Korea 3 (kr-south3)	Not provided

Performance items	Detailed description	unit	meaningful statistics
Invocations	Average number of times the function is called per unit time	Count	Total Average Maximum Minimum
Success Calls	Average number of times the runtime code operates correctly and returns a response code per unit time during a function call.	Count	Total Average Maximum Minimum
Error Calls	Average number of calls per unit time that encounter errors during function invocation, including runtime errors due to response timeouts and logic errors.	Count	Total Average Maximum Minimum
Memory Usage	Average memory usage per unit time while the function is executing	Kilobytes	Total Average Maximum Minimum
Active Operations	When a function is called multiple times simultaneously, the average number of tasks generated per unit time for concurrent processing.	Count	Total Average Maximum Minimum

Category	Detailed description
Number of calls	Average number of times the function is called per unit time (instances)
execution time	Average execution time (seconds) of the function per unit time
Memory usage	Average memory usage (KB) during the function execution per unit time
Current number of tasks	When the function is called multiple times simultaneously, the average number of tasks (count) generated per unit time for concurrent processing.
Successful call count	Average number of times (cases) the runtime code operated correctly and returned a response code per unit time during a function call.
Failed call count	Average number of calls with errors per unit time during function invocation Including runtime due to response timeouts and logic errors

Category	Detailed description
unit period	Select the period to view Cloud Functions log information Select in time units (1 hour, 3 hours, 12 hours) or allow the user to set a custom range
log message	Functions are displayed in order, starting with the most recent occurrence.

Category	Detailed description
source code	Inline editor method
code information	Display code information
Edit	After clicking the Edit button, you can modify the code in the inline editor.

Category	Detailed description
General configuration	Memory and timeout settings of Cloud Function Memory: Maximum memory limit that can be used per function Timeout: Maximum time to wait for a function invocation per function Function execution: Minimum and maximum number of tasks Click the Edit button to modify the General configuration settings
function URL	Issue an HTTPS URL address that can access the function Enabled: Whether the function URL is enabled Status: Current state of the function URL Active: In use Inactive: Disabled Creating: Creating Function URL: Click the function URL to navigate Authentication type: When IAM is configured, only authenticated IAM users can access the function URL Access control: When enabled, you can register and manage allowed IP addresses Click the Edit button to configure Enabled status, Authentication type, and Allowed IPs Refer to the OpenAPI of Samsung Cloud Platform
environment variable	Set runtime environment variables Environment variable: When used, you can adjust the function’s behavior without updating code Edit button to add or modify environment variable
Private connection configuration	Can be used in conjunction with PrivateLink Service For detailed information on configuring Private connections, see PrivateLink Service Integration
Permission	Add and manage resource policies for IAM-based functions Click the Edit button to edit the policy Add: If there is no existing resource policy, add a new resource policy Resource policies can be loaded from provided templates or created manually

Category	Detailed description
Cronjob configuration	Set the trigger’s repeat frequency Can be set in minutes, hours, days, months, weekdays
Timezone setting	Set the trigger’s reference time zone

Category	Detailed description
API name	Select API You can select an existing API or create a new one
Stage	Select deployment target You can select an existing stage or create a new one

Category	Detailed description	Remarks
Hello World	When the function is invoked, it responds with Hello Serverless World!
Execution after timeout	It outputs code that should run after the function call timeout but does not execute.	PHP, Python not supported
HTTP request body	Parse the request body.	PHP not supported
Send HTTP requests	The Cloud function sends an HTTP request.	PHP not supported
Print logs	Logs the user’s Samsung Cloud Platform Console request.	PHP not supported
Throw a custom error	Enter the error logic directly to handle the error.
Using Environment Variable	Configure environment variables within the Cloud function and execute it.

Category	Detailed description
Enable status	Configure the use of the function URL
Authentication type	Select whether to use IAM authentication for requests to the function URL
Access control	Add accessible IPs to enable management Set to Use, then you can input and add a public access IP

Category	Detailed description
PrivateLink Endpoint ID	PrivateLink Endpoint ID information
PrivateLink Service ID	PrivateLink Service ID information
Alias	hostalias information that can be used instead of an IP address for accessing a PrivateLink Endpoint
status	Approval status of PrivateLink Endpoint Requesting: Pending approval Active: Approved and connected Disconnected: Disconnected Reject: Approval rejected. Click the Retry button to retry Delete: Delete the endpoint

Category	description	example
Principal	Specify the caller of the function	Specific object storage bucket, API Gateway, other Samsung Cloud Platform accounts, etc.
Task(Action)	Define the allowed functions	Mostly scf:InvokeFunction
Condition(Condition)	Restrict to allow only in specific situations	Allow only requests originating from a bucket with a specific SRN.

Category	description	Reference example
Function URL - Authentication Type None	It is required when generating a function URL for invocation. If there is no resource-based policy, authorization fails, making it impossible to invoke the function using a public URL.	Function URL (Auth Type None) Example
Function URL - Authentication Type IAM	Resource-based policy is registered or credential permission is required. If the resource belongs to the same account: you can invoke it if you have a resource-based policy or credential permission (InvokeFunctionUrl). If the resource belongs to a different account: you must have both a resource-based policy and credential permission (InvokeFunction) to invoke it.	Function URL (authentication type IAM) example
API Gateway trigger	It is required when API Gateway calls Lambda to handle external API requests. Resource-based policy registration is required. If a resource-based policy is not present, authorization fails, making it impossible to invoke the function via API Gateway.	API Gateway Trigger Example
PrivateLink connection	You can connect a PrivateLink Service to define the function for private access from another VPC. It must be registered with a resource-based policy or require credential permissions. When the resource is in the same account: you can invoke it if you have a resource-based policy or credential permission (InvokeFunction). When the resource is in a different account: you must have both a resource-based policy and credential permission (InvokeFunction) to invoke it.	PrivateLink connection example

Performance Item Name	Explanation	unit
Volume Total	Total byte count	bytes
IOPS [Read]	iops(read)	iops
IOPS [Write]	iops(write)	iops
Latency Time [Read]	Read latency	usec
Latency Time [write]	Latency (write)	usec
Throughput [Read]	Throughput (read)	bytes/s
Throughput [Write]	Throughput (write)	bytes/s

Category	Required or not	Detailed description
Volume name	Required	Volume name English letters, numbers, spaces, and special characters(`-`, `_`) using to input up to 255 characters
snapshot	Selection	Select the snapshot to use when creating a volume from a snapshot After checking the Use option, snapshot selection is possible When creating a service via snapshot recovery volume creation, provide the recovery snapshot name If not selected, an empty volume is created
Disk type	Required	Select Disk Type SSD_Provisioned: SSD volume with configurable IOPS and throughput SSD/HDD: Standard SSD/HDD volume SSD/HDD_KMS: Additional encrypted volume SSD/HDD_MultiAttach: Volume that can be attached to more than one server Cannot be modified after service creation When creating a service via snapshot recovery volume creation, it is set identical to the original and cannot be modified
capacity	Selection	Capacity setting Can be created within 8~12,228GB Enter the number of units provided in 8GB increments Enter a capacity that is equal to or larger than the original when creating the service via snapshot recovery volume creation
Max IOPS	Required	Enter the maximum IOPS value between 5,000 and 20,000 Can be set only when disk type is SSD_Provisioned
Max Throughput	Required	Enter the maximum Throughput value between 250 and 1,000 Can be set only when the disk type is SSD_Provisioned

Category	Detailed description
Volume status	Volume status Creating: In progress Downloading: In progress (applying OS image) Available: Created, server connection available Reserved: Waiting for server connection Attaching: Connecting to server Detaching: Server connection detached In Use: Server connection established Deleting: Terminating service Awaiting Transfer: Waiting for volume transfer Extending: Expanding capacity Error Extending: Abnormal state during capacity expansion Backing Up: Backing up volume Restoring Backup: Restoring volume backup Error Backing Up: Abnormal state during volume backup Error Restoring: Abnormal state during volume backup restoration Error Deleting: Abnormal state during deletion Error Managing: Abnormal state Error: Abnormal state Maintenance: Temporary maintenance state Reverting: Restoring snapshot
Previous volume	Move the volume to another Account For detailed information about moving volumes, see 볼륨 이전
Create snapshot	Immediately create a snapshot at the time of creation For detailed information on snapshot creation, see Creating a Snapshot
Snapshot recovery	Recover the volume using the latest snapshot in Available state For detailed information on snapshot recovery, see 스냅샷 복구하기
Service termination	Cancel service button

Category	Detailed description
service	service group
Resource Type	Resource Type
SRN	Unique resource ID in Samsung Cloud Platform In the Block Storage service, it refers to the volume SRN
Resource name	Resource Name In the Block Storage service, it refers to the volume name
Resource ID	Service’s unique resource ID
constructor	User who created the service
Creation date and time	Service creation date and time
editor	User who modified the service
Modification date	Date and time the service was modified
Volume name	Volume name If you need to edit the volume name, click the Edit button For detailed information on editing the volume name, refer to Edit Volume Name
Volume ID	Volume unique ID
Disk type	Disk type
type	Classification by volume creation method and usage
capacity	Volume capacity Can be expanded up to 12,288GB in increments of 8 Capacity reduction is not allowed If capacity expansion is required, click the Edit button For detailed information on capacity expansion, refer to 용량 수정하기
Max IOPS	Required
Max Throughput	Required
Connection server	Connected Virtual Server Server ID: Unique server ID Server Name: Server name Status: Server status Connection Information: Connection information of the volume for server connection When adding a Virtual Server connection, click the Add button When removing a Virtual Server connection, click the Disconnect button For detailed information on editing the connected server, see Edit Connected Server

Category	Detailed description
Snapshot name	Snapshot name
Explanation	Snapshot description
Volume capacity	Snapshot target Block Storage original volume capacity If the original volume capacity and the snapshot volume capacity differ, only recovery using the recovery volume creation method is possible
Creation timestamp	Snapshot creation time
status	Snapshot status Available: Creation completed, recoverable Creating: Creating Error: Abnormal state Deleting: Deleting Error Deleting: Deleting with abnormal state Restoring: Restoring Backing Up: Backing up
Additional features > More	Snapshot management button Create recovery volume: Create a recovery volume based on a snapshot For details on snapshot recovery, see 스냅샷 복구 볼륨 생성하기 Edit: Edit snapshot name and description For details on editing a snapshot, see 스냅샷 수정하기 Delete: Deletes the snapshot. For details on deleting a snapshot, see 스냅샷 삭제하기
Delete	Select the snapshots to delete from the snapshot list and delete them all at once.

Item	Explanation
cat /etc/fstab	File system information file Used when the server is running
df -h	Check the total disk usage of mounted disks
fdisk -l	Check partition information Physical disks are represented by letters such as `a`, `b`, `c` like `/dev/sda`, `/dev/sdb`, `/dev/sdc`. Disk partitions are represented by numbers such as `1`, `2`, `3` like `/dev/sda1`, `/dev/sda2`, `/dev/sda3`

command	Explanation
m	Check the usage of the fdisk command
n	Create a new partition
p	Verify the updated partition information
t	Change the system ID of the partition
w	Save partition information and exit fdisk.

Category	Required	Detailed description
Volume name	Required	Volume name Enter up to 255 characters using English letters, numbers, spaces, and special characters (`-`, `_`)
Disk type	Required	Select Disk Type SSD_Provisioned: SSD volume with configurable IOPS and Throughput SSD/HDD: Standard SSD/HDD volume SSD/HDD_KMS: Additional encrypted volume SSD/HDD_MultiAttach: Volume that can be attached to more than one server Cannot be modified after service creation SSD_Provisioned: SSD volume with configurable IOPS and Throughput When creating the service via snapshot recovery volume creation, it is set identical to the original and cannot be modified
capacity	Selection	Capacity setting Can be created within 8~12,228 GB Enter the number of units provided in 8 GB increments Enter a capacity equal to or larger than the original when creating the service via snapshot recovery volume creation
Max IOPS	Required	Enter the maximum IOPS value between 5,000 and 20,000 Can be configured only when Disk type is SSD_Provisioned
Max Throughput	Required	Enter the maximum Throughput value between 250 and 1,000 Disk type can be set only when it is SSD_Provisioned
Recovery snapshot name	Selection	Name of the recovery snapshot used when creating the volume Provide the recovery snapshot name when creating a service via snapshot recovery volume creation