Leveraging the country’s top reliability, we conveniently and elastically provide optimal computing resources tailored to each use case.
This is the multi-page printable view of this section. Click here to print.
Compute
- 1: Virtual Server
- 1.1: Overview
- 1.1.1: Server Type
- 1.1.2: Monitoring Metrics
- 1.1.3: ServiceWatch Metrics
- 1.2: How-to guides
- 1.2.1: Image
- 1.2.2: Keypair
- 1.2.3: Server Group
- 1.2.4: Change IP
- 1.2.5: Configure Linux NTP
- 1.2.6: Configure RHEL Repo and WKMS
- 1.2.7: Install ServiceWatch Agent
- 1.3: API Reference
- 1.4: CLI Reference
- 1.5: Release Note
- 2: Virtual Server Auto-Scaling
- 2.1: Overview
- 2.1.1: Monitoring Metrics
- 2.1.2: ServiceWatch Metrics
- 2.2: How-to guides
- 2.2.1: Launch Configuration
- 2.2.2: Manage Policy
- 2.2.3: Manage Schedule
- 2.2.4: Manage Notification
- 2.3: API Reference
- 2.4: CLI Reference
- 2.5: Release Note
- 3: GPU Server
- 3.1: Overview
- 3.1.1: Server type
- 3.1.2: Monitoring Metrics
- 3.1.3: ServiceWatch Metrics
- 3.2: How-to guides
- 3.2.1: Manage Image
- 3.2.2: Manage Keypair
- 3.2.3: Use Multi-instance GPU on GPU Server
- 3.2.4: Use NVSwitch on GPU Server
- 3.2.5: Install ServiceWatch Agent
- 3.3: API Reference
- 3.4: CLI Reference
- 3.5: Release Note
- 4: Bare Metal Server
- 4.1: Overview
- 4.1.1: Server Type
- 4.1.2: Monitoring Metrics
- 4.2: How-to guides
- 4.2.1: Install ServiceWatch Agent
- 4.2.2: Setting up RHEL Repo and WKMS
- 4.3: API Reference
- 4.4: CLI Reference
- 4.5: Release Note
- 5: Multi-node GPU Cluster
- 5.1: Overview
- 5.1.1: Server type
- 5.1.2: Monitoring Metrics
- 5.2: How-to guides
- 5.2.1: Manage Cluster Fabric
- 5.2.2: Install ServiceWatch Agent
- 5.2.3: Multi-node GPU Cluster Service Scope and Inspection Guide
- 5.3: Release Note
- 6: Cloud Functions
- 6.1: Overview
- 6.1.1: ServiceWatch Metrics
- 6.2: How-to guides
- 6.2.1: Configure Trigger
- 6.2.2: Blueprint Detailed Guide
- 6.2.3: Integrate PrivateLink Service
- 6.2.4: Resource-based Policy Guide
- 6.3: API Reference
- 6.4: CLI Reference
- 6.5: Release Note
- 7: Virtual Server DR
- 7.1: Overview
- 7.2: Release Note
- 8: Block Storage
- 8.1: Overview
- 8.1.1: Monitoring Metrics
- 8.2: How-to guides
- 8.2.1: Connect to Server
- 8.2.2: Use Snapshot
- 8.2.3: Move Volume
- 8.3: API Reference
- 8.4: CLI Reference
- 8.5: Release Note
1 - Virtual Server
1.1 - Overview
Service Overview
Virtual Server is a cloud computing-optimized virtual server that lets you freely allocate the amount you need at the required time without having to purchase infrastructure resources such as CPU and memory individually. In a cloud environment, you can use resources with optimized performance according to your computing purpose, such as development, testing, and application execution.
Features
Easy and convenient computing environment setup: Through a web-based Console, users can easily perform self-service provisioning of Virtual Servers, as well as resource and cost management. If you need to change the capacity of major resources such as CPU or Memory while using a Virtual Server, you can easily scale up or down without operator intervention.
Providing various types of services: Provides virtualized vCore/Memory resources according to predefined server types (1~128 vCore).
- General Virtual Server: Provides commonly used computing specs (up to 16 vCore, 256 GB)
- High-capacity Virtual Server: Provided when resources larger than the standard Virtual Server spec are needed.
Strong Security Implementation: By using the Security Group service, you can control inbound/outbound traffic communicating with the external Internet or other VPCs (Virtual Private Cloud) to securely protect the server. Additionally, real-time monitoring enables stable operation of computing resources.
Service Architecture Diagram
Provided features
Virtual Server provides the following features.
- Auto Provisioning and Management: Provides Virtual Server provisioning, resource management, and cost management functions through a web-based Console. If you need to change the capacity of major resources such as CPU or Memory while using Virtual Server, you can modify the server type immediately using the server type modification feature.
- Standard Server Types and Image Provision: Provides virtualized vCore/Memory resources according to standard server types, and offers standard OS images.
- Storage Connection: Provides additional attached storage beyond the OS disk. Block Storage, File Storage, and Object Storage can be attached and used.
- Network Connection: You can connect the standard subnet/IP settings of the Virtual Server and the Public NAT IP. Provides a local subnet connection for inter-server communication. This can be modified on the detail page.
- Security Group Application: Use the Security Group service to control inbound and outbound traffic communicating with external internet or other VPCs, thereby securely protecting the server.
- Monitoring: You can view monitoring information for computing resources such as CPU, Memory, and Disk through the Cloud Monitoring service.
- Backup and Recovery: You can back up and restore the Virtual Server Image using the Backup service.
- Cost Management: You can create, stop, or terminate servers as needed, and since billing is based on actual usage time, you can monitor costs according to consumption.
- ServiceWatch Service Integration: You can monitor data using the ServiceWatch service.
Component
Virtual Server provides standard server types and standard OS images. Users can select and use them according to the desired service scale.
Image
You can create and manage images. The main features are as follows.
- Image creation: You can create an Image from the configuration of a Virtual Server you are using, and you can also create an Image by uploading your Image file to Object Storage.
- Create Shared Image: You can create an Image with Visibility set to Private as a Shared Image that can be shared.
- Share with another Account: You can share the Image with another Account.
- Refer to the How-to guides > Image document for how to create and use images.
Keypair
To ensure a more secure OS login, we strengthen security by providing a Key Pair instead of the ID/Password entry method. The main features are as follows.
- Keypair creation: Generate a user credential to connect to the Virtual Server.
- Retrieve Public Key: You can load a file or manually enter the public key to retrieve it.
- Refer to the How-to guides > Keypair document for creating and using keypairs.
Server Group
Through Server Group settings, you can position the Block Storage added when creating a Virtual Server close to or distributed across racks and hosts. The main features are as follows.
- Server Group Creation: You can set Virtual Servers belonging to the same Server Group as Anti-Affinity(Distributed placement), Affinity(Proximate placement), or Partition(Virtual Server and Block Storage distributed placement).
- Refer to the How-to guides > Server Group document for how to create and use Server Groups.
OS Image provided version
The OS images provided by Virtual Server are as follows
| OS Image version | EoS Date |
|---|---|
| Alma Linux 8.10 | 2029-05-31 |
| Alma Linux 9.6 | 2025-11-17 |
| Oracle Linux 8.10 | 2029-07-31 |
| Oracle Linux 9.6 | 2025-11-25 |
| RHEL 8.10 | 2029-05-31 |
| RHEL 9.4 | 2026-04-30 |
| RHEL 9.6 | 2027-05-31 |
| Rocky Linux 8.10 | 2029-05-31 |
| Rocky Linux 9.6 | 2025-11-30 |
| Ubuntu 22.04 | 2027-06-30 |
| Ubuntu 24.04 | 2029-06-30 |
| Windows 2019 | 2029-01-09 |
| Windows 2022 | 2031-10-14 |
| Windows 2016 | 2027-01-12 |
- Linux operating systems such as Alma Linux and Rocky Linux provide only even Minor versions, except for the final release of a Major version. This policy ensures the stability and consistency of the SCP system. We recommend checking the EOS (End of Support) and EOL (End of Life) dates for the operating system, and, if necessary, applying new or additional individual packages to maintain a stable environment.
Server type
The server types supported by Virtual Server are as follows. For detailed information about server types, see Virtual Server Server Types.
Standard s1v2m4
Category | example | Detailed description |
|---|---|---|
| Server type | Standard | Provided server type classifications
|
| Server specifications | s1 | Provided server type classification and generation
|
| Server specifications | v2 | Number of vCores
|
| Server specifications | m4 | Memory capacity
|
Constraints
- When creating a Virtual Server with Rocky Linux or Oracle Linux, additional configuration is required for time synchronization (NTP: Network Time Protocol). For other images, it is set automatically and no separate configuration is needed.
For more details, refer to Linux NTP Setup. - If you created RHEL and Windows Server before August 2025, you need to modify the RHEL Repository and WKMS (Windows Key Management Service) settings.
For more details, see RHEL Repo and WKMS Configuration.
Preliminary Service
This is a list of services that need to be pre-configured before creating the service. Please refer to the guide provided for each service and prepare in advance.
| Service Category | service | Detailed description |
|---|---|---|
| Networking | VPC | A service that provides an isolated virtual network in a cloud environment |
| Networking | Security Group | Virtual firewall that controls server traffic |
1.1.1 - Server Type
Virtual Server server type
Virtual Server provides server types that match the intended use. Server types consist of various combinations such as CPU, Memory, and Network Bandwidth. The host server used for a Virtual Server is determined by the server type selected when creating the Virtual Server. Please choose a server type based on the specifications of the application you plan to run on the Virtual Server.
The server types supported by Virtual Server are as follows.
Standard s1v2m4
Category | example | Detailed description |
|---|---|---|
| Server type | Standard | Provided server type classifications
|
| Server specifications | s1 | Provided server type classification and generation
|
| Server specifications | v2 | Number of vCores
|
| Server specifications | m4 | Memory capacity
|
s1 server type
The s1 server type of Virtual Server is offered with standard specifications (vCPU, Memory) and is suitable for various applications.
- First generation of Samsung Cloud Platform v2: Intel 3rd generation (Ice Lake) Xeon Gold 6342 Processor with up to 3.3 GHz
- Supports up to 16 vCPUs and 256 GB of memory
- Maximum networking speed of 12.5 Gbps
| Category | Server type | vCPU | Memory | Network Bandwidth |
|---|---|---|---|---|
| Standard | s1v1m2 | 1 vCore | 2 GB | Up to 10 Gbps |
| Standard | s1v2m4 | 2 vCore | 4 GB | Up to 10 Gbps |
| Standard | s1v2m8 | 2 vCore | 8 GB | Up to 10 Gbps |
| Standard | s1v2m16 | 2 vCore | 16 GB | Up to 10 Gbps |
| Standard | s1v2m24 | 2 vCore | 24 GB | Up to 10 Gbps |
| Standard | s1v2m32 | 2 vCore | 32 GB | Up to 10 Gbps |
| Standard | s1v4m8 | 4 vCore | 8 GB | Up to 10 Gbps |
| Standard | s1v4m16 | 4 vCore | 16 GB | Up to 10 Gbps |
| Standard | s1v4m32 | 4 vCore | 32 GB | Up to 10 Gbps |
| Standard | s1v4m48 | 4 vCore | 48 GB | Up to 10 Gbps |
| Standard | s1v4m64 | 4 vCore | 64 GB | Up to 10 Gbps |
| Standard | s1v6m12 | 6 vCore | 12 GB | Up to 10 Gbps |
| Standard | s1v6m24 | 6 vCore | 24 GB | Up to 10 Gbps |
| Standard | s1v6m48 | 6 vCore | 48 GB | Up to 10 Gbps |
| Standard | s1v6m72 | 6 vCore | 72 GB | Up to 10 Gbps |
| Standard | s1v6m96 | 6 vCore | 96 GB | Up to 10 Gbps |
| Standard | s1v8m16 | 8 vCore | 16 GB | Up to 10 Gbps |
| Standard | s1v8m32 | 8 vCore | 32 GB | Up to 10 Gbps |
| Standard | s1v8m64 | 8 vCore | 64 GB | Up to 10 Gbps |
| Standard | s1v8m96 | 8 vCore | 96 GB | Up to 10 Gbps |
| Standard | s1v8m128 | 8 vCore | 128 GB | Up to 10 Gbps |
| Standard | s1v10m20 | 10 vCore | 20 GB | Up to 10 Gbps |
| Standard | s1v10m40 | 10 vCore | 40 GB | Up to 10 Gbps |
| Standard | s1v10m80 | 10 vCore | 80 GB | Up to 10 Gbps |
| Standard | s1v10m120 | 10 vCore | 120 GB | Up to 10 Gbps |
| Standard | s1v10m160 | 10 vCore | 160 GB | Up to 10 Gbps |
| Standard | s1v12m24 | 12 vCore | 24 GB | Up to 12.5 Gbps |
| Standard | s1v12m48 | 12 vCore | 48 GB | Up to 12.5 Gbps |
| Standard | s1v12m96 | 12 vCore | 96 GB | Up to 12.5 Gbps |
| Standard | s1v12m144 | 12 vCore | 144 GB | Up to 12.5 Gbps |
| Standard | s1v12m192 | 12 vCore | 192 GB | Up to 12.5 Gbps |
| Standard | s1v14m28 | 14 vCore | 28 GB | Up to 12.5 Gbps |
| Standard | s1v14m56 | 14 vCore | 56 GB | Up to 12.5 Gbps |
| Standard | s1v14m112 | 14 vCore | 112 GB | Up to 12.5 Gbps |
| Standard | s1v14m168 | 14 vCore | 168 GB | Up to 12.5 Gbps |
| Standard | s1v14m224 | 14 vCore | 224 GB | Up to 12.5 Gbps |
| Standard | s1v16m32 | 16 vCore | 32 GB | Up to 12.5 Gbps |
| Standard | s1v16m64 | 16 vCore | 64 GB | Up to 12.5 Gbps |
| Standard | s1v16m128 | 16 vCore | 128 GB | Up to 12.5 Gbps |
| Standard | s1v16m192 | 16 vCore | 192 GB | Up to 12.5 Gbps |
| Standard | s1v16m256 | 16 vCore | 256 GB | Up to 12.5 Gbps |
s2 server type
The Virtual Server s2 server type is offered with standard specifications (vCPU, Memory) and is suitable for various applications.
- Samsung Cloud Platform v2, 2nd generation: Intel 4th‑generation (Sapphire Rapids) Xeon Gold 6448H Processor up to 3.2 GHz
- Supports up to 16 vCPUs and 256 GB of memory
- Maximum networking speed of 12.5 Gbps
| Category | Server type | CPU vCore | Memory | Network Bandwidth(Gbps) |
|---|---|---|---|---|
| Standard | s2v1m2 | 1 vCore | 2 GB | Up to 10 Gbps |
| Standard | s2v2m4 | 2 vCore | 4 GB | Up to 10 Gbps |
| Standard | s2v2m8 | 2 vCore | 8 GB | Up to 10 Gbps |
| Standard | s2v2m16 | 2 vCore | 16 GB | Up to 10 Gbps |
| Standard | s2v2m24 | 2 vCore | 24 GB | Up to 10 Gbps |
| Standard | s2v2m32 | 2 vCore | 32 GB | Up to 10 Gbps |
| Standard | s2v4m8 | 4 vCore | 8 GB | Up to 10 Gbps |
| Standard | s2v4m16 | 4 vCore | 16 GB | Up to 10 Gbps |
| Standard | s2v4m32 | 4 vCore | 32 GB | Up to 10 Gbps |
| Standard | s2v4m48 | 4 vCore | 48 GB | Up to 10 Gbps |
| Standard | s2v4m64 | 4 vCore | 64 GB | Up to 10 Gbps |
| Standard | s2v6m12 | 6 vCore | 12 GB | Up to 10 Gbps |
| Standard | s2v6m24 | 6 vCore | 24 GB | Up to 10 Gbps |
| Standard | s2v6m48 | 6 vCore | 48 GB | Up to 10 Gbps |
| Standard | s2v6m72 | 6 vCore | 72 GB | Up to 10 Gbps |
| Standard | s2v6m96 | 6 vCore | 96 GB | Up to 10 Gbps |
| Standard | s2v8m16 | 8 vCore | 16 GB | Up to 10 Gbps |
| Standard | s2v8m32 | 8 vCore | 32 GB | Up to 10 Gbps |
| Standard | s2v8m64 | 8 vCore | 64 GB | Up to 10 Gbps |
| Standard | s2v8m96 | 8 vCore | 96 GB | Up to 10 Gbps |
| Standard | s2v8m128 | 8 vCore | 128 GB | Up to 10 Gbps |
| Standard | s2v10m20 | 10 vCore | 20 GB | Up to 10 Gbps |
| Standard | s2v10m40 | 10 vCore | 40 GB | Up to 10 Gbps |
| Standard | s2v10m80 | 10 vCore | 80 GB | Up to 10 Gbps |
| Standard | s2v10m120 | 10 vCore | 120 GB | Up to 10 Gbps |
| Standard | s2v10m160 | 10 vCore | 160 GB | Up to 10 Gbps |
| Standard | s2v12m24 | 12 vCore | 24 GB | Up to 12.5 Gbps |
| Standard | s2v12m48 | 12 vCore | 48 GB | Up to 12.5 Gbps |
| Standard | s2v12m96 | 12 vCore | 96 GB | Up to 12.5 Gbps |
| Standard | s2v12m144 | 12 vCore | 144 GB | Up to 12.5 Gbps |
| Standard | s2v12m192 | 12 vCore | 192 GB | Up to 12.5 Gbps |
| Standard | s2v14m28 | 14 vCore | 28 GB | Up to 12.5 Gbps |
| Standard | s2v14m56 | 14 vCore | 56 GB | Up to 12.5 Gbps |
| Standard | s2v14m112 | 14 vCore | 112 GB | Up to 12.5 Gbps |
| Standard | s2v14m168 | 14 vCore | 168 GB | Up to 12.5 Gbps |
| Standard | s2v14m224 | 14 vCore | 224 GB | Up to 12.5 Gbps |
| Standard | s2v16m32 | 16 vCore | 32 GB | Up to 12.5 Gbps |
| Standard | s2v16m64 | 16 vCore | 64 GB | Up to 12.5 Gbps |
| Standard | s2v16m128 | 16 vCore | 128 GB | Up to 12.5 Gbps |
| Standard | s2v16m192 | 16 vCore | 192 GB | Up to 12.5 Gbps |
| Standard | s2v16m256 | 16 vCore | 256 GB | Up to 12.5 Gbps |
h2 Server Type
The h2 server type of Virtual Server is offered with high-capacity specifications and is suitable for applications that require large-scale data processing.
- Second generation of Samsung Cloud Platform v2: Intel 4th‑generation (Sapphire Rapids) Xeon Gold 6448H processor up to 3.2 GHz
- Supports up to 128 vCPUs and 1,536 GB of memory
- Networking speed up to 25 Gbps
| Category | Server type | vCPU | Memory | Network Bandwidth |
|---|---|---|---|---|
| High Capacity | h2v24m48 | 24 vCore | 48 GB | Up to 25 Gbps |
| High Capacity | h2v24m96 | 24 vCore | 96 GB | Maximum 25 Gbps |
| High Capacity | h2v24m192 | 24 vCore | 192 GB | Maximum 25 Gbps |
| High Capacity | h2v24m288 | 24 vCore | 288 GB | Maximum 25 Gbps |
| High Capacity | h2v32m64 | 32 vCore | 64 GB | Maximum 25 Gbps |
| High Capacity | h2v32m128 | 32 vCore | 128 GB | Maximum 25 Gbps |
| High Capacity | h2v32m256 | 32 vCore | 256 GB | Maximum 25 Gbps |
| High Capacity | h2v32m384 | 32 vCore | 384 GB | Maximum 25 Gbps |
| High Capacity | h2v48m96 | 48 vCore | 96 GB | Maximum 25 Gbps |
| High Capacity | h2v48m192 | 48 vCore | 192 GB | Maximum 25 Gbps |
| High Capacity | h2v48m384 | 48 vCore | 384 GB | Maximum 25 Gbps |
| High Capacity | h2v48m576 | 48 vCore | 576 GB | Maximum 25 Gbps |
| High Capacity | h2v64m128 | 64 vCore | 128 GB | Maximum 25 Gbps |
| High Capacity | h2v64m256 | 64 vCore | 256 GB | Up to 25 Gbps |
| High Capacity | h2v64m512 | 64 vCore | 512 GB | Maximum 25 Gbps |
| High Capacity | h2v64m768 | 64 vCore | 768 GB | Maximum 25 Gbps |
| High Capacity | h2v72m144 | 72 vCore | 144 GB | Up to 25 Gbps |
| High Capacity | h2v72m288 | 72 vCore | 288 GB | Maximum 25 Gbps |
| High Capacity | h2v72m576 | 72 vCore | 576 GB | Maximum 25 Gbps |
| High Capacity | h2v72m864 | 72 vCore | 864 GB | Maximum 25 Gbps |
| High Capacity | h2v96m192 | 96 vCore | 192 GB | Maximum 25 Gbps |
| High Capacity | h2v96m384 | 96 vCore | 384 GB | Maximum 25 Gbps |
| High Capacity | h2v96m768 | 96 vCore | 768 GB | Maximum 25 Gbps |
| High Capacity | h2v96m1152 | 96 vCore | 1152 GB | Maximum 25 Gbps |
| High Capacity | h2v128m256 | 128 vCore | 256 GB | Maximum 25 Gbps |
| High Capacity | h2v128m512 | 128 vCore | 512 GB | Maximum 25 Gbps |
| High Capacity | h2v128m1024 | 128 vCore | 1024 GB | Up to 25 Gbps |
| High Capacity | h2v128m1536 | 128 vCore | 1536 GB | Maximum 25 Gbps |
1.1.2 - Monitoring Metrics
According to Samsung Cloud Platform’s policy, the Cloud Monitoring service is scheduled to be discontinued in September 2026.
Starting after the September 2026 release, resource monitoring of the Samsung Cloud Platform via Cloud Monitoring will no longer be possible.
With the new alternative service, you can continuously perform resource monitoring by leveraging ServiceWatch released in October 2025.
ServiceWatch provides more modern and powerful features, replacing Cloud Monitoring to deliver a seamless monitoring environment.
Detailed information about ServiceWatch can be found in the ServiceWatch Overview.
Virtual Server Monitoring Metrics
The table below shows the monitoring metrics of Virtual Server that can be viewed through Cloud Monitoring. For detailed usage of Cloud Monitoring, refer to the Cloud Monitoring guide.
Provides basic monitoring metrics even without installing an agent, as shown below table. Please check the Virtual Server monitoring metrics (default). Additionally, the metrics that can be viewed by installing the Agent are in the table below. Virtual Server additional monitoring metrics (Agent installation required)** Please refer to it.
For Windows OS, memory-related metrics can only be viewed after installing the Agent.
| Performance items | Detailed description | unit |
|---|---|---|
| Memory Total [Basic] | bytes of usable memory | bytes |
| Memory Used [Basic] | Current memory usage in bytes | bytes |
| Memory Swap In [Basic] | bytes of the replaced memory | bytes |
| Memory Swap Out [Basic] | bytes of the replaced memory | bytes |
| Memory Free [Basic] | bytes of unused memory | bytes |
| Disk Read Bytes [Basic] | Read bytes | bytes |
| Disk Read Requests [Basic] | Number of read requests | cnt |
| Disk Write Bytes [Basic] | write bytes | bytes |
| Disk Write Requests [Basic] | Number of write requests | cnt |
| CPU Usage [Basic] | Average system CPU usage over 1 minute | % |
| Instance State [Basic] | Instance status | state |
| Network In Bytes [Basic] | Received bytes | bytes |
| Network In Dropped [Basic] | Incoming packet drop | cnt |
| Network In Packets [Basic] | Number of received packets | cnt |
| Network Out Bytes [Basic] | sent bytes | bytes |
| Network Out Dropped [Basic] | Transmit packet drop | cnt |
| Network Out Packets [Basic] | Number of transmitted packets | cnt |
| Performance items | Detailed description | unit |
|---|---|---|
| Core Usage [IO Wait] | Ratio of CPU time spent in wait state (disk wait) | % |
| Core Usage [System] | Proportion of CPU time spent in kernel space | % |
| Core Usage [User] | Proportion of CPU time spent in user space | % |
| CPU Cores | Number of CPU cores on the host | cnt |
| CPU Usage [Active] | Percentage of CPU time used other than Idle and IOWait states | % |
| CPU Usage [Idle] | It is the proportion of CPU time spent in idle state. | % |
| CPU Usage [IO Wait] | This is the proportion of CPU time spent in a waiting state (disk wait). | % |
| CPU Usage [System] | Percentage of CPU time used by the kernel | % |
| CPU Usage [User] | Percentage of CPU time used in user space | % |
| CPU Usage/Core [Active] | Percentage of CPU time used other than Idle and IOWait states | % |
| CPU Usage/Core [Idle] | It is the proportion of CPU time spent in idle state. | % |
| CPU Usage/Core [IO Wait] | This is the proportion of CPU time spent in a waiting state (disk wait). | % |
| CPU Usage/Core [System] | Percentage of CPU time used by the kernel | % |
| CPU Usage/Core [User] | Percentage of CPU time used in user space | % |
| DiskCPU Usage [IO Request] | Proportion of CPU time during which I/O requests to the device were executed | % |
| Disk Queue Size [Avg] | The average queue length of requests executed for the device. | num |
| Disk Read Bytes | The number of bytes read per second from the device. | bytes |
| Disk Read Bytes [Delta Avg] | Average of system.diskio.read.bytes_delta for individual disks | bytes |
| Disk Read Bytes [Delta Max] | Maximum system.diskio.read.bytes_delta of individual disks | bytes |
| Disk Read Bytes [Delta Min] | Minimum of system.diskio.read.bytes_delta for individual disks | bytes |
| Disk Read Bytes [Delta Sum] | Sum of system.diskio.read.bytes_delta of individual disks | bytes |
| Disk Read Bytes [Delta] | Delta of the system.diskio.read.bytes value for each Disk | bytes |
| Disk Read Bytes [Success] | Total bytes successfully read | bytes |
| Disk Read Requests | Number of read requests to the disk device per second | cnt |
| Disk Read Requests [Delta Avg] | Average of system.diskio.read.count_delta for individual disks | cnt |
| Disk Read Requests [Delta Max] | Maximum system.diskio.read.count_delta for individual disks | cnt |
| Disk Read Requests [Delta Min] | Minimum of system.diskio.read.count_delta for individual disks | cnt |
| Disk Read Requests [Delta Sum] | Sum of the system.diskio.read.count_delta of individual disks | cnt |
| Disk Read Requests [Success Delta] | Delta of system.diskio.read.count for each Disk | cnt |
| Disk Read Requests [Success] | Total number of successful reads | cnt |
| Disk Request Size [Avg] | It is the average size of requests executed on the device (unit: sectors). | num |
| Disk Service Time [Avg] | Average service time (milliseconds) of input requests executed on the device. | ms |
| Disk Wait Time [Avg] | Average time taken for requests executed on the supported device. | ms |
| Disk Wait Time [Read] | Average disk wait time | ms |
| Disk Wait Time [Write] | Average disk wait time | ms |
| Disk Write Bytes [Delta Avg] | Average of system.diskio.write.bytes_delta for each disk | bytes |
| Disk Write Bytes [Delta Max] | Maximum system.diskio.write.bytes_delta of individual disks | bytes |
| Disk Write Bytes [Delta Min] | Minimum of system.diskio.write.bytes_delta for individual disks | bytes |
| Disk Write Bytes [Delta Sum] | Sum of system.diskio.write.bytes_delta for individual disks | bytes |
| Disk Write Bytes [Delta] | Delta of the system.diskio.write.bytes value for each Disk | bytes |
| Disk Write Bytes [Success] | Total number of bytes successfully written | bytes |
| Disk Write Requests | Number of write requests to the disk device per second | cnt |
| Disk Write Requests [Delta Avg] | Average of system.diskio.write.count_delta for individual disks | cnt |
| Disk Write Requests [Delta Max] | Maximum system.diskio.write.count_delta of individual disks | cnt |
| Disk Write Requests [Delta Min] | Minimum of system.diskio.write.count_delta for individual disks | cnt |
| Disk Write Requests [Delta Sum] | Sum of system.diskio.write.count_delta for individual disks | cnt |
| Disk Write Requests [Success Delta] | Delta of system.diskio.write.count for each Disk | cnt |
| Disk Write Requests [Success] | Total number of successful writes | cnt |
| Disk Writes Bytes | Bytes per second written to the device | bytes |
| Filesystem Hang Check | filesystem(local/NFS) hang check(normal:1, abnormal:0) | status |
| Filesystem Nodes | Total number of file nodes in the file system. | cnt |
| Filesystem Nodes [Free] | It is the total number of available file nodes in the file system. | cnt |
| Filesystem Size [Available] | Disk space (bytes) available to unauthorized users | bytes |
| Filesystem Size [Free] | Available disk space (bytes) | bytes |
| Filesystem Size [Total] | Total disk space (bytes) | bytes |
| Filesystem Usage | Used disk space percentage | % |
| Filesystem Usage [Avg] | Average of individual filesystem.used.pct | % |
| Filesystem Usage [Inode] | iNode usage rate | % |
| Filesystem Usage [Max] | Maximum among individual filesystem.used.pct | % |
| Filesystem Usage [Min] | Min among individual filesystem.used.pct | % |
| Filesystem Usage [Total] | - | % |
| Filesystem Used | Used disk space (bytes) | bytes |
| Filesystem Used [Inode] | iNode usage | bytes |
| Memory Free | Total available memory (bytes) | bytes |
| Memory Free [Actual] | Actual usable Memory (bytes) | bytes |
| Memory Free [Swap] | Available Swap memory | bytes |
| Memory Total | Total Memory | bytes |
| Memory Total [Swap] | Total Swap memory. | bytes |
| Memory Usage | Percentage of used memory | % |
| Memory Usage [Actual] | Percentage of memory actually used | % |
| Memory Usage [Cache Swap] | cached swap usage | % |
| Memory Usage [Swap] | Percentage of used Swap memory | % |
| Memory Used | Used Memory | bytes |
| Memory Used [Actual] | Actual memory used (bytes) | bytes |
| Memory Used [Swap] | Used Swap memory | bytes |
| Collisions | Network collision | cnt |
| Network In Bytes | Number of received bytes | bytes |
| Network In Bytes [Delta Avg] | Average of system.network.in.bytes_delta for individual networks | bytes |
| Network In Bytes [Delta Max] | Maximum system.network.in.bytes_delta for each Network | bytes |
| Network In Bytes [Delta Min] | Minimum of system.network.in.bytes_delta for each network | bytes |
| Network In Bytes [Delta Sum] | Sum of system.network.in.bytes_delta for individual networks | bytes |
| Network In Bytes [Delta] | Delta of received byte count | bytes |
| Network In Dropped | Number of deleted packets among incoming packets | cnt |
| Network In Errors | Number of errors during reception | cnt |
| Network In Packets | Number of received packets | cnt |
| Network In Packets [Delta Avg] | Average of system.network.in.packets_delta for individual Networks | cnt |
| Network In Packets [Delta Max] | Maximum of system.network.in.packets_delta for each Network | cnt |
| Network In Packets [Delta Min] | Minimum of system.network.in.packets_delta for each Network | cnt |
| Network In Packets [Delta Sum] | Sum of system.network.in.packets_delta for individual Networks | cnt |
| Network In Packets [Delta] | Delta of received packet count | cnt |
| Network Out Bytes | Number of transmitted bytes | bytes |
| Network Out Bytes [Delta Avg] | Average of system.network.out.bytes_delta for individual networks | bytes |
| Network Out Bytes [Delta Max] | Maximum system.network.out.bytes_delta for each Network | bytes |
| Network Out Bytes [Delta Min] | Minimum of system.network.out.bytes_delta for individual Networks | bytes |
| Network Out Bytes [Delta Sum] | Sum of system.network.out.bytes_delta for individual Networks | bytes |
| Network Out Bytes [Delta] | Delta of transmitted byte count | bytes |
| Network Out Dropped | Number of deleted packets among outgoing packets | cnt |
| Network Out Errors | Number of errors during transmission | cnt |
| Network Out Packets | Number of transmitted packets | cnt |
| Network Out Packets [Delta Avg] | Average of system.network.out.packets_delta for each Network | cnt |
| Network Out Packets [Delta Max] | Maximum system.network.out.packets_delta for each Network | cnt |
| Network Out Packets [Delta Min] | Minimum of system.network.out.packets_delta for each Network | cnt |
| Network Out Packets [Delta Sum] | Sum of system.network.out.packets_delta for individual networks | cnt |
| Network Out Packets [Delta] | Delta of transmitted packet count | cnt |
| Open Connections [TCP] | All open TCP connections | cnt |
| Open Connections [UDP] | All open UDP connections | cnt |
| Port Usage | Connectable port utilization | % |
| SYN Sent Sockets | Number of sockets in SYN_SENT state (when connecting from local to remote) | cnt |
| Kernel PID Max | kernel.pid_max value | cnt |
| Kernel Thread Max | kernel.threads-max value | cnt |
| Process CPU Usage | Percentage of CPU time consumed by the process since the last update | % |
| Process CPU Usage/Core | Percentage of CPU time used by the process since the last event | % |
| Process Memory Usage | Proportion of main memory (RAM) occupied by a process | % |
| Process Memory Used | Resident Set size. The amount of memory a process occupies in RAM. | bytes |
| Process PID | process pid | pid |
| Process PPID | Parent process PID | pid |
| Processes [Dead] | Number of deadProcesses | cnt |
| Processes [Idle] | idle Processes count | cnt |
| Processes [Running] | running Processes count | cnt |
| Processes [Sleeping] | sleeping processes count | cnt |
| Processes [Stopped] | stopped processes count | cnt |
| Processes [Total] | Total number of processes | cnt |
| Processes [Unknown] | Number of processes whose status cannot be retrieved or is unknown | cnt |
| Processes [Zombie] | Number of zombie processes | cnt |
| Running Process Usage | process usage | % |
| Running Processes | Number of running processes | cnt |
| Running Thread Usage | Thread usage rate | % |
| Running Threads | Total number of threads running in running processes | cnt |
| Context Switches | context switch count (per second) | cnt |
| Load/Core [1 min] | The load over the last 1 minute divided by the number of cores | cnt |
| Load/Core [15 min] | The load over the last 15 minutes divided by the number of cores | cnt |
| Load/Core [5 min] | The load over the last 5 minutes divided by the number of cores | cnt |
| Multipaths [Active] | External storage connection path status = active count | cnt |
| Multipaths [Failed] | External storage connection path status = failed count | cnt |
| Multipaths [Faulty] | External storage connection path status = faulty count | cnt |
| NTP Offset last | sample’s measured offset (time difference between NTP server and local environment) | num |
| Run Queue Length | Execution queue length | num |
| Uptime | OS uptime (milliseconds) | ms |
| Context Switchies CPU | context switch count (per second) | cnt |
| Disk Read Bytes [Sec] | Number of bytes read in one second from a Windows logical disk
| cnt |
| Disk Read Time [Avg] | Data read average time (seconds)
| sec |
| Disk Transfer Time [Avg] | Disk average wait time (seconds)
| sec |
| Disk Write Bytes [Sec] | Number of bytes written in one second on a Windows logical disk
| cnt |
| Disk Write Time [Avg] | Average data write time (seconds)
| sec |
| Pagingfile Usage | paging file usage
| % |
| Pool Used [Non Paged] | Nonpaged Pool usage in kernel memory
| bytes |
| Pool Used [Paged] | Paged Pool usage in kernel memory
| bytes |
| Process [Running] | Number of currently running processes
| cnt |
| Threads [Running] | Number of currently running threads
| cnt |
| Threads [Waiting] | Number of threads waiting for processor time
| cnt |
1.1.3 - ServiceWatch Metrics
Virtual Server sends metrics to ServiceWatch. The metrics provided by default monitoring are data collected at 5‑minute intervals. If detailed monitoring is enabled, you can view data collected at 1‑minute intervals.
See How-to guides > ServiceWatch Enable Detailed Monitoring for instructions on enabling detailed monitoring of Virtual Server.
Basic Metrics
The following are the basic metrics for the Virtual Server namespace.
The indicators whose names are displayed in bold below are the indicators selected as key metrics among the default metrics provided by Virtual Server. The key metrics are used to build service dashboards that are automatically generated for each service in ServiceWatch. They can also be viewed on the Monitoring tab of the Virtual Server detail page.
Each metric provides guidance in the user guide on which statistical value is meaningful to query, and among the meaningful statistics, the values shown in bold text are the primary statistics. In the service dashboard or monitoring tab, you can view primary metrics using these primary statistics.
| Performance item (indicator name) | Detailed description | unit | meaningful statistics | |
|---|---|---|---|---|
| Instance State | Instance status display
| None |
| |
| CPU Usage | CPU usage | Percent |
| |
| Disk Read Bytes | Bytes read from block device (bytes) | Bytes |
| |
| Disk Read Requests | Number of read requests on a block device | Count |
| |
| Disk Write Bytes | Write capacity (bytes) on block device | Bytes |
| |
| Disk Write Requests | Number of write requests on block device | Count |
| |
| Network In Bytes | Received bytes on the network interface | Bytes |
| |
| Network In Dropped | Number of packet drops received on the network interface | Count |
| |
| Network In Packets | Number of packets received on the network interface | Count |
| |
| Network Out Bytes | Data transmitted on the network interface (bytes) | Bytes |
| |
| Network Out Dropped | Number of packet drops transmitted from the network interface | Count |
| |
| Network Out Packets | Number of packets transmitted on the network interface | Count |
|
1.2 - How-to guides
Users can create the service by entering the required information for a Virtual Server and selecting detailed options through the Samsung Cloud Platform Console.
Create Virtual Server
You can create and use a Virtual Server service in the Samsung Cloud Platform Console.
To create a Virtual Server, follow these steps.
Click the All Services > Compute > Virtual Server menu. 1. Navigate to the Service Home page of the Virtual Server.
On the Service Home page, click the Create Virtual Server button. 2. Navigate to the Create Virtual Server page.
On the Virtual Server Creation page, enter the information required to create the service and select detailed options.
- Select the required information in the Image and version selection area.
Category required statusDetailed description Image Required Select the provided Image type - Standard: Samsung Cloud Platform standard provided Image
- Alma Linux, Oracle Linux, RHEL, Rocky Linux, Ubuntu, Windows
- Custom: User-created Image
- Kubernetes: Image for Kubernetes
- RHEL, Ubuntu
- Marketplace: Image subscribed from Marketplace
Image version Required Select version of the selected Image - Provide version list of the server Image
Table. Virtual Server Image and version selection input items - Standard: Samsung Cloud Platform standard provided Image
- In the Service Information Input area, enter or select the required information.
Category required statusDetailed description Number of servers Required Number of servers to create concurrently - Only numeric input is allowed, and enter a value between 1~100
Service Type > Server Type Required Virtual Server server type - Standard: standard specifications commonly used
- High Capacity: large‑capacity server specifications beyond Standard
- For detailed information about the server types provided by Virtual Server, see Virtual Server 서버 타입
Service Type > Planned Compute Required Planned Compute-configured resource status - In Use: Number of resources with Planned Compute that are currently in use
- Configured: Number of resources with Planned Compute configured
- Coverage Preview: Amount applied per resource by Planned Compute
- Apply for Planned Compute Service: Go to the Planned Compute service creation page
- For more details, refer to Planned Compute Apply
Block Storage Essential Block Storage settings used by the server according to purpose - Basic OS: Area where the OS is installed and used
- Capacity is entered in Units, and the minimum size varies by OS Image type
- Alma Linux: Enter a value between 2~1,536
- Oracle Linux: Enter a value between 7~1,536
- RHEL: Enter a value between 2~1,536
- Rocky Linux: Enter a value between 2~1,536
- Ubuntu: Enter a value between 2~1,536
- Windows: Enter a value between 4~1,536
- SSD: High‑performance general volume
- HDD: General volume
- SSD/HDD_KMS: Additional encrypted volume using Samsung Cloud Platform KMS (Key Management Service) encryption keys
- Encryption can be applied only at initial creation and cannot be changed afterward
- Using the SSD_KMS disk type may cause performance degradation
- SSD_Provisioned: SSD volume with configurable IOPS and Throughput
- Capacity is entered in Units, and the minimum size varies by OS Image type
- Additional: Used when extra user space beyond the OS area is needed
- After selecting Use, enter the storage type and capacity
- To add storage, click the + button; to delete, click the x button (up to 25 can be added)
- Capacity is entered in Units, with a value between 1~1,536
- 1 Unit equals 8 GB, so 8~12,288 GB are created
- SSD: High‑performance general volume
- HDD: General volume
- SSD/HDD_KMS: Additional encrypted volume using Samsung Cloud Platform KMS (Key Management Service) encryption keys
- Encryption can be applied only at initial creation and cannot be changed afterward
- Using the SSD_KMS disk type may cause performance degradation
- SSD/HDD_MultiAttach: Volume that can be attached to two or more servers
- SSD_Provisioned: SSD volume with configurable IOPS and Throughput
- For details on each Block Storage type, see Block Storage 생성하기
- Delete on termination: When Delete on Termination is selected, the volume is terminated together with the server
- A volume with an existing snapshot is not deleted even when Delete on termination is enabled
- A multi‑attach volume can be deleted only when the server being removed is the last remaining server attached to the volume
- Max IOPS: Enter a maximum IOPS value between 5,000~20,000
- Not configurable for disk types HDD, HDD_KMS, HDD_MultiAttach
- Max Throughput: Enter a maximum Throughput value between 250~1,000
- Not configurable for disk types HDD, HDD_KMS, HDD_MultiAttach
Server Group Select After selecting Use for setting Anti-Affinity (distributed placement), Affinity (proximate placement), Partition (distributed placement of Virtual Server and Block Storage) for servers belonging to the same Server Group - , select the Server Group
- , choose Create New to create the Server Group
Servers belonging to the same Server Group are placed in a Best Effort manner according to the selected policy The policy can be selected among Anti-Affinity (distributed placement), Affinity (proximate placement), Partition (distributed placement of Virtual Server and Block Storage) Table. Virtual Server service information input itemsCautionWhen using the Partition (distributed placement of Virtual Server and Block Storage) policy among Server Group policies, you cannot allocate additional Block Storage Volumes after creating a Virtual Server, so create all required Block Storage during the Virtual Server creation step. - Required Information Input area, please enter or select the required information.
Category required statusDetailed description Server name Essential Enter a name to distinguish the server when the selected number of servers is 1 - Set the hostname to the entered server name
- Enter using English letters, numbers, spaces, and special characters (
-,_) within 63 characters
Network Settings > Create New Network Port Required Set the network where the Virtual Server will be installed - VPC Name: Select a pre‑created VPC
- General Subnet: Select a pre‑created general Subnet
- IP can be set to auto‑generate or user‑provided; if input is selected, the user can manually enter the IP
- NAT: Available only when there is a single server and the VPC is attached to an Internet Gateway. Checking Use allows selection of a NAT IP
- NAT IP: Select a NAT IP
- If no NAT IP is available, click the Create New button to generate a Public IP
- Refresh button to view and select the created Public IP
- Creating a Public IP incurs charges according to the Public IP pricing policy
- Local Subnet (optional): Select Use for a local Subnet
- It is not a required element for creating the service
- A pre‑created local Subnet must be selected
- IP can be set to auto‑generate or user‑provided; if input is selected, the user can manually enter the IP
- Security Group: Settings required to access the server
- Select: Choose a pre‑created Security Group
- Create New: If no applicable Security Group exists, it can be created separately in the Security Group service
- Up to 5 can be selected
- If no Security Group is set, all connections are blocked by default
- A Security Group must be configured to allow required connections
Network Settings > Specify Existing Network Port Required Set the network where the Virtual Server will be installed - VPC: Select a pre‑created VPC
- General Subnet: Select a pre‑created general Subnet and port
- NAT: Available only when there is a single server and the VPC is attached to an Internet Gateway. When enabled, you can select a NAT IP
- NAT IP: Select a NAT IP
- If no NAT IP is available to select, click the Create New button to generate a Public IP
- Refresh button to view and select the created Public IP
- Local Subnet (Optional): Choose Use for the local Subnet
- Select a pre‑created local Subnet and port
Keypair Required User authentication method to use when connecting to the server - Create new: Create a new one if a new keypair is required
- Refer to Keypair 생성하기
- Default login account list by OS
- Alma Linux: almalinux
- Oracle Linux: cloud-user
- RHEL: cloud-user
- Rocky Linux: rocky
- Ubuntu: ubuntu
- Windows: sysadmin
Table: Required input fields for Virtual Server - Additional Information Input area, please enter or select the required information.
Category required statusDetailed description Lock Select Lock usage setting - Using the lock prevents actions such as server termination, start, stop, etc., from being executed, preventing misoperations due to mistakes
Init script Select the script that runs when the server starts - The init script must be written as a Batch script for Windows, a Shell script or cloud‑init for Linux, depending on the Image type.
- Up to 45,000 bytes can be entered
tag Select Add Tag - Up to 50 can be added per resource
- After clicking the Add Tag button, enter or select Key, Value values
Table. Virtual Server additional information input fields
- Select the required information in the Image and version selection area.
Summary Check the detailed information and estimated charges generated in the panel, and click the Create button.
- Once creation is complete, check the created resources on the Virtual Server List page.
- When entering the server name, if spaces and special characters (
_) are used, the OS hostname will have spaces and special characters (_) changed to the special character (-) and be set. * Refer to this when setting the OS hostname.- Example: If the server name is ‘server name_01’, the OS hostname is set to ‘server-name-01’.
- If you need to manage server names uniquely, specify a different server name (Prefix) when creating them.
- When creating a server, because the numbering does not automatically increment based on the server name (Prefix), a Virtual Server with the same name can be created.
- Example: If you first create two Virtual Servers using the server name (Prefix) ’test’, ’test-1’ and ’test-2’ will be created. * After that, even if you create two Virtual Servers again using the prefix ’test’, ’test-1’ and ’test-2’ will be created.
- If you create a Virtual Server with Rocky Linux or Oracle Linux, additional configuration is required for time synchronization (NTP: Network Time Protocol). * For more details, please refer to Linux NTP 설정하기.
- If RHEL and Windows Server were created before July 2025, you need to modify the RHEL Repository and WKMS (Windows Key Management Service) settings. For detailed information, refer to RHEL Repo 및 WKMS 설정하기.
Check Virtual Server detailed information
The Virtual Server service allows you to view and edit the entire resource list and detailed information. The Virtual Server Details page includes the Details, Monitoring, Tags, Activity Log tabs.
To view detailed information about the Virtual Server service, follow these steps.
- All Services > Compute > Virtual Server menu, click it. 1. Navigate to the Service Home page of the Virtual Server.
- On the Service Home page, click the Virtual Server menu. 2. Go to the Virtual Server List page.
- On the Virtual Server List page, click the resource to view detailed information. 3. Navigate to the Virtual Server Details page.
- Virtual Server Details page displays status information and additional feature information, and consists of Details, Monitoring, Tags, Activity Log tabs.
- For detailed information about Virtual Server Add-on Features, please refer to Virtual Server Management Add-on Features.
Category Detailed description Virtual Server status Status of a user-created Virtual Server - Build: State where a Build command has been received
- Building: Build in progress
- Networking: Server creation networking process
- Scheduling: Server creation scheduling process
- Block_Device_Mapping: Attaching Block Storage during server creation
- Spawning: Server creation process is ongoing
- Active: Available state
- Powering_off: State when a shutdown request is made
- Deleting: Server deletion in progress
- Reboot_Started: Reboot in progress
- Error: Error state
- Migrating: Server is being migrated to another host
- Reboot: Reboot command has been received
- Rebooting: Restart in progress
- Rebuild: Rebuild command has been received
- Rebuilding: State when a Rebuild request is made
- Rebuild_Spawning: Rebuild process is ongoing
- Resize: Resize command has been received
- Resizing: Resize in progress
- Resize_Prep: State when a server type change is requested
- Resize_Migrating: Server is being moved to another host while resizing
- Resize_Migrated: Server has completed moving to another host during resize
- Resize_Finish: Resize completed
- Revert_Resize: Resize or migration of the server failed for some reason. The target server is cleaned up and the original server is restarted
- Shutoff: State when powering off is completed
- Verity_ Resize: After a server type change request and Resize_Prep, the server type is confirmed or can be reverted
- Resize_Reverting: State when a server type revert is requested
- Resize_Confirming: State while confirming the server’s Resize request
Server control Buttons to change server status - Start: Start a stopped server
- Stop: Stop a running server
- Restart: Restart a running server
Image generation Create a user Image from the current server’s Image - For detailed Image creation instructions, see Create Image
Console log View console logs of the current server - You can check the console logs output by the current server. For more details, see 콘솔 로그 확인하기
Create dump Create a dump of the current server - Dump file is created inside the Virtual Server
- Refer to Dump 생성하기 for detailed dump creation instructions
Rebuild The OS area data and settings of the existing Virtual Server are deleted, and it is rebuilt on a new server - For details, see Rebuild 수행하기
Service termination Button to cancel the service Table. Virtual Server status information and additional functions
Detailed Information
Virtual Server List page lets you view detailed information of the selected resource and, if needed, modify the information.
| Category | Detailed description |
|---|---|
| service | Service name |
| Resource Type | Resource Type |
| SRN | Unique resource ID in Samsung Cloud Platform
|
| Resource name | Resource name
|
| Resource ID | Unique resource ID in the service |
| Constructor | User who created the service |
| Creation date and time | Service creation timestamp |
| Modifier | User who edited the service information |
| Modification timestamp | Date and time the service information was modified |
| Server name | Server name
|
| Server type | vCPU, memory information display
|
| Image name | Server OS Image and version
|
| Lock | Show lock usage status
|
| Server group | Server group name to which the server belongs
|
| Keypair name | Server authentication information set by the user
|
| Planned Compute | Resource status with Planned Compute configured
|
| ServiceWatch detailed monitoring | ServiceWatch detailed monitoring activation status display
|
| Network | Network information of the Virtual Server
|
| Local Subnet | Local Subnet information of the Virtual Server
|
| Block Storage | Server-connected Block Storage information
|
Monitoring
You can monitor the ServiceWatch metrics of the resources selected on the Virtual Server List page. In the Monitoring tab, you can view monitoring charts for the Virtual Server, and each chart is based on the available Service Watch metrics.
| Category | Detailed description |
|---|---|
| Period setting area | Select the period applied to the chart
|
| Time Zone Settings Section | Select time zone for the chart |
| Reset button | Reset all manipulations and settings made on the chart |
| Refresh Settings Area | Select chart refresh interval
|
| Go to the service dashboard | Navigate to the ServiceWatch dashboard list screen |
| More | Display additional tasks for managing charts
|
| graph area | Data graph collected over the chart’s applied period
|
- The metrics provided by basic monitoring are data collected at 5‑minute intervals.
- For detailed information about the ServiceWatch metrics of Virtual Server, refer to Virtual Server ServiceWatch metrics.
Tag
Virtual Server List page lets you view the tag information of the selected resource, and you can add, modify, or delete it.
| Category | Detailed description |
|---|---|
| Tag list | Tag list
|
Job History
Virtual Server List page allows you to view the operation history of the selected resource.
| Category | Detailed description |
|---|---|
| Task History List | Resource Change History
|
Control Virtual Server Operation
If you need to control the operation of a created Virtual Server resource, you can perform the task from the Virtual Server List or Virtual Server Details page. You can start, stop, and restart a running server.
Getting Started with Virtual Server
You can start a Virtual Server that is Shutoff. To start the Virtual Server, follow these steps.
- Click the All Services > Compute > Virtual Server menu. 1. Navigate to the Service Home page of the Virtual Server.
- On the Service Home page, click the Virtual Server menu. 2. Go to the Virtual Server List page.
- Virtual Server List page, click the resource to start among the shutoff servers, and navigate to the Virtual Server Details page.
- On the Virtual Server List page, you can Start each resource via the right More button.
- After selecting multiple servers with checkboxes, you can control multiple servers simultaneously using the Start button at the top.
- Virtual Server Details page, click the Start button at the top to start the server. 4. Check the status of the modified server in the Status Indicator item.
- When the Virtual Server start is complete, the server status changes from Shutoff to Active.
- For detailed information about the Virtual Server status, please refer to Virtual Server Detailed Information.
Stop Virtual Server
You can stop a Virtual Server that is running (Active). To stop the Virtual Server, follow the steps below.
- Click the All Services > Compute > Virtual Server menu. 1. Navigate to the Service Home page of the Virtual Server.
- On the Service Home page, click the Virtual Server menu. 2. Go to the Virtual Server List page.
- On the Virtual Server List page, click the resource to stop among the active servers, and proceed to the Virtual Server Details page.
- On the Virtual Server List page, you can stop each resource via the right More button.
- After selecting multiple servers with checkboxes, you can control multiple servers simultaneously using the Stop button at the top.
- On the Virtual Server Details page, click the Stop button at the top to start the server. 4. Check the status of the modified server in the Status Indicator item.
- When the Virtual Server shutdown is complete, the server status changes from Active to Shutoff.
- For detailed information about the Virtual Server status, please refer to Virtual Server Detailed Information.
Restart Virtual Server
You can restart the created Virtual Server. To restart the Virtual Server, follow these steps.
- Click the All Services > Compute > Virtual Server menu. 1. Navigate to the Service Home page of the Virtual Server.
- On the Service Home page, click the Virtual Server menu. 2. Go to the Virtual Server List page.
- On the Virtual Server List page, click the resource to restart, and go to the Virtual Server Details page.
- On the Virtual Server list page, you can restart each resource via the right More button.
- After selecting multiple servers with checkboxes, you can control multiple servers simultaneously using the Restart button at the top.
- On the Virtual Server Details page, click the Restart button at the top to start the server. 4. Check the status of the changed server in the Status Display section.
- During a Virtual Server restart, the server status goes through Rebooting and finally changes to Active.
- For detailed information about the Virtual Server status, please refer to Virtual Server Detailed Information.
Managing Virtual Server Resources
If you need server control and management functions for the created Virtual Server resources, you can perform tasks on the Virtual Server List or Virtual Server Details page.
Create Image
You can create an image of a running Virtual Server.
This content explains how to create a user image on a running Virtual Server.
- On the Virtual Server list or Virtual Server details page, click the Create Image button to create a user Image.
- Refer to Image Detailed Guide: Creating an Image for the method of creating an Image by uploading an Image file owned by the user.
To create an Image of a Virtual Server, follow these steps.
Click the All Services > Compute > Virtual Server menu. 1. Navigate to the Service Home page of Virtual Server.
On the Service Home page, click the Virtual Server menu. 2. Navigate to the Virtual Server List page.
On the Virtual Server List page, click the resource to create an Image. 3. Navigate to the Virtual Server Details page.
On the Virtual Server Details page, click the Create Image button. 4. Go to the Image creation page.
- Enter the required information in the Service Information Input area.
Category required statusDetailed description Image name Required the name of the Image to be created - English letters, numbers, spaces, and special characters (
-,_) within 200 characters
Table. Image service information input fields - English letters, numbers, spaces, and special characters (
- Enter the required information in the Service Information Input area.
Check the input information and click the Complete button.
- When creation is complete, check the created resources on the All Services > Compute > Virtual Server > Image List page.
- When an Image is created, the generated Image is stored in the Object Storage used as internal storage. * Therefore, a usage fee will be charged for image storage.
- Since the file system of an Image created from an Active Virtual Server cannot be guaranteed to be intact, it is recommended to stop the server before creating the Image.
Edit Server Type
You can modify the server type of a Virtual Server.
To modify the server type of a Virtual Server, follow these steps.
- Click the All Services > Compute > Virtual Server menu. 1. Navigate to the Service Home page of the Virtual Server.
- On the Service Home page, click the Virtual Server menu. 2. Go to the Virtual Server List page.
- On the Virtual Server List page, click the resource you want to control. 3. Go to the Virtual Server Details page.
- On the Virtual Server Details page, check the server status and click the Edit button for the server type. 4. Edit Server Type popup window opens.
- Edit Server Type In the popup window, after changing the server type, click the Confirm button.
- If you modify the Virtual Server’s server type, the Virtual Server status changes to a state related to performing a resize.
- For detailed information about the Virtual Server status, please refer to Virtual Server Detailed Information.
Service Architecture Diagram
Provided features
Virtual Server Auto-Scaling provides the following features.
- Launch Configuration: It is a configuration template used to create a Virtual Server in an Auto-Scaling Group. When creating a Launch Configuration, you set information about the Virtual Server such as the image, server type, Key Pair, Block Storage, etc.
- Server Count Adjustment: Provides several ways to adjust the number of servers. By using policies, you can add a Virtual Server when load exceeds a threshold and release the Virtual Server when demand is low, maintaining application availability while reducing costs. You can also add and release Virtual Servers according to a predefined schedule, and manually adjust the number of servers in an Auto-Scaling Group as needed.
- Load Balancer Integration: You can use a Load Balancer to evenly distribute application traffic across Virtual Servers. Whenever a Virtual Server is added or removed, it is automatically registered with or deregistered from the Load Balancer.
- Network Connection: You can connect the Auto-Scaling Group’s standard subnet, automatic IP allocation, and Public NAT IP. Provides a local subnet connection for inter-server communication.
- Security Group Application: Security Group is a virtual logical firewall that controls inbound/outbound traffic generated on a Virtual Server. Inbound rules control incoming traffic to the Virtual Server, and outbound rules control outgoing traffic from the Virtual Server.
- Monitoring: You can view monitoring data such as CPU, Memory, and Disk of Virtual Servers created in an Auto-Scaling Group via the ServiceWatch service. Based on the monitoring data, you can set load thresholds using Auto-Scaling policies, and when thresholds are exceeded, you can add or remove servers.
Component
Virtual Server Auto-Scaling creates an Auto-Scaling Group through a Launch Configuration and monitors and manages the servers.
Launch Configuration
This is a Configuration template used to create a Virtual Server in an Auto-Scaling Group. The main features are as follows.
- Image: Provides OS standard images and Custom images created by the user. Users can select and use them according to the service they wish to configure.
- Keypair: Provides the Keypair method for secure OS access.
- Init script: Users can define a script to be executed when the Virtual Server starts.
- For more details, refer to Launch Configuration Create.
Auto-Scaling Group
Launch Configuration is used as a pre-configuration template for creating servers. By creating an Auto-Scaling Group, you can adjust and manage the number of servers. The main features are as follows.
- Launch Configuration: A Configuration template used to create a Virtual Server in an Auto-Scaling Group.
- Server Count Setting: Virtual Server Auto-Scaling provides several ways to adjust the number of servers in an Auto-Scaling Group.
- Fixed Server Count Method: When creating an Auto-Scaling Group, this method keeps the default settings using the configured number of servers without any additional schedules or policies. Refer to Create Auto-Scaling Group to set the Min, Desired, and Max server counts.
- Server Count Manual Adjustment Method: In an Auto-Scaling Group, you can increase or decrease the number of servers by modifying the server count to the desired amount. You can choose whether to manually set the desired server count. Please refer to 서버 수 수정하기.
- Schedule reservation method: You can schedule daily, weekly, monthly, or one-time, and set the desired number of servers at the specified time. This is useful when you can predict when to scale the number of servers up or down. If you use the schedule method, please refer to Manage Schedules to add and manage schedules.
- Policy Mode: You can use a policy to dynamically adjust servers. When a monitoring metric exceeds a configured threshold, the number of servers is adjusted. At this time, you can choose one of three methods to adjust the server count: increase or decrease by a specified number, increase or decrease by a specified percentage, or fix the server count to a given value. When servers are started or terminated due to a policy, the monitoring metric (CPU utilization) may temporarily exceed the threshold registered in the policy. However, because this is a brief moment, a cooldown period is set to avoid treating it as an abnormal situation. If you want to use the policy mode, see Managing Policies.
- Load Balancer: Automatically connects and disconnects to the Load Balancer registered in the Auto-Scaling Group whenever a Virtual Server is added or terminated.
- Refer to Auto-Scaling Group Detailed Information for detailed information about the Load Balancer of an Auto-Scaling Group.
Constraints
The constraints of Virtual Server Auto-Scaling are as follows.
| Category | Explanation |
|---|---|
| Number of Virtual Servers per Auto-Scaling Group | 50 or fewer |
| Number of policies per Auto-Scaling Group | 12 or fewer |
| Number of schedules per Auto-Scaling Group | 20 or fewer |
| Number of LB server groups and ports per Auto-Scaling Group | 3 or fewer |
- If the Image you are using is a discontinued standard Image, Scale-out will not work.
If the Image you are using is Custom Image, Scale out will continue to operate correctly even after that version is discontinued. - We recommend replacing the Launch Configuration with the latest version of the Image or a Custom Image before the current Image reaches end of support.
- For detailed information about the OS Image provided by Virtual Server, see OS Image Provided Versions.
Prior Service
This is a list of services that must be pre-configured before creating the service. Please refer to the guide provided for each service and prepare in advance.
| Service Category | service | Detailed description |
|---|---|---|
| Networking | VPC | A service that provides an isolated virtual network in a cloud environment |
| Networking | Security Group | Virtual firewall that controls server traffic |
2.1.1 - Monitoring Metrics
According to Samsung Cloud Platform’s policy, the Cloud Monitoring service is scheduled to be discontinued in September 2026.
Accordingly, after the September 2026 release, resource monitoring of the Samsung Cloud Platform via Cloud Monitoring will no longer be possible.
With the new alternative service, you can continuously perform resource monitoring by using ServiceWatch, released in October 2025.
ServiceWatch provides more modern and powerful features, replacing Cloud Monitoring to deliver a smooth monitoring environment.
Detailed information about ServiceWatch is available in the ServiceWatch Overview.
Virtual Server Auto-Scaling is a service provided for Virtual Servers that offers individual Virtual Server monitoring metrics and monitoring metrics provided by policies based on Cloud Monitoring.
Virtual Server Monitoring Metrics
The table below shows the monitoring metrics of Virtual Server that can be viewed through Cloud Monitoring. For detailed usage of Cloud Monitoring, refer to the Cloud Monitoring guide.
Memory-related metrics are not provided for Windows OS.
| Performance items | Detailed description | unit |
|---|---|---|
| Memory Total [Basic] | bytes of usable memory | bytes |
| Memory Used [Basic] | bytes of currently used memory | bytes |
| Memory Swap In [Basic] | bytes of the replaced memory | bytes |
| Memory Swap Out [Basic] | bytes of the replaced memory | bytes |
| Memory Free [Basic] | bytes of unused memory | bytes |
| Disk Read Bytes [Basic] | Read bytes | bytes |
| Disk Read Requests [Basic] | Number of read requests | cnt |
| Disk Write Bytes [Basic] | write bytes | bytes |
| Disk Write Requests [Basic] | Number of write requests | cnt |
| CPU Usage [Basic] | Average system CPU usage over 1 minute | % |
| Instance State [Basic] | Instance status | state |
| Network In Bytes [Basic] | Received bytes | bytes |
| Network In Dropped [Basic] | Incoming packet drop | cnt |
| Network In Packets [Basic] | Number of received packets | cnt |
| Network Out Bytes [Basic] | sent bytes | bytes |
| Network Out Dropped [Basic] | Transmit packet drop | cnt |
| Network Out Packets [Basic] | Number of transmitted packets | cnt |
Monitoring metrics provided by Cloud Monitoring-based policies
The table below shows the monitoring metrics provided by the policy of a Cloud Monitoring‑based Auto‑Scaling Group. For detailed information on policy settings based on Cloud Monitoring, see 정책 관리하기.
| Performance items | Detailed description | unit |
|---|---|---|
| CPU Usage [Basic] | Average system CPU usage over 1 minute | % |
| Memory Used [Basic] | bytes of currently used memory | bytes |
| Network In Bytes [Basic] | Received bytes | bytes |
| Network In Packets [Basic] | Number of received packets | cnt |
| Network Out Bytes [Basic] | sent bytes | bytes |
| Network Out Packets [Basic] | Number of transmitted packets | cnt |
2.1.2 - ServiceWatch Metrics
Virtual Server Auto-Scaling is a service provided for Virtual Servers that offers individual Virtual Server monitoring metrics and monitoring metrics supplied by ServiceWatch-based policies.
Virtual Server Monitoring Metrics
The basic metrics provided by Virtual Server can be found in ServiceWatch > Virtual Server Basic Metrics.
ServiceWatch monitoring metrics provided by the Auto-Scaling Group policy
The table below shows the ServiceWatch monitoring metrics provided by the Auto-Scaling Group policy. For detailed information on configuring Auto-Scaling Group policies, see Managing Policies.
| Performance items | Detailed description | unit |
|---|---|---|
| CPU Usage | CPU usage | Percent |
| Network In Bytes | Received bytes on the network interface | Bytes |
| Network In Packets | Number of packets received on the network interface | Count |
| Network Out Bytes | Data transmitted on the network interface (bytes) | Bytes |
| Network Out Packets | Number of packets transmitted on the network interface | Count |
2.2 - How-to guides
Users can create the service by entering the required information for an Auto-Scaling Group and selecting detailed options through the Samsung Cloud Platform Console.
Create Auto-Scaling Group
You can create and use the Auto-Scaling Group service from the Samsung Cloud Platform Console.
To create an Auto-Scaling Group, follow these steps.
Click the All Services > Compute > Virtual Server menu. Navigate to the Service Home page of Virtual Server.
Click the Auto-Scaling Group menu. You will be taken to the Auto-Scaling Group List page.
On the Auto-Scaling Group List page, click the Create Auto-Scaling Group button. You will be taken to the Create Auto-Scaling Group page.
On the Auto-Scaling Group Creation page, enter the information required to create the service.
- In the Launch Configuration area, select the Launch Configuration.
- Launch Configuration creation button can be clicked to create a new Launch Configuration.
- Enter or select the required information in the Service Information Input area.
Category Whether requiredDetailed description Auto-Scaling Group name Required Auto-Scaling Group name - Manage servers of the same type and purpose as a group
Server name Required Server name to be created within the Auto-Scaling Group - The server name is automatically assigned as a combination of the entered server name and a sequence, serving as an identifier to distinguish servers created within the Auto-Scaling Group.
Number of servers Required Number of servers to create in the Auto-Scaling Group - Enter a value between 0 and 20 (Min≤Desired≤Max)
- Min: Set the minimum number of servers the Auto-Scaling Group should maintain
- Desired: Set the target number of servers within the Auto-Scaling Group; also indicates the number of servers initially created when the Auto-Scaling Group is created
- Max: Set the maximum number of servers the Auto-Scaling Group can maintain
- After creating the Auto-Scaling Group, you can adjust the settings using the Edit button. For details, see Modify Server Count
Desired server count manual setting Selection Select whether to manually change the Desired number of servers - Even after creating the Auto-Scaling Group, configure it via the Edit button. For details, refer to Manually edit Desired server count
Network Settings > Network Settings Required Network settings for Auto-Scaling Group - Select the desired VPC and standard Subnet
- IP can only be auto-generated.
- When you select local Subnet, you can choose the desired local Subnet, and IP can only be auto-generated
Network Settings > Security Group Selection To allow required connections, you need to configure a Security Group - If you do not configure a Security Group, all inbound and outbound traffic will be blocked according to the default rule (Any/Deny)
- For Linux servers, allow SSH traffic
- For Windows servers, allow RDP traffic
- Even after creating an Auto-Scaling Group, configure it using the modify button. For details, refer to Configure Security Group
Load Balancer Selection Connect the Auto-Scaling Group to a Load Balancer - Register the Auto-Scaling Group’s servers as members of the LB server group
- LB server group: Select a LB server group that exists in the chosen VPC
- Port: Enter a value between 1 and 65,534
- Press the + button to add an LB server group (up to a total of three LB server groups and ports are allowed)
- LB server groups that use Weighted Round Robin or Weighted Least Connection load balancing cannot be selected
- Draining Timeout value: After checking Draining Timeout as Enabled, you can set the Draining Timeout value
- Draining Timeout: The waiting period before removing a server from the Load Balancer
- Since sessions may remain connected to the server, setting a Draining Timeout and waiting allows safe session termination
- If the Load Balancer is unused, Draining Timeout cannot be set
- The default is 300 seconds; you can enter a value from 1 second up to 3,600 seconds.
- Draining Timeout: The waiting period before removing a server from the Load Balancer
- Changes can be made after creating the Auto-Scaling Group; for details, see Using Auto-Scaling Group Load Balancer
Table. Auto-Scaling Group service information input items - In the Scaling policy setting area, set the scaling policy.
- For detailed information on policy settings, refer to Add Policy.
Category RequiredDetailed description Current setting Selection Set scaling policy now - Add Policy button click shows policy information input fields
Set later Selection After creating an Auto-Scaling Group, set policies on the detail information page Table. Auto-Scaling Group scaling policy configuration items
- For detailed information on policy settings, refer to Add Policy.
- In the Notification Settings area, configure the notification recipients and notification method.
- For detailed information on notification settings, see Add Notification.
Category Required?Detailed description Current setting Selection Set the notification recipients and notification method now - Click the Add Notification button to open the Add Notification popup
- For detailed information about notification settings, see the details
- In the notification recipients list, click the Edit button to modify the notification information
Set later Selection After creating an Auto-Scaling Group, set the notification recipients and notification method on the details page. Table. Auto-Scaling Group notification configuration items
- For detailed information on notification settings, see Add Notification.
- In the Additional Information Input area, enter or select the required information.
Category Required statusDetailed description Status check Selection Check the status of Virtual Server and Load Balancer and replace servers that are Unhealthy - Virtual Server status check: Enabled is the default and cannot be changed
- Load Balancer status check: Enabled only when a Load Balancer is connected in the service information input area
- Grace period: Set the time to defer status checks until the newly added server operates normally
- Enter 0 to disable the status check grace period
tag Selection Add Tag - Up to 50 per resource can be added
- After clicking the Add Tag button, enter or select Key, Value values
Table. Auto-Scaling Group additional information input fieldsCautionA server that the user changes to Stop is not considered Unhealthy, so even when using the health check feature, the server will not be replaced.
- In the Launch Configuration area, select the Launch Configuration.
Summary Check the detailed information and estimated billing amount generated in the panel, and click the Create button.
- When creation is complete, check the created Auto-Scaling Group on the Auto-Scaling Group list page.
Check detailed information of Auto-Scaling Group
Auto-Scaling Group service allows you to view and edit the full resource list and detailed information. The Auto-Scaling Group Details page is composed of Details, Policies, Schedule, Virtual Server, Load Balancer, Tags, Operation History tabs.
To view detailed information about the Auto-Scaling Group, follow these steps.
- Click the All Services > Compute > Virtual Server menu. Navigate to the Service Home page of Virtual Server.
- Click the Auto-Scaling Group menu. You will be taken to the Auto-Scaling Group List page.
- On the Auto-Scaling Group List page, click the resource to view detailed information. You will be taken to the Auto-Scaling Group Details page.
- Auto-Scaling Group Details page displays status information and additional feature information, and consists of Details, Policies, Schedule, Virtual Server, Load Balancer, Tags, Activity History tabs.
Category Detailed description Auto-Scaling Group status Status of the Auto-Scaling Group created by the user - Creating: Creating Auto-Scaling Group
- In Service: Serviceable state
- Scale In: Scale In in progress
- Scale Out: Scale Out in progress
- Cool Down: Cooling down
- Terminating: Terminating Auto-Scaling Group
- Attach to LB: Attaching to Load Balancer
- Detach from LB: Detaching from Load Balancer
Delete Auto-Scaling Group Button to delete Auto-Scaling Group Table. Auto-Scaling Group status information and additional features
- Auto-Scaling Group Details page displays status information and additional feature information, and consists of Details, Policies, Schedule, Virtual Server, Load Balancer, Tags, Activity History tabs.
Detailed Information
Auto-Scaling Group List page lets you view detailed information of the selected resource and modify it if necessary.
| Category | Detailed description |
|---|---|
| service | Service name |
| Resource Type | Resource Type |
| SRN | Unique resource ID in Samsung Cloud Platform
|
| Resource name | Resource name
|
| Resource ID | Unique resource ID in the service |
| constructor | User who created the service |
| Creation date and time | Service creation date and time |
| editor | User who edited the service information |
| Modification date | Date and time the service information was modified |
| Auto-Scaling Group name | Auto-Scaling Group name |
| Launch Configuration name | When creating an Auto-Scaling Group, the selected Launch Configuration name
|
| Number of servers | Current number of instances in the Auto-Scaling Group and the configured Min, Desired, Max instance counts
|
| Manually set the desired number of servers | Desired server count manual setting enable/disable
|
| VPC | VPC information of the Auto-Scaling Group |
| General Subnet | General Subnet and NAT IP usage information for the Auto-Scaling Group |
| Local Subnet | Local Subnet of Auto-Scaling Group |
| Security Group | Auto-Scaling Group’s Security Group
|
| Status check | Whether to use the replacement function for Virtual Server and Load Balancer in Unhealthy state
|
Policy
On the Auto-Scaling Group List page, you can view the policy list of the selected resource and, if needed, add or manage policies.
| Category | Detailed description |
|---|---|
| Category | Policy Type
|
| Policy Name | Name for policy-specific categorization |
| Execution conditions | Conditions to trigger the policy
|
| execution unit | Method of executing the policy
|
| Cool down | The waiting time (seconds) when a server is started or terminated due to a policy
|
| More > Edit | Modify the policy information
|
| More > Enable | Enable this policy
|
| More > Disable | Disable the policy
|
Schedule
On the Auto-Scaling Group List page, you can view the schedule list of the selected resource and, if necessary, add or manage schedules.
| Category | Detailed description |
|---|---|
| Name | Schedule name |
| Min | Minimum number of servers set in the schedule |
| Desired | Number of target servers set in the schedule |
| Max | Maximum number of servers set in the schedule |
| period | Schedule execution frequency
|
| Date/Day of Week | Schedule execution date or day of week
|
| Execution time | Schedule execution time |
| time zone | Schedule execution time window |
| status | Schedule status |
| More > Edit | Edit the schedule information |
| More > Activate | Activate this schedule
|
| More > Disable | Deactivate this schedule
|
| Add schedule | Add a new schedule |
| Delete | Delete the selected schedule from the list |
Virtual Server
Auto-Scaling Group List You can view the Virtual Server list of the selected resource on the page.
| Category | Detailed description |
|---|---|
| Server name | If you click the Server Name of the server created in the Auto-Scaling Group
|
| IP | IP assigned to the server |
| Creation date and time | The date and time the server was created |
| status | Result of status check for Virtual Server and Load Balancer
|
Load Balancer
On the Auto-Scaling Group List page, you can view the Load Balancer list of the selected resource, and, if needed, add or manage Load Balancers.
| Category | Detailed description |
|---|---|
| Draining Timeout | Draining Timeout usage
|
| Load Balancer | Load Balancer usage
|
| Load Balancer > Load Balancer name | Load Balancer name to attach to the Auto-Scaling Group |
| Load Balancer > LB server group | LB server group of the Load Balancer
|
| Load Balancer > Port | Port registered as a member in the LB server group |
- The LB server group member information configured in the Auto-Scaling Group can also be viewed in LB Server Group Connected Resources.
- Also, if manual connection/disconnection between the server and the Load Balancer is required, see Add LB Server Group Member.
Notification
Auto-Scaling Group List page allows you to view the notification recipients and notification methods for the selected resource.
| Category | Detailed description |
|---|---|
| Notification recipient | Name of the notification recipient |
| Notification recipient’s email | |
| Create server | Whether to send notifications when a server creation-related alert occurs
|
| Server termination | Whether to send a notification when a server termination-related alert occurs
|
| When the policy execution condition is satisfied | Whether to generate a notification when the policy execution condition is satisfied |
| status | Notification activation status
|
| More > Edit | Edit the notification information |
| Show more > Enable | Enable this notification information
|
| More > Disable | Disable this notification information
|
| Add notification | Add a new notification |
| Delete | Delete the selected notification from the list |
tag
Auto-Scaling Group List page lets you view the tag information of selected resources, and add, modify, or delete them.
| Category | Detailed description |
|---|---|
| Tag list | Tag list
|
Work History
You can view the operation history of the selected resource on the Auto-Scaling Group List page.
| Category | Detailed description |
|---|---|
| Task History List | Resource Change History
|
Auto-Scaling Group Managing Resources
If you need to manage a created Auto-Scaling Group, you can perform actions on the Auto-Scaling Group Details page.
Edit Launch Configuration
You can modify the Launch Configuration of an Auto-Scaling Group.
To modify the Launch Configuration of an Auto-Scaling Group, follow these steps.
Click the All Services > Compute > Virtual Server menu. Navigate to the Service Home page of Virtual Server.
Click the Auto-Scaling Group menu. You will be taken to the Auto-Scaling Group list page.
On the Auto-Scaling Group List page, click the resource you want to edit the Launch Configuration for. You will be taken to the Auto-Scaling Group Details page.
Launch Configuration name. Click the Edit button. The Launch Configuration Edit popup opens. You can view the list of selectable Launch Configurations.
CategoryDetailed description Launch Configuration name Launch Configuration name image Launch Configuration OS Image Server type Launch Configuration server type Block Storage Launch Configuration Block Storage Settings Number of Auto-Scaling Groups Number of Auto-Scailg Groups with Launch Configuration applied View Details Button to view Launch Configuration details Table. Launch Configuration list itemsIn the Launch Configuration Edit popup, select the Launch Configuration to modify, then click the Confirm button. The Launch Configuration Edit Notification popup opens. Check the message in the Launch Configuration Edit Notification popup and click the Confirm button.
Edit server count
You can modify the number of servers in the Auto-Scaling Group.
To modify the number of servers in an Auto-Scaling Group, follow these steps.
- Click the All Services > Compute > Virtual Server menu. Navigate to the Service Home page of Virtual Server.
- Click the Auto-Scaling Group menu. You will be taken to the Auto-Scaling Group list page.
- On the Auto-Scaling Group List page, click the resource to edit the server count. You will be taken to the Auto-Scaling Group Details page.
- Click the Edit button for server count. The Edit Server Count popup opens.
- Modify Server Count In the popup window, after entering the required fields, click the Confirm button.
Category RequiredDetailed description Number of servers > Min Required Minimum number of servers to modify - Set the number of servers that the Auto-Scaling Group should maintain at a minimum
Number of servers > Desired Required Target server count to modify - Set the target server count within the Auto-Scaling Group
- Desired server count manual setting is disabled; you cannot modify the Desired server count. To modify the Desired server count, see Desired server count manual setting modification
Number of servers > Max Required Target number of servers to modify - Set the maximum number of servers the Auto-Scailg Group can maintain
Table. Auto-Scailg Group server count modification items
Terminate Virtual Server Created by Auto-Scaling Group
Virtual Servers created in an Auto-Scaling Group can be terminated by clicking the terminate button on the Virtual Server’s detail page or by modifying the server count.
- Be careful, as data cannot be recovered after terminating the service.
- To cancel the service, first disconnect the File Storage and disable the Lock. If the File Storage is connected or a Lock is set, you cannot cancel the Virtual Server.
- If you terminate a Virtual Server created in an Auto-Scaling Group, the associated Load Balancer is automatically detached.
- When a Virtual Server is terminated, the status of the attached Storage is as follows.
- If Delete on termination is not set: the volume will not be deleted even if the Virtual Server is terminated.
- When Delete on termination is set: terminating the Virtual Server deletes the volume. However, if a Snapshot exists, the volume is not deleted even when Delete on termination is set.
- Multi attach volume: It can be deleted only when the server you are deleting is the last remaining server attached to the volume.
Cancel on the Virtual Server detail page
To cancel a Virtual Server on the Virtual Server detail page, follow these steps.
- Click the All Services > Compute > Virtual Server menu. Navigate to the Service Home page of Virtual Server.
- On the Service Home page, click the Virtual Server menu. You will be taken to the Virtual Server List page.
- On the Virtual Server List page, select the Virtual Server created by the Auto-Scaling Group. Navigate to the Virtual Server Details page.
- Click the Cancel Service button on the Virtual Server Details page. A popup confirming the server termination will appear.
- Click the Confirm button. Server termination is complete.
Cancel by adjusting the server count
To cancel a Virtual Server by modifying the server count, follow the steps below.
- Click the All Services > Compute > Virtual Server menu. Navigate to the Service Home page of Virtual Server.
- Click the Auto-Scaling Group menu. Navigate to the Auto-Scaling Group List page.
- On the Auto-Scaling Group List page, click the resource to terminate the Virtual Server. You will be taken to the Auto-Scaling Group Details page.
- Click the Edit button of the Server Count item. The Server Count Edit popup opens.
- Server Count Edit In the popup window, after reducing the Desired number, click the Confirm button. As the Desired server count is adjusted, the Virtual Server will be terminated.
- If Desired server count manual setting is set to unused, you cannot modify the Desired server count. To modify the Desired server count, see Desired server count manual setting modification.
- When a Virtual Server is terminated, the Desired server count remains unchanged, and scale‑out proceeds according to the Desired check batch.
Modify manual setting of desired server count
You can manually change the Desired server count of the Auto-Scaling Group.
To modify the manual setting of the Desired server count for an Auto-Scaling Group, follow these steps.
- Click the All Services > Compute > Virtual Server menu. Navigate to the Service Home page of Virtual Server.
- Click the Auto-Scaling Group menu. You will be taken to the Auto-Scaling Group List page.
- On the Auto-Scaling Group List page, click the resource to manually change the Desired server count. You will be taken to the Auto-Scaling Group Details page.
- Click the Edit button for the server count. The Desired server count manual setting popup opens.
- Desired server count manual setting After selecting whether to use it in the popup window, click the Confirm button.
Configure Security Group
You can configure the Security Group of an Auto-Scaling Group.
Follow these steps to configure the Security Group of an Auto-Scaling Group.
Click the All Services > Compute > Virtual Server menu. Navigate to the Service Home page of Virtual Server.
Click the Auto-Scaling Group menu. You will be taken to the Auto-Scaling Group List page.
On the Auto-Scaling Group List page, click the resource to configure the Security Group. You will be taken to the Auto-Scaling Group Details page.
Click the Edit button for Security Group. Security Group Edit popup opens. You can view the list of selectable Security Groups.
Category Detailed description Security Group name Security Group name Table. Security Group list itemsSecurity Group edit In the popup window, after selecting the Security Group, click the Confirm button. Security Group edit notification popup opens. Check the message of the Security Group edit notification popup and click the Confirm button.
Change status check
You can configure it to check the status of Virtual Server and Load Balancer and automatically replace servers that are in an Unhealthy state.
- The Load Balancer status check feature can only be used when a Load Balancer is in use.
- A server that the user changes to Stop is not considered Unhealthy, so even if you use the health check feature, the server will not be replaced.
- Click the All Services > Compute > Virtual Server menu. Go to the Service Home page of Virtual Server.
- Click the Auto-Scaling Group menu. Go to the Auto-Scaling Group list page.
- On the Auto-Scaling Group List page, click the resource for which you want to change the Load Balancer health check function. You will be taken to the Auto-Scaling Group Details page.
- Click the Edit button of Status Check. Status Check Edit popup window opens.
- Status Check Edit In the popup, choose whether to enable Load Balancer Status Check.
- After entering the Grace period, click the Confirm button.
- If you do not want to use the grace period, set the grace period to 0.
Manage Additional Information for Auto-Scaling Group
You can set the Load Balancer of an Auto-Scaling Group to enabled and select the LB server group. For an Auto-Scaling Group that is using a Load Balancer, you can change it to disabled.
Modify Load Balancer Draining Timeout
You can set the Load Balancer Draining Timeout of an Auto-Scaling Group.
The Draining Timeout is the time to wait before detaching a server from the Load Balancer.
- Because sessions may remain connected to the server, setting a Draining Timeout and waiting allows you to clean up sessions more safely.
- If the Load Balancer is unused, Draining Timeout cannot be set.
- The default is 300 seconds, and it can be set from a minimum of 1 second up to a maximum of 3,600 seconds.
To set the Load Balancer Draining Timeout of an Auto-Scaling Group, follow these steps.
- Click the All Services > Compute > Virtual Server menu. Navigate to the Service Home page of Virtual Server.
- Click the Auto-Scaling Group menu. You will be taken to the Auto-Scaling Group List page.
- Auto-Scaling Group List page, click the resource to set the Load Balancer Draining Timeout. You will be taken to the Auto-Scaling Group Details page.
- Click the Load Balancer tab. You will be taken to the Load Balancer list page.
- Click the Edit button for Draining Timeout. The Draining Timeout Edit popup opens.
- Draining Timeout Edit In the popup window, select whether to use Draining Timeout, and enter the Draining Timeout duration (seconds).
- Draining Timeout Edit After verifying the input values in the popup window, click the Confirm button. Draining Timeout Edit Notification popup window opens. Check the message in the notification popup and click the Confirm button.
Using Load Balancer
You can modify the Load Balancer of an Auto-Scaling Group. To configure the Load Balancer of an Auto-Scaling Group, follow these steps.
- When a server in the Auto-Scaling Group is created, it is automatically added as a member of the selected Load Balancer’s LB server group, and when the server is terminated, it is removed from the LB server group’s members.
- If Draining Timeout is used, after waiting for the Draining Timeout (seconds), the server is removed from the LB server group’s members.
- When a Member is separated due to Load Balancer modification, it waits in the Detach from LB state; when a Member is separated due to Scale In, it waits in the Scale In state.
- Click the All Services > Compute > Virtual Server menu. Navigate to the Service Home page of Virtual Server.
- Click the Auto-Scaling Group menu. Navigate to the Auto-Scaling Group List page.
- On the Auto-Scaling Group List page, click the resource that will use the Load Balancer. Navigate to the Auto-Scaling Group Details page.
- Click the Load Balancer tab. You will be taken to the Load Balancer list page.
- Click the Edit button of the Load Balancer. The Load Balancer Edit popup opens.
- Load Balancer Edit Please select whether to use it in the popup. Selecting Use allows you to choose a Load Balancer.
Category Detailed description LB server group LB server group name - Select the LB server group that is created in the selected VPC
- LB server groups that use Weighted Round Robin or Weighted Least Connection load balancing cannot be selected
Port Port information of the LB server group - When registering as a member of the LB server group, enter the required port information in the registration details.
- Enter a value between 1 and 65,534
Table. Load Balancer list items- Click the + button to add an LB server group. Up to three are allowed. Click the X button to remove the added Load Balancer.
- Check the Load Balancer list and click the Confirm button. Load Balancer Modification Notification popup opens. Verify the message in the notification popup and click the Confirm button.
- Please be careful, as separating or connecting servers from the Load Balancer may affect the service.
- If Draining Timeout is in use, setting the Load Balancer to unused or removing some of the attached Load Balancers with the X button will not detach immediately. After waiting for the Draining Timeout (seconds), the server is detached from the Load Balancer. At that time, the Auto-Scaling Group remains in the Detach from LB state.
- The LB server group member information configured in the Auto-Scaling Group can also be viewed in LB Server Group Connected Resources.
- Also, if manual connection/disconnection between the server and Load Balancer is required, please refer to Register LB Server Group Member.
Not Using Load Balancer
You can modify the Auto-Scaling Group’s Load Balancer to be unused. To set the Auto-Scaling Group’s Load Balancer to unused, follow these steps.
- Please be careful as there may be service impact when separating or connecting servers from the Load Balancer.
- If Draining Timeout is in use, setting the Load Balancer to unused or removing some of the attached Load Balancers with the X button will not detach immediately. After waiting for the Draining Timeout (seconds), the server is detached from the Load Balancer. At that time, the Auto-Scaling Group remains in the Detach from LB state.
- Click the All Services > Compute > Virtual Server menu. Go to the Service Home page of Virtual Server.
- Click the Auto-Scaling Group menu. You will be taken to the Auto-Scaling Group List page.
- On the Auto-Scaling Group List page, click the resource to change the Load Balancer to unused. You will be taken to the Auto-Scaling Group Details page.
- Click the Load Balancer tab. You will be taken to the Load Balancer list page.
- Click the Edit button of the Load Balancer. The Load Balancer Edit popup opens.
- Load Balancer edit Choose whether to use it in the popup window. If you deselect Use, Load Balancer will no longer be used.
- Use deselection, and click the Confirm button. Load Balancer Modification Notification popup opens. Verify the message in the notification popup and click the Confirm button.
Delete Auto-Scaling Group
Deleting an unused Auto-Scaling Group can reduce operational costs. However, terminating an Auto-Scaling Group may cause the running service to stop immediately, so you should thoroughly consider the impact of service interruption before proceeding with the termination.
To terminate an Auto-Scaling Group, follow these steps.
- Click the All Services > Compute > Virtual Server menu. Navigate to the Service Home page of Virtual Server.
- Click the Auto-Scaling Group menu. Navigate to the Auto-Scaling Group List page.
- On the Auto-Scaling Group List page, click the resource to terminate. Navigate to the Auto-Scaling Group Details page.
- Click the Auto-Scaling Group Delete button.
- After the deletion is complete, verify that the resource has been removed on the Auto-Scaling Group List page.
2.2.1 - Launch Configuration
To create an Auto-Scaling Group, a Launch Configuration must be created in advance.
Create Launch Configuration
You can create and use the Launch Configuration service in the Samsung Cloud Platform Console.
To create a Launch Configuration, follow these steps.
Click the All Services > Compute > Virtual Server menu. Navigate to the Service Home page of Virtual Server.
Click the Launch Configuration menu. Navigate to the Launch Configuration List page.
On the Launch Configuration list page, click the Create Launch Configuration button. You will be taken to the Create Launch Configuration page.
On the Launch Configurationp Creation page, in the Image and Version Selection area, select the required information and click the Next button.
ReferencePlease refer to the server type in Virtual Server OS Image Provisioning Versions for images selectable in Launch Configuration.Enter the required information in the Service Information Input area of the Launch Configurationp Creation page.
Category Required statusDetailed description Launch Configuration name Required Launch Configuration name - Name used to distinguish the Launch Configuration
Service Type > Server Type Required Launch Configuration server type - Standard: Standard specifications commonly used
- High Capacity: Large-scale server specifications beyond Standard
- Refer to Virtual Server Server Type for selectable server types
Block Storage Required Configure Block Storage according to the purpose of the Launch Configuration - Basic OS: The area where the OS is installed and used
- Enter the size in Units, and the minimum size varies depending on the OS image type
- Alma Linux: Enter a value between 2 and 1,536
- Oracle Linux: Enter a value between 5 and 1,536
- RHEL: Enter a value between 2 and 1,536
- Rocky Linux: Enter a value between 2 and 1,536
- Ubuntu: Enter a value between 1 and 1,536
- Windows: Enter a value between 4 and 1,536
- SSD: High‑performance general volume
- HDD: General volume
- SSD/HDD_KMS: Additional encrypted volume using Samsung Cloud Platform KMS (Key Management System) encryption keys
- Encryption can be applied only at initial creation and cannot be changed afterward
- Performance degradation occurs when using the SSD_KMS disk type
- Enter the size in Units, and the minimum size varies depending on the OS image type
- Additional: Use when extra space beyond the OS area is needed
- After selecting Use, enter the storage type and size
- Click the + button to add storage, or the x button to delete (up to 25 can be added)
- Enter the size in Units, with a value between 1 and 1,536
- Since 1 Unit equals 8 GB, this creates 8 GB to 12,288 GB
- SSD: High‑performance general volume
- HDD: General volume
- SSD/HDD_KMS: Additional encrypted volume using Samsung Cloud Platform KMS (Key Management System) encryption keys
- Encryption can be applied only at initial creation and cannot be changed afterward
- Performance degradation occurs when using the SSD_KMS disk type
- SSD_MultiAttach: Volume that can be attached to more than one server
- For details on each Block Storage type, see Create Block Storage
Keypair Required Select the user authentication method for the Launch Configuration - Server authentication information for accessing the servers created by generating an Auto-Scaling Group with a Launch Configuration
- New creation: Create a new one if a new Keypair is required
- Refer to Create Keypair for how to create a new Keypair
- Default login account list by OS
- Alma Linux: almalinux
- RHEL: cloud-user
- Rocky Linux: rocky
- Ubuntu: ubuntu
- Windows: sysadmin
File Storage Settings Selection Scale-out/Scale-in can be set to automatically connect File Storage - Volume: After clicking the Select button, choose the Volume to use in the Select File Storage popup
- Up to 5 can be selected
- The selected Volume can be deleted by clicking the X button
Table. Launch Configuration service information input itemsOn the Launch Configuration Creation page, after entering information in the Additional Information Input area, click the Next button.
Category RequiredDetailed description Init Script Selection Script that runs when the server of an Auto Scaling Group using a Launch Configuration starts - Enter within 45,000 bytes
- The init script must be a batch script for Windows, a shell script for Linux, or cloud‑init, depending on the selected image.
tag Selection Add Tag - Up to 50 can be added per resource
- After clicking the Add Tag button, enter or select Key, Value values
Table. Launch Configuration additional information input fieldsVerify creation information Check the information you entered on the page and the estimated amount, and click the Create button.
- When creation is complete, verify the created Launch Configuration on the Launch Configuration List page.
View Launch Configuration Details
The Launch Configuration service allows you to view and edit the full list of resources and detailed information. Launch Configuration Details page consists of Details, Tags, Activity Log tabs.
To view detailed information of the Launch Configuration, follow these steps.
- Click the All Services > Compute > Virtual Server menu. Navigate to the Service Home page of Virtual Server.
- Click the Launch Configuration menu. Navigate to the Launch Configuration List page.
- On the Launch Configuration List page, click the resource you want to view detailed information for. You will be taken to the Launch Configuration Details page.
- Launch Configuration Details At the top of the page, status information and additional feature information are displayed, and it consists of Details, Tags, Activity History tabs.
Category Detailed description Launch Configuration status User-created Launch Configuration status - Active: Available
Delete Launch Configuration Delete Launch Configuration button Table. Launch Configuration status information and additional features
- Launch Configuration Details At the top of the page, status information and additional feature information are displayed, and it consists of Details, Tags, Activity History tabs.
Detailed information
Launch Configuration list page allows you to view detailed information of the selected resource and edit the information if needed.
| Category | Detailed description |
|---|---|
| service | Service name |
| Resource Type | Resource Type |
| SRN | Unique resource ID in Samsung Cloud Platform
|
| Resource name | Resource name
|
| Resource ID | Unique resource ID in the service |
| constructor | User who created the service |
| Creation date and time | Service creation date and time |
| editor | User who edited the service information |
| Modification date | Date and time the service information was modified |
| Launch Configuration name | Launch Configuration name |
| image | When creating a Launch Configuration, the selected image name
|
| Number of Auto-Scaling Groups | Number of Auto Scaling Groups using a Launch Configuration |
| Server type | Server type set in Launch Configuration |
| Block Storage | Block Storage information per server configured in the Launch Configuration
|
| Keypair | Server authentication information set in the Launch Configuration
|
| Init Script | The Init Script set in the Launch Configuration
|
| File Storage Settings | File Storage Volume name set in the Launch Configuration
|
tag
Launch Configuration List page lets you view the tag information of the selected resource, and you can add, modify, or delete it.
| Category | Detailed description |
|---|---|
| Tag list | Tag list
|
Work History
You can view the operation history of the selected resource on the Launch Configuration List page.
| Category | Detailed description |
|---|---|
| Task History List | Resource Change History
|
Delete Launch Configuration
Deleting unused Launch Configurations can reduce operational costs. However, terminating a Launch Configuration may cause running services to stop immediately, so you should carefully consider the impact of service interruption before proceeding with the termination.
To delete a Launch Configuration, follow these steps.
- Click the All Services > Compute > Virtual Server menu. Navigate to the Service Home page of Virtual Server.
- Click the Launch Configuration menu. Go to the Launch Configuration List page.
- On the Launch Configuration List page, click the resource to be terminated. Navigate to the Launch Configuration Details page.
- Launch Configuration Delete Click the button.
- After deletion is complete, verify on the Launch Configuration List page that the resource has been removed.
2.2.2 - Manage Policy
You can dynamically adjust the number of servers in an Auto-Scaling Group based on monitoring metrics. When the metric exceeds the threshold you set, the server count is adjusted. At that time, you can choose one of three methods to adjust the server count: increase or decrease by a specified number, increase or decrease by a specified percentage, or fix the server count to a given value. When servers are launched or terminated due to a policy, the monitoring metric, such as CPU utilization, may temporarily exceed the threshold registered in the policy. However, because this is a brief moment, a cooldown period is set to avoid treating it as an abnormal condition. You can add and manage policies for an Auto-Scaling Group created in the Samsung Cloud Platform Console.
Add Policy
You can add policies to an Auto-Scaling Group. To add a policy to an Auto-Scaling Group, follow these steps.
Click the All Services > Compute > Virtual Server menu. Navigate to the Service Home page of Virtual Server.
Click the Auto-Scaling Group menu. You will be taken to the Auto-Scaling Group List page.
Auto-Scaling Group List page, click the resource you want to view detailed information for. You will be taken to the Auto-Scaling Group Details page.
Click the Policy Tab. You will be taken to the Policy Tab page.
Click the Add Policy button. The Add Policy popup window opens.
Category RequiredDetailed description Category Required Policy Category - Scale In: return server count
- Scale Out: increase server count
Policy Name Required Name for policy-specific categorization Execution conditions Required Conditions to execute the policy - Statistic: How to calculate the Metric Type
- Average: Average of servers in the Auto-Scaling Group
- Min: Minimum value among servers in the Auto-Scaling Group
- Max: Maximum value among servers in the Auto-Scaling Group
- Metric Type: CPU Usage, Network In(bytes), Network Out(bytes), Network In(Packets), Network Out(Packets)
- Operator:
>=><=<
- Threshold: Threshold corresponding to the Metric Type
- Period: Continuous duration required to trigger the condition (the condition must be continuously satisfied for N minutes for the policy to execute)
execution unit Required Policy execution method - Policy Type: Select the type of policy to execute.
- Increase or decrease the number of servers by a specified count: Increase or decrease the server count to the Target Value
- Increase or decrease the number of servers by a specified ratio: Increase or decrease by the Target Value ratio
- Fix the number of servers to the entered value: Fix the server count according to the Target Value
- Target Value: The number or ratio to apply for the selected Policy Type
cooldown Required The waiting time (seconds) when a server is started or terminated due to a policy - The default is 300 seconds, and it can be set between a minimum of 60 seconds and a maximum of 3,600 seconds.
Table. Add Policy Popup ItemsReferencePolicy > Cooldown Settings
- When a server is started or terminated due to a policy, it waits for the configured cooldown period. Temporarily, the monitoring metric CPU utilization may exceed the threshold defined in the policy. However, because this is a transient moment rather than a condition for adjusting the number of servers, it is not considered an abnormal situation, and the system waits by applying the cooldown time.
guidePolicy execution operates within the configured Min/Max server count range.
- Even if you input values outside the Min/Max server count range—such as increasing, returning, or fixing the number of servers—it operates within the configured Min/Max server count.
- Example: When the minimum number of servers is 3, even if you set the server count fixed to 1, the server count does not drop to 1 and remains at the minimum of 3.
Add Policy After entering the required values in the popup window, click the Confirm button. The added policy can be viewed in the Policy List.
Policy creation example
The following is an explanation of the policy example. Please refer to it when creating a policy.
| Category | Execution Conditions | execution unit | cooldown |
|---|---|---|---|
| Scale Out | Average CPU Usage >= 60% occurs for 1 minute | Increase the server count by the specified number, incrementing by one unit. | 300 seconds |
- If the average CPU usage of the servers in the Auto-Scaling Group exceeds 60% for one minute, a server is added one at a time.
- When a server is added, the cooldown period is 300 seconds, and during this time, no server addition or removal due to policy occurs.
- After the cooldown period ends, the policy execution conditions are checked again.
| Category | Execution Conditions | execution unit | cooldown |
|---|---|---|---|
| Scale In | Min CPU Usage <= 5% occurs for 1 minute | Scale the number of servers up or down by the specified ratio, returning 50%. | 300 seconds |
- If the minimum CPU usage of servers in the Auto-Scaling Group remains below 5% for 1 minute, 50% of the current servers will be terminated.
- When a server is terminated, the cooldown period is 300 seconds, and during this time, no server additions or removals occur due to policy.
- After the cooldown period ends, the policy execution conditions are checked again.
| Category | Execution Conditions | Execution Unit | cooldown |
|---|---|---|---|
| Scale Out | Max CPU Usage >= 90% occurs for 1 minute | Fix the number of servers to 5 based on the entered value | 300 seconds |
- If the maximum CPU usage among the servers in the Auto-Scaling Group exceeds 90% for 1 minute and the current number of servers is less than 5, servers will be created up to a total of 5.
- During server creation, the cooldown period is 300 seconds, and no server additions or removals due to policy occur during the cooldown period.
- After the cooldown period ends, the policy execution conditions are checked again.
Edit Policy
You can modify the policies of an Auto-Scaling Group. To modify the policies of an Auto-Scaling Group, follow these steps.
Click the All Services > Compute > Virtual Server menu. Navigate to the Service Home page of Virtual Server.
Click the Auto-Scaling Group menu. Navigate to the Auto-Scaling Group List page.
Auto-Scaling Group List page, click the resource to view detailed information. You will be taken to the Auto-Scaling Group Details page.
Click the Policy Tab. You will be taken to the Policy Tab page.
Click the More > Edit button for the policy you want to modify. The Policy Edit Popup opens.
Category RequiredDetailed description Category Required Policy Classification - Scale In: Return server count
- Scale Out: Increase server count
Policy Name Required Name for policy-specific categorization Execution conditions Required Conditions to execute the policy - Statistic: Method to calculate the Metric Type
- Average: Average of servers in the Auto-Scaling Group
- Min: Minimum value among servers in the Auto-Scaling Group
- Max: Maximum value among servers in the Auto-Scaling Group
- Metric Type: CPU Usage, Network In(bytes), Network Out(bytes), Network In(Packets), Network Out(Packets)
- Operator:
>=><=<
- Threshold: Threshold corresponding to the Metric Type
- Period: Continuous duration (in minutes) that triggers the execution condition
execution unit Required Policy execution method - Policy Type: Select the type of policy to execute.
- Increase or decrease the number of servers by a specified count: Increase or reduce the server count to the Target Value
- Increase or decrease the number of servers by a specified ratio: Increase or reduce by the Target Value ratio
- Fix the number of servers to the entered value: Fix the server count according to the Target Value
- Target Value: The number or ratio to execute the selected Policy Type
Cool down Required The waiting time (seconds) when a server is started or terminated due to a policy - The default is 300 seconds, and it can be set from a minimum of 1 second up to a maximum of 3,600 seconds
Table. Policy edit popup itemsEdit Policy In the popup window, after entering the required values, click the Confirm button.
Policy addition and modification constraints
When adding or modifying a policy, constraints exist based on the policy classification, execution conditions, and the scope of those conditions. Below is an example of constraints for the policy. Refer to the constraint examples to add or modify the policy.
Example 1 - Need to check for duplicate registration of policy classification/execution conditions
Policy classification (Scale Out or Scale In) and execution condition (Metric type) cannot be registered redundantly with the same values.
| Policy classification | Policy Name | Execution Condition(Statistic) | Execution condition(Metric type) | Execution condition range |
|---|---|---|---|---|
Scale Out | ScaleOutPolicy | Average | CPU Usage | >= 60% |
If a policy is registered as above, you cannot add a policy with the category (Scale Out) execution condition (Metric type=CPU Usage) or modify it to that condition.
Example 2 - Need to verify the execution condition range for execution conditions (Metric type) and execution conditions (Statistic) according to policy classification
When the policy type (Scale Out or Scale In) differs, duplicate registration of the execution condition range (Comparison operator + Threshold) is not allowed for the same execution condition (Metric type) and execution condition (Statistic).
| Policy classification | Policy Name | Execution condition(Statistic) | Execution condition(Metric type) | Execution condition range |
|---|---|---|---|---|
Scale Out | ScaleOutPolicy | Average | CPU Usage | >= 60% |
If a policy is registered as shown above, you cannot add a policy as below or modify it under the following conditions.
If the average CPU Usage is 60% or higher, a Scale Out policy is already registered; therefore, registering a Scale In policy when the average CPU Usage is 60% or lower would duplicate the 60% execution condition and cannot be added.
| Policy Classification | Policy Name | Execution Condition(Statistic) | Execution condition(Metric type) | Execution condition range |
|---|---|---|---|---|
Scale In | AddUpdatePolicy | Average | CPU Usage | <= 60% |
Example 3 - Need to verify the execution condition range for execution conditions (Metric type) and execution conditions (Statistic) according to policy classification
When the policy classification (Scale Out or Scale In) differs, duplicate registration of the execution condition range (Comparison operator + Threshold) is not allowed for the same execution condition (Metric type) and execution condition (Statistic).
| Policy classification | Policy Name | Execution condition(Statistic) | Execution condition (Metric type) | Execution condition range |
|---|---|---|---|---|
Scale In | ScaleInPolicy | Average | CPU Usage | <= 10% |
If a policy is registered as shown above, you cannot add a policy as below or modify it under the following conditions.
When the average CPU Usage is 10% or less, a Scale In policy is already registered; therefore, if the average CPU Usage is less than 60% / 60% or less / 10% or more / exceeds 9%, you cannot register a Scale Out policy because the execution condition ranges would overlap.
| Policy classification | Policy Name | Execution Condition(Statistic) | Execution condition(Metric type) | Execution condition range |
|---|---|---|---|---|
Scale Out | AddUpdatePolicy1 | Average | CPU Usage | < 60% |
Scale Out | AddUpdatePolicy2 | Average | CPU Usage | <= 60% |
Scale Out | AddUpdatePolicy3 | Average | CPU Usage | >= 10% |
Scale Out | AddUpdatePolicy4 | Average | CPU Usage | > 9% |
Example 4 - Execution conditions (Metric type) and execution conditions (Statistic) can be registered according to the execution condition range based on policy classification
If the policy type (Scale Out or Scale In) differs, you can register when the execution condition (Metric type) is the same but the execution condition (Statistic) is different, or when the execution condition range (Comparison operator + Threshold) does not overlap.
| Policy classification | Policy Name | Execution Condition(Statistic) | Execution condition (Metric type) | Execution Condition Range |
|---|---|---|---|---|
Scale Out | ScaleOutPolicy | Average | CPU Usage | >= 60% |
If a policy is already registered as shown above, you can add a policy as below or modify it with the following conditions. Registration is possible when the execution condition ranges do not overlap, or when the execution condition (Statistic) differs.
| Policy classification | Policy Name | Execution Condition(Statistic) | Execution condition (Metric type) | Execution condition range |
|---|---|---|---|---|
Scale In | AddUpdatePolicy1 | Average | CPU Usage | <= 10% |
Scale In | AddUpdatePolicy2 | Min | CPU Usage | <= 60% |
Delete Policy
You can delete a policy of an Auto-Scaling Group. To delete a policy of an Auto-Scaling Group, follow these steps.
- Click the All Services > Compute > Virtual Server menu. Navigate to the Service Home page of Virtual Server.
- Click the Auto-Scaling Group menu. Navigate to the Auto-Scaling Group List page.
- Auto-Scaling Group List page: click the resource you want to view details for. You will be taken to the Auto-Scaling Group Details page.
- Click the Policy Tab. You will be taken to the Policy Tab page.
- Select the policy to delete and click the Delete button. The Policy Deletion Confirmation popup opens.
- Policy Deletion Confirmation Check the popup window and click the Confirm button.
2.2.3 - Manage Schedule
You can schedule reservations daily, weekly, monthly, or one-time, and set the desired number of servers at the specified time. This is useful when you can predict when you need to decrease or increase the number of servers.
Add schedule
You can add a schedule to an Auto-Scaling Group. To add a schedule to an Auto-Scaling Group, follow these steps.
- Click the All Services > Compute > Virtual Server menu. Navigate to the Service Home page of Virtual Server.
- Click the Auto-Scaling Group menu. Navigate to the Auto-Scaling Group List page.
- Auto-Scaling Group List page, click the resource you want to view detailed information for. You will be taken to the Auto-Scaling Group Details page.
- Click the Schedule Tab. You will be taken to the Schedule Tab page.
- Click the Add Schedule button. The Add Schedule popup window opens.
Category requiredDetailed description Schedule name Required Name to distinguish by schedule Select server count Required Select the number of servers to adjust when executing the schedule - Min: The minimum number of servers the Auto-Scailg Group should maintain
- Desired: The target number of servers within the Auto-Scailg Group
- Max: The maximum number of servers the Auto-Scailg Group can maintain
Enter the number of servers Required Enter the value for the selected number of servers - Min value: Enter a value between 0 and 50. (Min≤Desired≤Max)
- Desired value: Enter a value between 0 and 50. (Min≤Desired≤Max)
- Max value: Enter a value between 0 and 50. (Min≤Desired≤Max)
Period Required Schedule execution frequency - Daily: You can set the Start date, End date, and Permanent options for a daily schedule. You can also set the Time and Time zone
- Weekly: You can set the Start date, End date, Permanent options, and the Time and Time zone. You can also select the Day of week on which the weekly schedule will run.
- Monthly: You can set the Start date, End date, Permanent options, and the Time and Time zone. You can also enter the Date on which the monthly schedule will run
- One-time: You can set the Time and Time zone. You can also set the Date on which the one-time schedule will run
Start date Selection Set schedule start date - Cannot set a date earlier than the current date. The default is the current date.
End date Selection Set schedule end date - Cannot set a date earlier than the current date. The default is the current date plus 7 days.
permanent Selection When configuring permanently, set the schedule end date to 9999-12-31 time Required Schedule execution time setting - Can be set in 30‑minute increments. Times earlier than the current date or current time cannot be set
time zone Required Schedule execution time zone (example: Asia/Seoul (GMT +09:00)) day of the week Required When Period is set to Weekly, select the Day of the week to execute the schedule Date Required - When Period is set to Monthly, enter the Date on which the schedule will run
- Enter one or more values from -31 to 31, excluding 0. (Example: 3,4,5)
- When Period is set to Once, set the Date on which the schedule will run
- Setting a date earlier than the current date is not allowed. The default is the current date.
Table. Schedule addition popup items - Add Schedule After entering the required values in the popup window, click the Confirm button.
- Add Schedule Confirmation After checking the message in the popup window, click the Confirm button.
If you select the schedule frequency as monthly, you must enter the schedule execution date, Date. Refer to the information below to register the schedule.
- If you input a number greater than 0, it represents the day of the month.
- Example: If you enter 1, you get August 1, September 1, …, December 1
- If you enter a number less than 0, the calculation starts from the end of each month.
- Entering -1 means the last day of each month.
- Example: August 31, September 30, …, December 31
- If you enter -2, it means the day before the last day of each month.
- Example: August 30, September 29, …, December 30
- Since the last day of each month varies—31, 30, 29, or 28 days—we allow calculation from the end of the month using negative numbers, as shown above, to handle schedules that need to run on the month’s final day.
- Entering -1 means the last day of each month.
- When the schedule runs, if the Min server count set in the schedule is greater than the Desired server count, or if the Max server count is less than the Desired server count, the Desired server count is also adjusted.
- If there are schedules with overlapping execution times, they may not run correctly. Please try to avoid overlapping execution times as much as possible.
Modify schedule
You can modify the schedule of an Auto-Scaling Group. To modify the schedule of an Auto-Scaling Group, follow these steps.
- Click the All Services > Compute > Virtual Server menu. Navigate to the Service Home page of Virtual Server.
- Click the Auto-Scaling Group menu. Navigate to the Auto-Scaling Group List page.
- On the Auto-Scaling Group List page, click the resource for which you want to view detailed information. You will be taken to the Auto-Scaling Group Details page.
- Click the Schedule Tab. You will be taken to the Schedule Tab page.
- Click the More > Edit button of the schedule you want to modify. The Edit Schedule popup window will open.
Category requiredDetailed description Schedule name Required Name to distinguish by schedule Select number of servers Required Select the number of servers to adjust when executing the schedule - Min: The minimum number of servers the Auto-Scailg Group should maintain
- Desired: The target number of servers within the Auto-Scailg Group
- Max: The maximum number of servers the Auto-Scailg Group can maintain
Enter number of servers Required Enter the value for the selected number of servers - Min value: Enter a value between 0 and 50. (Min≤Desired≤Max)
- Desired value: Enter a value between 0 and 50. (Min≤Desired≤Max)
- Max value: Enter a value between 0 and 50. (Min≤Desired≤Max)
period Required Schedule execution frequency - Daily: You can set the Start Date and End Date, and Permanent setting for the daily schedule. You can also set the Time and Time Zone
- Weekly: You can set the Start Date and End Date, Permanent setting, and the Time and Time Zone. You can also select the Day of Week for the Weekly schedule to run.
- Monthly: You can set the Start Date and End Date, Permanent setting, and the Time and Time Zone. You can also enter the Date for the Monthly schedule to run.
- Once: You can set the Time and Time Zone. You can also set the Date for the Once schedule to run
Start date Selection Set schedule start date - Cannot set a date earlier than the current date. The default is the current date.
End date Selection Set schedule end date - Cannot set a date earlier than the current date. The default is the current date plus 7 days.
Permanent Selection When configuring permanently, set the schedule end date to 9999-12-31. time Required Schedule execution time setting - Can be set in 30‑minute increments. Times earlier than the current date or current time cannot be set
time zone Required Schedule execution time zone (example: Asia/Seoul (GMT +09:00)) day of the week Required When Period is weekly, select the day of the week to run the schedule. Date Required - When Period is set to Monthly, enter the Date on which the schedule will run
- Enter one or more values from -31 to 31, excluding 0. (Example: 3,4,5)
- When Period is set to Once, set the Date on which the schedule will run
- Cannot set a date earlier than the current date. The default is the current date.
Table. Schedule edit popup items - Schedule Edit After entering the required values in the popup window, click the Confirm button.
- Schedule Modification Confirmation After checking the message in the popup window, click the Confirm button.
Delete schedule
You can delete the schedule of an Auto-Scaling Group. To delete the schedule of an Auto-Scaling Group, follow these steps.
- Click the All Services > Compute > Virtual Server menu. Navigate to the Service Home page of Virtual Server.
- Click the Auto-Scaling Group menu. You will be taken to the Auto-Scaling Group List page.
- On the Auto-Scaling Group List page, click the resource for which you want to view detailed information. You will be taken to the Auto-Scaling Group Details page.
- Click the Schedule Tab. You will be taken to the Schedule Tab page.
- Select the schedule to delete and click the Delete button. Schedule Deletion Confirmation popup will open.
- Schedule Deletion Confirmation Check the popup window and click the Confirm button.
2.2.4 - Manage Notification
You can designate a notification recipient and send alert messages via E‑mail or SMS for specific situations.
- The notification method (E-mail or SMS) can be set on the Notification Settings page by selecting Notification Target as Service > Virtual Server Auto-Scaling.
- For details on editing notification settings, refer to Edit Notification Settings.
Add notification
You can add notifications to an Auto-Scaling Group. To add notifications to an Auto-Scaling Group, follow these steps.
- Click the All Services > Compute > Virtual Server menu. Navigate to the Service Home page of Virtual Server.
- Click the Auto-Scaling Group menu. You will be taken to the Auto-Scaling Group List page.
- On the Auto-Scaling Group List page, click the resource to which you want to add notification information. You will be taken to the Auto-Scaling Group Details page.
- Click the Notification Tab. You will be taken to the Notification Tab page.
- Click the Add Notification button. The Add Notification popup opens.
- Add Notification After entering the required values in the popup window, click the Confirm button.
Category Detailed description Notification timing Notification timing when an Auto-Scaling Group alert occurs - Server creation, Server termination, Server creation failure, Server termination failure, When policy execution conditions are met
- Multiple selection allowed
Notification recipient User who will receive the notification when an alert occurs - Click the Add Notification Recipient button to select a user
- Only Samsung Cloud Platform users can be selected as recipients
Table. Notification items
- Add Notification Confirmation After checking the message in the popup window, click the Confirm button.
Edit Notification
You can modify the notification settings of an Auto-Scaling Group. To modify the notification settings of an Auto-Scaling Group, follow these steps.
- Click the All Services > Compute > Virtual Server menu. Navigate to the Service Home page of Virtual Server.
- Click the Auto-Scaling Group menu. Navigate to the Auto-Scaling Group List page.
- On the Auto-Scaling Group List page, click the resource to edit the notification information. You will be taken to the Auto-Scaling Group Details page.
- Click the Notification Tab. You will be taken to the Notification Tab page.
- In the notification list, click the More > Edit button for the notification you want to modify. The Edit Notification popup window will open.
- Edit Notification After editing the notification information in the popup window, click the Confirm button.
Category Detailed description Notification timing Notification timing when an Auto-Scaling Group alert occurs - Server creation, Server termination, Server creation failure, Server termination failure, When policy execution conditions are met
- Multiple selection allowed
Table. Notification edit items - Edit Notification Confirmation After checking the message in the popup window, click the Confirm button.
Delete Notification
You can delete notifications for an Auto-Scaling Group. To delete notifications for an Auto-Scaling Group, follow these steps.
- Click the All Services > Compute > Virtual Server menu. Navigate to the Service Home page of Virtual Server.
- Click the Auto-Scaling Group menu. You will be taken to the Auto-Scaling Group List page.
- On the Auto-Scaling Group List page, click the resource to edit the notification information. You will be taken to the Auto-Scaling Group Details page.
- Click the Notification Tab. You will be taken to the Notification Tab page.
- In the notification list, select the notification you want to delete, then click the Delete button. The Delete Notification Confirmation popup will open.
- Delete Notification Confirmation Review the popup window and click the Confirm button.
2.3 - API Reference
2.4 - CLI Reference
2.5 - Release Note
Virtual Server Auto-Scaling
- The Virtual Server and Load Balancer status check feature has been added.
- Check the status of Virtual Server and Load Balancer and automatically replace servers that are in an Unhealthy state.
- You can attach File Storage to a Launch Configuration.
- When creating a Launch Configuration, you can attach up to five File Storage Volumes.
- You can monitor data by integrating with the ServiceWatch service.
- The termination feature for Virtual Servers created in an Auto-Scaling Group has been improved.
- When a Virtual Server is terminated, the Load Balancer is automatically detached.
- The Desired server count remains unchanged, and scale-out is performed according to the Desired check batch.
- Added a notification feature to Virtual Server Auto-Scaling.
- You can add notification settings in the creation or detail screen of an Auto-Scaling Group.
- You can set scaling policies when creating an Auto-Scaling Group.
- Added a Metric Type to the Auto-Scaling Group policy.
- Add: Memory Usage, Network In(bytes), Network Out(bytes), Network In(Packets), Network Out(Packets)
- You can set the Draining Timeout when connecting to a Load Balancer.
- In an Auto-Scaling Group, up to 50 Virtual Servers can be connected, and up to 3 LB server groups and ports can be attached.
- Virtual Server Auto-Scaling feature change
- It will be released in conjunction with the Load Balancer service launching in February 2025.
- The NAT configuration feature has been added to the Auto-Scaling Group.
- Samsung Cloud Platform Common Feature Changes
- Account, IAM, Service Home, tags, and other common CX changes have been applied.
- Virtual Server Auto-Scaling creates an Auto-Scaling Group through a Launch Configuration and monitors and manages the servers.
- Provides a scheduling method that allows setting the desired number of servers at a specified time, and a policy method that adjusts the number of servers based on CPU utilization.
3 - GPU Server
3.1 - Overview
Service Overview
GPU Server is a virtualized computing service that lets you freely allocate and use infrastructure resources such as CPU, GPU, and memory provided by the server, without having to purchase them individually, and allocate as much as needed at the required time. It is suitable for tasks that require fast computation speed, such as AI model experimentation, prediction, and inference in a cloud environment, and you can flexibly select and use resources with optimized performance based on the type and scale of the work. The GPU Server provides the following features.
Provided Features
- GPU Server Management: Through a web-based console, users can directly Self Service create, delete, and modify GPU Server provisioning, monitoring, and billing.
- Product offerings by GPU quantity: Depending on the project’s purpose and scale, you can freely select the number of H100/A100 GPUs to configure a virtual server.
- High‑Performance GPU Provision: We provide a high‑performance GPU server at physical‑server level using a pass‑through method.
- Storage Connection: Provides additional attached storage besides the OS disk. * You can connect and use Block Storage, File Storage, and Object Storage.
- Strong Security Application: Use the Security Group service to control inbound/outbound traffic exchanged with the external internet or other VPC(Virtual Private Cloud), securely protecting the server.
- Monitoring: You can view monitoring information such as the status of computing resources—including CPU, Memory, Disk, and GPU—through the Cloud Monitoring service.
- Network Configuration Management: The server’s subnet/IP can be easily changed from the values set at initial creation. * NAT IP provides a management feature that lets you enable or disable it as needed.
- Key Pair method: To ensure a secure OS access method, we provide a Key Pair method instead of ID/PW login.
- Image Management: You can create and manage Custom Images, and it provides sharing functionality between projects.
- ServiceWatch Service Integration Offering: You can monitor data through the ServiceWatch service.
Component
GPU Server provides GPUs, NVSwitch, and NVLink on top of virtualized computing resources.
- NVSwitch can only be enabled and used for instance types that allocate eight GPUs on a single GPU server.
Specifications by GPU Type
GPU (Graphic Processing Unit) performs the calculations needed to create images that compose the computer screen, and because it is specialized for parallel processing, it can handle large amounts of data quickly, processing large-scale parallel operations such as artificial intelligence (AI) and data analysis.
The following are the specifications of the GPU Types offered by the GPU Server service.
| Category | A100 Type | H100 Type | B300 Type |
|---|---|---|---|
| GPU Architecture | NVIDIA Ampere | NVIDIA Hopper | NVIDIA Blackwell Ultra |
| GPU Memory | 80 GiB | 80 GiB | 268 GiB |
| GPU Transistors | 54 billion 7N TSMC | 80 billion 4N TSMC | 208 billion 4NP TSMC |
| FP16 Tensor Core (Dense) | 312 TFLOPs | 989 TFLOPs | 2.25 PFLOPs |
| FP8 Tensor Core (Dense) | Not supported | 1,979 TFLOPs | 4.5 PFLOPs |
| FP4 Tensor Core (Dense) | Unsupported | Not supported | 13.5 PFLOPs |
| GPU Memory Bandwidth | 2,039 GB/s HBM2e | 3,352 GB/s HBM3 | 8 TB/s HBM3e |
| NVLink performance | NVLink 3 | NVLink 4 | NVLink 5 |
| NVLink Signaling Rate | 25 GB/s (x12) | 25 GB/s (x18) | 50 GB/s (x18) |
| NVSwitch GPU-to-GPU bandwidth | 600 GB/s | 900 GB/s | 1.8 TB/s |
| Total NVSwitch aggregate bandwidth | 4.8 TB/s | 7.2 TB/s | 14.4 TB/s |
Server type
The server types offered by the GPU Server are as follows. For detailed information about the server types provided by GPU Server, refer to GPU Server 서버 타입.
| Category | Server type | CPU vCore | Memory(GB) | Number of GPUs |
|---|---|---|---|---|
| GPU-A100-1 | g1v16a1 | 16 | 234 | 1 |
| GPU-A100-1 | g1v32a2 | 32 | 468 | 2 |
| GPU-A100-1 | g1v64a4 | 64 | 936 | 4 |
| GPU-A100-1 | g1v128a8 | 128 | 1,872 | 8 |
| GPU-H100-2 | g2v12h1 | 12 | 234 | 1 |
| GPU-H100-2 | g2v24h2 | 24 | 468 | 2 |
| GPU-H100-2 | g2v48h4 | 48 | 936 | 4 |
| GPU-H100-2 | g2v96h8 | 96 | 1,872 | 8 |
| GPU-B300-3 | g3v16b1 | 16 | 480 | 1 |
| GPU-B300-3 | g3v32b2 | 32 | 960 | 2 |
| GPU-B300-3 | g3v64b4 | 64 | 1,920 | 4 |
| GPU-B300-3 | g3v128b8 | 128 | 3,840 | 8 |
OS and GPU driver version
The operating systems (OS) supported by the GPU Server are as follows. Note that GPUs of type B300 are supported only from a specific GPU version onward, so please be careful when selecting images.
| OS | OS version | GPU driver version | Server type classification |
|---|---|---|---|
| Ubuntu | 24.04 | 580.126.20 | GPU-B300-3, GPU-H100-2, GPU-A100-1 |
| Ubuntu | 24.04 | 570.195.03 | GPU-H100-2, GPU-A100-1 |
| Ubuntu | 22.04 | 535.183.06 | GPU-H100-2, GPU-A100-1 |
| RHEL | 9.6 | 580.126.20 | GPU-B300-3, GPU-H100-2, GPU-A100-1 |
| RHEL | 8.10 | 580.126.20 | GPU-B300-3, GPU-H100-2, GPU-A100-1 |
| RHEL | 8.10 | 535.183.06 | GPU-H100-2, GPU-A100-1 |
Preceding Service
This is a service that must be installed in advance before creating this service. Please prepare by referring to the user guide provided in advance.
| Service Category | service | Detailed description |
|---|---|---|
| Networking | VPC | A service that provides an isolated virtual network in a cloud environment |
| Networking | Security Group | Virtual firewall that controls server traffic |
3.1.1 - Server type
GPU Server server type
GPU servers are categorized by the GPU type they provide, and the GPU used in a GPU server is determined by the server type selected when creating the GPU server. Please select the server type based on the specifications of the application you want to run on the GPU server.
The server types supported by the GPU Server are as follows.
GPU-H100-2 g2v12h1
Category | example | Detailed description |
|---|---|---|
| Server type | GPU-H100-2 | Provided server type classification
|
| Server specifications | g2 | Provided server type classification and generation
|
| Server specifications | v12 | Number of vCores
|
| Server specifications | h1 | GPU type and quantity
|
g1 server type
The g1 server type is a GPU Server that uses the NVIDIA A100 Tensor Core GPU, suitable for high-performance applications.
- Up to 8 NVIDIA A100 Tensor Core GPUs provided
- Equipped with 6,912 CUDA cores and 432 Tensor cores per GPU
- Supports up to 128 vCPUs and 1,920 GB of memory
- Networking speed up to 40 Gbps
- 600 GB/s GPU and NVIDIA NVSwitch P2P communication
| Category | Server type | GPU | CPU | Memory | GPU Memory | Network Bandwidth |
|---|---|---|---|---|---|---|
| GPU-A100-1 | g1v16a1 | 1 | 16 vCore | 234 GB | 80 GiB | Maximum 20 Gbps |
| GPU-A100-1 | g1v32a2 | 2 | 32 vCore | 468 GB | 160 GiB | Up to 20 Gbps |
| GPU-A100-1 | g1v64a4 | 4 | 64 vCore | 936 GB | 320 GiB | Maximum 40 Gbps |
| GPU-A100-1 | g1v128a8 | 8 | 128 vCore | 1,872 GB | 640 GiB | Maximum 40 Gbps |
g2 server type
The g2 server type is a GPU Server that uses the NVIDIA H100 Tensor Core GPU, making it suitable for high-performance applications.
- Up to 8 NVIDIA H100 Tensor Core GPUs provided
- Equipped with 16,896 CUDA cores and 528 Tensor cores per GPU
- Supports up to 96 vCPUs and 1,920 GB of memory
- Networking speed up to 40 Gbps
- 900 GB/s GPU and NVIDIA NVSwitch P2P communication
| Category | Server type | GPU | CPU | Memory | GPU Memory | Network Bandwidth |
|---|---|---|---|---|---|---|
| GPU-H100-2 | g2v12h1 | 1 | 12 vCore | 234 GB | 80 GiB | Up to 20 Gbps |
| GPU-H100-2 | g2v24h2 | 2 | 24 vCore | 468 GB | 160 GiB | Up to 20 Gbps |
| GPU-H100-2 | g2v48h4 | 4 | 48 vCore | 936 GB | 320 GiB | Maximum 40 Gbps |
| GPU-H100-2 | g2v96h8 | 8 | 96 vCore | 1,872 GB | 640 GiB | Maximum 40 Gbps |
g3 server type
The g3 server type is a GPU Server that uses the NVIDIA B300 Tensor Core GPU, making it suitable for high-performance applications.
- Up to 8 NVIDIA B300 Tensor Core GPUs provided
- Equipped with 20,480 CUDA cores and 640 Tensor cores per GPU
- Supports up to 128 vCPUs and 3,840 GB of memory.
- Maximum 40 Gbps networking speed
- 1.8 TB/s GPU and NVIDIA NVSwitch P2P communication
| Category | Server type | GPU | CPU | Memory | GPU Memory | Network Bandwidth |
|---|---|---|---|---|---|---|
| GPU-B300-3 | g3v16b1 | 1 | 16 vCore | 480 GB | 268 GiB | Up to 20 Gbps |
| GPU-B300-3 | g3v32b2 | 2 | 32 vCore | 960 GB | 536 GiB | Up to 20 Gbps |
| GPU-B300-3 | g3v64b4 | 4 | 64 vCore | 1,920 GB | 1,072 GiB | Up to 40 Gbps |
| GPU-B300-3 | g3v128b8 | 8 | 128 vCore | 3,840 GB | 2,144 GiB | Maximum 40 Gbps |
3.1.2 - Monitoring Metrics
According to Samsung Cloud Platform’s policy, the Cloud Monitoring service is scheduled to be discontinued in September 2026.
Accordingly, after the September 2026 release, resource monitoring of the Samsung Cloud Platform via Cloud Monitoring will no longer be possible.
With the new alternative service, you can continuously perform resource monitoring by leveraging ServiceWatch released in October 2025.
ServiceWatch provides more modern and powerful features, replacing Cloud Monitoring to deliver a seamless monitoring environment.
If you are collecting metrics and logs through the Cloud Monitoring Agent, you need to switch to the ServiceWatch Agent.
For detailed information about ServiceWatch, please refer to ServiceWatch Overview.
Detailed information about ServiceWatch Agent can be found in the ServiceWatch Agent.
GPU Server Monitoring Metrics
The table below shows the monitoring metrics of the GPU server that can be viewed through Cloud Monitoring.
Even without installing the Agent, it provides basic monitoring metrics and the table below. Please check the GPU Server monitoring metrics (provided by default). Additionally, the metrics that can be viewed by installing the Agent are in the table. Please refer to the additional monitoring metrics for GPU Server (Agent installation required).
For detailed usage of Cloud Monitoring, refer to the Cloud Monitoring guide.
| Performance Item Name | Explanation | unit |
|---|---|---|
| Memory Total [Basic] | bytes of usable memory | bytes |
| Memory Used [Basic] | bytes of currently used memory | bytes |
| Memory Swap In [Basic] | bytes of the replaced memory | bytes |
| Memory Swap Out [Basic] | bytes of the replaced memory | bytes |
| Memory Free [Basic] | bytes of unused memory | bytes |
| Disk Read Bytes [Basic] | Read bytes | bytes |
| Disk Read Requests [Basic] | Number of read requests | cnt |
| Disk Write Bytes [Basic] | write bytes | bytes |
| Disk Write Requests [Basic] | Number of write requests | cnt |
| CPU Usage [Basic] | Average system CPU usage over 1 minute | % |
| Instance State [Basic] | Instance status | state |
| Network In Bytes [Basic] | Received bytes | bytes |
| Network In Dropped [Basic] | Incoming packet drop | cnt |
| Network In Packets [Basic] | Number of received packets | cnt |
| Network Out Bytes [Basic] | sent bytes | bytes |
| Network Out Dropped [Basic] | Transmit packet drop | cnt |
| Network Out Packets [Basic] | Number of transmitted packets | cnt |
| Performance Item Name | Explanation | unit |
|---|---|---|
| GPU Count | Number of GPUs | cnt |
| GPU Memory Usage | Memory usage rate | % |
| GPU Memory Used | Memory usage | MB |
| GPU Temperature | GPU temperature | ℃ |
| GPU Usage | utilization | % |
| GPU Usage [Avg] | Overall average GPU utilization (%) | % |
| GPU Power Cap | Maximum power capacity of the GPU | W |
| GPU Power Usage | Current GPU power usage | W |
| GPU Memory Usage [Avg] | GPU Memory Uti. AVG | % |
| GPU Count in use | Number of GPUs in use by jobs on the node | cnt |
| Execution Status for nvidia-smi | Result of running the nvidia-smi command | status |
| Core Usage [IO Wait] | Ratio of CPU time spent in wait state (disk wait) | % |
| Core Usage [System] | Proportion of CPU time spent in kernel space | % |
| Core Usage [User] | Proportion of CPU time spent in user space | % |
| CPU Cores | Number of CPU cores on the host | cnt |
| CPU Usage [Active] | Percentage of CPU time used, excluding Idle and IOWait states | % |
| CPU Usage [Idle] | It is the proportion of CPU time spent in idle state. | % |
| CPU Usage [IO Wait] | The proportion of CPU time spent in a waiting state (disk wait). | % |
| CPU Usage [System] | Percentage of CPU time used by the kernel | % |
| CPU Usage [User] | Percentage of CPU time used in user space. | % |
| CPU Usage/Core [Active] | Percentage of CPU time used other than Idle and IOWait states | % |
| CPU Usage/Core [Idle] | It is the proportion of CPU time spent in idle state. | % |
| CPU Usage/Core [IO Wait] | This is the proportion of CPU time spent in a waiting state (disk wait). | % |
| CPU Usage/Core [System] | Percentage of CPU time used by the kernel | % |
| CPU Usage/Core [User] | Percentage of CPU time used in user space. | % |
| Disk CPU Usage [IO Request] | Proportion of CPU time during which I/O requests to the device were executed | % |
| Disk Queue Size [Avg] | The average queue length of requests executed for the device. | num |
| Disk Read Bytes | The number of bytes read per second from the device. | bytes |
| Disk Read Bytes [Delta Avg] | Average of system.diskio.read.bytes_delta for individual disks | bytes |
| Disk Read Bytes [Delta Max] | Maximum system.diskio.read.bytes_delta of individual disks | bytes |
| Disk Read Bytes [Delta Min] | Minimum system.diskio.read.bytes_delta of individual disks | bytes |
| Disk Read Bytes [Delta Sum] | Sum of the system.diskio.read.bytes_delta of individual disks | bytes |
| Disk Read Bytes [Delta] | Delta of the system.diskio.read.bytes value for each disk | bytes |
| Disk Read Bytes [Success] | Total number of bytes successfully read. | bytes |
| Disk Read Requests | Number of read requests to the disk device per second | cnt |
| Disk Read Requests [Delta Avg] | Average of the system.diskio.read.count_delta for individual disks | cnt |
| Disk Read Requests [Delta Max] | Maximum system.diskio.read.count_delta for individual disks | cnt |
| Disk Read Requests [Delta Min] | Minimum of system.diskio.read.count_delta for individual disks | cnt |
| Disk Read Requests [Delta Sum] | Sum of system.diskio.read.count_delta for individual disks | cnt |
| Disk Read Requests [Success Delta] | Delta of system.diskio.read.count for each disk | cnt |
| Disk Read Requests [Success] | Total number of successful reads | cnt |
| Disk Request Size [Avg] | The average size of requests executed on the device (unit: sectors). | num |
| Disk Service Time [Avg] | Average service time (milliseconds) of input requests executed on the device. | ms |
| Disk Wait Time [Avg] | Average time taken for requests executed on the supported device. | ms |
| Disk Wait Time [Read] | Average disk wait time | ms |
| Disk Wait Time [Write] | Average disk wait time | ms |
| Disk Write Bytes [Delta Avg] | Average of system.diskio.write.bytes_delta for each disk | bytes |
| Disk Write Bytes [Delta Max] | Maximum system.diskio.write.bytes_delta of individual disks | bytes |
| Disk Write Bytes [Delta Min] | Minimum of system.diskio.write.bytes_delta for individual disks | bytes |
| Disk Write Bytes [Delta Sum] | Sum of system.diskio.write.bytes_delta for individual disks | bytes |
| Disk Write Bytes [Delta] | Delta of the system.diskio.write.bytes value for each disk | bytes |
| Disk Write Bytes [Success] | Total number of bytes successfully written. | bytes |
| Disk Write Requests | Number of write requests to the disk device per second | cnt |
| Disk Write Requests [Delta Avg] | Average of system.diskio.write.count_delta for individual disks | cnt |
| Disk Write Requests [Delta Max] | Maximum system.diskio.write.count_delta for individual disks | cnt |
| Disk Write Requests [Delta Min] | Minimum of system.diskio.write.count_delta for individual disks | cnt |
| Disk Write Requests [Delta Sum] | Sum of the system.diskio.write.count_delta of individual disks | cnt |
| Disk Write Requests [Success Delta] | Delta of system.diskio.write.count for each disk | cnt |
| Disk Write Requests [Success] | Total number of successful writes | cnt |
| Disk Writes Bytes | It is the number of bytes per second written to the device. | bytes |
| Filesystem Hang Check | filesystem (local/NFS) hang check (normal:1, abnormal:0) | status |
| Filesystem Nodes | It is the total number of file nodes in the file system. | cnt |
| Filesystem Nodes [Free] | It is the total number of available file nodes in the file system. | cnt |
| Filesystem Size [Available] | Disk space (bytes) available to unauthorized users | bytes |
| Filesystem Size [Free] | Available disk space (bytes) | bytes |
| Filesystem Size [Total] | Total disk space (bytes) | bytes |
| Filesystem Usage | Used disk space percentage | % |
| Filesystem Usage [Avg] | Average of individual filesystem.used.pct values | % |
| Filesystem Usage [Inode] | inode usage | % |
| Filesystem Usage [Max] | Maximum among individual filesystem.used.pct | % |
| Filesystem Usage [Min] | minimum among individual filesystem.used.pct | % |
| Filesystem Usage [Total] | - | % |
| Filesystem Used | Used disk space (bytes) | bytes |
| Filesystem Used [Inode] | inode usage | bytes |
| Memory Free | Total amount of available memory (bytes). | bytes |
| Memory Free [Actual] | Actual usable memory (bytes). | bytes |
| Memory Free [Swap] | Available swap memory. | bytes |
| Memory Total | total memory | bytes |
| Memory Total [Swap] | Total swap memory. | bytes |
| Memory Usage | Percentage of used memory | % |
| Memory Usage [Actual] | Percentage of memory actually used | % |
| Memory Usage [Cache Swap] | cached swap usage rate | % |
| Memory Usage [Swap] | Percentage of used swap memory | % |
| Memory Used | used memory | bytes |
| Memory Used [Actual] | Actual memory used (bytes). | bytes |
| Memory Used [Swap] | Swap memory used. | bytes |
| Collisions | Network collision | cnt |
| Network In Bytes | Number of received bytes | bytes |
| Network In Bytes [Delta Avg] | Average of system.network.in.bytes_delta for individual networks | bytes |
| Network In Bytes [Delta Max] | Maximum of system.network.in.bytes_delta for each network | bytes |
| Network In Bytes [Delta Min] | Minimum system.network.in.bytes_delta for each network | bytes |
| Network In Bytes [Delta Sum] | Sum of system.network.in.bytes_delta for individual networks | bytes |
| Network In Bytes [Delta] | Delta of received byte count | bytes |
| Network In Dropped | Number of deleted packets among incoming packets | cnt |
| Network In Errors | Number of errors during reception | cnt |
| Network In Packets | Number of received packets | cnt |
| Network In Packets [Delta Avg] | Average of system.network.in.packets_delta for each network | cnt |
| Network In Packets [Delta Max] | Maximum of system.network.in.packets_delta for each network | cnt |
| Network In Packets [Delta Min] | Minimum of system.network.in.packets_delta for individual networks | cnt |
| Network In Packets [Delta Sum] | Sum of system.network.in.packets_delta for individual networks | cnt |
| Network In Packets [Delta] | Delta of received packet count | cnt |
| Network Out Bytes | Number of transmitted bytes | bytes |
| Network Out Bytes [Delta Avg] | Average of system.network.out.bytes_delta for each network | bytes |
| Network Out Bytes [Delta Max] | Maximum of system.network.out.bytes_delta for individual networks | bytes |
| Network Out Bytes [Delta Min] | Minimum of system.network.out.bytes_delta for individual networks | bytes |
| Network Out Bytes [Delta Sum] | Sum of system.network.out.bytes_delta for individual networks | bytes |
| Network Out Bytes [Delta] | Delta of transmitted byte count | bytes |
| Network Out Dropped | Number of deleted packets among outgoing packets. | cnt |
| Network Out Errors | Number of errors during transmission | cnt |
| Network Out Packets | Number of transmitted packets | cnt |
| Network Out Packets [Delta Avg] | Average of system.network.out.packets_delta for each network | cnt |
| Network Out Packets [Delta Max] | Maximum of system.network.out.packets_delta for each network | cnt |
| Network Out Packets [Delta Min] | Minimum of system.network.out.packets_delta for each network | cnt |
| Network Out Packets [Delta Sum] | Sum of system.network.out.packets_delta for individual networks | cnt |
| Network Out Packets [Delta] | Delta of transmitted packet count | cnt |
| Open Connections [TCP] | All open TCP connections | cnt |
| Open Connections [UDP] | All open UDP connections | cnt |
| Port Usage | Available port usage rate | % |
| SYN Sent Sockets | Number of sockets in SYN_SENT state (when connecting from local to remote) | cnt |
| Kernel PID Max | kernel.pid_max value | cnt |
| Kernel Thread Max | kernel.threads-max value | cnt |
| Process CPU Usage | Percentage of CPU time consumed by the process since the last update. | % |
| Process CPU Usage/Core | Percentage of CPU time used by the process since the last event. | % |
| Process Memory Usage | Proportion of main memory (RAM) occupied by a process | % |
| Process Memory Used | Resident Set size. The amount of memory a process occupies in RAM. | bytes |
| Process PID | process pid | PID |
| Process PPID | parent process PID | PID |
| Processes [Dead] | Number of dead processes | cnt |
| Processes [Idle] | Number of idle processes | cnt |
| Processes [Running] | Number of running processes | cnt |
| Processes [Sleeping] | Number of sleeping processes | cnt |
| Processes [Stopped] | stopped processes count | cnt |
| Processes [Total] | Total number of processes | cnt |
| Processes [Unknown] | Number of processes with an unsearchable or unknown status | cnt |
| Processes [Zombie] | Zombie processes count | cnt |
| Running Process Usage | process usage rate | % |
| Running Processes | Number of running processes | cnt |
| Running Thread Usage | Thread usage rate | % |
| Running Threads | Total number of threads running in running processes | cnt |
| Context Switches | context switch count (per second) | cnt |
| Load/Core [1 min] | The load over the last 1 minute divided by the number of cores | cnt |
| Load/Core [15 min] | The load over the last 15 minutes divided by the number of cores | cnt |
| Load/Core [5 min] | The load over the last 5 minutes divided by the number of cores | cnt |
| Multipaths [Active] | External storage connection path status = active count | cnt |
| Multipaths [Failed] | External storage connection path status = failed count | cnt |
| Multipaths [Faulty] | External storage connection path status = faulty count | cnt |
| NTP Offset | measured offset of the last sample (the time difference between the NTP server and the local environment) | num |
| Run Queue Length | Execution queue length | num |
| Uptime | OS uptime (milliseconds). | ms |
| Context Switchies | CPU context switch count (per second) | cnt |
| Disk Read Bytes [Sec] | Number of bytes read from a Windows logical disk in 1 second | cnt |
| Disk Read Time [Avg] | Average data read time (seconds) | sec |
| Disk Transfer Time [Avg] | Disk average wait time | sec |
| Disk Usage | Disk usage | % |
| Disk Write Bytes [Sec] | Number of bytes written in one second on a Windows logical disk | cnt |
| Disk Write Time [Avg] | Average data write time (seconds) | sec |
| Pagingfile Usage | Paging file usage | % |
| Pool Used [Non Paged] | Nonpaged Pool usage in kernel memory | bytes |
| Pool Used [Paged] | Paged Pool usage in kernel memory | bytes |
| Process [Running] | Number of currently running processes | cnt |
| Threads [Running] | Number of currently running threads | cnt |
| Threads [Waiting] | Number of threads waiting for processor time | cnt |
3.1.3 - ServiceWatch Metrics
The GPU Server sends metrics to ServiceWatch. The metrics provided by default monitoring are data collected at 5‑minute intervals. If detailed monitoring is enabled, you can view data collected at 1‑minute intervals.
- The basic and detailed monitoring of the GPU Server are provided with the same metrics as the Virtual Server, and the namespace is also provided as Virtual Server.
- GPU-related metrics are provided through ServiceWatch Agent, and for instructions on collecting metrics using ServiceWatch Agent, refer to the ServiceWatch Agent guide.
Refer to How-to guides > ServiceWatch Detailed Monitoring Activation for how to enable detailed monitoring of the GPU Server.
Basic Metrics
The following are the basic metrics for the Virtual Server namespace.
The indicators whose names are displayed in bold below are the key indicators selected among the basic indicators provided by Virtual Server. The key metrics are used to build service dashboards that are automatically created for each service in ServiceWatch.
Each metric indicates through the user guide which statistical value is meaningful to view for that metric, and among the meaningful statistics, the statistical values shown in bold text are the primary statistics. In the service dashboard, you can view primary metrics using the primary statistical values.
| Performance items | Detailed description | unit | meaningful statistics |
|---|---|---|---|
| Instance State | Instance status display
| None |
|
| CPU Usage | CPU usage | Percent |
|
| Disk Read Bytes | Bytes read from block device (bytes) | Bytes |
|
| Disk Read Requests | Number of read requests on a block device | Count |
|
| Disk Write Bytes | Write capacity (bytes) on block device | Bytes |
|
| Disk Write Requests | Number of write requests on block device | Count |
|
| Network In Bytes | Received bytes (capacity) on the network interface | Bytes |
|
| Network In Dropped | Number of packet drops received on the network interface | Count |
|
| Network In Packets | Number of packets received on the network interface | Count |
|
| Network Out Bytes | Data transmitted on the network interface (bytes) | Bytes |
|
| Network Out Dropped | Number of packet drops transmitted from the network interface | Count |
|
| Network Out Packets | Number of packets transmitted on the network interface | Count |
|
3.2 - How-to guides
Users can create the service by entering the required GPU Server information and selecting detailed options through the Samsung Cloud Platform Console.
Creating a GPU Server
You can create and use a GPU Server service from the Samsung Cloud Platform Console.
To create a GPU server, follow the steps below.
Click the All Services > Compute > GPU Server menu. You will be taken to the Service Home page of GPU Server.
On the Service Home page, click the GPU Server Creation button. You will be taken to the GPU Server Creation page.
On the GPU Server Creation page, enter the information required to create the service and select detailed options.
- Select the required information in the Image and Version Selection area.
Category Required statusDetailed description Image Required Select the type of Image provided - Standard: Samsung Cloud Platform standard provided Image
- RHEL, Ubuntu
- Custom: User-created Image
- Kubernetes: Image for Kubernetes
- Ubuntu
Image version Required Select the version of the chosen Image - Provides a list of versions for the offered server Image
- For detailed information about the provided server image, see OS and GPU driver versions
Table. GPU Server image and version selection input fields - Standard: Samsung Cloud Platform standard provided Image
- Service Information Input area, enter or select the required information.
Category Required statusDetailed description Number of servers Required Number of GPU Server servers to create concurrently - Only numeric input is allowed, enter a value between 1 and 100
Service Type > Server Type Required GPU Server server type - Indicates the specifications of a GPU-type server, allowing selection of servers with 1, 2, 4, or 8 GPUs
- For detailed information about the server types provided by GPU Server, refer to GPU Server Server Types
Service Type > Planned Compute Selection Resource status with Planned Compute configured - In Use: Number of resources with Planned Compute that are currently in use
- Configured: Number of resources with Planned Compute configured
- Coverage Preview: Amount applied per resource by Planned Compute
- Apply for Planned Compute Service: Navigate to the Planned Compute service application page
- For more details, refer to Apply for Planned Compute
Block Storage Required Configure the Block Storage used by the GPU Server according to its purpose - Basic: The area where the OS is installed and used
- Capacity can be entered in units (the minimum capacity varies depending on the OS image type)
- RHEL: Values between 3 and 1,536 can be entered
- Ubuntu: Values between 3 and 1,536 can be entered
- SSD: High‑performance general volume
- HDD: General volume
- SSD/HDD_KMS: Additional encrypted volume using Samsung Cloud Platform KMS (Key Management System) encryption keys
- Encryption can be applied only at initial creation (cannot be changed after creation)
- Performance degradation occurs when using the SSD_KMS disk type
- SSD_Provisioned: SSD volume with configurable IOPS and throughput
- Capacity can be entered in units (the minimum capacity varies depending on the OS image type)
- Additional: Used when the user needs extra space beyond the OS area
- After selecting Use, enter the storage type and capacity
- To add storage, click the + button (up to 25 can be added); to delete, click the x button
- Capacity can be entered in units, with values between 1 and 1,536
- Since 1 Unit equals 8 GB, this creates 8 GB to 12,288 GB
- SSD: High‑performance general volume
- HDD: General volume
- SSD/HDD_KMS: Additional encrypted volume using Samsung Cloud Platform KMS (Key Management System) encryption keys
- Encryption can be applied only at initial creation (cannot be changed after creation)
- Performance degradation may occur when using the SSD_KMS disk type
- HDD/SSD_MultiAttach: Volume that can be attached to two or more servers
- SSD_Provisioned: SSD volume with configurable IOPS and throughput
- For details on each Block Storage type, refer to Create Block Storage
- Delete on termination: If Delete on Termination is set to Enabled, the volume is terminated together with the server
- Volumes with existing snapshots are not deleted even if Delete on termination is set to Enabled
- A multi‑attach volume can be deleted only when the server being removed is the last remaining server attached to the volume
Max IOPS Required Enter a maximum IOPS value between 5,000~20,000 - Can be set only when disk type is SSD_Provisioned
Max Throughput Required Enter the maximum Throughput value between 250~1,000 - Disk type can be set only when it is SSD_Provisioned
Table. GPU Server Service Configuration Items - In the Required Information Input area, enter or select the necessary information.
Category required or notDetailed description Server name Required Enter a name to distinguish the server when the selected number of servers is 1 - Set the hostname using the entered server name
- Enter within 63 characters using letters, numbers, spaces, and special characters (
-_)
Server name Prefix Required Enter a Prefix to distinguish each server generated when the selected number of servers is 2 or more - Automatically generated in the form of user input value (prefix) + ‘
-#’
- Enter within 59 characters using letters, numbers, spaces, and special characters (
-,_)
Network Settings > Create New Network Port Required Configure the network where the GPU Server will be installed - Select a pre‑created VPC.
- General Subnet: Select a pre‑created general Subnet
- IP can be set to auto‑generate or manual entry; if manual is chosen, the user can directly input the IP
- NAT: Available only when there is a single server and the VPC is attached to an Internet Gateway. Checking Use enables selection of a NAT IP
- NAT IP: Select a NAT IP
- If no NAT IP is available, click the Create New button to generate a Public IP
- Click the Refresh button to view and select the created Public IP
- Creating a Public IP incurs charges according to the Public IP pricing policy
- Local Subnet (optional): Choose Use for a local Subnet
- It is not a required element for creating the service
- A pre‑created local Subnet must be selected
- IP can be set to auto‑generate or manual entry; selecting Manual allows the user to input the IP directly
- Security Group: Settings required to access the server
- Select: Choose a pre‑created Security Group
- Create New: If no applicable Security Group exists, you can create one separately in the Security Group service
- Up to 5 can be selected
- If no Security Group is set, all connections are blocked by default
- You must configure a Security Group to allow required connections
Network Settings > Existing Network Port Assignment Required Set the network where the GPU server will be installed - Select a pre-created VPC
- General Subnet: Select a pre-created general Subnet and Port
- NAT: Available only when there is a single server and the VPC is connected to an Internet Gateway. When you check to use it, you can select a NAT IP.
- NAT IP: Select a NAT IP
- If there is no NAT IP to select, click the Create New button to generate a Public IP
- Click the Refresh button to view and select the created Public IP
- Local Subnet (optional): Select Use for the local Subnet
- Select a pre-created local Subnet and Port
Keypair Required User authentication methods to use when connecting to the server - New creation: Create a new keypair if one is needed
- Refer to Keypair 생성하기 for how to create a new keypair
- Default login accounts by OS
- RHEL: cloud-user
- Ubuntu: ubuntu
Table. Required input fields for GPU Server - Enter or select the required information in the Additional Information Input area.
Category RequiredDetailed description Lock Selection Lock usage setting - When Lock is enabled, it prevents actions such as server termination, start, and stop from being executed, thereby avoiding malfunctions caused by mistakes
Init script Selection Script executed when the server starts - The init script must be written as a Batch script for Windows, a Shell script for Linux, or cloud‑init, depending on the image type.
- Up to 45,000 bytes can be entered
tag Selection Add Tag - Up to 50 can be added per resource
- After clicking the Add Tag button, enter or select Key, Value values
Table. GPU Server additional information input fields
- Select the required information in the Image and Version Selection area.
Summary Check the detailed information and estimated billing amount generated in the panel, and click the Create button.
- Once creation is complete, check the created resources on the GPU Server List page.
Check GPU Server detailed information
GPU Server service allows you to view and edit the full resource list and detailed information. GPU Server Details page consists of Details, Tags, Job History tabs.
To view detailed information about the GPU Server service, follow these steps.
- Click the All Services > Compute > GPU Server menu. You will be taken to the Service Home page of GPU Server.
- On the Service Home page, click the GPU Server menu. You will be taken to the GPU Server List page.
- On the GPU Server List page, click the resource you want to view details for. You will be taken to the GPU Server Details page.
- GPU Server Details page displays status information and additional feature information, and is composed of Details, Tags, Activity History tabs.
- GPU Server Additional Features for detailed information, please refer to GPU Server Management Additional Features.
Category Detailed description GPU Server status Status of user-created GPU Server - Build: State where the Build command has been received
- Building: Build in progress
- Networking: Server creation networking process
- Scheduling: Server creation scheduling process
- Block_Device_Mapping: Connecting Block Storage during server creation
- Spawning: Server creation process is ongoing
- Active: Available state
- Powering_off: State when a stop request is made
- Deleting: Server deletion in progress
- Reboot_Started: Reboot in progress
- Error: Error state
- Migrating: Server migrating to another host
- Reboot: State where the Reboot command has been received
- Rebooting: Rebooting
- Rebuild: State where the Rebuild command has been received
- Rebuilding: State during Rebuild request
- Rebuild_Spawning: Rebuild process is ongoing
- Resize: State where the Resize command has been received
- Resizing: Resizing in progress
- Resize_Prep: State when a server type change is requested
- Resize_Migrating: Server is migrating to another host while resizing
- Resize_Migrated: Server has completed migration to another host during resizing
- Resize_Finish: Resize completed
- Revert_Resize: Resize or migration of the server failed for some reason. The target server is cleaned up and the original server is restarted
- Shutoff: State when powering off is completed
- Verity_Resize: After Resize_Prep following a server type change request, the server type is confirmed or can be reverted
- Resize_Reverting: State when a server type revert is requested
- Resize_Confirming: State confirming the server’s Resize request
Server control Buttons to change server status - Start: start a stopped server
- Stop: stop a running server
- Restart: restart a running server
Image generation Create a custom user image from the current server image Console log View console logs of the current server - You can view the console logs output by the current server. For more details, see Check console logs.
Create dump Generate a dump of the current server - The dump file is created inside the GPU Server
- For detailed dump creation instructions, refer to Create Dump
Rebuild All data and settings of the existing server are deleted, and a new server is configured - For detailed information, see Execute Rebuild.
Service termination Cancel service button Table. GPU Server status information and additional features
Detailed Information
GPU Server List page lets you view detailed information of the selected resource and edit it if needed.
| Category | Detailed description |
|---|---|
| service | Service name |
| Resource Type | Resource Type |
| SRN | Unique resource ID in Samsung Cloud Platform
|
| Resource name | Resource Name
|
| Resource ID | Unique resource ID in the service |
| constructor | User who created the service |
| Creation date and time | Service creation date and time |
| editor | User who edited the service information |
| Modification date | Date and time the service information was modified |
| Server name | Server name |
| Server type | vCPU, memory, GPU information display
|
| image name | Service OS image and version |
| Lock | Indicates whether Lock is used or not
|
| Keypair name | Server authentication information set by the user |
| Planned Compute | Resource status with Planned Compute configured
|
| LLM Endpoint | URL for using LLM
|
| ServiceWatch Detailed Monitoring | When enabled, data monitoring is possible in the ServiceWatch service
|
| Network | Network information of the GPU Server
|
| Local Subnet | Local Subnet information of the GPU Server
|
| Block Storage | Information of Block Storage attached to the server
|
tag
GPU Server List page allows you to view the tag information of the selected resource, and you can add, modify, or delete it.
| Category | Detailed description |
|---|---|
| Tag list | Tag list
|
Job History
You can view the job history of the selected resource on the GPU Server List page.
| Category | Detailed description |
|---|---|
| Task History List | Resource Change History
|
Control GPU Server Operation
If you need to control the operation of a created GPU Server resource, you can perform the task on the GPU Server List or GPU Server Details page. You can start, stop, and restart a running server.
Getting Started with GPU Server
You can start a shutoff GPU Server. To start a GPU Server, follow the steps below.
- Click the All Services > Compute > GPU Server menu. You will be taken to the Service Home page of the GPU Server.
- On the Service Home page, click the GPU Server menu. You will be taken to the GPU Server List page.
- On the GPU Server List page, click the resource you want to start among the shutoff servers, and navigate to the GPU Server Details page.
- GPU Server List page allows you to Start each resource via the right More button.
- After selecting multiple servers with checkboxes, you can control multiple servers simultaneously using the Start button at the top.
- GPU Server Details page, click the Start button at the top to start the server. In the Status Display section, verify the updated server status.
- When the GPU Server startup is complete, the server status changes from Shutoff to Active.
- For detailed information about the GPU Server status, refer to GPU Server detailed information.
Stopping the GPU Server
You can stop a running (Active) GPU Server. To stop the GPU Server, follow the steps below.
- Click the All Services > Compute > GPU Server menu. You will be taken to the Service Home page of GPU Server.
- On the Service Home page, click the GPU Server menu. You will be taken to the GPU Server List page.
- On the GPU Server List page, click the resource to stop among the servers that are running (Active), and navigate to the GPU Server Details page.
- On the GPU Server List page, you can Stop each resource via the right More button.
- After selecting multiple servers with checkboxes, you can control multiple servers simultaneously using the Stop button at the top.
- GPU Server Details page, click the Stop button at the top to start the server. Check the changed server status in the Status Display item.
- When the GPU server shutdown is complete, the server status changes from Active to Shutoff.
- For detailed information about the GPU Server status, please refer to GPU Server Detailed Information.
Restart GPU Server
You can restart the created GPU Server. To restart the GPU Server, follow the steps below.
- Click the All Services > Compute > GPU Server menu. You will be taken to the Service Home page of GPU Server.
- On the Service Home page, click the GPU Server menu. You will be taken to the GPU Server List page.
- On the GPU Server List page, click the resource to restart and go to the GPU Server Details page.
- On the GPU Server list page, you can restart each resource using the right More button.
- After selecting multiple servers with checkboxes, you can control multiple servers simultaneously using the Restart button at the top.
- GPU Server Details on the page, click the Restart button at the top to start the server. Check the updated server status in the Status Display field.
- During a GPU Server restart, the server status goes through Rebooting and finally changes to Active.
- For detailed information about the GPU Server status, refer to GPU Server detailed information.
GPU Server resource management
If you need server control and management functions for the created GPU Server resources, you can perform the tasks on the GPU Server Resource List or GPU Server Details page.
Create Image
You can create an image of a running GPU server.
This guide explains how to create a custom user image from the image of a running GPU server.
- On the GPU Server List or GPU Server Details page, click the Create Image button to create a user Custom Image.
To create an image of the GPU Server, follow these steps.
Click the All Services > Compute > GPU Server menu. Navigate to the Service Home page of the GPU Server.
On the Service Home page, click the GPU Server menu. You will be taken to the GPU Server List page.
On the GPU Server List page, click the resource to create an Image. You will be taken to the GPU Server Details page.
Click the Create Image button on the GPU Server Details page. You will be taken to the Image Creation page.
- Enter the required information in the Service Information Input area.
Category Required statusDetailed description image name Required Enter the name of the image to be created - using English letters, numbers, spaces, and special characters (
-_) within 200 characters
Table. Image service information input fields - using English letters, numbers, spaces, and special characters (
- Enter the required information in the Service Information Input area.
Check the input information and click the Create button.
- When creation is complete, check the created resources on the All Services > Compute > GPU Server > Image List page.
- When you create an Image, the generated Image is stored in the Object Storage used as internal storage. Therefore, Object Storage usage fees are applied.
- Since the file system of an image created from an active GPU server cannot be guaranteed to be intact, it is recommended to stop the server before creating the image.
Enabling detailed monitoring for ServiceWatch
By default, the GPU Server is linked to the basic monitoring of the ServiceWatch and Virtual Server namespaces. You can enable detailed monitoring as needed to identify operational issues more quickly and take action. For detailed information about ServiceWatch, see ServiceWatch Overview.
To enable detailed ServiceWatch monitoring on the GPU Server, follow these steps.
- Click the All Services > Compute > GPU Server menu. You will be taken to the Service Home page of GPU Server.
- On the Service Home page, click the GPU Server menu. You will be taken to the GPU Server List page.
- On the GPU Server List page, click the resource to enable ServiceWatch detailed monitoring. You will be taken to the GPU Server Details page.
- On the GPU Server Details page, click the ServiceWatch detailed monitoring Edit button. You will be taken to the ServiceWatch Detailed Monitoring Edit popup.
- ServiceWatch Detailed Monitoring Edit In the popup window, select Enable, review the instructions, and click the Confirm button.
- On the GPU Server Details page, check the ServiceWatch detailed monitoring items.
Disable detailed monitoring of ServiceWatch
To disable detailed monitoring of ServiceWatch on the GPU Server, follow these steps.
- Click the All Services > Compute > GPU Server menu. You will be taken to the Service Home page of GPU Server.
- On the Service Home page, click the GPU Server menu. You will be taken to the GPU Server List page.
- GPU Server List page, click the resource to disable ServiceWatch detailed monitoring. Navigate to the GPU Server Details page.
- GPU Server Details page, click the ServiceWatch detailed monitoring Edit button. You will be taken to the ServiceWatch detailed monitoring Edit popup.
- ServiceWatch Detailed Monitoring Edit In the popup window, after deselecting Enabled, review the guidance message and click the Confirm button.
- On the GPU Server Details page, view the ServiceWatch detailed monitoring items.
GPU Server Management Additional Features
GPU Server can view Console logs, generate Dumps, and perform Rebuilds for server management. To view Console logs, generate Dumps, or Rebuild a GPU Server, follow the steps below.
Check console log
You can view the current console log of the GPU Server.
To view the console logs of the GPU server, follow these steps.
- Click the All Services > Compute > GPU Server menu. You will be taken to the GPU Server’s Service Home page.
- On the Service Home page, click the GPU Server menu. You will be taken to the GPU Server List page.
- On the GPU Server List page, click the resource to view the console log. You will be taken to the GPU Server Details page.
- On the GPU Server Details page, click the Console Log button. You will be taken to the Console Log popup.
- Console Log Verify the console log displayed in the popup window.
Create Dump
To create a dump file on the GPU Server, follow these steps.
- Click the All Services > Compute > GPU Server menu. You will be taken to the Service Home page of GPU Server.
- On the Service Home page, click the GPU Server menu. You will be taken to the GPU Server List page.
- GPU Server List page, click the resource to view detailed information. You will be taken to the GPU Server Details page.
- On the GPU Server Details page, click the Create Dump button.
- The dump file is created inside the GPU server.
Perform Rebuild
You can delete all data and settings of the existing GPU Server and rebuild it on a new server.
Follow these steps to perform a Rebuild of the GPU Server.
- Click the All Services > Compute > GPU Server menu. You will be taken to the Service Home page of GPU Server.
- On the Service Home page, click the GPU Server menu. You will be taken to the GPU Server List page.
- GPU Server List page, click the resource to perform Rebuild. GPU Server Details page will be displayed.
- On the GPU Server Details page, click the Rebuild button.
- During a GPU Server Rebuild, the server status changes to Rebuilding, and when the Rebuild is complete, it returns to its state before the Rebuild.
- For detailed information about the GPU Server status, refer to Check GPU Server details.
Terminate GPU Server
If you terminate an unused GPU Server, you can reduce operating costs. However, terminating a GPU Server may cause the running service to stop immediately, so you should carefully consider the impact of service interruption before proceeding with the termination.
To cancel the GPU Server, follow the steps below.
- Click the All Services > Compute > GPU Server menu. Go to the GPU Server’s Service Home page.
- On the Service Home page, click the GPU Server menu. You will be taken to the GPU Server List page.
- On the GPU Server List page, select the resource to cancel and click the Cancel Service button.
- The termination of attached storage depends on the Delete on termination setting, so refer to Termination constraints.
- After termination is complete, check on the GPU Server List page whether the resources have been terminated.
Cancellation constraints
When a GPU Server termination request cannot be processed, a popup will provide guidance. Please refer to the cases below.
- If File Storage is connected please disconnect the File Storage connection first.
- When the LB Pool is connected, please disconnect the LB Pool connection first.
- If Lock is set please change the Lock setting to disabled and try again.
Termination of attached storage depends on the Delete on termination setting.
- Whether the volume is deleted also depends on the Delete on termination setting.
- Delete on termination If not set: Even if you terminate the GPU Server, the volume will not be deleted.
- When Delete on termination is set: If you terminate the GPU Server, the associated volume will be deleted.
- Volumes that have snapshots will not be deleted even if Delete on termination is set.
- A Multi‑attach volume can be deleted only when the server being deleted is the last remaining server attached to the volume.
3.2.1 - Manage Image
Users can create the service by entering the required information for the Image service within the GPU Server service and selecting detailed options through the Samsung Cloud Platform Console.
Create Image
You can create an image of a running GPU Server. To create an image of a GPU Server, please refer to Image Creation.
Check Image detailed information
Image service allows you to view and edit the full resource list and detailed information. The Image Details page consists of Detailed Information, Tags, Operation History tabs.
To view detailed information of the Image service, follow these steps.
- Click the All Services > Compute > GPU Server menu. You will be taken to the Service Home page of GPU Server.
- On the Service Home page, click the Image menu. You will be taken to the Image list page.
- On the Image List page, click the resource to view detailed information. You will be taken to the Image Detail page.
- Image Details page displays status information and additional feature information, and consists of Details, Tags, Activity Log tabs.
Category Detailed description Image status Status of user-created Image - Active: Available state
- Queued: Image has been uploaded and is waiting for processing after creation
- Importing: Image has been uploaded and is currently being processed after creation
Share with another account Image can be shared with another Account - The Image’s Visibility must be set to Shared in order to be shared with another Account
Delete image Button to delete the Image - Once the Image is deleted, it cannot be restored
Table. GPU Server Image status information and additional features
- Image Details page displays status information and additional feature information, and consists of Details, Tags, Activity Log tabs.
Detailed Information
Image list page lets you view detailed information of the selected resource and modify it if necessary.
| Category | Detailed description |
|---|---|
| service | Service name |
| Resource Type | Resource Type |
| SRN | Unique resource ID in Samsung Cloud Platform
|
| Resource name | Image name |
| Resource ID | Image ID |
| constructor | User who created the Image |
| Creation date and time | Image creation timestamp |
| editor | User who edited the Image |
| Modification date | Image modification timestamp |
| image name | Image name |
| Minimum disk | Minimum disk capacity (GB) of the Image
|
| Minimum RAM | Minimum RAM size (GB) of the Image |
| OS type | OS type of the image |
| OS hash algorithm | OS hash algorithm method |
| Visibility | Display access permissions for the image
|
| Protected | Select whether image deletion is prohibited
|
| image file URL | Image file URL uploaded when generating an image
|
| Sharing status | Current status of sharing images with another Account
|
tag
On the Image List page, you can view the tag information of the selected resource and add, modify, or delete it.
| Category | Detailed description |
|---|---|
| Tag list | Tag list
|
Job History
You can view the operation history of the selected resource on the Image List page.
| Category | Detailed description |
|---|---|
| Task History List | Resource Change History
|
Image Resource Management
Describes the control and management functions of the generated Image.
Share to another Account
To share an Image with another Account, follow the steps below.
- Log in to the account to be shared and click the All Services > Compute > GPU Server menu. Go to the GPU Server’s Service Home page.
- On the Service Home page, click the Image menu. You will be taken to the Image List page.
- On the Image List page, click the Image you want to control. You will be taken to the Image Details page.
- Click the Share to another Account button. Navigate to the Share image to another Account page.
- Share with another Account feature allows you to share an Image with another Account. To share an Image with another Account, the Image’s Visibility must be Shared.
- Share image to another Account page, enter the required information, and click the Done button.
Category RequiredDetailed description image name - Name of the image to share - Input not allowed
Image ID - Shareable image ID - Input not allowed
Shared Account ID Required Enter another Account ID to share - English letters, numbers, special characters
-within 64 characters
Table. Required input fields for sharing images to another Account - You can view the information in the sharing status of the Image Details page.
- When the request is first made, the status is Pending, and it changes to Accepted once approval is completed by the account receiving the share.
Receive sharing from another Account
To receive an Image shared from another Account, follow these steps.
- Log in to the account to be shared and click the All Services > Compute > GPU Server menu. Navigate to the GPU Server’s Service Home page.
- On the Service Home page, click the Image menu. You will be taken to the Image List page.
- On the Image List page, click the Receive Image Share button. You will be taken to the Receive Image Share popup.
- Receive Image Sharing In the popup window, enter the Image’s resource ID you want to receive, and click the Confirm button.
- When image sharing is complete, you can view the shared Image in the Image list.
Delete Image
You can delete unused Images. However, since an Image cannot be recovered after deletion, you should carefully consider the impact before performing the deletion.
To delete the Image, follow these steps.
- Click the All Services > Compute > GPU Server menu. Go to the GPU Server’s Service Home page.
- On the Service Home page, click the Image menu. You will be taken to the Image List page.
- Image list page, select the resource to delete, and click the Delete button.
- On the Image List page, select multiple Image check boxes and click the Delete button at the top of the resource list.
- After deletion is complete, verify on the Image list page that the resource has been removed.
3.2.2 - Manage Keypair
Users can create the service by entering the required Keypair information within the GPU Server service and selecting detailed options through the Samsung Cloud Platform Console.
Create a Keypair
You can create and use the Keypair service while using the GPU Server service in the Samsung Cloud Platform Console.
To create a keypair, follow these steps.
- Click the All Services > Compute > GPU Server menu. You will be taken to the Service Home page of GPU Server.
- On the Service Home page, click the Keypair menu. You will be taken to the Keypair List page.
- On the Keypair List page, click the Keypair Create button. You will be taken to the Keypair Create page.
- Enter the required information in the Service Information Input area.
Category RequiredDetailed description Keypair name Required Enter the name of the Keypair to create - using English letters, numbers, spaces, and special characters (
-,_) within 255 characters
Keypair type Required ssh Table. Keypair service information input fields - using English letters, numbers, spaces, and special characters (
- Additional Information Input area, please enter or select the required information.
Category Required statusDetailed description tag Selection Add Tag - Up to 50 can be added per resource
- After clicking the Add Tag button, enter or select Key, Value values
Table. Keypair additional information input fieldsCaution- After creation is complete, you can download the Key only once. Since reissuance is not possible, make sure it has been downloaded.
- Store the downloaded Private Key in a safe place.
- Enter the required information in the Service Information Input area.
- Check the input information and click the Create button.
- After creation is complete, check the created resources on the Keypair List page.
View detailed information of the Keypair
The Keypair service allows you to view and edit the full resource list and detailed information. Keypair Details page consists of Details, Tags, Activity Log tabs.
To view detailed information about a keypair, follow these steps.
- Click the All Services > Compute > GPU Server menu. You will be taken to the Service Home page of GPU Server.
- On the Service Home page, click the Keypair menu. You will be taken to the Keypair List page.
- On the Keypair List page, click the resource to view its details. You will be taken to the Keypair Details page.
- Keypair Details page displays status information and additional feature information, and consists of Details, Tags, Activity Log tabs.
Detailed Information
Keypair List page allows you to view detailed information of the selected resource and edit the information if needed.
| Category | Detailed description |
|---|---|
| service | Service name |
| Resource Type | Resource Type |
| SRN | Unique resource ID in Samsung Cloud Platform
|
| Resource name | Keypair name |
| Resource ID | Keypair’s unique resource ID |
| constructor | User who created the keypair |
| Creation date and time | Keypair creation timestamp |
| editor | User who modified the keypair information |
| Modification date | Date and time the keypair information was modified |
| Keypair name | Keypair name |
| Fingerprint | A unique value for identifying the key |
| User ID | User ID of the keypair creator |
| public key | Public key information |
tag
On the Keypair List page, you can view the tag information of the selected resource, and add, modify, or delete it.
| Category | Detailed description |
|---|---|
| Tag list | Tag list
|
Job History
On the Keypair List page, you can view the operation history of the selected resource.
| Category | Detailed description |
|---|---|
| Task History List | Resource Change History
|
Keypair Resource Management
Describes the control and management functions of a keypair.
Get public key
To retrieve the public key, follow these steps.
Click the All Services > Compute > GPU Server menu. You will be taken to the Service Home page of GPU Server.
On the Service Home page, click the Keypair menu. You will be taken to the Keypair List page.
On the Keypair List page, click the More button at the top and then click the Import Public Key button. You will be taken to the Import Public Key page.
- Enter or select the required information in the Required Information Input area.
Category RequiredDetailed description Keypair name Required Name of the Keypair to create Keypair type Required ssh public key Required Enter public key - Load file: Select the Attach file button to attach the public key file
- Only files with the following extension (.pem) can be attached
- Enter public key: Paste the copied public key value
- The public key value can be copied from the Keypair Details page
Table. Required input fields for retrieving the public key - Load file: Select the Attach file button to attach the public key file
- Enter or select the required information in the Required Information Input area.
Review the entered information and click the Complete button.
- Once creation is complete, check the created resources on the Keypair List page.
Delete Keypair
You can delete unused Keypairs. However, once a Keypair is deleted it cannot be recovered, so please review the impact thoroughly beforehand before proceeding with deletion.
To delete a keypair, follow these steps.
- Click the All Services > Compute > GPU Server menu. Go to the Service Home page of GPU Server.
- On the Service Home page, click the Keypair menu. You will be taken to the Keypair List page.
- On the Keypair List page, select the resource to delete, and click the Delete button.
- On the Keypair List page, select multiple Keypair check boxes and click the Delete button at the top of the resource list.
- After deletion is complete, check the Keypair List page to confirm that the resource has been removed.
3.2.3 - Use Multi-instance GPU on GPU Server
After creating a GPU Server, you can enable the MIG (Multi-instance GPU) feature on the GPU Server’s VM (Guest OS) and create an instance for use.
NVIDIA Multi-instance GPU Introduction
NVIDIA Multi-instance GPU (hereafter referred to as MIG) supports safely partitioning a GPU into GPU instances and running CUDA applications starting with the NVIDIA Ampere architecture. Through this, multiple users can each utilize different GPU resources to achieve optimal GPU utilization. This feature is especially useful for workloads that do not fully utilize the GPU’s computing capacity, and users can run multiple workloads in parallel to maximize utilization.
Using Multi-instance GPU feature
To use the MIG feature, create an NVIDIA GPU Server on the Samsung Cloud Platform, then enable and disable MIG. The order of applying and removing MIG is as follows.
- MIG can be used on Samsung Cloud Platform’s next-generation GPU Server or MNGC (Multi-node GPU Cluster).
- For system requirements to use MIG, refer to the NVIDIA Multi-Instance GPU User Guide.
Applying and Using MIG
After activating the MIG and creating an Instance to assign tasks, the tasks proceed in the following order.
Activate MIG
Check the GPU status on the VM Instance (GuestOS) before applying MIG.
- Check whether MIG mode is Disabled.Color mode
$ nvidia-smi Mon Sep 27 08:37:08 2021 +-----------------------------------------------------------------------------+ | NVIDIA-SMI 470.57.02 Driver Version: 470.57.02 CUDA Version: 11.4 | | -------------------------------+----------------------+---------------------- | | GPU Name Persistence-M | Bus-Id Disp.A | Volatile Uncorr. ECC | | Fan Temp Perf Pwr:Usage/Cap | Memory-Usage | GPU-Util Compute M. | | | | MIG M. | | ===============================+======================+====================== | | 0 NVDIA A100-SXM... Off | 00000000:05:00.0 Off | 0 | | N/A 32C P0 59W / 400W | 0MiB / 81251MiB | 0% Default | | | | Disabled | +-------------------------------+----------------------+----------------------+ +-----------------------------------------------------------------------------+ | Processes: | | GPU GI CI PID Type Process name GPU Memory | | ID ID Usage | | ============================================================================= | | No running processes found | +-----------------------------------------------------------------------------+$ nvidia-smi Mon Sep 27 08:37:08 2021 +-----------------------------------------------------------------------------+ | NVIDIA-SMI 470.57.02 Driver Version: 470.57.02 CUDA Version: 11.4 | | -------------------------------+----------------------+---------------------- | | GPU Name Persistence-M | Bus-Id Disp.A | Volatile Uncorr. ECC | | Fan Temp Perf Pwr:Usage/Cap | Memory-Usage | GPU-Util Compute M. | | | | MIG M. | | ===============================+======================+====================== | | 0 NVDIA A100-SXM... Off | 00000000:05:00.0 Off | 0 | | N/A 32C P0 59W / 400W | 0MiB / 81251MiB | 0% Default | | | | Disabled | +-------------------------------+----------------------+----------------------+ +-----------------------------------------------------------------------------+ | Processes: | | GPU GI CI PID Type Process name GPU Memory | | ID ID Usage | | ============================================================================= | | No running processes found | +-----------------------------------------------------------------------------+Code block. nvidia-smi command - Check GPU disabled status (1) Color mode$ nvidia-smi –L GPU 0: NVIDIA A100-SXM-80GB (UUID: GPU-c956838f-494a-92b2-6818-56eb28fe25e0)$ nvidia-smi –L GPU 0: NVIDIA A100-SXM-80GB (UUID: GPU-c956838f-494a-92b2-6818-56eb28fe25e0)Code block. nvidia-smi command - Check GPU disabled status (2)
- Check whether MIG mode is Disabled.
Enable MIG (Enable) for each GPU on the VM Instance (GuestOS) and reboot the VM Instance.
Color mode$ nvidia-smi –I 0 –mig 1 Enabled MIG mode for GPU 00000000:05:00.0 All done. # reboot$ nvidia-smi –I 0 –mig 1 Enabled MIG mode for GPU 00000000:05:00.0 All done. # rebootCode block. nvidia-smi command - enable MIG
When using a GPU and configuring MIG, you may encounter the following warning message. If the warning appears, check whether any programs are running on the GPU.
Warning: MIG mode is in pending enable state for GPU 00000000:05:00.0: In use by another client. 00000000:05:00.0 is currently being used by one or more other processes (e.g. CUDA application or a monitoring application such as another instance of nvidia-smi).
- Check the GPU status after applying MIG on the VM Instance(GuestOS).
- Check whether MIG mode is Enabled.Color mode
$ nvidia-smi Mon Sep 27 09:44:33 2021 +-----------------------------------------------------------------------------+ | NVIDIA-SMI 470.57.02 Driver Version: 470.57.02 CUDA Version: 11.4 | | -------------------------------+----------------------+---------------------- | | GPU Name Persistence-M | Bus-Id Disp.A | Volatile Uncorr. ECC | | Fan Temp Perf Pwr:Usage/Cap | Memory-Usage | GPU-Util Compute M. | | | | MIG M. | | ===============================+======================+====================== | | 0 NVDIA A100-SXM... Off | 00000000:05:00.0 Off | On | | N/A 32C P0 59W / 400W | 0MiB / 81251MiB | 0% Default | | | | Enabled | +-------------------------------+----------------------+----------------------+ +-----------------------------------------------------------------------------+ | MIG devices: | +-----------------------------------------------------------------------------+ | GPU GI CI MIG | Memory-Usage | Vol | Shared | | ID ID Dev | BAR1-Usage | SM Unc | CE ENC DEC OFA JPG | | | | ECC | | | ============================================================================= | | No MIG devices found | +-----------------------------------------------------------------------------+ +-----------------------------------------------------------------------------+ | Processes: | | GPU GI CI PID Type Process name GPU Memory | | ID ID Usage | | ============================================================================= | | No running processes found | +-----------------------------------------------------------------------------+$ nvidia-smi Mon Sep 27 09:44:33 2021 +-----------------------------------------------------------------------------+ | NVIDIA-SMI 470.57.02 Driver Version: 470.57.02 CUDA Version: 11.4 | | -------------------------------+----------------------+---------------------- | | GPU Name Persistence-M | Bus-Id Disp.A | Volatile Uncorr. ECC | | Fan Temp Perf Pwr:Usage/Cap | Memory-Usage | GPU-Util Compute M. | | | | MIG M. | | ===============================+======================+====================== | | 0 NVDIA A100-SXM... Off | 00000000:05:00.0 Off | On | | N/A 32C P0 59W / 400W | 0MiB / 81251MiB | 0% Default | | | | Enabled | +-------------------------------+----------------------+----------------------+ +-----------------------------------------------------------------------------+ | MIG devices: | +-----------------------------------------------------------------------------+ | GPU GI CI MIG | Memory-Usage | Vol | Shared | | ID ID Dev | BAR1-Usage | SM Unc | CE ENC DEC OFA JPG | | | | ECC | | | ============================================================================= | | No MIG devices found | +-----------------------------------------------------------------------------+ +-----------------------------------------------------------------------------+ | Processes: | | GPU GI CI PID Type Process name GPU Memory | | ID ID Usage | | ============================================================================= | | No running processes found | +-----------------------------------------------------------------------------+Code block. nvidia-smi command - Check GPU activation status (1) Color mode$ nvidia-smi –L GPU 0: NVIDIA A100-SXM-80GB (UUID: GPU-c956838f-494a-92b2-6818-56eb28fe25e0)$ nvidia-smi –L GPU 0: NVIDIA A100-SXM-80GB (UUID: GPU-c956838f-494a-92b2-6818-56eb28fe25e0)Code block. nvidia-smi command - Check GPU activation status (2)
- Check whether MIG mode is Enabled.
GPU Instance creation
If you have enabled MIG and verified its status, you can create a GPU Instance.
- Check the list of MIG GPU Instance profiles that can be created.Color mode
$ nvidia-smi mig -i [GPU ID] -lgip$ nvidia-smi mig -i [GPU ID] -lgipCode block. nvidia-smi command - view MIG GPU Instance profile list
$ nvidia-smi mig -i 0 -lgip
+-----------------------------------------------------------------------------+
| GPU instance profiles: |
| GPU Name ID Instances Memory P2P SM DEC ENC |
| Free/Total GiB CE JPEG OFA |
| ============================================================================= |
| 0 MIG 1g.10gb 19 7/7 9.50 No 14 0 0 |
| 1 0 0 |
+-----------------------------------------------------------------------------+
| 0 MIG 1g.10gb+me 20 1/1 9.50 No 14 0 0 |
| 1 1 1 |
+-----------------------------------------------------------------------------+
| 0 MIG 2g.20gb 14 3/3 19.50 No 28 1 0 |
| 2 0 0 |
+-----------------------------------------------------------------------------+
| 0 MIG 3g.40gb 9 2/2 39.50 No 42 2 0 |
| 3 0 0 |
+-----------------------------------------------------------------------------+
| 0 MIG 4g.40gb 5 1/1 39.50 No 56 2 0 |
| 4 0 0 |
+-----------------------------------------------------------------------------+
| 0 MIG 7g.80gb 0 1/1 79.25 No 98 0 0 |
| 7 1 1 |
+-----------------------------------------------------------------------------+$ nvidia-smi mig -i 0 -lgip
+-----------------------------------------------------------------------------+
| GPU instance profiles: |
| GPU Name ID Instances Memory P2P SM DEC ENC |
| Free/Total GiB CE JPEG OFA |
| ============================================================================= |
| 0 MIG 1g.10gb 19 7/7 9.50 No 14 0 0 |
| 1 0 0 |
+-----------------------------------------------------------------------------+
| 0 MIG 1g.10gb+me 20 1/1 9.50 No 14 0 0 |
| 1 1 1 |
+-----------------------------------------------------------------------------+
| 0 MIG 2g.20gb 14 3/3 19.50 No 28 1 0 |
| 2 0 0 |
+-----------------------------------------------------------------------------+
| 0 MIG 3g.40gb 9 2/2 39.50 No 42 2 0 |
| 3 0 0 |
+-----------------------------------------------------------------------------+
| 0 MIG 4g.40gb 5 1/1 39.50 No 56 2 0 |
| 4 0 0 |
+-----------------------------------------------------------------------------+
| 0 MIG 7g.80gb 0 1/1 79.25 No 98 0 0 |
| 7 1 1 |
+-----------------------------------------------------------------------------+- After creating a MIG GPU Instance, check it.
Create GPU Instance
Color mode$ nvidia-smi mig -i [GPU ID] -cgi [Profile ID]$ nvidia-smi mig -i [GPU ID] -cgi [Profile ID]code block. nvidia-smi command - GPU Instance creation Color mode$ nvidia-smi mig -i 0 -cgi 0 Successfully created GPU instance ID 0 on GPU 0 using profile MIG 7g.80gb (ID 0)$ nvidia-smi mig -i 0 -cgi 0 Successfully created GPU instance ID 0 on GPU 0 using profile MIG 7g.80gb (ID 0)Code block. nvidia-smi command - Example of creating a GPU Instance Check GPU Instance
Color mode$ nvidia-smi mig -i [GPU ID] -lgi$ nvidia-smi mig -i [GPU ID] -lgicode block. nvidia-smi command - check GPU Instance Color mode$ nvidia-smi mig -i 0 -lgi +--------------------------------------------------------+ | GPU instances: | | GPU Name Profile Instance Placement | | ID ID Start:Size | | ======================================================== | | 0 MIG 7g.80gb 0 0 0:8 | +--------------------------------------------------------+$ nvidia-smi mig -i 0 -lgi +--------------------------------------------------------+ | GPU instances: | | GPU Name Profile Instance Placement | | ID ID Start:Size | | ======================================================== | | 0 MIG 7g.80gb 0 0 0:8 | +--------------------------------------------------------+Code block. nvidia-smi command - example of checking GPU Instance
Compute Instance creation
If you have created a GPU Instance, you can create a Compute Instance.
Check the MIG Compute Instance profiles you can create.
Color mode$ nvidia-smi mig -i [GPU ID] -gi [GPU Instance ID] -lcip$ nvidia-smi mig -i [GPU ID] -gi [GPU Instance ID] -lcipCode block. nvidia-smi command - Check MIG Compute Instance profile Color mode$ nvidia-smi mig -i 0 -gi 0 -lcip +---------------------------------------------------------------------------------+ | Compute instance profiles: | | GPU GPU Name Profile Instances Exclusive Shared | | GPU Instance ID Free/Total SM DEC ENC OFA | | ID CE JPEG | | ================================================================================= | | 0 0 MIG 1c.7g.80gb 0 7/5 14 5 0 1 | | 7 1 | +---------------------------------------------------------------------------------+ | 0 0 MIG 2c.7g.80gb 1 3/3 28 5 0 1 | | 7 1 | +---------------------------------------------------------------------------------+ | 0 0 MIG 3c.7g.80gb 2 2/2 42 5 0 1 | | 7 1 | +---------------------------------------------------------------------------------+ | 0 0 MIG 4c.7g.80gb 3 1/1 56 5 0 1 | | 7 1 | +---------------------------------------------------------------------------------+ | 0 0 MIG 7g.80gb 4* 1/1 98 5 0 1 | | 7 1 | +---------------------------------------------------------------------------------+$ nvidia-smi mig -i 0 -gi 0 -lcip +---------------------------------------------------------------------------------+ | Compute instance profiles: | | GPU GPU Name Profile Instances Exclusive Shared | | GPU Instance ID Free/Total SM DEC ENC OFA | | ID CE JPEG | | ================================================================================= | | 0 0 MIG 1c.7g.80gb 0 7/5 14 5 0 1 | | 7 1 | +---------------------------------------------------------------------------------+ | 0 0 MIG 2c.7g.80gb 1 3/3 28 5 0 1 | | 7 1 | +---------------------------------------------------------------------------------+ | 0 0 MIG 3c.7g.80gb 2 2/2 42 5 0 1 | | 7 1 | +---------------------------------------------------------------------------------+ | 0 0 MIG 4c.7g.80gb 3 1/1 56 5 0 1 | | 7 1 | +---------------------------------------------------------------------------------+ | 0 0 MIG 7g.80gb 4* 1/1 98 5 0 1 | | 7 1 | +---------------------------------------------------------------------------------+Code block. Example of MIG Compute Instance profile list Create and verify a MIG Compute Instance.
- MIG Compute Instance creationColor mode
$ nvidia-smi mig -i [GPU ID] -gi [GPU Instance ID] -cci [Compute Profile ID]$ nvidia-smi mig -i [GPU ID] -gi [GPU Instance ID] -cci [Compute Profile ID]code block. nvidia-smi command - Create MIG Compute Instance Color mode$ nvidia-smi mig -i 0 -gi 0 -cci 4 Successfully created compute instance ID 0 on GPU instance ID 0 using profile MIG 7g.80gb(ID 4)$ nvidia-smi mig -i 0 -gi 0 -cci 4 Successfully created compute instance ID 0 on GPU instance ID 0 using profile MIG 7g.80gb(ID 4)Code block. nvidia-smi command - Example of creating a MIG Compute Instance - Check MIG Compute InstanceColor mode
$ nvidia-smi mig -i [GPU ID] -gi [GPU Instance ID] –lci$ nvidia-smi mig -i [GPU ID] -gi [GPU Instance ID] –lcicode block. nvidia-smi command - check MIG Compute Instance Color mode$ nvidia-smi mig -i 0 -gi 0 –lci +-----------------------------------------------------------------+ | Compute instance profiles: | | GPU GPU Name Profile Instances Placement | | GPU Instance ID ID Start:Size | | ID | | ================================================================= | | 0 0 MIG 7g.80gb 4 0 0:7 | +-----------------------------------------------------------------+$ nvidia-smi mig -i 0 -gi 0 –lci +-----------------------------------------------------------------+ | Compute instance profiles: | | GPU GPU Name Profile Instances Placement | | GPU Instance ID ID Start:Size | | ID | | ================================================================= | | 0 0 MIG 7g.80gb 4 0 0:7 | +-----------------------------------------------------------------+Code block. Example of checking MIG Compute Instance Color mode$ nvidia-smi –L GPU 0: NVIDIA A100-SXM-80GB (UUID: GPU-c956838f-494a-92b2-6818-56eb28fe25e0) MIG 7g.80gb Device 0: (UUID: MIG-53e20040-758b-5ecb-948e-c626d03a9a32)$ nvidia-smi –L GPU 0: NVIDIA A100-SXM-80GB (UUID: GPU-c956838f-494a-92b2-6818-56eb28fe25e0) MIG 7g.80gb Device 0: (UUID: MIG-53e20040-758b-5ecb-948e-c626d03a9a32)Code block. nvidia-smi command - Check GPU status (1) Color mode$ nvidia-smi Mon Sep 27 09:52:17 2021 +-----------------------------------------------------------------------------+ | NVIDIA-SMI 470.57.02 Driver Version: 470.57.02 CUDA Version: 11.4 | | -------------------------------+----------------------+---------------------- | | GPU Name Persistence-M | Bus-Id Disp.A | Volatile Uncorr. ECC | | Fan Temp Perf Pwr:Usage/Cap | Memory-Usage | GPU-Util Compute M. | | | | MIG M. | | ===============================+======================+====================== | | 0 NVDIA A100-SXM... Off | 00000000:05:00.0 Off | On | | N/A 32C P0 49W / 400W | 0MiB / 81251MiB | N/A Default | | | | Enabled | +-------------------------------+----------------------+----------------------+ +-----------------------------------------------------------------------------+ | MIG devices: | +-----------------------------------------------------------------------------+ | GPU GI CI MIG | Memory-Usage | Vol | Shared | | ID ID Dev | BAR1-Usage | SM Unc | CE ENC DEC OFA JPG | | | | ECC | | | ============================================================================= | | 0 0 0 0 | 0MiB / 81251MiB | 98 0 | 7 0 5 1 1 | | | 1MiB / 13107... | | | +-----------------------------------------------------------------------------+ +-----------------------------------------------------------------------------+ | Processes: | | GPU GI CI PID Type Process name GPU Memory | | ID ID Usage | | ============================================================================= | | No running processes found | +-----------------------------------------------------------------------------+$ nvidia-smi Mon Sep 27 09:52:17 2021 +-----------------------------------------------------------------------------+ | NVIDIA-SMI 470.57.02 Driver Version: 470.57.02 CUDA Version: 11.4 | | -------------------------------+----------------------+---------------------- | | GPU Name Persistence-M | Bus-Id Disp.A | Volatile Uncorr. ECC | | Fan Temp Perf Pwr:Usage/Cap | Memory-Usage | GPU-Util Compute M. | | | | MIG M. | | ===============================+======================+====================== | | 0 NVDIA A100-SXM... Off | 00000000:05:00.0 Off | On | | N/A 32C P0 49W / 400W | 0MiB / 81251MiB | N/A Default | | | | Enabled | +-------------------------------+----------------------+----------------------+ +-----------------------------------------------------------------------------+ | MIG devices: | +-----------------------------------------------------------------------------+ | GPU GI CI MIG | Memory-Usage | Vol | Shared | | ID ID Dev | BAR1-Usage | SM Unc | CE ENC DEC OFA JPG | | | | ECC | | | ============================================================================= | | 0 0 0 0 | 0MiB / 81251MiB | 98 0 | 7 0 5 1 1 | | | 1MiB / 13107... | | | +-----------------------------------------------------------------------------+ +-----------------------------------------------------------------------------+ | Processes: | | GPU GI CI PID Type Process name GPU Memory | | ID ID Usage | | ============================================================================= | | No running processes found | +-----------------------------------------------------------------------------+Code block. nvidia-smi command - Check GPU status (2)
- MIG Compute Instance creation
Using MIG
- Use the MIG Instance to perform the Job.
- Example of task executionColor mode
$ docker run --gpus '"device=[GPU ID]:[MIG ID]"' -rm nvcr.io/nvidia/cuda nvidia-smi$ docker run --gpus '"device=[GPU ID]:[MIG ID]"' -rm nvcr.io/nvidia/cuda nvidia-smiCode block. Task execution example - You can see an example of the work performed as follows.Color mode
$ docker run --gpus '"device=0:0"' -rm -it --network=host --shm-size=1g --ipc=host -v /root/.ssh/:/root/.ssh ================ == TensorFlow == ================ NVIDIA Release 21.08-tf1 (build 26012104) TensorFlow Version 1.15.5 Container image Copyright (c) 2021, NVIDIA CORPORATION. All right reserved. ... # Run Python process root@d622a93c9281:/workspace# python /workspace/nvidia-examples/cnn/resnet.py --num_iter 100 ... PY 3.8.10 (default, Jun 2 2021, 10:49:15) [GCC 9.4.0] TF 1.15.5 ...$ docker run --gpus '"device=0:0"' -rm -it --network=host --shm-size=1g --ipc=host -v /root/.ssh/:/root/.ssh ================ == TensorFlow == ================ NVIDIA Release 21.08-tf1 (build 26012104) TensorFlow Version 1.15.5 Container image Copyright (c) 2021, NVIDIA CORPORATION. All right reserved. ... # Run Python process root@d622a93c9281:/workspace# python /workspace/nvidia-examples/cnn/resnet.py --num_iter 100 ... PY 3.8.10 (default, Jun 2 2021, 10:49:15) [GCC 9.4.0] TF 1.15.5 ...Code block. Operation result
- Example of task execution
- Check the GPU usage. (Create JOB process)
- When the job runs, you can see that a process is allocated to the MIG device and its utilization increases.Color mode
$ nvidia-smi mig -i [GPU ID] -gi [GPU Instance ID] -lcip$ nvidia-smi mig -i [GPU ID] -gi [GPU Instance ID] -lcipcode block. nvidia-smi command - check GPU utilization - You can check the GPU usage as shown below.Color mode
+-----------------------------------------------------------------------------+ | MIG devices: | +-----------------------------------------------------------------------------+ | GPU GI CI MIG | Memory-Usage | Vol | Shared | | ID ID Dev | BAR1-Usage | SM Unc | CE ENC DEC OFA JPG | | | | ECC | | | ============================================================================= | | 0 0 0 0 | 66562MiB / 81251MiB | 98 0 | 7 0 5 1 1 | | | 5MiB / 13107... | | | +-----------------------------------------------------------------------------+ +-----------------------------------------------------------------------------+ | Processes: | | GPU GI CI PID Type Process name GPU Memory | | ID ID Usage | | ============================================================================= | | 0 0 0 17483 C python 66559MiB | +-----------------------------------------------------------------------------++-----------------------------------------------------------------------------+ | MIG devices: | +-----------------------------------------------------------------------------+ | GPU GI CI MIG | Memory-Usage | Vol | Shared | | ID ID Dev | BAR1-Usage | SM Unc | CE ENC DEC OFA JPG | | | | ECC | | | ============================================================================= | | 0 0 0 0 | 66562MiB / 81251MiB | 98 0 | 7 0 5 1 1 | | | 5MiB / 13107... | | | +-----------------------------------------------------------------------------+ +-----------------------------------------------------------------------------+ | Processes: | | GPU GI CI PID Type Process name GPU Memory | | ID ID Usage | | ============================================================================= | | 0 0 0 17483 C python 66559MiB | +-----------------------------------------------------------------------------+Code block. Example of checking GPU utilization.
- When the job runs, you can see that a process is allocated to the MIG device and its utilization increases.
Delete and release MIG Instance
Follow these steps to delete the MIG instance and detach the MIG.
Compute Instance Delete
- Delete the Compute Instance.Color mode
$ nvidia-smi mig -i [GPU ID] -gi [GPU Instance ID] –dci $ nvidia-smi mig -i [GPU ID] -gi [GPU Instance ID] -ci [Compute Instance] –dci$ nvidia-smi mig -i [GPU ID] -gi [GPU Instance ID] –dci $ nvidia-smi mig -i [GPU ID] -gi [GPU Instance ID] -ci [Compute Instance] –dciCode block. nvidia-smi command - Delete Compute Instance Color mode$ nvidia-smi mig -i 0 -gi 0 –lci +-----------------------------------------------------------------+ | Compute instance profiles: | | GPU GPU Name Profile Instances Placement | | GPU Instance ID ID Start:Size | | ID | | ================================================================= | | 0 0 MIG 7g.80gb 4 0 0:7 | +-----------------------------------------------------------------+$ nvidia-smi mig -i 0 -gi 0 –lci +-----------------------------------------------------------------+ | Compute instance profiles: | | GPU GPU Name Profile Instances Placement | | GPU Instance ID ID Start:Size | | ID | | ================================================================= | | 0 0 MIG 7g.80gb 4 0 0:7 | +-----------------------------------------------------------------+code block. Example of checking MIG Compute Instance Color mode$ nvidia-smi mig -i 0 -gi 0 –dci Successfully destroyed compute instance ID 0 from GPU instance ID 0$ nvidia-smi mig -i 0 -gi 0 –dci Successfully destroyed compute instance ID 0 from GPU instance ID 0Code block. Compute Instance deletion example Color mode$ nvidia-smi mig -i 0 -gi 0 –lci No compute instances found: Not found$ nvidia-smi mig -i 0 -gi 0 –lci No compute instances found: Not foundCode block. Confirm Compute Instance deletion
Delete GPU Instance
- Delete the GPU Instance.Color mode
$ nvidia-smi mig -i [GPU ID] –dgi $ nvidia-smi mig -i [GPU ID] -gi [GPU Instance ID] –dgi$ nvidia-smi mig -i [GPU ID] –dgi $ nvidia-smi mig -i [GPU ID] -gi [GPU Instance ID] –dgiCode block. nvidia-smi command - Delete GPU Instance Color mode$ nvidia-smi mig -i 0 -lgi +--------------------------------------------------------+ | GPU instances: | | GPU Name Profile Instance Placement | | ID ID Start:Size | | ======================================================== | | 0 MIG 7g.80gb 0 0 0:8 | +--------------------------------------------------------+$ nvidia-smi mig -i 0 -lgi +--------------------------------------------------------+ | GPU instances: | | GPU Name Profile Instance Placement | | ID ID Start:Size | | ======================================================== | | 0 MIG 7g.80gb 0 0 0:8 | +--------------------------------------------------------+Code block. nvidia-smi command - Example of checking GPU Instance Color mode$ nvidia-smi mig -i 0 -dgi Successfully destroyed GPU instance ID 0 from GPU 0$ nvidia-smi mig -i 0 -dgi Successfully destroyed GPU instance ID 0 from GPU 0Code block. nvidia-smi command - GPU Instance deletion example Color mode$ nvidia-smi mig -i 0 -lgi No GPU instances found: Not found$ nvidia-smi mig -i 0 -lgi No GPU instances found: Not foundcode block. nvidia-smi command - example of deleting a GPU Instance
Disable MIG feature (deactivation)
- After disabling MIG (Disable), reboot.Color mode
$ nvidia-smi -mig 0 Disabled MIG Mode for GPU 00000000:05:00.0 All done.$ nvidia-smi -mig 0 Disabled MIG Mode for GPU 00000000:05:00.0 All done.Code block. nvidia-smi command - disable MIG Color mode$ nvidia-smi Mon Sep 30 05:18:28 2021 +-----------------------------------------------------------------------------+ | NVIDIA-SMI 470.57.02 Driver Version: 470.57.02 CUDA Version: 11.4 | | -------------------------------+----------------------+---------------------- | | GPU Name Persistence-M | Bus-Id Disp.A | Volatile Uncorr. ECC | | Fan Temp Perf Pwr:Usage/Cap | Memory-Usage | GPU-Util Compute M. | | | | MIG M. | | ===============================+======================+====================== | | 0 NVDIA A100-SXM... Off | 00000000:05:00.0 Off | 0 | | N/A 33C P0 60W / 400W | 0MiB / 81251MiB | 0% Default | | | | Disabled | +-------------------------------+----------------------+----------------------+ +-----------------------------------------------------------------------------+ | MIG devices: | +-----------------------------------------------------------------------------+ | GPU GI CI MIG | Memory-Usage | Vol | Shared | | ID ID Dev | BAR1-Usage | SM Unc | CE ENC DEC OFA JPG | | | | ECC | | | ============================================================================= | | No MIG devices found | +-----------------------------------------------------------------------------+ +-----------------------------------------------------------------------------+ | Processes: | | GPU GI CI PID Type Process name GPU Memory | | ID ID Usage | | ============================================================================= | | No running processes found | +-----------------------------------------------------------------------------+$ nvidia-smi Mon Sep 30 05:18:28 2021 +-----------------------------------------------------------------------------+ | NVIDIA-SMI 470.57.02 Driver Version: 470.57.02 CUDA Version: 11.4 | | -------------------------------+----------------------+---------------------- | | GPU Name Persistence-M | Bus-Id Disp.A | Volatile Uncorr. ECC | | Fan Temp Perf Pwr:Usage/Cap | Memory-Usage | GPU-Util Compute M. | | | | MIG M. | | ===============================+======================+====================== | | 0 NVDIA A100-SXM... Off | 00000000:05:00.0 Off | 0 | | N/A 33C P0 60W / 400W | 0MiB / 81251MiB | 0% Default | | | | Disabled | +-------------------------------+----------------------+----------------------+ +-----------------------------------------------------------------------------+ | MIG devices: | +-----------------------------------------------------------------------------+ | GPU GI CI MIG | Memory-Usage | Vol | Shared | | ID ID Dev | BAR1-Usage | SM Unc | CE ENC DEC OFA JPG | | | | ECC | | | ============================================================================= | | No MIG devices found | +-----------------------------------------------------------------------------+ +-----------------------------------------------------------------------------+ | Processes: | | GPU GI CI PID Type Process name GPU Memory | | ID ID Usage | | ============================================================================= | | No running processes found | +-----------------------------------------------------------------------------+Code block. nvidia-smi command - check GPU status
3.2.4 - Use NVSwitch on GPU Server
After creating a GPU Server, you can enable the NVSwitch feature on the GPU Server’s VM (Guest OS) and use fast GPU-to-GPU P2P communication.
Exploring NVIDIA NVSwitch for Multi GPU
NVLink expands I/O by directly connecting multiple GPUs within a server both bidirectionally and GPU-to-GPU. Using NVSwitch, you can connect all GPUs in a server with full NVLink bandwidth.
Checking NVSwitch operation
Check the NVIDIA Fabric Manager, NVIDIA NVLink topology, and NVIDIA NVLink Status on the GPU server.
NVIDIA Fabric Manager operating status
Verify that active (running) is displayed when operating normally.
~$ systemctl status nvidia-fabricmanager
nvidia-fabricmanager.service - NVIDIA fabric manager service
Loaded: loaded (/lib/systemd/system/nvidia-fabricmanager.service; enabled; vendor preset: enabled)
Active: active (running) since Mon 2026-02-02 16:23:27 KST; 32min ago
Main PID: 2191 (nv-fabricmanage)
Tasks: 18 (limit: 629145)
Memory: 18.0M
CPU: 33.461s
CGroup: /system.slice/nvidia-fabricmanager.service
└─2191 /usr/bin/nv-fabricmanager -c /usr/share/nvidia/nvswitch/fabricmanager.cfgnvidia-fabricmanager.service - NVIDIA fabric manager service
Loaded: loaded (/lib/systemd/system/nvidia-fabricmanager.service; enabled; vendor preset: enabled)
Active: active (running) since Mon 2026-02-02 16:23:27 KST; 32min ago
Main PID: 2191 (nv-fabricmanage)
Tasks: 18 (limit: 629145)
Memory: 18.0M
CPU: 33.461s
CGroup: /system.slice/nvidia-fabricmanager.service
└─2191 /usr/bin/nv-fabricmanager -c /usr/share/nvidia/nvswitch/fabricmanager.cfgCheck NVIDIA NVLink topology
Check the NVIDIA NVLink topology.
~$ nvidia-smi topo -m
nvidia-smi topo -m
GPU0 GPU1 GPU2 GPU3 GPU4 GPU5 GPU6 GPU7 CPU Affinity NUMA Affinity GPU NUMA ID
GPU0 X NV12 NV12 NV12 NV12 NV12 NV12 NV12 0-127 0-7 N/A
GPU1 NV12 X NV12 NV12 NV12 NV12 NV12 NV12 0-127 0-7 N/A
GPU2 NV12 NV12 X NV12 NV12 NV12 NV12 NV12 0-127 0-7 N/A
GPU3 NV12 NV12 NV12 X NV12 NV12 NV12 NV12 0-127 0-7 N/A
GPU4 NV12 NV12 NV12 NV12 X NV12 NV12 NV12 0-127 0-7 N/A
GPU5 NV12 NV12 NV12 NV12 NV12 X NV12 NV12 0-127 0-7 N/A
GPU6 NV12 NV12 NV12 NV12 NV12 NV12 X NV12 0-127 0-7 N/A
GPU7 NV12 NV12 NV12 NV12 NV12 NV12 NV12 X 0-127 0-7 N/A
Legend:
X = Self
SYS = Connection traversing PCIe as well as the SMP interconnect between NUMA nodes (e.g., QPI/UPI)
NODE = Connection traversing PCIe as well as the interconnect between PCIe Host Bridges within a NUMA node
PHB = Connection traversing PCIe as well as a PCIe Host Bridge (typically the CPU)
PXB = Connection traversing multiple PCIe bridges (without traversing the PCIe Host Bridge)
PIX = Connection traversing at most a single PCIe bridge
NV# = Connection traversing a bonded set of # NVLinksnvidia-smi topo -m
GPU0 GPU1 GPU2 GPU3 GPU4 GPU5 GPU6 GPU7 CPU Affinity NUMA Affinity GPU NUMA ID
GPU0 X NV12 NV12 NV12 NV12 NV12 NV12 NV12 0-127 0-7 N/A
GPU1 NV12 X NV12 NV12 NV12 NV12 NV12 NV12 0-127 0-7 N/A
GPU2 NV12 NV12 X NV12 NV12 NV12 NV12 NV12 0-127 0-7 N/A
GPU3 NV12 NV12 NV12 X NV12 NV12 NV12 NV12 0-127 0-7 N/A
GPU4 NV12 NV12 NV12 NV12 X NV12 NV12 NV12 0-127 0-7 N/A
GPU5 NV12 NV12 NV12 NV12 NV12 X NV12 NV12 0-127 0-7 N/A
GPU6 NV12 NV12 NV12 NV12 NV12 NV12 X NV12 0-127 0-7 N/A
GPU7 NV12 NV12 NV12 NV12 NV12 NV12 NV12 X 0-127 0-7 N/A
Legend:
X = Self
SYS = Connection traversing PCIe as well as the SMP interconnect between NUMA nodes (e.g., QPI/UPI)
NODE = Connection traversing PCIe as well as the interconnect between PCIe Host Bridges within a NUMA node
PHB = Connection traversing PCIe as well as a PCIe Host Bridge (typically the CPU)
PXB = Connection traversing multiple PCIe bridges (without traversing the PCIe Host Bridge)
PIX = Connection traversing at most a single PCIe bridge
NV# = Connection traversing a bonded set of # NVLinksCheck NVIDIA NVLink Status
Check the NVIDIA NVLink Status.
~$ nvidia-smi topo -m
GPU 1: NVIDIA A100-SXM4-80GB (UUID: GPU-64a2f685-bb12-c4af-105c-0726ece9c8d7)
Link 0: 25 GB/s
Link 1: 25 GB/s
Link 2: 25 GB/s
Link 3: 25 GB/s
Link 4: 25 GB/s
Link 5: 25 GB/s
Link 6: 25 GB/s
Link 7: 25 GB/s
Link 8: 25 GB/s
Link 9: 25 GB/s
Link 10: 25 GB/s
Link 11: 25 GB/s
GPU 2: NVIDIA A100-SXM4-80GB (UUID: GPU-2269851b-71cd-f6c7-50c5-ba1525cf3ce8)
Link 0: 25 GB/s
Link 1: 25 GB/s
Link 2: 25 GB/s
Link 3: 25 GB/s
Link 4: 25 GB/s
Link 5: 25 GB/s
Link 6: 25 GB/s
Link 7: 25 GB/s
Link 8: 25 GB/s
Link 9: 25 GB/s
Link 10: 25 GB/s
Link 11: 25 GB/s
GPU 3: NVIDIA A100-SXM4-80GB (UUID: GPU-4c397bbf-95fc-5c29-918a-a429cbe45a7a)
Link 0: 25 GB/s
Link 1: 25 GB/s
Link 2: 25 GB/s
Link 3: 25 GB/s
Link 4: 25 GB/s
Link 5: 25 GB/s
Link 6: 25 GB/s
Link 7: 25 GB/s
Link 8: 25 GB/s
Link 9: 25 GB/s
Link 10: 25 GB/s
Link 11: 25 GB/s
GPU 4: NVIDIA A100-SXM4-80GB (UUID: GPU-0e350204-9fb6-2cbe-538e-8f7849658eb8)
Link 0: 25 GB/s
Link 1: 25 GB/s
Link 2: 25 GB/s
Link 3: 25 GB/s
Link 4: 25 GB/s
Link 5: 25 GB/s
Link 6: 25 GB/s
Link 7: 25 GB/s
Link 8: 25 GB/s
Link 9: 25 GB/s
Link 10: 25 GB/s
Link 11: 25 GB/s
GPU 5: NVIDIA A100-SXM4-80GB (UUID: GPU-45f0c453-4760-edd4-3af9-25c5ea7473a5)
Link 0: 25 GB/s
Link 1: 25 GB/s
Link 2: 25 GB/s
Link 3: 25 GB/s
Link 4: 25 GB/s
Link 5: 25 GB/s
Link 6: 25 GB/s
Link 7: 25 GB/s
Link 8: 25 GB/s
Link 9: 25 GB/s
Link 10: 25 GB/s
Link 11: 25 GB/s
GPU 6: NVIDIA A100-SXM4-80GB (UUID: GPU-38409794-bb34-430e-3c50-90b42cb2bb72)
Link 0: 25 GB/s
Link 1: 25 GB/s
Link 2: 25 GB/s
Link 3: 25 GB/s
Link 4: 25 GB/s
Link 5: 25 GB/s
Link 6: 25 GB/s
Link 7: 25 GB/s
Link 8: 25 GB/s
Link 9: 25 GB/s
Link 10: 25 GB/s
Link 11: 25 GB/s
GPU 7: NVIDIA A100-SXM4-80GB (UUID: GPU-3fb478aa-801b-eb64-55c2-0ffc3f2ce404)
Link 0: 25 GB/s
Link 1: 25 GB/s
Link 2: 25 GB/s
Link 3: 25 GB/s
Link 4: 25 GB/s
Link 5: 25 GB/s
Link 6: 25 GB/s
Link 7: 25 GB/s
Link 8: 25 GB/s
Link 9: 25 GB/s
Link 10: 25 GB/s
Link 11: 25 GB/sGPU 1: NVIDIA A100-SXM4-80GB (UUID: GPU-64a2f685-bb12-c4af-105c-0726ece9c8d7)
Link 0: 25 GB/s
Link 1: 25 GB/s
Link 2: 25 GB/s
Link 3: 25 GB/s
Link 4: 25 GB/s
Link 5: 25 GB/s
Link 6: 25 GB/s
Link 7: 25 GB/s
Link 8: 25 GB/s
Link 9: 25 GB/s
Link 10: 25 GB/s
Link 11: 25 GB/s
GPU 2: NVIDIA A100-SXM4-80GB (UUID: GPU-2269851b-71cd-f6c7-50c5-ba1525cf3ce8)
Link 0: 25 GB/s
Link 1: 25 GB/s
Link 2: 25 GB/s
Link 3: 25 GB/s
Link 4: 25 GB/s
Link 5: 25 GB/s
Link 6: 25 GB/s
Link 7: 25 GB/s
Link 8: 25 GB/s
Link 9: 25 GB/s
Link 10: 25 GB/s
Link 11: 25 GB/s
GPU 3: NVIDIA A100-SXM4-80GB (UUID: GPU-4c397bbf-95fc-5c29-918a-a429cbe45a7a)
Link 0: 25 GB/s
Link 1: 25 GB/s
Link 2: 25 GB/s
Link 3: 25 GB/s
Link 4: 25 GB/s
Link 5: 25 GB/s
Link 6: 25 GB/s
Link 7: 25 GB/s
Link 8: 25 GB/s
Link 9: 25 GB/s
Link 10: 25 GB/s
Link 11: 25 GB/s
GPU 4: NVIDIA A100-SXM4-80GB (UUID: GPU-0e350204-9fb6-2cbe-538e-8f7849658eb8)
Link 0: 25 GB/s
Link 1: 25 GB/s
Link 2: 25 GB/s
Link 3: 25 GB/s
Link 4: 25 GB/s
Link 5: 25 GB/s
Link 6: 25 GB/s
Link 7: 25 GB/s
Link 8: 25 GB/s
Link 9: 25 GB/s
Link 10: 25 GB/s
Link 11: 25 GB/s
GPU 5: NVIDIA A100-SXM4-80GB (UUID: GPU-45f0c453-4760-edd4-3af9-25c5ea7473a5)
Link 0: 25 GB/s
Link 1: 25 GB/s
Link 2: 25 GB/s
Link 3: 25 GB/s
Link 4: 25 GB/s
Link 5: 25 GB/s
Link 6: 25 GB/s
Link 7: 25 GB/s
Link 8: 25 GB/s
Link 9: 25 GB/s
Link 10: 25 GB/s
Link 11: 25 GB/s
GPU 6: NVIDIA A100-SXM4-80GB (UUID: GPU-38409794-bb34-430e-3c50-90b42cb2bb72)
Link 0: 25 GB/s
Link 1: 25 GB/s
Link 2: 25 GB/s
Link 3: 25 GB/s
Link 4: 25 GB/s
Link 5: 25 GB/s
Link 6: 25 GB/s
Link 7: 25 GB/s
Link 8: 25 GB/s
Link 9: 25 GB/s
Link 10: 25 GB/s
Link 11: 25 GB/s
GPU 7: NVIDIA A100-SXM4-80GB (UUID: GPU-3fb478aa-801b-eb64-55c2-0ffc3f2ce404)
Link 0: 25 GB/s
Link 1: 25 GB/s
Link 2: 25 GB/s
Link 3: 25 GB/s
Link 4: 25 GB/s
Link 5: 25 GB/s
Link 6: 25 GB/s
Link 7: 25 GB/s
Link 8: 25 GB/s
Link 9: 25 GB/s
Link 10: 25 GB/s
Link 11: 25 GB/s3.2.5 - Install ServiceWatch Agent
Users can install the ServiceWatch Agent on a GPU server to collect custom metrics and logs.
ServiceWatch Agent
There are two main types of agents that need to be installed on a GPU server to collect custom metrics and logs for ServiceWatch. It is a Prometheus Exporter and Open Telemetry Collector.
| Category | Detailed description | |
|---|---|---|
| Prometheus Exporter | Provide metrics of a specific application or service in a format that Prometheus can scrape
| |
| Open Telemetry Collector | Acts as a centralized collector that gathers telemetry data such as metrics and logs from distributed systems, processes (filtering, sampling, etc.) it, and exports it to multiple backends (e.g., Prometheus, Jaeger, Elasticsearch, etc.)
|
If you have configured a Kubernetes Engine on a GPU server, please check GPU metrics using the metrics provided by the Kubernetes Engine.
- If you install the DCGM Exporter on a GPU server configured with Kubernetes Engine, it may not operate correctly.
Pre-configuration for Using ServiceWatch Agent
To utilize the ServiceWatch Agent, please refer to Prerequisite Settings for ServiceWatch Agent and prepare the prerequisite settings.
Install Prometheus Exporter for GPU metrics (for Ubuntu)
Install the Prometheus Exporter for collecting metrics from the GPU server according to the steps below.
Verify NVDIA Driver Installation
- Check the installed NVDIA Driver.Color mode
nvidia-smi --query-gpu driver_version --format csvnvidia-smi --query-gpu driver_version --format csvCode block. NVDIA Driver version check command Color modedriver_version 535.183.06 ... 535.183.06driver_version 535.183.06 ... 535.183.06Code block. NVDIA Driver version check example
NVSwitch Configuration and Query (NSCQ) Library Installation
Install cuda-keyring.
Color modewget https://developer.download.nvidia.com/compute/cuda/repos/<distro>/<arch>/cuda-keyring_1.1-1_all.debwget https://developer.download.nvidia.com/compute/cuda/repos/<distro>/<arch>/cuda-keyring_1.1-1_all.debcode block. NSCQ library download command Color modesudo dpkg -i cuda-keyring_1.1-1_all.deb apt updatesudo dpkg -i cuda-keyring_1.1-1_all.deb apt updateCode block. NSCQ library installation command Color modenvidia-smi --query-gpu driver_version --format csvnvidia-smi --query-gpu driver_version --format csvCode block. NVDIA Driver version check command Color modedriver_version 535.183.06 ... 535.183.06driver_version 535.183.06 ... 535.183.06code block. Example of checking NVDIA driver version Install libnvidia-nscq.
Color modeapt-cache policy libnvidia-nscq-535apt-cache policy libnvidia-nscq-535Code block. NSCQ library apt-cache command Color modelibnvidia-nscq-535: Installed: (none) Candidate: 535.247.01-1 Version table: 535.247.01-1 600 600 https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/x86_64 Packages ... 600 https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/x86_64 Packages 535.216.01-1 600 600 https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/x86_64 Packages 535.183.06-1 600 # Install the version that matches the driver 600 https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/x86_64 Packages 535.183.01-1 600 600 https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/x86_64 Packages 535.54.03-1 600 600 https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/x86_64 Packageslibnvidia-nscq-535: Installed: (none) Candidate: 535.247.01-1 Version table: 535.247.01-1 600 600 https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/x86_64 Packages ... 600 https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/x86_64 Packages 535.216.01-1 600 600 https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/x86_64 Packages 535.183.06-1 600 # Install the version that matches the driver 600 https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/x86_64 Packages 535.183.01-1 600 600 https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/x86_64 Packages 535.54.03-1 600 600 https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/x86_64 PackagesCode block. NSCQ library apt-cache command result Color modeapt install libnvidia-nscq-535=535.183.06-1apt install libnvidia-nscq-535=535.183.06-1Code block. NSCQ library installation command
It must be installed with the same version as the NVDIA Driver.
- Example) driver version: 535.183.06, libnvdia-nscq version: 535.183.06-1
NVSwitch Device Monitoring API(NVSDM) Library Installation
- Install the NVSDM library.Color mode
apt-cache policy libnvsdmapt-cache policy libnvsdmCode block. NVSDM library apt-cache command Color modelibnvsdm: Installed: (none) Candidate: 580.105.08-1 Version table: 580.105.08-1 600 600 https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/x86_64 Packages 580.95.05-1 600 600 https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/x86_64 Packages 580.82.07-1 600 600 https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/x86_64 Packages 580.65.06-1 600 600 https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/x86_64 Packageslibnvsdm: Installed: (none) Candidate: 580.105.08-1 Version table: 580.105.08-1 600 600 https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/x86_64 Packages 580.95.05-1 600 600 https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/x86_64 Packages 580.82.07-1 600 600 https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/x86_64 Packages 580.65.06-1 600 600 https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/x86_64 PackagesCode block. NVSDM library apt-cache command result Color modeapt install libnvsdm=580.105.08-1apt install libnvsdm=580.105.08-1Code block. Install NVSDM library
Install NVIDIA DCGM (for Ubuntu)
Install the DCGM Exporter according to the steps below.
- DCGM(datacenter-gpu-manager) Installation
- datacenter-gpu-manager-exporter installation
- DCGM Service Activation and Start
DCGM(datacenter-gpu-manager) Installation
refers to a specific version of NVIDIA’s Data Center GPU Manager (DCGM) tool, which is a package for managing and monitoring NVIDIA data center GPUs. In particular, cuda12 indicates that this management tool is installed for the CUDA 12 version, and datacenter-gpu-manager-4 refers to the 4.x version of DCGM. This tool provides a variety of features, including GPU status monitoring, diagnostics, alert system, and power/clock management.
- Check the CUDA version.Color mode
nvidia-smi | grep CUDAnvidia-smi | grep CUDACode block. Check CUDA version Color mode| NVIDIA-SMI 535.183.06 Driver Version: 535.183.06 CUDA Version: 12.2 || NVIDIA-SMI 535.183.06 Driver Version: 535.183.06 CUDA Version: 12.2 |Code block. Example of CUDA version check result Color modeCUDA_VERSION=12CUDA_VERSION=12Code block. CUDA version setting command - Install datacenter-gpu-manager-cuda.Color mode
apt install datacenter-gpu-manager-4-cuda${CUDA_VERSION}apt install datacenter-gpu-manager-4-cuda${CUDA_VERSION}Code block. datacenter-gpu-manager-cuda installation command
Install datacenter-gpu-manager-exporter
It is a tool that, based on NVIDIA Data Center GPU Manager (DCGM), collects various GPU metrics such as GPU usage, memory usage, temperature, and power consumption, and exposes them for use in monitoring systems like Prometheus.
- Install datacenter-gpu-manager-exporter.Color mode
apt install datacenter-gpu-manager-exporterapt install datacenter-gpu-manager-exporterCode block. datacenter-gpu-manager-exporter installation command - Check the DCGM Exporter configuration file.Color mode
cat /usr/lib/systemd/system/nvidia-dcgm-exporter.service | grep ExecStartcat /usr/lib/systemd/system/nvidia-dcgm-exporter.service | grep ExecStartCode block. Command to check the datacenter-gpu-manager-exporter configuration file Color modeExecStart=/usr/bin/dcgm-exporter -f /etc/dcgm-exporter/default-counters.csvExecStart=/usr/bin/dcgm-exporter -f /etc/dcgm-exporter/default-counters.csvcode block. Example of verification result for the datacenter-gpu-manager-exporter configuration file - When installing the DCGM Exporter, review the provided configuration and remove
#for the required metrics, and add#for the unnecessary metrics.Color modevi /etc/dcgm-exporter/default-counters.csv ## Example ## ... DCGM_FI_PROF_PIPE_TENSOR_ACTIVE, gauge, Ratio of cycles the tensor (HMMA) pipe is active. DCGM_FI_PROF_DRAM_ACTIVE, gauge, Ratio of cycles the device memory interface is active sending or receiving data. # DCGM_FI_PROF_PIPE_FP64_ACTIVE, gauge, Ratio of cycles the fp64 pipes are active. # DCGM_FI_PROF_PIPE_FP32_ACTIVE, gauge, Ratio of cycles the fp32 pipes are active. ...vi /etc/dcgm-exporter/default-counters.csv ## Example ## ... DCGM_FI_PROF_PIPE_TENSOR_ACTIVE, gauge, Ratio of cycles the tensor (HMMA) pipe is active. DCGM_FI_PROF_DRAM_ACTIVE, gauge, Ratio of cycles the device memory interface is active sending or receiving data. # DCGM_FI_PROF_PIPE_FP64_ACTIVE, gauge, Ratio of cycles the fp64 pipes are active. # DCGM_FI_PROF_PIPE_FP32_ACTIVE, gauge, Ratio of cycles the fp32 pipes are active. ...Code block. Example of datacenter-gpu-manager-exporter metric configuration
Enable and start DCGM service
Enable and start the nvdia-dcgm service.
Color modesystemctl enable --now nvidia-dcgmsystemctl enable --now nvidia-dcgmCode block. nvdia-dcgm service activation and start command Enable and start the nvdia-dcgm-exporter service.
Color modesystemctl enable --now nvidia-dcgm-exportersystemctl enable --now nvidia-dcgm-exporterCode block. nvdia-dcgm-exporter service activation and start command
For more details, refer to ServiceWatch > ServiceWatch Agent 사용하기.
Install Prometheus Exporter for GPU metrics (for RHEL)
Install the ServiceWatch Agent to collect metrics from the GPU server according to the steps below.
NVDIA Driver installation verification (for RHEL)
- Check the installed NVDIA Driver.Color mode
nvidia-smi --query-gpu driver_version --format csvnvidia-smi --query-gpu driver_version --format csvCode block. NVDIA Driver version check command Color modedriver_version 535.183.06 ... 535.183.06driver_version 535.183.06 ... 535.183.06Code block. Example of checking NVDIA Driver version
NVSwitch Configuration and Query (NSCQ) Library installation (for RHEL)
The NVSwitch Configuration and Query (NSCQ) Library is required for Hopper or earlier generation GPUs.
- For RHEL, verify that libnvdia-nscq is installed and install it if necessary.
Checking the libnvdia-nscq package.
Color moderpm -qa | grep libnvidia-nscq libnvidia-nscq-535-535.183.06-1.x86_64rpm -qa | grep libnvidia-nscq libnvidia-nscq-535-535.183.06-1.x86_64Code block. Check NSCQ library package Add the CUDA Repository to DNF.
Color modednf config-manager --add-repo https://developer.download.nvidia.com/compute/cuda/repos/rhel8/x86_64/cuda-rhel8.repodnf config-manager --add-repo https://developer.download.nvidia.com/compute/cuda/repos/rhel8/x86_64/cuda-rhel8.repoCode block. Add DNF Repository NVDIA Driver status reset
Color modednf module reset nvidia-driverdnf module reset nvidia-driverCode block. Initialize the state of the NVIDIA Driver DNF module Color modeUpdating Subscription Management repositories. Last metadata expiration check: 0:03:15 ago on Wed 19 Nov 2025 01:23:48 AM EST. Dependencies resolved. ============================================= Package Architecture Version Repository Size ============================================= Disabling module profiles: nvidia-driver/default nvidia-driver/fm Resetting modules: nvidia-driver Transaction Summary ============================================= Is this ok [y/N]: yUpdating Subscription Management repositories. Last metadata expiration check: 0:03:15 ago on Wed 19 Nov 2025 01:23:48 AM EST. Dependencies resolved. ============================================= Package Architecture Version Repository Size ============================================= Disabling module profiles: nvidia-driver/default nvidia-driver/fm Resetting modules: nvidia-driver Transaction Summary ============================================= Is this ok [y/N]: yCode block. Example of the status initialization result of the NVIDIA Driver DNF module. Enable the NVDIA Driver module.
Color modednf module enable nvidia-driver:535-opendnf module enable nvidia-driver:535-opencode block. NVDIA Driver module activation Color modeUpdating Subscription Management repositories. Last metadata expiration check: 0:04:22 ago on Wed 19 Nov 2025 01:23:48 AM EST. Dependencies resolved. ============================================= Package Architecture Version Repository Size ============================================= Enabling module streams: nvidia-driver 535-open Transaction Summary ============================================= Is this ok [y/N]: yUpdating Subscription Management repositories. Last metadata expiration check: 0:04:22 ago on Wed 19 Nov 2025 01:23:48 AM EST. Dependencies resolved. ============================================= Package Architecture Version Repository Size ============================================= Enabling module streams: nvidia-driver 535-open Transaction Summary ============================================= Is this ok [y/N]: yCode block. Example of NVDIA Driver module activation result Check the libnvdia-nscq module list.
Color modednf list libnvidia-nscq-535 --showduplicatesdnf list libnvidia-nscq-535 --showduplicatesCode block. Check libnvdia-nscq module list Install libnvdia-nscq.
Color modednf install libnvidia-nscq-535-535.183.06-1dnf install libnvidia-nscq-535-535.183.06-1code block. libnvdia-nscq installation command
NVSwitch Device Monitoring API(NVSDM) Library Installation (for RHEL)
Check the NVSDM library module list.
Color modednf list libnvsdm --showduplicatesdnf list libnvsdm --showduplicatesCode block. Check NVSDM library module list Color modelibnvsdm: Installed: (none) Candidate: 580.105.08-1 Version table: 580.105.08-1 600 600 https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/x86_64 Packages 580.95.05-1 600 600 https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/x86_64 Packages 580.82.07-1 600 600 https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/x86_64 Packages 580.65.06-1 600 600 https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/x86_64 Packageslibnvsdm: Installed: (none) Candidate: 580.105.08-1 Version table: 580.105.08-1 600 600 https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/x86_64 Packages 580.95.05-1 600 600 https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/x86_64 Packages 580.82.07-1 600 600 https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/x86_64 Packages 580.65.06-1 600 600 https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/x86_64 PackagesCode block. Example of NVSDM library module list verification results. Install libnvsdm.
Color modednf install libnvsdm-580.105.08-1dnf install libnvsdm-580.105.08-1Code block. Install NVSDM library Color modeUpdating Subscription Management repositories. Last metadata expiration check: 0:08:18 ago on Wed 19 Nov 2025 01:05:28 AM EST. Dependencies resolved. ========================================================================= Package Architecture Version Repository Size ========================================================================= Installing: libnvsdm x86_64 580.105.08-1 cuda-rhel8-x86_64 675 k Installing dependencies: infiniband-diags x86_64 48.0-1.el8 rhel-8-for-x86_64-baseos-rpms 323 k libibumad x86_64 48.0-1.el8 rhel-8-for-x86_64-baseos-rpms 34 k Transaction Summary ========================================================================= Install 3 Packages Total download size: 1.0 M Installed size: 3.2 M Is this ok [y/N]: yUpdating Subscription Management repositories. Last metadata expiration check: 0:08:18 ago on Wed 19 Nov 2025 01:05:28 AM EST. Dependencies resolved. ========================================================================= Package Architecture Version Repository Size ========================================================================= Installing: libnvsdm x86_64 580.105.08-1 cuda-rhel8-x86_64 675 k Installing dependencies: infiniband-diags x86_64 48.0-1.el8 rhel-8-for-x86_64-baseos-rpms 323 k libibumad x86_64 48.0-1.el8 rhel-8-for-x86_64-baseos-rpms 34 k Transaction Summary ========================================================================= Install 3 Packages Total download size: 1.0 M Installed size: 3.2 M Is this ok [y/N]: yCode block. Example of the NVSDM library installation command result
Install NVIDIA DCGM (for RHEL)
Install Node Exporter according to the steps below.
- DCGM(datacenter-gpu-manager) Installation
- datacenter-gpu-manager-exporter installation
- DCGM Service Activation and Start
DCGM(datacenter-gpu-manager) Installation (for RHEL)
refers to a specific version of NVIDIA’s Data Center GPU Manager (DCGM) tool, which is a package for managing and monitoring NVIDIA data center GPUs. In particular, cuda12 indicates that this management tool is installed for the CUDA 12 version, and datacenter-gpu-manager-4 refers to the 4.x version of DCGM. This tool provides various features, including GPU status monitoring, diagnostics, alert system, and power/clock management.
- Add the CUDA repository to DNF.Color mode
dnf config-manager --add-repo https://developer.download.nvidia.com/compute/cuda/repos/rhel8/x86_64/cuda-rhel8.repodnf config-manager --add-repo https://developer.download.nvidia.com/compute/cuda/repos/rhel8/x86_64/cuda-rhel8.repocode block. Add DNF Repository - Check the CUDA version.Color mode
nvidia-smi | grep CUDAnvidia-smi | grep CUDACode block. Check CUDA version Color mode| NVIDIA-SMI 535.183.06 Driver Version: 535.183.06 CUDA Version: 12.2 || NVIDIA-SMI 535.183.06 Driver Version: 535.183.06 CUDA Version: 12.2 |Code block. Example of CUDA version check result Color modeCUDA_VERSION=12CUDA_VERSION=12Code block. CUDA version setting command - Check the list of datacenter-gpu-manager-cuda modules.Color mode
dnf list datacenter-gpu-manager-4-cuda${CUDA_VERSION} --showduplicatesdnf list datacenter-gpu-manager-4-cuda${CUDA_VERSION} --showduplicatesCode block. Check the datacenter-gpu-manager-cuda module list Color modeUpdating Subscription Management repositories. Unable to read consumer identity This system is not registered with an entitlement server. You can use subscription-manager to register. Last metadata expiration check: 0:00:34 ago on Wed 19 Nov 2025 12:26:56 AM EST. Available Packages datacenter-gpu-manager-4-cuda12.x86_64 1:4.0.0-1 cuda-rhel8-x86_64 datacenter-gpu-manager-4-cuda12.x86_64 1:4.1.0-1 cuda-rhel8-x86_64 datacenter-gpu-manager-4-cuda12.x86_64 1:4.1.1-1 cuda-rhel8-x86_64 datacenter-gpu-manager-4-cuda12.x86_64 1:4.2.0-1 cuda-rhel8-x86_64 datacenter-gpu-manager-4-cuda12.x86_64 1:4.2.2-1 cuda-rhel8-x86_64 datacenter-gpu-manager-4-cuda12.x86_64 1:4.2.3-1 cuda-rhel8-x86_64 datacenter-gpu-manager-4-cuda12.x86_64 1:4.2.3-2 cuda-rhel8-x86_64 datacenter-gpu-manager-4-cuda12.x86_64 1:4.3.0-1 cuda-rhel8-x86_64 datacenter-gpu-manager-4-cuda12.x86_64 1:4.3.1-1 cuda-rhel8-x86_64 datacenter-gpu-manager-4-cuda12.x86_64 1:4.4.0-1 cuda-rhel8-x86_64 datacenter-gpu-manager-4-cuda12.x86_64 1:4.4.1-1 cuda-rhel8-x86_64 datacenter-gpu-manager-4-cuda12.x86_64 1:4.4.2-1 cuda-rhel8-x86_64Updating Subscription Management repositories. Unable to read consumer identity This system is not registered with an entitlement server. You can use subscription-manager to register. Last metadata expiration check: 0:00:34 ago on Wed 19 Nov 2025 12:26:56 AM EST. Available Packages datacenter-gpu-manager-4-cuda12.x86_64 1:4.0.0-1 cuda-rhel8-x86_64 datacenter-gpu-manager-4-cuda12.x86_64 1:4.1.0-1 cuda-rhel8-x86_64 datacenter-gpu-manager-4-cuda12.x86_64 1:4.1.1-1 cuda-rhel8-x86_64 datacenter-gpu-manager-4-cuda12.x86_64 1:4.2.0-1 cuda-rhel8-x86_64 datacenter-gpu-manager-4-cuda12.x86_64 1:4.2.2-1 cuda-rhel8-x86_64 datacenter-gpu-manager-4-cuda12.x86_64 1:4.2.3-1 cuda-rhel8-x86_64 datacenter-gpu-manager-4-cuda12.x86_64 1:4.2.3-2 cuda-rhel8-x86_64 datacenter-gpu-manager-4-cuda12.x86_64 1:4.3.0-1 cuda-rhel8-x86_64 datacenter-gpu-manager-4-cuda12.x86_64 1:4.3.1-1 cuda-rhel8-x86_64 datacenter-gpu-manager-4-cuda12.x86_64 1:4.4.0-1 cuda-rhel8-x86_64 datacenter-gpu-manager-4-cuda12.x86_64 1:4.4.1-1 cuda-rhel8-x86_64 datacenter-gpu-manager-4-cuda12.x86_64 1:4.4.2-1 cuda-rhel8-x86_64Code block. Example of checking the list of datacenter-gpu-manager-cuda modules. - Install datacenter-gpu-manager-cuda.Color mode
dnf install datacenter-gpu-manager-4-cuda${CUDA_VERSION}dnf install datacenter-gpu-manager-4-cuda${CUDA_VERSION}Code block. Install datacenter-gpu-manager-cuda Color modeUpdating Subscription Management repositories. Unable to read consumer identity This system is not registered with an entitlement server. You can use subscription-manager to register. Last metadata expiration check: 0:07:12 ago on Wed 19 Nov 2025 12:26:56 AM EST. Dependencies resolved. =================================================================================================== Package Architecture Version Repository Size =================================================================================================== Installing: datacenter-gpu-manager-4-cuda12 x86_64 1:4.4.2-1 cuda-rhel8-x86_64 554 M Installing dependencies: datacenter-gpu-manager-4-core x86_64 1:4.4.2-1 cuda-rhel8-x86_64 9.9 M Installing weak dependencies: datacenter-gpu-manager-4-proprietary x86_64 1:4.4.2-1 cuda-rhel8-x86_64 5.3 M datacenter-gpu-manager-4-proprietary-cuda12 x86_64 1:4.4.2-1 cuda-rhel8-x86_64 289 M Transaction Summary ==================================================================================================== Install 4 Packages ... Is this ok [y/N]: yUpdating Subscription Management repositories. Unable to read consumer identity This system is not registered with an entitlement server. You can use subscription-manager to register. Last metadata expiration check: 0:07:12 ago on Wed 19 Nov 2025 12:26:56 AM EST. Dependencies resolved. =================================================================================================== Package Architecture Version Repository Size =================================================================================================== Installing: datacenter-gpu-manager-4-cuda12 x86_64 1:4.4.2-1 cuda-rhel8-x86_64 554 M Installing dependencies: datacenter-gpu-manager-4-core x86_64 1:4.4.2-1 cuda-rhel8-x86_64 9.9 M Installing weak dependencies: datacenter-gpu-manager-4-proprietary x86_64 1:4.4.2-1 cuda-rhel8-x86_64 5.3 M datacenter-gpu-manager-4-proprietary-cuda12 x86_64 1:4.4.2-1 cuda-rhel8-x86_64 289 M Transaction Summary ==================================================================================================== Install 4 Packages ... Is this ok [y/N]: yCode block. Example of datacenter-gpu-manager-cuda installation result
datacenter-gpu-manager-exporter installation (for RHEL)
It is a tool that, based on NVIDIA Data Center GPU Manager (DCGM), collects various GPU metrics such as GPU usage, memory usage, temperature, and power consumption, and exposes them for use in monitoring systems like Prometheus.
Add the CUDA Repository to DNF. 1. (If you have already performed this command, proceed to the next step.)
Color modednf config-manager --add-repo https://developer.download.nvidia.com/compute/cuda/repos/rhel8/x86_64/cuda-rhel8.repodnf config-manager --add-repo https://developer.download.nvidia.com/compute/cuda/repos/rhel8/x86_64/cuda-rhel8.repoCode block. Add DNF Repository Check the CUDA version. 2. (If you have already performed this command, proceed to the next step.)
Color modenvidia-smi | grep CUDAnvidia-smi | grep CUDACode block. Check CUDA version Color mode| NVIDIA-SMI 535.183.06 Driver Version: 535.183.06 CUDA Version: 12.2 || NVIDIA-SMI 535.183.06 Driver Version: 535.183.06 CUDA Version: 12.2 |Code block. Example of CUDA version check result Color modeCUDA_VERSION=12CUDA_VERSION=12Code block. CUDA version setting command Check the datacenter-gpu-manager-exporter module list.
Color modednf list datacenter-gpu-manager-exporter --showduplicatesdnf list datacenter-gpu-manager-exporter --showduplicatesCode block. Check the list of datacenter-gpu-manager-exporter modules Color modeUpdating Subscription Management repositories. Unable to read consumer identity This system is not registered with an entitlement server. You can use subscription-manager to register. Last metadata expiration check: 0:02:11 ago on Wed 19 Nov 2025 12:26:56 AM EST. Available Packages datacenter-gpu-manager-exporter.x86_64 4.0.1-1 cuda-rhel8-x86_64 datacenter-gpu-manager-exporter.x86_64 4.1.0-1 cuda-rhel8-x86_64 datacenter-gpu-manager-exporter.x86_64 4.1.1-1 cuda-rhel8-x86_64 datacenter-gpu-manager-exporter.x86_64 4.1.3-1 cuda-rhel8-x86_64 datacenter-gpu-manager-exporter.x86_64 4.5.0-1 cuda-rhel8-x86_64 datacenter-gpu-manager-exporter.x86_64 4.5.1-1 cuda-rhel8-x86_64 datacenter-gpu-manager-exporter.x86_64 4.5.2-1 cuda-rhel8-x86_64 datacenter-gpu-manager-exporter.x86_64 4.6.0-1 cuda-rhel8-x86_64Updating Subscription Management repositories. Unable to read consumer identity This system is not registered with an entitlement server. You can use subscription-manager to register. Last metadata expiration check: 0:02:11 ago on Wed 19 Nov 2025 12:26:56 AM EST. Available Packages datacenter-gpu-manager-exporter.x86_64 4.0.1-1 cuda-rhel8-x86_64 datacenter-gpu-manager-exporter.x86_64 4.1.0-1 cuda-rhel8-x86_64 datacenter-gpu-manager-exporter.x86_64 4.1.1-1 cuda-rhel8-x86_64 datacenter-gpu-manager-exporter.x86_64 4.1.3-1 cuda-rhel8-x86_64 datacenter-gpu-manager-exporter.x86_64 4.5.0-1 cuda-rhel8-x86_64 datacenter-gpu-manager-exporter.x86_64 4.5.1-1 cuda-rhel8-x86_64 datacenter-gpu-manager-exporter.x86_64 4.5.2-1 cuda-rhel8-x86_64 datacenter-gpu-manager-exporter.x86_64 4.6.0-1 cuda-rhel8-x86_64code block. Example of checking the module list of datacenter-gpu-manager-exporter Install datacenter-gpu-manager-cuda. dcgm-exporter 4.5.X requires glibc 2.34 or newer, but because RHEL 9 provides glibc 2.34, we install it by specifying version 4.1.3-1.
Color modednf install datacenter-gpu-manager-exporter-4.1.3-1dnf install datacenter-gpu-manager-exporter-4.1.3-1Code block. Install datacenter-gpu-manager-cuda Color modeUpdating Subscription Management repositories. Unable to read consumer identity This system is not registered with an entitlement server. You can use subscription-manager to register. Last metadata expiration check: 0:07:12 ago on Wed 19 Nov 2025 12:26:56 AM EST. Dependencies resolved. ==================================================================================================== Package Architecture Version Repository Size ==================================================================================================== Installing: datacenter-gpu-manager-exporter x86_64 4.1.3-1 cuda-rhel8-x86_64 26 M Is this ok [y/N]: yUpdating Subscription Management repositories. Unable to read consumer identity This system is not registered with an entitlement server. You can use subscription-manager to register. Last metadata expiration check: 0:07:12 ago on Wed 19 Nov 2025 12:26:56 AM EST. Dependencies resolved. ==================================================================================================== Package Architecture Version Repository Size ==================================================================================================== Installing: datacenter-gpu-manager-exporter x86_64 4.1.3-1 cuda-rhel8-x86_64 26 M Is this ok [y/N]: yCode block. Example of datacenter-gpu-manager-cuda installation result Color modecat /usr/lib/systemd/system/nvidia-dcgm-exporter.service | grep ExecStartcat /usr/lib/systemd/system/nvidia-dcgm-exporter.service | grep ExecStartCode block. datacenter-gpu-manager-exporter configuration file Color modeExecStart=/usr/bin/dcgm-exporter -f /etc/dcgm-exporter/default-counters.csvExecStart=/usr/bin/dcgm-exporter -f /etc/dcgm-exporter/default-counters.csvCode block. Example of the result of checking the datacenter-gpu-manager-exporter configuration file. When installing the DCGM Exporter, review the provided configuration and remove
#for required metrics, and add#for unnecessary metrics.Color modevi /etc/dcgm-exporter/default-counters.csv ## Example ## ... DCGM_FI_PROF_PIPE_TENSOR_ACTIVE, gauge, Ratio of cycles the tensor (HMMA) pipe is active. DCGM_FI_PROF_DRAM_ACTIVE, gauge, Ratio of cycles the device memory interface is active sending or receiving data. # DCGM_FI_PROF_PIPE_FP64_ACTIVE, gauge, Ratio of cycles the fp64 pipes are active. # DCGM_FI_PROF_PIPE_FP32_ACTIVE, gauge, Ratio of cycles the fp32 pipes are active. ...vi /etc/dcgm-exporter/default-counters.csv ## Example ## ... DCGM_FI_PROF_PIPE_TENSOR_ACTIVE, gauge, Ratio of cycles the tensor (HMMA) pipe is active. DCGM_FI_PROF_DRAM_ACTIVE, gauge, Ratio of cycles the device memory interface is active sending or receiving data. # DCGM_FI_PROF_PIPE_FP64_ACTIVE, gauge, Ratio of cycles the fp64 pipes are active. # DCGM_FI_PROF_PIPE_FP32_ACTIVE, gauge, Ratio of cycles the fp32 pipes are active. ...Code block. Example of datacenter-gpu-manager-exporter metric configuration
Enable and start DCGM service (for RHEL)
Activating and starting the nvdia-dcgm service.
Color modesystemctl enable --now nvidia-dcgmsystemctl enable --now nvidia-dcgmCode block. nvdia-dcgm service activation and start command Enable and start the nvdia-dcgm-exporter service.
Color modesystemctl enable --now nvidia-dcgm-exportersystemctl enable --now nvidia-dcgm-exporterCode block. nvdia-dcgm-exporter service activation and start command
For more details, refer to ServiceWatch > Using ServiceWatch Agent.
DCGM Exporter metric
DCGM Exporter Key Metrics
Among the metrics provided by the DCGM Exporter, the main GPU metrics are as follows.
| Category | DCGM Field | Prometheus Metric Type | Summary | |
|---|---|---|---|---|
| Clocks | DCGM_FI_DEV_SM_CLOCK | gauge | SM clock frequency (in MHz) | |
| Clocks | DCGM_FI_DEV_MEM_CLOCK | gauge | Memory clock frequency (in MHz) | |
| Temperature | DCGM_FI_DEV_GPU_TEMP | gauge | GPU temperature (in C) | |
| Power | DCGM_FI_DEV_POWER_USAGE | gauge | Power draw (in W) | |
| Utilization | DCGM_FI_DEV_GPU_UTIL | gauge | GPU utilization (in %) | |
| Utilization | DCGM_FI_DEV_MEM_COPY_UTIL | gauge | Memory utilization (in %) | |
| Memory Usage | DCGM_FI_DEV_FB_FREE | gauge | Frame buffer memory free (in MiB) | |
| Memory Usage | DCGM_FI_DEV_FB_USED | gauge | Frame buffer memory used (in MiB) | |
| Nvlink | DCGM_FI_DEV_NVLINK_BANDWIDTH_TOTAL(8 GPU only) | counter | Total number of NVLink bandwidth counters for all lanes |
DCGM Exporter metric collection configuration
Please refer to the default metrics of DCGM Exporter at DCGM Exporter > 기본 지표.
- Besides the default configuration, remove
#from default-counters.csv for any additional metrics you want to set. - For default metrics you do not wish to collect, add
#or delete the corresponding entry.
# Format
# If line starts with a '#' it is considered a comment
# DCGM FIELD, Prometheus metric type, help message
# Clocks
DCGM_FI_DEV_SM_CLOCK, gauge, SM clock frequency (in MHz).
DCGM_FI_DEV_MEM_CLOCK, gauge, Memory clock frequency (in MHz).
# Temperature
DCGM_FI_DEV_MEMORY_TEMP, gauge, Memory temperature (in C).
DCGM_FI_DEV_GPU_TEMP, gauge, GPU temperature (in C).
# Power
DCGM_FI_DEV_POWER_USAGE, gauge, Power draw (in W).
DCGM_FI_DEV_TOTAL_ENERGY_CONSUMPTION, counter, Total energy consumption since boot (in mJ).
# PCIE
# DCGM_FI_PROF_PCIE_TX_BYTES, counter, Total number of bytes transmitted through PCIe TX via NVML.
# DCGM_FI_PROF_PCIE_RX_BYTES, counter, Total number of bytes received through PCIe RX via NVML.
...# Format
# If line starts with a '#' it is considered a comment
# DCGM FIELD, Prometheus metric type, help message
# Clocks
DCGM_FI_DEV_SM_CLOCK, gauge, SM clock frequency (in MHz).
DCGM_FI_DEV_MEM_CLOCK, gauge, Memory clock frequency (in MHz).
# Temperature
DCGM_FI_DEV_MEMORY_TEMP, gauge, Memory temperature (in C).
DCGM_FI_DEV_GPU_TEMP, gauge, GPU temperature (in C).
# Power
DCGM_FI_DEV_POWER_USAGE, gauge, Power draw (in W).
DCGM_FI_DEV_TOTAL_ENERGY_CONSUMPTION, counter, Total energy consumption since boot (in mJ).
# PCIE
# DCGM_FI_PROF_PCIE_TX_BYTES, counter, Total number of bytes transmitted through PCIe TX via NVML.
# DCGM_FI_PROF_PCIE_RX_BYTES, counter, Total number of bytes received through PCIe RX via NVML.
...3.3 - API Reference
3.4 - CLI Reference
3.5 - Release Note
GPU Server
- Add Kubernetes image
- When creating a GPU Server, you can select the Kubernetes image (Ubuntu).
- An SSD volume with configurable IOPS and throughput has been added.
- When creating Block Storage, you can select the SSD_Provisioned disk type.
- You can set the maximum IOPS and Throughput values.
- Provision of ServiceWatch service integration
- You can monitor data through the ServiceWatch service.
- When creating a GPU server, you can select a RHEL image.
- Keypair management feature has been added.
- You can generate a keypair for use, or retrieve a public key and apply it.
- Add GPU Server feature
- The IP, Public NAT IP, and Private NAT IP configuration features have been added.
- An LLM Endpoint is provided for LLM usage.
- The method for sharing images between accounts has changed.
- You can create a new Image for sharing and share it.
- Add GPU Server usage guide
- Added the guides Using Multi-instance GPU on GPU Server and Using NVSwitch on GPU Server.
- GPU Server RHEL OS and GPU driver versions have been added.
- Add GPU Server feature
- NAT configuration feature has been added to the GPU Server.
- Samsung Cloud Platform Common Feature Changes
- Account, IAM, Service Home, tags, and other common CX changes have been applied.
- We have officially launched the GPU Server service.
- We have launched a virtualization computing service that lets you allocate and use infrastructure resources such as CPU, GPU, and memory provided by the server as needed, without having to purchase them individually.
4 - Bare Metal Server
4.1 - Overview
Service Overview
Bare Metal Server does not use virtualization technology and is a high-performance cloud computing service that can allocate and use physically separated computing resources such as CPU and memory exclusively. It is not affected by other cloud users, allowing you to reliably operate performance-sensitive services.
Features
Easy and convenient computing environment setup: Through the web-based Console, you can easily handle everything from Bare Metal Server provisioning to resource management and cost management. You can receive a server with standard specs (CPU, Memory, Disk) allocated exclusively and use it immediately.
Providing High-Performance Computing Environment: We provide servers suitable for workloads that require large capacity and high performance—such as real-time (Real-Time) systems, HPC (High Performance Computing), and servers that demand excessive I/O usage—in a physically isolated environment.
Efficient Service Delivery: Ensure performance and stability through optimal server selection and in-house testing. Customers can choose the optimal resources that fit their service environment from the various specifications of Bare Metal Servers offered by Samsung Cloud Platform.
Service Architecture Diagram
Provided features
Bare Metal Server provides the following features.
- Auto Provisioning (Auto Provisioning) and Management: Through the web-based Console, you can easily provision Bare Metal Servers, manage resources, and control costs.
- Providing various types of server types and OS images: Provides CPU, Memory, and Disk resources of standard server types, and offers a variety of standard OS images.
- Storage Connection: Provides additional attached storage beyond the OS disk. You can connect and use Block Storage, File Storage, and Object Storage.
- Network Connection: You can connect the standard subnet/IP settings of a Bare Metal Server and a Public NAT IP. Provides a local subnet connection for inter-server communication. This can be modified on the detail page.
- Monitoring: You can view monitoring information such as CPU, Memory, and Disk, which are computing resources, through Cloud Monitoring. To use the Cloud Monitoring service for Bare Metal Server, you must install the Agent. Please be sure to install the Agent for stable Bare Metal Server service usage. For more details, see Bare Metal Server Monitoring Metrics.
- Backup and Recovery: You can back up and restore the Bare Metal Server’s filesystem using the Backup service.
- Efficient Cost Management: You can easily create or terminate servers as needed, and because billing is based on actual usage time, you can use it cost‑effectively in various unpredictable situations.
- Local disk partition creation You can create and use up to 10 local disk partitions.
- Terraform Provisioning: Provides an IaC environment using Terraform.
Components
Bare Metal Server provides various standard OS images and standard server types. Users can select and use them according to the scale of the service they want to configure.
OS Image provided version
The OS images supported by Bare Metal Server are as follows
| OS Image version | EoS Date |
|---|---|
| Oracle Linux 9.6 | 2032-06-30 |
| RHEL 8.10 | 2029-05-31 |
| RHEL 9.4 | 2026-04-30 |
| RHEL 9.6 | 2027-05-31 |
| Rocky Linux 8.10 | 2029-05-31 |
| Rocky Linux 9.6 | 2025-11-30 |
| Ubuntu 22.04 | 2027-06-30 |
| Ubuntu 24.04 | 2029-06-30 |
| Windows 2019 | 2029-01-09 |
| Windows 2022 | 2031-10-14 |
- Oracle Linux, RHEL, and Rocky Linux only provide even-numbered minor versions. Check the OS Image’s EOS (End of Support)/EOL (End of Life) Date.
- To ensure stable OS operation, apply new or additional individual packages yourself.
Server type
The server types supported by Bare Metal Server are as follows. For detailed information about the server types supported by Bare Metal Server, see Bare Metal Server Server Types.
s3v16m64_metal
| Category | example | Detailed description |
|---|---|---|
| Server generation | s3 | Provided server categories and generations
|
| CPU vCore | v16 | Number of vCores
|
| Memory | m64 | Memory capacity
|
Preceding Service
This is a list of services that must be pre-configured before creating the service. Please refer to the guide provided for each service and prepare in advance.
| Service Category | service | Detailed description |
|---|---|---|
| Networking | VPC | A service that provides an isolated virtual network in a cloud environment |
4.1.1 - Server Type
Bare Metal Server server type
Bare Metal Server provides server types that suit the intended use. Server types are composed of various combinations such as CPU and Memory. The server used for a Bare Metal Server is determined by the server type selected when creating the Bare Metal Server. Please select a server type based on the specifications of the application you intend to run on the Bare Metal Server.
The server types supported by Bare Metal Server are as follows.
s3v16m64_metal
Category | example | Detailed description |
|---|---|---|
| Server generation | s3 | Provided server categories and generations
|
| CPU vCore | v16 | Number of vCores
|
| Memory | m64 | Memory Capacity
|
s4/h4 server type
The Bare Metal Server s4 server type is offered with standard specifications (vCPU, Memory) and, because it receives physically isolated resources, it is suitable for high-performance applications. Additionally, the Bare Metal Server h4 server type is offered with high-capacity specifications and is suitable for high-performance applications that require large-scale data processing.
- Supports a total of five vCPU options (16, 32, 64, 96, 128 vCore)
- Intel 6th Generation (Granite Rapids) Processor
- Supports up to 64 physical cores, 128 vCPUs, and 2,048 GB of memory
- Provides two internal disks up to 1.92 TB each (for OS use)
| Server type | Physical CPU | vCPU | Memory | CPU Type | Internal Disk(OS) | ||
|---|---|---|---|---|---|---|---|
| s4v16m64_metal | 8 Core | 16 vCore | 64 GB | Intel Xeon 6507P up to 4.3 GHz | 480GB * 2EA | ||
| s4v16m128_metal | 8 Core | 16 vCore | 128 GB | Intel Xeon 6507P up to 4.3 GHz | 480GB * 2EA | ||
| s4v16m256_metal | 8 Core | 16 vCore | 256 GB | Intel Xeon 6507P with a maximum frequency of 4.3 GHz | 480GB * 2EA | ||
| h4v32m128_metal | 16 Core | 32 vCore | 128 GB | Intel Xeon 6517P up to 4.0 GHz | 960GB * 2EA | ||
| h4v32m256_metal | 16 Core | 32 vCore | 256 GB | Intel Xeon 6517P up to 4.0 GHz | 960GB * 2EA | ||
| h4v32m512_metal | 16 Core | 32 vCore | 512 GB | Intel Xeon 6517P up to 4.0 GHz | 960GB * 2EA | ||
| h4v64m256_metal | 32 Core | 64 vCore | 256 GB | Intel Xeon 6737P up to 4.0 GHz | 1.92TB * 2EA | ||
| h4v64m512_metal | 32 Core | 64 vCore | 512 GB | Intel Xeon 6737P up to 4.0 GHz | 1.92TB * 2EA | ||
| h4v64m1024_metal | 32 Core | 64 vCore | 1024 GB | Intel Xeon 6737P up to 4.0 GHz | 1.92TB * 2EA | ||
| h4v96m512_metal | 48 Core | 96 vCore | 512 GB | Intel Xeon 6520P up to 3.4 GHz | 1.92TB * 2EA | ||
| h4v96m768_metal | 48 Core | 96 vCore | 768 GB | Intel Xeon 6520P with a maximum frequency of 3.4 GHz | 1.92TB * 2EA | ||
| h4v96m2048_metal | 48 Core | 96 vCore | 2048 GB | Intel Xeon 6520P up to 3.4 GHz | 1.92TB * 2EA | ||
| h4v128m512_metal | 64 Core | 128 vCore | 512 GB | Intel Xeon 6737P up to 4.0 GHz | 1.92TB * 2EA | ||
| h4v128m1024_metal | 64 Core | 128 vCore | 1024 GB | Intel Xeon 6737P up to 4.0 GHz | 1.92TB * 2EA | ||
| h4v128m2048_metal | 64 Core | 128 vCore | 2048 GB | Intel Xeon 6737P up to 4.0 GHz | 1.92TB * 2EA |
s3/h3 server type
The Bare Metal Server s3 server type is offered with standard specifications (vCPU, Memory) and is suitable for high‑performance applications because it receives physically isolated resources. Additionally, the Bare Metal Server h3 server type is offered with high-capacity specifications and is suitable for high-performance applications that require large-scale data processing.
- Supports a total of five vCPU options (16, 32, 64, 96, 128 vCore)
- Intel 4th Generation (Sapphire Rapids) Processor
- Supports up to 64 physical cores, 128 vCPUs, and 2,048 GB of memory.
- Provides two internal disks up to 1.92 TB (for OS use)
| Server type | Physical CPU | vCPU | Memory | CPU Type | Internal Disk(OS) | ||
|---|---|---|---|---|---|---|---|
| s3v16m64_metal | 8 Core | 16 vCore | 64 GB | Intel Xeon Gold 6434 up to 4.1 GHz | 480 GB * 2EA | ||
| s3v16m128_metal | 8 Core | 16 vCore | 128 GB | Intel Xeon Gold 6434 up to 4.1 GHz | 480 GB * 2EA | ||
| s3v16m256_metal | 8 Core | 16 vCore | 256 GB | Intel Xeon Gold 6434 up to 4.1 GHz | 480 GB * 2EA | ||
| h3v32m128_metal | 16 Core | 32 vCore | 128 GB | Intel Xeon Gold 6444Y up to 4.0 GHz | 960 GB * 2EA | ||
| h3v32m256_metal | 16 Core | 32 vCore | 256 GB | Intel Xeon Gold 6444Y up to 4.0 GHz | 960 GB * 2EA | ||
| h3v32m512_metal | 16 Core | 32 vCore | 512 GB | Intel Xeon Gold 6444Y up to 4.0 GHz | 960 GB * 2EA | ||
| h3v64m256_metal | 32 Core | 64 vCore | 256 GB | Intel Xeon Gold 6448H up to 3.2 GHz | 1.92 TB * 2EA | ||
| h3v64m512_metal | 32 Core | 64 vCore | 512 GB | Intel Xeon Gold 6448H up to 3.2 GHz | 1.92 TB * 2EA | ||
| h3v64m1024_metal | 32 Core | 64 vCore | 1024 GB | Intel Xeon Gold 6448H up to 3.2 GHz | 1.92 TB * 2EA | ||
| h3v96m384_metal | 48 Core | 96 vCore | 384 GB | Intel Xeon Gold 6442Y, up to 3.3 GHz | 1.92 TB * 2EA | ||
| h3v96m768_metal | 48 Core | 96 vCore | 768 GB | Intel Xeon Gold 6442Y up to 3.3 GHz | 1.92 TB * 2EA | ||
| h3v96m1536_metal | 48 Core | 96 vCore | 1536 GB | Intel Xeon Gold 6442Y, up to 3.3 GHz | 1.92 TB * 2EA | ||
| h3v128m512_metal | 64 Core | 128 vCore | 512 GB | Intel Xeon Gold 6448H up to 3.2 GHz | 1.92 TB * 2EA | ||
| h3v128m1024_metal | 64 Core | 128 vCore | 1024 GB | Intel Xeon Gold 6448H up to 3.2 GHz | 1.92 TB * 2EA | ||
| h3v128m2048_metal | 64 Core | 128 vCore | 2048 GB | Intel Xeon Gold 6448H up to 3.2 GHz | 1.92 TB * 2EA |
s2/h2 server type
The Bare Metal Server s2 server type is offered with standard specifications (vCPU, Memory) and, because it receives physically isolated resources, it is suitable for high-performance applications.
Additionally, the Bare Metal Server h2 server type is offered with high-capacity specifications and is suitable for high-performance applications that require large-scale data processing.
- Supports five vCPU options (16, 24, 32, 72, 96 vCore)
- Intel 3rd‑Generation (Ice Lake) Processor
- Supports up to 48 physical cores, 96 vCPUs, and 1,024 GB of memory.
- Provides two internal disks up to 1.92 TB for OS use
| Server type | Physical CPU | vCPU | Memory | CPU Type | Internal Disk(OS) | |
|---|---|---|---|---|---|---|
| s2v16m64_metal | 8 Core | 16 vCore | 64 GB | Intel Xeon Gold 6334 with a maximum frequency of 3.6 GHz | 480 GB * 2EA | |
| s2v16m128_metal | 8 Core | 16 vCore | 128 GB | Intel Xeon Gold 6334 with a maximum frequency of 3.6 GHz | 480 GB * 2EA | |
| s2v16m256_metal | 8 Core | 16 vCore | 256 GB | Intel Xeon Gold 6334 with a maximum frequency of 3.6 GHz | 480 GB * 2EA | |
| h2v24m96_metal | 12 Core | 24 vCore | 96 GB | Intel Xeon Gold 5317 up to 3.4 GHz | 960 GB * 2EA | |
| h2v24m192_metal | 12 Core | 24 vCore | 192 GB | Intel Xeon Gold 5317 up to 3.4 GHz | 960 GB * 2EA | |
| h2v24m384_metal | 12 Core | 24 vCore | 384 GB | Intel Xeon Gold 5317 up to 3.4 GHz | 960 GB * 2EA | |
| h2v32m128_metal | 16 Core | 32 vCore | 128 GB | Intel Xeon Gold 6346 with a maximum frequency of 3.6 GHz | 960 GB * 2EA | |
| h2v32m256_metal | 16 Core | 32 vCore | 256 GB | Intel Xeon Gold 6346 up to 3.6 GHz | 960 GB * 2EA | |
| h2v32m512_metal | 16 Core | 32 vCore | 512 GB | Intel Xeon Gold 6346 up to 3.6 GHz | 960 GB * 2EA | |
| h2v72m256_metal | 36 Core | 72 vCore | 256 GB | Intel Xeon Gold 6354, up to 3.6 GHz | 1.92 TB * 2EA | |
| h2v72m512_metal | 36 Core | 72 vCore | 512 GB | Intel Xeon Gold 6354 with a maximum frequency of 3.6 GHz | 1.92 TB * 2EA | |
| h2v72m1024_metal | 36 Core | 72 vCore | 1024 GB | Intel Xeon Gold 6354 with a maximum frequency of 3.6 GHz | 1.92 TB * 2EA | |
| h2v96m384_metal | 48 Core | 96 vCore | 384 GB | Intel Xeon Gold 6342 up to 3.3 GHz | 1.92 TB * 2EA | |
| h2v96m768_metal | 48 Core | 96 vCore | 768 GB | Intel Xeon Gold 6342 up to 3.3 GHz | 1.92 TB * 2EA |
4.1.2 - Monitoring Metrics
According to Samsung Cloud Platform’s policy, the Cloud Monitoring service is scheduled to be discontinued in September 2026.
Accordingly, after the September 2026 release, resource monitoring of the Samsung Cloud Platform via Cloud Monitoring will no longer be possible.
With the new alternative service, you can continuously perform resource monitoring by leveraging ServiceWatch released in October 2025.
ServiceWatch provides more modern and powerful features, replacing Cloud Monitoring to deliver a seamless monitoring environment.
If you are collecting metrics and logs through the Cloud Monitoring Agent, you need to switch to the ServiceWatch Agent.
For detailed information about ServiceWatch, please refer to ServiceWatch Overview.
Detailed information about ServiceWatch Agent can be found in the ServiceWatch Agent.
Bare Metal Server Monitoring Metrics
The table below shows the monitoring metrics of Bare Metal Server that can be viewed through Cloud Monitoring.
| Performance items | Detailed description | unit |
|---|---|---|
| Core Usage [IO Wait] | Ratio of CPU time spent in wait state (disk wait) | % |
| Core Usage [System] | Proportion of CPU time spent in kernel space | % |
| Core Usage [User] | Proportion of CPU time spent in user space | % |
| CPU Cores | Number of CPU cores on the host | cnt |
| CPU Usage [Active] | Percentage of CPU time used, excluding Idle and IOWait states | % |
| CPU Usage [Idle] | It is the proportion of CPU time spent in idle state. | % |
| CPU Usage [IO Wait] | Ratio of CPU time spent in wait state (disk wait) | % |
| CPU Usage [System] | Percentage of CPU time used by the kernel | % |
| CPU Usage [User] | Percentage of CPU time used in user space | % |
| CPU Usage/Core [Active] | Percentage of CPU time used other than Idle and IOWait states | % |
| CPU Usage/Core [Idle] | Ratio of CPU time spent in idle state | % |
| CPU Usage/Core [IO Wait] | Ratio of CPU time spent in wait state (disk wait) | % |
| CPU Usage/Core [System] | Percentage of CPU time used by the kernel | % |
| CPU Usage/Core [User] | Percentage of CPU time used in user space | % |
| Disk CPU Usage [IO Request] | Proportion of CPU time during which I/O requests to the device were executed | % |
| Disk Queue Size [Avg] | Average queue length of requests executed for the device | num |
| Disk Read Bytes | Bytes per second read from the device | bytes |
| Disk Read Bytes [Delta Avg] | Average of system.diskio.read.bytes_delta for individual disks | bytes |
| Disk Read Bytes [Delta Max] | Maximum system.diskio.read.bytes_delta of individual disks | bytes |
| Disk Read Bytes [Delta Min] | minimum system.diskio.read.bytes_delta of individual disks | bytes |
| Disk Read Bytes [Delta Sum] | Sum of system.diskio.read.bytes_delta of individual disks | bytes |
| Disk Read Bytes [Delta] | Delta of the system.diskio.read.bytes value for each Disk | bytes |
| Disk Read Bytes [Success] | Total bytes successfully read | bytes |
| Disk Read Requests | Number of read requests to the disk device per second | cnt |
| Disk Read Requests [Delta Avg] | Average of system.diskio.read.count_delta for individual disks | cnt |
| Disk Read Requests [Delta Max] | Maximum system.diskio.read.count_delta for individual disks | cnt |
| Disk Read Requests [Delta Min] | Minimum of individual Disks’ system.diskio.read.count_delta | cnt |
| Disk Read Requests [Delta Sum] | Sum of system.diskio.read.count_delta for individual disks | cnt |
| Disk Read Requests [Success Delta] | Delta of system.diskio.read.count for each Disk | cnt |
| Disk Read Requests [Success] | Total number of successful reads | cnt |
| Disk Request Size [Avg] | Average size of requests executed on the device (unit: sectors) | num |
| Disk Service Time [Avg] | Average service time (ms) of input requests executed on the device | ms |
| Disk Wait Time [Avg] | Average time taken for requests executed on the device to be supported | ms |
| Disk Wait Time [Read] | Average disk wait time | ms |
| Disk Wait Time [Write] | Disk average wait time | ms |
| Disk Write Bytes [Delta Avg] | Average of system.diskio.write.bytes_delta for individual disks | bytes |
| Disk Write Bytes [Delta Max] | Maximum system.diskio.write.bytes_delta of individual disks | bytes |
| Disk Write Bytes [Delta Min] | Minimum of system.diskio.write.bytes_delta for individual disks | bytes |
| Disk Write Bytes [Delta Sum] | Sum of system.diskio.write.bytes_delta of individual disks | bytes |
| Disk Write Bytes [Delta] | individual Disk’s system.diskio.write.bytes value delta | bytes |
| Disk Write Bytes [Success] | Total bytes successfully written | bytes |
| Disk Write Requests | Number of write requests to the disk device per second | cnt |
| Disk Write Requests [Delta Avg] | Average of system.diskio.write.count_delta for individual disks | cnt |
| Disk Write Requests [Delta Max] | Maximum system.diskio.write.count_delta of individual disks | cnt |
| Disk Write Requests [Delta Min] | Minimum of system.diskio.write.count_delta for individual disks | cnt |
| Disk Write Requests [Delta Sum] | Sum of system.diskio.write.count_delta of individual disks | cnt |
| Disk Write Requests [Success Delta] | Delta of system.diskio.write.count for each Disk | cnt |
| Disk Write Requests [Success] | Total number of successful writes | cnt |
| Disk Writes Bytes | Bytes per second written to the device | bytes |
| Filesystem Hang Check | filesystem(local/NFS) hang check (normal:1, abnormal:0) | status |
| Filesystem Nodes | Total number of file nodes in the file system | cnt |
| Filesystem Nodes [Free] | Total number of available file nodes in the file system | cnt |
| Filesystem Size [Available] | Disk space (bytes) available to unauthorized users | bytes |
| Filesystem Size [Free] | Available disk space (bytes) | bytes |
| Filesystem Size [Total] | Total disk space (bytes) | bytes |
| Filesystem Usage | Used disk space percentage | % |
| Filesystem Usage [Avg] | Average of individual filesystem.used.pct | % |
| Filesystem Usage [Inode] | inode usage | % |
| Filesystem Usage [Max] | Maximum among individual filesystem.used.pct | % |
| Filesystem Usage [Min] | minimum among individual filesystem.used.pct | % |
| Filesystem Usage [Total] | Total filesystem usage | % |
| Filesystem Used | Used disk space (bytes) | bytes |
| Filesystem Used [Inode] | inode usage | bytes |
| Memory Free | Total amount of available memory (bytes). Memory used by system cache and buffers is not included. | bytes |
| Memory Free [Actual] | Actual usable memory (bytes). | bytes |
| Memory Free [Swap] | Available swap memory. | bytes |
| Memory Total | total memory | bytes |
| Memory Total [Swap] | Total swap memory. | bytes |
| Memory Usage | Percentage of used memory | % |
| Memory Usage [Actual] | Percentage of memory actually used | % |
| Memory Usage [Cache Swap] | cached swap usage | % |
| Memory Usage [Swap] | Percentage of used swap memory | % |
| Memory Used | used memory | bytes |
| Memory Used [Actual] | Actual used memory (bytes). The value obtained by subtracting used memory from total memory. | bytes |
| Memory Used [Swap] | Used swap memory | bytes |
| Collisions | Network collision | cnt |
| Network In Bytes | Number of received bytes | bytes |
| Network In Bytes [Delta Avg] | Average of system.network.in.bytes_delta for each network | bytes |
| Network In Bytes [Delta Max] | Maximum of system.network.in.bytes_delta for each network | bytes |
| Network In Bytes [Delta Min] | Minimum of system.network.in.bytes_delta for each network | bytes |
| Network In Bytes [Delta Sum] | Sum of system.network.in.bytes_delta for individual networks | bytes |
| Network In Bytes [Delta] | Delta of received byte count | bytes |
| Network In Dropped | Number of deleted packets among incoming packets | cnt |
| Network In Errors | Number of errors during reception | cnt |
| Network In Packets | Number of received packets | cnt |
| Network In Packets [Delta Avg] | Average of system.network.in.packets_delta for each network | cnt |
| Network In Packets [Delta Max] | Maximum of system.network.in.packets_delta for each network | cnt |
| Network In Packets [Delta Min] | Minimum of system.network.in.packets_delta for each network | cnt |
| Network In Packets [Delta Sum] | Sum of system.network.in.packets_delta for individual networks | cnt |
| Network In Packets [Delta] | Delta of received packet count | cnt |
| Network Out Bytes | Number of transmitted bytes | bytes |
| Network Out Bytes [Delta Avg] | Average of system.network.out.bytes_delta for each network | bytes |
| Network Out Bytes [Delta Max] | Maximum of system.network.out.bytes_delta for individual networks | bytes |
| Network Out Bytes [Delta Min] | Minimum of system.network.out.bytes_delta for individual networks | bytes |
| Network Out Bytes [Delta Sum] | Sum of system.network.out.bytes_delta for individual networks | bytes |
| Network Out Bytes [Delta] | Delta of transmitted byte count | bytes |
| Network Out Dropped | Number of deleted packets among outgoing packets | cnt |
| Network Out Errors | Number of errors during transmission | cnt |
| Network Out Packets | Number of transmitted packets | cnt |
| Network Out Packets [Delta Avg] | Average of system.network.out.packets_delta for each network | cnt |
| Network Out Packets [Delta Max] | Maximum of system.network.out.packets_delta for each network | cnt |
| Network Out Packets [Delta Min] | Minimum of system.network.out.packets_delta for each network | cnt |
| Network Out Packets [Delta Sum] | Sum of system.network.out.packets_delta for individual networks | cnt |
| Network Out Packets [Delta] | Delta of transmitted packet count | cnt |
| Open Connections [TCP] | All open TCP connections | cnt |
| Open Connections [UDP] | All open UDP connections | cnt |
| Port Usage | Available port usage rate | % |
| SYN Sent Sockets | Number of sockets in SYN_SENT state (when connecting from local to remote) | cnt |
| Kernel PID Max | kernel.pid_max value | cnt |
| Kernel Thread Max | kernel.threads-max value | cnt |
| Process CPU Usage | Percentage of CPU time consumed by the process since the last update | % |
| Process CPU Usage/Core | Percentage of CPU time used by the process since the last event | % |
| Process Memory Usage | The proportion of main memory (RAM) occupied by a process | % |
| Process Memory Used | Resident Set size. The amount of memory a process occupies in RAM. | bytes |
| Process PID | process pid | pid |
| Process PPID | Parent process PID | pid |
| Processes [Dead] | Number of dead processes | cnt |
| Processes [Idle] | Number of idle processes | cnt |
| Processes [Running] | Number of running processes | cnt |
| Processes [Sleeping] | sleeping processes count | cnt |
| Processes [Stopped] | number of stopped processes | cnt |
| Processes [Total] | Total number of processes | cnt |
| Processes [Unknown] | Number of processes with an unknown or unsearchable status | cnt |
| Processes [Zombie] | zombie processes count | cnt |
| Running Process Usage | process usage | % |
| Running Processes | Number of running processes | cnt |
| Running Thread Usage | Thread usage | % |
| Running Threads | Total number of threads running in running processes | cnt |
| Context Switches | context switch count (per second) | cnt |
| Load/Core [1 min] | The load over the last 1 minute divided by the number of cores | cnt |
| Load/Core [15 min] | The load over the last 15 minutes divided by the number of cores | cnt |
| Load/Core [5 min] | The load over the last 5 minutes divided by the number of cores | cnt |
| Multipaths [Active] | External storage connection path status = active count | cnt |
| Multipaths [Failed] | External storage connection path status = failed count | cnt |
| Multipaths [Faulty] | External storage connection path status = faulty count | cnt |
| NTP Offset | measured offset of the last sample (the time difference between the NTP server and the local environment) | num |
| Run Queue Length | Execution queue length | num |
| Uptime | OS uptime (uptime). (milliseconds) | ms |
| Context Switchies | CPU context switch count (per second) | cnt |
| Disk Read Bytes [Sec] | Bytes read per second from the Windows logical disk | cnt |
| Disk Read Time [Avg] | Average data read time (seconds) | sec |
| Disk Transfer Time [Avg] | Disk average wait time | sec |
| Disk Usage | Disk usage | % |
| Disk Write Bytes [Sec] | Number of bytes written in one second on a Windows logical disk | cnt |
| Disk Write Time [Avg] | Average data write time (seconds) | sec |
| Pagingfile Usage | Paging file usage | % |
| Pool Used [Non Paged] | Nonpaged Pool usage in kernel memory | bytes |
| Pool Used [Paged] | Paged Pool usage in kernel memory | bytes |
| Process [Running] | Number of currently running processes | cnt |
| Threads [Running] | Current number of running threads | cnt |
| Threads [Waiting] | Number of threads waiting for processor time | cnt |
4.2 - How-to guides
Users can create the service by entering the required information for a Bare Metal Server and selecting detailed options through the Samsung Cloud Platform Console.
Create Bare Metal Server
You can create and use the Bare Metal Server service from the Samsung Cloud Platform Console.
To create a Bare Metal Server, follow these steps.
- Click the All Services > Compute > Bare Metal Server menu. Navigate to the Service Home page of the Bare Metal Server.
- On the Service Home page, click the Create Bare Metal Server button. You will be taken to the Create Bare Metal Server page.
- On the Bare Metal Server Creation page, enter the information required to create the service and select detailed options.
- Select the required information in the Image and Version Selection area.
Category Required statusDetailed description image Required Select the type of image provided - RHEL
- Rocky Linux
- Ubuntu
- Windows
Image version Required Select version of the chosen image - Provide a list of versions for the supplied server image
Table. Bare Metal Server image and version input fields - Enter or select the required information in the Service Information Input area.
Category Required statusDetailed description Number of servers Required Number of Bare Metal Server instances to create simultaneously - Only numeric input is allowed, and it must be between 1 and 5
Service Type > Server Type Required Bare Metal Server server type - Select the desired vCPU, Memory, Disk specifications
- For detailed information about the server types provided by Bare Metal Server, refer to Bare Metal Server Server Type
Service Type > Planned Compute Required Status of resources with Planned Compute configured - In Use: Number of resources with Planned Compute that are currently in use
- Configured: Number of resources with Planned Compute configured
- Coverage Preview: Amount applied per resource by Planned Compute
- Create Planned Compute Service: Go to the Planned Compute application page
- For more details, refer to Planned Compute Apply
Automation account Required Automatically create an account to provide automation features after creating a Bare Metal Server - This account is used only for inter-system interfaces
- The password is encrypted and cannot be accessed from outside the system
- If the account is deleted, network changes and some automation functions may be restricted
Table. Bare Metal Server Service Information Input Items - In the Required Information Input area, enter or select the necessary information.
Category RequiredDetailed description Administrator account Required Set the administrator account and password to be used when connecting to the server - RHEL and Ubuntu OS are provided with a fixed root account
- For Windows OS, enter 5 to 20 characters using lowercase letters and numbers
AdministratorNot allowed
Server name Required When the selected number of servers is 1, enter a name to distinguish the Bare Metal Server - Set the hostname to the entered server name
- Start with a lowercase English letter, and use lowercase letters, numbers, and special characters (
-) to enter between 3 and 15 characters
- Must not end with a special character (
-)
Server name Prefix Required Enter a prefix to distinguish each Bare Metal Server generated when the selected number of servers is 2 or more - Automatically generated in the form of user input value (prefix) + ‘
-###’
- Must start with a lowercase English letter and be 3 to 15 characters long, using lowercase letters, numbers, and special characters (
-)
- Must not end with a special character (
-)
Network Settings Required Set the network where the Bare Metal Server will be installed - Select a pre‑created VPC
- General Subnet: Select a pre‑created general Subnet
- IP can be set to auto‑generate or user input, and if input is selected, the user enters the IP manually
- NAT: Available only when there is a single server and the VPC is attached to an Internet Gateway
- Checking the option allows selection of a NAT IP
- NAT IP: Select a NAT IP
- If no NAT IP is available, click the Create New button to generate a Public IP
- Refresh button to view and select the created Public IP
- Creating a Public IP incurs charges according to the Public IP pricing policy
- Local Subnet (optional): Choose to use a local Subnet
- It is not a required element for creating the service
- Select a pre‑created local Subnet
- IP can be set to auto‑generate or user input, and if input is selected, the user enters the IP manually
Table. Required information input fields for Bare Metal Server
- Select the required information in the Image and Version Selection area.
Please use a firewall, etc., to control traffic access for the Bare Metal Server. Security Groups are not provided.
Bare Metal Server’s firewall can only be used to control traffic between the Bare Metal Server and Virtual Server. To use the Bare Metal Server’s firewall, follow the steps below.
- Separate the VPC of Bare Metal Server: Separate them so that Bare Metal Server and Virtual Server do not use the same VPC.
- Create Transit Gateway: Create a Transit Gateway.
- Integration between the Virtual Server’s VPC and the Bare Metal Server’s VPC uses a Transit Gateway.
- When creating a Transit Gateway integration in the VPC of a Bare Metal Server, you must also create the Bare Metal Server’s firewall.
- Firewall Rule registration: Add a rule to the Bare Metal Server’s firewall.
- On the Bare Metal Server Creation page, in the Additional Information Input area, enter or select the required information.
Category RequiredDetailed description Local disk partition Selection Set usage of local disk partitions - Up to 10 partitions can be created, including the root partition
- Up to 90% of total capacity can be used
- After checking Use, partition information can be set
- Set root partition information
- Partition type: flat or lvm selectable
- Partition name: enter partition name
- Partition type can be entered only when it is lvm
- Enter up to 15 characters starting with a letter and containing letters, numbers, and special characters (
-_)
- Partition size: enter at least 50 GB
- Filesystem type: select according to the image used
- For RHEL, Rocky Linux: xfs, ext4
- For Ubuntu: ext4, xfs, btrfs
- For SLES: btrfs, xfs, ext4
- Mount point: start with special character
/and include letters, numbers, and special characters (-_) up to 15 characters- Filesystem type cannot be entered when it is swap
- Available capacity: 90% of the default disk capacity provided when selecting a server
- When setting Partition size, the remaining capacity is automatically calculated and displayed
- The total size of partition disks cannot exceed the available capacity
- Set additional partition information
- Partition type: flat or lvm selectable
- Partition name: enter partition name
- Partition type can be entered only when it is lvm
- Enter up to 15 characters starting with a letter and containing letters, numbers, and special characters (
-_)
- Partition size: enter at least 1 GB
- Filesystem type: select according to the image used>
- For RHEL, Rocky Linux: xfs, ext4, swap
- For Ubuntu: ext4, xfs, btrfs, swap
- For SLES: btrfs, xfs, ext4, swap
- Mount point: start with special character
/and include letters, numbers, and special characters (-_) up to 15 characters- Filesystem type cannot be entered when it is swap
- Available capacity: 90% of the default disk capacity provided when selecting a server
- When setting Partition size, the remaining capacity is automatically calculated and displayed
- The total size of partition disks cannot exceed the available capacity
Placement Group Selection Servers belonging to the same Placement group are distributed across different racks - Provides distributed placement for up to two servers belonging to the same Placement group
- For distributed placement of three or more servers, add additional Placement groups
- Applicable only at initial creation and cannot be modified afterward
- If you terminate the last server in a Placement group, that Placement group is automatically deleted
Lock Selection Using a lock prevents accidental actions that could terminate, start, or stop the server. Hyper Threading Selection Set the logical core count to operate at twice the number of physical cores - To turn off Hyper-Threading, clear the checkbox
- Cannot be changed after server creation
Init Script Selection Script to run when the server starts - The Init Script must be selected differently depending on the image type
- For Windows: select Batch Script
- For Linux: Shell Script
Table. Bare Metal Server additional information entry fields - Summary Check the detailed information and estimated billing amount generated in the panel, and click the Create button.
- Once creation is complete, check the created resources on the Bare Metal Server List page.
Bare Metal Server Check detailed information
Bare Metal Server service allows you to view and edit the full resource list and detailed information. The Bare Metal Server Details page consists of Details, Tags, Activity Log tabs.
To view detailed information about the Bare Metal Server, follow these steps.
- All Services > Compute > Bare Metal Server Click the menu. Navigate to the Service Home page of Bare Metal Server.
- On the Service Home page, click the Bare Metal Server menu. You will be taken to the Bare Metal Server List page.
- Bare Metal Server List page, click the resource to view detailed information. You will be taken to the Bare Metal Server Details page.
- The Bare Metal Server Details page displays status information and additional feature information, and consists of Details, Tags, Activity Log tabs.
Category Detailed description Bare Metal Server status Status of user-created Bare Metal Server - Creating: state while the server is being created
- Running:: state after creation, ready for use
- Editing:: state while IP is being changed
- Unknown: error state
- Starting: state while the server is starting
- Stopping: state while the server is stopping
- Stopped: state after the server has stopped
- Terminating: state while termination is in progress
- Terminated: state after termination is complete
Server control Button to change server status - Start: Start a stopped server
- Stop: Stop a running server
Service termination Cancel service button Table. Bare Metal Server status information and additional features
- The Bare Metal Server Details page displays status information and additional feature information, and consists of Details, Tags, Activity Log tabs.
Detailed Information
Bare Metal Server List page lets you view detailed information of the selected resource and modify it if necessary.
| Category | Detailed description |
|---|---|
| service | Service name |
| Resource Type | Resource Type |
| SRN | Unique resource ID in Samsung Cloud Platform
|
| Resource name | Resource Name
|
| Resource ID | Unique resource ID in the service |
| constructor | User who created the service |
| Creation date and time | Service creation date and time |
| editor | User who edited the service information |
| Modification date | Date and time the service information was modified |
| Server name | Server name |
| Image/Version | Server OS image and version |
| Server type | Display vCPU and memory information |
| Planned Compute | Resource status with Planned Compute configured
|
| Lock | Display whether Lock is enabled or disabled
|
| Hyper Threading | Indicates whether Hyper Threading is enabled/disabled
|
| Network | Network information of the Bare Metal Server
|
| Local Subnet | Local Subnet information of the Bare Metal Server
|
| Block Storage | Block Storage information connected to the server
|
| Init Script | View the Init Script content entered when creating the server |
tag
On the Bare Metal Server List page, you can view the tag information of the selected resource, and add, modify, or delete it.
| Category | Detailed description |
|---|---|
| Tag list | Tag list
|
Job History
Bare Metal Server List page allows you to view the operation history of the selected resource.
| Category | Detailed description |
|---|---|
| Task History List | Resource Change History
|
Bare Metal Server Resource Management
If you need server control and management functions for the created Bare Metal Server resources, you can perform the tasks on the Bare Metal Server List or Bare Metal Server Details page.
Bare Metal Server Operation Control
You can start, stop, and restart a running Bare Metal Server.
To control the operation of a Bare Metal Server, follow these steps.
- Click the All Services > Compute > Bare Metal Server menu. You will be taken to the Service Home page of the Bare Metal Server.
- On the Service Home page, click the Bare Metal Server menu. You will be taken to the Bare Metal Server List page.
- Bare Metal Server List page allows you to select multiple servers and control them simultaneously using the Start and Stop buttons at the top of the table.
- Bare Metal Server Details page also allows you to start and stop the server.
- On the Bare Metal Server List page, click the resource to control its operation and navigate to the Bare Metal Server Details page.
- Check the server status and complete the changes using each Server Management button.
- Start: Starts a stopped server.
- Stop: Stops the running server.
When a Bare Metal Server is stopped, the server’s power is turned off.
- Since it may affect running applications or storage, we recommend shutting down the OS before stopping.
- After shutting down the OS, be sure to also stop it in the Console.
- If a Bare Metal Server cannot be started when a start request is made, see the following.
- When lock is set: Change the lock setting to disabled, then try again.
- If the Bare Metal Server’s status is not Stopped: Change the Bare Metal Server’s status to Stopped, then try again.
- If a stop request for a Bare Metal Server cannot be fulfilled, see the following.
- When Lock is set: Change the Lock setting to disabled, then try again.
- If the Bare Metal Server’s status is not Running: Change the Bare Metal Server’s status to Running, then try again.
Add Block Storage
You can add Block Storage to a Bare Metal Server.
To add Block Storage, follow the steps below.
- Click the All Services > Compute > Bare Metal Server menu. Navigate to the Service Home page of the Bare Metal Server.
- Click the Bare Metal Server menu on the Service Home page. You will be taken to the Bare Metal Server List page.
- On the Bare Metal Server List page, click the server to which you want to add Block Storage. You will be taken to the Bare Metal Server Details page.
- On the Bare Metal Server Details page, click the Add button in the Block Storage section.
- When the popup confirming the addition of Block Storage opens, click the Confirm button. You will be taken to the Block Storage (BM) Creation page.
- On the Block Storage(BM) Creation page, enter the information required to create the service and create a Block Storage.
- For detailed information on creating Block Storage (BM), please refer to Create Block Storage(BM).
- Go to the Bare Metal Server Details page where Block Storage was added and verify that Block Storage has been added.
Terminate Bare Metal Server
Terminating an unused Bare Metal Server can reduce operating costs. However, terminating a Bare Metal Server may cause the running service to stop immediately, so you should proceed with termination only after fully considering the impact of service interruption.
To terminate a Bare Metal Server, follow the steps below.
- Click the All Services > Compute > Bare Metal Server menu. Navigate to the Service Home page of the Bare Metal Server.
- Click the Bare Metal Server menu on the Service Home page. You will be taken to the Bare Metal Server List page.
- Bare Metal Server List page, select the resource to terminate, and click the Terminate Service button.
- You can select multiple resources and delete them simultaneously.
- You can also delete it by clicking the Service Termination button on the Bare metal Server Details page of the resource to be terminated.
- When termination is complete, verify on the Bare Metal Server list page that the resource has been terminated.
Cancellation constraints
If termination is not possible when requesting cancellation of a Bare Metal Server, a popup will inform you. Please refer to the cases below.
When Block Storage(BM) is attached (simultaneous termination of two or more servers): Please detach the Block Storage(BM) first.
- For detailed information on how to cancel, see Block Storage(BM) Cancellation.
When File Storage is connected: First disconnect the File Storage connection.
- For detailed information on how to cancel, refer to File Storage Cancellation.
When Lock is set: Change the Lock setting to disabled, then try again.
If there are resources connected to Backup Agent or Load Balancer: First disconnect the connections of those resources.
If resource management tasks for the Bare Metal Server are in progress on the same account: After the Bare Metal Server resource management task is completed, please try again.
If the Bare Metal Server’s status is not Running or Stopped: Change the Bare Metal Server’s status to Running or Stopped, then try again.
If the selection includes a server that cannot be terminated simultaneously: Select only resources that can be terminated and try again.
Configure local Subnet
After completing the creation of a Bare Metal Server, when adding a local Subnet on the Bare Metal Server Details page, the user must directly configure the network settings of the local Subnet.
first connection(kr-west)
There is no local Subnet currently connected to the Bare Metal Server, and if you are adding the first connection, follow the guide below.
This guide applies to kr-west (Korea West) when adding the first local Subnet connection to the server.
- kr-south(South Korea) guide can be found in the First Connection (kr-south) chapter.
Linux - Configuring Subnet on Ubuntu
To add a local Subnet and configure the network on an Ubuntu operating system, follow these steps.
On the Bare Metal Server Details page, check the Interface Name.
Retrieve the network configuration information.
Color mode[root@localhost ~]# vi /etc/netplan/50-cloud-init.yaml network: ethernets: ens2f1: match: macaddress: 68:05:ca:d4:09:91 mtu: 1500 set-name: ens2f1 ens4f1: match: macaddress: 68:05:ca:d4:09:01 mtu: 1500 set-name: ens4f1[root@localhost ~]# vi /etc/netplan/50-cloud-init.yaml network: ethernets: ens2f1: match: macaddress: 68:05:ca:d4:09:91 mtu: 1500 set-name: ens2f1 ens4f1: match: macaddress: 68:05:ca:d4:09:01 mtu: 1500 set-name: ens4f1Code block. View network configuration file After adding a new VLAN, set the IP for the bonding configuration.
- Replace the ID and IP in the example code with the assigned ID and IP.Color mode
[root@localhost ~]# vi /etc/netplan/50-cloud-init.yaml network: bonds: bond-mgt: interfaces: - ens2f1 // **Bare Metal Server Details** page: enter the Interface Name you verified. - ens4f1 // **Bare Metal Server Details** page: enter the Interface Name you verified. mtu: 1500 parameters: mii-monitor-interval: 100 mode: active-backup transmit-hash-policy: layer2 ethernets: ens2f1: match: macaddress: 68:05:ca:d4:09:91 mtu: 1500 set-name: ens2f1 ens4f1: match: macaddress: 68:05:ca:d4:09:01 mtu: 1500 set-name: ens4f1 vlans: bond-mgt.20: // Enter the Vlan ID you verified in the SCP Console instead of 20. addresses: - 192.168.0.10/24 // Set it to the local Subnet IP confirmed in the SCP Console. id: 20 // Set it to the VLAN ID verified in the SCP Console. link: bond-mgt mtu: 1500[root@localhost ~]# vi /etc/netplan/50-cloud-init.yaml network: bonds: bond-mgt: interfaces: - ens2f1 // **Bare Metal Server Details** page: enter the Interface Name you verified. - ens4f1 // **Bare Metal Server Details** page: enter the Interface Name you verified. mtu: 1500 parameters: mii-monitor-interval: 100 mode: active-backup transmit-hash-policy: layer2 ethernets: ens2f1: match: macaddress: 68:05:ca:d4:09:91 mtu: 1500 set-name: ens2f1 ens4f1: match: macaddress: 68:05:ca:d4:09:01 mtu: 1500 set-name: ens4f1 vlans: bond-mgt.20: // Enter the Vlan ID you verified in the SCP Console instead of 20. addresses: - 192.168.0.10/24 // Set it to the local Subnet IP confirmed in the SCP Console. id: 20 // Set it to the VLAN ID verified in the SCP Console. link: bond-mgt mtu: 1500Code block. IP configuration
- Replace the ID and IP in the example code with the assigned ID and IP.
Apply the changes to the system.
Color mode# netplan apply# netplan applyCode block. Apply changes Check the interface status.
Color mode# ip a or # bash /usr/local/bin/ip.sh# ip a or # bash /usr/local/bin/ip.shCode block. Interface lookup
Linux – Configuring Subnet on CentOS/Red Hat
After adding a local Subnet on a CentOS/Red Hat operating system, follow these steps to configure the network.
On the Bare Metal Server Details page, check the Interface Name.
Modify the following command and execute it.
Color mode#!/usr/bin/bash IP_ADDR="10.1.1.3/24" // Set the local Subnet IP as verified in the Console. VLAN_ID="7" // Set the Vlan ID you verified in the console. BOND_NAME="bond-mgt" BOND_IF_name1="ens2f1" // **Bare Metal Server Details** Enter the Interface Name as shown on the page. BOND_IF_name2="ens4f0" // **Bare Metal Server Details** page, enter the Interface Name you verified. # Delete Connection nmcli con down "Bond ${BOND_NAME}" nmcli con del "Bond ${BOND_NAME}" nmcli con down "System ${BOND_IF_name1}" nmcli con down "System ${BOND_IF_name2}" nmcli con del "System ${BOND_IF_name1}" nmcli con del "System ${BOND_IF_name2}" # Create Bonding nmcli con add con-name ${BOND_NAME} type bond ifname ${BOND_NAME} nmcli conn mod ${BOND_NAME} con-name "Bond ${BOND_NAME}" nmcli conn mod "Bond ${BOND_NAME}" ipv4.method disabled nmcli conn mod "Bond ${BOND_NAME}" ipv6.method ignore nmcli conn mod "Bond ${BOND_NAME}" connect.autoconnect yes nmcli conn mod "Bond ${BOND_NAME}" +bond.options mode=active-backup \ +bond.options xmit_hash_policy=layer2 +bond.options miimon=100 +bond.options num_grat_arp=1 \ +bond.options downdelay=0 +bond.options updelay=0 # Assign bond-slave nmcli conn add type bond-slave ifname ${BOND_IF_name1} con-name "${BOND_IF_name1}" master ${BOND_NAME} nmcli conn mod ${BOND_IF_name1} con-name "System ${BOND_IF_name1}" nmcli conn add type bond-slave ifname ${BOND_IF_name2} con-name "${BOND_IF_name2}" master ${BOND_NAME} nmcli conn mod ${BOND_IF_name2} con-name "System ${BOND_IF_name2}" # Connection UP nmcli conn up "Bond ${BOND_NAME}" # add vlan nmcli conn add type vlan ifname "${BOND_NAME}.${VLAN_ID}" con-name "${BOND_NAME}.${VLAN_ID}" id ${VLAN_ID} dev ${BOND_NAME} nmcli con mod ${BOND_NAME}.${VLAN_ID} con-name "Vlan ${BOND_NAME}.${VLAN_ID}" nmcli con mod "Vlan ${BOND_NAME}.${VLAN_ID}" ipv4.addresses ${IP_ADDR} nmcli con mod "Vlan ${BOND_NAME}.${VLAN_ID}" ipv4.method manual nmcli con mod "Vlan ${BOND_NAME}.${VLAN_ID}" ipv6.method "ignore" nmcli con mod "Vlan ${BOND_NAME}.${VLAN_ID}" connect.autoconnect yes nmcli con up "Vlan ${BOND_NAME}.${VLAN_ID}" nmcli device reapply ${BOND_NAME}.${VLAN_ID}#!/usr/bin/bash IP_ADDR="10.1.1.3/24" // Set the local Subnet IP as verified in the Console. VLAN_ID="7" // Set the Vlan ID you verified in the console. BOND_NAME="bond-mgt" BOND_IF_name1="ens2f1" // **Bare Metal Server Details** Enter the Interface Name as shown on the page. BOND_IF_name2="ens4f0" // **Bare Metal Server Details** page, enter the Interface Name you verified. # Delete Connection nmcli con down "Bond ${BOND_NAME}" nmcli con del "Bond ${BOND_NAME}" nmcli con down "System ${BOND_IF_name1}" nmcli con down "System ${BOND_IF_name2}" nmcli con del "System ${BOND_IF_name1}" nmcli con del "System ${BOND_IF_name2}" # Create Bonding nmcli con add con-name ${BOND_NAME} type bond ifname ${BOND_NAME} nmcli conn mod ${BOND_NAME} con-name "Bond ${BOND_NAME}" nmcli conn mod "Bond ${BOND_NAME}" ipv4.method disabled nmcli conn mod "Bond ${BOND_NAME}" ipv6.method ignore nmcli conn mod "Bond ${BOND_NAME}" connect.autoconnect yes nmcli conn mod "Bond ${BOND_NAME}" +bond.options mode=active-backup \ +bond.options xmit_hash_policy=layer2 +bond.options miimon=100 +bond.options num_grat_arp=1 \ +bond.options downdelay=0 +bond.options updelay=0 # Assign bond-slave nmcli conn add type bond-slave ifname ${BOND_IF_name1} con-name "${BOND_IF_name1}" master ${BOND_NAME} nmcli conn mod ${BOND_IF_name1} con-name "System ${BOND_IF_name1}" nmcli conn add type bond-slave ifname ${BOND_IF_name2} con-name "${BOND_IF_name2}" master ${BOND_NAME} nmcli conn mod ${BOND_IF_name2} con-name "System ${BOND_IF_name2}" # Connection UP nmcli conn up "Bond ${BOND_NAME}" # add vlan nmcli conn add type vlan ifname "${BOND_NAME}.${VLAN_ID}" con-name "${BOND_NAME}.${VLAN_ID}" id ${VLAN_ID} dev ${BOND_NAME} nmcli con mod ${BOND_NAME}.${VLAN_ID} con-name "Vlan ${BOND_NAME}.${VLAN_ID}" nmcli con mod "Vlan ${BOND_NAME}.${VLAN_ID}" ipv4.addresses ${IP_ADDR} nmcli con mod "Vlan ${BOND_NAME}.${VLAN_ID}" ipv4.method manual nmcli con mod "Vlan ${BOND_NAME}.${VLAN_ID}" ipv6.method "ignore" nmcli con mod "Vlan ${BOND_NAME}.${VLAN_ID}" connect.autoconnect yes nmcli con up "Vlan ${BOND_NAME}.${VLAN_ID}" nmcli device reapply ${BOND_NAME}.${VLAN_ID}Code block. IP configuration script Check the interface status.
Color mode# ip a or # bash /usr/local/bin/ip.sh# ip a or # bash /usr/local/bin/ip.shCode block. Interface lookup
Configuring Subnet on Windows
After adding a local Subnet on the Windows operating system, follow these steps to configure the network.
Windows Start icon, right-click it, then run the Windows PowerShell(Administrator) program.
On the Bare Metal Server Details page, check the Interface Name.
Run ncpa.cpl from the Windows Run menu.
Check whether the interface is enabled, and enable it if necessary.
- Activate the Interface Name confirmed on the Bare Metal Server Details page.
- Activate the Interface Name confirmed on the Bare Metal Server Details page.
Create a teaming.
Color modePS C:\> New-NetLbfoTeam –Name “bond-mgt” –TeamMembers ens2f1,ens4f1 PS C:\> Set-NetLbfoTeam –Name “bond-mgt” –LoadBalancingAlgorithm DynamicPS C:\> New-NetLbfoTeam –Name “bond-mgt” –TeamMembers ens2f1,ens4f1 PS C:\> Set-NetLbfoTeam –Name “bond-mgt” –LoadBalancingAlgorithm DynamicCode block. Create Teaming After adding a new VLAN, configure the IP.
- Enter the VLAN ID and local Subnet IP verified on the Bare Metal Server Details page.Color mode
PS C:\> Add-NetLbfoTeamNIC -Team bond_bond-mgt -VlanID 20 -Name bond-mgt.20 PS C:\> Get-NetAdapter PS C:\> netsh interface ip set address bond-mgt.20 static “192.168.0.10/24”PS C:\> Add-NetLbfoTeamNIC -Team bond_bond-mgt -VlanID 20 -Name bond-mgt.20 PS C:\> Get-NetAdapter PS C:\> netsh interface ip set address bond-mgt.20 static “192.168.0.10/24”Code block. Windows IP configuration
- Enter the VLAN ID and local Subnet IP verified on the Bare Metal Server Details page.
In the Windows Run menu, execute ncpa.cpl to check the interface status.
First connection(kr-south)
If there is no local Subnet already connected to the Bare Metal Server and you are adding the first connection, follow the guide below.
This guide is for kr-south(Korea region) when adding the first local Subnet connection to the server.
- The guide for kr-west(Korea West) is in the First link (kr-west) chapter.
Linux - Configuring Subnet on Ubuntu
To add a local Subnet and configure the network on an Ubuntu operating system, follow these steps.
After adding a new VLAN, set the IP.
- Change the ID and IP in the example code to the assigned ID and IP.Color mode
[root@localhost ~]# vi /etc/netplan/50-cloud-init.yaml network: bonds: bond-mgt: interfaces: - ens2f1 - ens4f1 mtu: 1500 parameters: mii-monitor-interval: 100 mode: active-backup transmit-hash-policy: layer2 ethernets: ens2f1: match: macaddress: 68:05:ca:d4:09:91 mtu: 1500 set-name: ens2f1 ens4f1: match: macaddress: 68:05:ca:d4:09:01 mtu: 1500 set-name: ens4f1 vlans: bond-mgt.20: addresses: - 192.168.0.10/24 id: 20 link: bond-mgt mtu: 1500 vlans: bond-mgt.21: // Enter the VLAN ID you verified in the console instead of 21. addresses: - 192.168.0.20/24 // Set it to the local Subnet IP you verified in the Console. id: 21 // Set to the VLAN ID verified in the console. link: bond-mgt mtu: 1500[root@localhost ~]# vi /etc/netplan/50-cloud-init.yaml network: bonds: bond-mgt: interfaces: - ens2f1 - ens4f1 mtu: 1500 parameters: mii-monitor-interval: 100 mode: active-backup transmit-hash-policy: layer2 ethernets: ens2f1: match: macaddress: 68:05:ca:d4:09:91 mtu: 1500 set-name: ens2f1 ens4f1: match: macaddress: 68:05:ca:d4:09:01 mtu: 1500 set-name: ens4f1 vlans: bond-mgt.20: addresses: - 192.168.0.10/24 id: 20 link: bond-mgt mtu: 1500 vlans: bond-mgt.21: // Enter the VLAN ID you verified in the console instead of 21. addresses: - 192.168.0.20/24 // Set it to the local Subnet IP you verified in the Console. id: 21 // Set to the VLAN ID verified in the console. link: bond-mgt mtu: 1500Code block. Add VLAN and configure IP
- Change the ID and IP in the example code to the assigned ID and IP.
Apply the changes to the system.
Color mode# netplan apply# netplan applyCode block. Reflect changes. Check the interface status.
Color mode# ip a or # bash /usr/local/bin/ip.sh# ip a or # bash /usr/local/bin/ip.shCode block. Interface lookup
Linux – Configuring Subnet on CentOS/Red Hat
After adding a local Subnet on a CentOS/Red Hat operating system, follow these steps to configure the network.
- Check the Bond Name for the local Subnet.Color mode
# sh /usr/local/bin/ip.sh# sh /usr/local/bin/ip.shCode block. Verify bonding - Modify the following command and execute it.Color mode
#!/usr/bin/bash IP_ADDR="10.1.1.3/24" // Set the local Subnet IP as observed in the console. VLAN_ID="7" // Set the Vlan ID you verified in the console. BOND_NAME="bond-mgt" // Set the Bond Name identified in step 1. # add vlan nmcli conn add type vlan ifname "${BOND_NAME}.${VLAN_ID}" con-name "${BOND_NAME}.${VLAN_ID}" id ${VLAN_ID} dev ${BOND_NAME} nmcli con mod ${BOND_NAME}.${VLAN_ID} con-name "Vlan ${BOND_NAME}.${VLAN_ID}" nmcli con mod "Vlan ${BOND_NAME}.${VLAN_ID}" ipv4.addresses ${IP_ADDR} nmcli con mod "Vlan ${BOND_NAME}.${VLAN_ID}" ipv4.method manual nmcli con mod "Vlan ${BOND_NAME}.${VLAN_ID}" ipv6.method "ignore" nmcli con mod "Vlan ${BOND_NAME}.${VLAN_ID}" connect.autoconnect yes nmcli con up "Vlan ${BOND_NAME}.${VLAN_ID}" nmcli device reapply ${BOND_NAME}.${VLAN_ID}#!/usr/bin/bash IP_ADDR="10.1.1.3/24" // Set the local Subnet IP as observed in the console. VLAN_ID="7" // Set the Vlan ID you verified in the console. BOND_NAME="bond-mgt" // Set the Bond Name identified in step 1. # add vlan nmcli conn add type vlan ifname "${BOND_NAME}.${VLAN_ID}" con-name "${BOND_NAME}.${VLAN_ID}" id ${VLAN_ID} dev ${BOND_NAME} nmcli con mod ${BOND_NAME}.${VLAN_ID} con-name "Vlan ${BOND_NAME}.${VLAN_ID}" nmcli con mod "Vlan ${BOND_NAME}.${VLAN_ID}" ipv4.addresses ${IP_ADDR} nmcli con mod "Vlan ${BOND_NAME}.${VLAN_ID}" ipv4.method manual nmcli con mod "Vlan ${BOND_NAME}.${VLAN_ID}" ipv6.method "ignore" nmcli con mod "Vlan ${BOND_NAME}.${VLAN_ID}" connect.autoconnect yes nmcli con up "Vlan ${BOND_NAME}.${VLAN_ID}" nmcli device reapply ${BOND_NAME}.${VLAN_ID}Code block. IP configuration script - Check the interface status.Color mode
# ip a or # bash /usr/local/bin/ip.sh# ip a or # bash /usr/local/bin/ip.shCode block. Interface lookup
Configuring Subnet on Windows
After adding a local Subnet on the Windows operating system, follow these steps to configure the network.
Windows Start icon, right-click it, then run the Windows PowerShell(Administrator) program.
Check the Teaming name for the local Subnet.
Color modePS C:\> Get-NetAdapterPS C:\> Get-NetAdapterCode block. Check Windows interface After adding a new VLAN, configure the IP.
- Enter the Teaming name verified in step 2, and the Vlan ID and local Subnet IP verified in the Console.Color mode
PS C:\> Add-NetLbfoTeamNIC -Team bond_bond-mgt -VlanID 20 -Name bond-mgt.20 PS C:\> Get-NetAdapter PS C:\> netsh interface ip set address bond-mgt.20 static “192.168.0.10/24”PS C:\> Add-NetLbfoTeamNIC -Team bond_bond-mgt -VlanID 20 -Name bond-mgt.20 PS C:\> Get-NetAdapter PS C:\> netsh interface ip set address bond-mgt.20 static “192.168.0.10/24”code block. Create Teaming
- Enter the Teaming name verified in step 2, and the Vlan ID and local Subnet IP verified in the Console.
Run ncpa.cpl from the Windows Run menu to check the interface status.
Add second connection (kr-west, kr-south)
If there is a local subnet connected to the Bare Metal Server, the guide for the second additional connection is below.
Since a Bonding was already created when connecting the first local Subnet, there is no need to create a Bonding when connecting the second local Subnet.
Please refer to the details below.
Linux - Setting up Subnet on Ubuntu
To add a local Subnet and configure the network on an Ubuntu operating system, follow these steps.
After adding a new VLAN, configure the IP.
- Change the ID and IP in the example code to the assigned ID and IP.Color mode
[root@localhost ~]# vi /etc/netplan/50-cloud-init.yaml network: bonds: bond-mgt: interfaces: - ens2f1 - ens4f1 mtu: 1500 parameters: mii-monitor-interval: 100 mode: active-backup transmit-hash-policy: layer2 ethernets: ens2f1: match: macaddress: 68:05:ca:d4:09:91 mtu: 1500 set-name: ens2f1 ens4f1: match: macaddress: 68:05:ca:d4:09:01 mtu: 1500 set-name: ens4f1 vlans: bond-mgt.20: addresses: - 192.168.0.10/24 id: 20 link: bond-mgt mtu: 1500 vlans: bond-mgt.21: // Enter the VLAN ID you verified on the console instead of 21. addresses: - 192.168.0.20/24 // Set it to the local Subnet IP verified in the console. id: 21 // Set it to the VLAN ID verified in the console. link: bond-mgt mtu: 1500[root@localhost ~]# vi /etc/netplan/50-cloud-init.yaml network: bonds: bond-mgt: interfaces: - ens2f1 - ens4f1 mtu: 1500 parameters: mii-monitor-interval: 100 mode: active-backup transmit-hash-policy: layer2 ethernets: ens2f1: match: macaddress: 68:05:ca:d4:09:91 mtu: 1500 set-name: ens2f1 ens4f1: match: macaddress: 68:05:ca:d4:09:01 mtu: 1500 set-name: ens4f1 vlans: bond-mgt.20: addresses: - 192.168.0.10/24 id: 20 link: bond-mgt mtu: 1500 vlans: bond-mgt.21: // Enter the VLAN ID you verified on the console instead of 21. addresses: - 192.168.0.20/24 // Set it to the local Subnet IP verified in the console. id: 21 // Set it to the VLAN ID verified in the console. link: bond-mgt mtu: 1500Code block. Add VLAN and configure IP
- Change the ID and IP in the example code to the assigned ID and IP.
Apply the changes to the system.
Color mode# netplan apply# netplan applyCode block. Reflect changes Check the interface status.
Color mode# ip a or # bash /usr/local/bin/ip.sh# ip a or # bash /usr/local/bin/ip.shCode block. Interface lookup
Linux – Setting Subnet on CentOS/Red Hat
After adding a local Subnet on a CentOS/Red Hat operating system, follow these steps to configure the network.
- Verify the Bond Name for the local Subnet.Color mode
# sh /usr/local/bin/ip.sh# sh /usr/local/bin/ip.shCode block. Verify bonding - Modify the following command and execute it.Color mode
#!/usr/bin/bash IP_ADDR="10.1.1.3/24" // Set the local Subnet IP as observed in the console. VLAN_ID="7" // Set the Vlan ID as observed in the console. BOND_NAME="bond-mgt" // Set the Bond Name confirmed in step 1. # add vlan nmcli conn add type vlan ifname "${BOND_NAME}.${VLAN_ID}" con-name "${BOND_NAME}.${VLAN_ID}" id ${VLAN_ID} dev ${BOND_NAME} nmcli con mod ${BOND_NAME}.${VLAN_ID} con-name "Vlan ${BOND_NAME}.${VLAN_ID}" nmcli con mod "Vlan ${BOND_NAME}.${VLAN_ID}" ipv4.addresses ${IP_ADDR} nmcli con mod "Vlan ${BOND_NAME}.${VLAN_ID}" ipv4.method manual nmcli con mod "Vlan ${BOND_NAME}.${VLAN_ID}" ipv6.method "ignore" nmcli con mod "Vlan ${BOND_NAME}.${VLAN_ID}" connect.autoconnect yes nmcli con up "Vlan ${BOND_NAME}.${VLAN_ID}" nmcli device reapply ${BOND_NAME}.${VLAN_ID}#!/usr/bin/bash IP_ADDR="10.1.1.3/24" // Set the local Subnet IP as observed in the console. VLAN_ID="7" // Set the Vlan ID as observed in the console. BOND_NAME="bond-mgt" // Set the Bond Name confirmed in step 1. # add vlan nmcli conn add type vlan ifname "${BOND_NAME}.${VLAN_ID}" con-name "${BOND_NAME}.${VLAN_ID}" id ${VLAN_ID} dev ${BOND_NAME} nmcli con mod ${BOND_NAME}.${VLAN_ID} con-name "Vlan ${BOND_NAME}.${VLAN_ID}" nmcli con mod "Vlan ${BOND_NAME}.${VLAN_ID}" ipv4.addresses ${IP_ADDR} nmcli con mod "Vlan ${BOND_NAME}.${VLAN_ID}" ipv4.method manual nmcli con mod "Vlan ${BOND_NAME}.${VLAN_ID}" ipv6.method "ignore" nmcli con mod "Vlan ${BOND_NAME}.${VLAN_ID}" connect.autoconnect yes nmcli con up "Vlan ${BOND_NAME}.${VLAN_ID}" nmcli device reapply ${BOND_NAME}.${VLAN_ID}Code block. IP configuration script - Check the interface status.Color mode
# ip a or # bash /usr/local/bin/ip.sh# ip a or # bash /usr/local/bin/ip.shCode block. Interface lookup
Configuring Subnet on Windows
After adding a local Subnet on the Windows operating system, follow these steps to configure the network.
Windows Start icon, right‑click it, then launch the Windows PowerShell(Administrator) program.
Check the Teaming name for the local Subnet.
Color modePS C:\> Get-NetAdapterPS C:\> Get-NetAdapterCode block. Check Windows interface After adding a new Vlan, configure the IP.
- Enter the Teaming name confirmed in step 2, the Vlan ID, and the local Subnet IP confirmed in the Console.Color mode
PS C:\> Add-NetLbfoTeamNIC -Team bond_bond-mgt -VlanID 20 -Name bond-mgt.20 PS C:\> Get-NetAdapter PS C:\> netsh interface ip set address bond-mgt.20 static “192.168.0.10/24”PS C:\> Add-NetLbfoTeamNIC -Team bond_bond-mgt -VlanID 20 -Name bond-mgt.20 PS C:\> Get-NetAdapter PS C:\> netsh interface ip set address bond-mgt.20 static “192.168.0.10/24”Code block. Create Teaming
- Enter the Teaming name confirmed in step 2, the Vlan ID, and the local Subnet IP confirmed in the Console.
Run ncpa.cpl from the Windows Run menu to check the interface status.
Change IP
The IP address may be changed to carry out migration, server replacement, and similar operations.
- If you proceed with changing the IP, you will no longer be able to communicate using that IP, and you cannot cancel the IP change while it is in progress.
- If the server is running the Load Balancer service, you must delete the old IP from the LB server group and directly add the new IP as a member of the LB server group.
- Servers using Public NAT and Privat NAT must disable and then reconfigure Public NAT and Privat NAT after an IP change.
- If you are using Public NAT and Privat NAT, first disable Public NAT and Privat NAT, complete the IP change, and then reconfigure.
- Whether to use Public NAT and Privat NAT can be changed by clicking the Edit button for Public NAT IP and Privat NAT on the Bare Metal Server Details page.
To change the IP, follow the steps below.
Click the All Services > Compute > Bare Metal Server menu. Navigate to the Service Home page of the Bare Metal Server.
Click the Bare Metal Server menu on the Service Home page. You will be taken to the Bare Metal Server List page.
Bare Metal Server List page, click the server whose IP you want to change. You will be taken to the Bare Metal Server Details page.
On the Bare Metal Server Details page, click the Edit button next to the IP item.
When the popup notifying IP modification opens, click the Confirm button. The IP Change popup opens.
In the IP Change popup, complete Step 1, Step 2, and Step 3 in order.
information- When changing an IP, the detailed configuration method for the IP change step varies depending on the subnet of the IP to be changed. Be sure to refer to the following examples and perform the work for each step.
- When changing to an IP that uses the same subnet: Change to an IP in the same Subnet See
- When changing to an IP that uses a different subnet: see Changing to an IP in a different Subnet
- When each step is completed successfully, the task status in the upper right corner is shown as Completed, and you can proceed to the next step.
- When performing the final check of Step 3, it is recommended to restart the server before proceeding with the inspection.
- When changing an IP, the detailed configuration method for the IP change step varies depending on the subnet of the IP to be changed. Be sure to refer to the following examples and perform the work for each step.
After confirming that all tasks have completed successfully, click the Confirm button.
Change to an IP in the same Subnet
This document explains how to configure IP settings per operating system when the IP to be changed uses the same subnet.
Linux – centos/redhat operating system
Step 1
Follow the procedure below to carry out Step 1.
- Select the Subnet to modify.
- Enter the IP address to change.
- Click the IP Allocation Request button.
- If a popup window notifying you of an IP change opens, click the Confirm button.
- When the task completes successfully, the task status in the upper right corner is displayed as Completed.CautionIf you proceed with the IP allocation request for Step 1, you cannot cancel or revert the IP change.
- When the task completes successfully, the task status in the upper right corner is displayed as Completed.
Step 2
Follow the procedure below to carry out Step 2.
For the IP change operation, connect to the target server using the NAT IP.
guideTo prevent situations where communication becomes impossible during operation, we recommend connecting via another Virtual Server or Bare Metal Server created in the same subnet.Enter the IP assigned in Step 1 and configure the new IP on the server.
- In the following example, replace
172.17.34.150with the assigned IP address. - After checking the information of the Interface you want to change on the server, replace it with
bond-srv.9in the following example.Color mode# nmcli con mod "Vlan bond-srv.9" ipv4.addresses 172.17.34.150/24 # nmcli con mod "Vlan bond-srv.9" ipv4.method manual # nmcli device reapply bond-srv.9# nmcli con mod "Vlan bond-srv.9" ipv4.addresses 172.17.34.150/24 # nmcli con mod "Vlan bond-srv.9" ipv4.method manual # nmcli device reapply bond-srv.9Code block. IP settings to change information- Setting the IP disconnects the terminal session.
- Step 2 After completing the task, if the task status changes to Completed, you can reconnect to the terminal.
- In the following example, replace
When all tasks are completed, select the Step 2 completion checkbox in the IP Change popup window.
- When the task completes successfully, the task status in the upper right corner is displayed as Completed.information
- If the task status of Step 2 has changed to Completed but you still have issues with terminal access, go to the All Services > Management > Support Center Contact Us menu and submit an inquiry.
- When the task completes successfully, the task status in the upper right corner is displayed as Completed.
Step 3
Follow the procedure below to carry out Step 3 work.
Connect to the server subject to IP change using the NAT IP and check the communication status.
- Use the following command to verify whether any pre‑change configuration remains and whether the change was applied correctly. If you can connect successfully to the target server whose IP was changed, the new IP is communicating properly.Color mode
# bash /usr/local/bin/ip.sh# bash /usr/local/bin/ip.shCode block. Communication status check ReferenceThe NAT IP does not change.
- Use the following command to verify whether any pre‑change configuration remains and whether the change was applied correctly. If you can connect successfully to the target server whose IP was changed, the new IP is communicating properly.
After all tasks are completed, restart the server and perform a final check.
ReferenceIt is recommended to perform the final check after restarting the server.If there are no issues in the final inspection results, select the work completion checkbox for Step 3 in the IP Change popup window.
Linux – Ubuntu operating system
Step 1
Follow the procedure below to carry out Step 1.
- Select the Subnet to modify.
- Enter the IP address to change.
- IP Allocation Request Click the button.
- When a popup indicating IP change verification opens, click the Confirm button.
- When the task completes successfully, the task status in the upper right corner is displayed as Completed.CautionIf you proceed with the IP allocation request of Step 1, you cannot cancel or revert the IP change.
- When the task completes successfully, the task status in the upper right corner is displayed as Completed.
Step 2
Follow the procedure below to perform Step 2.
To perform the IP change, connect to the target server using its NAT IP.
InformationTo prevent situations where communication becomes impossible during operation, we recommend connecting via another Virtual Server or Bare Metal Server created in the same subnet.Enter the IP assigned in Step 1 and configure the new IP on the server.
- In the following example, replace
172.17.34.150/24with the assigned IP address. - After checking the information of the Interface you want to change on the server, replace it with the following example
bond-srv.9.Color mode[root@localhost ~]# vi /etc/netplan/50-cloud-init.yaml network: bonds: ...................... omitted ethernets: ...................... omitted vlans: bond-srv.9: addresses: - 172.17.34.150/24 # Enter the IP assigned in Step1. gateway4: 172.17.34.2 id: 9 link: bond-srv mtu: 1500 bond-srv.350: addresses: - 172.16.87.150/24 routes: - to: 172.17.87.0/24 via: 172.16.87.1 - to: 172.17.87.0/24 via: 172.16.87.1 id: 350 link: bond-srv[root@localhost ~]# vi /etc/netplan/50-cloud-init.yaml network: bonds: ...................... omitted ethernets: ...................... omitted vlans: bond-srv.9: addresses: - 172.17.34.150/24 # Enter the IP assigned in Step1. gateway4: 172.17.34.2 id: 9 link: bond-srv mtu: 1500 bond-srv.350: addresses: - 172.16.87.150/24 routes: - to: 172.17.87.0/24 via: 172.16.87.1 - to: 172.17.87.0/24 via: 172.16.87.1 id: 350 link: bond-srvCode block. Change IP setting
- In the following example, replace
Use the Netplan apply command to apply the changes to the system.
Color mode[root@localhost ~]# netplan apply[root@localhost ~]# netplan applyCode block. Run Netplan apply information- Setting the IP disconnects the terminal session.
- Step 2 After completing the task, if the task status changes to Completed, you can reconnect to the terminal.
When all tasks are completed, select the task completion checkbox for Step 2 in the IP change popup window.
- When the task completes successfully, the task status in the upper right corner is displayed as Completed.information
- If the task status of Step 2 has changed to Completed but you still experience issues with terminal access, go to the All Services > Management > Support Center Contact Us menu and submit an inquiry.
- When the task completes successfully, the task status in the upper right corner is displayed as Completed.
Step 3
Follow the procedure below to carry out Step 3.
Connect to the server subject to IP change using the NAT IP and check the communication status.
- Use the following command to verify whether any pre‑change configuration remains and whether the change was applied correctly. If you can connect successfully to the target server whose IP was changed, the new IP is communicating properly.Color mode
# bash /usr/local/bin/ip.sh# bash /usr/local/bin/ip.shCode block. Communication status check referenceThe NAT IP does not change.
- Use the following command to verify whether any pre‑change configuration remains and whether the change was applied correctly. If you can connect successfully to the target server whose IP was changed, the new IP is communicating properly.
After all tasks are completed, restart the server and then perform a final check.
ReferenceIt is recommended to perform the final check after restarting the server.If the final inspection results show no issues, select the work completion checkbox for Step 3 in the IP Change popup window.
Windows operating system
Step 1
Follow the procedure below to perform Step 1 work.
- Select the Subnet to modify.
- Enter the IP address to change.
- Click the IP Allocation Request button.
- When the pop-up that notifies you of an IP change appears, click the Confirm button.
- When the task completes successfully, the task status in the upper right corner is displayed as Completed.CautionIf you proceed with the IP allocation request of Step 1, you cannot cancel or revert the IP change.
- When the task completes successfully, the task status in the upper right corner is displayed as Completed.
Step 2
Follow the procedure below to perform Step 2.
To perform the IP change, connect to the target server using its NAT IP.
InformationTo prevent communication failures during operation, we recommend connecting through another Virtual Server or Bare Metal Server created in the same subnet.Right-click the Windows Start icon, then run Windows PowerShell(Administrator).
Enter the assigned IP from Step 1 and set the IP to be changed on the server.
- In the following example, replace
172.17.34.150with the assigned IP address.Color modePS C:\> netsh interface ip set address "bond-srv.20" static 172.17.34.150 255.255.255.0PS C:\> netsh interface ip set address "bond-srv.20" static 172.17.34.150 255.255.255.0code block. Change IP setting information- Setting the IP disconnects the terminal session.
- Step 2 After completing the task, if the task status changes to Completed, you can reconnect to the terminal.
- In the following example, replace
When all tasks are completed, select the task completion checkbox for Step 2 in the IP change popup window.
- When the task completes successfully, the task status in the upper right corner is displayed as Completed.Information
- If the task status of Step 2 has changed to Completed but you still have issues with terminal access, go to the All Services > Management > Support Center Contact Us menu and submit an inquiry.
- When the task completes successfully, the task status in the upper right corner is displayed as Completed.
Step 3
Follow the procedure below to carry out Step 3.
Connect to the server targeted for IP change using its NAT IP and verify the communication status.
- Use the following command to verify whether any pre‑change configuration remains and to confirm that the change was applied correctly. If you can connect successfully to the target server whose IP was changed, the new IP is communicating properly.Color mode
PS C:\> Get-NetIPAddress | Format-TablePS C:\> Get-NetIPAddress | Format-TableCode block. Communication status check ReferenceThe NAT IP does not change.
- Use the following command to verify whether any pre‑change configuration remains and to confirm that the change was applied correctly. If you can connect successfully to the target server whose IP was changed, the new IP is communicating properly.
After all tasks are completed, restart the server and then perform a final check.
ReferenceIt is recommended to perform the final check after restarting the server.If there are no issues in the final inspection results, select the work completion checkbox for Step 3 in the IP Change popup window.
Change to an IP of a different Subnet
This explains how to configure IP settings for each operating system when the IP to be changed uses a different subnet.
Linux – centos/redhat operating system
Step 1
Follow the procedure below to carry out Step 1.
- Select the Subnet to modify.
- Enter the IP address to change.
- Click the IP Allocation Request button.
- If a popup window notifying you of an IP change opens, click the Confirm button.
- When the operation completes successfully, Vlan ID verification, Default Gateway verification information is displayed, and the task status in the upper right corner shows Completed.CautionIf you proceed with the IP allocation request for Step 1, you cannot cancel or revert the IP change.
- When the operation completes successfully, Vlan ID verification, Default Gateway verification information is displayed, and the task status in the upper right corner shows Completed.
Step 2
Follow the steps below to carry out Step 2.
For the IP change operation, connect to the target server using the NAT IP.
informationTo prevent situations where communication becomes impossible during operation, we recommend connecting via another Virtual Server or Bare Metal Server created in the same subnet.To add the IP to the server, add a new VLAN and configure the IP.
- Add VLAN: Create the interface for the Vlan ID identified in Step 1. In the following example, replace
20with the assigned ID. - IP configuration: Enter the IP assigned in Step 1. In the following example, replace
192.168.0.10/24with the assigned IP.Color mode# nmcli conn add type vlan ifname "bond-srv.20" con-name "bond-srv.20" id 20 dev bond-srv # nmcli con mod bond-srv.20 con-name "Vlan bond-srv.20" # nmcli con mod "Vlan bond-srv.20" ipv4.addresses 192.168.0.10/24 # nmcli con mod "Vlan bond-srv.20" ipv4.method manual # nmcli con mod "Vlan bond-srv.20" ipv6.method "ignore" # nmcli con up "Vlan bond-srv.20"# nmcli conn add type vlan ifname "bond-srv.20" con-name "bond-srv.20" id 20 dev bond-srv # nmcli con mod bond-srv.20 con-name "Vlan bond-srv.20" # nmcli con mod "Vlan bond-srv.20" ipv4.addresses 192.168.0.10/24 # nmcli con mod "Vlan bond-srv.20" ipv4.method manual # nmcli con mod "Vlan bond-srv.20" ipv6.method "ignore" # nmcli con up "Vlan bond-srv.20"Code block. IP settings to change
- Add VLAN: Create the interface for the Vlan ID identified in Step 1. In the following example, replace
Configure the default gateway for the new VLAN.
- Default gateway setting: Enter the Default gateway IP assigned in Step 1. In the following example, replace
192.168.0.1with the assigned Default gateway IP.Color mode# nmcli con mod "Vlan bond-srv.20" ipv4.gateway 192.168.0.1 # nmcli device reapply bond-srv.20# nmcli con mod "Vlan bond-srv.20" ipv4.gateway 192.168.0.1 # nmcli device reapply bond-srv.20Code block. Change IP settings Info- Setting a Default Gateway on a new VLAN disconnects the terminal session.
- Step 2 After completing the task, if the task status changes to Completed, you can reconnect to the terminal.
- Default gateway setting: Enter the Default gateway IP assigned in Step 1. In the following example, replace
When all tasks are completed, select the task completion checkbox of Step 2 in the IP change popup window.
- When the task completes successfully, the task status in the upper right corner is displayed as Completed.information
- If the task status of Step 2 has changed to Completed but you still experience issues with terminal access, go to the All Services > Management > Support Center’s Contact Us menu and make an inquiry.
- When the task completes successfully, the task status in the upper right corner is displayed as Completed.
Step 3
Follow the procedure below to proceed with Step 3.
Connect to the server to be changed using the NAT IP.
After verifying the Default Gateway IP of the existing (pre-change) interface, delete it.
- In the example below, replace
192.168.10.1with the IP you have verified.Color mode# ip route del default via 192.168.10.1# ip route del default via 192.168.10.1Code block. Delete the Default Gateway IP of the existing interface.
- In the example below, replace
Connect to the server subject to IP change using the NAT IP and check the communication status.
- Use the following command to verify whether any pre‑change configuration remains and whether the change was applied correctly. If you can connect successfully to the target server whose IP was changed, the new IP is communicating properly.Color mode
# netstat –nr # bash /usr/local/bin/ip.sh# netstat –nr # bash /usr/local/bin/ip.shCode block. Communication status check ReferenceThe NAT IP does not change.
- Use the following command to verify whether any pre‑change configuration remains and whether the change was applied correctly. If you can connect successfully to the target server whose IP was changed, the new IP is communicating properly.
Verify the VLAN information of the existing IP and delete it from the server.
- Enter the verified ID in place of
30in the following example.Color mode# nmcli con delete "Vlan bond-srv.30"# nmcli con delete "Vlan bond-srv.30"Code block. Delete the VLAN information of the existing IP.
- Enter the verified ID in place of
After all tasks are completed, restart the server and then perform a final check.
ReferenceIt is recommended to perform the final check after restarting the server.If the final inspection results show no issues, select the work completion checkbox for Step 3 in the IP Change popup window.
Linux – Ubuntu operating system
Step 1
Follow the procedure below to perform Step 1.
- Select the Subnet to modify.
- Enter the IP address to change.
- Click the IP Allocation Request button.
- When a popup notifying you of an IP change opens, click the Confirm button.
- When the operation completes successfully, Vlan ID verification, Default Gateway verification information is displayed, and the operation status in the upper right corner shows Completed.CautionIf you proceed with the IP allocation request of Step 1, you cannot cancel or revert the IP change.
- When the operation completes successfully, Vlan ID verification, Default Gateway verification information is displayed, and the operation status in the upper right corner shows Completed.
Step 2
Follow the procedure below to perform Step 2.
To perform the IP change, connect to the target server using its NAT IP.
InformationTo prevent situations where communication becomes impossible during operation, we recommend connecting via another Virtual Server or Bare Metal Server created in the same subnet.To add the IP to the server, add a new VLAN and configure the IP and Default Gateway.
- In the following example, this is the part where content is added at the bottom of the Step 1 task description.
- In the following example, enter the assigned ID and IP instead of ID and IP.Color mode
[root@localhost ~]# vi /etc/netplan/50-cloud-init.yaml network: bonds: ...................... omitted ethernets: ...................... omitted vlans: bond-srv.9: addresses: - 172.17.34.150/24 gateway4: 172.17.34.2 id: 9 link: bond-srv mtu: 1500 bond-srv.350: addresses: - 172.16.87.150/24 routes: - to: 172.17.87.0/24 via: 172.16.87.1 - to: 172.17.87.0/24 via: 172.16.87.1 id: 350 link: bond-srv # Create the interface for the Vlan ID identified in Step1. # Enter the IP assigned in Step1. # Enter the Default gateway IP assigned in Step1. bond-srv.20: addresses: - 192.168.0.10/24 gateway4: 192.168.0.1 id: 20 link: bond-srv mtu: 1500[root@localhost ~]# vi /etc/netplan/50-cloud-init.yaml network: bonds: ...................... omitted ethernets: ...................... omitted vlans: bond-srv.9: addresses: - 172.17.34.150/24 gateway4: 172.17.34.2 id: 9 link: bond-srv mtu: 1500 bond-srv.350: addresses: - 172.16.87.150/24 routes: - to: 172.17.87.0/24 via: 172.16.87.1 - to: 172.17.87.0/24 via: 172.16.87.1 id: 350 link: bond-srv # Create the interface for the Vlan ID identified in Step1. # Enter the IP assigned in Step1. # Enter the Default gateway IP assigned in Step1. bond-srv.20: addresses: - 192.168.0.10/24 gateway4: 192.168.0.1 id: 20 link: bond-srv mtu: 1500Code block. Change IP settings
Use the Netplan apply command to apply the changes to the system.
Color mode[root@localhost ~]# netplan apply[root@localhost ~]# netplan applyCode block. Run Netplan apply Info- Setting a new Default Gateway disconnects the terminal session.
- Step 2 After completing the task, if the task status changes to Completed, you can reconnect to the terminal.
When all tasks are completed, select the task completion checkbox of Step 2 in the IP change popup window.
- When the task completes successfully, the task status in the upper right corner is displayed as Completed.information
- If the task status of Step 2 has changed to Completed but you still experience issues with terminal access, go to the All Services > Management > Support Center’s Contact Us menu and make an inquiry.
- When the task completes successfully, the task status in the upper right corner is displayed as Completed.
Step 3
Follow the procedure below to proceed with Step 3.
Connect to the server to be changed using the NAT IP.
Check the Default Gateway IP of the existing (pre‑change) interface, then delete it.
- In the following example, the Delete this line row is the part that gets deleted.Color mode
[root@localhost ~]# vi /etc/netplan/50-cloud-init.yaml network: bonds: ...................... omitted ethernets: ...................... omitted vlans: bond-srv.9: addresses: - 172.17.34.150/24 gateway4: 172.17.34.2 #Delete this line id: 9 link: bond-srv mtu: 1500 bond-srv.350: addresses: - 172.16.87.150/24 routes: - to: 172.17.87.0/24 via: 172.16.87.1 - to: 172.17.87.0/24 via: 172.16.87.1 id: 350 link: bond-srv bond-srv.20: addresses: - 192.168.0.10/24 gateway4: 192.168.0.1 id: 20 link: bond-srv mtu: 1500[root@localhost ~]# vi /etc/netplan/50-cloud-init.yaml network: bonds: ...................... omitted ethernets: ...................... omitted vlans: bond-srv.9: addresses: - 172.17.34.150/24 gateway4: 172.17.34.2 #Delete this line id: 9 link: bond-srv mtu: 1500 bond-srv.350: addresses: - 172.16.87.150/24 routes: - to: 172.17.87.0/24 via: 172.16.87.1 - to: 172.17.87.0/24 via: 172.16.87.1 id: 350 link: bond-srv bond-srv.20: addresses: - 192.168.0.10/24 gateway4: 192.168.0.1 id: 20 link: bond-srv mtu: 1500Code block. Delete the Default Gateway IP of the existing interface.
- In the following example, the Delete this line row is the part that gets deleted.
Connect to the IP change target server using the NAT IP and check the communication status.
- Use the following command to verify whether any pre‑change configuration remains and that the change was applied correctly. If you can connect successfully to the target server whose IP was changed, the new IP is communicating properly.Color mode
# netstat –nr # bash /usr/local/bin/ip.sh# netstat –nr # bash /usr/local/bin/ip.shCode block. Communication status check ReferenceThe NAT IP does not change.
- Use the following command to verify whether any pre‑change configuration remains and that the change was applied correctly. If you can connect successfully to the target server whose IP was changed, the new IP is communicating properly.
Delete the existing IP.
- In the following example, the Delete this line row is the part that gets deleted.Color mode
[root@localhost ~]# vi /etc/netplan/50-cloud-init.yaml network: bonds: ...................... omitted ethernets: ...................... omitted vlans: bond-srv.9: #Delete this line addresses: #Delete this line - 172.17.34.150/24 # Delete this line gateway4: 172.17.34.2 #Delete this line id: 9 # Delete this line link: bond-srv #Delete this line mtu: 1500 #Delete this line bond-srv.350: addresses: - 172.16.87.150/24 routes: - to: 172.17.87.0/24 via: 172.16.87.1 - to: 172.17.87.0/24 via: 172.16.87.1 id: 350 link: bond-srv bond-srv.20: addresses: - 192.168.0.10/24 gateway4: 192.168.0.1 id: 20 link: bond-srv mtu: 1[root@localhost ~]# vi /etc/netplan/50-cloud-init.yaml network: bonds: ...................... omitted ethernets: ...................... omitted vlans: bond-srv.9: #Delete this line addresses: #Delete this line - 172.17.34.150/24 # Delete this line gateway4: 172.17.34.2 #Delete this line id: 9 # Delete this line link: bond-srv #Delete this line mtu: 1500 #Delete this line bond-srv.350: addresses: - 172.16.87.150/24 routes: - to: 172.17.87.0/24 via: 172.16.87.1 - to: 172.17.87.0/24 via: 172.16.87.1 id: 350 link: bond-srv bond-srv.20: addresses: - 192.168.0.10/24 gateway4: 192.168.0.1 id: 20 link: bond-srv mtu: 1Code block. Delete existing IP
- In the following example, the Delete this line row is the part that gets deleted.
Apply the changes to the system.
Color mode[root@localhost ~]# netplan apply [root@localhost ~]# ip link delete bond-srv.9 # Additional steps when deleting VLAN[root@localhost ~]# netplan apply [root@localhost ~]# ip link delete bond-srv.9 # Additional steps when deleting VLANCode block. Apply changes After all tasks are completed, restart the server and then perform a final check.
- If there are no issues in the final inspection results, select the work completion checkbox for Step 3 in the IP change popup window.
Windows operating system
Step 1
Follow the procedure below to carry out Step 1.
- Select the Subnet to modify.
- Enter the IP address to change.
- Click the IP Allocation Request button.
- If a popup notifying IP change verification opens, click the Confirm button.
- When the operation completes successfully, Vlan ID verification, Default Gateway verification information is displayed, and the operation status in the upper right corner shows Completed.CautionIf you proceed with the IP allocation request for Step 1, you cannot cancel or revert the IP change.
- When the operation completes successfully, Vlan ID verification, Default Gateway verification information is displayed, and the operation status in the upper right corner shows Completed.
Step 2
Follow the procedure below to carry out Step 2.
To perform the IP change, connect to the target server using its NAT IP.
informationTo prevent situations where communication becomes impossible during operation, we recommend connecting via another Virtual Server or Bare Metal Server created in the same subnet.Windows Start icon, right-click it, then run Windows PowerShell(Administrator).
Add a VLAN and configure the IP and default gateway.
- Add VLAN: Create the interface for the Vlan ID identified in Step 1. In the following example, replace
20with the assigned ID. - IP configuration: Enter the IP assigned in Step 1. In the following example, replace
46with the ifindex obtained via Get-NetAdapter, and replace 192.168.0.10 with the assigned IP. - Default gateway setting: Enter the Default gateway IP assigned in Step 1. In the following example, replace
192.168.0.1with the assigned Default gateway IP.Color modePS C:\> Add-NetLbfoTeamNIC -Team bond_bond-srv -VlanID 20 -Name bond-srv.20 -Confirm:$false PS C:\> Get-NetAdapter PS C:\> New-NetIPAddress -InterfaceIndex 46 -IPAddress 192.168.0.10 -PrefixLength 24 –defaultgateway 192.168.0.1PS C:\> Add-NetLbfoTeamNIC -Team bond_bond-srv -VlanID 20 -Name bond-srv.20 -Confirm:$false PS C:\> Get-NetAdapter PS C:\> New-NetIPAddress -InterfaceIndex 46 -IPAddress 192.168.0.10 -PrefixLength 24 –defaultgateway 192.168.0.1code block. Change IP settings information- Setting a new Default Gateway disconnects the terminal session.
- Step 2 After completing the task, if the task status changes to Completed, you can reconnect to the terminal.
- Add VLAN: Create the interface for the Vlan ID identified in Step 1. In the following example, replace
When all tasks are completed, select the task completion checkbox of Step 2 in the IP change popup window.
- When the task completes successfully, the task status in the upper right corner is displayed as Completed.information
- If the task status of Step 2 has changed to Completed but you still experience issues with terminal access, go to the All Services > Management > Support Center Contact Us menu and submit an inquiry.
- When the task completes successfully, the task status in the upper right corner is displayed as Completed.
Step 3
Follow the procedure below to carry out Step 3.
Connect to the IP-change target server using the NAT IP.
Run the interface index (ifindex) to check the existing Default Gateway IP.
Color modePS C:\> Get-NetAdapter Name InterfaceDescription ifIndex Status MacAddress LinkSpeed ---- -------------------- ------- ------ ---------- --------- bond-srv.9 Microsoft Network Adapter Multiple...#4 30 Up 40-A6-B7-27-96-D5 50 Gbps bond-srv Microsoft Network Adapter Multiple...#3 19 Up 40-A6-B7-27-96-D5 50 Gbps bond-iscsi Microsoft Network Adapter Multiple...#2 18 Up 40-A6-B7-27-96-D4 50 Gbps bond-backup Microsoft Network Adapter Multiplexo... 22 Up 68-05-CA-C9-EB-88 20 Gbps eno2 Intel(R) Ethernet Connection X722 fo... 12 Disabled 38-68-DD-36-A0-59 1 Gbps ens3f0 Intel(R) Ethernet Network Adapter XX... 11 Up 40-A6-B7-27-96-D4 25 Gbps ………………………………………….. omittedPS C:\> Get-NetAdapter Name InterfaceDescription ifIndex Status MacAddress LinkSpeed ---- -------------------- ------- ------ ---------- --------- bond-srv.9 Microsoft Network Adapter Multiple...#4 30 Up 40-A6-B7-27-96-D5 50 Gbps bond-srv Microsoft Network Adapter Multiple...#3 19 Up 40-A6-B7-27-96-D5 50 Gbps bond-iscsi Microsoft Network Adapter Multiple...#2 18 Up 40-A6-B7-27-96-D4 50 Gbps bond-backup Microsoft Network Adapter Multiplexo... 22 Up 68-05-CA-C9-EB-88 20 Gbps eno2 Intel(R) Ethernet Connection X722 fo... 12 Disabled 38-68-DD-36-A0-59 1 Gbps ens3f0 Intel(R) Ethernet Network Adapter XX... 11 Up 40-A6-B7-27-96-D4 25 Gbps ………………………………………….. omittedcode block. Run Get-NetAdapter Color modePS C:\> get-netroute -ifindex 30 ifIndex DestinationPrefix NextHop RouteMetric PolicyStore ------- ----------------- ------- ----------- ----------- 30 255.255.255.255/32 0.0.0.0 256 ActiveStore 30 224.0.0.0/4 0.0.0.0 256 ActiveStore 30 172.17.35.0/24 172.17.35.1 256 ActiveStore 30 172.17.34.255/32 0.0.0.0 256 ActiveStore 30 172.17.34.14/32 0.0.0.0 256 ActiveStore 30 172.17.34.0/24 0.0.0.0 256 ActiveStore 30 0.0.0.0/0 172.17.34.1 1 ActiveStorePS C:\> get-netroute -ifindex 30 ifIndex DestinationPrefix NextHop RouteMetric PolicyStore ------- ----------------- ------- ----------- ----------- 30 255.255.255.255/32 0.0.0.0 256 ActiveStore 30 224.0.0.0/4 0.0.0.0 256 ActiveStore 30 172.17.35.0/24 172.17.35.1 256 ActiveStore 30 172.17.34.255/32 0.0.0.0 256 ActiveStore 30 172.17.34.14/32 0.0.0.0 256 ActiveStore 30 172.17.34.0/24 0.0.0.0 256 ActiveStore 30 0.0.0.0/0 172.17.34.1 1 ActiveStorecode block. Execute -ifindex Delete the existing Default Gateway IP.
- In the following example, replace
30with the ifindex obtained using Get-NetAdapter, and replace172.17.34.1with the IP you verified.Color modePS C:\> Remove-NetRoute -ifIndex 30 -DestinationPrefix 0.0.0.0/0 -NextHop 172.17.34.1 -Confirm:$falsePS C:\> Remove-NetRoute -ifIndex 30 -DestinationPrefix 0.0.0.0/0 -NextHop 172.17.34.1 -Confirm:$falseCode block. Delete Default Gateway IP Color modePS C:\> get-netroute -ifindex 30 ifIndex DestinationPrefix NextHop RouteMetric PolicyStore ------- ----------------- ------- ----------- ----------- 30 255.255.255.255/32 0.0.0.0 256 ActiveStore 30 224.0.0.0/4 0.0.0.0 256 ActiveStore 30 172.17.34.255/32 0.0.0.0 256 ActiveStore 30 172.17.34.14/32 0.0.0.0 256 ActiveStore 30 172.17.34.0/24 0.0.0.0 256 ActiveStorePS C:\> get-netroute -ifindex 30 ifIndex DestinationPrefix NextHop RouteMetric PolicyStore ------- ----------------- ------- ----------- ----------- 30 255.255.255.255/32 0.0.0.0 256 ActiveStore 30 224.0.0.0/4 0.0.0.0 256 ActiveStore 30 172.17.34.255/32 0.0.0.0 256 ActiveStore 30 172.17.34.14/32 0.0.0.0 256 ActiveStore 30 172.17.34.0/24 0.0.0.0 256 ActiveStorecode block. -ifindex execution Information- If you delete the existing Default Gateway, the terminal session will be disconnected.
- Step 2 After completing the task, if the task status changes to Completed, you can reconnect to the terminal.
- In the following example, replace
Connect to the server targeted for IP change using its NAT IP and verify the communication status.
- Use the following command to verify whether any pre‑change configuration remains and whether the change was applied correctly. If you can connect successfully to the target server whose IP was changed, the new IP is communicating properly.Color mode
PS C:\> netstat –nr | findstr Default PS C:\> Get-NetIPAddress | Format-TablePS C:\> netstat –nr | findstr Default PS C:\> Get-NetIPAddress | Format-TableCode block. Communication status check ReferenceThe NAT IP does not change.
- Use the following command to verify whether any pre‑change configuration remains and whether the change was applied correctly. If you can connect successfully to the target server whose IP was changed, the new IP is communicating properly.
Check the VLAN information of the existing IP in the Team information.
Color modePS C:\> Get-NetLbfoTeam Name : bond_bond-srv Members : {ens6f1, ens3f1} TeamNics : {bond-srv, bond-srv.9} TeamingMode : SwitchIndependent LoadBalancingAlgorithm : Dynamic Status : Up Name : bond_bond-iscsi Members : {ens6f0, ens3f0} TeamNics : bond-iscsi TeamingMode : SwitchIndependent LoadBalancingAlgorithm : Dynamic Status : Up Name : bond_bond-backup Members : {ens2f0, ens4f1} TeamNics : bond-backup TeamingMode : SwitchIndependent LoadBalancingAlgorithm : Dynamic Status : Up PS C:\> Get-NetAdapter Name InterfaceDescription ifIndex Status MacAddress LinkSpeed ---- -------------------- ------- ------ ---------- --------- bond-srv.9 Microsoft Network Adapter Multiple...#4 30 Up 40-A6-B7-27-96-D5 50 Gbps bond-srv Microsoft Network Adapter Multiple...#3 19 Up 40-A6-B7-27-96-D5 50 Gbps bond-iscsi Microsoft Network Adapter Multiple...#2 18 Up 40-A6-B7-27-96-D4 50 Gbps bond-backup Microsoft Network Adapter Multiplexo... 22 Up 68-05-CA-C9-EB-88 20 Gbps eno2 Intel(R) Ethernet Connection X722 fo... 12 Disabled 38-68-DD-36-A0-59 1 Gbps ens3f0 Intel(R) Ethernet Network Adapter XX... 11 Up 40-A6-B7-27-96-D4 25 Gbps ………………………………………….. omittedPS C:\> Get-NetLbfoTeam Name : bond_bond-srv Members : {ens6f1, ens3f1} TeamNics : {bond-srv, bond-srv.9} TeamingMode : SwitchIndependent LoadBalancingAlgorithm : Dynamic Status : Up Name : bond_bond-iscsi Members : {ens6f0, ens3f0} TeamNics : bond-iscsi TeamingMode : SwitchIndependent LoadBalancingAlgorithm : Dynamic Status : Up Name : bond_bond-backup Members : {ens2f0, ens4f1} TeamNics : bond-backup TeamingMode : SwitchIndependent LoadBalancingAlgorithm : Dynamic Status : Up PS C:\> Get-NetAdapter Name InterfaceDescription ifIndex Status MacAddress LinkSpeed ---- -------------------- ------- ------ ---------- --------- bond-srv.9 Microsoft Network Adapter Multiple...#4 30 Up 40-A6-B7-27-96-D5 50 Gbps bond-srv Microsoft Network Adapter Multiple...#3 19 Up 40-A6-B7-27-96-D5 50 Gbps bond-iscsi Microsoft Network Adapter Multiple...#2 18 Up 40-A6-B7-27-96-D4 50 Gbps bond-backup Microsoft Network Adapter Multiplexo... 22 Up 68-05-CA-C9-EB-88 20 Gbps eno2 Intel(R) Ethernet Connection X722 fo... 12 Disabled 38-68-DD-36-A0-59 1 Gbps ens3f0 Intel(R) Ethernet Network Adapter XX... 11 Up 40-A6-B7-27-96-D4 25 Gbps ………………………………………….. omittedcode block. Run Get-NetLbfoTeam Delete the Vlan information of the existing IP from the server.
- In the following example, replace
30with the ID you verified.Color modePS C:\> Remove-NetLbfoTeamNic -Team bond_bond-srv -VlanID 30 -Confirm:$false PS C:\> Get-NetAdapter Name InterfaceDescription ifIndex Status MacAddress LinkSpeed ---- -------------------- ------- ------ ---------- --------- bond-srv Microsoft Network Adapter Multiple...#3 19 Up 40-A6-B7-27-96-D5 50 Gbps bond-iscsi Microsoft Network Adapter Multiple...#2 18 Up 40-A6-B7-27-96-D4 50 Gbps bond-backup Microsoft Network Adapter Multiplexo... 22 Up 68-05-CA-C9-EB-88 20 Gbps eno2 Intel(R) Ethernet Connection X722 fo... 12 Disabled 38-68-DD-36-A0-59 1 Gbps ens3f0 Intel(R) Ethernet Network Adapter XX... 11 Up 40-A6-B7-27-96-D4 25 Gbps ………………………………………….. omittedPS C:\> Remove-NetLbfoTeamNic -Team bond_bond-srv -VlanID 30 -Confirm:$false PS C:\> Get-NetAdapter Name InterfaceDescription ifIndex Status MacAddress LinkSpeed ---- -------------------- ------- ------ ---------- --------- bond-srv Microsoft Network Adapter Multiple...#3 19 Up 40-A6-B7-27-96-D5 50 Gbps bond-iscsi Microsoft Network Adapter Multiple...#2 18 Up 40-A6-B7-27-96-D4 50 Gbps bond-backup Microsoft Network Adapter Multiplexo... 22 Up 68-05-CA-C9-EB-88 20 Gbps eno2 Intel(R) Ethernet Connection X722 fo... 12 Disabled 38-68-DD-36-A0-59 1 Gbps ens3f0 Intel(R) Ethernet Network Adapter XX... 11 Up 40-A6-B7-27-96-D4 25 Gbps ………………………………………….. omittedcode block. Run Get-NetLbfoTeam
- In the following example, replace
After all tasks are completed, restart the server and then perform a final check.
ReferenceIt is recommended to perform the final check after restarting the server.If there are no issues in the final inspection results, select the Step 3 work completion checkbox in the IP Change popup window.
Configuring RHEL Repository
The Samsung Cloud Platform Console provides the SCP RHEL Repository to support user environments where external access is restricted, such as VPC Private Subnets.
You can use the RHEL Repository to install and download the same packages as the official RHEL Repository.
To configure the RHEL Repository, please refer to RHEL Repository Configuration Guide.
- If a user created RHEL before August 2025 through the Samsung Cloud Platform Console, they must modify the RHEL Repository settings.
- Since the SCP RHEL Repository synchronizes with each region’s local repository according to an internal schedule, it is recommended to switch to an external public mirror site to apply the latest patches quickly.
- Samsung Cloud Platform provides the latest repository for the specified major version.
4.2.1 - Install ServiceWatch Agent
Users can install the ServiceWatch Agent on a Bare Metal Server to collect custom metrics and logs.
ServiceWatch Agent
On a Bare Metal Server, the agents that need to be installed to collect ServiceWatch’s custom metrics and logs can be divided into two main types. It is a Prometheus Exporter and Open Telemetry Collector.
| Category | Detailed description | |
|---|---|---|
| Prometheus Exporter | Provide metrics of a specific application or service in a format that Prometheus can scrape
| |
| Open Telemetry Collector | Acts as a centralized collector that gathers telemetry data such as metrics and logs from distributed systems, processes (filtering, sampling, etc.) it, and exports it to multiple backends (e.g., Prometheus, Jaeger, Elasticsearch, etc.)
|
Pre-configuration for Using ServiceWatch Agent
To use the ServiceWatch Agent, please refer to Prerequisite Settings for ServiceWatch Agent and prepare the prerequisite settings.
4.2.2 - Setting up RHEL Repo and WKMS
The Samsung Cloud Platform Console provides the SCP RHEL Repository to support user environments where external access is restricted, such as VPC Private Subnet. You can use the SCP RHEL Repository to install and download the same packages as the official RHEL Repository.
- If a user created RHEL before August 2025 through the Samsung Cloud Platform Console, they must modify the RHEL Repository settings.
- Since the SCP RHEL Repository synchronizes with each Region Local Repository according to an internal schedule, it is recommended to switch to an external public mirror site to apply the latest patches quickly.
- Samsung Cloud Platform provides the latest repository for the specified major version.
RHEL Repository Configuration Guide
When using RHEL, you can configure the SCP RHEL Repository to install and download the same packages as the official RHEL Repository. To set up the RHEL Repository, follow these steps.
On the Virtual Server, as the OS root user, use the cat command to check the
/etc/yum.repos.d/scp.rhel8.repoor/etc/yum.repos.d/scp.rhel9.repoconfiguration.Color modecat /etc/yum.repos.d/scp.rhel8.repocat /etc/yum.repos.d/scp.rhel8.repoCode block. Verify repo configuration (RHEL8) Color modecat /etc/yum.repos.d/scp.rhel9.repocat /etc/yum.repos.d/scp.rhel9.repoCode block. Verify repo configuration (RHEL9) - When checking the configuration file, the following result is displayed.Color mode
[rhel-8-baseos] name=rhel-8-baseos gpgcheck=0 enabled=1 baseurl=http://scp-rhel8-ip/rhel/8/baseos [rhel-8-baseos-debug] name=rhel-8-baseos-debug gpgcheck=0 enabled=1 baseurl=http://scp-rhel8-ip/rhel/8/baseos-debug [rhel-8-appstream] name=rhel-8-appstream gpgcheck=0 enabled=1 baseurl=http://scp-rhel8-ip/rhel/8/appstream[rhel-8-baseos] name=rhel-8-baseos gpgcheck=0 enabled=1 baseurl=http://scp-rhel8-ip/rhel/8/baseos [rhel-8-baseos-debug] name=rhel-8-baseos-debug gpgcheck=0 enabled=1 baseurl=http://scp-rhel8-ip/rhel/8/baseos-debug [rhel-8-appstream] name=rhel-8-appstream gpgcheck=0 enabled=1 baseurl=http://scp-rhel8-ip/rhel/8/appstreamCode block. Check repo configuration (RHEL8) Color mode[rhel-9-for-x86_64-baseos-rpms] name=rhel-9-for-x86_64-baseos-rpms baseurl=http://scp-rhel9-ip/rhel/$releasever/x86_64/baseos gpgcheck=0 enabled=1 [rhel-9-for-x86_64-appstream-rpms] name=rhel-9-for-x86_64-appstream-rpms baseurl=http://scp-rhel9-ip/rhel/$releasever/x86_64/appstream gpgcheck=0 enabled=1 [codeready-builder-for-rhel-9-x86_64-rpms] name=codeready-builder-for-rhel-9-x86_64-rpms baseurl=http://scp-rhel9-ip/rhel/$releasever/x86_64/codeready-builder gpgcheck=0 enabled=1 [rhel-9-for-x86_64-highavailability-rpms] name=rhel-9-for-x86_64-highavailability-rpms baseurl=http://scp-rhel9-ip/rhel/$releasever/x86_64/ha gpgcheck=0 enabled=1 [rhel-9-for-x86_64-supplementary-rpms] name=rhel-9-for-x86_64-supplementary-rpms baseurl=http://scp-rhel9-ip/rhel/$releasever/x86_64/supplementary gpgcheck=0 enabled=1[rhel-9-for-x86_64-baseos-rpms] name=rhel-9-for-x86_64-baseos-rpms baseurl=http://scp-rhel9-ip/rhel/$releasever/x86_64/baseos gpgcheck=0 enabled=1 [rhel-9-for-x86_64-appstream-rpms] name=rhel-9-for-x86_64-appstream-rpms baseurl=http://scp-rhel9-ip/rhel/$releasever/x86_64/appstream gpgcheck=0 enabled=1 [codeready-builder-for-rhel-9-x86_64-rpms] name=codeready-builder-for-rhel-9-x86_64-rpms baseurl=http://scp-rhel9-ip/rhel/$releasever/x86_64/codeready-builder gpgcheck=0 enabled=1 [rhel-9-for-x86_64-highavailability-rpms] name=rhel-9-for-x86_64-highavailability-rpms baseurl=http://scp-rhel9-ip/rhel/$releasever/x86_64/ha gpgcheck=0 enabled=1 [rhel-9-for-x86_64-supplementary-rpms] name=rhel-9-for-x86_64-supplementary-rpms baseurl=http://scp-rhel9-ip/rhel/$releasever/x86_64/supplementary gpgcheck=0 enabled=1Code block. Verify repo configuration (RHEL9)
- When checking the configuration file, the following result is displayed.
Use a text editor (e.g., vim) to open the
/etc/hostsfile.Modify the
/etc/hostsfile with the following content and save it.Color mode198.19.2.13 scp-rhel8-ip scp-rhel9-ip scp-rhel-ip198.19.2.13 scp-rhel8-ip scp-rhel9-ip scp-rhel-ipcode block. Change /etc/hosts file settings Use the yum command to verify the RHEL Repository connection configured on the server.
Color modeyum repolist –vyum repolist –vCode block. Verify repository connection settings - If the RHEL repository is successfully connected, you can view the repository list.Color mode
Repo-id : rhel-8-appstream Repo-name : rhel-8-appstream Repo-revision : 1718903734 Repo-updated : Fri 21 Jun 2024 02:15:34 AM KST Repo-pkgs : 38,260 Repo-available-pkgs: 25,799 Repo-size : 122 G Repo-baseurl : http://scp-rhel8-ip/rhel/8/appstream Repo-expire : 172,800 second(s) (last: Thu 08 Aug 2024 07:27:57 AM KST) Repo-filename : /etc/yum.repos.d/scp.rhel8.repo Repo-id : rhel-8-baseos Repo-name : rhel-8-baseos Repo-revision : 1718029433 Repo-updated : Mon 10 Jun 2024 11:23:52 PM KST Repo-pkgs : 17,487 Repo-available-pkgs: 17,487 Repo-size : 32 G Repo-baseurl : http://scp-rhel8-ip/rhel/8/baseos Repo-expire : 172,800 second(s) (last: Thu 08 Aug 2024 07:27:57 AM KST) Repo-filename : /etc/yum.repos.d/scp.rhel8.repo Repo-id : rhel-8-baseos-debug Repo-name : rhel-8-baseos-debug Repo-revision : 1717662461 Repo-updated : Thu 06 Jun 2024 05:27:41 PM KST Repo-pkgs : 17,078 Repo-available-pkgs: 17,078 Repo-size : 100 G Repo-baseurl : http://scp-rhel8-ip/rhel/8/baseos-debug Repo-expire : 172,800 second(s) (last: Thu 08 Aug 2024 07:27:57 AM KST) Repo-filename : /etc/yum.repos.d/scp.rhel8.repoRepo-id : rhel-8-appstream Repo-name : rhel-8-appstream Repo-revision : 1718903734 Repo-updated : Fri 21 Jun 2024 02:15:34 AM KST Repo-pkgs : 38,260 Repo-available-pkgs: 25,799 Repo-size : 122 G Repo-baseurl : http://scp-rhel8-ip/rhel/8/appstream Repo-expire : 172,800 second(s) (last: Thu 08 Aug 2024 07:27:57 AM KST) Repo-filename : /etc/yum.repos.d/scp.rhel8.repo Repo-id : rhel-8-baseos Repo-name : rhel-8-baseos Repo-revision : 1718029433 Repo-updated : Mon 10 Jun 2024 11:23:52 PM KST Repo-pkgs : 17,487 Repo-available-pkgs: 17,487 Repo-size : 32 G Repo-baseurl : http://scp-rhel8-ip/rhel/8/baseos Repo-expire : 172,800 second(s) (last: Thu 08 Aug 2024 07:27:57 AM KST) Repo-filename : /etc/yum.repos.d/scp.rhel8.repo Repo-id : rhel-8-baseos-debug Repo-name : rhel-8-baseos-debug Repo-revision : 1717662461 Repo-updated : Thu 06 Jun 2024 05:27:41 PM KST Repo-pkgs : 17,078 Repo-available-pkgs: 17,078 Repo-size : 100 G Repo-baseurl : http://scp-rhel8-ip/rhel/8/baseos-debug Repo-expire : 172,800 second(s) (last: Thu 08 Aug 2024 07:27:57 AM KST) Repo-filename : /etc/yum.repos.d/scp.rhel8.repoCode block. Check repository list
- If the RHEL repository is successfully connected, you can view the repository list.
Windows Key Management Service Configuration Guide
When using Windows Server on Samsung Cloud Platform, you can authenticate genuine licenses by utilizing the Key Management Service provided by Samsung Cloud Platform. To authenticate genuine copies using the Key Management Service, follow these steps.
Right-click the Windows Start icon, then run cmd from Windows PowerShell (Administrator) or the Windows Run menu.
Run the following command in Windows PowerShell(Administrator) or cmd to register the KMS Server.
Color modeslmgr /skms 198.19.2.23:1688slmgr /skms 198.19.2.23:1688code block. WKMS configuration Run the KMS Server registration command, verify the notification popup indicating successful registration, and then click the OK button.
Run the following command in Windows PowerShell(Administrator) or cmd to activate Windows.
Color modeslmgr /atoslmgr /atoCode block. Windows Server activation settings After confirming the notification popup indicating that activation was successful, click the OK button.
In Windows PowerShell(Administrator) or cmd, run the following command to verify that the product is properly activated.
Color modeslmgr /dlvslmgr /dlvCode block. Verify Windows Server activation After confirming the notification popup indicating that product activation was successful, click the OK button.
4.3 - API Reference
4.4 - CLI Reference
4.5 - Release Note
Bare Metal Server
- The 4th‑generation BM based on the Intel 6th‑generation (Granite Rapids) Processor has been released.
- Bare Metal Server s4 and h4 server types have been added.
- Detailed information about the s4/h4 server type can be found in s3/h3 server type.
- Provides local disk partition functionality
- Now you can create and use up to 10 Local disk partitions.
- You can terminate multiple resources simultaneously from the Bare Metal Server list.
- You can change the IP of a regular Subnet.
- OS Image has been added.
- RHEL 8.10, Ubuntu 24.04
- Add Bare Metal Server feature
- Distribute servers belonging to the same Placement group across different racks.
- Additional OS images provided (RHEL 9.4, Rocky Linux 8.6, Rocky Linux 9.4)
- Add a 3rd‑generation (s3/h3) server type based on Intel 4th‑generation (Sapphire Rapids) Processor. For more information, refer to Bare Metal Server 서버 타입.
- Samsung Cloud Platform Common Feature Changes
- Account, IAM, Service Home, tags, and other common CX changes have been applied.
- We have officially launched the Bare Metal Server service.
- We have launched a Bare Metal Server service that lets customers use a physical server without virtualization, exclusively for their own use.
5 - Multi-node GPU Cluster
5.1 - Overview
Service Overview
Multi-node GPU Cluster is a service that provides physical GPU servers without virtualization for large-scale high-performance AI computation. You can use two or more Bare Metal Servers equipped with GPUs to cluster multiple GPUs, and conveniently operate GPU servers in conjunction with Samsung Cloud Platform’s high‑performance storage and networking services.
Provided Features
The Multi-node GPU Cluster provides the following features.
- Auto Provisioning and Management: Through the web-based Console, you can easily provision servers of the standard GPU Bare Metal model equipped with 8 GPUs and manage resources and costs.
- Network Connection: You can cluster multiple GPUs on two or more Bare Metal Servers via high‑speed interconnects, and by configuring a GPU Direct RDMA (Remote Direct Memory Access) environment, you can directly process data I/O between GPU memories, enabling high‑speed AI/Machine Learning computation.
- Storage Connection: Provides various additional attached storage besides the OS disk. * High-performance SSD NAS File Storage, Block Storage, and Object Storage directly integrated with a high-speed network can also be used together.
- Network Configuration Management: The server’s subnet/IP can be easily changed from the values set at initial creation. * NAT IP provides a management feature that allows you to enable or disable it as needed.
- Monitoring: You can view monitoring information for computing resources such as CPU, GPU, Memory, and Disk through Cloud Monitoring. * To use the Cloud Monitoring service of a Multi-node GPU Cluster, you need to install the Agent. * Please install the Agent to ensure stable service. * For more details, please refer to Multi-node GPU Cluster Monitoring Metrics.
- Terraform Provision: Provides an IaC environment via Terraform.
Component
Multi-node GPU Cluster provides GPUs as a Bare Metal Sever type with standard images and server types. NVSwitch and NVLink are provided.
Specifications by GPU Type
GPU (Graphic Processing Unit) is specialized for parallel operations that process large amounts of data quickly, enabling large-scale parallel computation in fields such as artificial intelligence (AI) and data analysis.
The following are the specifications of GPU types offered by the Multi-node GPU Cluster service.
| Category | H100 Type | B300 Type |
|---|---|---|
| GPU Architecture | NVIDIA Hopper | NVIDIA Blackwell Ultra |
| GPU Memory | 80 GiB | 268 GiB |
| GPU Transistors | 80 billion 4N TSMC | 208 billion 4NP TSMC |
| FP16 Tensor Core (Dense) | 989 TFLOPs | 2.25 PFLOPs |
| FP8 Tensor Core (Dense) | 1979 TFLOPs | 4.5 PFLOPs |
| FP4 Tensor Core (Dense) | Not supported | 13.5 PFLOPs |
| GPU Memory Bandwidth | 3,352 GB/s HBM3 | 8 TB/s HBM3e |
| NVLink performance | NVLink 4 | NVLink 5 |
| NVLink Signaling Rate | 25 GB/s (x18) | 50 GB/s (x18) |
| NVSwitch GPU-to-GPU bandwidth | 900 GB/s | 1.8 TB/s |
| Total NVSwitch aggregate bandwidth | 7.2 TB/s | 14.4 TB/s |
OS and GPU driver version
The operating systems (OS) supported by the Multi-node GPU Cluster are as follows.
| OS | OS version | GPU driver version |
|---|---|---|
| Ubuntu | 22.04 | 535.86.10, 535.183.06 |
| Ubuntu | 24.04 | 580.105.08 |
Server type
The format of server types provided by the Multi-node GPU Cluster is as follows.
- Example: when the server type is g2c96h8_metal
| Category | example | Detailed description |
|---|---|---|
| Server generation | g2 | Provided server generation
|
| CPU | c96 | Number of cores
|
| GPU | h8 | GPU type and quantity
|
Preceding Service
This is a list of services that must be pre-configured before creating the service. Please refer to the guide provided for each service and prepare in advance.
| Service Category | service | Detailed description |
|---|---|---|
| Networking | VPC | A service that provides an isolated virtual network in a cloud environment |
5.1.1 - Server type
Multi-node GPU Cluster server type
Multi-node GPU Cluster is categorized based on the GPU Type it provides, and the GPU used in a Multi-node GPU Cluster is determined by the server type selected when creating a GPU Node. Select the server type based on the specifications of the application you want to run on a multi-node GPU cluster.
The server types supported by the Multi-node GPU Cluster are as follows.
- Example: when the server type is g2c96h8_metal.
Category example Detailed description Server generation g2 Provided server generation - g2
- g means GPU server specification
- 2 means generation
CPU c96 Number of cores - c96: Allocated cores are physical cores
GPU h8 GPU type and quantity - h8: h means GPU type, and 8 means GPU quantity
Table. Multi-node GPU Cluster server type format - g2
g2 server type
The g2 server type is a GPU Bare Metal Server that uses NVIDIA H100 SXM GPUs, suitable for large-scale high-performance AI computation.
- 8 NVIDIA Hopper Architecture-based H100 GPUs provided
- Provides 1,979 TFLOPS FP8 Tensor Core performance per GPU, 989 TFLOPS FP16 Tensor Core performance.
- Supports up to 96 vCPUs and 2,048 GB of memory
- Supports up to 1,600 Gb/s NVIDIA InfiniBand RDMA network.
- Service network up to 100 Gbps
- 900 GB/s GPU P2P communication via NVSwitch within a node
| Server type | GPU | GPU Memory | CPU(Core) | Memory | Disk | GPU P2P |
|---|---|---|---|---|---|---|
| g2c96h8_metal | H100 | 640 GiB | 96 vCore | 2 TB | SSD (OS) 960 GB * 2, NVMeSSD 3.84 TB * 4 | 900 GB/s NVSwitch |
g3 server type
The g3 server type is a GPU Bare Metal Server that uses NVIDIA B300 SXM GPUs, suitable not only for large-scale high-performance AI computation but also for LLM inference and AI deployment for generative AI.
- 8 NVIDIA Blackwell Ultra Architecture-based B300 GPUs provided
- Provides 13.5 PFLOPS FP4 Tensor Core and 4.5 PFLOPS FP8 Tensor Core performance per GPU.
- Supports up to 128 vCPUs and 4,096 GB of memory
- Supports up to 6,400 Gb/s NVIDIA InfiniBand RDMA network
- Service network up to 100 Gbps
- 1.8 TB/s GPU P2P communication via NVSwitch within a node
| Server type | GPU | GPU Memory | CPU(Core) | Memory | Disk | GPU P2P |
|---|---|---|---|---|---|---|
| g3c128b8_metal | B300 | 2.1 TiB | 128 vCore | 4 TB | SSD (OS) 960 GB * 2, NVMeSSD 3.84 TB * 4 | 1.8 TB/s NVSwitch |
5.1.2 - Monitoring Metrics
According to Samsung Cloud Platform’s policy, the Cloud Monitoring service is scheduled to be discontinued in September 2026.
Accordingly, from after the September 2026 release, resource monitoring of the Samsung Cloud Platform via Cloud Monitoring will no longer be possible.
With a new alternative service, you can continuously perform resource monitoring by leveraging ServiceWatch released in October 2025.
ServiceWatch provides more modern and powerful features, replacing Cloud Monitoring to deliver a seamless monitoring environment.
If you are collecting metrics and logs through the Cloud Monitoring Agent, you need to switch to the ServiceWatch Agent.
For more details about ServiceWatch, please refer to ServiceWatch Overview.
Detailed information about ServiceWatch Agent: please refer to the ServiceWatch Agent
Multi-node GPU Cluster Monitoring Metrics
The table below shows the monitoring metrics of a Multi-node GPU Cluster that can be viewed through Cloud Monitoring.
Multi-node GPU Cluster [Cluster]
| Performance items | Detailed description | unit |
|---|---|---|
| Memory Total [Basic] | bytes of usable memory | bytes |
| Memory Used [Basic] | Current memory usage in bytes | bytes |
| Memory Swap In [Basic] | bytes of the replaced memory | bytes |
| Memory Swap Out [Basic] | bytes of the replaced memory | bytes |
| Memory Free [Basic] | bytes of unused memory | bytes |
| Disk Read Bytes [Basic] | Read bytes | bytes |
| Disk Read Requests [Basic] | Number of read requests | cnt |
| Disk Write Bytes [Basic] | write bytes | bytes |
| Disk Write Requests [Basic] | Number of write requests | cnt |
| CPU Usage [Basic] | Average system CPU usage over 1 minute | % |
| Instance State [Basic] | Instance status | state |
| Network In Bytes [Basic] | Received bytes | bytes |
| Network In Dropped [Basic] | Incoming packet drop | cnt |
| Network In Packets [Basic] | Number of received packets | cnt |
| Network Out Bytes [Basic] | sent bytes | bytes |
| Network Out Dropped [Basic] | Transmit packet drop | cnt |
| Network Out Packets [Basic] | Number of transmitted packets | cnt |
| Performance items | Detailed description | unit |
|---|---|---|
| Cluster GPU Count | GPU Count SUM in Cluster
| cnt |
| Cluster GPU Count In Use | Number of GPUs being used by jobs within the cluster
| cnt |
| Cluster GPU Usage | GPU Utilization AVG within the cluster
| % |
| Cluster GPU Memory Usage [Avg] | Cluster GPU Memory Utilization AVG
| % |
Multi-node GPU Cluster [Node]
| Performance items | Detailed description | unit |
|---|---|---|
| Memory Total [Basic] | bytes of usable memory | bytes |
| Memory Used [Basic] | Current memory usage in bytes | bytes |
| Memory Swap In [Basic] | bytes of the replaced memory | bytes |
| Memory Swap Out [Basic] | bytes of the replaced memory | bytes |
| Memory Free [Basic] | bytes of unused memory | bytes |
| Disk Read Bytes [Basic] | Read bytes | bytes |
| Disk Read Requests [Basic] | Number of read requests | cnt |
| Disk Write Bytes [Basic] | write bytes | bytes |
| Disk Write Requests [Basic] | Number of write requests | cnt |
| CPU Usage [Basic] | Average system CPU usage over 1 minute | % |
| Instance State [Basic] | Instance status | state |
| Network In Bytes [Basic] | Received bytes | bytes |
| Network In Dropped [Basic] | Incoming packet drop | cnt |
| Network In Packets [Basic] | Number of received packets | cnt |
| Network Out Bytes [Basic] | sent bytes | bytes |
| Network Out Dropped [Basic] | Transmit packet drop | cnt |
| Network Out Packets [Basic] | Number of transmitted packets | cnt |
| Performance items | Detailed description | unit |
|---|---|---|
| GPU Count | Number of GPUs | cnt |
| GPU Temperature | GPU temperature | ℃ |
| GPU Usage | utilization | % |
| GPU Usage [Avg] | Overall average GPU utilization (%) | % |
| GPU Power Cap | Maximum power capacity of the GPU | W |
| GPU Power Usage | Current GPU power usage | W |
| GPU Memory Usage [Avg] | GPU Memory Uti. AVG | % |
| GPU Count in use | Number of GPUs in use by jobs on the node | cnt |
| Execution Status for nvidia-smi | Result of running the nvidia-smi command | status |
| Core Usage [IO Wait] | Ratio of CPU time spent in wait state (disk wait) | % |
| Core Usage [System] | Proportion of CPU time spent in kernel space | % |
| Core Usage [User] | Proportion of CPU time spent in user space | % |
| CPU Cores | The number of CPU cores on the host. The maximum value of the unnormalized ratio is 100%* of a core. The unnormalized ratio already incorporates this value, and its maximum is 100%* of a core. | cnt |
| CPU Usage [Active] | Percentage of CPU time used excluding Idle and IOWait states (when all 4 cores are used at 100%: 400%) | % |
| CPU Usage [Idle] | It is the proportion of CPU time spent in idle state. | % |
| CPU Usage [IO Wait] | This is the proportion of CPU time spent in a waiting state (disk wait). | % |
| CPU Usage [System] | Percentage of CPU time used by the kernel (when all 4 cores are used at 100%: 400%) | % |
| CPU Usage [User] | Percentage of CPU time used in user space. (If all 4 cores are used at 100%, it is 400%) | % |
| CPU Usage/Core [Active] | Percentage of CPU time used excluding Idle and IOWait states (value normalized by the number of cores; 100% when all four cores are fully utilized) | % |
| CPU Usage/Core [Idle] | It is the proportion of CPU time spent in idle state. | % |
| CPU Usage/Core [IO Wait] | This is the proportion of CPU time spent in a waiting state (disk wait). | % |
| CPU Usage/Core [System] | Percentage of CPU time used by the kernel (value normalized by the number of cores; 100% when all 4 cores are fully utilized) | % |
| CPU Usage/Core [User] | Percentage of CPU time used in user space. (Value normalized by the number of cores; 100% when all 4 cores are fully utilized) | % |
| Disk CPU Usage [IO Request] | It is the proportion of CPU time during which I/O requests for the device were executed (device bandwidth utilization). If this value approaches 100%, the device becomes saturated. | % |
| Disk Queue Size [Avg] | The average queue length of requests executed for the device. | num |
| Disk Read Bytes | The number of bytes read per second from the device. | bytes |
| Disk Read Bytes [Delta Avg] | Average of system.diskio.read.bytes_delta for individual disks | bytes |
| Disk Read Bytes [Delta Max] | Maximum system.diskio.read.bytes_delta of individual disks | bytes |
| Disk Read Bytes [Delta Min] | Minimum system.diskio.read.bytes_delta of individual disks | bytes |
| Disk Read Bytes [Delta Sum] | Sum of the system.diskio.read.bytes_delta of individual disks | bytes |
| Disk Read Bytes [Delta] | Delta of the system.diskio.read.bytes value for each disk | bytes |
| Disk Read Bytes [Success] | Total number of bytes successfully read. On Linux, assuming a sector size of 512, it is the number of sectors read multiplied by 512. | bytes |
| Disk Read Requests | Number of read requests to the disk device per second | cnt |
| Disk Read Requests [Delta Avg] | Average of the system.diskio.read.count_delta for individual disks | cnt |
| Disk Read Requests [Delta Max] | Maximum system.diskio.read.count_delta for individual disks | cnt |
| Disk Read Requests [Delta Min] | Minimum of system.diskio.read.count_delta for individual disks | cnt |
| Disk Read Requests [Delta Sum] | Sum of system.diskio.read.count_delta of individual disks | cnt |
| Disk Read Requests [Success Delta] | Delta of system.diskio.read.count for each disk | cnt |
| Disk Read Requests [Success] | Total number of successful reads | cnt |
| Disk Request Size [Avg] | Average size of requests executed on the device (unit: sectors). | num |
| Disk Service Time [Avg] | Average service time (ms) of input requests executed on the device. | ms |
| Disk Wait Time [Avg] | Average time taken for requests executed on the supported device. | ms |
| Disk Wait Time [Read] | Average disk wait time | ms |
| Disk Wait Time [Write] | Average disk wait time | ms |
| Disk Write Bytes [Delta Avg] | Average of system.diskio.write.bytes_delta for each disk | bytes |
| Disk Write Bytes [Delta Max] | Maximum system.diskio.write.bytes_delta of individual disks | bytes |
| Disk Write Bytes [Delta Min] | Minimum of system.diskio.write.bytes_delta for individual disks | bytes |
| Disk Write Bytes [Delta Sum] | Sum of the system.diskio.write.bytes_delta of individual disks | bytes |
| Disk Write Bytes [Delta] | Delta of the system.diskio.write.bytes value for each disk | bytes |
| Disk Write Bytes [Success] | Total number of bytes successfully written. On Linux, assuming a sector size of 512, it is the number of sectors written multiplied by 512. | bytes |
| Disk Write Requests | Number of write requests to the disk device per second | cnt |
| Disk Write Requests [Delta Avg] | Average of system.diskio.write.count_delta for individual disks | cnt |
| Disk Write Requests [Delta Max] | Maximum system.diskio.write.count_delta for individual disks | cnt |
| Disk Write Requests [Delta Min] | Minimum of system.diskio.write.count_delta for individual disks | cnt |
| Disk Write Requests [Delta Sum] | Sum of the system.diskio.write.count_delta of individual disks | cnt |
| Disk Write Requests [Success Delta] | Delta of system.diskio.write.count for each disk | cnt |
| Disk Write Requests [Success] | Total number of successful writes | cnt |
| Disk Writes Bytes | It is the number of bytes per second written to the device. | bytes |
| Filesystem Hang Check | filesystem (local/NFS) hang check (normal:1, abnormal:0) | status |
| Filesystem Nodes | It is the total number of file nodes in the file system. | cnt |
| Filesystem Nodes [Free] | It is the total number of available file nodes in the file system. | cnt |
| Filesystem Size [Available] | Disk space (bytes) that unauthorized users can use. | bytes |
| Filesystem Size [Free] | Available disk space (bytes) | bytes |
| Filesystem Size [Total] | Total disk space (bytes) | bytes |
| Filesystem Usage | Used disk space percentage | % |
| Filesystem Usage [Avg] | Average of individual filesystem.used.pct | % |
| Filesystem Usage [Inode] | inode usage | % |
| Filesystem Usage [Max] | Maximum among individual filesystem.used.pct | % |
| Filesystem Usage [Min] | minimum of individual filesystem.used.pct | % |
| Filesystem Usage [Total] | - | % |
| Filesystem Used | Used disk space (bytes) | bytes |
| Filesystem Used [Inode] | inode usage | bytes |
| Memory Free | Total amount of available memory (bytes). Memory used by system cache and buffers is not included (see system.memory.actual.free). | bytes |
| Memory Free [Actual] | Actual usable memory (bytes). The calculation method varies by OS: on Linux, it is MemAvailable from /proc/meminfo, or if meminfo cannot be used, it is calculated from available memory plus cache and buffers. On OSX, it is the sum of usable memory and inactive memory. On Windows, it corresponds to a value such as system.memory.free. | bytes |
| Memory Free [Swap] | Available swap memory. | bytes |
| Memory Total | total memory | bytes |
| Memory Total [Swap] | Total swap memory. | bytes |
| Memory Usage | Percentage of used memory
| % |
| Memory Usage [Actual] | Percentage of memory actually used
| % |
| Memory Usage [Cache Swap] | Cached swap usage rate | % |
| Memory Usage [Swap] | Percentage of used swap memory | % |
| Memory Used | used memory | bytes |
| Memory Used [Actual] | Actual used memory (bytes). The value obtained by subtracting used memory from total memory. Available memory is calculated differently for each OS (see system.actual.free). | bytes |
| Memory Used [Swap] | Used swap memory. | bytes |
| Collisions | Network collision | cnt |
| Network In Bytes | Number of received bytes | bytes |
| Network In Bytes [Delta Avg] | Average of system.network.in.bytes_delta for each network | bytes |
| Network In Bytes [Delta Max] | Maximum of system.network.in.bytes_delta for each network | bytes |
| Network In Bytes [Delta Min] | Minimum system.network.in.bytes_delta for each network | bytes |
| Network In Bytes [Delta Sum] | Sum of system.network.in.bytes_delta for individual networks | bytes |
| Network In Bytes [Delta] | Delta of received byte count | bytes |
| Network In Dropped | Number of deleted packets among incoming packets | cnt |
| Network In Errors | Number of errors during reception | cnt |
| Network In Packets | Number of received packets | cnt |
| Network In Packets [Delta Avg] | Average of system.network.in.packets_delta for individual networks | cnt |
| Network In Packets [Delta Max] | Maximum of system.network.in.packets_delta for each network | cnt |
| Network In Packets [Delta Min] | Minimum of system.network.in.packets_delta for each network | cnt |
| Network In Packets [Delta Sum] | Sum of system.network.in.packets_delta for individual networks | cnt |
| Network In Packets [Delta] | Delta of received packet count | cnt |
| Network Out Bytes | Number of transmitted bytes | bytes |
| Network Out Bytes [Delta Avg] | Average of system.network.out.bytes_delta for each network | bytes |
| Network Out Bytes [Delta Max] | Maximum system.network.out.bytes_delta of individual networks | bytes |
| Network Out Bytes [Delta Min] | Minimum of system.network.out.bytes_delta for individual networks | bytes |
| Network Out Bytes [Delta Sum] | Sum of system.network.out.bytes_delta for individual networks | bytes |
| Network Out Bytes [Delta] | Delta of transmitted byte count | bytes |
| Network Out Dropped | Number of deleted packets among outgoing packets. This value is not reported by the operating system, so it is always 0 on Darwin and BSD. | cnt |
| Network Out Errors | Number of errors during transmission | cnt |
| Network Out Packets | Number of transmitted packets | cnt |
| Network Out Packets [Delta Avg] | Average of system.network.out.packets_delta for each network | cnt |
| Network Out Packets [Delta Max] | Maximum of system.network.out.packets_delta for each network | cnt |
| Network Out Packets [Delta Min] | Minimum of system.network.out.packets_delta for each network | cnt |
| Network Out Packets [Delta Sum] | Sum of system.network.out.packets_delta for individual networks | cnt |
| Network Out Packets [Delta] | Delta of transmitted packet count | cnt |
| Open Connections [TCP] | All open TCP connections | cnt |
| Open Connections [UDP] | All open UDP connections | cnt |
| Port Usage | Available port usage rate | % |
| SYN Sent Sockets | Number of sockets in SYN_SENT state (when connecting from local to remote) | cnt |
| Kernel PID Max | kernel.pid_max value | cnt |
| Kernel Thread Max | kernel.threads-max value | cnt |
| Process CPU Usage | Percentage of CPU time consumed by the process since the last update. This value is similar to the %CPU value shown for the process by the top command on Unix systems. | % |
| Process CPU Usage/Core | The percentage of CPU time used by the process since the last event. Normalized by the number of cores, with values ranging from 0 to 100%. | % |
| Process Memory Usage | Proportion of main memory (RAM) occupied by a process | % |
| Process Memory Used | Resident Set size. The amount of memory a process occupies in RAM. In Windows, the current working set size. | bytes |
| Process PID | process pid | PID |
| Process PPID | parent process PID | PID |
| Processes [Dead] | Number of dead processes | cnt |
| Processes [Idle] | Number of idle processes | cnt |
| Processes [Running] | Number of running processes | cnt |
| Processes [Sleeping] | Number of sleeping processes | cnt |
| Processes [Stopped] | stopped processes count | cnt |
| Processes [Total] | Total number of processes | cnt |
| Processes [Unknown] | Number of processes with an unknown or unsearchable status | cnt |
| Processes [Zombie] | Number of zombie processes | cnt |
| Running Process Usage | process usage | % |
| Running Processes | Number of running processes | cnt |
| Running Thread Usage | Thread usage rate | % |
| Running Threads | Total number of threads running in running processes | cnt |
| Instance Status | Instance status | state |
| Context Switches | context switch count (per second) | cnt |
| Load/Core [1 min] | The load over the last 1 minute divided by the number of cores | cnt |
| Load/Core [15 min] | The load over the last 15 minutes divided by the number of cores | cnt |
| Load/Core [5 min] | The load over the last 5 minutes divided by the number of cores | cnt |
| Multipaths [Active] | External storage connection path status = active count | cnt |
| Multipaths [Failed] | External storage connection path status = failed count | cnt |
| Multipaths [Faulty] | External storage connection path status = faulty count | cnt |
| NTP Offset | measured offset of the last sample (the time difference between the NTP server and the local environment) | num |
| Run Queue Length | Execution queue length | num |
| Uptime | OS uptime(uptime). (milliseconds) | ms |
| Context Switchies | CPU context switch count (per second) | cnt |
| Disk Read Bytes [Sec] | Number of bytes read from a Windows logical disk in 1 second | cnt |
| Disk Read Time [Avg] | Average data read time (seconds) | sec |
| Disk Transfer Time [Avg] | Disk average wait time | sec |
| Disk Usage | Disk usage | % |
| Disk Write Bytes [Sec] | Number of bytes written in one second on a Windows logical disk | cnt |
| Disk Write Time [Avg] | Average data write time (seconds) | sec |
| Pagingfile Usage | Paging file usage | % |
| Pool Used [Non Paged] | Nonpaged Pool usage in kernel memory | bytes |
| Pool Used [Paged] | Paged Pool usage in kernel memory | bytes |
| Process [Running] | Number of currently running processes | cnt |
| Threads [Running] | Number of currently running threads | cnt |
| Threads [Waiting] | Number of threads waiting for processor time | cnt |
5.2 - How-to guides
Users can create the service by entering the required information for the Multi-node GPU Cluster service and selecting detailed options through the Samsung Cloud Platform Console.
Multi-node GPU Cluster Getting Started
You can create and use a Multi-node GPU Cluster service in the Samsung Cloud Platform Console.
This service consists of a GPU Node and a Cluster Fabric service.
Create GPU Node
Multi-node GPU Cluster To create a Multi-node GPU Cluster, follow the steps below.
- All Services > Compute > Multi-node GPU Cluster menu, click it. Navigate to the Service Home page of Multi-node GPU Cluster.
- On the Service Home page, click the Create GPU Node button. You will be taken to the Create GPU Node page.
- On the GPU Node Creation page, enter the information required to create the service and select detailed options.
- Select the required information in the Image and Version Selection area.
Category required or notDetailed description image Required Select the type of image provided - Ubuntu
Image version Required Select version of the chosen image - Provide a list of versions for the supplied server image
Table. GPU Node image and version selection options - In the Service Information Input area, enter or select the required information.
Category RequiredDetailed description Number of servers Required Number of GPU Node servers to create simultaneously - Only numeric input is allowed, and the minimum number of servers to create is 2.
- Only during the initial setup can you create 2 or more, and subsequent expansions can be done one at a time.
Service Type > Server Type Required GPU Node server type - Select the desired CPU, Memory, GPU, and Disk specifications
- For detailed information about the server types provided by GPU Node, refer to Multi-node GPU Cluster Server Type
Service Type > Planned Compute Required Status of resources with Planned Compute configured - In Use: Number of resources with Planned Compute that are currently in use
- Configured: Number of resources with Planned Compute configured
- Coverage Preview: Amount applied per resource by Planned Compute
- Apply for Planned Compute Service: Navigate to the Planned Compute service application page
- For more details, refer to Apply for Planned Compute
Table. GPU Node Service Information Input Items - In the Required Information Input area, enter or select the necessary information.
Category RequiredDetailed description Administrator account Required Set the administrator account and password to be used when connecting to the server - Ubuntu OS is provided with root fixed
Server name Prefix Required Enter a Prefix to distinguish each GPU Node generated when the selected number of servers is 2 or more - Automatically generated in the form of user input value (prefix) + ‘
-###’
- Must start with a lowercase English letter and be entered using lowercase letters, numbers, and special characters (
-) within 3 to 11 characters
- Must not end with a special character (
-)
Network Settings Required Set the network where the GPU Node will be installed - VPC name: select a pre‑created VPC
- General Subnet name: select a pre‑created general Subnet
- IP can be auto‑generated or manually entered; if manual input is chosen, the user enters the IP directly
- NAT: usable only when there is a single server and the VPC is attached to an Internet Gateway. Check to use, and you can select a NAT IP (initially it can be created only with two or more servers, so modify it on the resource detail page)
- NAT IP: select a NAT IP
- If no NAT IP is available, click the Create new button to generate a Public IP
- Click the Refresh button to view and select the created Public IP
- Creating a Public IP incurs charges according to the Public IP pricing
Table. Required input fields for GPU Node - In the Cluster Selection area, create or select a Cluster Fabric.
Category Required statusDetailed description Cluster Fabric Required Configure a group of GPU Node servers that can apply GPU Direct RDMA together - Optimal GPU performance and speed can be achieved only within the same Cluster Fabric
- When creating a new Cluster Fabric, *New Input > select Node pool, then enter the name of the Cluster Fabric to create
- To add to an existing Cluster Fabric, Existing Input > select Node pool, then select the previously created Cluster Fabric
Table. GPU Node Cluster Fabric selection items - Additional Information Input area, enter or select the required information.
Category Required statusDetailed description Lock Selection Using a lock prevents accidental actions that could terminate, start, or stop the server. Init Script Selection Script to run when the server starts - The Init Script must be selected differently depending on the image type
- For Linux: Choose Shell Script or cloud-init
tag Selection Add Tag - Up to 50 can be added per resource
- After clicking the Add Tag button, enter or select Key, Value values
Table. GPU Node additional information input fields - The Init Script must be selected differently depending on the image type
- Select the required information in the Image and Version Selection area.
- Summary Check the detailed information and estimated billing amount generated in the panel, and click the Create button.
- When creation is complete, check the created resources on the GPU Node List page.
- When creating a service, the GPU MIG/ECC settings are reset. However, to ensure the correct settings are applied, perform an initial reboot, verify that the settings have taken effect, and then proceed.
- For detailed information on resetting GPU MIG/ECC settings, refer to the GPU MIG/ECC Reset Settings Checklist Guide.
Check GPU Node details
The Multi-node GPU Cluster service allows you to view and edit the full resource list and detailed information of GPU nodes.
GPU Node Details page consists of Details, Tags, Job History tabs.
To view detailed information about the GPU Node, follow these steps.
All Services > Compute > Multi-node GPU Cluster > GPU Node menu, click it. Navigate to the Service Home page of Multi-node GPU Cluster.
On the Service Home page, click the GPU Node menu. You will be taken to the GPU Node List page.
- Resource items beyond the required columns can be added via the Settings button.
Category Required statusDetailed description Resource ID Selection User-created GPU Node ID Cluster Fabric name Required User-created Cluster Fabric name Server name Required User-created GPU Node name Server type Required Server type of the GPU Node - Users can view the number of cores, memory capacity, and GPU type and quantity of the resources they created
image Required User-created GPU Node image version IP Required IP of the GPU node created by the user status Required Status of the user-created GPU node Creation date and time Selection GPU Node creation timestamp Table. GPU Node resource list items
- Resource items beyond the required columns can be added via the Settings button.
On the GPU Node List page, click the resource for which you want to view detailed information. You will be taken to the GPU Node Details page.
- GPU Server Details At the top of the page, status information and descriptions of additional features are displayed.
Category Detailed description GPU Node status Status of user-created GPU Node - Creating: server is being created
- Running:: creation completed and available
- Editing:: IP is being changed
- Unknown: error state
- Starting: server is starting
- Stopping: server is stopping
- Stopped: server stopped
- Terminating: terminating
- Terminated: termination completed
Server control Button to change server status - Start: Start a stopped server
- Stop: Stop a running server
Service termination Cancel service button Table. GPU Node status information and additional features
- GPU Server Details At the top of the page, status information and descriptions of additional features are displayed.
Detailed Information
On the GPU Node List page’s Details Tab, you can view the detailed information of the selected resource and edit the information if necessary.
| Category | Detailed description |
|---|---|
| service | Service name |
| Resource Type | Resource Type |
| SRN | Unique resource ID in Samsung Cloud Platform
|
| Resource name | Resource Name
|
| Resource ID | Unique resource ID in the service |
| constructor | User who created the service |
| Creation date and time | Service creation date and time |
| editor | User who edited the service information |
| Modification date | Date and time the service information was modified |
| Server name | Server name |
| Node pool | A collection of nodes that can be grouped into the same Cluster Fabric |
| Cluster Fabric name | User-created Cluster Fabric name |
| Image/Version | Server OS image and version |
| Server type | CPU, memory, GPU, information display |
| Planned Compute | Resource status with Planned Compute configured
|
| Lock | Display whether Lock is enabled or disabled
|
| Network | GPU Node network information
|
| Block Storage | Block Storage information attached to the server
|
| Init Script | View the Init Script content entered when creating the server |
tag
On the GPU Node List page’s Tag Tab, you can view the selected resource’s tag information, and add, modify, or delete it.
| Category | Detailed description |
|---|---|
| Tag list | Tag list
|
Job History
On the GPU Node List page’s Job History Tab, you can view the job history of the selected resource.
| Category | Detailed description |
|---|---|
| Task History List | Resource Change History
|
Control GPU Node Operation
If you need server control and management functions for the created GPU Node resources, you can perform tasks on the GPU Node List or GPU Node Details page. You can start and stop the resources of a running GPU node.
Getting Started with GPU Node
You can start a GPU Node that is stopped (Stopped). To start a GPU Node, follow the steps below.
- Click the All Services > Compute > Multi-node GPU Cluster menu. Go to the Service Home page of Multi-node GPU Cluster.
- On the Service Home page, click the GPU Node menu. You will be taken to the GPU Node List page.
- On the GPU Node List page, after selecting individual or multiple servers with the checkboxes, you can Start using the More button at the top.
- On the GPU Node List page, click the resource. Navigate to the GPU Node Detail page.
- On the GPU Node Details page, click the Start button at the top to start the server.
- Check the server status and complete the status change.
Stopping GPU Node
You can stop a GPU Node that is running (Active). To stop a GPU Node, follow the steps below.
- Click the All Services > Compute > Multi-node GPU Cluster menu. Go to the Service Home page of Multi-node GPU Cluster.
- On the Service Home page, click the GPU Node menu. You will be taken to the GPU Node List page.
- GPU Node List page allows you to control individual or multiple servers by selecting the checkboxes and using the Stop button at the top.
- On the GPU Node List page, click the resource. You will be taken to the GPU Node Details page.
- On the GPU Node Details page, click the Stop button at the top to stop the server.
- Check the server status and complete the status change.
Terminate GPU Node
You can terminate unused GPU Nodes to reduce operating costs. However, terminating a service may cause the running service to stop immediately, so you should carefully consider the impact of service interruption before proceeding with termination.
To cancel a GPU Node, follow the steps below.
- Click the All Services > Compute > Multi-node GPU Server menu. Navigate to the Service Home page of the Multi-node GPU Cluster.
- On the Service Home page, click the Cluster Fabric menu. You will be taken to the Cluster Fabric list page.
- Cluster Fabric List page, select the resources to terminate, and click the Service Termination button.
- Resources that use the same Cluster Fabric can be terminated simultaneously.
- When termination is complete, check on the GPU Node List page whether the resources have been terminated.
The cases where a GPU Node cannot be terminated are as follows.
- When Block Storage(BM) is connected: Please disconnect the Block Storage(BM) connection first.
- If File Storage is connected: Please disconnect the File Storage first.
- When Lock is set: Please change the Lock setting to disabled and try again.
- If the selection includes a server that cannot be terminated simultaneously: Please re-select only resources that can be terminated.
- If the server you want to terminate has a different Cluster Fabric: Select only resources that use the same Cluster Fabric.
5.2.1 - Manage Cluster Fabric
Cluster Fabric is a service that helps manage the servers (GPU Node) included in a GPU Cluster. By using Cluster Fabric, you can move servers between GPU Clusters in the same Node pool and optimize GPU performance and speed within the same GPU Cluster.
Creating Cluster Fabric
Cluster Fabric can be created together with a GPU Node, and it cannot be created or deleted separately. If all GPU Nodes within a Cluster Fabric are terminated, the Cluster Fabric is automatically deleted.
If you have not created a GPU Node, please create a GPU Node first. For more information, see GPU Node 생성하기.
Check Cluster Fabric details
- Cluster Fabric can be created together when a GPU node is created, and it cannot be created or deleted independently.
- If all GPU nodes in the Cluster Fabric are terminated, the Cluster Fabric is automatically deleted.
- If you have not created a GPU Node, please create a GPU Node first. For more details, refer to GPU Node 생성하기.
On the Cluster Fabric List page and the Cluster Fabric Details page, you can view the generated Cluster Fabric list and details and move the server.
Click the All Services > Compute > Multi-node GPU Server menu. Go to the Service Home page of the Multi-node GPU Cluster.
On the Service Home page, click the Cluster Fabric menu. You will be taken to the Cluster Fabric List page.
- On the Cluster Fabric List page, you can view the resource list of GPU clusters created by the user.
- Resource items beyond the required columns can be added via the Settings button.
Category RequiredDetailed description Resource ID Selection User-created Cluster Fabric ID Cluster Fabric name Required User-created Cluster Fabric name Node pool Selection A collection of nodes that can be grouped into the same Cluster Fabric Number of servers Selection Number of GPU Nodes Server type Selection Server type of GPU Node - Users can view the number of cores, memory capacity, and GPU type and count of the resources they created
status Selection Status of the user-created Cluster Fabric Creation date and time Select Cluster Fabric creation timestamp Table. Cluster Fabric resource list items
On the Cluster Fabric List page, click the resource to view detailed information. You will be taken to the Cluster Fabric Details page.
- Cluster Fabric Details At the top of the page, status information and descriptions of additional features are displayed.
Category Detailed description Cluster Fabric status Status of the user-created Cluster Fabric - Creating: State while the cluster is being created
- Active: State when creation is complete and the cluster is usable
- Editing: State while the IP is being changed
- Deleting: State while being terminated
- Deleted: State after termination is complete
Add target server A feature that allows moving a server from another cluster to the target cluster. Table. Cluster Fabric status information and additional features
- Cluster Fabric Details At the top of the page, status information and descriptions of additional features are displayed.
Detailed Information
On the Cluster Fabric List page’s Details Tab, you can view detailed information of the selected resource and retrieve servers from another cluster.
| Category | Detailed description |
|---|---|
| service | Service name |
| Resource Type | Resource Type |
| SRN | Unique resource ID in Samsung Cloud Platform
|
| Resource name | Resource name
|
| Resource ID | Unique resource ID in the service |
| constructor | User who created the service |
| Creation date and time | Service creation date and time |
| editor | User who edited the service information |
| Modification date | Date and time the service information was modified |
| Cluster Fabric name | User-created Cluster Fabric name |
| Node pool | A set of nodes that can be grouped into the same Cluster Fabric |
| target server | GPU Node list bound to Cluster Fabric
|
Import Cluster Fabric Server
Cluster Fabric Details page’s add target server feature allows you to import servers from another cluster and add them to the selected cluster.
- Click the All Services > Compute > Multi-node GPU Server menu. Navigate to the Service Home page of the Multi-node GPU Cluster.
- On the Service Home page, click the Cluster Fabric menu. You will be taken to the Cluster Fabric list page.
- On the Cluster Fabric List page, click the resource to view detailed information. You will be taken to the Cluster Fabric Details page.
- In the target server of the Details tab, click the Add button on the right.
- The add target server popup window opens.
- Select a cluster from Cluster Fabric.
- GPU nodes associated with the selected cluster are listed; select the GPU node you want to retrieve.
- The selected GPU Node’s name is displayed at the bottom.
- Press the Confirm button to complete.
- Pressing the Cancel button cancels the operation.
- Verify that the GPU node added on the target server is displayed.
- The add target server popup window opens.
Terminate Cluster Fabric
If all GPU Nodes in the Cluster Fabric are terminated, the Cluster Fabric is automatically deleted. For more information, see Terminate GPU Node.
5.2.2 - Install ServiceWatch Agent
Users can install the ServiceWatch Agent on GPU nodes of a Multi-node GPU Cluster to collect custom metrics and logs.
ServiceWatch Agent
In a Multi-node GPU Cluster, the agents that need to be installed on GPU nodes to collect ServiceWatch custom metrics and logs can be divided into two main types. It is a Prometheus Exporter and Open Telemetry Collector.
| Category | Detailed description | |
|---|---|---|
| Prometheus Exporter | Provide metrics of a specific application or service in a format that Prometheus can scrape
| |
| Open Telemetry Collector | Acts as a centralized collector that gathers telemetry data such as metrics and logs from distributed systems, processes (filtering, sampling, etc.) it, and exports it to multiple backends (e.g., Prometheus, Jaeger, Elasticsearch, etc.)
|
If you have configured a Kubernetes Engine on a GPU node, please view the GPU metrics using the metrics provided by the Kubernetes Engine.
- If you install the DCGM Exporter on a GPU node where Kubernetes Engine is configured, it may not operate correctly.
Pre-configuration for Using ServiceWatch Agent
To use the ServiceWatch Agent, please refer to Prerequisite Settings for ServiceWatch Agent and prepare the prerequisite settings.
5.2.3 - Multi-node GPU Cluster Service Scope and Inspection Guide
Multi-node GPU Cluster Service Scope
If an IaaS hardware-level issue occurs with the Multi-node GPU Cluster service, you can receive technical support through Contact Us in the Support Center. However, the risks associated with changes such as OS kernel updates or application installations are the user’s responsibility, so technical support is limited; please be mindful when performing tasks such as system updates.
IaaS hardware level issue
- HW fault event messages generated within the server by the IPMI hardware monitoring console.
- GPU HW operation error observed in the nvdia-smi command
- HW error messages that occur during inspection of InfiniBand HCA cards or InfiniBand Switches
IaaS HW Inspection Guide
After applying for the Multi-node GPU Cluster service, it is recommended to check the IaaS HW level according to the inspection guide.
Intel E810 driver update
Check the version of the Intel E810 driver and, if necessary, refer to the following procedure to perform an update.
- Server manufacturer Intel E810 driver minimum recommended version: 1.15.4 or later
- Driver download: Intel Network Adapter Driver for 800 Series Devices under Linux
You can use the lspci command to verify whether an E810 NIC device is present.
If the E810 NIC is valid, the PCIe device is identified as follows using the E810-C information (if there is no E810 NIC device, this operation is not performed).
<div class="code-block-buttons">
<button class="code-block-download">
<div class="code-block-download-icon">
<svg width="14" height="16" viewBox="0 0 14 16" fill="none" xmlns="http://www.w3.org/2000/svg"><path d="M13.999 14.4353v-2.998C13.999 11.0232 13.6602 10.6853 13.2461 10.6853S12.5 11.0232 12.5 11.4373v2.2519H1.5V11.4373C1.5 11.0232 1.16211 10.6853.748047 10.6853.333984 10.6853 976563e-9 11.0232 976562e-9 11.4373v2.998C976562e-9 14.8494.333984 15.1892.748047 15.1892H13.2461c.414099999999999.0.7529-.3398.7529-.7539z" fill="#5135ff"/><path d="M1.41169 6.21654c.25876-.32345.73073-.37589 1.05417-.11713l3.78463 3.0277V1.56104c0-.41422.33578-.750005.75-.750005.41421.0.75.335785.75.750005V9.12208L11.5288 6.09941C11.8523 5.84065 12.3242 5.89309 12.583 6.21654 12.8418 6.53999 12.7893 7.01196 12.4659 7.27071L6.99734 11.6455 1.52882 7.27071c-.32345-.25875-.37589-.73072-.11713-1.05417z" fill="#5135ff"/></svg></div>
Sample Code Download
</button>
<button class="code-block-copy">
<div class="code-block-copy-icon">
<svg width="14" height="14" viewBox="0 0 14 14" fill="none" xmlns="http://www.w3.org/2000/svg"><path fill-rule="evenodd" clip-rule="evenodd" d="M10 4.00012V1c0-.552285-.44772-1-1-1H1C.447715.0.0.447715.0 1V9c0 .55228.447715 1 1 1H3.99988v3.0001C3.99988 13.5515 4.44849 14 4.99988 14H10.0002l3.9997-3.9995V5c0-.55139-.448600000000001-.99988-1-.99988H10zM1.4 1.4V8.6H3.99988V5c0-.55139.44861-.99988 1-.99988H8.6V1.4H1.4zM5.3999 12.6H9.08295V9.78301C9.08295 9.39641 9.39635 9.08301 9.78295 9.08301H12.5999V5.40015h-7.2V12.6zm6.1375-2.117-1.0545 1.0544V10.483h1.0545z" fill="currentcolor"/></svg></div>
Copy Code
</button>
</div>
lspci | grep E810
0000:6a:00.0 Ethernet controller: Intel Corporation Ethernet Controller E810-C for QSFP (rev 02) 0000:6a:00.1 Ethernet controller: Intel Corporation Ethernet Controller E810-C for QSFP (rev 02)lspci | grep E810
0000:6a:00.0 Ethernet controller: Intel Corporation Ethernet Controller E810-C for QSFP (rev 02) 0000:6a:00.1 Ethernet controller: Intel Corporation Ethernet Controller E810-C for QSFP (rev 02)</div>
To update the driver, follow the steps below.
Move the base driver tar file to the desired directory.
Color mode/usr/local/src/usr/local/srcDirectory navigation example Untar / unzip the Archiver file.
- x.x.x is the version number of the driver tar file.Color mode
tar zxf ice-x.x.x.tar.gztar zxf ice-x.x.x.tar.gzuntar example
- x.x.x is the version number of the driver tar file.
Change the driver to the src directory.
- x.x.x is the version number of the driver tar file.Color mode
cd ice-x.x.x/src/cd ice-x.x.x/src/Example of changing to the src directory
- x.x.x is the version number of the driver tar file.
Compile the driver module.
Color modemake installmake installDriver module compilation example After the update is complete, check the version.
Color modelsmod | grep ice modinfo ice | grep versionlsmod | grep ice modinfo ice | grep versionVersion check example
Check NVIDIA driver
To check the NVIDIA driver (nvidia-smi topo, IB nv_peer_mem status) and inspect the IaaS hardware level, follow these steps.
Check the GPU driver status.
Color mode~$ nvidia-smi~$ nvidia-smiExample code for checking GPU driver status Color modeThu Jan 29 14:48:31 2026 +---------------------------------------------------------------------------------------+ | NVIDIA-SMI 535.183.06 Driver Version: 535.183.06 CUDA Version: 12.2 | |-----------------------------------------+----------------------+----------------------+ | GPU Name Persistence-M | Bus-Id Disp.A | Volatile Uncorr. ECC | | Fan Temp Perf Pwr:Usage/Cap | Memory-Usage | GPU-Util Compute M. | | | | MIG M. | | =========================================+======================+====================== | | 0 NVIDIA A100-SXM4-80GB On | 00000000:00:05.0 Off | On | | N/A 36C P0 52W / 400W | 0MiB / 81920MiB | N/A Default | | | | Enabled | +-----------------------------------------+----------------------+----------------------+ | 1 NVIDIA A100-SXM4-80GB On | 00000000:00:06.0 Off | 0 | | N/A 36C P0 61W / 400W | 0MiB / 81920MiB | 0% Default | | | | Disabled | +-----------------------------------------+----------------------+----------------------+ | 2 NVIDIA A100-SXM4-80GB On | 00000000:00:07.0 Off | 0 | | N/A 36C P0 64W / 400W | 0MiB / 81920MiB | 0% Default | | | | Disabled | +-----------------------------------------+----------------------+----------------------+ | 3 NVIDIA A100-SXM4-80GB On | 00000000:00:08.0 Off | 0 | | N/A 40C P0 64W / 400W | 0MiB / 81920MiB | 0% Default | | | | Disabled | +-----------------------------------------+----------------------+----------------------+ | 4 NVIDIA A100-SXM4-80GB On | 00000000:00:09.0 Off | 0 | | N/A 36C P0 63W / 400W | 0MiB / 81920MiB | 0% Default | | | | Disabled | +-----------------------------------------+----------------------+----------------------+ | 5 NVIDIA A100-SXM4-80GB On | 00000000:00:0A.0 Off | 0 | | N/A 40C P0 64W / 400W | 0MiB / 81920MiB | 0% Default | | | | Disabled | +-----------------------------------------+----------------------+----------------------+ | 6 NVIDIA A100-SXM4-80GB On | 00000000:00:0B.0 Off | 0 | | N/A 39C P0 65W / 400W | 0MiB / 81920MiB | 0% Default | | | | Disabled | +-----------------------------------------+----------------------+----------------------+ | 7 NVIDIA A100-SXM4-80GB On | 00000000:00:0C.0 Off | 0 | | N/A 39C P0 60W / 400W | 0MiB / 81920MiB | 0% Default | | | | Disabled | +-----------------------------------------+----------------------+----------------------+ +---------------------------------------------------------------------------------------+ | MIG devices: | +------------------+--------------------------------+-----------+-----------------------+ | GPU GI CI MIG | Memory-Usage | Vol | Shared | | ID ID Dev | BAR1-Usage | SM Unc | CE ENC DEC OFA JPG | | | | ECC | | | ==================+================================+===========+======================= | | No MIG devices found | +---------------------------------------------------------------------------------------+ +---------------------------------------------------------------------------------------+ | Processes: | | GPU GI CI PID Type Process name GPU Memory | | ID ID Usage | | ======================================================================================= | | No running processes found | +---------------------------------------------------------------------------------------+Thu Jan 29 14:48:31 2026 +---------------------------------------------------------------------------------------+ | NVIDIA-SMI 535.183.06 Driver Version: 535.183.06 CUDA Version: 12.2 | |-----------------------------------------+----------------------+----------------------+ | GPU Name Persistence-M | Bus-Id Disp.A | Volatile Uncorr. ECC | | Fan Temp Perf Pwr:Usage/Cap | Memory-Usage | GPU-Util Compute M. | | | | MIG M. | | =========================================+======================+====================== | | 0 NVIDIA A100-SXM4-80GB On | 00000000:00:05.0 Off | On | | N/A 36C P0 52W / 400W | 0MiB / 81920MiB | N/A Default | | | | Enabled | +-----------------------------------------+----------------------+----------------------+ | 1 NVIDIA A100-SXM4-80GB On | 00000000:00:06.0 Off | 0 | | N/A 36C P0 61W / 400W | 0MiB / 81920MiB | 0% Default | | | | Disabled | +-----------------------------------------+----------------------+----------------------+ | 2 NVIDIA A100-SXM4-80GB On | 00000000:00:07.0 Off | 0 | | N/A 36C P0 64W / 400W | 0MiB / 81920MiB | 0% Default | | | | Disabled | +-----------------------------------------+----------------------+----------------------+ | 3 NVIDIA A100-SXM4-80GB On | 00000000:00:08.0 Off | 0 | | N/A 40C P0 64W / 400W | 0MiB / 81920MiB | 0% Default | | | | Disabled | +-----------------------------------------+----------------------+----------------------+ | 4 NVIDIA A100-SXM4-80GB On | 00000000:00:09.0 Off | 0 | | N/A 36C P0 63W / 400W | 0MiB / 81920MiB | 0% Default | | | | Disabled | +-----------------------------------------+----------------------+----------------------+ | 5 NVIDIA A100-SXM4-80GB On | 00000000:00:0A.0 Off | 0 | | N/A 40C P0 64W / 400W | 0MiB / 81920MiB | 0% Default | | | | Disabled | +-----------------------------------------+----------------------+----------------------+ | 6 NVIDIA A100-SXM4-80GB On | 00000000:00:0B.0 Off | 0 | | N/A 39C P0 65W / 400W | 0MiB / 81920MiB | 0% Default | | | | Disabled | +-----------------------------------------+----------------------+----------------------+ | 7 NVIDIA A100-SXM4-80GB On | 00000000:00:0C.0 Off | 0 | | N/A 39C P0 60W / 400W | 0MiB / 81920MiB | 0% Default | | | | Disabled | +-----------------------------------------+----------------------+----------------------+ +---------------------------------------------------------------------------------------+ | MIG devices: | +------------------+--------------------------------+-----------+-----------------------+ | GPU GI CI MIG | Memory-Usage | Vol | Shared | | ID ID Dev | BAR1-Usage | SM Unc | CE ENC DEC OFA JPG | | | | ECC | | | ==================+================================+===========+======================= | | No MIG devices found | +---------------------------------------------------------------------------------------+ +---------------------------------------------------------------------------------------+ | Processes: | | GPU GI CI PID Type Process name GPU Memory | | ID ID Usage | | ======================================================================================= | | No running processes found | +---------------------------------------------------------------------------------------+GPU driver status example Check the NVSwitch and NVLink hardware status.
Check NVSwitch status
Color mode~$ nvidia-smi nvlink --status~$ nvidia-smi nvlink --statusNVSwitch status check example Color modeGPU 1: NVIDIA A100-SXM4-80GB (UUID: GPU-64a2f685-bb12-c4af-105c-0726ece9c8d7) Link 0: 25 GB/s Link 1: 25 GB/s Link 2: 25 GB/s Link 3: 25 GB/s Link 4: 25 GB/s Link 5: 25 GB/s Link 6: 25 GB/s Link 7: 25 GB/s Link 8: 25 GB/s Link 9: 25 GB/s Link 10: 25 GB/s Link 11: 25 GB/s GPU 2: NVIDIA A100-SXM4-80GB (UUID: GPU-2269851b-71cd-f6c7-50c5-ba1525cf3ce8) Link 0: 25 GB/s Link 1: 25 GB/s Link 2: 25 GB/s Link 3: 25 GB/s Link 4: 25 GB/s Link 5: 25 GB/s Link 6: 25 GB/s Link 7: 25 GB/s Link 8: 25 GB/s Link 9: 25 GB/s Link 10: 25 GB/s Link 11: 25 GB/s GPU 3: NVIDIA A100-SXM4-80GB (UUID: GPU-4c397bbf-95fc-5c29-918a-a429cbe45a7a) Link 0: 25 GB/s Link 1: 25 GB/s Link 2: 25 GB/s Link 3: 25 GB/s Link 4: 25 GB/s Link 5: 25 GB/s Link 6: 25 GB/s Link 7: 25 GB/s Link 8: 25 GB/s Link 9: 25 GB/s Link 10: 25 GB/s Link 11: 25 GB/s GPU 4: NVIDIA A100-SXM4-80GB (UUID: GPU-0e350204-9fb6-2cbe-538e-8f7849658eb8) Link 0: 25 GB/s Link 1: 25 GB/s Link 2: 25 GB/s Link 3: 25 GB/s Link 4: 25 GB/s Link 5: 25 GB/s Link 6: 25 GB/s Link 7: 25 GB/s Link 8: 25 GB/s Link 9: 25 GB/s Link 10: 25 GB/s Link 11: 25 GB/s GPU 5: NVIDIA A100-SXM4-80GB (UUID: GPU-45f0c453-4760-edd4-3af9-25c5ea7473a5) Link 0: 25 GB/s Link 1: 25 GB/s Link 2: 25 GB/s Link 3: 25 GB/s Link 4: 25 GB/s Link 5: 25 GB/s Link 6: 25 GB/s Link 7: 25 GB/s Link 8: 25 GB/s Link 9: 25 GB/s Link 10: 25 GB/s Link 11: 25 GB/s GPU 6: NVIDIA A100-SXM4-80GB (UUID: GPU-38409794-bb34-430e-3c50-90b42cb2bb72) Link 0: 25 GB/s Link 1: 25 GB/s Link 2: 25 GB/s Link 3: 25 GB/s Link 4: 25 GB/s Link 5: 25 GB/s Link 6: 25 GB/s Link 7: 25 GB/s Link 8: 25 GB/s Link 9: 25 GB/s Link 10: 25 GB/s Link 11: 25 GB/s GPU 7: NVIDIA A100-SXM4-80GB (UUID: GPU-3fb478aa-801b-eb64-55c2-0ffc3f2ce404) Link 0: 25 GB/s Link 1: 25 GB/s Link 2: 25 GB/s Link 3: 25 GB/s Link 4: 25 GB/s Link 5: 25 GB/s Link 6: 25 GB/s Link 7: 25 GB/s Link 8: 25 GB/s Link 9: 25 GB/s Link 10: 25 GB/s Link 11: 25 GB/sGPU 1: NVIDIA A100-SXM4-80GB (UUID: GPU-64a2f685-bb12-c4af-105c-0726ece9c8d7) Link 0: 25 GB/s Link 1: 25 GB/s Link 2: 25 GB/s Link 3: 25 GB/s Link 4: 25 GB/s Link 5: 25 GB/s Link 6: 25 GB/s Link 7: 25 GB/s Link 8: 25 GB/s Link 9: 25 GB/s Link 10: 25 GB/s Link 11: 25 GB/s GPU 2: NVIDIA A100-SXM4-80GB (UUID: GPU-2269851b-71cd-f6c7-50c5-ba1525cf3ce8) Link 0: 25 GB/s Link 1: 25 GB/s Link 2: 25 GB/s Link 3: 25 GB/s Link 4: 25 GB/s Link 5: 25 GB/s Link 6: 25 GB/s Link 7: 25 GB/s Link 8: 25 GB/s Link 9: 25 GB/s Link 10: 25 GB/s Link 11: 25 GB/s GPU 3: NVIDIA A100-SXM4-80GB (UUID: GPU-4c397bbf-95fc-5c29-918a-a429cbe45a7a) Link 0: 25 GB/s Link 1: 25 GB/s Link 2: 25 GB/s Link 3: 25 GB/s Link 4: 25 GB/s Link 5: 25 GB/s Link 6: 25 GB/s Link 7: 25 GB/s Link 8: 25 GB/s Link 9: 25 GB/s Link 10: 25 GB/s Link 11: 25 GB/s GPU 4: NVIDIA A100-SXM4-80GB (UUID: GPU-0e350204-9fb6-2cbe-538e-8f7849658eb8) Link 0: 25 GB/s Link 1: 25 GB/s Link 2: 25 GB/s Link 3: 25 GB/s Link 4: 25 GB/s Link 5: 25 GB/s Link 6: 25 GB/s Link 7: 25 GB/s Link 8: 25 GB/s Link 9: 25 GB/s Link 10: 25 GB/s Link 11: 25 GB/s GPU 5: NVIDIA A100-SXM4-80GB (UUID: GPU-45f0c453-4760-edd4-3af9-25c5ea7473a5) Link 0: 25 GB/s Link 1: 25 GB/s Link 2: 25 GB/s Link 3: 25 GB/s Link 4: 25 GB/s Link 5: 25 GB/s Link 6: 25 GB/s Link 7: 25 GB/s Link 8: 25 GB/s Link 9: 25 GB/s Link 10: 25 GB/s Link 11: 25 GB/s GPU 6: NVIDIA A100-SXM4-80GB (UUID: GPU-38409794-bb34-430e-3c50-90b42cb2bb72) Link 0: 25 GB/s Link 1: 25 GB/s Link 2: 25 GB/s Link 3: 25 GB/s Link 4: 25 GB/s Link 5: 25 GB/s Link 6: 25 GB/s Link 7: 25 GB/s Link 8: 25 GB/s Link 9: 25 GB/s Link 10: 25 GB/s Link 11: 25 GB/s GPU 7: NVIDIA A100-SXM4-80GB (UUID: GPU-3fb478aa-801b-eb64-55c2-0ffc3f2ce404) Link 0: 25 GB/s Link 1: 25 GB/s Link 2: 25 GB/s Link 3: 25 GB/s Link 4: 25 GB/s Link 5: 25 GB/s Link 6: 25 GB/s Link 7: 25 GB/s Link 8: 25 GB/s Link 9: 25 GB/s Link 10: 25 GB/s Link 11: 25 GB/sNVSwitch status example Check NVLinks hardware status
Color mode~$ nvidia-smi topo -m~$ nvidia-smi topo -mExample code for checking NVLink hardware status Color modeGPU0 GPU1 GPU2 GPU3 GPU4 GPU5 GPU6 GPU7 CPU Affinity NUMA Affinity GPU NUMA ID GPU0 X NV12 NV12 NV12 NV12 NV12 NV12 NV12 0-127 0-7 N/A GPU1 NV12 X NV12 NV12 NV12 NV12 NV12 NV12 0-127 0-7 N/A GPU2 NV12 NV12 X NV12 NV12 NV12 NV12 NV12 0-127 0-7 N/A GPU3 NV12 NV12 NV12 X NV12 NV12 NV12 NV12 0-127 0-7 N/A GPU4 NV12 NV12 NV12 NV12 X NV12 NV12 NV12 0-127 0-7 N/A GPU5 NV12 NV12 NV12 NV12 NV12 X NV12 NV12 0-127 0-7 N/A GPU6 NV12 NV12 NV12 NV12 NV12 NV12 X NV12 0-127 0-7 N/A GPU7 NV12 NV12 NV12 NV12 NV12 NV12 NV12 X 0-127 0-7 N/A Legend: X = Self SYS = Connection traversing PCIe as well as the SMP interconnect between NUMA nodes (e.g., QPI/UPI) NODE = Connection traversing PCIe as well as the interconnect between PCIe Host Bridges within a NUMA node PHB = Connection traversing PCIe as well as a PCIe Host Bridge (typically the CPU) PXB = Connection traversing multiple PCIe bridges (without traversing the PCIe Host Bridge) PIX = Connection traversing at most a single PCIe bridge NV# = Connection traversing a bonded set of # NVLinksGPU0 GPU1 GPU2 GPU3 GPU4 GPU5 GPU6 GPU7 CPU Affinity NUMA Affinity GPU NUMA ID GPU0 X NV12 NV12 NV12 NV12 NV12 NV12 NV12 0-127 0-7 N/A GPU1 NV12 X NV12 NV12 NV12 NV12 NV12 NV12 0-127 0-7 N/A GPU2 NV12 NV12 X NV12 NV12 NV12 NV12 NV12 0-127 0-7 N/A GPU3 NV12 NV12 NV12 X NV12 NV12 NV12 NV12 0-127 0-7 N/A GPU4 NV12 NV12 NV12 NV12 X NV12 NV12 NV12 0-127 0-7 N/A GPU5 NV12 NV12 NV12 NV12 NV12 X NV12 NV12 0-127 0-7 N/A GPU6 NV12 NV12 NV12 NV12 NV12 NV12 X NV12 0-127 0-7 N/A GPU7 NV12 NV12 NV12 NV12 NV12 NV12 NV12 X 0-127 0-7 N/A Legend: X = Self SYS = Connection traversing PCIe as well as the SMP interconnect between NUMA nodes (e.g., QPI/UPI) NODE = Connection traversing PCIe as well as the interconnect between PCIe Host Bridges within a NUMA node PHB = Connection traversing PCIe as well as a PCIe Host Bridge (typically the CPU) PXB = Connection traversing multiple PCIe bridges (without traversing the PCIe Host Bridge) PIX = Connection traversing at most a single PCIe bridge NV# = Connection traversing a bonded set of # NVLinksNVLink HW status check code example
Check the InfiniBand (IB) HCA card hardware status and link.
Color modeuser@bm-dev-001:~$ ibdev2netdev -vuser@bm-dev-001:~$ ibdev2netdev -vHW status check command example Color modecat: /sys/class/infiniband/mlx5_0/device/vpd: Permission denied 0000:45:00.0 mlx5_0 (MT4123 - ) fw 20.29.1016 port 1 (ACTIVE) ==> ibs18 (Down) cat: /sys/class/infiniband/mlx5_1/device/vpd: Permission denied 0000:0e:00.0 mlx5_1 (MT4123 - ) fw 20.29.1016 port 1 (ACTIVE) ==> ibs17 (Down) cat: /sys/class/infiniband/mlx5_2/device/vpd: Permission denied 0000:c5:00.0 mlx5_2 (MT4123 - ) fw 20.29.1016 port 1 (ACTIVE) ==> ibs20 (Down) cat: /sys/class/infiniband/mlx5_3/device/vpd: Permission denied 0000:85:00.0 mlx5_3 (MT4123 - ) fw 20.29.1016 port 1 (ACTIVE) ==> ibs19 (Down) user@bm-dev-001:~$cat: /sys/class/infiniband/mlx5_0/device/vpd: Permission denied 0000:45:00.0 mlx5_0 (MT4123 - ) fw 20.29.1016 port 1 (ACTIVE) ==> ibs18 (Down) cat: /sys/class/infiniband/mlx5_1/device/vpd: Permission denied 0000:0e:00.0 mlx5_1 (MT4123 - ) fw 20.29.1016 port 1 (ACTIVE) ==> ibs17 (Down) cat: /sys/class/infiniband/mlx5_2/device/vpd: Permission denied 0000:c5:00.0 mlx5_2 (MT4123 - ) fw 20.29.1016 port 1 (ACTIVE) ==> ibs20 (Down) cat: /sys/class/infiniband/mlx5_3/device/vpd: Permission denied 0000:85:00.0 mlx5_3 (MT4123 - ) fw 20.29.1016 port 1 (ACTIVE) ==> ibs19 (Down) user@bm-dev-001:~$Example of HW status check result Color moderoot@bm-dev-001:~# ibstatroot@bm-dev-001:~# ibstatExample of link verification command Color modeCA 'mlx5_0' CA type: MT4123 Number of ports: 1 Firmware version: 20.29.1016 Hardware version: 0 Node GUID: 0x88e9a4ffff5060ac System image GUID: 0x88e9a4ffff5060ac Port 1: State: Active Physical state: LinkUp Rate: 200 Base lid: 8 LMC: 0 SM lid: 1 Capability mask: 0x2651e848 Port GUID: 0x88e9a4ffff5060ac Link layer: InfiniBand CA 'mlx5_1' CA type: MT4123 Number of ports: 1 Firmware version: 20.29.1016 Hardware version: 0 Node GUID: 0x88e9a4ffff504080 System image GUID: 0x88e9a4ffff504080 Port 1: State: Active Physical state: LinkUp Rate: 200 Base lid: 5 LMC: 0 SM lid: 1 Capability mask: 0x2651e848 Port GUID: 0x88e9a4ffff504080 Link layer: InfiniBand CA 'mlx5_2' CA type: MT4123 Number of ports: 1 Firmware version: 20.29.1016 Hardware version: 0 Node GUID: 0x88e9a4ffff505038 System image GUID: 0x88e9a4ffff505038 Port 1: State: Active Physical state: LinkUp Rate: 200 Base lid: 2 LMC: 0 SM lid: 1 Capability mask: 0x2651e848 Port GUID: 0x88e9a4ffff505038 Link layer: InfiniBand CA 'mlx5_3' CA type: MT4123 Number of ports: 1 Firmware version: 20.29.1016 Hardware version: 0 Node GUID: 0x88e9a4ffff504094 System image GUID: 0x88e9a4ffff504094 Port 1: State: Active Physical state: LinkUp Rate: 200 Base lid: 7 LMC: 0 SM lid: 1 Capability mask: 0x2651e848 Port GUID: 0x88e9a4ffff504094 Link layer: InfiniBandCA 'mlx5_0' CA type: MT4123 Number of ports: 1 Firmware version: 20.29.1016 Hardware version: 0 Node GUID: 0x88e9a4ffff5060ac System image GUID: 0x88e9a4ffff5060ac Port 1: State: Active Physical state: LinkUp Rate: 200 Base lid: 8 LMC: 0 SM lid: 1 Capability mask: 0x2651e848 Port GUID: 0x88e9a4ffff5060ac Link layer: InfiniBand CA 'mlx5_1' CA type: MT4123 Number of ports: 1 Firmware version: 20.29.1016 Hardware version: 0 Node GUID: 0x88e9a4ffff504080 System image GUID: 0x88e9a4ffff504080 Port 1: State: Active Physical state: LinkUp Rate: 200 Base lid: 5 LMC: 0 SM lid: 1 Capability mask: 0x2651e848 Port GUID: 0x88e9a4ffff504080 Link layer: InfiniBand CA 'mlx5_2' CA type: MT4123 Number of ports: 1 Firmware version: 20.29.1016 Hardware version: 0 Node GUID: 0x88e9a4ffff505038 System image GUID: 0x88e9a4ffff505038 Port 1: State: Active Physical state: LinkUp Rate: 200 Base lid: 2 LMC: 0 SM lid: 1 Capability mask: 0x2651e848 Port GUID: 0x88e9a4ffff505038 Link layer: InfiniBand CA 'mlx5_3' CA type: MT4123 Number of ports: 1 Firmware version: 20.29.1016 Hardware version: 0 Node GUID: 0x88e9a4ffff504094 System image GUID: 0x88e9a4ffff504094 Port 1: State: Active Physical state: LinkUp Rate: 200 Base lid: 7 LMC: 0 SM lid: 1 Capability mask: 0x2651e848 Port GUID: 0x88e9a4ffff504094 Link layer: InfiniBandLink verification result example
Check IB bandwidth communication
Check the IB bandwidth communication status (ib_send_bw) and follow the steps below to inspect the IaaS hardware level.
Check the name of the IB HCA interface.
- In the following example, IB ports:
mlx5_0,mlx5_4,mlx_5_5,mlx5_8Color mode~$ ibdev2netdev -v~$ ibdev2netdev -vExample of checking IB HCA interface name Color mode0000:1a:00.0 mlx5_0 (MT4129 - 1028SN ) Nvidia ConnectX-7 Single Port Infiniband NDR OSFP Adapter fw 28.38.1002 port 1 (ACTIVE) ==> ibp26s0 (Down) 0000:1b:00.0 mlx5_1 (MT4123 - 1028SN ) Mellanox ConnectX-6 Single Port VPI HDR QSFP Adapter fw 20.38.1002 port 1 (ACTIVE) ==> bond-nas (Up) 0000:3c:00.0 mlx5_2 (MT4129 - 1028SN ) Nvidia ConnectX-7 Single Port Infiniband NDR OSFP Adapter fw 28.38.1002 port 1 (DOWN ) ==> ibp60s0 (Down) 0000:4d:00.0 mlx5_3 (MT4129 - 1028SN ) Nvidia ConnectX-7 Single Port Infiniband NDR OSFP Adapter fw 28.38.1002 port 1 (DOWN ) ==> ibp77s0 (Down) 0000:5e:00.0 mlx5_4 (MT4129 - 1028SN ) Nvidia ConnectX-7 Single Port Infiniband NDR OSFP Adapter fw 28.38.1002 port 1 (ACTIVE) ==> ibp94s0 (Down) 0000:9c:00.0 mlx5_5 (MT4129 - 1028SN ) Nvidia ConnectX-7 Single Port Infiniband NDR OSFP Adapter fw 28.38.1002 port 1 (ACTIVE) ==> ibp156s0 (Down) 0000:9d:00.0 mlx5_6 (MT4123 - 1028SN ) Mellanox ConnectX-6 Single Port VPI HDR QSFP Adapter fw 20.38.1002 port 1 (ACTIVE) ==> bond-nas (Up) 0000:bc:00.0 mlx5_7 (MT4129 - 1028SN ) Nvidia ConnectX-7 Single Port Infiniband NDR OSFP Adapter fw 28.38.1002 port 1 (DOWN ) ==> ibp188s0 (Down) 0000:cc:00.0 mlx5_8 (MT4129 - 1028SN ) Nvidia ConnectX-7 Single Port Infiniband NDR OSFP Adapter fw 28.38.1002 port 1 (ACTIVE) ==> ibp204s0 (Down) 0000:dc:00.0 mlx5_9 (MT4129 - 1028SN ) Nvidia ConnectX-7 Single Port Infiniband NDR OSFP Adapter fw 28.38.1002 port 1 (DOWN ) ==> ibp220s0 (Down)0000:1a:00.0 mlx5_0 (MT4129 - 1028SN ) Nvidia ConnectX-7 Single Port Infiniband NDR OSFP Adapter fw 28.38.1002 port 1 (ACTIVE) ==> ibp26s0 (Down) 0000:1b:00.0 mlx5_1 (MT4123 - 1028SN ) Mellanox ConnectX-6 Single Port VPI HDR QSFP Adapter fw 20.38.1002 port 1 (ACTIVE) ==> bond-nas (Up) 0000:3c:00.0 mlx5_2 (MT4129 - 1028SN ) Nvidia ConnectX-7 Single Port Infiniband NDR OSFP Adapter fw 28.38.1002 port 1 (DOWN ) ==> ibp60s0 (Down) 0000:4d:00.0 mlx5_3 (MT4129 - 1028SN ) Nvidia ConnectX-7 Single Port Infiniband NDR OSFP Adapter fw 28.38.1002 port 1 (DOWN ) ==> ibp77s0 (Down) 0000:5e:00.0 mlx5_4 (MT4129 - 1028SN ) Nvidia ConnectX-7 Single Port Infiniband NDR OSFP Adapter fw 28.38.1002 port 1 (ACTIVE) ==> ibp94s0 (Down) 0000:9c:00.0 mlx5_5 (MT4129 - 1028SN ) Nvidia ConnectX-7 Single Port Infiniband NDR OSFP Adapter fw 28.38.1002 port 1 (ACTIVE) ==> ibp156s0 (Down) 0000:9d:00.0 mlx5_6 (MT4123 - 1028SN ) Mellanox ConnectX-6 Single Port VPI HDR QSFP Adapter fw 20.38.1002 port 1 (ACTIVE) ==> bond-nas (Up) 0000:bc:00.0 mlx5_7 (MT4129 - 1028SN ) Nvidia ConnectX-7 Single Port Infiniband NDR OSFP Adapter fw 28.38.1002 port 1 (DOWN ) ==> ibp188s0 (Down) 0000:cc:00.0 mlx5_8 (MT4129 - 1028SN ) Nvidia ConnectX-7 Single Port Infiniband NDR OSFP Adapter fw 28.38.1002 port 1 (ACTIVE) ==> ibp204s0 (Down) 0000:dc:00.0 mlx5_9 (MT4129 - 1028SN ) Nvidia ConnectX-7 Single Port Infiniband NDR OSFP Adapter fw 28.38.1002 port 1 (DOWN ) ==> ibp220s0 (Down)Example of IB HCA interface name verification result
- In the following example, IB ports:
Use the SERVER Side command to check the communication status.
Color mode~$ ib_send_bw -d mlx5_0 -i 1 –F~$ ib_send_bw -d mlx5_0 -i 1 –FSERVER Side command example Color mode************************************ * Waiting for client to connect... * * ************************************ --------------------------------------------------------------------------------------- Send BW Test Dual-port : OFF Device : mlx5_0 Number of qps : 1 Transport type : IB Connection type : RC Using SRQ : OFF PCIe relax order: ON ibv_wr* API : ON TX depth : 128 CQ Moderation : 100 Mtu : 4096[B] Link type : IB Max inline data : 0[B] rdma_cm QPs : OFF Data ex. method : Ethernet --------------------------------------------------------------------------------------- local address: LID 0x0f QPN 0x6d95 PSN 0xb974a remote address: LID 0x01 QPN 0x6dd2 PSN 0xc8a18c --------------------------------------------------------------------------------------- #bytes #iterations BW peak[MB/sec] BW average[MB/sec] MsgRate[Mpps] 65536 1000 0.00 19827.40 0.317238 ---------------------------------------------------------------------------------------************************************ * Waiting for client to connect... * * ************************************ --------------------------------------------------------------------------------------- Send BW Test Dual-port : OFF Device : mlx5_0 Number of qps : 1 Transport type : IB Connection type : RC Using SRQ : OFF PCIe relax order: ON ibv_wr* API : ON TX depth : 128 CQ Moderation : 100 Mtu : 4096[B] Link type : IB Max inline data : 0[B] rdma_cm QPs : OFF Data ex. method : Ethernet --------------------------------------------------------------------------------------- local address: LID 0x0f QPN 0x6d95 PSN 0xb974a remote address: LID 0x01 QPN 0x6dd2 PSN 0xc8a18c --------------------------------------------------------------------------------------- #bytes #iterations BW peak[MB/sec] BW average[MB/sec] MsgRate[Mpps] 65536 1000 0.00 19827.40 0.317238 ---------------------------------------------------------------------------------------Example of mutual communication status check result Use the CLIENT Side command to check the communication status.
Color mode~$ ib_send_bw -d mlx5_0 -i 1 -F <SERVER Side IP>~$ ib_send_bw -d mlx5_0 -i 1 -F <SERVER Side IP>CLIENT Side command example Color mode--------------------------------------------------------------------------------------- Send BW Test Dual-port : OFF Device : mlx5_0 Number of qps : 1 Transport type : IB Connection type : RC Using SRQ : OFF PCIe relax order: ON ibv_wr* API : ON RX depth : 512 CQ Moderation : 100 Mtu : 4096[B] Link type : IB Max inline data : 0[B] rdma_cm QPs : OFF Data ex. method : Ethernet --------------------------------------------------------------------------------------- local address: LID 0x01 QPN 0x6dd2 PSN 0xc8a18c remote address: LID 0x0f QPN 0x6d95 PSN 0xb974a --------------------------------------------------------------------------------------- #bytes #iterations BW peak[MB/sec] BW average[MB/sec] MsgRate[Mpps] 65536 1000 19008.49 19006.37 0.304102 ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ Send BW Test Dual-port : OFF Device : mlx5_0 Number of qps : 1 Transport type : IB Connection type : RC Using SRQ : OFF PCIe relax order: ON ibv_wr* API : ON RX depth : 512 CQ Moderation : 100 Mtu : 4096[B] Link type : IB Max inline data : 0[B] rdma_cm QPs : OFF Data ex. method : Ethernet --------------------------------------------------------------------------------------- local address: LID 0x01 QPN 0x6dd2 PSN 0xc8a18c remote address: LID 0x0f QPN 0x6d95 PSN 0xb974a --------------------------------------------------------------------------------------- #bytes #iterations BW peak[MB/sec] BW average[MB/sec] MsgRate[Mpps] 65536 1000 19008.49 19006.37 0.304102 ---------------------------------------------------------------------------------------Example of mutual communication status check result
Check IB service related kernel modules
Check the relevant kernel modules for the IB service (lsmod) to inspect the IaaS hardware level.
Check IB service-related kernel module - nvidia_peermem
Color mode~$ lsmod | grep nvidia_peermem~$ lsmod | grep nvidia_peermemExample command to check nvidia_peermem Color modenvidia_peermem 16384 0 ib_core 425984 9 rdma_cm,ib_ipoib,nvidia_peermem,iw_cm,ib_umad,rdma_ucm,ib_uverbs,mlx5_ib,ib_cm nvidia 56524800 451 nvidia_uvm,nvidia_peermem,nvidia_modesetnvidia_peermem 16384 0 ib_core 425984 9 rdma_cm,ib_ipoib,nvidia_peermem,iw_cm,ib_umad,rdma_ucm,ib_uverbs,mlx5_ib,ib_cm nvidia 56524800 451 nvidia_uvm,nvidia_peermem,nvidia_modesetExample of nvidia_peermem check result Check IB service related kernel modules - IB Card (HCA) driver
Color mode~$ lsmod|egrep 'ib_|_ib|mlx'~$ lsmod|egrep 'ib_|_ib|mlx'Example command to check the IB Card (HCA) driver Color modemlx5_ib 393216 0 ib_uverbs 163840 2 irdma,mlx5_ib ib_core 393216 3 irdma,ib_uverbs,mlx5_ib mlx5_core 1593344 1 mlx5_ib mlxfw 32768 1 mlx5_core psample 20480 1 mlx5_core tls 114688 1 mlx5_core pci_hyperv_intf 16384 1 mlx5_core ib_ipoib 139264 0 ib_cm 131072 2 rdma_cm,ib_ipoib ib_umad 40960 0 mlx5_ib 454656 0 ib_uverbs 135168 2 rdma_ucm,mlx5_ib ib_core 434176 9 rdma_cm,ib_ipoib,nvidia_peermem,iw_cm,ib_umad,rdma_ucm,ib_uverbs,mlx5_ib,ib_cm libcrc32c 16384 5 nf_conntrack,nf_nat,btrfs,nf_tables,raid456 mlx5_core 2064384 1 mlx5_ib mlx_compat 69632 11 rdma_cm,ib_ipoib,mlxdevm,iw_cm,ib_umad,ib_core,rdma_ucm,ib_uverbs,mlx5_ib,ib_cm,mlx5_coremlx5_ib 393216 0 ib_uverbs 163840 2 irdma,mlx5_ib ib_core 393216 3 irdma,ib_uverbs,mlx5_ib mlx5_core 1593344 1 mlx5_ib mlxfw 32768 1 mlx5_core psample 20480 1 mlx5_core tls 114688 1 mlx5_core pci_hyperv_intf 16384 1 mlx5_core ib_ipoib 139264 0 ib_cm 131072 2 rdma_cm,ib_ipoib ib_umad 40960 0 mlx5_ib 454656 0 ib_uverbs 135168 2 rdma_ucm,mlx5_ib ib_core 434176 9 rdma_cm,ib_ipoib,nvidia_peermem,iw_cm,ib_umad,rdma_ucm,ib_uverbs,mlx5_ib,ib_cm libcrc32c 16384 5 nf_conntrack,nf_nat,btrfs,nf_tables,raid456 mlx5_core 2064384 1 mlx5_ib mlx_compat 69632 11 rdma_cm,ib_ipoib,mlxdevm,iw_cm,ib_umad,ib_core,rdma_ucm,ib_uverbs,mlx5_ib,ib_cm,mlx5_coreExample of IB Card (HCA) driver verification result
Check storage physical disk resources and Multi-Path
Verify the storage physical disk resources and Multi-Path to assess the IaaS hardware level.
Storage Physical Disk Resource Check Results
Color moderoot@bm-dev-002:/tmp# fdisk –lroot@bm-dev-002:/tmp# fdisk –lExample of storage physical disk resource check result Multi-Path verification result
Color moderoot@bm-dev-002:/tmp# multipath –llroot@bm-dev-002:/tmp# multipath –llExample of Multi-Path verification result
Check Service Network after new deployment of Multi-node GPU Cluster
Use the following command to verify that the MII Status of Bonding and Slave Interface is up.
Service Network check command
Color mode~$ cat /proc/net/bonding/bond-srv~$ cat /proc/net/bonding/bond-srvExample command to check Service Network Service Network check result
Color modeEthernet Channel Bonding Driver: v5.15.0-25-generic Bonding Mode: fault-tolerance (active-backup) Primary Slave: None Currently Active Slave: ens9f0 MII Status: up MII Polling Interval (ms): 100 Up Delay (ms): 0 Down Delay (ms): 0 Peer Notification Delay (ms): 0 Slave Interface: ens9f0 MII Status: up Speed: 100000 Mbps Duplex: full Link Failure Count: 0 Permanent HW addr: 30:3e:a7:02:35:70 Slave queue ID: 0 Slave Interface: ens11f0 MII Status: up Speed: 100000 Mbps Duplex: full Link Failure Count: 0 Permanent HW addr: 30:3e:a7:02:2f:e8 Slave queue ID: 0Ethernet Channel Bonding Driver: v5.15.0-25-generic Bonding Mode: fault-tolerance (active-backup) Primary Slave: None Currently Active Slave: ens9f0 MII Status: up MII Polling Interval (ms): 100 Up Delay (ms): 0 Down Delay (ms): 0 Peer Notification Delay (ms): 0 Slave Interface: ens9f0 MII Status: up Speed: 100000 Mbps Duplex: full Link Failure Count: 0 Permanent HW addr: 30:3e:a7:02:35:70 Slave queue ID: 0 Slave Interface: ens11f0 MII Status: up Speed: 100000 Mbps Duplex: full Link Failure Count: 0 Permanent HW addr: 30:3e:a7:02:2f:e8 Slave queue ID: 0Example of Service Network verification result
Multi-node GPU Cluster: Verify time synchronization with the Time Server after new deployment
The OS image includes the installation of the chrony daemon and configuration for SCP NTP server synchronization. Use the following command to verify whether a line marked with ^* exists in the MS Name column.
Command to check the chrony daemon source and synchronization status
Color mode~$ chronyc sources -V~$ chronyc sources -VExample of status check command Result of checking chrony daemon status
Color modeMS Name/IP address Stratum Poll Reach LastRx Last sample =============================================================================== ^* 198.19.0.54 4 10 377 1040 -16us[ -37us] +/- 9982usMS Name/IP address Stratum Poll Reach LastRx Last sample =============================================================================== ^* 198.19.0.54 4 10 377 1040 -16us[ -37us] +/- 9982usExample of checking the chrony daemon status
5.3 - Release Note
Multi-node GPU Cluster
- The Ubuntu image version 24.04 has been added.
- The B300 GPU server type has been added.
- Provides an IaC environment using Terraform.
- You can terminate multiple resources simultaneously from the GPU Node list.
- It must be a node that uses the same DataSet and Cluster Fabric.
- Integrated with Cloud Monitoring.
- You can view key performance metrics in real time with Cloud Monitoring.
- We have launched the Multi-node GPU Cluster service.
- We provide a service that offers physical GPU servers without virtualization for large-scale, high-performance AI computing.
6 - Cloud Functions
6.1 - Overview
Service Overview
Cloud Functions is a serverless computing-based FaaS (Function as a Service) that lets you easily run function‑based applications without provisioning servers. Users don’t have to hassle with managing servers or containers for scaling, and can focus on writing code and deploying applications.
Features
- Easy and Convenient Development Environment: Developers can easily create Function resources that connect to events across multiple environments using a Code Editor suitable for the selected runtime, and can write and invoke code with ease.
- Serverless Computing: You can use a serverless code execution service for development in the Samsung Cloud Platform environment. * The resources required to invoke and run function-based applications are allocated and managed by the Samsung Cloud Platform according to the scale of execution.
- Efficient Cost Management: The invoked Function aggregates usage (total number of calls, total execution time) and is billed for the time actually used to run the application. * Functions with low usage can have the Cloud Functions scaler adjusted to a scale-to-zero state, so they do not consume resources, enabling efficient cost management.
Service diagram
Provided Features
Cloud Functions provides the following features.
- Code Development Environment: Create Runtime-optimized functions, write and edit code
- Refer to 구성 요소 > Runtime for supported runtimes.
- Function execution, environment management, monitoring: endpoint definition, Token management, access control configuration, trigger configuration, etc., runtime environment/variable definition and modification, artifact invocation/testing for Deploy/Test, service deployment, progress status monitoring/logging
- Serverless Computing: All elements required for code writing and deployment are managed by Samsung Cloud Platform, with automatic scaling adjustments based on deployment.
- Sample Code Provided: By providing various sample codes through Blueprint, you can start easily and quickly
Component
Runtime
Cloud Functions currently supports the following runtimes. Additional continuously supported runtimes will also be added.
| Runtime | Version | Deprecation date |
|---|---|---|
| GO | 1.21 | 2026.7.30 |
| GO | 1.23 | 2026.7.30 |
| GO | 1.25 | - |
| java | 17 | - |
| Node.js | 18 | 2026.7.30 |
| Node.js | 20 | 2026.7.30 |
| Node.js | 22 | - |
| Node.js | 24 | - |
| PHP | 8.1 | 2026.7.30 |
| PHP | 8.4 | - |
| PHP | 8.5 | - |
| Python | 3.9 | 2026.7.30 |
| Python | 3.10 | - |
| Python | 3.11 | - |
| Python | 3.14 | - |
Runtime scheduled for deprecation
- For Runtime versions slated for deprecation, technical support is expected to end within 60 days (End of Technical Support) (detailed schedule will be announced on the console).
- When technical support for a Runtime version ends, security patches and updates will no longer be applied to that Runtime version.
- Already created functions can be used continuously without a call time limit, but they cannot be modified.
- We do not guarantee bugs, errors, defects, or vulnerabilities that arise in Runtime versions for which technical support has ended.
- We recommend migrating to and using an alternative version of a function that is scheduled for deprecation to ensure safe usage.
Provision status by region
The Cloud Functions service is available in the environments below.
| Region | Whether provided |
|---|---|
| Korea West 1 (kr-west1) | Provide |
| Korea East 1 (kr-east1) | Provided |
| South Korea 1 (kr-south1) | Not provided |
| South Korea South 2 (kr-south2) | Not provided |
| South Korea 3 (kr-south3) | Not provided |
Preceding Service
This is a list of services that can be configured as optional before creating the service. Please refer to the guide provided for each service and prepare in advance.
| Service Category | service | Detailed description |
|---|---|---|
| Application Service | API Gateway | A service that easily manages and monitors APIs |
6.1.1 - ServiceWatch Metrics
Cloud Functions sends metrics to ServiceWatch. The metrics provided by default monitoring are data collected at a 1‑minute interval.
Basic Metrics
The following are the basic metrics for the Cloud Functions namespace.
The metrics whose names are displayed in bold below are the metrics selected as key metrics among the default metrics provided by Cloud Functions. The key metrics are used to build service dashboards that are automatically created per service in ServiceWatch. You can also view the key metrics on the Monitoring tab of the Cloud Functions detail page.
Each metric guides users, through the user guide, on which statistical values are meaningful when querying that metric, and among the meaningful statistics, the values displayed in bold are the primary statistics. In the service dashboard or monitoring tab, you can view key metrics using these primary statistics.
| Performance items | Detailed description | unit | meaningful statistics |
|---|---|---|---|
| Invocations | Average number of times the function is called per unit time | Count |
|
| Success Calls | Average number of times the runtime code operates correctly and returns a response code per unit time during a function call. | Count |
|
| Error Calls | Average number of calls per unit time that encounter errors during function invocation, including runtime errors due to response timeouts and logic errors. | Count |
|
| Memory Usage | Average memory usage per unit time while the function is executing | Kilobytes |
|
| Active Operations | When a function is called multiple times simultaneously, the average number of tasks generated per unit time for concurrent processing. | Count |
|
6.2 - How-to guides
Users can create the service by entering the required information for Cloud Functions and selecting detailed options through the Samsung Cloud Platform Console.
Creating Cloud Functions
All Services > Compute > Cloud Functions Click the menu. 1. Go to the Service Home page of Cloud Functions.
On the Service Home page, click the Create Cloud Functions button. 2. Go to the Create Cloud Functions page.
On the Create Cloud Functions page, enter the information required to create the service.
Category RequiredDetailed description Function name Required Enter the name of the Funtion to create - Start with a lowercase English letter and use lowercase English letters, numbers, and special characters (
-) to enter between 3 and 64 characters
Runtime Required Select Runtime creation method - New: Create a new Runtime
- Start with Blueprint: Write using the Runtime source code provided by the service
Runtime & Vesion Essential Select Runtime and Version - When Create New is selected
- Refer to the Runtime & Version list in Components > Runtime
- For the Java runtime, UI code editing is not supported, but you can import a JAR file from Object Storage and execute it
- When Start with Blueprint is selected
- You can view a source code example by clicking the View Source Code button for that Runtime & Version
- Refer to the Blueprint Detailed Guide for more information on Blueprint settings
- If the Runtime version has reached End of Technical Support (EoTS), it cannot be modified after creation
Table. Cloud Functions service information input fields- Start with a lowercase English letter and use lowercase English letters, numbers, and special characters (
Summary Check the detailed information and estimated charges generated in the panel, and click the Create button.
- When creation is complete, check the created resources on the Cloud Functions list page.
View Cloud Functions details
Cloud Functions Details page consists of Details, Monitoring, Logs, Code, Configuration, Triggers, Tags, Job History tabs.
To view detailed information about the Cloud Functions service, follow these steps.
- All Services > Compute > Cloud Functions Click the menu. 1. Navigate to the Service Home page of Cloud Functions.
- On the Service Home page, click the Function menu. 2. Function list Go to the page.
- On the Function list page, click the resource to view detailed information. 3. Navigate to the Function Details page.
- Function Details page displays status information and additional feature information, and consists of Details, Monitoring, Logs, Code, Configuration, Triggers, Tags, Job History tabs.
Category Detailed description Cloud Functions status Cloud Functions status information - Ready: green icon, a state where normal function invocations are possible
- Not Ready: gray icon, a state where normal function invocations are not possible
- Deploying: yellow icon, a state where the function is being created or updated, which triggers the next action
- Function creation and modification
- Code tab: edit code in the editor
- Code tab: inspect jar file
- Trigger tab: add and modify
- Configuration tab: modify
- Running: blue icon, a state where normal function invocations are possible and a cold‑start prevention policy is applied
Service cancellation Cancel service button Table. Cloud Functions status information and additional features
- Function Details page displays status information and additional feature information, and consists of Details, Monitoring, Logs, Code, Configuration, Triggers, Tags, Job History tabs.
Detailed Information
Function list page allows you to view detailed information of the selected resource and edit the information if needed.
| Category | Detailed description |
|---|---|
| service | Service name |
| Resource Type | Resource Type |
| SRN | Unique resource ID in Samsung Cloud Platform |
| Resource name | Resource name
|
| Resource ID | Service’s unique resource ID |
| Constructor | User who created the service |
| Creation date and time | Service creation timestamp |
| Modifier | User who modified the service |
| Modification date | Date and time of service modification |
| Function name | Name of the Cloud Function |
| Runtime | Runtime types and versions
|
- Even after technical support for the Runtime version ends, functions that have already been created can continue to be used without any call time limit. * However, security patches and updates for that Runtime version will not be applied.
- Bugs, errors, defects, or vulnerabilities that occur in Runtime versions that are no longer supported are not covered.
- If support for a Runtime has ended, the user must create a replacement Runtime version and then manually delete the function of the previous version.
- To use this function safely, create a new instance using the Lastest or Stable version.
Monitoring
Function List page lets you view the Cloud Functions usage information for the selected resource.
| Category | Detailed description |
|---|---|
| Number of calls | Average number of times the function is called per unit time (instances) |
| execution time | Average execution time (seconds) of the function per unit time |
| Memory usage | Average memory usage (KB) during the function execution per unit time |
| Current number of tasks | When the function is called multiple times simultaneously, the average number of tasks (count) generated per unit time for concurrent processing. |
| Successful call count | Average number of times (cases) the runtime code operated correctly and returned a response code per unit time during a function call. |
| Failed call count | Average number of calls with errors per unit time during function invocation
|
log
Function list page allows you to view the Cloud Functions logs of the selected resource.
| Category | Detailed description |
|---|---|
| unit period | Select the period to view Cloud Functions log information
|
| log message | Functions are displayed in order, starting with the most recent occurrence. |
code
Function List page lets you view and edit the Cloud Functions code of the selected resource.
The way to view and edit source code varies depending on the runtime used.
- Inline Editor: Node.js, Python, PHP, Go
- Compressed file (.jar/.zip) execution: Java
| Category | Detailed description |
|---|---|
| source code | Inline editor method |
| code information | Display code information |
| Edit | After clicking the Edit button, you can modify the code in the inline editor. |
| Category | Detailed description |
|---|---|
| source code | Execution method for compressed files (.jar/.zip) |
| code information | Display compressed file information
|
| Edit | Jar file can be modified
|
- If technical support for the runtime version has ended, you cannot modify the code. * Also, because security patches and updates are not applied, create and use the function anew with the Latest or Stable version to ensure safe usage.
- In the case of Java Runtime, it does not provide a UI code editing feature, and you must select a compressed file (.jar/.zip) from a bucket in the Object Storage service.
- If a user does not have an authentication key generated for the Object Storage service, they cannot execute Import from Object Storage, so they must create an authentication key in advance.
- For detailed information on creating an authentication key, see Creating an Authentication Key.
- The Object Storage bucket for the Cloud Functions service must have its access control set to allow.
- For detailed information on Object Storage access control, see Allowing Cloud Functions Service Access.
Configuration
On the Function list page, you can view the Cloud Functions configuration of the selected resource.
| Category | Detailed description |
|---|---|
| General configuration | Memory and timeout settings of Cloud Function
|
| function URL | Issue an HTTPS URL address that can access the function
|
| environment variable | Set runtime environment variables
|
| Private connection configuration | Can be used in conjunction with PrivateLink Service
|
| Permission | Add and manage resource policies for IAM-based functions
|
- If technical support for the Runtime version has been discontinued, configuration items cannot be modified. * Also, because security patches and updates are not applied, create and use the function anew with the Latest or Stable version to ensure safe usage.
- If access control is disabled, the registered access information is deleted, making function access control impossible, which can expose the system to security attacks such as external scanning, hacking, etc.
- CPU cores proportional to the memory allocation of General configuration are automatically assigned.
- If the minimum number of executions of General configuration is 1 or more, Cold Start is prevented, but continuous costs are incurred.
Trigger
On the Function List page, you can view and configure the trigger information of the selected resource. By setting a trigger, you can automatically execute the Function when an event occurs.
| Category | Detailed description |
|---|---|
| Cronjob | Use Cronjob as a trigger
|
| API Gateway | Use API Gateway as a trigger
|
- If technical support for the Runtime version has ended, you cannot modify the trigger item. * Also, because security patches and updates are not applied, create and use the function anew with the Latest or Stable version to ensure safe usage.
- If the Cronjob trigger is called before the function’s timeout, the function will execute concurrently, increasing both the execution count and the total time. * Therefore, be cautious because continuous additional costs can lead to high expenses.
- If the status is Deploying, it cannot be modified.
- Refer to Setting up triggers for trigger configuration.
Tag
In the Tag tab, you can view the resource’s tag information, and add, modify, or delete it.
| Category | Detailed description |
|---|---|
| Tag list | Tag list
|
Job History
Job History page allows you to view the resource’s job history.
| Category | Detailed description |
|---|---|
| Task History List | Resource Change History
|
Changing Java Runtime code
If you are using Java Runtime, you cannot modify the code directly, so you must select and replace the archive file (.jar/.zip) in the bucket of the Object Storage service.
To modify a compressed file, follow these steps.
All Services > Compute > Cloud Functions menu, click. 1. Navigate to the Service Home page of Cloud Functions.
On the Service Home page, click the Function menu. 2. Go to the Function list page.
On the Function List page, click the resource to change the compressed file in the code. 3. Go to the Function Details page.
Click the Edit button on the Code tab of the Function Details page. 4. Edit Function code Navigate to the page.
Click the Import from Object Storage button. 5. Import from Object Storage The popup window opens.
Category Detailed description Java Runtime Java Runtime Information Handler information Handler information - Execution Class: Automatically entered when setting the archive file (.jar/.zip)
- Execution Method: Automatically entered when setting the archive file (.jar/.zip)
Compressed file (.jar/.zip) Set the archive file to modify - Archive file name (.jar/.zip): Displays the name of the archive file. Import from Object Storage after configuration, it is entered automatically
- Import from Object Storage: Configure the Object Storage to retrieve the archive file (.jar/.zip)
Table. Cloud Functions Details - Function Code Modification ItemsEnter the URL information of the Object Storage to retrieve the compressed file in Object Storage URL, then click the Confirm button. 6. The notification popup opens.
- URL information can be found in the Folder List tab of the detailed page of the Object Storage to retrieve, under the File Information > Private URL item.
Click the Confirm button. 7. On the Function code edit page, the name of the imported compressed file is displayed in the Compressed file name (.jar/.zip).
Click the Save button.
- Users without a generated authentication key cannot execute Import from Object Storage.
- If the URL does not exist or the archive file matches any of the following, it cannot be changed.
- When using an unsupported file extension
- If there are harmful files inside the compressed file.
- If the size exceeds the supported limit
Terminate Cloud Functions
To cancel the Cloud Functions service, follow these steps.
- Click the All Services > Compute > Cloud Functions menu. 1. Navigate to the Service Home page of Cloud Functions.
- On the Service Home page, click the Function menu. 2. Navigate to the Function list page.
- On the Function List page, click the resource you want to terminate and then click the Terminate Service button.
- When the termination is complete, check on the Function list page whether the resource has been terminated.
6.2.1 - Configure Trigger
Configure Trigger
- By default, all triggers can be added in Cloud Functions.
- If it is triggered for a specific product, it should be passed to Cloud Functions.
Setting up Cronjob trigger
To set up a Cronjob trigger, follow these steps.
- Click the All Services > Compute > Cloud Functions menu. Navigate to the Service Home page of Cloud Functions.
- On the Service Home page, click the Function menu. You will be taken to the Function list page.
- Function List page, click the resource for which you want to set a trigger. You will be taken to the Function Details page.
- After clicking the Trigger tab, click the Add Trigger button. Set it. The Add Trigger popup window opens.
- Add Trigger In the popup, select Cronjob from Trigger Type. A required information input area appears at the bottom.
Category Detailed description Cronjob configuration Set the trigger’s repeat frequency - Can be set in minutes, hours, days, months, weekdays
Timezone setting Set the trigger’s reference time zone Table. Cronjob Trigger Required Information Items - After entering the required information, click the Confirm button.
- When the popup notifying an addition opens, click the Confirm button.
Configure API Gateway Trigger
To set up an API Gateway trigger, follow these steps.
- Click the All Services > Compute > Cloud Functions menu. Navigate to the Service Home page of Cloud Functions.
- On the Service Home page, click the Function menu. You will be taken to the Function List page.
- Click the resource to set the trigger on the Function List page. Go to the Function Details page.
- Click the Trigger tab, then click the Add Trigger button. Set it. The Add Trigger popup opens.
- In the Add Trigger popup, select API Gateway under Trigger Type. A required information input area appears at the bottom.
Category Detailed description API name Select API - You can select an existing API or create a new one
Stage Select deployment target - You can select an existing stage or create a new one
Table. API Gateway Trigger Required Information Items - After entering the required information, click the Confirm button.
- When the popup notifying the addition opens, click the Confirm button.
Configure Multi-Trigger
You can attach multiple triggers to a single function.
Modify Trigger
To modify the added trigger, follow these steps.
- Click the All Services > Compute > Cloud Functions menu. Navigate to the Service Home page of Cloud Functions.
- On the Service Home page, click the Function menu. You will be taken to the Function list page.
- On the Function List page, click the resource to edit the trigger. You will be taken to the Function Details page.
- Click the Trigger tab, then in the trigger list, click the Edit button of the trigger whose settings you want to modify. The Edit Trigger popup window opens.
- Edit Trigger After modifying the settings in the popup window, click the Confirm button.
- The configuration values differ depending on the trigger type (refer to Cronjob Trigger, API Gateway Trigger).
- When the edit notification popup appears, click Confirm.
Delete Trigger
To delete a trigger, follow these steps.
- Click the All Services > Compute > Cloud Functions menu. Navigate to the Service Home page of Cloud Functions.
- On the Service Home page, click the Function menu. You will be taken to the Function List page.
- Function List page, click the resource for which you want to set a trigger. You will be taken to the Function Details page.
- In the Trigger tab’s trigger list, select the trigger you want to delete, then click the Delete button.
- When the popup notifying you of trigger deletion opens, click the Confirm button.
6.2.2 - Blueprint Detailed Guide
Blueprint Overview
When creating Cloud Functions, you can set a Blueprint to utilize the Runtime source code provided by Cloud Functions. Refer to the following for the Blueprint items provided by Cloud Functions.
| Category | Detailed description | Remarks |
|---|---|---|
| Hello World | When the function is invoked, it responds with Hello Serverless World! | |
| Execution after timeout | It outputs code that should run after the function call timeout but does not execute. | PHP, Python not supported |
| HTTP request body | Parse the request body. | PHP not supported |
| Send HTTP requests | The Cloud function sends an HTTP request. | PHP not supported |
| Print logs | Logs the user’s Samsung Cloud Platform Console request. | PHP not supported |
| Throw a custom error | Enter the error logic directly to handle the error. | |
| Using Environment Variable | Configure environment variables within the Cloud function and execute it. |
Hello World
Hello World Explains the response-receiving configuration and a function call example (using the function URL).
Hello World Setup
To set up Hello World, follow these steps.
Click the All Services > Compute > Cloud Functions menu. Go to the Service Home page of Cloud Functions.
On the Service Home page, click the Function menu. You will be taken to the Function List page.
Function List page, click the resource to be called via URL. You will be taken to the Function Detail page.
After clicking the Configuration tab, click the Edit button for the Function URL item. The Edit Function URL popup window opens.
In the Function URL Edit popup, set Activation status to Enabled, then click the Confirm button.
Category Detailed description Enable status Configure the use of the function URL Authentication type Select whether to use IAM authentication for requests to the function URL Access control Add accessible IPs to enable management - Set to Use, then you can input and add a public access IP
Table. Required input fields when adding a triggerAfter navigating to the Code tab, click the Edit button. You will be taken to the Function Code Edit page.
After adding the handling logic for success and failure cases, click the Save button.
- Node.js source codeColor mode
exports.handleRequest = async function (params) { /** * @description User writing area (Function details) */ const response = { statusCode: 200, body: JSON.stringify('Hello Serverless World!'), }; return response; };exports.handleRequest = async function (params) { /** * @description User writing area (Function details) */ const response = { statusCode: 200, body: JSON.stringify('Hello Serverless World!'), }; return response; };Hello World - Node.js source code - Python source codeColor mode
import json def handle_request(params): # User writing area (Function details) return { 'statusCode': 200, 'body': json.dumps('Hello Serverless World!') }import json def handle_request(params): # User writing area (Function details) return { 'statusCode': 200, 'body': json.dumps('Hello Serverless World!') }Hello World - Python source code - PHP source codeColor mode
<?php function handle_request() { # User writing area (Function details) $res = array( 'statusCode' => 200, 'body' => 'Hello Serverless World!', ); return $res; } ?><?php function handle_request() { # User writing area (Function details) $res = array( 'statusCode' => 200, 'body' => 'Hello Serverless World!', ); return $res; } ?>Hello World - PHP source code
- Node.js source code
Check function call
On the Function Details page, in the Configuration tab, invoke the function URL and then verify the response.
Hello Serverless World!
Execution after timeout
Describes configuring execution after timeout (Execution after timeout) and provides an example of invoking the function (using the function URL).
Configure execution after timeout
To set Execution after timeout, follow these steps.
- Click the All Services > Compute > Cloud Functions menu. Navigate to the Service Home page of Cloud Functions.
- On the Service Home page, click the Function menu. You will be taken to the Function List page.
- On the Function List page, click the resource for which you want to set a trigger. You will be taken to the Function Details page.
- After clicking the Trigger tab, click the Add Trigger button. The Add Trigger popup window opens.
- Add Trigger In the popup window, after selecting the Trigger Type item, enter the required information displayed at the bottom and click the OK button.
- Required information varies depending on the trigger type.
Trigger Types Input field API Gateway - API name: Select an existing API or create a new one
- Stage: Select an existing stage or create a new one
Cronjob - Refer to the example and enter the trigger’s repeat frequency(minute, hour, day, month, day of week)
- Timezone setting: select the reference time zone to apply
Table. Required input fields when adding a trigger
- Required information varies depending on the trigger type.
- After moving to the Code tab, click the Edit button. You will be taken to the Function Code Edit page.
- After adding the handling logic for success and failure cases, click the Save button.
- Node.js source codeColor mode
exports.handleRequest = async function (params) { /** * @description User writing area (Function details) */ console.log("Hello world 3"); await delay(3000); const response = { statusCode: 200, body: JSON.stringify('Hello Serverless World!'), }; return response; }; const delay = (ms) => { return new Promise(resolve=>{ setTimeout(resolve,ms) }) }exports.handleRequest = async function (params) { /** * @description User writing area (Function details) */ console.log("Hello world 3"); await delay(3000); const response = { statusCode: 200, body: JSON.stringify('Hello Serverless World!'), }; return response; }; const delay = (ms) => { return new Promise(resolve=>{ setTimeout(resolve,ms) }) }Execution after timeout - Node.js source code
- Node.js source code
Check function call
On the Function Detail page’s Configuration tab, invoke the function URL and, after a brief period, check the response.
Hello Serverless World!
HTTP request body
Explains the configuration for parsing the Request Body and an example of calling the function (using the function URL).
Setting HTTP request body
To set the HTTP request body, follow these steps.
- Click the All Services > Compute > Cloud Functions menu. Navigate to the Service Home page of Cloud Functions.
- On the Service Home page, click the Function menu. You will be taken to the Function List page.
- Function List page, click the resource to set the trigger. You will be taken to the Function Details page.
- After clicking the Trigger tab, click the Add Trigger button. The Add Trigger popup window opens.
- Add Trigger In the popup window, select the Trigger Type option, then fill in the required information shown at the bottom and click the OK button.
- Required information varies depending on the trigger type.
Trigger Types Input field API Gateway - API name: Select an existing API or create a new one
- Stage: Select an existing stage or create a new one
Cronjob - Refer to the example and enter the trigger’s repeat frequency (minutes, hours, day, month, day of week)
- Timezone setting: select the reference time zone to apply
Table. Required input fields when adding a trigger
- Required information varies depending on the trigger type.
- After moving to the Code tab, click the Edit button. You will be taken to the Function Code Edit page.
- After adding the handling logic for success and failure cases, click the Save button.
- Node.js source codeColor mode
exports.handleRequest = async function (params) { /** * @description User writing area (Function details) */ const response = { statusCode: 200, body: JSON.stringify(params.body), }; return response; };exports.handleRequest = async function (params) { /** * @description User writing area (Function details) */ const response = { statusCode: 200, body: JSON.stringify(params.body), }; return response; };Execution after timeout - Node.js source code - Python source codeColor mode
import json def handle_request(params): # User writing area (Function details) return { 'statusCode': 200, 'body': json.dumps(params.json) }import json def handle_request(params): # User writing area (Function details) return { 'statusCode': 200, 'body': json.dumps(params.json) }Execution after timeout - Python source code
- Node.js source code
Check function call
In the Configuration tab of the Function Details page, after calling the Function URL, check the Body data, request Body value, and response Body value.
Request Body value
Color mode{ "testKey" :"cloud-001", "testNames": [ { "name": "Son" }, { "name": "Kim" } ], "testCode":"test" }{ "testKey" :"cloud-001", "testNames": [ { "name": "Son" }, { "name": "Kim" } ], "testCode":"test" }Request Body value Response Body value
Color mode{ "testKey" :"cloud-001", "testNames": [ { "name": "Son" }, { "name": "Kim" } ], "testCode":"test" }{ "testKey" :"cloud-001", "testNames": [ { "name": "Son" }, { "name": "Kim" } ], "testCode":"test" }Response Body value
Send HTTP requests
Explains the HTTP request configuration and an example of calling a function (using the function URL).
Send HTTP requests Configure
To configure Send HTTP requests, follow these steps.
- Click the All Services > Compute > Cloud Functions menu. Navigate to the Service Home page of Cloud Functions.
- On the Service Home page, click the Function menu. You will be taken to the Function List page.
- Click the resource to set the trigger on the Function List page. Go to the Function Details page.
- After clicking the Trigger tab, click the Add Trigger button. The Add Trigger popup window opens.
- Add Trigger In the popup window, after selecting the Trigger Type item, enter the required information displayed at the bottom and click the OK button.
- Required information varies depending on the trigger type.
Trigger Types Input field API Gateway - API name: Select an existing API or create a new one
- Stage: Select an existing stage or create a new one
Cronjob - Refer to the example and enter the trigger’s repeat frequency (minutes, hours, day, month, day of week)
- Timezone setting: select the reference time zone to apply
Table. Required input items when adding a trigger
- Required information varies depending on the trigger type.
- After moving to the Code tab, click the Edit button. You will be taken to the Function Code Edit page.
- After adding the handling logic for success and failure cases, click the Save button.
- Node.js source codeColor mode
const request = require('request'); /** * @description User writing area (Function details) */ exports.handleRequest = async function (params) { return await sendRequest(params); }; async function sendRequest(req) { return new Promise((resolve, reject) => { // Port 80 and Port 443 are available url = "https://example.com"; // Destination URL const options = { uri: url, method:'GET', json: true, strictSSL: false, rejectUnauthorized: false } request(options, (error, response, body) => { if (error) { reject(error); } else { resolve({ statusCode: response.statusCode, body: JSON.stringify(body) }); } }); }); }const request = require('request'); /** * @description User writing area (Function details) */ exports.handleRequest = async function (params) { return await sendRequest(params); }; async function sendRequest(req) { return new Promise((resolve, reject) => { // Port 80 and Port 443 are available url = "https://example.com"; // Destination URL const options = { uri: url, method:'GET', json: true, strictSSL: false, rejectUnauthorized: false } request(options, (error, response, body) => { if (error) { reject(error); } else { resolve({ statusCode: response.statusCode, body: JSON.stringify(body) }); } }); }); }Send HTTP requests - Node.js source code - Python source codeColor mode
import json import requests def handle_request(params): # User writing area (Function details) # Port 80 and Port 443 are available url = "https://example.com" # Destination URL try: response = requests.get(url, verify=True) return { 'statusCode': response.status_code, 'body': json.dumps(response.text) } except requests.exceptions.RequestException as e: return str(e)import json import requests def handle_request(params): # User writing area (Function details) # Port 80 and Port 443 are available url = "https://example.com" # Destination URL try: response = requests.get(url, verify=True) return { 'statusCode': response.status_code, 'body': json.dumps(response.text) } except requests.exceptions.RequestException as e: return str(e)Send HTTP requests - Python source code
- Node.js source code
Check Function Call
On the Function Details page, after invoking the function URL in the Configuration tab, verify the response.
<!doctype html>
<html>
<head>
<title>Example Domain</title>
<meta charset="utf-8" />
<meta http-equiv="Content-type" content="text/html; charset=utf-8" />
<meta name="viewport" content="width=device-width, initial-scale=1" />
<style type="text/css">
body {
background-color: #f0f0f2;
margin: 0;
padding: 0;
font-family: -apple-system, system-ui, BlinkMacSystemFont, "Segoe UI", "Open Sans", "Helvetica Neue", Helvetica, Arial, sans-serif;
}
div {
width: 600px;
margin: 5em auto;
padding: 2em;
background-color: #fdfdff;
border-radius: 0.5em;
box-shadow: 2px 3px 7px 2px rgba(0,0,0,0.02);
}
a:link, a:visited {
color: #38488f;
text-decoration: none;
}
@media (max-width: 700px) {
div {
margin: 0 auto;
width: auto;
}
}
</style>
</head>
<body>
<div>
<h1>Example Domain</h1>
<p>This domain is for use in illustrative examples in documents. You may use this
domain in literature without prior coordination or asking for permission.</p>
<p><a href="https://www.iana.org/domains/example">More information...</a></p>
</div>
</body>
</html><!doctype html>
<html>
<head>
<title>Example Domain</title>
<meta charset="utf-8" />
<meta http-equiv="Content-type" content="text/html; charset=utf-8" />
<meta name="viewport" content="width=device-width, initial-scale=1" />
<style type="text/css">
body {
background-color: #f0f0f2;
margin: 0;
padding: 0;
font-family: -apple-system, system-ui, BlinkMacSystemFont, "Segoe UI", "Open Sans", "Helvetica Neue", Helvetica, Arial, sans-serif;
}
div {
width: 600px;
margin: 5em auto;
padding: 2em;
background-color: #fdfdff;
border-radius: 0.5em;
box-shadow: 2px 3px 7px 2px rgba(0,0,0,0.02);
}
a:link, a:visited {
color: #38488f;
text-decoration: none;
}
@media (max-width: 700px) {
div {
margin: 0 auto;
width: auto;
}
}
</style>
</head>
<body>
<div>
<h1>Example Domain</h1>
<p>This domain is for use in illustrative examples in documents. You may use this
domain in literature without prior coordination or asking for permission.</p>
<p><a href="https://www.iana.org/domains/example">More information...</a></p>
</div>
</body>
</html>Print logs
This explains how to configure log output and an example of calling a function (using the function URL).
Configure Print logs
Print logs To set up response receiving, follow these steps.
- Click the All Services > Compute > Cloud Functions menu. Navigate to the Service Home page of Cloud Functions.
- On the Service Home page, click the Function menu. You will be taken to the Function List page.
- Function List page, click the resource to set the trigger. Function Details page will be displayed.
- After clicking the Trigger tab, click the Add Trigger button. The Add Trigger popup window opens.
- In the Add Trigger popup, select the Trigger Type item, then enter the required information displayed at the bottom and click the OK button.
- Required information varies depending on the trigger type.
Trigger Types Input field API Gateway - API name: Select an existing API or create a new one
- Stage: Select an existing stage or create a new one
Cronjob - Refer to the example and enter the trigger’s repeat frequency(minutes, hours, day, month, day of week)
- Timezone setting: select the reference time zone to apply
Table. Required input fields when adding a trigger
- Required information varies depending on the trigger type.
- After moving to the Code tab, click the Edit button. You will be taken to the Function Code Edit page.
- After adding the handling logic for success and failure cases, click the Save button.
- Node.js source codeColor mode
const winston = require('winston'); // Log module setting const logger = winston.createLogger({ format: winston.format.combine( winston.format.timestamp(), winston.format.printf(info => info.timestamp + ' ' + info.level + ': ' + info.message) ), transports: [ new winston.transports.Console() ] }); exports.handleRequest = async function (params) { /** * @description User writing area (Function details) */ const response = { statusCode: 200, body: JSON.stringify(params.body), }; logger.info(JSON.stringify(response, null, 2)); return response; };const winston = require('winston'); // Log module setting const logger = winston.createLogger({ format: winston.format.combine( winston.format.timestamp(), winston.format.printf(info => info.timestamp + ' ' + info.level + ': ' + info.message) ), transports: [ new winston.transports.Console() ] }); exports.handleRequest = async function (params) { /** * @description User writing area (Function details) */ const response = { statusCode: 200, body: JSON.stringify(params.body), }; logger.info(JSON.stringify(response, null, 2)); return response; };Print logs - Node.js source code - Python source codeColor mode
import json import logging # Log module setting logging.basicConfig(level=logging.INFO) def handle_request(params): # User writing area (Function details) response = { 'statusCode': 200, 'body': json.dumps(params.json) } logging.info(response) return responseimport json import logging # Log module setting logging.basicConfig(level=logging.INFO) def handle_request(params): # User writing area (Function details) response = { 'statusCode': 200, 'body': json.dumps(params.json) } logging.info(response) return responsePrint logs - Python source code
- Node.js source code
Check Function Call
After calling the function URL in the Configuration tab of the Function Details page, check the log in the Log tab.
[2023-09-07] 12:06:23] "host": "scf-xxxxxxxxxxxxxxxxxxxxx",
[2023-09-07] 12:06:23] "ce-id": "xxxxxxxxxxxxxxxxxxxxx",
[2023-09-07] 12:06:23] "ce-source": "xxxxxxxxxxxxxxxxxxxxx",[2023-09-07] 12:06:23] "host": "scf-xxxxxxxxxxxxxxxxxxxxx",
[2023-09-07] 12:06:23] "ce-id": "xxxxxxxxxxxxxxxxxxxxx",
[2023-09-07] 12:06:23] "ce-source": "xxxxxxxxxxxxxxxxxxxxx",Throw a custom error
Explains setting up a custom error (Throw a custom error) and an example of calling a function (using a function URL).
Configure Throw a custom error
To configure Throw a custom error, follow these steps.
- Click the All Services > Compute > Cloud Functions menu. Navigate to the Service Home page of Cloud Functions.
- On the Service Home page, click the Function menu. You will be taken to the Function List page.
- Click the resource to set the trigger on the Function List page. Go to the Function Details page.
- After clicking the Trigger tab, click the Add Trigger button. The Add Trigger popup window opens.
- Add Trigger In the popup window, after selecting the Trigger Type item, enter the required information displayed at the bottom and click the OK button.
- Required information varies depending on the trigger type.
Trigger Types Input field API Gateway - API name: Select an existing API or create a new one
- Stage: Select an existing stage or create a new one
Cronjob - Refer to the example and enter the trigger’s repeat frequency (minutes, hours, day, month, day of week)
- Timezone setting: select the reference time zone to apply
Table. Required input fields when adding a trigger
- Required information varies depending on the trigger type.
- After moving to the Code tab, click the Edit button. You will be taken to the Function Code Edit page.
- After adding the handling logic for success and failure cases, click the Save button.
- Node.js source codeColor mode
class CustomError extends Error { constructor(message) { super(message); this.name = 'CustomError'; } } exports.handleRequest = async function (params) { /** * @description User writing area (Function details) */ throw new CustomError('This is a custom error!'); };class CustomError extends Error { constructor(message) { super(message); this.name = 'CustomError'; } } exports.handleRequest = async function (params) { /** * @description User writing area (Function details) */ throw new CustomError('This is a custom error!'); };Throw a custom error - Node.js source code - Python source codeColor mode
class CustomError(Exception): def __init__(self, message): self.message = message def handle_request(parmas): raise CustomError('This is a custom error!')class CustomError(Exception): def __init__(self, message): self.message = message def handle_request(parmas): raise CustomError('This is a custom error!')Throw a custom error - Python source code - PHP source codeColor mode
<?php class CustomError extends Exception { public function __construct($message) { parent::__construct($message); $this->message = $message; } } function handle_request() { throw new CustomError('This is a custom error!'); } ?><?php class CustomError extends Exception { public function __construct($message) { parent::__construct($message); $this->message = $message; } } function handle_request() { throw new CustomError('This is a custom error!'); } ?>Throw a custom error - PHP source code
- Node.js source code
Check Function Call
On the Function Details page, after calling the Function URL in the Configuration tab, verify whether an error occurred in the Log tab.
Using Environment Variable
Explains the use of environment variables (Using Environment Variable) settings and a function call example (using function URL).
Using Environment Variable Configure
To configure Using Environment Variable, follow these steps.
- Click the All Services > Compute > Cloud Functions menu. Navigate to the Service Home page of Cloud Functions.
- On the Service Home page, click the Function menu. You will be taken to the Function List page.
- Click the resource to set the trigger on the Function List page. Go to the Function Details page.
- After clicking the Trigger tab, click the Add Trigger button. The Add Trigger popup window opens.
- In the Add Trigger popup, select the Trigger Type item, then enter the required information displayed at the bottom and click the OK button.
- Required information varies depending on the trigger type.
Trigger Types Input field API Gateway - API name: Select an existing API or create a new one
- Stage: Select an existing stage or create a new one
Cronjob - Refer to the example and enter the trigger’s repeat frequency(minutes, hours, days, months, day of week)
- Timezone setting: select the reference time zone to apply
Table. Required input fields when adding a trigger
- Required information varies depending on the trigger type.
- After moving to the Code tab, click the Edit button. You will be taken to the Function Code Edit page.
- After adding the handling logic for success and failure cases, click the Save button.
- Node.js source codeColor mode
exports.handleRequest = async function (params) { /** * @description User writing area (Function details) */ return process.env.test; };exports.handleRequest = async function (params) { /** * @description User writing area (Function details) */ return process.env.test; };Using Environment Variable - Node.js source code - Python source codeColor mode
import json import os def handle_request(params): # User writing area (Function details) return os.environ.get("test")import json import os def handle_request(params): # User writing area (Function details) return os.environ.get("test")Using Environment Variable - Python source code - PHP source codeColor mode
import json def handle_request(params): # User writing area (Function details) return os.environ.get("test")import json def handle_request(params): # User writing area (Function details) return os.environ.get("test")Using Environment Variable - PHP source code
- Node.js source code
- After moving to the Configuration tab, click the Edit button in the Environment Variables area. The Edit Environment Variables popup will open.
- After entering the environment variable information, click the Confirm button.
Category Detailed description Name Enter the key value value Enter the value Table. Environment Variable Input Items
Check function call
On the Function Details page, after calling the function URL in the Configuration tab, check the environment variable value in the Log tab.
6.2.3 - Integrate PrivateLink Service
By integrating Cloud Functions with the PrivateLink service, you can connect VPCs within the Samsung Cloud Platform to other VPCs, and VPCs to services, without using the external internet.
The data uses only the internal network, providing high security, and does not require public IP, NAT, VPN, or an internet gateway.
Enable PrivateLink Service
To connect the PrivateLink Service, you must first enable the service.
To enable the PrivateLink service, follow these steps.
- All Services > Compute > Cloud Functions Click the menu. 1. Navigate to the Service Home page of Cloud Functions.
- On the Service Home page, click the Function menu. 2. Go to the Function list page.
- On the Function List page, click the resource to associate with PrivateLink. 3. Function Details page.
- On the Function Details page, click the Configuration tab.
- In Private connection configuration, click the Edit button of PrivateLink Service. 5. PrivateLink Service Edit The popup window opens.
- PrivateLink Service Edit In the popup window, after checking the Use item of Activation Status, click the Confirm button. 6. Configuration tab’s Private connection configuration displays PrivateLink Service information.
| Category | Detailed description |
|---|---|
| Private URL | PrivateLink Service URL information |
| PrivateLink Service ID | PrivateLink Service ID information |
| Request Endpoint Management | List of PrivateLink Endpoints that requested a PrivateLink Service connection
|
Integrating PrivateLink Service
You can expose the function for private access from another VPC by integrating with PrivateLink Service.
To integrate the PrivateLink service, review the following tasks.
- Register the domain for the PrivateLink Endpoint IP address and the Private URL address to invoke the issued Private URL.
192.168.0.13 abc123.scf.private.kr-west1.qa2.samsungsdscloud.com - When invoking the PrivateLink Service, verify IAM authentication based on the credentials of the Endpoint creator required for the Endpoint.
Create PrivateLink Endpoint
Create an entry point to access the PrivateLink Service of the user VPC.
To create a PrivateLink Endpoint, follow these steps.
- Click the All Services > Compute > Cloud Functions menu. 1. Go to the Service Home page of Cloud Functions.
- On the Service Home page, click the Function menu. 2. Go to the Function list page.
- On the Function list page, click the resource to associate with PrivateLink. 3. Function Details Go to the page.
- On the Function Details page, click the Configuration tab.
- Click the Add button in Private connection configuration of PrivateLink Endpoint. 5. Add PrivateLink Endpoint The popup window opens.
- Add PrivateLink Service in the popup window, after entering the PrivateLink Service ID and Alias information, click the Confirm button.
- When the popup indicating creation opens, click the Confirm button. 7. Configuration tab’s Private connection configuration displays PrivateLink Endpoint information.
| Category | Detailed description |
|---|---|
| PrivateLink Endpoint ID | PrivateLink Endpoint ID information |
| PrivateLink Service ID | PrivateLink Service ID information |
| Alias | hostalias information that can be used instead of an IP address for accessing a PrivateLink Endpoint |
| status | Approval status of PrivateLink Endpoint
|
Integrating APIGW Private EPS
To connect the SCF Endpoint and the APIGW Private Endpoint, you must specify the Private URL in the SCF Endpoint Alias instead of the APIGW EPS resource path.
- Private URL example:
181b6126ef6d4e4b81370df5.apigw.private.kr-west1.s.samsungsdscloud.com/get/resourcepath
To integrate APIGW Private EPS, refer to the following code.
const request = require('request');
/**
* @description User writing area (Function details)
*/
exports.handleRequest = async function (params) {
return await sendRequest(params);
};
async function sendRequest(req) {
return new Promise((resolve, reject) => {
// Port 80 and Port 443 are available
url = "https://{alias}/{resource_path}"; // Destination URL
/**
{alias} is the alias name entered when creating an Endpoint within the function
{resoure_path} is the resource path (/get/resourcepath) specified in the Private URL of APIGW EPS
*/
const options = {
uri: url,
method:'GET',
json: true,
strictSSL: false,
rejectUnauthorized: false
}
request(options, (error, response, body) => {
if (error) {
reject(error);
} else {
resolve({
statusCode: response.statusCode,
body: JSON.stringify(body)
});
}
});
});
}const request = require('request');
/**
* @description User writing area (Function details)
*/
exports.handleRequest = async function (params) {
return await sendRequest(params);
};
async function sendRequest(req) {
return new Promise((resolve, reject) => {
// Port 80 and Port 443 are available
url = "https://{alias}/{resource_path}"; // Destination URL
/**
{alias} is the alias name entered when creating an Endpoint within the function
{resoure_path} is the resource path (/get/resourcepath) specified in the Private URL of APIGW EPS
*/
const options = {
uri: url,
method:'GET',
json: true,
strictSSL: false,
rejectUnauthorized: false
}
request(options, (error, response, body) => {
if (error) {
reject(error);
} else {
resolve({
statusCode: response.statusCode,
body: JSON.stringify(body)
});
}
});
});
}6.2.4 - Resource-based Policy Guide
Resource-based policy overview
The resource-based policy (Resource-based Policy) of Cloud Functions is a policy granted to a resource that can decide to allow or deny (Effect) an action (Action) on a specific resource for a principal (Principal). You can directly define the principal that can invoke a function by using resource-based policies.
You can allow function calls by defining the following in a resource-based policy.
- User of the specified Samsung Cloud Platform account
- Specified source IP address range or CIDR block
A source policy is defined as a JSON policy document attached to the API, which controls whether the specified security principal (typically an IAM role or group) can call the API.
| Category | description | example |
|---|---|---|
| Principal | Specify the caller of the function | Specific object storage bucket, API Gateway, other Samsung Cloud Platform accounts, etc. |
| Task(Action) | Define the allowed functions | Mostly scf:InvokeFunction |
| Condition(Condition) | Restrict to allow only in specific situations | Allow only requests originating from a bucket with a specific SRN. |
- Cloud Functions’ resource-based policies leverage the rules of IAM’s resource-based policies.
- For instructions on creating or modifying policies using JSON, refer to the JSON Mode Utilization Guide.
Resource-based policy usage scenario
The primary use cases for resource-based policies are as follows.
Resource-based policy scenario
The resource-based policy scenarios used when a Cloud Functions function runs are as follows.
| Category | description | Reference example |
|---|---|---|
| Function URL - Authentication Type None | It is required when generating a function URL for invocation.
| Function URL (Auth Type None) Example |
| Function URL - Authentication Type IAM |
| Function URL (authentication type IAM) example |
| API Gateway trigger | It is required when API Gateway calls Lambda to handle external API requests.
| API Gateway Trigger Example |
| PrivateLink connection | You can connect a PrivateLink Service to define the function for private access from another VPC.
| PrivateLink connection example |
User addition usage scenario
Although it is not automatically registered as a resource-based policy for Cloud Functions, users can add and use it as needed. The scenarios that users can add and utilize are as follows.
- Cross-Account Access
- If an IAM user in account A wants to invoke a Lambda in account B, register account A in the function policy of account B.
- Hybrid Access Control
- It can be configured so that access is allowed only when both conditions are met—a specific user and a specific IP range—rather than merely restricting by account or IP alone.
Resource-based policy management for Cloud Functions
To view and configure resource-based policies for Cloud Functions, follow these steps.
- Click the All Services > Compute > Cloud Functions menu. Navigate to the Service Home page of Cloud Functions.
- On the Service Home page, click the Function menu. You will be taken to the Function list page.
- On the Function List page, click the resource for which you want to set a policy. You will be taken to the Function Details page.
- Click the Configuration tab on the Function Details page.
- Click the Edit button of the Resource-based policy permission item. The Resource policy edit popup window opens.
- In the Resource Policy edit popup, after selecting the Policy Template, write the policy.
- For policy examples by policy template, refer to Resource-based policy examples.
- When the writing is complete, click the Confirm button.
- Click the Delete button to delete the registered policy.
Example of resource-based policy
Users can define additional resource-based policies as needed or modify existing policies for use.
- For some features, a resource‑based policy (or credential) must be registered to use them in Cloud Functions.
- In the resource-based policy examples described in this guide, Cloud Functions automatically registers the example resource-based policies when each feature is enabled or linked.
Function URL - Authentication Type None
Principal is /* a policy that allows public calls.
Policy Template
{
"Statement": [
{
"Action": ["scf:InvokeFunctionUrl"],
"Condition": {
"StringEquals": {
"scf:CloudFunctionAuthType": ["NONE"]
}
},
"Effect": "Allow"
"Principal": "*"
"Resource": ["{{CloudFunctionSrn}}"],
"Sid": "InvokeFunctionURLAllowPublicAccess"
}
],
"Version": "2024-07-01"
}{
"Statement": [
{
"Action": ["scf:InvokeFunctionUrl"],
"Condition": {
"StringEquals": {
"scf:CloudFunctionAuthType": ["NONE"]
}
},
"Effect": "Allow"
"Principal": "*"
"Resource": ["{{CloudFunctionSrn}}"],
"Sid": "InvokeFunctionURLAllowPublicAccess"
}
],
"Version": "2024-07-01"
}Policy example
{
"Statement": [
{
"Action": ["scf:InvokeFunctionUrl"],
"Condition": {
"StringEquals": {
"scf:CloudFunctionAuthType": ["NONE"]
}
},
"Effect": "Allow"
"Principal": "*"
"Resource": ["srn:e::accountID:kr-west1::scf:cloud-function/functionsID"],
"Sid": "InvokeFunctionURLAllowPublicAccess"
}
],
"Version": "2024-07-01"
}{
"Statement": [
{
"Action": ["scf:InvokeFunctionUrl"],
"Condition": {
"StringEquals": {
"scf:CloudFunctionAuthType": ["NONE"]
}
},
"Effect": "Allow"
"Principal": "*"
"Resource": ["srn:e::accountID:kr-west1::scf:cloud-function/functionsID"],
"Sid": "InvokeFunctionURLAllowPublicAccess"
}
],
"Version": "2024-07-01"
}Function URL - Authentication Type IAM
This policy permits a specific user to invoke a public URL.
Policy Template
{
"Statement": [
{
"Action": ["scf:InvokeFunctionUrl"],
"Condition": {
"StringEquals": {
"scf:CloudFunctionAuthType": ["SCP_IAM"]
}
},
"Effect": "Allow"
"Principal": {
"scp": ["srn:{{Environment}}::{{AccountID}}:::iam:user/{{UserId}}"]
},
"Resource": ["{{CloudFunctionSrn}}"],
"Sid": "Statement1"
}
],
"Version": "2024-07-01"
}{
"Statement": [
{
"Action": ["scf:InvokeFunctionUrl"],
"Condition": {
"StringEquals": {
"scf:CloudFunctionAuthType": ["SCP_IAM"]
}
},
"Effect": "Allow"
"Principal": {
"scp": ["srn:{{Environment}}::{{AccountID}}:::iam:user/{{UserId}}"]
},
"Resource": ["{{CloudFunctionSrn}}"],
"Sid": "Statement1"
}
],
"Version": "2024-07-01"
}Policy Example
{
"Statement": [
{
"Action": ["scf:InvokeFunctionUrl"],
"Condition": {
"StringEquals": {
"scf:CloudFunctionAuthType": ["SCP_IAM"]
}
},
"Effect": "Allow"
"Principal": "*",
"Resource": ["srn:e::accountID:kr-west1::scf:cloud-function/functionsID"],
"Sid": "accountID-iam-invokefunctionurl"
}
],
"Version": "2024-07-01"
}{
"Statement": [
{
"Action": ["scf:InvokeFunctionUrl"],
"Condition": {
"StringEquals": {
"scf:CloudFunctionAuthType": ["SCP_IAM"]
}
},
"Effect": "Allow"
"Principal": "*",
"Resource": ["srn:e::accountID:kr-west1::scf:cloud-function/functionsID"],
"Sid": "accountID-iam-invokefunctionurl"
}
],
"Version": "2024-07-01"
}API Gateway trigger
Principal is a policy that permits public calls with a * principal.
Policy Template
{
"Statement": [
{
"Action": ["scf:InvokeFunction"],
"Condition": {
"SrnLike": {
"scp:RequestAttribute/body['x-scf-request-obj-srn']": ["{{ApiGatewayMethodSrn}}"]
}
},
"Effect": "Allow",
"Principal": {
"Service": ["apigateway.samsungsdscloud.com"]
},
"Resource": ["{{CloudFunctionSrn}}"],
"Sid": "Statement1"
}
],
"Version": "2024-07-01"
}{
"Statement": [
{
"Action": ["scf:InvokeFunction"],
"Condition": {
"SrnLike": {
"scp:RequestAttribute/body['x-scf-request-obj-srn']": ["{{ApiGatewayMethodSrn}}"]
}
},
"Effect": "Allow",
"Principal": {
"Service": ["apigateway.samsungsdscloud.com"]
},
"Resource": ["{{CloudFunctionSrn}}"],
"Sid": "Statement1"
}
],
"Version": "2024-07-01"
}Policy Example
{
"Statement": [
{
"Action": [
scf:InvokeFunction
],
"Condition": {
"SrnLike": {
"scp:RequestAttribute/body['x-scf-request-obj-srn']": [
srn:e::accountID:kr-west1::apigateway:method/MethodID/*/GET/test
]
}
},
"Effect": "Allow"
"Principal": {
"Service": [
"apigateway.samsungsdscloud.com"
]
},
"Resource": [
srn:e::accountID:kr-west1::scf:cloud-function/functionID
],
"Sid": "999e9a9999de4d4683c9e10c74ee999z"
}
],
"Version": "2024-07-01"
}{
"Statement": [
{
"Action": [
scf:InvokeFunction
],
"Condition": {
"SrnLike": {
"scp:RequestAttribute/body['x-scf-request-obj-srn']": [
srn:e::accountID:kr-west1::apigateway:method/MethodID/*/GET/test
]
}
},
"Effect": "Allow"
"Principal": {
"Service": [
"apigateway.samsungsdscloud.com"
]
},
"Resource": [
srn:e::accountID:kr-west1::scf:cloud-function/functionID
],
"Sid": "999e9a9999de4d4683c9e10c74ee999z"
}
],
"Version": "2024-07-01"
}PrivateLink connection
This is a policy that allows function calls through a Privatelink Endpoint for specific users.
Policy Template
{
"Statement": [
{
"Action": ["scf:InvokeFunction"],
"Condition": {
"StringEquals": {
"scf:CloudFunctionPrivatelinkServiceAuthType": ["SCP_IAM"]
}
},
"Effect": "Allow"
"Principal": {
"scp": ["srn:{{Environment}}::{{AccountID}}:::iam:user/{{UserId}}"]
},
"Resource": ["{{CloudFunctionSrn}}"],
"Sid": "Statement1"
}
],
"Version": "2024-07-01"
}{
"Statement": [
{
"Action": ["scf:InvokeFunction"],
"Condition": {
"StringEquals": {
"scf:CloudFunctionPrivatelinkServiceAuthType": ["SCP_IAM"]
}
},
"Effect": "Allow"
"Principal": {
"scp": ["srn:{{Environment}}::{{AccountID}}:::iam:user/{{UserId}}"]
},
"Resource": ["{{CloudFunctionSrn}}"],
"Sid": "Statement1"
}
],
"Version": "2024-07-01"
}Policy Example
{
"Statement": [
{
"Action": [
scf:InvokeFunction
],
"Condition": {
"StringEquals": {
"scf:CloudFunctionAuthType": [
SCP_IAM
]
}
},
"Effect": "Allow",
"Principal": {
"scp": [
srn:e::accountID:::iam:user/userID
]
},
"Resource": [
srn:e::accountID:kr-west1::scf:cloud-function/functionID
],
"Sid": "rrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrr-privatelink-invokefunction"
}
],
"Version": "2024-07-01"
}{
"Statement": [
{
"Action": [
scf:InvokeFunction
],
"Condition": {
"StringEquals": {
"scf:CloudFunctionAuthType": [
SCP_IAM
]
}
},
"Effect": "Allow",
"Principal": {
"scp": [
srn:e::accountID:::iam:user/userID
]
},
"Resource": [
srn:e::accountID:kr-west1::scf:cloud-function/functionID
],
"Sid": "rrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrr-privatelink-invokefunction"
}
],
"Version": "2024-07-01"
}6.3 - API Reference
6.4 - CLI Reference
6.5 - Release Note
Cloud Functions
- Technical support end-of-life information (EoTS) for the Runtime version is provided.
- When technical support ends, security patches and updates for that Runtime version will no longer be applied.
- Already created functions can be used continuously without any call time limit. * However, we do not guarantee bugs, errors, defects, or vulnerabilities that occur in Runtime versions for which technical support has ended.
- Technical support will be discontinued. The Runtime versions are as follows (see the full supported Runtime versions at Rutime version).
- GO: 1.21, 1.23
- Node.js: 18, 20
- PHP 8.1
- Python: 3.9
- Including runtime of additionally excluded functions
- The AIOS service integration has been stopped.
- Since the AIOS service has been discontinued, the AIOS integration service has been removed from Cloud Functions.
- You can set a resource-based policy on a function.
- Resource-based policies are policies applied to the function itself that allow external access.
- You can use resource-based policies to allow or deny actions on specific resources to a specific principal (Principal).
- You can use functions in conjunction with the AIOS service.
- You can integrate Cloud Functions with AIOS to leverage LLMs.
- You can use functions in conjunction with the PrivateLink service.
- Through a Private connection (PrivateLink), you can establish internal connections between VPCs and between a VPC and services on the Samsung Cloud Platform without traversing the Internet.
- The feature to upload Java Runtime executable files has been added.
- You can import and configure a Java Runtime executable archive file (.jar/.zip) in Object Storage.
- We have officially launched the Cloud Functions service.
- It is a serverless computing‑based FaaS (Function as a Service) that lets you easily run function‑style applications without server provisioning.
7 - Virtual Server DR
When the system is disrupted due to various disaster situations and risk factors, you can replicate the Block Storage attached to a Virtual Server in another region to restore normal operation quickly.
7.1 - Overview
Service Overview
Virtual Server DR is a service that quickly restores the system by replicating the Virtual Server and its attached Block Storage to a region different from the one currently in use. Even if the system is disrupted due to various disaster situations or unexpected events, you can quickly restore normal operation by using Virtual Server DR.
- The Virtual Server DR service can be configured using partner solutions sold on the Samsung Cloud Platform Marketplace.
- For detailed information on using the Marketplace, please refer to Marketplace.
- When you purchase and use a service sold on the Marketplace, a delegated tax invoice is issued in accordance with a separate agreement with the Marketplace software supplier.
- If you request a partner solution product for Virtual Server DR from the Marketplace, the request information will be emailed to the responsible person. Coordinate product details and schedule with the person in charge. Software installation and charges will be billed based on the confirmed date.
- The services sold in the Marketplace of Samsung Cloud Platform are offered by individual sellers, and Samsung SDS serves as an e‑commerce intermediary, not a party to the sales. Accordingly, Samsung SDS does not guarantee or assume responsibility for the service information or transactions conducted by individual sellers.
Features
- Easy DR Environment Setup: You can easily configure a Virtual Server for DR setup through partner solutions in the Marketplace of Samsung Cloud Platform.
- Various Environment Configurations: Using partner solutions, you can configure various environments such as physical to virtual (P2V) and virtual to virtual (V2V) environments, and it supports multiple operating systems (Windows, Linux).
Service Architecture Diagram
Provided features
For the main features, please refer to the Product Catalog Details page of the partner solution sold on the Samsung Cloud Platform Marketplace.
Preliminary Service
This is a list of services that must be pre-configured before creating the service. Please refer to the guide provided for each service and prepare accordingly.
| Service Category | service | Detailed description |
|---|---|---|
| Networking | VPC | A service that provides an isolated virtual network in a cloud environment |
| Networking | Security Group | Virtual firewall that controls server traffic |
| Compute | Virtual Server | Virtual server optimized for cloud computing |
7.2 - Release Note
Virtual Server DR
- We have officially launched the Virtual Server DR service.
- When the system is interrupted by various disaster scenarios and risk factors, it can be restored to normal operation in a short time.
8 - Block Storage
8.1 - Overview
Service Overview
Block Storage is a high-performance storage that stores data in block units arranged in a fixed size and layout.
Suitable for large-scale, high-performance requirements such as databases and mail servers, and users can directly allocate volumes on the server for use.
Features
- Large Volume Provisioning: OS configuration volumes are created with at least the minimum size per image and can be expanded up to 12 TB, and data storage volumes outside the OS can be created and expanded from a minimum of 8 GB to a maximum of 12 TB. Capacity expansion is performed reliably while online.
- High-Performance Full SSD Offering: It provides high durability and availability based on redundant controllers and Disk Array RAID. Since a Full SSD disk is provided by default, it is suitable for high-speed data processing tasks such as database workloads.
- Snapshot Backup: Data that has been modified or deleted can be recovered using the image snapshot feature. The user selects a snapshot created at the desired point in time from the list and performs the recovery.
Service Architecture Diagram
Provided features
Block Storage provides the following features.
- Volume Name: Users can set or modify the name for each volume.
- Capacity: Volumes can be created with sizes from a minimum of 8 GB up to a maximum of 12 TB, and can be expanded while in use. The OS default volume can be created with at least the minimum size for each image.
- Connection Server: You can select a Virtual Server to connect or disconnect.
- Multi-Server Connection (Multi Attach): Connect to two or more servers, with no limit on the number of servers per volume, and a Virtual Server can connect up to 26 volumes.
- Encryption: All volumes of Block Storage have AES-256 algorithm encryption applied by default, and when a volume is of HDD/SSD_KMS disk type, additional in‑transit encryption is provided for the segment between the instance and the Block Storage attached to the instance.
- Snapshot: The image snapshot feature allows recovery of modified or deleted data. Users select a snapshot created at the desired point in time from the list to restore it.
- Volume Transfer: You can transfer a volume to another Account using the volume transfer feature.
- Monitoring: IOPS, Latency, Throughput, etc. monitoring information can be viewed through the Cloud Monitoring service.
Component
You can create a volume by entering the capacity and selecting the disk type according to the user’s service scale and performance requirements. When using the snapshot feature, you can restore data to the desired point in time.
Volume
A volume (Volume) is the basic creation unit of the Block Storage service and is used as data storage space. Users select a name, capacity, and disk type to create a volume, then attach it to a Virtual Server for use.
The volume name generation rules are as follows.
-, _) within 255 characters.snapshot
A snapshot (Snapshot) is an image backup of a volume at a specific point in time. Users can view the snapshot name and creation timestamp in the snapshot list to select the snapshot they want to restore, and through that snapshot they can recover data that was changed or deleted.
The following considerations apply when using snapshots.
- The snapshot creation time is based on Asia/Seoul (GMT +09:00).
- Select the snapshot recovery button to restore the Block Storage volume to the latest snapshot.
- If you select a specific snapshot from the snapshot list, you can recover by creating a new volume based on the snapshot.
- Snapshots are billed based on the size of the original Block Storage, so please delete any unnecessary snapshots.
Preliminary Service
This is a list of services that must be pre-configured before creating the service. Please refer to the guide provided for each service for details and prepare in advance.
| Service Category | service | Detailed description |
|---|---|---|
| Compute | Virtual Server | Virtual server optimized for cloud computing |
8.1.1 - Monitoring Metrics
According to Samsung Cloud Platform’s policy, the Cloud Monitoring service is scheduled to be discontinued in September 2026.
Accordingly, after the September 2026 release, resource monitoring of the Samsung Cloud Platform via Cloud Monitoring will no longer be possible.
With the new alternative service, you can continuously perform resource monitoring by leveraging ServiceWatch released in October 2025.
ServiceWatch provides more modern and powerful features, replacing Cloud Monitoring to deliver a seamless monitoring environment.
Block Storage planned to be integrated with ServiceWatch from the July 2026 release onward is.
Detailed information about ServiceWatch is available in the ServiceWatch Overview.
Block Storage Monitoring Metrics
The table below shows the monitoring metrics for Block Storage that can be viewed in Cloud Monitoring. For detailed usage of Cloud Monitoring, refer to the Cloud Monitoring guide.
| Performance Item Name | Explanation | unit |
|---|---|---|
| Volume Total | Total byte count | bytes |
| IOPS [Read] | iops(read) | iops |
| IOPS [Write] | iops(write) | iops |
| Latency Time [Read] | Read latency | usec |
| Latency Time [write] | Latency (write) | usec |
| Throughput [Read] | Throughput (read) | bytes/s |
| Throughput [Write] | Throughput (write) | bytes/s |
8.2 - How-to guides
Users can create the service by entering the required information for Block Storage and selecting detailed options through the Samsung Cloud Platform Console.
Create Block Storage
You can create and use the Block Storage service in the Samsung Cloud Platform Console.
To create a Block Storage, follow these steps.
Click the All Services > Compute > Virtual Server menu. Navigate to the Service Home page of Virtual Server.
Click the Block Storage menu. Go to the Block Storage List page.
On the Block Storage page, click the Create Service button. You will be taken to the Block Storage Creation page.
On the Block Storage Creation page, enter the information required to create the service and select detailed options.
Category Required or notDetailed description Volume name Required Volume name - English letters, numbers, spaces, and special characters(
-,_) using to input up to 255 characters
snapshot Selection Select the snapshot to use when creating a volume from a snapshot - After checking the Use option, snapshot selection is possible
- When creating a service via snapshot recovery volume creation, provide the recovery snapshot name
- If not selected, an empty volume is created
Disk type Required Select Disk Type - SSD_Provisioned: SSD volume with configurable IOPS and throughput
- SSD/HDD: Standard SSD/HDD volume
- SSD/HDD_KMS: Additional encrypted volume
- SSD/HDD_MultiAttach: Volume that can be attached to more than one server
- Cannot be modified after service creation
- When creating a service via snapshot recovery volume creation, it is set identical to the original and cannot be modified
capacity Selection Capacity setting - Can be created within 8~12,228GB
- Enter the number of units provided in 8GB increments
- Enter a capacity that is equal to or larger than the original when creating the service via snapshot recovery volume creation
Max IOPS Required Enter the maximum IOPS value between 5,000 and 20,000 - Can be set only when disk type is SSD_Provisioned
Max Throughput Required Enter the maximum Throughput value between 250 and 1,000 - Can be set only when the disk type is SSD_Provisioned
Table. Block Storage Service Information Input Items- English letters, numbers, spaces, and special characters(
Check the detailed information and estimated billing amount generated in the Summary panel, and click the Create button.
- When creation is complete, check the created resource on the Block Storage List page.
- All volumes of Block Storage are encrypted by default with the AES-256 algorithm.
- Windows-based Virtual Servers cannot use MultiAttach disks. Use a separate replication method or solution.
- If the volume is of HDD/SSD_KMS disk type, we additionally provide in‑transit encryption for the block storage segment connected to the instance.
View Block Storage Details
The Block Storage service allows you to view and edit the full resource list and detailed information. Block Storage Detail page consists of Detail Information, Snapshot List, Tags, Operation History tabs.
Follow these steps to view detailed information about the Block Storage service.
- Click the All Services > Compute > Virtual Server menu. Navigate to the Service Home page of Virtual Server.
- Click the Block Storage menu. Navigate to the Block Storage List page.
- On the Block Storage List page, click the resource for which you want to view detailed information. You will be taken to the Block Storage Details page.
- Block Storage Details page displays status information and additional feature information, and consists of Details, Snapshot List, Tags, Activity History tabs.
Category Detailed description Volume status Volume status - Creating: In progress
- Downloading: In progress (applying OS image)
- Available: Created, server connection available
- Reserved: Waiting for server connection
- Attaching: Connecting to server
- Detaching: Server connection detached
- In Use: Server connection established
- Deleting: Terminating service
- Awaiting Transfer: Waiting for volume transfer
- Extending: Expanding capacity
- Error Extending: Abnormal state during capacity expansion
- Backing Up: Backing up volume
- Restoring Backup: Restoring volume backup
- Error Backing Up: Abnormal state during volume backup
- Error Restoring: Abnormal state during volume backup restoration
- Error Deleting: Abnormal state during deletion
- Error Managing: Abnormal state
- Error: Abnormal state
- Maintenance: Temporary maintenance state
- Reverting: Restoring snapshot
Previous volume Move the volume to another Account - For detailed information about moving volumes, see 볼륨 이전
Create snapshot Immediately create a snapshot at the time of creation - For detailed information on snapshot creation, see Creating a Snapshot
Snapshot recovery Recover the volume using the latest snapshot in Available state - For detailed information on snapshot recovery, see 스냅샷 복구하기
Service termination Cancel service button Table. Status Information and Additional Features
- Block Storage Details page displays status information and additional feature information, and consists of Details, Snapshot List, Tags, Activity History tabs.
Detailed information
Block Storage List page lets you view detailed information of the selected resource and edit the information if needed.
| Category | Detailed description |
|---|---|
| service | service group |
| Resource Type | Resource Type |
| SRN | Unique resource ID in Samsung Cloud Platform
|
| Resource name | Resource Name
|
| Resource ID | Service’s unique resource ID |
| constructor | User who created the service |
| Creation date and time | Service creation date and time |
| editor | User who modified the service |
| Modification date | Date and time the service was modified |
| Volume name | Volume name
|
| Volume ID | Volume unique ID |
| Disk type | Disk type |
| type | Classification by volume creation method and usage |
| capacity | Volume capacity
|
| Max IOPS | Required |
| Max Throughput | Required |
| Connection server | Connected Virtual Server
|
Snapshot List
Block Storage List page allows you to view the snapshot of the selected resource.
| Category | Detailed description |
|---|---|
| Snapshot name | Snapshot name |
| Explanation | Snapshot description |
| Volume capacity | Snapshot target Block Storage original volume capacity
|
| Creation timestamp | Snapshot creation time |
| status | Snapshot status
|
| Additional features > More | Snapshot management button
|
| Delete | Select the snapshots to delete from the snapshot list and delete them all at once. |
Snapshots can affect volume capacity management. Delete unnecessary snapshots after use.
- Snapshot recovery can be performed while the server is not connected.
- The snapshot creation time is based on Asia/Seoul (GMT +09:00).
- When the snapshot recovery button is clicked, the volume is restored to the latest snapshot in the Available state.
- When you select Create recovery volume on the snapshot list page, a new volume based on the snapshot is created without modifying the existing volume.
- Snapshots that contain vProtect are temporary snapshots created when using the Backup service and are not charged.
tag
Block Storage List page lets you view the tag information of the selected resource, and add, modify, or delete it.
| Category | Detailed description |
|---|---|
| Tag list | Tag list
|
Work History
On the Block Storage List page, you can view the operation history of the selected resource.
| Category | Detailed description |
|---|---|
| Task History List | Resource Change History
|
Managing Block Storage Resources
If you need to modify the settings of a created Block Storage or add or remove connected servers, you can perform the tasks on the Block Storage Details page.
Edit volume name
You can rename a volume. To rename a volume, follow these steps.
- Click the All Services > Compute > Virtual Server menu. Navigate to the Service Home page of Virtual Server.
- Click the Block Storage menu. Navigate to the Block Storage list page.
- On the Block Storage List page, click the resource whose volume name you want to edit. You will be taken to the Block Storage Details page.
- Click the Edit button of the Volume name. The Edit Volume Name popup window opens.
- Enter the volume name and click the Confirm button.
-, _)Increase capacity
You can expand the volume capacity. To expand the capacity, follow the steps below.
- Click the All Services > Compute > Virtual Server menu. Navigate to the Service Home page of Virtual Server.
- Click the Block Storage menu. Go to the Block Storage List page.
- Block Storage List page, click the resource to expand capacity. You will be taken to the Block Storage Details page.
- Click the Capacity Edit button. The Capacity Edit popup window opens.
- Enter the capacity and click the Confirm button.
- Size reduction is not provided.
- After expanding the capacity, you cannot restore from snapshots taken before the expansion.
- Snapshots created before the capacity expansion can only be used to recover by creating a new volume.
- Expansion to a larger capacity than the current one is possible within the range of 8 to 12,228 GB.
- Enter the number of Units provided in 8GB increments.
Modify connection server
You can connect to or disconnect from the server. To modify the connected server, follow these steps.
- Click the All Services > Compute > Virtual Server menu. Navigate to the Service Home page of Virtual Server.
- Click the Block Storage menu. Go to the Block Storage List page.
- Block Storage List page, click the resource to edit the attached server. You will be taken to the Block Storage Details page.
- When adding a Virtual Server connection, click the Add button in the Connection Server field. The Add Connection Server popup window will open.
- After selecting the Virtual Server you want to connect to, click the Confirm button.
- If you are disconnecting the Virtual Server, click the Connection Server item’s Disconnect button.
- Be sure to perform the disconnect operation (Umount, Disk Offline) on the server before proceeding with the release.
- You can connect a Virtual Server created in the same location as the Block Storage.
- A Virtual Server that uses a Partition with a Server Group policy cannot be connected.
- HDD/SSD_MultiAttach disk type can be attached to two or more Virtual Servers, and there is no limit on the number of connections.
- A Windows-based Virtual Server cannot use MultiAttach disks and must use a separate replication method or solution.
- A Virtual Server can attach up to 26 volumes, including the OS by default.
- The OS default volume cannot be modified on the connected server, and the service cannot be terminated.
- When adding a connected server, it can be used after performing the connection tasks (Mount, Disk Online) on the server. For detailed information about server connections, refer to Connecting to Server.
Cancel Block Storage
You can reduce operating costs by terminating unused Block Storage. However, terminating the service may cause the running service to stop immediately, so you should carefully consider the impact of service interruption before proceeding with the termination.
- Please note that data cannot be recovered after termination.
- In the following cases, you cannot delete a Block Storage volume.
- Connecting to server
- OS default volume
- Connect Custom image to Virtual Server
- When the volume status is not Available, Error, Error Extending, Error Restoring, or Error Managing
- If you select two or more volumes to cancel, only the volumes that can be canceled will be canceled.
To cancel Block Storage, follow the steps below.
- Click the All Services > Compute > Virtual Server menu. Navigate to the Service Home page of Virtual Server.
- Click the Block Storage menu. Go to the Block Storage List page.
- On the Block Storage List page, select the resource to terminate, and click the Terminate Service button.
- After termination is complete, verify on the Block Storage List page that the resource has been terminated.
8.2.1 - Connect to Server
When using a volume on the server, a connect or disconnect operation is required.
On the Block Storage Details page, add a connected server, then access the server to perform connection operations (Mount, Disk Online). After use, perform disconnection operations (Umount, Disk Offline) and then remove the connected server.
Connect to Server (Mount, Disk Online)
To use the volume added to the connected server, you must log into the server and perform the connection tasks (Mount, Disk Online). Follow the steps below.
Linux operating system
- Server OS: LINUX
- Mount location: /data
- Volume size: 24 GB
- File system: ext3, ext4, xfs etc
- Additional allocated disk: /dev/vdb
- Click the All Services > Compute > Virtual Server menu. Navigate to the Service Home page of Virtual Server.
- Click the Block Storage menu. Go to the Block Storage List page.
- On the Block Storage List page, click the resource to be used on the connected server. Navigate to the Block Storage Details page.
- After verifying the server in the Connection Server item, connect.
- Refer to the steps below to mount the volume (Mount).
Switch to root privileges
$ sudo -iCheck Disk
# lsblk NAME MAJ:MIN RM SIZE RO TYPE MOUNTPOINT vda 252:0 0 24G 0 disk ├─vda1 252:1 0 23.9G 0 part [SWAP] └─vda14 252:14 0 4M 0 part / └─vda15 252:15 0 106M 0 part /boot/efi vdb 252:16 0 24G 0 diskCreate partition
# fdisk /dev/vdb Command (m for help): n Partition type: p primary (0 primary, 0 extended, 4 free) e extended Select (default p): p Partition number (1-4, default 1): 1 First sector (2048-50331646, default 2048): Last sector, +/-sectors or +/-size{K,M,G,T,P} (2048-50331646, default 50331646): Created a new partition 1 of type 'Linux' and of size 24 GiB. Command (m for help): w The partition table has been altered! Calling ioctl() to re-read partition table. Syncing disks.Partition format setting (example: ext4)
# lsblk NAME MAJ:MIN RM SIZE RO TYPE MOUNTPOINT vda 252:0 0 24G 0 disk ├─vda1 252:1 0 23.9G 0 part [SWAP] └─vda14 252:14 0 4M 0 part / └─vda15 252:15 0 106M 0 part /boot/efi vdb 252:16 0 24G 0 disk └─vdb1 252:17 0 24G 0 part # mkfs.ext4 /dev/vdb1 mke2fs 1.46.5 (30-Dec-2021) ... Writing superblocks and filesystem accounting information: doneVolume Mount
# mkdir /data # mount /dev/vdb1 /data # lsblk NAME MAJ:MIN RM SIZE RO TYPE MOUNTPOINT vda 252:0 0 24G 0 disk ├─vda1 252:1 0 23.9G 0 part [SWAP] └─vda14 252:14 0 4M 0 part / └─vda15 252:15 0 106M 0 part /boot/efi vdb 252:16 0 24G 0 disk └─vdb1 252:17 0 24G 0 part /data # vi /etc/fstab (추가) /dev/vdb1 /data ext4 defaults 0 0
| Item | Explanation |
|---|---|
| cat /etc/fstab | File system information file
|
| df -h | Check the total disk usage of mounted disks |
| fdisk -l | Check partition information
|
| command | Explanation |
|---|---|
| m | Check the usage of the fdisk command |
| n | Create a new partition |
| p | Verify the updated partition information |
| t | Change the system ID of the partition |
| w | Save partition information and exit fdisk. |
Windows operating system
- Click the All Services > Compute > Virtual Server menu. Navigate to the Virtual Server Service Home page.
- Click the Block Storage menu. You will be taken to the Block Storage List page.
- On the Block Storage List page, click the resource to be used on the connected server. You will be taken to the Block Storage Details page.
- Connection Server item, verify the server and then connect.
- Refer to the steps below to connect the volume (Disk Online).
After right-clicking the Windows Start icon, run
Computer Management.In the Computer Management tree structure, select
Storage > Disk ManagementCheck Disk
Disk Online
Disk initialization
Partition format
Check volume
Disconnect server (Umount, Disk Offline)
Access the server, perform the disconnect operations (Umount, Disk Offline), and then disconnect the server from the Console.
Follow the next procedure.
- If you disconnect the server from the console without performing the disconnect operations (Umount, Disk Offlline) on the server, a server state error (Hang) may occur.
- Be sure to perform the OS tasks first.
- For the OS default volume, you cannot modify the connected server or cancel the service.
Linux operating system
- Click the All Services > Compute > Virtual Server menu. Navigate to the Service Home page of Virtual Server.
- Click the Block Storage menu. Navigate to the Block Storage List page.
- On the Block Storage List page, click the resource to detach the attached server. You will be taken to the Block Storage Details page.
- After verifying the server in the Connection Server item, connect.
- Refer to the steps below to unmount the volume(Umount).
- Volume Umount
# umount /dev/vdb1 /data
# lsblk
NAME MAJ:MIN RM SIZE RO TYPE MOUNTPOINT
vda 252:0 0 24G 0 disk
├─vda1 252:1 0 23.9G 0 part [SWAP]
└─vda14 252:14 0 4M 0 part /
└─vda15 252:15 0 106M 0 part /boot/efi
vdb 252:16 0 24G 0 disk
└─vdb1 252:17 0 24G 0 part
# vi /etc/fstab
(삭제) /dev/vdb1 /data ext4 defaults 0 0
Windows operating system
Click the All Services > Compute > Virtual Server menu. Navigate to the Service Home page of Virtual Server.
Click the Block Storage menu. Go to the Block Storage List page.
Block Storage List page, click the resource to detach the connected server. Navigate to the Block Storage Details page.
In the Connection Server field, verify the server before connecting.
Unmount the mounted file system.
Please follow the procedure below to detach the volume (Disk Offline).
Right-click the Windows Start icon, then run
Computer Management.In the Computer Management tree structure, select
Storage > Disk ManagementRight-click the disk to be removed, then run
OfflineCheck disk status
8.2.2 - Use Snapshot
You can create, delete, or restore snapshots of a created Block Storage using snapshots. You can perform these actions on the Block Storage Details page and the Snapshot List page.
Creating a Snapshot
You can create a snapshot of the current point in time. To create a snapshot, follow these steps.
- Click the All Services > Compute > Virtual Server menu. Navigate to the Service Home page of Virtual Server.
- Click the Block Storage menu. Go to the Block Storage List page.
- On the Block Storage List page, click the resource for which you want to create a snapshot. You will be taken to the Block Storage Details page.
- Click the Create Snapshot button. The Create Snapshot popup window opens.
- Enter the Snapshot name and description, then click the Confirm button. This creates a snapshot of the current point in time.
- Click the Snapshot List button. You will be taken to the Block Storage Snapshot List page.
- Check the generated snapshot.
Edit Snapshot
You can edit snapshot information. To modify the snapshot name or description, follow these steps.
- Click the All Services > Compute > Virtual Server menu. Go to the Service Home page of Virtual Server.
- Click the Block Storage menu. Navigate to the Block Storage List page.
- Block Storage List page, click the resource to edit the snapshot information. You will be taken to the Block Storage Details page.
- Click the Snapshot List button. You will be taken to the Block Storage Snapshot List page.
- After confirming the snapshot to edit, click the More button.
- Click the Edit button. The Snapshot Edit popup window opens.
- Enter the Snapshot name or Description, and click the Confirm button.
Recover snapshot
Block Storage volumes can be restored to the latest snapshot while in the Available state. To restore a snapshot, follow the steps below.
- Click the All Services > Compute > Virtual Server menu. Navigate to the Service Home page of Virtual Server.
- Click the Block Storage menu. Go to the Block Storage List page.
- On the Block Storage List page, click the resource to restore from a snapshot. You will be taken to the Block Storage Details page.
- If there are servers added in the Connected Server item, connect to the server and perform the disconnect operations (Umount, Disk Offline).
- For detailed information about disconnecting the server, see Disconnecting the Server.
- On the Block Storage Details page, click the Disconnect button in the Connected Server item to remove the server. The connected server will be removed.
- For detailed information on disconnecting the connection server, refer to Modify Connection Server.
- Click the Snapshot List button. You will be taken to the Block Storage Snapshot List page.
- Check the latest snapshot in the Available state. The volume will be restored from that snapshot.
- Click the Snapshot Recovery button. The Snapshot Recovery popup window opens.
- After verifying the Snapshot name and Creation timestamp, click the Confirm button.
- When recovery starts, the status becomes Reverting, and when it completes, it becomes Available.
- Detailed Information page button, click it. It navigates to the Block Storage Detailed Page.
- Click the Add button of the Connection Server. Reconnect the Virtual Server.
- For detailed information about adding a connection server, refer to Edit Connection Server.
- After connecting to the added server, perform the connection tasks (Mount, Disk Online) according to the operating system.
- For detailed information about server connection, please refer to Connect to Server.
- Snapshot recovery can be performed while the server is not connected.
- If you want to restore from a snapshot that is not the latest, you can recover by creating a recovery volume.
- Recovery is not possible in the following situations.
- When a Block Storage volume is not in the Available state.
- If a server is attached to a Block Storage volume
- If there are no recoverable snapshots.
- If the latest snapshot changes while creating a recovery.
- If the latest snapshot is not in the Available state.
- When the snapshot’s volume size differs from the Block Storage volume size (in case the volume was expanded)
Create a snapshot recovery volume
You can create a volume using a snapshot. To create a snapshot recovery volume, follow these steps.
- Click the All Services > Compute > Virtual Server menu. Go to the Service Home page of Virtual Server.
- Click the Block Storage menu. Go to the Block Storage List page.
- On the Block Storage List page, click the resource to create a snapshot recovery volume. You will be taken to the Block Storage Details page.
- Click the Snapshot List button. You will be taken to the Block Storage Snapshot List page.
- After checking the Snapshot Name, Description and Creation Time, click the More button of the snapshot you want to restore.
- Click Create recovery volume. The Create snapshot recovery volume popup opens.
- Click the Confirm button. Go to the Create Block Storage page.
- On the Block Storage Creation page, enter the information required to create the service and select detailed options.
- Enter the volume name and size. You can specify a size that is greater than or equal to the original volume.
- The disk type is set to match the original and cannot be modified.
Category RequiredDetailed description Volume name Required Volume name - Enter up to 255 characters using English letters, numbers, spaces, and special characters (
-,_)
Disk type Required Select Disk Type - SSD_Provisioned: SSD volume with configurable IOPS and Throughput
- SSD/HDD: Standard SSD/HDD volume
- SSD/HDD_KMS: Additional encrypted volume
- SSD/HDD_MultiAttach: Volume that can be attached to more than one server
- Cannot be modified after service creation
- SSD_Provisioned: SSD volume with configurable IOPS and Throughput
- When creating the service via snapshot recovery volume creation, it is set identical to the original and cannot be modified
capacity Selection Capacity setting - Can be created within 8~12,228 GB
- Enter the number of units provided in 8 GB increments
- Enter a capacity equal to or larger than the original when creating the service via snapshot recovery volume creation
Max IOPS Required Enter the maximum IOPS value between 5,000 and 20,000 - Can be configured only when Disk type is SSD_Provisioned
Max Throughput Required Enter the maximum Throughput value between 250 and 1,000 - Disk type can be set only when it is SSD_Provisioned
Recovery snapshot name Selection Name of the recovery snapshot used when creating the volume - Provide the recovery snapshot name when creating a service via snapshot recovery volume creation
Table. Block Storage Service Information Input Items - Enter up to 255 characters using English letters, numbers, spaces, and special characters (
- Summary Check the detailed information and estimated billing amount generated in the panel, and click the Create button.
- When creation is complete, check the created resource on the Block Storage List page.
Delete Snapshot
You can select a snapshot to delete. To delete a snapshot, follow these steps.
- Click the All Services > Compute > Virtual Server menu. Navigate to the Service Home page of Virtual Server.
- Click the Block Storage menu. You will be taken to the Block Storage list page.
- On the Block Storage List page, click the resource to delete the snapshot. You will be taken to the Block Storage Details page.
- Click the Snapshot List button. You will be taken to the Block Storage Snapshot List page.
- After checking Snapshot Name, Description and Creation Time, click the More button of the snapshot you want to delete.
- Click the Delete button. The snapshot will be removed from the Snapshot List page.
8.2.3 - Move Volume
You can transfer a volume to another account, and when you do, the volume is removed from its original location. You can perform volume transfer on the Block Storage List or Block Storage Details page.
Previous volume
You can transfer a volume to another Account within the same region. To transfer a volume, follow the steps below.
- Click the All Services > Compute > Virtual Server menu. Navigate to the Service Home page of Virtual Server.
- Click the Block Storage menu. Go to the Block Storage List page.
- On the Block Storage List page, after selecting the resource to migrate, click the More > Migrate Volume button at the top left of the list.
- Or click the Volume Migration button at the top of the Block Storage Details page of the resource to be migrated.
- When the popup indicating volume transfer opens, verify the volume name you want to transfer, then click the Confirm button.
- When the pop‑up for previous completion opens, click the Confirm button. The Volume Transfer ID and Approval Key information will be downloaded as a text file.
- The volume changes to Awaiting Transfer state.
- Volume migration is possible within the same region.
- Volume migration is possible only when the volume is in Available state. If it is in In Use state, disconnect all attached servers.
Cancel previous volume
You can cancel after creating a volume migration. To cancel a volume migration, follow the steps below.
- Click the All Services > Compute > Virtual Server menu. Navigate to the Service Home page of Virtual Server.
- Click the Block Storage menu. Go to the Block Storage List page.
- Block Storage List page, click the resource to cancel the volume migration. You will be taken to the Block Storage Details page.
- If the volume is in Awaiting Transfer status, it can be canceled.
- Click the Cancel previous volume button. The Cancel previous volume popup will open.
- After confirming the volume name for which you want to cancel the volume transfer, click the Confirm button.
- The volume changes to Available state.
Get previous volume
You can receive a volume from another Account within the same region. To receive a volume, follow these steps.
- Click the All Services > Compute > Virtual Server menu. Navigate to the Service Home page of Virtual Server.
- Click the Block Storage menu. Navigate to the Block Storage List page.
- On the Block Storage List page, click the More > Receive Volume Transfer button at the top left of the list. The Receive Volume Transfer popup opens.
- Volume Migration: Enter the Volume Migration ID and Approval Key provided when creating it.
- A volume is created on the Block Storage list page.
- It takes time for the changes to be applied.
- The transferred volume is removed from the account that initiated the volume transfer.
8.3 - API Reference
8.4 - CLI Reference
8.5 - Release Note
Block Storage
- An SSD volume with configurable IOPS and throughput has been added.
- When creating Block Storage, you can select the SSD_Provisioned disk type.
- You can set the IOPS and Throughput range.
- IOPS: 5,000 ~ 20,000
- Throughput: 250 ~ 1,000
- During the preview period, there is no additional charge; charges will be applied later.
- Snapshots are billed based on the size of the original Block Storage.
- Integrated with Cloud Monitoring.
- You can view IOPS, Latency, and Throughput information in Cloud Monitoring.
- Block Storage feature change
- HDD disk types have been added, and you can select the added type (HDD, HDD_MultiAttach, HDD_KMS) according to your purpose.
- Samsung Cloud Platform common feature changes
- Account, IAM, Service Home, tags, and other common CX changes have been applied.
- Added the SSD_KMS disk type.
- When SSD_KMS is selected, encryption using a KMS (Key Management Service) encryption key is added.
- We have launched a high‑performance storage service suitable for handling large‑scale data and database workloads.
- We have launched a high-performance storage service suitable for handling large-scale data and database workloads.















