The page has been translated by Gen AI.

Overview

Service Overview

GPU Server is a virtualized computing service that lets you freely allocate and use infrastructure resources such as CPU, GPU, and memory provided by the server, without having to purchase them individually, and allocate as much as needed at the required time. It is suitable for tasks that require fast computation speed, such as AI model experimentation, prediction, and inference in a cloud environment, and you can flexibly select and use resources with optimized performance based on the type and scale of the work. The GPU Server provides the following features.

Provided Features

  • GPU Server Management: Through a web-based console, users can directly Self Service create, delete, and modify GPU Server provisioning, monitoring, and billing.
  • Product offerings by GPU quantity: Depending on the project’s purpose and scale, you can freely select the number of H100/A100 GPUs to configure a virtual server.
  • High‑Performance GPU Provision: We provide a high‑performance GPU server at physical‑server level using a pass‑through method.
  • Storage Connection: Provides additional attached storage besides the OS disk. * You can connect and use Block Storage, File Storage, and Object Storage.
  • Strong Security Application: Use the Security Group service to control inbound/outbound traffic exchanged with the external internet or other VPC(Virtual Private Cloud), securely protecting the server.
  • Monitoring: You can view monitoring information such as the status of computing resources—including CPU, Memory, Disk, and GPU—through the Cloud Monitoring service.
  • Network Configuration Management: The server’s subnet/IP can be easily changed from the values set at initial creation. * NAT IP provides a management feature that lets you enable or disable it as needed.
  • Key Pair method: To ensure a secure OS access method, we provide a Key Pair method instead of ID/PW login.
  • Image Management: You can create and manage Custom Images, and it provides sharing functionality between projects.
  • ServiceWatch Service Integration Offering: You can monitor data through the ServiceWatch service.

Component

GPU Server provides GPUs, NVSwitch, and NVLink on top of virtualized computing resources.

caution
  • NVSwitch can only be enabled and used for instance types that allocate eight GPUs on a single GPU server.

Specifications by GPU Type

GPU (Graphic Processing Unit) performs the calculations needed to create images that compose the computer screen, and because it is specialized for parallel processing, it can handle large amounts of data quickly, processing large-scale parallel operations such as artificial intelligence (AI) and data analysis.

The following are the specifications of the GPU Types offered by the GPU Server service.

CategoryA100 TypeH100 TypeB300 Type
GPU ArchitectureNVIDIA AmpereNVIDIA HopperNVIDIA Blackwell Ultra
GPU Memory80 GiB80 GiB268 GiB
GPU Transistors54 billion 7N TSMC80 billion 4N TSMC208 billion 4NP TSMC
FP16 Tensor Core (Dense)312 TFLOPs989 TFLOPs2.25 PFLOPs
FP8 Tensor Core (Dense)Not supported1,979 TFLOPs4.5 PFLOPs
FP4 Tensor Core (Dense)UnsupportedNot supported13.5 PFLOPs
GPU Memory Bandwidth2,039 GB/s HBM2e3,352 GB/s HBM38 TB/s HBM3e
NVLink performanceNVLink 3NVLink 4NVLink 5
NVLink Signaling Rate25 GB/s (x12)25 GB/s (x18)50 GB/s (x18)
NVSwitch GPU-to-GPU bandwidth600 GB/s900 GB/s1.8 TB/s
Total NVSwitch aggregate bandwidth4.8 TB/s7.2 TB/s14.4 TB/s
Table. Specifications by GPU Type

Server type

The server types offered by the GPU Server are as follows. For detailed information about the server types provided by GPU Server, refer to GPU Server 서버 타입.

CategoryServer typeCPU vCoreMemory(GB)Number of GPUs
GPU-A100-1g1v16a1162341
GPU-A100-1g1v32a2324682
GPU-A100-1g1v64a4649364
GPU-A100-1g1v128a81281,8728
GPU-H100-2g2v12h1122341
GPU-H100-2g2v24h2244682
GPU-H100-2g2v48h4489364
GPU-H100-2g2v96h8961,8728
GPU-B300-3g3v16b1164801
GPU-B300-3g3v32b2329602
GPU-B300-3g3v64b4641,9204
GPU-B300-3g3v128b81283,8408
Table. GPU Server server type

OS and GPU driver version

The operating systems (OS) supported by the GPU Server are as follows. Note that GPUs of type B300 are supported only from a specific GPU version onward, so please be careful when selecting images.

OSOS versionGPU driver versionServer type classification
Ubuntu24.04580.126.20GPU-B300-3, GPU-H100-2, GPU-A100-1
Ubuntu24.04570.195.03GPU-H100-2, GPU-A100-1
Ubuntu22.04535.183.06GPU-H100-2, GPU-A100-1
RHEL9.6580.126.20GPU-B300-3, GPU-H100-2, GPU-A100-1
RHEL8.10580.126.20GPU-B300-3, GPU-H100-2, GPU-A100-1
RHEL8.10535.183.06GPU-H100-2, GPU-A100-1
Table. GPU Server OS and GPU driver version

Preceding Service

This is a service that must be installed in advance before creating this service. Please prepare by referring to the user guide provided in advance.

Service CategoryserviceDetailed description
NetworkingVPCA service that provides an isolated virtual network in a cloud environment
NetworkingSecurity GroupVirtual firewall that controls server traffic
Table. GPU Server Preliminary Service
Release Note
Server type