The page has been translated by Gen AI.

Overview

Service Overview

GPU Server is a virtualized computing service that allows you to freely allocate and use as much infrastructure resources provided by the server such as CPU, GPU, and memory as needed at the desired time without having to purchase them individually. It is suitable for tasks that require fast computing speed such as AI model experimentation, prediction, and inference in a cloud environment, and allows you to flexibly select and use resources with optimized performance according to task type and scale. GPU Server provides the following features:

Provided Features

  • GPU Server Management: Users can directly manage creation, deletion, and changes as a Self Service from GPU Server provisioning to monitoring and billing through a web-based Console.
  • Provisioning by GPU Quantity: You can configure a virtual server by freely selecting the quantity of H100/A100 GPUs according to project purpose and scale.
  • High Performance GPU Provision: Provides high-performance GPU servers at the physical server level using the Pass-through method.
  • Storage Connection: Provides additional connected storage besides OS disks. You can connect and use Block Storage, File Storage, and Object Storage.
  • Strong Security Application: Protects servers safely by controlling Inbound/Outbound traffic exchanged with external internet or other VPCs (Virtual Private Cloud) through the Security Group service.
  • Monitoring: You can check monitoring information such as CPU, Memory, Disk, and GPU status corresponding to computing resources through the Cloud Monitoring service.
  • Network Setting Management: The server’s subnet/IP can be easily changed from the values set at initial creation. Provides management functionality that allows you to set use/terminate NAT IP as needed.
  • Key Pair Method: Provides a Key Pair method instead of ID/PW access for secure OS access.
  • Image Management: You can create and manage Custom Images and provides sharing functionality between projects.
  • ServiceWatch Service Integration Provision: You can monitor data through the ServiceWatch service.

Components

GPU Server provides GPU, NVSwitch, and NVLink on top of virtualized computing resources.

Warning
  • NVSwitch can be activated and used only for instance types that allocate 8 GPUs to a single GPU Server.

Specifications by GPU Type

GPU (Graphic Processing Unit) plays the role of performing calculations necessary to create images that make up the computer screen and is specialized for parallel processing, enabling it to process large amounts of data quickly, handling large-scale parallel operations such as artificial intelligence (AI) and data analysis.

The following are the specifications of GPU types provided by the GPU Server service.

ItemA100 TypeH100 Type
Service Provision MethodPass-throughPass-through
GPU ArchitectureNVIDIA AmpereNVIDIA Hopper
GPU Memory80 GB80 GB
GPU Transistors54 billion 7N TSMC80 billion 4N TSMC
FP16 Tensor Core (Dense)312 TFLOPs989 TFLOPs
FP8 Tensor Core (Dense)Not supported1,979 TFLOPs
FP4 Tensor Core (Dense)Not supportedNot supported
GPU Memory Bandwidth2,039 GB/s HBM2e3,352 GB/s HBM3
NVLink PerformanceNVLink 3NVLink 4
NVLink Signaling Rate25 GB/s (x12)25 GB/s (x18)
NVSwitch GPU-to-GPU Bandwidth600 GB/s900 GB/s
Total NVSwitch Aggregate Bandwidth4.8 TB/s7.2 TB/s
Table. GPU Type Specifications

Server Type

The server types provided by GPU Server are as follows. For a detailed description of the server types provided by GPU Server, see GPU Server Server Types.

ItemServer TypeCPU vCoreMemory(GB)GPU Quantity
GPU-A100-1g1v16a1162341
GPU-A100-1g1v32a2324682
GPU-A100-1g1v64a4649364
GPU-A100-1g1v128a812818728
GPU-H100-2g2v12h1122341
GPU-H100-2g2v24h2244682
GPU-H100-2g2v48h4489364
GPU-H100-2g2v96h89618728
Table. GPU Server Server Types

OS and GPU Driver Version

The operating systems (OS) supported by GPU Server are as follows:

OSOS VersionGPU Driver Version
Ubuntu22.04535.183.06
Ubuntu24.04570.195.03
RHEL8.10535.183.06
Table. GPU Server OS and GPU Driver Version

Prerequisite Services

This is a service that must be pre-installed before creating this service. Please prepare by referring to the user guide provided in advance.

Service CategoryServiceDescription
NetworkingVPCService that provides independent virtual networks in cloud environment
NetworkingSecurity GroupVirtual firewall that controls server traffic
Table. GPU Server prerequisite services
Release Note
Server Type