The page has been translated by Gen AI.

Server type

Multi-node GPU Cluster server type

Multi-node GPU Cluster is categorized based on the GPU Type it provides, and the GPU used in a Multi-node GPU Cluster is determined by the server type selected when creating a GPU Node. Select the server type based on the specifications of the application you want to run on a multi-node GPU cluster.

The server types supported by the Multi-node GPU Cluster are as follows.

Example: when the server type is g2c96h8_metal.

Category	example	Detailed description
Server generation	g2	Provided server generation g2 g means GPU server specification 2 means generation
CPU	c96	Number of cores c96: Allocated cores are physical cores
GPU	h8	GPU type and quantity h8: h means GPU type, and 8 means GPU quantity

Table. Multi-node GPU Cluster server type format

g2 server type

The g2 server type is a GPU Bare Metal Server that uses NVIDIA H100 SXM GPUs, suitable for large-scale high-performance AI computation.

8 NVIDIA Hopper Architecture-based H100 GPUs provided
Provides 1,979 TFLOPS FP8 Tensor Core performance per GPU, 989 TFLOPS FP16 Tensor Core performance.
Supports up to 96 vCPUs and 2,048 GB of memory
Supports up to 1,600 Gb/s NVIDIA InfiniBand RDMA network.
Service network up to 100 Gbps
900 GB/s GPU P2P communication via NVSwitch within a node

Server type	GPU	GPU Memory	CPU(Core)	Memory	Disk	GPU P2P
g2c96h8_metal	H100	640 GiB	96 vCore	2 TB	SSD (OS) 960 GB * 2, NVMeSSD 3.84 TB * 4	900 GB/s NVSwitch

Table. Multi-node GPU Cluster server type specifications > H100 server type

g3 server type

The g3 server type is a GPU Bare Metal Server that uses NVIDIA B300 SXM GPUs, suitable not only for large-scale high-performance AI computation but also for LLM inference and AI deployment for generative AI.

8 NVIDIA Blackwell Ultra Architecture-based B300 GPUs provided
Provides 13.5 PFLOPS FP4 Tensor Core and 4.5 PFLOPS FP8 Tensor Core performance per GPU.
Supports up to 128 vCPUs and 4,096 GB of memory
Supports up to 6,400 Gb/s NVIDIA InfiniBand RDMA network
Service network up to 100 Gbps
1.8 TB/s GPU P2P communication via NVSwitch within a node

Server type	GPU	GPU Memory	CPU(Core)	Memory	Disk	GPU P2P
g3c128b8_metal	B300	2.1 TiB	128 vCore	4 TB	SSD (OS) 960 GB * 2, NVMeSSD 3.84 TB * 4	1.8 TB/s NVSwitch

Table. Multi-node GPU Cluster server type specifications > B300 server type

Overview

Monitoring Metrics