The page has been translated by Gen AI.

Server type

Multi-node GPU Cluster server type

Multi-node GPU Cluster is categorized based on the GPU Type it provides, and the GPU used in a Multi-node GPU Cluster is determined by the server type selected when creating a GPU Node. Select the server type based on the specifications of the application you want to run on a multi-node GPU cluster.

The server types supported by the Multi-node GPU Cluster are as follows.

  • Example: when the server type is g2c96h8_metal.
    CategoryexampleDetailed description
    Server generationg2Provided server generation
    • g2
      • g means GPU server specification
      • 2 means generation
    CPUc96Number of cores
    • c96: Allocated cores are physical cores
    GPUh8GPU type and quantity
    • h8: h means GPU type, and 8 means GPU quantity
    Table. Multi-node GPU Cluster server type format

g2 server type

The g2 server type is a GPU Bare Metal Server that uses NVIDIA H100 SXM GPUs, suitable for large-scale high-performance AI computation.

  • 8 NVIDIA Hopper Architecture-based H100 GPUs provided
  • Provides 1,979 TFLOPS FP8 Tensor Core performance per GPU, 989 TFLOPS FP16 Tensor Core performance.
  • Supports up to 96 vCPUs and 2,048 GB of memory
  • Supports up to 1,600 Gb/s NVIDIA InfiniBand RDMA network.
  • Service network up to 100 Gbps
  • 900 GB/s GPU P2P communication via NVSwitch within a node
Server typeGPUGPU MemoryCPU(Core)MemoryDiskGPU P2P
g2c96h8_metalH100640 GiB96 vCore2 TBSSD (OS) 960 GB * 2, NVMeSSD 3.84 TB * 4900 GB/s NVSwitch
Table. Multi-node GPU Cluster server type specifications > H100 server type

g3 server type

The g3 server type is a GPU Bare Metal Server that uses NVIDIA B300 SXM GPUs, suitable not only for large-scale high-performance AI computation but also for LLM inference and AI deployment for generative AI.

  • 8 NVIDIA Blackwell Ultra Architecture-based B300 GPUs provided
  • Provides 13.5 PFLOPS FP4 Tensor Core and 4.5 PFLOPS FP8 Tensor Core performance per GPU.
  • Supports up to 128 vCPUs and 4,096 GB of memory
  • Supports up to 6,400 Gb/s NVIDIA InfiniBand RDMA network
  • Service network up to 100 Gbps
  • 1.8 TB/s GPU P2P communication via NVSwitch within a node
Server typeGPUGPU MemoryCPU(Core)MemoryDiskGPU P2P
g3c128b8_metalB3002.1 TiB128 vCore4 TBSSD (OS) 960 GB * 2, NVMeSSD 3.84 TB * 41.8 TB/s NVSwitch
Table. Multi-node GPU Cluster server type specifications > B300 server type
Overview
Monitoring Metrics