The page has been translated by Gen AI.

Manage Cluster Fabric

Cluster Fabric is a service that helps manage the servers (GPU Node) included in a GPU Cluster. By using Cluster Fabric, you can move servers between GPU Clusters in the same Node pool and optimize GPU performance and speed within the same GPU Cluster.

Creating Cluster Fabric

Cluster Fabric can be created together with a GPU Node, and it cannot be created or deleted separately. If all GPU Nodes within a Cluster Fabric are terminated, the Cluster Fabric is automatically deleted.
If you have not created a GPU Node, please create a GPU Node first. For more information, see GPU Node 생성하기.

Check Cluster Fabric details

Notice
  • Cluster Fabric can be created together when a GPU node is created, and it cannot be created or deleted independently.
  • If all GPU nodes in the Cluster Fabric are terminated, the Cluster Fabric is automatically deleted.
  • If you have not created a GPU Node, please create a GPU Node first. For more details, refer to GPU Node 생성하기.

On the Cluster Fabric List page and the Cluster Fabric Details page, you can view the generated Cluster Fabric list and details and move the server.

  1. Click the All Services > Compute > Multi-node GPU Server menu. Go to the Service Home page of the Multi-node GPU Cluster.

  2. On the Service Home page, click the Cluster Fabric menu. You will be taken to the Cluster Fabric List page.

    • On the Cluster Fabric List page, you can view the resource list of GPU clusters created by the user.
    • Resource items beyond the required columns can be added via the Settings button.
      Category
      Required
      Detailed description
      Resource IDSelectionUser-created Cluster Fabric ID
      Cluster Fabric nameRequiredUser-created Cluster Fabric name
      Node poolSelectionA collection of nodes that can be grouped into the same Cluster Fabric
      Number of serversSelectionNumber of GPU Nodes
      Server typeSelectionServer type of GPU Node
      • Users can view the number of cores, memory capacity, and GPU type and count of the resources they created
      statusSelectionStatus of the user-created Cluster Fabric
      Creation date and timeSelectCluster Fabric creation timestamp
      Table. Cluster Fabric resource list items
  3. On the Cluster Fabric List page, click the resource to view detailed information. You will be taken to the Cluster Fabric Details page.

    • Cluster Fabric Details At the top of the page, status information and descriptions of additional features are displayed.
      CategoryDetailed description
      Cluster Fabric statusStatus of the user-created Cluster Fabric
      • Creating: State while the cluster is being created
      • Active: State when creation is complete and the cluster is usable
      • Editing: State while the IP is being changed
      • Deleting: State while being terminated
      • Deleted: State after termination is complete
      Add target serverA feature that allows moving a server from another cluster to the target cluster.
      Table. Cluster Fabric status information and additional features

Detailed Information

On the Cluster Fabric List page’s Details Tab, you can view detailed information of the selected resource and retrieve servers from another cluster.

CategoryDetailed description
serviceService name
Resource TypeResource Type
SRNUnique resource ID in Samsung Cloud Platform
  • In Cluster Fabric, it refers to the Cluster Fabric SRN
Resource nameResource name
  • In the Cluster Fabric service, it refers to the Cluster Fabric name
Resource IDUnique resource ID in the service
constructorUser who created the service
Creation date and timeService creation date and time
editorUser who edited the service information
Modification dateDate and time the service information was modified
Cluster Fabric nameUser-created Cluster Fabric name
Node poolA set of nodes that can be grouped into the same Cluster Fabric
target serverGPU Node list bound to Cluster Fabric
  • Server name, server type, IP, status
Table. Cluster Fabric detailed information tab items

Import Cluster Fabric Server

Cluster Fabric Details page’s add target server feature allows you to import servers from another cluster and add them to the selected cluster.

  1. Click the All Services > Compute > Multi-node GPU Server menu. Navigate to the Service Home page of the Multi-node GPU Cluster.
  2. On the Service Home page, click the Cluster Fabric menu. You will be taken to the Cluster Fabric list page.
  3. On the Cluster Fabric List page, click the resource to view detailed information. You will be taken to the Cluster Fabric Details page.
  4. In the target server of the Details tab, click the Add button on the right.
    • The add target server popup window opens.
      • Select a cluster from Cluster Fabric.
      • GPU nodes associated with the selected cluster are listed; select the GPU node you want to retrieve.
      • The selected GPU Node’s name is displayed at the bottom.
      • Press the Confirm button to complete.
      • Pressing the Cancel button cancels the operation.
    • Verify that the GPU node added on the target server is displayed.

Terminate Cluster Fabric

If all GPU Nodes in the Cluster Fabric are terminated, the Cluster Fabric is automatically deleted. For more information, see Terminate GPU Node.

How-to guides
Install ServiceWatch Agent