Manage Cluster Fabric
Cluster Fabric is a service that helps manage the servers (GPU Node) included in a GPU Cluster. By using Cluster Fabric, you can move servers between GPU Clusters in the same Node pool and optimize GPU performance and speed within the same GPU Cluster.
Creating Cluster Fabric
Cluster Fabric can be created together with a GPU Node, and it cannot be created or deleted separately. If all GPU Nodes within a Cluster Fabric are terminated, the Cluster Fabric is automatically deleted.
If you have not created a GPU Node, please create a GPU Node first. For more information, see GPU Node 생성하기.
Check Cluster Fabric details
- Cluster Fabric can be created together when a GPU node is created, and it cannot be created or deleted independently.
- If all GPU nodes in the Cluster Fabric are terminated, the Cluster Fabric is automatically deleted.
- If you have not created a GPU Node, please create a GPU Node first. For more details, refer to GPU Node 생성하기.
On the Cluster Fabric List page and the Cluster Fabric Details page, you can view the generated Cluster Fabric list and details and move the server.
Click the All Services > Compute > Multi-node GPU Server menu. Go to the Service Home page of the Multi-node GPU Cluster.
On the Service Home page, click the Cluster Fabric menu. You will be taken to the Cluster Fabric List page.
- On the Cluster Fabric List page, you can view the resource list of GPU clusters created by the user.
- Resource items beyond the required columns can be added via the Settings button.
Category RequiredDetailed description Resource ID Selection User-created Cluster Fabric ID Cluster Fabric name Required User-created Cluster Fabric name Node pool Selection A collection of nodes that can be grouped into the same Cluster Fabric Number of servers Selection Number of GPU Nodes Server type Selection Server type of GPU Node - Users can view the number of cores, memory capacity, and GPU type and count of the resources they created
status Selection Status of the user-created Cluster Fabric Creation date and time Select Cluster Fabric creation timestamp Table. Cluster Fabric resource list items
On the Cluster Fabric List page, click the resource to view detailed information. You will be taken to the Cluster Fabric Details page.
- Cluster Fabric Details At the top of the page, status information and descriptions of additional features are displayed.
Category Detailed description Cluster Fabric status Status of the user-created Cluster Fabric - Creating: State while the cluster is being created
- Active: State when creation is complete and the cluster is usable
- Editing: State while the IP is being changed
- Deleting: State while being terminated
- Deleted: State after termination is complete
Add target server A feature that allows moving a server from another cluster to the target cluster. Table. Cluster Fabric status information and additional features
- Cluster Fabric Details At the top of the page, status information and descriptions of additional features are displayed.
Detailed Information
On the Cluster Fabric List page’s Details Tab, you can view detailed information of the selected resource and retrieve servers from another cluster.
| Category | Detailed description |
|---|---|
| service | Service name |
| Resource Type | Resource Type |
| SRN | Unique resource ID in Samsung Cloud Platform
|
| Resource name | Resource name
|
| Resource ID | Unique resource ID in the service |
| constructor | User who created the service |
| Creation date and time | Service creation date and time |
| editor | User who edited the service information |
| Modification date | Date and time the service information was modified |
| Cluster Fabric name | User-created Cluster Fabric name |
| Node pool | A set of nodes that can be grouped into the same Cluster Fabric |
| target server | GPU Node list bound to Cluster Fabric
|
Import Cluster Fabric Server
Cluster Fabric Details page’s add target server feature allows you to import servers from another cluster and add them to the selected cluster.
- Click the All Services > Compute > Multi-node GPU Server menu. Navigate to the Service Home page of the Multi-node GPU Cluster.
- On the Service Home page, click the Cluster Fabric menu. You will be taken to the Cluster Fabric list page.
- On the Cluster Fabric List page, click the resource to view detailed information. You will be taken to the Cluster Fabric Details page.
- In the target server of the Details tab, click the Add button on the right.
- The add target server popup window opens.
- Select a cluster from Cluster Fabric.
- GPU nodes associated with the selected cluster are listed; select the GPU node you want to retrieve.
- The selected GPU Node’s name is displayed at the bottom.
- Press the Confirm button to complete.
- Pressing the Cancel button cancels the operation.
- Verify that the GPU node added on the target server is displayed.
- The add target server popup window opens.
Terminate Cluster Fabric
If all GPU Nodes in the Cluster Fabric are terminated, the Cluster Fabric is automatically deleted. For more information, see Terminate GPU Node.