The page has been translated by Gen AI.

Use Multi-instance GPU on GPU Server

After creating a GPU Server, you can enable the MIG (Multi-instance GPU) feature on the GPU Server’s VM (Guest OS) and create an instance for use.

NVIDIA Multi-instance GPU Introduction

NVIDIA Multi-instance GPU (hereafter referred to as MIG) supports safely partitioning a GPU into GPU instances and running CUDA applications starting with the NVIDIA Ampere architecture. Through this, multiple users can each utilize different GPU resources to achieve optimal GPU utilization. This feature is especially useful for workloads that do not fully utilize the GPU’s computing capacity, and users can run multiple workloads in parallel to maximize utilization.

Using Multi-instance GPU feature

To use the MIG feature, create an NVIDIA GPU Server on the Samsung Cloud Platform, then enable and disable MIG. The order of applying and removing MIG is as follows.

MIG application order
Enable MIG → Create GPU Instance → Create Compute Instance → Use MIG
MIG release order
Delete Compute Instance → Delete GPU Instance → Disable MIG feature (deactivate)
Reference
  • MIG can be used on Samsung Cloud Platform’s next-generation GPU Server or MNGC (Multi-node GPU Cluster).
  • For system requirements to use MIG, refer to the NVIDIA Multi-Instance GPU User Guide.

Applying and Using MIG

After activating the MIG and creating an Instance to assign tasks, the tasks proceed in the following order.

MIG application order
Enable MIG → Create GPU Instance → Create Compute Instance → Use MIG
Note
The example of applying MIG is explained based on an A100 GPU server.

Activate MIG

  1. Check the GPU status on the VM Instance (GuestOS) before applying MIG.

    • Check whether MIG mode is Disabled.
      Color mode
      $ nvidia-smi
      Mon Sep 27 08:37:08 2021
      +-----------------------------------------------------------------------------+
      | NVIDIA-SMI 470.57.02    Driver Version: 470.57.02    CUDA Version: 11.4 |
      | -------------------------------+----------------------+---------------------- |
      | GPU  Name        Persistence-M | Bus-Id        Disp.A | Volatile Uncorr. ECC |
      | Fan  Temp  Perf  Pwr:Usage/Cap | Memory-Usage | GPU-Util  Compute M. |
      |  |  | MIG M. |
      | ===============================+======================+====================== |
      | 0  NVDIA A100-SXM...  Off | 00000000:05:00.0 Off | 0 |
      | N/A   32C   P0    59W / 400W | 0MiB / 81251MiB | 0%      Default |
      |  |  | Disabled |
      +-------------------------------+----------------------+----------------------+
      
      +-----------------------------------------------------------------------------+
      | Processes: |
      | GPU   GI   CI       PID   Type   Process name                   GPU Memory |
      | ID   ID                                                   Usage |
      | ============================================================================= |
      | No running processes found |
      +-----------------------------------------------------------------------------+
      $ nvidia-smi
      Mon Sep 27 08:37:08 2021
      +-----------------------------------------------------------------------------+
      | NVIDIA-SMI 470.57.02    Driver Version: 470.57.02    CUDA Version: 11.4 |
      | -------------------------------+----------------------+---------------------- |
      | GPU  Name        Persistence-M | Bus-Id        Disp.A | Volatile Uncorr. ECC |
      | Fan  Temp  Perf  Pwr:Usage/Cap | Memory-Usage | GPU-Util  Compute M. |
      |  |  | MIG M. |
      | ===============================+======================+====================== |
      | 0  NVDIA A100-SXM...  Off | 00000000:05:00.0 Off | 0 |
      | N/A   32C   P0    59W / 400W | 0MiB / 81251MiB | 0%      Default |
      |  |  | Disabled |
      +-------------------------------+----------------------+----------------------+
      
      +-----------------------------------------------------------------------------+
      | Processes: |
      | GPU   GI   CI       PID   Type   Process name                   GPU Memory |
      | ID   ID                                                   Usage |
      | ============================================================================= |
      | No running processes found |
      +-----------------------------------------------------------------------------+
      Code block. nvidia-smi command - Check GPU disabled status (1)
      Color mode
      $ nvidia-smi –L
      GPU 0: NVIDIA A100-SXM-80GB (UUID: GPU-c956838f-494a-92b2-6818-56eb28fe25e0)
      $ nvidia-smi –L
      GPU 0: NVIDIA A100-SXM-80GB (UUID: GPU-c956838f-494a-92b2-6818-56eb28fe25e0)
      Code block. nvidia-smi command - Check GPU disabled status (2)
  2. Enable MIG (Enable) for each GPU on the VM Instance (GuestOS) and reboot the VM Instance.

    Color mode
    $ nvidia-smi –I 0 –mig 1
    Enabled MIG mode for GPU 00000000:05:00.0
    All done.
    
    # reboot
    $ nvidia-smi –I 0 –mig 1
    Enabled MIG mode for GPU 00000000:05:00.0
    All done.
    
    # reboot
    Code block. nvidia-smi command - enable MIG

Reference

When using a GPU and configuring MIG, you may encounter the following warning message. If the warning appears, check whether any programs are running on the GPU.

Warning: MIG mode is in pending enable state for GPU 00000000:05:00.0: In use by another client. 00000000:05:00.0 is currently being used by one or more other processes (e.g. CUDA application or a monitoring application such as another instance of nvidia-smi).
  1. Check the GPU status after applying MIG on the VM Instance(GuestOS).
    • Check whether MIG mode is Enabled.
      Color mode
      $ nvidia-smi
      Mon Sep 27 09:44:33 2021
      +-----------------------------------------------------------------------------+
      | NVIDIA-SMI 470.57.02    Driver Version: 470.57.02    CUDA Version: 11.4 |
      | -------------------------------+----------------------+---------------------- |
      | GPU  Name        Persistence-M | Bus-Id        Disp.A | Volatile Uncorr. ECC |
      | Fan  Temp  Perf  Pwr:Usage/Cap | Memory-Usage | GPU-Util  Compute M. |
      |  |  | MIG M. |
      | ===============================+======================+====================== |
      | 0  NVDIA A100-SXM...  Off | 00000000:05:00.0 Off | On |
      | N/A   32C   P0    59W / 400W | 0MiB / 81251MiB | 0%      Default |
      |  |  | Enabled |
      +-------------------------------+----------------------+----------------------+
      +-----------------------------------------------------------------------------+
      | MIG devices: |
      +-----------------------------------------------------------------------------+
      | GPU  GI  CI  MIG | Memory-Usage | Vol | Shared |
      | ID  ID  Dev | BAR1-Usage | SM     Unc | CE  ENC  DEC  OFA  JPG |
      |  |  | ECC |  |
      | ============================================================================= |
      | No MIG devices found |
      +-----------------------------------------------------------------------------+
      +-----------------------------------------------------------------------------+
      | Processes: |
      | GPU   GI   CI       PID   Type   Process name                   GPU Memory |
      | ID   ID                                                   Usage |
      | ============================================================================= |
      | No running processes found |
      +-----------------------------------------------------------------------------+
      $ nvidia-smi
      Mon Sep 27 09:44:33 2021
      +-----------------------------------------------------------------------------+
      | NVIDIA-SMI 470.57.02    Driver Version: 470.57.02    CUDA Version: 11.4 |
      | -------------------------------+----------------------+---------------------- |
      | GPU  Name        Persistence-M | Bus-Id        Disp.A | Volatile Uncorr. ECC |
      | Fan  Temp  Perf  Pwr:Usage/Cap | Memory-Usage | GPU-Util  Compute M. |
      |  |  | MIG M. |
      | ===============================+======================+====================== |
      | 0  NVDIA A100-SXM...  Off | 00000000:05:00.0 Off | On |
      | N/A   32C   P0    59W / 400W | 0MiB / 81251MiB | 0%      Default |
      |  |  | Enabled |
      +-------------------------------+----------------------+----------------------+
      +-----------------------------------------------------------------------------+
      | MIG devices: |
      +-----------------------------------------------------------------------------+
      | GPU  GI  CI  MIG | Memory-Usage | Vol | Shared |
      | ID  ID  Dev | BAR1-Usage | SM     Unc | CE  ENC  DEC  OFA  JPG |
      |  |  | ECC |  |
      | ============================================================================= |
      | No MIG devices found |
      +-----------------------------------------------------------------------------+
      +-----------------------------------------------------------------------------+
      | Processes: |
      | GPU   GI   CI       PID   Type   Process name                   GPU Memory |
      | ID   ID                                                   Usage |
      | ============================================================================= |
      | No running processes found |
      +-----------------------------------------------------------------------------+
      Code block. nvidia-smi command - Check GPU activation status (1)
      Color mode
      $ nvidia-smi –L
      GPU 0: NVIDIA A100-SXM-80GB (UUID: GPU-c956838f-494a-92b2-6818-56eb28fe25e0)
      $ nvidia-smi –L
      GPU 0: NVIDIA A100-SXM-80GB (UUID: GPU-c956838f-494a-92b2-6818-56eb28fe25e0)
      Code block. nvidia-smi command - Check GPU activation status (2)

GPU Instance creation

If you have enabled MIG and verified its status, you can create a GPU Instance.

  1. Check the list of MIG GPU Instance profiles that can be created.
    Color mode
    $ nvidia-smi mig -i [GPU ID] -lgip
    $ nvidia-smi mig -i [GPU ID] -lgip
    Code block. nvidia-smi command - view MIG GPU Instance profile list
Color mode
$ nvidia-smi mig -i 0 -lgip
+-----------------------------------------------------------------------------+
| GPU instance profiles: |
| GPU   Name             ID    Instances   Memory     P2P    SM    DEC   ENC |
| Free/Total   GiB              CE    JPEG  OFA |
| ============================================================================= |
| 0 MIG 1g.10gb        19    7/7         9.50       No     14     0     0 |
| 1     0     0 |
+-----------------------------------------------------------------------------+
| 0 MIG 1g.10gb+me     20    1/1         9.50       No     14     0     0 |
| 1     1     1 |
+-----------------------------------------------------------------------------+
| 0 MIG 2g.20gb        14    3/3         19.50      No     28     1     0 |
| 2     0     0 |
+-----------------------------------------------------------------------------+
| 0 MIG 3g.40gb         9    2/2         39.50      No     42     2     0 |
| 3     0     0 |
+-----------------------------------------------------------------------------+
| 0 MIG 4g.40gb         5    1/1         39.50      No     56     2     0 |
| 4     0     0 |
+-----------------------------------------------------------------------------+
| 0 MIG 7g.80gb         0    1/1         79.25      No     98     0     0 |
| 7     1     1 |
+-----------------------------------------------------------------------------+
$ nvidia-smi mig -i 0 -lgip
+-----------------------------------------------------------------------------+
| GPU instance profiles: |
| GPU   Name             ID    Instances   Memory     P2P    SM    DEC   ENC |
| Free/Total   GiB              CE    JPEG  OFA |
| ============================================================================= |
| 0 MIG 1g.10gb        19    7/7         9.50       No     14     0     0 |
| 1     0     0 |
+-----------------------------------------------------------------------------+
| 0 MIG 1g.10gb+me     20    1/1         9.50       No     14     0     0 |
| 1     1     1 |
+-----------------------------------------------------------------------------+
| 0 MIG 2g.20gb        14    3/3         19.50      No     28     1     0 |
| 2     0     0 |
+-----------------------------------------------------------------------------+
| 0 MIG 3g.40gb         9    2/2         39.50      No     42     2     0 |
| 3     0     0 |
+-----------------------------------------------------------------------------+
| 0 MIG 4g.40gb         5    1/1         39.50      No     56     2     0 |
| 4     0     0 |
+-----------------------------------------------------------------------------+
| 0 MIG 7g.80gb         0    1/1         79.25      No     98     0     0 |
| 7     1     1 |
+-----------------------------------------------------------------------------+
Code block. MIG GPU Instance profile list
Reference
Refer to the NVIDIA Multi-Instance GPU User Guide for GPU Instance profiles.
  1. After creating a MIG GPU Instance, check it.
    • Create GPU Instance

      Color mode
      $ nvidia-smi mig -i [GPU ID] -cgi [Profile ID]
      $ nvidia-smi mig -i [GPU ID] -cgi [Profile ID]
      code block. nvidia-smi command - GPU Instance creation
      Color mode
      $ nvidia-smi mig -i 0 -cgi 0
      Successfully created GPU instance ID 0 on GPU 0 using profile MIG 7g.80gb (ID 0)
      $ nvidia-smi mig -i 0 -cgi 0
      Successfully created GPU instance ID 0 on GPU 0 using profile MIG 7g.80gb (ID 0)
      Code block. nvidia-smi command - Example of creating a GPU Instance

    • Check GPU Instance

      Color mode
      $ nvidia-smi mig -i [GPU ID] -lgi
      $ nvidia-smi mig -i [GPU ID] -lgi
      code block. nvidia-smi command - check GPU Instance
      Color mode
      $ nvidia-smi mig -i 0 -lgi
      +--------------------------------------------------------+
      | GPU instances: |
      | GPU   Name               Profile  Instance  Placement |
      | ID       ID      Start:Size |
      | ======================================================== |
      | 0  MIG 7g.80gb            0        0         0:8 |
      +--------------------------------------------------------+
      $ nvidia-smi mig -i 0 -lgi
      +--------------------------------------------------------+
      | GPU instances: |
      | GPU   Name               Profile  Instance  Placement |
      | ID       ID      Start:Size |
      | ======================================================== |
      | 0  MIG 7g.80gb            0        0         0:8 |
      +--------------------------------------------------------+
      Code block. nvidia-smi command - example of checking GPU Instance

Compute Instance creation

If you have created a GPU Instance, you can create a Compute Instance.

  1. Check the MIG Compute Instance profiles you can create.

    Color mode
    $ nvidia-smi mig -i [GPU ID] -gi [GPU Instance ID] -lcip
    $ nvidia-smi mig -i [GPU ID] -gi [GPU Instance ID] -lcip
    Code block. nvidia-smi command - Check MIG Compute Instance profile
    Color mode
    $ nvidia-smi mig -i 0 -gi 0 -lcip
    +---------------------------------------------------------------------------------+
    | Compute instance profiles: |
    | GPU     GPU     Name            Profile  Instances   Exclusive      Shared |
    | GPU   Instance                     ID    Free/Total     SM       DEC  ENC  OFA |
    | ID                                                       CE   JPEG |
    | ================================================================================= |
    | 0      0      MIG 1c.7g.80gb     0      7/5           14       5    0    1 |
    | 7    1 |
    +---------------------------------------------------------------------------------+
    | 0      0      MIG 2c.7g.80gb     1      3/3           28       5    0    1 |
    | 7    1 |
    +---------------------------------------------------------------------------------+
    | 0      0      MIG 3c.7g.80gb     2      2/2           42       5    0    1 |
    | 7    1 |
    +---------------------------------------------------------------------------------+
    | 0      0      MIG 4c.7g.80gb     3      1/1           56       5    0    1 |
    | 7    1 |
    +---------------------------------------------------------------------------------+
    | 0      0      MIG 7g.80gb        4*     1/1           98       5    0    1 |
    | 7    1 |
    +---------------------------------------------------------------------------------+
    $ nvidia-smi mig -i 0 -gi 0 -lcip
    +---------------------------------------------------------------------------------+
    | Compute instance profiles: |
    | GPU     GPU     Name            Profile  Instances   Exclusive      Shared |
    | GPU   Instance                     ID    Free/Total     SM       DEC  ENC  OFA |
    | ID                                                       CE   JPEG |
    | ================================================================================= |
    | 0      0      MIG 1c.7g.80gb     0      7/5           14       5    0    1 |
    | 7    1 |
    +---------------------------------------------------------------------------------+
    | 0      0      MIG 2c.7g.80gb     1      3/3           28       5    0    1 |
    | 7    1 |
    +---------------------------------------------------------------------------------+
    | 0      0      MIG 3c.7g.80gb     2      2/2           42       5    0    1 |
    | 7    1 |
    +---------------------------------------------------------------------------------+
    | 0      0      MIG 4c.7g.80gb     3      1/1           56       5    0    1 |
    | 7    1 |
    +---------------------------------------------------------------------------------+
    | 0      0      MIG 7g.80gb        4*     1/1           98       5    0    1 |
    | 7    1 |
    +---------------------------------------------------------------------------------+
    Code block. Example of MIG Compute Instance profile list

  2. Create and verify a MIG Compute Instance.

    • MIG Compute Instance creation
      Color mode
      $ nvidia-smi mig -i [GPU ID] -gi [GPU Instance ID] -cci [Compute Profile ID]
      $ nvidia-smi mig -i [GPU ID] -gi [GPU Instance ID] -cci [Compute Profile ID]
      code block. nvidia-smi command - Create MIG Compute Instance
      Color mode
      $ nvidia-smi mig -i 0 -gi 0 -cci 4
      Successfully created compute instance ID 0 on GPU instance ID 0 using profile MIG 7g.80gb(ID 4)
      $ nvidia-smi mig -i 0 -gi 0 -cci 4
      Successfully created compute instance ID 0 on GPU instance ID 0 using profile MIG 7g.80gb(ID 4)
      Code block. nvidia-smi command - Example of creating a MIG Compute Instance
    • Check MIG Compute Instance
      Color mode
      $ nvidia-smi mig -i [GPU ID] -gi [GPU Instance ID] –lci
      $ nvidia-smi mig -i [GPU ID] -gi [GPU Instance ID] –lci
      code block. nvidia-smi command - check MIG Compute Instance
      Color mode
      $ nvidia-smi mig -i 0 -gi 0 –lci
      +-----------------------------------------------------------------+
      | Compute instance profiles: |
      | GPU     GPU     Name            Profile  Instances   Placement |
      | GPU   Instance                     ID      ID        Start:Size |
      | ID |
      | ================================================================= |
      | 0      0      MIG 7g.80gb         4       0            0:7 |
      +-----------------------------------------------------------------+
      $ nvidia-smi mig -i 0 -gi 0 –lci
      +-----------------------------------------------------------------+
      | Compute instance profiles: |
      | GPU     GPU     Name            Profile  Instances   Placement |
      | GPU   Instance                     ID      ID        Start:Size |
      | ID |
      | ================================================================= |
      | 0      0      MIG 7g.80gb         4       0            0:7 |
      +-----------------------------------------------------------------+
      Code block. Example of checking MIG Compute Instance
      Color mode
      $ nvidia-smi –L
      GPU 0: NVIDIA A100-SXM-80GB (UUID: GPU-c956838f-494a-92b2-6818-56eb28fe25e0)
        MIG 7g.80gb     Device  0: (UUID: MIG-53e20040-758b-5ecb-948e-c626d03a9a32)
      $ nvidia-smi –L
      GPU 0: NVIDIA A100-SXM-80GB (UUID: GPU-c956838f-494a-92b2-6818-56eb28fe25e0)
        MIG 7g.80gb     Device  0: (UUID: MIG-53e20040-758b-5ecb-948e-c626d03a9a32)
      Code block. nvidia-smi command - Check GPU status (1)
      Color mode
      $ nvidia-smi
      Mon Sep 27 09:52:17 2021
      +-----------------------------------------------------------------------------+
      | NVIDIA-SMI 470.57.02    Driver Version: 470.57.02    CUDA Version: 11.4 |
      | -------------------------------+----------------------+---------------------- |
      | GPU  Name        Persistence-M | Bus-Id        Disp.A | Volatile Uncorr. ECC |
      | Fan  Temp  Perf  Pwr:Usage/Cap | Memory-Usage | GPU-Util  Compute M. |
      |  |  | MIG M. |
      | ===============================+======================+====================== |
      | 0  NVDIA A100-SXM...  Off | 00000000:05:00.0 Off | On |
      | N/A   32C   P0    49W / 400W | 0MiB / 81251MiB | N/A      Default |
      |  |  | Enabled |
      +-------------------------------+----------------------+----------------------+
      
      +-----------------------------------------------------------------------------+
      | MIG devices: |
      +-----------------------------------------------------------------------------+
      | GPU  GI  CI  MIG | Memory-Usage | Vol | Shared |
      | ID  ID  Dev | BAR1-Usage | SM     Unc | CE  ENC  DEC  OFA  JPG |
      |  |  | ECC |  |
      | ============================================================================= |
      | 0    0   0    0 | 0MiB / 81251MiB | 98      0 | 7   0    5    1    1 |
      |  | 1MiB / 13107... |  |  |
      +-----------------------------------------------------------------------------+
      +-----------------------------------------------------------------------------+
      | Processes: |
      | GPU   GI   CI       PID   Type   Process name                   GPU Memory |
      | ID   ID                                                   Usage |
      | ============================================================================= |
      | No running processes found |
      +-----------------------------------------------------------------------------+
      $ nvidia-smi
      Mon Sep 27 09:52:17 2021
      +-----------------------------------------------------------------------------+
      | NVIDIA-SMI 470.57.02    Driver Version: 470.57.02    CUDA Version: 11.4 |
      | -------------------------------+----------------------+---------------------- |
      | GPU  Name        Persistence-M | Bus-Id        Disp.A | Volatile Uncorr. ECC |
      | Fan  Temp  Perf  Pwr:Usage/Cap | Memory-Usage | GPU-Util  Compute M. |
      |  |  | MIG M. |
      | ===============================+======================+====================== |
      | 0  NVDIA A100-SXM...  Off | 00000000:05:00.0 Off | On |
      | N/A   32C   P0    49W / 400W | 0MiB / 81251MiB | N/A      Default |
      |  |  | Enabled |
      +-------------------------------+----------------------+----------------------+
      
      +-----------------------------------------------------------------------------+
      | MIG devices: |
      +-----------------------------------------------------------------------------+
      | GPU  GI  CI  MIG | Memory-Usage | Vol | Shared |
      | ID  ID  Dev | BAR1-Usage | SM     Unc | CE  ENC  DEC  OFA  JPG |
      |  |  | ECC |  |
      | ============================================================================= |
      | 0    0   0    0 | 0MiB / 81251MiB | 98      0 | 7   0    5    1    1 |
      |  | 1MiB / 13107... |  |  |
      +-----------------------------------------------------------------------------+
      +-----------------------------------------------------------------------------+
      | Processes: |
      | GPU   GI   CI       PID   Type   Process name                   GPU Memory |
      | ID   ID                                                   Usage |
      | ============================================================================= |
      | No running processes found |
      +-----------------------------------------------------------------------------+
      Code block. nvidia-smi command - Check GPU status (2)

Using MIG

  1. Use the MIG Instance to perform the Job.
    • Example of task execution
      Color mode
      $ docker run --gpus '"device=[GPU ID]:[MIG ID]"' -rm nvcr.io/nvidia/cuda nvidia-smi
      $ docker run --gpus '"device=[GPU ID]:[MIG ID]"' -rm nvcr.io/nvidia/cuda nvidia-smi
      Code block. Task execution example
    • You can see an example of the work performed as follows.
      Color mode
      $ docker run --gpus '"device=0:0"' -rm -it --network=host --shm-size=1g --ipc=host -v /root/.ssh/:/root/.ssh
      
      ================
      == TensorFlow ==
      ================
      
      NVIDIA Release 21.08-tf1 (build 26012104)
      TensorFlow Version 1.15.5
      
      Container image Copyright (c) 2021, NVIDIA CORPORATION. All right reserved.
      ...
      
      # Run Python process
      root@d622a93c9281:/workspace# python /workspace/nvidia-examples/cnn/resnet.py --num_iter 100
      ...
      PY 3.8.10 (default, Jun 2 2021, 10:49:15)
      [GCC 9.4.0]
      TF 1.15.5
      ...
      $ docker run --gpus '"device=0:0"' -rm -it --network=host --shm-size=1g --ipc=host -v /root/.ssh/:/root/.ssh
      
      ================
      == TensorFlow ==
      ================
      
      NVIDIA Release 21.08-tf1 (build 26012104)
      TensorFlow Version 1.15.5
      
      Container image Copyright (c) 2021, NVIDIA CORPORATION. All right reserved.
      ...
      
      # Run Python process
      root@d622a93c9281:/workspace# python /workspace/nvidia-examples/cnn/resnet.py --num_iter 100
      ...
      PY 3.8.10 (default, Jun 2 2021, 10:49:15)
      [GCC 9.4.0]
      TF 1.15.5
      ...
      Code block. Operation result
  2. Check the GPU usage. (Create JOB process)
    • When the job runs, you can see that a process is allocated to the MIG device and its utilization increases.
      Color mode
      $ nvidia-smi mig -i [GPU ID] -gi [GPU Instance ID] -lcip
      $ nvidia-smi mig -i [GPU ID] -gi [GPU Instance ID] -lcip
      code block. nvidia-smi command - check GPU utilization
    • You can check the GPU usage as shown below.
      Color mode
      +-----------------------------------------------------------------------------+
      | MIG devices: |
      +-----------------------------------------------------------------------------+
      | GPU  GI  CI  MIG | Memory-Usage | Vol | Shared |
      | ID  ID  Dev | BAR1-Usage | SM     Unc | CE  ENC  DEC  OFA  JPG |
      |  |  | ECC |  |
      | ============================================================================= |
      | 0    0   0    0 | 66562MiB / 81251MiB | 98      0 | 7   0    5    1    1 |
      |  | 5MiB / 13107... |  |  |
      +-----------------------------------------------------------------------------+
      +-----------------------------------------------------------------------------+
      | Processes: |
      | GPU   GI   CI       PID   Type   Process name                   GPU Memory |
      | ID   ID                                                   Usage |
      | ============================================================================= |
      | 0     0    0     17483      C   python                           66559MiB |
      +-----------------------------------------------------------------------------+
      +-----------------------------------------------------------------------------+
      | MIG devices: |
      +-----------------------------------------------------------------------------+
      | GPU  GI  CI  MIG | Memory-Usage | Vol | Shared |
      | ID  ID  Dev | BAR1-Usage | SM     Unc | CE  ENC  DEC  OFA  JPG |
      |  |  | ECC |  |
      | ============================================================================= |
      | 0    0   0    0 | 66562MiB / 81251MiB | 98      0 | 7   0    5    1    1 |
      |  | 5MiB / 13107... |  |  |
      +-----------------------------------------------------------------------------+
      +-----------------------------------------------------------------------------+
      | Processes: |
      | GPU   GI   CI       PID   Type   Process name                   GPU Memory |
      | ID   ID                                                   Usage |
      | ============================================================================= |
      | 0     0    0     17483      C   python                           66559MiB |
      +-----------------------------------------------------------------------------+
      Code block. Example of checking GPU utilization.

Delete and release MIG Instance

Follow these steps to delete the MIG instance and detach the MIG.

MIG release order
Delete Compute Instance → Delete GPU Instance → Disable MIG feature (deactivate)

Compute Instance Delete

  • Delete the Compute Instance.
    Color mode
    $ nvidia-smi mig -i [GPU ID] -gi [GPU Instance ID] –dci
    $ nvidia-smi mig -i [GPU ID] -gi [GPU Instance ID] -ci [Compute Instance] –dci
    $ nvidia-smi mig -i [GPU ID] -gi [GPU Instance ID] –dci
    $ nvidia-smi mig -i [GPU ID] -gi [GPU Instance ID] -ci [Compute Instance] –dci
    Code block. nvidia-smi command - Delete Compute Instance
    Color mode
    $ nvidia-smi mig -i 0 -gi 0 –lci
    +-----------------------------------------------------------------+
    | Compute instance profiles: |
    | GPU     GPU     Name            Profile  Instances   Placement |
    | GPU   Instance                     ID      ID        Start:Size |
    | ID |
    | ================================================================= |
    | 0      0      MIG 7g.80gb         4       0            0:7 |
    +-----------------------------------------------------------------+
    $ nvidia-smi mig -i 0 -gi 0 –lci
    +-----------------------------------------------------------------+
    | Compute instance profiles: |
    | GPU     GPU     Name            Profile  Instances   Placement |
    | GPU   Instance                     ID      ID        Start:Size |
    | ID |
    | ================================================================= |
    | 0      0      MIG 7g.80gb         4       0            0:7 |
    +-----------------------------------------------------------------+
    code block. Example of checking MIG Compute Instance
    Color mode
    $ nvidia-smi mig -i 0 -gi 0 –dci
    Successfully destroyed compute instance ID  0 from GPU instance ID  0
    $ nvidia-smi mig -i 0 -gi 0 –dci
    Successfully destroyed compute instance ID  0 from GPU instance ID  0
    Code block. Compute Instance deletion example
    Color mode
    $ nvidia-smi mig -i 0 -gi 0 –lci
    No compute instances found: Not found
    $ nvidia-smi mig -i 0 -gi 0 –lci
    No compute instances found: Not found
    Code block. Confirm Compute Instance deletion

Delete GPU Instance

  • Delete the GPU Instance.
    Color mode
    $ nvidia-smi mig -i [GPU ID] –dgi
    $ nvidia-smi mig -i [GPU ID] -gi [GPU Instance ID] –dgi
    $ nvidia-smi mig -i [GPU ID] –dgi
    $ nvidia-smi mig -i [GPU ID] -gi [GPU Instance ID] –dgi
    Code block. nvidia-smi command - Delete GPU Instance
    Color mode
    $ nvidia-smi mig -i 0 -lgi
    +--------------------------------------------------------+
    | GPU instances: |
    | GPU   Name               Profile  Instance  Placement |
    | ID       ID      Start:Size |
    | ======================================================== |
    | 0  MIG 7g.80gb            0        0         0:8 |
    +--------------------------------------------------------+
    $ nvidia-smi mig -i 0 -lgi
    +--------------------------------------------------------+
    | GPU instances: |
    | GPU   Name               Profile  Instance  Placement |
    | ID       ID      Start:Size |
    | ======================================================== |
    | 0  MIG 7g.80gb            0        0         0:8 |
    +--------------------------------------------------------+
    Code block. nvidia-smi command - Example of checking GPU Instance
    Color mode
    $ nvidia-smi mig -i 0 -dgi
    Successfully destroyed GPU instance ID  0 from GPU  0
    $ nvidia-smi mig -i 0 -dgi
    Successfully destroyed GPU instance ID  0 from GPU  0
    Code block. nvidia-smi command - GPU Instance deletion example
    Color mode
    $ nvidia-smi mig -i 0 -lgi
    No GPU instances found: Not found
    $ nvidia-smi mig -i 0 -lgi
    No GPU instances found: Not found
    code block. nvidia-smi command - example of deleting a GPU Instance

Disable MIG feature (deactivation)

  • After disabling MIG (Disable), reboot.
    Color mode
    $ nvidia-smi -mig 0
    Disabled MIG Mode for GPU 00000000:05:00.0
    
    All done.
    $ nvidia-smi -mig 0
    Disabled MIG Mode for GPU 00000000:05:00.0
    
    All done.
    Code block. nvidia-smi command - disable MIG
    Color mode
    $ nvidia-smi
    Mon Sep 30 05:18:28 2021
    +-----------------------------------------------------------------------------+
    | NVIDIA-SMI 470.57.02    Driver Version: 470.57.02    CUDA Version: 11.4 |
    | -------------------------------+----------------------+---------------------- |
    | GPU  Name        Persistence-M | Bus-Id        Disp.A | Volatile Uncorr. ECC |
    | Fan  Temp  Perf  Pwr:Usage/Cap | Memory-Usage | GPU-Util  Compute M. |
    |  |  | MIG M. |
    | ===============================+======================+====================== |
    | 0  NVDIA A100-SXM...  Off | 00000000:05:00.0 Off | 0 |
    | N/A   33C   P0    60W / 400W | 0MiB / 81251MiB | 0%      Default |
    |  |  | Disabled |
    +-------------------------------+----------------------+----------------------+
    +-----------------------------------------------------------------------------+
    | MIG devices: |
    +-----------------------------------------------------------------------------+
    | GPU  GI  CI  MIG | Memory-Usage | Vol | Shared |
    | ID  ID  Dev | BAR1-Usage | SM     Unc | CE  ENC  DEC  OFA  JPG |
    |  |  | ECC |  |
    | ============================================================================= |
    | No MIG devices found |
    +-----------------------------------------------------------------------------+
    +-----------------------------------------------------------------------------+
    | Processes: |
    | GPU   GI   CI       PID   Type   Process name                   GPU Memory |
    | ID   ID                                                   Usage |
    | ============================================================================= |
    | No running processes found |
    +-----------------------------------------------------------------------------+
    $ nvidia-smi
    Mon Sep 30 05:18:28 2021
    +-----------------------------------------------------------------------------+
    | NVIDIA-SMI 470.57.02    Driver Version: 470.57.02    CUDA Version: 11.4 |
    | -------------------------------+----------------------+---------------------- |
    | GPU  Name        Persistence-M | Bus-Id        Disp.A | Volatile Uncorr. ECC |
    | Fan  Temp  Perf  Pwr:Usage/Cap | Memory-Usage | GPU-Util  Compute M. |
    |  |  | MIG M. |
    | ===============================+======================+====================== |
    | 0  NVDIA A100-SXM...  Off | 00000000:05:00.0 Off | 0 |
    | N/A   33C   P0    60W / 400W | 0MiB / 81251MiB | 0%      Default |
    |  |  | Disabled |
    +-------------------------------+----------------------+----------------------+
    +-----------------------------------------------------------------------------+
    | MIG devices: |
    +-----------------------------------------------------------------------------+
    | GPU  GI  CI  MIG | Memory-Usage | Vol | Shared |
    | ID  ID  Dev | BAR1-Usage | SM     Unc | CE  ENC  DEC  OFA  JPG |
    |  |  | ECC |  |
    | ============================================================================= |
    | No MIG devices found |
    +-----------------------------------------------------------------------------+
    +-----------------------------------------------------------------------------+
    | Processes: |
    | GPU   GI   CI       PID   Type   Process name                   GPU Memory |
    | ID   ID                                                   Usage |
    | ============================================================================= |
    | No running processes found |
    +-----------------------------------------------------------------------------+
    Code block. nvidia-smi command - check GPU status
Manage Keypair
Use NVSwitch on GPU Server