Use GPUs with Discrete Device Assignment in clustered VMs

06/23/2025
Applies to: ✅ Windows Server 2025, ✅ Azure Local 2311.2 and later

You can include graphics processing units (GPUs) in your clusters to provide GPU acceleration to workloads running in clustered VMs. GPU acceleration can be provided via Discrete Device Assignment (DDA), which allows you to dedicate one or more physical GPUs to a VM, or through GPU Partitioning. Clustered VMs can take advantage of GPU acceleration, and clustering capabilities such as high availability via failover.

In this article, you'll learn how to use GPUs with clustered VMs to provide GPU acceleration to workloads using Discrete Device Assignment. This article guides you through preparing the cluster, assigning a GPU to a cluster VM, and failing over that VM using Windows Admin Center and PowerShell.

Tip

Live migration of virtual machines (VMs) using GPUs provided by DDA isn't currently supported, but VMs can be automatically restarted and placed where GPU resources are available if there's a failure. Looking to use Live Migration in clustered VMs? Consider using GPU partitioning. GPU partitioning allows you to share a fraction of the GPU instead of the entire GPU. To learn more about when to use GPU partition and support for live migration, see Partition and assign GPUs to a virtual machine.

Prerequisites

There are several requirements and things to consider before you begin to use GPUs with clustered VMs:

You need Azure Local 2311.2 and later.
Review how to manage GPUs in Azure Local 2311.2 and later, see Prepare GPUs for Azure Local.

You need a Windows Server Failover cluster running Windows Server 2025 or later.

You must have a familiarity with Failover clustering and Hyper-V.
You must install the same make and model of the GPUs across all the servers in your cluster.
Review and follow the instructions from your GPU manufacturer to install the necessary drivers and software on each server in the cluster.
Depending on your hardware vendor, you might also need to configure any GPU licensing requirements.
You need a machine with Windows Admin Center installed. This machine could be one of your cluster nodes.

Create a VM to assign the GPU to. Prepare that VM for DDA by setting its cache behavior, stop action, and memory-mapped I/O (MMIO) properties according to the instructions in Deploy graphics devices using Discrete Device Assignment.
Prepare the GPUs in each server by installing security mitigation drivers on each server, disabling the GPUs, and dismounting them from the host. To learn more about this process, see Deploy graphics devices by using Discrete Device Assignment.

Follow the steps in Plan for deploying devices by using Discrete Device Assignment to prepare GPU devices in the cluster.
Make sure your device has enough MMIO space allocated within the VM. For more information, see MMIO Space.
Create a VM to assign the GPU to. Prepare that VM for DDA by setting its cache behavior, stop action, and memory-mapped I/O (MMIO) properties according to the instructions in Deploy graphics devices using Discrete Device Assignment.
Prepare the GPUs in each server by installing security mitigation drivers on each server, disabling the GPUs, and dismounting them from the host. To learn more about this process, see Deploy graphics devices by using Discrete Device Assignment.

Note

Your system must be supported Azure Local solution with GPU support. To browse options, visit the Azure Local Catalog.

Prepare the cluster

When the prerequisites are complete, you can prepare the cluster to use GPUs with clustered VMs.

Preparing the cluster involves creating a resource pool that contains the GPUs that are available for assignment to VMs. The cluster uses this pool to determine VM placement for any started or moved VMs that are assigned to the GPU resource pool.

Windows Admin Center
PowerShell

Using Windows Admin Center, follow these steps to prepare the cluster to use GPUs with clustered VMs.

To prepare the cluster and assign a VM to a GPU resource pool:

Launch Windows Admin Center and make sure the GPUs extension is already installed.
Select Cluster Manager from the top dropdown menu and connect to your cluster.
From the Settings menu, select Extensions > GPUs.
On the Tools menu, under Extensions, select GPUs to open the tool.
On tool's main page, select the GPU pools tab, and then select Create GPU pool.
On the New GPU pool page, specify the following and then select Save:
1. Server name
2. GPU pool name
3. GPUs that you want to add to the pool
After the process completes, you'll receive a success prompt that shows the name of the new GPU pool and the host server.

Follow these steps to prepare the cluster to use GPUs with clustered VMs using PowerShell.

Create a new empty resource pool on each server containing the clustered GPU resources. Make sure to provide the same pool name on each server.

In PowerShell, run the following cmdlet as an administrator:
```
 New-VMResourcePool -ResourcePoolType PciExpress -Name "GpuChildPool"
```
Add the dismounted GPUs from each server to the resource pool that you created in the previous step.

In PowerShell, run the following commands:
```
 $gpu = Get-VMHostAssignableDevice

 Add-VMHostAssignableDevice -HostAssignableDevice $gpu -ResourcePoolName "GpuChildPool"
```

You now have a cluster-wide resource pool (named GpuChildPool) that is populated with assignable GPUs. The cluster uses this pool to determine VM placement for any started or moved VMs that are assigned to the GPU resource pool.

Assign a VM to a GPU resource pool

You can now assign a VM to a GPU resource pool. You can assign one or more VMs to a clustered GPU resource pool, and remove a VM from a clustered GPU resource pool.

Windows Admin Center
PowerShell

Follow these steps to assign an existing VM to a GPU resource pool using Windows Admin Center.

Note

You also need to install drivers from your GPU manufacturer inside the VM so that apps in the VM can take advantage of the GPU assigned to them.

On the Assign VM to GPU pool page, specify the following, then select Assign:
1. Server name
2. GPU pool name
3. Virtual machine that you want to assign the GPU to from the GPU pool.
You can also define advanced setting values for memory-mapped IO (MMIO) spaces to determine resource requirements for a single GPU.

After the process completes, you'll receive a confirmation prompt that shows you successfully assigned the GPU from the GPU resource pool to the VM, which displays under Assigned VMs.

To unassign a VM from a GPU resource pool:

On the GPU pools tab, select the GPU that you want to unassign, and then select Unassign VM.
On the Unassign VM from GPU pool page, in the Virtual machines list box, specify the name of the VM, and then select Unassign.

After the process completes, you receive a success prompt that the VM has been unassigned from the GPU pool, and under Assignment status the GPU shows Available (Not assigned).

Follow these steps to assign an existing VM to a GPU resource pool using PowerShell.

Configure the cluster VM resource’s default offline action as force-shutdown rather than save. Make sure to replace <vmname> with the name of the VM that you want to assign to the GPU resource pool.

In PowerShell, run the following cmdlet:
```
 Get-ClusterResource -name <vmname> | Set-ClusterParameter -Name "OfflineAction" -Value 3
```
Assign the resource pool that you created earlier to the VM. Assigning the resource pool declares to the cluster that the VM requires an assigned device from the GpuChildPool pool when it's started or moved.

In PowerShell, run the following cmdlet:
```
 Get-ClusterResource -name <vmname> | Add-VMAssignableDevice -ResourcePoolName "GpuChildPool"
```
Note

To add more than one GPU to the VM, first check that the resource pool contains multiple assignable GPUs. Then, run the previous command again for each GPU you want to add.

You can also remove an assigned GPU from a VM. To do so, in PowerShell, run the following command. Make sure to replace <vmname> with the name of the VM that you want to assign to the GPU resource pool.
```
 Add-VMAssignableDevice -VMName $vm -ResourcePoolName "GpuChildPool"

 $vm | Remove-VMAssignableDevice
```

When you start the VM, the cluster ensures that the VM is placed on a server with available GPU resources from this cluster-wide pool. The cluster also assigns the GPU to the VM through DDA, which allows the GPU to be accessed from workloads inside the VM.

Fail over a VM with an assigned GPU

To test the cluster’s ability to fail over your GPU workload, perform a drain operation on the server where the VM is running with an assigned GPU. Performing a drain operation on the server causes the cluster to restart the VM on another server in the cluster, as long as another server has sufficient available resources in the pool that you created.

To drain the server, follow the instructions in Failover cluster maintenance procedures. The cluster restarts the VM on another server in the cluster, as long as another server has sufficient available GPU resources in the pool that you created.

For more information on using GPUs with your clustered VMs, see:

For more information on using GPUs with your VMs and GPU partitioning, see:

Additional resources

Training

Learning path

Run high-performance computing (HPC) applications on Azure - Training

Azure HPC is a purpose-built cloud capability for HPC & AI workload, using leading-edge processors and HPC-class InfiniBand interconnect, to deliver the best application performance, scalability, and value. Azure HPC enables users to unlock innovation, productivity, and business agility, through a highly available range of HPC & AI technologies that can be dynamically allocated as your business and technical needs change. This learning path is a series of modules that help you get started on Azure HPC - you

Certification

Microsoft Certified: Azure Virtual Desktop Specialty - Certifications

Plan, deliver, manage, and monitor virtual desktop experiences and remote apps on Microsoft Azure for any device.

Share via