更换 Azure Stack Hub 中的物理磁盘Replace a physical disk in Azure Stack Hub

本文介绍更换 Azure Stack Hub 中的物理磁盘的一般过程。This article describes the general process to replace a physical disk in Azure Stack Hub. 如果物理磁盘发生故障,应尽快更换。If a physical disk fails, you should replace it as soon as possible.

Note

更换物理数据驱动器需要将缩放单位节点提前置于维护模式(清空)状态。Replacing a physical data drive does not require the scale unit node to be put into maintenance mode (drain) upfront. 此外,在更换物理驱动器后,不需要使用 Azure Stack Hub 管理员门户来修复缩放单位节点。Also after the physical drive has been replaced the scale unit node doesn't need to be repaired using the Azure Stack Hub administrator portal. 以下文章详细介绍了何时需要修复:更换 Azure Stack Hub 缩放单元节点上的硬件组件The following article has more information when a repair is required Replace a hardware component on an Azure Stack Hub scale unit node.

可以将此过程用于具有热插拔磁盘的部署。You can use this procedure for deployments that have hot-swappable disks.

实际的磁盘更换步骤因原始设备制造商 (OEM) 硬件供应商而异。Actual disk replacement steps will vary based on your original equipment manufacturer (OEM) hardware vendor. 请参阅供应商的现场可更换单元 (FRU) 文档来了解特定于你的系统的详细步骤。See your vendor's field replaceable unit (FRU) documentation for detailed steps that are specific to your system.

查看磁盘警报信息Review disk alert information

磁盘发生故障时,你将收到一个警报,告知你与物理磁盘的连接已丢失。When a disk fails, you receive an alert that tells you that connectivity has been lost to a physical disk.

在 Azure Stack Hub 管理中显示物理磁盘连接丢失的警报

如果打开该警报,警报说明包含必须更换之磁盘的缩放单元节点和确切物理插槽位置。If you open the alert, the alert description contains the scale unit node and the exact physical slot location for the disk that you must replace. Azure Stack Hub 会使用 LED 指示器功能来进一步帮助你确定发生故障的磁盘。Azure Stack Hub further helps you to identify the failed disk by using LED indicator capabilities.

更换物理磁盘Replace the physical disk

请按照 OEM 硬件供应商的 FRU 说明进行实际磁盘更换。Follow your OEM hardware vendor's FRU instructions for actual disk replacement.

Note

每次更换一个缩放单元节点的磁盘。Replace disks for one scale unit node at a time. 等待虚拟磁盘修复作业完成,然后移到下一个缩放单元节点。Wait for the virtual disk repair jobs to complete before moving on to the next scale unit node.

为了防止在集成系统中使用不支持的磁盘,系统会阻止供应商不支持的磁盘。To prevent the use of an unsupported disk in an integrated system, the system blocks disks that aren't supported by your vendor. 如果尝试使用不支持的磁盘,则会有新警报通知你因为不支持该磁盘的型号或固件,所以已将该磁盘隔离。If you try to use an unsupported disk, a new alert tells you a disk has been quarantined because of an unsupported model or firmware.

更换磁盘后,Azure Stack Hub 会自动发现新磁盘,并启动虚拟磁盘修复过程。After you replace the disk, Azure Stack Hub automatically discovers the new disk and starts the virtual disk repair process.

使用 Azure Stack Hub PowerShell 查看虚拟磁盘修复的状态Check the status of virtual disk repair using Azure Stack Hub PowerShell

更换磁盘后,可以使用 Azure Stack Hub PowerShell 监视虚拟磁盘运行状况和修复作业进度。After you replace the disk, you can monitor the virtual disk health status and repair job progress by using Azure Stack Hub PowerShell.

  1. 检查是否已安装 Azure Stack Hub PowerShell。Check that you have Azure Stack Hub PowerShell installed. 有关详细信息,请参阅安装适用于 Azure Stack Hub 的 PowerShellFor more information, see Install PowerShell for Azure Stack Hub.

  2. 以操作员身份使用 PowerShell 连接到 Azure Stack Hub。Connect to Azure Stack Hub with PowerShell as an operator. 有关详细信息,请参阅以操作员身份使用 PowerShell 连接到 Azure Stack HubFor more information, see Connect to Azure Stack Hub with PowerShell as an operator.

  3. 运行以下 cmdlet 以验证虚拟磁盘运行状况并修复状态:Run the following cmdlets to verify the virtual disk health and repair status:

    $scaleunit=Get-AzsScaleUnit
    $StorageSubSystem=Get-AzsStorageSubSystem -ScaleUnit $scaleunit.Name
    Get-AzsVolume -StorageSubSystem $StorageSubSystem.Name -ScaleUnit $scaleunit.name | Select-Object VolumeLabel, OperationalStatus, RepairStatus
    

    PowerShell 中的 Azure Stack Hub 卷运行状况

  4. 验证 Azure Stack Hub 系统状态。Validate Azure Stack Hub system state. 有关说明,请参阅验证 Azure Stack Hub 系统状态For instructions, see Validate Azure Stack Hub system state.

  5. 或者,可以运行以下命令来验证更换的物理磁盘的状态。Optionally, you can run the following command to verify the status of the replaced physical disk.

    $scaleunit=Get-AzsScaleUnit
    $StorageSubSystem=Get-AzsStorageSubSystem -ScaleUnit $scaleunit.Name
    
    Get-AzsDrive -StorageSubSystem $StorageSubSystem.Name -ScaleUnit $scaleunit.name | Sort-Object StorageNode,MediaType,PhysicalLocation | Format-Table Storagenode, Healthstatus, PhysicalLocation, Model, MediaType,  CapacityGB, CanPool, CannotPoolReason
    

    已通过 PowerShell 更换 Azure Stack Hub 中的物理磁盘

使用特权终结点检查虚拟磁盘修复状态Check the status of virtual disk repair using the privileged endpoint

更换磁盘后,可以使用特权终结点监视虚拟磁盘运行状况和修复作业进度。After you replace the disk, you can monitor the virtual disk health status and repair job progress by using the privileged endpoint. 从任何可以通过网络连接到特权终结点的计算机,按照下列步骤进行操作。Follow these steps from any computer that has network connectivity to the privileged endpoint.

  1. 打开 Windows PowerShell 会话并连接到特权终结点。Open a Windows PowerShell session and connect to the privileged endpoint.

        $cred = Get-Credential
        Enter-PSSession -ComputerName <IP_address_of_ERCS>`
          -ConfigurationName PrivilegedEndpoint -Credential $cred
    
  2. 运行以下命令以查看虚拟磁盘运行状况:Run the following command to view virtual disk health:

        Get-VirtualDisk -CimSession s-cluster
    

    Get-VirtualDisk 命令的 Powershell 输出

  3. 运行以下命令以查看当前的存储作业状态:Run the following command to view current storage job status:

        Get-VirtualDisk -CimSession s-cluster | Get-StorageJob
    

    Get-StorageJob 命令的 Powershell 输出

  4. 验证 Azure Stack Hub 系统状态。Validate the Azure Stack Hub system state. 有关说明,请参阅验证 Azure Stack Hub 系统状态For instructions, see Validate Azure Stack Hub system state.

使用特权终结点排查虚拟磁盘修复问题Troubleshoot virtual disk repair using the privileged endpoint

如果虚拟磁盘修复作业出现停滞,请运行以下命令来重新启动作业:If the virtual disk repair job appears stuck, run the following command to restart the job:

Get-VirtualDisk -CimSession s-cluster | Repair-VirtualDisk