管理 Azure Stack Hub 的存储基础结构Manage storage infrastructure for Azure Stack Hub

本文介绍 Azure Stack Hub 存储基础结构资源的运行状况和工作状态。This article describes the health and operational status of Azure Stack Hub storage infrastructure resources. 这些资源包括存储驱动器和卷。These resources include storage drives and volumes. 本主题中的信息可帮助你排查各种问题,例如,无法将驱动器添加到池的问题。The information in this topic helps you troubleshoot various issues, like when a drive can't be added to a pool.

卷状态Volume states

若要确定卷所处的状态,请使用以下 PowerShell 命令:To find out what state volumes are in, use the following PowerShell commands:

$scaleunit_name = (Get-AzsScaleUnit)[0].name

$subsystem_name = (Get-AzsStorageSubSystem -ScaleUnit $scaleunit_name)[0].name

Get-AzsVolume -ScaleUnit $scaleunit_name -StorageSubSystem $subsystem_name | Select-Object VolumeLabel, HealthStatus, OperationalStatus, RepairStatus, Description, Action, TotalCapacityGB, RemainingCapacityGB

以下示例输出显示有一个卷已分离,并且有一个卷已降级/不完整:Here's an example of output showing a detached volume and a degraded/incomplete volume:

VolumeLabelVolumeLabel HealthStatusHealthStatus OperationalStatusOperationalStatus
ObjStore_1ObjStore_1 未知Unknown 已分离Detached
ObjStore_2ObjStore_2 警告Warning {已降级,不完整}{Degraded, Incomplete}

以下部分列出了运行状况和工作状态:The following sections list the health and operational states:

卷运行状况:正常Volume health state: Healthy

操作状态Operational state 说明Description
OKOK 卷处于正常状态。The volume is healthy.
欠佳Suboptimal 数据未均匀写入到各个驱动器。Data isn't written evenly across drives.

操作: 请联系支持人员优化存储池中的驱动器用法。Action: Contact Support to optimize drive usage in the storage pool. 在此之前,请参考 https://aka.ms/azurestacklogfiles 中的指导启动日志文件收集过程。Before you do, start the log file collection process using the guidance from https://aka.ms/azurestacklogfiles. 恢复失败的连接后,可能需要从备份还原数据。You may have to restore from backup after the failed connection is restored.

卷运行状况:警告Volume health state: Warning

如果卷处于“警告”运行状况状态,则表示数据的一个或多个副本不可用,但 Azure Stack Hub 仍可读取至少一个数据副本。When the volume is in a Warning health state, it means that one or more copies of your data are unavailable but Azure Stack Hub can still read at least one copy of your data.

操作状态Operational state 说明Description
运行中In service Azure Stack Hub 正在修复卷(例如,在添加或删除驱动器之后)。Azure Stack Hub is repairing the volume, like after adding or removing a drive. 修复完成后,卷应该会恢复“正常”运行状况。When the repair is complete, the volume should return to the OK health state.

操作: 等待 Azure Stack Hub 完成修复卷,然后检查状态。Action: Wait for Azure Stack Hub to finish repairing the volume and check the status afterward.
不完整Incomplete 由于一个或多个驱动器出现故障或缺失,卷的复原能力下降。The resilience of the volume is reduced because one or more drives failed or are missing. 但是,缺失的驱动器包含数据的最新副本。However, the missing drives contain up-to-date copies of your data.

操作: 重新连接所有缺失的驱动器,更换所有出现故障的驱动器,并使脱机的所有服务器联机。Action: Reconnect any missing drives, replace any failed drives, and bring online any servers that are offline.
已降级Degraded 由于一个或多个驱动器出现故障或缺失,并且驱动器上的数据副本已过时,因此卷的复原能力下降。The resilience of the volume is reduced because of one or more failed or missing drives as well as outdated copies of data on the drives.

操作: 重新连接所有缺失的驱动器,更换所有出现故障的驱动器,并使脱机的所有服务器联机。Action: Reconnect any missing drives, replace any failed drives, and bring online any servers that are offline.

卷运行状况:不正常Volume health state: Unhealthy

如果某个卷处于“不正常”运行状况,该卷上的部分或所有数据当前将不可访问。When a volume is in an Unhealthy health state, some or all of the data on the volume is currently inaccessible.

操作状态Operational state 说明Description
无冗余No redundancy 由于过多的驱动器出现故障,该卷已丢失数据。The volume has lost data because too many drives failed.

操作: 请联系支持人员。Action: Contact Support. 在此之前,请参考 https://aka.ms/azurestacklogfiles 中的指导启动日志文件收集过程。Before you do, start the log file collection process using the guidance from https://aka.ms/azurestacklogfiles.

卷运行状况:未知Volume health state: Unknown

如果虚拟磁盘已分离,卷也有可能处于“未知”运行状况。The volume can also be in the Unknown health state if the virtual disk has become detached.

操作状态Operational state 说明Description
已分离Detached 某个存储设备出现故障,从而可能导致卷不可访问。A storage device failure occurred which may cause the volume to be inaccessible. 某些数据可能已丢失。Some data may be lost.

操作:Action:
1.检查所有存储设备的物理连接和网络连接。1. Check the physical and network connectivity of all storage devices.
2.如果所有设备连接正确,请联系支持人员。2. If all devices are connected correctly, contact Support. 在此之前,请参考 https://aka.ms/azurestacklogfiles 中的指导启动日志文件收集过程。Before you do, start the log file collection process using the guidance from https://aka.ms/azurestacklogfiles. 恢复失败的连接后,可能需要从备份还原数据。You may have to restore from backup after the failed connection is restored.

驱动器状态Drive states

使用以下 PowerShell 命令监视驱动器的状态:Use the following PowerShell commands to monitor the state of drives:

$scaleunit_name = (Get-AzsScaleUnit)[0].name

$subsystem_name = (Get-AzsStorageSubSystem -ScaleUnit $scaleunit_name)[0].name

Get-AzsDrive -ScaleUnit $scaleunit_name -StorageSubSystem $subsystem_name | Select-Object StorageNode, PhysicalLocation, HealthStatus, OperationalStatus, Description, Action, Usage, CanPool, CannotPoolReason, SerialNumber, Model, MediaType, CapacityGB

以下部分描述了驱动器可能处于的运行状况:The following sections describe the health states a drive can be in:

驱动器运行状况:正常Drive health state: Healthy

操作状态Operational state 说明Description
OKOK 卷处于正常状态。The volume is healthy.
运行中In service 驱动器正在执行某些内部保养操作。The drive is doing some internal housekeeping operations. 操作完成后,驱动器应会恢复“正常”运行状况。When the action is complete, the drive should return to the OK health state.

驱动器运行状况:正常Drive health state: Healthy

处于“警告”状态的驱动器可以成功读取和写入数据,但存在问题。A drive in the Warning state can read and write data successfully but has an issue.

操作状态Operational state 说明Description
通信断开Lost communication 与驱动器的连接已断开。Connectivity has been lost to the drive.

操作: 使所有服务器恢复联机。Action: Bring all servers back online. 如果仍未解决问题,请重新连接驱动器。If that doesn't fix it, reconnect the drive. 如果此状态持续出现,请更换驱动器,以确保能够全面复原。If this state persists, replace the drive to ensure full resiliency.
预测性故障Predictive failure 预测驱动器即将发生故障。A failure of the drive is predicted to occur soon.

操作: 请尽快更换驱动器,以确保能够全面复原。Action: Replace the drive as soon as possible to ensure full resiliency.
IO 错误IO error 访问驱动器时发生暂时性错误。There was a temporary error accessing the drive.

操作: 如果此状态持续出现,请更换驱动器,以确保能够全面复原。Action: If this state persists, replace the drive to ensure full resiliency.
暂时性错误Transient error 驱动器出现暂时性错误。There was a temporary error with the drive. 此错误通常表示驱动器无响应,但也可能表示不恰当地从驱动器中删除了存储空间直通的保护分区。This error usually means the drive was unresponsive, but it could also mean that the Storage Spaces Direct protective partition was inappropriately removed from the drive.

操作: 如果此状态持续出现,请更换驱动器,以确保能够全面复原。Action: If this state persists, replace the drive to ensure full resiliency.
异常延迟Abnormal latency 驱动器有时无响应并出现故障迹象。The drive is sometimes unresponsive and is showing signs of failure.

操作: 如果此状态持续出现,请更换驱动器,以确保能够全面复原。Action: If this state persists, replace the drive to ensure full resiliency.
从池中删除Removing from pool Azure Stack Hub 正在从其存储池中删除驱动器。Azure Stack Hub is in the process of removing the drive from its storage pool.

操作: 等待 Azure Stack Hub 完成删除驱动器,然后检查状态。Action: Wait for Azure Stack Hub to finish removing the drive, and check the status afterward.
如果仍旧出现此状态,请联系支持人员。If the status remains, contact Support. 在此之前,请参考 https://aka.ms/azurestacklogfiles 中的指导启动日志文件收集过程。Before you do, start the log file collection process using the guidance from https://aka.ms/azurestacklogfiles.
启动维护模式Starting maintenance mode Azure Stack Hub 正在将驱动器置于维护模式。Azure Stack Hub is in the process of putting the drive in maintenance mode. 此状态是暂时的 - 驱动器应该很快就会处于“维护中模式”状态。This state is temporary-the drive should soon be in the In maintenance mode state.

操作: 等待 Azure Stack Hub 完成该过程,然后检查状态。Action: Wait for Azure Stack Hub to finish the process and check the status afterward.
维护中模式In maintenance mode 驱动器处于维护模式,因此对其的读取和写入操作已停止。The drive is in maintenance mode, halting reads and writes from the drive. 此状态通常表示正在执行 Azure Stack Hub 管理任务,例如,PNU 或 FRU 正在操作驱动器。This state usually means Azure Stack Hub administration tasks such as PNU or FRU are operating the drive. 但是,管理员也可将驱动器置于维护模式。But the admin could also place the drive in maintenance mode.

操作: 等待 Azure Stack Hub 完成管理任务,然后检查状态。Action: Wait for Hub Azure Stack Hub to finish the administration task, and check the status afterward.
如果仍旧出现此状态,请联系支持人员。If the status remains, contact Support. 在此之前,请参考 https://aka.ms/azurestacklogfiles 中的指导启动日志文件收集过程。Before you do, start the log file collection process using the guidance from https://aka.ms/azurestacklogfiles.
停止维护模式Stopping maintenance mode Azure Stack Hub 正在将驱动器恢复联机。Azure Stack Hub is in the process of bringing the drive back online. 此状态是暂时性的 - 驱动器应该很快就会处于另一种状态,最好是“正常运行”状态。This state is temporary - the drive should soon be in another state, ideally Healthy.

操作: 等待 Azure Stack Hub 完成该过程,然后检查状态。Action: Wait for Azure Stack Hub to finish the process and check the status afterward.

驱动器运行状况:不正常Drive health state: Unhealthy

当前无法写入或访问处于“不正常”状态的驱动器。A drive in the Unhealthy state can't currently be written to or accessed.

操作状态Operational state 说明Description
拆分Split 已从池中分离驱动器。The drive has become separated from the pool.

操作: 使用新磁盘替换该驱动器。Action: Replace the drive with a new disk. 如果必须使用此磁盘,请从系统中删除该磁盘,确保该磁盘上没有任何有用的数据,擦除该磁盘,然后重新安装磁盘。If you must use this disk, remove the disk from the system, make sure there's no useful data on the disk, erase the disk, and then reseat the disk.
不可用Not usable 物理磁盘已被隔离,因为它不受解决方案供应商的支持。The physical disk is quarantined because it's not supported by your solution vendor. 仅支持批准用于解决方案并且包含正确磁盘固件的磁盘。Only disks that are approved for the solution and have the correct disk firmware are supported.

操作: 将该驱动器替换为由解决方案批准的制造商提供的且带有认可型号的磁盘。Action: Replace the drive with a disk that has an approved manufacturer and model number for the solution.
已过时的元数据Stale metadata 用于更换的磁盘之前已用过,可能包含未知存储系统中的数据。The replacement disk was previously used and may contain data from an unknown storage system. 该磁盘已被隔离。The disk is quarantined.

操作: 使用新磁盘替换该驱动器。Action: Replace the drive with a new disk. 如果必须使用此磁盘,请从系统中删除该磁盘,确保该磁盘上没有任何有用的数据,擦除该磁盘,然后重新安装磁盘。If you must use this disk, remove the disk from the system, make sure there's no useful data on the disk, erase the disk, and then reseat the disk.
无法识别的元数据Unrecognized metadata 在驱动器上找到了无法识别的元数据,这通常表示驱动器包含来自其他池的元数据。Unrecognized metadata found on the drive, which usually means that the drive has metadata from a different pool on it.

操作: 使用新磁盘替换该驱动器。Action: Replace the drive with a new disk. 如果必须使用此磁盘,请从系统中删除该磁盘,确保该磁盘上没有任何有用的数据,擦除该磁盘,然后重新安装磁盘。If you must use this disk, remove the disk from the system, make sure there's no useful data on the disk, erase the disk, and then reseat the disk.
介质故障Failed media 驱动器出现故障,不再可供存储空间使用。The drive failed and won't be used by Storage Spaces anymore.

操作: 请尽快更换驱动器,以确保能够全面复原。Action: Replace the drive as soon as possible to ensure full resiliency.
设备硬件故障Device hardware failure 此驱动器上出现硬件故障。There was a hardware failure on this drive.

操作: 请尽快更换驱动器,以确保能够全面复原。Action: Replace the drive as soon as possible to ensure full resiliency.
正在更新固件Updating firmware Azure Stack Hub 正在更新驱动器上的固件。Azure Stack Hub is updating the firmware on the drive. 此状态是暂时性的,其持续时间通常小于一分钟,在此期间,池中的其他驱动器会处理所有读取和写入操作。This state is temporary and usually lasts less than a minute and during which time other drives in the pool handle all reads and writes.

操作: 等待 Azure Stack Hub 完成更新,然后检查状态。Action: Wait for Azure Stack Hub to finish the updating and check the status afterward.
正在启动Starting 驱动器正在为操作做好准备。The drive is getting ready for operation. 此状态应该是暂时性的 - 完成后,驱动器应会转换为另一种运行状态。This state should be temporary-once complete, the drive should transition to a different operational state.

操作: 等待 Azure Stack Hub 完成操作,然后检查状态。Action: Wait for Azure Stack Hub to finish the operation and check the status afterward.

驱动器无法入池的原因Reasons a drive can't be pooled

某些驱动器尚未做好加入 Azure Stack Hub 存储池的准备。Some drives just aren't ready to be in Azure Stack Hub storage pool. 通过查看驱动器的 CannotPoolReason 属性,可以确定驱动器为何不符合入池条件的原因。You can find out why a drive isn't eligible for pooling by looking at the CannotPoolReason property of a drive. 下表更具体地描述了每种原因。The following table gives a little more detail on each of the reasons.

ReasonReason 说明Description
硬件不合规Hardware not compliant 使用运行状况服务指定的已批准存储模型列表中不包括该驱动程序。The drive isn't in the list of approved storage models specified by using the Health Service.

操作: 使用新磁盘替换该驱动器。Action: Replace the drive with a new disk.
固件不合规Firmware not compliant 使用运行状况服务指定的已批准固件修订版列表中不包括该物理驱动器上的固件。The firmware on the physical drive isn't in the list of approved firmware revisions by using the Health Service.

操作: 使用新磁盘替换该驱动器。Action: Replace the drive with a new disk.
已由群集使用In use by cluster 该驱动器当前已由故障转移群集使用。The drive is currently used by a Failover Cluster.

操作: 使用新磁盘替换该驱动器。Action: Replace the drive with a new disk.
可移动媒体Removable media 该驱动器分类为可移动驱动器。The drive is classified as a removable drive.

操作: 使用新磁盘替换该驱动器。Action: Replace the drive with a new disk.
不正常Not healthy 该驱动器不处于正常状态,可能需要更换。The drive isn't in a healthy state and might need to be replaced.

操作: 使用新磁盘替换该驱动器。Action: Replace the drive with a new disk.
容量不足Insufficient capacity 某些分区占用了驱动器上的可用空间。There are partitions taking up the free space on the drive.

操作: 使用新磁盘替换该驱动器。Action: Replace the drive with a new disk. 如果必须使用此磁盘,请从系统中删除该磁盘,确保该磁盘上没有任何有用的数据,擦除该磁盘,然后重新安装磁盘。If you must use this disk, remove the disk from the system, make sure there's no useful data on the disk, erase the disk, and then reseat the disk.
正在验证Verification in progress 运行状况服务正在检查是否已批准使用驱动器上的固件。The Health Service is checking to see if the drive or firmware on the drive is approved for use.

操作: 等待 Azure Stack Hub 完成该过程,然后检查状态。Action: Wait for Azure Stack Hub to finish the process, and check the status afterward.
验证失败Verification failed 运行状况服务无法检查是否已批准使用驱动器上的固件。The Health Service couldn't check to see if the drive or firmware on the drive is approved for use.

操作: 请联系支持人员。Action: Contact Support. 在此之前,请参考 https://aka.ms/azurestacklogfiles 中的指导启动日志文件收集过程。Before you do, start the log file collection process using the guidance from https://aka.ms/azurestacklogfiles.
OfflineOffline 驱动器已脱机。The drive is offline.

操作: 请联系支持人员。Action: Contact Support. 在此之前,请参考 https://aka.ms/azurestacklogfiles 中的指导启动日志文件收集过程。Before you do, start the log file collection process using the guidance from https://aka.ms/azurestacklogfiles.