监视 Azure 备份工作负荷Monitoring Azure Backup workloads

Azure 备份根据备份要求和基础结构拓扑(本地或 Azure)提供多个备份解决方案。Azure Backup provides multiple backup solutions based on the backup requirement and infrastructure topology (On-premises vs Azure). 任何备份用户或管理员都应看到所有解决方案中发生的情况,并会在出现重大情况时收到通知。Any backup user or admin should see what's going on across all solutions and can expect to be notified in important scenarios. 本文详细介绍了 Azure 备份服务提供的监视和通知功能。This article details the monitoring and notification capabilities provided by Azure Backup service.

恢复服务保管库中的备份项Backup Items in Recovery Services vault

可通过恢复服务保管库监视所有备份项。You can monitor all your backup items via a Recovery Services vault. 导航到保管库中的“备份项”部分后会打开一个视图,其中提供与保管库关联的每种工作负载的备份项数量。Navigating to the Backup Items section in the vault opens up a view that provides the number of backup items of each workload type associated with the vault. 单击任意行会打开一个详细视图,其中列出了给定工作负载类型的所有备份项,以及有关每个项的上次备份状态、可用的最新还原点等信息。Clicking on any row opens up a detailed view listing all backup items of the given workload type, with information on the last backup status for each item, latest restore point available, and so on.

RS 保管库备份项

备注

对于使用 DPM 备份到 Azure 的项,该列表将显示使用 DPM 服务器保护的所有数据源(包括磁盘和联机)。For items backed up to Azure using DPM, the list will show all the data sources protected (both disk and online) using the DPM server. 如果对保留了备份数据的数据源停止保护,则该数据源仍会在门户中列出。If the protection is stopped for the datasource with backup data retained, the datasource will be still listed in the portal. 可访问数据源的详细信息,查看恢复点是否存在于磁盘、联机或同时存在于这两者中。You can go to the details of the data source to see if the recovery points are present in disk, online or both. 此外,对于联机保护已停止但数据仍保留的数据源,在数据完全删除之前,将继续对联机恢复点进行计费。Also, datasources for which the online protection is stopped but data is retained, billing for the online recovery points continue until the data is completely deleted.

DPM 版本必须是 DPM 1807 (5.1.378.0) 或 DPM 2019(10.19.58.0 或更高版本),才能在恢复服务保管库门户中显示备份项。The DPM version must be DPM 1807 (5.1.378.0) or DPM 2019 ( version 10.19.58.0 or above), for the backup items to be visible in the Recovery Services vault portal.

恢复服务保管库中的 Azure 备份作业Backup Jobs in Recovery Services vault

Azure 备份针对 Azure 备份保护的工作负荷提供内置的监视和警报功能。Azure Backup provides in-built monitoring and alerting capabilities for workloads being protected by Azure Backup. 在恢复服务保管库设置中,“监视”部分提供了内置的作业和警报。In the Recovery Services vault settings, the Monitoring section provides in-built jobs and alerts.

恢复服务保管库内置监视

执行配置备份、备份、还原、删除备份等操作时,会生成作业。Jobs are generated when operations such as configuring backup, back up, restore, delete backup, and so on, are performed.

此处会显示以下 Azure 备份解决方案中的作业:Jobs from the following Azure Backup solutions are shown here:

  • Azure VM 备份Azure VM backup
  • Azure 文件备份Azure File backup
  • Azure 工作负荷备份,例如 SQL 和 SAP HANA 备份Azure workload back up such as SQL and SAP HANA
  • Microsoft Azure 恢复服务 (MARS) 代理Microsoft Azure Recovery Services (MARS) agent

不会显示 System Center Data Protection Manager (SC-DPM) 和 Microsoft Azure 备份服务器 (MABS) 中的作业。Jobs from System Center Data Protection Manager (SC-DPM), Microsoft Azure Backup Server (MABS) aren't displayed.

备注

Azure VM 中的 Azure 工作负荷(例如 SQL 和 SAP HANA 备份)包含大量的备份作业。Azure workloads such as SQL and SAP HANA backups within Azure VMs have huge number of backup jobs. 例如,日志备份可能每隔 15 分钟运行一次。For example, log backups can run for every 15 minutes. 因此,对于此类数据库工作负荷,只会显示用户触发的操作。So for such DB workloads, only user triggered operations are displayed. 不会显示计划的备份操作。Scheduled backup operations aren't displayed.

恢复服务保管库中的备份警报Backup Alerts in Recovery Services vault

警报主要用于通知用户,让他们采取相关的措施。Alerts are primarily scenarios where users are notified so that they can take relevant action. “备份警报”部分显示 Azure 备份服务生成的警报。The Backup Alerts section shows alerts generated by Azure Backup service. 这些警报由服务定义,用户无法以自定义方式创建任何警报。These alerts are defined by the service and user can't custom create any alerts.

警报方案Alert scenarios

以下方案由服务定义为可发出警报的方案。The following scenarios are defined by service as alertable scenarios.

  • 备份/还原失败Backup/Restore failures
  • 备份成功,并显示针对 Microsoft Azure 恢复服务 (MARS) 代理的警告Backup succeeded with warnings for Microsoft Azure Recovery Services (MARS) agent
  • 停止保护并保留数据/停止保护并删除数据Stop protection with retain data/Stop protection with delete data

此处会显示以下 Azure 备份解决方案中的警报Alerts from the following Azure Backup solutions are shown here

  • Azure VM 备份Azure VM backups
  • Azure 文件备份Azure File backups
  • Azure 工作负荷备份,例如 SQL 备份、SAP HANA 备份Azure workload backups such as SQL, SAP HANA
  • Microsoft Azure 恢复服务 (MARS) 代理Microsoft Azure Recovery Services (MARS) agent

备注

此处不会显示 System Center Data Protection Manager (SC-DPM) 和 Microsoft Azure 备份服务器 (MABS) 中的警报。Alerts from System Center Data Protection Manager (SC-DPM), Microsoft Azure Backup Server (MABS) aren't displayed here.

合并的警报Consolidated Alerts

对于 Azure 工作负荷备份解决方案(例如 SQL 和 SAP HANA),系统可以非常频繁地生成日志备份(根据策略,最高可达每 15 分钟 1 次)。For Azure workload backup solutions such as SQL and SAP HANA, log backups can be generated very frequently (up to every 15 minutes according to the policy). 因此,也可能会出现日志备份失败也很频繁(高达每 15 分钟一次)的情况。So it's also possible that the log backup failures are also very frequent (up to every 15 minutes). 在这种情况下,如果每次失败都引发一次警报,最终用户将会不堪其扰。In this scenario, the end user will be overwhelmed if an alert is raised for each failure occurrence. 因此,系统会在第一次失败时发送警报,以后由于同一根本原因而失败时则不会生成警报。So an alert is sent for the first occurrence and if the later failures are because of the same root cause, then further alerts aren't generated. 将在第一个警报中更新失败计数。The first alert is updated with the failure count. 但如果该警报被用户停用,则下一次失败会触发另一警报,系统会将其视为该情况的第一个警报。But if the alert is inactivated by the user, the next occurrence will trigger another alert and this will be treated as the first alert for that occurrence. 这是 Azure 备份针对 SQL 和 SAP HANA 备份执行警报合并的方式。This is how Azure Backup performs alert consolidation for SQL and SAP HANA backups.

未引发警报时生成异常Exceptions when an alert is not raised

在一些例外情况下,失败时不会引发警报。There are few exceptions when an alert isn't raised on a failure. 它们具有以下特点:They are:

  • 用户显式取消了正在运行的作业User explicitly canceled the running job
  • 作业失败,因为另一个备份作业正在进行(在此情况下,无需采取任何措施,因为只需等待前一个作业完成即可)The job fails because another backup job is in progress (nothing to act on here since we just have to wait for the previous job to finish)
  • VM 备份作业失败,因为备份的 Azure VM 不再存在The VM backup job fails because the backed-up Azure VM no longer exists
  • 合并的警报Consolidated Alerts

之所以设计上述异常,是因为我们知道,这些操作的结果(主要是用户触发的操作)会立即在门户/PS/CLI 客户端中显示。The exceptions above are designed from the understanding that the result of these operations (primarily user triggered) shows up immediately on portal/PS/CLI clients. 因此,用户会立即了解相关情况,不需要通知。So the user is immediately aware and doesn't need a notification.

警报类型Alert types

根据警报严重性,可以定义三种类型的警报:Based on alert severity, alerts can be defined in three types:

  • 严重:原则上,发生任何备份或恢复失败(计划的或用户触发的)都会导致生成警报,并显示为严重警报以及破坏性操作(例如删除备份)。Critical: In principle, any backup or recovery failure (scheduled or user triggered) would lead to generation of an alert and would be shown as a Critical alert and also destructive operations such as delete backup.
  • 警告:如果备份操作成功但出现了几条警告,则会将这些警报列为“警告”警报。Warning: If the backup operation succeeds but with few warnings, they're listed as Warning alerts. “警告”警报目前仅适用于 Azure 备份代理备份。Warning alerts are currently available only for Azure Backup Agent backups.
  • 信息性:目前,Azure 备份服务不会生成任何信息性警报。Informational: Currently, no informational alert is generated by Azure Backup service.

备份警报的通知Notification for Backup Alerts

备注

只能通过 Azure 门户配置通知。Configuration of notification can be done only through the Azure portal. 不支持使用 PS/CLI/REST API/Azure 资源管理器模板。PS/CLI/REST API/Azure Resource Manager Template support isn't supported.

一旦引发警报,用户就会收到通知。Once an alert is raised, users are notified. Azure 备份通过电子邮件提供内置通知机制。Azure Backup provides an inbuilt notification mechanism via e-mail. 可以指定在生成警报时接收通知的个人电子邮件地址或通讯组列表。One can specify individual email addresses or distribution lists to be notified when an alert is generated. 还可以选择是要接收每个警报的通知,还是将这些警报分组成按小时摘要,然后接收通知。You can also choose whether to get notified for each individual alert or to group them in an hourly digest and then get notified.

恢复服务保管库内置电子邮件通知

配置通知后,你会收到一封欢迎电子邮件或简介电子邮件。When notification is configured, you'll receive a welcome or introductory email. 由此可以确认,在引发警报时,Azure 备份可向这些地址发送电子邮件。This confirms that Azure Backup can send emails to these addresses when an alert is raised.

如果将频率设置为“每小时摘要”,然后引发了警报,但在一小时内解决了该警报,那么,后续的每小时摘要中不会包括该警报。If the frequency was set to an hourly digest and an alert was raised and resolved within an hour, it won't be a part of the upcoming hourly digest.

备注

  • 如果执行了破坏性操作(例如“停止保护并删除数据”),那么,即使未针对恢复服务保管库配置通知,也会引发警报,并向订阅所有者、管理员和共同管理员发送电子邮件。If a destructive operation such as stop protection with delete data is performed, an alert is raised and an email is sent to subscription owners, admins, and co-admins even if notifications aren't configured for the Recovery Services vault.
  • 若要针对成功的作业配置通知,请使用 Log AnalyticsTo configure notification for successful jobs use Log Analytics.

停用警报Inactivating alerts

若要停用/解决某个活动警报,可以选择与要停用的警报相对应的列表项。To inactivate/resolve an active alert, you can select the list item corresponding to the alert you wish to inactivate. 这将打开一个屏幕,其中会显示有关警报的详细信息,顶部有一个“停用”按钮。This opens up a screen that displays detailed information about the alert, with an Inactivate button on the top. 选择此按钮会将警报的状态更改为“非活动”。Selecting this button will change the status of the alert to Inactive. 还可以通过以下方式停用警报:右键单击与警报对应的列表项并选择“停用”。You may also inactivate an alert by right-clicking on the list item corresponding to that alert and selecting Inactivate.

停用恢复服务保管库警报

Azure 备份的 Azure Monitor 警报(预览版)Azure Monitor alerts for Azure Backup (preview)

Azure 备份还通过 Azure Monitor 提供警报,使用户能够在不同的 Azure 服务(包括备份)中获得一致的警报管理体验。Azure Backup also provides alerts via Azure Monitor, to enable users to have a consistent experience for alert management across different Azure services, including backup. 使用 Azure Monitor 警报可以将警报路由到 Azure 备份支持的任何通知通道,例如电子邮件、Webhook、逻辑应用等。With Azure Monitor alerts, you can route alerts to any notification channel supported by Azure Backup such as email, Webhook, Logic App and so on.

目前,此功能适用于 Azure Databases for PostgreSQL Server、Azure Blob 和 Azure 托管磁盘。Currently, this feature is available for Azure Databases for PostgreSQL Server, Azure Blobs and Azure Managed Disks. 会为以下方案生成警报,并且可以通过导航到备份保管库并单击“警报”菜单项来访问警报:Alerts are generated for the following scenarios and can be accessed by navigating to a Backup vault and clicking on the Alerts menu item:

  • 删除备份数据Delete Backup Data
  • 备份失败(若要获取有关备份失败的警报,需要通过预览门户注册名为 EnableAzureBackupJobFailureAlertsToAzureMonitor 的 AFEC 标志)Backup Failure (to get alerts for Backup Failure, you need to register the AFEC flag named EnableAzureBackupJobFailureAlertsToAzureMonitor via the preview portal)
  • 还原失败(若要获取有关还原失败的警报,需要通过预览门户注册名为 EnableAzureBackupJobFailureAlertsToAzureMonitor 的 AFEC 标志)Restore Failure (to get alerts for Restore Failure, you need to register the AFEC flag named EnableAzureBackupJobFailureAlertsToAzureMonitor via the preview portal)

若要详细了解 Azure Monitor 警报,请参阅 Azure 中的警报概述For more information about Azure Monitor alerts, see Overview of alerts in Azure.

后续步骤Next steps

使用 Azure Monitor 监视 Azure 备份工作负荷Monitor Azure Backup workloads using Azure Monitor