监视 Azure 备份工作负荷Monitoring Azure Backup workloads

Azure 备份根据备份要求和基础结构拓扑(本地或 Azure)提供多个备份解决方案。Azure Backup provides multiple backup solutions based on the backup requirement and infrastructure topology (On-premises vs Azure). 任何备份用户或管理员都会看到所有解决方案中发生的情况,并会在出现重大情况时收到通知。Any backup user or admin should see what is going on across all solutions and can expect to be notified in important scenarios. 本文详细介绍了 Azure 备份服务提供的监视和通知功能。This article details the monitoring and notification capabilities provided by Azure Backup service.

恢复服务保管库中的 Azure 备份作业Backup Jobs in Recovery Services vault

Azure 备份针对 Azure 备份保护的工作负荷提供内置的监视和警报功能。Azure Backup provides in-built monitoring and alerting capabilities for workloads being protected by Azure Backup. 在恢复服务保管库设置中,“监视”部分提供了内置的作业和警报。In the Recovery Services vault settings, the Monitoring section provides in-built jobs and alerts.

恢复服务保管库内置监视

执行配置备份、备份、还原、删除备份等操作时,会生成作业。Jobs are generated when operations such as configuring backup, back up, restore, delete backup, and so on, are performed.

此处会显示以下 Azure 备份解决方案中的作业:Jobs from the following Azure Backup solutions are shown here:

  • Azure VM 备份Azure VM backup
  • Azure 文件备份Azure File backup
  • Azure 工作负荷备份,例如 SQL 和 SAP HANA 备份Azure workload back up such as SQL and SAP HANA
  • Azure 备份代理 (MAB)Azure Backup agent (MAB)

不会显示 System Center Data Protection Manager (SC-DPM) 和 Microsoft Azure 备份服务器 (MABS) 中的作业。Jobs from System Center Data Protection Manager (SC-DPM), Microsoft Azure Backup Server (MABS) are NOT displayed.

备注

Azure VM 中的 Azure 工作负荷(例如 SQL 和 SAP HANA 备份)包含大量的备份作业。Azure workloads such as SQL and SAP HANA backups within Azure VMs have huge number of backup jobs. 例如,日志备份可能每隔 15 分钟运行一次。For example, log backups can run for every 15 minutes. 因此,对于此类数据库工作负荷,只会显示用户触发的操作。So for such DB workloads, only user triggered operations are displayed. 不显示计划的备份操作。Scheduled backup operations are NOT displayed.

恢复服务保管库中的备份警报Backup Alerts in Recovery Services vault

警报主要用于通知用户,让他们采取相关的措施。Alerts are primarily scenarios where users are notified so that they can take relevant action. “备份警报”部分显示 Azure 备份服务生成的警报。The Backup Alerts section shows alerts generated by Azure Backup service. 这些警报由服务定义,用户无法以自定义方式创建任何警报。These alerts are defined by the service and user can't custom create any alerts.

警报方案Alert scenarios

以下方案由服务定义为可发出警报的方案。The following scenarios are defined by service as alertable scenarios.

  • 备份/还原失败Backup/Restore failures
  • 备份成功,但出现 Azure 备份代理 (MAB) 的警告Backup succeeded with warnings for Azure Backup Agent (MAB)
  • 停止保护并保留数据/停止保护并删除数据Stop protection with retain data/Stop protection with delete data

此处会显示以下 Azure 备份解决方案中的警报Alerts from the following Azure Backup solutions are shown here

  • Azure VM 备份Azure VM backups
  • Azure 文件备份Azure File backups
  • Azure 工作负荷备份,例如 SQL 备份、SAP HANA 备份Azure workload backups such as SQL, SAP HANA
  • Azure 备份代理 (MAB)Azure Backup agent (MAB)

备注

此处不会显示 System Center Data Protection Manager (SC-DPM) 和 Microsoft Azure 备份服务器 (MABS) 中的警报。Alerts from System Center Data Protection Manager (SC-DPM), Microsoft Azure Backup Server (MABS) are NOT displayed here.

合并的警报Consolidated Alerts

对于 Azure 工作负荷备份解决方案(例如 SQL 和 SAP HANA),系统可以非常频繁地生成日志备份(根据策略,最高可达每 15 分钟 1 次)。For Azure workload backup solutions such as SQL and SAP HANA, log backups can be generated very frequently (up to every 15 minutes according to the policy). 因此,也可能会出现日志备份失败也很频繁(高达每 15 分钟一次)的情况。So it's also possible that the log backup failures are also very frequent (up to every 15 minutes). 在这种情况下,如果每次失败都引发一次警报,最终用户将会不堪其扰。In this scenario, the end user will be overwhelmed if an alert is raised for each failure occurrence. 因此,系统会在第一次失败时发送警报,以后由于同一根本原因而失败时则不会生成警报。So an alert is sent for the first occurrence and if the later failures are because of the same root cause, then further alerts aren't generated. 将在第一个警报中更新失败计数。The first alert is updated with the failure count. 但如果该警报被用户停用,则下一次失败会触发另一警报,系统会将其视为该情况的第一个警报。But if the alert is inactivated by the user, the next occurrence will trigger another alert and this will be treated as the first alert for that occurrence. 这是 Azure 备份针对 SQL 和 SAP HANA 备份执行警报合并的方式。This is how Azure Backup performs alert consolidation for SQL and SAP HANA backups.

未引发警报时生成异常Exceptions when an alert is not raised

在一些例外情况下,失败时不会引发警报。There are few exceptions when an alert isn't raised on a failure. 它们具有以下特点:They are:

  • 用户显式取消了正在运行的作业User explicitly canceled the running job
  • 作业失败,因为另一个备份作业正在进行(在此情况下,无需采取任何措施,因为只需等待前一个作业完成即可)The job fails because another backup job is in progress (nothing to act on here since we just have to wait for the previous job to finish)
  • VM 备份作业失败,因为备份的 Azure VM 不再存在The VM backup job fails because the backed-up Azure VM no longer exists
  • 合并的警报Consolidated Alerts

之所以设计上述异常,是因为我们知道,这些操作的结果(主要是用户触发的操作)会立即显示在门户/PS/CLI 客户端中。The above exceptions are designed from the understanding that the result of these operations (primarily user triggered) shows up immediately on portal/PS/CLI clients. 因此,用户会立即了解相关情况,不需要通知。So the user is immediately aware and doesn't need a notification.

警报类型Alert types

根据警报严重性,可以定义三种类型的警报:Based on alert severity, alerts can be defined in three types:

  • 严重:原则上,发生任何备份或恢复失败(计划的或用户触发的)都会导致生成警报,并显示为严重警报以及破坏性操作(例如删除备份)。Critical: In principle, any backup or recovery failure (scheduled or user triggered) would lead to generation of an alert and would be shown as a Critical alert and also destructive operations such as delete backup.
  • 警告:如果备份操作成功但出现了几条警告,则会将这些警报列为“警告”警报。Warning: If the backup operation succeeds but with few warnings, they're listed as Warning alerts. “警告”警报目前仅适用于 Azure 备份代理备份。Warning alerts are currently available only for Azure Backup Agent backups.
  • 信息性:目前,Azure 备份服务不会生成任何信息性警报。Informational: Currently, no informational alert is generated by Azure Backup service.

备份警报的通知Notification for Backup Alerts

备注

只能通过 Azure 门户配置通知。Configuration of notification can be done only through Azure Portal. 不支持使用 PS/CLI/REST API/Azure 资源管理器模板。PS/CLI/REST API/Azure Resource Manager Template support is not supported.

一旦引发警报,用户就会收到通知。Once an alert is raised, users are notified. Azure 备份通过电子邮件提供内置通知机制。Azure Backup provides an inbuilt notification mechanism via e-mail. 可以指定在生成警报时接收通知的个人电子邮件地址或通讯组列表。One can specify individual email addresses or distribution lists to be notified when an alert is generated. 还可以选择是要接收每个警报的通知,还是将这些警报分组成按小时摘要,然后接收通知。You can also choose whether to get notified for each individual alert or to group them in an hourly digest and then get notified.

恢复服务保管库内置电子邮件通知

配置通知后,你会收到一封欢迎电子邮件或简介电子邮件。When notification is configured, you'll receive a welcome or introductory email. 由此可以确认,在引发警报时,Azure 备份可向这些地址发送电子邮件。This confirms that Azure Backup can send emails to these addresses when an alert is raised.

如果将频率设置为“每小时摘要”,然后引发了警报,但在一小时内解决了该警报,那么,后续的每小时摘要中不会包括该警报。If the frequency was set to an hourly digest and an alert was raised and resolved within an hour, it won't be a part of the upcoming hourly digest.

备注

  • 如果执行了破坏性操作(例如“停止保护并删除数据”),那么,即使未针对恢复服务保管库配置通知,也会引发警报,并向订阅所有者、管理员和共同管理员发送电子邮件。If a destructive operation such as stop protection with delete data is performed, an alert is raised and an email is sent to subscription owners, admins, and co-admins even if notifications are NOT configured for the Recover Service vault.
  • 若要针对成功的作业配置通知,请使用 Log AnalyticsTo configure notification for successful jobs use Log Analytics.

停用警报Inactivating alerts

若要停用/解决某个活动警报,可以单击与要停用的警报相对应的列表项。To inactivate/resolve an active alert, you can click on the list item corresponding to the alert you wish to inactivate. 这将打开一个屏幕,其中会显示有关警报的详细信息,顶部有一个“停用”按钮。This opens up a screen that displays detailed information about the alert, with an 'Inactivate' button on the top. 单击该按钮会将警报的状态更改为“非活动”。Clicking this button would change the status of the alert to 'Inactive'. 还可以通过以下方式停用警报:右键单击与警报对应的列表项并选择“停用”。You may also inactivate an alert by right-clicking on the list item corresponding to that alert and selecting 'Inactivate'.

停用恢复服务保管库警报

后续步骤Next steps

使用 Azure Monitor 监视 Azure 备份工作负荷Monitor Azure backup workloads using Azure Monitor