方案:Apache Ambari UI 显示 Azure HDInsight 中的主机和服务已关闭Scenario: Apache Ambari UI shows down hosts and services in Azure HDInsight

本文介绍在与 Azure HDInsight 群集交互时出现的问题的故障排除步骤和可能的解决方案。This article describes troubleshooting steps and possible resolutions for issues when interacting with Azure HDInsight clusters.

问题Issue

Apache Ambari UI 可以访问,但 UI 显示几乎所有服务都已关闭,所有主机都显示检测信号丢失。Apache Ambari UI is accessible, but the UI shows almost all services are down, all hosts showing heartbeat lost.

原因Cause

在大多数情况下,这是 Ambari 服务器未在活动头节点上运行的问题。In most scenarios, this is an issue with Ambari server not running on the active headnode. 检查哪个头节点是活动头节点,并确保 Ambari 服务器在正确的头节点上运行。Check which headnode is the active headnode and make sure your ambari-server runs on the right one. 请勿手动启动 Ambari 服务器,让故障转移控制器服务负责在正确的头节点上启动 Ambari 服务器。Don't manually start ambari-server, let failover controller service be responsible for starting ambari-server on the right headnode. 重新启动活动头节点以强制进行故障转移。Reboot the active headnode to force a failover.

网络问题也可能导致此问题。Networking issues can also cause this problem. 从每个群集节点,看看是否可以 ping 通 headnodehostFrom each cluster node, see if you can ping headnodehost. 在极少数情况下,任何群集节点都无法连接到 headnodehostThere is a rare situation where no cluster node can connect to headnodehost:

$>telnet headnodehost 8440
... No route to host

解决方法Resolution

通常,重新启动活动头节点会缓解此问题。Usually rebooting the active headnode will mitigate this issue. 如果未缓解,请与 HDInsight 支持团队联系。If not please contact HDInsight support team.

后续步骤Next steps

如果你的问题未在本文中列出,或者无法解决问题,请访问以下渠道获取更多支持:If you didn't see your problem or are unable to solve your issue, visit the following channel for more support:

  • 如果需要更多帮助,可以从 Azure 门户提交支持请求。If you need more help, you can submit a support request from the Azure portal. 从菜单栏中选择“支持” ,或打开“帮助 + 支持” 中心。Select Support from the menu bar or open the Help + support hub.