Help Center > > User Guide> Managing Active Clusters> Alarm Reference> ALM-27001 DBService Unavailable

ALM-27001 DBService Unavailable

Updated at: Dec 31, 2019 GMT+08:00

Description

The alarm module checks the DBService status every 30 seconds. This alarm is generated when the system detects that DBService is unavailable and is cleared when DBService recovers.

Attribute

Alarm ID

Alarm Severity

Automatically Cleared

27001

Critical

Yes

Parameters

Parameter

Description

ServiceName

Specifies the service for which the alarm is generated.

RoleName

Specifies the role for which the alarm is generated.

HostName

Specifies the host for which the alarm is generated.

Impact on the System

The database service is unavailable and cannot provide data import or query functions for upper-layer services, which results in service exceptions.

Possible Causes

  • The floating IP address does not exist.
  • There is no active DBServer instance.
  • The active and standby DBServer processes are abnormal.

Procedure

  1. Check whether the floating IP address exists in the cluster environment.

    1. On the MRS cluster details page, click Components.

      For MRS 2.0.1 or earlier, log in to MRS Manager and click Services.

    2. Choose DBService > Instance.
    3. Check whether the active instance exists.
      • If yes, go to 1.d.
      • If no, go to 2.a.
    4. Select the active DBServer instance and record the IP address.
    5. Log in to the host that corresponds to the preceding IP address, and run the ifconfig command to check whether the DBService floating IP address exists on the node.
      • If yes, go to 1.f.
      • If no, go to 2.a.
    6. Run the ping floating IPaddress command to check whether the DBService floating IP address can be pinged.
      • If yes, go to 1.g.
      • If no, go to 2.a.
    7. Log in to the host that corresponds to the DBService floating IP address, and run the ifconfig interface down command to delete the floating IP address.
    8. Choose Components > DBService > More > Restart Service to restart DBService. Check whether DBService is restarted successfully.
      • If yes, go to 1.i.
      • If no, go to 2.a.
    9. Wait about 2 minutes and check whether the alarm is cleared from the alarm list.
      • If yes, no further action is required.
      • If no, go to 3.a.

  2. Check the status of the active DBServer instance.

    1. Select the DBServer instance whose role status is abnormal and record the IP address.
    2. On the Alarms page, check whether alarm ALM-12007 Process Fault occurs in the DBServer instance on the host that corresponds to the IP address.
      • If yes, go to 2.c.
      • If no, go to 4.
    3. Follow procedures in ALM-12007 Process Fault to handle the alarm.
    4. Wait about 5 minutes and check whether the alarm is cleared from the alarm list.
      • If yes, no further action is required.
      • If no, go to 4.

  3. Check the status of the active and standby DBServers.

    1. Log in to the host that corresponds to the DBService floating IP address, and run the sudo su - root and su - omm commands to switch to user omm. Run the cd ${BIGDATA_HOME}/FusionInsight/dbservice/ command to go to the installation directory of DBService.
    2. Run the sh sbin/status-dbserver.sh command to view the status of DBService's active and standby HA processes. Check whether the status can be viewed successfully.
      • If yes, go to 3.c.
      • If no, go to 4.
    3. Check whether the active and standby HA processes are normal.
      • If yes, go to 4.
      • If no, go to 3.d.
    4. Choose Components > DBService > More > Restart Service to restart DBService, and check whether DBService is restarted successfully.
      • If yes, go to 3.e.
      • If no, go to 4.
    5. Wait about 2 minutes and check whether the alarm is cleared from the alarm list.
      • If yes, no further action is required.
      • If no, go to 4.

  4. Collect fault information.

    1. On MRS Manager, choose System > Export Log.
    2. Contact the O&M personnel and send the collected log information.

Related Information

N/A

Did you find this page helpful?

Submit successfully!

Thank you for your feedback. Your feedback helps make our documentation better.

Failed to submit the feedback. Please try again later.

Which of the following issues have you encountered?







Please complete at least one feedback item.

Content most length 200 character

Content is empty.

OK Cancel