ha/service-mgmt/sm-db/database
Bin Qian a85ffc695e Shorten rabbit failure recovery delay
In rare cases, when system running slowly with significant scheduling
delay, rabbit disable action timeout continually. As final resort sm
reboots the impacted controller for recovery after failure count reaches
MAX_TRANSITION_FAILURES. As rabbit service disable timeout is set to 60
seconds, this result a significant delay before reboot for recovery.

This change updates MAX_TRANSITION_FAILURES of rabbit service from
16 to 5 to reduce the delay of recovery of rabbit failure.

TCs passed:
    Install a DX system
    Observed service group recovery escalated to reboot after 5 forced
    rabbit disable failure.

Closes-bug: 2016168
Signed-off-by: Bin Qian <bin.qian@windriver.com>
Change-Id: I660a64f0e78b6564456eb26245b672d2549f9a3b
2023-05-09 03:48:48 +00:00
..
Makefile Remove version from sm-db folder 2019-09-26 14:08:15 -05:00
README Remove version from sm-db folder 2019-09-26 14:08:15 -05:00
create_sm_db.sql Shorten rabbit failure recovery delay 2023-05-09 03:48:48 +00:00
create_sm_hb_db.sql Remove version from sm-db folder 2019-09-26 14:08:15 -05:00
sm-patch.sql Remove version from sm-db folder 2019-09-26 14:08:15 -05:00

README

The SM database is generated by the corresponding SQL scripts:
create_sm_db.sql -> sm.db
create_sm_hb_db.sql -> sm.db.hb

Instructions:
1. Update the corresponding SQL script i.e, create_sm_db.sql or create_sm_hb_db.sql:
    Add proper SQL statement(s) (insert, update, delete) to the sql file.