metal/mtce/src
Eric MacDonald 6d0cc6a2a8 Prevent early active monitoring of compute processes in AIO
The commit shown below introduced a main loop audit that
mistakenly registers subfunction processes that are in the
waiting for /var/run/.compute_config_complete 'polling'
state during unlock enable.

By doing so inadvertently changes its monitor FSM stage
from 'Poll' to 'Manage' before configuration is complete.

Since config is not complete, the hbsClient has not initialized
its socket interface and is unable to service active monitoring
requests. This leads to quorum failure and watchdog reboot.

commit 537935bb0c
Author: Eric MacDonald <eric.macdonald@windriver.com>
Date:   Mon Jul 9 08:36:22 2018 -0400
Reorder process restart operations to prevent pmond futex deadlock

The Fix: Don't run the audit for processes that are in the
waiting for 'polling' state.

Test Plan:

Provision AIO , verify no quorum failure and inspect logs for
correct behavior.

Change-Id: I179c78309517a34285783ee99bbb3d699915cb83
Closes-Bug: 1804318
Signed-off-by: Eric MacDonald <eric.macdonald@windriver.com>
2018-11-21 10:04:00 -05:00
..
alarm Decouple Guest-server/agent from stx-metal 2018-09-18 17:15:08 -04:00
common Fix out of band issue. In some case array or string is possible to be 2018-11-02 09:36:13 +08:00
fsmon Decouple Guest-server/agent from stx-metal 2018-09-18 17:15:08 -04:00
fsync Decouple Guest-server/agent from stx-metal 2018-09-18 17:15:08 -04:00
heartbeat Fix resource leak issue, memory not free 2018-10-19 04:37:54 +08:00
hostw Add explicate string terminator for protection 2018-11-05 14:56:36 +08:00
hwmon Add explicate string terminator for protection 2018-11-05 14:56:36 +08:00
maintenance Add explicate string terminator for protection 2018-11-05 14:56:36 +08:00
mtclog Decouple Guest-server/agent from stx-metal 2018-09-18 17:15:08 -04:00
pmon Prevent early active monitoring of compute processes in AIO 2018-11-21 10:04:00 -05:00
public Fix out of band issue. In some case array or string is possible to be 2018-11-02 09:36:13 +08:00
rmon Add explicate string terminator for protection 2018-11-05 14:56:36 +08:00
scripts Mtce: Add heartbeat cluster information for SM query 2018-10-05 22:47:17 +00:00
LICENSE Decouple Guest-server/agent from stx-metal 2018-09-18 17:15:08 -04:00
Makefile Decouple Guest-server/agent from stx-metal 2018-09-18 17:15:08 -04:00