Define kernel mismatch alarms: 100.120, 100.121

2 new alarms:

- 100.120 - Controllers running mismatched kernels
  (minor, non-management affecting)

- 100.121 - Host not running the provisioned kernel
  (major, management affecting)

Part of the kernel switchover feature

Task: 48281
Story: 2010731

Test plan:
PASS - AIO-DX - install iso and bootstrap

PASS - AIO-DX - raise and clears both alarms: 100.120, 100.121

Change-Id: Ifb2df5658071d1a2fab42737267c621fc42d7136
Signed-off-by: Kyale, Eliud <Eliud.Kyale@windriver.com>
This commit is contained in:
Kyale, Eliud 2023-06-24 09:47:41 -04:00
parent dbffaed98f
commit 901437c0fe
3 changed files with 44 additions and 1 deletions

View File

@ -68,6 +68,8 @@ ALARM_GROUP_SW_MGMT = "900"
# General Alarm id
FM_ALARM_ID_FS_USAGE = ALARM_GROUP_GENERAL + ".104"
FM_ALARM_ID_IMAGE_CONVERSION = ALARM_GROUP_GENERAL + ".105"
FM_ALARM_ID_CONTROLLERS_KERNEL_MISMATCH = ALARM_GROUP_GENERAL + ".120"
FM_ALARM_ID_PROVISIONED_KERNEL_MISMATCH = ALARM_GROUP_GENERAL + ".121"
# Maintenance Log id
FM_LOG_ID_HOST_DISCOVERED = ALARM_GROUP_MAINTENANCE + ".020"
@ -618,3 +620,13 @@ FM_SERVICE_AFFECTING_INDEX = 11
FM_SUPPRESSION_INDEX = 12
FM_INHIBIT_ALARMS_INDEX = 13
MAX_ALARM_ATTRIBUTES = 14
# Proposed Repair actions
FM_PRA_CONTROLLERS_KERNEL_MISMATCH = ("Modify controllers using "
"'system host-kernel-modify' "
"so that both are running the "
"desired 'standard' or "
"'lowlatency' kernel.")
FM_PRA_PROVISIONED_KERNEL_MISMATCH = ("Retry 'system host-kernel-modify' "
"and if condition persists, "
"contact next level of support.")

View File

@ -493,6 +493,38 @@
Degrade_Affecting_Severity: none
Context: starlingx
100.120:
Type: Alarm
Description: Controllers running mismatched kernels.
Entity_Instance_ID: host=<hostname>.kernel=<kernel>
Severity: minor
Proposed_Repair_Action: "Modify controllers using 'system host-kernel-modify' so that both are running the desired 'standard' or 'lowlatency' kernel."
Maintenance_Action: none
Inhibit_Alarms: False
Alarm_Type: equipment
Probable_Cause: unspecified-reason
Service_Affecting: False
Suppression: False
Management_Affecting_Severity: none
Degrade_Affecting_Severity: none
Context: starlingx
100.121:
Type: Alarm
Description: Host not running the provisioned kernel.
Entity_Instance_ID: host=<hostname>.kernel=<kernel>
Severity: major
Proposed_Repair_Action: "Retry 'system host-kernel-modify' and if condition persists, contact next level of support."
Maintenance_Action: none
Inhibit_Alarms: False
Alarm_Type: equipment
Probable_Cause: unspecified-reason
Service_Affecting: False
Suppression: False
Management_Affecting_Severity: major
Degrade_Affecting_Severity: none
Context: starlingx
100.150:
Type: Alarm
Description:

View File

@ -13,4 +13,3 @@
# fm_api:event_suppression:get: role:reader
# fm_api:event_suppression:modify: rule:admin_in_system_projects