Update events.yaml for specific alarms

The following alarms need to be clearer for users.
Some information was gathered in order to improve the
'description' and 'Proposed repair action' fields.

500.200
500.210
750.002
900.002
900.003
900.004
900.009
900.103
900.203
900.303
900.503

Test plan:
Build and install Debian package.

Story: 2010143
Task: 45785

Signed-off-by: Agustin Carranza <agustin.carranza@windriver.com>
Change-Id: I55f81ffa4159284b730e2572e76dfd9867b348d5
This commit is contained in:
Agustin Carranza 2022-08-10 17:08:46 -03:00
parent 6ed32029db
commit 571b0665ae
1 changed files with 13 additions and 12 deletions

View File

@ -1400,7 +1400,7 @@
OR OR
system.certificate.k8sRootCA system.certificate.k8sRootCA
Severity: major Severity: major
Proposed_Repair_Action: Renew certificate for the entity identified Proposed_Repair_Action: Check certificate expiration time. Renew certificate for the entity identified.
Maintenance_Action: Maintenance_Action:
Inhibit_Alarms: Inhibit_Alarms:
Alarm_Type: operational-violation Alarm_Type: operational-violation
@ -1427,7 +1427,7 @@
OR OR
system.certificate.k8sRootCA system.certificate.k8sRootCA
Severity: critical Severity: critical
Proposed_Repair_Action: Renew certificate for the entity identified Proposed_Repair_Action: Check certificate expiration time. Renew certificate for the entity identified.
Maintenance_Action: Maintenance_Action:
Inhibit_Alarms: Inhibit_Alarms:
Alarm_Type: operational-violation Alarm_Type: operational-violation
@ -2788,7 +2788,8 @@
Description: "Application Apply Failure" Description: "Application Apply Failure"
Entity_Instance_ID: k8s_application=<appname> Entity_Instance_ID: k8s_application=<appname>
Severity: major Severity: major
Proposed_Repair_Action: "Retry applying the application. If the issue persists, please check system inventory log for cause." Proposed_Repair_Action: "Retry applying the application. Check application is managed by the system application framework.
If the issue persists, please check system inventory log for cause."
Maintenance_Action: Maintenance_Action:
Inhibit_Alarms: Inhibit_Alarms:
Alarm_Type: processing-error Alarm_Type: processing-error
@ -3083,10 +3084,10 @@
900.002: 900.002:
Type: Alarm Type: Alarm
Description: Patch host install failure. Description: Patch host install failure. Command "sw-patch host-install" failed.
Entity_Instance_ID: host=<hostname> Entity_Instance_ID: host=<hostname>
Severity: major Severity: major
Proposed_Repair_Action: Undo patching operation. Proposed_Repair_Action: Undo patching operation. Check patch logs on the target host (i.e. /var/log/patching.log)
Maintenance_Action: Maintenance_Action:
Inhibit_Alarms: Inhibit_Alarms:
Alarm_Type: environmental Alarm_Type: environmental
@ -3098,7 +3099,7 @@
900.003: 900.003:
Type: Alarm Type: Alarm
Description: Obsolete patch in system. Description: A patch with state 'obsolete' in its metadata has been uploaded.
Entity_Instance_ID: host=controller Entity_Instance_ID: host=controller
Severity: warning Severity: warning
Proposed_Repair_Action: Remove and delete obsolete patches. Proposed_Repair_Action: Remove and delete obsolete patches.
@ -3113,7 +3114,7 @@
900.004: 900.004:
Type: Alarm Type: Alarm
Description: Host version mismatch. Description: The upgrade and running software version do not match. Command host-upgrade failed.
Entity_Instance_ID: host=<hostname> Entity_Instance_ID: host=<hostname>
Severity: major Severity: major
Proposed_Repair_Action: Reinstall host to update applied load. Proposed_Repair_Action: Reinstall host to update applied load.
@ -3188,7 +3189,7 @@
900.009: 900.009:
Type: Alarm Type: Alarm
Description: Kubernetes root CA update aborted, certificates may not be fully updated. Description: Kubernetes root CA update aborted, certificates may not be fully updated. Command "system kube-rootca-update-abort" has been run.
Entity_Instance_ID: host=controller Entity_Instance_ID: host=controller
Severity: minor Severity: minor
Proposed_Repair_Action: Fully update certificates by a new root CA update. Proposed_Repair_Action: Fully update certificates by a new root CA update.
@ -3233,7 +3234,7 @@
900.103: 900.103:
Type: Alarm Type: Alarm
Description: Software patch auto-apply failed Description: Software patch auto-apply failed. Command "sw-manager patch-strategy apply" failed.
Entity_Instance_ID: orchestration=sw-patch Entity_Instance_ID: orchestration=sw-patch
Severity: critical Severity: critical
Proposed_Repair_Action: Attempt to apply software patches manually; if problem persists contact next level of support Proposed_Repair_Action: Attempt to apply software patches manually; if problem persists contact next level of support
@ -3377,7 +3378,7 @@
900.203: 900.203:
Type: Alarm Type: Alarm
Description: Software upgrade auto-apply failed Description: Software upgrade auto-apply failed. Command "sw-manager update-strategy apply" failed
Entity_Instance_ID: orchestration=sw-upgrade Entity_Instance_ID: orchestration=sw-upgrade
Severity: critical Severity: critical
Proposed_Repair_Action: Attempt to apply software upgrade manually; if problem persists contact next level of support Proposed_Repair_Action: Attempt to apply software upgrade manually; if problem persists contact next level of support
@ -3521,7 +3522,7 @@
900.303: 900.303:
Type: Alarm Type: Alarm
Description: Firmware Update auto-apply failed Description: Firmware Update auto-apply failed. Command "sw-manager kube-rootca-update-strategy apply" failed.
Entity_Instance_ID: orchestration=fw-update Entity_Instance_ID: orchestration=fw-update
Severity: critical Severity: critical
Proposed_Repair_Action: Attempt to apply firmware update manually; if problem persists contact next level of support Proposed_Repair_Action: Attempt to apply firmware update manually; if problem persists contact next level of support
@ -3809,7 +3810,7 @@
900.503: 900.503:
Type: Alarm Type: Alarm
Description: Kubernetes rootca update auto-apply failed Description: Kubernetes rootca update auto-apply failed. Command "sw-manager kube-upgrade-strategy apply" failed.
Entity_Instance_ID: orchestration=kube-rootca-update Entity_Instance_ID: orchestration=kube-rootca-update
Severity: critical Severity: critical
Proposed_Repair_Action: Attempt to apply kubernetes rootca update manually; if problem persists contact next level of support Proposed_Repair_Action: Attempt to apply kubernetes rootca update manually; if problem persists contact next level of support