Commit Graph

93 Commits

Author SHA1 Message Date
Zuul af05b3dbe4 Merge "Add pxeboot mtcAlive alarm to fault management" 2024-04-08 15:06:12 +00:00
Eric MacDonald aef702fb01 Add pxeboot mtcAlive alarm to fault management
This update introduces a new maintenance group alarm ; 200.003

This new alarm is minor and management affecting if asserted.
It is considered management affecting for the upgrades case because
the pxeboot network is needed to upgrade a node.

The alarm represents a communication/messaging failure between the
active controller mtcAgent process and the mtcClient that runs
on each node.

Test Plan:

PASS: Verify alarm attributes
PASS: - code of 200.003
PASS: - assertion cause text
PASS: - proposed repair action text
PASS: - suppression option
PASS: - does not inhibit other alarms
PASS: - affect of assertion on upgrade healthcheck
PASS: Verify ability to assert and clear
PASS: Verify fm logging for the above assertion and clear

Story: 2010940
Task: 49789
Change-Id: I507d30213674c5b1e24fcfebe15c6a87bad74358
Signed-off-by: Eric MacDonald <eric.macdonald@windriver.com>
2024-04-08 14:25:18 +00:00
Zuul e1f59d28a5 Merge "Create deploy state out of sync alarm" 2024-04-02 11:32:30 +00:00
junfeng-li 3dd9855071 Create deploy state out of sync alarm
This new alarm is raised when the controllers deploy
state is out of sync during the deployment.

Test Plan:

PASS: alarm raised when the deploy state is out of sync
PASS: alarm cleared when the deploy state is in sync

Task: 49732
Story: 2010676
Change-Id: Ibdcc54f02c9e156b2b78313b527cd273a62425f1
Signed-off-by: junfeng-li <junfeng.li@windriver.com>
2024-03-22 13:53:04 +00:00
Andy Ning e0c1825635 Add alarm 250.004 for IPsec certs renewal
This change added alarm 250.004, "IPsec certificates renewal failed".
This alarm will be raised by the ipsec-cert-renew cron job when the
renewal failed, and will be cleared when cron job script is re-run,
either manually or by cron, after the error is fixed.

Test Plan:
PASS: Simulate a failure condition (eg, ipsec-client return non zero),
      run the cron job script, verify the IPsec renewal fails, and
      alarm 250.004 is raised.
PASS: Run the script with IPsec cert not being about to expire, verify
      the script finish successfully and alarm 250.004 is cleared.

Story: 2010940
Task: 49706
Change-Id: Ie4d3970ca32173939c1df55a2e59241ac214b2ae
Signed-off-by: Andy Ning <andy.ning@windriver.com>
2024-03-12 12:23:08 -04:00
Zuul a2fadb04dc Merge "Adding new alarm definition for node taint." 2024-02-27 17:36:54 +00:00
rakshith mr df3fffe426 New alarm 850.002 - used when k8s cluster is unreachable
Configuring new alarm 850.002 which will be used when K8s
periodic audit happens and any of the endpoint health check
fails.

Test Plan:
PASS: Performed tox test locally and packages were built
      successfully
PASS: Verify that k8s orchestrated upgrade will get blocked,
      if it is tried while alarm is set. And k8s orchestrated
      upgrade will complete, if it is tried when alarm is
      cleared.

Story: 2011037
Task: 49535

Change-Id: I335179ea98ef63d7c35c89d82328a52ab2391f5c
Signed-off-by: rakshith mr <rakshith.mr@windriver.com>
2024-02-23 18:59:03 +00:00
Vanathi.Selvaraju 14468b9d21 Adding new alarm definition for node taint.
Currently there is no alarm for node taint.
This new alarm 900.701 describes the attributes
of the node taint.

Test Plan:
PASSED: Verified the details of the alarm
using fm alarm-list.

Partial-Bug: 2046273

Change-Id: I929ddb45b75f1e4b097b84919f703d458d8fa39e
Signed-off-by: Vanathi.Selvaraju <vanathi.selvaraju@windriver.com>
2024-02-09 14:16:43 -05:00
Jagatguru Prasad Mishra 1684514c79 Add LUKS alarm in fm doc
Maintenance service raises an alarm with ID 200.016 if luks-fs-mgr
service is inactive. This change adds the description of 200.016 alarm.

Test Plan:
PASS: build-pkgs -c -p fm-doc
PASS: build-image
PASS: AIO-SX bootstrap with LUKS service status inactive. A critical
      alarm with ID 200.016 should be displayed while listing the alarm
      using 'fm alarm-list'

Story: 2010872
Task: 49125

Depends-On: https://review.opendev.org/c/starlingx/metal/+/901455

Change-Id: Iadee64bffbb37cfd94aa735f7eeb12ba0fa86fbd
Signed-off-by: Jagatguru Prasad Mishra <jagatguruprasad.mishra@windriver.com>
2023-12-14 08:21:44 -05:00
Zhang Rong(Jon) 65a8ff3d66 Add alarm id, definition for system peer and subcloud peer group
The alarm 280.004 is added and will be raised when the system peer
connection failure has been detected, and will be cleared when the
system peer connection has been restored.
The alarm 280.005 is added and will be raised when the a subcloud peer
group is being managed by a remote system with a lower priority, and
will be cleared when the subclouds belonging to the subcloud peer group
have been migrated back to current system.

Test Plan:
 PASS - Verify successful tox test and package build

Story: 2010852
Task: 48492
Change-Id: I3068676933c0446a88bd4290277456cd0962f941
Signed-off-by: Zhang Rong(Jon) <rong.zhang@windriver.com>
2023-10-13 15:56:12 +08:00
Zuul b6727e579b Merge "Change context for 400.001 and 400.002 alarms" 2023-07-24 14:08:40 +00:00
Agustin Carranza 5b94faf575 Change context for 400.001 and 400.002 alarms
400.001 and 400.002 alarms are tagged for openstack but should be
starlingx.

This change tags them to starlingx so the documentation scripts are
able to classify them correctly.

Test plan
PASS: Check the parsing scripts end successfully.

Closes-bug: 2028379

Signed-off-by: Agustin Carranza <agustin.carranza@windriver.com>
Change-Id: I8f5966d5b0a7b82198e4bc2e735fa4536a4cdd0a
2023-07-21 11:09:50 -03:00
Kyale, Eliud 901437c0fe Define kernel mismatch alarms: 100.120, 100.121
2 new alarms:

- 100.120 - Controllers running mismatched kernels
  (minor, non-management affecting)

- 100.121 - Host not running the provisioned kernel
  (major, management affecting)

Part of the kernel switchover feature

Task: 48281
Story: 2010731

Test plan:
PASS - AIO-DX - install iso and bootstrap

PASS - AIO-DX - raise and clears both alarms: 100.120, 100.121

Change-Id: Ifb2df5658071d1a2fab42737267c621fc42d7136
Signed-off-by: Kyale, Eliud <Eliud.Kyale@windriver.com>
2023-06-30 08:02:41 -04:00
Zuul dbffaed98f Merge "Remove 250.002 alarm from events.yaml file" 2023-06-20 20:17:23 +00:00
Agustin Carranza de22b91ed5 Link alarms to current documentation URL
Since the alarm documentation has been automated and the events.yaml
file is taken as source of truth for it, it is required to link
the alarms proposed repair action with a direct link to the
documentation for the users.

This change modifies the mentions of documentation to a proper link,
using Sphinx placeholder that are interpreted by the documentation
language.

Test plan
PASS:
* Build fm-doc package. Check that all parsing checks were run and
package was built successfully.

Closes-bug: 2022104

Change-Id: Iccb34e42ed80634d73cf7549e9230976579deef7
Signed-off-by: Agustin Carranza <agustin.carranza@windriver.com>
2023-06-15 16:04:12 -03:00
Agustin Carranza 063021cd70 Remove 250.002 alarm from events.yaml file
The 250.002 alarm has been deprecated long time ago.
This change deletes it from the alarm list.

Test plan:
PASS:
* Build fm-doc and fm-api packages.
* Check that all parsing checks were run and package was built
  successfully.

Depends-on: https://review.opendev.org/c/starlingx/distcloud/+/886001

Closes-bug: 2024010

Signed-off-by: Agustin Carranza <agustin.carranza@windriver.com>
Change-Id: I0180bd5addc1feae6e3e45edd24c1f50d6622e2c
2023-06-15 15:52:11 -03:00
Agustin Carranza ad5e224dd2 Remove 800.103 and modify documentation reference
Some alarms reference to "System Administration Manual" but this
document does not exist. It was changed to a generic documentation
reference.

The 800.103 alarm has been deprecated so it is deleted from the
events.yaml file.

Test plan
Pass:
* Build fm-doc package. Check that all parsing checks were run and
package was built successfully.

Closes-bug: 2022104

Signed-off-by: Agustin Carranza <agustin.carranza@windriver.com>
Change-Id: I4723d05e77983796a0f64c7242f5c2bcf4699763
2023-06-01 16:38:54 -03:00
Zuul 6172b8ee41 Merge "Add System Config Update orch alarms and events" 2023-05-23 21:55:36 +00:00
Agustin Carranza 6438565a4d Add parsing check when Context field is empty
This change adds a parsing check to ERROR if Context field is Empty.
Until now there had not been a requirement of non empty fields, so in
case this is needed in the future for other key/values, a collection is
created.

Test plan
PASS: * Add/modify an alarm/log in events.yaml file with Context field
        set to <Empty>.
      * Run the checkEventYaml script and check it fails.
PASS: * Check that all the events in events.yaml file have the Context
        field set to a non empty value.
      * Run the checkEventYaml script and check it ends successfully.

Closes-bug: 2020381

Signed-off-by: Agustin Carranza <agustin.carranza@windriver.com>
Change-Id: Ia267886dd49099525751165975fb5d291c0c6f82
2023-05-22 15:07:56 -03:00
Yuxing Jiang 04d6b31d95 Add System Config Update orch alarms and events
System config update alarms are 900.6xx series

The new alarms are originated by a new type of vim strategy
orchestrating configuration update.

The new alarms are similar in numbering and wording as the
kube upgrade auto apply 900.4xx series alarms and logs.

System config update in-progress alarm is 900.010.
System config update aborted alarm is 900.011.

Story: 2010719
Task: 47947

Change-Id: Ieb6e68adf359ac7b0489d15bb33cb4b4a9f3ef3f
Signed-off-by: Yuxing Jiang <Yuxing.Jiang@windriver.com>
2023-05-19 16:57:16 -04:00
Zuul 82bcc2d0a5 Merge "Documentation is missing 900.007 alarm" 2023-05-11 20:31:56 +00:00
Agustin Carranza 888463cf06 Documentation is missing 900.007 alarm
Product Documentation is missing the alarm 900.007 'Kubernetes upgrade
in progress.'
That alarm has the Context field set to none. In order to be included
in stx documentation, it has to be set to Context: starlingx.

Test plan:
PASS: Run documentation generating scripts and check the alarm is now
      included.

Closes-bug: 2019146

Signed-off-by: Agustin Carranza <agustin.carranza@windriver.com>
Change-Id: I4d4867e5299e3fb1eb37c9bcd3e53447d4f08ba5
2023-05-10 15:46:33 -03:00
Enzo Candotti 3ef3df74db Update 260.002 alarm to be non-mgmt-affecting
This commit is intended to update the 260.002 alarm. As the 'severity'
is set to 'minor', it is desired to classify it as
non-management-affecting by adjusting its Management_Affecting_Severity
value to 'none'.

Test Plan:
PASS: Build and install Debian package.

Story: 2010719
Task: 47938

Change-Id: Ie228191ebdda5f2651dab1309b929ae06bc1f7f6
Signed-off-by: Enzo Candotti <enzo.candotti@windriver.com>
2023-05-09 09:34:42 -03:00
Enzo Candotti a891bdcfd0 Add alarm id for resources out of sync
This commit adds a new alarm id and definition for resources
that has INSYNC=False.

The alarm will be raised when a resource is not
synchronized during a process of update. It will be cleared when
the resource is synchronized again.

Test Plan:
 - Verify successful tox test and package build
 - Verify the alarm can be raised using FmClientCli

Story: 2010719
Task: 47910

Change-Id: I24a976ed4beaa8248df25fd97eeee27f5754b969
Signed-off-by: Enzo Candotti <enzo.candotti@windriver.com>
2023-05-05 19:47:38 +00:00
Agustin Carranza dad8caed91 Fix Context value for some alarms
Some documentation generating scripts were introduced in order to avoid
manual intervention every time an alarm/log is changed/added/removed.
Those scripts required a way to know where the alarm/log belongs to.
For that requirement, the field Context was introduced in previous
commits. During that development, it was taken the current
classification at that time in the docs as source of truth, but it was
outdated.

This commits modifies the values that were detected as wrong/outdated.
The scripts also require the value 'none' in the Context field for when
an alarm/log should not be included in the documentation but still be
defined in the events.yaml file. So the Context value is updated for
that case too.

Context incorrectly tagged as openstack and changed to starlingx:
* 900.006

Context incorrectly tagged as starlingx and changed to openstack:
* 100.105
* 100.112
* 100.113
* 300.001
* 300.002

Closes-bug: 2012981

Test plan
PASS: Since the Context field does not have impact in functionality,
      build and install fm-doc package successfully.
      Check the file in the filesystem contains this change.
PASS: Trigger random alarms and check FM functionality.

Signed-off-by: Agustin Carranza <agustin.carranza@windriver.com>
Change-Id: I16f858bbb712349f08b2ceca33152e365b0ed733
2023-04-10 15:25:11 -03:00
Zuul a4601e8026 Merge "Add alarm for Restore in progress" 2023-03-27 14:49:30 +00:00
Joshua Kraitberg 3b430eb604 Add alarm for Restore in progress
Currently, there is no alarm for Restore in progress.
Because of this, the system is shown as healthy,
before restore has been completed.

This new alarm will prevent the system from being healthy
until restore has properly been completed.

TEST PLAN
PASS: On any available system, the following commands can
be triggered at anytime:
* Run "system restore-start" to trigger alarm
* Run "system restore-complete" to clear alarm

Story: 2010117
Task: 47689
Signed-off-by: Joshua Kraitberg <joshua.kraitberg@windriver.com>
Change-Id: I292b5c8083c08b68ac757fe5a650989178eb819f
2023-03-22 10:43:28 -04:00
Zuul 1a61473b14 Merge "Add ceph commands in the 800 series alarm document" 2023-03-17 13:32:25 +00:00
Agustin Carranza 0e1321913b Add ceph commands in the 800 series alarm document
When a 800-Series alarm occurs, users refer to the documentation to
know what kind of error is shown. But sometimes that is not enough
information.
The output of some commands can be useful information and could
save time when solving issues related to the storage alarms.

Closes-bug: 2004601

Test plan
PASS: * Build fm packages and deploy an ISO containing new fm
        packages.
      * Trigger alarms that were modified by this commit,
        (e.g. shutdown a controller).
      * Run fm alarm-list --uuid and copy the uuid of a 800-series
        alarm.
      * Run fm alarm-show <uuid> and check that the field
        has changed.

Signed-off-by: Agustin Carranza <agustin.carranza@windriver.com>
Change-Id: I94e2719b55b4fc14b692439526b5b47204460ac7
2023-03-13 14:13:44 -03:00
Al Bailey bd8857357b Run checkEventYaml as part of zuul linter job
The checkEventYaml script verifies if all contents
are properly populated for the events.yaml file.

This change ensures that check is done by zuul, rather
than during the build.

yaml.load after version 5.1 requires a Loader argument.
The yaml.load in fm-doc are now updated to use safe_load
intead

Test Plan:
  PASS: tox -e linters
  PASS: remove 'context' field from an alarm and observe
  that tox -e linters reports a failure.
  PASS: build-pkgs -p fm-doc

Story: 2010531
Task: 47549
Signed-off-by: Al Bailey <al.bailey@windriver.com>
Change-Id: I369ffe4c74fcaf5fe4a916822fed18a78ead8ff8
2023-02-27 16:16:01 +00:00
Zuul c7e47234e9 Merge "Update debian package versions to use git commits" 2023-02-13 16:51:45 +00:00
Vanathi.Selvaraju 447ed111ae Host compute service failure alarm removal
Removal of stale alarm 270.001(Host compute service failure)
is raised by the vim. This might be an old reference to nova.
It’s likely not in use since stx.

Test Plan:
PASS: Verify with a load without the changes (removal of alarm)
and the event log in platform.log shows an entry for 270.001 alarm.
PASS: Verify with a load with changes of alarm removal and
the event log in platform.log does not show an entry for 270.001 alarm.

Closes-Bug: 2004744

Change-Id: I47a9f5cede2cfade4a16c63a2dc1bcfd563e88cf
Signed-off-by: Vanathi.Selvaraju <vanathi.selvaraju@windriver.com>
2023-02-10 09:32:01 -05:00
Al Bailey 60ab3f6b45 Update debian package versions to use git commits
The Debian packaging has been changed to reflect all the
git commits under the directory, and not just the commits
to the metadata folder.

This ensures that any new code submissions under those
directories will increment the versions.

All packages have a higher version than before the change.

Test Plan:
  PASS: build-pkgs -c -p fm-api
  PASS: build-pkgs -c -p fm-common
  PASS: build-pkgs -c -p fm-doc
  PASS: build-pkgs -c -p fm-mgr
  PASS: build-pkgs -c -p fm-rest-api
  PASS: build-pkgs -c -p python-fmclient

Story: 2010550
Task: 47226

Signed-off-by: Al Bailey <al.bailey@windriver.com>
Change-Id: I65e881ba96512d2eaba25c44332d5ae82efea502
2023-02-09 18:06:57 +00:00
Agustin Carranza d161fe5922 Extend events.yaml schema with usage context field
The events.yaml file contains every alarm and log used by platform and
openstack. There is no way to know which one relates to one or
the other.
In order to know that, it is required an additional field as part of
each record to differential between platform and openstack.

Story: 2010143
Task: 46723

Test plan
PASS: Build the fm-api and fm-doc packages.
      Install fm-api first and then fm-doc.
      No errors are found during build and installation process.

Signed-off-by: Agustin Carranza <agustin.carranza@windriver.com>
Change-Id: I8598afc77d27d107c4f9a108dd46b2ebc79b30a1
2022-11-09 16:14:21 -03:00
Girish Subramanya efa09aa3db Alarm Hostname controller function has in-service failure reported
When compute services remain healthy:
 - listing alarms shall not refer to the below Obsoleted alarm
 - 200.012 alarm hostname controller function has an in-service failure

This update deletes definition of the obsoleted alarm and any references
200.012 is removed in events.yaml file
Also updated any reference to this alarm definition.
Need to also raise a Bug to track the Doc change.

Test Plan:

Verify on a Standard configuration no alarms are listed for hostname
controller in-service failure
Code (removal) changes exercised with fix prior to ansible bootstrap
and host-unlock and verify no unexpected alarms
Regression:

There is no need to test the alarm referred here as they are obsolete

Closes-Bug: 1991531

Signed-off-by: Girish Subramanya <girish.subramanya@windriver.com>

Change-Id: I255af68155c5392ea42244b931516f742fa838c3
2022-10-05 10:30:49 -04:00
Tee Ngo 80d6819635 Add alarm id, definition for subcloud backup
This commit adds a new alarm id and definition for subcloud backup.
The alarm will be raised and cleared by dcmanager upon subcloud backup
failure and successful retry respectively.

Test Plan:
 - Verify successful tox test and package build

Story: 2010116
Task: 46162
Change-Id: I9a34a827e484bb691c8ce0da63e1a7d26735c289
Signed-off-by: Tee Ngo <Tee.Ngo@windriver.com>
2022-09-01 09:07:43 -04:00
Agustin Carranza 571b0665ae Update events.yaml for specific alarms
The following alarms need to be clearer for users.
Some information was gathered in order to improve the
'description' and 'Proposed repair action' fields.

500.200
500.210
750.002
900.002
900.003
900.004
900.009
900.103
900.203
900.303
900.503

Test plan:
Build and install Debian package.

Story: 2010143
Task: 45785

Signed-off-by: Agustin Carranza <agustin.carranza@windriver.com>
Change-Id: I55f81ffa4159284b730e2572e76dfd9867b348d5
2022-08-11 04:26:26 +00:00
Agustin Carranza 6ed32029db Revert "Update events.yaml for specific alarms"
This reverts commit baead557fb.

Reason for revert: Change breaks the build. 270.001 alarm has to remain defined because it is still being referenced.

Change-Id: I136928c90634e05e3026dddc9443cc2c59203320
2022-08-08 17:04:35 +00:00
Agustin Carranza baead557fb Update events.yaml for specific alarms
The following alarms need to be clearer for users. Some information
was gathered in order to improve the 'description' and 'Proposed
repair action' fields.

Alarm that has been deprecated:
270.001

The list of alarms that were modified is the following:
500.210
500.200
750.002
750.006
800.003
800.102
900.002
900.003
900.004
900.009
900.103
900.203
900.303
900.503

Test plan:
There is no need to test the alarms affected by the changes.

Story: 2010143
Task: 45785

Signed-off-by: Agustin Carranza <agustin.carranza@windriver.com>
Change-Id: I57b86548a36da66119cb04779ce1f9147254316c
2022-08-05 14:26:20 +00:00
Gabriel de Araújo Cabral 71cef583e7 Removing alarm 800.102
800.102 alarm is not necessary anymore so it must be removed.

Test Plan: No test needed

Story: 2010104
Task: 45668

Signed-off-by: Gabriel de Araújo Cabral <gabriel.cabral@windriver.com>
Change-Id: I6f47dadb28c78781e95f479739a4a846df8431fc
2022-06-22 10:49:17 -04:00
Al Bailey 0b0d33fd72 Correct alarm IDs in the events.yaml for patching
The 900.002 and 900.003 alarm IDs were reversed.

Now corrected to be what the constants and code
are generating.

 - 900.002 is Patch host install failure
 - 900.003 is Obsolete patch in system

Story: 2009969
Task: 45582
Signed-off-by: Al Bailey <al.bailey@windriver.com>
Change-Id: I0d13cfce39de15b34265012d1e26fd7fedcee81d
2022-06-09 13:29:21 +00:00
Zuul f1c3e852cb Merge "Remove unused constant" 2022-02-18 15:28:27 +00:00
Teresa Ho 6aeafb674d [PTP SyncE] Update PTP alarm description
This commit updates the description and entity instance for the
PTP alarm for SyncE.

Story: 2009130
Task: 44497

Change-Id: Iefae1e75003ff2a45501e45d850f9556256d580f
Signed-off-by: Teresa Ho <teresa.ho@windriver.com>
2022-02-15 08:54:54 -05:00
Vinicius Lopes da Silva 45653becf4 Remove unused constant
Constant FM_ALARM_ID_STORAGE_CEPH_FREE_SPACE was being used only in
legacy code by ceph-manager. The code was updated to remove this
unused constant and its respective alarm.

Test plan:
    PASS: Verify that no alarms are raised related to quotas.
    PASS: Confirm that no quota messages are present in the logs.

Depends-On: https://review.opendev.org/c/starlingx/config/+/827545
Depends-On: https://review.opendev.org/c/starlingx/utilities/+/820933

Closes-Bug: 1959894
Signed-off-by: Daian Cardoso Sganderlla <Daian.CardosoSganderlla@windriver.com>

Change-Id: I606d0e6dca102189a26c8cd4d587a1e772e610f0
2022-02-08 12:43:35 +00:00
Ramon Gazoni Lacerda 707340e521 Add debian package for fm-doc
Add debian packaging infrastructure for fm-doc to build
a debian package.

Test Plan:
PASS: Package installed and ISO built successfully

Story: 2009101
Task: 44088

Signed-off-by: Ramon Gazoni Lacerda <Ramon.GazoniLacerda@windriver.com>
Change-Id: If98aa5c561012e31ab67589e7d62486c58be8e4c
2022-01-20 13:20:18 +00:00
Bin Qian 6105f83a85 Add new alarm for FD limit reached
Add a new alarm for open FD approaching limit (major) or
limit is reached (critical).

Partial-bug: 1952126
Change-Id: Ifaece0e1d7a335f980cfebc3a591a90edbc35742
Signed-off-by: Bin Qian <bin.qian@windriver.com>
2021-11-24 10:43:53 -05:00
Sabeel Ansari bcee98f7db Fix quotes in new alarm doc
Recent commit to add Expiring-Soon and Expired alarm
IDs and description resulted in failure in Openstack
app with following error.

UnicodeEncodeError: 'ascii' codec can't encode character
u'\u2018' in position 12: ordinal not in range(128)

The problem was traced to incorrect quotation marks
in Description field. Tested with a fix of the quotes
and fm-rest-api pod was able to come up successfully.

Story: 2008946
Task: 43049

Signed-off-by: Sabeel Ansari <Sabeel.Ansari@windriver.com>
Change-Id: Iacdf8c18daf412b0ac6d9d8aa89de1c4026356b7
2021-08-31 14:12:00 -04:00
Zuul c5012a7127 Merge "Added an alarm for rootca update abort" 2021-08-30 19:03:15 +00:00
Sabeel Ansari cdf8dc6f6e Add alarm IDs for expiring-soon & expired certificates
Added two new alarm IDs and details for expiring-soon and
expired certificates.

Story: 2008946
Task: 43049

Signed-off-by: Sabeel Ansari <Sabeel.Ansari@windriver.com>
Change-Id: I61cb835c38473cef61238de8cf4a21361e123da4
2021-08-24 11:17:23 -04:00
Andy Ning 5fbfd8b066 Added an alarm for rootca update abort
This change added a KUBE_ROOTCA_UPDATE_ABORTED alarm for kube
rootca update abort.

Story: 2008675
Task: 43068
Signed-off-by: Andy Ning <andy.ning@windriver.com>
Change-Id: I3096b57697d2c6a548f5c54002705cb1797ae97b
2021-08-19 23:33:06 -04:00