StarlingX High Availability/Process Monitoring/Service Management
Go to file
Steven Webster db1eea124d Add admin network support to SM
Add SM support for the DC admin network

This commit adds SM support for the DC admin network.

The admin network is intended to be used between a subcloud
and system controller. Because the (existing) management network
is so embedded in other parts of the StarlingX system, it makes
it prohibitively hard to re-configure this network after initial
installation.  The admin network is intended to be isolated from
the management network, allowing re-configuration of the network
parameters in the case that the physical network between subcloud
and system controller has been changed.

In the case of admin network usage, the management network still
exist but is a private network in the context of a subcloud.

This specific commit provides for admin-ip and admin-interface
services to be added to the SM database and be recognized in
processing similar to the management, cluster-host, oam, etc
networks.

Since there is a requirement for the admin IP subnet information
to be allowed to change at runtime, in-service updating of SM
information relating to the admin-ip service (floating IP), as
well as unicast heartbeating between peers is also added in this
commit.

Testing:

AIO-SX:
    - admin-ip service is enabled when the admin network is
      created.
    - admin-ip service is not enabled when the admin network is
      not created.
    - floating-ip is updated on the admin interface when admin
      addr-pool information is changed.
AIO-DX:
    - admin-ip service is enabled when the admin network is
      created.
    - admin-ip service is not enabled when the admin network
      is not created.
    - floating-ip is updated on the active-controller when the
      admin addr-pool information is changed.
    - When a peer admin interface is down, an alarm is raised.
    - When a peer admin IP is not correct (changed), an alarm
      is raised.
    - Swact between controllers.
    - Inactive controller admin interface goes down
	Result: A 400.005 major communication loss fault is generated
               for the inactive controller entity
    - Inactive controller admin interface comes back up
        Result: The fault is cleared
    - Inactive controller admin IP address is removed/changed
        Result: Two 400.005 major communication loss faults are
                generated for both controller entities
    - Inactive controller admin node IP address is re-applied
        Result: The faults are cleared
    - Active admin interface goes down
        Result: A 400.005 major communication loss fault is generated
                for the inactive controller entity.  A swact is not
                issued.
    - Active admin interface comes back up
        Result: The fault is cleared
    - Active admin node IP address is removed/changed
        Result: Two 400.005 major communication loss faults are
                generated for both controller entities.
                A swact is not issued.
    - Active admin floating IP address is removed/changed
        Result: A 400.001 critical admin-services / admin-ip alarm
                is raised.
                A swact occurs.
                The floating admin IP is applied to the newly active
                controller. Alarms are cleared.
    - After the above test, the newly active controller swacts back
      to the previously active controller.
        Result: No alarms.
                The floating IP is applied to the newly active
		controller.
    - The cable for the management interface on the active controller
      is pulled
        Result: A swact occurs
    - The cable for the OAM interface on the active controller
      is pulled
        Result: A swact occurs
    - The cable for the Admin interface on the active controller
      is pulled
        Result: A swact occurs. 400.005 alarms are raised.
    - The mgmt, cluster-host, oam interfaces are all brought down/up at
      the same time.  The admin interface is also brought down,
      but not brought back up
      back up.
        Result: A swact occurs, with multiple controller-services
	related to the mgmt interface being in degraded state.

Story: 2010319
Task: 47278

Signed-off-by: Steven Webster <steven.webster@windriver.com>
Change-Id: I65df52600f4d5c499dceed32739cab414d36847a
2023-02-14 15:14:28 +00:00
api-ref/source Switch to newer openstackdocstheme and reno versions 2020-06-04 14:27:03 +02:00
devstack Remove sm-watchdog service since NFS is now stable 2022-08-19 19:57:43 +00:00
doc Fix tox-docs failing sphinx 2022-05-31 14:18:44 +00:00
releasenotes Switch to newer openstackdocstheme and reno versions 2020-06-04 14:27:03 +02:00
service-mgmt Add admin network support to SM 2023-02-14 15:14:28 +00:00
service-mgmt-api Merge "Debian: Remove conf files from etc-pmon.d" 2022-09-30 19:10:32 +00:00
service-mgmt-client debian: Simplify sm-client packaging 2022-03-08 19:54:43 +00:00
service-mgmt-tools Add admin network support to SM 2023-02-14 15:14:28 +00:00
stx-ocf-scripts Add debian package for stx-ocf-scripts 2021-11-08 14:10:07 +00:00
.gitignore [Doc] OpenStack API Reference Guide 2018-09-27 10:14:44 -07:00
.gitreview OpenDev Migration Patch 2019-04-19 19:52:24 +00:00
.zuul.yaml Update zuul jobs from python2 to python3 2023-02-07 20:20:57 +00:00
CONTRIBUTORS.wrs StarlingX open source release updates 2018-05-31 07:36:26 -07:00
LICENSE StarlingX open source release updates 2018-05-31 07:36:26 -07:00
README.rst Followup opendev cleanup and test jobs 2019-04-21 14:31:33 -05:00
bindep.txt Update zuul jobs from python2 to python3 2023-02-07 20:20:57 +00:00
centos_build_layer.cfg Build layering, add layer build config file 2019-10-21 10:53:26 +08:00
centos_dev_wheels.inc Add sm-client-wheels to tarball 2019-11-14 10:55:52 -05:00
centos_iso_image.inc Config file changes to add 'stx-ocf-scripts ' after relocation from 'stx-upstream' 2019-09-04 15:59:21 -04:00
centos_pkg_dirs Remove version from sm folder 2019-09-26 14:11:31 -05:00
centos_stable_wheels.inc Add sm-client-wheels to tarball 2019-11-14 10:55:52 -05:00
debian_build_layer.cfg Add debian_build_layer.cfg file 2021-10-05 14:33:19 -04:00
debian_iso_image.inc Debian: fa: update debian_iso_image.inc 2022-11-16 12:01:26 +08:00
debian_pkg_dirs Add debian_pkg_dirs for ha 2021-10-27 18:59:20 +00:00
github_sync.trigger Verify upload to GitHub mirror with a new commit 2020-02-04 11:54:18 -05:00
pylint.rc Update zuul jobs from python2 to python3 2023-02-07 20:20:57 +00:00
test-requirements.txt Fix zuul errors due to changes in dependencies 2021-04-26 11:41:59 -04:00
tox.ini Update zuul jobs from python2 to python3 2023-02-07 20:20:57 +00:00

README.rst

ha

StarlingX Service Management