StarlingX NFVI Orchestration
Go to file
Saba Touheed Mujawar 471d1001e0 Set timeout for KubeHostUpgradeControlPlaneStep to 420s
The history for KubeHostUpgradeControlPlaneStep timeout of 600s
was to give significant headroom in doing control-plane upgrade.
This step was known to run long, but we had limited data, so
we set the value large. The underlying kubeadm
UpgradeManifestTimeout was 5 minutes, so timeout larger than
300s was ineffective.

This updates KubeHostUpgradeControlPlaneStep timeout
to 420s. This is intentionally engineered to be larger than
the resultant time for sysinv code to reach completion of the
Kubernetes Upgrade control-plane step with retries and
accounting for failure.

The timeout is engineered using the following equation.
This accounts for retries, hitting kubeadm upgrade timeout
each try, and some buffer for the sysinv report callback
mechanism.

nfv_timeout = ImageDownloadTime + retries*
                        (UpgradeControlPlaneTimeout + buffer)

Following are the engineered parameters:

ImageDownloadTime = 0s (images are pre-pull before this step)
UpgradeManifestTimeout = 3 minutes
buffer = 30s
2 retries

Result:
Engineered puppet timeout for upgrade control-plane:
= UpgradeControlPlaneTimeout + buffer = 3*60s + 30s = 210s

Engineered NFV timeout:
= 0s + 2(180s + 30s) = 420s

Test Plan:
PASS: Perform orchestrated k8s upgrade, manually STOP kubeadm process
      during k8s upgrade control-plane step. Check logs to verify
      puppet timeout and also verify sysinv attempts retry mechanism
      before nfv timeout.

Partial-Bug: 2056326

Change-Id: I73ab8ea7cd7fc3816372260983c4b54a02cdcc4c
Signed-off-by: Saba Touheed Mujawar <sabatouheed.mujawar@windriver.com>
2024-03-19 13:44:09 -04:00
api-ref/source NFV API to list current strategy type and state. 2024-03-11 12:03:49 +00:00
devstack Followup opendev cleanup and test jobs 2019-04-21 14:21:35 -05:00
doc Fix tox-docs failing sphinx 2022-06-03 15:20:56 -03:00
mtce-guest Update debian package versions to use git commits 2023-02-09 17:00:49 +00:00
nfv Set timeout for KubeHostUpgradeControlPlaneStep to 420s 2024-03-19 13:44:09 -04:00
nova-api-proxy Update debian package versions to use git commits 2023-02-09 17:00:49 +00:00
releasenotes Switch to newer openstackdocstheme and reno versions 2020-06-04 14:34:31 +02:00
.gitignore [Doc] OpenStack API Reference Guide 2018-09-25 14:01:33 -07:00
.gitreview OpenDev Migration Patch 2019-04-19 19:52:26 +00:00
.yamllint Cleanup tox.ini and zuul files from older releases 2023-01-19 15:42:06 +00:00
.zuul.yaml Fix github mirroring for this repo 2023-04-28 12:38:52 -04:00
CONTRIBUTORS.wrs StarlingX open source release updates 2018-05-31 07:36:51 -07:00
LICENSE StarlingX open source release updates 2018-05-31 07:36:51 -07:00
README.rst Followup opendev cleanup and test jobs 2019-04-21 14:21:35 -05:00
centos_build_layer.cfg Build layering, add layer build config file 2019-10-15 19:12:57 +08:00
centos_dev_docker_images.inc Create the stable/dev wheel and image build files 2019-03-21 10:09:02 -04:00
centos_dev_wheels.inc Add NFV wheels to tarball 2019-11-14 10:52:14 -05:00
centos_guest_image.inc Remove wrs-guest-heartbeat SDK Module 2019-04-17 16:34:14 -04:00
centos_guest_image_rt.inc Remove wrs-guest-heartbeat SDK Module 2019-04-17 16:34:14 -04:00
centos_iso_image.inc Remove wrs-server-group SDK Module from StarlingX 2019-03-27 14:01:52 -04:00
centos_pkg_dirs Remove wrs-guest-heartbeat SDK Module 2019-04-17 16:34:14 -04:00
centos_stable_docker_images.inc Create the stable/dev wheel and image build files 2019-03-21 10:09:02 -04:00
centos_stable_wheels.inc Add NFV wheels to tarball 2019-11-14 10:52:14 -05:00
debian_build_layer.cfg Add debian_build_layer.cfg file 2021-10-05 14:10:32 -04:00
debian_iso_image.inc Deactivate guest related services 2023-01-10 17:26:09 -03:00
debian_pkg_dirs Deactivate guest related services 2023-01-10 17:26:09 -03:00
debian_stable_docker_images.inc Port stx-nova-api-proxy image to stx-debian 2023-01-26 07:39:17 -03:00
debian_stable_wheels.inc Port stx-nova-api-proxy image to stx-debian 2023-01-26 07:39:17 -03:00
requirements.txt Adding py27 for nfv 2018-07-10 10:18:28 -05:00
test-requirements.txt Not require recreate of tox env when running tox 2021-04-06 09:48:36 -05:00
tox.ini Convert yaml.load to yaml.safe_load 2023-01-19 19:57:23 +00:00

README.rst

nfv

StarlingX NFVI Orchestration