StarlingX System Configuration Management
Go to file
Don Penney c138575062 Ceph initialization on AIO is done only in 'controller' manifests
On AIO deployments puppet is run twice with two different manifests:
1. 'controller': to configure controller services
2. 'worker': to configure worker services.

Ceph is configured when 'controller' manifests are applied, there is
no need to run them a second time, when 'worker' set is applied.

Commit adds new puppet classes to encapsulate ceph configuration
based on node personality and adds a check to not apply it a 2nd
time on controllers.

If the ceph manifests are executed a second time then we get into
a racing issue between SM's process monitoring and 'worker' puppet
manifests triggering a restart of ceph-mon as part of reconfiguration

After a reboot on AIO, SM takes control of ceph-mon monitoring
after 'controller' puppet manifests finish applying. As part of this,
SM monitors processes death notification and gets the pid from the
.pid file. And periodically executes '/etc/init.d/ceph status
mon.controller' for a more advanced monitoring.

When the 'worker' manifests are executed, they trigger a restart
of ceph-mon through /etc/init.d/ceph restart that has two steps: 'stop'
in which ceph-mon is stopped, and 'start' in which it is restarted.

In the first step, stopping ceph-mon leads to the death of ceph-mon
process and removal of its PID file. This is promptly detected by
SM which immediately triggers a start of ceph-mon that creates a
new pid file. Problem is that ceph-mon was already in a restart,
and at the end of the 'stop' step the init script cleans up the
new pid file instead of the old.

This leads to controllers swacting a couple of times before the system
gets rid of the rogue process.

Change-Id: I2a0df3bab716a553e71e322e1515bee2bb2f700d
Co-authored-by: Ovidiu Poncea <ovidiu.poncea@windriver.com>
Story: 2002844
Task: 29214
Signed-off-by: Ovidiu Poncea <ovidiu.poncea@windriver.com>
2019-02-10 21:22:41 +02:00
api-ref/source Create DataNetworks modelling in System Configuration 2019-02-04 11:28:21 -05:00
config-gate Change compute node to worker node personality 2018-12-13 14:15:55 -05:00
configutilities Enable pylint zuul check for configutilities 2019-02-01 14:40:50 -06:00
controllerconfig Fix cluster host network when mgmt is tagged 2019-02-08 09:25:40 -05:00
devstack [Devstack] rename sysinv-agent and check sysinv services 2019-01-31 08:42:53 +08:00
doc Clean up and standardize landing pages 2018-12-27 16:49:27 -08:00
kubernetes Merge "Move nova static configs to Armada manifest" 2019-02-08 21:28:32 +00:00
playbooks/tox-puppet-lint Add puppet-lint support 2018-12-24 13:50:20 -06:00
puppet-manifests Ceph initialization on AIO is done only in 'controller' manifests 2019-02-10 21:22:41 +02:00
puppet-modules-wrs Revert "Stardandize makefiles for puppet-modules-wrs" 2019-02-07 13:14:41 +00:00
releasenotes Update config for release notes to include project name 2019-02-05 14:03:49 -08:00
storageconfig Configurable Host HTTP/HTTPS Port Binding 2019-02-06 12:47:00 -06:00
sysinv Ceph initialization on AIO is done only in 'controller' manifests 2019-02-10 21:22:41 +02:00
tmp/patch-scripts/EXAMPLE_SYSINV/scripts StarlingX open source release updates 2018-05-31 07:35:52 -07:00
worker-utils Remove requires kubelet from affine tasks service 2019-02-06 21:15:18 +00:00
workerconfig Configurable Host HTTP/HTTPS Port Binding 2019-02-06 12:47:00 -06:00
.gitignore Fixing up tox -e cover for stx-config 2018-12-20 08:53:15 -06:00
.gitreview Set .gitreview back to use review.openstack.org 2018-09-04 21:12:34 +00:00
.yamllint clear yamllint errors under stx-config 2018-09-12 21:11:57 +08:00
.zuul.yaml Enable pylint zuul check for configutilities 2019-02-01 14:40:50 -06:00
CONTRIBUTORS.wrs StarlingX open source release updates 2018-05-31 07:35:52 -07:00
LICENSE StarlingX open source release updates 2018-05-31 07:35:52 -07:00
README.rst StarlingX open source release updates 2018-05-31 07:35:52 -07:00
centos_helm.inc Add stx-openstack-helm to the containers build 2018-11-14 11:55:29 -05:00
centos_iso_image.inc Change compute node to worker node personality 2018-12-13 14:15:55 -05:00
centos_pike_wheels.inc Rename centos_wheels.inc to centos_pike_wheels.inc 2018-11-14 15:05:28 -06:00
centos_pkg_dirs Build stx-openstack-helm without requiring networking 2019-01-08 13:13:13 -06:00
centos_pkg_dirs_containers Add stx-openstack-helm to the containers build 2018-11-14 11:55:29 -05:00
test-requirements.txt Properly enable hacking checks for flake8 2018-12-18 11:28:56 -06:00
tox.ini Add puppet-lint support 2018-12-24 13:50:20 -06:00

README.rst

stx-config

StarlingX Configuration Management