StarlingX Integration and packaging
Go to file
Eric MacDonald e8c9676d98 Add network interface monitoring plugin to collectd
This update introduces interface monitoring for oam,
mgmt and infra networks as a collectd plugin.

The interface plugin runs and queries the new maintenance
Link Monitor daemon for Link Model and Information every
10 seconds.

The plugin then manages alarms based on the link model similar
to how rmon did in the past ; port and interface alarms.

Severity: Interface and Port levels

Alarm Level  Minor        Major              Critical
-----------  -----  ---------------------    ----------------------------
Interface     N/A   One of lag pair is Up    All Interface ports are Down
     Port     N/A   Physical Link is Down    N/A

Degrade support for interface monitoring is add to the mtce
degrade notifier. Any link down condition results in a host
degrade condition like was in rmon.

Sample Data: represented as % of total links Up for that network interface
100 or 100% percent used - all links of interface are up.
 50 or  50% percent used - one of lag pair is Up and the other is Down
  0 or   0% percent used - all ports for that network are Down

The plugin documents all of this in its header.

This update also

1. Adds the new lmond process to syslog-ng config file.
2. Adds the new lmond process to the mtce patch script.
3. Modifies the cpu, df and memory threshold settings by -1.
   rmon thresholds were precise whereas collectd requires
   that the samples cross the thresholds, not just meet them.
   So for example, in terms of a 90% usage action the
   threshold needs to be 89.

Test Plan: (WIP but almost complete)

PASS: Verify interface plugin startup
PASS: Verify interface plugin logging
PASS: Verify interface plugin Link Status Query and response handling
PASS: Verify monitor, sample storage and grafana display
PASS: verify port and interface alarm matches what rmon produced
PASS: Verify lmon port config from manifest configured plugin
PASS: Verify lmon port config from lmon.conf
PASS: Verify single interface failure handling and recovery
PASS: Verify lagged interface failure handling and recovery
PASS: Verify link loss of lagged interface shared between mgmt and oam (hp380)
PASS: Verify network interface failure handling ; single port
PASS: Verify network interface degrade handling ; lagged interface
PEND: Verify network interface degrade handling ; vlan interface
PASS: Verify HTTP request timeout period and handling
PASS: Verify link status query failure handling - invalid uri (timeout)
PASS: Verify link status query failure handling - missing uri (timeout)
PASS: Verify link status query failure handling - status fail
PASS: Verify link status query failure handling - bad json resp

Change-Id: I2e2dfe6ddfa06a46770245540c7153d330bdf196
Story: 2002823
Task: 28635
Depends-On: https://review.openstack.org/#/c/633264
Signed-off-by: Eric MacDonald <eric.macdonald@windriver.com>
2019-02-06 14:18:14 -05:00
base Scope parameter in ifup-aliases network script 2019-01-07 19:09:53 +00:00
ceph Resolve AIO-SX shutdown hang with CEPH ordering hooks 2019-02-05 17:59:09 -05:00
config Add StarlingX specific restart command for Ceph monitors 2019-01-16 17:05:57 +02:00
config-files Add network interface monitoring plugin to collectd 2019-02-06 14:18:14 -05:00
database build mariadb docker image with galera arbitrator added 2019-01-23 17:11:45 -06:00
devstack Add DevStack plugin 2018-11-01 23:59:31 -05:00
doc Clean up and standardize landing pages 2018-12-27 17:06:20 -08:00
filesystem Remove last patch of iscsi-initiator-utils 2018-12-12 23:09:56 +08:00
grub de-fuzz grub2 patches 2018-09-28 08:10:39 +08:00
kernel Merge "rebase e1000e driver patch from 3.4.1.1 to 3.4.2.1 version" 2019-01-02 06:24:22 +00:00
kubernetes Uprev kubernetes to 1.12.3 2019-01-22 09:39:50 -06:00
ldap Remove hardcoded configure options for openldap 2018-12-04 08:21:31 +00:00
logging Fix: "import" issue for Python 2/3 compatible code 2018-12-26 09:19:59 +08:00
monitoring Add network interface monitoring plugin to collectd 2019-02-06 14:18:14 -05:00
networking Change compute node to worker node personality 2018-12-12 15:09:04 -05:00
python Fix fuzzy issue introduced by Change 597433 2018-11-06 01:56:04 +08:00
releasenotes Update config 2019-01-30 16:40:20 -08:00
security Add low latency per-cpu power management 2019-02-01 14:27:10 -05:00
storage-drivers Add hooks for python wheel generation 2018-10-23 10:35:32 -04:00
tools Merge "change 'compute' to 'worker' in collect utils" 2019-01-10 15:12:24 +00:00
utilities Add network interface monitoring plugin to collectd 2019-02-06 14:18:14 -05:00
virt Add low latency per-cpu power management 2019-02-01 14:27:10 -05:00
.gitignore Add Docker Registry Token Server 2019-01-08 11:42:04 -05:00
.gitreview correct gitreview for master 2018-09-20 23:20:14 +08:00
.zuul.yaml Adding a pylint tox and zuul job for stx-integ 2019-01-08 11:14:46 -06:00
CONTRIBUTORS.wrs StarlingX open source release updates 2018-05-31 07:36:35 -07:00
LICENSE StarlingX open source release updates 2018-05-31 07:36:35 -07:00
README.rst StarlingX open source release updates 2018-05-31 07:36:35 -07:00
centos_guest_image.inc Split image.inc across git repos 2018-08-15 16:45:36 -04:00
centos_guest_image_rt.inc Split image.inc across git repos 2018-08-15 16:45:36 -04:00
centos_iso_image.inc Remove Docker Registry Token Server From Build 2019-01-08 15:24:27 -05:00
centos_master_docker_images.inc build mariadb docker image with galera arbitrator added 2019-01-23 17:11:45 -06:00
centos_pike_docker_images.inc build mariadb docker image with galera arbitrator added 2019-01-23 17:11:45 -06:00
centos_pike_wheels.inc Rename centos_wheels.inc to centos_pike_wheels.inc 2018-11-14 15:09:05 -06:00
centos_pkg_dirs Build registry-token-server without dep 2019-01-18 09:36:15 -05:00
centos_pkg_dirs_installer Relocate anaconda to stx-integ/base/anaconda 2018-08-01 15:37:12 -04:00
centos_pkg_dirs_rt Relocate drbd to stx-integ/kernel/kernel-modules/drbd 2018-08-01 15:31:44 -04:00
pylint.rc Adding a pylint tox and zuul job for stx-integ 2019-01-08 11:14:46 -06:00
test-requirements.txt Add default test framework 2018-06-11 13:45:22 -05:00
tox.ini Adding a pylint tox and zuul job for stx-integ 2019-01-08 11:14:46 -06:00

README.rst

stx-integ

StarlingX Integration