config/sysinv/sysinv/sysinv/sysinv
Jim Gauld acefd544f0 Mitigate memory leak of sessions by disabling sudo for sriov agent
The sriov agent was polling devices via 'sudo ip link show',
and this resulted in a severe memory leak. The usage of 'sudo'
uses the host 'dbus-daemon', and somewhere the host does not
clean up login sessions.

Symptoms:
- gradual run out of memory until system unstable, host spontaneous
  reboot due to delay or OOM
- huge growth of kernel slab
- thousands of /sys/fs/cgroup/systemd/user.slice/user-0.slice
  session-x*.scope files with empty 'tasks', i.e., sessions
  that should have deleted
- huge latency seen with ssh and various systemd commands

The problem is mitigated by disabling 'sudo' for sriov agent, using
a helm override that configures [agent]/root_helper='' .

Testing:
- Verified that we could launch a VM with SR-IOV interface;
  VFs were able to set MAC and VLAN attributes.

Closes-Bug: 1815106

Change-Id: I0c57629c01b7407c99cc7f38b409019ab87af859
Signed-off-by: Jim Gauld <james.gauld@windriver.com>
2019-02-14 15:42:07 -05:00
..
agent Remove nova storage aggregates 2019-01-25 15:38:43 -05:00
api Merge "Store BMC password in Openstack Barbican instead of keyring." 2019-02-13 15:54:41 +00:00
cluster Fix: "import" issue for Python 2/3 compatible code 2018-12-25 08:58:03 +08:00
cmd Fix: others issues for Python 2/3 compatible code 2018-12-19 10:20:56 +08:00
common Update nova helm overrides for PCI alias, passthrough, and SR-IOV 2019-02-13 14:00:51 -05:00
conductor Store BMC password in Openstack Barbican instead of keyring. 2019-02-12 10:12:54 -05:00
db Enable python3.5 sysinv unit test 2019-01-30 08:51:07 +08:00
helm Mitigate memory leak of sessions by disabling sudo for sriov agent 2019-02-14 15:42:07 -05:00
objects Remove un-necessary exception log 2019-02-11 09:20:50 +08:00
openstack Enable python3.5 sysinv unit test 2019-01-30 08:51:07 +08:00
puppet Update nova helm overrides for PCI alias, passthrough, and SR-IOV 2019-02-13 14:00:51 -05:00
tests Merge "Ceph for standard: Add deployment model to System Inventory cluster API" 2019-02-11 16:59:47 +00:00
__init__.py StarlingX open source release updates 2018-05-31 07:35:52 -07:00
netconf.py StarlingX open source release updates 2018-05-31 07:35:52 -07:00
sanity_coverage.py Sysinv tox updates. Prepare for bandit reports and test reports 2018-06-29 13:25:09 -04:00
version.py StarlingX open source release updates 2018-05-31 07:35:52 -07:00