提交 · eaf2c9f89107b9f60cf8db2c919f78b987ff7177 · openeuler / libvirt

25 7月, 2017 1 次提交

Move machineName generation from virsystemd into domain_conf · eaf2c9f8

由 Martin Kletzander 提交于 7月 21, 2017

It is more related to a domain as we might use it even when there is
no systemd and it does not use any dbus/systemd functions. In order
not to use code from conf/ in util/ pass machineName in cgroups code
as a parameter. That also fixes a leak of machineName in the lxc
driver and cleans up and de-duplicates some code.
Signed-off-by: NMartin Kletzander <mkletzan@redhat.com>

eaf2c9f8

09 1月, 2017 1 次提交

lxc: ensure libvirt_lxc and qemu-nbd move into systemd machine slice · 44f79a0b

由 Daniel P. Berrange 提交于 1月 05, 2017

Currently when spawning containers with systemd, the container PID 1
will get moved into the systemd machine slice. Libvirt then manually
moves the libvirt_lxc and qemu-nbd processes into the cgroups associated
with the slice, but skips the systemd controller cgroup. This means that
from systemd's POV, libvirt_lxc and qemu-nbd are still part of the
libvirtd.service unit.

On systemctl daemon-reload, it will notice that libvirt_lxc & qemu-nbd
are in the libvirtd.service unit for the systemd controller, but in the
machine cgroups for resources. Systemd will thus move them back into
the libvirtd.service resource cgroups next time libvirtd is restarted.
This causes libvirtd to kill off the container due to incorrect cgroup
placement.

The solution is to ensure that when moving libvirt_lxc & qemu-nbd, we
also move the systemd cgroup controller placement. Normally this is
not something we ever want todo, but this is a special case as we are
intentionally wanting to move them to a different systemd unit.
Signed-off-by: NDaniel P. Berrange <berrange@redhat.com>

44f79a0b

25 8月, 2016 1 次提交
- P
  
  util: Extract and rename qemuDomainDelCgroupForThread to virCgroupDelThread · c84c2cb3
  由 Peter Krempa 提交于 8月 04, 2016
  
  c84c2cb3
01 3月, 2016 1 次提交

qemu_cgroup: use virCgroupAddTask instead of virCgroupMoveTask · ff16bde1

由 Henning Schild 提交于 2月 26, 2016

qemuProcessSetupEmulator runs at a point in time where there is only
the qemu main thread. Use virCgroupAddTask to put just that one task
into the emulator cgroup. That patch makes virCgroupMoveTask and
virCgroupAddTaskStrController obsolete.
Signed-off-by: NHenning Schild <henning.schild@siemens.com>

ff16bde1

17 2月, 2016 2 次提交

util: cgroup: Allow ignoring EACCES in virCgroup(Allow|Deny)DevicePath · cf113e8d

由 Peter Krempa 提交于 2月 16, 2016

When adding disk images to ACL we may call those functions on NFS
shares. In that case we might get an EACCES, which isn't really relevant
since NFS would not hold a block device. This patch adds a flag that
allows to stop reporting an error on EACCES to avoid spaming logs.

Currently there's no functional change.

cf113e8d

util: cgroup: Drop virCgroup(Allow|Deny)DeviceMajor · 9cd5da71

由 Peter Krempa 提交于 2月 16, 2016

Since commit 47e5b5ae virCgroupAllowDevice allows to pass -1 as either
the minor or major device number and it automatically uses '*' in place
of that. Reuse the new approach through the code and drop the duplicated
functions.

9cd5da71

08 2月, 2016 1 次提交
- P
  cgroup: Prepare for sparse vCPU topologies in virCgroupGetPercpuStats · 7938b533
  由 Peter Krempa 提交于 12月 14, 2015
```
Pass a bitmap of enabled guest vCPUs to virCgroupGetPercpuStats so that
non-continuous vCPU topologies can be used.
```
  7938b533
06 2月, 2016 1 次提交

util: Fix virCgroupNewMachine ATTRIBUTE_NONNULL args · b8c0f186

由 John Ferlan 提交于 2月 06, 2016

Commit id 'c3bd0019' removed arg3, but forgot to adjust the numbers
for NONNULL - caused build failure for coverity

b8c0f186

05 2月, 2016 1 次提交

systemd: Modernize machine naming · c3bd0019

由 Martin Kletzander 提交于 2月 01, 2016

So, systemd-machined has this philosophy that machine names are like
hostnames and hence should follow the same rules. But we always allowed
international characters in domain names. Thus we need to modify the
machine name we are passing to systemd.

In order to change some machine names that we will be passing to systemd,
we also need to call TerminateMachine at the end of a lifetime of a
domain. Even for domains that were started with older libvirt. That
can be achieved thanks to virSystemdGetMachineNameByPID(). And because
we can change machine names, we can get rid of the inconsistent and
pointless escaping of domain names when creating machine names.

So this patch modifies the naming in the following way. It creates the
name as <drivername>-<id>-<name> where invalid hostname characters are
stripped out of the name and if the resulting name is longer, it
truncates it to 64 characters. That way we can start domains we
couldn't start before. Well, at least on systemd.

To make it work all together, the machineName (which is needed only with
systemd) is saved in domain's private data. That way the generation is
moved to the driver and we don't need to pass various unnecessary
arguments to cgroup functions.

The only thing this complicates a bit is the scope generation when
validating a cgroup where we must check both old and new naming, so a
slight modification was needed there.

Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1282846Signed-off-by: NMartin Kletzander <mkletzan@redhat.com>

c3bd0019

27 1月, 2016 1 次提交

lxc: don't try to hide parent cgroups inside container · dc576025

由 Daniel P. Berrange 提交于 1月 22, 2016

On the host when we start a container, it will be
placed in a cgroup path of

   /machine.slice/machine-lxc\x2ddemo.scope

under /sys/fs/cgroup/*

Inside the containers' namespace we need to setup
/sys/fs/cgroup mounts, and currently will bind
mount /machine.slice/machine-lxc\x2ddemo.scope on
the host to appear as / in the container.

While this may sound nice, it confuses applications
dealing with cgroups, because /proc/$PID/cgroup
now does not match the directory in /sys/fs/cgroup

This particularly causes problems for systems and
will make it create repeated path components in
the cgroup for apps run in the container eg

  /machine.slice/machine-lxc\x2ddemo.scope/machine.slice/machine-lxc\x2ddemo.scope/user.slice/user-0.slice/session-61.scope

This also causes any systemd service that uses
sd-notify to fail to start, because when systemd
receives the notification it won't be able to
identify the corresponding unit it came from.
In particular this break rabbitmq-server startup

Future kernels will provide proper cgroup namespacing
which will handle this problem, but until that time
we should not try to play games with hiding parent
cgroups.
Signed-off-by: NDaniel P. Berrange <berrange@redhat.com>

dc576025

19 8月, 2015 1 次提交

util: Add getters for cgroup block device I/O throttling · 89c509a0

由 Martin Kletzander 提交于 8月 03, 2015

Since now they were not needed, but I sense they will be in a short
while.
Signed-off-by: NMartin Kletzander <mkletzan@redhat.com>

89c509a0

22 7月, 2015 1 次提交

cgroup: Drop resource partition from virSystemdMakeScopeName · 88f6c007

由 Peter Krempa 提交于 7月 16, 2015

The scope name, even according to our docs is
"machine-$DRIVER\x2d$VMNAME.scope" virSystemdMakeScopeName would use the
resource partition name instead of "machine-" if it was specified thus
creating invalid scope paths.

This makes libvirt drop cgroups for a VM that uses custom resource
partition upon reconnecting since the detected scope name would not
match the expected name generated by virSystemdMakeScopeName.

The error is exposed by the following log entry:

debug : virCgroupValidateMachineGroup:302 : Name 'machine-qemu\x2dtestvm.scope' for controller 'cpu' does not match 'testvm', 'testvm.libvirt-qemu' or 'machine-test-qemu\x2dtestvm.scope'

for a "/machine/test" resource and "testvm" vm.

Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1238570

88f6c007

10 4月, 2015 2 次提交

cgroup: Use virCgroupNewThread · 0456eda3

由 John Ferlan 提交于 4月 07, 2015

Replace the virCgroupNew{Vcpu|Emulator|IOThread} calls with the common
virCgroupNewThread API
Signed-off-by: NJohn Ferlan <jferlan@redhat.com>

0456eda3

cgroup: Introduce virCgroupNewThread · 2cd3a980

由 John Ferlan 提交于 4月 07, 2015

Create a new common API to replace the virCgroupNew{Vcpu|Emulator|IOThread}
API's using an emum to generate the cgroup name
Signed-off-by: NJohn Ferlan <jferlan@redhat.com>

2cd3a980

08 4月, 2015 1 次提交

vircgroup: Introduce virCgroupControllerAvailable · d65acbde

由 Michal Privoznik 提交于 3月 31, 2015

This new internal API checks if given CGroup controller is
available.  It is going to be needed later when we need to make a
decision whether pin domain memory onto NUMA nodes using cpuset
CGroup controller or using numa_set_membind().
Signed-off-by: NMichal Privoznik <mprivozn@redhat.com>

d65acbde

30 3月, 2015 1 次提交

virCgroupController: Check the enum fits into 'int' · 771e6e5a

由 Michal Privoznik 提交于 3月 27, 2015

Throughout our code, the virCgroupController enum is used in two ways.
First as an index to an array of cgroup controllers:

struct virCgroup {
    char *path;

    struct virCgroupController controllers[VIR_CGROUP_CONTROLLER_LAST];
};

Second way is that when calling virCgroupNew() a bitmask of the enum
items can be passed to selectively detect only some controllers. For
instance:

int
virCgroupNewVcpu(virCgroupPtr domain,
                 int vcpuid,
                 bool create,
                 virCgroupPtr *group)
{
    ...
    controllers = ((1 << VIR_CGROUP_CONTROLLER_CPU) |
                   (1 << VIR_CGROUP_CONTROLLER_CPUACCT) |
                   (1 << VIR_CGROUP_CONTROLLER_CPUSET));

    if (virCgroupNew(-1, name, domain, controllers, group) < 0)
        goto cleanup;
}

Even though it's highly unlikely that so many new controllers will be
invented so that we would overflow when constructing the bitmask, it
doesn't hurt to check at compile time either.
Signed-off-by: NMichal Privoznik <mprivozn@redhat.com>

771e6e5a

20 3月, 2015 1 次提交
- M
  cgroup: Add accessors for cpuset.memory_migrate · ba1dfc5b
  由 Martin Kletzander 提交于 3月 11, 2015
```
Signed-off-by: NMartin Kletzander <mkletzan@redhat.com>
```
  ba1dfc5b
15 1月, 2015 1 次提交

Add support for systemd-machined CreateMachineWithNetwork · 318df5a0

由 Daniel P. Berrange 提交于 11月 11, 2014

systemd-machined introduced a new method CreateMachineWithNetwork
that obsoletes CreateMachine. It expects to be given a list of
VETH/TAP device indexes for the host side device(s) associated
with a container/machine.

This falls back to the old CreateMachine method when the new
one is not supported.

318df5a0

16 12月, 2014 1 次提交

util: Add function virCgroupHasEmptyTasks · 1a80b97d

由 Martin Kletzander 提交于 12月 13, 2014

That function helps checking whether there's a task in that cgroup.
Signed-off-by: NMartin Kletzander <mkletzan@redhat.com>

1a80b97d

02 10月, 2014 1 次提交

qemu: use systemd's TerminateMachine to kill all processes · 4882618e

由 Guido Günther 提交于 9月 25, 2014

If we don't properly clean up all processes in the
machine-<vmname>.scope systemd won't remove the cgroup and subsequent vm
starts fail with

  'CreateMachine: File exists'

Additional processes can e.g. be added via

  echo $PID > /sys/fs/cgroup/systemd/machine.slice/machine-${VMNAME}.scope/tasks

but there are other cases like

  http://bugs.debian.org/761521

Invoke TerminateMachine to be on the safe side since systemd tracks the
cgroup anyway. This is a noop if all processes have terminated already.

4882618e

16 9月, 2014 1 次提交

vircgroup: Introduce virCgroupNewIOThread · 3abb95ca

由 John Ferlan 提交于 9月 03, 2014

Add virCgroupNewIOThread() to mimic virCgroupNewVcpu() except the naming
scheme with use "iothread" rather than "vcpu".

3abb95ca

23 7月, 2014 1 次提交

lxc: allow to keep or drop capabilities · 47e5b5ae

由 Cédric Bosdonnat 提交于 7月 18, 2014

Added <capabilities> in the <features> section of LXC domains
configuration. This section can contain elements named after the
capabilities like:

  <mknod state="on"/>, keep CAP_MKNOD capability
  <sys_chroot state="off"/> drop CAP_SYS_CHROOT capability

Users can restrict or give more capabilities than the default using
this mechanism.

47e5b5ae

08 7月, 2014 1 次提交

util: cgroup: Add helper to convert device mode to string · a48f4451

由 Peter Krempa 提交于 6月 20, 2014

Cgroups code uses VIR_CGROUP_DEVICE_* flags to specify the mode but in
the end it needs to be converted to a string. Add a helper to do it and
use it in the cgroup code before introducing it into the rest of the
code.

a48f4451

09 4月, 2014 1 次提交

Extend virCgroupGetPercpuStats to fill in vcputime too · 897808e7

由 Ján Tomko 提交于 4月 03, 2014

Currently, virCgroupGetPercpuStats is only used by the LXC driver,
filling out the CPUTIME stats. qemuDomainGetPercpuStats does this
and also filles out VCPUTIME stats.

Extend virCgroupGetPercpuStats to also report VCPUTIME stats if
nvcpupids is non-zero. In the LXC driver, we don't have cpupids.
In the QEMU driver, there is at least one cpupid for a running domain,
so the behavior shouldn't change for QEMU either.

Also rename getSumVcpuPercpuStats to virCgroupGetPercpuVcpuSum.

897808e7

24 2月, 2014 1 次提交

Ensure systemd cgroup ownership is delegated to container with userns · 6fb42d7c

由 Richard Weinberger 提交于 2月 24, 2014

This function is needed for user namespaces, where we need to chmod()
the cgroup to the initial uid/gid such that systemd is allowed to
use the cgroup.
Signed-off-by: NRichard Weinberger <richard@nod.at>
Signed-off-by: NDaniel P. Berrange <berrange@redhat.com>

6fb42d7c

20 2月, 2014 3 次提交
- T
  
  Implement domainGetCPUStats for lxc driver. · 4b3b2f6c
  由 Thorsten Behrens 提交于 2月 14, 2014
  
  4b3b2f6c
- T
  Make qemuGetDomainTotalCPUStats a virCgroup function. · 65158899
  由 Thorsten Behrens 提交于 2月 14, 2014
```
To reuse this from other drivers, like lxc.
```
  65158899
- T
  Add util virCgroupGetBlkioIo*Serviced methods. · a2bb187c
  由 Thorsten Behrens 提交于 2月 14, 2014
```
This reads blkio stats from blkio.throttle.io_service_bytes and
blkio.throttle.io_serviced.
```
  a2bb187c
20 1月, 2014 1 次提交

blkio: Setting throttle blkio cgroup for domain · 3b431929

由 Gao feng 提交于 12月 11, 2013

This patch introduces virCgroupSetBlkioDeviceReadIops,
virCgroupSetBlkioDeviceWriteIops,
virCgroupSetBlkioDeviceReadBps and
virCgroupSetBlkioDeviceWriteBps,

we can use these interfaces to set up throttle
blkio cgroup for domain.

This patch also adds the new throttle blkio cgroup
elements to the test xml.
Signed-off-by: NGuan Qiang <hzguanqiang@corp.netease.com>
Signed-off-by: NGao feng <gaofeng@cn.fujitsu.com>

3b431929

16 9月, 2013 1 次提交

cgroup: Move [qemu|lxc]GetCpuBWStatus to vicgroup.c and refactor it · d79fe8b5

由 Peter Krempa 提交于 9月 13, 2013

The function existed in two identical instances in lxc and qemu. Move it
to vircgroup.c and simplify it. Refactor the callers too.

d79fe8b5

13 8月, 2013 1 次提交

cgroup: functional sort · 2ff9e54c

由 Eric Blake 提交于 8月 12, 2013

Make future patches smaller by matching a sane header listing in
the first place.  No semantic change.

* src/util/vircgroup.h: Move free next to new, and controller
functions next to each other.
* src/util/vircgroup.c (virCgroupFree, virCgroupHasController)
(virCgroupPathOfController, virCgroupRemoveRecursively)
(virCgroupRemove): Sort implementation to be closer to header.
Signed-off-by: NEric Blake <eblake@redhat.com>

2ff9e54c

01 8月, 2013 2 次提交

Enable support for systemd-machined in cgroups creation · 2fe24701

由 Daniel P. Berrange 提交于 7月 22, 2013

Make the virCgroupNewMachine method try to use systemd-machined
first. If that fails, then fallback to using the traditional
cgroup setup code path.
Signed-off-by: NDaniel P. Berrange <berrange@redhat.com>

2fe24701

Add support for systemd cgroup mount · aedd46e7

由 Daniel P. Berrange 提交于 7月 25, 2013

Systemd uses a named cgroup mount for tracking processes. Add
it as another type of controller, albeit one which we have to
special case in a number of places. In particular we must
never create/delete directories there, nor add tasks. Essentially
the systemd mount is to be considered read-only for libvirt.

With this change both the virCgroupDetectPlacement and
virCgroupCopyPlacement methods must be invoked. The copy
placement method will copy setup for resource controllers
only. The detect placement method will probe for any
named controllers, or resource controllers not already
setup.
Signed-off-by: NDaniel P. Berrange <berrange@redhat.com>

aedd46e7

26 7月, 2013 3 次提交

Add 'controllers' arg to virCgroupNewDetect · 5ec5a224

由 Daniel P. Berrange 提交于 7月 24, 2013

When detecting cgroups we must honour any controllers
whitelist the driver may have.
Signed-off-by: NDaniel P. Berrange <berrange@redhat.com>

5ec5a224

Make virCgroupIsValidMachine static · 525c9d5a

由 Daniel P. Berrange 提交于 7月 24, 2013

The virCgroupIsValidMachine does not need to be called from
outside the cgroups file now, so make it static.
Signed-off-by: NDaniel P. Berrange <berrange@redhat.com>

525c9d5a

Introduce a more convenient virCgroupNewDetectMachine · a45b99ea

由 Daniel P. Berrange 提交于 7月 24, 2013

Instead of requiring drivers to use a combination of calls
to virCgroupNewDetect and virCgroupIsValidMachine, combine
the two into virCgroupNewDetectMachine
Signed-off-by: NDaniel P. Berrange <berrange@redhat.com>

a45b99ea

25 7月, 2013 1 次提交

New cgroups API for atomically creating machine cgroups · b333330a

由 Daniel P. Berrange 提交于 7月 18, 2013

Instead of requiring one API call to create a cgroup and
another to add a task to it, introduce a new API
virCgroupNewMachine which does both jobs at once. This
will facilitate the later code to talk to systemd to
achieve this job which is also atomic.
Signed-off-by: NDaniel P. Berrange <berrange@redhat.com>

b333330a

24 7月, 2013 3 次提交

Remove obsolete cgroups creation apis · d64e852b

由 Daniel P. Berrange 提交于 7月 22, 2013

The virCgroupNewDomainDriver and virCgroupNewDriver methods
are obsolete now that we can auto-detect existing cgroup
placement. Delete them to reduce code bloat.
Signed-off-by: NDaniel P. Berrange <berrange@redhat.com>

d64e852b

Add API for checking if a cgroup is valid for a domain · e638778e

由 Daniel P. Berrange 提交于 7月 23, 2013

Add virCgroupIsValidMachine API to check whether an auto
detected cgroup is valid for a machine. This lets us
check if a VM has just been placed into some generic
shared cgroup, or worse, the root cgroup
Signed-off-by: NDaniel P. Berrange <berrange@redhat.com>

e638778e

Add a virCgroupNewDetect API for finding cgroup placement · 66a7f857

由 Daniel P. Berrange 提交于 7月 19, 2013

Add a virCgroupNewDetect API which is used to initialize a
cgroup object with the placement of an arbitrary process.
Signed-off-by: NDaniel P. Berrange <berrange@redhat.com>

66a7f857