提交 · 7ba5defb5a34bb82de2f16467c2b8d157cd14d2d · openeuler / libvirt

15 10月, 2012 18 次提交

Add support for SUSPEND_DISK event · 7ba5defb

由 Martin Kletzander 提交于 10月 12, 2012

This patch adds support for SUSPEND_DISK event; both lifecycle and
separated.  The support is added for QEMU, machines are changed to
PMSUSPENDED, but as QEMU sends SHUTDOWN afterwards, the state changes
to shut-off.  This and much more needs to be done in order for libvirt
to work with transient devices, wake-ups etc.  This patch is not
aiming for that functionality.

7ba5defb

util: switch virLogEatParams to virLogSource · a9e3b4f7

由 Ján Tomko 提交于 10月 15, 2012

Commit e8fd8757 changed 'const char *'
category to virLogSource enum. This changes it in virLogEatParams as
well, thus fixing the build with --disable-debug.
--
Hopefully moving the enum declarations is less ugly than using int.

a9e3b4f7

node_memory: Add new parameter field to tune the new sysfs knob · f81f0f2f

由 Osier Yang 提交于 10月 12, 2012

Upstream kernel introduced new sysfs knob "merge_across_nodes" to
specify if pages from different numa nodes can be merged. When set
to 0, only pages which physically reside in the memory area of
same NUMA node can be merged. When set to 1, pages from all nodes
can be merged.

This patch supports the tuning by adding new param field
"shm_merge_across_nodes".

f81f0f2f

qemu: reorganize qemuDomainChangeNet and qemuDomainChangeNetBridge · 6bde0a1a

由 Laine Stump 提交于 10月 10, 2012

This patch resolves:

  https://bugzilla.redhat.com/show_bug.cgi?id=805071

to the extent that it can be resolved with current qemu functionality.
It attempts to detect as many situations as possible when the simple
operation of disconnecting an existing tap device from one bridge and
attaching it to another will satisfy the change requested in
virDomainUpdateDeviceFlags() for a network device. Before this patch,
that situation could only be detected if the pre-change interface
*and* the post-change interface definition were both "type='bridge'".
After this patch, it can also be detected if the before or after
interfaces are any combination of type='bridge' and type='network'
(the networks can be <forward mode='nat|route|bridge'>, as long as
they use a Linux host bridge and not macvtap connections).

This extra effort is especially useful since the recent discovery that
a netdev_del+netdev_add combo (to reconnect the network device with
completely different hostside configuration) doesn't work properly
with current qemu (1.2) unless it is accompanied by the matching
device_del+device_add - see this mailing list message for details:

  http://lists.nongnu.org/archive/html/qemu-devel/2012-10/msg02355.html

(A slight modification of the patch referenced there has been prepared
to apply on top of this patch, but won't be pushed until qemu can be
made to work with it.)

* qemuDomainChangeNet needs access to the virDomainDeviceDef that
holds the new netdef (so that it can clear out the virDomainDeviceDef
if it ends up using the NetDef to replace the original), so the
virDomainNetDefPtr arg is replaced with a virDomainDeviceDefPtr.

* qemuDomainChangeNet previously checked for *some* changes to the
interface config, but this check was by no means complete. It was also
a bit disorganized.

This refactoring of the code is (I believe) complete in its check of
all NetDef attributes that might be changed, and either returns a
failure (for changes that are simply impossible), or sets one of three
flags:

  needLinkStateChange - if the device link state needs to go up/down
  needBridgeChange    - if everything else is the same, but it needs
                        to be connected to a difference linux host
                        bridge
  needReconnect       - if the entire host side of the device needs
                        to be torn down and reconstructed (currently
                        non-working, as mentioned above)

Note that this function will refuse to make any change that requires
the *guest* side of the device to be detached (e.g. changing the PCI
address or mac address). Those would be disruptive enough to the guest
that it's reasonable to require an explicit detach/attach sequence
from the management application.

* As mentioned above, qemuDomainChangeNet also does its best to
understand when a simple change in attached bridge for the existing
tap device will work vs. the need to completely tear down/reconstruct
the host side of the device (including tap device).

This patch *does not* implement the "reconnect" code anyway - there is
a placeholder that turns that into an error. Rather, the purpose of
this patch is to replicate existing behavior with code that is ready
to have that functionality plugged in in a later patch.

* The expanded uses for qemuDomainChangeNetBridge meant that it needed
to be enhanced as well - it no longer replaces the original brname
string in olddev with the new brname; instead, it relies on the
caller to replace the *entire* olddev with newdev (since we've gone
to great lengths to assure they are functionally identical other
than the name of the bridge, this is now not only safe, but more
correct). Additionally, qemuDomainNetChangeBridge can now set the
bridge for type='network' interfaces as well as plain type='bridge'
interfaces. (Note that I had to make this change simultaneous to the
reorganization of qemuDomainChangeNet because the two are too
closely intertwined to separate).

6bde0a1a

Avoid straying </cpuset> · dc9d7a17

由 Guido Günther 提交于 10月 15, 2012

by using the same condition as for the <cpuset>.

Fixes "make check" found by
    http://honk.sigxcpu.org:8001/job/libvirt-check/160/

dc9d7a17

conf: virDomainDeviceInfoCopy utility function · 11c47d97

由 Laine Stump 提交于 10月 11, 2012

This does a shallow copy of all the bits, then strdups the two items
that are actually allocated separately.

11c47d97

conf: fix virDevicePCIAddressEqual args · 31094559

由 Laine Stump 提交于 10月 11, 2012

This function really should have been taking virDevicePCIAddress*
instead of the inefficient virDevicePCIAddress (results in copying two
entire structs onto the stack rather than just two pointers), and
returning a bool true/false (not matching is not necessarily a
"failure", as a -1 return would imply, and also using "if
(!virDevicePCIAddressEqual(x, y))" to mean "if x == y" is just a bit
counterintuitive).

31094559

Fix tab vs space · a2b80edb

由 Guido Günther 提交于 10月 15, 2012

that broke "make syntax-check"

found by http://honk.sigxcpu.org:8001/job/libvirt-syntax-check/157/

Pushed under the build breaker rule.

a2b80edb

O
qemu: Ignore def->cpumask if emulatorpin is specified · 3635b41e
由 Osier Yang 提交于 10月 12, 2012
```
If the vcpu placement is "static", it's just fine to ignore the
def->cpumask if emulatorpin is specified.
```
3635b41e

conf: Ignore emulatorpin if vcpu placement is auto · 5378effd

由 Osier Yang 提交于 10月 12, 2012

When vcpu placement is "auto", the domain process will be pinned
to advisory nodeset from querying numad, While emulatorpin will
override the pinning. That means both of them are to set the
pinning policy for domain process, but conflicts with each other.

This patch ingore emulatorpin if vcpu placement is "auto", because
<vcpu> placement can't be simply ignored for <numatune> placement
could default to it.

5378effd

qemu: Initialize cpuset for hotplugged vcpu as def->cpuset · 0df1a790

由 Osier Yang 提交于 10月 12, 2012

The onlined vcpu pinning policy should inherit def->cpuset if
it's not specified explicitly, and the affinity should be set
in this case. Oppositely, the offlined vcpu pinning policy should
be free()'ed.

0df1a790

qemu: Create or remove cgroup when doing vcpu hotpluging · a9bfe887

由 Osier Yang 提交于 10月 12, 2012

Various APIs use cgroup to either set or get the statistics of
host or guest. Hotplug or hot unplug new vcpus without creating
or removing the cgroup for the vcpus could cause problems for
those APIs. E.g.

% virsh vcpucount dom
maximum      config        10
maximum      live          10
current      config         1
current      live           1

% virsh setvcpu dom 2

% virsh schedinfo dom --set vcpu_quota=1000
Scheduler      : posix
error: Unable to find vcpu cgroup for rhel6.2(vcpu: 1): No such file or
directory

This patch fixes the problem by creating cgroups for each of the
onlined vcpus, and destroying cgroups for each of the offlined
vcpus.

a9bfe887

conf: Initialize the pinning policy for vcpus · 10f8a45d

由 Osier Yang 提交于 10月 12, 2012

Document for <vcpu>'s "cpuset" says:

Since 0.4.4, this element can contain an optional cpuset attribute,
which is a comma-separated list of physical CPU numbers that virtual
CPUs can be pinned to.

However, it's not the truth, libvirt actually pins the domain
process to the specified pCPUs by "cpuset" of <vcpu>. And the
vcpu thread are pinned to all available pCPUs if no <vcpupin>
is specified for it.

This patch is to implement the codes to inherit <vcpu>'s "cpuset" for
vcpu that doesn't have <vcpupin> specified, and <vcpupin>
for these vcpu will be ignored when formating. Underlying
driver implementation will make sure the vcpu thread pinned
to correct pCPUs.

10f8a45d

conf: Ignore vcpupin for not onlined vcpus when parsing · 60b176c3

由 Osier Yang 提交于 10月 12, 2012

Setting pinning policy for vcpu which exceeds current vcpus number
just makes no sense, however, it could cause various problems, E.g.

<vcpu current='1'>4</vcpu>
<cputune>
  <vcpupin vcpuid='3' cpuset='4'/>
</cputune>

% virsh start linux
error: Failed to start domain linux
error: cannot set CPU affinity on process 32534: No such process

We must have some odd codes underlying which produces the
"on process 32534", but the point is why we not to prevent
earlier when parsing? Note that this is only one of the
problem it could cause.

This patch is to ignore the <vcpupin> for not onlined vcpus.

60b176c3

doc: Sort out the relationship between <vcpu>, <vcpupin>, and <emulatorpin> · f108944a

由 Osier Yang 提交于 10月 12, 2012

These 3 elements conflicts with each other in either the doc
or the underlying codes.

Current problems:

Problem 1:

The doc shouldn't simply say "These settings are superseded
by CPU tuning. " for element <vcpu>. As except the tuning, <vcpu>
allows to specify the current, maxmum vcpu number. Apart from that,
<vcpu> also allows to specify the placement as "auto", which binds
the domain process to the advisory nodeset from numad.

Problem 2:

Doc for <vcpu> says its "cpuset" specify the physical CPUs
that the vcpus can be pinned. But it's not the truth, as
actually it only pin domain process to the specified physical
CPUs. So either it's a document bug, or code bug.

Problem 3:

Doc for <vcpupin> says it supersed "cpuset" of <vcpu>, it's
not quite correct, as each <vcpupin> specify the pinning policy
only for one vcpu. How about the ones which doesn't have
<vcpupin> specified? it says the vcpu will be pinned to all
available physical CPUs, but what's the meaning of attribute
"cpuset" of <vcpu> then?

Problem 4:

Doc for <emulatorpin> says it pin the emulator threads (domain
process in other context, perhaps another follow up patch to
cleanup the inconsistency is needed) to the physical CPUs
specified its attribute "cpuset". Which conflicts with
<vcpu>'s "cpuset". And actually in the underlying codes,
it set the affinity for domain process twice if both
"cpuset" for <vcpu> and <emulatorpin> are specified,
and <emulatorpin>'s pinning will override <vcpu>'s.

Problem 5:

When "placement" of <vcpu> is "auto" (I.e. uses numad to
get the advisory nodeset to which the domain process is
pinned to), it will also be overridden by <emulatorpin>,

This patch is trying to sort out the conflicts or bugs by:

1) Don't say <vcpu> is superseded by <cputune>

2) Keep the semanteme for "cpuset" of <vcpu> (I.e. Still says it
   specify the physical CPUs the virtual CPUs). But modifying it
   to mention it also set the pinning policy for domain process,
   and the CPU placement of domain process specified by "cpuset"
   of <vcpu> will be ingored if <emulatorpin> specified, and
   similary, the CPU placement of vcpu thread will be ignored
   if it has <vcpupin> specified, for vcpu which doesn't have
   <vcpupin> specified, it inherits "cpuset" of <vcpu>.

3) Don't say <vcpu> is supersed by <vcpupin>. If neither <vcpupin>
   nor "cpuset" of <vcpu> is specified, the vcpu will be pinned
   to all available pCPUs.

4) If neither <emulatorpin> nor "cpuset" of <vcpu> is specified,
   the domain process (emulator threads in the context) will be
   pinned to all available pCPUs.

5) If "placement" of <vcpu> is "auto", <emulatorpin> is not allowed.

6) hotplugged vcpus will also inherit "cpuset" of <vcpu>

Codes changes according to above document changes:

1) Inherit def->cpumask for each vcpu which doesn't have <vcpupin>
   specified, during parsing.

2) ping the vcpu which doesn't have <vcpupin> specified to def->cpumask
   either by cgroup for sched_setaffinity(2), which is actually done
   by 1).

3) Error out if "placement" == "auto", and <emulatorpin> is specified.
   Otherwise, <emulatorpin> is honored, and "cpuset" of <cpuset> is
   ignored.

4) Setup cgroup for each hotplugged vcpu, and setup the pinning policy
   by either cgroup or sched_setaffinity(2).

5) Remove cgroup and <vcpupin> for each hot unplugged vcpu.

Patches are following (6 in total except this patch)

f108944a

Tweak comments in the policykit rules file · d04c53bc

由 Cole Robinson 提交于 10月 12, 2012

- Add the XML header so vim gives us syntax highlighting
- polkit-policy-file-validate hasn't existed for 3 years
- Permissions comment was not accurate

d04c53bc

C
Only keep one polkit rules file · e1019e9e
由 Cole Robinson 提交于 10月 12, 2012
```
Just tweak it at build time depending on what polkit version we are
building for.
```
e1019e9e
C

daemon: Use $(AM_V_GEN) in a few more places · 0801c149
由 Cole Robinson 提交于 10月 13, 2012

0801c149

13 10月, 2012 2 次提交

G
Properly parse (unsigned) long long · d78035d0
由 Guido Günther 提交于 10月 13, 2012
```
This fixes problems on platforms where sizeof(long) != sizeof(long long)
like ia32.
```
d78035d0

spec: Add support for libssh2 transport · 1e25c54f

由 Peter Krempa 提交于 10月 12, 2012

Libssh2 transport support was enabled lately but the spec file wasn't
updated to take this into account. This caused libvirt to be built
without libssh2 support in Red Hat based OSes.

1e25c54f

12 10月, 2012 5 次提交

selinux: Use raw contexts · 9674f2c6

由 Martin Kletzander 提交于 10月 05, 2012

We are currently able to work only with non-translated SELinux
contexts, but we are using functions that work with translated
contexts throughout the code.  This patch swaps all SELinux context
translation relative calls with their raw sisters to avoid parsing
problems.

The problems can be experienced with mcstrans for example.  The
difference is that if you have translations enabled (yum install
mcstrans; service mcstrans start), fgetfilecon_raw() will get you
something like 'system_u:object_r:virt_image_t:s0', whereas
fgetfilecon() will return 'system_u:object_r:virt_image_t:SystemLow'
that we cannot parse.

I was trying to confirm that the _raw variants were here since the dawn of
time, but the only thing I see now is that it was imported together in
the upstream repo [1] from svn, so before 2008.

Thanks Laurent Bigonville for finding this out.

[1] http://oss.tresys.com/git/selinux.git

9674f2c6

conf: Mark missing optional USB devices in domain XML · f95560b3

由 Jiri Denemark 提交于 10月 11, 2012

When startupPolicy set for a USB devices allows such device to be
missing, there was no way this could be detected from domain XML. With
this patch, libvirt emits a new missing='yes' attribute for such devices
when active domain XML is generated.

f95560b3

J

virsh: remove reference to migration in blockcopy · c0fab871
由 Ján Tomko 提交于 10月 11, 2012

c0fab871

virsh: block SIGINT while getting BlockJobInfo · 13fefaf3

由 Ján Tomko 提交于 10月 11, 2012

SIGINT hasn't been blocked, which could lead to losing it somewhere in
virDomainGetBlockJobInfo and not aborting the job.

13fefaf3

J

Various typos and misspellings · 149c87b4
由 Ján Tomko 提交于 10月 11, 2012

149c87b4

11 10月, 2012 15 次提交

qemu: Fix misleading comment for qemuDomainObjBeginJobWithDriver() · 36f7dbf4

由 Peter Krempa 提交于 9月 27, 2012

The comment stated that you may call qemuDomainObjBeginJobWithDriver
without passing qemud_driver to signal it's not locked.
qemuDomainObjBeginJobWithDriver still accesses the qemud_driver
structure and the lock singaling is done through a separate parameter.

36f7dbf4

qemu: Make save/restore with USB devices usable · bd1282d6

由 Jiri Denemark 提交于 10月 09, 2012

Save/restore with passed through USB devices currently only works if the
USB device can be found at the same USB address where it used to be
before saving a domain. This makes sense in case a user explicitly
configure the USB address in domain XML. However, if the device was
found automatically by vendor/product identification, we should try to
search for that device when restoring the domain and use any device we
find as long as there is only one available. In other words, the USB
device can now be removed and plugged again or the host can be rebooted
between saving and restoring the domain.

bd1282d6

Add MIGRATABLE flag for virDomainGetXMLDesc · 28f8dfdc

由 Jiri Denemark 提交于 10月 08, 2012

Using VIR_DOMAIN_XML_MIGRATABLE flag, one can request domain's XML
configuration that is suitable for migration or save/restore. Such XML
may contain extra run-time stuff internal to libvirt and some default
configuration may be removed for better compatibility of the XML with
older libvirt releases.

This flag may serve as an easy way to get the XML that can be passed
(after desired modifications) to APIs that accept custom XMLs, such as
virDomainMigrate{,ToURI}2 or virDomainSaveFlags.

28f8dfdc

J

qemu: Implement startupPolicy for USB passed through devices · edc9269a
由 Jiri Denemark 提交于 10月 04, 2012

edc9269a

qemu: Add option to treat missing USB devices as success · 059aff6b

由 Jiri Denemark 提交于 10月 03, 2012

All USB device lookup functions emit an error when they cannot find the
requested device. With this patch, their caller can choose if a missing
device is an error or normal condition.

059aff6b

qemu: Introduce qemuFindHostdevUSBDevice · 7bcc7278

由 Jiri Denemark 提交于 10月 03, 2012

The code which looks up a USB device specified by hostdev is duplicated
in two places. This patch creates a dedicated function that can be
called in both places.

7bcc7278

conf: Add support for startupPolicy for USB devices · e658daeb

由 Jiri Denemark 提交于 10月 02, 2012

USB devices can disappear without OS being mad about it, which makes
them ideal for startupPolicy. With this attribute, USB devices can be
configured to be mandatory (the default), requisite (will disappear
during migration if they cannot be found), or completely optional.

e658daeb

locking: Implement lock failure action in sanlock driver · 89364767

由 Jiri Denemark 提交于 9月 18, 2012

While the changes to sanlock driver should be stable, the actual
implementation of sanlock_helper is supposed to be replaced in the
future. However, before we can implement a better sanlock_helper, we
need an administrative interface to libvirtd so that the helper can just
pass a "leases lost" event to the particular libvirt driver and
everything else will be taken care of internally. This approach will
also allow libvirt to pass such event to applications and use
appropriate reasons when changing domain states.

The temporary implementation handles all actions directly by calling
appropriate libvirt APIs (which among other things means that it needs
to know the credentials required to connect to libvirtd).

89364767

J

locking: Add support for lock failure action · 297c704a
由 Jiri Denemark 提交于 9月 18, 2012

297c704a
J
locking: Pass hypervisor driver name when acquiring locks · d236f3fc
由 Jiri Denemark 提交于 9月 17, 2012
```
This is required in case a lock manager needs to contact libvirtd in
case of an unexpected event.
```
d236f3fc
J

locking: Add const char * parameter to avoid ugly typecasts · e55ff49c
由 Jiri Denemark 提交于 9月 17, 2012

e55ff49c

conf: Add on_lockfailure event configuration · 76f5bcab

由 Jiri Denemark 提交于 9月 06, 2012

Using this new element, one can configure an action that should be
performed when resource locks are lost.

76f5bcab

conf: Rename life cycle actions to event actions · d0ea530b

由 Jiri Denemark 提交于 9月 06, 2012

While current on_{poweroff,reboot,crash} action configuration is about
configuring life cycle actions, they can all be considered events and
actions that need to be done on a particular event. Let's generalize the
code by renaming life cycle actions to event actions so that it can be
reused later for non-lifecycle events.

d0ea530b

Z

Correct name of domain/pm/suspend-to-mem in docs · 0ec6aebb
由 Zeeshan Ali (Khattak) 提交于 10月 10, 2012

0ec6aebb

storage: Report UUID/name consistently in driver errors · 3af8280b

由 Cole Robinson 提交于 10月 08, 2012

Done with:

sed -i -e "s/no pool with matching uuid/no storage pool with matching uuid/g" src/storage/storage_driver.c
sed -i -e 's/"%s", _("no storage pool with matching uuid")/_("no storage pool with matching uuid %s"), obj->uuid/g' src/storage/storage_driver.c
sed -i -e 's/"%s", _("storage pool is not active")/_("storage pool '%s' is not active"), pool->def->name/g' src/storage/storage_driver.c

And a couple fixups before, during, and after, and a manual inspection
pass to make sure nothing was wonky.

3af8280b