提交 · 2384b6f019e6c3a03297856255a2e349e9174505 · openeuler / libvirt

26 10月, 2017 2 次提交

qemu: Enabled pause-before-switchover migration capability · 32c29f10

由 Jiri Denemark 提交于 10月 20, 2017

QEMU identified a race condition between the device state serialization
and the end of storage migration. Both QEMU and libvirt needs to be
updated to fix this.

Our migration work flow is modified so that after starting the migration
we to wait for QEMU to enter "pre-switchover", "postcopy-active", or
"completed" state. Once there, we cancel all block jobs as usual. But if
QEMU is in "pre-switchover", we need to resume the migration afterwards
and wait again for the real end (either "postcopy-active" or
"completed" state).

Old QEMU will just enter either "postcopy-active" or "completed"
directly, which is still correctly handled even by new libvirt. The
"pre-switchover" state will only be entered if QEMU supports it and the
pause-before-switchover capability was enabled. Thus all combinations of
libvirt and QEMU will work, but only new QEMU with new libvirt will
avoid the race condition.
Signed-off-by: NJiri Denemark <jdenemar@redhat.com>

32c29f10

qemu: Add pause-before-switchover migration capability · 6addde24

由 Jiri Denemark 提交于 10月 20, 2017

This new capability enables a pause before device state serialization so
that we can finish all block jobs without racing with the end of the
migration. The pause is indicated by "pre-switchover" state. Once we're
done QEMU enters "device" migration state.

This patch just defines the new capability and QEMU migration states and
their mapping to our job states.
Signed-off-by: NJiri Denemark <jdenemar@redhat.com>

6addde24

23 10月, 2017 7 次提交

qemu: Set correct job status when qemuMigrationRun fails · 55ac6a5d

由 Jiri Denemark 提交于 10月 19, 2017

Instead of enumerating all states which need to be turned into
QEMU_DOMAIN_JOB_STATUS_FAILED (and failing to add all of them), it's
better to mention just the one which needs to be left alone.
Signed-off-by: NJiri Denemark <jdenemar@redhat.com>
Reviewed-by: NJohn Ferlan <jferlan@redhat.com>

55ac6a5d

qemu: Consistently use exit_monitor in qemuMigrationRun · 73a35226

由 Jiri Denemark 提交于 10月 19, 2017

Almost every failure in qemuMigrationRun while we are talking to QEMU
monitor results in a jump to exit_monitor label. The only exception is
removed by this patch.
Signed-off-by: NJiri Denemark <jdenemar@redhat.com>
Reviewed-by: NJohn Ferlan <jferlan@redhat.com>

73a35226

qemu: Don't misuse "ret" in qemuMigrationRun · af32e57f

由 Jiri Denemark 提交于 10月 19, 2017

The "ret" variable is used for storing the return value of a function
and should not be used as a temporary variable.
Signed-off-by: NJiri Denemark <jdenemar@redhat.com>
Reviewed-by: NJohn Ferlan <jferlan@redhat.com>

af32e57f

qemu: Unite error handling in qemuMigrationRun · 7d2fbabc

由 Jiri Denemark 提交于 10月 19, 2017

Merge cancel and cancelPostCopy sections with the generic error section,
where we can easily decide whether canceling the ongoing migration is
required.
Signed-off-by: NJiri Denemark <jdenemar@redhat.com>
Reviewed-by: NJohn Ferlan <jferlan@redhat.com>

7d2fbabc

qemu: Split cleanup and error code in qemuMigrationRun · c1a643b6

由 Jiri Denemark 提交于 10月 19, 2017

Let cleanup only do things common to both failure and success paths and
move error handling code inside the new "error" section.
Signed-off-by: NJiri Denemark <jdenemar@redhat.com>
Reviewed-by: NJohn Ferlan <jferlan@redhat.com>

c1a643b6

qemu: Refactor qemuMigrationRun a bit · f8ede9cc

由 Jiri Denemark 提交于 10月 18, 2017

Some code which was supposed to be executed only when migration
succeeded was buried inside the cleanup code.
Signed-off-by: NJiri Denemark <jdenemar@redhat.com>
Reviewed-by: NJohn Ferlan <jferlan@redhat.com>

f8ede9cc

qemu: Use switch in qemuMigrationCompleted · 96032623

由 Jiri Denemark 提交于 10月 17, 2017

When adding a new job state it's useful to let the compiler complain
about places where we need to think about what to do with the new
state.
Signed-off-by: NJiri Denemark <jdenemar@redhat.com>
Reviewed-by: NJohn Ferlan <jferlan@redhat.com>

96032623

20 10月, 2017 1 次提交

qemu: Use bitmap with migration capabilities · 310287b1

由 Jiri Denemark 提交于 10月 17, 2017

All calls to qemuMonitorGetMigrationCapability in QEMU driver are
replaced with qemuMigrationCapsGet.
Signed-off-by: NJiri Denemark <jdenemar@redhat.com>
Reviewed-by: NJohn Ferlan <jferlan@redhat.com>

310287b1

19 10月, 2017 1 次提交

qemu: send allowReboot in migration cookie · e859da6f

由 Pavel Hrdina 提交于 10月 13, 2017

We need to send allowReboot in the migration cookie to ensure the same
behavior of the virDomainSetLifecycleAction() API on the destination.

Consider this scenario:

    1. On the source the domain is started with:
        <on_poweroff>destroy</on_poweroff>
        <on_reboot>restart</on_reboot>
        <on_crash>destroy</on_crash>

    2. User calls an API to set "destroy" for <on_reboot>:
        <on_poweroff>destroy</on_poweroff>
        <on_reboot>destroy</on_reboot>
        <on_crash>destroy</on_crash>

    3. The guest is migrated to a different host

    4a. Without the allowReboot in the migration cookie the QEMU
        process on destination would be started with -no-reboot
        which would prevent using the virDomainSetLifecycleAction() API
        for the rest of the guest lifetime.

    4b. With the allowReboot in the migration cookie the QEMU process
        on destination is started without -no-reboot like it was started
        on the source host and the virDomainSetLifecycleAction() API
        continues to work.

The following patch adds a QEMU implementation of the
virDomainSetLifecycleAction() API and that implementation disallows
using the API if all actions are set to "destroy" because we add
"-no-reboot" on the QEMU command line.  Changing the lifecycle action
is in this case pointless because the QEMU process is always terminated.
Reviewed-by: NJohn Ferlan <jferlan@redhat.com>
Signed-off-by: NPavel Hrdina <phrdina@redhat.com>

e859da6f

17 10月, 2017 1 次提交

qemu: Check QEMU error on failed migration · e1ca8ecb

由 Jiri Denemark 提交于 10月 12, 2017

When migration fails, QEMU may provide a description of the error in
the reply to query-migrate QMP command. We can fetch this error and use
it instead of the generic "unexpectedly failed" message.
Signed-off-by: NJiri Denemark <jdenemar@redhat.com>
Reviewed-by: NPavel Hrdina <phrdina@redhat.com>

e1ca8ecb

05 10月, 2017 2 次提交

qemu: process: Pass flags to qemuProcessPrepareHost · 2e78c588

由 Peter Krempa 提交于 10月 03, 2017

Pass flags to the function rather than just whether we have incoming
migration. This also enforces correct startup policy for USB devices
when reverting from a snapshot.

2e78c588

qemu: migration: Extract flags for starting VM into a variable · b8c0262e

由 Peter Krempa 提交于 10月 03, 2017

qemuMigrationPrepareAny called multiple of the functions starting the
qemu process for incoming migration by adding the flags explicitly.
Extract them to a variable so that they can be easily used for other
calls or changed in the future.

b8c0262e

25 9月, 2017 1 次提交

Print hex values with '0x' prefix and octal with '0' in debug messages · 32d6c738

由 Daniel P. Berrange 提交于 9月 25, 2017

Seeing a log message saying 'flags=93' is ambiguous & confusing unless
you happen to know that libvirt always prints flags as hex. Change our
debug messages so that they always add a '0x' prefix when printing flags,
and '0' prefix when printing mode. A few other misc places gain a '0x'
prefix in error messages too.
Signed-off-by: NDaniel P. Berrange <berrange@redhat.com>

32d6c738

14 9月, 2017 1 次提交

qemu: monitor: Remove support for "legacy" block jobs · 771a3860

由 Peter Krempa 提交于 9月 13, 2017

Drop all the monitor code necessary to do the downstream block jobs.
Reviewed-by: NEric Blake <eblake@redhat.com>

771a3860

07 9月, 2017 9 次提交

qemu: migration: don't expose incomplete job as complete · 3f2d6d82

由 Nikolay Shirokovskiy 提交于 9月 01, 2017

In case of real migration (not migrating to file on save, dump etc)
migration info is not complete at time qemu finishes migration
in normal (non postcopy) mode. We need to update disks stats,
downtime info etc. Thus let's not expose this job status as
completed.

To archive this let's set status to 'qemu completed' after
qemu reports migration is finished. It is not visible as complete
job to clients. Cookie code on confirm phase will finally turn
job into completed. As we don't need more things to do when
migrating to file status is set to 'completed' as before
in this case.
Signed-off-by: NJiri Denemark <jdenemar@redhat.com>

3f2d6d82

qemu: migrate: add mirror stats to migration stats · 8c466583

由 Nikolay Shirokovskiy 提交于 9月 01, 2017

When getting job info in case mirror does not reach ready phase
fetch mirror stats from qemu. Otherwise mirror stats are already
saved in current job.
Signed-off-by: NJiri Denemark <jdenemar@redhat.com>

8c466583

qemu: introduce migrating job status · 5a274d4f

由 Nikolay Shirokovskiy 提交于 9月 01, 2017

Instead of checking stat.status let's set status to migrating
as soon as migrate command is send (waiting for completion
is a good place too).
Signed-off-by: NJiri Denemark <jdenemar@redhat.com>

5a274d4f

qemu: start all async job with job status active · b6868c3c

由 Nikolay Shirokovskiy 提交于 9月 01, 2017

Setting status to none has little value - getting job status
will not return even elapsed time.

After this patch getting job stats stays correct in a sence
it will not fetch migration stats because it consults
stats.status before doing the fetch.
Signed-off-by: NJiri Denemark <jdenemar@redhat.com>

b6868c3c

qemu: refactor fetching migration stats · 6a2a80c6

由 Nikolay Shirokovskiy 提交于 9月 01, 2017

qemuMigrationFetchJobStatus is rather inconvinient. Some of its
callers don't need status to be updated, some don't need to update
elapsed time right away. So let's update status or elapsed time
in callers instead.

This patch drops updating job status on getting job stats by
client. This way we will not provide status 'completed' while
it is not yet updated by migration routine.
Signed-off-by: NJiri Denemark <jdenemar@redhat.com>

6a2a80c6

N
qemu: drop excessive zero-out in qemuMigrationFetchJobStatus · e7967470
由 Nikolay Shirokovskiy 提交于 9月 01, 2017
```
qemuMonitorGetMigrationStats will do it for us anyway.
Signed-off-by: NJiri Denemark <jdenemar@redhat.com>
```
e7967470

qemu: drop QEMU_MIGRATION_COMPLETED_UPDATE_STATS · e87d4b9e

由 Nikolay Shirokovskiy 提交于 9月 01, 2017

This way we get stats only in one place. The former code waits for
complete/postcopy status basically and don't need to mess with stats.

The patch drops raising an error on stats updates failure. This
does not make much sense anyway.
Signed-off-by: NJiri Denemark <jdenemar@redhat.com>

e87d4b9e

qemu: introduce QEMU_DOMAIN_JOB_STATUS_POSTCOPY · 09f57f9a

由 Nikolay Shirokovskiy 提交于 9月 01, 2017

Let's introduce QEMU_DOMAIN_JOB_STATUS_POSTCOPY state for job.current->status
instead of checking job.current->stats.status. The latter can be changed
when fetching migration statistics. Moving state function from the variable
and leave only store function seems more managable.

This patch removes all state checking usage of stats except for
qemuDomainGetJobStatsInternal. This place will be handled separately.
Signed-off-by: NJiri Denemark <jdenemar@redhat.com>

09f57f9a

qemu: introduce qemu domain job status · 751a1c7f

由 Nikolay Shirokovskiy 提交于 9月 01, 2017

This patch simply switches code from using VIR_DOMAIN_JOB_* to
introduced QEMU_DOMAIN_JOB_STATUS_*. Later this gives us freedom
to introduce states for postcopy and mirroring phases.
Signed-off-by: NJiri Denemark <jdenemar@redhat.com>

751a1c7f

29 8月, 2017 1 次提交

qemu: Introduce and use qemuDomainRemoveInactiveJob · 9115dcd8

由 Michal Privoznik 提交于 8月 15, 2017

At some places we either already have synchronous job or we just
released it. Also, some APIs might want to use this code without
having to release their job. Anyway, the job acquire code is
moved out to qemuDomainRemoveInactiveJob so that
qemuDomainRemoveInactive does just what it promises.
Signed-off-by: NMichal Privoznik <mprivozn@redhat.com>
Reviewed-by: NJohn Ferlan <jferlan@redhat.com>

9115dcd8

18 8月, 2017 1 次提交

qemu: don't check whether offline migration is safe · abab46a2

由 Pavel Hrdina 提交于 8月 17, 2017

Offline migration transfers only the domain definition.

Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1449715Signed-off-by: NPavel Hrdina <phrdina@redhat.com>

abab46a2

20 7月, 2017 1 次提交

qemu: shared disks with cache=directsync should be safe for migration · fed9cc85

由 Hao Peng 提交于 7月 15, 2017

At present shared disks can be migrated with either readonly or cache=none. But
cache=directsync should be safe for migration, because both cache=directsync and cache=none
don't use the host page cache, and cache=direct write through qemu block layer cache.
Signed-off-by: NPeng Hao <peng.hao2@zte.com.cn>
Reviewed-by: NWang Yechao <wang.yechao255@zte.com.cn>

fed9cc85

19 7月, 2017 1 次提交

qemu: avoid deadlock on domain object enter monitor fail · 057c2fba

由 Wang King 提交于 7月 19, 2017

Should be followed with qemuDomainObjExitMonitor only if
qemuDomainObjEnterMonitorAsync returns 0.
Signed-off-by: NJiri Denemark <jdenemar@redhat.com>

057c2fba

26 6月, 2017 1 次提交

qemu: Avoid fd leak on incoming tunneled migration · 2abb0e4b

由 Jiri Denemark 提交于 6月 19, 2017

While qemuProcessIncomingDefNew takes an fd argument and stores it in
qemuProcessIncomingDef structure, the caller is still responsible for
closing the file descriptor.

Introduced by commit v1.2.21-140-ge7c6f457.
Signed-off-by: NJiri Denemark <jdenemar@redhat.com>
Reviewed-by: NJohn Ferlan <jferlan@redhat.com>

2abb0e4b

14 6月, 2017 1 次提交

qemu: Use qemuDomainCheckABIStability where needed · f0a3fe1b

由 Jiri Denemark 提交于 6月 14, 2017

Most places which want to check ABI stability for an active domain need
to call this API rather than the original
qemuDomainDefCheckABIStability. The only exception is in snapshots where
we need to decide what to do depending on the saved image data.

https://bugzilla.redhat.com/show_bug.cgi?id=1460952Signed-off-by: NJiri Denemark <jdenemar@redhat.com>
Reviewed-by: NPavel Hrdina <phrdina@redhat.com>

f0a3fe1b

13 6月, 2017 1 次提交

Use ATTRIBUTE_FALLTHROUGH · adf846d3

由 Marc Hartmayer 提交于 6月 07, 2017

Use ATTRIBUTE_FALLTHROUGH, introduced by commit
5d84f596, instead of comments to
indicate that the fall through is an intentional behavior.
Signed-off-by: NMarc Hartmayer <mhartmay@linux.vnet.ibm.com>
Reviewed-by: NBoris Fiuczynski <fiuczy@linux.vnet.ibm.com>
Reviewed-by: NBjoern Walk <bwalk@linux.vnet.ibm.com>

adf846d3

07 6月, 2017 4 次提交

qemu: Use updated CPU when starting QEMU if possible · 8e34f478

由 Jiri Denemark 提交于 5月 31, 2017

If QEMU is new enough and we have the live updated CPU definition in
either save or migration cookie, we can use it to enforce ABI. The
original guest CPU from domain XML will be stored in private data.
Signed-off-by: NJiri Denemark <jdenemar@redhat.com>
Reviewed-by: NPavel Hrdina <phrdina@redhat.com>

8e34f478

qemu: Send updated CPU in migration cookie · 48bc3053

由 Jiri Denemark 提交于 5月 26, 2017

Since the domain XML send during migration uses the original guest CPU
definition but we still want the destination to enforce ABI if it is new
enough, we send the live updated CPU definition in a migration cookie.
Signed-off-by: NJiri Denemark <jdenemar@redhat.com>
Reviewed-by: NPavel Hrdina <phrdina@redhat.com>

48bc3053

qemu: Always send persistent XML during migration · b0a16641

由 Jiri Denemark 提交于 6月 07, 2017

When persistent migration of a transient domain is requested but no
custom XML is passed to the migration API we would just let the
destination daemon make a persistent definition from the live definition
itself. This is not a problem now, but once the destination daemon
starts replacing the original CPU definition with the one from migration
cookie before starting a domain, it would need to add more ugly hacks to
reverse the operation. Let's just always send the persistent definition
in the cookie to make things a bit cleaner.
Signed-off-by: NJiri Denemark <jdenemar@redhat.com>
Reviewed-by: NPavel Hrdina <phrdina@redhat.com>

b0a16641

qemu: Report the original CPU in migratable xml · 356a2161

由 Jiri Denemark 提交于 5月 19, 2017

The destination host may not be able to start a domain using the live
updated CPU definition because either libvirt or QEMU may not be new
enough. Thus we need to send the original guest CPU definition.
Signed-off-by: NJiri Denemark <jdenemar@redhat.com>
Reviewed-by: NPavel Hrdina <phrdina@redhat.com>

356a2161

03 5月, 2017 1 次提交

qemu: Fix persistent migration of transient domains · 59307fad

由 Jiri Denemark 提交于 5月 02, 2017

While fixing a bug with incorrectly freed memory in commit
v3.1.0-399-g5498aa29, I accidentally broke persistent migration of
transient domains. Before adding qemuDomainDefCopy in the path, the code
just took NULL from vm->newDef and used it as the persistent def, which
resulted in no persistent XML being sent in the migration cookie. This
scenario is perfectly valid and the destination correctly handles it by
using the incoming live definition and storing it as the persistent one.

After the mentioned commit libvirtd would just segfault in the described
scenario.

https://bugzilla.redhat.com/show_bug.cgi?id=1446205Signed-off-by: NJiri Denemark <jdenemar@redhat.com>

59307fad

02 5月, 2017 1 次提交

qemu: Don't reset "events" migration capability · fc48fc79

由 Jiri Denemark 提交于 4月 28, 2017

When creating v3.2.0-77-g8be3ccd0 commit, I completely forgot that one
migration capability is very special. It's the "events" capability which
tells QEMU to report "MIGRATION" events. Since libvirt always wants the
events, it is enabled in qemuConnectMonitor and the rest of the code
should not touch it.

https://bugzilla.redhat.com/show_bug.cgi?id=1439841
https://bugzilla.redhat.com/show_bug.cgi?id=1441165Messed-up-by: NJiri Denemark <jdenemar@redhat.com>
Signed-off-by: NJiri Denemark <jdenemar@redhat.com>

fc48fc79

27 4月, 2017 2 次提交

qemu: Report VIR_DOMAIN_JOB_OPERATION · 2a978269

由 Jiri Denemark 提交于 4月 26, 2017

Not all async jobs are visible via virDomainGetJobStats (either they are
too fast or getting the stats is not allowed during the job), but
forcing all of them to advertise the operation is easier than hunting
the jobs for which fetching statistics is allowed. And we won't need to
think about this when we add support for getting stats for more jobs.

https://bugzilla.redhat.com/show_bug.cgi?id=1441563Signed-off-by: NJiri Denemark <jdenemar@redhat.com>

2a978269

qemu: migration: fix race on cancelling drive mirror · bc82d1ea

由 Nikolay Shirokovskiy 提交于 4月 07, 2017

0feebab2 adds calling qemuBlockNodeNamesDetect for completed job
on updating block jobs. This affects cancelling drive mirror logic as
this function drops vm lock. Now we have to recheck all disks
before the disk with the completed block job before going
to wait for block job events.
Signed-off-by: NJiri Denemark <jdenemar@redhat.com>

bc82d1ea