1. 17 6月, 2015 1 次提交
    • E
      lxc: set nosuid+nodev+noexec flags on /proc/sys mount · bda5f2bd
      Eric W. Biederman 提交于
      Future kernels will mandate the use of nosuid+nodev+noexec
      flags when mounting the /proc/sys filesystem. Unconditionally
      add them now since they don't harm things regardless and could
      mitigate future security attacks.
      
      (cherry picked from commit 24710414)
      
      Conflicts:
          src/lxc/lxc_container.c
      bda5f2bd
  2. 22 5月, 2015 1 次提交
    • T
      conf: fix issue on virCPUDefCopy · 62d46ead
      Thibaut Collet 提交于
      The cpu xml copy is incorrect: the memAccess field is not copied.
      The lack of copy of this memAccess field can cause unexpected behaviour for live
      migration when vhost user is used.
      
      For example if guest has the following configuration:
      ....
      <cpu>
      <model>Westmere</model>
      <topology sockets="1" cores="4" threads="1"/>
      <numa>
      <cell id='0' cpus='0-3' memory='2097152' memAccess='shared'/>
      </numa>
      </cpu>
      ....
      
      The used configuration on the remote host in case of live migration is:
      ....
        <cpu mode='custom' match='exact'>
          <model fallback='allow'>Westmere</model>
          <topology sockets='1' cores='4' threads='1'/>
          <numa>
            <cell id='0' cpus='0-3' memory='2097152' unit='KiB'/>
          </numa>
        </cpu>
      ....
      
      On the remote host the lack of memAccess info can cause unexpected error on the
      qemu backend vhost user driver.
      
      Fixes: def6b359 ("docs, conf, schema: add support for shared memory mapping")
      
      This issue is present only for libvirt1.2.9 to libvirt1.2.12
      With patch 181742d4 ("conf: Move all NUMA configuration to virDomainNuma")
      present since libvirt1.2.13 the problem does not exist anymore as NUMA
      information are no more in the CPU configuration.
      Signed-off-by: NThibaut Collet <thibaut.collet@6wind.com>
      62d46ead
  3. 28 4月, 2015 18 次提交
    • C
      Prep for release 1.2.9.3 · 09661a8e
      Cole Robinson 提交于
      09661a8e
    • C
      storage: fs: Ignore volumes that fail to open with EACCESS/EPERM · 96b8510f
      Cole Robinson 提交于
      Trying to use qemu:///session to create a storage pool pointing at
      /tmp will usually fail with something like:
      
      $ virsh pool-start tmp
      error: Failed to start pool tmp
      error: cannot open volume '/tmp/systemd-private-c38cf0418d7a4734a66a8175996c384f-colord.service-kEyiTA': Permission denied
      
      If any volume in an FS pool can't be opened by the daemon, the refresh
      fails, and the pool can't be used.
      
      This causes pain for virt-install/virt-manager though. Imaging a user
      downloads a disk image to /tmp. virt-manager wants to import /tmp as
      a storage pool, so we can detect what disk format it is, and set the
      XML correctly. However this case will likely fail as explained above.
      
      Change the logic here to skip volumes that fail to open. This could
      conceivably cause user complaints along the lines of 'why doesn't
      libvirt show $ROOT-OWNED-VOLUME-FOO', but figuring that currently
      the pool won't even startup, I don't think there are any current
      users that care about that case.
      
      https://bugzilla.redhat.com/show_bug.cgi?id=1103308
      (cherry picked from commit 56476f6a)
      96b8510f
    • C
      domain: conf: Don't validate VM ostype/arch at daemon startup · 55540339
      Cole Robinson 提交于
      When parsing XML, we validate the passed ostype + arch combo against
      the detected hypervisor capabilities. This has led to the following
      problem:
      
      - Define x86 qemu guest
      - qemu is inadvertently removed from the host
      - libvirtd is restarted. fails to parse VM config since arch is removed
      - 'virsh list --all' is now empty, user is wondering where their VMs went
      
      Add a new internal flag VIR_DOMAIN_DEF_PARSE_SKIP_OSTYPE_CHECKS. Use
      it when loading VM and snapshot configs from disk.
      
      https://bugzilla.redhat.com/show_bug.cgi?id=1043572
      (cherry picked from commit f1a89a8b)
      55540339
    • C
      domain: conf: Better errors on bad os <type> values · 7d9739f2
      Cole Robinson 提交于
      If no <os><type> was specified:
        before: unknown OS type no OS type
        after : xml error: an os <type> must be specified
      
      If an <os><type> is specified that's not in our capabiliities data:
        before: unknown OS type: $type
        after : unsupported configuration: no support found for os <type> '$type'
      
      VIR_ERR_OS_TYPE is now unused (as it should be frankly) so drop its strings
      as well to save our translators some effort.
      
      (cherry picked from commit 3700c065)
      7d9739f2
    • D
      Report original error when QMP probing fails with new QEMU · a514d325
      Daniel P. Berrange 提交于
      If probing capabilities via QMP fails, we now have a check
      that prevents us falling back to -help parsing. Unfortunately
      the error message
      
        "Failed to probe capabilities for /usr/bin/qemu-kvm:
         unsupported configuration: QEMU 2.1.2 is too new for help parsing"
      
      is proving rather unhelpful to the user. We need to be telling
      them why QMP failed (the root cause), rather than they can't
      use -help (the side effect).
      
      To do this we should capture stderr during QMP probing, and
      if -help parsing then sees a new QEMU version, we know that
      QMP should have worked, and so we can show the messages from
      stderr. The message thus becomes
      
        "Failed to probe capabilities for /usr/bin/qemu-kvm:
         internal error: QEMU / QMP failed: Could not access
         KVM kernel module: No such file or directory
         failed to initialize KVM: No such file or directory"
      
      (cherry picked from commit 25bf888a)
      a514d325
    • J
      cpu: Add {Haswell,Broadwell}-noTSX CPU models · b2436daf
      Jiri Denemark 提交于
      QEMU 2.3 adds these new models to cover Haswell and Broadwell CPUs with
      updated microcode. Luckily, they also reverted former the machine type
      specific changes to existing models. And since these changes were never
      released, we don't need to hack around them in libvirt.
      Signed-off-by: NJiri Denemark <jdenemar@redhat.com>
      (cherry picked from commit c563b506)
      b2436daf
    • P
      storage: qemu: Fix security labelling of new image chain elements · 5d6cf6d4
      Peter Krempa 提交于
      When creating a disk image snapshot the libvirt code would blindly copy
      the parents label to the newly created image. This runs into problems
      when you start a VM from an image hosted on NFS (or other storage system
      that doesn't support selinux labels) and the snapshot destination is on
      a storage system that does support selinux labels. Libvirt's code in
      that case generates a different security label for the image hosted on
      NFS. This label is valid only for NFS images and doesn't allow access in
      case of a locally stored image.
      
      To fix this issue libvirt needs to refrain from copying security
      information in cases where the default domain seclabel is a better
      choice.
      
      This patch repurposes the now unused @force argument of
      virStorageSourceInitChainElement to denote whether a copy of the
      security labelling stuff should be attempted or not. This allows to
      fine-control the copy operation for cases where we need to keep the
      label of the old disk vs. the cases where we need to keep the label
      unset to use the default domain imagelabel.
      
      Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1151718
      (cherry picked from commit 7e130e8b)
      5d6cf6d4
    • J
      Ignore CPU features without a model for host-passthrough · 66a7f51a
      Ján Tomko 提交于
      This fixes reverting to snapshots created by older libvirt
      and allows libvirt not to lose track of a domain that
      has this in its live status XML (such as a domain
      restored from managedsave)
      
      https://bugzilla.redhat.com/show_bug.cgi?id=1030793
      https://bugzilla.redhat.com/show_bug.cgi?id=1151885
      (cherry picked from commit 15abebde)
      66a7f51a
    • J
      Do not format CPU features without a model · c9da7af6
      Ján Tomko 提交于
      For host-passthrough CPU we don't honor the CPU
      features specified in the XML, but we allow
      outputting them via the UPDATE_CPU flag for dumpxml,
      this gives user a rough idea of what features the CPU
      might have.
      
      After restoring a managedsave'd domain, the features
      might end up in the live status XML (in /var/run) without
      the model. This XML cannot be parsed by the daemon after
      restart and the domain might disappear.
      
      This fix skips formatting the features for HOST_PASSTHROUGH
      when UPDATE_CPU is not specified, so the newly restored domains
      and newly created snapshots won't be affected.
      
      Note: this doesn't fix existing snapshots or already restored
      running domains.
      
      https://bugzilla.redhat.com/show_bug.cgi?id=1030793
      https://bugzilla.redhat.com/show_bug.cgi?id=1151885
      (cherry picked from commit dd324bb2)
      c9da7af6
    • M
      domcaps: Check for architecture more wisely · 8feb6bad
      Michal Privoznik 提交于
      https://bugzilla.redhat.com/show_bug.cgi?id=1209948
      
      So we have this bug. The virConnectGetDomainCapabilities() API
      performs a couple of checks before it produces any result. One of
      the checks is if the architecture requested by user can be run by
      the binary (again user provided). However, the check is pretty
      dumb. It merely compares if the default binary architecture
      matches the one provided by user. However, a qemu binary can run
      multiple architectures. For instance: qemu-system-ppc64 can run:
      ppc, ppcle, ppc64, ppc64le and ppcemb. The default is ppc64, so
      if user requested something else, like ppc64le, the check would
      have failed without obvious reason.
      Signed-off-by: NMichal Privoznik <mprivozn@redhat.com>
      (cherry picked from commit 0af9325e)
      8feb6bad
    • P
      daemon: Clear fake domain def object that is used to check ACL prior to use · 4650e48a
      Peter Krempa 提交于
      The fake object is used to pass the domain name and UUID to the ACL code
      for events where we don't have the full domain def when dispatching
      events. The rest of the entries would be left uninitialized. While this
      is not a problem code-wise as the used fields are initialized it looks
      ugly in the debugger.
      
      (cherry picked from commit 6ca857c7)
      4650e48a
    • P
      util: identity: Harden virIdentitySetCurrent() · 363bc010
      Peter Krempa 提交于
      Don't unref the old identity unless we set the new one correctly and
      unref the new one on failure to set it so that we don't leak any
      references or use invalid pointers.
      
      (cherry picked from commit ad886fa6)
      363bc010
    • C
      qemu: Build nvram directory at driver startup · b843bea3
      Cole Robinson 提交于
      Similar to what was done for the channel socket in the previous commit.
      
      (cherry picked from commit 19425d11)
      b843bea3
    • C
      qemu: Build channel autosocket directory at driver startup · a86252af
      Cole Robinson 提交于
      Rather than depend on the RPM to put it in place, since this doesn't
      cover the qemu:///session case. Currently auto allocated socket path is
      completely busted with qemu:///session
      
      https://bugzilla.redhat.com/show_bug.cgi?id=1105274
      
      And because we chown the directory at driver startup now, this also fixes
      autosocket startup failures when using user/group=root
      
      https://bugzilla.redhat.com/show_bug.cgi?id=1044561
      https://bugzilla.redhat.com/show_bug.cgi?id=1146886
      (cherry picked from commit e31ab02f)
      a86252af
    • C
      qemu: chown autoDumpPath on driver startup · 116742ee
      Cole Robinson 提交于
      Not sure if this is required, but it makes things consistent with the
      rest of the directories.
      
      (cherry picked from commit db3ccd58)
      116742ee
    • C
      qemu: conf: Clarify paths that are relative to libDir · 808a6382
      Cole Robinson 提交于
      Rather than duplicate libDir for each new path
      
      (cherry picked from commit c19f43ae)
      808a6382
    • D
      avoid using deprecated udev logging functions · 96789b1e
      Daniel P. Berrange 提交于
      In systemd >= 218, the udev_set_log_fn method has been marked
      deprecated and turned into a no-op. Nothing in the udev client
      library will print to stderr by default anymore, so we can
      just stop installing a logging hook for new enough udev.
      
      (cherry picked from commit a93a3b97)
      96789b1e
    • C
      qemu: Always refresh capabilities if no <guests> found · 9ebc1631
      Cole Robinson 提交于
      - Remove all qemu emulators
      - Restart libvirtd
      - Install qemu emulators
      - Call 'virsh version' -> errors
      
      The only thing that will force the qemu driver to refresh it's cached
      capablities info is an explict API call to GetCapabilities.
      
      However in the case when the initial caps lookup at driver connect didn't
      find a single qemu emulator to poll, the driver is effectively useless
      and really can't do anything until it's populated some qemu capabilities
      info.
      
      With the above steps, the user would have to either know about the
      magic refresh capabilities call, or restart libvirtd to pick up the
      changes.
      
      Instead, this patch changes things so that every time a part of th
      driver requests access to capabilities info, check to see if
      we've previously seen any emulators. If not, force a refresh.
      
      In the case of 'still no emulators found', this is still very quick, so
      I can't think of a downside.
      
      https://bugzilla.redhat.com/show_bug.cgi?id=1000116
      (cherry picked from commit 95546c43)
      9ebc1631
  4. 23 4月, 2015 1 次提交
    • Z
      qemu: move setting emulatorpin ahead of monitor showing up · 91ced385
      Zhou yimin 提交于
      If VM is configured with many devices(including passthrough devices)
      and large memory, libvirtd will take seconds(in the worst case) to
      wait for monitor. In this period the qemu process may run on any
      PCPU though I intend to pin emulator to the specified PCPU in xml
      configuration.
      
      Actually qemu process takes high cpu usage during vm startup.
      So this is not the strict CPU isolation in this case.
      Signed-off-by: NZhou yimin <zhouyimin@huawei.com>
      (cherry picked from commit 411cea63)
      91ced385
  5. 16 4月, 2015 9 次提交
  6. 10 4月, 2015 1 次提交
    • M
      virNetworkDefUpdateIPDHCPHost: Don't crash when updating network · cf2289f2
      Michal Privoznik 提交于
      https://bugzilla.redhat.com/show_bug.cgi?id=1182486
      
      When updating a network and adding new ip-dhcp-host entry, the deamon
      may crash. The problem is, we iterate over existing <host/> entries
      trying to compare MAC addresses to see if there's already an existing
      rule. However, not all entries are required to have MAC address. For
      instance, the following is perfectly valid entry:
      
      <host id='00:04:58:fd:e4:15:1b:09:4c:0e:09:af:e4:d3:8c:b8:ca:1e'
      name='redhatipv6.redhat.com' ip='2001:db8:ca2:2::119'/>
      
      When the checking loop iterates over this, the entry's MAC address is
      accessed directly. Well, the fix is obvious - check if the address is
      defined before trying to compare it.
      Signed-off-by: NMichal Privoznik <mprivozn@redhat.com>
      (cherry picked from commit 7d3ae359)
      cf2289f2
  7. 17 3月, 2015 1 次提交
    • E
      daemon: avoid memleak when ListAll returns nothing · b9dacdd4
      Eric Blake 提交于
      Commit 4f25146b (v1.2.8) managed to silence Coverity, but at the
      cost of a memory leak detected by valgrind:
      ==24129== 40 bytes in 5 blocks are definitely lost in loss record 355 of 637
      ==24129==    at 0x4A08B1C: realloc (in /usr/lib64/valgrind/vgpreload_memcheck-amd64-linux.so)
      ==24129==    by 0x5084B8E: virReallocN (viralloc.c:245)
      ==24129==    by 0x514D5AA: virDomainObjListExport (domain_conf.c:22200)
      ==24129==    by 0x201227DB: qemuConnectListAllDomains (qemu_driver.c:18042)
      ==24129==    by 0x51CC1B6: virConnectListAllDomains (libvirt-domain.c:6797)
      ==24129==    by 0x14173D: remoteDispatchConnectListAllDomains (remote.c:1580)
      ==24129==    by 0x121BE1: remoteDispatchConnectListAllDomainsHelper (remote_dispatch.h:1072)
      
      In short, every time a client calls a ListAll variant and asks
      for the resulting list, but there are 0 elements to return, we
      end up leaking the 1-entry array that holds the NULL terminator.
      
      What's worse, a read-only client can access these functions in a
      tight loop to cause libvirtd to eventually run out of memory; and
      this can be considered a denial of service attack against more
      privileged clients.  Thankfully, the leak is so small (8 bytes per
      call) that you would already have some other denial of service with
      any guest calling the API that frequently, so an out-of-memory
      crash is unlikely enough that this did not warrant a CVE.
      
      * daemon/remote.c (remoteDispatchConnectListAllDomains)
      (remoteDispatchDomainListAllSnapshots)
      (remoteDispatchDomainSnapshotListAllChildren)
      (remoteDispatchConnectListAllStoragePools)
      (remoteDispatchStoragePoolListAllVolumes)
      (remoteDispatchConnectListAllNetworks)
      (remoteDispatchConnectListAllInterfaces)
      (remoteDispatchConnectListAllNodeDevices)
      (remoteDispatchConnectListAllNWFilters)
      (remoteDispatchConnectListAllSecrets)
      (remoteDispatchNetworkGetDHCPLeases): Plug leak.
      Signed-off-by: NEric Blake <eblake@redhat.com>
      (cherry picked from commit 3c2ff502)
      Signed-off-by: NEric Blake <eblake@redhat.com>
      
      Conflicts:
      	daemon/remote.c - context with older cleanup styles
      b9dacdd4
  8. 26 2月, 2015 2 次提交
  9. 14 2月, 2015 1 次提交
  10. 11 2月, 2015 1 次提交
    • Z
      conf: Fix libvirtd crash and memory leak caused by virDomainVcpuPinDel() · 0df83949
      Zhang Bo 提交于
      The function virDomainVcpuPinDel() used vcpupin_list to stand for
      def->cputune.vcpupin, which made the codes more readable.
      However, in this function, it will realloc vcpupin_list later.
      As the definition of realloc(), it may free vcpupin_list and then
      points it to a new-realloced address, but def->cputune.vcpupin doesn't
      point to the new address(it's freed however).
      Thus,
      1) When we refer to the def->cputune.vcpupin afterwards, which was freed
      by realloc(), an INVALID READ occurs, and libvirtd may crash.
      2) As no one will use vcpupin_list any more, and no one frees it(it's just
      alloced by realloc()), memory leak occurs.
      
      Part of the valgrind logs are shown as below:
      ==1837== Thread 15:
      ==1837== Invalid read of size 8
      ==1837==    at 0x5367337: virDomainDefFormatInternal (domain_conf.c:18392)
              which is : virBufferAsprintf(buf, "<vcpupin vcpu='%u' ",
                                def->cputune.vcpupin[i]->vcpuid);
      ==1837==    by 0x536966C: virDomainObjFormat (domain_conf.c:18970)
      ==1837==    by 0x5369743: virDomainSaveStatus (domain_conf.c:19166)
      ==1837==    by 0x117B26DC: qemuDomainPinVcpuFlags (qemu_driver.c:4586)
      ==1837==    by 0x53EA313: virDomainPinVcpuFlags (libvirt.c:9803)
      ==1837==    by 0x14CB7D: remoteDispatchDomainPinVcpuFlags (remote_dispatch.h:6762)
      ==1837==    by 0x14CC81: remoteDispatchDomainPinVcpuFlagsHelper (remote_dispatch.h:6740)
      ==1837==    by 0x5464C30: virNetServerProgramDispatchCall (virnetserverprogram.c:437)
      ==1837==    by 0x546507A: virNetServerProgramDispatch (virnetserverprogram.c:307)
      ==1837==    by 0x171B83: virNetServerProcessMsg (virnetserver.c:172)
      ==1837==    by 0x171E6E: virNetServerHandleJob (virnetserver.c:193)
      ==1837==    by 0x5318E78: virThreadPoolWorker (virthreadpool.c:145)
      ==1837==  Address 0x12ea2870 is 0 bytes inside a block of size 16 free'd
      ==1837==    at 0x4C291AC: realloc (in /usr/lib64/valgrind/vgpreload_memcheck-amd64-linux.so)
      ==1837==    by 0x52A3D14: virReallocN (viralloc.c:245)
      ==1837==    by 0x52A3DFB: virShrinkN (viralloc.c:372)
      ==1837==    by 0x52A3F57: virDeleteElementsN (viralloc.c:503)
      ==1837==    by 0x533939E: virDomainVcpuPinDel (domain_conf.c:15405)  //doReset为true时才会进到。
      ==1837==    by 0x117B2642: qemuDomainPinVcpuFlags (qemu_driver.c:4573)
      ==1837==    by 0x53EA313: virDomainPinVcpuFlags (libvirt.c:9803)
      ==1837==    by 0x14CB7D: remoteDispatchDomainPinVcpuFlags (remote_dispatch.h:6762)
      ==1837==    by 0x14CC81: remoteDispatchDomainPinVcpuFlagsHelper (remote_dispatch.h:6740)
      ==1837==    by 0x5464C30: virNetServerProgramDispatchCall (virnetserverprogram.c:437)
      ==1837==    by 0x546507A: virNetServerProgramDispatch (virnetserverprogram.c:307)
      ==1837==    by 0x171B83: virNetServerProcessMsg (virnetserver.c:172)
      
      Steps to reproduce the problem:
      1) use virDomainPinVcpuFlags() to pin a guest's vcpu to all the pcpus
      of the host.
      
      This patch uses def->cputune.vcpupin instead of vcpupin_list to do the
      realloc() job, to avoid invalid read or memory leaking.
      Signed-off-by: NZhang Bo <oscar.zhangbo@huawei.com>
      Signed-off-by: Yue Wenyuan <yuewenyuan@huawei.com@huawei.com>
      (cherry picked from commit 2d27dcb0)
      Signed-off-by: NJán Tomko <jtomko@redhat.com>
      0df83949
  11. 08 2月, 2015 3 次提交
    • C
      Prep for release 1.2.9.2 · 2ed8e729
      Cole Robinson 提交于
      2ed8e729
    • P
      util: storage: Fix parsing of nbd:// URI without path · f6dd410d
      Peter Krempa 提交于
      If a storage file would be backed with a NBD device without path
      (nbd://localhost) libvirt would crash when parsing the backing path for
      the disk as the URI structure's path element is NULL in such case but
      the NBD parser would access it shamelessly.
      
      (cherry picked from commit fdb80ed4)
      f6dd410d
    • W
      qemu: fix domain startup failing with 'strict' mode in numatune · e2267726
      Wang Rui 提交于
      If the memory mode is specified as 'strict' and with one node, we
      get the following error when starting domain.
      
      error: Unable to write to '$cgroup_path/cpuset.mems': Device or resource busy
      
      XML is configured with numatune as follows:
        <numatune>
          <memory mode='strict' nodeset='0'/>
        </numatune>
      
      It's broken by Commit 411cea63
      which moved qemuSetupCgroupForEmulator() before setting cpuset.mems
      in qemuSetupCgroupPostInit.
      
      Directory '$cgroup_path/emulator/' is created in qemuSetupCgroupForEmulator.
      But '$cgroup_path/emulator/cpuset.mems' it not set and has a default value
      (all nodes, such as 0-1). Then we setup '$cgroup_path/cpuset.mems' to the
      nodemask (in this case it's '0') in qemuSetupCgroupPostInit. It must fail.
      
      This patch makes '$cgroup_path/emulator/cpuset.mems' is set before
      '$cgroup_path/cpuset.mems'. The action is similar with that in
      qemuDomainSetNumaParamsLive.
      Signed-off-by: NWang Rui <moon.wangrui@huawei.com>
      (cherry picked from commit c6e90248)
      e2267726
  12. 02 2月, 2015 1 次提交
    • J
      storage: Need to clear pool prior to refreshPool during Autostart · ca1701e3
      John Ferlan 提交于
      https://bugzilla.redhat.com/show_bug.cgi?id=1176510
      
      When storageDriverAutostart is called path virStateReload via a 'service
      libvirtd reload', then because the volume list in the pool wasn't cleared
      prior to the call, each volume would be listed multiple times (as many
      times as we reload). I believe the issue would be introduced by commit
      id '9e093f0b' at least for the libvirtd reload path, although I suppose
      the introduction of virStateReload (commit id '70da0494') could be a
      different cause.
      
      Thus like other places prior to calling refreshPool, we need to call
      virStoragePoolObjClearVols
      
      (cherry picked from commit 1d2e4d8c)
      ca1701e3