提交 · 81f70ba233d5f660e1ea5fe23260ee323af5d53a · openeuler / Kernel

22 12月, 2015 1 次提交

由 Alex Williamson 提交于 12月 21, 2015

There is really no way to safely give a user full access to a DMA
capable device without an IOMMU to protect the host system. There is
also no way to provide DMA translation, for use cases such as device
assignment to virtual machines. However, there are still those users
that want userspace drivers even under those conditions. The UIO
driver exists for this use case, but does not provide the degree of
device access and programming that VFIO has. In an effort to avoid
code duplication, this introduces a No-IOMMU mode for VFIO.

This mode requires building VFIO with CONFIG_VFIO_NOIOMMU and enabling
the "enable_unsafe_noiommu_mode" option on the vfio driver. This
should make it very clear that this mode is not safe. Additionally,
CAP_SYS_RAWIO privileges are necessary to work with groups and
containers using this mode. Groups making use of this support are
named /dev/vfio/noiommu-$GROUP and can only make use of the special
VFIO_NOIOMMU_IOMMU for the container. Use of this mode, specifically
binding a device without a native IOMMU group to a VFIO bus driver
will taint the kernel and should therefore not be considered
supported. This patch includes no-iommu support for the vfio-pci bus
driver only.
Signed-off-by: NAlex Williamson <alex.williamson@redhat.com>
Acked-by: NMichael S. Tsirkin <mst@redhat.com>

03a76b60

04 12月, 2015 1 次提交

Revert: "vfio: Include No-IOMMU mode" · ae5515d6

由 Alex Williamson 提交于 12月 04, 2015

Revert commit 033291ec ("vfio: Include No-IOMMU mode") due to lack
of a user. This was originally intended to fill a need for the DPDK
driver, but uptake has been slow so rather than support an unproven
kernel interface revert it and revisit when userspace catches up.
Signed-off-by: NAlex Williamson <alex.williamson@redhat.com>

ae5515d6

20 11月, 2015 1 次提交

vfio-pci: constify pci_error_handlers structures · 7d10f4e0

由 Julia Lawall 提交于 11月 14, 2015

This pci_error_handlers structure is never modified, like all the other
pci_error_handlers structures, so declare it as const.

Done with the help of Coccinelle.
Signed-off-by: NJulia Lawall <Julia.Lawall@lip6.fr>
Signed-off-by: NAlex Williamson <alex.williamson@redhat.com>

7d10f4e0

05 11月, 2015 1 次提交

vfio: Include No-IOMMU mode · 033291ec

由 Alex Williamson 提交于 10月 15, 2015

033291ec

10 6月, 2015 1 次提交

vfio/pci: Fix racy vfio_device_get_from_dev() call · 20f30017

由 Alex Williamson 提交于 6月 09, 2015

Testing the driver for a PCI device is racy, it can be all but
complete in the release path and still report the driver as ours.
Therefore we can't trust drvdata to be valid. This race can sometimes
be seen when one port of a multifunction device is being unbound from
the vfio-pci driver while another function is being released by the
user and attempting a bus reset. The device in the remove path is
found as a dependent device for the bus reset of the release path
device, the driver is still set to vfio-pci, but the drvdata has
already been cleared, resulting in a null pointer dereference.

To resolve this, fix vfio_device_get_from_dev() to not take the
dev_get_drvdata() shortcut and instead traverse through the
iommu_group, vfio_group, vfio_device path to get a reference we
can trust. Once we have that reference, we know the device isn't
in transition and we can test to make sure the driver is still what
we expect, so that we don't interfere with devices we don't own.
Signed-off-by: NAlex Williamson <alex.williamson@redhat.com>

20f30017

02 5月, 2015 1 次提交

vfio-pci: Log device requests more verbosely · 5f55d2ae

由 Alex Williamson 提交于 4月 28, 2015

Log some clues indicating whether the user is receiving device
request interfaces or not listening.  This can help indicate why a
driver unbind is blocked or explain why QEMU automatically unplugged
a device from the VM.
Signed-off-by: NAlex Williamson <alex.williamson@redhat.com>

5f55d2ae

08 4月, 2015 6 次提交

vfio-pci: Fix use after free · 5a0ff177

由 Alex Williamson 提交于 4月 08, 2015

Reported by 0-day test infrastructure.

Fixes: ecaa1f6a ("vfio-pci: Add VGA arbiter client")
Signed-off-by: NAlex Williamson <alex.williamson@redhat.com>

5a0ff177

vfio-pci: Move idle devices to D3hot power state · 6eb70187

由 Alex Williamson 提交于 4月 07, 2015

We can save some power by putting devices that are bound to vfio-pci
but not in use by the user in the D3hot power state. Devices get
woken into D0 when opened by the user. Resets return the device to
D0, so we need to re-apply the low power state after a bus reset.
It's tempting to try to use D3cold, but we have no reason to inhibit
hotplug of idle devices and we might get into a loop of having the
device disappear before we have a chance to try to use it.

A new module parameter allows this feature to be disabled if there are
devices that misbehave as a result of this change.
Signed-off-by: NAlex Williamson <alex.williamson@redhat.com>

6eb70187

vfio-pci: Remove warning if try-reset fails · 561d72dd

由 Alex Williamson 提交于 4月 07, 2015

As indicated in the comment, this is not entirely uncommon and
causes user concern for no reason.
Signed-off-by: NAlex Williamson <alex.williamson@redhat.com>

561d72dd

vfio-pci: Allow PCI IDs to be specified as module options · 80c7e8cc

由 Alex Williamson 提交于 4月 07, 2015

This copies the same support from pci-stub for exactly the same
purpose, enabling a set of PCI IDs to be automatically added to the
driver's dynamic ID table at module load time. The code here is
pretty simple and both vfio-pci and pci-stub are fairly unique in
being meta drivers, capable of attaching to any device, so there's no
attempt made to generalize the code into pci-core.
Signed-off-by: NAlex Williamson <alex.williamson@redhat.com>

80c7e8cc

vfio-pci: Add VGA arbiter client · ecaa1f6a

由 Alex Williamson 提交于 4月 07, 2015

If VFIO VGA access is disabled for the user, either by CONFIG option
or module parameter, we can often opt-out of VGA arbitration.  We can
do this when PCI bridge control of VGA routing is possible.  This
means that we must have a parent bridge and there must only be a
single VGA device below that bridge.  Fortunately this is the typical
case for discrete GPUs.

Doing this allows us to minimize the impact of additional GPUs, in
terms of VGA arbitration, when they are only used via vfio-pci for
non-VGA applications.
Signed-off-by: NAlex Williamson <alex.williamson@redhat.com>

ecaa1f6a

vfio-pci: Add module option to disable VGA region access · 88c0dead

由 Alex Williamson 提交于 4月 07, 2015

Add a module option so that we don't require a CONFIG change and
kernel rebuild to disable VGA support. Not only can VGA support be
troublesome in itself, but by disabling it we can reduce the impact
to host devices by doing a VGA arbitration opt-out.
Signed-off-by: NAlex Williamson <alex.williamson@redhat.com>

88c0dead

17 3月, 2015 2 次提交

vfio: initialize the virqfd workqueue in VFIO generic code · 42ac9bd1

由 Antonios Motakis 提交于 3月 16, 2015

Now we have finally completely decoupled virqfd from VFIO_PCI. We can
initialize it from the VFIO generic code, in order to safely use it from
multiple independent VFIO bus drivers.
Signed-off-by: NAntonios Motakis <a.motakis@virtualopensystems.com>
Signed-off-by: NBaptiste Reynal <b.reynal@virtualopensystems.com>
Reviewed-by: NEric Auger <eric.auger@linaro.org>
Tested-by: NEric Auger <eric.auger@linaro.org>
Signed-off-by: NAlex Williamson <alex.williamson@redhat.com>

42ac9bd1

vfio: virqfd: rename vfio_pci_virqfd_init and vfio_pci_virqfd_exit · bb78e9ea

由 Antonios Motakis 提交于 3月 16, 2015

The functions vfio_pci_virqfd_init and vfio_pci_virqfd_exit are not really
PCI specific, since we plan to reuse the virqfd code with more VFIO drivers
in addition to VFIO_PCI.
Signed-off-by: NAntonios Motakis <a.motakis@virtualopensystems.com>
[Baptiste Reynal: Move rename vfio_pci_virqfd_init and vfio_pci_virqfd_exit
from "vfio: add a vfio_ prefix to virqfd_enable and virqfd_disable and export"]
Signed-off-by: NBaptiste Reynal <b.reynal@virtualopensystems.com>
Reviewed-by: NEric Auger <eric.auger@linaro.org>
Tested-by: NEric Auger <eric.auger@linaro.org>
Signed-off-by: NAlex Williamson <alex.williamson@redhat.com>

bb78e9ea

11 2月, 2015 1 次提交

vfio-pci: Add device request interface · 6140a8f5

由 Alex Williamson 提交于 2月 06, 2015

Userspace can opt to receive a device request notification,
indicating that the device should be released.  This is setup
the same way as the error IRQ and also supports eventfd signaling.
Future support may forcefully remove the device from the user if
the request is ignored.
Signed-off-by: NAlex Williamson <alex.williamson@redhat.com>

6140a8f5

08 1月, 2015 1 次提交

vfio-pci: Fix the check on pci device type in vfio_pci_probe() · 7c2e211f

由 Wei Yang 提交于 1月 07, 2015

Current vfio-pci just supports normal pci device, so vfio_pci_probe() will
return if the pci device is not a normal device. While current code makes a
mistake. PCI_HEADER_TYPE is the offset in configuration space of the device
type, but we use this value to mask the type value.

This patch fixs this by do the check directly on the pci_dev->hdr_type.
Signed-off-by: NWei Yang <weiyang@linux.vnet.ibm.com>
Signed-off-by: NAlex Williamson <alex.williamson@redhat.com>
Cc: stable@vger.kernel.org # v3.6+

7c2e211f

08 11月, 2014 1 次提交

vfio: make vfio run on s390 · 1d53a3a7

由 Frank Blaschka 提交于 11月 07, 2014

add Kconfig switch to hide INTx
add Kconfig switch to let vfio announce PCI BARs are not mapable
Signed-off-by: NFrank Blaschka <frank.blaschka@de.ibm.com>
Signed-off-by: NAlex Williamson <alex.williamson@redhat.com>

1d53a3a7

30 9月, 2014 1 次提交

vfio-pci: Fix remove path locking · 93899a67

由 Alex Williamson 提交于 9月 29, 2014

Locking both the remove() and release() path results in a deadlock
that should have been obvious.  To fix this we can get and hold the
vfio_device reference as we evaluate whether to do a bus/slot reset.
This will automatically block any remove() calls, allowing us to
remove the explict lock.  Fixes 61d79256.
Signed-off-by: NAlex Williamson <alex.williamson@redhat.com>
Cc: stable@vger.kernel.org	[3.17]

93899a67

09 8月, 2014 1 次提交

drivers/vfio: Enable VFIO if EEH is not supported · 9b936c96

由 Alexey Kardashevskiy 提交于 8月 08, 2014

The existing vfio_pci_open() fails upon error returned from
vfio_spapr_pci_eeh_open(), which breaks POWER7's P5IOC2 PHB
support which this patch brings back.

The patch fixes the issue by dropping the return value of
vfio_spapr_pci_eeh_open().
Signed-off-by: NAlexey Kardashevskiy <aik@ozlabs.ru>
Signed-off-by: NGavin Shan <gwshan@linux.vnet.ibm.com>
Signed-off-by: NAlex Williamson <alex.williamson@redhat.com>

9b936c96

08 8月, 2014 3 次提交

vfio-pci: Attempt bus/slot reset on release · bc4fba77

由 Alex Williamson 提交于 8月 07, 2014

Each time a device is released, mark whether a local reset was
successful or whether a bus/slot reset is needed.  If a reset is
needed and all of the affected devices are bound to vfio-pci and
unused, allow the reset.  This is most useful when the userspace
driver is killed and releases all the devices in an unclean state,
such as when a QEMU VM quits.
Signed-off-by: NAlex Williamson <alex.williamson@redhat.com>

bc4fba77

vfio-pci: Use mutex around open, release, and remove · 61d79256

由 Alex Williamson 提交于 8月 07, 2014

Serializing open/release allows us to fix a refcnt error if we fail
to enable the device and lets us prevent devices from being unbound
or opened, giving us an opportunity to do bus resets on release.  No
restriction added to serialize binding devices to vfio-pci while the
mutex is held though.
Signed-off-by: NAlex Williamson <alex.williamson@redhat.com>

61d79256

vfio-pci: Release devices with BusMaster disabled · 9c22e660

由 Alex Williamson 提交于 8月 07, 2014

Our current open/release path looks like this:

vfio_pci_open
  vfio_pci_enable
    pci_enable_device
    pci_save_state
    pci_store_saved_state

vfio_pci_release
  vfio_pci_disable
    pci_disable_device
    pci_restore_state

pci_enable_device() doesn't modify PCI_COMMAND_MASTER, so if a device
comes to us with it enabled, it persists through the open and gets
stored as part of the device saved state.  We then restore that saved
state when released, which can allow the device to attempt to continue
to do DMA.  When the group is disconnected from the domain, this will
get caught by the IOMMU, but if there are other devices in the group,
the device may continue running and interfere with the user.  Even in
the former case, IOMMUs don't necessarily behave well and a stream of
blocked DMA can result in unpleasant behavior on the host.

Explicitly disable Bus Master as we're enabling the device and
slightly re-work release to make sure that pci_disable_device() is
the last thing that touches the device.
Signed-off-by: NAlex Williamson <alex.williamson@redhat.com>

9c22e660

05 8月, 2014 1 次提交

drivers/vfio: EEH support for VFIO PCI device · 1b69be5e

由 Gavin Shan 提交于 6月 10, 2014

The patch adds new IOCTL commands for sPAPR VFIO container device
to support EEH functionality for PCI devices, which have been passed
through from host to somebody else via VFIO.
Signed-off-by: NGavin Shan <gwshan@linux.vnet.ibm.com>
Acked-by: NAlexander Graf <agraf@suse.de>
Acked-by: NAlex Williamson <alex.williamson@redhat.com>
Signed-off-by: NBenjamin Herrenschmidt <benh@kernel.crashing.org>

1b69be5e

31 5月, 2014 2 次提交

drivers/vfio/pci: Fix wrong MSI interrupt count · fd49c81f

由 Gavin Shan 提交于 5月 30, 2014

According PCI local bus specification, the register of Message
Control for MSI (offset: 2, length: 2) has bit#0 to enable or
disable MSI logic and it shouldn't be part contributing to the
calculation of MSI interrupt count. The patch fixes the issue.
Signed-off-by: NGavin Shan <gwshan@linux.vnet.ibm.com>
Signed-off-by: NAlex Williamson <alex.williamson@redhat.com>

fd49c81f

vfio/pci: Fix unchecked return value · eb5685f0

由 Alex Williamson 提交于 5月 30, 2014

There's nothing we can do different if pci_load_and_free_saved_state()
fails, other than maybe print some log message, but the actual re-load
of the state is an unnecessary step here since we've only just saved
it.  We can cleanup a coverity warning and eliminate the unnecessary
step by freeing the state ourselves.

Detected by Coverity: CID 753101
Signed-off-by: NAlex Williamson <alex.williamson@redhat.com>

eb5685f0

16 1月, 2014 1 次提交

vfio-pci: Use pci "try" reset interface · 890ed578

由 Alex Williamson 提交于 1月 14, 2014

PCI resets will attempt to take the device_lock for any device to be
reset. This is a problem if that lock is already held, for instance
in the device remove path. It's not sufficient to simply kill the
user process or skip the reset if called after .remove as a race could
result in the same deadlock. Instead, we handle all resets as "best
effort" using the PCI "try" reset interfaces. This prevents the user
from being able to induce a deadlock by triggering a reset.
Signed-off-by: NAlex Williamson <alex.williamson@redhat.com>
Signed-off-by: NBjorn Helgaas <bhelgaas@google.com>

890ed578

15 1月, 2014 1 次提交

vfio-pci: Don't use device_lock around AER interrupt setup · 3be3a074

由 Alex Williamson 提交于 1月 14, 2014

device_lock is much too prone to lockups. For instance if we have a
pending .remove then device_lock is already held. If userspace
attempts to modify AER signaling after that point, a deadlock occurs.
eventfd setup/teardown is already protected in vfio with the igate
mutex. AER is not a high performance interrupt, so we can also use
the same mutex to protect signaling versus setup races.
Signed-off-by: NAlex Williamson <alex.williamson@redhat.com>

3be3a074

05 9月, 2013 1 次提交

vfio-pci: PCI hot reset interface · 8b27ee60

由 Alex Williamson 提交于 9月 04, 2013

The current VFIO_DEVICE_RESET interface only maps to PCI use cases
where we can isolate the reset to the individual PCI function.  This
means the device must support FLR (PCIe or AF), PM reset on D3hot->D0
transition, device specific reset, or be a singleton device on a bus
for a secondary bus reset.  FLR does not have widespread support,
PM reset is not very reliable, and bus topology is dictated by the
system and device design.  We need to provide a means for a user to
induce a bus reset in cases where the existing mechanisms are not
available or not reliable.

This device specific extension to VFIO provides the user with this
ability.  Two new ioctls are introduced:
 - VFIO_DEVICE_PCI_GET_HOT_RESET_INFO
 - VFIO_DEVICE_PCI_HOT_RESET

The first provides the user with information about the extent of
devices affected by a hot reset.  This is essentially a list of
devices and the IOMMU groups they belong to.  The user may then
initiate a hot reset by calling the second ioctl.  We must be
careful that the user has ownership of all the affected devices
found via the first ioctl, so the second ioctl takes a list of file
descriptors for the VFIO groups affected by the reset.  Each group
must have IOMMU protection established for the ioctl to succeed.
Signed-off-by: NAlex Williamson <alex.williamson@redhat.com>

8b27ee60

25 7月, 2013 1 次提交

vfio-pci: Avoid deadlock on remove · d24cdbfd

由 Alex Williamson 提交于 6月 10, 2013

If an attempt is made to unbind a device from vfio-pci while that
device is in use, the request is blocked until the device becomes
unused. Unfortunately, that unbind path still grabs the device_lock,
which certain things like __pci_reset_function() also want to take.
This means we need to try to acquire the locks ourselves and use the
pre-locked version, __pci_reset_function_locked().
Signed-off-by: NAlex Williamson <alex.williamson@redhat.com>

d24cdbfd

29 6月, 2013 1 次提交
- A
  vfio: remap_pfn_range() sets all those flags... · a47df151
  由 Al Viro 提交于 5月 11, 2013
```
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
  a47df151
25 4月, 2013 2 次提交

vfio-pci: Use cached MSI/MSI-X capabilities · a9047f24

由 Bjorn Helgaas 提交于 4月 18, 2013

We now cache the MSI/MSI-X capability offsets in the struct pci_dev,
so no need to find the capabilities again.
Signed-off-by: NBjorn Helgaas <bhelgaas@google.com>
Acked-by: NAlex Williamson <alex.williamson@redhat.com>

a9047f24

vfio-pci: Use PCI_MSIX_TABLE_BIR, not PCI_MSIX_FLAGS_BIRMASK · 508d1aa6

由 Bjorn Helgaas 提交于 4月 18, 2013

PCI_MSIX_FLAGS_BIRMASK is mis-named because the BIR mask is in the
Table Offset register, not the flags ("Message Control" per spec)
register.
Signed-off-by: NBjorn Helgaas <bhelgaas@google.com>
Acked-by: NAlex Williamson <alex.williamson@redhat.com>

508d1aa6

27 3月, 2013 1 次提交

vfio-pci: Fix possible integer overflow · 904c680c

由 Alex Williamson 提交于 3月 26, 2013

The VFIO_DEVICE_SET_IRQS ioctl takes a start and count parameter, both
of which are unsigned.  We attempt to bounds check these, but fail to
account for the case where start is a very large number, allowing
start + count to wrap back into the valid range.  Bounds check both
start and start + count.
Reported-by: NDan Carpenter <dan.carpenter@oracle.com>
Signed-off-by: NAlex Williamson <alex.williamson@redhat.com>

904c680c

11 3月, 2013 1 次提交

VFIO-AER: Vfio-pci driver changes for supporting AER · dad9f897

由 Vijay Mohan Pandarathil 提交于 3月 11, 2013

- New VFIO_SET_IRQ ioctl option to pass the eventfd that is signaled when
  an error occurs in the vfio_pci_device

- Register pci_error_handler for the vfio_pci driver

- When the device encounters an error, the error handler registered by
  the vfio_pci driver gets invoked by the AER infrastructure

- In the error handler, signal the eventfd registered for the device.

- This results in the qemu eventfd handler getting invoked and
  appropriate action taken for the guest.
Signed-off-by: NVijay Mohan Pandarathil <vijaymohan.pandarathil@hp.com>
Signed-off-by: NAlex Williamson <alex.williamson@redhat.com>

dad9f897

19 2月, 2013 1 次提交

vfio-pci: Add support for VGA region access · 84237a82

由 Alex Williamson 提交于 2月 18, 2013

PCI defines display class VGA regions at I/O port address 0x3b0, 0x3c0
and MMIO address 0xa0000.  As these are non-overlapping, we can ignore
the I/O port vs MMIO difference and expose them both in a single
region.  We make use of the VGA arbiter around each access to
configure chipset access as necessary.
Signed-off-by: NAlex Williamson <alex.williamson@redhat.com>

84237a82

15 2月, 2013 2 次提交

vfio-pci: Cleanup BAR access · 906ee99d

由 Alex Williamson 提交于 2月 14, 2013

We can actually handle MMIO and I/O port from the same access function
since PCI already does abstraction of this.  The ROM BAR only requires
a minor difference, so it gets included too.  vfio_pci_config_readwrite
gets renamed for consistency.
Signed-off-by: NAlex Williamson <alex.williamson@redhat.com>

906ee99d

vfio-pci: Cleanup read/write functions · 5b279a11

由 Alex Williamson 提交于 2月 14, 2013

The read and write functions are nearly identical, combine them
and convert to a switch statement.  This also makes it easy to
narrow the scope of when we use the io/mem accessors in case new
regions are added.
Signed-off-by: NAlex Williamson <alex.williamson@redhat.com>

5b279a11

08 12月, 2012 3 次提交

vfio-pci: Enable device before attempting reset · 9a92c509

由 Alex Williamson 提交于 12月 07, 2012

Devices making use of PM reset are getting incorrectly identified as
not supporting reset because pci_pm_reset() fails unless the device is
in D0 power state. When first attached to vfio_pci devices are
typically in an unknown power state. We can fix this by explicitly
setting the power state or simply calling pci_enable_device() before
attempting a pci_reset_function(). We need to enable the device
anyway, so move this up in our vfio_pci_enable() function, which also
simplifies the error path a bit.

Note that pci_disable_device() does not explicitly set the power
state, so there's no need to re-order vfio_pci_disable().
Signed-off-by: NAlex Williamson <alex.williamson@redhat.com>

9a92c509

VFIO: fix out of order labels for error recovery in vfio_pci_init() · 05bf3aac

由 Jiang Liu 提交于 12月 07, 2012

The two labels for error recovery in function vfio_pci_init() is out of
order, so fix it.
Signed-off-by: NJiang Liu <jiang.liu@huawei.com>
Signed-off-by: NAlex Williamson <alex.williamson@redhat.com>

05bf3aac

vfio-pci: Re-order device reset · 2007722a

由 Alex Williamson 提交于 12月 07, 2012

Move the device reset to the end of our disable path, the device
should already be stopped from pci_disable_device().  This also allows
us to manipulate the save/restore to avoid the save/reset/restore +
save/restore that we had before.
Signed-off-by: NAlex Williamson <alex.williamson@redhat.com>

2007722a

openeuler / Kernel 1 年多 前同步成功

openeuler / Kernel
1 年多前同步成功