提交 · d8902adcc1a9fd484c8cb5e575152e32192c1ff8 · openeuler / Kernel

09 9月, 2009 40 次提交

dmaengine: sh: Add Support SuperH DMA Engine driver · d8902adc

由 Nobuhiro Iwamatsu 提交于 9月 07, 2009

This supported all DMA channels, and it was tested in SH7722,
SH7780, SH7785 and SH7763.
This can not use with SH DMA API.
Signed-off-by: NNobuhiro Iwamatsu <iwamatsu.nobuhiro@renesas.com>
Reviewed-by: NMatt Fleming <matt@console-pimps.org>
Acked-by: NMaciej Sosnowski <maciej.sosnowski@intel.com>
Acked-by: NPaul Mundt <lethal@linux-sh.org>
Signed-off-by: NDan Williams <dan.j.williams@intel.com>

d8902adc

D
Merge commit 'md/for-linus' into async-tx-next · 9134d02b
由 Dan Williams 提交于 9月 08, 2009
```
Conflicts:
	drivers/md/raid5.c
```
9134d02b

Merge branch 'dmaengine' into async-tx-next · bbb20089

由 Dan Williams 提交于 9月 08, 2009

Conflicts:
	crypto/async_tx/async_xor.c
	drivers/dma/ioat/dma_v2.h
	drivers/dma/ioat/pci.c
	drivers/md/raid5.c

bbb20089

D

Merge branch 'iop-raid6' into async-tx-next · 3e48e656
由 Dan Williams 提交于 9月 08, 2009

3e48e656

dmaengine: Move all map_sg/unmap_sg for slave channel to its client · 657a77fa

由 Atsushi Nemoto 提交于 9月 08, 2009

Dan Williams wrote:
... DMA-slave clients request specific channels and know the hardware
details at a low level, so it should not be too high an expectation to
push dma mapping responsibility to the client.

Also this patch includes DMA_COMPL_{SRC,DEST}_UNMAP_SINGLE support for
dw_dmac driver.
Acked-by: NMaciej Sosnowski <maciej.sosnowski@intel.com>
Acked-by: NNicolas Ferre <nicolas.ferre@atmel.com>
Signed-off-by: NAtsushi Nemoto <anemo@mba.ocn.ne.jp>
Signed-off-by: NDan Williams <dan.j.williams@intel.com>

657a77fa

fsldma: Add DMA_SLAVE support · bbea0b6e

由 Ira Snyder 提交于 9月 08, 2009

Use the DMA_SLAVE capability of the DMAEngine API to copy/from a
scatterlist into an arbitrary list of hardware address/length pairs.

This allows a single DMA transaction to copy data from several different
devices into a scatterlist at the same time.

This also adds support to enable some controller-specific features such as
external start and external pause for a DMA transaction.

[dan.j.williams@intel.com: rebased on tx_list movement]
Signed-off-by: NIra W. Snyder <iws@ovro.caltech.edu>
Acked-by: NLi Yang <leoli@freescale.com>
Acked-by: NKumar Gala <galak@kernel.crashing.org>
Signed-off-by: NDan Williams <dan.j.williams@intel.com>

bbea0b6e

fsldma: split apart external pause and request count features · e6c7ecb6

由 Ira Snyder 提交于 9月 08, 2009

When using the Freescale DMA controller in external control mode, both the
request count and external pause bits need to be setup correctly. This was
being done with the same function.

The 83xx controller lacks the external pause feature, but has a similar
feature called external start. This feature requires that the request count
bits be setup correctly.

Split the function into two parts, to make it possible to use the external
start feature on the 83xx controller.
Signed-off-by: NIra W. Snyder <iws@ovro.caltech.edu>
Signed-off-by: NDan Williams <dan.j.williams@intel.com>

e6c7ecb6

ioat2,3: cacheline align software descriptor allocations · 162b96e6

由 Dan Williams 提交于 9月 08, 2009

All the necessary fields for handling an ioat2,3 ring entry can fit into
one cacheline. Move ->len prior to ->txd in struct ioat_ring_ent, and
move allocation of these entries to a hw-cache-aligned kmem cache to
reduce the number of cachelines dirtied for descriptor management.
Signed-off-by: NDan Williams <dan.j.williams@intel.com>

162b96e6

dmaengine: kill tx_list · 08031727

由 Dan Williams 提交于 9月 08, 2009

The tx_list attribute of struct dma_async_tx_descriptor is common to
most, but not all dma driver implementations.  None of the upper level
code (dmaengine/async_tx) uses it, so allow drivers to implement it
locally if they need it.  This saves sizeof(struct list_head) bytes for
drivers that do not manage descriptors with a linked list (e.g.: ioatdma
v2,3).
Signed-off-by: NDan Williams <dan.j.williams@intel.com>

08031727

txx9dmac: implement a private tx_list · 1979b186

由 Dan Williams 提交于 9月 08, 2009

Drop txx9dmac's use of tx_list from struct dma_async_tx_descriptor in
preparation for removal of this field.

Cc: Atsushi Nemoto <anemo@mba.ocn.ne.jp>
Signed-off-by: NDan Williams <dan.j.williams@intel.com>

1979b186

at_hdmac: implement a private tx_list · 285a3c71

由 Dan Williams 提交于 9月 08, 2009

Drop at_hdmac's use of tx_list from struct dma_async_tx_descriptor in
preparation for removal of this field.

Cc: Nicolas Ferre <nicolas.ferre@atmel.com>
Signed-off-by: NDan Williams <dan.j.williams@intel.com>

285a3c71

mv_xor: implement a private tx_list · 64203b67

由 Dan Williams 提交于 9月 08, 2009

Drop mv_xor's use of tx_list from struct dma_async_tx_descriptor in
preparation for removal of this field.

Cc: Saeed Bishara <saeed@marvell.com>
Signed-off-by: NDan Williams <dan.j.williams@intel.com>

64203b67

ioat: implement a private tx_list · ea25968a

由 Dan Williams 提交于 9月 08, 2009

Drop ioatdma's use of tx_list from struct dma_async_tx_descriptor in
preparation for removal of this field.

Cc: Maciej Sosnowski <maciej.sosnowski@intel.com>
Signed-off-by: NDan Williams <dan.j.williams@intel.com>

ea25968a

iop-adma: implement a private tx_list · 308136d1

由 Dan Williams 提交于 9月 08, 2009

    
Drop iop-adma's use of tx_list from struct dma_async_tx_descriptor in
preparation for removal of this field.
Signed-off-by: NDan Williams <dan.j.williams@intel.com>

308136d1

fsldma: implement a private tx_list · eda34234

由 Dan Williams 提交于 9月 08, 2009

Drop fsldma's use of tx_list from struct dma_async_tx_descriptor in
preparation for removal of this field.

Cc: Li Yang <leoli@freescale.com>
Signed-off-by: NDan Williams <dan.j.williams@intel.com>

eda34234

dw_dmac: implement a private tx_list · e0bd0f8c

由 Dan Williams 提交于 9月 08, 2009

Drop dw_dmac's use of tx_list from struct dma_async_tx_descriptor in
preparation for removal of this field.

Cc: Haavard Skinnemoen <haavard.skinnemoen@atmel.com>
Signed-off-by: NDan Williams <dan.j.williams@intel.com>

e0bd0f8c

D

Merge branch 'ioat' into dmaengine · e12c4fa3
由 Dan Williams 提交于 9月 08, 2009

e12c4fa3

I/OAT: Convert to PCI_VDEVICE() · a6417dd5

由 Roland Dreier 提交于 9月 08, 2009

Trivial cleanup to make the PCI ID table easier to read.

[dan.j.williams@intel.com: extended to v3.2 devices]
Signed-off-by: NRoland Dreier <rolandd@cisco.com>
Signed-off-by: NDan Williams <dan.j.williams@intel.com>

a6417dd5

Add MODULE_DEVICE_TABLE() so ioatdma module is autoloaded · 6506cbca

由 Roland Dreier 提交于 9月 08, 2009

The ioatdma module is missing aliases for the PCI devices it supports,
so it is not autoloaded on boot.  Add a MODULE_DEVICE_TABLE() to get
these aliases.
Signed-off-by: NRoland Dreier <rolandd@cisco.com>
Signed-off-by: NDan Williams <dan.j.williams@intel.com>

6506cbca

ioat3: segregate raid engines · e3232714

由 Dan Williams 提交于 9月 08, 2009

The cleanup routine for the raid cases imposes extra checks for handling
raid descriptors and extended descriptors.  If the channel does not
support raid it can avoid this extra overhead by using the ioat2 cleanup
path.
Signed-off-by: NDan Williams <dan.j.williams@intel.com>

e3232714

ioat3: ioat3.2 pci ids for Jasper Forest · b265b11f

由 Tom Picard 提交于 9月 08, 2009

Jasper Forest introduces raid offload support via ioat3.2 support.  When
raid offload is enabled two (out of 8 channels) will report raid5/raid6
offload capabilities.  The remaining channels will only report ioat3.0
capabilities (memcpy).
Signed-off-by: NTom Picard <tom.s.picard@intel.com>
Signed-off-by: NDan Williams <dan.j.williams@intel.com>

b265b11f

ioat3: interrupt descriptor support · 58c8649e

由 Dan Williams 提交于 9月 08, 2009

The async_tx api uses the DMA_INTERRUPT operation type to terminate a
chain of issued operations with a callback routine.
Signed-off-by: NDan Williams <dan.j.williams@intel.com>

58c8649e

ioat3: support xor via pq descriptors · ae786624

由 Dan Williams 提交于 9月 08, 2009

If a platform advertises pq capabilities, but not xor, then use
ioat3_prep_pqxor and ioat3_prep_pqxor_val to simulate xor support.
Signed-off-by: NDan Williams <dan.j.williams@intel.com>

ae786624

ioat3: pq support · d69d235b

由 Dan Williams 提交于 9月 08, 2009

ioat3.2 adds support for raid6 syndrome generation (xor sum of galois
field multiplication products) using up to 8 sources. It can also
perform an pq-zero-sum operation to validate whether the syndrome for a
given set of sources matches a previously computed syndrome.
Signed-off-by: NDan Williams <dan.j.williams@intel.com>

d69d235b

ioat3: xor self test · 9de6fc71

由 Dan Williams 提交于 9月 08, 2009

This adds a hardware specific self test to be called from ioat_probe.
In the ioat3 case we will have tests for all the different raid
operations, while ioat1 and ioat2 will continue to just test memcpy.
Signed-off-by: NDan Williams <dan.j.williams@intel.com>

9de6fc71

ioat3: xor support · b094ad3b

由 Dan Williams 提交于 9月 08, 2009

ioat3.2 adds xor offload support for up to 8 sources. It can also
perform an xor-zero-sum operation to validate whether all given sources
sum to zero, without writing to a destination. Xor descriptors differ
from memcpy in that one operation may require multiple descriptors
depending on the number of sources. When the number of sources exceeds
5 an extended descriptor is needed. These descriptors need to be
accounted for when updating the DMA_COUNT register.
Signed-off-by: NDan Williams <dan.j.williams@intel.com>

b094ad3b

ioat3: enable dca for completion writes · e61dacae

由 Dan Williams 提交于 9月 08, 2009

Tag completion writes for direct cache access to reduce the latency of
checking for descriptor completions.
Signed-off-by: NDan Williams <dan.j.williams@intel.com>

e61dacae

ioat: add 'ioat' sysfs attributes · 5669e31c

由 Dan Williams 提交于 9月 08, 2009

Export driver attributes for diagnostic purposes:
'ring_size': total number of descriptors available to the engine
'ring_active': number of descriptors in-flight
'capabilities': supported operation types for this channel
'version': Intel(R) QuickData specfication revision

This also allows some chattiness to be removed from the driver startup
as this information is now available via sysfs.
Signed-off-by: NDan Williams <dan.j.williams@intel.com>

5669e31c

ioat3: split ioat3 support to its own file, add memset · bf40a686

由 Dan Williams 提交于 9月 08, 2009

Up until this point the driver for Intel(R) QuickData Technology
engines, specification versions 2 and 3, were mostly identical save for
a few quirks. Version 3.2 hardware adds many new capabilities (like
raid offload support) requiring some infrastructure that is not relevant
for v2. For better code organization of the new funcionality move v3
and v3.2 support to its own file dma_v3.c, and export some routines from
the base files (dma.c and dma_v2.c) that can be reused directly.

The first new capability included in this code reorganization is support
for v3.2 memset operations.
Signed-off-by: NDan Williams <dan.j.williams@intel.com>

bf40a686

ioat3: hardware version 3.2 register / descriptor definitions · 2aec048c

由 Dan Williams 提交于 9月 08, 2009

ioat3.2 adds raid5 and raid6 offload capabilities.
Signed-off-by: NTom Picard <tom.s.picard@intel.com>
Signed-off-by: NDan Williams <dan.j.williams@intel.com>

2aec048c

ioat2+: add fence support · 128f2d56

由 Dan Williams 提交于 9月 08, 2009

In preparation for adding more operation types to the ioat3 path the
driver needs to honor the DMA_PREP_FENCE flag. For example the async_tx api
will hand xor->memcpy->xor chains to the driver with the 'fence' flag set on
the first xor and the memcpy operation. This flag in turn sets the 'fence'
flag in the descriptor control field telling the hardware that future
descriptors in the chain depend on the result of the current descriptor, so
wait for all writes to complete before starting the next operation.

Note that ioat1 does not prefetch the descriptor chain, so does not
require/support fenced operations.
Signed-off-by: NDan Williams <dan.j.williams@intel.com>

128f2d56

dmaengine, async_tx: support alignment checks · 83544ae9

由 Dan Williams 提交于 9月 08, 2009

Some engines have transfer size and address alignment restrictions. Add
a per-operation alignment property to struct dma_device that the async
routines and dmatest can use to check alignment capabilities.
Signed-off-by: NDan Williams <dan.j.williams@intel.com>

83544ae9

dmaengine: cleanup unused transaction types · 9308add6

由 Dan Williams 提交于 9月 08, 2009

No drivers currently implement these operation types, so they can be
deleted.
Signed-off-by: NDan Williams <dan.j.williams@intel.com>

9308add6

dmaengine, async_tx: add a "no channel switch" allocator · 138f4c35

由 Dan Williams 提交于 9月 08, 2009

Channel switching is problematic for some dmaengine drivers as the
architecture precludes separating the ->prep from ->submit.  In these
cases the driver can select ASYNC_TX_DISABLE_CHANNEL_SWITCH to modify
the async_tx allocator to only return channels that support all of the
required asynchronous operations.

For example MD_RAID456=y selects support for asynchronous xor, xor
validate, pq, pq validate, and memcpy.  When
ASYNC_TX_DISABLE_CHANNEL_SWITCH=y any channel with all these
capabilities is marked DMA_ASYNC_TX allowing async_tx_find_channel() to
quickly locate compatible channels with the guarantee that dependency
chains will remain on one channel.  When
ASYNC_TX_DISABLE_CHANNEL_SWITCH=n async_tx_find_channel() may select
channels that lead to operation chains that need to cross channel
boundaries using the async_tx channel switch capability.
Signed-off-by: NDan Williams <dan.j.williams@intel.com>

138f4c35

dmaengine: add fence support · 0403e382

由 Dan Williams 提交于 9月 08, 2009

Some engines optimize operation by reading ahead in the descriptor chain
such that descriptor2 may start execution before descriptor1 completes.
If descriptor2 depends on the result from descriptor1 then a fence is
required (on descriptor2) to disable this optimization. The async_tx
api could implicitly identify dependencies via the 'depend_tx'
parameter, but that would constrain cases where the dependency chain
only specifies a completion order rather than a data dependency. So,
provide an ASYNC_TX_FENCE to explicitly identify data dependencies.
Signed-off-by: NDan Williams <dan.j.williams@intel.com>

0403e382

D
Merge branch 'md-raid6-accel' into ioat3.2 · f9dd2134
由 Dan Williams 提交于 9月 08, 2009
```
Conflicts:
	include/linux/dmaengine.h
```
f9dd2134

net_dma: poll for a descriptor after allocation failure · 4b652f0d

由 Dan Williams 提交于 9月 08, 2009

Handle descriptor allocation failures by polling for a descriptor. The
driver will force forward progress when polled. In the best case this
polling interval will be the time it takes for one dma memcpy
transaction to complete. In the worst case, channel hang, we will need
to wait 100ms for the cleanup watchdog to fire (ioatdma driver).
Signed-off-by: NMaciej Sosnowski <maciej.sosnowski@intel.com>
Signed-off-by: NDan Williams <dan.j.williams@intel.com>

4b652f0d

ioat2,3: dynamically resize descriptor ring · a309218a

由 Dan Williams 提交于 9月 08, 2009

Increment the allocation order of the descriptor ring every time we run
out of descriptors up to a maximum of allocation order specified by the
module parameter 'ioat_max_alloc_order'.  After each idle period
decrement the allocation order to a minimum order of
'ioat_ring_alloc_order' (i.e. the default ring size, tunable as a module
parameter).
Signed-off-by: NMaciej Sosnowski <maciej.sosnowski@intel.com>
Signed-off-by: NDan Williams <dan.j.williams@intel.com>

a309218a

ioat: switch watchdog and reset handler from workqueue to timer · 09c8a5b8

由 Dan Williams 提交于 9月 08, 2009

In order to support dynamic resizing of the descriptor ring or polling
for a descriptor in the presence of a hung channel the reset handler
needs to make progress while in a non-preemptible context.  The current
workqueue implementation precludes polling channel reset completion
under spin_lock().

This conversion also allows us to return to opportunistic cleanup in the
ioat2 case as the timer implementation guarantees at least one cleanup
after every descriptor is submitted.  This means the worst case
completion latency becomes the timer frequency (for exceptional
circumstances), but with the benefit of avoiding busy waiting when the
lock is contended.
Signed-off-by: NMaciej Sosnowski <maciej.sosnowski@intel.com>
Signed-off-by: NDan Williams <dan.j.williams@intel.com>

09c8a5b8

ioat1: trim ioat_dma_desc_sw · ad643f54

由 Dan Williams 提交于 9月 08, 2009

Save 4 bytes per software descriptor by transmitting tx_cnt in an unused
portion of the hardware descriptor.
Signed-off-by: NMaciej Sosnowski <maciej.sosnowski@intel.com>
Signed-off-by: NDan Williams <dan.j.williams@intel.com>

ad643f54

openeuler / Kernel 1 年多 前同步成功

openeuler / Kernel
1 年多前同步成功