提交 · 63256ec5347fb2344a42adbae732b90603c92f35 · openeuler / Kernel

12 1月, 2011 1 次提交

drm/i915: Enforce write ordering through the GTT · 63256ec5

由 Chris Wilson 提交于 1月 04, 2011

We need to ensure that writes through the GTT land before any
modification to the MMIO registers and so must impose a mandatory write
barrier when flushing the GTT domain. This was revealed by relaxing the
write ordering by experimentally mapping the registers and the GATT as
write-combining.
Signed-off-by: NChris Wilson <chris@chris-wilson.co.uk>

63256ec5

20 12月, 2010 1 次提交

drm/i915: Allow the application to choose the constant addressing mode · 72bfa19c

由 Chris Wilson 提交于 12月 19, 2010

The relative-to-general state default is useless as it means having to
rewrite the streaming kernels for each batch. Relative-to-surface is
more useful, as that stream usually needs to be rewritten for each
batch. And absolute addressing mode, vital if you start streaming
state, is also only available by adjusting the register...
Signed-off-by: NChris Wilson <chris@chris-wilson.co.uk>

72bfa19c

10 12月, 2010 2 次提交

C
drm/i915: Mark the user reloc error paths as unlikely · b8f7ab17
由 Chris Wilson 提交于 12月 08, 2010
```
Signed-off-by: NChris Wilson <chris@chris-wilson.co.uk>
```
b8f7ab17

drm/i915: Eliminate drm_gem_object_lookup during relocation · 67731b87

由 Chris Wilson 提交于 12月 08, 2010

As we provide a list of all objects that will be accessed from the
batchbuffer, we can build a lut of the handles associated with those
objects for this invocation and use that to avoid the overhead of
looking up those objects again for every relocation.

The cost of building and searching a small hash table is much less than
that of acquiring a spinlock, searching a radix tree and manipulating an
atomic refcnt per relocation.
Signed-off-by: NChris Wilson <chris@chris-wilson.co.uk>

67731b87

06 12月, 2010 1 次提交

drm/i915: Ignore fenced commands for gpu access on gen4 · 9b3826bf

由 Chris Wilson 提交于 12月 05, 2010

Userspace should not have been declaring that it needed fenced GPU
access with gen4+ as those GPUs have no fenced commands, but to be on
the safe side it is easier to ignore userspace in case they did.
Signed-off-by: NChris Wilson <chris@chris-wilson.co.uk>

9b3826bf

05 12月, 2010 1 次提交

drm/i915: Implement GPU semaphores for inter-ring synchronisation on SNB · 1ec14ad3

由 Chris Wilson 提交于 12月 04, 2010

The bulk of the change is to convert the growing list of rings into an
array so that the relationship between the rings and the semaphore sync
registers can be easily computed.
Signed-off-by: NChris Wilson <chris@chris-wilson.co.uk>

1ec14ad3

02 12月, 2010 2 次提交

drm/i915: Pipelined fencing [infrastructure] · d9e86c0e

由 Chris Wilson 提交于 11月 10, 2010

With this change, every batchbuffer can use all available fences (save
pinned and scanout, of course) without ever stalling the gpu!

In theory. Currently the actual pipelined update of the register is
disabled due to some stability issues. However, just the deferred update
is a significant win.

Based on a series of patches by Daniel Vetter.

The premise is that before every access to a buffer through the GTT we
have to declare whether we need a register or not. If the access is by
the GPU, a pipelined update to the register is made via the ringbuffer,
and we track the last seqno of the batches that access it. If by the
CPU we wait for the last GPU access and update the register (either
to clear or to set it for the current buffer).

One advantage of being able to pipeline changes is that we can defer the
actual updating of the fence register until we first need to access the
object through the GTT, i.e. we can eliminate the stall on set_tiling.
This is important as the userspace bo cache does not track the tiling
status of active buffers which generate frequent stalls on gen3 when
enabling tiling for an already bound buffer.
Signed-off-by: NChris Wilson <chris@chris-wilson.co.uk>
Reviewed-by: NDaniel Vetter <daniel.vetter@ffwll.ch>

d9e86c0e

C
drm/i915: Prevent stalling for a GTT read back from a read-only GPU target · 87ca9c8a
由 Chris Wilson 提交于 12月 02, 2010
```
Signed-off-by: NChris Wilson <chris@chris-wilson.co.uk>
```
87ca9c8a

30 11月, 2010 1 次提交

drm/i915/ringbuffer: Handle cliprects in the caller · c4e7a414

由 Chris Wilson 提交于 11月 30, 2010

This makes the various rings more consistent by removing the anomalous
handing of the rendering ring execbuffer dispatch.
Signed-off-by: NChris Wilson <chris@chris-wilson.co.uk>

c4e7a414

28 11月, 2010 1 次提交

drm/i915/execbuffer: On error, starting unwinding from the previous object · 602606a4

由 Chris Wilson 提交于 11月 28, 2010

As the error occurred on the current object, it means that its state was
not changed and so it should be excluded from the unwind.
Reported-by: NDaniel Vetter <daniel.vetter@ffwll.ch>
Signed-off-by: NChris Wilson <chris@chris-wilson.co.uk>

602606a4

26 11月, 2010 2 次提交

drm/i915: Avoid allocation for execbuffer object list · 432e58ed

由 Chris Wilson 提交于 11月 25, 2010

Besides the minimal improvement in reducing the execbuffer overhead, the
real benefit is clarifying a few routines.
Signed-off-by: NChris Wilson <chris@chris-wilson.co.uk>

432e58ed

drm/i915: Split i915_gem_execbuffer into its own file. · 54cf91dc

由 Chris Wilson 提交于 11月 25, 2010

A number of dragons have been seen lurking within the execbuffer code.
The first step is then to isolate them from the rest and begin to
scrutinise them in depth. Suggested by Daniel Vetter.
Signed-off-by: NChris Wilson <chris@chris-wilson.co.uk>

54cf91dc

openeuler / Kernel 1 年多 前同步成功

openeuler / Kernel
1 年多前同步成功