提交 · 60de2ba51eaba9eefcc355cb20c8582b1481e755 · openeuler / raspberrypi-kernel

02 12月, 2010 4 次提交

drm/i915: Kill the get_fence tracepoint · 60de2ba5

由 Chris Wilson 提交于 11月 12, 2010

As the tracepoint is now decoupled from when the actual register is
assigned and was never complemented by detailing when the object lost
its fence, it has outlived its limited usefulness. Profiling the actual
stalls is a far more profitable venture anyway.
Signed-off-by: NChris Wilson <chris@chris-wilson.co.uk>

60de2ba5

drm/i915: Remove inactive LRU tracking from set_domain_ioctl · c6748e09

由 Chris Wilson 提交于 11月 12, 2010

As the userspace mappings are torn down on every GPU write, we prefer to
track when the buffer is activated (via a fresh i915_gem_fault). This
makes the LRU conceptually simpler. With coherent mappings, the
remaining use-case for set_domain_ioctl is GPU synchronisation.
Signed-off-by: NChris Wilson <chris@chris-wilson.co.uk>

c6748e09

drm/i915: Pipelined fencing [infrastructure] · d9e86c0e

由 Chris Wilson 提交于 11月 10, 2010

With this change, every batchbuffer can use all available fences (save
pinned and scanout, of course) without ever stalling the gpu!

In theory. Currently the actual pipelined update of the register is
disabled due to some stability issues. However, just the deferred update
is a significant win.

Based on a series of patches by Daniel Vetter.

The premise is that before every access to a buffer through the GTT we
have to declare whether we need a register or not. If the access is by
the GPU, a pipelined update to the register is made via the ringbuffer,
and we track the last seqno of the batches that access it. If by the
CPU we wait for the last GPU access and update the register (either
to clear or to set it for the current buffer).

One advantage of being able to pipeline changes is that we can defer the
actual updating of the fence register until we first need to access the
object through the GTT, i.e. we can eliminate the stall on set_tiling.
This is important as the userspace bo cache does not track the tiling
status of active buffers which generate frequent stalls on gen3 when
enabling tiling for an already bound buffer.
Signed-off-by: NChris Wilson <chris@chris-wilson.co.uk>
Reviewed-by: NDaniel Vetter <daniel.vetter@ffwll.ch>

d9e86c0e

C
drm/i915: Prevent stalling for a GTT read back from a read-only GPU target · 87ca9c8a
由 Chris Wilson 提交于 12月 02, 2010
```
Signed-off-by: NChris Wilson <chris@chris-wilson.co.uk>
```
87ca9c8a

29 11月, 2010 1 次提交

drm/i915: Release fenced GTT mapping on suspend · 7d2cb39c

由 Chris Wilson 提交于 11月 27, 2010

... so that upon first use after resume we will reacquire the fence reg.
Reported-by: NKeith Packard <keithp@keithp.com>
Signed-off-by: NChris Wilson <chris@chris-wilson.co.uk>

7d2cb39c

28 11月, 2010 1 次提交

drm/i915: fix regression due to · de18a29e

由 Daniel Vetter 提交于 11月 27, 2010

We don't track gpu flush request in any special way. So even with
obj->write_domain == 0, a gpu flush might be outstanding but no
yet executed. Even worse, the latest request might use the object
only for reading. So and unconditional call to object_wait_rendering
is needed for !pipelined.

Hence revert that patch fully and untangle the flushing from the
synchronization again.
Reported-by: NKeith Packard <keithp@keithp.com>
Tested-by: NKeith Packard <keithp@keithp.com>
Signed-off-by: NDaniel Vetter <daniel.vetter@ffwll.ch>
Signed-off-by: NChris Wilson <chris@chris-wilson.co.uk>

de18a29e

26 11月, 2010 2 次提交

drm/i915: Avoid allocation for execbuffer object list · 432e58ed

由 Chris Wilson 提交于 11月 25, 2010

Besides the minimal improvement in reducing the execbuffer overhead, the
real benefit is clarifying a few routines.
Signed-off-by: NChris Wilson <chris@chris-wilson.co.uk>

432e58ed

drm/i915: Split i915_gem_execbuffer into its own file. · 54cf91dc

由 Chris Wilson 提交于 11月 25, 2010

A number of dragons have been seen lurking within the execbuffer code.
The first step is then to isolate them from the rest and begin to
scrutinise them in depth. Suggested by Daniel Vetter.
Signed-off-by: NChris Wilson <chris@chris-wilson.co.uk>

54cf91dc

25 11月, 2010 4 次提交

drm/i915: Defer accounting until read from debugfs · 6299f992

由 Chris Wilson 提交于 11月 24, 2010

Simply remove our accounting of objects inside the aperture, keeping
only track of what is in the aperture and its current usage. This
removes the over-complication of BUGs that were attempting to keep the
accounting correct and also removes the overhead of the accounting on
the hot-paths.
Signed-off-by: NChris Wilson <chris@chris-wilson.co.uk>

6299f992

drm/i915: Mark a few functions as __must_check · 2021746e

由 Chris Wilson 提交于 11月 23, 2010

... to benefit from the compiler checking that we remember to handle
and propagate errors.
Signed-off-by: NChris Wilson <chris@chris-wilson.co.uk>

2021746e

drm/i915: Only save and restore fences for UMS · 312817a3

由 Chris Wilson 提交于 11月 22, 2010

With KMS, we can simply relinquish the fence when we idle the GPU and
reassign it upon first use.
Signed-off-by: NChris Wilson <chris@chris-wilson.co.uk>

312817a3

drm/i915: Add a mechanism for pipelining fence register updates · c6642782

由 Daniel Vetter 提交于 11月 12, 2010

Not employed just yet...
Signed-off-by: NDaniel Vetter <daniel.vetter@ffwll.ch>
Signed-off-by: NChris Wilson <chris@chris-wilson.co.uk>

c6642782

24 11月, 2010 12 次提交

C
drm/i915: More accurately track last fence usage by the GPU · caea7476
由 Chris Wilson 提交于 11月 12, 2010
```
Based on a patch by Daniel Vetter.
Signed-off-by: NChris Wilson <chris@chris-wilson.co.uk>
```
caea7476

drm/i915: Rework execbuffer pinning · a7a09aeb

由 Chris Wilson 提交于 11月 12, 2010

Avoid evicting buffers that will be used later in the batch in order to
make room for the initial buffers by pinning all bound buffers in a
single pass before binding (and evicting for) fresh buffer.
Signed-off-by: NChris Wilson <chris@chris-wilson.co.uk>

a7a09aeb

C
drm/i915: Thread the pipelining ring through the callers. · 919926ae
由 Chris Wilson 提交于 11月 12, 2010
```
Signed-off-by: NChris Wilson <chris@chris-wilson.co.uk>
```
919926ae

drm/i915: Remove a defunct BUG_ON · dddbc0e5

由 Chris Wilson 提交于 11月 12, 2010

This used to check the precondition that all fences were to be located
in a mappable area, redundant now as those two parameters are combined
into one.

After pinning, we assert that the buffer is bound into the desired
region.
Signed-off-by: NChris Wilson <chris@chris-wilson.co.uk>

dddbc0e5

drm/i915: Move the implementation details of PIPE_CONTROL to the ringbuffer · b6913e4b

由 Chris Wilson 提交于 11月 12, 2010

The pipe control object is allocated by the device for the sole use of the
render ringbuffer. Move this detail from the general code to the render
ring buffer initialisation.
Signed-off-by: NChris Wilson <chris@chris-wilson.co.uk>

b6913e4b

drm/i915: Not all mappable regions require GTT fence regions · 92b88aeb

由 Chris Wilson 提交于 11月 09, 2010

Combining map_and_fenceable revealed a bug in
i915_gem_object_gtt_size() in that it always computed the appropriate
fence size for the object regardless of tiling state which caused us to
over-allocate linear buffers when binding to the GTT.
Signed-off-by: NChris Wilson <chris@chris-wilson.co.uk>

92b88aeb

drm/i915: Use drm_i915_gem_object as the preferred type · 05394f39

由 Chris Wilson 提交于 11月 08, 2010

A glorified s/obj_priv/obj/ with a net reduction of over a 100 lines and
many characters!
Signed-off-by: NChris Wilson <chris@chris-wilson.co.uk>

05394f39

drm/i915: move gtt handling to i915_gem_gtt.c · 7c2e6fdf

由 Daniel Vetter 提交于 11月 06, 2010

No more drm_*_agp in i915_gem.c!
Signed-off-by: NDaniel Vetter <daniel.vetter@ffwll.ch>
Signed-off-by: NChris Wilson <chris@chris-wilson.co.uk>

7c2e6fdf

drm/i915: track objects in the gtt · 93a37f20

由 Daniel Vetter 提交于 11月 05, 2010

This is required to restore gtt mappings on resume when agp is gone.

The right way to do this would be to make sturct drm_mm_node embeddable
and use the allocation list maintained by the drm memory manager. But
that's a bigger project. Getting rid of the per bo agp_mem will save
more memory than this wastes, anyway.
Signed-off-by: NDaniel Vetter <daniel.vetter@ffwll.ch>
Signed-off-by: NChris Wilson <chris@chris-wilson.co.uk>

93a37f20

drm/i915/gtt: call chipset flush directly · 40ce6575

由 Daniel Vetter 提交于 11月 05, 2010

Signed-off-by: NDaniel Vetter <daniel.vetter@ffwll.ch>
Signed-off-by: NChris Wilson <chris@chris-wilson.co.uk>

40ce6575

drm/i915|intel-gtt: consolidate intel-gtt.h headers · 23ed992a

由 Daniel Vetter 提交于 11月 05, 2010

... and a few other defines.
Signed-off-by: NDaniel Vetter <daniel.vetter@ffwll.ch>
Signed-off-by: NChris Wilson <chris@chris-wilson.co.uk>

23ed992a

drm/i915: Handle pagefaults in execbuffer user relocations · bcf50e27

由 Chris Wilson 提交于 11月 21, 2010

Currently if we hit a pagefault when applying a user relocation for the
execbuffer, we bail and return EFAULT to the application. Instead, we
need to unwind, drop the dev->struct_mutex, copy all the relocation
entries to a vmalloc array (to avoid any potential circular deadlocks
when resolving the pagefault), retake the mutex and then apply the
relocations.  Afterwards, we need to again drop the lock and copy the
vmalloc array back to userspace.

v2: Incorporate feedback from Daniel Vetter.
Reported-by: NDaniel Vetter <daniel.vetter@ffwll.ch>
Signed-off-by: NChris Wilson <chris@chris-wilson.co.uk>
Reviewed-by: NDaniel Vetter <daniel.vetter@ffwll.ch>

bcf50e27

21 11月, 2010 1 次提交

drm/i915: Prevent integer overflow when validating the execbuffer · d1d78830

由 Chris Wilson 提交于 11月 21, 2010

Commit 2549d6c2 removed the vmalloc used for temporary storage of the
relocation lists used during execbuffer. However, our use of vmalloc was
being protected by an integer overflow check which we do want to
preserve!
Reported-by: NDan Carpenter <error27@gmail.com>
Signed-off-by: NChris Wilson <chris@chris-wilson.co.uk>

d1d78830

19 11月, 2010 1 次提交

drm/i915: Do not hold mutex when faulting in user addresses · 51311d0a

由 Chris Wilson 提交于 11月 17, 2010

Linus Torvalds found that it was rather trivial to trigger a system
freeze:

  In fact, with lockdep, I don't even need to do the sysrq-d thing: it
  shows the bug as it happens. It's the X server taking the same lock
  recursively.

  Here's the problem:

    =============================================
    [ INFO: possible recursive locking detected ]
    2.6.37-rc2-00012-gbdbd01ac #7
    ---------------------------------------------
    Xorg/2816 is trying to acquire lock:
     (&dev->struct_mutex){+.+.+.}, at: [<ffffffff812c626c>] i915_gem_fault+0x50/0x17e

    but task is already holding lock:
     (&dev->struct_mutex){+.+.+.}, at: [<ffffffff812c403b>] i915_mutex_lock_interruptible+0x28/0x4a

    other info that might help us debug this:
    2 locks held by Xorg/2816:
     #0:  (&dev->struct_mutex){+.+.+.}, at: [<ffffffff812c403b>] i915_mutex_lock_interruptible+0x28/0x4a
     #1:  (&mm->mmap_sem){++++++}, at: [<ffffffff81022d4f>] page_fault+0x156/0x37b

This recursion was introduced by rearranging the locking to avoid the
double locking on the fast path (4f27b5d and fbd5a26d) and the
introduction of the prefault to encourage the fast paths (b5e4f2b). In
order to undo the problem, we rearrange the code to perform the access
validation upfront, attempt to prefault and then fight for control of the
mutex.  the best case scenario where the mutex is uncontended the
prefaulting is not wasted.
Reported-and-tested-by: NLinus Torvalds <torvalds@linux-foundation.org>
Signed-off-by: NChris Wilson <chris@chris-wilson.co.uk>

51311d0a

15 11月, 2010 1 次提交

drm/i915: fix relaxed tiling for gen <= 3 && !g33 · 5e783301

由 Daniel Vetter 提交于 11月 14, 2010

g33/pineview doesn't have any alignment constrains for unfenced tiled
buffers. But older chips have. Fix this.

Problem introduced in a00b10c3.
Signed-off-by: NDaniel Vetter <daniel.vetter@ffwll.ch>

5e783301

13 11月, 2010 1 次提交

drm/i915: Retire any pending operations on the old scanout when switching · 85345517

由 Chris Wilson 提交于 11月 13, 2010

An old and oft reported bug, is that of the GPU hanging on a
MI_WAIT_FOR_EVENT following a mode switch. The cause is that the GPU is
waiting on a scanline counter on an inactive pipe, and so waits for a
very long time until eventually the user reboots his machine.

We can prevent this either by moving the WAIT into the kernel and
thereby incurring considerable cost on every swapbuffers, or by waiting
for the GPU to retire the last batch that accesses the framebuffer
before installing a new one. As mode switches are much rarer than swap
buffers, this looks like an easy choice.

Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=28964
Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=29252Signed-off-by: NChris Wilson <chris@chris-wilson.co.uk>
Cc: stable@kernel.org

85345517

11 11月, 2010 1 次提交
- C
  drm/i915: Only add the lazy request if we end up waiting for it. · 5d97eb69
  由 Chris Wilson 提交于 11月 10, 2010
```
Signed-off-by: NChris Wilson <chris@chris-wilson.co.uk>
```
  5d97eb69
09 11月, 2010 1 次提交

drivers/gpu/drm: Update WARN uses · fce7d61b

由 Joe Perches 提交于 10月 30, 2010

Coalesce long formats.
Align arguments.
Add missing newlines.
Signed-off-by: NJoe Perches <joe@perches.com>
Signed-off-by: NDave Airlie <airlied@redhat.com>

fce7d61b

08 11月, 2010 1 次提交

drm/i915: Avoid might_fault during pwrite whilst holding our mutex · b47b30cc

由 Chris Wilson 提交于 11月 08, 2010

... and so prevent a potential circular reference:

  [ INFO: possible circular locking dependency detected ]
  2.6.37-rc1-uwe1+ #4
  -------------------------------------------------------
  Xorg/1401 is trying to acquire lock:
   (&mm->mmap_sem){++++++}, at: [<c01e4ddb>] might_fault+0x4b/0xa0

  but task is already holding lock:
   (&dev->struct_mutex){+.+.+.}, at: [<f869c3ac>]
  i915_mutex_lock_interruptible+0x3c/0x60 [i915]

  which lock already depends on the new lock.

When the locking around the pwrite ioctl was simplified, I did not spot
that the phys path never took any locks and so we introduced this
potential circular reference.
Reported-by: NUwe Helm <uwe.helm@googlemail.com>
Signed-off-by: NChris Wilson <chris@chris-wilson.co.uk>

b47b30cc

07 11月, 2010 1 次提交

drm/i915: Handle GPU hangs during fault gracefully. · 045e769a

由 Chris Wilson 提交于 11月 07, 2010

Instead of killing the process, just return no page found and reschedule
the process giving the GPU some time to (hopefully) recover.
Signed-off-by: NChris Wilson <chris@chris-wilson.co.uk>

045e769a

05 11月, 2010 1 次提交

drm/i915: kill mappable/fenceable disdinction · 75e9e915

由 Daniel Vetter 提交于 11月 04, 2010

a00b10c3 "Only enforce fence limits inside the GTT" also
added a fenceable/mappable disdinction when binding/pinning buffers.
This only complicates the code with no pratical gain:

- In execbuffer this matters on for g33/pineview, as this is the only
  chip that needs fences and has an unmappable gtt area. But fences
  are only possible in the mappable part of the gtt, so need_fence
  implies need_mappable. And need_mappable is only set independantly
  with relocations which implies (for sane userspace) that the buffer
  is untiled.

- The overlay code is only really used on i8xx, which doesn't have
  unmappable gtt. And it doesn't support tiled buffers, currently.

- For all other buffers it's a bug to pass in a tiled bo.

In short, this disdinction doesn't have any practical gain.

I've also reverted mapping the overlay and context pages as possibly
unmappable. It's not worth being overtly clever here, all the big
gains from unmappable are for execbuf bos.

Also add a comment for a clever optimization that confused me
while reading the original patch by Chris Wilson.
Signed-off-by: NDaniel Vetter <daniel.vetter@ffwll.ch>
Signed-off-by: NChris Wilson <chris@chris-wilson.co.uk>

75e9e915

03 11月, 2010 1 次提交

drm/i915: Ensure that if we ever try to pin+fence it is mappable. · 085ce264

由 Chris Wilson 提交于 11月 03, 2010

When merging Daniel's full-gtt patches I had a set of tweaks which I
thought I had undone. I was half right...

Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=31286
Reported-by: jinjin.wang@intel.com
Reported-by: NAlexey Fisher <bug-track@fisher-privat.net>
Signed-off-by: NChris Wilson <chris@chris-wilson.co.uk>

085ce264

01 11月, 2010 3 次提交
- C
  drm/i915: Apply big hammer to serialise buffer access between rings · c6afd658
  由 Chris Wilson 提交于 11月 01, 2010
```
Signed-off-by: NChris Wilson <chris@chris-wilson.co.uk>
Cc: stable@kernel.org
```
  c6afd658
- C
  drm/i915: Move the invalidate|flush information out of the device struct · 0f8c6d7c
  由 Chris Wilson 提交于 11月 01, 2010
```
... and into a local structure scoped for the single function in which
it is used.
Signed-off-by: NChris Wilson <chris@chris-wilson.co.uk>
```
  0f8c6d7c
- C
  drm/i915: Apply big hammer to serialise buffer access between rings · 13b29289
  由 Chris Wilson 提交于 11月 01, 2010
```
Signed-off-by: NChris Wilson <chris@chris-wilson.co.uk>
```
  13b29289
31 10月, 2010 2 次提交

drm/i915: Evict just the purgeable GTT entries on the first pass · 5eac3ab4

由 Chris Wilson 提交于 10月 31, 2010

Take two passes to evict everything whilst searching for sufficient free
space to bind the batchbuffer. After searching for sufficient free space
using LRU eviction, evict everything that is purgeable and try again.
Only then if there is insufficient free space (or the GTT is too badly
fragmented) evict everything from the aperture and try one last time.
Signed-off-by: NChris Wilson <chris@chris-wilson.co.uk>

5eac3ab4

drm/i915: Fix typo from in i915_gem_attach_phys_object() · ff75b9bc

由 Chris Wilson 提交于 10月 30, 2010

Accessing the uninitialised obj->pages instead of the local page lead to
an OOPs.
Reported-by: NXavier Chantry <chantry.xavier@gmail.com>
Signed-off-by: NChris Wilson <chris@chris-wilson.co.uk>

ff75b9bc

29 10月, 2010 1 次提交
- C
  drm/i915: Remove the duplicate domain-change tracepoint for GPU flush · 872d860c
  由 Chris Wilson 提交于 10月 28, 2010
```
Signed-off-by: NChris Wilson <chris@chris-wilson.co.uk>
```
  872d860c