提交 · bfb38d35c1cacb182d8bbda23379397bffeafc8c · openeuler / raspberrypi-kernel

18 7月, 2012 1 次提交

drm/radeon: let sa manager block for fences to wait for v2 · bfb38d35

由 Christian König 提交于 7月 11, 2012

Otherwise we can encounter out of memory situations under extreme load.

v2: add documentation for the new function
Signed-off-by: NChristian König <deathsimple@vodafone.de>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>

bfb38d35

17 7月, 2012 7 次提交

drm/radeon: implement ring saving on reset v4 · 55d7c221

由 Christian König 提交于 7月 09, 2012

Try to save whatever is on the rings when
we encounter an lockup.

v2: Fix spelling error. Free saved ring data if reset fails.
    Add documentation for the new functions.
v3: Some more spelling fixes
v4: It doesn't make sense to save anything if all fences
    are signaled
Signed-off-by: NChristian König <deathsimple@vodafone.de>
Reviewed-by: NMichel Dänzer <michel.daenzer@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>

55d7c221

drm/radeon: record what is next valid wptr for each ring v4 · 45df6803

由 Christian König 提交于 7月 06, 2012

Before emitting any indirect buffer, emit the offset of the next
valid ring content if any. This allow code that want to resume
ring to resume ring right after ib that caused GPU lockup.

v2: use scratch registers instead of storing it into memory
v3: skip over the surface sync for ni and si as well
v4: use SET_CONFIG_REG instead of PACKET0
Signed-off-by: NJerome Glisse <jglisse@redhat.com>
Signed-off-by: NChristian König <deathsimple@vodafone.de>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>

45df6803

drm/radeon: remove vm_manager start/suspend · c6105f24

由 Christian König 提交于 7月 05, 2012

Just restore the page table instead. Addressing three
problem with this change:

1. Calling vm_manager_suspend in the suspend path is
   problematic cause it wants to wait for the VM use
   to end, which in case of a lockup never happens.

2. In case of a locked up memory controller
   unbinding the VM seems to make it even more
   unstable, creating an unrecoverable lockup
   in the end.

3. If we want to backup/restore the leftover ring
   content we must not unbind VMs in between.
Signed-off-by: NChristian König <deathsimple@vodafone.de>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>

c6105f24

drm/radeon: remove r600_blit_suspend · 6f72a631

由 Christian König 提交于 7月 05, 2012

Just reinitialize the shader content on resume instead.
Signed-off-by: NChristian König <deathsimple@vodafone.de>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>

6f72a631

drm/radeon: remove ip_pool start/suspend · 2898c348

由 Christian König 提交于 7月 05, 2012

The IB pool is in gart memory, so it is completely
superfluous to unpin / repin it on suspend / resume.
Signed-off-by: NChristian König <deathsimple@vodafone.de>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>

2898c348

drm/radeon: add an exclusive lock for GPU reset v2 · dee53e7f

由 Jerome Glisse 提交于 7月 02, 2012

GPU reset need to be exclusive, one happening at a time. For this
add a rw semaphore so that any path that trigger GPU activities
have to take the semaphore as a reader thus allowing concurency.

The GPU reset path take the semaphore as a writer ensuring that
no concurrent reset take place.

v2: init rw semaphore
Signed-off-by: NJerome Glisse <jglisse@redhat.com>
Reviewed-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>

dee53e7f

drm/radeon: add error handling to fence_wait_empty_locked · 7ecc45e3

由 Christian König 提交于 6月 29, 2012

Instead of returning the error handle it directly
and while at it fix the comments about the ring lock.
Signed-off-by: NChristian König <deathsimple@vodafone.de>
Reviewed-by: NMichel Dänzer <michel.daenzer@amd.com>
Reviewed-by: NJerome Glisse <jglisse@redhat.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>

7ecc45e3

21 6月, 2012 9 次提交

drm/radeon: replace cs_mutex with vm_mutex v3 · 36ff39c4

由 Christian König 提交于 5月 09, 2012

Try to remove or replace the cs_mutex with a
vm_mutex where it is still needed.

v2: fix locking order
v3: rebased on drm-next
Signed-off-by: NChristian König <deathsimple@vodafone.de>

36ff39c4

drm/radeon: replace pflip and sw_int counters with atomics · 736fc37f

由 Christian Koenig 提交于 5月 17, 2012

So we can skip the locking. Also renames sw_int to
ring_int, cause that better matches its purpose.
Signed-off-by: NChristian Koenig <christian.koenig@amd.com>

736fc37f

drm/radeon: apply Murphy's law to the kms irq code v3 · fb98257a

由 Christian Koenig 提交于 5月 17, 2012

1. It is really dangerous to have more than one
   spinlock protecting the same information.

2. radeon_irq_set sometimes wasn't called with lock
   protection, so it can happen that more than one
   CPU would tamper with the irq regs at the same
   time.

3. The pm.gui_idle variable was assuming that the 3D
   engine wasn't becoming idle between testing the
   register and setting the variable. So just remove
   it and test the register directly.

v2: Also handle the hpd irq code the same way.
v3: Rename hpd parameter for clarification.
Signed-off-by: NChristian Koenig <christian.koenig@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>

fb98257a

drm/radeon: fix & improve ih ring handling v3 · c20dc369

由 Christian Koenig 提交于 5月 16, 2012

The spinlock was actually there to protect the
rptr, but rptr was read outside of the locked area.

Also we don't really need a spinlock here, an
atomic should to quite fine since we only need to
prevent it from being reentrant.

v2: Keep the spinlock....
v3: Back to an atomic again after finding & fixing the real bug.
Signed-off-by: NChristian Koenig <christian.koenig@amd.com>

c20dc369

drm/radeon: remove some unneeded structure members · 6823d740

由 Christian Koenig 提交于 5月 16, 2012

Signed-off-by: NChristian Koenig <christian.koenig@amd.com>
Reviewed-by: NJerome Glisse <jglisse@redhat.com>

6823d740

drm/radeon: replace vmram_mutex with mclk_lock v2 · db7fce39

由 Christian König 提交于 5月 11, 2012

It is a rw_semaphore now and only write locked
while changing the clock. Also the lock is renamed
to better reflect what it is protecting.

v2: Keep the ttm_vm_ops on IGPs
Signed-off-by: NChristian König <deathsimple@vodafone.de>
Reviewed-by: NJerome Glisse <jglisse@redhat.com>

db7fce39

drm/radeon: rework ring syncing code · 220907d9

由 Christian König 提交于 5月 10, 2012

Move inter ring syncing with semaphores into the
existing ring allocations, with that we need to
lock the ring mutex only once.
Signed-off-by: NChristian König <deathsimple@vodafone.de>
Reviewed-by: NJerome Glisse <jglisse@redhat.com>

220907d9

drm/radeon: add infrastructure for advanced ring synchronization v2 · 68e250b7

由 Christian König 提交于 5月 10, 2012

v2: BUG_ON not matching rings.
Signed-off-by: NChristian König <deathsimple@vodafone.de>
Reviewed-by: NJerome Glisse <jglisse@redhat.com>

68e250b7

drm/radeon: remove radeon_fence_create · 876dc9f3

由 Christian König 提交于 5月 08, 2012

It is completely unnecessary to create fences
before they are emitted, so remove it and a bunch
of checks if fences are emitted or not.
Signed-off-by: NChristian König <deathsimple@vodafone.de>
Reviewed-by: NJerome Glisse <jglisse@redhat.com>

876dc9f3

05 6月, 2012 1 次提交

drm/radeon: fix gpu_init on si · 1a8ca750

由 Alex Deucher 提交于 6月 01, 2012

- Properly set up the RBs
- Properly set up the SPI
- Properly set up gb_addr_config

This should fix rendering issues on certain cards.
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NDave Airlie <airlied@redhat.com>

1a8ca750

02 6月, 2012 1 次提交

drm/radeon: fixup tiling group size and backendmap on r6xx-r9xx (v4) · 416a2bd2

由 Alex Deucher 提交于 5月 31, 2012

Tiling group size is always 256bits on r6xx/r7xx/r8xx/9xx. Also fix and
simplify render backend map. This now properly sets up the backend map
on r6xx-9xx which should improve 3D performance.

Vadim benchmarked also:
Some benchmarks on juniper (5750), fullscreen 1920x1080,
first result - kernel 3.4.0+ (fb21affa), second - with these patches:

Lightsmark:   91 fps => 123 fps    +35%
Doom3:        74 fps => 101 fps    +36%
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NJerome Glisse <jglisse@redhat.com>
Signed-off-by: NDave Airlie <airlied@redhat.com>

416a2bd2

31 5月, 2012 1 次提交

radeon: add radeon prime vmap support. · 63bc620b

由 Dave Airlie 提交于 5月 31, 2012

This is the same as the nouveau code pretty much.
Signed-off-by: NDave Airlie <airlied@redhat.com>

63bc620b

29 5月, 2012 1 次提交

radeon: make radeon_cs_update_pages static. · c4c7f314

由 Dave Airlie 提交于 5月 26, 2012

Just move its only caller into the same file as it and make it static.
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NDave Airlie <airlied@redhat.com>

c4c7f314

22 5月, 2012 1 次提交

drm/radeon/hdmi: compile audio status in 1 function · 3299de95

由 Rafał Miłecki 提交于 5月 14, 2012

This optmizes calls, registers reads and assignments.
Signed-off-by: NRafał Miłecki <zajec5@gmail.com>
Signed-off-by: NDave Airlie <airlied@redhat.com>

3299de95

13 5月, 2012 2 次提交

drm/radeon/hdmi: separate evergreen code · e55d3e6c

由 Rafał Miłecki 提交于 5月 06, 2012

Signed-off-by: NRafał Miłecki <zajec5@gmail.com>
Signed-off-by: NDave Airlie <airlied@redhat.com>

e55d3e6c

drm/radeon/kms/hdmi: helper getting ready ACR entry · 1b688d08

由 Rafał Miłecki 提交于 4月 30, 2012

Signed-off-by: NRafał Miłecki <zajec5@gmail.com>
Reviewed-by: NAlex Deucher <alexdeucher@gmail.com>
Signed-off-by: NDave Airlie <airlied@redhat.com>

1b688d08

10 5月, 2012 15 次提交

drm/radeon: make the ib an inline object · f2e39221

由 Jerome Glisse 提交于 5月 09, 2012

No need to malloc it any more.
Signed-off-by: NJerome Glisse <jglisse@redhat.com>
Signed-off-by: NChristian König <deathsimple@vodafone.de>
Signed-off-by: NDave Airlie <airlied@redhat.com>

f2e39221

drm/radeon: remove r600 blit mutex v2 · f237750f

由 Christian König 提交于 5月 09, 2012

If we don't store local data into global variables
it isn't necessary to lock anything.

v2: rebased on new SA interface
Signed-off-by: NChristian König <deathsimple@vodafone.de>
Signed-off-by: NDave Airlie <airlied@redhat.com>

f237750f

drm/radeon: move the semaphore from the fence into the ib · 68470ae7

由 Jerome Glisse 提交于 5月 09, 2012

It never really belonged there in the first place.
Signed-off-by: NChristian König <deathsimple@vodafone.de>
Signed-off-by: NDave Airlie <airlied@redhat.com>

68470ae7

drm/radeon: rip out the ib pool · c507f7ef

由 Jerome Glisse 提交于 5月 09, 2012

It isn't necessary any more and the suballocator seems to perform
even better.
Signed-off-by: NChristian König <deathsimple@vodafone.de>
Signed-off-by: NJerome Glisse <jglisse@redhat.com>
Signed-off-by: NDave Airlie <airlied@redhat.com>

c507f7ef

drm/radeon: simplify semaphore handling v2 · a8c05940

由 Jerome Glisse 提交于 5月 09, 2012

Directly use the suballocator to get small chunks of memory.
It's equally fast and doesn't crash when we encounter a GPU reset.

v2: rebased on new SA interface.
Signed-off-by: NChristian König <deathsimple@vodafone.de>
Signed-off-by: NJerome Glisse <jglisse@redhat.com>
Signed-off-by: NDave Airlie <airlied@redhat.com>

a8c05940

drm/radeon: multiple ring allocator v3 · c3b7fe8b

由 Christian König 提交于 5月 09, 2012

A startover with a new idea for a multiple ring allocator.
Should perform as well as a normal ring allocator as long
as only one ring does somthing, but falls back to a more
complex algorithm if more complex things start to happen.

We store the last allocated bo in last, we always try to allocate
after the last allocated bo. Principle is that in a linear GPU ring
progression was is after last is the oldest bo we allocated and thus
the first one that should no longer be in use by the GPU.

If it's not the case we skip over the bo after last to the closest
done bo if such one exist. If none exist and we are not asked to
block we report failure to allocate.

If we are asked to block we wait on all the oldest fence of all
rings. We just wait for any of those fence to complete.

v2: We need to be able to let hole point to the list_head, otherwise
    try free will never free the first allocation of the list. Also
    stop calling radeon_fence_signalled more than necessary.

v3: Don't free allocations without considering them as a hole,
    otherwise we might lose holes. Also return ENOMEM instead of ENOENT
    when running out of fences to wait for. Limit the number of holes
    we try for each ring to 3.
Signed-off-by: NChristian König <deathsimple@vodafone.de>
Signed-off-by: NJerome Glisse <jglisse@redhat.com>
Signed-off-by: NDave Airlie <airlied@redhat.com>

c3b7fe8b

drm/radeon: use one wait queue for all rings add fence_wait_any v2 · 0085c950

由 Jerome Glisse 提交于 5月 09, 2012

Use one wait queue for all rings. When one ring progress, other
likely does to and we are not expecting to have a lot of waiter
anyway.

Also add a fence_wait_any that will wait until the first fence
in the fence array (one fence per ring) is signaled. This allow
to wait on all rings.

v2: some minor cleanups and improvements.
Signed-off-by: NChristian König <deathsimple@vodafone.de>
Signed-off-by: NJerome Glisse <jglisse@redhat.com>
Signed-off-by: NDave Airlie <airlied@redhat.com>

0085c950

drm/radeon: define new SA interface v3 · 557017a0

由 Christian König 提交于 5月 09, 2012

Define the interface without modifying the allocation
algorithm in any way.

v2: rebase on top of fence new uint64 patch
v3: add ring to debugfs output
Signed-off-by: NJerome Glisse <jglisse@redhat.com>
Signed-off-by: NChristian König <deathsimple@vodafone.de>
Signed-off-by: NDave Airlie <airlied@redhat.com>

557017a0

drm/radeon: make sa bo a stand alone object · 2e0d9910

由 Christian König 提交于 5月 09, 2012

Allocating and freeing it seperately.
Signed-off-by: NChristian König <deathsimple@vodafone.de>
Signed-off-by: NDave Airlie <airlied@redhat.com>

2e0d9910

drm/radeon: keep start and end offset in the SA · e6661a96

由 Christian König 提交于 5月 09, 2012

Instead of offset + size keep start and end offset directly.
Signed-off-by: NChristian König <deathsimple@vodafone.de>
Signed-off-by: NDave Airlie <airlied@redhat.com>

e6661a96

drm/radeon: add proper locking to the SA v3 · a651c55a

由 Christian König 提交于 5月 09, 2012

Make the suballocator self containing to locking.

v2: split the bugfix into a seperate patch.
v3: remove some unreleated changes.
Sig-off-by: NChristian König <deathsimple@vodafone.de>
Signed-off-by: NDave Airlie <airlied@redhat.com>

a651c55a

drm/radeon: rework locking ring emission mutex in fence deadlock detection v2 · 8a47cc9e

由 Christian König 提交于 5月 09, 2012

Some callers illegal called fence_wait_next/empty
while holding the ring emission mutex. So don't
relock the mutex in that cases, and move the actual
locking into the fence code.

v2: Don't try to unlock the mutex if it isn't locked.
Signed-off-by: NChristian König <deathsimple@vodafone.de>
Signed-off-by: NDave Airlie <airlied@redhat.com>

8a47cc9e

drm/radeon: rework fence handling, drop fence list v7 · 3b7a2b24

由 Jerome Glisse 提交于 5月 09, 2012

Using 64bits fence sequence we can directly compare sequence
number to know if a fence is signaled or not. Thus the fence
list became useless, so does the fence lock that mainly
protected the fence list.

Things like ring.ready are no longer behind a lock, this should
be ok as ring.ready is initialized once and will only change
when facing lockup. Worst case is that we return an -EBUSY just
after a successfull GPU reset, or we go into wait state instead
of returning -EBUSY (thus delaying reporting -EBUSY to fence
wait caller).

v2: Remove left over comment, force using writeback on cayman and
    newer, thus not having to suffer from possibly scratch reg
    exhaustion
v3: Rebase on top of change to uint64 fence patch
v4: Change DCE5 test to force write back on cayman and newer but
    also any APU such as PALM or SUMO family
v5: Rebase on top of new uint64 fence patch
v6: Just break if seq doesn't change any more. Use radeon_fence
    prefix for all function names. Even if it's now highly optimized,
    try avoiding polling to often.
v7: We should never poll the last_seq from the hardware without
    waking the sleeping threads, otherwise we might lose events.
Signed-off-by: NJerome Glisse <jglisse@redhat.com>
Signed-off-by: NChristian König <deathsimple@vodafone.de>
Signed-off-by: NDave Airlie <airlied@redhat.com>

3b7a2b24

drm/radeon: convert fence to uint64_t v4 · bb635567

由 Jerome Glisse 提交于 5月 09, 2012

This convert fence to use uint64_t sequence number intention is
to use the fact that uin64_t is big enough that we don't need to
care about wrap around.

Tested with and without writeback using 0xFFFFF000 as initial
fence sequence and thus allowing to test the wrap around from
32bits to 64bits.

v2: Add comment about possible race btw CPU & GPU, add comment
    stressing that we need 2 dword aligned for R600_WB_EVENT_OFFSET
    Read fence sequenc in reverse order of GPU write them so we
    mitigate the race btw CPU and GPU.

v3: Drop the need for ring to emit the 64bits fence, and just have
    each ring emit the lower 32bits of the fence sequence. We
    handle the wrap over 32bits in fence_process.

v4: Just a small optimization: Don't reread the last_seq value
    if loop restarts, since we already know its value anyway.
    Also start at zero not one for seq value and use pre instead
    of post increment in emmit, otherwise wait_empty will deadlock.
Signed-off-by: NJerome Glisse <jglisse@redhat.com>
Signed-off-by: NChristian König <deathsimple@vodafone.de>
Signed-off-by: NDave Airlie <airlied@redhat.com>

bb635567

drm/radeon: replace the per ring mutex with a global one · d6999bc7

由 Christian König 提交于 5月 09, 2012

A single global mutex for ring submissions seems sufficient.
Signed-off-by: NChristian König <deathsimple@vodafone.de>
Signed-off-by: NDave Airlie <airlied@redhat.com>

d6999bc7

03 5月, 2012 1 次提交

drm/radeon: make forcing ring activity a common function · 7b9ef16b

由 Christian König 提交于 5月 02, 2012

Nothing chipset or ring specific with it,
so also move it to radon_ring.
Signed-off-by: NChristian König <deathsimple@vodafone.de>
Reviewed-by: NJerome Glisse <jglisse@redhat.com>
Signed-off-by: NDave Airlie <airlied@redhat.com>

7b9ef16b