提交 · 4ff63e47f7b9dbd72031c364db44526b3c295591 · gsplhtlxg / clone-Linux

20 8月, 2012 1 次提交

drm/radeon: split ATRM support out from the ATPX handler (v3) · c61e2775

由 Alex Deucher 提交于 8月 16, 2012

There are systems that use ATRM, but not ATPX.

Fixes:
https://bugs.freedesktop.org/show_bug.cgi?id=41265

V2: fix #ifdefs as per Greg's comments
V3: fix it harder
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>
Cc: stable@vger.kernel.org

c61e2775

13 8月, 2012 3 次提交

drm/radeon/kms: implement timestamp userspace query (v2) · 6759a0a7

由 Marek Olšák 提交于 8月 09, 2012

Returns a snapshot of the GPU clock counter.  Needed
for certain OpenGL extensions.

v2: agd5f
- address Jerome's comments
- add function documentation
Signed-off-by: NMarek Olšák <maraeo@gmail.com>
Reviewed-by: NJerome Glisse <jglisse@redhat.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

6759a0a7

drm/radeon: fence virtual address and free it once idle v4 · e43b5ec0

由 Jerome Glisse 提交于 8月 06, 2012

Virtual address need to be fenced to know when we can safely remove it.
This patch also properly clear the pagetable. Previously it was
serouisly broken.

Kernel 3.5/3.4 need a similar patch but adapted for difference in mutex locking.

v2: For to update pagetable when unbinding bo (don't bailout if
    bo_va->valid is true).
v3: Add kernel 3.5/3.4 comment.
v4: Fix compilation warnings.
Signed-off-by: NJerome Glisse <jglisse@redhat.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>

e43b5ec0

drm/radeon: fix some missing parens in asic macros · 69b62ad8

由 Alex Deucher 提交于 8月 03, 2012

Better safe than sorry.
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>
Reviewed-by: NJerome Glisse <jglisse@redhat.com>

69b62ad8

18 7月, 2012 4 次提交

drm/radeon: update rptr saving logic for memory buffers · 89d35807

由 Alex Deucher 提交于 7月 17, 2012

Add support for using memory buffers rather than
scratch registers.  Some rings may not be able to
write to scratch registers.
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>
Reviewed-by: NChristian König <christian.koenig@amd.com>

89d35807

drm/radeon: remove radeon_ring_index() · 8b25ed34

由 Alex Deucher 提交于 7月 17, 2012

Just store the index in the ring structure.
Idea taken from one of Jerome's wip rptr patches.
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>
Reviewed-by: NChristian König <christian.koenig@amd.com>

8b25ed34

drm/radeon: fix const IB handling v2 · 4ef72566

由 Christian König 提交于 7月 13, 2012

Const IBs are executed on the CE not the CP, so we can't
fence them in the normal way.

So submit them directly before the IB instead, just as
the documentation says.

v2: keep the extra documentation
Signed-off-by: NChristian König <deathsimple@vodafone.de>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>

4ef72566

drm/radeon: let sa manager block for fences to wait for v2 · bfb38d35

由 Christian König 提交于 7月 11, 2012

Otherwise we can encounter out of memory situations under extreme load.

v2: add documentation for the new function
Signed-off-by: NChristian König <deathsimple@vodafone.de>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>

bfb38d35

17 7月, 2012 7 次提交

drm/radeon: implement ring saving on reset v4 · 55d7c221

由 Christian König 提交于 7月 09, 2012

Try to save whatever is on the rings when
we encounter an lockup.

v2: Fix spelling error. Free saved ring data if reset fails.
    Add documentation for the new functions.
v3: Some more spelling fixes
v4: It doesn't make sense to save anything if all fences
    are signaled
Signed-off-by: NChristian König <deathsimple@vodafone.de>
Reviewed-by: NMichel Dänzer <michel.daenzer@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>

55d7c221

drm/radeon: record what is next valid wptr for each ring v4 · 45df6803

由 Christian König 提交于 7月 06, 2012

Before emitting any indirect buffer, emit the offset of the next
valid ring content if any. This allow code that want to resume
ring to resume ring right after ib that caused GPU lockup.

v2: use scratch registers instead of storing it into memory
v3: skip over the surface sync for ni and si as well
v4: use SET_CONFIG_REG instead of PACKET0
Signed-off-by: NJerome Glisse <jglisse@redhat.com>
Signed-off-by: NChristian König <deathsimple@vodafone.de>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>

45df6803

drm/radeon: remove vm_manager start/suspend · c6105f24

由 Christian König 提交于 7月 05, 2012

Just restore the page table instead. Addressing three
problem with this change:

1. Calling vm_manager_suspend in the suspend path is
   problematic cause it wants to wait for the VM use
   to end, which in case of a lockup never happens.

2. In case of a locked up memory controller
   unbinding the VM seems to make it even more
   unstable, creating an unrecoverable lockup
   in the end.

3. If we want to backup/restore the leftover ring
   content we must not unbind VMs in between.
Signed-off-by: NChristian König <deathsimple@vodafone.de>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>

c6105f24

drm/radeon: remove r600_blit_suspend · 6f72a631

由 Christian König 提交于 7月 05, 2012

Just reinitialize the shader content on resume instead.
Signed-off-by: NChristian König <deathsimple@vodafone.de>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>

6f72a631

drm/radeon: remove ip_pool start/suspend · 2898c348

由 Christian König 提交于 7月 05, 2012

The IB pool is in gart memory, so it is completely
superfluous to unpin / repin it on suspend / resume.
Signed-off-by: NChristian König <deathsimple@vodafone.de>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>

2898c348

drm/radeon: add an exclusive lock for GPU reset v2 · dee53e7f

由 Jerome Glisse 提交于 7月 02, 2012

GPU reset need to be exclusive, one happening at a time. For this
add a rw semaphore so that any path that trigger GPU activities
have to take the semaphore as a reader thus allowing concurency.

The GPU reset path take the semaphore as a writer ensuring that
no concurrent reset take place.

v2: init rw semaphore
Signed-off-by: NJerome Glisse <jglisse@redhat.com>
Reviewed-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>

dee53e7f

drm/radeon: add error handling to fence_wait_empty_locked · 7ecc45e3

由 Christian König 提交于 6月 29, 2012

Instead of returning the error handle it directly
and while at it fix the comments about the ring lock.
Signed-off-by: NChristian König <deathsimple@vodafone.de>
Reviewed-by: NMichel Dänzer <michel.daenzer@amd.com>
Reviewed-by: NJerome Glisse <jglisse@redhat.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>

7ecc45e3

21 6月, 2012 9 次提交

drm/radeon: replace cs_mutex with vm_mutex v3 · 36ff39c4

由 Christian König 提交于 5月 09, 2012

Try to remove or replace the cs_mutex with a
vm_mutex where it is still needed.

v2: fix locking order
v3: rebased on drm-next
Signed-off-by: NChristian König <deathsimple@vodafone.de>

36ff39c4

drm/radeon: replace pflip and sw_int counters with atomics · 736fc37f

由 Christian Koenig 提交于 5月 17, 2012

So we can skip the locking. Also renames sw_int to
ring_int, cause that better matches its purpose.
Signed-off-by: NChristian Koenig <christian.koenig@amd.com>

736fc37f

drm/radeon: apply Murphy's law to the kms irq code v3 · fb98257a

由 Christian Koenig 提交于 5月 17, 2012

1. It is really dangerous to have more than one
   spinlock protecting the same information.

2. radeon_irq_set sometimes wasn't called with lock
   protection, so it can happen that more than one
   CPU would tamper with the irq regs at the same
   time.

3. The pm.gui_idle variable was assuming that the 3D
   engine wasn't becoming idle between testing the
   register and setting the variable. So just remove
   it and test the register directly.

v2: Also handle the hpd irq code the same way.
v3: Rename hpd parameter for clarification.
Signed-off-by: NChristian Koenig <christian.koenig@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>

fb98257a

drm/radeon: fix & improve ih ring handling v3 · c20dc369

由 Christian Koenig 提交于 5月 16, 2012

The spinlock was actually there to protect the
rptr, but rptr was read outside of the locked area.

Also we don't really need a spinlock here, an
atomic should to quite fine since we only need to
prevent it from being reentrant.

v2: Keep the spinlock....
v3: Back to an atomic again after finding & fixing the real bug.
Signed-off-by: NChristian Koenig <christian.koenig@amd.com>

c20dc369

drm/radeon: remove some unneeded structure members · 6823d740

由 Christian Koenig 提交于 5月 16, 2012

Signed-off-by: NChristian Koenig <christian.koenig@amd.com>
Reviewed-by: NJerome Glisse <jglisse@redhat.com>

6823d740

drm/radeon: replace vmram_mutex with mclk_lock v2 · db7fce39

由 Christian König 提交于 5月 11, 2012

It is a rw_semaphore now and only write locked
while changing the clock. Also the lock is renamed
to better reflect what it is protecting.

v2: Keep the ttm_vm_ops on IGPs
Signed-off-by: NChristian König <deathsimple@vodafone.de>
Reviewed-by: NJerome Glisse <jglisse@redhat.com>

db7fce39

drm/radeon: rework ring syncing code · 220907d9

由 Christian König 提交于 5月 10, 2012

Move inter ring syncing with semaphores into the
existing ring allocations, with that we need to
lock the ring mutex only once.
Signed-off-by: NChristian König <deathsimple@vodafone.de>
Reviewed-by: NJerome Glisse <jglisse@redhat.com>

220907d9

drm/radeon: add infrastructure for advanced ring synchronization v2 · 68e250b7

由 Christian König 提交于 5月 10, 2012

v2: BUG_ON not matching rings.
Signed-off-by: NChristian König <deathsimple@vodafone.de>
Reviewed-by: NJerome Glisse <jglisse@redhat.com>

68e250b7

drm/radeon: remove radeon_fence_create · 876dc9f3

由 Christian König 提交于 5月 08, 2012

It is completely unnecessary to create fences
before they are emitted, so remove it and a bunch
of checks if fences are emitted or not.
Signed-off-by: NChristian König <deathsimple@vodafone.de>
Reviewed-by: NJerome Glisse <jglisse@redhat.com>

876dc9f3

05 6月, 2012 1 次提交

drm/radeon: fix gpu_init on si · 1a8ca750

由 Alex Deucher 提交于 6月 01, 2012

- Properly set up the RBs
- Properly set up the SPI
- Properly set up gb_addr_config

This should fix rendering issues on certain cards.
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NDave Airlie <airlied@redhat.com>

1a8ca750

02 6月, 2012 1 次提交

drm/radeon: fixup tiling group size and backendmap on r6xx-r9xx (v4) · 416a2bd2

由 Alex Deucher 提交于 5月 31, 2012

Tiling group size is always 256bits on r6xx/r7xx/r8xx/9xx. Also fix and
simplify render backend map. This now properly sets up the backend map
on r6xx-9xx which should improve 3D performance.

Vadim benchmarked also:
Some benchmarks on juniper (5750), fullscreen 1920x1080,
first result - kernel 3.4.0+ (fb21affa), second - with these patches:

Lightsmark:   91 fps => 123 fps    +35%
Doom3:        74 fps => 101 fps    +36%
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NJerome Glisse <jglisse@redhat.com>
Signed-off-by: NDave Airlie <airlied@redhat.com>

416a2bd2

31 5月, 2012 1 次提交

radeon: add radeon prime vmap support. · 63bc620b

由 Dave Airlie 提交于 5月 31, 2012

This is the same as the nouveau code pretty much.
Signed-off-by: NDave Airlie <airlied@redhat.com>

63bc620b

29 5月, 2012 1 次提交

radeon: make radeon_cs_update_pages static. · c4c7f314

由 Dave Airlie 提交于 5月 26, 2012

Just move its only caller into the same file as it and make it static.
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NDave Airlie <airlied@redhat.com>

c4c7f314

22 5月, 2012 1 次提交

drm/radeon/hdmi: compile audio status in 1 function · 3299de95

由 Rafał Miłecki 提交于 5月 14, 2012

This optmizes calls, registers reads and assignments.
Signed-off-by: NRafał Miłecki <zajec5@gmail.com>
Signed-off-by: NDave Airlie <airlied@redhat.com>

3299de95

13 5月, 2012 2 次提交

drm/radeon/hdmi: separate evergreen code · e55d3e6c

由 Rafał Miłecki 提交于 5月 06, 2012

Signed-off-by: NRafał Miłecki <zajec5@gmail.com>
Signed-off-by: NDave Airlie <airlied@redhat.com>

e55d3e6c

drm/radeon/kms/hdmi: helper getting ready ACR entry · 1b688d08

由 Rafał Miłecki 提交于 4月 30, 2012

Signed-off-by: NRafał Miłecki <zajec5@gmail.com>
Reviewed-by: NAlex Deucher <alexdeucher@gmail.com>
Signed-off-by: NDave Airlie <airlied@redhat.com>

1b688d08

10 5月, 2012 9 次提交

drm/radeon: make the ib an inline object · f2e39221

由 Jerome Glisse 提交于 5月 09, 2012

No need to malloc it any more.
Signed-off-by: NJerome Glisse <jglisse@redhat.com>
Signed-off-by: NChristian König <deathsimple@vodafone.de>
Signed-off-by: NDave Airlie <airlied@redhat.com>

f2e39221

drm/radeon: remove r600 blit mutex v2 · f237750f

由 Christian König 提交于 5月 09, 2012

If we don't store local data into global variables
it isn't necessary to lock anything.

v2: rebased on new SA interface
Signed-off-by: NChristian König <deathsimple@vodafone.de>
Signed-off-by: NDave Airlie <airlied@redhat.com>

f237750f

drm/radeon: move the semaphore from the fence into the ib · 68470ae7

由 Jerome Glisse 提交于 5月 09, 2012

It never really belonged there in the first place.
Signed-off-by: NChristian König <deathsimple@vodafone.de>
Signed-off-by: NDave Airlie <airlied@redhat.com>

68470ae7

drm/radeon: rip out the ib pool · c507f7ef

由 Jerome Glisse 提交于 5月 09, 2012

It isn't necessary any more and the suballocator seems to perform
even better.
Signed-off-by: NChristian König <deathsimple@vodafone.de>
Signed-off-by: NJerome Glisse <jglisse@redhat.com>
Signed-off-by: NDave Airlie <airlied@redhat.com>

c507f7ef

drm/radeon: simplify semaphore handling v2 · a8c05940

由 Jerome Glisse 提交于 5月 09, 2012

Directly use the suballocator to get small chunks of memory.
It's equally fast and doesn't crash when we encounter a GPU reset.

v2: rebased on new SA interface.
Signed-off-by: NChristian König <deathsimple@vodafone.de>
Signed-off-by: NJerome Glisse <jglisse@redhat.com>
Signed-off-by: NDave Airlie <airlied@redhat.com>

a8c05940

drm/radeon: multiple ring allocator v3 · c3b7fe8b

由 Christian König 提交于 5月 09, 2012

A startover with a new idea for a multiple ring allocator.
Should perform as well as a normal ring allocator as long
as only one ring does somthing, but falls back to a more
complex algorithm if more complex things start to happen.

We store the last allocated bo in last, we always try to allocate
after the last allocated bo. Principle is that in a linear GPU ring
progression was is after last is the oldest bo we allocated and thus
the first one that should no longer be in use by the GPU.

If it's not the case we skip over the bo after last to the closest
done bo if such one exist. If none exist and we are not asked to
block we report failure to allocate.

If we are asked to block we wait on all the oldest fence of all
rings. We just wait for any of those fence to complete.

v2: We need to be able to let hole point to the list_head, otherwise
    try free will never free the first allocation of the list. Also
    stop calling radeon_fence_signalled more than necessary.

v3: Don't free allocations without considering them as a hole,
    otherwise we might lose holes. Also return ENOMEM instead of ENOENT
    when running out of fences to wait for. Limit the number of holes
    we try for each ring to 3.
Signed-off-by: NChristian König <deathsimple@vodafone.de>
Signed-off-by: NJerome Glisse <jglisse@redhat.com>
Signed-off-by: NDave Airlie <airlied@redhat.com>

c3b7fe8b

drm/radeon: use one wait queue for all rings add fence_wait_any v2 · 0085c950

由 Jerome Glisse 提交于 5月 09, 2012

Use one wait queue for all rings. When one ring progress, other
likely does to and we are not expecting to have a lot of waiter
anyway.

Also add a fence_wait_any that will wait until the first fence
in the fence array (one fence per ring) is signaled. This allow
to wait on all rings.

v2: some minor cleanups and improvements.
Signed-off-by: NChristian König <deathsimple@vodafone.de>
Signed-off-by: NJerome Glisse <jglisse@redhat.com>
Signed-off-by: NDave Airlie <airlied@redhat.com>

0085c950

drm/radeon: define new SA interface v3 · 557017a0

由 Christian König 提交于 5月 09, 2012

Define the interface without modifying the allocation
algorithm in any way.

v2: rebase on top of fence new uint64 patch
v3: add ring to debugfs output
Signed-off-by: NJerome Glisse <jglisse@redhat.com>
Signed-off-by: NChristian König <deathsimple@vodafone.de>
Signed-off-by: NDave Airlie <airlied@redhat.com>

557017a0

drm/radeon: make sa bo a stand alone object · 2e0d9910

由 Christian König 提交于 5月 09, 2012

Allocating and freeing it seperately.
Signed-off-by: NChristian König <deathsimple@vodafone.de>
Signed-off-by: NDave Airlie <airlied@redhat.com>

2e0d9910