提交 · 2fc5703abda201f138faf63bdca743d04dbf4b1a · openeuler / Kernel

20 5月, 2014 1 次提交

drm/radeon: check VCE relocation buffer range v3 · 2fc5703a

由 Leo Liu 提交于 5月 05, 2014

v2 (chk): fix image size storage
v3 (chk): fix UV size calculation
Signed-off-by: NLeo Liu <leo.liu@amd.com>
Signed-off-by: NChristian König <christian.koenig@amd.com>

2fc5703a

01 5月, 2014 1 次提交

drm/radeon: use pflip irq on R600+ v2 · f5d636d2

由 Christian König 提交于 4月 23, 2014

Testing the update pending bit directly after issuing an
update is nonsense cause depending on the pixel clock the
CRTC needs a bit of time to execute the flip even when we
are in the VBLANK period.

This is just a non invasive patch to solve the problem at
hand, a more complete and cleaner solution should follow
in the next merge window.

Fixes: https://bugs.freedesktop.org/show_bug.cgi?id=76564

v2: fix source IDs for CRTC2-6
Signed-off-by: NChristian König <christian.koenig@amd.com>
Cc: stable@vger.kernel.org

f5d636d2

17 4月, 2014 1 次提交

drm/radeon: fix runpm handling on APUs (v4) · 90c4cde9

由 Alex Deucher 提交于 4月 10, 2014

Don't try and runtime suspend the APU in PX systems.  We
only want to power down the dGPU.

v2: fix harder
v3: fix stupid typo
v4: consolidate runpm enablement to a single flag

bugs:
https://bugs.freedesktop.org/show_bug.cgi?id=75127
https://bugzilla.kernel.org/show_bug.cgi?id=72701Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>
Cc: stable@vger.kernel.org

90c4cde9

08 4月, 2014 1 次提交

drm/radeon: fix audio pin counts for DCE6+ (v2) · be0949f5

由 Alex Deucher 提交于 4月 08, 2014

There is actually quite a bit of variance based on
the asic.

v2: fix typo noticed by Jerome.
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NChristian König <christian.koenig@amd.com>
Cc: stable@vger.kernel.org

be0949f5

04 3月, 2014 1 次提交

drm/radeon: remove struct radeon_bo_list · df0af440

由 Christian König 提交于 3月 03, 2014

Just move all fields into radeon_cs_reloc, removing unused/duplicated fields.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>

df0af440

03 3月, 2014 7 次提交

drm/radeon: remove global vm lock · 529364e0

由 Christian König 提交于 2月 20, 2014

Not needed any more.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>

529364e0

drm/radeon: use normal BOs for the page tables v4 · 6d2f2944

由 Christian König 提交于 2月 20, 2014

No need to make it more complicated than necessary,
just allocate the page tables as normal BO and
flush whenever the address change.

v2: update comments and function name
v3: squash bug fixes, page directory and tables patch
v4: rebased on Mareks changes
Signed-off-by: NChristian König <christian.koenig@amd.com>

6d2f2944

drm/radeon: further cleanup vm flushing & fencing · fa688343

由 Christian König 提交于 2月 20, 2014

Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>

fa688343

C
drm/radeon: fix VCE suspend/resume · b03b4e4b
由 Christian König 提交于 2月 28, 2014
```
Signed-off-by: NChristian König <christian.koenig@amd.com>
```
b03b4e4b

drm/radeon: validate relocations in the order determined by userspace v3 · c9b76548

由 Marek Olšák 提交于 3月 02, 2014

Userspace should set the first 4 bits of drm_radeon_cs_reloc::flags to
a number from 0 to 15. The higher the number, the higher the priority,
which means a buffer with a higher number will be validated sooner.

The old behavior is preserved: Buffers used for write are prioritized over
read-only buffers if the userspace doesn't set the number.

v2: add buffers to buckets directly, then concatenate them
v3: use a stable sort
Signed-off-by: NMarek Olšák <marek.olsak@amd.com>
Reviewed-by: NChristian König <christian.koenig@amd.com>

c9b76548

drm/radeon: track memory statistics about VRAM and GTT usage and buffer moves v2 · 67e8e3f9

由 Marek Olšák 提交于 3月 02, 2014

The statistics are:
- VRAM usage in bytes
- GTT usage in bytes
- number of bytes moved by TTM

The last one is actually a counter, so you need to sample it before and after
command submission and take the difference.

This is useful for finding performance bottlenecks. Userspace queries are
also added.

v2: use atomic64_t
Signed-off-by: NMarek Olšák <marek.olsak@amd.com>
Reviewed-by: NChristian König <christian.koenig@amd.com>

67e8e3f9

drm/radeon: add a way to get and set initial buffer domains v2 · bda72d58

由 Marek Olšák 提交于 3月 02, 2014

When passing buffers between processes, the receiving process needs to know
the original buffer domain, so that it doesn't accidentally move the buffer.

v2: reserve the buffer
Signed-off-by: NMarek Olšák <marek.olsak@amd.com>
Reviewed-by: NChristian König <christian.koenig@amd.com>

bda72d58

28 2月, 2014 3 次提交

drm/radeon: cleanup the fence ring locking code · 37615527

由 Christian König 提交于 2月 18, 2014

We no longer need to take the ring lock while checking for
a gpu lockup, so just cleanup the code.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>

37615527

drm/radeon: improve ring lockup detection code v2 · aee4aa73

由 Christian König 提交于 2月 18, 2014

Use atomics and jiffies_64, so that we don't need to have the
ring mutex locked any more and avoid wrap arounds.

v2: fix some checkpatch warnings
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>

aee4aa73

drm/radeon: change audio enable logic · 832eafaf

由 Alex Deucher 提交于 2月 18, 2014

Disable audio around audio hw setup.  This may avoid
hangs on certain asics.
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>
Reviewed-by: NChristian König <christian.koenig@amd.com>

832eafaf

19 2月, 2014 3 次提交

drm/radeon: fix CP semaphores on CIK · 8f53492f

由 Christian König 提交于 2月 18, 2014

The CP semaphore queue on CIK has a bug that triggers if uncompleted
waits use the same address while a signal is still pending. Work around
this by using different addresses for each sync.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Cc: stable@vger.kernel.org
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

8f53492f

drm/radeon: drop radeon_ring_force_activity · 2d2fe3f9

由 Christian König 提交于 2月 18, 2014

The reason for the false positives was fixed quite some time ago and since
most engines can still execute NOPs while being locked up it leads to false
negatives.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>

2d2fe3f9

drm/radeon: drop drivers copy of the rptr · ff212f25

由 Christian König 提交于 2月 18, 2014

In all cases where it really matters we are using the read functions anyway.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>

ff212f25

18 2月, 2014 7 次提交

drm/radeon/dpm: enable dynamic vce state switching v2 · 03afe6f6

由 Alex Deucher 提交于 8月 23, 2013

enable vce states when vce is active.  When vce is active,
it adjusts the currently selected state (performance, battery,
uvd, etc.)

v2: add code comments
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NChristian König <christian.koenig@amd.com>

03afe6f6

A
drm/radeon/dpm: fetch vce states from the vbios · 58bd2a88
由 Alex Deucher 提交于 9月 04, 2013
```
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>
```
58bd2a88
A
drm/radeon/dpm: fill in some initial vce infrastructure · b62d628b
由 Alex Deucher 提交于 8月 20, 2013
```
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>
```
b62d628b
A
drm/radeon: add callback for setting vce clocks · b59b7333
由 Alex Deucher 提交于 8月 20, 2013
```
Similar to uvd clock setting.
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>
```
b59b7333
C
drm/radeon: add VCE version parsing and checking · 98ccc291
由 Christian König 提交于 1月 23, 2014
```
Also make the result available to userspace.
Signed-off-by: NChristian König <christian.koenig@amd.com>
```
98ccc291

drm/radeon: initial VCE support v4 · d93f7937

由 Christian König 提交于 5月 23, 2013

Only VCE 2.0 support so far.

v2: squashing multiple patches into this one
v3: add IRQ support for CIK, major cleanups,
    basic code documentation
v4: remove HAINAN from chipset list
Signed-off-by: NChristian König <christian.koenig@amd.com>

d93f7937

drm/radeon: fix CP semaphores on CIK · 1c61eae4

由 Christian König 提交于 2月 18, 2014

The CP semaphore queue on CIK has a bug that triggers if uncompleted
waits use the same address while a signal is still pending. Work around
this by using different addresses for each sync.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Cc: stable@vger.kernel.org

1c61eae4

30 1月, 2014 1 次提交

drm/radeon: fix VMID use tracking · 593b2635

由 Christian König 提交于 1月 23, 2014

Otherwise we allocate a new VMID on nearly every submit.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

593b2635

09 1月, 2014 2 次提交

drm/radeon: add pci config hard reset · 1a0041b8

由 Alex Deucher 提交于 10月 02, 2013

This is used to hard reset the asic.  If a soft
reset is not able to reset things, a hard reset
can be used.
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

1a0041b8

drm/radeon: add hard_reset module parameter · 363eb0b4

由 Alex Deucher 提交于 1月 08, 2014

Enabling this parameter enables pci config reset,
aka hard reset, which is a bus level chip reset.
In some cases this works more reliably than a soft
reset.  Disabled by default.
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

363eb0b4

25 12月, 2013 4 次提交

drm/radeon: remove generic rptr/wptr functions (v2) · ea31bf69

由 Alex Deucher 提交于 12月 09, 2013

Fill in asic family specific versions rather than
using the generic version.  This lets us handle asic
specific differences more easily.  In this case, we
disable sw swapping of the rtpr writeback value on
r6xx+ since the hw does it for us.  Fixes bogus
rptr readback on BE systems.

v2: remove missed cpu_to_le32(), add comments
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

ea31bf69

drm/radeon/dpm: add a late enable callback · 914a8987

由 Alex Deucher 提交于 12月 19, 2013

Certain features need to be enabled after ring tests
(e.g., powergating, etc.).  Add a function pointer
to split out late enable features.
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

914a8987

drm/radeon: add GART debugfs access v3 · dd66d20e

由 Christian König 提交于 12月 18, 2013

v2: add default_llseek
v3: set inode size in the open callback
Signed-off-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

dd66d20e

drm/radeon: add VRAM debugfs access v3 · 2014b569

由 Christian König 提交于 12月 18, 2013

Not very fast, but makes it possible to access even the
normally inaccessible parts of VRAM from userspace.

v2: use MM_INDEX_HI for >2GB mem access, add default_llseek
v3: set inode size in the open callback
Signed-off-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

2014b569

23 12月, 2013 1 次提交

drm/radeon: expose render backend mask to the userspace · 439a1cff

由 Marek Olšák 提交于 12月 22, 2013

This will allow userspace to correctly program the PA_SC_RASTER_CONFIG
register, so it can be considered a fix.
Signed-off-by: NMarek Olšák <marek.olsak@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>
Cc: stable@vger.kernel.org

439a1cff

03 12月, 2013 1 次提交

drm/radeon: add radeon_vm_bo_update trace point · 9c57a6bd

由 Christian König 提交于 11月 25, 2013

Also rename the function to better reflect what it is doing.

agd5f: fix argument size warning
Signed-off-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

9c57a6bd

18 11月, 2013 1 次提交

drm/radeon/cik: Add macrotile mode array query · 32f79a8a

由 Michel Dänzer 提交于 11月 18, 2013

This is required to properly calculate the tiling parameters
in userspace.
Signed-off-by: NMichel Dänzer <michel.daenzer@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

32f79a8a

16 11月, 2013 2 次提交

drm/radeon: use a single doorbell for cik kms compute · d5754ab8

由 Andrew Lewycky 提交于 11月 13, 2013

A single doorbell page is plenty for cik kms compute.
Use a single page and manage doorbell allocation by
individual doorbells rather than pages.  Identify
doorbells by their index rather than byte offset.
Signed-off-by: NAndrew Lewycky <Andrew.Lewycky@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

d5754ab8

drm/radeon: allow semaphore emission to fail · 1654b817

由 Christian König 提交于 11月 12, 2013

To workaround bugs and/or certain limits it's sometimes
useful to fall back to waiting on fences.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>
Cc: stable@vger.kernel.org

1654b817

02 11月, 2013 3 次提交

drm/radeon: fixup locking inversion between, mmap_sem and reservations · 28a326c5

由 Maarten Lankhorst 提交于 10月 09, 2013

op 08-10-13 18:58, Thomas Hellstrom schreef:
> On 10/08/2013 06:47 PM, Jerome Glisse wrote:
>> On Tue, Oct 08, 2013 at 06:29:35PM +0200, Thomas Hellstrom wrote:
>>> On 10/08/2013 04:55 PM, Jerome Glisse wrote:
>>>> On Tue, Oct 08, 2013 at 04:45:18PM +0200, Christian König wrote:
>>>>> Am 08.10.2013 16:33, schrieb Jerome Glisse:
>>>>>> On Tue, Oct 08, 2013 at 04:14:40PM +0200, Maarten Lankhorst wrote:
>>>>>>> Allocate and copy all kernel memory before doing reservations. This prevents a locking
>>>>>>> inversion between mmap_sem and reservation_class, and allows us to drop the trylocking
>>>>>>> in ttm_bo_vm_fault without upsetting lockdep.
>>>>>>>
>>>>>>> Signed-off-by: Maarten Lankhorst <maarten.lankhorst@canonical.com>
>>>>>> I would say NAK. Current code only allocate temporary page in AGP case.
>>>>>> So AGP case is userspace -> temp page -> cs checker -> radeon ib.
>>>>>>
>>>>>> Non AGP is directly memcpy to radeon IB.
>>>>>>
>>>>>> Your patch allocate memory memcpy userspace to it and it will then be
>>>>>> memcpy to IB. Which means you introduce an extra memcpy in the process
>>>>>> not something we want.
>>>>> Totally agree. Additional to that there is no good reason to provide
>>>>> anything else than anonymous system memory to the CS ioctl, so the
>>>>> dependency between the mmap_sem and reservations are not really
>>>>> clear to me.
>>>>>
>>>>> Christian.
>>>> I think is that in other code path you take mmap_sem first then reserve
>>>> bo. But here we reserve bo and then we take mmap_sem because of copy
>>> >from user.
>>>> Cheers,
>>>> Jerome
>>>>
>>> Actually the log message is a little confusing. I think the mmap_sem
>>> locking inversion problem is orthogonal to what's being fixed here.

> >>> This patch fixes the possible recursive bo::reserve caused by
> >>> malicious user-space handing a pointer to ttm memory so that the ttm
> >>> fault handler is called when bos are already reserved. That may
> >>> cause a (possibly interruptible) livelock.

>>> Once that is fixed, we are free to choose the mmap_sem ->
>>> bo::reserve locking order. Currently it's bo::reserve->mmap_sem(),
>>> but the hack required in the ttm fault handler is admittedly a bit
>>> ugly.  The plan is to change the locking order to
>>> mmap_sem->bo::reserve

> >>> I'm not sure if it applies to this particular case, but it should be
> >>> possible to make sure that copy_from_user_inatomic() will always
> >>> succeed, by making sure the pages are present using
> >>> get_user_pages(), and release the pages after
> >>> copy_from_user_inatomic() is done. That way there's no need for a
> >>> double memcpy slowpath, but if the copied data is very fragmented I
> >>> guess the resulting code may look ugly. The get_user_pages()
> >>> function will return an error if it hits TTM pages.

>>> /Thomas
>> get_user_pages + copy_from_user_inatomic is overkill. We should just
>> do get_user_pages which fails with ttm memory and then use copy_highpage
>> helper.
>>
>> Cheers,
>> Jerome
> Yeah, it may well be that that's the preferred solution.
>
> /Thomas
>
I still disagree, and shuffled radeon_ib_get around to be called sooner.

How does the patch below look?
8<-------
Allocate and copy all kernel memory before doing reservations. This prevents a locking
inversion between mmap_sem and reservation_class, and allows us to drop the trylocking
in ttm_bo_vm_fault without upsetting lockdep.

Changes since v1:
- Kill extra memcpy for !AGP case.
Signed-off-by: NMaarten Lankhorst <maarten.lankhorst@canonical.com>
Reviewed-by: NJerome Glisse <jglisse@redhat.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

28a326c5

drm/radeon: drop CP page table updates & cleanup v2 · 24c16439

由 Christian König 提交于 10月 30, 2013

The DMA ring seems to be stable now.

v2: remove pt_ring_index as well
Signed-off-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

24c16439

drm/radeon: rework and fix reset detection v2 · f9eaf9ae

由 Christian König 提交于 10月 29, 2013

Stop fiddling with jiffies, always wait for RADEON_FENCE_JIFFIES_TIMEOUT.
Consolidate the two wait sequence implementations into just one function.
Activate all waiters and remember if the reset was already done instead of
trying to reset from only one thread.

v2: clear reset flag earlier to avoid timeout in IB test
Signed-off-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

f9eaf9ae

openeuler / Kernel 大约 1 年 前同步成功

openeuler / Kernel
大约 1 年前同步成功