提交 · 701e1e789142042144c8cc10b8f6d1554e960144 · openanolis / cloud-kernel

19 8月, 2014 1 次提交

drm/radeon: properly document reloc priority mask · 701e1e78

由 Christian König 提交于 8月 15, 2014

Instead of hard coding the value properly document
that this is an userspace interface.

No intended functional change.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>
Cc: stable@vger.kernel.org

701e1e78

05 8月, 2014 1 次提交

drm/radeon: invalidate moved BOs in the VM (v2) · e31ad969

由 Christian König 提交于 7月 18, 2014

Don't wait for the BO to be used again, just
update the PT on the next VM use.

v2: remove stray semicolon.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Tested-by: NMichel Dänzer <michel.daenzer@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

e31ad969

22 7月, 2014 2 次提交

drm/radeon: fix VM IB handling · cc9e67e3

由 Christian König 提交于 7月 18, 2014

Calling radeon_vm_bo_find on the IB BO during CS
is illegal and can lead to an crash.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

cc9e67e3

drm/radeon: fix handling of radeon_vm_bo_rmv v3 · 036bf46a

由 Christian König 提交于 7月 18, 2014

v3: completely rewritten. We now just remember which areas
of the PT to clear and do so on the next command submission.

Bug: https://bugs.freedesktop.org/show_bug.cgi?id=79980Signed-off-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

036bf46a

10 6月, 2014 1 次提交

drm/radeon: rename alt_domain to allowed_domains · ce6758c8

由 Christian König 提交于 6月 02, 2014

And also domain to prefered_domains. That matches better
what those values represent.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Cc: Marek Olšák <maraeo@gmail.com>
Reviewed-by: NMarek Olšák <marek.olsak@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

ce6758c8

30 5月, 2014 2 次提交

drm/radeon: don't allow RADEON_GEM_DOMAIN_CPU for command submission · ec65da38

由 Marek Olšák 提交于 5月 27, 2014

It hangs the hardware.
Signed-off-by: NMarek Olšák <marek.olsak@amd.com>
Reviewed-by: NChristian König <christian.koenig@amd.com>
Cc: stable@vger.kernel.org

ec65da38

C
drm/radeon: avoid crash if VM command submission isn't available · 60a44540
由 Christian König 提交于 5月 21, 2014
```
Signed-off-by: NChristian König <christian.koenig@amd.com>
CC: stable@vger.kernel.org
```
60a44540

04 3月, 2014 1 次提交

drm/radeon: remove struct radeon_bo_list · df0af440

由 Christian König 提交于 3月 03, 2014

Just move all fields into radeon_cs_reloc, removing unused/duplicated fields.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>

df0af440

03 3月, 2014 6 次提交

drm/radeon: remove global vm lock · 529364e0

由 Christian König 提交于 2月 20, 2014

Not needed any more.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>

529364e0

drm/radeon: use normal BOs for the page tables v4 · 6d2f2944

由 Christian König 提交于 2月 20, 2014

No need to make it more complicated than necessary,
just allocate the page tables as normal BO and
flush whenever the address change.

v2: update comments and function name
v3: squash bug fixes, page directory and tables patch
v4: rebased on Mareks changes
Signed-off-by: NChristian König <christian.koenig@amd.com>

6d2f2944

drm/radeon: further cleanup vm flushing & fencing · fa688343

由 Christian König 提交于 2月 20, 2014

Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>

fa688343

M
drm/radeon: limit how much memory TTM can move per IB according to VRAM usage · 19dff56a
由 Marek Olšák 提交于 3月 02, 2014
```
Signed-off-by: NMarek Olšák <marek.olsak@amd.com>
Reviewed-by: NChristian König <christian.koenig@amd.com>
```
19dff56a

drm/radeon: validate relocations in the order determined by userspace v3 · c9b76548

由 Marek Olšák 提交于 3月 02, 2014

Userspace should set the first 4 bits of drm_radeon_cs_reloc::flags to
a number from 0 to 15. The higher the number, the higher the priority,
which means a buffer with a higher number will be validated sooner.

The old behavior is preserved: Buffers used for write are prioritized over
read-only buffers if the userspace doesn't set the number.

v2: add buffers to buckets directly, then concatenate them
v3: use a stable sort
Signed-off-by: NMarek Olšák <marek.olsak@amd.com>
Reviewed-by: NChristian König <christian.koenig@amd.com>

c9b76548

M
drm/radeon: add buffers to the LRU list from smallest to largest · 4330441a
由 Marek Olšák 提交于 3月 02, 2014
```
Signed-off-by: NMarek Olšák <marek.olsak@amd.com>
Reviewed-by: NChristian König <christian.koenig@amd.com>
```
4330441a

18 2月, 2014 2 次提交

drm/radeon/dpm: enable dynamic vce state switching v2 · 03afe6f6

由 Alex Deucher 提交于 8月 23, 2013

enable vce states when vce is active.  When vce is active,
it adjusts the currently selected state (performance, battery,
uvd, etc.)

v2: add code comments
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NChristian König <christian.koenig@amd.com>

03afe6f6

drm/radeon: initial VCE support v4 · d93f7937

由 Christian König 提交于 5月 23, 2013

Only VCE 2.0 support so far.

v2: squashing multiple patches into this one
v3: add IRQ support for CIK, major cleanups,
    basic code documentation
v4: remove HAINAN from chipset list
Signed-off-by: NChristian König <christian.koenig@amd.com>

d93f7937

30 1月, 2014 1 次提交

drm/radeon: skip async dma init on r6xx · b9ace36f

由 Alex Deucher 提交于 1月 27, 2014

The hw is buggy and it's not currently used, but it's
currently still initialized by the driver.  Skip the init.
Skipping init also seems to improve stability with dpm on
some r6xx asics.

bug:
https://bugs.freedesktop.org/show_bug.cgi?id=66963Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>
Reviewed-by: NChristian König <christian.koenig@amd.com>

b9ace36f

18 12月, 2013 1 次提交

drm: Kill DRM_COPY_(TO|FROM)_USER · 1d6ac185

由 Daniel Vetter 提交于 12月 11, 2013

Less yelling ftw!
Signed-off-by: NDaniel Vetter <daniel.vetter@ffwll.ch>
Signed-off-by: NDave Airlie <airlied@redhat.com>

1d6ac185

03 12月, 2013 1 次提交

drm/radeon: add radeon_vm_bo_update trace point · 9c57a6bd

由 Christian König 提交于 11月 25, 2013

Also rename the function to better reflect what it is doing.

agd5f: fix argument size warning
Signed-off-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

9c57a6bd

16 11月, 2013 1 次提交

drm/radeon: allow semaphore emission to fail · 1654b817

由 Christian König 提交于 11月 12, 2013

To workaround bugs and/or certain limits it's sometimes
useful to fall back to waiting on fences.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>
Cc: stable@vger.kernel.org

1654b817

02 11月, 2013 1 次提交

drm/radeon: fixup locking inversion between, mmap_sem and reservations · 28a326c5

由 Maarten Lankhorst 提交于 10月 09, 2013

op 08-10-13 18:58, Thomas Hellstrom schreef:
> On 10/08/2013 06:47 PM, Jerome Glisse wrote:
>> On Tue, Oct 08, 2013 at 06:29:35PM +0200, Thomas Hellstrom wrote:
>>> On 10/08/2013 04:55 PM, Jerome Glisse wrote:
>>>> On Tue, Oct 08, 2013 at 04:45:18PM +0200, Christian König wrote:
>>>>> Am 08.10.2013 16:33, schrieb Jerome Glisse:
>>>>>> On Tue, Oct 08, 2013 at 04:14:40PM +0200, Maarten Lankhorst wrote:
>>>>>>> Allocate and copy all kernel memory before doing reservations. This prevents a locking
>>>>>>> inversion between mmap_sem and reservation_class, and allows us to drop the trylocking
>>>>>>> in ttm_bo_vm_fault without upsetting lockdep.
>>>>>>>
>>>>>>> Signed-off-by: Maarten Lankhorst <maarten.lankhorst@canonical.com>
>>>>>> I would say NAK. Current code only allocate temporary page in AGP case.
>>>>>> So AGP case is userspace -> temp page -> cs checker -> radeon ib.
>>>>>>
>>>>>> Non AGP is directly memcpy to radeon IB.
>>>>>>
>>>>>> Your patch allocate memory memcpy userspace to it and it will then be
>>>>>> memcpy to IB. Which means you introduce an extra memcpy in the process
>>>>>> not something we want.
>>>>> Totally agree. Additional to that there is no good reason to provide
>>>>> anything else than anonymous system memory to the CS ioctl, so the
>>>>> dependency between the mmap_sem and reservations are not really
>>>>> clear to me.
>>>>>
>>>>> Christian.
>>>> I think is that in other code path you take mmap_sem first then reserve
>>>> bo. But here we reserve bo and then we take mmap_sem because of copy
>>> >from user.
>>>> Cheers,
>>>> Jerome
>>>>
>>> Actually the log message is a little confusing. I think the mmap_sem
>>> locking inversion problem is orthogonal to what's being fixed here.

> >>> This patch fixes the possible recursive bo::reserve caused by
> >>> malicious user-space handing a pointer to ttm memory so that the ttm
> >>> fault handler is called when bos are already reserved. That may
> >>> cause a (possibly interruptible) livelock.

>>> Once that is fixed, we are free to choose the mmap_sem ->
>>> bo::reserve locking order. Currently it's bo::reserve->mmap_sem(),
>>> but the hack required in the ttm fault handler is admittedly a bit
>>> ugly.  The plan is to change the locking order to
>>> mmap_sem->bo::reserve

> >>> I'm not sure if it applies to this particular case, but it should be
> >>> possible to make sure that copy_from_user_inatomic() will always
> >>> succeed, by making sure the pages are present using
> >>> get_user_pages(), and release the pages after
> >>> copy_from_user_inatomic() is done. That way there's no need for a
> >>> double memcpy slowpath, but if the copied data is very fragmented I
> >>> guess the resulting code may look ugly. The get_user_pages()
> >>> function will return an error if it hits TTM pages.

>>> /Thomas
>> get_user_pages + copy_from_user_inatomic is overkill. We should just
>> do get_user_pages which fails with ttm memory and then use copy_highpage
>> helper.
>>
>> Cheers,
>> Jerome
> Yeah, it may well be that that's the preferred solution.
>
> /Thomas
>
I still disagree, and shuffled radeon_ib_get around to be called sooner.

How does the patch below look?
8<-------
Allocate and copy all kernel memory before doing reservations. This prevents a locking
inversion between mmap_sem and reservation_class, and allows us to drop the trylocking
in ttm_bo_vm_fault without upsetting lockdep.

Changes since v1:
- Kill extra memcpy for !AGP case.
Signed-off-by: NMaarten Lankhorst <maarten.lankhorst@canonical.com>
Reviewed-by: NJerome Glisse <jglisse@redhat.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

28a326c5

19 10月, 2013 1 次提交

drm/radeon/uvd: revert lower msg&fb buffer requirements on UVD3 · bcf6f1e9

由 Christian König 提交于 10月 15, 2013

This only seem to work for H.264 but not for VC-1 streams.

Need to investigate further why exactly.

This reverts commit 4b40e592.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

bcf6f1e9

23 9月, 2013 1 次提交

drm/radeon/uvd: lower msg&fb buffer requirements on UVD3 · 4b40e592

由 Christian König 提交于 9月 23, 2013

Starting with UVD3 message and feedback buffers have their
own 256MB segment, so no need to force them into VRAM any more.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

4b40e592

21 9月, 2013 1 次提交

drm/radeon: avoid UVD corruption on AGP cards using GPU gart · 4ca5a6cb

由 Alex Deucher 提交于 9月 15, 2013

If the user has forced the driver to use the internal GPU gart
rather than AGP on an AGP card, force the buffers to vram
as well.
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>
Reviewed-by: NChristian König <christian.koenig@amd.com>
Tested-by: NDieter Nützel <Dieter@nuetzel-hh.de>
Cc: stable@vger.kernel.org

4ca5a6cb

16 9月, 2013 1 次提交

drm/radeon: avoid UVD corruptions on AGP cards · 4f66c599

由 Christian König 提交于 9月 15, 2013

Putting everything into VRAM seems to help.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>
Cc: stable@vger.kernel.org

4f66c599

11 9月, 2013 1 次提交

drm/radeon: add command submission tracepoint · 860024e5

由 Christian König 提交于 9月 07, 2013

Neither complete nor perfect, but solves my problem at hand
and might be useful in the future.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

860024e5

31 8月, 2013 2 次提交

drm/radeon: rework ring function handling · 76a0df85

由 Christian König 提交于 8月 13, 2013

Give the ring functions a separate structure and let the asic
structure point to the ring specific functions. This simplifies
the code and allows us to make changes at only one point.

No change in functionality.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

76a0df85

drm/radeon/dpm: use multiple UVD power states (v3) · ce3537d5

由 Alex Deucher 提交于 7月 24, 2013

Use the UVD handle information to determine which
which power states to select when using UVD.  For
example, decoding a single SD stream requires much
lower clocks than multiple HD streams.

v2: switch to a cleaner dpm/uvd interface
v3: change the uvd power state while streams
are active if need be
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

ce3537d5

28 6月, 2013 2 次提交

drm/ttm: make ttm reservation calls behave like reservation calls · ecff665f

由 Maarten Lankhorst 提交于 6月 27, 2013

This commit converts the source of the val_seq counter to
the ww_mutex api. The reservation objects are converted later,
because there is still a lockdep splat in nouveau that has to
resolved first.
Signed-off-by: NMaarten Lankhorst <maarten.lankhorst@canonical.com>
Reviewed-by: NJerome Glisse <jglisse@redhat.com>
Signed-off-by: NDave Airlie <airlied@redhat.com>

ecff665f

drm/radeon/kms: enable UVD as needed (v9) · 8a227555

由 Alex Deucher 提交于 6月 21, 2013

When using UVD, the driver must switch to a special UVD power
state.  In the CS ioctl, switch to the power state and schedule
work to change the power state back, when the work comes up,
check if uvd is still busy and if not, switch back to the user
state, otherwise, reschedule the work.

Note:  We really need some better way to decide when to
switch out of the uvd power state.  Switching power states
while playback is active make uvd angry.

V2: fix locking.

V3: switch from timer to delayed work

V4: check fence driver for UVD jobs, reduce timeout to
    1 second and rearm timeout on activity

v5: rebase on new dpm tree

v6: rebase on interim uvd on demand changes

v7: fix UVD when DPM is disabled

v8: unify non-DPM and DPM UVD handling

v9: remove leftover idle work struct
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NChristian König <deathsimple@vodafone.de>

8a227555

27 6月, 2013 1 次提交

drm/radeon/cik: Add support for compute queues (v4) · 963e81f9

由 Alex Deucher 提交于 6月 26, 2013

On CIK, the compute rings work slightly differently than
on previous asics, however the basic concepts are the same.

The main differences:
- New MEC engines for compute queues
- Multiple queues per MEC:
  - CI/KB: 1 MEC, 4 pipes per MEC, 8 queues per pipe = 32 queues
  -    KV: 2 MEC, 4 pipes per MEC, 8 queues per pipe = 64 queues
- Queues can be allocated and scheduled by another queue
- New doorbell aperture allows you to assign space in the aperture
  for the wptr which allows for userspace access to queues

v2: add wptr shadow, fix eop setup
v3: fix comment
v4: switch to new callback method
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>
Reviewed-by: NJerome Glisse <jglisse@redhat.com>

963e81f9

26 6月, 2013 1 次提交

drm/radeon: Add CP init for CIK (v7) · 841cf442

由 Alex Deucher 提交于 12月 18, 2012

Sets up the GFX ring and loads ucode for GFX and Compute.

Todo:
- handle compute queue setup.

v2: add documentation
v3: integrate with latest reset changes
v4: additional init fixes
v5: scratch reg write back no longer supported on CIK
v6: properly set CP_RB0_BASE_HI
v7: rebase
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

841cf442

24 4月, 2013 1 次提交

drm/radeon: raise UVD clocks only on demand · 55b51c88

由 Christian König 提交于 4月 18, 2013

That not only saves some power, but also solves problems with
older chips where an idle UVD block on higher clocks can
cause problems.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

55b51c88

09 4月, 2013 3 次提交

drm/radeon: UVD bringup v8 · f2ba57b5

由 Christian König 提交于 4月 08, 2013

Just everything needed to decode videos using UVD.

v6: just all the bugfixes and support for R7xx-SI merged in one patch
v7: UVD_CGC_GATE is a write only register, lockup detection fix
v8: split out VRAM fallback changes, remove support for RV770,
    add support for HEMLOCK, add buffer sizes checks
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NJerome Glisse <jglisse@redhat.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

f2ba57b5

drm/radeon: rework fallback handling v2 · 4474f3a9

由 Christian König 提交于 4月 08, 2013

Let the CS module decide if we can fall back to VRAM or not.

v2: remove unintended change
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NJerome Glisse <jglisse@redhat.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

4474f3a9

drm/radeon: UVD doesn't needs VM on SI v2 · 57449040

由 Christian König 提交于 4月 08, 2013

v2: update error message and comment
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NJerome Glisse <jglisse@redhat.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

57449040

02 2月, 2013 1 次提交

drm/radeon: use IBs for VM page table updates v2 · 43f1214a

由 Alex Deucher 提交于 2月 01, 2013

For very large page table updates, we can exceed the
size of the ring.  To avoid this, use an IB to perform
the page table update.

v2(ck): cleanup the IB infrastructure and the use it instead
        of filling the struct ourself.
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NChristian König <christian.koenig@amd.com>

43f1214a

01 2月, 2013 3 次提交

drm/radeon: pull out common next_reloc function · e9716993

由 Ilija Hadzic 提交于 1月 02, 2013

next_reloc function does the same thing in all ASICs with
the exception of R600 which has a special case in legacy mode.
Pull out the common function in preparation for refactoring.
Signed-off-by: NIlija Hadzic <ihadzic@research.bell-labs.com>
Reviewed-by: NMarek Olšák <maraeo@gmail.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

e9716993

drm/radeon: rename r100_cs_dump_packet to radeon_cs_dump_packet · c3ad63af

由 Ilija Hadzic 提交于 1月 02, 2013

This function is not limited to r100, but it can dump a
(raw) packet for any ASIC. Rename it accordingly and move
its declaration to radeon.h
Signed-off-by: NIlija Hadzic <ihadzic@research.bell-labs.com>
Reviewed-by: NMarek Olšák <maraeo@gmail.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

c3ad63af

drm/radeon: factor out cs_next_is_pkt3_nop function · 9ffb7a6d

由 Ilija Hadzic 提交于 1月 02, 2013

Once we factored out radeon_cs_packet_parse function,
evergreen_cs_next_is_pkt3_nop and r600_cs_next_is_pkt3_nop
functions became identical, so they can be factored out
into a common function.
Signed-off-by: NIlija Hadzic <ihadzic@research.bell-labs.com>
Reviewed-by: NMarek Olšák <maraeo@gmail.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

9ffb7a6d

openanolis / cloud-kernel 1 年多 前同步成功

openanolis / cloud-kernel
1 年多前同步成功