提交 · 2280ab57b6edc8581497d5e101c4694faf839c3e · openanolis / cloud-kernel

03 3月, 2014 4 次提交

C
drm/radeon: fix VCE suspend/resume · b03b4e4b
由 Christian König 提交于 2月 28, 2014
```
Signed-off-by: NChristian König <christian.koenig@amd.com>
```
b03b4e4b

drm/radeon: validate relocations in the order determined by userspace v3 · c9b76548

由 Marek Olšák 提交于 3月 02, 2014

Userspace should set the first 4 bits of drm_radeon_cs_reloc::flags to
a number from 0 to 15. The higher the number, the higher the priority,
which means a buffer with a higher number will be validated sooner.

The old behavior is preserved: Buffers used for write are prioritized over
read-only buffers if the userspace doesn't set the number.

v2: add buffers to buckets directly, then concatenate them
v3: use a stable sort
Signed-off-by: NMarek Olšák <marek.olsak@amd.com>
Reviewed-by: NChristian König <christian.koenig@amd.com>

c9b76548

drm/radeon: track memory statistics about VRAM and GTT usage and buffer moves v2 · 67e8e3f9

由 Marek Olšák 提交于 3月 02, 2014

The statistics are:
- VRAM usage in bytes
- GTT usage in bytes
- number of bytes moved by TTM

The last one is actually a counter, so you need to sample it before and after
command submission and take the difference.

This is useful for finding performance bottlenecks. Userspace queries are
also added.

v2: use atomic64_t
Signed-off-by: NMarek Olšák <marek.olsak@amd.com>
Reviewed-by: NChristian König <christian.koenig@amd.com>

67e8e3f9

drm/radeon: add a way to get and set initial buffer domains v2 · bda72d58

由 Marek Olšák 提交于 3月 02, 2014

When passing buffers between processes, the receiving process needs to know
the original buffer domain, so that it doesn't accidentally move the buffer.

v2: reserve the buffer
Signed-off-by: NMarek Olšák <marek.olsak@amd.com>
Reviewed-by: NChristian König <christian.koenig@amd.com>

bda72d58

28 2月, 2014 2 次提交

drm/radeon: cleanup the fence ring locking code · 37615527

由 Christian König 提交于 2月 18, 2014

We no longer need to take the ring lock while checking for
a gpu lockup, so just cleanup the code.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>

37615527

drm/radeon: improve ring lockup detection code v2 · aee4aa73

由 Christian König 提交于 2月 18, 2014

Use atomics and jiffies_64, so that we don't need to have the
ring mutex locked any more and avoid wrap arounds.

v2: fix some checkpatch warnings
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>

aee4aa73

19 2月, 2014 3 次提交

drm/radeon: fix CP semaphores on CIK · 8f53492f

由 Christian König 提交于 2月 18, 2014

The CP semaphore queue on CIK has a bug that triggers if uncompleted
waits use the same address while a signal is still pending. Work around
this by using different addresses for each sync.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Cc: stable@vger.kernel.org
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

8f53492f

drm/radeon: drop radeon_ring_force_activity · 2d2fe3f9

由 Christian König 提交于 2月 18, 2014

The reason for the false positives was fixed quite some time ago and since
most engines can still execute NOPs while being locked up it leads to false
negatives.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>

2d2fe3f9

drm/radeon: drop drivers copy of the rptr · ff212f25

由 Christian König 提交于 2月 18, 2014

In all cases where it really matters we are using the read functions anyway.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>

ff212f25

18 2月, 2014 7 次提交

drm/radeon/dpm: enable dynamic vce state switching v2 · 03afe6f6

由 Alex Deucher 提交于 8月 23, 2013

enable vce states when vce is active.  When vce is active,
it adjusts the currently selected state (performance, battery,
uvd, etc.)

v2: add code comments
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NChristian König <christian.koenig@amd.com>

03afe6f6

A
drm/radeon/dpm: fetch vce states from the vbios · 58bd2a88
由 Alex Deucher 提交于 9月 04, 2013
```
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>
```
58bd2a88
A
drm/radeon/dpm: fill in some initial vce infrastructure · b62d628b
由 Alex Deucher 提交于 8月 20, 2013
```
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>
```
b62d628b
A
drm/radeon: add callback for setting vce clocks · b59b7333
由 Alex Deucher 提交于 8月 20, 2013
```
Similar to uvd clock setting.
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>
```
b59b7333
C
drm/radeon: add VCE version parsing and checking · 98ccc291
由 Christian König 提交于 1月 23, 2014
```
Also make the result available to userspace.
Signed-off-by: NChristian König <christian.koenig@amd.com>
```
98ccc291

drm/radeon: initial VCE support v4 · d93f7937

由 Christian König 提交于 5月 23, 2013

Only VCE 2.0 support so far.

v2: squashing multiple patches into this one
v3: add IRQ support for CIK, major cleanups,
    basic code documentation
v4: remove HAINAN from chipset list
Signed-off-by: NChristian König <christian.koenig@amd.com>

d93f7937

drm/radeon: fix CP semaphores on CIK · 1c61eae4

由 Christian König 提交于 2月 18, 2014

The CP semaphore queue on CIK has a bug that triggers if uncompleted
waits use the same address while a signal is still pending. Work around
this by using different addresses for each sync.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Cc: stable@vger.kernel.org

1c61eae4

30 1月, 2014 1 次提交

drm/radeon: fix VMID use tracking · 593b2635

由 Christian König 提交于 1月 23, 2014

Otherwise we allocate a new VMID on nearly every submit.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

593b2635

09 1月, 2014 2 次提交

drm/radeon: add pci config hard reset · 1a0041b8

由 Alex Deucher 提交于 10月 02, 2013

This is used to hard reset the asic.  If a soft
reset is not able to reset things, a hard reset
can be used.
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

1a0041b8

drm/radeon: add hard_reset module parameter · 363eb0b4

由 Alex Deucher 提交于 1月 08, 2014

Enabling this parameter enables pci config reset,
aka hard reset, which is a bus level chip reset.
In some cases this works more reliably than a soft
reset.  Disabled by default.
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

363eb0b4

25 12月, 2013 4 次提交

drm/radeon: remove generic rptr/wptr functions (v2) · ea31bf69

由 Alex Deucher 提交于 12月 09, 2013

Fill in asic family specific versions rather than
using the generic version.  This lets us handle asic
specific differences more easily.  In this case, we
disable sw swapping of the rtpr writeback value on
r6xx+ since the hw does it for us.  Fixes bogus
rptr readback on BE systems.

v2: remove missed cpu_to_le32(), add comments
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

ea31bf69

drm/radeon/dpm: add a late enable callback · 914a8987

由 Alex Deucher 提交于 12月 19, 2013

Certain features need to be enabled after ring tests
(e.g., powergating, etc.).  Add a function pointer
to split out late enable features.
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

914a8987

drm/radeon: add GART debugfs access v3 · dd66d20e

由 Christian König 提交于 12月 18, 2013

v2: add default_llseek
v3: set inode size in the open callback
Signed-off-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

dd66d20e

drm/radeon: add VRAM debugfs access v3 · 2014b569

由 Christian König 提交于 12月 18, 2013

Not very fast, but makes it possible to access even the
normally inaccessible parts of VRAM from userspace.

v2: use MM_INDEX_HI for >2GB mem access, add default_llseek
v3: set inode size in the open callback
Signed-off-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

2014b569

23 12月, 2013 1 次提交

drm/radeon: expose render backend mask to the userspace · 439a1cff

由 Marek Olšák 提交于 12月 22, 2013

This will allow userspace to correctly program the PA_SC_RASTER_CONFIG
register, so it can be considered a fix.
Signed-off-by: NMarek Olšák <marek.olsak@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>
Cc: stable@vger.kernel.org

439a1cff

03 12月, 2013 1 次提交

drm/radeon: add radeon_vm_bo_update trace point · 9c57a6bd

由 Christian König 提交于 11月 25, 2013

Also rename the function to better reflect what it is doing.

agd5f: fix argument size warning
Signed-off-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

9c57a6bd

18 11月, 2013 1 次提交

drm/radeon/cik: Add macrotile mode array query · 32f79a8a

由 Michel Dänzer 提交于 11月 18, 2013

This is required to properly calculate the tiling parameters
in userspace.
Signed-off-by: NMichel Dänzer <michel.daenzer@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

32f79a8a

16 11月, 2013 2 次提交

drm/radeon: use a single doorbell for cik kms compute · d5754ab8

由 Andrew Lewycky 提交于 11月 13, 2013

A single doorbell page is plenty for cik kms compute.
Use a single page and manage doorbell allocation by
individual doorbells rather than pages.  Identify
doorbells by their index rather than byte offset.
Signed-off-by: NAndrew Lewycky <Andrew.Lewycky@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

d5754ab8

drm/radeon: allow semaphore emission to fail · 1654b817

由 Christian König 提交于 11月 12, 2013

To workaround bugs and/or certain limits it's sometimes
useful to fall back to waiting on fences.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>
Cc: stable@vger.kernel.org

1654b817

02 11月, 2013 5 次提交

drm/radeon: fixup locking inversion between, mmap_sem and reservations · 28a326c5

由 Maarten Lankhorst 提交于 10月 09, 2013

op 08-10-13 18:58, Thomas Hellstrom schreef:
> On 10/08/2013 06:47 PM, Jerome Glisse wrote:
>> On Tue, Oct 08, 2013 at 06:29:35PM +0200, Thomas Hellstrom wrote:
>>> On 10/08/2013 04:55 PM, Jerome Glisse wrote:
>>>> On Tue, Oct 08, 2013 at 04:45:18PM +0200, Christian König wrote:
>>>>> Am 08.10.2013 16:33, schrieb Jerome Glisse:
>>>>>> On Tue, Oct 08, 2013 at 04:14:40PM +0200, Maarten Lankhorst wrote:
>>>>>>> Allocate and copy all kernel memory before doing reservations. This prevents a locking
>>>>>>> inversion between mmap_sem and reservation_class, and allows us to drop the trylocking
>>>>>>> in ttm_bo_vm_fault without upsetting lockdep.
>>>>>>>
>>>>>>> Signed-off-by: Maarten Lankhorst <maarten.lankhorst@canonical.com>
>>>>>> I would say NAK. Current code only allocate temporary page in AGP case.
>>>>>> So AGP case is userspace -> temp page -> cs checker -> radeon ib.
>>>>>>
>>>>>> Non AGP is directly memcpy to radeon IB.
>>>>>>
>>>>>> Your patch allocate memory memcpy userspace to it and it will then be
>>>>>> memcpy to IB. Which means you introduce an extra memcpy in the process
>>>>>> not something we want.
>>>>> Totally agree. Additional to that there is no good reason to provide
>>>>> anything else than anonymous system memory to the CS ioctl, so the
>>>>> dependency between the mmap_sem and reservations are not really
>>>>> clear to me.
>>>>>
>>>>> Christian.
>>>> I think is that in other code path you take mmap_sem first then reserve
>>>> bo. But here we reserve bo and then we take mmap_sem because of copy
>>> >from user.
>>>> Cheers,
>>>> Jerome
>>>>
>>> Actually the log message is a little confusing. I think the mmap_sem
>>> locking inversion problem is orthogonal to what's being fixed here.

> >>> This patch fixes the possible recursive bo::reserve caused by
> >>> malicious user-space handing a pointer to ttm memory so that the ttm
> >>> fault handler is called when bos are already reserved. That may
> >>> cause a (possibly interruptible) livelock.

>>> Once that is fixed, we are free to choose the mmap_sem ->
>>> bo::reserve locking order. Currently it's bo::reserve->mmap_sem(),
>>> but the hack required in the ttm fault handler is admittedly a bit
>>> ugly.  The plan is to change the locking order to
>>> mmap_sem->bo::reserve

> >>> I'm not sure if it applies to this particular case, but it should be
> >>> possible to make sure that copy_from_user_inatomic() will always
> >>> succeed, by making sure the pages are present using
> >>> get_user_pages(), and release the pages after
> >>> copy_from_user_inatomic() is done. That way there's no need for a
> >>> double memcpy slowpath, but if the copied data is very fragmented I
> >>> guess the resulting code may look ugly. The get_user_pages()
> >>> function will return an error if it hits TTM pages.

>>> /Thomas
>> get_user_pages + copy_from_user_inatomic is overkill. We should just
>> do get_user_pages which fails with ttm memory and then use copy_highpage
>> helper.
>>
>> Cheers,
>> Jerome
> Yeah, it may well be that that's the preferred solution.
>
> /Thomas
>
I still disagree, and shuffled radeon_ib_get around to be called sooner.

How does the patch below look?
8<-------
Allocate and copy all kernel memory before doing reservations. This prevents a locking
inversion between mmap_sem and reservation_class, and allows us to drop the trylocking
in ttm_bo_vm_fault without upsetting lockdep.

Changes since v1:
- Kill extra memcpy for !AGP case.
Signed-off-by: NMaarten Lankhorst <maarten.lankhorst@canonical.com>
Reviewed-by: NJerome Glisse <jglisse@redhat.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

28a326c5

drm/radeon: drop CP page table updates & cleanup v2 · 24c16439

由 Christian König 提交于 10月 30, 2013

The DMA ring seems to be stable now.

v2: remove pt_ring_index as well
Signed-off-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

24c16439

drm/radeon: rework and fix reset detection v2 · f9eaf9ae

由 Christian König 提交于 10月 29, 2013

Stop fiddling with jiffies, always wait for RADEON_FENCE_JIFFIES_TIMEOUT.
Consolidate the two wait sequence implementations into just one function.
Activate all waiters and remember if the reset was already done instead of
trying to reset from only one thread.

v2: clear reset flag earlier to avoid timeout in IB test
Signed-off-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

f9eaf9ae

drm/radeon: add runtime PM support (v2) · 10ebc0bc

由 Dave Airlie 提交于 9月 17, 2012

This hooks radeon up to the runtime PM system to enable
dynamic power management for secondary GPUs in switchable
and powerxpress laptops.

v2: agd5f: clean up, add module parameter
Signed-off-by: NDave Airlie <airlied@redhat.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

10ebc0bc

drm/radeon: convert to pmops · 7473e830

由 Dave Airlie 提交于 9月 13, 2012

This is a pre-requisite for runtime pm on powerxpress systems.
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

7473e830

24 10月, 2013 1 次提交

drm/radeon/dpm: fix incompatible casting on big endian · cdf6e805

由 Alex Deucher 提交于 10月 23, 2013

We use u16 for voltage values throughout the driver so switch
the table values to a u16 as well.  Fixes an incompatible
cast error in ci_patch_clock_voltage_limits_with_vddc_leakage()
picked up by coverity.
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

cdf6e805

11 9月, 2013 4 次提交

drm/radeon/dpm: add infrastructure to properly handle bapm · 1c71bda0

由 Alex Deucher 提交于 9月 09, 2013

bapm is a pm feature for sharing the power budget between
the GPU and the CPU on APUs.  It needs to be enabled or
disabled in certain circumstances.  For now, disable it
when on battery and enable it when on AC power.
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

1c71bda0

drm/radeon: fix typo in PG flags · 2b19d17f

由 Alex Deucher 提交于 9月 04, 2013

s/CG/PG/ in the GFX powergating flag name.
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

2b19d17f

drm/radeon: add spinlocks for indirect register accesss · 0a5b7b0b

由 Alex Deucher 提交于 9月 03, 2013

This adds spinlocks to protect access to other
indirect register apertures.  These indirect spaces are
used pretty infrequently and we haven't had an reported
problems, but better safe than sorry.
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

0a5b7b0b

drm/radeon: protect concurrent smc register access with a spinlock · fe78118c

由 Alex Deucher 提交于 9月 03, 2013

smc registers are access indirectly via the main mmio aperture, so
there may be problems with concurrent access.  This adds a spinlock
to protect access to this register space.
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

fe78118c

31 8月, 2013 2 次提交

drm/radeon/si: restructure cg code (v3) · e16866ec

由 Alex Deucher 提交于 8月 08, 2013

Resturcture clockgating code so that it can be
enabled/disabled from other components such as
dpm.

v2: make function static
v3: add fine grained cg controls
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

e16866ec

drm/radeon: add cg and pg flags · 64d8a728

由 Alex Deucher 提交于 8月 08, 2013

This commits adds flags for supported clockgating and
powergating features.  This allows us to more easily
track which features are supported on a particular
asic and to enable/disable features for debugging.
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

64d8a728

openanolis / cloud-kernel 1 年多 前同步成功

openanolis / cloud-kernel
1 年多前同步成功