提交 · d474ea7e52cbaaae22711d857949ba6018562c29 · openeuler / Kernel

21 11月, 2014 4 次提交

drm/radeon: use one VMID for each ring · 7c42bc1a

由 Christian König 提交于 11月 19, 2014

Use multiple VMIDs for each VM, one for each ring. That allows
us to execute flushes separately on each ring, still not ideal
cause in a lot of cases rings can share IDs.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

7c42bc1a

drm/radeon: split semaphore and sync object handling v2 · 975700d2

由 Christian König 提交于 11月 19, 2014

Previously we just allocated space for four hardware semaphores
in each software semaphore object. Make software semaphore objects
represent only one hardware semaphore address again by splitting
the sync code into it's own object.

v2: fix typo in comment
Signed-off-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

975700d2

drm/radeon: rework vm_flush parameters · faffaf62

由 Christian König 提交于 11月 19, 2014

Use ring structure instead of index and provide vm_id and pd_addr separately.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

faffaf62

drm/radeon: work around a hw bug in MGCG on CIK · 4bb62c95

由 Alex Deucher 提交于 11月 17, 2014

Always need to set bit 0 of RLC_CGTT_MGCG_OVERRIDE
to avoid unreliable doorbell updates in some cases.
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>
Cc: stable@vger.kernel.org

4bb62c95

13 11月, 2014 1 次提交

drm/radeon: fix for memory training on bonaire 0x6649 · 9feb3dda

由 Alex Deucher 提交于 11月 07, 2014

Workaround for memory link training on certain variants
of 0x6649.
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

9feb3dda

07 11月, 2014 2 次提交

drm/radeon: make sure mode init is complete in bandwidth_update · 8efe82ca

由 Alex Deucher 提交于 11月 03, 2014

The power management code calls into the display code for
certain things.  If certain power management sysfs attributes
are called before the driver has finished initializing all of
the hardware we can run into problems with uninitialized
modesetting state.  Add a check to make sure modesetting
init has completed to the bandwidth update callbacks to
fix this.  Can be triggered by the tlp and laptop start
up scripts depending on the timing.

bugs:
https://bugzilla.kernel.org/show_bug.cgi?id=83611
https://bugs.freedesktop.org/show_bug.cgi?id=85771Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>
Cc: stable@vger.kernel.org

8efe82ca

drm/radeon: set correct CE ram size for CIK · dc4edad6

由 Jammy Zhou 提交于 11月 03, 2014

CE ram size is 32k/0k/0k for GFX/CS0/CS1 with CIK

Ported from amdgpu driver.
Signed-off-by: NJammy Zhou <Jammy.Zhou@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>
Cc: stable@vger.kernel.org

dc4edad6

03 10月, 2014 2 次提交

drm/radeon: export reservation_object from dmabuf to ttm · 831b6966

由 Maarten Lankhorst 提交于 9月 18, 2014

Adds an extra argument to radeon_bo_create, which is only used in radeon_prime.c.
Reviewed-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NMaarten Lankhorst <maarten.lankhorst@canonical.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

831b6966

drm/radeon: cope with foreign fences inside the reservation object · 392a250b

由 Maarten Lankhorst 提交于 9月 25, 2014

Not the whole world is a radeon! :-)
Reviewed-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NMaarten Lankhorst <maarten.lankhorst@canonical.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

392a250b

01 10月, 2014 1 次提交

drm/radeon/cik: write gfx ucode version to ucode addr reg · 38aea071

由 Alex Deucher 提交于 9月 30, 2014

Helpful for debugging as the version shows up in a
register dump.

Cc: Jay Cornwall <jay.cornwall@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

38aea071

23 9月, 2014 4 次提交

drm/radeon/cik: use a separate counter for CP init timeout · 370ce45b

由 Alex Deucher 提交于 9月 23, 2014

Otherwise we may fail to init the second compute ring.
Noticed-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>
Cc: stable@vger.kernel.org

370ce45b

drm/radeon: Update IH_RB_RPTR register after each processed interrupt · f55e03b9

由 Michel Dänzer 提交于 9月 19, 2014

This might decrease the chance of IH ring buffer overflows.
Signed-off-by: NMichel Dänzer <michel.daenzer@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

f55e03b9

drm/radeon: Make IH ring overflow debugging output more useful · 6cc2fda2

由 Michel Dänzer 提交于 9月 19, 2014

Use the same format for all ring indices, and fix the calculation of the
post-overflow RPTR.
Signed-off-by: NMichel Dänzer <michel.daenzer@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

6cc2fda2

drm/radeon: Clear RB_OVERFLOW bit earlier · 11bab0ae

由 Michel Dänzer 提交于 9月 19, 2014

Otherwise the bit remains set in rdev->ih.rptr, so the wptr can never
match that and we still have an infinite loop.

This fix allows me to successfully recover from an IH ring buffer
overflow.
Signed-off-by: NMichel Dänzer <michel.daenzer@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

11bab0ae

11 9月, 2014 1 次提交

drm/radeon: add the infrastructure for concurrent buffer access · 57d20a43

由 Christian König 提交于 9月 04, 2014

This allows us to specify if we want to sync to
the shared fences of a reservation object or not.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

57d20a43

28 8月, 2014 2 次提交

radeon: Test for PCI root bus before assuming bus->self · 0bd252de

由 Alex Williamson 提交于 8月 27, 2014

If we assign a Radeon device to a virtual machine, we can no longer
assume a fixed hardware topology, like the GPU having a parent device.
This patch simply adds a few pci_is_root_bus() tests to avoid passing
a NULL pointer to PCI access functions, allowing the radeon driver to
work in a QEMU 440FX machine with an assigned HD8570 on the emulated
PCI root bus.
Signed-off-by: NAlex Williamson <alex.williamson@redhat.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

0bd252de

drm/radeon: drop doing resets in a work item · 3c036389

由 Christian König 提交于 8月 27, 2014

Blocking completely innocent processes with a GPU reset is
a pretty bad idea. Just set needs_reset and let the next
command submission or fence wait do the job.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NMaarten Lankhorst <maarten.lankhorst@canonical.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

3c036389

27 8月, 2014 1 次提交

drm/radeon: save/restore the PD addr on suspend/resume · 054e01d6

由 Christian König 提交于 8月 26, 2014

This fixes a problem with GPU resets and TLB flushes on SI/CIK.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

054e01d6

22 8月, 2014 1 次提交

drm/radeon: add new KV pci id · 6dc14baf

由 Alex Deucher 提交于 8月 21, 2014

bug:
https://bugs.freedesktop.org/show_bug.cgi?id=82912Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>
Cc: stable@vger.kernel.org

6dc14baf

20 8月, 2014 2 次提交

drm/radeon: fix active_cu mask on SI and CIK after re-init (v3) · 52da51f0

由 Alex Deucher 提交于 8月 19, 2014

Need to initialize the mask to 0 on init, otherwise it
keeps increasing.

bug:
https://bugzilla.kernel.org/show_bug.cgi?id=82581

v2: also fix cu count
v3: split count fix into separate patch
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>
Reviewed-by: NMichel Dänzer <michel.daenzer@amd.com>
Cc: stable@vger.kernel.org

52da51f0

drm/radeon: fix active cu count for SI and CIK · 6101b3ae

由 Alex Deucher 提交于 8月 19, 2014

This fixes the CU count reported to userspace for
OpenCL.

bug:
https://bugzilla.kernel.org/show_bug.cgi?id=82581Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>
Reviewed-by: NMichel Dänzer <michel.daenzer@amd.com>
Cc: stable@vger.kernel.org

6101b3ae

19 8月, 2014 2 次提交

drm/radeon: Sync ME and PFP after CP semaphore waits v4 · 86302eea

由 Christian König 提交于 8月 18, 2014

Fixes lockups due to CP read GPUVM faults when running piglit on Cape
Verde.

v2 (chk): apply the fix to R600+ as well, on CIK only the GFX CP has
	  a PFP, add more comments to R600 code, enable flushing again
v3: (agd5f): only apply to 7xx+.  r6xx does not have the packet.
v4: (agd5f): split flush change into a separate patch, fix formatting
Signed-off-by: NMichel Dänzer <michel.daenzer@amd.com>
Signed-off-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>
Tested-by: NMichel Dänzer <michel.daenzer@amd.com>

86302eea

drm/radeon: Only flush HDP cache for indirect buffers from userspace · 1538a9e0

由 Michel Dänzer 提交于 8月 18, 2014

It isn't necessary for command streams generated by the kernel (at least
not while we aren't storing ring or indirect buffers in VRAM).
Signed-off-by: NMichel Dänzer <michel.daenzer@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

1538a9e0

15 8月, 2014 1 次提交
- A
  drm/radeon: use pfp for all vm_flush related updates · 4fb0bbd5
  由 Alex Deucher 提交于 8月 07, 2014
```
May fix hangs in some cases.
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>
```
  4fb0bbd5
05 8月, 2014 8 次提交

drm/radeon: Use pflip irqs for pageflip completion if possible. (v2) · 39dc5454

由 Mario Kleiner 提交于 7月 29, 2014

Skip the "manual" pageflip completion checks via polling and
guessing in the vblank handler radeon_crtc_handle_vblank() on
asics which are known to reliably support hw pageflip completion
irqs. Those pflip irqs are a more reliable and race-free method
of handling pageflip completion detection, whereas the "classic"
polling method has some small races in combination with dpm on,
and with the reworked pageflip implementation since Linux 3.16.

On old asics without pflip irqs, the classic method is used.

On asics with known good pflip irqs, only pflip irqs are used
by default, but a new module parameter "use_pflipirqs" allows to
override this in case we encounter asics in the wild with
unreliable or faulty pflip irqs. A module parameter of 0 allows
to use the classic method only in such a case. A parameter of 1
allows to use both classic method and pflip irqs as additional
band-aid to avoid some small races which could happen with the
classic method alone. The setting 1 gives Linux 3.16 behaviour.

Hw pflip irqs are available since R600.

Tested on DCE-4, AMD Cedar - FirePro 2270.

v2:  agd5f: only enable pflip interrupts on DCE4+ as they are not
reliable on older asics.
Signed-off-by: NMario Kleiner <mario.kleiner.de@gmail.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

39dc5454

drm/radeon: use packet3 for nop on hawaii with new firmware · 78cd3661

由 Alex Deucher 提交于 8月 01, 2014

Older firmware didn't support the new nop packet.
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NAndreas Boll <andreas.boll.dev@gmail.com>

78cd3661

drm/radeon: use packet2 for nop on hawaii with old firmware · 0e16e4cf

由 Alex Deucher 提交于 8月 01, 2014

Older firmware didn't support the new nop packet.

v2 (Andreas Boll):
 - Drop usage of packet3 for new firmware
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>
Reviewed-by: Christian König <christian.koenig@amd.com> (v1)
Signed-off-by: NAndreas Boll <andreas.boll.dev@gmail.com>
Cc: stable@vger.kernel.org

0e16e4cf

drm/radeon: Always flush the HDP cache before submitting a CS to the GPU · 72a9987e

由 Michel Dänzer 提交于 7月 31, 2014

This ensures the GPU sees all previous CPU writes to VRAM, which makes it
safe:

* For userspace to stream data from CPU to GPU via VRAM instead of GTT
* For IBs to be stored in VRAM instead of GTT
* For ring buffers to be stored in VRAM instead of GTT, if the HPD flush
  is performed via MMIO
Signed-off-by: NMichel Dänzer <michel.daenzer@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

72a9987e

drm/radeon: set VM base addr using the PFP v2 · f1d2a26b

由 Christian König 提交于 7月 30, 2014

Seems to make VM flushes more stable on SI and CIK.

v2: only use the PFP on the GFX ring on CIK
Signed-off-by: NChristian König <christian.koenig@amd.com>
Cc: stable@vger.kernel.org
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

f1d2a26b

drm/radeon: Allow write-combined CPU mappings of BOs in GTT (v2) · 02376d82

由 Michel Dänzer 提交于 7月 17, 2014

v2: fix rebase onto drm-fixes
Signed-off-by: NMichel Dänzer <michel.daenzer@amd.com>
Reviewed-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

02376d82

drm/radeon: Remove radeon_gart_restore() · a3eb06db

由 Michel Dänzer 提交于 7月 09, 2014

Doesn't seem necessary, the GART table memory should be persistent.
Signed-off-by: NMichel Dänzer <michel.daenzer@amd.com>
Reviewed-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

a3eb06db

drm/radeon/cik: Add support for new ucode format (v5) · f2c6b0f4

由 Alex Deucher 提交于 6月 25, 2014

This adds CIK support for the new ucode format.

v2: add size validation, integrate debug info
v3: add support for MEC2 on KV
v4: fix typos
v4: update to latest format
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

f2c6b0f4

25 7月, 2014 1 次提交

drm/radeon: fix cut and paste issue for hawaii. · 1b2c4869

由 Jerome Glisse 提交于 7月 24, 2014

This is a halfway fix for hawaii acceleration. More fixes to come
but hopefully isolated to userspace.
Signed-off-by: NJérôme Glisse <jglisse@redhat.com>
Cc: stable@vger.kernel.org
Signed-off-by: NDave Airlie <airlied@redhat.com>

1b2c4869

23 7月, 2014 1 次提交

drm/radeon: fix irq ring buffer overflow handling · e8c214d2

由 Christian König 提交于 7月 23, 2014

We must mask out the overflow bit as well, otherwise
the wptr will never match the rptr again and the interrupt
handler will loop forever.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Cc: stable@vger.kernel.org
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>
Reviewed-by: NMichel Dänzer <michel.daenzer@amd.com>

e8c214d2

15 7月, 2014 1 次提交

drm/radeon: Add radeon <--> amdkfd interface · e28740ec

由 Oded Gabbay 提交于 7月 15, 2014

This patch adds the interface between the radeon driver and the amdkfd driver.
The interface implementation is contained in radeon_kfd.c and radeon_kfd.h.

The interface itself is represented by a pointer to struct
kfd_dev. The pointer is located inside radeon_device structure.

All the register accesses that amdkfd need are done using this interface. This
allows us to avoid direct register accesses in amdkfd proper,  while also
avoiding locking between amdkfd and radeon.

The single exception is the doorbells that are used in both of the drivers.
However, because they are located in separate pci bar pages, the danger of
sharing registers between the drivers is minimal.

Having said that, we are planning to move the doorbells as well to radeon.

v3:

Add interface for sa manager init and fini. The init function will allocate a
buffer on system memory and pin it to the GART address space via the radeon sa
manager.

All mappings of buffers to GART address space are done via the radeon sa
manager. The interface of allocate memory will use the radeon sa manager to sub
allocate from the single buffer that was allocated during the init function.

Change lower_32/upper_32 calls to use linux macros

Add documentation for the interface

v4:

Change ptr field type in kgd_mem from uint32_t* to void* to match to type that
is returned by radeon_sa_bo_cpu_addr

v5:

Change format of mqd structure to work with latest KV firmware
Add support for AQL queues creation to enable working with open-source HSA
runtime.
Move generic kfd-->kgd interface and other generic kgd definitions to a generic
header file that will be used by AMD's radeon and amdgpu drivers
Signed-off-by: NOded Gabbay <oded.gabbay@amd.com>

e28740ec

14 7月, 2014 1 次提交

drm/radeon: adding synchronization for GRBM GFX · 1c0a4625

由 Oded Gabbay 提交于 7月 14, 2014

Implementing a lock for selecting and accessing shader engines and arrays.
This lock will make sure that radeon and amdkfd are not colliding when
accessing shader engines and arrays with GRBM_GFX_INDEX register.
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NOded Gabbay <oded.gabbay@amd.com>

1c0a4625

11 7月, 2014 1 次提交

drm/radeon: only print meaningful VM faults · 9b7d786b

由 Christian König 提交于 7月 07, 2014

Signed-off-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

9b7d786b

10 6月, 2014 3 次提交

drm/radeon: add query for number of active CUs · 65fcf668

由 Alex Deucher 提交于 6月 02, 2014

Query to find out how many compute units on a GPU.
Useful for OpenCL usermode drivers.
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

65fcf668

drm/radeon: make vm_block_size a module parameter · 4510fb98

由 Christian König 提交于 6月 05, 2014

Signed-off-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

4510fb98

drm/radeon: use lower_32_bits where appropriate · 5e167cdb

由 Christian König 提交于 6月 03, 2014

Replace occurrences of "v & 0xffffffff" with lower_32_bits(v)
when it's next to an upper_32_bits(v). Also remove unnecessary
"upper_32_bits(v) & 0xffffffff" code snippets.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

5e167cdb

openeuler / Kernel 1 年多 前同步成功

openeuler / Kernel
1 年多前同步成功