提交 · 894a8293aaa702a5aef758bc069162a671ca7a07 · openeuler / raspberrypi-kernel

02 11月, 2017 8 次提交

由 Felix Kuehling 提交于 11月 01, 2017

These were missed previously when rebasing changes for upstreaming.

v2: Remove redundant sched_policy conditions
Signed-off-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Reviewed-by: NOded Gabbay <oded.gabbay@gmail.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

894a8293

drm/amdkfd: Update queue_count before mapping queues · 096d1a3e

由 Felix Kuehling 提交于 11月 01, 2017

map_queues_cpsch uses the queue_count to decide whether to upload
a new runlist. So update the counter before calling it.
Signed-off-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Reviewed-by: NOded Gabbay <oded.gabbay@gmail.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

096d1a3e

drm/amdkfd: Cleanup DQM ASIC-specific ops · bfd5e378

由 Yong Zhao 提交于 11月 01, 2017

Remove empty initialize function.

Rename register_process to update_qpd to avoid confusion with the
non-ASIC-specific register_process.

Shorten ops_asic_specific to asic_ops.
Signed-off-by: NYong Zhao <yong.zhao@amd.com>
Signed-off-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Reviewed-by: NOded Gabbay <oded.gabbay@gmail.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

bfd5e378

drm/amdkfd: Register/Deregister process on qpd resolution · 5a29ad6b

由 Ben Goz 提交于 11月 01, 2017

Process registration needs to happen on each device. So use per-device
queue lists to determine when to register/deregister the process.
Signed-off-by: NBen Goz <ben.goz@amd.com>
Signed-off-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Reviewed-by: NOded Gabbay <oded.gabbay@gmail.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

5a29ad6b

drm/amdkfd: Fix debug unregister procedure on process termination · 062c5672

由 Yair Shachar 提交于 11月 01, 2017

Take the dbgmgr lock and unregister before destroying the debug manager.
Do this before destroying the queues.

v2: Correct locking order in kfd_ioctl_dbg_register to ake sure the
process mutex and dbgmgr mutex are always taken in the same order.
Signed-off-by: NYair Shachar <yair.shachar@amd.com>
Signed-off-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Reviewed-by: NOded Gabbay <oded.gabbay@gmail.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

062c5672

drm/amdkfd: Avoid calling amd_iommu_unbind_pasid() when suspending · e2a8e999

由 Yong Zhao 提交于 11月 01, 2017

When kfd suspending on APU, we do not need to call
amd_iommu_unbind_pasid(), because pasid will be unbound automatically
when power goes off.

On the other hand, calling amd_iommu_unbind_pasid() will trigger
kfd_process_iommu_unbind_callback() if the process is not terminating.
By design, kfd_process_iommu_unbind_callback() should only be called
for process terminating. So we would rather not to call
amd_iommu_unbind_pasid() when suspending.
Signed-off-by: NYong Zhao <yong.zhao@amd.com>
Signed-off-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Reviewed-by: NOded Gabbay <oded.gabbay@gmail.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

e2a8e999

drm/amdkfd: Disable CP/SDMA ring/doorbell in MQD · bba9662d

由 Jay Cornwall 提交于 11月 01, 2017

The MQD represents an inactive context and should not have ring or
doorbell enable bits set. Doing so interferes with HWS which streams
the MQD onto the HQD. If enable bits are set this activates the ring
or doorbell before the HQD is fully configured.
Signed-off-by: NJay Cornwall <Jay.Cornwall@amd.com>
Signed-off-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Reviewed-by: NOded Gabbay <oded.gabbay@gmail.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

bba9662d

drm/amdkfd: Clean up the data structure in kfd_process · ab40cba3

由 Yong Zhao 提交于 11月 01, 2017

A list of per-process queues is maintained in the
kfd_process_queue_manager, so the queues array in kfd_process is
redundant and in fact unused.
Signed-off-by: NYong Zhao <yong.zhao@amd.com>
Signed-off-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Reviewed-by: NOded Gabbay <oded.gabbay@gmail.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

ab40cba3

30 10月, 2017 1 次提交

drm/radeon: deprecate and remove KFD interface · f4fa88ab

由 Christian König 提交于 10月 30, 2017

To quote Felix: "For testing KV with current user mode stack, please use
amdgpu. I don't expect this to work with radeon and I'm not planning to
spend any effort on making radeon work with a current user mode stack."

Only compile tested, but should be straight forward.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

f4fa88ab

28 10月, 2017 16 次提交

drm/amdkfd: use a high priority workqueue for IH work · 48e876a2

由 Andres Rodriguez 提交于 10月 27, 2017

In systems under heavy load the IH work may experience significant
scheduling delays.

Under load + system workqueue:
    Max Latency: 7.023695 ms
    Avg Latency: 0.263994 ms

Under load + high priority workqueue:
    Max Latency: 1.162568 ms
    Avg Latency: 0.163213 ms

Further work is required to measure the impact of per-cpu settings on IH
performance.
Signed-off-by: NAndres Rodriguez <andres.rodriguez@amd.com>
Signed-off-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Acked-by: NOded Gabbay <oded.gabbay@gmail.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

48e876a2

drm/amdkfd: wait only for IH work on IH exit · 0f875e3f

由 Andres Rodriguez 提交于 10月 27, 2017

We don't need to wait for all work to complete in the IH exit function.
We only need to make sure the interrupt_work has finished executing to
guarantee that ih_kfifo is no longer in use.
Signed-off-by: NAndres Rodriguez <andres.rodriguez@amd.com>
Acked-by: NOded Gabbay <oded.gabbay@gmail.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

0f875e3f

drm/amdkfd: increase IH num entries to 8192 · 27232055

由 Andres Rodriguez 提交于 10月 27, 2017

A larger buffer will let us accommodate applications with a large amount
of semi-simultaneous event signals.
Signed-off-by: NAndres Rodriguez <andres.rodriguez@amd.com>
Signed-off-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Acked-by: NOded Gabbay <oded.gabbay@gmail.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

27232055

drm/amdkfd: use standard kernel kfifo for IH · 04ad47bd

由 Andres Rodriguez 提交于 10月 27, 2017

Replace our implementation of a lockless ring buffer with the standard
linux kernel kfifo.

We shouldn't maintain our own version of a standard data structure.
Signed-off-by: NAndres Rodriguez <andres.rodriguez@amd.com>
Signed-off-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Acked-by: NOded Gabbay <oded.gabbay@gmail.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

04ad47bd

drm/amdkfd: Make event limit dependent on user mode mapping size · b9a5d0a5

由 Felix Kuehling 提交于 10月 27, 2017

This allows increasing the KFD_SIGNAL_EVENT_LIMIT in kfd_ioctl.h
without breaking processes built with older kfd_ioctl.h versions.
Signed-off-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

b9a5d0a5

drm/amdkfd: Use IH context ID for signal lookup · 3f04f961

由 Felix Kuehling 提交于 10月 27, 2017

This speeds up signal lookup when the IH ring entry includes a
valid context ID or partial context ID. Only if the context ID is
found to be invalid, fall back to an exhaustive search of all
signaled events.
Signed-off-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Acked-by: NOded Gabbay <oded.gabbay@gmail.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

3f04f961

drm/amdkfd: Simplify event ID and signal slot management · 482f0777

由 Felix Kuehling 提交于 10月 27, 2017

Signal slots are identical to event IDs.

Replace the used_slot_bitmap and events hash table with an IDR to
allocate and lookup event IDs and signal slots more efficiently.
Signed-off-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Acked-by: NOded Gabbay <oded.gabbay@gmail.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

482f0777

drm/amdkfd: Simplify events page allocator · 50cb7dd9

由 Felix Kuehling 提交于 10月 27, 2017

The first event page is always big enough to handle all events.
Handling of multiple events pages is not supported by user mode, and
not necessary.
Signed-off-by: NYong Zhao <yong.zhao@amd.com>
Signed-off-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Acked-by: NOded Gabbay <oded.gabbay@gmail.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

50cb7dd9

drm/amdkfd: Use wait_queue_t to implement event waiting · 74e40716

由 Felix Kuehling 提交于 10月 27, 2017

Use standard wait queues for waiting and waking up waiting threads
instead of inventing our own. We still have our own wait loop
because the HSA event semantics require the ability to have one
thread waiting on multiple wait queues (events) at the same time.
Signed-off-by: NKent Russell <kent.russell@amd.com>
Signed-off-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Acked-by: NOded Gabbay <oded.gabbay@gmail.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

74e40716

drm/amdkfd: remove redundant kfd_event_waiter.input_index · ebf947fe

由 Felix Kuehling 提交于 10月 27, 2017

This always identical with the index of the event_waiter in the array.
No need to store it in the waiter record.
Signed-off-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Reviewed-by: NOded Gabbay <oded.gabbay@gmail.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

ebf947fe

drm/amdkfd: Fix event destruction with pending waiters · fe528c13

由 Felix Kuehling 提交于 10月 27, 2017

When an event with pending waiters is destroyed, those waiters may
end up sleeping forever unless they are notified and woken up.
Implement the notification by clearing the waiter->event pointer,
which becomes invalid anyway, when the event is freed, and waking
up the waiting tasks.

Waiters on an event that's destroyed return failure.
Signed-off-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Acked-by: NOded Gabbay <oded.gabbay@gmail.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

fe528c13

drm/amdkfd: Clean up kfd_wait_on_events · fdf0c833

由 Felix Kuehling 提交于 10月 27, 2017

Cleaned up the code while resolving some potential bugs and
inconsistencies in the process.

Clean-ups:
* Remove enum kfd_event_wait_result, which duplicates
  KFD_IOC_EVENT_RESULT definitions
* alloc_event_waiters can be called without holding p->event_mutex
* Return an error code from copy_signaled_event_data instead of bool
* Clean up error handling code paths to minimize duplication in
  kfd_wait_on_events

Fixes:
* Consistently return an error code from kfd_wait_on_events and set
  wait_result to KFD_IOC_WAIT_RESULT_FAIL in all failure cases.
* Always call free_waiters while holding p->event_mutex
* copy_signaled_event_data might sleep. Don't call it while the task state
  is TASK_INTERRUPTIBLE.
Signed-off-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Acked-by: NOded Gabbay <oded.gabbay@gmail.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

fdf0c833

drm/amdkfd: Fix scheduler race in kfd_wait_on_events sleep loop · d9aeec4c

由 Sean Keely 提交于 10月 27, 2017

Signed-off-by: NSean Keely <sean.keely@amd.com>
Signed-off-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Acked-by: NOded Gabbay <oded.gabbay@gmail.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

d9aeec4c

drm/amdkfd: Short cut for kfd_wait_on_events without waiting · 1f9d09be

由 Sean Keely 提交于 10月 27, 2017

If kfd_wait_on_events can return immediately, we don't need to populate
the wait list and don't need to enter the sleep-loop.
Signed-off-by: NSean Keely <sean.keely@amd.com>
Signed-off-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Acked-by: NOded Gabbay <oded.gabbay@gmail.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

1f9d09be

drm/amdkfd: Don't dereference kfd_process.mm · 9b56bb11

由 Felix Kuehling 提交于 10月 27, 2017

The kfd_process doesn't own a reference to the mm_struct, so it can
disappear without warning even while the kfd_process still exists.

Therefore, avoid dereferencing the kfd_process.mm pointer and make
it opaque. Use get_task_mm to get a temporary reference to the mm
when it's needed.

v2: removed unnecessary WARN_ON
Signed-off-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Reviewed-by: NOded Gabbay <oded.gabbay@gmail.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

9b56bb11

drm/amdkfd: Add SDMA trap src id to the KFD isr wanted list · 66b783b4

由 Besar Wicaksono 提交于 10月 27, 2017

This enables SDMA signalling with event interrupt.
Signed-off-by: NBesar Wicaksono <Besar.Wicaksono@amd.com>
Signed-off-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Reviewed-by: NOded Gabbay <oded.gabbay@gmail.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

66b783b4

26 10月, 2017 8 次提交

drm/amd/amdgpu: Remove workaround for suspend/resume in uvd7 · 4a0144bf

由 Tom St Denis 提交于 10月 24, 2017

The workaround is not required anymor and would result in
hangs during suspend/resume cycles if the uvd block were busy.
Signed-off-by: NTom St Denis <tom.stdenis@amd.com>
Acked-by: NLeo Liu <leo.liu@amd.com>
Acked-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

4a0144bf

drm/amdgpu: don't flush the TLB before initializing GART · fa2cd036

由 Christian König 提交于 10月 16, 2017

No point in doing this.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NMichel Dänzer <michel.daenzer@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

fa2cd036

drm/amdgpu: minor cleanup for amdgpu_ttm_bind · ec8c9f8b

由 Christian König 提交于 10月 16, 2017

Filter the placement mask before using it. In theory it could be that we
have other flags set here as well.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NMichel Dänzer <michel.daenzer@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

ec8c9f8b

drm/amdgpu/psp: prevent page fault by checking write_frame address(v4) · 4694257e

由 Evan Quan 提交于 10月 16, 2017

 - Prevent a possible buffer overflow when updating the ring buffer by
    bounds checking the command frame against the available space in the
    ring buffer.

 v2: update the ring_buffer_end address
 v3: update the commit log
 v4: squash in print fix (Michel)
Signed-off-by: NEvan Quan <evan.quan@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

4694257e

drm/amd/powerplay: retrieve the real-time coreClock values · 0722382d

由 Evan Quan 提交于 10月 20, 2017

 - Currently, the coreClock value for min/max performance level on raven
   is hard-coded. Use the real-time value retrieved by GetGfxMinFreqLimit
   and GetGfxMaxFreqLimit PPSMC messages
Signed-off-by: NEvan Quan <evan.quan@amd.com>
Acked-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

0722382d

drm/amd/powerplay: fix performance drop on Vega10 · b87079ec

由 Eric Huang 提交于 10月 19, 2017

Setting package power PID to 1 fixes performance drop caused by
updated SMU FW, before DPM is enabled.
Signed-off-by: NEric Huang <JinHuiEric.Huang@amd.com>
Acked-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

b87079ec

drm/amd/powerplay: add one smc message for Vega10 · 75e50086

由 Eric Huang 提交于 10月 19, 2017

This is used to fix performance drop caused by updated SMU FW.
Signed-off-by: NEric Huang <JinHuiEric.Huang@amd.com>
Acked-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

75e50086

drm/amd/powerplay: fix amd_powerplay_reset() · 7265d50e

由 Dan Carpenter 提交于 10月 24, 2017

We accidentally inverted an if statement and turned amd_powerplay_reset()
into a no-op.

Fixes: ae97988f ("drm/amd/powerplay: tidy up ret checks in amd_powerplay.c (v3)")
Signed-off-by: NDan Carpenter <dan.carpenter@oracle.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

7265d50e

25 10月, 2017 1 次提交

drm/amd/amdgpu: Remove workaround check for UVD6 on APUs · d3daa2c7

由 Tom St Denis 提交于 10月 23, 2017

On APUs the uvd6 driver was skipping proper suspend/resume routines resulting
in a broken state upon resume.
Signed-off-by: NTom St Denis <tom.stdenis@amd.com>
Acked-by: NAlex Deucher <alexander.deucher@amd.com>
Reviewed-by: NLeo Liu <leo.liu@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

d3daa2c7

22 10月, 2017 1 次提交

drm/amd/powerplay: fix uninitialized variable · 8b95f4f7

由 Rex Zhu 提交于 10月 20, 2017

refresh_rate was not initialized when program
display gap.
this patch can fix vce ring test failed
when do S3 on Polaris10.

bug: https://bugs.freedesktop.org/show_bug.cgi?id=103102
bug: https://bugzilla.kernel.org/show_bug.cgi?id=196615Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NRex Zhu <Rex.Zhu@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>
Cc: stable@vger.kernel.org

8b95f4f7

21 10月, 2017 4 次提交

drm/amdgpu:fix wb_clear · 63ae07ca

由 Monk Liu 提交于 10月 17, 2017

Properly shift the index when clearing so we clear
the right bit
Signed-off-by: NMonk Liu <Monk.Liu@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

63ae07ca

drm/amdgpu:fix vf_error_put · 6867e1b5

由 Monk Liu 提交于 10月 16, 2017

1,it should not work on non-SR-IOV case
2,the NO_VBIOS error is incorrect, should
handle it under detect_sriov_bios.
3,wrap the whole detect_sriov_bios with sriov check
Signed-off-by: NMonk Liu <Monk.Liu@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

6867e1b5

drm/amdgpu/sriov:now must reinit psp · ef4c166d

由 Monk Liu 提交于 9月 22, 2017

otherwise after VF FLR the KIQ cannot work
Signed-off-by: NMonk Liu <Monk.Liu@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

ef4c166d

drm/amdgpu: merge bios post checking functions · 91fe77eb

由 pding 提交于 10月 19, 2017

Merge the post checking functions to avoid confusion and take
virtualization into account in all cases.
Signed-off-by: Npding <Pixel.Ding@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

91fe77eb

20 10月, 2017 1 次提交

drm/amd/powerplay: Place the constant on the right side of the test · 96687ec0

由 Georgiana Chelu 提交于 10月 17, 2017

Move the constant on the right side of the comparison in order to
make the code easier to read.

Issue found by checkpatch script:
* WARNING: Comparisons should place the constant on the right side of
the test
Signed-off-by: NGeorgiana Chelu <georgiana.chelu93@gmail.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

96687ec0