提交 · 374200b154ae48e8f011fb74dab21f80459f9e47 · openeuler / raspberrypi-kernel

16 3月, 2018 15 次提交

drm/amdkfd: Add module option for testing large-BAR functionality · 374200b1

由 Felix Kuehling 提交于 3月 15, 2018

Simulate large-BAR system by exporting only visible memory. This
limits the amount of available VRAM to the size of the BAR, but
enables CPU access to VRAM.
Signed-off-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Acked-by: NOded Gabbay <oded.gabbay@gmail.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

374200b1

drm/amdkfd: Kmap event page for dGPUs · 0fc8011f

由 Felix Kuehling 提交于 3月 15, 2018

The events page must be accessible in user mode by the GPU and CPU
as well as in kernel mode by the CPU. On dGPUs user mode virtual
addresses are managed by the Thunk's GPU memory allocation code.
Therefore we can't allocate the memory in kernel mode like we do
on APUs. But KFD still needs to map the memory for kernel access.
To facilitate this, the Thunk provides the buffer handle of the
events page to KFD when creating the first event.
Signed-off-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Acked-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

0fc8011f

drm/amdkfd: Add ioctls for GPUVM memory management · 5ec7e028

由 Felix Kuehling 提交于 3月 15, 2018

v2:
* Fix error handling after kfd_bind_process_to_device in
  kfd_ioctl_map_memory_to_gpu
v3:
* Add ioctl to acquire VM from a DRM FD
v4:
* Return number of successful map/unmap operations in failure cases
* Facilitate partial retry after failed map/unmap
* Added comments with parameter descriptions to new APIs
* Defined AMDKFD_IOC_FREE_MEMORY_OF_GPU write-only
Signed-off-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Acked-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

5ec7e028

drm/amdkfd: Add TC flush on VMID deallocation for Hawaii · 552764b6

由 Felix Kuehling 提交于 3月 15, 2018

On GFX7 the CP does not perform a TC flush when queues are unmapped.
To avoid TC eviction from accessing an invalid VMID, flush it
explicitly before releasing a VMID.

v2: Fix unnecessary list_for_each_entry_safe
v3: Moved allocation to kfd_process_device_init_vm
Signed-off-by: NAmber Lin <Amber.Lin@amd.com>
Signed-off-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Acked-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

552764b6

drm/amdkfd: Allocate CWSR trap handler memory for dGPUs · f35751b8

由 Felix Kuehling 提交于 3月 15, 2018

Add helpers for allocating GPUVM memory in kernel mode and use them
to allocate memory for the CWSR trap handler.

v2: Use dev instead of pdd->dev in kfd_process_free_gpuvm
v3:
* Cleaned up and simplified kfd_process_alloc_gpuvm
* Moved allocation for dGPU to kfd_process_device_init_vm
Signed-off-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Acked-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

f35751b8

drm/amdkfd: Add per-process IDR for buffer handles · 52b29d73

由 Felix Kuehling 提交于 3月 15, 2018

Also used for cleaning up on process termination.

v2: Refactored cleanup on process termination
Signed-off-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Acked-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

52b29d73

drm/amdkfd: Aperture setup for dGPUs · d01994c2

由 Felix Kuehling 提交于 3月 15, 2018

Set up the GPUVM aperture for SVM (shared virtual memory) that allows
sharing a part of virtual address space between GPUs and CPUs.

Report the size of the GPUVM aperture that is supported by KGD accurately.

The low part of the GPUVM aperture is reserved for kernel use. This is
for kernel-allocated buffers that are only accessed on the GPU:
- CWSR trap handler
- IB for submitting commands in user-mode context from kernel mode
Signed-off-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Acked-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

d01994c2

drm/amdkfd: Remove limit on number of GPUs · c7bcbfa4

由 Felix Kuehling 提交于 3月 15, 2018

Currently the number of GPUs is limited by aperture placement options
available on GFX7 and GFX8 hardware. This limitation is not necessary.
Scratch and LDS represent per-work-item and per-work-group storage
respectively. Different work-items and work-groups use the same virtual
address to access their own data. Work running on different GPUs is by
definition in different work-groups (different dispatches, in fact).
That means the same virtual addresses can be used for these apertures
on different GPUs.

Add a new AMDKFD_IOC_GET_PROCESS_APERTURES_NEW ioctl that removes the
artificial limitation on the number of GPUs that can be supported. The
new ioctl allows user mode to query the number of GPUs to allocate
enough memory for all GPUs to be reported.

This deprecates AMDKFD_IOC_GET_PROCESS_APERTURES.
Signed-off-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Acked-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

c7bcbfa4

drm/amdkfd: Populate DRM render device minor · 7c9b7171

由 Oak Zeng 提交于 3月 15, 2018

Populate DRM render device minor in kfd topology
Signed-off-by: NOak Zeng <Oak.Zeng@amd.com>
Signed-off-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Acked-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

7c9b7171

drm/amdkfd: Create KFD VMs on demand · b84394e2

由 Felix Kuehling 提交于 3月 15, 2018

Instead of creating all VMs on process creation, create them when
a process is bound to a device. This will later allow registering
an existing VM from a DRM render node FD at runtime, before the
process is bound to the device. This way the render node VM can be
used for KFD instead of creating our own redundant VM.
Signed-off-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Acked-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

b84394e2

drm/amdgpu: Add kfd2kgd interface to acquire an existing VM · ede0dd86

由 Felix Kuehling 提交于 3月 15, 2018

This allows acquiring an existing VM from a render node FD to use it
for a compute process.

Such VMs get destroyed when the original file descriptor is released.
Added a callback from amdgpu_vm_fini to handle KFD VM destruction
correctly in this case.

v2:
* Removed vm->vm_context check in amdgpu_amdkfd_gpuvm_destroy_cb,
  check vm->process_info earlier instead
Signed-off-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Acked-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

ede0dd86

drm/amdgpu: Add helper to turn an existing VM into a compute VM · b236fa1d

由 Felix Kuehling 提交于 3月 15, 2018

v2: Removed updating and checking of vm->vm_context
v3: Enable amdgpu_vm_clear_bo in amdgpu_vm_make_compute
Signed-off-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Reviewed-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

b236fa1d

drm/amdgpu: Fix initial validation of PD BO for KFD VMs · 3486625b

由 Felix Kuehling 提交于 3月 15, 2018

Make sure the PD BO is valid and attach the eviction fence during VM
creation. This ensures that the pd_phys_address is actually valid
and an eviction that would invalidate it triggers a KFD process
eviction like it should.

v2: Use uninterruptible waiting in initial PD validation
Signed-off-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Acked-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

3486625b

drm/amdgpu: Move KFD-specific fields into struct amdgpu_vm · 5b21d3e5

由 Felix Kuehling 提交于 3月 15, 2018

Remove struct amdkfd_vm and move the fields into struct amdgpu_vm.
This will allow turning a VM created by a DRM render node into a
KFD VM.

v2: Removed vm_context field
Signed-off-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Reviewed-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

5b21d3e5

drm/amdkfd: fix uninitialized variable use · 48a44387

由 Arnd Bergmann 提交于 3月 15, 2018

When CONFIG_ACPI is disabled, we never initialize the acpi_table
structure in kfd_create_crat_image_virtual:

drivers/gpu/drm/amd/amdkfd/kfd_crat.c: In function 'kfd_create_crat_image_virtual':
drivers/gpu/drm/amd/amdkfd/kfd_crat.c:888:40: error: 'acpi_table' may be used uninitialized in this function [-Werror=maybe-uninitialized]

The undefined behavior also happens for any other acpi_get_table()
failure, but then the compiler can't warn about it.

This adds an error check that prevents the structure from
being used in error, avoiding both the undefined behavior and
the warning about it.

Fixes: 520b8fb7 ("drm/amdkfd: Add topology support for CPUs")
Signed-off-by: NArnd Bergmann <arnd@arndb.de>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

48a44387

15 3月, 2018 1 次提交

drm/amdkfd: add missing include of mm.h · 7420f482

由 Oded Gabbay 提交于 3月 15, 2018

This patch fixes kernel build in ARCH=frv
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

7420f482

23 3月, 2018 9 次提交

drm/amd/pp: clean header file hwmgr.h · 09695ad7

由 Rex Zhu 提交于 3月 22, 2018

Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NRex Zhu <Rex.Zhu@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

09695ad7

drm/amd/pp: use mlck_table.count for array loop index limit · 5b293355

由 Colin Ian King 提交于 3月 21, 2018

v2: use temporaries to trivially reduces object size.

The for-loops process data in the mclk_table but use slck_table.count
as the loop index limit.  I believe these are cut-n-paste errors from
the previous almost identical loops as indicated by static analysis.
Fix these.

Detected by CoverityScan, CID#1466001 ("Copy-paste error")

Fixes: 5d97cf39 ("drm/amd/pp: Add and initialize OD_dpm_table for CI/VI.")
Fixes: 5e4d4fbe ("drm/amd/pp: Implement edit_dpm_table on smu7")
Reviewed-by: NRex Zhu <Rex.Zhu@amd.com>
Signed-off-by: NColin Ian King <colin.king@canonical.com>
Signed-off-by: NRex Zhu <Rex.Zhu@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

5b293355

drm/amdgpu: Add an ATPX quirk for hybrid laptop · 13b40935

由 Alex Deucher 提交于 3月 21, 2018

_PR3 doesn't seem to work properly, use ATPX instead.

Bug: https://bugs.freedesktop.org/show_bug.cgi?id=104064Reviewed-by: NHuang Rui <ray.huang@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>
Cc: stable@vger.kernel.org

13b40935

drm/amdgpu: fix spelling mistake: "asssert" -> "assert" · 36b3f84a

由 Colin Ian King 提交于 3月 22, 2018

Trivial fix to spelling mistake in pr_err error message text
Acked-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NColin Ian King <colin.king@canonical.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

36b3f84a

drm/amd/pp: Add new asic support in pp_psm.c · 8ebde09b

由 Rex Zhu 提交于 3月 21, 2018

In new asics(vega12), no power state management in driver,
So no need to implement related callback functions.
and add some ps checks in pp_psm.c

Revert "drm/amd/powerplay: add new pp_psm infrastructure for vega12 (v2)"
This reverts commit 7d1a63f3aa331b853e41f92d0e7890ed31de8c13.
Acked-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NRex Zhu <Rex.Zhu@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

8ebde09b

drm/amd/pp: Clean up powerplay code on Vega12 · bbfcc8af

由 Rex Zhu 提交于 3月 21, 2018

Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NRex Zhu <Rex.Zhu@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

bbfcc8af

drm/amd/pp: Add smu irq handlers for legacy asics · 031ec948

由 Rex Zhu 提交于 3月 21, 2018

Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NRex Zhu <Rex.Zhu@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

031ec948

drm/amd/pp: Fix set wrong temperature range on smu7 · 3c796843

由 Rex Zhu 提交于 3月 21, 2018

Fix the issue thermal irq was always triggered
as GPU under temperature range detected

The low temp in default thermal policy
was set to -273. so need to use int type for the low temp.
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NRex Zhu <Rex.Zhu@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

3c796843

drm/amdgpu: Don't change preferred domian when fallback GTT v5 · cc15dfaa

由 Chunming Zhou 提交于 3月 16, 2018

v2: add sanity checking
v3: make code open
v4: also handle visible to invisible fallback
v5: Since two fallback cases, re-use goto retry
Signed-off-by: NChunming Zhou <david1.zhou@amd.com>
Reviewed-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

cc15dfaa

22 3月, 2018 15 次提交

drm/amdgpu: Fix NULL ptr on driver unload due to init failure. · b6356df3

由 Andrey Grodzovsky 提交于 3月 21, 2018

Problem:
When unloading due to failure amdgpu_device_fini was called twice
which was leading to NULL ptr in amdgpu_irq_disable_all.

Fix:
Call amdgpu_device_fini only once from amdgpu_driver_unload_kms.
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NAndrey Grodzovsky <andrey.grodzovsky@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

b6356df3

drm/amdgpu: fix "mitigate workaround for i915" · 2333bf9a

由 Christian König 提交于 3月 21, 2018

Mixed up exporter and importer here. E.g. while mapping the BO we need
to check the importer not the exporter.

Bug: https://bugs.freedesktop.org/show_bug.cgi?id=105633Signed-off-by: NChristian König <christian.koenig@amd.com>
Acked-by: NAlex Deucher <alexander.deucher@amd.com>
Tested-by: NMike Lothian <mike@fireburn.co.uk>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

2333bf9a

drm/amd/pp: Add smu irq handlers in sw_init instand of hw_init · 3296c4ae

由 Rex Zhu 提交于 3月 21, 2018

Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NRex Zhu <Rex.Zhu@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

3296c4ae

drm/amd/pp: Refine register_thermal_interrupt function · 4d200372

由 Rex Zhu 提交于 3月 21, 2018

v2: add Vega12 support

1. delete useless argument in function register_thermal_interrupt
2. rename function name register_thermal_interrupt to register_irq_handlers
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NRex Zhu <Rex.Zhu@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

4d200372

drm/amdgpu: Remove wrapper layer of cgs irq handling · 160b8e75

由 Rex Zhu 提交于 3月 20, 2018

v2: add Vega12 support

1. remove struct cgs_os_ops
2. delete cgs_linux.h
3. refine the irq code for vega10, can fix set pp table
   failed issue.
4. add common smu irq process function
Acked-by: NChristian König <christian.koenig@amd.com>
Acked-by: NJunwei Zhang <Jerry.Zhang@amd.com>
Signed-off-by: NRex Zhu <Rex.Zhu@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

160b8e75

drm/amd/powerplay: Return per DPM level clock · 7436854e

由 Kenneth Feng 提交于 3月 20, 2018

Add change to return per DPM level clock in DAL interface
Signed-off-by: NKenneth Feng <kenneth.feng@amd.com>
Reviewed-by: NEvan Quan <evan.quan@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

7436854e

drm/amd/powerplay: Remove the SOC floor voltage setting · 7f3f106e

由 Kenneth Feng 提交于 3月 20, 2018

Remove W/A carried over from VG10 to set VDDSOC Floor Voltage
prior to enabling DPM since the VBIOS covers the floor voltage
setting now
Signed-off-by: NKenneth Feng <kenneth.feng@amd.com>
Reviewed-by: NEvan Quan <evan.quan@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

7f3f106e

drm/amdgpu: no job timeout setting on compute queues · f0c2b16b

由 Evan Quan 提交于 3月 15, 2018

Under some heavy computing environment(e.g. dgemm test), it
takes the asic over 10+ seconds to finish the dispatched job
which will trigger the timeout.

It's quite confusing although it does not seem to bring any
real problems. As a quick workround, we choose to not enfoce
the timeout setting on compute queues.
Signed-off-by: NEvan Quan <evan.quan@amd.com>
Acked-by: NAlex Deucher <alexander.deucher@amd.com>
Reviewed-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

f0c2b16b

drm/amdgpu: add vega12 pci ids (v2) · dc53d543

由 Alex Deucher 提交于 9月 01, 2017

v2: add additional pci ids
Acked-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

dc53d543

drm/amd/powerplay: add the hw manager for vega12 (v4) · 2cac05de

由 Evan Quan 提交于 3月 19, 2018

handles the driver power state setup

v2: squash in the following:
- handle negative temperature ranges
- add vega12 thermal ranges
- use ffs/fls
- remove ACG code
- resend NumOfDisplays message
- correct max dpm levels
- remove power containment settings
- fix warnings
- add sensors interface
- delete unused overdrive arbiter
- drop get_temperature callback
- smu table cleanup
- atomfirmware smu dpm table updates
v3: rebase
v4: rebase
Acked-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NEvan Quan <evan.quan@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

2cac05de

drm/amd/powerplay: add the smu manager for vega12 (v4) · fa969db4

由 Evan Quan 提交于 3月 19, 2018

handles the driver interaction with the smu firmware

v2: squash in:
- s3 fix for firmware loading
- smu loading through the psp
- unecessary calls to is_smc_ram_running()
- smu table cleanups
v3: rebase
v4: rebase, smu bo allocation fixes, add dpm running callback
Acked-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NEvan Quan <evan.quan@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

fa969db4

drm/amd/powerplay: add new pp_psm infrastructure for vega12 (v2) · d33edb64

由 Evan Quan 提交于 3月 19, 2018

New psm infrastructure for vega12.

v2: rebase (Alex)
Acked-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NEvan Quan <evan.quan@amd.com>
Acked-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

d33edb64

drm/amd/powerplay: update ppatomfwctl (v2) · 3503d588

由 Evan Quan 提交于 12月 25, 2017

Add new get_smc_dpm_information api to fetch the smu dpm
info from the vbios.

v2: deal with updated table format.
Acked-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NEvan Quan <evan.quan@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

3503d588

drm/amd/powerplay: add vega12_pptable.h · c042c9b4

由 Evan Quan 提交于 12月 25, 2017

Defines the power table format in the vbios.
Acked-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NEvan Quan <evan.quan@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

c042c9b4

drm/amd/powerplay: add vega12_ppsmc.h · c4a4f4b6

由 Evan Quan 提交于 12月 25, 2017

Defines the smc message interface with the driver.
Acked-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NEvan Quan <evan.quan@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

c4a4f4b6