提交 · f42e2c2429c0a76d99565e20d70d4fbf21c97498 · openanolis / cloud-kernel

17 3月, 2015 1 次提交

drm/amdkfd: Fix SDMA queue init. in non-HWS mode · 4fadf6b6

由 Ben Goz 提交于 3月 10, 2015

This patch fixes the SDMA queue initialization, when running in non-HWS mode.

The first fix is to move the initialization of SDMA VM parameters before the
initialization of the SDMA MQD.

The second fix is to load the MQD to an HQD after the initialization of the MQD.
Signed-off-by: NBen Goz <ben.goz@amd.com>
Signed-off-by: NOded Gabbay <oded.gabbay@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>

4fadf6b6

23 2月, 2015 2 次提交

drm/amdkfd: don't set get_pipes_num() as inline · 64ea8f4a

由 Oded Gabbay 提交于 2月 17, 2015

get_pipes_num() calls BUG_ON so we can't set it as inline because it produces a
warning as BUG_ON() uses static variables when it is expanded.
Signed-off-by: NOded Gabbay <oded.gabbay@amd.com>

64ea8f4a

drm/amdkfd: Initialize only amdkfd's assigned pipelines · 1365aa62

由 Oded Gabbay 提交于 2月 17, 2015

This patch fixes a bug in the initialization of the pipelines. The
init_pipelines() function was called with a constant value of 0 in the
first_pipe argument. This is an error because amdkfd doesn't handle pipe 0.

The correct way is to pass the value that get_first_pipe() returns as the
argument for first_pipe.

This bug appeared in 3.19 (first version with amdkfd) and it causes around 15%
drop in CPU performance of Kaveri (A10-7850).

v2: Don't set get_first_pipe() as inline because it calls BUG_ON()
Signed-off-by: NOded Gabbay <oded.gabbay@amd.com>
Cc: stable@vger.kernel.org
Tested-by: NMichel Dänzer <michel.daenzer@amd.com>

1365aa62

02 2月, 2015 1 次提交

drm/amdkfd: Fix bug in accounting of queues · 8b58f261

由 Oded Gabbay 提交于 1月 29, 2015

Signed-off-by: NOded Gabbay <oded.gabbay@amd.com>
Reviewed-by: NJammy Zhou <Jammy.Zhou@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>

8b58f261

22 1月, 2015 6 次提交

drm/amdkfd: Fix sparse errors · 0b3674ae

由 Oded Gabbay 提交于 1月 22, 2015

Signed-off-by: NOded Gabbay <oded.gabbay@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>

0b3674ae

drm/amdkfd: Fix bug in call to init_pipelines() · 9fa843e7

由 Oded Gabbay 提交于 1月 22, 2015

This patch fixes a bug where the first_pipe index passed into init_pipelines()
was a #define instead of the value that is passed into amdkfd by radeon
Signed-off-by: NOded Gabbay <oded.gabbay@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Reviewed-by: NJammy Zhou <Jammy.Zhou@amd.com>

9fa843e7

drm/amdkfd: Handle case of invalid queue type · 7113cd65

由 Oded Gabbay 提交于 1月 22, 2015

This patch handles a case where amdkfd tries to destroy a queue but the queue
type is invalid.
This case occurs in non-HWS path.
Signed-off-by: NOded Gabbay <oded.gabbay@amd.com>
Reviewed-by: NJammy Zhou <Jammy.Zhou@amd.com>

7113cd65

drm/amdkfd: Add break at the end of case · 300dec95

由 Oded Gabbay 提交于 1月 22, 2015

Signed-off-by: NOded Gabbay <oded.gabbay@amd.com>
Reviewed-by: NJammy Zhou <Jammy.Zhou@amd.com>

300dec95

O
drm/amdkfd: Remove negative check of uint variable · 010b82e7
由 Oded Gabbay 提交于 1月 22, 2015
```
Signed-off-by: NOded Gabbay <oded.gabbay@amd.com>
Reviewed-by: NJammy Zhou <Jammy.Zhou@amd.com>
```
010b82e7

drm/amdkfd: Fix bug in pipelines initialization · 749042b0

由 Oded Gabbay 提交于 1月 22, 2015

This patch fixes a bug when calling to init_pipeline() interface.
The index that was passed to that function didn't take into account the
first_pipe value, which represents the first pipe index that is under amdkfd's
responsibility.
Signed-off-by: NOded Gabbay <oded.gabbay@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Reviewed-by: NJammy Zhou <Jammy.Zhou@amd.com>

749042b0

20 1月, 2015 1 次提交

drm/amdkfd: Fix dqm->queue_count tracking · b6819cec

由 Jay Cornwall 提交于 1月 19, 2015

dqm->queue_count tracks queues in the active state only. In a few
places this count is modified unconditionally, leading to an incorrect
value when the UPDATE_QUEUE ioctl is used to make a queue inactive.
Signed-off-by: NJay Cornwall <jay.cornwall@amd.com>
Reviewed-by: NBen Goz <ben.goz@amd.com>
Signed-off-by: NOded Gabbay <oded.gabbay@amd.com>

b6819cec

18 1月, 2015 1 次提交

drm/amdkfd: Allow user to limit only queues per device · b8cbab04

由 Oded Gabbay 提交于 1月 18, 2015

This patch replaces the two current amdkfd module parameters with a new one.

The current parameters that are being replaced are:

- Maximum number of HSA processes
- Maximum number of queues per process

The new parameter that replaces them is called "Maximum queues per device"

This replacement achieves two goals:

- Allows the user to have as many HSA processes as it wants (until
  a maximum of 512 HSA processes in Kaveri).

- Removes the limitation the user had on maximum number of queues per HSA
  process. E.g. the user can now have processes which only have one queue and
  other processes which have hundreds of queues, while before the user
  couldn't have more than 128 queues per process (as default).

The default value of the new parameter is 4096 (32 * 128, which were the
defaults of the old parameters). There is almost no additional GART memory
required for the default case. As a reminder, this amount of queues requires a
little bit below 4MB of GART memory.

v2:
In addition, This patch defines a new counter for queues accounting in the DQM
structure. This is done because the current counter only counts active queues
which allows the user to create more queues than the
max_num_of_queues_per_device module parameter allows.

However, we need the current counter for the runlist packet build process, so
the solution is to have a dedicated counter for this accounting.
Signed-off-by: NOded Gabbay <oded.gabbay@amd.com>
Reviewed-by: NBen Goz <ben.goz@amd.com>

b8cbab04

15 1月, 2015 1 次提交

drm/amdkfd: Replace cpu_relax() with schedule() in DQM · 99331a51

由 Oded Gabbay 提交于 1月 15, 2015

In order not to occupy the current core and thus prevent the core from
servicing IOMMU PPR requests, this patch replaces the call in DQM to
cpu_relax() with a call to schedule().
Signed-off-by: NOded Gabbay <oded.gabbay@amd.com>
Acked-by: NAlex Deucher <alexander.deucher@amd.com>

99331a51

13 1月, 2015 1 次提交

drm/amdkfd: Fix for-loop when allocating HQD (non-HWS) · f0ec5b99

由 Ben Goz 提交于 1月 13, 2015

This patch fixes a minor bug in allocate_hqd(), where the loop run from the
next-to-allocate pipe until the number of pipes.

This is wrong because we need to consider the possibility where
next-to-allocate pipe is not 0, and thus, the for-loop only checks part of the
pipes and doesn't wrap-around, as it supposed to do.

Therefore, we add another counting variable to make sure we go over all the
pipes, regardless of where we start to look at the first iteration of the loop.

This bug only affected non-HWS mode. In HWS mode, the CP fw is responsible for
allocating the HQD.
Signed-off-by: NBen Goz <ben.goz@amd.com>
Signed-off-by: NOded Gabbay <oded.gabbay@amd.com>
Acked-by: NAlex Deucher <alexander.deucher@amd.com>

f0ec5b99

10 1月, 2015 3 次提交

drm/amdkfd: Using new gtt sa in amdkfd · a86aa3ca

由 Oded Gabbay 提交于 10月 26, 2014

This patch change the calls throughout the amdkfd driver from the old kfd-->kgd
interface to the new kfd gtt sa inside amdkfd

v2: change the new call in sdma code that appeared because of the sdma feature
Signed-off-by: NOded Gabbay <oded.gabbay@amd.com>
Reviewed-by: NAlexey Skidanov <Alexey.skidanov@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>

a86aa3ca

drm/amdkfd: Add SDMA user-mode queues support to QCM · bcea3081

由 Ben Goz 提交于 1月 03, 2015

This patch adds support for SDMA user-mode queues to the QCM - the Queue
management system that manages queues-per-device and queues-per-process.

v2: Remove calls to interface function that initializes sdma engines.

v3: Use the new names of some of the defines.
Signed-off-by: NBen Goz <ben.goz@amd.com>
Signed-off-by: NOded Gabbay <oded.gabbay@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>

bcea3081

drm/amdkfd: Process-device data creation and lookup split · 093c7d8c

由 Alexey Skidanov 提交于 11月 18, 2014

This patch splits the current kfd_get_process_device_data() to two
functions, one that specifically creates a pdd and another one which
just do lookup.

This is done to enhance the readability and maintainability of the code.
Signed-off-by: NAlexey Skidanov <Alexey.Skidanov@amd.com>
Signed-off-by: NOded Gabbay <oded.gabbay@amd.com>

093c7d8c

08 1月, 2015 1 次提交
- O
  drm/amdkfd: Fix sparse warning (different address space) · 8dfe58b2
  由 Oded Gabbay 提交于 1月 08, 2015
```
Signed-off-by: NOded Gabbay <oded.gabbay@amd.com>
```
  8dfe58b2
05 1月, 2015 2 次提交

drm/amdkfd: unmap VMID<-->PASID when relesing VMID (non-HWS) · 2030664b

由 Ben Goz 提交于 1月 05, 2015

This patch fixes a bug where deallocate_vmid() didn't actually unmap the
VMID<-->PASID mapping (in the registers).
That can cause undefined behavior.

This bug only occurs in non-HWS mode.
Signed-off-by: NBen Goz <ben.goz@amd.com>
Signed-off-by: NOded Gabbay <oded.gabbay@amd.com>
Acked-by: NAlex Deucher <alexander.deucher@amd.com>

2030664b

drm/amdkfd: Load mqd to hqd in non-HWS mode · 030e416b

由 Ben Goz 提交于 1月 04, 2015

This patch fixes a bug in DQM, where the MQD of a newly created compute queue
is not loaded to an HQD slot. As a result, the CP never reads packets from this
queue.

This bug happens only in non-HWS (hardware scheduling) mode. In HWS mode, the
CP is responsible of loading MQDs to HQDs slots.
Signed-off-by: NBen Goz <ben.goz@amd.com>
Signed-off-by: NOded Gabbay <oded.gabbay@amd.com>
Acked-by: NAlex Deucher <alexander.deucher@amd.com>

030e416b

08 12月, 2014 1 次提交

amdkfd: Fix accounting of device queues · b6ffbab8

由 Oded Gabbay 提交于 12月 07, 2014

This patch fixes a device QCM bug, where the number of queues were not
counted correctly for the operation of update queue. The count was incorrect
as there was no regard to the previous state of the queue.
Signed-off-by: NOded Gabbay <oded.gabbay@amd.com>

b6ffbab8

12 1月, 2015 2 次提交

drm/amdkfd: Add initial VI support for DQM · a22fc854

由 Ben Goz 提交于 1月 12, 2015

This patch starts to add support for the VI APU in the DQM module.

Because most (more than 90%) of the DQM code is shared among AMD's APUs, we
chose a design that performs most/all the code in the shared DQM file
(kfd_device_queue_manager.c). If there is H/W specific code to be executed,
than it is written in an asic-specific extension function for that H/W.

That asic-specific extension function is called from the shared function at the
appropriate time. This requires that for every asic-specific extension function
that is implemented in a specific ASIC, there will be an equivalent
implementation in ALL ASICs, even if those implementations are just stubs.

That way we achieve:

- Maintainability: by having one copy of most of the code, we only need to
  fix bugs at one locations

- Readability: very clear what is the shared code and what is done per ASIC

- Extensibility: very easy to add new H/W specific files/functions
Signed-off-by: NBen Goz <ben.goz@amd.com>
Signed-off-by: NOded Gabbay <oded.gabbay@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>

a22fc854

drm/amdkfd: Encapsulate DQM functions in ops structure · 45c9a5e4

由 Oded Gabbay 提交于 1月 12, 2015

This patch does some re-org on the device_queue_manager structure. It takes out
all the function pointers from the structure and puts them in a new structure,
called device_queue_manager_ops. Then, it puts an instance of that structure
inside device_queue_manager.

This re-org is done to prepare the DQM module to support more than one AMD APU
(Kaveri).
Signed-off-by: NOded Gabbay <oded.gabbay@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>

45c9a5e4

25 11月, 2014 1 次提交

amdkfd: Fix memory leak of mqds on dqm fini · 6f9d54fd

由 Oded Gabbay 提交于 11月 25, 2014

The mqds array members are not freed when dqm is uninitialized.
Reviewed-by: NBen Goz <Ben.Goz@amd.com>
Signed-off-by: NOded Gabbay <oded.gabbay@amd.com>

6f9d54fd

19 11月, 2014 1 次提交

amdkfd: Instead of using get function, use container_of · 52a5fdce

由 Alexey Skidanov 提交于 11月 19, 2014

Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NAlexey Skidanov <Alexey.Skidanov@amd.com>
Signed-off-by: NOded Gabbay <oded.gabbay@amd.com>

52a5fdce

21 11月, 2014 1 次提交

amdkfd: Fix memory leak on process deregistration · f5d896bb

由 Jay Cornwall 提交于 11月 20, 2014

struct device_process_node was allocated during process registration but
not released at process deregistration.
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NJay Cornwall <jay.cornwall@amd.com>
Signed-off-by: NOded Gabbay <oded.gabbay@amd.com>

f5d896bb

20 11月, 2014 1 次提交

amdkfd: fence_wait_timeout() can be static · d80d19bd

由 Oded Gabbay 提交于 11月 20, 2014

Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NOded Gabbay <oded.gabbay@amd.com>

d80d19bd

18 8月, 2014 1 次提交

drm/amdkfd: Fix logic of destroy_queue_nocpsch() · c2e1b3a4

由 Ben Goz 提交于 8月 18, 2014

This patch rewrites destroy_queue_nocpsch() as the current logic that is
implemented in the function is completely flawed.

This function is used only in non-HWS mode.
Signed-off-by: NBen Goz <ben.goz@amd.com>
Signed-off-by: NOded Gabbay <oded.gabbay@amd.com>
Acked-by: NAlex Deucher <alexander.deucher@amd.com>

c2e1b3a4

04 1月, 2015 1 次提交

drm/amdkfd: Make KFD_MQD_TYPE enum types H/W agnostic · 85d258f9

由 Ben Goz 提交于 1月 04, 2015

As the MQD types are common across all AMD GPUs/APUs, let's remove the CIK part
from the name.
Signed-off-by: NBen Goz <ben.goz@amd.com>
Signed-off-by: NOded Gabbay <oded.gabbay@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>

85d258f9

03 1月, 2015 1 次提交

drm/amdkfd: Don't include header files from radeon · 71273adc

由 Oded Gabbay 提交于 1月 02, 2015

Because amdkfd will need to work both with radeon and amdgpu, don't include
header files that are in radeon's folder.

Instead, use the common amd include folder and move amdkfd specific defines to
amdkfd header files.
Signed-off-by: NOded Gabbay <oded.gabbay@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>

71273adc

27 10月, 2014 1 次提交

drm/amdkfd: Remove call to deprecated init_memory interface · fe502804

由 Ben Goz 提交于 10月 26, 2014

This patch removes a call to kfd-->kgd interface function that is doing H/W
initialization. That function is moved into radeon to be part of the common
H/W initialization sequence. The interface function will be deleted.
Signed-off-by: NBen Goz <ben.goz@amd.com>
Signed-off-by: NOded Gabbay <oded.gabbay@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>

fe502804

17 7月, 2014 1 次提交

amdkfd: Add device queue manager module · 64c7f8cf

由 Ben Goz 提交于 7月 17, 2014

The queue scheduler divides into two sections, one section is process bounded
and the other section is device bounded.
The device bounded section is handled by this module.
The DQM module handles queue setup, update and tear-down from the device side.
It also supports suspend/resume operation.

v3: Changed device_init, added the use of the new gart allocation functions an
Added documentation.

v4:

Fixed a race in DQM queue scheduler where dqm->lock must be held when accessing
dqm->queue_count and dqm->processes_count. This fixes runlist IB allocation
failures when DQM is under load.

Fixed race in DQM queue destruction where queues being destroyed must be
removed from qpd->queues_list prior to preemption, or concurrent queue
creation activity may reschedule them while their MQD is destroyed.

Fixed EOP queue size setting in CP_HPD_EOP_CONTROL, because the size is
specified as (log2(size_dwords)-1). The previous calculation assumed the
size was specified in bytes, which caused interference between EOP queues
when multiple MEC pipelines were active.

v5:

Move amdkfd from drm/radeon/ to drm/amd/
Change format of mqd structure to match latest KV firmware
Add support for AQL queues creation to enable working with open-source HSA
runtime
Remove unused unmap_queue function
Various fixes (Style, typos)
Signed-off-by: NBen Goz <ben.goz@amd.com>
Signed-off-by: NJay Cornwall <jay.cornwall@amd.com>
Signed-off-by: NOded Gabbay <oded.gabbay@amd.com>

64c7f8cf

openanolis / cloud-kernel 1 年多 前同步成功

openanolis / cloud-kernel
1 年多前同步成功