提交 · b1fc2839d2f92d09da90d1e09156a73ddaba8a93 · openeuler / Kernel

28 10月, 2017 3 次提交

drm/msm: Implement preemption for A5XX targets · b1fc2839

由 Jordan Crouse 提交于 10月 20, 2017

Implement preemption for A5XX targets - this allows multiple
ringbuffers for different priorities with automatic preemption
of a lower priority ringbuffer if a higher one is ready.
Signed-off-by: NJordan Crouse <jcrouse@codeaurora.org>
Signed-off-by: NRob Clark <robdclark@gmail.com>

b1fc2839

drm/msm: Support multiple ringbuffers · f97decac

由 Jordan Crouse 提交于 10月 20, 2017

Add the infrastructure to support the idea of multiple ringbuffers.
Assign each ringbuffer an id and use that as an index for the various
ring specific operations.

The biggest delta is to support legacy fences. Each fence gets its own
sequence number but the legacy functions expect to use a unique integer.
To handle this we return a unique identifier for each submission but
map it to a specific ring/sequence under the covers. Newer users use
a dma_fence pointer anyway so they don't care about the actual sequence
ID or ring.

The actual mechanics for multiple ringbuffers are very target specific
so this code just allows for the possibility but still only defines
one ringbuffer for each target family.
Signed-off-by: NJordan Crouse <jcrouse@codeaurora.org>
Signed-off-by: NRob Clark <robdclark@gmail.com>

f97decac

drm/msm: Move memptrs to msm_gpu · cd414f3d

由 Jordan Crouse 提交于 10月 20, 2017

When we move to multiple ringbuffers we're going to store the data
in the memptrs on a per-ring basis. In order to prepare for that
move the current memptrs from the adreno namespace into msm_gpu.
This is way cleaner and immediately lets us kill off some sub
functions so there is much less cost later when we do move to
per-ring structs.
Signed-off-by: NJordan Crouse <jcrouse@codeaurora.org>
Signed-off-by: NRob Clark <robdclark@gmail.com>

cd414f3d

23 8月, 2017 1 次提交

drm/msm: Attach the GPU MMU when it is created · 1267a4df

由 Jordan Crouse 提交于 7月 27, 2017

Currently the GPU MMU is attached in the adreno_gpu code but as
more and more of the GPU initialization moves to the generic
GPU path we have a need to map and use GPU memory earlier and
earlier.  There isn't any reason to defer attaching the MMU
until later so attach it right after the address space is
created so it can be used immediately.
Signed-off-by: NJordan Crouse <jcrouse@codeaurora.org>
Signed-off-by: NRob Clark <robdclark@gmail.com>

1267a4df

17 6月, 2017 1 次提交

drm/msm: Separate locking of buffer resources from struct_mutex · 0e08270a

由 Sushmita Susheelendra 提交于 6月 13, 2017

Buffer object specific resources like pages, domains, sg list
need not be protected with struct_mutex. They can be protected
with a buffer object level lock. This simplifies locking and
makes it easier to avoid potential recursive locking scenarios
for SVM involving mmap_sem and struct_mutex. This also removes
unnecessary serialization when creating buffer objects, and also
between buffer object creation and GPU command submission.
Signed-off-by: NSushmita Susheelendra <ssusheel@codeaurora.org>
[robclark: squash in handling new locking for shrinker]
Signed-off-by: NRob Clark <robdclark@gmail.com>

0e08270a

16 6月, 2017 4 次提交

drm/msm: remove address-space id · 8432a903

由 Rob Clark 提交于 6月 13, 2017

Now that the msm_gem supports an arbitrary number of vma's, we no longer
need to assign an id (index) to each address space.  So rip out the
associated code.
Signed-off-by: NRob Clark <robdclark@gmail.com>

8432a903

drm/msm: pass address-space to _get_iova() and friends · 8bdcd949

由 Rob Clark 提交于 6月 13, 2017

No functional change, that will come later.  But this will make it
easier to deal with dynamically created address spaces (ie. per-
process pagetables for gpu).
Signed-off-by: NRob Clark <robdclark@gmail.com>

8bdcd949

drm/msm: fix locking inconsistency for gpu->hw_init() · cb1e3818

由 Rob Clark 提交于 6月 13, 2017

Most, but not all, paths where calling the with struct_mutex held.  The
fast-path in msm_gem_get_iova() (plus some sub-code-paths that only run
the first time) was masking this issue.

So lets just always hold struct_mutex for hw_init().  And sprinkle some
WARN_ON()'s and might_lock() to avoid this sort of problem in the
future.
Signed-off-by: NRob Clark <robdclark@gmail.com>

cb1e3818

drm/msm: Add a struct to pass configuration to msm_gpu_init() · 5770fc7a

由 Jordan Crouse 提交于 5月 08, 2017

The amount of information that we need to pass into msm_gpu_init()
is steadily increasing, so add a new struct to stabilize the function
call and make it easier to add new configuration down the line.
Signed-off-by: NJordan Crouse <jcrouse@codeaurora.org>
Signed-off-by: NRob Clark <robdclark@gmail.com>

5770fc7a

28 5月, 2017 1 次提交

drm/msm/gpu: check legacy clk names in get_clocks() · 134ccada

由 Rob Clark 提交于 5月 03, 2017

Otherwise if someone was using old bindings with "core_clk" instead of
"core" as the clock name, we'd never find it and gpu would be stuck at
27MHz (or whatever it's slowest rate is).

Fixes: 98db803f ("msm/drm: gpu: Dynamically locate the clocks from the device tree")
Signed-off-by: NRob Clark <robdclark@gmail.com>

134ccada

08 4月, 2017 4 次提交

msm/drm: gpu: Dynamically locate the clocks from the device tree · 98db803f

由 Jordan Crouse 提交于 3月 07, 2017

Instead of using a fixed list of clock names use the clock-names
list in the device tree to discover and get the list of clocks
that we need.
Signed-off-by: NJordan Crouse <jcrouse@codeaurora.org>
Signed-off-by: NRob Clark <robdclark@gmail.com>

98db803f

drm/msm: Hard code the GPU "slow frequency" · bf5af4ae

由 Jordan Crouse 提交于 3月 07, 2017

Some A3XX and A4XX GPU targets required that the GPU clock be
programmed to a non zero value when it was disabled so
27Mhz was chosen as the "invalid" frequency.

Even though newer targets do not have the same clock restrictions
we still write 27Mhz on clock disable and expect the clock subsystem
to round down to zero.

For unknown reasons even though the slow clock speed is always
27Mhz and it isn't actually a functional level the legacy device tree
frequency tables always defined it and then did gymnastics to work
around it.

Instead of playing the same silly games just hard code the "slow" clock
speed in the code as 27MHz and save ourselves a bit of infrastructure.
Signed-off-by: NJordan Crouse <jcrouse@codeaurora.org>
Signed-off-by: NRob Clark <robdclark@gmail.com>

bf5af4ae

drm/msm: Make sure to detach the MMU during GPU cleanup · 9873ef07

由 Jordan Crouse 提交于 2月 06, 2017

We should be detaching the MMU before destroying the address
space. To do this cleanly, the detach has to happen in
adreno_gpu_cleanup() because it needs access to structs
in adreno_gpu.c.  Plus it is better symmetry to have
the attach and detach at the same code level.
Signed-off-by: NJordan Crouse <jcrouse@codeaurora.org>
Signed-off-by: NRob Clark <robdclark@gmail.com>

9873ef07

drm/msm/gpu: use pm-runtime · eeb75474

由 Rob Clark 提交于 2月 10, 2017

We need to use pm-runtime properly when IOMMU is using device_link() to
control it's own clocks.
Signed-off-by: NRob Clark <robdclark@gmail.com>

eeb75474

04 4月, 2017 1 次提交

drm/msm: Make sure to detach the MMU during GPU cleanup · 028402d4

由 Jordan Crouse 提交于 2月 06, 2017

We should be detaching the MMU before destroying the address
space. To do this cleanly, the detach has to happen in
adreno_gpu_cleanup() because it needs access to structs
in adreno_gpu.c.  Plus it is better symmetry to have
the attach and detach at the same code level.
Signed-off-by: NJordan Crouse <jcrouse@codeaurora.org>
Signed-off-by: NRob Clark <robdclark@gmail.com>

028402d4

07 2月, 2017 1 次提交

drm/msm: drop _clk suffix from clk names · 720c3bb8

由 Rob Clark 提交于 1月 30, 2017

Suggested by Rob Herring.  We still support the old names for
compatibility with downstream android dt files.

Cc: Rob Herring <robh@kernel.org>
Signed-off-by: NRob Clark <robdclark@gmail.com>
Reviewed-by: NEric Anholt <eric@anholt.net>
Acked-by: NRob Herring <robh@kernel.org>

720c3bb8

29 11月, 2016 3 次提交

drm/msm: gpu: Add A5XX target support · b5f103ab

由 Jordan Crouse 提交于 11月 28, 2016

Add support for the A5XX family of Adreno GPUs.
Signed-off-by: NJordan Crouse <jcrouse@codeaurora.org>
Signed-off-by: NRob Clark <robdclark@gmail.com>

b5f103ab

drm/msm: Remove 'src_clk' from adreno configuration · 89d777a5

由 Jordan Crouse 提交于 11月 28, 2016

The adreno code inherited a silly workaround from downstream
from the bad old days before decent clock control. grp_clk[0]
(named 'src_clk') doesn't actually exist - it was used as a proxy
for whatever the core clock actually was (usually 'core_clk').

All targets should be able to correctly request 'core_clk' and
get the right thing back so zap the anachronism and directly
use grp_clk[0] to control the clock rate.
Signed-off-by: NJordan Crouse <jcrouse@codeaurora.org>
Signed-off-by: NRob Clark <robdclark@gmail.com>

89d777a5

drm/msm: convert iova to 64b · 78babc16

由 Rob Clark 提交于 11月 11, 2016

For a5xx the gpu is 64b so we need to change iova to 64b everywhere.  On
the display side, iova is still 32b so it can ignore the upper bits.
(Although all the armv8 devices have an iommu that can map 64b pa to 32b
iova.)
Signed-off-by: NRob Clark <robdclark@gmail.com>

78babc16

28 11月, 2016 1 次提交

drm/msm: support multiple address spaces · 667ce33e

由 Rob Clark 提交于 9月 28, 2016

We can have various combinations of 64b and 32b address space, ie. 64b
CPU but 32b display and gpu, or 64b CPU and GPU but 32b display.  So
best to decouple the device iova's from mmap offset.
Signed-off-by: NRob Clark <robdclark@gmail.com>

667ce33e

25 10月, 2016 1 次提交

dma-buf: Rename struct fence to dma_fence · f54d1867

由 Chris Wilson 提交于 10月 25, 2016

I plan to usurp the short name of struct fence for a core kernel struct,
and so I need to rename the specialised fence/timeline for DMA
operations to make room.

A consensus was reached in
https://lists.freedesktop.org/archives/dri-devel/2016-July/113083.html
that making clear this fence applies to DMA operations was a good thing.
Since then the patch has grown a bit as usage increases, so hopefully it
remains a good thing!

(v2...: rebase, rerun spatch)
v3: Compile on msm, spotted a manual fixup that I broke.
v4: Try again for msm, sorry Daniel

coccinelle script:
@@

@@
- struct fence
+ struct dma_fence
@@

@@
- struct fence_ops
+ struct dma_fence_ops
@@

@@
- struct fence_cb
+ struct dma_fence_cb
@@

@@
- struct fence_array
+ struct dma_fence_array
@@

@@
- enum fence_flag_bits
+ enum dma_fence_flag_bits
@@

@@
(
- fence_init
+ dma_fence_init
|
- fence_release
+ dma_fence_release
|
- fence_free
+ dma_fence_free
|
- fence_get
+ dma_fence_get
|
- fence_get_rcu
+ dma_fence_get_rcu
|
- fence_put
+ dma_fence_put
|
- fence_signal
+ dma_fence_signal
|
- fence_signal_locked
+ dma_fence_signal_locked
|
- fence_default_wait
+ dma_fence_default_wait
|
- fence_add_callback
+ dma_fence_add_callback
|
- fence_remove_callback
+ dma_fence_remove_callback
|
- fence_enable_sw_signaling
+ dma_fence_enable_sw_signaling
|
- fence_is_signaled_locked
+ dma_fence_is_signaled_locked
|
- fence_is_signaled
+ dma_fence_is_signaled
|
- fence_is_later
+ dma_fence_is_later
|
- fence_later
+ dma_fence_later
|
- fence_wait_timeout
+ dma_fence_wait_timeout
|
- fence_wait_any_timeout
+ dma_fence_wait_any_timeout
|
- fence_wait
+ dma_fence_wait
|
- fence_context_alloc
+ dma_fence_context_alloc
|
- fence_array_create
+ dma_fence_array_create
|
- to_fence_array
+ to_dma_fence_array
|
- fence_is_array
+ dma_fence_is_array
|
- trace_fence_emit
+ trace_dma_fence_emit
|
- FENCE_TRACE
+ DMA_FENCE_TRACE
|
- FENCE_WARN
+ DMA_FENCE_WARN
|
- FENCE_ERR
+ DMA_FENCE_ERR
)
 (
 ...
 )
Signed-off-by: NChris Wilson <chris@chris-wilson.co.uk>
Reviewed-by: NGustavo Padovan <gustavo.padovan@collabora.co.uk>
Acked-by: NSumit Semwal <sumit.semwal@linaro.org>
Acked-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NDaniel Vetter <daniel.vetter@ffwll.ch>
Link: http://patchwork.freedesktop.org/patch/msgid/20161025120045.28839-1-chris@chris-wilson.co.uk

f54d1867

16 9月, 2016 1 次提交
- R
  drm/msm: move fence allocation out of msm_gpu_submit() · f44d32c7
  由 Rob Clark 提交于 6月 16, 2016
```
Prep work for next patch.
Signed-off-by: NRob Clark <robdclark@gmail.com>
```
  f44d32c7
08 5月, 2016 7 次提交

drm/msm: print offender task name on hangcheck recovery · 4816b626

由 Rob Clark 提交于 5月 03, 2016

Track the pid per submit, so we can print the name of the task which
submitted the batch that caused the gpu to hang.
Signed-off-by: NRob Clark <robdclark@gmail.com>

4816b626

R
drm/msm: fix leak in failed submit path · 40e6815b
由 Rob Clark 提交于 5月 03, 2016
```
Signed-off-by: NRob Clark <robdclark@gmail.com>
```
40e6815b

drm/msm: drop return from gpu->submit() · 1193c3bc

由 Rob Clark 提交于 5月 03, 2016

At this point, there is nothing left to fail.  And submit already has a
fence assigned and is added to the submit_list.  Any problems from here
on out are asynchronous (ie. hangcheck/recovery).
Signed-off-by: NRob Clark <robdclark@gmail.com>

1193c3bc

R
drm/msm: 'struct fence' conversion · b6295f9a
由 Rob Clark 提交于 3月 15, 2016
```
Signed-off-by: NRob Clark <robdclark@gmail.com>
```
b6295f9a

drm/msm: introduce msm_fence_context · ca762a8a

由 Rob Clark 提交于 3月 15, 2016

Better encapsulate the per-timeline stuff into fence-context.  For now
there is just a single fence-context, but eventually we'll also have one
per-CRTC to enable fully explicit fencing.
Signed-off-by: NRob Clark <robdclark@gmail.com>

ca762a8a

drm/msm/gpu: simplify tracking in-flight bo's · 7d12a279

由 Rob Clark 提交于 3月 16, 2016

Since we already track the array of bo's in the submit object, just
unconditionally take and drop ref's per submit (rather than only taking
ref's if bo is not already active). This simplifies later patches.
Signed-off-by: NRob Clark <robdclark@gmail.com>

7d12a279

R
drm/msm: move fence code to it's own file · fde5de6c
由 Rob Clark 提交于 3月 15, 2016
```
Signed-off-by: NRob Clark <robdclark@gmail.com>
```
fde5de6c

23 10月, 2015 1 次提交

drm/msm: Fix IOMMU clean up path in case msm_iommu_new() fails · 5e921b19

由 Stephane Viau 提交于 9月 15, 2015

msm_iommu_new() can fail and this change makes sure that we
detect the failure and free the allocated domain before going
any further.
Signed-off-by: NStephane Viau <sviau@codeaurora.org>
Signed-off-by: NRob Clark <robdclark@gmail.com>

5e921b19

12 6月, 2015 3 次提交

drm/msm: restart queued submits after hang · 1a370be9

由 Rob Clark 提交于 6月 07, 2015

Track the list of in-flight submits. If the gpu hangs, retire up to an
including the offending submit, and then re-submit the remainder. This
way, for concurrently running piglit tests (for example), one failing
test doesn't cause unrelated tests to fail simply because it's submit
was queued up after one that triggered a hang.
Signed-off-by: NRob Clark <robdclark@gmail.com>

1a370be9

drm/msm: adreno a306 support · de558cd2

由 Rob Clark 提交于 5月 06, 2015

As found in apq8016 (used in DragonBoard 410c) and msm8916.

Note that numerically a306 is actually 307 (since a305c already claimed
306).  Nice and confusing.
Signed-off-by: NRob Clark <robdclark@gmail.com>

de558cd2

drm/msm: clarify downstream bus scaling · 6490ad47

由 Rob Clark 提交于 6月 04, 2015

A few spots in the driver have support for downstream android
CONFIG_MSM_BUS_SCALING.  This is mainly to simplify backporting the
driver for various devices which do not have sufficient upstream
kernel support.  But the intentionally dead code seems to cause
some confusion.  Rename the #define to make this more clear.
Signed-off-by: NRob Clark <robdclark@gmail.com>

6490ad47

04 8月, 2014 2 次提交

drm/msm: fix potential deadlock in gpu init · a1ad3523

由 Rob Clark 提交于 7月 11, 2014

Somewhere along the way, the firmware loader sprouted another lock
dependency, resulting in possible deadlock scenario:

 &dev->struct_mutex --> &sb->s_type->i_mutex_key#2 --> &mm->mmap_sem

which is problematic vs things like gem mmap.

So introduce a separate mutex to synchronize gpu init.
Signed-off-by: NRob Clark <robdclark@gmail.com>

a1ad3523

drm/msm: use upstream iommu · 944fc36c

由 Rob Clark 提交于 7月 09, 2014

Downstream kernel IOMMU had a non-standard way of dealing with multiple
devices and multiple ports/contexts.  We don't need that on upstream
kernel, so rip out the crazy.

Note that we have to move the pinning of the ringbuffer to after the
IOMMU is attached.  No idea how that managed to work properly on the
downstream kernel.

For now, I am leaving the IOMMU port name stuff in place, to simplify
things for folks trying to backport latest drm/msm to device kernels.
Once we no longer have to care about pre-DT kernels, we can drop this
and instead backport upstream IOMMU driver.
Signed-off-by: NRob Clark <robdclark@gmail.com>

944fc36c

02 6月, 2014 2 次提交

R
drm/msm: add perf logging debugfs · 70c70f09
由 Rob Clark 提交于 5月 30, 2014
```
Signed-off-by: NRob Clark <robdclark@gmail.com>
```
70c70f09

drm/msm: add rd logging debugfs · a7d3c950

由 Rob Clark 提交于 5月 30, 2014

To ease debugging, add debugfs file which can be cat/tail'd to log
submits, along with fence #.  If GPU hangs, you can look at 'gpu'
debugfs file to find last completed fence and current register state,
and compare with logged rd file to narrow down the DRAW_INDX which
triggered the GPU hang.
Signed-off-by: NRob Clark <robdclark@gmail.com>

a7d3c950

31 3月, 2014 1 次提交

drm/msm: crank down gpu when inactive · 37d77c3a

由 Rob Clark 提交于 1月 11, 2014

Shut down the clks when the gpu has nothing to do.  A short inactivity
timer is used to provide a low pass filter for power transitions.
Signed-off-by: NRob Clark <robdclark@gmail.com>

37d77c3a

07 2月, 2014 1 次提交

drm/msm: bigger synchronization hammer · c2703b13

由 Rob Clark 提交于 2月 06, 2014

Because we use a list_head in the bo to track it's position in a submit,
we need to serialize at a higher layer.  Otherwise there are problems
when multiple contexts are SUBMIT'ing in parallel cmdstreams referencing
a shared bo.
Signed-off-by: NRob Clark <robdclark@gmail.com>

c2703b13

10 1月, 2014 1 次提交

drm/msm: add support for non-IOMMU systems · 871d812a

由 Rob Clark 提交于 11月 16, 2013

Add a VRAM carveout that is used for systems which do not have an IOMMU.

The VRAM carveout uses CMA.  The arch code must setup a CMA pool for the
device (preferrably in highmem.. a 256m-512m VRAM pool in lowmem is not
cool).  The user can configure the VRAM pool size using msm.vram module
param.

Technically, the abstraction of IOMMU behind msm_mmu is not strictly
needed, but it simplifies the GEM code a bit, and will be useful later
when I add support for a2xx devices with GPUMMU, so I decided to keep
this part.

It appears to be possible to configure the GPU to restrict access to
addresses within the VRAM pool, but this is not done yet.  So for now
the GPU will refuse to load if there is no sort of mmu.  Once address
based limits are supported and tested to confirm that we aren't giving
the GPU access to arbitrary memory, this restriction can be lifted
Signed-off-by: NRob Clark <robdclark@gmail.com>

871d812a

openeuler / Kernel 1 年多 前同步成功

openeuler / Kernel
1 年多前同步成功