提交 · 3a9016ba0e8f3cf4196eae99b5b18172c9b71045 · openanolis / cloud-kernel

14 12月, 2017 1 次提交

drm/msm: fix spelling mistake: "ringubffer" -> "ringbuffer" · 3a9016ba

由 Colin Ian King 提交于 11月 02, 2017

Trivial fix to spelling mistake in DRM_DEV_ERROR error message
Signed-off-by: NColin Ian King <colin.king@canonical.com>
Signed-off-by: NRob Clark <robdclark@gmail.com>

3a9016ba

28 10月, 2017 9 次提交

drm/msm: Implement preemption for A5XX targets · b1fc2839

由 Jordan Crouse 提交于 10月 20, 2017

Implement preemption for A5XX targets - this allows multiple
ringbuffers for different priorities with automatic preemption
of a lower priority ringbuffer if a higher one is ready.
Signed-off-by: NJordan Crouse <jcrouse@codeaurora.org>
Signed-off-by: NRob Clark <robdclark@gmail.com>

b1fc2839

drm/msm: Make the value of RB_CNTL (almost) generic · 4d87fc32

由 Jordan Crouse 提交于 10月 20, 2017

We use a global ringbuffer size and block size for all targets and
at least for 5XX preemption we need to know the value the RB_CNTL
in several locations so it makes sense to calculate it once and use
it everywhere.

The only monkey wrench is that we need to disable the RPTR shadow
for A430 targets but that only needs to be done once and doesn't
affect A5XX so we can or in the value at init time.
Signed-off-by: NJordan Crouse <jcrouse@codeaurora.org>
Signed-off-by: NRob Clark <robdclark@gmail.com>

4d87fc32

drm/msm: Shadow current pointer in the ring until command is complete · 4c7085a5

由 Jordan Crouse 提交于 10月 20, 2017

Add a shadow pointer to track the current command being written into
the ring. Don't commit it as 'cur' until the command is submitted.
Because 'cur' is used to construct the software copy of the wptr this
ensures that somebody peeking in on the ring doesn't assume that a
command is inflight while it is being written. This isn't a huge deal
with a single ring (though technically the hangcheck could assume
the system is prematurely busy when it isn't) but it will be rather
important for preemption where the decision to preempt is based
on a non-empty ringbuffer. Without a shadow an aggressive preemption
scheme could assume that the ringbuffer is non empty and switch to it
before the CPU is done writing the command and boom.

Even though preemption won't be supported for all targets because of
the way the code is organized it is simpler to make this generic for
all targets. The extra load for non-preemption targets should be
minimal.
Signed-off-by: NJordan Crouse <jcrouse@codeaurora.org>
Signed-off-by: NRob Clark <robdclark@gmail.com>

4c7085a5

drm/msm: Add a parameter query for the number of ringbuffers · a6e29a0e

由 Jordan Crouse 提交于 10月 20, 2017

In order to manage ringbuffer priority to its fullest userspace
should know how many ringbuffers it has to work with. Add a
parameter to return the number of active rings.
Signed-off-by: NJordan Crouse <jcrouse@codeaurora.org>
Signed-off-by: NRob Clark <robdclark@gmail.com>

a6e29a0e

drm/msm: Support multiple ringbuffers · f97decac

由 Jordan Crouse 提交于 10月 20, 2017

Add the infrastructure to support the idea of multiple ringbuffers.
Assign each ringbuffer an id and use that as an index for the various
ring specific operations.

The biggest delta is to support legacy fences. Each fence gets its own
sequence number but the legacy functions expect to use a unique integer.
To handle this we return a unique identifier for each submission but
map it to a specific ring/sequence under the covers. Newer users use
a dma_fence pointer anyway so they don't care about the actual sequence
ID or ring.

The actual mechanics for multiple ringbuffers are very target specific
so this code just allows for the possibility but still only defines
one ringbuffer for each target family.
Signed-off-by: NJordan Crouse <jcrouse@codeaurora.org>
Signed-off-by: NRob Clark <robdclark@gmail.com>

f97decac

drm/msm: Move memptrs to msm_gpu · cd414f3d

由 Jordan Crouse 提交于 10月 20, 2017

When we move to multiple ringbuffers we're going to store the data
in the memptrs on a per-ring basis. In order to prepare for that
move the current memptrs from the adreno namespace into msm_gpu.
This is way cleaner and immediately lets us kill off some sub
functions so there is much less cost later when we do move to
per-ring structs.
Signed-off-by: NJordan Crouse <jcrouse@codeaurora.org>
Signed-off-by: NRob Clark <robdclark@gmail.com>

cd414f3d

drm/msm/adreno: deal with linux-firmware fw paths · 2c41ef1b

由 Rob Clark 提交于 10月 16, 2017

When firmware was added to linux-firmware, it was put in a qcom sub-
directory, unlike what we'd been using before.  For a300_pfp.fw and
a300_pm4.fw symlinks were created, but we'd prefer not to have to do
this in the future.  So add support to look in both places when
loading firmware.
Signed-off-by: NRob Clark <robdclark@gmail.com>

2c41ef1b

R
drm/msm/adreno: split out helper to load fw · e8f3de96
由 Rob Clark 提交于 10月 16, 2017
```
Prep work for the next patch.
Signed-off-by: NRob Clark <robdclark@gmail.com>
```
e8f3de96

drm/msm/adreno: load gpu at probe/bind time · eec874ce

由 Rob Clark 提交于 10月 16, 2017

Previously, in an effort to defer initializing the gpu until firmware
was available (ie. rootfs mounted), the gpu was not loaded at when the
subdevice was bound. Which resulted that clks/etc were requested in a
place that devm couldn't really help unwind if something failed.

Instead move request_firmware() to gpu->hw_init() and construct the gpu
earlier in adreno_bind(). To avoid the rest of the driver needing to
be aware of a gpu that hasn't managed to load firmware and hw_init()
yet, stash the gpu ptr in the adreno device's drvdata, and don't set
priv->gpu() until hw_init() succeeds.
Signed-off-by: NRob Clark <robdclark@gmail.com>

eec874ce

23 8月, 2017 2 次提交

drm/msm: Add a helper function for in-kernel buffer allocations · 8223286d

由 Jordan Crouse 提交于 7月 27, 2017

Nearly all of the buffer allocations for kernel allocate an buffer object,
virtual address and GPU iova at the same time. Make a helper function to
handle the details.
Signed-off-by: NJordan Crouse <jcrouse@codeaurora.org>
[dropped msm_fbdev conversion to new helper, since it interferes with
display-handover work, where we want to separate allocation and mapping]
Signed-off-by: NRob Clark <robdclark@gmail.com>

8223286d

drm/msm: Attach the GPU MMU when it is created · 1267a4df

由 Jordan Crouse 提交于 7月 27, 2017

Currently the GPU MMU is attached in the adreno_gpu code but as
more and more of the GPU initialization moves to the generic
GPU path we have a need to map and use GPU memory earlier and
earlier.  There isn't any reason to defer attaching the MMU
until later so attach it right after the address space is
created so it can be used immediately.
Signed-off-by: NJordan Crouse <jcrouse@codeaurora.org>
Signed-off-by: NRob Clark <robdclark@gmail.com>

1267a4df

02 8月, 2017 1 次提交

drm/msm/adreno: Prevent unclocked access when retrieving timestamps · 541de4c9

由 Archit Taneja 提交于 7月 28, 2017

msm_gpu's get_timestamp() op (called by the MSM_GET_PARAM ioctl) can
result in register accesses. We need our power domain and clocks to
be active for that. Make sure they are enabled here.
Signed-off-by: NArchit Taneja <architt@codeaurora.org>
Signed-off-by: NRob Clark <robdclark@gmail.com>

541de4c9

17 6月, 2017 1 次提交

drm/msm: Separate locking of buffer resources from struct_mutex · 0e08270a

由 Sushmita Susheelendra 提交于 6月 13, 2017

Buffer object specific resources like pages, domains, sg list
need not be protected with struct_mutex. They can be protected
with a buffer object level lock. This simplifies locking and
makes it easier to avoid potential recursive locking scenarios
for SVM involving mmap_sem and struct_mutex. This also removes
unnecessary serialization when creating buffer objects, and also
between buffer object creation and GPU command submission.
Signed-off-by: NSushmita Susheelendra <ssusheel@codeaurora.org>
[robclark: squash in handling new locking for shrinker]
Signed-off-by: NRob Clark <robdclark@gmail.com>

0e08270a

16 6月, 2017 4 次提交

drm/msm: pass address-space to _get_iova() and friends · 8bdcd949

由 Rob Clark 提交于 6月 13, 2017

No functional change, that will come later.  But this will make it
easier to deal with dynamically created address spaces (ie. per-
process pagetables for gpu).
Signed-off-by: NRob Clark <robdclark@gmail.com>

8bdcd949

drm/msm: fix locking inconsistency for gpu->hw_init() · cb1e3818

由 Rob Clark 提交于 6月 13, 2017

Most, but not all, paths where calling the with struct_mutex held.  The
fast-path in msm_gem_get_iova() (plus some sub-code-paths that only run
the first time) was masking this issue.

So lets just always hold struct_mutex for hw_init().  And sprinkle some
WARN_ON()'s and might_lock() to avoid this sort of problem in the
future.
Signed-off-by: NRob Clark <robdclark@gmail.com>

cb1e3818

drm/msm: Remove memptrs->wptr · 42a105e9

由 Jordan Crouse 提交于 5月 08, 2017

memptrs->wptr seems to be unused. Remove it to avoid
confusing the upcoming preemption code.
Signed-off-by: NJordan Crouse <jcrouse@codeaurora.org>
Signed-off-by: NRob Clark <robdclark@gmail.com>

42a105e9

drm/msm: Add a struct to pass configuration to msm_gpu_init() · 5770fc7a

由 Jordan Crouse 提交于 5月 08, 2017

The amount of information that we need to pass into msm_gpu_init()
is steadily increasing, so add a new struct to stabilize the function
call and make it easier to add new configuration down the line.
Signed-off-by: NJordan Crouse <jcrouse@codeaurora.org>
Signed-off-by: NRob Clark <robdclark@gmail.com>

5770fc7a

08 4月, 2017 7 次提交

drm/msm: Hard code the GPU "slow frequency" · bf5af4ae

由 Jordan Crouse 提交于 3月 07, 2017

Some A3XX and A4XX GPU targets required that the GPU clock be
programmed to a non zero value when it was disabled so
27Mhz was chosen as the "invalid" frequency.

Even though newer targets do not have the same clock restrictions
we still write 27Mhz on clock disable and expect the clock subsystem
to round down to zero.

For unknown reasons even though the slow clock speed is always
27Mhz and it isn't actually a functional level the legacy device tree
frequency tables always defined it and then did gymnastics to work
around it.

Instead of playing the same silly games just hard code the "slow" clock
speed in the code as 27MHz and save ourselves a bit of infrastructure.
Signed-off-by: NJordan Crouse <jcrouse@codeaurora.org>
Signed-off-by: NRob Clark <robdclark@gmail.com>

bf5af4ae

drm/msm: Add MSM_PARAM_GMEM_BASE · e3689e47

由 Jordan Crouse 提交于 3月 07, 2017

User space needs to know where the GMEM whole starts so that they
can set up the addressing correctly.
Signed-off-by: NJordan Crouse <jcrouse@codeaurora.org>
Signed-off-by: NRob Clark <robdclark@gmail.com>

e3689e47

drm/msm: Reference count address spaces · ee546cd3

由 Jordan Crouse 提交于 3月 07, 2017

There are reasons for a memory object to outlive the file descriptor
that created it and so the address space that a buffer object is
attached to must also outlive the file descriptor. Reference count
the address space so that it can remain viable until all the objects
have released their addresses.
Signed-off-by: NJordan Crouse <jcrouse@codeaurora.org>
Signed-off-by: NRob Clark <robdclark@gmail.com>

ee546cd3

drm/msm: Make sure to detach the MMU during GPU cleanup · 9873ef07

由 Jordan Crouse 提交于 2月 06, 2017

We should be detaching the MMU before destroying the address
space. To do this cleanly, the detach has to happen in
adreno_gpu_cleanup() because it needs access to structs
in adreno_gpu.c.  Plus it is better symmetry to have
the attach and detach at the same code level.
Signed-off-by: NJordan Crouse <jcrouse@codeaurora.org>
Signed-off-by: NRob Clark <robdclark@gmail.com>

9873ef07

drm/msm/adreno: reset ringbuffer in hw_init · de098e5f

由 Rob Clark 提交于 2月 12, 2017

We need to do this also in resume path when we need to re-hw_init().
Signed-off-by: NRob Clark <robdclark@gmail.com>

de098e5f

drm/msm/gpu: use pm-runtime · eeb75474

由 Rob Clark 提交于 2月 10, 2017

We need to use pm-runtime properly when IOMMU is using device_link() to
control it's own clocks.
Signed-off-by: NRob Clark <robdclark@gmail.com>

eeb75474

drm/msm/gpu: move suspend/resume into debugfs->show · c3c3ab19

由 Rob Clark 提交于 2月 10, 2017

Each of the per-generation callbacks was doing this.  Lets just simplify
and move it into toplevel show() fxn.
Signed-off-by: NRob Clark <robdclark@gmail.com>

c3c3ab19

04 4月, 2017 1 次提交

drm/msm: Make sure to detach the MMU during GPU cleanup · 028402d4

由 Jordan Crouse 提交于 2月 06, 2017

We should be detaching the MMU before destroying the address
space. To do this cleanly, the detach has to happen in
adreno_gpu_cleanup() because it needs access to structs
in adreno_gpu.c.  Plus it is better symmetry to have
the attach and detach at the same code level.
Signed-off-by: NJordan Crouse <jcrouse@codeaurora.org>
Signed-off-by: NRob Clark <robdclark@gmail.com>

028402d4

07 2月, 2017 1 次提交

drm/msm: drop quirks binding · 4e09b95d

由 Rob Clark 提交于 1月 30, 2017

This was never documented or used in upstream dtb.  It is used by
downstream bindings from android device kernels.  But the quirks are
a property of the gpu revision, and as such are redundant to be listed
separately in dt.  Instead, move the quirks to the device table.
Signed-off-by: NRob Clark <robdclark@gmail.com>
Reviewed-by: NEric Anholt <eric@anholt.net>

4e09b95d

13 1月, 2017 1 次提交

drm/msm: fix potential null ptr issue in non-iommu case · de85d2b3

由 Rob Clark 提交于 1月 12, 2017

Fixes: 9cb07b099fb ("drm/msm: support multiple address spaces")
Reported-by: NRiku Voipio <riku.voipio@linaro.org>
Signed-off-by: NRob Clark <robdclark@gmail.com>

de85d2b3

30 12月, 2016 1 次提交

drm/msm: Ensure that the hardware write pointer is valid · 88b333b0

由 Jordan Crouse 提交于 12月 20, 2016

Currently the value written to CP_RB_WPTR is calculated on the fly as
(rb->next - rb->start). But as the code is designed rb->next is wrapped
before writing the commands so if a series of commands happened to
fit perfectly in the ringbuffer, rb->next would end up being equal to
rb->size / 4 and thus result in an out of bounds address to CP_RB_WPTR.

The easiest way to fix this is to mask WPTR when writing it to the
hardware; it makes the hardware happy and the rest of the ringbuffer
math appears to work and there isn't any point in upsetting anything.
Signed-off-by: NJordan Crouse <jcrouse@codeaurora.org>
[squash in is_power_of_2() check]
Signed-off-by: NRob Clark <robdclark@gmail.com>

88b333b0

29 11月, 2016 5 次提交

drm/msm: gpu: Add A5XX target support · b5f103ab

由 Jordan Crouse 提交于 11月 28, 2016

Add support for the A5XX family of Adreno GPUs.
Signed-off-by: NJordan Crouse <jcrouse@codeaurora.org>
Signed-off-by: NRob Clark <robdclark@gmail.com>

b5f103ab

drm/msm: Disable interrupts during init · 4ac277cd

由 Jordan Crouse 提交于 11月 28, 2016

Disable the interrupt during the init sequence to avoid having
interrupts fired for errors and other things that we are not
ready to handle while initializing.
Signed-off-by: NJordan Crouse <jcrouse@codeaurora.org>
Signed-off-by: NRob Clark <robdclark@gmail.com>

4ac277cd

drm/msm: Add adreno_gpu_write64() · fb039981

由 Jordan Crouse 提交于 11月 28, 2016

Add a new generic function to write a "64" bit value. This isn't
actually a 64 bit operation, it just writes the upper and lower
32 bit of a 64 bit value to a specified LO and HI register.  If
a particular target doesn't support one of the registers it can
mark that register as SKIP and writes/reads from that register
will be quietly dropped.

This can be immediately put in place for the ringbuffer base and
the RPTR address.  Both writes are converted to use
adreno_gpu_write64() with their respective high and low registers
and the high register appropriately marked as SKIP for both 32 bit
targets (a3xx and a4xx). When a5xx comes it will define valid target
registers for the 'hi' option and everything else will just work.
Signed-off-by: NJordan Crouse <jcrouse@codeaurora.org>
Signed-off-by: NRob Clark <robdclark@gmail.com>

fb039981

drm/msm: gpu: Return error on hw_init failure · c4a8d475

由 Jordan Crouse 提交于 11月 28, 2016

When the GPU hardware init function fails (like say, ME_INIT timed
out) return error instead of blindly continuing on. This gives us
a small chance of saving the system before it goes boom.
Signed-off-by: NJordan Crouse <jcrouse@codeaurora.org>
Signed-off-by: NRob Clark <robdclark@gmail.com>

c4a8d475

drm/msm/adreno: move scratch register dumping to per-gen code · 398efc46

由 Rob Clark 提交于 11月 11, 2016

Scratch registers move, annoyingly enough, in a5xx.  Move to
per-generation aNxx_recover() fxn.
Signed-off-by: NRob Clark <robdclark@gmail.com>

398efc46

28 11月, 2016 1 次提交

drm/msm: support multiple address spaces · 667ce33e

由 Rob Clark 提交于 9月 28, 2016

We can have various combinations of 64b and 32b address space, ie. 64b
CPU but 32b display and gpu, or 64b CPU and GPU but 32b display.  So
best to decouple the device iova's from mmap offset.
Signed-off-by: NRob Clark <robdclark@gmail.com>

667ce33e

16 7月, 2016 2 次提交

drm/msm: deal with arbitrary # of cmd buffers · 6b597ce2

由 Rob Clark 提交于 6月 01, 2016

For some optimizations coming on the userspace side, splitting larger
draw or gmem cmds into multiple cmdstream buffers, we need to support
much more than the previous small/arbitrary limit.
Signed-off-by: NRob Clark <robdclark@gmail.com>

6b597ce2

drm/msm: change gem->vmap() to get/put · 18f23049

由 Rob Clark 提交于 5月 26, 2016

Before we can add vmap shrinking, we really need to know which vmap'ings
are currently being used.  So switch to get/put interface.  Stubbed put
fxns for now.
Signed-off-by: NRob Clark <robdclark@gmail.com>

18f23049

05 6月, 2016 1 次提交

drm/msm: deal with exhausted vmap space better · 69a834c2

由 Rob Clark 提交于 5月 24, 2016

Some, but not all, callers of obj->vmap() would check if return
IS_ERR().  So let's actually return an error if vmap() fails.  And fixup
the call-sites that were not handling this properly.
Signed-off-by: NRob Clark <robdclark@gmail.com>

69a834c2

08 5月, 2016 2 次提交

drm/msm: drop return from gpu->submit() · 1193c3bc

由 Rob Clark 提交于 5月 03, 2016

At this point, there is nothing left to fail.  And submit already has a
fence assigned and is added to the submit_list.  Any problems from here
on out are asynchronous (ie. hangcheck/recovery).
Signed-off-by: NRob Clark <robdclark@gmail.com>

1193c3bc

drm/msm: fix ->last_fence() after recover · 27557343

由 Rob Clark 提交于 3月 17, 2016

It is no longer true that we discard all in-flight submits on recover
(these days we only discard the first one that hung).  After the first
re-submitted batch completes it would overwrite the fence with a correct
value, but there would be a window of time which showed all re-submitted
batches as already complete.
Signed-off-by: NRob Clark <robdclark@gmail.com>

27557343

openanolis / cloud-kernel 1 年多 前同步成功

openanolis / cloud-kernel
1 年多前同步成功