提交 · d674c963af7470b6394b4b7f98cf2716b3a757d7 · openeuler / Kernel

20 4月, 2019 1 次提交

drm/msm/gpu: add per-process pagetables param · d674c963

由 Rob Clark 提交于 4月 15, 2019

For now it always returns '0' (false), but once the iommu work is in
place to enable per-process pagetables we can update the value returned.

Userspace needs to know this to make an informed decision about exposing
KHR_robustness.
Signed-off-by: NRob Clark <robdclark@chromium.org>
Reviewed-by: NJordan Crouse <jcrouse@codeaurora.org>

d674c963

19 2月, 2019 1 次提交

drm/msm/a6xx: Add support for an interconnect path · fcf9d0b7

由 Jordan Crouse 提交于 2月 12, 2019

Try to get the interconnect path for the GPU and vote for the maximum
bandwidth to support all frequencies. This is needed for performance.
Later we will want to scale the bandwidth based on the frequency to
also optimize for power but that will require some device tree
infrastructure that does not yet exist.

v6: use icc_set_bw() instead of icc_set()
v5: Remove hardcoded interconnect name and just use the default
v4: Don't use a port string at all to skip the need for names in the DT
v3: Use macros and change port string per Georgi Djakov
Signed-off-by: NJordan Crouse <jcrouse@codeaurora.org>
Acked-by: NRob Clark <robdclark@gmail.com>
Reviewed-by: NEvan Green <evgreen@chromium.org>
Signed-off-by: NGeorgi Djakov <georgi.djakov@linaro.org>
Signed-off-by: NGreg Kroah-Hartman <gregkh@linuxfoundation.org>

fcf9d0b7

29 1月, 2019 1 次提交

drm/msm/gpu: Remove hardcoded interrupt name · 2255f244

由 Jordan Crouse 提交于 12月 18, 2018

Every GPU core only has one interrupt so there isn't any
value in looking up the interrupt by name. Remove the name (which
is legacy anyway) and use platform_get_irq() instead.
Signed-off-by: NJordan Crouse <jcrouse@codeaurora.org>
Reviewed-by: NDouglas Anderson <dianders@chromium.org>
Signed-off-by: NRob Clark <robdclark@gmail.com>

2255f244

25 1月, 2019 1 次提交

drm/msm/gpu: Remove hardcoded interrupt name · 878411ae

由 Jordan Crouse 提交于 12月 18, 2018

Every GPU core only has one interrupt so there isn't any
value in looking up the interrupt by name. Remove the name (which
is legacy anyway) and use platform_get_irq() instead.
Signed-off-by: NJordan Crouse <jcrouse@codeaurora.org>
Reviewed-by: NDouglas Anderson <dianders@chromium.org>
Signed-off-by: NRob Clark <robdclark@gmail.com>

878411ae

12 12月, 2018 7 次提交

drm/msm: implement a2xx mmu · c2052a4e

由 Jonathan Marek 提交于 11月 14, 2018

A2XX has its own very simple MMU.

Added a msm_use_mmu() function because we can't rely on iommu_present to
decide to use MMU or not.
Signed-off-by: NJonathan Marek <jonathan@marek.ca>
Signed-off-by: NRob Clark <robdclark@gmail.com>

c2052a4e

drm/msm/adreno: add a2xx · 21af872c

由 Jonathan Marek 提交于 11月 21, 2018

derived from the a3xx driver and tested on the following hardware:
imx51-zii-rdu1 (a200 with 128kb gmem)
imx53-qsrb (a200)
msm8060-tenderloin (a220)
Signed-off-by: NJonathan Marek <jonathan@marek.ca>
Reviewed-by: NJordan Crouse <jcrouse@codeaurora.org>
Signed-off-by: NRob Clark <robdclark@gmail.com>

21af872c

drm/msm: Optimize adreno_show_object() · 1df4289d

由 Sharat Masetty 提交于 11月 01, 2018

When the userspace tries to read the crashstate dump, the read side
implementation in the driver currently ascii85 encodes all the binary
buffers and it does this each time the read system call is called.
A userspace tool like cat typically does a page by page read and the
number of read calls depends on the size of the data captured by the
driver. This is certainly not desirable and does not scale well with
large captures.

This patch encodes the buffer only once in the read path. With this there
is an immediate >10X speed improvement in crashstate save time.
Signed-off-by: NSharat Masetty <smasetty@codeaurora.org>
Reviewed-by: NJordan Crouse <jcrouse@codeaurora.org>
Signed-off-by: NRob Clark <robdclark@gmail.com>

1df4289d

drm/msm/gpu: Map the ringbuffer in the iova at create time · 84c61275

由 Jordan Crouse 提交于 11月 07, 2018

For reasons that I'm sure made perfect sense at the time we were
opting to defer the iova alloc / pin on the ringbuffer until HW
init time so when we moved to iova reference counting we ended
up adding a reference count every time the hardware started.
Not that it mattered (because the ring is always around) but
it did make the debug output look odd. Allocate and pin the iova
at create time instead.
Signed-off-by: NJordan Crouse <jcrouse@codeaurora.org>
Signed-off-by: NRob Clark <robdclark@gmail.com>

84c61275

drm/msm: Add msm_gem_get_and_pin_iova() · 9fe041f6

由 Jordan Crouse 提交于 11月 07, 2018

Add a new function to get and pin the iova memory in one
step (basically renaming the old msm_gem_get_iova function)
and switch msm_gem_get_iova() to only allocate an iova but
not map it in the IOMMU. This is only currently used by
msm_ioctl_gem_info() since all other users of of the iova
expect that the memory be immediately available.
Signed-off-by: NJordan Crouse <jcrouse@codeaurora.org>
Signed-off-by: NRob Clark <robdclark@gmail.com>

9fe041f6

drm/msm/adreno: Don't capture register values if target doesn't define them · b9fc2302

由 Jordan Crouse 提交于 11月 02, 2018

If the GPU target doesn't define a list of registers then gracefully skip
capturing and/or printing them. This is used by more complex targets like
6xx that have other means of capturing register values.
Signed-off-by: NJordan Crouse <jcrouse@codeaurora.org>
Signed-off-by: NRob Clark <robdclark@gmail.com>

b9fc2302

drm: msm: Use DRM_DEV_* instead of dev_* · 6a41da17

由 Mamta Shukla 提交于 10月 20, 2018

Use DRM_DEV_INFO/ERROR/WARN instead of dev_info/err/debug to generate
drm-formatted specific log messages so that it will be easy to
differentiate in case of multiple instances of driver.
Signed-off-by: NMamta Shukla <mamtashukla555@gmail.com>
Signed-off-by: NRob Clark <robdclark@gmail.com>

6a41da17

24 10月, 2018 1 次提交

drm/msm: fix OF child-node lookup · f9a70823

由 Johan Hovold 提交于 8月 27, 2018

Use the new of_get_compatible_child() helper to lookup the legacy
pwrlevels child node instead of using of_find_compatible_node(), which
searches the entire tree from a given start node and thus can return an
unrelated (i.e.  non-child) node.

This also addresses a potential use-after-free (e.g. after probe
deferral) as the tree-wide helper drops a reference to its first
argument (i.e. the probed device's node).

While at it, also fix the related child-node reference leak.

Fixes: e2af8b6b ("drm/msm: gpu: Use OPP tables if we can")
Cc: stable <stable@vger.kernel.org>     # 4.12
Cc: Jordan Crouse <jcrouse@codeaurora.org>
Cc: Rob Clark <robdclark@gmail.com>
Cc: David Airlie <airlied@linux.ie>
Signed-off-by: NJohan Hovold <johan@kernel.org>
Signed-off-by: NRob Herring <robh@kernel.org>

f9a70823

11 8月, 2018 1 次提交

drm/msm/adreno: Load the firmware before bringing up the hardware · 2c087a33

由 Jordan Crouse 提交于 8月 06, 2018

Failure to load firmware is the primary reason to fail adreno_load_gpu().
Try to load it first before going into the hardware initialization code and
unwinding it. This is important for a6xx because the GMU gets loaded from
the runtime power code and it is more costly to fail in that path because
of missing firmware.
Signed-off-by: NJordan Crouse <jcrouse@codeaurora.org>
Signed-off-by: NRob Clark <robdclark@gmail.com>

2c087a33

05 8月, 2018 1 次提交

drm/msm/adreno: Remove VLA usage · bec2dd69

由 Kees Cook 提交于 6月 29, 2018

In the quest to remove all stack VLA usage from the kernel[1], this
switches to using a kasprintf()ed buffer. Return paths are updated
to free the allocation.

[1] https://lkml.kernel.org/r/CA+55aFzCG-zNmZwX4A2FQpadafLfEzK6CC=qPXydAacU1RqZWA@mail.gmail.comSigned-off-by: NKees Cook <keescook@chromium.org>
Signed-off-by: NRob Clark <robdclark@gmail.com>

bec2dd69

30 7月, 2018 8 次提交

drm/msm/gpu: avoid deprecated do_gettimeofday · 3530a17f

由 Arnd Bergmann 提交于 7月 26, 2018

All users of do_gettimeofday() have been removed, but this one recently
crept in, along with an incorrect printing of the microseconds portion.

This converts it to using ktime_get_real_timespec64() as a direct
replacement, and adds the leading zeroes. I considered using monotonic
times (ktime_get()) instead, but as this timestamp appears to only
be used for humans rather than compared with other timestamps, the
real time domain is probably good enough.

Fixes: e43b045e2c82 ("drm/msm/gpu: Capture the state of the GPU")
Signed-off-by: NArnd Bergmann <arnd@arndb.de>
Signed-off-by: NRob Clark <robdclark@gmail.com>

3530a17f

drm/msm/gpu: Add the buffer objects from the submit to the crash dump · cdb95931

由 Jordan Crouse 提交于 7月 24, 2018

For hangs, dump copy out the contents of the buffer objects attached to the
guilty submission and print them in the crash dump report.
Signed-off-by: NJordan Crouse <jcrouse@codeaurora.org>
Signed-off-by: NRob Clark <robdclark@gmail.com>

cdb95931

drm/msm/adreno: Add a5xx specific registers for the GPU state · 50f8d218

由 Jordan Crouse 提交于 7月 24, 2018

HLSQ, SP and TP registers are only accessible from a special
aperture and to make matters worse the aperture is blocked from
the CPU on targets that can support secure rendering. Luckily the
GPU hardware has its own purpose built register dumper that can
access the registers from the aperture. Add a5xx specific code
to program the crashdumper and retrieve the wayward registers
and dump them for the crash state.

Also, remove a block of registers the regular CPU accessible
list that aren't useful for debug which helps reduce the size
of the crash state file by a goodly amount.
Signed-off-by: NJordan Crouse <jcrouse@codeaurora.org>
Signed-off-by: NRob Clark <robdclark@gmail.com>

50f8d218

drm/msm/adreno: Add ringbuffer data to the GPU state · 43a56687

由 Jordan Crouse 提交于 7月 24, 2018

Add the contents of each ringbuffer to the GPU state and dump the
data in the crash file encoded with ascii85. To save space only
the used portions of the ringbuffer are dumped.
Signed-off-by: NJordan Crouse <jcrouse@codeaurora.org>
Signed-off-by: NRob Clark <robdclark@gmail.com>

43a56687

drm/msm/adreno: Convert the show/crash file format · bcf1d9fa

由 Jordan Crouse 提交于 7月 24, 2018

Convert the format of the 'show' debugfs file and the crash
dump to a  format resembling YAML. This should be easier to
parse and be more flexible for future changes and expansions.

v2: Use a standard .rst for the msm crashdump documentation
Signed-off-by: NJordan Crouse <jcrouse@codeaurora.org>
Signed-off-by: NRob Clark <robdclark@gmail.com>

bcf1d9fa

drm/msm/gpu: Capture the GPU state on a GPU hang · c0fec7f5

由 Jordan Crouse 提交于 7月 24, 2018

Capture the GPU state on a GPU hang and store it for later playback
via the devcoredump facility. Only one crash state is stored at a
time on the assumption that the first hang is usually the most
interesting. The existing crash state can be cleared after capturing
it and then a new one will be captured on the next hang.
Signed-off-by: NJordan Crouse <jcrouse@codeaurora.org>
Signed-off-by: NRob Clark <robdclark@gmail.com>

c0fec7f5

drm/msm/gpu: Convert the GPU show function to use the GPU state · 4f776f45

由 Jordan Crouse 提交于 7月 24, 2018

Convert the existing GPU show function to use the GPU state to
dump the information rather than reading it directly from the hardware.
This will require an additional step to capture the state before
dumping it for the existing nodes but it will greatly facilitate reusing
the same code for dumping a previously captured state from a GPU hang.
Signed-off-by: NJordan Crouse <jcrouse@codeaurora.org>
Signed-off-by: NRob Clark <robdclark@gmail.com>

4f776f45

drm/msm/gpu: Capture the state of the GPU · e00e473d

由 Jordan Crouse 提交于 7月 24, 2018

Add the infrastructure to capture the current state of the GPU and
store it in memory so that it can be dumped later.

For now grab the same basic ringbuffer information and registers
that are provided by the debugfs 'gpu' node but obviously this should
be extended to capture a much larger set of GPU information.
Signed-off-by: NJordan Crouse <jcrouse@codeaurora.org>
Signed-off-by: NRob Clark <robdclark@gmail.com>

e00e473d

25 7月, 2018 1 次提交

drm/msm/gpu: Increase the pm runtime autosuspend for 5xx · 64709686

由 Jordan Crouse 提交于 5月 07, 2018

Experimentation shows that resuming power quickly after suspending
ends up forcing a system hang for unknown reasons on 5xx targets.
To avoid cycling the power too much (especially during init)
turn up the autosuspend time for a5xx to 250ms and use
pm_runtime_put_autosuspend() when applicable.
Signed-off-by: NJordan Crouse <jcrouse@codeaurora.org>
Signed-off-by: NRob Clark <robdclark@gmail.com>

64709686

19 3月, 2018 1 次提交

drm/msm: Trigger fence completion from GPU · 79d57bf6

由 Bjorn Andersson 提交于 2月 13, 2018

Interrupt commands causes the CP to trigger an interrupt as the command
is processed, regardless of the GPU being done processing previous
commands. This is seen by the interrupt being delivered before the
fence is written on 8974 and is likely the cause of the additional
CP_WAIT_FOR_IDLE workaround found for a306, which would cause the CP to
wait for the GPU to go idle before triggering the interrupt.

Instead we can set the (undocumented) BIT(31) of the CACHE_FLUSH_TS
which will cause a special CACHE_FLUSH_TS interrupt to be triggered from
the GPU as the write event is processed.

Add CACHE_FLUSH_TS to the IRQ masks of A3xx and A4xx and remove the
workaround for A306.
Suggested-by: NJordan Crouse <jcrouse@codeaurora.org>
Signed-off-by: NBjorn Andersson <bjorn.andersson@linaro.org>
Signed-off-by: NRob Clark <robdclark@gmail.com>

79d57bf6

20 2月, 2018 2 次提交

drm/msm/adreno: Use generic function to load firmware to a buffer object · 9de43e79

由 Jordan Crouse 提交于 2月 01, 2018

Move a5xx specific code to load firmware into a buffer object to
the generic Adreno code. This will come in useful for future targets.
Signed-off-by: NJordan Crouse <jcrouse@codeaurora.org>
Signed-off-by: NRob Clark <robdclark@gmail.com>

9de43e79

drm/msm/adreno: Define a list of firmware files to load per target · c5e3548c

由 Jordan Crouse 提交于 2月 01, 2018

The number and type of firmware files required differs for each
target. Instead of using a fixed struct member for each possible
firmware file use a generic list of files that should be loaded
on boot.  Use some semi-target specific enums to help each target
find the appropriate firmware(s) that it needs to load.
Signed-off-by: NJordan Crouse <jcrouse@codeaurora.org>
Signed-off-by: NRob Clark <robdclark@gmail.com>

c5e3548c

11 1月, 2018 1 次提交

drm/msm: Add devfreq support for the GPU · f91c14ab

由 Jordan Crouse 提交于 1月 10, 2018

Add support for devfreq to dynamically control the GPU frequency.
By default try to use the 'simple_ondemand' governor which can
adjust the frequency based on GPU load.

v2: Fix __aeabi_uldivmod issue from the 0 day bot and use
devfreq_recommended_opp() as suggested by Rob.
Signed-off-by: NJordan Crouse <jcrouse@codeaurora.org>
Signed-off-by: NRob Clark <robdclark@gmail.com>

f91c14ab

10 1月, 2018 2 次提交

drm/msm/adreno: Move clock parsing to adreno_gpu_init() · 999ae6ed

由 Jordan Crouse 提交于 11月 21, 2017

Move the clock parsing to adreno_gpu_init() to allow for target
specific probing and manipulation of the clock tables.
Signed-off-by: NJordan Crouse <jcrouse@codeaurora.org>
Signed-off-by: NRob Clark <robdclark@gmail.com>

999ae6ed

drm/msm/gpu: Remove unused bus scaling code · 1babd706

由 Jordan Crouse 提交于 11月 21, 2017

Remove the downstream bus scaling code. It isn't needed for for
compatibility with a downstream or vendor kernel. Get it out of the
way to clear space for devfreq support.
Signed-off-by: NJordan Crouse <jcrouse@codeaurora.org>
Signed-off-by: NRob Clark <robdclark@gmail.com>

1babd706

14 12月, 2017 1 次提交

drm/msm: fix spelling mistake: "ringubffer" -> "ringbuffer" · 3a9016ba

由 Colin Ian King 提交于 11月 02, 2017

Trivial fix to spelling mistake in DRM_DEV_ERROR error message
Signed-off-by: NColin Ian King <colin.king@canonical.com>
Signed-off-by: NRob Clark <robdclark@gmail.com>

3a9016ba

28 10月, 2017 9 次提交

drm/msm: Implement preemption for A5XX targets · b1fc2839