提交 · 9aa0d2dde6ebd14e9d16e28081a24721d5b41cc8 · openanolis / cloud-kernel

13 10月, 2017 1 次提交

drm/msm: fix _NO_IMPLICIT fencing case · 06451a3d

由 Rob Clark 提交于 9月 12, 2017

We need to call reservation_object_reserve_shared() in both cases, but
this wasn't happening in the _NO_IMPLICIT submit case.

Fixes: f0a42bb5 ("drm/msm: submit support for in-fences")
Reported-by: NJordan Crouse <jcrouse@codeaurora.org>
Signed-off-by: NRob Clark <robdclark@gmail.com>

06451a3d

14 9月, 2017 1 次提交

mm: treewide: remove GFP_TEMPORARY allocation flag · 0ee931c4

由 Michal Hocko 提交于 9月 13, 2017

GFP_TEMPORARY was introduced by commit e12ba74d ("Group short-lived
and reclaimable kernel allocations") along with __GFP_RECLAIMABLE.  It's
primary motivation was to allow users to tell that an allocation is
short lived and so the allocator can try to place such allocations close
together and prevent long term fragmentation.  As much as this sounds
like a reasonable semantic it becomes much less clear when to use the
highlevel GFP_TEMPORARY allocation flag.  How long is temporary? Can the
context holding that memory sleep? Can it take locks? It seems there is
no good answer for those questions.

The current implementation of GFP_TEMPORARY is basically GFP_KERNEL |
__GFP_RECLAIMABLE which in itself is tricky because basically none of
the existing caller provide a way to reclaim the allocated memory.  So
this is rather misleading and hard to evaluate for any benefits.

I have checked some random users and none of them has added the flag
with a specific justification.  I suspect most of them just copied from
other existing users and others just thought it might be a good idea to
use without any measuring.  This suggests that GFP_TEMPORARY just
motivates for cargo cult usage without any reasoning.

I believe that our gfp flags are quite complex already and especially
those with highlevel semantic should be clearly defined to prevent from
confusion and abuse.  Therefore I propose dropping GFP_TEMPORARY and
replace all existing users to simply use GFP_KERNEL.  Please note that
SLAB users with shrinkers will still get __GFP_RECLAIMABLE heuristic and
so they will be placed properly for memory fragmentation prevention.

I can see reasons we might want some gfp flag to reflect shorterm
allocations but I propose starting from a clear semantic definition and
only then add users with proper justification.

This was been brought up before LSF this year by Matthew [1] and it
turned out that GFP_TEMPORARY really doesn't have a clear semantic.  It
seems to be a heuristic without any measured advantage for most (if not
all) its current users.  The follow up discussion has revealed that
opinions on what might be temporary allocation differ a lot between
developers.  So rather than trying to tweak existing users into a
semantic which they haven't expected I propose to simply remove the flag
and start from scratch if we really need a semantic for short term
allocations.

[1] http://lkml.kernel.org/r/20170118054945.GD18349@bombadil.infradead.org

[akpm@linux-foundation.org: fix typo]
[akpm@linux-foundation.org: coding-style fixes]
[sfr@canb.auug.org.au: drm/i915: fix up]
  Link: http://lkml.kernel.org/r/20170816144703.378d4f4d@canb.auug.org.au
Link: http://lkml.kernel.org/r/20170728091904.14627-1-mhocko@kernel.orgSigned-off-by: NMichal Hocko <mhocko@suse.com>
Signed-off-by: NStephen Rothwell <sfr@canb.auug.org.au>
Acked-by: NMel Gorman <mgorman@suse.de>
Acked-by: NVlastimil Babka <vbabka@suse.cz>
Cc: Matthew Wilcox <willy@infradead.org>
Cc: Neil Brown <neilb@suse.de>
Cc: "Theodore Ts'o" <tytso@mit.edu>
Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

0ee931c4

02 8月, 2017 2 次提交

drm/msm: args->fence should be args->flags · b0135ab9

由 Jordan Crouse 提交于 7月 27, 2017

Fix a typo in msm_ioctl_gem_submit - check args->flags for the
MSM_SUBMIT_NO_IMPLICIT flag instead of args->fence.
Signed-off-by: NJordan Crouse <jcrouse@codeaurora.org>
Signed-off-by: NRob Clark <robdclark@gmail.com>

b0135ab9

drm/msm: fix an integer overflow test · 65e93108

由 Dan Carpenter 提交于 6月 30, 2017

We recently added an integer overflow check but it needs an additional
tweak to work properly on 32 bit systems.

The problem is that we're doing the right hand side of the assignment as
type unsigned long so the max it will have an integer overflow instead
of being larger than SIZE_MAX.  That means the "sz > SIZE_MAX" condition
is never true even on 32 bit systems.  We need to first cast it to u64
and then do the math.

Fixes: 4a630fad ("drm/msm: Fix potential buffer overflow issue")
Signed-off-by: NDan Carpenter <dan.carpenter@oracle.com>
Acked-by: NJordan Crouse <jcrouse@codeaurora.org>
Signed-off-by: NRob Clark <robdclark@gmail.com>

65e93108

20 6月, 2017 1 次提交

drm/msm: Fix potential buffer overflow issue · 4a630fad

由 Kasin Li 提交于 6月 19, 2017

In function submit_create, if nr_cmds or nr_bos is assigned with
negative value, the allocated buffer may be small than intended.
Using this buffer will lead to buffer overflow issue.
Signed-off-by: NKasin Li <donglil@codeaurora.org>
Signed-off-by: NJordan Crouse <jcrouse@codeaurora.org>
Signed-off-by: NRob Clark <robdclark@gmail.com>

4a630fad

17 6月, 2017 1 次提交

drm/msm: Separate locking of buffer resources from struct_mutex · 0e08270a

由 Sushmita Susheelendra 提交于 6月 13, 2017

Buffer object specific resources like pages, domains, sg list
need not be protected with struct_mutex. They can be protected
with a buffer object level lock. This simplifies locking and
makes it easier to avoid potential recursive locking scenarios
for SVM involving mmap_sem and struct_mutex. This also removes
unnecessary serialization when creating buffer objects, and also
between buffer object creation and GPU command submission.
Signed-off-by: NSushmita Susheelendra <ssusheel@codeaurora.org>
[robclark: squash in handling new locking for shrinker]
Signed-off-by: NRob Clark <robdclark@gmail.com>

0e08270a

16 6月, 2017 1 次提交

drm/msm: pass address-space to _get_iova() and friends · 8bdcd949

由 Rob Clark 提交于 6月 13, 2017

No functional change, that will come later.  But this will make it
easier to deal with dynamically created address spaces (ie. per-
process pagetables for gpu).
Signed-off-by: NRob Clark <robdclark@gmail.com>

8bdcd949

28 5月, 2017 2 次提交

drm/msm: Fix the check for the command size · d72fea53

由 Jordan Crouse 提交于 5月 08, 2017

The overrun check for the size of submitted commands is off by one.
It should allow the offset plus the size to be equal to the
size of the memory object when the command stream is very tightly
constructed.
Signed-off-by: NJordan Crouse <jcrouse@codeaurora.org>
Signed-off-by: NRob Clark <robdclark@gmail.com>

d72fea53

drm/msm: for array in-fences, check if all backing fences are from our own context before waiting · 3cfac69c

由 Philipp Zabel 提交于 3月 17, 2017

Use the dma_fence_match_context helper to check if all backing fences
are from our own context, in which case we don't have to wait.
Signed-off-by: NPhilipp Zabel <p.zabel@pengutronix.de>
Cc: Rob Clark <robdclark@gmail.com>
Cc: Gustavo Padovan <gustavo.padovan@collabora.com>
[rebased on code-motion]
Signed-off-by: NRob Clark <robdclark@gmail.com>

3cfac69c

08 4月, 2017 1 次提交

drm/msm: move submit fence wait out of struct_mutex · 48f243c9

由 Rob Clark 提交于 2月 25, 2017

Probably a symptom of needing finer grained locking, but if we wait on
the incoming fence-fd (which could come from a different context) while
holding struct_mutex, that blocks retire_worker so gpu fences cannot get
signalled.

This causes a problem if userspace manages to get more than a frame
ahead, leaving the atomic-commit worker blocked waiting on fences that
cannot be signaled because submit is blocked waiting for a fence
signalled from vblank (after the atomic commit which is blocked).

If we start having multiple fence ctxs for the gpu, submit_fence_sync()
would probably need to move outside of struct_mutex as well.
Signed-off-by: NRob Clark <robdclark@gmail.com>

48f243c9

07 2月, 2017 1 次提交

drm/msm: return -EFAULT if copy_from_user() fails · 21c42da1

由 Dan Carpenter 提交于 1月 16, 2017

copy_from_user_inatomic() is actually a local function that returns
-EFAULT or positive values on error.  Otherwise copy_from_user() returns
the number of bytes remaining to be copied.  We want to return -EFAULT
here.

I removed an unlikely() because we just did a copy_from_user()
so I don't think it can possibly make a difference.
Signed-off-by: NDan Carpenter <dan.carpenter@oracle.com>
Signed-off-by: NRob Clark <robdclark@gmail.com>

21c42da1

30 12月, 2016 2 次提交

drm/msm: Verify that MSM_SUBMIT_BO_FLAGS are set · a6cb3b86

由 Jordan Crouse 提交于 12月 20, 2016

For every submission buffer object one of MSM_SUBMIT_BO_WRITE
and MSM_SUBMIT_BO_READ must be set (and nothing else). If we
allowed zero then the buffer object would never get queued to
be unreferenced.
Signed-off-by: NJordan Crouse <jcrouse@codeaurora.org>
Signed-off-by: NRob Clark <robdclark@gmail.com>

a6cb3b86

drm/msm: Put back the vaddr in submit_reloc() · 6490abc4

由 Jordan Crouse 提交于 12月 20, 2016

The error cases in submit_reloc() need to put back the virtual
address of the bo before failling. Add a single failure path
for the function.
Signed-off-by: NJordan Crouse <jcrouse@codeaurora.org>
Signed-off-by: NRob Clark <robdclark@gmail.com>

6490abc4

29 11月, 2016 1 次提交

drm/msm: convert iova to 64b · 78babc16

由 Rob Clark 提交于 11月 11, 2016

For a5xx the gpu is 64b so we need to change iova to 64b everywhere.  On
the display side, iova is still 32b so it can ignore the upper bits.
(Although all the armv8 devices have an iommu that can map 64b pa to 32b
iova.)
Signed-off-by: NRob Clark <robdclark@gmail.com>

78babc16

25 10月, 2016 1 次提交

dma-buf: Rename struct fence to dma_fence · f54d1867

由 Chris Wilson 提交于 10月 25, 2016

I plan to usurp the short name of struct fence for a core kernel struct,
and so I need to rename the specialised fence/timeline for DMA
operations to make room.

A consensus was reached in
https://lists.freedesktop.org/archives/dri-devel/2016-July/113083.html
that making clear this fence applies to DMA operations was a good thing.
Since then the patch has grown a bit as usage increases, so hopefully it
remains a good thing!

(v2...: rebase, rerun spatch)
v3: Compile on msm, spotted a manual fixup that I broke.
v4: Try again for msm, sorry Daniel

coccinelle script:
@@

@@
- struct fence
+ struct dma_fence
@@

@@
- struct fence_ops
+ struct dma_fence_ops
@@

@@
- struct fence_cb
+ struct dma_fence_cb
@@

@@
- struct fence_array
+ struct dma_fence_array
@@

@@
- enum fence_flag_bits
+ enum dma_fence_flag_bits
@@

@@
(
- fence_init
+ dma_fence_init
|
- fence_release
+ dma_fence_release
|
- fence_free
+ dma_fence_free
|
- fence_get
+ dma_fence_get
|
- fence_get_rcu
+ dma_fence_get_rcu
|
- fence_put
+ dma_fence_put
|
- fence_signal
+ dma_fence_signal
|
- fence_signal_locked
+ dma_fence_signal_locked
|
- fence_default_wait
+ dma_fence_default_wait
|
- fence_add_callback
+ dma_fence_add_callback
|
- fence_remove_callback
+ dma_fence_remove_callback
|
- fence_enable_sw_signaling
+ dma_fence_enable_sw_signaling
|
- fence_is_signaled_locked
+ dma_fence_is_signaled_locked
|
- fence_is_signaled
+ dma_fence_is_signaled
|
- fence_is_later
+ dma_fence_is_later
|
- fence_later
+ dma_fence_later
|
- fence_wait_timeout
+ dma_fence_wait_timeout
|
- fence_wait_any_timeout
+ dma_fence_wait_any_timeout
|
- fence_wait
+ dma_fence_wait
|
- fence_context_alloc
+ dma_fence_context_alloc
|
- fence_array_create
+ dma_fence_array_create
|
- to_fence_array
+ to_dma_fence_array
|
- fence_is_array
+ dma_fence_is_array
|
- trace_fence_emit
+ trace_dma_fence_emit
|
- FENCE_TRACE
+ DMA_FENCE_TRACE
|
- FENCE_WARN
+ DMA_FENCE_WARN
|
- FENCE_ERR
+ DMA_FENCE_ERR
)
 (
 ...
 )
Signed-off-by: NChris Wilson <chris@chris-wilson.co.uk>
Reviewed-by: NGustavo Padovan <gustavo.padovan@collabora.co.uk>
Acked-by: NSumit Semwal <sumit.semwal@linaro.org>
Acked-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NDaniel Vetter <daniel.vetter@ffwll.ch>
Link: http://patchwork.freedesktop.org/patch/msgid/20161025120045.28839-1-chris@chris-wilson.co.uk

f54d1867

16 9月, 2016 4 次提交

R
drm/msm: submit support for out-fences · 4cd09459
由 Rob Clark 提交于 6月 16, 2016
```
Signed-off-by: NRob Clark <robdclark@gmail.com>
```
4cd09459
R
drm/msm: move fence allocation out of msm_gpu_submit() · f44d32c7
由 Rob Clark 提交于 6月 16, 2016
```
Prep work for next patch.
Signed-off-by: NRob Clark <robdclark@gmail.com>
```
f44d32c7
R
drm/msm: submit support for in-fences · f0a42bb5
由 Rob Clark 提交于 6月 16, 2016
```
Signed-off-by: NRob Clark <robdclark@gmail.com>
```
f0a42bb5

drm/msm: extend the submit ioctl to pass in flags · d9c181e2

由 Rob Clark 提交于 4月 23, 2016

We'll want to be able to pass in flags, such as asking for explicit
fencing, and possibly other things down the road.  Fortunately we
don't need a full 32b for the pipe-id.  So use the upper 16 bits
for flags (which could be extended or reduced later if needed, so
start adding flags from the high bits).

Since anything with the upper bits set would not be a valid pipe-id,
an old userspace would not set any of the upper bits, and an old
kernel would reject it as an invalid pipe-id.
Signed-off-by: NRob Clark <robdclark@gmail.com>

d9c181e2

29 8月, 2016 2 次提交

drm/msm: protect against faults from copy_from_user() in submit ioctl · d78d383a

由 Rob Clark 提交于 8月 22, 2016

An evil userspace could try to cause deadlock by passing an unfaulted-in
GEM bo as submit->bos (or submit->cmds) table.  Which will trigger
msm_gem_fault() while we already hold struct_mutex.  See:

https://github.com/freedreno/msmtest/blob/master/evilsubmittest.c

Cc: stable@vger.kernel.org
Signed-off-by: NRob Clark <robdclark@gmail.com>

d78d383a

drm/msm: fix use of copy_from_user() while holding spinlock · 89f82cbb

由 Rob Clark 提交于 8月 22, 2016

Use instead __copy_from_user_inatomic() and fallback to slow-path where
we drop and re-aquire the lock in case of fault.

Cc: stable@vger.kernel.org
Reported-by: NVaishali Thakkar <vaishali.thakkar@oracle.com>
Signed-off-by: NRob Clark <robdclark@gmail.com>

89f82cbb

16 7月, 2016 3 次提交

drm/msm: deal with arbitrary # of cmd buffers · 6b597ce2

由 Rob Clark 提交于 6月 01, 2016

For some optimizations coming on the userspace side, splitting larger
draw or gmem cmds into multiple cmdstream buffers, we need to support
much more than the previous small/arbitrary limit.
Signed-off-by: NRob Clark <robdclark@gmail.com>

6b597ce2

drm/msm: change gem->vmap() to get/put · 18f23049

由 Rob Clark 提交于 5月 26, 2016

Before we can add vmap shrinking, we really need to know which vmap'ings
are currently being used.  So switch to get/put interface.  Stubbed put
fxns for now.
Signed-off-by: NRob Clark <robdclark@gmail.com>

18f23049

drm/msm: use mutex_lock_interruptible for submit ioctl · b5b4c264

由 Rob Clark 提交于 5月 17, 2016

Be kinder to things that do lots of signal handling (ie. Xorg)
Signed-off-by: NRob Clark <robdclark@gmail.com>

b5b4c264

05 6月, 2016 2 次提交

R
drm/msm: fix potential submit error path issue · a9e26cab
由 Rob Clark 提交于 6月 01, 2016
```
Signed-off-by: NRob Clark <robdclark@gmail.com>
```
a9e26cab

drm/msm: fix some crashes in submit fail path · ba344afd

由 Rob Clark 提交于 5月 24, 2016

If submit fails, before fence is created or before submit is added to
submit-list, then unitialized fields cause problems in the clean-up
path.
Signed-off-by: NRob Clark <robdclark@gmail.com>

ba344afd

08 5月, 2016 5 次提交

drm/msm: print offender task name on hangcheck recovery · 4816b626

由 Rob Clark 提交于 5月 03, 2016

Track the pid per submit, so we can print the name of the task which
submitted the batch that caused the gpu to hang.
Signed-off-by: NRob Clark <robdclark@gmail.com>

4816b626

R
drm/msm: fix leak in failed submit path · 40e6815b
由 Rob Clark 提交于 5月 03, 2016
```
Signed-off-by: NRob Clark <robdclark@gmail.com>
```
40e6815b
R
drm/msm: de-indent submit_create() · 6860b56c
由 Rob Clark 提交于 5月 03, 2016
```
Signed-off-by: NRob Clark <robdclark@gmail.com>
```
6860b56c
R
drm/msm: 'struct fence' conversion · b6295f9a
由 Rob Clark 提交于 3月 15, 2016
```
Signed-off-by: NRob Clark <robdclark@gmail.com>
```
b6295f9a

drm/msm: split locking and pinning BO's · 340faef2

由 Rob Clark 提交于 3月 14, 2016

Split up locking and pinning buffers in the submit path.  This is needed
because we'll want to insert fencing in between the two steps.

This makes things end up looking more similar to etnaviv submit code
(which was originally modelled on the msm code but has already added
'struct fence' support).
Signed-off-by: NRob Clark <robdclark@gmail.com>

340faef2

30 4月, 2016 1 次提交

kernel.h: add u64_to_user_ptr() · 3ed605bc

由 Gustavo Padovan 提交于 4月 26, 2016

This function had copies in 3 different files. Unify them in kernel.h.

Cc: Joe Perches <joe@perches.com>
Cc: Andrew Morton <akpm@linux-foundation.org>
Cc: David Airlie <airlied@linux.ie>
Cc: Daniel Vetter <daniel.vetter@intel.com>
Cc: Rob Clark <robdclark@gmail.com>
Signed-off-by: NGustavo Padovan <gustavo.padovan@collabora.co.uk>
Acked-by: Daniel Vetter <daniel.vetter@intel.com>	[drm/i915/]
Acked-by: Rob Clark <robdclark@gmail.com>		[drm/msm/]
Acked-by: Lucas Stach <l.stach@pengutronix.de>		[drm/etinav/]
Acked-by: NMaarten Lankhorst <maarten.lankhorst@linux.intel.com>
Signed-off-by: NGreg Kroah-Hartman <gregkh@linuxfoundation.org>

3ed605bc

04 3月, 2016 2 次提交

drm/msm: grab struct_mutex after allocating submit · 687f084a

由 Rob Clark 提交于 2月 03, 2016

No real need to hold the lock over allocation, and simplifies things
slightly if we change the order.
Signed-off-by: NRob Clark <robdclark@gmail.com>

687f084a

drm/msm: reject submit ioctl if no gpu · c01a958e

由 Rob Clark 提交于 2月 03, 2016

Existing userspace wouldn't get this far, since getparam ioctl would
have failed and it would have bailed out creating a screen/context.

But all the same, we shouldn't let evil or confused userspace cause a
null ptr deref.
Signed-off-by: NRob Clark <robdclark@gmail.com>

c01a958e

12 6月, 2015 1 次提交

drm/msm: restart queued submits after hang · 1a370be9

由 Rob Clark 提交于 6月 07, 2015

Track the list of in-flight submits. If the gpu hangs, retire up to an
including the offending submit, and then re-submit the remainder. This
way, for concurrently running piglit tests (for example), one failing
test doesn't cause unrelated tests to fail simply because it's submit
was queued up after one that triggered a hang.
Signed-off-by: NRob Clark <robdclark@gmail.com>

1a370be9

02 6月, 2014 1 次提交

drm/msm: add rd logging debugfs · a7d3c950

由 Rob Clark 提交于 5月 30, 2014

To ease debugging, add debugfs file which can be cat/tail'd to log
submits, along with fence #.  If GPU hangs, you can look at 'gpu'
debugfs file to find last completed fence and current register state,
and compare with logged rd file to narrow down the DRAW_INDX which
triggered the GPU hang.
Signed-off-by: NRob Clark <robdclark@gmail.com>

a7d3c950

31 3月, 2014 1 次提交

drm/msm: validate flags, etc · 93ddb0d3

由 Rob Clark 提交于 3月 03, 2014

After reading a nice article on LWN[1], I went back and double checked
my handling of invalid-input checking.  Turns out there were a couple
places I had missed.

Since the driver is fairly young, and the devices it supports are really
only just barely usable for basic stuff (serial console) with an
upstream kernel, I think we should fix this now and revert specific
parts of this patch later in the unlikely event that a regression is
reported.

[1] https://lwn.net/Articles/588444/Signed-off-by: NRob Clark <robdclark@gmail.com>

93ddb0d3

07 2月, 2014 1 次提交

drm/msm: bigger synchronization hammer · c2703b13

由 Rob Clark 提交于 2月 06, 2014

Because we use a list_head in the bo to track it's position in a submit,
we need to serialize at a higher layer.  Otherwise there are problems
when multiple contexts are SUBMIT'ing in parallel cmdstreams referencing
a shared bo.
Signed-off-by: NRob Clark <robdclark@gmail.com>

c2703b13

11 9月, 2013 1 次提交

drm/msm: fix cmdstream size check · 19872533

由 Rob Clark 提交于 9月 06, 2013

Need to check size+offset against bo size (duh!).. now we have a test
case to make sure I've done it right:

https://github.com/freedreno/msmtest/blob/master/submittest.c

Also, use DRM_ERROR() for error case traces, which makes debugging
userspace easier when enabling debug traces is too much.
Signed-off-by: NRob Clark <robdclark@gmail.com>

19872533

25 8月, 2013 1 次提交

drm/msm: add a3xx gpu support · 7198e6b0

由 Rob Clark 提交于 7月 19, 2013

Add initial support for a3xx 3d core.

So far, with hardware that I've seen to date, we can have:
 + zero, one, or two z180 2d cores
 + a3xx or a2xx 3d core, which share a common CP (the firmware
   for the CP seems to implement some different PM4 packet types
   but the basics of cmdstream submission are the same)

Which means that the eventual complete "class" hierarchy, once
support for all past and present hw is in place, becomes:
 + msm_gpu
   + adreno_gpu
     + a3xx_gpu
     + a2xx_gpu
   + z180_gpu

This commit splits out the parts that will eventually be common
between a2xx/a3xx into adreno_gpu, and the parts that are even
common to z180 into msm_gpu.

Note that there is no cmdstream validation required.  All memory access
from the GPU is via IOMMU/MMU.  So as long as you don't map silly things
to the GPU, there isn't much damage that the GPU can do.
Signed-off-by: NRob Clark <robdclark@gmail.com>

7198e6b0

openanolis / cloud-kernel 1 年多 前同步成功

openanolis / cloud-kernel
1 年多前同步成功