提交 · ecf6a637090c8034bc0d843c48905bfa5e2a4e0c · openeuler / raspberrypi-kernel

08 8月, 2016 1 次提交

drm/amdgpu: implement amdgpu_fill_buffer() · 59b4a977

由 Flora Cui 提交于 7月 19, 2016

so that bo could be set to some pattern
Signed-off-by: NFlora Cui <Flora.Cui@amd.com>
Reviewed-by: NChunming Zhou <David1.Zhou@amd.com>
Reviewed-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

59b4a977

08 7月, 2016 6 次提交

drm/amdgpu: add eviction counter · dbd5ed60

由 Christian König 提交于 6月 21, 2016

Keep track of the number of evictions since boot.
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

dbd5ed60

drm/amdgpu: pipeline evictions as well · ce64bc25

由 Christian König 提交于 6月 15, 2016

This boosts Xonotic from 38fps to 47fps when artificially limiting VRAM to
256MB for testing. It should improve all CPU bound rendering situations
where we have a lot of swapping to/from VRAM.
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

ce64bc25

drm/ttm: remove no_gpu_wait param from ttm_bo_move_accel_cleanup · 74561cd4

由 Christian König 提交于 6月 15, 2016

It isn't used and not waiting for the GPU after scheduling a move is
actually quite dangerous.
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

74561cd4

drm/amdgpu: remove pre move wait · 99c44632

由 Christian König 提交于 6月 06, 2016

Not needed any more.
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

99c44632

drm/ttm: wait for BO idle in ttm_bo_move_memcpy · 77dfc28b

由 Christian König 提交于 6月 06, 2016

When we want to pipeline accelerated moves we need to wait in the fallback path.
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

77dfc28b

drm/ttm: add wait for idle in all drivers bo_move functions · 88932a7b

由 Christian König 提交于 6月 06, 2016

Wait for idle before moving the BO in all drivers implementing
an accelerated move function.

This should keep the current behavior when removing the pre move wait.
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

88932a7b

05 5月, 2016 3 次提交

drm/amdgpu: group BOs by log2 of the size on the LRU v2 · 29b3259a

由 Christian König 提交于 4月 15, 2016

This allows us to have small BOs on the LRU before big ones.

v2: fix of by one and list corruption bug
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

29b3259a

drm/ttm: implement LRU add callbacks v2 · 98c2872a

由 Christian König 提交于 4月 06, 2016

This allows fine grained control for the driver where to add a BO into the LRU.

v2: fix typo in comment
Reviewed-by: NSinclair Yeh <syeh@vmware.com>
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

98c2872a

drm/amdgpu: Mark all instances of struct drm_info_list as const · 06ab6832

由 Nils Wallménius 提交于 5月 02, 2016

All these are compile time constand and the
drm_debugfs_create/remove_files functions take a const
pointer argument.
Reviewed-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NNils Wallménius <nils.wallmenius@gmail.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

06ab6832

03 5月, 2016 1 次提交

drm/amdgpu: optionally enable GART debugfs file · a1d29476

由 Christian König 提交于 3月 30, 2016

Keeping the pages array around can use a lot of system memory
when you want a large GART.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

a1d29476

22 4月, 2016 1 次提交

drm/amdgpu: forbid mapping of userptr bo through radeon device file · 054892ed

由 Jérôme Glisse 提交于 4月 19, 2016

Allowing userptr bo which are basicly a list of page from some vma
(so either anonymous page or file backed page) would lead to serious
corruption of kernel structures and counters (because we overwrite
the page->mapping field when mapping buffer).

This will already block if the buffer was populated before anyone does
try to mmap it because then TTM_PAGE_FLAG_SG would be set in in the
ttm_tt flags. But that flag is check before ttm_tt_populate in the ttm
vm fault handler.

So to be safe just add a check to verify_access() callback.
Reviewed-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NJérôme Glisse <jglisse@redhat.com>
Cc: <stable@vger.kernel.org>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

054892ed

05 4月, 2016 1 次提交

mm, fs: get rid of PAGE_CACHE_* and page_cache_{get,release} macros · 09cbfeaf

由 Kirill A. Shutemov 提交于 4月 01, 2016

PAGE_CACHE_{SIZE,SHIFT,MASK,ALIGN} macros were introduced *long* time
ago with promise that one day it will be possible to implement page
cache with bigger chunks than PAGE_SIZE.

This promise never materialized.  And unlikely will.

We have many places where PAGE_CACHE_SIZE assumed to be equal to
PAGE_SIZE.  And it's constant source of confusion on whether
PAGE_CACHE_* or PAGE_* constant should be used in a particular case,
especially on the border between fs and mm.

Global switching to PAGE_CACHE_SIZE != PAGE_SIZE would cause to much
breakage to be doable.

Let's stop pretending that pages in page cache are special.  They are
not.

The changes are pretty straight-forward:

 - <foo> << (PAGE_CACHE_SHIFT - PAGE_SHIFT) -> <foo>;

 - <foo> >> (PAGE_CACHE_SHIFT - PAGE_SHIFT) -> <foo>;

 - PAGE_CACHE_{SIZE,SHIFT,MASK,ALIGN} -> PAGE_{SIZE,SHIFT,MASK,ALIGN};

 - page_cache_get() -> get_page();

 - page_cache_release() -> put_page();

This patch contains automated changes generated with coccinelle using
script below.  For some reason, coccinelle doesn't patch header files.
I've called spatch for them manually.

The only adjustment after coccinelle is revert of changes to
PAGE_CAHCE_ALIGN definition: we are going to drop it later.

There are few places in the code where coccinelle didn't reach.  I'll
fix them manually in a separate patch.  Comments and documentation also
will be addressed with the separate patch.

virtual patch

@@
expression E;
@@
- E << (PAGE_CACHE_SHIFT - PAGE_SHIFT)
+ E

@@
expression E;
@@
- E >> (PAGE_CACHE_SHIFT - PAGE_SHIFT)
+ E

@@
@@
- PAGE_CACHE_SHIFT
+ PAGE_SHIFT

@@
@@
- PAGE_CACHE_SIZE
+ PAGE_SIZE

@@
@@
- PAGE_CACHE_MASK
+ PAGE_MASK

@@
expression E;
@@
- PAGE_CACHE_ALIGN(E)
+ PAGE_ALIGN(E)

@@
expression E;
@@
- page_cache_get(E)
+ get_page(E)

@@
expression E;
@@
- page_cache_release(E)
+ put_page(E)
Signed-off-by: NKirill A. Shutemov <kirill.shutemov@linux.intel.com>
Acked-by: NMichal Hocko <mhocko@suse.com>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

09cbfeaf

28 3月, 2016 1 次提交

drm/amdgpu: Don't move pinned BOs · 104ece97

由 Michel Dänzer 提交于 3月 28, 2016

The purpose of pinning is to prevent a buffer from moving.
Reviewed-by: NChristian König <christian.koenig@amd.com>
Tested-by: NRex Zhu <Rex.Zhu@amd.com>
Signed-off-by: NMichel Dänzer <michel.daenzer@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

104ece97

09 3月, 2016 2 次提交

drm/amdgpu: move get_user_pages out of amdgpu_ttm_tt_pin_userptr v6 · 2f568dbd

由 Christian König 提交于 2月 23, 2016

That avoids lock inversion between the BO reservation lock
and the anon_vma lock.

v2:
* Changed amdgpu_bo_list_entry.user_pages to an array of pointers
* Lock mmap_sem only for get_user_pages
* Added invalidation of unbound userpointer BOs
* Fixed memory leak and page reference leak

v3 (chk):
* Revert locking mmap_sem only for_get user_pages
* Revert adding invalidation of unbound userpointer BOs
* Sanitize and fix error handling

v4 (chk):
* Init userpages pointer everywhere.
* Fix error handling when get_user_pages() fails.
* Add invalidation of unbound userpointer BOs again.

v5 (chk):
* Add maximum number of tries.

v6 (chk):
* Fix error handling when we run out of tries.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Reviewed-by: Felix Kuehling <Felix.Kuehling@amd.com> (v4)
Acked-by: NAlex Deucher <alexander.deucher@amd.com>

2f568dbd

drm/amdgpu: prevent get_user_pages recursion · 637dd3b5

由 Christian König 提交于 3月 03, 2016

Remember the tasks which are inside get_user_pages()
and ignore MMU callbacks from there.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

637dd3b5

16 2月, 2016 2 次提交

drm/amdgpu: use post-decrement in error handling · 09ccbb74

由 Rasmus Villemoes 提交于 2月 15, 2016

We need to use post-decrement to get the pci_map_page undone also for
i==0, and to avoid some very unpleasant behaviour if pci_map_page
failed already at i==0.
Reviewed-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NRasmus Villemoes <linux@rasmusvillemoes.dk>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>
Cc: stable@vger.kernel.org

09ccbb74

mm/gup: Switch all callers of get_user_pages() to not pass tsk/mm · d4edcf0d

由 Dave Hansen 提交于 2月 12, 2016

We will soon modify the vanilla get_user_pages() so it can no
longer be used on mm/tasks other than 'current/current->mm',
which is by far the most common way it is called.  For now,
we allow the old-style calls, but warn when they are used.
(implemented in previous patch)

This patch switches all callers of:

	get_user_pages()
	get_user_pages_unlocked()
	get_user_pages_locked()

to stop passing tsk/mm so they will no longer see the warnings.
Signed-off-by: NDave Hansen <dave.hansen@linux.intel.com>
Reviewed-by: NThomas Gleixner <tglx@linutronix.de>
Cc: Andrea Arcangeli <aarcange@redhat.com>
Cc: Andrew Morton <akpm@linux-foundation.org>
Cc: Andy Lutomirski <luto@amacapital.net>
Cc: Borislav Petkov <bp@alien8.de>
Cc: Brian Gerst <brgerst@gmail.com>
Cc: Dave Hansen <dave@sr71.net>
Cc: Denys Vlasenko <dvlasenk@redhat.com>
Cc: H. Peter Anvin <hpa@zytor.com>
Cc: Kirill A. Shutemov <kirill.shutemov@linux.intel.com>
Cc: Linus Torvalds <torvalds@linux-foundation.org>
Cc: Naoya Horiguchi <n-horiguchi@ah.jp.nec.com>
Cc: Peter Zijlstra <peterz@infradead.org>
Cc: Rik van Riel <riel@redhat.com>
Cc: Srikar Dronamraju <srikar@linux.vnet.ibm.com>
Cc: Vlastimil Babka <vbabka@suse.cz>
Cc: jack@suse.cz
Cc: linux-mm@kvack.org
Link: http://lkml.kernel.org/r/20160212210156.113E9407@viggo.jf.intel.comSigned-off-by: NIngo Molnar <mingo@kernel.org>

d4edcf0d

13 2月, 2016 2 次提交

drm/amdgpu: use separate scheduler entitiy for buffer moves · 703297c1

由 Christian König 提交于 2月 10, 2016

This allows us to remove the global kernel context.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>

703297c1

drm/amdgpu: use per VM entity for page table updates (v2) · 2bd9ccfa

由 Christian König 提交于 2月 01, 2016

Updates from different VMs can be processed independently.

v2: agd: rebase on upstream
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>

2bd9ccfa

11 2月, 2016 8 次提交

drm/amdgpu: move sync into job object · e86f9cee

由 Christian König 提交于 2月 08, 2016

No need to keep that for every IB.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Acked-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

e86f9cee

drm/amdgpu: cleanup in kernel job submission · d71518b5

由 Christian König 提交于 2月 01, 2016

Add a job_alloc_with_ib helper and proper job submission.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucer@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

d71518b5

drm/amdgpu: move ring from IBs into job · b07c60c0

由 Christian König 提交于 1月 31, 2016

We can't submit to multiple rings at the same time anyway.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucer@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

b07c60c0

drm/amdgpu: make pad_ib a ring function v3 · 9e5d5309

由 Christian König 提交于 1月 31, 2016

The padding depends on the firmware version and we need that for BO moves as
well, not only for VM updates.

v2: new approach of making pad_ib a ring function
v3: fix typo in macro name
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucer@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

9e5d5309

drm/amdgpu: check userptrs mm earlier · cc325d19

由 Christian König 提交于 2月 08, 2016

Instead of when we try to bind it check the usermm when
we try to use it in the IOCTLs.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Acked-by: NAlex Deucher <alexander.deucher@amd.com>

cc325d19

drm/amdgpu: clean up non-scheduler code path (v2) · cadf97b1

由 Chunming Zhou 提交于 1月 15, 2016

Non-scheduler code is longer supported.

v2: agd: rebased on upstream
Signed-off-by: NChunming Zhou <David1.Zhou@amd.com>
Reviewed-by: NKen Wang  <Qingqing.Wang@amd.com>
Reviewed-by: NMonk Liu <monk.liu@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

cadf97b1

drm/amdgpu: fix issue with overlapping userptrs · d7006964

由 Christian König 提交于 2月 08, 2016

Otherwise we could try to evict overlapping userptr BOs in get_user_pages(),
leading to a possible circular locking dependency.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Acked-by: NAlex Deucher <alexander.deucher@amd.com>

d7006964

drm/amdgpu: fix issue with overlapping userptrs · cc1de6e8

由 Christian König 提交于 2月 08, 2016

Otherwise we could try to evict overlapping userptr BOs in get_user_pages(),
leading to a possible circular locking dependency.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Acked-by: NAlex Deucher <alexander.deucher@amd.com>
Cc: stable@vger.kernel.org

cc1de6e8

03 2月, 2016 1 次提交

drm/amdgpu: The VI specific EXE bit should only apply to GMC v8.0 above · 8f3c1629

由 Ken Wang 提交于 2月 03, 2016

Reviewed-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NKen Wang <Qingqing.Wang@amd.com>
Cc: stable@vger.kernel.org

8f3c1629

05 12月, 2015 2 次提交

drm/amdgpu: set snooped flags only on system addresses v2 · 6d99905a

由 Christian König 提交于 12月 04, 2015

Not necessary for VRAM.

v2: no need to check if ttm is NULL.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

6d99905a

drm/amdgpu: add err check for pin userptr · ba98f9e5

由 Chunming Zhou 提交于 11月 26, 2015

Missing error check if the operation failed.
Signed-off-by: NChunming Zhou <David1.Zhou@amd.com>
Reviewed-by: NChristian König <christian.koenig@amd.com>

ba98f9e5

03 12月, 2015 1 次提交

drm/amdgpu: add err check for pin userptr · e2f784fa

由 Chunming Zhou 提交于 11月 26, 2015

Missing error check if the operation failed.
Signed-off-by: NChunming Zhou <David1.Zhou@amd.com>
Reviewed-by: NChristian König <christian.koenig@amd.com>

e2f784fa

17 11月, 2015 1 次提交

drm/amdgpu: fix seq_printf format string · e1b35f61

由 Arnd Bergmann 提交于 11月 10, 2015

The amdgpu driver has a debugfs interface that shows the amount of
VRAM in use, but the newly added code causes a build error on
all 32-bit architectures:

drivers/gpu/drm/amd/amdgpu/amdgpu_ttm.c:1076:17: warning: format '%lu' expects argument of type 'long unsigned int', but argument 4 has type 'long long int' [-Wformat=]

This fixes the format string to use "%llu" for printing 64-bit
numbers, which works everywhere, as long as we also cast to 'u64'.
Unlike atomic64_t, u64 is defined as 'unsigned long long' on
all architectures.
Signed-off-by: NArnd Bergmann <arnd@arndb.de>
Fixes: a2ef8a97 ("drm/amdgpu: add vram usage into debugfs")
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

e1b35f61

05 11月, 2015 1 次提交

drm/amdgpu: remove AMDGPU_FENCE_OWNER_MOVE · 7a91d6cb

由 Christian König 提交于 10月 27, 2015

Moves are exclusive operations anyway, just use the undefined owner for those.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NChunming Zhou <david1.zhou@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>

7a91d6cb

08 10月, 2015 1 次提交

drm/amdgpu: add vram usage into debugfs · a2ef8a97

由 Chunming Zhou 提交于 9月 22, 2015

Signed-off-by: NChunming Zhou <david1.zhou@amd.com>
Reviewed-by: NJammy Zhou <Jammy.Zhou@amd.com>
Reviewed-by: NChristian König <christian.koenig@amd.com>

a2ef8a97

24 9月, 2015 1 次提交

drm/amdgpu: export reservation_object from dmabuf to ttm (v2) · 72d7668b

由 Christian König 提交于 9月 03, 2015

Adds an extra argument to amdgpu_bo_create, which is only used in amdgpu_prime.c.

Port of radeon commit 831b6966.

v2: fix up kfd.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NJammy Zhou <Jammy.Zhou@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>

72d7668b

03 9月, 2015 1 次提交

drm/amdgpu: be explicit about cpu vram access for driver BOs (v2) · 857d913d

由 Alex Deucher 提交于 8月 27, 2015

For kernel driver BOs, be explicit about whether we need
vram access up front.  This avoids unecessary migrations and
avoids using visible vram for buffers were it's not needed.

v2: line wrap fixes
Reviewed-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

857d913d

27 8月, 2015 1 次提交

drm/amdgpu: use IB for copy buffer of eviction · c7ae72c0

由 Chunming Zhou 提交于 8月 25, 2015

This aids handling buffers moves with the scheduler.
Signed-off-by: NChunming Zhou <david1.zhou@amd.com>
Reviewed-by: NChristian K?nig <christian.koenig@amd.com>

c7ae72c0

25 8月, 2015 2 次提交

drm/amdgpu: fix no sync_wait in copy_buffer · 9066b0c3

由 Chunming Zhou 提交于 8月 25, 2015

when eviction is happening, if don't handle
dependency, then the fence could be dead off.
Signed-off-by: NChunming Zhou <david1.zhou@amd.com>
Reviewed-by: NJammy Zhou <Jammy.Zhou@amd.com>
Reviewed-by: NChristian K?nig <christian.koenig@amd.com>

9066b0c3

drm/amdgpu: improve sa_bo->fence by kernel fence · 4ce9891e

由 Chunming Zhou 提交于 8月 19, 2015

Signed-off-by: NChunming Zhou <david1.zhou@amd.com>
Reviewed-by: NChristian K?nig <christian.koenig@amd.com>

4ce9891e