提交 · c6a3c495f05a070d4c4016d4a51c384cba723971 · openanolis / cloud-kernel

14 12月, 2015 20 次提交

powerpc/mm: Add helper for converting pte bit to hpte bits · c6a3c495

由 Aneesh Kumar K.V 提交于 12月 01, 2015

Instead of open coding it in multiple code paths, export the helper
and add more documentation. Also make sure we don't make assumption
regarding pte bit position
Acked-by: NScott Wood <scottwood@freescale.com>
Signed-off-by: NAneesh Kumar K.V <aneesh.kumar@linux.vnet.ibm.com>
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

c6a3c495

powerpc/mm: Convert 4k insert from asm to C · a43c0eb8

由 Aneesh Kumar K.V 提交于 12月 01, 2015

This is similar to 64K insert. May be we want to consolidate
Acked-by: NScott Wood <scottwood@freescale.com>
Signed-off-by: NAneesh Kumar K.V <aneesh.kumar@linux.vnet.ibm.com>
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

a43c0eb8

powerpc/mm: Convert __hash_page_64K to C · 89ff7250

由 Aneesh Kumar K.V 提交于 12月 01, 2015

Convert from asm to C
Acked-by: NScott Wood <scottwood@freescale.com>
Signed-off-by: NAneesh Kumar K.V <aneesh.kumar@linux.vnet.ibm.com>
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

89ff7250

powerpc/mm: Increase the width of #define · 227fdbee

由 Aneesh Kumar K.V 提交于 12月 01, 2015

No real change, only style changes
Acked-by: NScott Wood <scottwood@freescale.com>
Signed-off-by: NAneesh Kumar K.V <aneesh.kumar@linux.vnet.ibm.com>
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

227fdbee

A
powerpc/mm: Remove pte_val usage for the second half of pgtable_t · 506b863c
由 Aneesh Kumar K.V 提交于 12月 01, 2015
```
Signed-off-by: NAneesh Kumar K.V <aneesh.kumar@linux.vnet.ibm.com>
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>
```
506b863c

powerpc/mm: Don't track subpage valid bit in pte_t · bf680d51

由 Aneesh Kumar K.V 提交于 12月 01, 2015

This free up 11 bits in pte_t. In the later patch we also change
the pte_t format so that we can start supporting migration pte
at pmd level. We now track 4k subpage valid bit as below

If we have _PAGE_COMBO set, we override the _PAGE_F_GIX_SHIFT
and _PAGE_F_SECOND. Together we have 4 bits, each of them
used to indicate whether any of the 4 4k subpage in that group
is valid. ie,

[ group 1 bit ]   [ group 2 bit ]  ..... [ group 4 ]
[ subpage 1 - 4]  [ subpage 5- 8]  ..... [ subpage 13 - 16]

We still track each 4k subpage slot number and secondary hash
information in the second half of pgtable_t. Removing the subpage
tracking have some significant overhead on aim9 and ebizzy benchmark and
to support THP with 4K subpage, we do need a pgtable_t of 4096 bytes.
Signed-off-by: NAneesh Kumar K.V <aneesh.kumar@linux.vnet.ibm.com>
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

bf680d51

powerpc/mm: Remove the dependency on pte bit position in asm code · 106713a1

由 Aneesh Kumar K.V 提交于 12月 01, 2015

We should not expect pte bit position in asm code. Simply
by moving part of that to C
Acked-by: NScott Wood <scottwood@freescale.com>
Signed-off-by: NAneesh Kumar K.V <aneesh.kumar@linux.vnet.ibm.com>
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

106713a1

powerpc/mm: Convert 4k hash insert to C · 91f1da99

由 Aneesh Kumar K.V 提交于 12月 01, 2015

Acked-by: NScott Wood <scottwood@freescale.com>
Signed-off-by: NAneesh Kumar K.V <aneesh.kumar@linux.vnet.ibm.com>
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

91f1da99

powerpc/booke: Move nohash headers · 17ed9e31

由 Aneesh Kumar K.V 提交于 12月 01, 2015

Move the booke related headers below booke/32 or booke/64
Acked-by: NScott Wood <scottwood@freescale.com>
Signed-off-by: NAneesh Kumar K.V <aneesh.kumar@linux.vnet.ibm.com>
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

17ed9e31

powerpc/mm: Move PTE bits from generic functions to hash64 functions. · 1ca72129

由 Aneesh Kumar K.V 提交于 12月 01, 2015

functions which operate on pte bits are moved to hash*.h and other
generic functions are moved to pgtable.h
Acked-by: NScott Wood <scottwood@freescale.com>
Signed-off-by: NAneesh Kumar K.V <aneesh.kumar@linux.vnet.ibm.com>
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

1ca72129

powerpc/mm: Move hash64 PTE bits from book3s/64/pgtable.h to hash.h · 371352ca

由 Aneesh Kumar K.V 提交于 12月 01, 2015

This enables us to keep hash64 related bits together, and makes it easy
to follow.
Acked-by: NScott Wood <scottwood@freescale.com>
Signed-off-by: NAneesh Kumar K.V <aneesh.kumar@linux.vnet.ibm.com>
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

371352ca

powerpc/mm: Don't use pmd_val, pud_val and pgd_val as lvalue · f281b5d5

由 Aneesh Kumar K.V 提交于 12月 01, 2015

We convert them static inline function here as we did with pte_val in
the previous patch
Acked-by: NScott Wood <scottwood@freescale.com>
Signed-off-by: NAneesh Kumar K.V <aneesh.kumar@linux.vnet.ibm.com>
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

f281b5d5

powerpc/mm: Don't use pte_val as lvalue · 10bd3808

由 Aneesh Kumar K.V 提交于 12月 01, 2015

We also convert few #define to static inline in this patch for better
type checking
Acked-by: NScott Wood <scottwood@freescale.com>
Signed-off-by: NAneesh Kumar K.V <aneesh.kumar@linux.vnet.ibm.com>
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

10bd3808

powerpc/mm: Drop pte-common.h from BOOK3S 64 · b0412ea9

由 Aneesh Kumar K.V 提交于 12月 01, 2015

We copy only needed PTE bits define from pte-common.h to respective
hash related header. This should greatly simply later patches in which
we are going to change the pte format for hash config
Acked-by: NScott Wood <scottwood@freescale.com>
Signed-off-by: NAneesh Kumar K.V <aneesh.kumar@linux.vnet.ibm.com>
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

b0412ea9

powerpc/mm: Don't have generic headers introduce functions touching pte bits · ee4889c7

由 Aneesh Kumar K.V 提交于 12月 01, 2015

We are going to drop pte_common.h in the later patch. The idea is to
enable hash code not require to define all PTE bits. Having PTE bits
defined in pte_common.h made the code unnecessarily complex.
Acked-by: NScott Wood <scottwood@freescale.com>
Signed-off-by: NAneesh Kumar K.V <aneesh.kumar@linux.vnet.ibm.com>
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

ee4889c7

powerpc/mm: Delete booke bits from book3s · cbbb8683

由 Aneesh Kumar K.V 提交于 12月 01, 2015

We also move __ASSEMBLY__ towards the end of header. This avoid
having #ifndef __ASSEMBLY___ all over the header
Acked-by: NScott Wood <scottwood@freescale.com>
Signed-off-by: NAneesh Kumar K.V <aneesh.kumar@linux.vnet.ibm.com>
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

cbbb8683

powerpc/mm: Move hash specific pte width and other defines to book3s · ab537dca

由 Aneesh Kumar K.V 提交于 12月 01, 2015

This further make a copy of pte defines to book3s/64/hash*.h. This
remove the dependency on pgtable-ppc64-4k.h and pgtable-ppc64-64k.h
Acked-by: NScott Wood <scottwood@freescale.com>
Signed-off-by: NAneesh Kumar K.V <aneesh.kumar@linux.vnet.ibm.com>
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

ab537dca

powerpc/mm: make a separate copy for book3s · 3dfcb315

由 Aneesh Kumar K.V 提交于 12月 01, 2015

In this patch we do:
cp pgtable-ppc32.h book3s/32/pgtable.h
cp pgtable-ppc64.h book3s/64/pgtable.h

This enable us to do further changes to hash specific config.
We will change the page table format for 64bit hash in later patches.
Acked-by: NScott Wood <scottwood@freescale.com>
Signed-off-by: NAneesh Kumar K.V <aneesh.kumar@linux.vnet.ibm.com>
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

3dfcb315

powerpc/mm: move pte headers to book3s directory · 26b6a3d9

由 Aneesh Kumar K.V 提交于 12月 01, 2015

Acked-by: NScott Wood <scottwood@freescale.com>
Signed-off-by: NAneesh Kumar K.V <aneesh.kumar@linux.vnet.ibm.com>
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

26b6a3d9

powerpc/mm: Fix infinite loop in hash fault with 4K page size · 0863d7f2

由 Aneesh Kumar K.V 提交于 11月 28, 2015

This is the same bug we fixed as part of 09567e7f
("powerpc/mm: Check paca psize is up to date for huge mappings"). Please
check that for details. The difference here is that faults were
happening on a 4K page at an address previously mapped by hugetlb.
Signed-off-by: NAneesh Kumar K.V <aneesh.kumar@linux.vnet.ibm.com>
Reviewed-by: NAnshuman Khandual <khandual@linux.vnet.ibm.com>
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

0863d7f2

10 12月, 2015 3 次提交

powerpc: Fix DSCR inheritance over fork() · db1231dc

由 Anton Blanchard 提交于 12月 09, 2015

Two DSCR tests have a hack in them:

	/*
	 * XXX: Force a context switch out so that DSCR
	 * current value is copied into the thread struct
	 * which is required for the child to inherit the
	 * changed value.
	 */
	sleep(1);

We should not be working around this in the testcase, it is a kernel bug.
Fix it by copying the current DSCR to the child, instead of what we
had in the thread struct at last context switch.
Signed-off-by: NAnton Blanchard <anton@samba.org>
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

db1231dc

powerpc: Call restore_sprs() before _switch() · 20dbe670

由 Anton Blanchard 提交于 12月 10, 2015

commit 152d523e ("powerpc: Create context switch helpers save_sprs()
and restore_sprs()") moved the restore of SPRs after the call to _switch().

There is an issue with this approach - new tasks do not return through
_switch(), they are set up by copy_thread() to directly return through
ret_from_fork() or ret_from_kernel_thread(). This means restore_sprs() is
not getting called for new tasks.

Fix this by moving restore_sprs() before _switch().

Fixes: 152d523e ("powerpc: Create context switch helpers save_sprs() and restore_sprs()")
Signed-off-by: NAnton Blanchard <anton@samba.org>
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

20dbe670

powerpc: Call check_if_tm_restore_required() in enable_kernel_*() · d64d02ce

由 Anton Blanchard 提交于 12月 10, 2015

Commit a0e72cf1 ("powerpc: Create msr_check_and_{set,clear}()")
removed a call to check_if_tm_restore_required() in the
enable_kernel_*() functions. Add them back in.

Fixes: a0e72cf1 ("powerpc: Create msr_check_and_{set,clear}()")
Reported-by: NRashmica Gupta <rashmicy@gmail.com>
Signed-off-by: NAnton Blanchard <anton@samba.org>
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

d64d02ce

02 12月, 2015 4 次提交

powerpc: clean up asm/switch_to.h · d1e1cf2e

由 Anton Blanchard 提交于 10月 29, 2015

Remove a bunch of unnecessary fallback functions and group
things in a more logical way.
Signed-off-by: NAnton Blanchard <anton@samba.org>
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

d1e1cf2e

powerpc: Rearrange __switch_to() · f3d885cc

由 Anton Blanchard 提交于 10月 29, 2015

Most of __switch_to() is housekeeping, TLB batching, timekeeping etc.
Move these away from the more complex and critical context switching
code.
Signed-off-by: NAnton Blanchard <anton@samba.org>
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

f3d885cc

powerpc: create flush_all_to_thread() · 579e633e

由 Anton Blanchard 提交于 10月 29, 2015

Create a single function that flushes everything (FP, VMX, VSX, SPE).
Doing this all at once means we only do one MSR write.
Signed-off-by: NAnton Blanchard <anton@samba.org>
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

579e633e

powerpc: create giveup_all() · c2085059

由 Anton Blanchard 提交于 10月 29, 2015

Create a single function that gives everything up (FP, VMX, VSX, SPE).
Doing this all at once means we only do one MSR write.

A context switch microbenchmark using yield():

http://ozlabs.org/~anton/junkcode/context_switch2.c

./context_switch2 --test=yield --fp --altivec --vector 0 0

shows an improvement of 3% on POWER8.
Signed-off-by: NAnton Blanchard <anton@samba.org>
[mpe: giveup_all() needs to be EXPORT_SYMBOL'ed]
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

c2085059

01 12月, 2015 13 次提交

powerpc: Remove fp_enable() and vec_enable(), use msr_check_and_{set, clear}() · 1f2e25b2

由 Anton Blanchard 提交于 10月 29, 2015

More consolidation of our MSR available bit handling.
Signed-off-by: NAnton Blanchard <anton@samba.org>
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

1f2e25b2

powerpc: Add ppc_strict_facility_enable boot option · 3eb5d588

由 Anton Blanchard 提交于 10月 29, 2015

Add a boot option that strictly manages the MSR unavailable bits.
This catches kernel uses of FP/Altivec/SPE that would otherwise
corrupt user state.
Signed-off-by: NAnton Blanchard <anton@samba.org>
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

3eb5d588

powerpc: Create disable_kernel_{fp,altivec,vsx,spe}() · dc4fbba1

由 Anton Blanchard 提交于 10月 29, 2015

The enable_kernel_*() functions leave the relevant MSR bits enabled
until we exit the kernel sometime later. Create disable versions
that wrap the kernel use of FP, Altivec VSX or SPE.

While we don't want to disable it normally for performance reasons
(MSR writes are slow), it will be used for a debug boot option that
does this and catches bad uses in other areas of the kernel.
Signed-off-by: NAnton Blanchard <anton@samba.org>
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

dc4fbba1

powerpc: Create msr_check_and_{set,clear}() · a0e72cf1

由 Anton Blanchard 提交于 10月 29, 2015

Create helper functions to set and clear MSR bits after first
checking if they are already set. Grouping them will make it
easy to avoid the MSR writes in a subsequent optimisation.
Signed-off-by: NAnton Blanchard <anton@samba.org>
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

a0e72cf1

crypto: vmx: Only call enable_kernel_vsx() · 1552cd70

由 Anton Blanchard 提交于 10月 29, 2015

With the recent change to enable_kernel_vsx(), we no longer need
to call enable_kernel_fp() and enable_kernel_altivec().
Signed-off-by: NAnton Blanchard <anton@samba.org>
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

1552cd70

powerpc: Move part of giveup_vsx into c · a7d623d4

由 Anton Blanchard 提交于 10月 29, 2015

Move the MSR modification into c. Removing it from the assembly
function will allow us to avoid costly MSR writes by batching them
up.

Check the FP and VMX bits before calling the relevant giveup_*()
function. This makes giveup_vsx() and flush_vsx_to_thread() perform
more like their sister functions, and allows us to use
flush_vsx_to_thread() in the signal code.

Move the check_if_tm_restore_required() check in.
Signed-off-by: NAnton Blanchard <anton@samba.org>
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

a7d623d4

powerpc: Move part of giveup_fpu,altivec,spe into c · 98da581e

由 Anton Blanchard 提交于 10月 29, 2015

Move the MSR modification into new c functions. Removing it from
the low level functions will allow us to avoid costly MSR writes
by batching them up.

Move the check_if_tm_restore_required() check into these new functions.
Signed-off-by: NAnton Blanchard <anton@samba.org>
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

98da581e

powerpc: Remove NULL task struct pointer checks in FP and vector code · b51b1153

由 Anton Blanchard 提交于 10月 29, 2015

We used to allow giveup_*() to be called with a NULL task struct
pointer. Now those cases are handled in the caller we can remove
the checks. We can also remove giveup_altivec_notask() which is also
unused.
Signed-off-by: NAnton Blanchard <anton@samba.org>
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

b51b1153

powerpc: Create mtmsrd_isync() · 611b0e5c

由 Anton Blanchard 提交于 10月 29, 2015

mtmsrd_isync() will do an mtmsrd followed by an isync on older
processors. On newer processors we avoid the isync via a feature fixup.
Signed-off-by: NAnton Blanchard <anton@samba.org>
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

611b0e5c

powerpc: Simplify TM restore checks · b86fd2bd

由 Anton Blanchard 提交于 10月 29, 2015

Instead of having multiple giveup_*_maybe_transactional() functions,
separate out the TM check into a new function called
check_if_tm_restore_required().

This will make it easier to optimise the giveup_*() functions in a
subsequent patch.
Signed-off-by: NAnton Blanchard <anton@samba.org>
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

b86fd2bd

powerpc: Remove UP only lazy floating point and vector optimisations · af1bbc3d

由 Anton Blanchard 提交于 10月 29, 2015

The UP only lazy floating point and vector optimisations were written
back when SMP was not common, and neither glibc nor gcc used vector
instructions. Now SMP is very common, glibc aggressively uses vector
instructions and gcc autovectorises.

We want to add new optimisations that apply to both UP and SMP, but
in preparation for that remove these UP only optimisations.
Signed-off-by: NAnton Blanchard <anton@samba.org>
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

af1bbc3d

powerpc: Remove redundant mflr in _switch · 68bfa962

由 Anton Blanchard 提交于 10月 29, 2015

No need to execute mflr twice.
Signed-off-by: NAnton Blanchard <anton@samba.org>
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

68bfa962

powerpc: Create context switch helpers save_sprs() and restore_sprs() · 152d523e

由 Anton Blanchard 提交于 10月 29, 2015

Move all our context switch SPR save and restore code into two
helpers. We do a few optimisations:

- Group all mfsprs and all mtsprs. In many cases an mtspr sets a
scoreboarding bit that an mfspr waits on, so the current practise of
mfspr A; mtspr A; mfpsr B; mtspr B is the worst scheduling we can
do.

- SPR writes are slow, so check that the value is changing before
writing it.

A context switch microbenchmark using yield():

http://ozlabs.org/~anton/junkcode/context_switch2.c

./context_switch2 --test=yield 0 0

shows an improvement of almost 10% on POWER8.
Signed-off-by: NAnton Blanchard <anton@samba.org>
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

152d523e

openanolis / cloud-kernel 大约 1 年 前同步成功

openanolis / cloud-kernel
大约 1 年前同步成功