提交 · 8703d6e0fcfdcc9323d5316a443882e790efc1a6 · openeuler / Kernel

26 5月, 2012 10 次提交

arch/tile: allow querying cpu module information from the hypervisor · 8703d6e0

由 Chris Metcalf 提交于 3月 30, 2012

This just adds a few more attributes to the information Linux
can query from the hypervisor for the /sys/hypervisor/board/ directory,
providing part, serial#, revision#, and description for cpu modules
(as opposed to the board itself, or any mezzanine boards).
Signed-off-by: NChris Metcalf <cmetcalf@tilera.com>

8703d6e0

arch/tile: fix hardwall for tilegx and generalize for idn and ipi · b8ace083

由 Chris Metcalf 提交于 3月 30, 2012

The hardwall drain code was not properly implemented for tilegx,
just tilepro, so you couldn't reliably restart an application that
made use of the udn.

In addition, the code was only applicable to the udn (user dynamic
network).  On tilegx there is a second user network that is available
(the "idn"), and there is support for having I/O shims deliver
user-level interrupts to applications ("ipi") which functions in a
very similar way to the inter-core permissions used for udn/idn.
So this change also generalizes the code from supporting just the udn
to supports udn/idn/ipi on tilegx.

By default we now use /dev/hardwall/{udn,idn,ipi} with separate
minor numbers for the three devices.
Signed-off-by: NChris Metcalf <cmetcalf@tilera.com>

b8ace083

arch/tile: support multiple huge page sizes dynamically · 621b1955

由 Chris Metcalf 提交于 4月 01, 2012

This change adds support for a new "super" bit in the PTE, using the new
arch_make_huge_pte() method. The Tilera hypervisor sees the bit set at a
given level of the page table and gangs together 4, 16, or 64 consecutive
pages from that level of the hierarchy to create a larger TLB entry.

One extra "super" page size can be specified at each of the three levels
of the page table hierarchy on tilegx, using the "hugepagesz" argument
on the boot command line. A new hypervisor API is added to allow Linux
to tell the hypervisor how many PTEs to gang together at each level of
the page table.

To allow pre-allocating huge pages larger than the buddy allocator can
handle, this change modifies the Tilera bootmem support to put all of
memory on tilegx platforms into bootmem.

As part of this change I eliminate the vestigial CONFIG_HIGHPTE support,
which never worked anyway, and eliminate the hv_page_size() API in favor
of the standard vma_kernel_pagesize() API.
Signed-off-by: NChris Metcalf <cmetcalf@tilera.com>

621b1955

C
arch/tile: support kexec() for tilegx · fc0c49f5
由 Chris Metcalf 提交于 3月 29, 2012
```
Signed-off-by: NChris Metcalf <cmetcalf@tilera.com>
```
fc0c49f5

arch/tile: support <asm/cachectl.h> header for cacheflush() syscall · cd6f32aa

由 Chris Metcalf 提交于 3月 29, 2012

We already had a syscall that did some dcache flushing, but it was
not used in practice.  Make it MIPS compatible instead so it can
do both the DCACHE and ICACHE actions.  We have code that wants to
be able to use the ICACHE flush mode from userspace so this change
enables that.
Signed-off-by: NChris Metcalf <cmetcalf@tilera.com>

cd6f32aa

arch/tile: Allow tilegx to build with either 16K or 64K page size · d5d14ed6

由 Chris Metcalf 提交于 3月 29, 2012

This change introduces new flags for the hv_install_context()
API that passes a page table pointer to the hypervisor. Clients
can explicitly request 4K, 16K, or 64K small pages when they
install a new context. In practice, the page size is fixed at
kernel compile time and the same size is always requested every
time a new page table is installed.

The <hv/hypervisor.h> header changes so that it provides more abstract
macros for managing "page" things like PFNs and page tables. For
example there is now a HV_DEFAULT_PAGE_SIZE_SMALL instead of the old
HV_PAGE_SIZE_SMALL. The various PFN routines have been eliminated and
only PA- or PTFN-based ones remain (since PTFNs are always expressed
in fixed 2KB "page" size). The page-table management macros are
renamed with a leading underscore and take page-size arguments with
the presumption that clients will use those macros in some single
place to provide the "real" macros they will use themselves.

I happened to notice the old hv_set_caching() API was totally broken
(it assumed 4KB pages) so I changed it so it would nominally work
correctly with other page sizes.

Tag modules with the page size so you can't load a module built with
a conflicting page size. (And add a test for SMP while we're at it.)
Signed-off-by: NChris Metcalf <cmetcalf@tilera.com>

d5d14ed6

arch/tile: optimize get_user/put_user and friends · 47d632f9

由 Chris Metcalf 提交于 3月 29, 2012

Use direct load/store for the get_user/put_user.

Previously, we would call out to a helper routine that would do the
appropriate thing and then return, handling the possible exception
internally.  Now we inline the load or store, along with a "we succeeded"
indication in a register; if the load or store faults, we write a
"we failed" indication into the same register and then return to the
following instruction.  This is more efficient and gives us more compact
code, as well as being more in line with what other architectures do.

The special futex assembly source file for TILE-Gx also disappears in
this change; we just use the same inlining idiom there as well, putting
the appropriate atomic operations directly into futex_atomic_op_inuser()
(and thus into the FUTEX_WAIT function).

The underlying atomic copy_from_user, copy_to_user functions were
renamed using the (cryptic) x86 convention as copy_from_user_ll and
copy_to_user_ll.
Signed-off-by: NChris Metcalf <cmetcalf@tilera.com>

47d632f9

arch/tile: support building big-endian kernel · 1efea40d

由 Chris Metcalf 提交于 3月 29, 2012

The toolchain supports big-endian mode now, so add support for building
the kernel to run big-endian as well.
Signed-off-by: NChris Metcalf <cmetcalf@tilera.com>

1efea40d

arch/tile: allow building Linux with transparent huge pages enabled · 73636b1a

由 Chris Metcalf 提交于 3月 28, 2012

The change adds some infrastructure for managing tile pmd's more generally,
using pte_pmd() and pmd_pte() methods to translate pmd values to and
from ptes, since on TILEPro a pmd is really just a nested structure
holding a pgd (aka pte). Several existing pmd methods are moved into
this framework, and a whole raft of additional pmd accessors are defined
that are used by the transparent hugepage framework.

The tile PTE now has a "client2" bit. The bit is used to indicate a
transparent huge page is in the process of being split into subpages.

This change also fixes a generic bug where the return value of the
generic pmdp_splitting_flush() was incorrect.
Signed-off-by: NChris Metcalf <cmetcalf@tilera.com>

73636b1a

arch/tile: use interrupt critical sections less · 51007004

由 Chris Metcalf 提交于 3月 27, 2012

In general we want to avoid ever touching memory while within an
interrupt critical section, since the page fault path goes through
a different path from the hypervisor when in an interrupt critical
section, and we carefully decided with tilegx that we didn't need
to support this path in the kernel. (On tilepro we did implement
that path as part of supporting atomic instructions in software.)

In practice we always need to touch the kernel stack, since that's
where we store the interrupt state before releasing the critical
section, but this change cleans up a few things. The IRQ_ENABLE
macro is split up so that when we want to enable interrupts in a
deferred way (e.g. for cpu_idle or for interrupt return) we can
read the per-cpu enable mask before entering the critical section.
The cache-migration code is changed to use interrupt masking instead
of interrupt critical sections. And, the interrupt-entry code is
changed so that we defer loading "tp" from per-cpu data until after
we have released the interrupt critical section.
Signed-off-by: NChris Metcalf <cmetcalf@tilera.com>

51007004

19 5月, 2012 1 次提交

tilegx: enable SYSCALL_WRAPPERS support · e6d9668e

由 Chris Metcalf 提交于 5月 18, 2012

Some discussion with the glibc mailing lists revealed that this was
necessary for 64-bit platforms with MIPS-like sign-extension rules
for 32-bit values. The original symptom was that passing (uid_t)-1 to
setreuid() was failing in programs linked -pthread because of the "setxid"
mechanism for passing setxid-type function arguments to the syscall code.
SYSCALL_WRAPPERS handles ensuring that all syscall arguments end up with
proper sign-extension and is thus the appropriate fix for this problem.

On other platforms (s390, powerpc, sparc64, and mips) this was fixed
in 2.6.28.6. The general issue is tracked as CVE-2009-0029.

Cc: <stable@vger.kernel.org>
Signed-off-by: NChris Metcalf <cmetcalf@tilera.com>

e6d9668e

17 5月, 2012 2 次提交

arch/tile: apply commit to the compat signal handling as well · a134d228

由 Chris Metcalf 提交于 5月 16, 2012

This passes siginfo and mcontext to tilegx32 signal handlers that
don't have SA_SIGINFO set just as we have been doing for tilegx64.

Cc: stable@vger.kernel.org
Signed-off-by: NChris Metcalf <cmetcalf@tilera.com>

a134d228

arch/tile: fix up some issues in calling do_work_pending() · fc327e26

由 Chris Metcalf 提交于 4月 28, 2012

First, we were at risk of handling thread-info flags, in particular
do_signal(), when returning from kernel space. This could happen
after a failed kernel_execve(), or when forking a kernel thread.
The fix is to test in do_work_pending() for user_mode() and return
immediately if so; we already had this test for one of the flags,
so I just hoisted it to the top of the function.

Second, if a ptraced process updated the callee-saved registers
in the ptregs struct and then processed another thread-info flag, we
would overwrite the modifications with the original callee-saved
registers. To fix this, we add a register to note if we've already
saved the registers once, and skip doing it on additional passes
through the loop. To avoid a performance hit from the couple of
extra instructions involved, I modified the GET_THREAD_INFO() macro
to be guaranteed to be one instruction, then bundled it with adjacent
instructions, yielding an overall net savings.
Reported-By: NAl Viro <viro@ZenIV.linux.org.uk>
Signed-off-by: NChris Metcalf <cmetcalf@tilera.com>

fc327e26

26 4月, 2012 1 次提交

arch/tile: fix a couple of functions that should be __init · 05ef1b79

由 Chris Metcalf 提交于 4月 25, 2012

They were marked __devinit by mistake, causing some warnings at link time.
Signed-off-by: NChris Metcalf <cmetcalf@tilera.com>

05ef1b79

21 4月, 2012 1 次提交

VM: add "vm_mmap()" helper function · 6be5ceb0

由 Linus Torvalds 提交于 4月 20, 2012

This continues the theme started with vm_brk() and vm_munmap():
vm_mmap() does the same thing as do_mmap(), but additionally does the
required VM locking.

This uninlines (and rewrites it to be clearer) do_mmap(), which sadly
duplicates it in mm/mmap.c and mm/nommu.c.  But that way we don't have
to export our internal do_mmap_pgoff() function.

Some day we hopefully don't have to export do_mmap() either, if all
modular users can become the simpler vm_mmap() instead.  We're actually
very close to that already, with the notable exception of the (broken)
use in i810, and a couple of stragglers in binfmt_elf.
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

6be5ceb0

12 4月, 2012 1 次提交

arch/tile: avoid unused variable warning in proc.c for tilegx · e72d5c7e

由 Chris Metcalf 提交于 4月 11, 2012

Until we push the unaligned access support for tilegx, it's silly
to have arch/tile/kernel/proc.c generate a warning about an unused
variable. Extend the #ifdef to cover all the code and data for now.
Signed-off-by: NChris Metcalf <cmetcalf@tilera.com>

e72d5c7e

10 4月, 2012 1 次提交

tile/CPU hotplug: Add missing call to notify_cpu_starting() · d1640130

由 Srivatsa S. Bhat 提交于 3月 22, 2012

The scheduler depends on receiving the CPU_STARTING notification, without
which we end up into a lot of trouble. So add the missing call to
notify_cpu_starting() in the bringup code.
Signed-off-by: NSrivatsa S. Bhat <srivatsa.bhat@linux.vnet.ibm.com>
Signed-off-by: NChris Metcalf <cmetcalf@tilera.com>

d1640130

03 4月, 2012 23 次提交

arch/tile: avoid accidentally unmasking NMI-type interrupt accidentally · e1d5c019

由 Chris Metcalf 提交于 3月 30, 2012

The return path as we reload registers and core state requires that r30
hold a boolean indicating whether we are returning from an NMI, but in a
couple of cases we weren't setting this properly, with the result that we
could accidentally unmask the NMI interrupt(s), which could cause confusion.
Now we set r30 in every place where we jump into the interrupt return path.
Signed-off-by: NChris Metcalf <cmetcalf@tilera.com>

e1d5c019

arch/tile: remove bogus performance optimization · b1760c84

由 Chris Metcalf 提交于 3月 30, 2012

We were re-homing the initial task's kernel stack on the boot cpu,
but in fact it's better to let it stay globally homed, since that
task isn't bound to the boot cpu anyway.  This is more of a general
cleanup than an actual performance optimization, but it removes
code, which is a good thing. :-)
Signed-off-by: NChris Metcalf <cmetcalf@tilera.com>

b1760c84

arch/tile: return SIGBUS for addresses that are unaligned AND invalid · cdd8e16f

由 Chris Metcalf 提交于 3月 30, 2012

Previously we were returning SIGSEGV in this case. It seems cleaner
to return SIGBUS since the hardware figures out alignment traps
before TLB violations, so SIGBUS is the "more correct" signal.
Signed-off-by: NChris Metcalf <cmetcalf@tilera.com>

cdd8e16f

arch/tile: fix finv_buffer_remote() for tilegx · 54229ff3

由 Chris Metcalf 提交于 3月 30, 2012

There were some correctness issues with this code that are now fixed
with this change.  The change is likely less performant than it could
be, but it should no longer be vulnerable to any races with memory
operations on the memory network while invalidating a range of memory.
This code is run infrequently so performance isn't critical, but
correctness definitely is.
Signed-off-by: NChris Metcalf <cmetcalf@tilera.com>

54229ff3

arch/tile: use atomic exchange in arch_write_unlock() · ab306cae

由 Chris Metcalf 提交于 3月 30, 2012

This idiom is used elsewhere when we do an unlock by writing a zero,
but I missed it here. Using an atomic operation avoids waiting
on the write buffer for the unlocking write to be sent to the home cache.
Signed-off-by: NChris Metcalf <cmetcalf@tilera.com>

ab306cae

arch/tile: stop mentioning the "kvm" subdirectory · b14f2190

由 Chris Metcalf 提交于 3月 30, 2012

It causes "make clean" to fail, for example.  Once we have KVM support
complete, we'll reinstate the subdir reference.
Signed-off-by: NChris Metcalf <cmetcalf@tilera.com>

b14f2190

arch/tile: export the page_home() function. · e81510e0

由 Chris Metcalf 提交于 3月 29, 2012

This avois a bug in modules trying to use the function.
Signed-off-by: NChris Metcalf <cmetcalf@tilera.com>

e81510e0

arch/tile: fix pointer cast in cacheflush.c · 918cbd38

由 Chris Metcalf 提交于 3月 29, 2012

Pragmatically it couldn't be wrong to cast pointers to long to compare
them (since all kernel addresses are in the top half of VA space),
but it's more correct to cast to unsigned long.
Signed-off-by: NChris Metcalf <cmetcalf@tilera.com>

918cbd38

arch/tile: fix single-stepping over swint1 instructions on tilegx · 2858f856

由 Chris Metcalf 提交于 3月 29, 2012

If we are single-stepping and make a syscall, we call ptrace_notify()
explicitly on the return path back to user space, since we are returning
to a pc value set artificially to the next instruction, and otherwise
we won't register that we stepped over the syscall instruction (swint1).
Signed-off-by: NChris Metcalf <cmetcalf@tilera.com>

2858f856

arch/tile: implement panic_smp_self_stop() · cb210ee3

由 Chris Metcalf 提交于 3月 29, 2012

This allows the later-panicking tiles to wait in a lower power state
until they get interrupted with an smp_send_stop().
Signed-off-by: NChris Metcalf <cmetcalf@tilera.com>

cb210ee3

arch/tile: add "nop" after "nap" to help GX idle power draw · 8c92ba6c

由 Chris Metcalf 提交于 3月 29, 2012

This avoids the hardware istream prefetcher doing unnecessary work.
Signed-off-by: NChris Metcalf <cmetcalf@tilera.com>

8c92ba6c

arch/tile: use proper memparse() for "maxmem" options · bfffe79b

由 Chris Metcalf 提交于 3月 29, 2012

This is more standard and avoids having to remember what units
the options actually take.
Signed-off-by: NChris Metcalf <cmetcalf@tilera.com>

bfffe79b

arch/tile: fix up locking in pgtable.c slightly · 719ea79e

由 Chris Metcalf 提交于 3月 29, 2012

We should be holding the init_mm.page_table_lock in shatter_huge_page()
since we are modifying the kernel page tables.  Then, only if we are
walking the other root page tables to update them, do we want to take
the pgd_lock.

Add a comment about taking the pgd_lock that we always do it with
interrupts disabled and therefore are not at risk from the tlbflush
IPI deadlock as is seen on x86.
Signed-off-by: NChris Metcalf <cmetcalf@tilera.com>

719ea79e

C
arch/tile: don't leak kernel memory when we unload modules · 5f220704
由 Chris Metcalf 提交于 3月 29, 2012
```
We were failing to track the memory when we allocated it.
Signed-off-by: NChris Metcalf <cmetcalf@tilera.com>
```
5f220704

arch/tile: fix bug in delay_backoff() · 444eef1b

由 Chris Metcalf 提交于 3月 29, 2012

We were carefully computing a value to use for the number of loops
to spin for, and then ignoring it.
Signed-off-by: NChris Metcalf <cmetcalf@tilera.com>

444eef1b

arch/tile: fix bug in loading kernels larger than 16 MB · 7a7039ee

由 Chris Metcalf 提交于 3月 29, 2012

Previously we only handled kernels up to a single huge page in size.
Now we create additional PTEs appropriately.
Signed-off-by: NChris Metcalf <cmetcalf@tilera.com>

7a7039ee

arch/tile: don't enable irqs unconditionally in page fault handler · b230ff2d

由 Chris Metcalf 提交于 3月 29, 2012

If we took a page fault while we had interrupts disabled, we
shouldn't enable them in the page fault handler.
Signed-off-by: NChris Metcalf <cmetcalf@tilera.com>

b230ff2d

arch/tile: don't set the homecache of a PTE unless appropriate · 12400f1f

由 Chris Metcalf 提交于 3月 29, 2012

We make sure not to try to set the home for an MMIO PTE (on tilegx)
or a PTE that isn't referencing memory managed by Linux.
Signed-off-by: NChris Metcalf <cmetcalf@tilera.com>

12400f1f

arch/tile: don't wait for migrating PTEs in an NMI handler · 48292738

由 Chris Metcalf 提交于 3月 29, 2012

Doing so raises the possibility of self-deadlock if we are waiting
for a backtrace for an oprofile or perf interrupt while we are
in the middle of migrating our own stack page.
Signed-off-by: NChris Metcalf <cmetcalf@tilera.com>

48292738

C
arch/tile/Makefile: use KCFLAGS when figuring out the libgcc path. · 6731aa9e
由 Chris Metcalf 提交于 3月 29, 2012
```
Signed-off-by: NChris Metcalf <cmetcalf@tilera.com>
```
6731aa9e

arch/tile: fix a couple of comments that needed updating · 51bcdf88

由 Chris Metcalf 提交于 3月 29, 2012

Not associated with any code changes, so I'm just lumping these
comment changes into a commit by themselves.
Signed-off-by: NChris Metcalf <cmetcalf@tilera.com>

51bcdf88

arch/tile: fix up some minor trap handling issues · a714ffff

由 Chris Metcalf 提交于 3月 29, 2012

We now respond to MEM_ERROR traps (e.g. an atomic instruction to
non-cacheable memory) with a SIGBUS.

We also no longer generate a console crash message if a user
process die due to a SIGTRAP.
Signed-off-by: NChris Metcalf <cmetcalf@tilera.com>

a714ffff

arch/tile: work around a hardware issue with the return-address stack · e1723538

由 Chris Metcalf 提交于 3月 29, 2012

In certain circumstances we need to do a bunch of jump-and-link
instructions to fill the hardware return-address stack with nonzero values.
Signed-off-by: NChris Metcalf <cmetcalf@tilera.com>

e1723538

openeuler / Kernel 1 年多 前同步成功

openeuler / Kernel
1 年多前同步成功