提交 · 6822190882ce02ae8ae135026c2b3f17c006960b · openeuler / Kernel

07 3月, 2010 3 次提交

frv: remove pci_dma_sync_single() and pci_dma_sync_sg() · 68221908

由 FUJITA Tomonori 提交于 3月 05, 2010

No architecture except for frv has pci_dma_sync_single() and
pci_dma_sync_sg().  The APIs are deprecated.
Signed-off-by: NFUJITA Tomonori <fujita.tomonori@lab.ntt.co.jp>
Acked-by: NDavid S. Miller <davem@davemloft.net>
Acked-by: NDavid Howells <dhowells@redhat.com>
Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

68221908

mm: change anon_vma linking to fix multi-process server scalability issue · 5beb4930

由 Rik van Riel 提交于 3月 05, 2010

The old anon_vma code can lead to scalability issues with heavily forking
workloads.  Specifically, each anon_vma will be shared between the parent
process and all its child processes.

In a workload with 1000 child processes and a VMA with 1000 anonymous
pages per process that get COWed, this leads to a system with a million
anonymous pages in the same anon_vma, each of which is mapped in just one
of the 1000 processes.  However, the current rmap code needs to walk them
all, leading to O(N) scanning complexity for each page.

This can result in systems where one CPU is walking the page tables of
1000 processes in page_referenced_one, while all other CPUs are stuck on
the anon_vma lock.  This leads to catastrophic failure for a benchmark
like AIM7, where the total number of processes can reach in the tens of
thousands.  Real workloads are still a factor 10 less process intensive
than AIM7, but they are catching up.

This patch changes the way anon_vmas and VMAs are linked, which allows us
to associate multiple anon_vmas with a VMA.  At fork time, each child
process gets its own anon_vmas, in which its COWed pages will be
instantiated.  The parents' anon_vma is also linked to the VMA, because
non-COWed pages could be present in any of the children.

This reduces rmap scanning complexity to O(1) for the pages of the 1000
child processes, with O(N) complexity for at most 1/N pages in the system.
 This reduces the average scanning cost in heavily forking workloads from
O(N) to 2.

The only real complexity in this patch stems from the fact that linking a
VMA to anon_vmas now involves memory allocations.  This means vma_adjust
can fail, if it needs to attach a VMA to anon_vma structures.  This in
turn means error handling needs to be added to the calling functions.

A second source of complexity is that, because there can be multiple
anon_vmas, the anon_vma linking in vma_adjust can no longer be done under
"the" anon_vma lock.  To prevent the rmap code from walking up an
incomplete VMA, this patch introduces the VM_LOCK_RMAP VMA flag.  This bit
flag uses the same slot as the NOMMU VM_MAPPED_COPY, with an ifdef in mm.h
to make sure it is impossible to compile a kernel that needs both symbolic
values for the same bitflag.

Some test results:

Without the anon_vma changes, when AIM7 hits around 9.7k users (on a test
box with 16GB RAM and not quite enough IO), the system ends up running
>99% in system time, with every CPU on the same anon_vma lock in the
pageout code.

With these changes, AIM7 hits the cross-over point around 29.7k users.
This happens with ~99% IO wait time, there never seems to be any spike in
system time.  The anon_vma lock contention appears to be resolved.

[akpm@linux-foundation.org: cleanups]
Signed-off-by: NRik van Riel <riel@redhat.com>
Cc: KOSAKI Motohiro <kosaki.motohiro@jp.fujitsu.com>
Cc: Larry Woodman <lwoodman@redhat.com>
Cc: Lee Schermerhorn <Lee.Schermerhorn@hp.com>
Cc: Minchan Kim <minchan.kim@gmail.com>
Cc: Andrea Arcangeli <aarcange@redhat.com>
Cc: Hugh Dickins <hugh.dickins@tiscali.co.uk>
Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

5beb4930

bitops: rename for_each_bit() to for_each_set_bit() · 984b3f57

由 Akinobu Mita 提交于 3月 05, 2010

Rename for_each_bit to for_each_set_bit in the kernel source tree.  To
permit for_each_clear_bit(), should that ever be added.

The patch includes a macro to map the old for_each_bit() onto the new
for_each_set_bit().  This is a (very) temporary thing to ease the migration.

[akpm@linux-foundation.org: add temporary for_each_bit()]
Suggested-by: NAlexey Dobriyan <adobriyan@gmail.com>
Suggested-by: NAndrew Morton <akpm@linux-foundation.org>
Signed-off-by: NAkinobu Mita <akinobu.mita@gmail.com>
Cc: "David S. Miller" <davem@davemloft.net>
Cc: Russell King <rmk@arm.linux.org.uk>
Cc: David Woodhouse <dwmw2@infradead.org>
Cc: Artem Bityutskiy <dedekind@infradead.org>
Cc: Stephen Rothwell <sfr@canb.auug.org.au>
Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

984b3f57

06 3月, 2010 1 次提交

x86: fix mtrr missing kernel-doc · 6c550ee4

由 Randy Dunlap 提交于 3月 05, 2010

Fix missing kernel-doc notation in mtrr/main.c:

Warning(arch/x86/kernel/cpu/mtrr/main.c:152): No description found for parameter 'info'
Signed-off-by: NRandy Dunlap <randy.dunlap@oracle.com>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

6c550ee4

04 3月, 2010 5 次提交

x86: Issue at least one memory barrier in stop_machine_text_poke() · e5a11016

由 Masami Hiramatsu 提交于 3月 03, 2010

Fix stop_machine_text_poke() to issue smp_mb() before exiting
waiting loop, and use cpu_relax() for waiting.

Changes in v2:
 - Don't use ACCESS_ONCE().
Signed-off-by: NMasami Hiramatsu <mhiramat@redhat.com>
Acked-by: NMathieu Desnoyers <mathieu.desnoyers@efficios.com>
Cc: systemtap <systemtap@sources.redhat.com>
Cc: DLE <dle-develop@lists.sourceforge.net>
Cc: Jason Baron <jbaron@redhat.com>
LKML-Reference: <20100304033850.3819.74590.stgit@localhost6.localdomain6>
Signed-off-by: NIngo Molnar <mingo@elte.hu>

e5a11016

Simplify failure exits in s390/hypfs fill_super() · f1771ffa

由 Al Viro 提交于 1月 25, 2010

->kill_sb() will be called after any failure exit, so no need
to duplicate what it can do.
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

f1771ffa

A
Switch may_open() and break_lease() to passing O_... · 8737c930
由 Al Viro 提交于 12月 24, 2009
```
... instead of mixing FMODE_ and O_
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
8737c930

sparc64: Make prom entry spinlock NMI safe. · 8a4fd1e4

由 David S. Miller 提交于 3月 03, 2010

If we do something like try to print to the OF console from an NMI
while we're already in OpenFirmware, we'll deadlock on the spinlock.

Use a raw spinlock and disable NMIs when we take it.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

8a4fd1e4

sparc64: Kill off old sys_perfctr system call and state. · c7d5a005

由 David S. Miller 提交于 3月 03, 2010

People should be using the perf events interfaces, and
the way these system call facilities used the %pcr conflicts
with the usage of the NMI watchdog and perf events.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

c7d5a005

03 3月, 2010 8 次提交

D
sparc: Update defconfigs. · 6c5ae5b2
由 David S. Miller 提交于 3月 03, 2010
```
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
6c5ae5b2
D
sparc: Provide io{read,write}{16,32}be(). · 1bff4dbb
由 David S. Miller 提交于 3月 03, 2010
```
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
1bff4dbb

USB: atmel uaba: Adding invert vbus_pin · 640e95ab

由 Eirik Aanonsen 提交于 2月 05, 2010

Adding vbus_pin_inverted so that the usb detect pin can be active high
or low depending on HW implementation also replaced the
gpio_get_value(udc->vbus_pin); with a call to vbus_is_present(udc); This
allows the driver to be loaded and save about 0,15W on the consumption.
Signed-off-by: NEirik Aanonsen <eaa@wprmedical.com>
Signed-off-by: NGreg Kroah-Hartman <gregkh@suse.de>

640e95ab

arm: defconfig: rx51: enable phonet and g_nokia · bce54fed

由 Felipe Balbi 提交于 1月 05, 2010

trivial patch enabling g_nokia on rx51_defconfig.
Signed-off-by: NFelipe Balbi <felipe.balbi@nokia.com>
Signed-off-by: NGreg Kroah-Hartman <gregkh@suse.de>

bce54fed

M
USB: MXC: add platform resources for i.MX21 USB host controller. · 4e0fa90d
由 Martin Fuzzey 提交于 11月 21, 2009
```
Signed-off-by: NMartin Fuzzey <mfuzzey@gmail.com>
Signed-off-by: NGreg Kroah-Hartman <gregkh@suse.de>
```
4e0fa90d

USB: MXC: use DMA_BIT_MASK macro rather than hardcoded constants. · 3eb352c7

由 Martin Fuzzey 提交于 11月 21, 2009

Also fixes tab/space issue causing checkpatch to complain.
Signed-off-by: NMartin Fuzzey <mfuzzey@gmail.com>
Signed-off-by: NGreg Kroah-Hartman <gregkh@suse.de>

3eb352c7

USB: MXC: Add i.MX21 specific USB host controller driver. · 23d3e7a6

由 Martin Fuzzey 提交于 11月 21, 2009

This driver is a Full / Low speed only USB host for the i.MX21.
Signed-off-by: NMartin Fuzzey <mfuzzey@gmail.com>
Signed-off-by: NGreg Kroah-Hartman <gregkh@suse.de>

23d3e7a6

DMAENGINE: COH 901 318 descriptor pool refactoring · b87108a7

由 Linus Walleij 提交于 3月 02, 2010

This centralize some spread-out initialization of descriptors into
one function and cleans up the error paths.
Signed-off-by: NLinus Walleij <linus.walleij@stericsson.com>
Signed-off-by: NDan Williams <dan.j.williams@intel.com>

b87108a7

02 3月, 2010 5 次提交

DaVinci DM365: Adding support for SPI EEPROM · 5f19daa1

由 Sandeep Paulraj 提交于 2月 01, 2010

The DM365 Spectrum Digital EVM comes with an EEPROM
connected to SPI0.
This patch adds support for the SPI EEPROM.
Signed-off-by: NSandeep Paulraj <s-paulraj@ti.com>
Signed-off-by: NKevin Hilman <khilman@deeprootsystems.com>

5f19daa1

DaVinci DM365: Adding DM365 SPI support · a3e13e89

由 Sandeep Paulraj 提交于 2月 01, 2010

This patch adds SPI init for DM365.
It does the following
1) Initializes SPI0
2) Defines resources to be used by SPI0
3) Adds platform data for SPI0
Signed-off-by: NSandeep Paulraj <s-paulraj@ti.com>
Signed-off-by: NKevin Hilman <khilman@deeprootsystems.com>

a3e13e89

DaVinci DM355: Modifications to DM355 SPI support · 15e86585

由 Sandeep Paulraj 提交于 2月 01, 2010

This patch does the following

1) Minor change to the SPI clocks making it
similar to DM365.
2) Changing the interrupt used by SPI0
3) Adding EDMA resources that can be used by SPI0
4) Adding platform specific data.
Signed-off-by: NSandeep Paulraj <s-paulraj@ti.com>
Signed-off-by: NKevin Hilman <khilman@deeprootsystems.com>

15e86585

DaVinci: SPI: Adding header file for SPI support. · 8e2a0013

由 Sandeep Paulraj 提交于 2月 01, 2010

This patch adds "spi.h" header file that will be used by board and
architecture specific code.
Signed-off-by: NSandeep Paulraj <s-paulraj@ti.com>
Signed-off-by: NKevin Hilman <khilman@deeprootsystems.com>

8e2a0013

davinci: dm646x: CDCE clocks: davinci_clk converted to clk_lookup · c564191b

由 Kevin Hilman 提交于 1月 11, 2010

Remove unneeded 'struct davinci_clk' wrapper around 'struct
clk_lookup' and use clk_lookup directly.
Signed-off-by: NKevin Hilman <khilman@deeprootsystems.com>

c564191b

01 3月, 2010 18 次提交

KVM: x86: Add KVM_CAP_X86_ROBUST_SINGLESTEP · d2be1651

由 Jan Kiszka 提交于 2月 23, 2010

This marks the guest single-step API improvement of 94fe45da and
91586a3b with a capability flag to allow reliable detection by user
space.
Signed-off-by: NJan Kiszka <jan.kiszka@siemens.com>
Cc: stable@kernel.org (2.6.33)
Signed-off-by: NAvi Kivity <avi@redhat.com>

d2be1651

KVM: VMX: Update instruction length on intercepted BP · c573cd22

由 Jan Kiszka 提交于 2月 23, 2010

We intercept #BP while in guest debugging mode. As VM exits due to
intercepted exceptions do not necessarily come with valid
idt_vectoring, we have to update event_exit_inst_len explicitly in such
cases. At least in the absence of migration, this ensures that
re-injections of #BP will find and use the correct instruction length.
Signed-off-by: NJan Kiszka <jan.kiszka@siemens.com>
Cc: stable@kernel.org (2.6.32, 2.6.33)
Signed-off-by: NAvi Kivity <avi@redhat.com>

c573cd22

KVM: Fix emulate_sys[call, enter, exit]()'s fault handling · e54cfa97

由 Takuya Yoshikawa 提交于 2月 18, 2010

This patch fixes emulate_syscall(), emulate_sysenter() and
emulate_sysexit() to handle injected faults properly.

Even though original code injects faults in these functions,
we cannot handle these unless we use the different return
value from the UNHANDLEABLE case. So this patch use X86EMUL_*
codes instead of -1 and 0 and makes x86_emulate_insn() to
handle these propagated faults.

Be sure that, in x86_emulate_insn(), goto cannot_emulate and
goto done with rc equals X86EMUL_UNHANDLEABLE have same effect.
Signed-off-by: NTakuya Yoshikawa <yoshikawa.takuya@oss.ntt.co.jp>
Signed-off-by: NGleb Natapov <gleb@redhat.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

e54cfa97

KVM: Fix segment descriptor loading · c697518a

由 Gleb Natapov 提交于 2月 18, 2010

Add proper error and permission checking. This patch also change task
switching code to load segment selectors before segment descriptors, like
SDM requires, otherwise permission checking during segment descriptor
loading will be incorrect.

Cc: stable@kernel.org (2.6.33, 2.6.32)
Signed-off-by: NGleb Natapov <gleb@redhat.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

c697518a

KVM: Fix load_guest_segment_descriptor() to inject page fault · 6f550484

由 Takuya Yoshikawa 提交于 2月 18, 2010

This patch injects page fault when reading descriptor in
load_guest_segment_descriptor() fails with FAULT.

Effects of this injection: This function is used by
kvm_load_segment_descriptor() which is necessary for the
following instructions:

 - mov seg,r/m16
 - jmp far
 - pop ?s

This patch makes it possible to emulate the page faults
generated by these instructions. But be sure that unless
we change the kvm_load_segment_descriptor()'s ret value
propagation this patch has no effect.
Signed-off-by: NTakuya Yoshikawa <yoshikawa.takuya@oss.ntt.co.jp>
Signed-off-by: NGleb Natapov <gleb@redhat.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

6f550484

KVM: x86 emulator: Forbid modifying CS segment register by mov instruction · 8b9f4414

由 Gleb Natapov 提交于 2月 18, 2010

Inject #UD if guest attempts to do so. This is in accordance to Intel
SDM.

Cc: stable@kernel.org (2.6.33, 2.6.32)
Signed-off-by: NGleb Natapov <gleb@redhat.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

8b9f4414

KVM: Convert i8254/i8259 locks to raw_spinlocks · fa8273e9

由 Thomas Gleixner 提交于 2月 17, 2010

The i8254/i8259 locks need to be real spinlocks on preempt-rt. Convert
them to raw_spinlock. No change for !RT kernels.
Signed-off-by: NThomas Gleixner <tglx@linutronix.de>
Signed-off-by: NAvi Kivity <avi@redhat.com>

fa8273e9

KVM: x86 emulator: disallow opcode 82 in 64-bit mode · e424e191

由 Gleb Natapov 提交于 2月 11, 2010

Instructions with opcode 82 are not valid in 64 bit mode.
Signed-off-by: NGleb Natapov <gleb@redhat.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

e424e191

KVM: x86 emulator: code style cleanup · 1d327eac

由 Wei Yongjun 提交于 2月 11, 2010

Just remove redundant semicolon.
Signed-off-by: NWei Yongjun <yjwei@cn.fujitsu.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

1d327eac

KVM: x86 emulator: Add LOCK prefix validity checking · d380a5e4