提交 · 30ddbdb03339fc62480ddbff800a44066bb14455 · openeuler / Kernel

20 3月, 2006 23 次提交

D
[SPARC64]: Add Niagara init-store twin-load ASI defines. · 30ddbdb0
由 David S. Miller 提交于 2月 04, 2006
```
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
30ddbdb0
D
[SPARC64]: Add 'hypervisor' to ultra_tlb_type enumeration. · 1633a53c
由 David S. Miller 提交于 2月 04, 2006
```
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
1633a53c
D
[SPARC64]: SUN4V hypervisor interface defines. · 766f861f
由 David S. Miller 提交于 2月 04, 2006
```
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
766f861f

[SPARC64]: Refine register window trap handling. · 314ef685

由 David S. Miller 提交于 2月 04, 2006

When saving and restoing trap state, do the window spill/fill
handling inline so that we never trap deeper than 2 trap levels.
This is important for chips like Niagara.

The window fixup code is massively simplified, and many more
improvements are now possible.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

314ef685

[SPARC64]: Add explicit register args to trap state loading macros. · ffe483d5

由 David S. Miller 提交于 2月 02, 2006

This, as well as making the code cleaner, allows a simplification in
the TSB miss handling path.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

ffe483d5

[SPARC64]: Refine code sequences to get the cpu id. · 92704a1c

由 David S. Miller 提交于 2月 26, 2006

On uniprocessor, it's always zero for optimize that.

On SMP, the jmpl to the stub kills the return address stack in the cpu
branch prediction logic, so expand the code sequence inline and use a
code patching section to fix things up.  This also always better and
explicit register selection, which will be taken advantage of in a
future changeset.

The hard_smp_processor_id() function is big, so do not inline it.

Fix up tests for Jalapeno to also test for Serrano chips too.  These
tests want "jbus Ultra-IIIi" cases to match, so that is what we should
test for.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

92704a1c

[SPARC64]: Correctable ECC errors cannot occur at trap level > 0. · 7bec08e3

由 David S. Miller 提交于 2月 02, 2006

The are distrupting, which by the sparc v9 definition means they
can only occur when interrupts are enabled in the %pstate register.
This never occurs in any of the trap handling code running at
trap levels > 0.

So just mark it as an unexpected trap.

This allows us to kill off the cee_stuff member of struct thread_info.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

7bec08e3

[SPARC64]: Access TSB with physical addresses when possible. · 517af332

由 David S. Miller 提交于 2月 01, 2006

This way we don't need to lock the TSB into the TLB.
The trick is that every TSB load/store is registered into
a special instruction patch section.  The default uses
virtual addresses, and the patch instructions use physical
address load/stores.

We can't do this on all chips because only cheetah+ and later
have the physical variant of the atomic quad load.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

517af332

D
[SPARC64]: Kill out-of-date commentary in asm-sparc64/tsb.h · b0fd4e49
由 David S. Miller 提交于 1月 31, 2006
```
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
b0fd4e49

[SPARC64]: Fix race in LOAD_PER_CPU_BASE() · 86b81868

由 David S. Miller 提交于 1月 31, 2006

Since we use %g5 itself as a temporary, it can get clobbered
if we take an interrupt mid-stream and thus cause end up with
the final %g5 value too early as a result of rtrap processing.

Set %g5 at the very end, atomically, to avoid this problem.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

86b81868

D
[SPARC64]: Increase swapper_tsb size to 32K. · 2f7ee7c6
由 David S. Miller 提交于 1月 31, 2006
```
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
2f7ee7c6

[SPARC64]: Kill sole argument passed to setup_tba(). · a8b900d8

由 David S. Miller 提交于 1月 31, 2006

No longer used, and move extern declaration to a header file.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

a8b900d8

[SPARC64]: Fix incorrect TSB lock bit handling. · 4753eb2a

由 David S. Miller 提交于 1月 31, 2006

The TSB_LOCK_BIT define is actually a special
value shifted down by 32-bits for the assembler
code macros.

In C code, this isn't what we want.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

4753eb2a

D
[SPARC64]: Preload TSB entries from update_mmu_cache(). · b70c0fa1
由 David S. Miller 提交于 1月 31, 2006
```
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
b70c0fa1

[SPARC64]: Dynamically grow TSB in response to RSS growth. · bd40791e

由 David S. Miller 提交于 1月 31, 2006

As the RSS grows, grow the TSB in order to reduce the likelyhood
of hash collisions and thus poor hit rates in the TSB.

This definitely needs some serious tuning.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

bd40791e

[SPARC64]: Add infrastructure for dynamic TSB sizing. · 98c5584c

由 David S. Miller 提交于 1月 31, 2006

This also cleans up tsb_context_switch().  The assembler
routine is now __tsb_context_switch() and the former is
an inline function that picks out the bits from the mm_struct
and passes it into the assembler code as arguments.

setup_tsb_parms() computes the locked TLB entry to map the
TSB.  Later when we support using the physical address quad
load instructions of Cheetah+ and later, we'll simply use
the physical address for the TSB register value and set
the map virtual and PTE both to zero.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

98c5584c

[SPARC64]: TSB refinements. · 09f94287

由 David S. Miller 提交于 1月 31, 2006

Move {init_new,destroy}_context() out of line.

Do not put huge pages into the TSB, only base page size translations.
There are some clever things we could do here, but for now let's be
correct instead of fancy.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

09f94287

[SPARC64]: Elminate all usage of hard-coded trap globals. · 56fb4df6

由 David S. Miller 提交于 2月 26, 2006

UltraSPARC has special sets of global registers which are switched to
for certain trap types.  There is one set for MMU related traps, one
set of Interrupt Vector processing, and another set (called the
Alternate globals) for all other trap types.

For what seems like forever we've hard coded the values in some of
these trap registers.  Some examples include:

1) Interrupt Vector global %g6 holds current processors interrupt
   work struct where received interrupts are managed for IRQ handler
   dispatch.

2) MMU global %g7 holds the base of the page tables of the currently
   active address space.

3) Alternate global %g6 held the current_thread_info() value.

Such hardcoding has resulted in some serious issues in many areas.
There are some code sequences where having another register available
would help clean up the implementation.  Taking traps such as
cross-calls from the OBP firmware requires some trick code sequences
wherein we have to save away and restore all of the special sets of
global registers when we enter/exit OBP.

We were also using the IMMU TSB register on SMP to hold the per-cpu
area base address, which doesn't work any longer now that we actually
use the TSB facility of the cpu.

The implementation is pretty straight forward.  One tricky bit is
getting the current processor ID as that is different on different cpu
variants.  We use a stub with a fancy calling convention which we
patch at boot time.  The calling convention is that the stub is
branched to and the (PC - 4) to return to is in register %g1.  The cpu
number is left in %g6.  This stub can be invoked by using the
__GET_CPUID macro.

We use an array of per-cpu trap state to store the current thread and
physical address of the current address space's page tables.  The
TRAP_LOAD_THREAD_REG loads %g6 with the current thread from this
table, it uses __GET_CPUID and also clobbers %g1.

TRAP_LOAD_IRQ_WORK is used by the interrupt vector processing to load
the current processor's IRQ software state into %g6.  It also uses
__GET_CPUID and clobbers %g1.

Finally, TRAP_LOAD_PGD_PHYS loads the physical address base of the
current address space's page tables into %g7, it clobbers %g1 and uses
__GET_CPUID.

Many refinements are possible, as well as some tuning, with this stuff
in place.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

56fb4df6

[SPARC64]: Kill pgtable quicklists and use SLAB. · 3c936465

由 David S. Miller 提交于 1月 31, 2006

Taking a nod from the powerpc port.

With the per-cpu caching of both the page allocator and SLAB, the
pgtable quicklist scheme becomes relatively silly and primitive.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

3c936465

[SPARC64]: No need to D-cache color page tables any longer. · 05e28f9d

由 David S. Miller 提交于 1月 31, 2006

Unlike the virtual page tables, the new TSB scheme does not
require this ugly hack.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

05e28f9d

[SPARC64]: Move away from virtual page tables, part 1. · 74bf4312

由 David S. Miller 提交于 1月 31, 2006

We now use the TSB hardware assist features of the UltraSPARC
MMUs.

SMP is currently knowingly broken, we need to find another place
to store the per-cpu base pointers.  We hid them away in the TSB
base register, and that obviously will not work any more :-)

Another known broken case is non-8KB base page size.

Also noticed that flush_tlb_all() is not referenced anywhere, only
the internal __flush_tlb_all() (local cpu only) is used by the
sparc64 port, so we can get rid of flush_tlb_all().

The kernel gets it's own 8KB TSB (swapper_tsb) and each address space
gets it's own private 8K TSB.  Later we can add code to dynamically
increase the size of per-process TSB as the RSS grows.  An 8KB TSB is
good enough for up to about a 4MB RSS, after which the TSB starts to
incur many capacity and conflict misses.

We even accumulate OBP translations into the kernel TSB.

Another area for refinement is large page size support.  We could use
a secondary address space TSB to handle those.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

74bf4312

[TG3]: 40-bit DMA workaround part 2 · 4a29cc2e

由 Michael Chan 提交于 3月 19, 2006

The 40-bit DMA workaround recently implemented for 5714, 5715, and
5780 needs to be expanded because there may be other tg3 devices
behind the EPB Express to PCIX bridge in the 5780 class device.

For example, some 4-port card or mother board designs have 5704 behind
the 5714.

All devices behind the EPB require the 40-bit DMA workaround.

Thanks to Chris Elmquist again for reporting the problem and testing
the patch.
Signed-off-by: NMichael Chan <mchan@broadcom.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

4a29cc2e

[AX.25]: Fix potencial memory hole. · c7c694d1

由 Ralf Baechle DL5RB 提交于 3月 19, 2006

If the AX.25 dialect chosen by the sysadmin is set to DAMA master / 3
(or DAMA slave / 2, if CONFIG_AX25_DAMA_SLAVE=n) ax25_kick() will fall
through the switch statement without calling ax25_send_iframe() or any
other function that would eventually free skbn thus leaking the packet.

Fix by restricting the sysctl inferface to allow only actually supported
AX.25 dialects.

The system administration mistake needed for this to happen is rather
unlikely, so this is an uncritical hole.

Coverity #651.
Signed-off-by: NRalf Baechle DL5RB <ralf@linux-mips.org>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

c7c694d1

19 3月, 2006 5 次提交

[MIPS] Sibyte: Fix race in sb1250_gettimeoffset(). · a904f747

由 Ralf Baechle 提交于 3月 15, 2006

    
From Dave Johnson <djohnson+linuxmips@sw.starentnetworks.com>:
    
sb1250_gettimeoffset() simply reads the current cpu 0 timer remaining
value, however once this counter reaches 0 and the interrupt is raised,
it immediately resets and begins to count down again.
    
If sb1250_gettimeoffset() is called on cpu 1 via do_gettimeofday() after
the timer has reset but prior to cpu 0 processing the interrupt and
taking write_seqlock() in timer_interrupt() it will return a full value
(or close to it) causing time to jump backwards 1ms. Once cpu 0 handles
the interrupt and timer_interrupt() gets far enough along it will jump
forward 1ms.
    
Fix this problem by implementing mips_hpt_*() on sb1250 using a spare
timer unrelated to the existing periodic interrupt timers. It runs at
1Mhz with a full 23bit counter.  This eliminated the custom
do_gettimeoffset() for sb1250 and allowed use of the generic
fixed_rate_gettimeoffset() using mips_hpt_*() and timerhi/timerlo.
Signed-off-by: NRalf Baechle <ralf@linux-mips.org>

a904f747

[MIPS] Sibyte: Fix M_SCD_TIMER_INIT and M_SCD_TIMER_CNT wrong field width. · a77f1242

由 Ralf Baechle 提交于 3月 14, 2006

    
From Dave Johnson <djohnson+linuxmips@sw.starentnetworks.com>:
    
Field width should be 23 bits not 20 bits.
Signed-off-by: NRalf Baechle <ralf@linux-mips.org>

a77f1242

[MIPS] Work around bad code generation for <asm/io.h>. · 966f4406

由 Ralf Baechle 提交于 3月 15, 2006

If a call to set_io_port_base() was being followed by usage of
mips_io_port_base in the same function gcc was possibly using the old
value due to some clever abuse of const. Adding a barrier will keep
the optimization and result in correct code with latest gcc.
Signed-off-by: NRalf Baechle <ralf@linux-mips.org>

966f4406

[MIPS] local_r4k_flush_cache_page fix · de62893b

由 Atsushi Nemoto 提交于 3月 13, 2006

    
If dcache_size != icache_size or dcache_size != scache_size, or
set-associative cache, icache/scache does not flushed properly.  Make
blast_?cache_page_indexed() masks its index value correctly.  Also,
use physical address for physically indexed pcache/scache.
Signed-off-by: NAtsushi Nemoto <anemo@mba.ocn.ne.jp>
Signed-off-by: NRalf Baechle <ralf@linux-mips.org>

de62893b

[MIPS] SB1: Fix interrupt disable hazard. · a3c4946d

由 Ralf Baechle 提交于 3月 13, 2006

    
The SB1 core has a three cycle interrupt disable hazard but we were
wrongly treating it as fully interlocked.
Signed-off-by: NRalf Baechle <ralf@linux-mips.org>

a3c4946d

18 3月, 2006 1 次提交

[NET]: Fix race condition in sk_wait_event(). · 265a9285

由 Alexey Kuznetsov 提交于 3月 17, 2006

It is broken, the condition is checked out of socket lock. It is
wonderful the bug survived for so long time.

[ This fixes bugzilla #6233:
  race condition in tcp_sendmsg when connection became established ]
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

265a9285

16 3月, 2006 2 次提交

[PATCH] powerpc: properly configure DDR/P5IOC children devs · 92eb4602

由 John Rose 提交于 3月 14, 2006

The dynamic add path for PCI Host Bridges can fail to configure children
adapters under P5IOC controllers.  It fails to properly fixup bus/device
resources, and it fails to properly enable EEH.  Both of these steps
need to occur before any children devices are enabled in
pci_bus_add_devices().
Signed-off-by: NJohn Rose <johnrose@austin.ibm.com>
Signed-off-by: NPaul Mackerras <paulus@samba.org>

92eb4602

[ARM] 3364/1: [cleanup] warning fix - definitions for enable_hlt and disable_hlt · dabaeff0

由 Ben Dooks 提交于 3月 15, 2006

Patch from Ben Dooks

The enable_hlt and disable_hlt should be declared in
include/asm/setup.h. This fixes sparse errors from
arch/arm/kernel/process.c
Signed-off-by: NBen Dooks <ben-linux@fluff.org>
Signed-off-by: NRussell King <rmk+kernel@arm.linux.org.uk>

dabaeff0

13 3月, 2006 1 次提交

[ARM] iwmmxt thread state alignment · cdaabbd7

由 Russell King 提交于 3月 12, 2006

This patch removes the reliance of iwmmxt on hand coded alignments.
Since thread_info is always 8K aligned, specifying that fpstate is
8-byte aligned achieves the same effect without needing to resort
to hand coded alignments.
Signed-off-by: NRussell King <rmk+kernel@arm.linux.org.uk>

cdaabbd7

12 3月, 2006 2 次提交

[PATCH] remove __put_task_struct_cb export again · 7cd9013b

由 Christoph Hellwig 提交于 3月 11, 2006

The patch '[PATCH] RCU signal handling' [1] added an export for
__put_task_struct_cb, a put_task_struct helper newly introduced in that
patch.  But the put_task_struct couldn't be used modular previously as
__put_task_struct wasn't exported.  There are not callers of it in modular
code, and it shouldn't be exported because we don't want drivers to hold
references to task_structs.

This patch removes the export and folds __put_task_struct into
__put_task_struct_cb as there's no other caller.

[1] http://www2.kernel.org/git/gitweb.cgi?p=linux/kernel/git/torvalds/linux-2.6.git;a=commit;h=e56d090310d7625ecb43a1eeebd479f04affb48bSigned-off-by: NChristoph Hellwig <hch@lst.de>
Acked-by: NPaul E. McKenney <paulmck@us.ibm.com>
Signed-off-by: NAndrew Morton <akpm@osdl.org>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

7cd9013b

[PATCH] ext3: ext3_symlink should use GFP_NOFS allocations inside · 0adb25d2

由 Kirill Korotaev 提交于 3月 11, 2006

This patch fixes illegal __GFP_FS allocation inside ext3 transaction in
ext3_symlink(). Such allocation may re-enter ext3 code from
try_to_free_pages. But JBD/ext3 code keeps a pointer to current journal
handle in task_struct and, hence, is not reentrable.

This bug led to "Assertion failure in journal_dirty_metadata()" messages.

http://bugzilla.openvz.org/show_bug.cgi?id=115Signed-off-by: NAndrey Savochkin <saw@saw.sw.com.sg>
Signed-off-by: NKirill Korotaev <dev@openvz.org>
Signed-off-by: NAndrew Morton <akpm@osdl.org>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

0adb25d2

10 3月, 2006 4 次提交

[PATCH] slab: Node rotor for freeing alien caches and remote per cpu pages. · 8fce4d8e

由 Christoph Lameter 提交于 3月 09, 2006

The cache reaper currently tries to free all alien caches and all remote
per cpu pages in each pass of cache_reap. For a machines with large number
of nodes (such as Altix) this may lead to sporadic delays of around ~10ms.
Interrupts are disabled while reclaiming creating unacceptable delays.

This patch changes that behavior by adding a per cpu reap_node variable.
Instead of attempting to free all caches, we free only one alien cache and
the per cpu pages from one remote node. That reduces the time spend in
cache_reap. However, doing so will lengthen the time it takes to
completely drain all remote per cpu pagesets and all alien caches. The
time needed will grow with the number of nodes in the system. All caches
are drained when they overflow their respective capacity. So the drawback
here is only that a bit of memory may be wasted for awhile longer.

Details:

1. Rename drain_remote_pages to drain_node_pages to allow the specification
of the node to drain of pcp pages.

2. Add additional functions init_reap_node, next_reap_node for NUMA
that manage a per cpu reap_node counter.

3. Add a reap_alien function that reaps only from the current reap_node.

For us this seems to be a critical issue. Holdoffs of an average of ~7ms
cause some HPC benchmarks to slow down significantly. F.e. NAS parallel
slows down dramatically. NAS parallel has a 12-16 seconds runtime w/o rotor
compared to 5.8 secs with the rotor patches. It gets down to 5.05 secs with
the additional interrupt holdoff reductions.
Signed-off-by: NChristoph Lameter <clameter@sgi.com>
Signed-off-by: NAndrew Morton <akpm@osdl.org>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

8fce4d8e

[PATCH] m68k: fix cmpxchg compile errors if CONFIG_RMW_INSNS=n · 7b61fcda

由 Roman Zippel 提交于 3月 09, 2006

We require that all archs implement atomic_cmpxchg(), for the generic
version of atomic_add_unless().
Signed-off-by: NRoman Zippel <zippel@linux-m68k.org>
Cc: Geert Uytterhoeven <geert@linux-m68k.org>
Cc: Adrian Bunk <bunk@stusta.de>
Signed-off-by: NAndrew Morton <akpm@osdl.org>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

7b61fcda

[PATCH] mtd: 64 bit fixes · 0ef675d4

由 Atsushi Nemoto 提交于 3月 09, 2006

Fix some bugs in mtd/jffs2 on 64bit platform.

The MEMGETBADBLOCK/MEMSETBADBLOCK ioctl are not listed in compat_ioctl.h.

And some variables in jffs2 are declared as uint32_t but used to hold
size_t values.
Signed-off-by: NAtsushi Nemoto <anemo@mba.ocn.ne.jp>
Cc: Thomas Gleixner <tglx@linutronix.de>
Acked-by: NDavid Woodhouse <dwmw2@infradead.org>
Signed-off-by: NAndrew Morton <akpm@osdl.org>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

0ef675d4

[MIPS] Undefine scr_writew and scr_readw in <asm/vga.h>. · fd2a4f11

由 Ralf Baechle 提交于 3月 08, 2006

This is gluing the build of cirrusfb but really the mess that would need
cleaning and fixing is <video/vga.h> and <linux/vt_buffer.h> ...
Signed-off-by: NRalf Baechle <ralf@linux-mips.org>

fd2a4f11

09 3月, 2006 2 次提交

[PATCH] i386: port ATI timer fix from x86_64 to i386 II · f9262c12

由 Andi Kleen 提交于 3月 08, 2006

ATI chipsets tend to generate double timer interrupts for the local APIC
timer when both the 8254 and the IO-APIC timer pins are enabled.  This is
because they route it to both and the result is anded together and the CPU
ends up processing it twice.

This patch changes check_timer to disable the 8254 routing for interrupt 0.

I think it would be safe on all chipsets actually (i tested it on a couple
and it worked everywhere) and Windows seems to do it in a similar way, but
to be conservative this patch only enables this mode on ATI (and adds
options to enable/disable too)

Ported over from a similar x86-64 change.

I reused the ACPI earlyquirk infrastructure for the ATI bridge check, but
tweaked it a bit to work even without ACPI.

Inspired by a patch from Chuck Ebbert, but redone.

Cc: Chuck Ebbert <76306.1226@compuserve.com>
Cc: "Brown, Len" <len.brown@intel.com>
Signed-off-by: NAndrew Morton <akpm@osdl.org>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

f9262c12

[PATCH] fix kexec asm · 2ec5e3a8

由 Michael Matz 提交于 3月 07, 2006

While testing kexec and kdump we hit problems where the new kernel would
freeze or instantly reboot.  The easiest way to trigger it was to kexec a
kernel compiled for CONFIG_M586 on an athlon cpu.  Compiling for CONFIG_MK7
instead would work fine.

The patch fixes a few problems with the kexec inline asm.
Signed-off-by: NChris Mason <mason@suse.com>
Acked-by: N"Eric W. Biederman" <ebiederm@xmission.com>
Signed-off-by: NAndrew Morton <akpm@osdl.org>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

2ec5e3a8

openeuler / Kernel 1 年多 前同步成功

openeuler / Kernel
1 年多前同步成功