提交 · 05b3cbd8bb98736387df8a2e1efe311b1fb4d2ad · openeuler / Kernel

12 1月, 2006 40 次提交

[PATCH] x86_64: Early initialization of cpu_to_node · 05b3cbd8

由 Ravikiran Thirumalai 提交于 1月 11, 2006

Patch enables early intialization of cpu_to_node.
apicid_to_node is built by reading the SRAT table, from acpi_numa_init with
ACPI_NUMA and k8_scan_nodes with K8_NUMA.
x86_cpu_to_apicid is built by parsing the ACPI MADT table, from acpi_boot_init.
We combine these two tables and setup cpu_to_node.

Early intialization helps the static per_cpu_areas in getting pages from
correct node.

Change since last release:
Do not initialize early init_cpu_to_node for faking node cases.

Patch tested on TYAN dual core 4P board with K8 only, ACPI_NUMA.
Tested on EM64T NUMA. Also tested with numa=off, numa=fake, and  running
a kernel compiled with NUMA on a regular EM64 2 way SMP.
Signed-off-by: NAlok N Kataria <alokk@calsoftinc.com>
Signed-off-by: NRavikiran Thirumalai <kiran@scalex86.org>
Signed-off-by: NShai Fultheim <shai@scalex86.org>
Signed-off-by: NAndi Kleen <ak@suse.de>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

05b3cbd8

[PATCH] x86_64: Fix up white space in time.c · 0b91317e

由 Andi Kleen 提交于 1月 11, 2006

No functional changes.

And remove one redundant prototype.
Signed-off-by: NAndi Kleen <ak@suse.de>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

0b91317e

[PATCH] x86_64: Use standard __always_inline in vsyscall.c · 2c8bc944

由 Andi Kleen 提交于 1月 11, 2006

Replacing the old home brewn __force_inline.
Signed-off-by: NAndi Kleen <ak@suse.de>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

2c8bc944

[PATCH] i386: Replace broken serialize_cpu in microcode driver with correct sync_core · 487472bc

由 Andi Kleen 提交于 1月 11, 2006

Passing random input values in eax to cpuid is not a good idea
because the CPU will GPF for unknown ones.
Use the correct x86-64 version that exists for a longer time too.
This also adds a memory barrier to prevent the optimizer from
reordering.

Cc: tigran@veritas.com
Signed-off-by: NAndi Kleen <ak@suse.de>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

487472bc

[PATCH] x86_64: On Intel CPUs don't do an additional CPU sync before RDTSC · c818a181

由 Andi Kleen 提交于 1月 11, 2006

RDTSC serialization using cpuid is not needed for Intel platforms.
This increases gettimeofday performance.

Cc: vojtech@suse.cz
Cc: rohit.seth@intel.com
Signed-off-by: NAndi Kleen <ak@suse.de>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

c818a181

[PATCH] x86_64: Support alternative() in vsyscalls · 7f6c5b04

由 Andi Kleen 提交于 1月 11, 2006

The real vsyscall .text addresses are not mapped when the alternative()
replacement runs early, so use some black magic to access them using
the direct mapping.
Signed-off-by: NAndi Kleen <ak@suse.de>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

7f6c5b04

[PATCH] x86_64: Support alternative() with a output argument. · 6e54d95f

由 Andi Kleen 提交于 1月 11, 2006

Needed for follow on patches
Signed-off-by: NAndi Kleen <ak@suse.de>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

6e54d95f

[PATCH] x86_64: Don't try to synchronize the TSC over CPUs on Intel CPUs at boot. · 737c5c3b

由 Andi Kleen 提交于 1月 11, 2006

They already do this in hardware and the Linux algorithm
actually adds errors.

Cc: mingo@elte.hu
Cc: rohit.seth@intel.com
Signed-off-by: NAndi Kleen <ak@suse.de>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

737c5c3b

[PATCH] x86_64: Fix compile error with !CONFIG_COMPAT · 3c021751

由 Andi Kleen 提交于 1月 11, 2006

cpumask.h wasn't included implicitely into proto.h in this case.
Just move it over to smp.h
Signed-off-by: NAndi Kleen <ak@suse.de>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

3c021751

[PATCH] x86_64: x86_64 write apic id fix · b9d1e4bd

由 Vivek Goyal 提交于 1月 11, 2006

o Apic id is in most significant 8 bits of APIC_ID register. Current code
  is trying to write apic id to least significant 8 bits. This patch fixes
  it.

o This fix enables booting uni kdump capture kernel on a cpu with non-zero
  apic id.
Signed-off-by: NVivek Goyal <vgoyal@in.ibm.com>
Signed-off-by: NAndi Kleen <ak@suse.de>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

b9d1e4bd

[PATCH] x86_64: Remove duplicate exports · aea9fca1

由 Brian Gerst 提交于 1月 11, 2006

Remove exports that are already exported from the object's source file.
Signed-off-by: NBrian Gerst <bgerst@didntduck.org>
Signed-off-by: NAndi Kleen <ak@suse.de>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

aea9fca1

[PATCH] x86_64: unexport pci_*_consistent · e3602824

由 Brian Gerst 提交于 1月 11, 2006

These functions are inlines and shouldn't be exported.
Signed-off-by: NBrian Gerst <bgerst@didntduck.org>
Signed-off-by: NAndi Kleen <ak@suse.de>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

e3602824

[PATCH] x86_64: Remove unused apic_write_atomic · 2d0db401

由 Andi Kleen 提交于 1月 11, 2006

This function is never used for x86_64.
Signed-off-by: NBrian Gerst <bgerst@didntduck.org>
Signed-off-by: NAndi Kleen <ak@suse.de>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

2d0db401

[PATCH] x86_64: Make the cpu_*_maps in kernel/sched.c read mostly · 4cef0c61

由 Andi Kleen 提交于 1月 11, 2006

They are referred to often so avoid potential false sharing for them.
Signed-off-by: NAndi Kleen <ak@suse.de>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

4cef0c61

[PATCH] i386: make pci_map_single/pci_map_sg warn for zero length. · fd78f117

由 Andi Kleen 提交于 1月 11, 2006

As suggested by Linus.

This catches driver bugs that could cause corruption on IOMMU architectures.

Also I converted the BUGs to out_of_line_bug()s to save a bit
of text space.
Signed-off-by: NAndi Kleen <ak@suse.de>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

fd78f117

[PATCH] x86_64: Enable sound in old style OSS driver for NForce4 CK804 · 3d831d92

由 Andi Kleen 提交于 1月 11, 2006

Just add the missing PCI ID.

Cc: perex@suse.cz
Cc: tiwai@suse.de
Signed-off-by: NAndi Kleen <ak@suse.de>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

3d831d92

[PATCH] x86_64: Make it clear in machine checks that it's an hardware problem · 4855170f

由 Andi Kleen 提交于 1月 11, 2006

Hopefully the users will take the hint.
Signed-off-by: NAndi Kleen <ak@suse.de>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

4855170f

[PATCH] x86_64: Clean up copy_*_user · 2cbc9ee3

由 Andi Kleen 提交于 1月 11, 2006

- Remove optimization for old B stepping Opteron
- Make the fast path for copies with a multiple of eight length faster.
- Minor instruction rearrangement to hopefully avoid a pipeline
stall or two.
- Add comment about errata to consider.
Signed-off-by: NAndi Kleen <ak@suse.de>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

2cbc9ee3

[PATCH] x86_64: Use function pointers to call DMA mapping functions · 17a941d8

由 Muli Ben-Yehuda 提交于 1月 11, 2006

AK: I hacked Muli's original patch a lot and there were a lot
of changes - all bugs are probably to blame on me now.
There were also some changes in the fall back behaviour
for swiotlb - in particular it doesn't try to use GFP_DMA
now anymore. Also all DMA mapping operations use the
same core dma_alloc_coherent code with proper fallbacks now.
And various other changes and cleanups.

Known problems: iommu=force swiotlb=force together breaks
                needs more testing.

This patch cleans up x86_64's DMA mapping dispatching code. Right now
we have three possible IOMMU types: AGP GART, swiotlb and nommu, and
in the future we will also have Xen's x86_64 swiotlb and other HW
IOMMUs for x86_64. In order to support all of them cleanly, this
patch:

- introduces a struct dma_mapping_ops with function pointers for each
  of the DMA mapping operations of gart (AMD HW IOMMU), swiotlb
  (software IOMMU) and nommu (no IOMMU).

- gets rid of:

  if (swiotlb)
      return swiotlb_xxx();

- PCI_DMA_BUS_IS_PHYS is now checked against the dma_ops being set
This makes swiotlb faster by avoiding double copying in some cases.
Signed-Off-By: NMuli Ben-Yehuda <mulix@mulix.org>
Signed-Off-By: NJon D. Mason <jdmason@us.ibm.com>
Signed-off-by: NAndi Kleen <ak@suse.de>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

17a941d8

[PATCH] x86_64: Reject SRAT tables that don't cover all memory · 8a6fdd3e

由 Andi Kleen 提交于 1月 11, 2006

Broken BIOS on Iwill 8way systems reports these and it causes the bootmem
allocator to crash. Add a sanity check if all the PXMs in the
SRAT table cover all memory as reported by e820. If the sanity
check fails the SRAT is rejected and the code will fall back
to discover the NUMA topology using the K8 northbridge registers
when applicable.
Signed-off-by: NAndi Kleen <ak@suse.de>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

8a6fdd3e

[PATCH] x86_64: Add idle notifiers · 95833c83

由 Andi Kleen 提交于 1月 11, 2006

This adds a new notifier chain that is called with IDLE_START
when a CPU goes idle and IDLE_END when it goes out of idle.
The context can be idle thread or interrupt context.

Since we cannot rely on MONITOR/MWAIT existing the idle
end check currently has to be done in all interrupt
handlers.

They were originally inspired by the similar s390 implementation.

They have a variety of applications:
- They will be needed for CONFIG_NO_IDLE_HZ
- They can be used for oprofile to fix up the missing time
in idle when performance counters don't tick.
- They can be used for better C state management in ACPI
- They could be used for microstate accounting.

This is just infrastructure so far, no users.
Signed-off-by: NAndi Kleen <ak@suse.de>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

95833c83

A
[PATCH] x86_64: Clean up some printks in NUMA code · 6b050f80
由 Andi Kleen 提交于 1月 11, 2006
```
Signed-off-by: NAndi Kleen <ak@suse.de>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>
```
6b050f80

[PATCH] x86_64: Fix up coding style in numa.c · d18ff470

由 Andi Kleen 提交于 1月 11, 2006

No functional changes
Signed-off-by: NAndi Kleen <ak@suse.de>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

d18ff470

[PATCH] x86_64: Fix off by one in IOMMU check · ca8642f6

由 Andi Kleen 提交于 1月 11, 2006

Fix off by one when checking if the machine has enougn memory to need IOMMU
This caused the IOMMUs to be needlessly enabled for mem=4G

Based on a patch from Jon Mason

Signed-off-by: jdmason@us.ibm.com
Signed-off-by: NAndi Kleen <ak@suse.de>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

ca8642f6

[PATCH] x86_64: Handle missing local APIC timer interrupts on C3 state · d25bf7e5

由 Venkatesh Pallipadi 提交于 1月 11, 2006

Whenever we see that a CPU is capable of C3 (during ACPI cstate init), we
disable local APIC timer and switch to using a broadcast from external timer
interrupt (IRQ 0).

Patch below adds the code for x86_64.
Signed-off-by: NVenkatesh Pallipadi <venkatesh.pallipadi@intel.com>
Signed-off-by: NAndi Kleen <ak@suse.de>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

d25bf7e5

[PATCH] i386: Handle missing local APIC timer interrupts on C3 state · 6eb0a0fd

由 Venkatesh Pallipadi 提交于 1月 11, 2006

Whenever we see that a CPU is capable of C3 (during ACPI cstate init), we
disable local APIC timer and switch to using a broadcast from external timer
interrupt (IRQ 0). This is needed because Intel CPUs stop the local
APIC timer in C3. This is currently only enabled for Intel CPUs.

Patch below adds the code for i386 and also the ACPI hunk.
Signed-off-by: NVenkatesh Pallipadi <venkatesh.pallipadi@intel.com>
Signed-off-by: NAndi Kleen <ak@suse.de>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

6eb0a0fd

[PATCH] i386/x86-64: Remove sub jiffy profile timer support · 5a07a30c

由 Venkatesh Pallipadi 提交于 1月 11, 2006

Remove the finer control of local APIC timer. We cannot provide a sub-jiffy
control like this when we use broadcast from external timer in place of
local APIC. Instead of removing this only on systems that may end up using
broadcast from external timer (due to C3), I am going the
"I'm feeling lucky" way to remove this fully. Basically, I am not sure about
usefulness of this code today. Few other architectures also don't seem to
support this today.

If you are using profiling and fine grained control and don't like this going
away in normal case, yell at me right now.
Signed-off-by: NVenkatesh Pallipadi <venkatesh.pallipadi@intel.com>
Signed-off-by: NAndi Kleen <ak@suse.de>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

5a07a30c

[PATCH] x86_64: Report hardware breakpoints in user space when triggered by the kernel · 01b8faae

由 John Blackwood 提交于 1月 11, 2006

I would like to throw out a suggestion for a possible change in the way that
the debug register traps are handled in do_debug() when the trap occurs
in kernel-mode.

In the x86_64 version of do_debug(), the code will skip around sending
a SIGTRAP to the current task if the trap occurred while in kernel mode.

On the i386-side of things, if the access happens to occur in kernel mode
(say during a read(2) of user's buffer that matches the address of a
debug register trap), then the do_debug() routine for i386 will go ahead
and call send_sigtrap() and send the SIGTRAP signal. The send_sigtrap()
code will also set the info.si_addr to NULL in this case (even though I
don't understand why, since the SIGTRAP siginfo processing doesn't use
the si_addr field...).

So I would like to suggest that the x86_64 do_debug() routine also
follow this type of behavior and have it go ahead and send the
SIGTRAP signal to the current task, even if the debug register trap
happens to have occurred in kernel mode. I have taken a stab at
a patch for this change below. (It includes the i386-ish change
for setting si_addr to NULL when the trap occurred in kernel mode.)

It seems like a useful feature to be able to 'watch' a user location that
might also be modified in the kernel via a system service call, and have the
debugger report that information back to the user, rather than to just
silently ignore the trap.

Additionally, I realize that users that pull in a kernel debugger such as
KGDB into their kernel might want to remove this change below when they add
in KGDB support. However, they could alternatively look at the current
task's thread.debugreg[] values to see if the trap occurred due to KGDB
or instead because of a user-space debugger trap, and still honor the
user SIGTRAP processing (instead of the KGDB breakpoint processing)
if the trap matches up with the thread.debugreg[] registers.
Signed-off-by: NAndi Kleen <ak@suse.de>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

01b8faae

[PATCH] x86_64: "extern inline" -> "static inline" in pgtable.h · 4839057c

由 Adrian Bunk 提交于 1月 11, 2006

Signed-off-by: NAdrian Bunk <bunk@stusta.de>
Signed-off-by: NAndi Kleen <ak@suse.de>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

4839057c

[PATCH] x86_64: Convert page fault error codes to symbolic constants. · 66c58156

由 Andi Kleen 提交于 1月 11, 2006

Much better to deal with these than with the magic numbers.

And remove the comment describing the bits - kernel source
is no replacement for an architecture manual.
Signed-off-by: NAndi Kleen <ak@suse.de>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

66c58156

[PATCH] x86_64: Implement is_compat_task the right way · bf2fcc6f

由 Andi Kleen 提交于 1月 11, 2006

By setting a flag during a 32bit system call only
Signed-off-by: NAndi Kleen <ak@suse.de>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

bf2fcc6f

[PATCH] x86_64: Implement compat code for sg driver SG_GET_REQUEST_TABLE ioctl · 2966387b

由 Andi Kleen 提交于 1月 11, 2006

Apparently helps with some non SANE scanner drivers.

Cc: axboe@suse.de
Signed-off-by: NAndi Kleen <ak@suse.de>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

2966387b

[PATCH] x86_64: Remove unnecessary case from the page fault handler · f95190b2

由 Andi Kleen 提交于 1月 11, 2006

Don't need to do the vmalloc check for the module range because its
PML4 is shared with the kernel text.

Also removed an unnecessary TLB flush.

Pointed out by Jan Beulich

Cc: jbeulich@novell.com
Signed-off-by: NAndi Kleen <ak@suse.de>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

f95190b2

[PATCH] x86_64: Align and pad x86_64 GDT on page boundary · c11efdf9

由 Ravikiran G Thirumalai 提交于 1月 11, 2006

This patch is on the same lines as Zachary Amsden's i386 GDT page alignemnt
patch in -mm, but for x86_64.

Patch to align and pad x86_64 GDT on page boundries.

[AK: some minor cleanups and fixed incorrect TLS initialization
in CPU init.]
Signed-off-by: NNippun Goel <nippung@calsoftinc.com>
Signed-off-by: NRavikiran Thirumalai <kiran@scalex86.org>
Signed-off-by: NShai Fultheim <shai@scalex86.org>
Signed-off-by: NAndi Kleen <ak@suse.de>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

c11efdf9

[PATCH] x86_64: Allow compilation on a 32bit biarch toolchain · bb33421d

由 Andi Kleen 提交于 1月 11, 2006

This might help on distributions that use a 32bit biarch compiler.

First pass -m64 by default.

Secondly add some more .code32s because at least the Ubuntu biarch
32bit as called by gcc doesn't seem to handle -m64 -m32 as generated
by the Makefile without such assistance.

And finally make sure the linker script can be preprocessed
with a 32bit cpp.
Signed-off-by: NAndi Kleen <ak@suse.de>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

bb33421d

[PATCH] x86_64: Make udelay more accurate · 79c62cf1

由 Ross Biro 提交于 1月 11, 2006

The attempt to avoid overflow in __delay caused varying precision
on different CPUs depending on differences in the CPU speed.

We should be able to do this multiplication with out overflowing
provided the
cpu is running at less than about 128 GHz.  xloops < 20000 * 0x10c6.
loops_per_jiffy * HZ <= cpu_clock_speed.  So if the cpu clock speed
< 2^64/(20000 * 0x10c6) = 2^64/ 51E6CC0 < 2^64/2^27 = 2^37 = 128G we
will not overflow the calculation.
Signed-off-by: NAndi Kleen <ak@suse.de>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

79c62cf1

[PATCH] x86_64: Return -1 for unknown PCI bus affinity · e4e94072

由 Andi Kleen 提交于 1月 11, 2006

When we don't know the node a PCI bus is connected to return -1.
This matches the generic code.

Noticed by Ravikiran G Thirumalai <kiran@scalex86.org>

Cc: Ravikiran G Thirumalai <kiran@scalex86.org>
Signed-off-by: NAndi Kleen <ak@suse.de>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

e4e94072

[PATCH] x86_64: Handle unknown node (-1) in alloc_pages_node · 819a6928

由 Andi Kleen 提交于 1月 11, 2006

Following kmalloc_node.

Needed for another patch to return -1 for unknown nodes in x86-64.

Cc: Christoph Lameter <clameter@engr.sgi.com>
Cc: kiran@scalex86.org
Signed-off-by: NAndi Kleen <ak@suse.de>
[ Changed 0 to numa_node_id() on suggestion by Christoph Lameter ]
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

819a6928

[PATCH] x86_64: Validate SLIT table · 1584b89c

由 Andi Kleen 提交于 1月 11, 2006

A lot of Opteron BIOS just pass 10 in all SLIT entries (10 is the
normalized unit). This is actually worse than the default heuristic
because it leads to pci_distance not knowing the difference between
local and remote nodes anymore. This messes up some NUMA
heuristics in generic code.

In this case it's better to fall back to the default heuristic
which just does nodea == nodeb ? 10 : 20.

This patch does some basic sanity checking on the SLIT and only accepts
the SLIT when it passes.

Invariants enforced are:
- Node to itself shall be 10
- Any other distance shouldn't be 10
- Distances smaller than 10 are illegal
Signed-off-by: NAndi Kleen <ak@suse.de>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

1584b89c

[PATCH] x86_64: Fix off by one in acpi table mapping · 7a4a76cc

由 Andi Kleen 提交于 1月 11, 2006

And fix the test to include the size

Noticed by Vivek Goyal
Signed-off-by: NAndi Kleen <ak@suse.de>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

7a4a76cc

openeuler / Kernel 1 年多 前同步成功

openeuler / Kernel
1 年多前同步成功