提交 · 4f0234f4f9da485ecb9729af1b88567700fd4767 · openeuler / Kernel

16 7月, 2007 8 次提交

[SPARC64]: Initial LDOM cpu hotplug support. · 4f0234f4

由 David S. Miller 提交于 7月 13, 2007

Only adding cpus is supports at the moment, removal
will come next.

When new cpus are configured, the machine description is
updated.  When we get the configure request we pass in a
cpu mask of to-be-added cpus to the mdesc CPU node parser
so it only fetches information for those cpus.  That code
also proceeds to update the SMT/multi-core scheduling bitmaps.

cpu_up() does all the work and we return the status back
over the DS channel.

CPUs via dr-cpu need to be booted straight out of the
hypervisor, and this requires:

1) A new trampoline mechanism.  CPUs are booted straight
   out of the hypervisor with MMU disabled and running in
   physical addresses with no mappings installed in the TLB.

   The new hvtramp.S code sets up the critical cpu state,
   installs the locked TLB mappings for the kernel, and
   turns the MMU on.  It then proceeds to follow the logic
   of the existing trampoline.S SMP cpu bringup code.

2) All calls into OBP have to be disallowed when domaining
   is enabled.  Since cpus boot straight into the kernel from
   the hypervisor, OBP has no state about that cpu and therefore
   cannot handle being invoked on that cpu.

   Luckily it's only a handful of interfaces which can be called
   after the OBP device tree is obtained.  For example, rebooting,
   halting, powering-off, and setting options node variables.

CPU removal support will require some infrastructure changes
here.  Namely we'll have to process the requests via a true
kernel thread instead of in a workqueue.  workqueues run on
a per-cpu thread, but when unconfiguring we might need to
force the thread to execute on another cpu if the current cpu
is the one being removed.  Removal of a cpu also causes the kernel
to destroy that cpu's workqueue running thread.

Another issue on removal is that we may have interrupts still
pointing to the cpu-to-be-removed.  So new code will be needed
to walk the active INO list and retarget those cpus as-needed.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

4f0234f4

[SPARC64]: Fix setting of variables in LDOM guest. · b3e13fbe

由 David S. Miller 提交于 7月 12, 2007

There is a special domain services capability for setting
variables in the OBP options node.  Guests don't have permanent
store for the OBP variables like a normal system, so they are
instead maintained in the LDOM control node or in the SC.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

b3e13fbe

[SPARC64]: Fix MD property lifetime bugs. · 83292e0a

由 David S. Miller 提交于 7月 12, 2007

Property values cannot be referenced outside of
mdesc_grab()/mdesc_release() pairs.  The only major
offender was the VIO bus layer, easily fixed.

Add some commentary to mdesc.h describing these rules.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

83292e0a

[SPARC64]: Abstract out mdesc accesses for better MD update handling. · 43fdf274

由 David S. Miller 提交于 7月 12, 2007

Since we have to be able to handle MD updates, having an in-tree
set of data structures representing the MD objects actually makes
things more painful.

The MD itself is easy to parse, and we can implement the existing
interfaces using direct parsing of the MD binary image.

The MD is now reference counted, so accesses have to now take the
form:

	handle = mdesc_grab();

	... operations on MD ...

	mdesc_release(handle);

The only remaining issue are cases where code holds on to references
to MD property values.  mdesc_get_property() returns a direct pointer
to the property value, most cases just pull in the information they
need and discard the pointer, but there are few that use the pointer
directly over a long lifetime.  Those will be fixed up in a subsequent
changeset.

A preliminary handler for MD update events from domain services is
there, it is rudimentry but it works and handles all of the reference
counting.  It does not check the generation number of the MDs,
and it does not generate a "add/delete" list for notification to
interesting parties about MD changes but that will be forthcoming.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

43fdf274

[SPARC64]: Use more mearningful names for IRQ registry. · 133f09a1

由 David S. Miller 提交于 7月 11, 2007

All of the interrupts say "LDX RX" and "LDX TX" currently
which is next to useless.  Put a device specific prefix
before "RX" and "TX" instead which makes it much more
useful.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

133f09a1

[SPARC64]: Export powerd facilities for external entities. · 13077d80

由 David S. Miller 提交于 7月 11, 2007

Besides the existing usage for power-button interrupts, we'll
want to make use of this code for domain-services where the
LDOM manager can send reboot requests to the guest node.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

13077d80

[SPARC64]: Assorted LDC bug cures. · cb481235

由 David S. Miller 提交于 7月 11, 2007

1) LDC_MODE_RELIABLE is deprecated an unused by anything, plus
   it and LDC_MODE_STREAM were mis-numbered.

2) read_stream() should try to read as much as possible into
   the per-LDC stream buffer area, so do not trim the read_nonraw()
   length by the caller's size parameter.

3) Send data ACKs when necessary in read_nonraw().

4) In read_nonraw() when we get a pure ACK, advance the RX head
   unconditionally past it.

5) Provide the ACKID field in the ldcdgb() packet dump in read_nonraw().
   This helps debugging stream mode LDC channel problems.

6) Decrease verbosity of rx_data_wait() so that it is more useful.
   A debugging message each loop iteration is too much.

7) In process_data_ack() stop the loop checking when we hit lp->tx_tail
   not lp->tx_head.

8) Set the seqid field properly in send_data_nack().
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

cb481235

[SPARC64]: Add LDOM virtual channel driver and VIO device layer. · e53e97ce

由 David S. Miller 提交于 7月 09, 2007

Virtual devices on Sun Logical Domains are built on top
of a virtual channel framework.  This, with help of hypervisor
interfaces, provides a link layer protocol with basic
handshaking over which virtual device clients and servers
communicate.

Built on top of this is a VIO device protocol which has it's
own handshaking and message types.  At this layer attributes
are exchanged (disk size, network device addresses, etc.)
descriptor rings are registered, and data transfers are
triggers and replied to.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

e53e97ce

12 7月, 2007 2 次提交

PCI: remove pci_dac_dma_... APIs · caa51716

由 Jan Beulich 提交于 7月 09, 2007

Based on replies to a respective query, remove the pci_dac_dma_...() APIs
(except for pci_dac_dma_supported() on Alpha, where this function is used
in non-DAC PCI DMA code).
Signed-off-by: NJan Beulich <jbeulich@novell.com>
Cc: Andi Kleen <ak@suse.de>
Cc: Jesse Barnes <jesse.barnes@intel.com>
Cc: Christoph Hellwig <hch@infradead.org>
Acked-by: NDavid Miller <davem@davemloft.net>
Cc: Jeff Garzik <jeff@garzik.org>
Cc: <linux-arch@vger.kernel.org>
Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
Signed-off-by: NGreg Kroah-Hartman <gregkh@suse.de>

caa51716

PCI: Use a weak symbol for the empty version of pcibios_add_platform_entries() · 575e3348

由 Michael Ellerman 提交于 5月 08, 2007

I'm not sure if this is going to fly, weak symbols work on the compilers I'm
using, but whether they work for all of the affected architectures I can't say.
I've cc'ed as many arch maintainers/lists as I could find.

But assuming they do, we can use a weak empty definition of
pcibios_add_platform_entries() to avoid having an empty definition on every
arch.
Signed-off-by: NMichael Ellerman <michael@ellerman.id.au>
Signed-off-by: NGreg Kroah-Hartman <gregkh@suse.de>

575e3348

29 6月, 2007 1 次提交

[SPARC64]: Add linux/pagemap.h to asm/tlb.h · 9f462a1a

由 Alexey Dobriyan 提交于 6月 28, 2007

As seen on sparc64-allnoconfig:

  CC      arch/sparc64/mm/tlb.o
In file included from arch/sparc64/mm/tlb.c:19:
include/asm/tlb.h: In function 'tlb_flush_mmu':
include/asm/tlb.h:60: warning: implicit declaration of function 'release_pages'
include/asm/tlb.h: In function 'tlb_remove_page':
include/asm/tlb.h:92: warning: implicit declaration of function 'page_cache_release'
Signed-off-by: NAlexey Dobriyan <adobriyan@sw.ru>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

9f462a1a

26 6月, 2007 1 次提交

[SPARC64]: Add irqs to mdesc_node. · 701271df

由 David S. Miller 提交于 6月 26, 2007

Will be used to store translated LDC rx-ino and tx-ino.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

701271df

13 6月, 2007 3 次提交

[SPARC64]: Fix args to sun4v_ldc_revoke(). · fc395f8d

由 David S. Miller 提交于 6月 12, 2007

First argument is LDC channel ID, then mapping cookie,
then the MTE revoke cookie.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

fc395f8d

[SPARC64]: Really fix parport. · f467b998

由 David S. Miller 提交于 6月 12, 2007

We were passing a "struct pci_dev *" instead of a
"struct device *" to the parport registry routines.
No wonder things exploded.

The ebus_bus_type hacks can be backed out from
asm-sparc64/dma-mapping.h, those were wrong.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

f467b998

D
[SPARC64]: Wire up cookie based sun4v interrupt registry. · 4a907dec
由 David S. Miller 提交于 6月 13, 2007
```
This will be used for logical domain channel interrupts.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
4a907dec

05 6月, 2007 4 次提交

D
[SPARC64]: Fill in gaps in non-PCI dma_*() NOP implementation. · f04dbac2
由 David S. Miller 提交于 6月 04, 2007
```
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
f04dbac2

[SPARC64]: Fix {mc,smt}_capable(). · a2f9f6bb

由 David S. Miller 提交于 6月 04, 2007

It's not just sun4v hypervisor platforms that should return true
for this, sun4u with UltraSPARC-IV should return true too.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

a2f9f6bb

[SPARC64]: Proper multi-core scheduling support. · f78eae2e

由 David S. Miller 提交于 6月 04, 2007

The scheduling domain hierarchy is:

   all cpus -->
      cpus that share an instruction cache -->
          cpus that share an integer execution unit
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

f78eae2e

[SPARC64]: Provide mmu statistics via sysfs. · d887ab3a

由 David Miller 提交于 6月 03, 2007

If the system supports hypervisor based statistics, allow them to
be fetched, enabled, and disabled via sysfs.

Enable and disable via the boolean:

/sys/devices/systems/cpu/cpuN/mmustat_enable

Statistic values are provided under:

/sys/devices/systems/cpu/cpuN/mmu_status/
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

d887ab3a

31 5月, 2007 1 次提交
- D
  [SPARC64]: Add missing NCS and SVC hypervisor interfaces. · dbbe3cb8
  由 David S. Miller 提交于 5月 30, 2007
```
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
  dbbe3cb8
29 5月, 2007 6 次提交

[SPARC64]: Fill holes in hypervisor APIs and fix KTSB registry. · 7db35f31

由 David S. Miller 提交于 5月 29, 2007

Several interfaces were missing and others misnumbered or
improperly documented.

Also, make sure to check the return value when registering
the kernel TSBs with the hypervisor.  This helped to find
the 4MB kernel TSB alignment bug fixed in a previous changeset.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

7db35f31

[SPARC64]: Fix two bugs wrt. kernel 4MB TSB. · 2d9e2763

由 David S. Miller 提交于 5月 29, 2007

1) The TSB lookup was not using the correct hash mask.

2) It was not aligned on a boundary equal to it's size,
   which is required by the sun4v Hypervisor.

wasn't having it's return value checked, and that bug will be fixed up
as well in a subsequent changeset.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

2d9e2763

[SPARC64]: Eliminate NR_CPUS limitations. · 22adb358

由 David S. Miller 提交于 5月 26, 2007

Cheetah systems can have cpuids as large as 1023, although physical
systems don't have that many cpus.

Only three limitations existed in the kernel preventing arbitrary
NR_CPUS values:

1) dcache dirty cpu state stored in page->flags on
   D-cache aliasing platforms.  With some build time
   calculations and some build-time BUG checks on
   page->flags layout, this one was easily solved.

2) The cheetah XCALL delivery code could only handle
   a cpumask with up to 32 cpus set.  Some simple looping
   logic clears that up too.

3) thread_info->cpu was a u8, easily changed to a u16.

There are a few spots in the kernel that still put NR_CPUS
sized arrays on the kernel stack, but that's not a sparc64
specific problem.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

22adb358

D
[SPARC64]: Use machine description and OBP properly for cpu probing. · 5cbc3073
由 David S. Miller 提交于 5月 25, 2007
```
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
5cbc3073
D
[SPARC64]: Report proper system soft state to the hypervisor. · 22d6a1cb
由 David S. Miller 提交于 5月 25, 2007
```
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
22d6a1cb

[SPARC64]: Kill unused DIE_PAGE_FAULT enum value. · a1aadd55

由 Christoph Hellwig 提交于 5月 23, 2007

sparc64 got rid of the pagefault notifiers, so the enum value for them
can go away aswell.
Signed-off-by: NChristoph Hellwig <hch@lst.de>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

a1aadd55

16 5月, 2007 1 次提交

[SPARC64]: Add hypervisor API negotiation and fix console bugs. · c7754d46

由 David S. Miller 提交于 5月 15, 2007

Hypervisor interfaces need to be negotiated in order to use
some API calls reliably.  So add a small set of interfaces
to request API versions and query current settings.

This allows us to fix some bugs in the hypervisor console:

1) If we can negotiate API group CORE of at least major 1
   minor 1 we can use con_read and con_write which can improve
   console performance quite a bit.

2) When we do a console write request, we should hold the
   spinlock around the whole request, not a byte at a time.
   What would happen is that it's easy for output from
   different cpus to get mixed with each other.

3) Use consistent udelay() based polling, udelay(1) each
   loop with a limit of 1000 polls to handle stuck hypervisor
   console.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

c7754d46

14 5月, 2007 1 次提交
- D
  [SPARC64]: Accept ebus_bus_type for generic DMA ops. · 9ac6d4a4
  由 David S. Miller 提交于 5月 14, 2007
```
Based upon a bug report by Meelis Roos.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
  9ac6d4a4
12 5月, 2007 1 次提交
- D
  [SPARC]: Wire up signalfd/timerfd/eventfd syscalls. · 8354c5b7
  由 David S. Miller 提交于 5月 11, 2007
```
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
  8354c5b7
11 5月, 2007 1 次提交

Consolidate asm/poll.h · 04dd08b4

由 Stephen Rothwell 提交于 5月 10, 2007

These files are almost all the same.

This patch could be made even simpler if we don't mind POLLREMOVE turning
up in a few architectures that didn't have it previously (which should be
OK as POLLREMOVE is not used anywhere in the current tree).
Signed-off-by: NStephen Rothwell <sfr@canb.auug.org.au>
Cc: <linux-arch@vger.kernel.org>
Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

04dd08b4

10 5月, 2007 2 次提交

[SPARC64]: Bump PROMINTR_MAX to 32. · 9245df0c

由 David S. Miller 提交于 5月 10, 2007

Some devices have more than 15 which was the previous
setting.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

9245df0c

Remove hardcoding of hard_smp_processor_id on UP systems · 2f4dfe20

由 Fernando Luis Vazquez Cao 提交于 5月 09, 2007

With the advent of kdump, the assumption that the boot CPU when booting an UP
kernel is always the CPU with a particular hardware ID (often 0) (usually
referred to as BSP on some architectures) is not valid anymore.  The reason
being that the dump capture kernel boots on the crashed CPU (the CPU that
invoked crash_kexec), which may be or may not be that particular CPU.

Move definition of hard_smp_processor_id for the UP case to
architecture-specific code ("asm/smp.h") where it belongs, so that each
architecture can provide its own implementation.
Signed-off-by: NFernando Luis Vazquez Cao <fernando@oss.ntt.co.jp>
Cc: "Luck, Tony" <tony.luck@intel.com>
Acked-by: NAndi Kleen <ak@suse.de>
Cc: "Eric W. Biederman" <ebiederm@xmission.com>
Cc: Vivek Goyal <vgoyal@in.ibm.com>
Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

2f4dfe20

09 5月, 2007 8 次提交

D
[SPARC64]: Optimize fault kprobe handling just like powerpc. · 127cda1e
由 David S. Miller 提交于 5月 08, 2007
```
And eliminate DIE_GPF while we're at it.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
127cda1e
D
[SPARC]: Wire up utimensat syscall. · 6c114260
由 David S. Miller 提交于 5月 08, 2007
```
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
6c114260

[SPARC64]: Kill asm-sparc64/pbm.h · c57c2ffb

由 David S. Miller 提交于 5月 08, 2007

Everything it contains can be hidden in pci_impl.h
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

c57c2ffb

D
[SPARC64]: Move index info pci_pbm_info. · 6c108f12
由 David S. Miller 提交于 5月 07, 2007
```
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
6c108f12
D
[SPARC64]: Move {setup,teardown}_msi_irq into pci_pbm_info. · e9870c4c
由 David S. Miller 提交于 5月 07, 2007
```
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
e9870c4c
D
[SPARC64]: Move pci_ops into pci_pbm_info. · f1cd8de2
由 David S. Miller 提交于 5月 07, 2007
```
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
f1cd8de2

[SPARC64] PCI: Use root list of pbm's instead of pci_controller_info's · 34768bc8

由 David S. Miller 提交于 5月 07, 2007

The idea is to move more and more things into the pbm,
with the eventual goal of eliminating the pci_controller_info
entirely as there really isn't any need for it.

This stage of the transformations requires some reworking of
the PCI error interrupt handling.

It might be tricky to get rid of the pci_controller_info parenting for
a few reasons:

1) When we get an uncorrectable or correctable error we want
   to interrogate the IOMMU and streaming cache of both
   PBMs for error status.  These errors come from the UPA
   front-end which is shared between the two PBM PCI bus
   segments.

   Historically speaking this is why I choose the datastructure
   hierarchy of pci_controller_info-->pci_pbm_info

2) The probing does a portid/devhandle match to look for the
   'other' pbm, but this is entirely an artifact and can be
   eliminated trivially.

What we could do to solve #1 is to have a "buddy" pointer from one pbm
to another.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

34768bc8

D
[SPARC64] PCI: Kill PROM_PCIRNG_MAX and PROM_PCIIMAP_MAX. · 5a4a3e59
由 David S. Miller 提交于 5月 07, 2007
```
They are totally unused.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
5a4a3e59

openeuler / Kernel 1 年多 前同步成功

openeuler / Kernel
1 年多前同步成功