提交 · 90603d15fa95605d1d08235b73e220d766f04bb0 · openeuler / Kernel

12 6月, 2009 7 次提交

lguest: use native_set_* macros, which properly handle 64-bit entries when PAE is activated · 90603d15

由 Matias Zabaljauregui 提交于 6月 12, 2009

Some cleanups and replace direct assignment with native_set_* macros which properly handle 64-bit entries when PAE is activated
Signed-off-by: NMatias Zabaljauregui <zabaljauregui@gmail.com>
Signed-off-by: NRusty Russell <rusty@rustcorp.com.au>

90603d15

lguest: map switcher with executable page table entries · ed1dc778

由 Matias Zabaljauregui 提交于 5月 30, 2009

Map switcher with executable page table entries.
(This bug didn't matter before PAE and hence NX support -- RR)
Signed-off-by: NMatias Zabaljauregui <zabaljauregui@gmail.com>
Signed-off-by: NRusty Russell <rusty@rustcorp.com.au>

ed1dc778

lguest: Segment selectors are 16-bit long. Fix lg_cpu.ss1 definition. · f086122b

由 Matias Zabaljauregui 提交于 6月 12, 2009

If GDT_ENTRIES were every > 256, this could become a problem.

Signed-off-by: Matias Zabaljauregui <zabaljauregui at gmail.com>
Signed-off-by: NRusty Russell <rusty@rustcorp.com.au>

f086122b

lguest: beyond ARRAY_SIZE of cpu->arch.gdt · 81b79b01

由 Roel Kluin 提交于 5月 20, 2009

Do not go beyond ARRAY_SIZE of cpu->arch.gdt
Signed-off-by: NRoel Kluin <roel.kluin@gmail.com>
Signed-off-by: NRusty Russell <rusty@rustcorp.com.au>

81b79b01

lguest: improve interrupt handling, speed up stream networking · a32a8813

由 Rusty Russell 提交于 6月 12, 2009

lguest never checked for pending interrupts when enabling interrupts, and
things still worked.  However, it makes a significant difference to TCP
performance, so it's time we fixed it by introducing a pending_irq flag
and checking it on irq_restore and irq_enable.

These two routines are now too big to patch into the 8/10 bytes
patch space, so we drop that code.

Note: The high latency on interrupt delivery had a very curious
effect: once everything else was optimized, networking without GSO was
faster than networking with GSO, since more interrupts were sent and
hence a greater chance of one getting through to the Guest!

Note2: (Almost) Closing the same loophole for iret doesn't have any
measurable effect, so I'm leaving that patch for the moment.

Before:
	1GB tcpblast Guest->Host:		30.7 seconds
	1GB tcpblast Guest->Host (no GSO):	76.0 seconds

After:
	1GB tcpblast Guest->Host:		6.8 seconds
	1GB tcpblast Guest->Host (no GSO):	27.8 seconds
Signed-off-by: NRusty Russell <rusty@rustcorp.com.au>

a32a8813

lguest: fix race in halt code · abd41f03

由 Rusty Russell 提交于 6月 12, 2009

When the Guest does the LHCALL_HALT hypercall, we go to sleep, expecting
that a timer or the Waker will wake_up_process() us.

But we do it in a stupid way, leaving a classic missing wakeup race.

So split maybe_do_interrupt() into interrupt_pending() and
try_deliver_interrupt(), and check maybe_do_interrupt() and the
"break_out" flag before calling schedule.
Signed-off-by: NRusty Russell <rusty@rustcorp.com.au>

abd41f03

lguest: fix lguest wake on guest clock tick, or fd activity · a6c372de

由 Rusty Russell 提交于 6月 12, 2009

The Launcher could be inside the Guest on another CPU; wake_up_process
will do nothing because it is "running".  kick_process will knock it
back into our kernel in this case, otherwise we'll miss it until the
next guest exit.
Signed-off-by: NRusty Russell <rusty@rustcorp.com.au>

a6c372de

27 5月, 2009 1 次提交

lguest: fix on Intel when KVM loaded (unhandled trap 13) · 56434622

由 Rusty Russell 提交于 5月 26, 2009

When KVM is loaded, and hence VT set up, the vmcall instruction in an
lguest guest causes a #GP, not #UD.
Signed-off-by: NRusty Russell <rusty@rustcorp.com.au>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

56434622

19 4月, 2009 2 次提交

lguest: fix guest crash on non-linear addresses in gdt pvops · a489f0b5

由 Rusty Russell 提交于 4月 19, 2009

Fixes guest crash 'lguest: bad read address 0x4800000 len 256'

The new per-cpu allocator ends up handing a non-linear address to
write_gdt_entry.  We do __pa() on it, and hand it to the host, which
kills us.

I've long wanted to make the hypercall "LOAD_GDT_ENTRY" to match the IDT
code, but had no pressing reason until now.
Signed-off-by: NRusty Russell <rusty@rustcorp.com.au>
Cc: lguest@ozlabs.org

a489f0b5

lguest: fix crash on vmlinux images · 88df781a

由 Matias Zabaljauregui 提交于 4月 08, 2009

Typical message: 'lguest: unhandled trap 6 at 0x418726 (0x0)'

vmlinux guests were broken by 4cd8b5e2
'lguest: use KVM hypercalls', which rewrites guest text from kvm hypercalls
to trap 31.

The Launcher mmaps the kernel image.  The Guest executes and
immediately faults in the first text page (read-only).  Then it hits a
hypercall, and we rewrite that hypercall, causing a copy-on-write.
But the Guest pagetables still refer to the old page: we fault again,
but as Host we see the hypercall already rewritten, and pass the fault
back to the Guest.  The Guest hasn't set up an IDT yet, so we kill it.

This doesn't happen with bzImages: they unpack themselves and so the
text pages are already read-write.
Signed-off-by: NRusty Russell <rusty@rustcorp.com.au>
Tested-by: NPatrick McHardy <kaber@trash.net>

88df781a

30 3月, 2009 3 次提交

lguest: use bool instead of int · df1693ab

由 Matias Zabaljauregui 提交于 3月 18, 2009

Impact: clean up

Rusty told me, some time ago, that he had become a fan of "bool".
So, here are some replacements.

Signed-off-by: Matias Zabaljauregui <zabaljauregui at gmail.com>
Signed-off-by: NRusty Russell <rusty@rustcorp.com.au>

df1693ab

lguest: use KVM hypercalls · 4cd8b5e2

由 Matias Zabaljauregui 提交于 3月 14, 2009

Impact: cleanup

This patch allow us to use KVM hypercalls

Signed-off-by: Matias Zabaljauregui <zabaljauregui at gmail.com>
Signed-off-by: NRusty Russell <rusty@rustcorp.com.au>

4cd8b5e2

lguest: fix spurious BUG_ON() on invalid guest stack. · 6afbdd05

由 Rusty Russell 提交于 3月 30, 2009

Impact: fix crash on misbehaving guest

gpte_addr() contains a BUG_ON(), insisting that the present flag is
set.  We need to return before we call it if that isn't the case.
Signed-off-by: NRusty Russell <rusty@rustcorp.com.au>
Cc: stable@kernel.org

6afbdd05

09 3月, 2009 1 次提交

lguest: fix for CONFIG_SPARSE_IRQ=y · 6db6a5f3

由 Rusty Russell 提交于 3月 09, 2009

Impact: remove lots of lguest boot WARN_ON() when CONFIG_SPARSE_IRQ=y

We now need to call irq_to_desc_alloc_cpu() before
set_irq_chip_and_handler_name(), but we can't do that from init_IRQ (no
kmalloc available).

So do it as we use interrupts instead.  Also means we only alloc for
irqs we use, which was the intent of CONFIG_SPARSE_IRQ anyway.
Signed-off-by: NRusty Russell <rusty@rustcorp.com.au>
Cc: Ingo Molnar <mingo@redhat.com>

6db6a5f3

23 2月, 2009 1 次提交

x86: remove the Voyager 32-bit subarch · 965c7eca

由 Ingo Molnar 提交于 2月 22, 2009

Impact: remove unused/broken code

The Voyager subarch last built successfully on the v2.6.26 kernel
and has been stale since then and does not build on the v2.6.27,
v2.6.28 and v2.6.29-rc5 kernels.

No actual users beyond the maintainer reported this breakage.
Patches were sent and most of the fixes were accepted but the
discussion around how to do a few remaining issues cleanly
fizzled out with no resolution and the code remained broken.

In the v2.6.30 x86 tree development cycle 32-bit subarch support
has been reworked and removed - and the Voyager code, beyond the
build problems already known, needs serious and significant
changes and probably a rewrite to support it.

CONFIG_X86_VOYAGER has been marked BROKEN then. The maintainer has
been notified but no patches have been sent so far to fix it.

While all other subarchs have been converted to the new scheme,
voyager is still broken. We'd prefer to receive patches which
clean up the current situation in a constructive way, but even in
case of removal there is no obstacle to add that support back
after the issues have been sorted out in a mutually acceptable
fashion.

So remove this inactive code for now.
Signed-off-by: NIngo Molnar <mingo@elte.hu>

965c7eca

30 1月, 2009 2 次提交

lguest: Fix a memory leak with the lg object during launcher close · 05dfdbbd

由 Mark Wallis 提交于 1月 26, 2009

Fix a memory leak identified by Rusty Russell during LCA09 by
kfree'ing the lg object instead of just clearing it when the
launcher closes.
Signed-off-by: NMark Wallis <mwallis@serialmonkey.com>
Signed-off-by: NRusty Russell <rusty@rustcorp.com.au>

05dfdbbd

lguest: typos fix · 72410af9

由 Atsushi SAKAI 提交于 1月 16, 2009

3 points

lguest_asm.S => i386_head.S
LHCALL_BREAK => LHREQ_BREAK
perferred    => preferred
Signed-off-by: NAtsushi SAKAI <sakaia@jp.fujitsu.com>
Signed-off-by: NRusty Russell <rusty@rustcorp.com.au>

72410af9

07 1月, 2009 1 次提交

lguest: do not statically allocate root device · ff8561c4

由 Mark McLoughlin 提交于 12月 15, 2008

We shouldn't be statically allocating the root device object,
so dynamically allocate it using root_device_register()
instead.
Signed-off-by: NMark McLoughlin <markmc@redhat.com>
Acked-by: NRusty Russell <rusty@rustcorp.com.au>
Signed-off-by: NGreg Kroah-Hartman <gregkh@suse.de>

ff8561c4

30 12月, 2008 4 次提交

lguest: struct device - replace bus_id with dev_name() · bda53cd5

由 Mark McLoughlin 提交于 12月 10, 2008

bus_id is gradually being removed, so use dev_name() instead.
Signed-off-by: NMark McLoughlin <markmc@redhat.com>
Cc: Kay Sievers <kay.sievers@vrfy.org>
Cc: Greg Kroah-Hartman <gregkh@suse.de>
Signed-off-by: NRusty Russell <rusty@rustcorp.com.au>

bda53cd5

lguest: move the initial guest page table creation code to the host · 58a24566

由 Matias Zabaljauregui 提交于 9月 29, 2008

This patch moves the initial guest page table creation code to the host,
so the launcher keeps working with PAE enabled configs.
Signed-off-by: NMatias Zabaljauregui <zabaljauregui@gmail.com>
Signed-off-by: NRusty Russell <rusty@rustcorp.com.au>

58a24566

virtio: hand virtio ring alignment as argument to vring_new_virtqueue · 87c7d57c

由 Rusty Russell 提交于 12月 30, 2008

This allows each virtio user to hand in the alignment appropriate to
their virtio_ring structures.
Signed-off-by: NRusty Russell <rusty@rustcorp.com.au>
Acked-by: NChristian Borntraeger <borntraeger@de.ibm.com>

87c7d57c

virtio: use LGUEST_VRING_ALIGN instead of relying on pagesize · 2966af73

由 Rusty Russell 提交于 12月 30, 2008

This doesn't really matter, since lguest is i386 only at the moment,
but we could actually choose a different value.  (lguest doesn't have
a guarenteed ABI).
Signed-off-by: NRusty Russell <rusty@rustcorp.com.au>

2966af73

24 12月, 2008 1 次提交

x86: fix lguest used_vectors breakage, -v2 · b77b881f

由 Yinghai Lu 提交于 12月 19, 2008

Impact: fix lguest, clean up

32-bit lguest used used_vectors to record vectors, but that model of
allocating vectors changed and got broken, after we changed vector
allocation to a per_cpu array.

Try enable that for 64bit, and the array is used for all vectors that
are not managed by vector_irq per_cpu array.

Also kill system_vectors[], that is now a duplication of the
used_vectors bitmap.

[ merged in cpus4096 due to io_apic.c cpumask changes. ]
[ -v2, fix build failure ]
Signed-off-by: NYinghai Lu <yinghai@kernel.org>
Signed-off-by: NIngo Molnar <mingo@elte.hu>
Signed-off-by: NIngo Molnar <mingo@elte.hu>

b77b881f

25 8月, 2008 1 次提交
- R
  lguest: update commentry · 1dc3e3bc
  由 Rusty Russell 提交于 8月 26, 2008
```
Signed-off-by: NRusty Russell <rusty@rustcorp.com.au>
```
  1dc3e3bc
12 8月, 2008 1 次提交

lguest: use get_user_pages_fast() instead of get_user_pages() · 71a3f4ed

由 Rusty Russell 提交于 8月 12, 2008

Using a simple page table thrashing program I measure a slight
improvement.  The program creates five processes.  Each touches 1000
pages then schedules the next process.  We repeat this 1000 times.  As
lguest only caches 4 cr3 values, this rebuilds a lot of shadow page
tables requiring virt->phys mappings.

	Before: 5.93 seconds
	After: 5.40 seconds

(Counts of slow vs fastpath in this usage are 6092 and 2852462 respectively.)

And more importantly for lguest, the code is simpler.
Signed-off-by: NRusty Russell <rusty@rustcorp.com.au>

71a3f4ed

29 7月, 2008 3 次提交

lguest: use cpu capability accessors · cf485e56

由 Andrew Morton 提交于 6月 09, 2008

To support my little make-x86-bitops-use-proper-typechecking projectlet.

Cc: Thomas Gleixner <tglx@linutronix.de>
Cc: Andrea Arcangeli <andrea@qumranet.com>
Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
Acked-by: NIngo Molnar <mingo@elte.hu>
Signed-off-by: NRusty Russell <rusty@rustcorp.com.au>

cf485e56

lguest: fix switcher_page leak on unload · 0a707210

由 Johannes Weiner 提交于 7月 08, 2008

map_switcher allocates the array, unmap_switcher has to free it
accordingly.
Signed-off-by: NJohannes Weiner <hannes@saeurebad.de>
Signed-off-by: NRusty Russell <rusty@rustcorp.com.au>

0a707210

lguest: Guest int3 fix · 0c12091d

由 Rusty Russell 提交于 7月 29, 2008

Ron Minnich noticed that guest userspace gets a GPF when it tries to int3:
we need to copy the privilege level from the guest-supplied IDT to the real
IDT. int3 is the only common case where guest userspace expects to invoke
an interrupt, so that's the symptom of failing to do this.
Signed-off-by: NRusty Russell <rusty@rustcorp.com.au>

0c12091d

25 7月, 2008 2 次提交

virtio: Add transport feature handling stub for virtio_ring. · e34f8725

由 Rusty Russell 提交于 7月 25, 2008

To prepare for virtio_ring transport feature bits, hook in a call in
all the users to manipulate them.  This currently just clears all the
bits, since it doesn't understand any features.
Signed-off-by: NRusty Russell <rusty@rustcorp.com.au>

e34f8725

virtio: Rename set_features to finalize_features · c624896e

由 Rusty Russell 提交于 7月 25, 2008

Rather than explicitly handing the features to the lower-level, we just
hand the virtio_device and have it set the features. This make it clear
that it has the chance to manipulate the features of the device at this
point (and that all feature negotiation is already done).
Signed-off-by: NRusty Russell <rusty@rustcorp.com.au>

c624896e

11 7月, 2008 1 次提交
- I
  x86, VisWS: turn into generic arch, eliminate Kconfig specials · 15e551d2
  由 Ingo Molnar 提交于 7月 10, 2008
```
remove leftover traces of various VISWS related Kconfig specials.
Signed-off-by: NIngo Molnar <mingo@elte.hu>
```
  15e551d2
26 6月, 2008 1 次提交

on_each_cpu(): kill unused 'retry' parameter · 15c8b6c1

由 Jens Axboe 提交于 5月 09, 2008

It's not even passed on to smp_call_function() anymore, since that
was removed. So kill it.
Acked-by: NJeremy Fitzhardinge <jeremy.fitzhardinge@citrix.com>
Reviewed-by: NPaul E. McKenney <paulmck@linux.vnet.ibm.com>
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

15c8b6c1

20 6月, 2008 1 次提交

x86: fix NULL pointer deref in __switch_to · 54481cf8

由 Suresh Siddha 提交于 6月 19, 2008

I am able to reproduce the oops reported by Simon in __switch_to() with
lguest.

My debug showed that there is at least one lguest specific
issue (which should be present in 2.6.25 and before aswell) and it got
exposed with a kernel oops with the recent fpu dynamic allocation patches.

In addition to the previous possible scenario (with fpu_counter), in the
presence of lguest, it is possible that the cpu's TS bit it still set and the
lguest launcher task's thread_info has TS_USEDFPU still set.

This is because of the way the lguest launcher handling the guest's TS bit.
(look at lguest_set_ts() in lguest_arch_run_guest()). This can result
in a DNA fault while doing unlazy_fpu() in __switch_to(). This will
end up causing a DNA fault in the context of new process thats
getting context switched in (as opossed to handling DNA fault in the context
of lguest launcher/helper process).

This is wrong in both pre and post 2.6.25 kernels. In the recent
2.6.26-rc series, this is showing up as NULL pointer dereferences or
sleeping function called from atomic context(__switch_to()), as
we free and dynamically allocate the FPU context for the newly
created threads. Older kernels might show some FPU corruption for processes
running inside of lguest.

With the appended patch, my test system is running for more than 50 mins
now. So atleast some of your oops (hopefully all!) should get fixed.
Please give it a try. I will spend more time with this fix tomorrow.
Reported-by: NSimon Holm Thøgersen <odie@cs.aau.dk>
Reported-by: NPatrick McHardy <kaber@trash.net>
Signed-off-by: NSuresh Siddha <suresh.b.siddha@intel.com>
Signed-off-by: NIngo Molnar <mingo@elte.hu>

54481cf8

30 5月, 2008 2 次提交

virtio: set device index in common code. · b769f579

由 Rusty Russell 提交于 5月 30, 2008

Anthony Liguori points out that three different transports use the virtio code,
but each one keeps its own counter to set the virtio_device's index field.  In
theory (though not in current practice) this means that names could be
duplicated, and that risk grows as more transports are created.

So we move the selection of the unique virtio_device.index into the common code
in virtio.c, which has the side-benefit of removing duplicate code.

The only complexity is that lguest and S/390 use the index to uniquely identify
the device in case of catastrophic failure before register_virtio_device() is
called: now we use the offset within the descriptor page as a unique identifier
for the printks.
Signed-off-by: NRusty Russell <rusty@rustcorp.com.au>
Cc: Christian Borntraeger <borntraeger@de.ibm.com>
Cc: Martin Schwidefsky <schwidefsky@de.ibm.com>
Cc: Carsten Otte <cotte@de.ibm.com>
Cc: Heiko Carstens <heiko.carstens@de.ibm.com>
Cc: Chris Lalancette <clalance@redhat.com>
Cc: Anthony Liguori <anthony@codemonkey.ws>

b769f579

lguest: use ioremap_cache, not ioremap · e27810f1

由 Rusty Russell 提交于 5月 30, 2008

Thanks to Jon Corbet & LWN.  Only took me a day to join the dots.

Host->Guest netcat before (with unnecessily large receive buffers):
1073741824 bytes (1.1 GB) copied, 24.7528 seconds, 43.4 MB/s

After:
1073741824 bytes (1.1 GB) copied, 17.6369 seconds, 60.9 MB/s
Signed-off-by: NRusty Russell <rusty@rustcorp.com.au>

e27810f1

27 5月, 2008 1 次提交

x86/paravirt: add pte_flags to just get pte flags · a15af1c9

由 Jeremy Fitzhardinge 提交于 5月 26, 2008

Add pte_flags() to extract the flags from a pte.  This is a special
case of pte_val() which is only guaranteed to return the pte's flags
correctly; the page number may be corrupted or missing.

The intent is to allow paravirt implementations to return pte flags
without having to do any translation of the page number (most notably,
Xen).
Signed-off-by: NJeremy Fitzhardinge <jeremy.fitzhardinge@citrix.com>
Signed-off-by: NThomas Gleixner <tglx@linutronix.de>

a15af1c9

02 5月, 2008 4 次提交

lguest: make Launcher see device status updates · a007a751

由 Rusty Russell 提交于 5月 02, 2008

This brings us closer to Real Life, where we'd examine the device
features once it's set the DRIVER_OK status bit.
Signed-off-by: NRusty Russell <rusty@rustcorp.com.au>

a007a751

lguest: remove bogus NULL cpu check · 9f3f7467

由 Rusty Russell 提交于 5月 02, 2008

If lg isn't NULL, and cpu_id is sane, &lg->cpus[cpu_id] can't be NULL.
Signed-off-by: NRusty Russell <rusty@rustcorp.com.au>

9f3f7467

lguest: avoid using NR_CPUS as a bounds check. · 24adf127

由 Rusty Russell 提交于 5月 02, 2008

NR_CPUS (being a host number) is an arbitrary limit for the Guest.
Using the array size directly (which currently happes to be NR_CPUS)
is more futureproof.
Signed-off-by: NRusty Russell <rusty@rustcorp.com.au>

24adf127

virtio: explicit advertisement of driver features · c45a6816

由 Rusty Russell 提交于 5月 02, 2008

A recent proposed feature addition to the virtio block driver revealed
some flaws in the API: in particular, we assume that feature
negotiation is complete once a driver's probe function returns.

There is nothing in the API to require this, however, and even I
didn't notice when it was violated.

So instead, we require the driver to specify what features it supports
in a table, we can then move the feature negotiation into the virtio
core.  The intersection of device and driver features are presented in
a new 'features' bitmap in the struct virtio_device.

Note that this highlights the difference between Linux unsigned-long
bitmaps where each unsigned long is in native endian, and a
straight-forward little-endian array of bytes.

Drivers can still remove feature bits in their probe routine if they
really have to.

API changes:
- dev->config->feature() no longer gets and acks a feature.
- drivers should advertise their features in the 'feature_table' field
- use virtio_has_feature() for extra sanity when checking feature bits
Signed-off-by: NRusty Russell <rusty@rustcorp.com.au>

c45a6816

openeuler / Kernel 1 年多 前同步成功

openeuler / Kernel
1 年多前同步成功