提交 · 82a45b96a203a7403427183f1afd3d295222ff7d · openeuler / qemu

26 10月, 2016 4 次提交

cputlb: Move most of iotlb code out of line · 82a45b96

由 Richard Henderson 提交于 7月 08, 2016

Saves 2k code size off of a cold path.
Reviewed-by: NEmilio G. Cota <cota@braap.org>
Reviewed-by: NAlex Bennée <alex.bennee@linaro.org>
Signed-off-by: NRichard Henderson <rth@twiddle.net>

82a45b96

cputlb: Remove includes from softmmu_template.h · 40978428

由 Richard Henderson 提交于 7月 08, 2016

We already include exec/address-spaces.h and exec/memory.h in
cputlb.c; the include of qemu/timer.h appears to be a fossil.
Reviewed-by: NEmilio G. Cota <cota@braap.org>
Reviewed-by: NAlex Bennée <alex.bennee@linaro.org>
Signed-off-by: NRichard Henderson <rth@twiddle.net>

40978428

cputlb: Move probe_write out of softmmu_template.h · 3b08f0a9

由 Richard Henderson 提交于 7月 08, 2016

Reviewed-by: NEmilio G. Cota <cota@braap.org>
Reviewed-by: NAlex Bennée <alex.bennee@linaro.org>
Signed-off-by: NRichard Henderson <rth@twiddle.net>

3b08f0a9

cputlb: Replace SHIFT with DATA_SIZE · dea21982

由 Richard Henderson 提交于 7月 08, 2016

Reviewed-by: NEmilio G. Cota <cota@braap.org>
Reviewed-by: NAlex Bennée <alex.bennee@linaro.org>
Signed-off-by: NRichard Henderson <rth@twiddle.net>

dea21982

16 9月, 2016 2 次提交

tcg: Merge GETPC and GETRA · 01ecaf43

由 Richard Henderson 提交于 7月 26, 2016

The return address argument to the softmmu template helpers was
confused.  In the legacy case, we wanted to indicate that there
is no return address, and so passed in NULL.  However, we then
immediately subtracted GETPC_ADJ from NULL, resulting in a non-zero
value, indicating the presence of an (invalid) return address.

Push the GETPC_ADJ subtraction down to the only point it's required:
immediately before use within cpu_restore_state_from_tb, after all
NULL pointer checks have been completed.

This makes GETPC and GETRA identical.  Remove GETRA as the lesser
used macro, replacing all uses with GETPC.
Signed-off-by: NRichard Henderson <rth@twiddle.net>

01ecaf43

tcg: Support arbitrary size + alignment · 85aa8081

由 Richard Henderson 提交于 7月 14, 2016

Previously we allowed fully unaligned operations, but not operations
that are aligned but with less alignment than the operation size.

In addition, arm32, ia64, mips, and sparc had been omitted from the
previous overalignment patch, which would have led to that alignment
being enforced.
Signed-off-by: NRichard Henderson <rth@twiddle.net>

85aa8081

09 7月, 2016 3 次提交

cputlb: Fix for self-modifying writes across page boundaries · 81daabaf

由 Samuel Damashek 提交于 7月 08, 2016

As it currently stands, QEMU does not properly handle self-modifying code
when the write is unaligned and crosses a page boundary. The procedure
for handling a write to the current translation block is to write-protect
the current translation block, catch the write, split up the translation
block into the current instruction (which remains write-protected so that
the current instruction is not modified) and the remaining instructions
in the translation block, and then restore the CPU state to before the
write occurred so the write will be retried and successfully executed.
However, since unaligned writes across pages are split into one-byte
writes for simplicity, writes to the second page (which is not the
current TB) may succeed before a write to the current TB is attempted,
and since these writes are not invalidated before resuming state after
splitting the TB, these writes will be performed a second time, thus
corrupting the second page. Credit goes to Patrick Hulin for
discovering this.

In recent 64-bit versions of Windows running in emulated mode, this
results in either being very unstable (a BSOD after a couple minutes of
uptime), or being entirely unable to boot. Windows performs one or more
8-byte unaligned self-modifying writes (xors) which intersect the end
of the current TB and the beginning of the next TB, which runs into the
aforementioned issue. This commit fixes that issue by making the
unaligned write loop perform the writes in forwards order, instead of
reverse order. This way, QEMU immediately tries to write to the current
TB, and splits the TB before any write to the second page is executed.
The write then proceeds as intended. With this patch applied, I am able
to boot and use Windows 7 64-bit and Windows 10 64-bit in QEMU without
KVM.

Per Richard Henderson's input, this patch also ensures the second page
is in the TLB before executing the write loop, to ensure the second
page is mapped.

The original discussion of the issue is located at
http://lists.nongnu.org/archive/html/qemu-devel/2014-08/msg02161.html.
Signed-off-by: NSamuel Damashek <samuel.damashek@invincea.com>
Message-Id: <20160706182652.16190-1-samuel.damashek@invincea.com>
Signed-off-by: NRichard Henderson <rth@twiddle.net>

81daabaf

cputlb: Add address parameter to VICTIM_TLB_HIT · a390284b

由 Samuel Damashek 提交于 7月 06, 2016

[rth: Split out from the original patch.]
Signed-off-by: NSamuel Damashek <samuel.damashek@invincea.com>
Message-Id: <20160706182652.16190-1-samuel.damashek@invincea.com>
Signed-off-by: NRichard Henderson <rth@twiddle.net>

a390284b

cputlb: Move VICTIM_TLB_HIT out of line · 7e9a7c50

由 Richard Henderson 提交于 7月 08, 2016

There are currently 22 invocations of this function,
and we're about to increase that number.
Signed-off-by: NRichard Henderson <rth@twiddle.net>

7e9a7c50

06 7月, 2016 1 次提交

tcg: Improve the alignment check infrastructure · 1f00b27f

由 Sergey Sorokin 提交于 6月 23, 2016

Some architectures (e.g. ARMv8) need the address which is aligned
to a size more than the size of the memory access.
To support such check it's enough the current costless alignment
check implementation in QEMU, but we need to support
an alignment size specifying.
Signed-off-by: NSergey Sorokin <afarallax@yandex.ru>
Message-Id: <1466705806-679898-1-git-send-email-afarallax@yandex.ru>
Signed-off-by: NRichard Henderson <rth@twiddle.net>
[rth: Assert in tcg_canonicalize_memop.  Leave get_alignment_bits
available for, though unused by, user-mode.  Retain logging difference
based on ALIGNED_ONLY.]

1f00b27f

21 1月, 2016 1 次提交

exec.c: Pass MemTxAttrs to iotlb_to_region so it uses the right AS · a54c87b6

由 Peter Maydell 提交于 1月 21, 2016

Pass the MemTxAttrs for the memory access to iotlb_to_region(); this
allows it to determine the correct AddressSpace to use for the lookup.
Signed-off-by: NPeter Maydell <peter.maydell@linaro.org>
Acked-by: NEdgar E. Iglesias <edgar.iglesias@xilinx.com>

a54c87b6

11 9月, 2015 2 次提交

softmmu: remove now unused functions · b8611499

由 Pavel Dovgalyuk 提交于 7月 10, 2015

Now that the cpu_ld/st_* function directly call helper_ret_ld/st, we can
drop the old helper_ld/st functions.
Reviewed-by: NAurelien Jarno <aurelien@aurel32.net>
Signed-off-by: NPavel Dovgalyuk <pavel.dovgaluk@ispras.ru>
Message-Id: <20150710095656.13280.7085.stgit@PASHA-ISP>
Signed-off-by: NRichard Henderson <rth@twiddle.net>

b8611499

softmmu: add helper function to pass through retaddr · 282dffc8

由 Pavel Dovgalyuk 提交于 7月 10, 2015

This patch introduces several helpers to pass return address
which points to the TB. Correct return address allows correct
restoring of the guest PC and icount. These functions should be used when
helpers embedded into TB invoke memory operations.
Reviewed-by: NAurelien Jarno <aurelien@aurel32.net>
Signed-off-by: NPavel Dovgalyuk <pavel.dovgaluk@ispras.ru>
Message-Id: <20150710095650.13280.32255.stgit@PASHA-ISP>
Signed-off-by: NRichard Henderson <rth@twiddle.net>

282dffc8

15 8月, 2015 1 次提交

exec: drop cpu_can_do_io, just read cpu->can_do_io · 414b15c9

由 Paolo Bonzini 提交于 6月 24, 2015

After commit 626cf8f4 (icount: set can_do_io outside TB execution,
2014-12-08), can_do_io is set to 1 if not executing code.  It is
no longer necessary to make this assumption in cpu_can_do_io.

It is also possible to remove the use_icount test, simply by
never setting cpu->can_do_io to 0 unless use_icount is true.

With these changes cpu_can_do_io boils down to a read of
cpu->can_do_io.
Signed-off-by: NPaolo Bonzini <pbonzini@redhat.com>

414b15c9

11 6月, 2015 1 次提交

softmmu: Add probe_write() · 3b4afc9e

由 Yongbok Kim 提交于 6月 01, 2015

Probe for whether the specified guest write access is permitted.
If it is not permitted then an exception will be taken in the same
way as if this were a real write access (and we will not return).
Otherwise the function will return, and there will be a valid
entry in the TLB for this access.
Signed-off-by: NYongbok Kim <yongbok.kim@imgtec.com>
Reviewed-by: NLeon Alrae <leon.alrae@imgtec.com>
Signed-off-by: NLeon Alrae <leon.alrae@imgtec.com>

3b4afc9e

15 5月, 2015 2 次提交

tcg: Add MO_ALIGN, MO_UNALN · dfb36305

由 Richard Henderson 提交于 5月 13, 2015

These modifiers control, on a per-memory-op basis, whether
unaligned memory accesses are allowed.  The default setting
reflects the target's definition of ALIGNED_ONLY.
Reviewed-by: NPeter Maydell <peter.maydell@linaro.org>
Signed-off-by: NRichard Henderson <rth@twiddle.net>

dfb36305

tcg: Push merged memop+mmu_idx parameter to softmmu routines · 3972ef6f

由 Richard Henderson 提交于 5月 13, 2015

The extra information is not yet used but it is now available.
This requires minor changes through all of the tcg backends.
Reviewed-by: NPeter Maydell <peter.maydell@linaro.org>
Signed-off-by: NRichard Henderson <rth@twiddle.net>

3972ef6f

26 4月, 2015 3 次提交

Add MemTxAttrs to the IOTLB · fadc1cbe

由 Peter Maydell 提交于 4月 26, 2015

Add a MemTxAttrs field to the IOTLB, and allow target-specific
code to set it via a new tlb_set_page_with_attrs() function;
pass the attributes through to the device when making IO accesses.
Signed-off-by: NPeter Maydell <peter.maydell@linaro.org>
Reviewed-by: NPaolo Bonzini <pbonzini@redhat.com>
Reviewed-by: NEdgar E. Iglesias <edgar.iglesias@xilinx.com>
Reviewed-by: NAlex Bennée <alex.bennee@linaro.org>

fadc1cbe

Make CPU iotlb a structure rather than a plain hwaddr · e469b22f

由 Peter Maydell 提交于 4月 26, 2015

Make the CPU iotlb a structure rather than a plain hwaddr;
this will allow us to add transaction attributes to it.
Signed-off-by: NPeter Maydell <peter.maydell@linaro.org>
Reviewed-by: NPaolo Bonzini <pbonzini@redhat.com>
Reviewed-by: NEdgar E. Iglesias <edgar.iglesias@xilinx.com>
Reviewed-by: NAlex Bennée <alex.bennee@linaro.org>

e469b22f

memory: Replace io_mem_read/write with memory_region_dispatch_read/write · 3b643495

由 Peter Maydell 提交于 4月 26, 2015

Rather than retaining io_mem_read/write as simple wrappers around
the memory_region_dispatch_read/write functions, make the latter
public and change all the callers to use them, since we need to
touch all the callsites anyway to add MemTxAttrs and MemTxResult
support. Delete io_mem_read and io_mem_write entirely.

(All the callers currently pass MEMTXATTRS_UNSPECIFIED
and convert the return value back to bool or ignore it.)
Signed-off-by: NPeter Maydell <peter.maydell@linaro.org>
Reviewed-by: NAlex Bennée <alex.bennee@linaro.org>

3b643495

17 2月, 2015 1 次提交

exec: make iotlb RCU-friendly · 9d82b5a7

由 Paolo Bonzini 提交于 8月 16, 2013

After the previous patch, TLBs will be flushed on every change to
the memory mapping.  This patch augments that with synchronization
of the MemoryRegionSections referred to in the iotlb array.

With this change, it is guaranteed that iotlb_to_region will access
the correct memory map, even once the TLB will be accessed outside
the BQL.
Reviewed-by: NFam Zheng <famz@redhat.com>
Signed-off-by: NPaolo Bonzini <pbonzini@redhat.com>

9d82b5a7

03 11月, 2014 1 次提交

softmmu: provide softmmu access type enum · 55e94093

由 Leon Alrae 提交于 7月 07, 2014

New MIPS features depend on the access type and enum is more convenient than
using the numbers directly.
Signed-off-by: NLeon Alrae <leon.alrae@imgtec.com>
Reviewed-by: NThomas Huth <thuth@linux.vnet.ibm.com>

55e94093

02 9月, 2014 1 次提交

implementing victim TLB for QEMU system emulated TLB · 88e89a57

由 Xin Tong 提交于 8月 04, 2014

QEMU system mode page table walks are expensive. Taken by running QEMU
qemu-system-x86_64 system mode on Intel PIN , a TLB miss and walking a
4-level page tables in guest Linux OS takes ~450 X86 instructions on
average.

QEMU system mode TLB is implemented using a directly-mapped hashtable.
This structure suffers from conflict misses. Increasing the
associativity of the TLB may not be the solution to conflict misses as
all the ways may have to be walked in serial.

A victim TLB is a TLB used to hold translations evicted from the
primary TLB upon replacement. The victim TLB lies between the main TLB
and its refill path. Victim TLB is of greater associativity (fully
associative in this patch). It takes longer to lookup the victim TLB,
but its likely better than a full page table walk. The memory
translation path is changed as follows :

Before Victim TLB:
1. Inline TLB lookup
2. Exit code cache on TLB miss.
3. Check for unaligned, IO accesses
4. TLB refill.
5. Do the memory access.
6. Return to code cache.

After Victim TLB:
1. Inline TLB lookup
2. Exit code cache on TLB miss.
3. Check for unaligned, IO accesses
4. Victim TLB lookup.
5. If victim TLB misses, TLB refill
6. Do the memory access.
7. Return to code cache

The advantage is that victim TLB can offer more associativity to a
directly mapped TLB and thus potentially fewer page table walks while
still keeping the time taken to flush within reasonable limits.
However, placing a victim TLB before the refill path increase TLB
refill path as the victim TLB is consulted before the TLB refill. The
performance results demonstrate that the pros outweigh the cons.

some performance results taken on SPECINT2006 train
datasets and kernel boot and qemu configure script on an
Intel(R) Xeon(R) CPU E5620 @ 2.40GHz Linux machine are shown in the
Google Doc link below.

https://docs.google.com/spreadsheets/d/1eiItzekZwNQOal_h-5iJmC4tMDi051m9qidi5_nwvH4/edit?usp=sharing

In summary, victim TLB improves the performance of qemu-system-x86_64 by
11% on average on SPECINT2006, kernelboot and qemu configscript and with
highest improvement of in 26% in 456.hmmer. And victim TLB does not result
in any performance degradation in any of the measured benchmarks. Furthermore,
the implemented victim TLB is architecture independent and is expected to
benefit other architectures in QEMU as well.

Although there are measurement fluctuations, the performance
improvement is very significant and by no means in the range of
noises.
Signed-off-by: NXin Tong <trent.tong@gmail.com>
Message-id: 1407202523-23553-1-git-send-email-trent.tong@gmail.com
Reviewed-by: NPeter Maydell <peter.maydell@linaro.org>
Signed-off-by: NPeter Maydell <peter.maydell@linaro.org>

88e89a57

05 6月, 2014 3 次提交

softmmu: move softmmu_template.h out of include/ · 58ed270d

由 Paolo Bonzini 提交于 3月 28, 2014

It is only included in cputlb.c now.
Reviewed-by: NRichard Henderson <rth@twiddle.net>
Signed-off-by: NPaolo Bonzini <pbonzini@redhat.com>

58ed270d

softmmu: commonize helper definitions · 0f590e74

由 Paolo Bonzini 提交于 3月 28, 2014

They do not need to be in op_helper.c.  Because cputlb.c now includes
softmmu_template.h twice for each size, io_readX must be elided the
second time through.
Reviewed-by: NRichard Henderson <rth@twiddle.net>
Signed-off-by: NPaolo Bonzini <pbonzini@redhat.com>

0f590e74

softmmu: make do_unaligned_access a method of CPU · 93e22326

由 Paolo Bonzini 提交于 3月 28, 2014

We will reference it from more files in the next patch.  To avoid
ruining the small steps we're making towards multi-target, make
it a method of CPU rather than just a global.
Reviewed-by: NAndreas Färber <afaerber@suse.de>
Signed-off-by: NPaolo Bonzini <pbonzini@redhat.com>

93e22326

14 3月, 2014 4 次提交
- A
  translate-all: Change cpu_io_recompile() argument to CPUState · 90b40a69
  由 Andreas Färber 提交于 9月 01, 2013
```
Signed-off-by: NAndreas Färber <afaerber@suse.de>
```
  90b40a69
- A
  exec: Change tlb_fill() argument to CPUState · d5a11fef
  由 Andreas Färber 提交于 8月 27, 2013
```
Signed-off-by: NAndreas Färber <afaerber@suse.de>
```
  d5a11fef
- A
  cpu: Move can_do_io field from CPU_COMMON to CPUState · 99df7dce
  由 Andreas Färber 提交于 8月 26, 2013
```
Rename can_do_io() to cpu_can_do_io() and change argument to CPUState.
Signed-off-by: NAndreas Färber <afaerber@suse.de>
```
  99df7dce
- A
  cpu: Move mem_io_{pc,vaddr} fields from CPU_COMMON to CPUState · 93afeade
  由 Andreas Färber 提交于 8月 26, 2013
```
Reset them.
Signed-off-by: NAndreas Färber <afaerber@suse.de>
```
  93afeade
11 2月, 2014 2 次提交

cpu: Add per-cpu address space · 09daed84

由 Edgar E. Iglesias 提交于 12月 17, 2013

Reviewed-by: NPeter Maydell <peter.maydell@linaro.org>
Signed-off-by: NEdgar E. Iglesias <edgar.iglesias@xilinx.com>

09daed84

exec: Make iotlb_to_region input an AS · 77717094

由 Edgar E. Iglesias 提交于 11月 07, 2013

Reviewed-by: NPeter Maydell <peter.maydell@linaro.org>
Signed-off-by: NEdgar E. Iglesias <edgar.iglesias@xilinx.com>

77717094

01 2月, 2014 1 次提交

qemu 1.7.0 does not build on NetBSD · dc9a353c

由 Martin Husemann 提交于 1月 18, 2014

 Do not rely on int8_t (and friends) not being preprocessor
 symbols (or symbols expanding to themselves). On NetBSD (for example) the
 glue(u, SDATA_TYPE) results in u__int8_t, which is undefined. There is no way
 to stop cpp expanding inner macros, so just add the few lines explicitly and
 get rid of the magic.
Signed-off-by: NMartin Husemann <martin@NetBSD.org>
Reviewed-by: NPeter Maydell <peter.maydell@linaro.org>
Reviewed-by: NAndreas Färber <afaerber@suse.de>
Signed-off-by: NMichael Tokarev <mjt@tls.msk.ru>

dc9a353c

11 10月, 2013 2 次提交

exec: Add both big- and little-endian memory helpers · 867b3201

由 Richard Henderson 提交于 9月 04, 2013

Step three in the transition: helpers not tied to the target
"default" endianness.  To be used when the guest uses a memory
operation with non-default endianness.
Signed-off-by: NRichard Henderson <rth@twiddle.net>

867b3201

R
exec: Delete is_tcg_gen_code and GETRA_EXT · dbdbe0cd
由 Richard Henderson 提交于 9月 03, 2013
```
All implementations now boil down to GETRA.
Signed-off-by: NRichard Henderson <rth@twiddle.net>
```
dbdbe0cd

03 9月, 2013 3 次提交

R
tcg: Introduce zero and sign-extended versions of load helpers · c8f94df5
由 Richard Henderson 提交于 8月 27, 2013
```
Reviewed-by: NAurelien Jarno <aurelien@aurel32.net>
Signed-off-by: NRichard Henderson <rth@twiddle.net>
```
c8f94df5

exec: Rename USUFFIX to LSUFFIX · 701e3a5c

由 Richard Henderson 提交于 8月 27, 2013

In a following patch, there will be confusion between multiple "unsigned"
suffixes; rename this one so as to imply "load".
Reviewed-by: NAurelien Jarno <aurelien@aurel32.net>
Signed-off-by: NRichard Henderson <rth@twiddle.net>

701e3a5c

exec: Reorganize the GETRA/GETPC macros · 0f842f8a

由 Richard Henderson 提交于 8月 27, 2013

Always define GETRA; use __builtin_extract_return_addr, rather than
having a special case for s390.  Split GETPC_ADJ out of GETPC; use 2
universally, rather than having a special case for arm.

Rename GETPC_LDST to GETRA_LDST to indicate that it does not
contain the GETPC_ADJ value.  Likewise with GETPC_EXT to GETRA_EXT.

Perform the GETPC_ADJ adjustment inside helper_ret_ld/st.  This will
allow backends to pass along the "true" return address rather than
the massaged GETPC value.  In the meantime, double application of
GETPC_ADJ does not hurt, since the call insn in all ISAs is at least
4 bytes long.
Reviewed-by: NAurelien Jarno <aurelien@aurel32.net>
Signed-off-by: NRichard Henderson <rth@twiddle.net>

0f842f8a

27 8月, 2013 2 次提交

tcg: Tidy softmmu_template.h · aac1fb05

由 Richard Henderson 提交于 7月 26, 2013

Avoid a loop in the tlb_fill path; the fill will either succeed or
generate an exception.

Inline the slow_ld/st function; it was a complete copy of the main
helper except for the actual cross-page unaligned code, and the
compiler was inlining it anyway.

Add unlikely markers optimizing for the most common case of simple
tlb miss.

Make sure the compiler can optimize away the unaligned paths for a
1 byte access.
Reviewed-by: NAurelien Jarno <aurelien@aurel32.net>
Signed-off-by: NRichard Henderson <rth@twiddle.net>

aac1fb05

tcg: Add mmu helpers that take a return address argument · e25c3887

由 Richard Henderson 提交于 7月 24, 2013

Allow the code that tcg generates to be less obtuse, passing in
the return address directly instead of computing it in the helper.

Maintain the old entrance point unchanged as an alternate entry point.

Delete the helper_st*_cmmu prototypes; the implementations did not exist.
Signed-off-by: NRichard Henderson <rth@twiddle.net>

e25c3887