提交 · 03eebc9e3246b9b3f5925aa41f7dfd7c1e467875 · openeuler / qemu

05 6月, 2015 2 次提交

memory: replace cpu_physical_memory_reset_dirty() with test-and-clear · 03eebc9e

由 Stefan Hajnoczi 提交于 12月 02, 2014

The cpu_physical_memory_reset_dirty() function is sometimes used
together with cpu_physical_memory_get_dirty().  This is not atomic since
two separate accesses to the dirty memory bitmap are made.

Turn cpu_physical_memory_reset_dirty() and
cpu_physical_memory_clear_dirty_range_type() into the atomic
cpu_physical_memory_test_and_clear_dirty().
Signed-off-by: NStefan Hajnoczi <stefanha@redhat.com>
Message-Id: <1417519399-3166-6-git-send-email-stefanha@redhat.com>
Reviewed-by: NFam Zheng <famz@redhat.com>
Signed-off-by: NPaolo Bonzini <pbonzini@redhat.com>

03eebc9e

cputlb: remove useless arguments to tlb_unprotect_code_phys, rename · 9564f52d

由 Paolo Bonzini 提交于 4月 22, 2015

These days modification of the TLB is done in notdirty_mem_write,
so the virtual address and env pointer as unnecessary.

The new name of the function, tlb_unprotect_code, is consistent with
tlb_protect_code.
Reviewed-by: NFam Zheng <famz@redhat.com>
Signed-off-by: NPaolo Bonzini <pbonzini@redhat.com>

9564f52d

26 4月, 2015 2 次提交

Add MemTxAttrs to the IOTLB · fadc1cbe

由 Peter Maydell 提交于 4月 26, 2015

Add a MemTxAttrs field to the IOTLB, and allow target-specific
code to set it via a new tlb_set_page_with_attrs() function;
pass the attributes through to the device when making IO accesses.
Signed-off-by: NPeter Maydell <peter.maydell@linaro.org>
Reviewed-by: NPaolo Bonzini <pbonzini@redhat.com>
Reviewed-by: NEdgar E. Iglesias <edgar.iglesias@xilinx.com>
Reviewed-by: NAlex Bennée <alex.bennee@linaro.org>

fadc1cbe

Make CPU iotlb a structure rather than a plain hwaddr · e469b22f

由 Peter Maydell 提交于 4月 26, 2015

Make the CPU iotlb a structure rather than a plain hwaddr;
this will allow us to add transaction attributes to it.
Signed-off-by: NPeter Maydell <peter.maydell@linaro.org>
Reviewed-by: NPaolo Bonzini <pbonzini@redhat.com>
Reviewed-by: NEdgar E. Iglesias <edgar.iglesias@xilinx.com>
Reviewed-by: NAlex Bennée <alex.bennee@linaro.org>

e469b22f

17 2月, 2015 2 次提交

exec: RCUify AddressSpaceDispatch · 79e2b9ae

由 Paolo Bonzini 提交于 1月 21, 2015

Note that even after this patch, most callers of address_space_*
functions must still be under the big QEMU lock, otherwise the memory
region returned by address_space_translate can disappear as soon as
address_space_translate returns.  This will be fixed in the next part
of this series.
Reviewed-by: NFam Zheng <famz@redhat.com>
Signed-off-by: NPaolo Bonzini <pbonzini@redhat.com>

79e2b9ae

exec: make iotlb RCU-friendly · 9d82b5a7

由 Paolo Bonzini 提交于 8月 16, 2013

After the previous patch, TLBs will be flushed on every change to
the memory mapping.  This patch augments that with synchronization
of the MemoryRegionSections referred to in the iotlb array.

With this change, it is guaranteed that iotlb_to_region will access
the correct memory map, even once the TLB will be accessed outside
the BQL.
Reviewed-by: NFam Zheng <famz@redhat.com>
Signed-off-by: NPaolo Bonzini <pbonzini@redhat.com>

9d82b5a7

17 12月, 2014 1 次提交

qemu-log: add log category for MMU info · 339aaf5b

由 Antony Pavlov 提交于 12月 13, 2014

Running barebox on qemu-system-mips* with '-d unimp' overloads
stderr by very very many mips_cpu_handle_mmu_fault() messages:

  mips_cpu_handle_mmu_fault address=b80003fd ret 0 physical 00000000180003fd prot 3
  mips_cpu_handle_mmu_fault address=a0800884 ret 0 physical 0000000000800884 prot 3
  mips_cpu_handle_mmu_fault pc a080cd80 ad b80003fd rw 0 mmu_idx 0

So it's very difficult to find LOG_UNIMP message.

The mips_cpu_handle_mmu_fault() messages appear on enabling ANY
logging! It's not very handy.

Adding separate log category for *_cpu_handle_mmu_fault()
logging fixes the problem.
Signed-off-by: NAntony Pavlov <antonynpavlov@gmail.com>
Acked-by: NAlexander Graf <agraf@suse.de>
Reviewed-by: NRichard Henderson <rth@twiddle.net>
Message-id: 1418489298-1184-1-git-send-email-antonynpavlov@gmail.com
Signed-off-by: NPeter Maydell <peter.maydell@linaro.org>

339aaf5b

02 9月, 2014 1 次提交

implementing victim TLB for QEMU system emulated TLB · 88e89a57

由 Xin Tong 提交于 8月 04, 2014

QEMU system mode page table walks are expensive. Taken by running QEMU
qemu-system-x86_64 system mode on Intel PIN , a TLB miss and walking a
4-level page tables in guest Linux OS takes ~450 X86 instructions on
average.

QEMU system mode TLB is implemented using a directly-mapped hashtable.
This structure suffers from conflict misses. Increasing the
associativity of the TLB may not be the solution to conflict misses as
all the ways may have to be walked in serial.

A victim TLB is a TLB used to hold translations evicted from the
primary TLB upon replacement. The victim TLB lies between the main TLB
and its refill path. Victim TLB is of greater associativity (fully
associative in this patch). It takes longer to lookup the victim TLB,
but its likely better than a full page table walk. The memory
translation path is changed as follows :

Before Victim TLB:
1. Inline TLB lookup
2. Exit code cache on TLB miss.
3. Check for unaligned, IO accesses
4. TLB refill.
5. Do the memory access.
6. Return to code cache.

After Victim TLB:
1. Inline TLB lookup
2. Exit code cache on TLB miss.
3. Check for unaligned, IO accesses
4. Victim TLB lookup.
5. If victim TLB misses, TLB refill
6. Do the memory access.
7. Return to code cache

The advantage is that victim TLB can offer more associativity to a
directly mapped TLB and thus potentially fewer page table walks while
still keeping the time taken to flush within reasonable limits.
However, placing a victim TLB before the refill path increase TLB
refill path as the victim TLB is consulted before the TLB refill. The
performance results demonstrate that the pros outweigh the cons.

some performance results taken on SPECINT2006 train
datasets and kernel boot and qemu configure script on an
Intel(R) Xeon(R) CPU E5620 @ 2.40GHz Linux machine are shown in the
Google Doc link below.

https://docs.google.com/spreadsheets/d/1eiItzekZwNQOal_h-5iJmC4tMDi051m9qidi5_nwvH4/edit?usp=sharing

In summary, victim TLB improves the performance of qemu-system-x86_64 by
11% on average on SPECINT2006, kernelboot and qemu configscript and with
highest improvement of in 26% in 456.hmmer. And victim TLB does not result
in any performance degradation in any of the measured benchmarks. Furthermore,
the implemented victim TLB is architecture independent and is expected to
benefit other architectures in QEMU as well.

Although there are measurement fluctuations, the performance
improvement is very significant and by no means in the range of
noises.
Signed-off-by: NXin Tong <trent.tong@gmail.com>
Message-id: 1407202523-23553-1-git-send-email-trent.tong@gmail.com
Reviewed-by: NPeter Maydell <peter.maydell@linaro.org>
Signed-off-by: NPeter Maydell <peter.maydell@linaro.org>

88e89a57

05 6月, 2014 4 次提交

softmmu: introduce cpu_ldst.h · f08b6170

由 Paolo Bonzini 提交于 3月 28, 2014

This will collect all load and store helpers soon.  For now
it is just a replacement for softmmu_exec.h, which this patch
stops including directly, but we also include it where this will
be necessary in order to simplify the next patch.
Reviewed-by: NRichard Henderson <rth@twiddle.net>
Signed-off-by: NPaolo Bonzini <pbonzini@redhat.com>

f08b6170

softmmu: move softmmu_template.h out of include/ · 58ed270d

由 Paolo Bonzini 提交于 3月 28, 2014

It is only included in cputlb.c now.
Reviewed-by: NRichard Henderson <rth@twiddle.net>
Signed-off-by: NPaolo Bonzini <pbonzini@redhat.com>

58ed270d

softmmu: commonize helper definitions · 0f590e74

由 Paolo Bonzini 提交于 3月 28, 2014

They do not need to be in op_helper.c.  Because cputlb.c now includes
softmmu_template.h twice for each size, io_readX must be elided the
second time through.
Reviewed-by: NRichard Henderson <rth@twiddle.net>
Signed-off-by: NPaolo Bonzini <pbonzini@redhat.com>

0f590e74

cputlb: Fix regression with TCG interpreter (bug 1310324) · 7e4e8865

由 Stefan Weil 提交于 4月 28, 2014

Commit 0f842f8a replaced GETPC_EXT() which
was derived from GETPC() by GETRA_EXT() without fixing cputlb.c. A later
patch replaced GETRA_EXT() by GETRA() in exec/softmmu_template.h which
is included in cputlb.c.

The TCG interpreter failed because the values returned by GETRA() were no
longer explicitly set to 0. The redefinition of GETRA() introduced here
fixes this.

In addition, GETPC_ADJ which is also used in exec/softmmu_template.h is
set to 0. Both changes reduce the compiled code size for cputlb.c by more
than 100 bytes, so the normal TCG without interpreter also profits from
the reduced code size and slightly faster code.

Cc: qemu-stable@nongnu.org
Reported-by: NGiovanni Mascellani <gio@debian.org>
Signed-off-by: NStefan Weil <sw@weilnetz.de>
Signed-off-by: NPaolo Bonzini <pbonzini@redhat.com>

7e4e8865

14 3月, 2014 8 次提交
- A
  cputlb: Change tlb_set_page() argument to CPUState · 0c591eb0
  由 Andreas Färber 提交于 9月 03, 2013
```
Signed-off-by: NAndreas Färber <afaerber@suse.de>
```
  0c591eb0
- A
  cputlb: Change tlb_flush() argument to CPUState · 00c8cb0a
  由 Andreas Färber 提交于 9月 04, 2013
```
Signed-off-by: NAndreas Färber <afaerber@suse.de>
```
  00c8cb0a
- A
  cputlb: Change tlb_flush_page() argument to CPUState · 31b030d4
  由 Andreas Färber 提交于 9月 04, 2013
```
Signed-off-by: NAndreas Färber <afaerber@suse.de>
```
  31b030d4
- A
  exec: Change cpu_abort() argument to CPUState · a47dddd7
  由 Andreas Färber 提交于 9月 03, 2013
```
Signed-off-by: NAndreas Färber <afaerber@suse.de>
```
  a47dddd7
- A
  exec: Change memory_region_section_get_iotlb() argument to CPUState · bb0e627a
  由 Andreas Färber 提交于 9月 03, 2013
```
It no longer needs CPUArchState since moving watchpoints to CPUState.
Signed-off-by: NAndreas Färber <afaerber@suse.de>
```
  bb0e627a
- A
  cputlb: Change tlb_unprotect_code_phys() argument to CPUState · baea4fae
  由 Andreas Färber 提交于 9月 03, 2013
```
Note that the argument is unused.
Signed-off-by: NAndreas Färber <afaerber@suse.de>
```
  baea4fae
- A
  translate-all: Change tb_flush_jmp_cache() argument to CPUState · 611d4f99
  由 Andreas Färber 提交于 9月 01, 2013
```
Signed-off-by: NAndreas Färber <afaerber@suse.de>
```
  611d4f99
- A
  cpu: Move tb_jmp_cache field from CPU_COMMON to CPUState · 8cd70437
  由 Andreas Färber 提交于 8月 26, 2013
```
Clear it on reset.
Signed-off-by: NAndreas Färber <afaerber@suse.de>
```
  8cd70437
11 2月, 2014 2 次提交

cpu: Add per-cpu address space · 09daed84

由 Edgar E. Iglesias 提交于 12月 17, 2013

Reviewed-by: NPeter Maydell <peter.maydell@linaro.org>
Signed-off-by: NEdgar E. Iglesias <edgar.iglesias@xilinx.com>

09daed84

exec: Make iotlb_to_region input an AS · 77717094

由 Edgar E. Iglesias 提交于 11月 07, 2013

Reviewed-by: NPeter Maydell <peter.maydell@linaro.org>
Signed-off-by: NEdgar E. Iglesias <edgar.iglesias@xilinx.com>

77717094

13 1月, 2014 5 次提交

memory: split cpu_physical_memory_* functions to its own include · 220c3ebd

由 Juan Quintela 提交于 10月 14, 2013

All the functions that use ram_addr_t should be here.
Signed-off-by: NJuan Quintela <quintela@redhat.com>
Reviewed-by: NOrit Wasserman <owasserm@redhat.com>

220c3ebd

memory: make cpu_physical_memory_reset_dirty() take a length parameter · a2f4d5be

由 Juan Quintela 提交于 10月 10, 2013

We have an end parameter in all the callers, and this make it coherent
with the rest of cpu_physical_memory_* functions, that also take a
length parameter.

Once here, move the start/end calculation to
tlb_reset_dirty_range_all() as we don't need it here anymore.
Signed-off-by: NJuan Quintela <quintela@redhat.com>
Reviewed-by: NEric Blake <eblake@redhat.com>
Reviewed-by: NOrit Wasserman <owasserm@redhat.com>

a2f4d5be

memory: s/dirty/clean/ in cpu_physical_memory_is_dirty() · a2cd8c85

由 Juan Quintela 提交于 10月 10, 2013

All uses except one really want the other meaning.
Signed-off-by: NJuan Quintela <quintela@redhat.com>
Reviewed-by: NEric Blake <eblake@redhat.com>
Reviewed-by: NOrit Wasserman <owasserm@redhat.com>

a2cd8c85

memory: cpu_physical_memory_mask_dirty_range() always clears a single flag · 52159192

由 Juan Quintela 提交于 10月 08, 2013

Document it
Signed-off-by: NJuan Quintela <quintela@redhat.com>
Reviewed-by: NEric Blake <eblake@redhat.com>
Reviewed-by: NOrit Wasserman <owasserm@redhat.com>

52159192

memory: create function to set a single dirty bit · a1390db4

由 Juan Quintela 提交于 10月 08, 2013

Signed-off-by: NJuan Quintela <quintela@redhat.com>
Reviewed-by: NOrit Wasserman <owasserm@redhat.com>
Reviewed-by: NEric Blake <eblake@redhat.com>

a1390db4

23 12月, 2013 2 次提交

cputlb: Tidy memset() of arrays · eb2535f4

由 Richard Henderson 提交于 12月 07, 2013

Don't duplicate the array length computation in the memset()
when plain sizeof() can produce the correct results.
Signed-off-by: NRichard Henderson <rth@twiddle.net>
Reviewed-by: NAurelien Jarno <aurelien@aurel32.net>
Signed-off-by: NAndreas Färber <afaerber@suse.de>

eb2535f4

cputlb: Use memset() when flushing entries · 4fadb3bb

由 Richard Henderson 提交于 12月 07, 2013

The size of tlb_table is 4k on a 64-bit host.  For overwriting
memory at this size, cacheline tricks can help.
Signed-off-by: NRichard Henderson <rth@twiddle.net>
Reviewed-by: NAurelien Jarno <aurelien@aurel32.net>
Signed-off-by: NAndreas Färber <afaerber@suse.de>

4fadb3bb

07 10月, 2013 1 次提交

cputlb: Remove dead function tlb_update_dirty() · 81258640

由 liguang 提交于 9月 03, 2013

Signed-off-by: Nliguang <lig.fnst@cn.fujitsu.com>
Reviewed-by: NPaolo Bonzini <pbonzini@redhat.com>
Signed-off-by: NAndreas Färber <afaerber@suse.de>

81258640

03 9月, 2013 1 次提交

cpu: Use QTAILQ for CPU list · bdc44640

由 Andreas Färber 提交于 6月 24, 2013

Introduce CPU_FOREACH(), CPU_FOREACH_SAFE() and CPU_NEXT() shorthand
macros.
Signed-off-by: NAndreas Färber <afaerber@suse.de>

bdc44640

10 7月, 2013 1 次提交

cpu: Make first_cpu and next_cpu CPUState · 182735ef

由 Andreas Färber 提交于 5月 29, 2013

Move next_cpu from CPU_COMMON to CPUState.
Move first_cpu variable to qom/cpu.h.

gdbstub needs to use CPUState::env_ptr for now.
cpu_copy() no longer needs to save and restore cpu_next.
Acked-by: NPaolo Bonzini <pbonzini@redhat.com>
[AF: Rebased, simplified cpu_copy()]
Signed-off-by: NAndreas Färber <afaerber@suse.de>

182735ef

04 7月, 2013 2 次提交

memory: return MemoryRegion from qemu_ram_addr_from_host · 1b5ec234

由 Paolo Bonzini 提交于 5月 06, 2013

It will be needed in the next patch.
Reviewed-by: NJan Kiszka <jan.kiszka@siemens.com>
Signed-off-by: NPaolo Bonzini <pbonzini@redhat.com>

1b5ec234

exec: move qemu_ram_addr_from_host_nofail to cputlb.c · 7443b437

由 Paolo Bonzini 提交于 6月 03, 2013

After the next patch it would not be used elsewhere anyway.  Also,
the _nofail and the standard versions of this function return different
things, which is confusing.  Removing the function from the public headers
limits the confusion.
Reviewed-by: NJan Kiszka <jan.kiszka@siemens.com>
Signed-off-by: NPaolo Bonzini <pbonzini@redhat.com>

7443b437

28 6月, 2013 1 次提交

cpu: Turn cpu_unassigned_access() into a CPUState hook · c658b94f

由 Andreas Färber 提交于 5月 27, 2013

Use it for all targets, but be careful not to pass invalid CPUState.
cpu_single_env can be NULL, e.g. on Xen.
Signed-off-by: NAndreas Färber <afaerber@suse.de>

c658b94f

20 6月, 2013 1 次提交

exec: Resolve subpages in one step except for IOTLB fills · 90260c6c

由 Jan Kiszka 提交于 5月 26, 2013

Except for the case of setting the IOTLB entry in TCG mode, we can avoid
the subpage dispatching handlers and do the resolution directly on
address_space_lookup_region. An IOTLB entry describes a full page, not
only the region that the first access to a sub-divided page may return.

This patch therefore introduces a special translation function,
address_space_translate_for_iotlb, that avoids the subpage resolutions.
In contrast, callers of the existing address_space_translate service
will now always receive the terminal memory region section. This will be
important for breaking the BQL and for enabling unaligned memory region.
Signed-off-by: NJan Kiszka <jan.kiszka@siemens.com>
Signed-off-by: NPaolo Bonzini <pbonzini@redhat.com>

90260c6c

14 6月, 2013 1 次提交

cputlb: fix debug logs · 54b949d2

由 Hervé Poussineau 提交于 6月 05, 2013

'pd' variable has been removed in 06ef3525.
Signed-off-by: NHervé Poussineau <hpoussin@reactos.org>
Signed-off-by: NMichael Tokarev <mjt@tls.msk.ru>

54b949d2

29 5月, 2013 2 次提交

memory: add address_space_translate · 149f54b5

由 Paolo Bonzini 提交于 5月 24, 2013

Using phys_page_find to translate an AddressSpace to a MemoryRegionSection
is unwieldy.  It requires to pass the page index rather than the address,
and later memory_region_section_addr has to be called.  Replace
memory_region_section_addr with a function that does all of it: call
phys_page_find, compute the offset within the region, and check how
big the current mapping is.  This way, a large flat region can be written
with a single lookup rather than a page at a time.

address_space_translate will also provide a single point where IOMMU
forwarding is implemented.
Reviewed-by: NPeter Maydell <peter.maydell@linaro.org>
Reviewed-by: NRichard Henderson <rth@twiddle.net>
Signed-off-by: NPaolo Bonzini <pbonzini@redhat.com>

149f54b5

cputlb: simplify tlb_set_page · 8f3e03cb

由 Paolo Bonzini 提交于 5月 24, 2013

The same "if" condition is repeated twice.
Reviewed-by: NRichard Henderson <rth@twiddle.net>
Signed-off-by: NPaolo Bonzini <pbonzini@redhat.com>

8f3e03cb

16 2月, 2013 1 次提交

cpu: Move current_tb field to CPUState · d77953b9

由 Andreas Färber 提交于 1月 16, 2013

Explictly NULL it on CPU reset since it was located before breakpoints.

Change vapic_report_tpr_access() argument to CPUState. This also
resolves the use of void* for cpu.h independence.
Change vAPIC patch_instruction() argument to X86CPU.
Signed-off-by: NAndreas Färber <afaerber@suse.de>

d77953b9