1. 24 1月, 2020 2 次提交
  2. 22 10月, 2019 1 次提交
  3. 19 6月, 2019 1 次提交
  4. 17 12月, 2018 1 次提交
  5. 14 12月, 2018 1 次提交
  6. 09 10月, 2018 2 次提交
    • P
      KVM: PPC: Use ccr field in pt_regs struct embedded in vcpu struct · fd0944ba
      Paul Mackerras 提交于
      When the 'regs' field was added to struct kvm_vcpu_arch, the code
      was changed to use several of the fields inside regs (e.g., gpr, lr,
      etc.) but not the ccr field, because the ccr field in struct pt_regs
      is 64 bits on 64-bit platforms, but the cr field in kvm_vcpu_arch is
      only 32 bits.  This changes the code to use the regs.ccr field
      instead of cr, and changes the assembly code on 64-bit platforms to
      use 64-bit loads and stores instead of 32-bit ones.
      Reviewed-by: NDavid Gibson <david@gibson.dropbear.id.au>
      Signed-off-by: NPaul Mackerras <paulus@ozlabs.org>
      Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>
      fd0944ba
    • P
      KVM: PPC: Book3S: Simplify external interrupt handling · d24ea8a7
      Paul Mackerras 提交于
      Currently we use two bits in the vcpu pending_exceptions bitmap to
      indicate that an external interrupt is pending for the guest, one
      for "one-shot" interrupts that are cleared when delivered, and one
      for interrupts that persist until cleared by an explicit action of
      the OS (e.g. an acknowledge to an interrupt controller).  The
      BOOK3S_IRQPRIO_EXTERNAL bit is used for one-shot interrupt requests
      and BOOK3S_IRQPRIO_EXTERNAL_LEVEL is used for persisting interrupts.
      
      In practice BOOK3S_IRQPRIO_EXTERNAL never gets used, because our
      Book3S platforms generally, and pseries in particular, expect
      external interrupt requests to persist until they are acknowledged
      at the interrupt controller.  That combined with the confusion
      introduced by having two bits for what is essentially the same thing
      makes it attractive to simplify things by only using one bit.  This
      patch does that.
      
      With this patch there is only BOOK3S_IRQPRIO_EXTERNAL, and by default
      it has the semantics of a persisting interrupt.  In order to avoid
      breaking the ABI, we introduce a new "external_oneshot" flag which
      preserves the behaviour of the KVM_INTERRUPT ioctl with the
      KVM_INTERRUPT_SET argument.
      Reviewed-by: NDavid Gibson <david@gibson.dropbear.id.au>
      Signed-off-by: NPaul Mackerras <paulus@ozlabs.org>
      Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>
      d24ea8a7
  7. 24 8月, 2018 1 次提交
  8. 30 7月, 2018 1 次提交
  9. 13 6月, 2018 4 次提交
    • P
      KVM: PPC: Book3S PR: Enable use on POWER9 bare-metal hosts in HPT mode · db96a04a
      Paul Mackerras 提交于
      It turns out that PR KVM has no dependency on the format of HPTEs,
      because it uses functions pointed to by mmu_hash_ops which do all
      the formatting and interpretation of HPTEs.  Thus we can allow PR
      KVM to load on POWER9 bare-metal hosts as long as they are running
      in HPT mode.
      Signed-off-by: NPaul Mackerras <paulus@ozlabs.org>
      db96a04a
    • P
      KVM: PPC: Book3S PR: Don't let PAPR guest set MSR hypervisor bit · 4f169d21
      Paul Mackerras 提交于
      PAPR guests run in supervisor mode and should not be able to set the
      MSR HV (hypervisor mode) bit or clear the ME (machine check enable)
      bit by mtmsrd or any other means.  To enforce this, we force MSR_HV
      off and MSR_ME on in kvmppc_set_msr_pr.  Without this, the guest
      can appear to be in hypervisor mode to itself and to userspace.
      This has been observed to cause a crash in QEMU when it tries to
      deliver a system reset interrupt to the guest.
      Signed-off-by: NPaul Mackerras <paulus@ozlabs.org>
      4f169d21
    • P
      KVM: PPC: Book3S PR: Fix MSR setting when delivering interrupts · 916ccadc
      Paul Mackerras 提交于
      This makes sure that MSR "partial-function" bits are not transferred
      to SRR1 when delivering an interrupt.  This was causing failures in
      guests running kernels that include commit f3d96e69 ("powerpc/mm:
      Overhaul handling of bad page faults", 2017-07-19), which added code
      to check bits of SRR1 on instruction storage interrupts (ISIs) that
      indicate a bad page fault.  The symptom was that a guest user program
      that handled a signal and attempted to return from the signal handler
      would get a SIGBUS signal and die.
      
      The code that generated ISIs and some other interrupts would
      previously set bits in the guest MSR to indicate the interrupt status
      and then call kvmppc_book3s_queue_irqprio().  This technique no
      longer works now that kvmppc_inject_interrupt() is masking off those
      bits.  Instead we make kvmppc_core_queue_data_storage() and
      kvmppc_core_queue_inst_storage() call kvmppc_inject_interrupt()
      directly, and make sure that all the places that generate ISIs or
      DSIs call kvmppc_core_queue_{data,inst}_storage instead of
      kvmppc_book3s_queue_irqprio().
      Signed-off-by: NPaul Mackerras <paulus@ozlabs.org>
      916ccadc
    • C
      KVM: PPC: Book3S PR: Handle additional interrupt types · b71dc519
      Cameron Kaiser 提交于
      This adds trivial handling for additional interrupt types that KVM-PR must
      support for proper virtualization on a POWER9 host in HPT mode, as a further
      prerequisite to enabling KVM-PR on that configuration.
      Signed-off-by: NCameron Kaiser <spectre@floodgap.com>
      Signed-off-by: NPaul Mackerras <paulus@ozlabs.org>
      b71dc519
  10. 01 6月, 2018 12 次提交
    • S
      KVM: PPC: Book3S PR: Enable kvmppc_get/set_one_reg_pr() for HTM registers · deeb879d
      Simon Guo 提交于
      We need to migrate PR KVM during transaction and userspace will use
      kvmppc_get_one_reg_pr()/kvmppc_set_one_reg_pr() APIs to get/set
      transaction checkpoint state. This patch adds support for that.
      
      So far, QEMU on PR KVM doesn't fully function for migration but the
      savevm/loadvm can be done against a RHEL72 guest. During savevm/
      loadvm procedure, the kvm ioctls will be invoked as well.
      
      Test has been performed to savevm/loadvm for a guest running
      a HTM test program:
      https://github.com/justdoitqd/publicFiles/blob/master/test-tm-mig.cSigned-off-by: NSimon Guo <wei.guo.simon@gmail.com>
      Signed-off-by: NPaul Mackerras <paulus@ozlabs.org>
      deeb879d
    • S
      KVM: PPC: Book3S PR: Support TAR handling for PR KVM HTM · 7284ca8a
      Simon Guo 提交于
      Currently guest kernel doesn't handle TAR facility unavailable and it
      always runs with TAR bit on. PR KVM will lazily enable TAR. TAR is not
      a frequent-use register and it is not included in SVCPU struct.
      
      Due to the above, the checkpointed TAR val might be a bogus TAR val.
      To solve this issue, we will make vcpu->arch.fscr tar bit consistent
      with shadow_fscr when TM is enabled.
      
      At the end of emulating treclaim., the correct TAR val need to be loaded
      into the register if FSCR_TAR bit is on.
      
      At the beginning of emulating trechkpt., TAR needs to be flushed so that
      the right tar val can be copied into tar_tm.
      
      Tested with:
      tools/testing/selftests/powerpc/tm/tm-tar
      tools/testing/selftests/powerpc/ptrace/ptrace-tm-tar (remove DSCR/PPR
      related testing).
      Signed-off-by: NSimon Guo <wei.guo.simon@gmail.com>
      Signed-off-by: NPaul Mackerras <paulus@ozlabs.org>
      7284ca8a
    • S
      KVM: PPC: Book3S PR: Add guard code to prevent returning to guest with PR=0 and Transactional state · 68ab07b9
      Simon Guo 提交于
      Currently PR KVM doesn't support transaction memory in guest privileged
      state.
      
      This patch adds a check at setting guest msr, so that we can never return
      to guest with PR=0 and TS=0b10. A tabort will be emulated to indicate
      this and fail transaction immediately.
      
      [paulus@ozlabs.org - don't change the TM_CAUSE_MISC definition, instead
       use TM_CAUSE_KVM_FAC_UNAV.]
      Signed-off-by: NSimon Guo <wei.guo.simon@gmail.com>
      Signed-off-by: NPaul Mackerras <paulus@ozlabs.org>
      68ab07b9
    • S
      KVM: PPC: Book3S PR: Add emulation for trechkpt. · e32c53d1
      Simon Guo 提交于
      This patch adds host emulation when guest PR KVM executes "trechkpt.",
      which is a privileged instruction and will trap into host.
      
      We firstly copy vcpu ongoing content into vcpu tm checkpoint
      content, then perform kvmppc_restore_tm_pr() to do trechkpt.
      with updated vcpu tm checkpoint values.
      Signed-off-by: NSimon Guo <wei.guo.simon@gmail.com>
      Signed-off-by: NPaul Mackerras <paulus@ozlabs.org>
      e32c53d1
    • S
      KVM: PPC: Book3S PR: Restore NV regs after emulating mfspr from TM SPRs · 19c585eb
      Simon Guo 提交于
      Currently kvmppc_handle_fac() will not update NV GPRs and thus it can
      return with GUEST_RESUME.
      
      However PR KVM guest always disables MSR_TM bit in privileged state.
      If PR privileged-state guest is trying to read TM SPRs, it will
      trigger TM facility unavailable exception and fall into
      kvmppc_handle_fac().  Then the emulation will be done by
      kvmppc_core_emulate_mfspr_pr().  The mfspr instruction can include a
      RT with NV reg. So it is necessary to restore NV GPRs at this case, to
      reflect the update to NV RT.
      
      This patch make kvmppc_handle_fac() return GUEST_RESUME_NV for TM
      facility unavailable exceptions in guest privileged state.
      Signed-off-by: NSimon Guo <wei.guo.simon@gmail.com>
      Reviewed-by: NPaul Mackerras <paulus@ozlabs.org>
      Signed-off-by: NPaul Mackerras <paulus@ozlabs.org>
      19c585eb
    • S
      KVM: PPC: Book3S PR: Always fail transactions in guest privileged state · 5706340a
      Simon Guo 提交于
      Currently the kernel doesn't use transaction memory.
      And there is an issue for privileged state in the guest that:
      tbegin/tsuspend/tresume/tabort TM instructions can impact MSR TM bits
      without trapping into the PR host. So following code will lead to a
      false mfmsr result:
      	tbegin	<- MSR bits update to Transaction active.
      	beq 	<- failover handler branch
      	mfmsr	<- still read MSR bits from magic page with
      		transaction inactive.
      
      It is not an issue for non-privileged guest state since its mfmsr is
      not patched with magic page and will always trap into the PR host.
      
      This patch will always fail tbegin attempt for privileged state in the
      guest, so that the above issue is prevented. It is benign since
      currently (guest) kernel doesn't initiate a transaction.
      
      Test case:
      https://github.com/justdoitqd/publicFiles/blob/master/test_tbegin_pr.cSigned-off-by: NSimon Guo <wei.guo.simon@gmail.com>
      Signed-off-by: NPaul Mackerras <paulus@ozlabs.org>
      5706340a
    • S
      KVM: PPC: Book3S PR: Emulate mtspr/mfspr using active TM SPRs · 533082ae
      Simon Guo 提交于
      The mfspr/mtspr on TM SPRs(TEXASR/TFIAR/TFHAR) are non-privileged
      instructions and can be executed by PR KVM guest in problem state
      without trapping into the host. We only emulate mtspr/mfspr
      texasr/tfiar/tfhar in guest PR=0 state.
      
      When we are emulating mtspr tm sprs in guest PR=0 state, the emulation
      result needs to be visible to guest PR=1 state. That is, the actual TM
      SPR val should be loaded into actual registers.
      
      We already flush TM SPRs into vcpu when switching out of CPU, and load
      TM SPRs when switching back.
      
      This patch corrects mfspr()/mtspr() emulation for TM SPRs to make the
      actual source/dest be the actual TM SPRs.
      Signed-off-by: NSimon Guo <wei.guo.simon@gmail.com>
      Signed-off-by: NPaul Mackerras <paulus@ozlabs.org>
      533082ae
    • S
      KVM: PPC: Book3S PR: Add math support for PR KVM HTM · 13989b65
      Simon Guo 提交于
      The math registers will be saved into vcpu->arch.fp/vr and corresponding
      vcpu->arch.fp_tm/vr_tm area.
      
      We flush or giveup the math regs into vcpu->arch.fp/vr before saving
      transaction. After transaction is restored, the math regs will be loaded
      back into regs.
      
      If there is a FP/VEC/VSX unavailable exception during transaction active
      state, the math checkpoint content might be incorrect and we need to do
      treclaim./load the correct checkpoint val/trechkpt. sequence to retry the
      transaction. That will make our solution complicated. To solve this issue,
      we always make the hardware guest MSR math bits (shadow_msr) consistent
      with the MSR val which guest sees (kvmppc_get_msr()) when guest msr is
      with tm enabled. Then all FP/VEC/VSX unavailable exception can be delivered
      to guest and guest handles the exception by itself.
      Signed-off-by: NSimon Guo <wei.guo.simon@gmail.com>
      Signed-off-by: NPaul Mackerras <paulus@ozlabs.org>
      13989b65
    • S
      KVM: PPC: Book3S PR: Add transaction memory save/restore skeleton · 8d2e2fc5
      Simon Guo 提交于
      The transaction memory checkpoint area save/restore behavior is
      triggered when VCPU qemu process is switching out/into CPU, i.e.
      at kvmppc_core_vcpu_put_pr() and kvmppc_core_vcpu_load_pr().
      
      MSR TM active state is determined by TS bits:
          active: 10(transactional) or 01 (suspended)
          inactive: 00 (non-transactional)
      We don't "fake" TM functionality for guest. We "sync" guest virtual
      MSR TM active state(10 or 01) with shadow MSR. That is to say,
      we don't emulate a transactional guest with a TM inactive MSR.
      
      TM SPR support(TFIAR/TFAR/TEXASR) has already been supported by
      commit 9916d57e ("KVM: PPC: Book3S PR: Expose TM registers").
      Math register support (FPR/VMX/VSX) will be done at subsequent
      patch.
      
      Whether TM context need to be saved/restored can be determined
      by kvmppc_get_msr() TM active state:
      	* TM active - save/restore TM context
      	* TM inactive - no need to do so and only save/restore
      TM SPRs.
      Signed-off-by: NSimon Guo <wei.guo.simon@gmail.com>
      Suggested-by: NPaul Mackerras <paulus@ozlabs.org>
      Signed-off-by: NPaul Mackerras <paulus@ozlabs.org>
      8d2e2fc5
    • S
      KVM: PPC: Book3S PR: Add kvmppc_save/restore_tm_sprs() APIs · 66c33e79
      Simon Guo 提交于
      This patch adds 2 new APIs, kvmppc_save_tm_sprs() and
      kvmppc_restore_tm_sprs(), for the purpose of TEXASR/TFIAR/TFHAR
      save/restore.
      Signed-off-by: NSimon Guo <wei.guo.simon@gmail.com>
      Signed-off-by: NPaul Mackerras <paulus@ozlabs.org>
      66c33e79
    • S
      KVM: PPC: Book3S PR: Sync TM bits to shadow msr for problem state guest · 95757bfc
      Simon Guo 提交于
      MSR TS bits can be modified with non-privileged instruction such as
      tbegin./tend.  That means guest can change MSR value "silently" without
      notifying host.
      
      It is necessary to sync the TM bits to host so that host can calculate
      shadow msr correctly.
      
      Note, privileged mode in the guest will always fail transactions so we
      only take care of problem state mode in the guest.
      
      The logic is put into kvmppc_copy_from_svcpu() so that
      kvmppc_handle_exit_pr() can use correct MSR TM bits even when preemption
      occurs.
      Signed-off-by: NSimon Guo <wei.guo.simon@gmail.com>
      Signed-off-by: NPaul Mackerras <paulus@ozlabs.org>
      95757bfc
    • S
      KVM: PPC: Book3S PR: Pass through MSR TM and TS bits to shadow_msr · 901938ad
      Simon Guo 提交于
      PowerPC TM functionality needs MSR TM/TS bits support in hardware level.
      Guest TM functionality can not be emulated with "fake" MSR (msr in magic
      page) TS bits.
      
      This patch syncs TM/TS bits in shadow_msr with the MSR value in magic
      page, so that the MSR TS value which guest sees is consistent with actual
      MSR bits running in guest.
      Signed-off-by: NSimon Guo <wei.guo.simon@gmail.com>
      Signed-off-by: NPaul Mackerras <paulus@ozlabs.org>
      901938ad
  11. 31 5月, 2018 1 次提交
    • P
      KVM: PPC: Book3S PR: Allow KVM_PPC_CONFIGURE_V3_MMU to succeed · 9617a0b3
      Paul Mackerras 提交于
      Currently, PR KVM does not implement the configure_mmu operation, and
      so the KVM_PPC_CONFIGURE_V3_MMU ioctl always fails with an EINVAL
      error.  This causes recent kernels to fail to boot as a PR KVM guest
      on POWER9, since recent kernels booted in HPT mode do the
      H_REGISTER_PROC_TBL hypercall, which causes userspace (QEMU) to do
      KVM_PPC_CONFIGURE_V3_MMU, which fails.
      
      This implements a minimal configure_mmu operation for PR KVM.  It
      succeeds only if the MMU is being configured for HPT mode and no
      process table is being registered.  This is enough to get recent
      kernels to boot as a PR KVM guest.
      Reviewed-by: NGreg Kurz <groug@kaod.org>
      Tested-by: NGreg Kurz <groug@kaod.org>
      Signed-off-by: NPaul Mackerras <paulus@ozlabs.org>
      9617a0b3
  12. 22 5月, 2018 1 次提交
    • S
      KVM: PPC: Add giveup_ext() hook to PPC KVM ops · 2e6baa46
      Simon Guo 提交于
      Currently HV will save math regs(FP/VEC/VSX) when trap into host. But
      PR KVM will only save math regs when qemu task switch out of CPU, or
      when returning from qemu code.
      
      To emulate FP/VEC/VSX mmio load, PR KVM need to make sure that math
      regs were flushed firstly and then be able to update saved VCPU
      FPR/VEC/VSX area reasonably.
      
      This patch adds giveup_ext() field to KVM ops. Only PR KVM has non-NULL
      giveup_ext() ops. kvmppc_complete_mmio_load() can invoke that hook
      (when not NULL) to flush math regs accordingly, before updating saved
      register vals.
      
      Math regs flush is also necessary for STORE, which will be covered
      in later patch within this patch series.
      Signed-off-by: NSimon Guo <wei.guo.simon@gmail.com>
      Signed-off-by: NPaul Mackerras <paulus@ozlabs.org>
      2e6baa46
  13. 18 5月, 2018 3 次提交
  14. 19 3月, 2018 1 次提交
  15. 01 2月, 2018 1 次提交
  16. 10 1月, 2018 1 次提交
    • A
      KVM: PPC: Book3S PR: Fix WIMG handling under pHyp · 6c7d47c3
      Alexey Kardashevskiy 提交于
      Commit 96df2267 ("KVM: PPC: Book3S PR: Preserve storage control bits")
      added code to preserve WIMG bits but it missed 2 special cases:
      - a magic page in kvmppc_mmu_book3s_64_xlate() and
      - guest real mode in kvmppc_handle_pagefault().
      
      For these ptes, WIMG was 0 and pHyp failed on these causing a guest to
      stop in the very beginning at NIP=0x100 (due to bd9166ff "KVM: PPC:
      Book3S PR: Exit KVM on failed mapping").
      
      According to LoPAPR v1.1 14.5.4.1.2 H_ENTER:
      
       The hypervisor checks that the WIMG bits within the PTE are appropriate
       for the physical page number else H_Parameter return. (For System Memory
       pages WIMG=0010, or, 1110 if the SAO option is enabled, and for IO pages
       WIMG=01**.)
      
      This hence initializes WIMG to non-zero value HPTE_R_M (0x10), as expected
      by pHyp.
      
      [paulus@ozlabs.org - fix compile for 32-bit]
      
      Cc: stable@vger.kernel.org # v4.11+
      Fixes: 96df2267 "KVM: PPC: Book3S PR: Preserve storage control bits"
      Signed-off-by: NAlexey Kardashevskiy <aik@ozlabs.ru>
      Tested-by: NRuediger Oertel <ro@suse.de>
      Reviewed-by: NGreg Kurz <groug@kaod.org>
      Tested-by: NGreg Kurz <groug@kaod.org>
      Signed-off-by: NPaul Mackerras <paulus@ozlabs.org>
      6c7d47c3
  17. 01 11月, 2017 1 次提交
    • G
      KVM: PPC: Book3S PR: Only install valid SLBs during KVM_SET_SREGS · f4093ee9
      Greg Kurz 提交于
      Userland passes an array of 64 SLB descriptors to KVM_SET_SREGS,
      some of which are valid (ie, SLB_ESID_V is set) and the rest are
      likely all-zeroes (with QEMU at least).
      
      Each of them is then passed to kvmppc_mmu_book3s_64_slbmte(), which
      assumes to find the SLB index in the 3 lower bits of its rb argument.
      When passed zeroed arguments, it happily overwrites the 0th SLB entry
      with zeroes. This is exactly what happens while doing live migration
      with QEMU when the destination pushes the incoming SLB descriptors to
      KVM PR. When reloading the SLBs at the next synchronization, QEMU first
      clears its SLB array and only restore valid ones, but the 0th one is
      now gone and we cannot access the corresponding memory anymore:
      
      (qemu) x/x $pc
      c0000000000b742c: Cannot access memory
      
      To avoid this, let's filter out non-valid SLB entries. While here, we
      also force a full SLB flush before installing new entries. Since SLB
      is for 64-bit only, we now build this path conditionally to avoid a
      build break on 32-bit, which doesn't define SLB_ESID_V.
      Signed-off-by: NGreg Kurz <groug@kaod.org>
      Reviewed-by: NDavid Gibson <david@gibson.dropbear.id.au>
      Signed-off-by: NPaul Mackerras <paulus@ozlabs.org>
      f4093ee9
  18. 27 4月, 2017 1 次提交
  19. 20 4月, 2017 3 次提交
  20. 27 1月, 2017 1 次提交