1. 01 9月, 2022 1 次提交
  2. 17 8月, 2022 3 次提交
  3. 10 8月, 2022 1 次提交
  4. 04 8月, 2022 9 次提交
  5. 29 7月, 2022 3 次提交
  6. 28 7月, 2022 2 次提交
  7. 26 7月, 2022 1 次提交
  8. 21 7月, 2022 4 次提交
  9. 19 7月, 2022 2 次提交
  10. 18 7月, 2022 6 次提交
  11. 17 7月, 2022 5 次提交
  12. 13 7月, 2022 2 次提交
    • K
      Intel: AVX VNNI: x86: Enumerate AVX Vector Neural Network instructions · de9dc9ca
      Kyung Min Park 提交于
      mainline inclusion
      from mainline-5.11
      commit b85a0425
      category: feature
      feature: SPR New instructions
      bugzilla: https://gitee.com/openeuler/intel-kernel/issues/I596EH
      CVE: N/A
      
      Intel-SIG: commit b85a0425 x86: Enumerate AVX Vector Neural Network instructions
      Backport for SPR core AVX VNNI support.
      
      ----------------------------
      
      Add AVX version of the Vector Neural Network (VNNI) Instructions.
      A processor supports AVX VNNI instructions if CPUID.0x07.0x1:EAX[4] is
      present. The following instructions are available when this feature is
      present.
      1. VPDPBUS: Multiply and Add Unsigned and Signed Bytes
      2. VPDPBUSDS: Multiply and Add Unsigned and Signed Bytes with Saturation
      3. VPDPWSSD: Multiply and Add Signed Word Integers
      4. VPDPWSSDS: Multiply and Add Signed Integers with Saturation
      
      The only in-kernel usage of this is kvm passthrough. The CPU feature
      flag is shown as "avx_vnni" in /proc/cpuinfo.
      
      This instruction is currently documented in the latest "extensions"
      manual (ISE). It will appear in the "main" manual (SDM) in the future.
      Signed-off-by: NKyung Min Park <kyung.min.park@intel.com>
      Signed-off-by: NYang Zhong <yang.zhong@intel.com>
      Reviewed-by: NTony Luck <tony.luck@intel.com>
      Message-Id: <20210105004909.42000-2-yang.zhong@intel.com>
      Acked-by: NBorislav Petkov <bp@suse.de>
      Signed-off-by: NPaolo Bonzini <pbonzini@redhat.com>
      Signed-off-by: NLuming Yu <luming.yu@intel.com>
      de9dc9ca
    • K
      Intel: 5G ISA: x86: Enumerate AVX512 FP16 CPUID feature flag · 82b4f867
      Kyung Min Park 提交于
      mainline inclusion
      from mainline-5.11
      commit e1b35da5
      category: feature
      feature: SPR New instructions
      bugzilla: https://gitee.com/openeuler/intel-kernel/issues/I596EH
      CVE: N/A
      
      Intel-SIG: commit e1b35da5 x86: Enumerate AVX512 FP16 CPUID feature flag
      Backport for SPR core 5G ISA support.
      
      -------------------------------------
      
      Enumerate AVX512 Half-precision floating point (FP16) CPUID feature
      flag. Compared with using FP32, using FP16 cut the number of bits
      required for storage in half, reducing the exponent from 8 bits to 5,
      and the mantissa from 23 bits to 10. Using FP16 also enables developers
      to train and run inference on deep learning models fast when all
      precision or magnitude (FP32) is not needed.
      
      A processor supports AVX512 FP16 if CPUID.(EAX=7,ECX=0):EDX[bit 23]
      is present. The AVX512 FP16 requires AVX512BW feature be implemented
      since the instructions for manipulating 32bit masks are associated with
      AVX512BW.
      
      The only in-kernel usage of this is kvm passthrough. The CPU feature
      flag is shown as "avx512_fp16" in /proc/cpuinfo.
      
      Signed-off-by: Kyung Min Park kyung.min.park@intel.com
      Acked-by: Dave Hansen dave.hansen@intel.com
      Reviewed-by: Tony Luck tony.luck@intel.com
      Message-Id: 20201208033441.28207-2-kyung.min.park@intel.com
      Acked-by: Borislav Petkov bp@suse.de
      Signed-off-by: Paolo Bonzini pbonzini@redhat.com
      Signed-off-by: Luming Yu luming.yu@intel.com
      82b4f867
  13. 08 7月, 2022 1 次提交