1. 12 10月, 2022 1 次提交
  2. 10 10月, 2022 2 次提交
  3. 08 10月, 2022 1 次提交
  4. 27 9月, 2022 1 次提交
  5. 26 9月, 2022 2 次提交
  6. 25 9月, 2022 1 次提交
  7. 22 9月, 2022 1 次提交
  8. 21 9月, 2022 2 次提交
  9. 20 9月, 2022 2 次提交
  10. 15 9月, 2022 1 次提交
  11. 14 9月, 2022 2 次提交
  12. 08 9月, 2022 2 次提交
  13. 07 9月, 2022 1 次提交
    • W
      Layernorm shift partition (#45736) · 960109af
      wenbin 提交于
      * first commit
      
      * conver done
      
      * correct format
      
      * layernorm_shift_partition
      
      * correct convert
      
      * redefine plugin
      
      * runable
      
      * bug fix
      
      * modify ShiftPartitionPattern
      
      * correct
      
      * add UT
      
      * modify ut
      
      * compile
      
      * modify enforce
      
      * modify UT
      960109af
  14. 06 9月, 2022 1 次提交
  15. 02 9月, 2022 1 次提交
  16. 31 8月, 2022 1 次提交
  17. 22 8月, 2022 2 次提交
  18. 19 8月, 2022 2 次提交
  19. 17 8月, 2022 1 次提交
    • N
      [CodeStyle][NPU] use np.testing.assert_allclose instead of... · 2de0d676
      Nyakku Shigure 提交于
      [CodeStyle][NPU] use np.testing.assert_allclose instead of self.assertTrue(np.allclose(...)) (part 1) (#44988)
      
      * autofix
      
      * try resolve precision issues
      
      * revert some changes
      
      * clean some `err_msg`
      
      * 0.0001 -> 1e-4
      
      * update commented assert code
      
      * try to fix some shape errors
      
      * `numpy` -> `np`
      
      * empty commit, trigger kunlun ci, test=kunlun
      
      * empty commit, retrigger kunlun ci, test=kunlun
      
      * empty commit, trigger kunlun ci, try fix npu memcpy_h2d, test=kunlun
      
      * try fix npu import error, test=kunlun
      2de0d676
  20. 16 8月, 2022 1 次提交
    • F
      convert multihead to oss (#45019) · f706d95d
      feng_shuai 提交于
      * convert multihead to oss
      
      * fix:bug
      
      * fix:delete const cast
      
      * fix:don't support bias_qk
      
      * add vit pass
      
      * fix:convert bug and add preln_residual_bias
      
      * support length=-1
      
      * add UT for convert
      
      * add no_bias_qk support for gpu_multihead_op
      
      * delete infer_shape depends on bias_qk
      
      * oss just can be used in T4 and A*
      
      * fix:change api for ROCM CI
      f706d95d
  21. 15 8月, 2022 2 次提交
  22. 12 8月, 2022 1 次提交
    • S
      Offload calculations from matmul op to fuse pass (#44941) · acb78ea2
      Sławomir Siwek 提交于
      * remove v2_transpose_reshape
      
      * matmul_transpose_reshape
      
      * reshape_transpose_matmul
      
      * Add int8 support for matmulV2
      
      * restore ut
      
      * adjust old ut
      
      * restore parallel UT ruels
      
      * remove mkldnn code from base ops
      
      * move enforces to pass
      
      * remove duplicated functions
      
      * delete duplicated enforces
      
      * feedback from review
      
      * add comments to variables
      
      * enable eltwise support
      
      * dynamic attribute
      
      * remove fusepass tests from op test
      
      * remove fuse pass cases from op test
      
      * revert introduction of dynamic attributes
      
      * style
      Co-authored-by: Nwozna <joanna.wozna@intel.com>
      acb78ea2
  23. 05 8月, 2022 1 次提交
  24. 04 8月, 2022 2 次提交
    • S
      Matmuls with activation and elementwise_add fuses (#44655) · 0420d514
      Sławomir Siwek 提交于
      * Add unit tests
      
      * matmul_v2 + activation
      
      * matmuls + elementwise_add
      
      * matmul_v2 postops
      
      * transform matmul to v2
      
      * opcompat
      
      * fix fusing matmul with multipe outs
      
      * add shape constraints
      
      * remove unused vars
      
      * change pass order
      
      * - Unit tests to be debugged
      
      - fix
      
      - refactor
      
      - diagnostic
      
      - more diagnostic
      
      - fix
      
      - Fix number two
      
      - fix
      
      - fix
      
      - fix
      
      - alpha added
      
      - more fixes
      
      - compilation fix
      
      - removed diagnostic code
      
      - cosmetic fixes
      
      * lint
      
      * add alpha constraint
      
      * merge matmul refactor
      
      * trigger CI
      
      * - fix
      
      * - another fix
      
      * code style
      
      * add support for matmul+elementwise_add+activation
      
      * code style
      
      * fix bfloat16 bugs
      
      * change append_binary to append_sum
      Co-authored-by: NJacek Czaja <jacek.czaja@intel.com>
      0420d514
    • Z
      [Paddle-TRT] add Rnn (#44678) · ffc8defa
      zhoutianzi666 提交于
      * add rnn
      ffc8defa
  25. 03 8月, 2022 1 次提交
  26. 29 7月, 2022 1 次提交
  27. 22 7月, 2022 1 次提交
  28. 21 7月, 2022 1 次提交
  29. 19 7月, 2022 2 次提交
    • J
      Added pad3d and pad2d FP32 FWD oneDNN kernels (#43990) · 2792b8de
      jakpiase 提交于
      * Piotrek's changes for pad3d
      
      * my changes
      
      * first version of pad3d, single copy, unnecessary reads
      
      * optimized pad3d kernel
      
      * test upadte
      
      * removed magic numbers
      
      * added support for pad2d
      
      * reverted two files
      
      * reverted one old change
      
      * added support for Paddings tensor
      
      * CI fix
      
      * CI fix
      
      * fixed timeout of tests
      
      * fixed typo
      
      * changes to GetKernelTypeForVar
      
      * Revert "changes to GetKernelTypeForVar"
      
      This reverts commit 469106115c49682b25038a666fd71bd4a10fb66b.
      
      * added AsExtra() to pad2d
      Co-authored-by: NPiotr Paturej <piotr.paturej@intel.com>
      2792b8de
    • Y
      add labels for infer ut (#44279) · fea05f1f
      YUNSHEN XIE 提交于
      * add labels for infer ut
      
      * add RUN_TYPE=INFER for cpp ut
      
      * fix formaterror
      
      * update
      fea05f1f