1. 29 9月, 2022 1 次提交
  2. 28 9月, 2022 1 次提交
    • C
      Remove the declaration of using Tensor in framework/tensor.h (#46432) · e12a905e
      Chen Weihang 提交于
      * remove needless using tensor
      
      * remove needless using tensor
      
      * resolve conflict
      
      * replace tensor using
      
      * fix format error
      
      * revert needless changing
      
      * fix rocm and npu compile error
      
      * fix cinn compile error
      
      * fix format error
      
      * fix mkldnn format error
      
      * fix mkldnn format error
      
      * fix cinn compile error
      
      * fix cinn compile error
      
      * fix cinn compile error
      
      * resolve conflict
      e12a905e
  3. 27 9月, 2022 2 次提交
  4. 22 9月, 2022 1 次提交
  5. 21 9月, 2022 1 次提交
  6. 20 9月, 2022 1 次提交
  7. 19 9月, 2022 1 次提交
  8. 15 9月, 2022 1 次提交
  9. 14 9月, 2022 2 次提交
  10. 07 9月, 2022 1 次提交
    • W
      Layernorm shift partition (#45736) · 960109af
      wenbin 提交于
      * first commit
      
      * conver done
      
      * correct format
      
      * layernorm_shift_partition
      
      * correct convert
      
      * redefine plugin
      
      * runable
      
      * bug fix
      
      * modify ShiftPartitionPattern
      
      * correct
      
      * add UT
      
      * modify ut
      
      * compile
      
      * modify enforce
      
      * modify UT
      960109af
  11. 06 9月, 2022 2 次提交
  12. 05 9月, 2022 2 次提交
    • Y
      New format quant model support for MKLDNN (#45416) · 4e4f4586
      yeliang2258 提交于
      * support onnx format quantized model
      
      * update code
      
      * add test
      
      * add test
      
      * fix
      
      * fix test
      
      * fix cmake
      
      * update code
      
      * change scale file path to calibration file path
      
      * update code
      
      * update code
      
      * fix build bug
      
      * fix build bugs
      
      * fix
      
      * fix
      4e4f4586
    • D
      Update DlNNE engine (#45027) · 638965c5
      denglin-github 提交于
      * add config param for enable_dlnne and support calibration mode
      * remove useless file
      * refine code and add annotation
      * refine code of Warnning tips
      638965c5
  13. 02 9月, 2022 1 次提交
  14. 30 8月, 2022 1 次提交
  15. 29 8月, 2022 1 次提交
  16. 22 8月, 2022 3 次提交
  17. 18 8月, 2022 2 次提交
  18. 16 8月, 2022 2 次提交
    • F
      convert multihead to oss (#45019) · f706d95d
      feng_shuai 提交于
      * convert multihead to oss
      
      * fix:bug
      
      * fix:delete const cast
      
      * fix:don't support bias_qk
      
      * add vit pass
      
      * fix:convert bug and add preln_residual_bias
      
      * support length=-1
      
      * add UT for convert
      
      * add no_bias_qk support for gpu_multihead_op
      
      * delete infer_shape depends on bias_qk
      
      * oss just can be used in T4 and A*
      
      * fix:change api for ROCM CI
      f706d95d
    • W
      memoptim and fp16 mixed precision (#45132) · fa890092
      Wilber 提交于
      fa890092
  19. 15 8月, 2022 1 次提交
  20. 14 8月, 2022 1 次提交
  21. 10 8月, 2022 1 次提交
  22. 05 8月, 2022 2 次提交
  23. 04 8月, 2022 3 次提交
    • S
      Matmuls with activation and elementwise_add fuses (#44655) · 0420d514
      Sławomir Siwek 提交于
      * Add unit tests
      
      * matmul_v2 + activation
      
      * matmuls + elementwise_add
      
      * matmul_v2 postops
      
      * transform matmul to v2
      
      * opcompat
      
      * fix fusing matmul with multipe outs
      
      * add shape constraints
      
      * remove unused vars
      
      * change pass order
      
      * - Unit tests to be debugged
      
      - fix
      
      - refactor
      
      - diagnostic
      
      - more diagnostic
      
      - fix
      
      - Fix number two
      
      - fix
      
      - fix
      
      - fix
      
      - alpha added
      
      - more fixes
      
      - compilation fix
      
      - removed diagnostic code
      
      - cosmetic fixes
      
      * lint
      
      * add alpha constraint
      
      * merge matmul refactor
      
      * trigger CI
      
      * - fix
      
      * - another fix
      
      * code style
      
      * add support for matmul+elementwise_add+activation
      
      * code style
      
      * fix bfloat16 bugs
      
      * change append_binary to append_sum
      Co-authored-by: NJacek Czaja <jacek.czaja@intel.com>
      0420d514
    • Z
      [Paddle-TRT] add Rnn (#44678) · ffc8defa
      zhoutianzi666 提交于
      * add rnn
      ffc8defa
    • W
      convert support multi block. (#44866) · b4a4eef2
      Wilber 提交于
      * convert support multi block.
      
      * update
      b4a4eef2
  24. 02 8月, 2022 1 次提交
  25. 01 8月, 2022 3 次提交
  26. 29 7月, 2022 1 次提交
  27. 28 7月, 2022 1 次提交