1. 05 1月, 2023 1 次提交
  2. 03 1月, 2023 1 次提交
  3. 02 1月, 2023 1 次提交
  4. 14 12月, 2022 2 次提交
  5. 08 12月, 2022 2 次提交
  6. 06 12月, 2022 1 次提交
  7. 05 12月, 2022 1 次提交
    • W
      Reverse roll fuse (#46914) · feb68dd1
      Wang Bojun 提交于
      * pass
      
      * pass
      
      * draft version
      
      * share mem opt
      
      * remove sharemem
      
      * add pattern for the case with circle_shift=0
      
      * add UT
      
      * pass opt
      
      * test_fix
      
      * code-commit
      
      * code-style
      
      * code style
      
      * code-style
      
      * ut-fix
      
      * op teller refine
      
      * resolve conflict
      
      * adjust position op_teller list and pass order for swin
      
      * ut code style update
      
      * adjust paddle pass order
      
      * refine pass order
      
      * refine pass order
      
      * refine pass order
      feb68dd1
  8. 30 11月, 2022 1 次提交
  9. 23 11月, 2022 2 次提交
  10. 15 11月, 2022 1 次提交
  11. 11 11月, 2022 1 次提交
  12. 10 11月, 2022 2 次提交
  13. 08 11月, 2022 1 次提交
    • S
      Migrate old C++ unit tests to Python framework (#47006) · 0c9f09b8
      Sławomir Siwek 提交于
      * softplus+activation
      
      * fc + elementwise_add test refactored
      
      * rename MKLDNN to OneDNN
      
      * fc+activation tests refactored
      
      * remove softplus ut
      
      * whitespace
      
      * whitespace
      
      * codestyle
      
      * codestyle
      
      * add more cases to fc+act
      
      * remove softplus+hard_sigmoid pass
      
      * remove softplus + hard_sigmoid UT
      
      * add approximate for gelu
      
      * swish beta range
      
      * new codestyle
      
      * reduce number of tests
      0c9f09b8
  14. 07 11月, 2022 1 次提交
    • H
      suqeeze2 + transpose2 fuse onednn (#47592) · fa874a46
      Hui Zhang 提交于
      * suqeeze2 transpose2 fuse onednn
      
      * format
      
      * fix output shape
      
      * fix conflict
      
      * format
      
      * format
      
      * remove useless
      
      * remove log
      
      * simply pass
      
      * fix comment
      
      * fix
      
      * fix msg
      
      * fix error msg
      
      * format
      fa874a46
  15. 04 11月, 2022 1 次提交
  16. 27 10月, 2022 1 次提交
    • L
      make all cpp tests dynamic linked to libpaddle.so [except windows] (#47088) · 2096448b
      Leo Chen 提交于
      * make all cpp tests dynamic linked to libpaddle.so
      
      * add comments
      
      * keep old cc_test for some tests
      
      * fix some ut
      
      * make some ut use cc_test_old
      
      * fix typos and fit for win32
      
      * fix lib path
      
      * fix some tests
      
      * skip lite test
      
      * fit for rocm
      
      * fit for cinn
      
      * fit for mac
      
      * fit for win32
      
      * skip inference ut
      
      * skip  windows
      
      * fix coverage
      2096448b
  17. 26 10月, 2022 2 次提交
  18. 20 10月, 2022 1 次提交
  19. 18 10月, 2022 1 次提交
    • W
      Merge layernorm trt fuse (#46320) · 5e9f491e
      Wang Bojun 提交于
      * first version, accuracy corrected
      
      * disable debug print
      
      * use blockReduceSum in phi
      
      * add UT
      
      * add opCompat
      
      * code style
      
      * code refine
      
      * bug fix
      
      * code refine
      
      * test fix
      
      * bugfix
      
      * codesytle fix
      
      * code style
      
      * code-style
      
      * code-style
      
      * code-style
      5e9f491e
  20. 17 10月, 2022 1 次提交
  21. 16 10月, 2022 1 次提交
  22. 27 9月, 2022 1 次提交
  23. 22 9月, 2022 1 次提交
  24. 21 9月, 2022 1 次提交
    • Z
      Enable PaddleInference to use CINN. (#45009) · 3aa6bd57
      Zhen Wang 提交于
      * use cinn in the paddle inference
      
      * fix some cmake errors
      
      * Avoid division by zero in the arange_kernel.
      
      * Avoid dynamic ops.
      
      * Remove some useless codes.
      
      * Use OpTransInfo to encapsulate some codes used in the build_cinn_pass.
      3aa6bd57
  25. 07 9月, 2022 1 次提交
    • W
      Layernorm shift partition (#45736) · 960109af
      wenbin 提交于
      * first commit
      
      * conver done
      
      * correct format
      
      * layernorm_shift_partition
      
      * correct convert
      
      * redefine plugin
      
      * runable
      
      * bug fix
      
      * modify ShiftPartitionPattern
      
      * correct
      
      * add UT
      
      * modify ut
      
      * compile
      
      * modify enforce
      
      * modify UT
      960109af
  26. 31 8月, 2022 1 次提交
  27. 30 8月, 2022 1 次提交
  28. 22 8月, 2022 2 次提交
  29. 16 8月, 2022 1 次提交
    • F
      convert multihead to oss (#45019) · f706d95d
      feng_shuai 提交于
      * convert multihead to oss
      
      * fix:bug
      
      * fix:delete const cast
      
      * fix:don't support bias_qk
      
      * add vit pass
      
      * fix:convert bug and add preln_residual_bias
      
      * support length=-1
      
      * add UT for convert
      
      * add no_bias_qk support for gpu_multihead_op
      
      * delete infer_shape depends on bias_qk
      
      * oss just can be used in T4 and A*
      
      * fix:change api for ROCM CI
      f706d95d
  30. 15 8月, 2022 1 次提交
  31. 05 8月, 2022 1 次提交
  32. 04 8月, 2022 1 次提交
    • S
      Matmuls with activation and elementwise_add fuses (#44655) · 0420d514
      Sławomir Siwek 提交于
      * Add unit tests
      
      * matmul_v2 + activation
      
      * matmuls + elementwise_add
      
      * matmul_v2 postops
      
      * transform matmul to v2
      
      * opcompat
      
      * fix fusing matmul with multipe outs
      
      * add shape constraints
      
      * remove unused vars
      
      * change pass order
      
      * - Unit tests to be debugged
      
      - fix
      
      - refactor
      
      - diagnostic
      
      - more diagnostic
      
      - fix
      
      - Fix number two
      
      - fix
      
      - fix
      
      - fix
      
      - alpha added
      
      - more fixes
      
      - compilation fix
      
      - removed diagnostic code
      
      - cosmetic fixes
      
      * lint
      
      * add alpha constraint
      
      * merge matmul refactor
      
      * trigger CI
      
      * - fix
      
      * - another fix
      
      * code style
      
      * add support for matmul+elementwise_add+activation
      
      * code style
      
      * fix bfloat16 bugs
      
      * change append_binary to append_sum
      Co-authored-by: NJacek Czaja <jacek.czaja@intel.com>
      0420d514
  33. 26 7月, 2022 2 次提交