1. 31 3月, 2023 2 次提交
  2. 30 3月, 2023 2 次提交
  3. 29 3月, 2023 2 次提交
    • Z
      [XPU] optimize pass (#52099) · 599388e3
      zhupengyang 提交于
      599388e3
    • Y
      Add Fuse Adamw Pass (#50484) · 66098bff
      yuehuayingxueluo 提交于
      * add fuse adamw pass
      
      * fix some bugs
      
      * fix CIbug
      
      * change chunk_size
      
      * fix CI bug
      
      * rm test_fused_adam_op.py
      
      * fix CI bugs
      
      * fix fuse_adamw_op_pass.cc
      
      * change code style
      
      * fix CI bug
      
      * fix ut bug and use_adamw_op_pass.cc
      
      * fix test_fuse_adamw_pass.py
      
      * fix CI bug
      
      * remove fluid
      
      * fix ci bug
      
      * fix CI bug
      66098bff
  4. 27 3月, 2023 1 次提交
    • S
      Fused elementwise_(mul/div) (#50428) · 968f7f24
      Sławomir Siwek 提交于
      * extract Op and OPMaker to .h
      
      * extend pattern for fused_op
      
      * set "with_residual" default to false
      
      * adjust fuse passes
      
      * remove fc+eltwise flag
      
      * fused_output_scale
      
      * activation attrs
      
      * remove extra attrs
      
      * fix int8/bf16 unit tests
      
      * simplify RecomputeOutputDims
      
      * remove unused method
      
      * Add description for attributes
      
      * add extra check
      
      * adjust op compats
      
      * update quantize test
      
      * fix protobuf parsing error
      
      * fix int8 performance
      
      * fused elementwises
      
      * merge develop
      
      * remove activation
      
      * restore activation for existing add/sub ops
      968f7f24
  5. 22 3月, 2023 5 次提交
  6. 21 3月, 2023 1 次提交
    • iSerendipity's avatar
      [PHI decoupling] Move DataType* from paddle:experimental to phi namespace (#51716) · 4638a62e
      iSerendipity 提交于
      * move DataType from paddle::experimental to phi
      
      * convert namespace
      
      * convert namespace
      
      * convert namespace
      
      * clarify namespace
      
      * convert more datatype
      
      * Revert "convert more datatype"
      
      This reverts commit 083b462959e6a22d4d8767707b628b95b396642e.
      
      * convert more in auto_code_generator
      
      * fix conflicts for XPU
      
      * fix namespace conflicts
      
      * fix errors
      
      * Revert "fix errors"
      
      This reverts commit f9d9958b54ee32141112274c8a5c3c381ab0f876.
      
      * fix errors
      
      * fix formatting
      4638a62e
  7. 20 3月, 2023 2 次提交
  8. 16 3月, 2023 1 次提交
    • W
      split layernorm pass (#51228) · 3f3372b6
      wenbin 提交于
      * split pass
      
      * fix compile
      
      * fix ut
      
      * more time
      
      * modify ut
      
      * reduce dim
      
      * fix compile
      
      * reshape weight
      
      * tensor
      
      * remove enforce
      
      * static shape ut
      
      * batchsize
      
      * reorder pass
      
      * minus test cases
      
      * windows timeout
      
      * windows time out
      
      * remove test for windows
      
      * correct
      
      * sssss
      
      * xxx
      3f3372b6
  9. 15 3月, 2023 1 次提交
    • iSerendipity's avatar
      [PHI] remove operator.h in blas.h (rebase to latest codebase) (#51472) · 427712df
      iSerendipity 提交于
      * Revert "Revert "【Hackathon No.67】remove operator.h in blas.h (#50989)" (#51467)"
      
      This reverts commit b9d91531.
      
      * remove cout
      
      * add header
      
      * fix missing header
      
      * fix refer fluid error
      
      * fix missing header
      
      * 更新 repeat_interleave_grad_kernel_impl.h
      
      Change to phi style datatype.
      
      * 更新 repeat_interleave_grad_kernel_impl.h
      
      Fix missing header
      
      * datatype fluid -> phi
      
      * paddle::experimental -> phi
      
      * fix reference error
      
      * fix reference error
      
      * fix reference error
      
      * fix errors
      
      * fix missing FLAGS
      
      * fix missing headers
      
      * fix missing headers
      
      * fix missing headers
      
      * fix missing headers
      
      * fix missing header
      
      * fix missing header
      
      * fix errors
      427712df
  10. 14 3月, 2023 1 次提交
  11. 13 3月, 2023 3 次提交
    • S
      Fused softplus (#51087) · fdcfa04f
      Sławomir Siwek 提交于
      * mkldnn->onednn
      
      * fused softplus op + kernel
      
      * remove extra attributes
      
      * add missing handler
      
      * change var name
      fdcfa04f
    • Z
      [Paddle Inference ]use python to generate cutlass code (#50603) · 4e9e23cb
      zhoutianzi666 提交于
      * use python to generate cutlass code
      
      * refine CommonConvKernelPart1, CommonConvKernelPart2
      
      * remove useless code in generate_cutlass_code.sh
      
      * add more config in conv2d_residual
      
      * CommonCutlassConvKernelPart1 and CommonCutlassConvKernelPart2
      
      * add group conv support in util.cu
      
      * remove .sh
      
      * refine name
      
      * make name goodgit status!
      
      * add fuse_alpha
      
      * make code easy to understand
      
      * mot fopen generate in py
      
      * use python script to generate conv2d,group=1 cutlass code
      
      * use const &
      
      * use const & && use python script to generate conv2d/group=1 code
      4e9e23cb
    • Z
      e2cdd4a3
  12. 09 3月, 2023 1 次提交
  13. 07 3月, 2023 1 次提交
  14. 06 3月, 2023 2 次提交
  15. 02 3月, 2023 2 次提交
  16. 01 3月, 2023 2 次提交
  17. 28 2月, 2023 3 次提交
  18. 27 2月, 2023 1 次提交
  19. 24 2月, 2023 1 次提交
  20. 23 2月, 2023 2 次提交
  21. 22 2月, 2023 1 次提交
  22. 21 2月, 2023 1 次提交
    • HappyHeavyRain's avatar
      Support bw invoke fw (#50260) · d8845735
      HappyHeavyRain 提交于
      * support bw invoke fw
      
      * fix scale in static_backward.yaml
      
      * fix the bug in tensorrt/convert
      
      * move 'scale','sign' into ops.yaml
      
      * add scale_grad of scale in op_compat.yaml
      
      * change generated_static_op in CMakeLists.txt
      d8845735
  23. 20 2月, 2023 1 次提交
  24. 17 2月, 2023 1 次提交
    • S
      upgrade oneDNN to 2.7.3 (#46301) · f803b239
      Sławomir Siwek 提交于
      * change SHA
      
      * update to oneDNN 2.7
      
      * update to 2.7.1
      
      * update to 2.7.2
      
      * add supported hardsigmoid
      
      * update to 2.7.3
      
      * limit cpu threads for int8 test
      
      * group activations
      f803b239