1. 20 7月, 2023 1 次提交
  2. 19 7月, 2023 1 次提交
    • Z
      delete relu6_raw (#55383) · 56d46ccc
      zhangyuqin1998 提交于
      * delete relu6_raw
      
      * fix codestyle
      
      * Update test_mkldnn_matmul_activation_fuse_pass.py
      
      * fix
      
      * Update backward.yaml
      
      * Update ops.yaml
      
      * Update backward.yaml
      56d46ccc
  3. 18 7月, 2023 1 次提交
    • G
      batch add inpalce api (#55078) · 19302938
      GGBond8488 提交于
      * batch add inpalce api
      
      * fix inplace fn generate
      
      * add test for  new inpalce api
      
      * fix typro
      
      * fix typro
      
      * fix typro
      
      * fix test error
      
      * fix atan2
      
      * remove atan2
      
      * auto genereate inpalce api
      
      * fix inplace generate fn error
      
      * fix windows error
      
      * fix test error
      
      * fix test error
      
      * fix windows ci error
      
      * fix test error
      
      * fix test_error
      
      * fix test error
      
      * fix eigen aliasing error in inplace
      
      * remove elementwise_pow inplace
      
      * fix doc error
      
      * fix test error
      19302938
  4. 14 7月, 2023 1 次提交
  5. 11 7月, 2023 2 次提交
    • MarDino's avatar
      Integrate rmsnorm kernel (#54998) · 97d3d6ee
      MarDino 提交于
      * add rmsnorm kernel
      * add static graph test
      * fix round type
      * use alignas to avoid msvc compile error
      * remove redundant headerfile to avoid rocm compile error
      * fix rocm compile not found cub
      * Add document
      97d3d6ee
    • FormlessUnit's avatar
      Linear compress (#55128) · f4290a92
      FormlessUnit 提交于
      * rename weight_only/llm.int8
      f4290a92
  6. 10 7月, 2023 1 次提交
  7. 04 7月, 2023 1 次提交
  8. 03 7月, 2023 2 次提交
  9. 30 6月, 2023 3 次提交
  10. 28 6月, 2023 2 次提交
  11. 26 6月, 2023 2 次提交
  12. 20 6月, 2023 1 次提交
    • Z
      [IR] Change IR from Static library to dynamic library (#54729) · 24a3cb52
      zhangbo9674 提交于
      * new_ir to shared
      
      * refine code
      
      * add ir lib path to env
      
      * refine type
      
      * refine code
      
      * fix bug
      
      * fix bug
      
      * refine code
      
      * refine code
      
      * close win
      
      * refine code
      
      * refine code
      
      * refine code
      
      * add win share
      
      * refine code
      
      * refie code
      
      * refine code
      
      * refine code
      
      * refien code
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * solve conflict
      
      * solve conflict
      
      * fix bug
      
      * refine code
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * refine code
      
      * fix interpretercore program bug
      
      * delete unuse code
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix cinn bug
      
      * fix cinn bug
      
      * debug
      
      * fix cinn bug
      
      * delete unused code
      
      * fix cinn bug
      
      * fix cinn bug
      
      * fix  ug
      
      * test win openblas
      
      * test win openblas
      
      * fix win openblas bug
      
      * polish code
      
      * fix win open blas bug
      
      * close win dll
      
      * fix flag bug
      
      * test for windows
      
      * fix compile bug
      24a3cb52
  13. 16 6月, 2023 1 次提交
  14. 15 6月, 2023 1 次提交
  15. 14 6月, 2023 1 次提交
  16. 13 6月, 2023 1 次提交
  17. 09 6月, 2023 1 次提交
  18. 08 6月, 2023 1 次提交
    • Y
      [AMP] Add check_numerics API. (#54301) · a5444592
      Yiqun Liu 提交于
      * Add outputs to check_numerics_kernel.
      
      * Add check_numerics to yaml.
      
      * Add API and unittest.
      
      * Add check_nan_inf_level as argument of check_numerics_kernel.
      
      * Add more unittests.
      
      * Fix static API implementation and unittest.
      
      * Move the implementation of check_numerics to paddle.amp.
      
      * Fix import error.
      a5444592
  19. 05 6月, 2023 3 次提交
  20. 02 6月, 2023 1 次提交
  21. 01 6月, 2023 2 次提交
  22. 23 5月, 2023 2 次提交
  23. 18 5月, 2023 1 次提交
    • R
      support auto generate for op layer_norm (#53178) · 4f07b653
      RedContritio 提交于
      * simplify layer_norm_op.cc
      
      * support auto generate for op layer_norm
      
      * update unittest for composite_layer_norm
      
      * remove layer_norm_op.cc from scripts
      
      * replace layer_norm_op with generated_op
      
      * add get_expected_kernel for layer_norm
      
      * update cmake kernel register function for layer_norm_mkldnn_op
      4f07b653
  24. 17 5月, 2023 2 次提交
  25. 16 5月, 2023 2 次提交
  26. 12 5月, 2023 1 次提交
    • P
      【Hackathon 4 No.20】Add i0 / i0e to paddle (#52058) · ce256f75
      PommesPeter 提交于
      * added base code for i0 and i0e
      
      * added grad base code for i0 and i0e
      
      * added i0 and i0e python code
      
      * added ops and backward yaml config
      
      * added i0 and i0e cpu kernel, but not test.
      
      * added i0 and i0e code and unitest files
      
      * added test files
      
      * added i0/i0e gpu implementation code
      
      * updated code style
      
      * updated code style
      
      * fixed unitests code
      
      * updated i0 with eigen3
      
      * fixed bug and added more test cases
      
      * refactor: fixed static graph bug
      
      * refactor: removed i0 and i0e from op_compat
      
      * refactor: updated code style
      
      * refactor: updated op_compat.yaml
      
      * refactor: updated op_compat.yaml
      
      * refactor: fixed op name mapping and optimize unittest case
      
      * refactor: manually implement i0 / i0e
      
      * refactor: added grad kernel for i0 / i0e,didn't finish
      
      * Update math.py
      
      * refactor: added equation to doc in English and added comments for computing i0 / i0e gradient
      
      * refactor: removed eigen implementation
      
      * refactor: finished i0 / i0e cpu and gpu op
      
      * refactor: updated code style
      
      * fix: find  a bug but not fix
      
      * fix: incorrect unittest cases
      
      * update: updated code style and remove my file
      
      * update: updated unittest case
      
      * fix: fixed sign error
      
      * fix: fixed mistakes when merging
      
      * refactor: updated code style
      
      * refactor: remove unused code
      
      * refactor: updated code style
      ce256f75
  27. 10 5月, 2023 2 次提交
    • add index_put api (#52886) · f3393f49
      傅剑寒 提交于
      * add index_put api
      
      * fix value broadcast in backward and add test case in static
      
      * add timeout=120s for index_put
      
      * add op_compat for index_put
      
      * add inplace index_put test
      
      * add test case when index tensor in indices is int32 when indices.size less than x.dims
      
      * add index_put api backward in cpu place
      
      * add backward test case
      
      * refactor code to delete some duplicated code
      
      * replace reshape with resize for decrease extra memcpy
      
      * add datatype flag in backward yaml
      
      * fix bug in documentation
      
      * Update python/paddle/tensor/manipulation.py
      
      ---------
      Co-authored-by: NLigoml <39876205+Ligoml@users.noreply.github.com>
      f3393f49
    • [static op generation] lstsq (#53290) · 4f33f44b
      张春乔 提交于
      4f33f44b