1. 04 9月, 2023 1 次提交
  2. 30 8月, 2023 1 次提交
  3. 28 8月, 2023 1 次提交
    • G
      【inplace api】Batch add inplace api gt_, ge_, lt_, le_, eq_, not_equal_,... · c5fc413a
      GGBond8488 提交于
      【inplace api】Batch add inplace api gt_, ge_, lt_, le_, eq_, not_equal_, logical_and_, logical_or_, logical_xor_, logical_not_, divide_, floor_divide_, bitwise_and_ , bitwise_or_, bitwise_xor_, bitwise_not_ (#55509)
      
      * tmp commit
      
      * add atan2
      
      * add inplace api
      
      * fix error
      
      * add inpalce divide
      
      * add inplace api
      
      * add more inplace
      
      * add more inpalce
      
      * fix logical_not error
      
      * support sinh and cosh in cpu
      
      * support asin, acos, atan, asinh, acosh, atanh in cpu
      
      * fix typro
      
      * fix typro
      
      * mv out atan2 ldexp
      
      * mv out atan2 ldexp
      
      * support sinh and cosh in gpu
      
      * support asin, acos, atan, asinh, acosh, atanh in gpu
      
      * fix ge error
      
      * fix dygraph commpare error
      
      * fix dygraph commpare error
      
      * check complex in python
      
      * fix cast inpalce error
      
      * open inplace test
      
      * fix ops.yaml error
      
      * mv cast inpalce to python
      
      * fix coverage ci
      
      * add last inplace
      
      * fix inplace error
      
      * fix cast error
      
      * fix error
      
      * add nan_to_num_
      
      * fix typro
      
      * fix sparse cast error
      
      * remove gpu 4
      
      * fix static cast error
      
      * tmp commit
      
      * add atan2
      
      * add inplace api
      
      * fix error
      
      * add inpalce divide
      
      * add inplace api
      
      * add more inplace
      
      * add more inpalce
      
      * fix logical_not error
      
      * fix typro
      
      * fix typro
      
      * mv out atan2 ldexp
      
      * mv out atan2 ldexp
      
      * fix ge error
      
      * fix dygraph commpare error
      
      * fix dygraph commpare error
      
      * fix cast inpalce error
      
      * open inplace test
      
      * fix ops.yaml error
      
      * mv cast inpalce to python
      
      * fix coverage ci
      
      * add last inplace
      
      * fix inplace error
      
      * fix cast error
      
      * fix error
      
      * add nan_to_num_
      
      * fix typro
      
      * fix sparse cast error
      
      * remove gpu 4
      
      * fix static cast error
      
      * fix cast error
      
      * fix
      
      * Revert "check complex in python"
      
      This reverts commit c822064261d774dd58ad46a4f90ba8b467700a05.
      
      * add renorm , fix error
      
      * add coverage
      
      * fix cumsum inpalce version error
      
      * add cast inpalce impl
      
      * rm test.log
      
      * fix multiply_dyfunction and add multiply_backward test
      
      * add and use is_same_tensor
      
      * fix typro
      
      * fix sone error
      
      * fix typro
      
      ---------
      Co-authored-by: NScotty <jmhgchn@gmail.com>
      Co-authored-by: NScotty <527407973@qq.com>
      c5fc413a
  4. 03 8月, 2023 1 次提交
  5. 12 7月, 2023 1 次提交
  6. 20 6月, 2023 1 次提交
    • Z
      [IR] Change IR from Static library to dynamic library (#54729) · 24a3cb52
      zhangbo9674 提交于
      * new_ir to shared
      
      * refine code
      
      * add ir lib path to env
      
      * refine type
      
      * refine code
      
      * fix bug
      
      * fix bug
      
      * refine code
      
      * refine code
      
      * close win
      
      * refine code
      
      * refine code
      
      * refine code
      
      * add win share
      
      * refine code
      
      * refie code
      
      * refine code
      
      * refine code
      
      * refien code
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * solve conflict
      
      * solve conflict
      
      * fix bug
      
      * refine code
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * refine code
      
      * fix interpretercore program bug
      
      * delete unuse code
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix cinn bug
      
      * fix cinn bug
      
      * debug
      
      * fix cinn bug
      
      * delete unused code
      
      * fix cinn bug
      
      * fix cinn bug
      
      * fix  ug
      
      * test win openblas
      
      * test win openblas
      
      * fix win openblas bug
      
      * polish code
      
      * fix win open blas bug
      
      * close win dll
      
      * fix flag bug
      
      * test for windows
      
      * fix compile bug
      24a3cb52
  7. 13 6月, 2023 1 次提交
  8. 09 6月, 2023 1 次提交
  9. 05 6月, 2023 1 次提交
  10. 01 6月, 2023 1 次提交
    • T
      mv all unittests test (#53235) · b0e86d55
      tianshuo78520a 提交于
      * mv all unittests test
      
      * fix error
      
      * fix error
      
      * fix
      
      * fix
      
      * del unittests
      
      * fix paddle_build.sh
      
      * fix
      
      * fix test
      
      * fix add test
      
      * fix
      
      * fix
      
      * fix
      
      * merge develop
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * merge develop
      
      * fix test_async_read_write
      
      * fix test_async_read_write
      
      * merge develop
      
      * fix
      
      * fix import legacy_test
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * fix bug
      
      * fix
      
      * fix coverage test bug
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * fix code sstyle
      
      * fix code
      
      * fix code
      
      * fix
      
      * fix
      
      * fix
      
      * del test_sequence_enumerate_op.py
      
      * fix
      b0e86d55
  11. 26 5月, 2023 1 次提交
    • Y
      [PHI Decoupling]Create PHI shared lib (#53735) · da50a009
      YuanRisheng 提交于
      * create phi so
      
      * fix ci bugs
      
      * fix py3 bugs
      
      * add file
      
      * fix py3 bugs
      
      * fix windows bugs
      
      * perfect so
      
      * fix py3 bugs
      
      * delete all static target in phi
      
      * fix windows bugs
      
      * fix py3 bugs
      
      * fix ci bugs
      
      * fix windows bugs
      
      * fix bugs: gflags can't be linked by dynamic and static lib
      
      * fix bugs that can not load 3rd party
      
      * fix ci bugs
      
      * fix compile bugs
      
      * fix py3 bugs
      
      * fix conflict
      
      * fix xpu bugs
      
      * fix mac compile bugs
      
      * fix psgpu bugs
      
      * fix inference failed
      
      * deal with conflict
      
      * fix LIBRARY_PATH bug
      
      * fix windows bugs
      
      * fix onednn error
      
      * fix windows compile bugs
      
      * fix windows compile bugs
      
      * fix test_cuda_graph_static_mode_error aborted
      
      * fix windows bugs
      
      * fix mac-python3 error
      
      * fix hip compile bugs
      
      * change mode to static
      
      * change to static mode
      
      * fix ci bugs
      
      * fix py3 bugs
      
      * fix windows bugs
      
      * fix bugs
      
      * add static flag
      
      * add PADDLE_API
      
      * change position of PADDLE_API
      
      * fix windows bugs
      
      * change mode to dynamic lib
      
      * fix windows static bugs
      
      * deal with conflict
      
      * fix windows unit bug
      
      * fix coverage
      
      * deal with conflict
      
      * fix windows-inference
      
      * fix py3 bugs
      
      * fix bugs when compile type_info
      
      * fix compile bugs
      
      * fix py3 bugs
      
      * fix windows bugs
      
      * fix windows openblas
      
      * fix xpu bugs
      
      * fix enforce_test in windows
      
      * update code according comment
      
      * fix windows cmake bug
      
      * fix windows bugs
      
      * fix windows bugs
      
      * delete cinn unittest
      
      * fix cinn bugs
      
      ---------
      Co-authored-by: HappyHeavyRain's avatarlzydev <1528794076@qq.com>
      da50a009
  12. 19 5月, 2023 1 次提交
    • L
      Add flash attention to speedup fused_gate_attention. (#52731) · d29c1f8e
      limingshu 提交于
      * Reorganize the forward codes of flash-attention.
      
      * Fix forward.
      
      * Remove some noused codes.
      
      * Simplify codes and fix backward.
      
      * Change all LOG(INFO) to VLOG and fix the backward.
      
      * add scale for AF2 flash_attn, much thanks to xreki and shaojie for debug these codes
      
      * decrease the effect of debug print on performance
      
      * Unify the initialize of flashattn arguments.
      
      * Rewirte the reshape of temp_mask and temp_bias.
      
      * API support use_flash_attn.
      
      * Fix compiling error on CI.
      
      * Try to crop the flash-attention lib.
      
      * Correct the condition of whether can use flash-attn.
      
      * Remove the softmax_out argument.
      
      * Remove is_causal.
      
      * Polish codes.
      
      * Fix qkv_transpose_out's shape and scaling of Q * K.
      
      * Update commit of flash-attention.
      
      ---------
      Co-authored-by: NLiu Yiqun <liuyiqun01@baidu.com>
      d29c1f8e
  13. 16 5月, 2023 1 次提交
  14. 25 4月, 2023 1 次提交
    • Y
      [PHI]Add flags macro for PHI (#52991) · 22e96bde
      YuanRisheng 提交于
      * add flags for phi
      
      * fix compile bugs
      
      * fix ci bugs
      
      * fix inference bugs
      
      * fix cinn' bugs
      
      * fix cinn bugs
      
      * perfect code according comment
      
      * fix ci bugs
      
      * fix ci bugs
      22e96bde
  15. 23 4月, 2023 1 次提交
  16. 18 4月, 2023 1 次提交
  17. 30 3月, 2023 1 次提交
  18. 07 3月, 2023 1 次提交
  19. 31 1月, 2023 1 次提交
  20. 09 1月, 2023 1 次提交
    • J
      Prim paddle Basic (#49272) · 2f601282
      Jiabin Yang 提交于
      * proto type of composite grad in paddle
      
      * proto type of composite grad in paddle
      
      * refactor composite api with phi
      
      * fix compile error
      
      * support static graph code-gen for squeeze op
      
      * generate static graph code of unsqueeze
      
      * refine op name
      
      * fix compile error
      
      * add extra output in op_compat
      
      * remove debug log
      
      * fix clang compile error
      
      * support prim switch flag
      
      * support prim switch flag
      
      * fix dygraph error
      
      * merge develop
      
      * add code_gen
      
      * add necessary files without codegen
      
      * fix code_gen bug
      
      * add deps
      
      * modify igmnore
      
      * add ignore
      
      * delete std cout
      
      * add composite logic for backward.py
      
      * add tanh first order grad composite
      
      * support enable_prim flag for static graph
      
      * throw expection when both GrapOpMaker and GradCompOpMaker not been registered
      
      * reorganize the directory of prim api tests
      
      * fix windows error
      
      * add eager_utils
      
      * add eager_utils
      
      * modify code gen
      
      * add composite parse
      
      * add unittest for get_grad_op_desc
      
      * code optimize
      
      * fix static test on windows
      
      * support generate static graph code for imag and real op
      
      * fix windows compile error in test_static_prim
      
      * merge develop
      
      * disable test eager in inference
      
      * prim code gen
      
      * disable eager compile in inference
      
      * rm other file
      
      * rm gitignore file
      
      * code_style
      
      * add eager test
      
      * code_style
      
      * merge develop
      
      * remove useless files
      
      * modify static test
      
      * support bool flag from singlton
      
      * merge develop
      
      * recover git ignore
      
      * fix conflict
      
      * recover git ignore for generated op
      
      * fix test compile error
      
      * remove some tests
      
      * add python test
      
      * fix some name issue
      
      * add composite code gen
      
      * modify backward yaml
      
      * fix static composite grad maker code gen
      
      * remove addtional files
      
      * add some static funcs unit test
      
      * fix some bugs
      
      * fix composite grad maker register code gen
      
      * optimize some functions
      Co-authored-by: Nzyfncg <zhangyunfei07@baidu.com>
      Co-authored-by: Nwangruting <wangruting@baidu.com>
      Co-authored-by: Ncxxly <chenxx_id@163.com>
      Co-authored-by: Ncharles-hit <wanghao107@baidu.com>
      Co-authored-by: Nxiaoguoguo626807 <100397923+xiaoguoguo626807@users.noreply.github.com>
      2f601282
  21. 13 12月, 2022 1 次提交
  22. 09 12月, 2022 1 次提交
  23. 08 12月, 2022 1 次提交
  24. 01 12月, 2022 1 次提交
  25. 07 11月, 2022 2 次提交
  26. 01 11月, 2022 1 次提交
    • C
      Adapting device-specific Extra Attributes for the PHI kernel (#46342) · c923e6c9
      Chen Weihang 提交于
      * add extra attr property set
      
      * add type_info for all context
      
      * add onednn context to all context
      
      * fix context compile error
      
      * simplify conv kernel args
      
      * pass runtime attr into dev_ctx
      
      * fix marco error
      
      * clear conv_grad_kernel extra args
      
      * merge conv_grad_grad into conv_grad
      
      * clear conv2d_grad_grad extra attrs
      
      * clear yaml and eager extra attr
      
      * fix conv1d error
      
      * change to thread local
      
      * fix npu compile failed
      
      * try to fix windows compile failed
      
      * add conv2d onednn phi kernel
      
      * fix ci bugs (#36)
      
      * fix compile bugs (#38)
      
      * fix extra input transform bug (#39)
      
      * support dynamic created attr (#40)
      
      * reset extra info gen code
      
      * rm conv_grad_grad kernel
      
      * reimpl pass attr adapting
      
      * add int attr support
      
      * remove vector inputnames creating
      
      * fix map at error
      
      * Update paddle/phi/kernels/onednn/conv_grad_kernel.cc
      Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com>
      
      * remove useless extra attrs
      
      * replace mkldnn_engine by onednn_engine
      Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com>
      Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com>
      c923e6c9
  27. 26 10月, 2022 1 次提交
  28. 19 9月, 2022 1 次提交
  29. 15 9月, 2022 1 次提交
  30. 06 9月, 2022 1 次提交
  31. 05 9月, 2022 1 次提交
  32. 26 8月, 2022 1 次提交
  33. 02 8月, 2022 1 次提交
  34. 01 8月, 2022 1 次提交
    • L
      unify gpu context (#44740) · 86763023
      Leo Chen 提交于
      * remove cudaDeviceContext
      
      * remove more template
      
      * fix rocm compile
      
      * remove alias name CUDADeviceContext
      
      * fix compile
      
      * fix tests
      
      * revert changes
      86763023
  35. 29 7月, 2022 1 次提交
  36. 19 7月, 2022 1 次提交
  37. 14 7月, 2022 1 次提交
  38. 13 7月, 2022 1 次提交
  39. 09 7月, 2022 1 次提交