1. 16 8月, 2023 5 次提交
  2. 15 8月, 2023 1 次提交
  3. 14 8月, 2023 2 次提交
  4. 10 8月, 2023 2 次提交
    • L
      Add variable_length_memory_efficient_attention (#55400) · 4036c937
      lzy 提交于
      * add variable_length_memory_efficient_attention
      * update variable_length_memory_efficient_attention unittest
      * update variable_length_mem_eff_attn's docs and unittest
      * update variable_length_mem_eff_attn's docs
      * Update test_variable_length_memory_efficient_attention.py
      * Update variable_length_memory_efficient_attention.cu
      * fix codestyle
      * fix variable_length_fmha's docs and unittest
      * fix variable_length_fmha's docs
      4036c937
    • L
      add tanh_triple_grad composite logic (#56072) · 7c4a3556
      lxd-cumt 提交于
      * decompose tanh_triple_grad and add it into prim_white_list test=develop
      
      * fix TanhTripleGradKernel bugs test=develop
      
      * decompose tanh_triple_grad test=develop
      7c4a3556
  5. 09 8月, 2023 1 次提交
  6. 08 8月, 2023 2 次提交
  7. 07 8月, 2023 1 次提交
    • Y
      Add attn_mask supported for FlashAttnKernel. (#55969) · 42e0c6b8
      yin wei 提交于
      * add mask
      
      * add backword
      
      * add enforce info
      
      * update scale
      
      * integrate code
      
      * update enforce
      
      * add enforce eq
      
      * add error type
      
      * update enforce
      
      * add test_flash_attention
      
      * Polish codes and fix compiling errors.
      
      * Set num_splits to 0 for flash-attn with tensor mask.
      
      * Fix the compiling error for non flash-attn case.
      
      ---------
      Co-authored-by: NLiu Yiqun <liuyiqun01@baidu.com>
      42e0c6b8
  8. 04 8月, 2023 1 次提交
    • K
      [NewIR] Rename feed with place to data (#55778) · 274e5e54
      kangguangli 提交于
      * fix bug: feed_with_place should consider variable existence
      
      * fix
      
      * fix build scope
      
      * change method to set feed var name
      
      * remove feed_with_place to placeholder
      
      * fix
      
      * rename to data
      
      * fix
      
      * fix
      274e5e54
  9. 03 8月, 2023 3 次提交
  10. 02 8月, 2023 1 次提交
  11. 01 8月, 2023 1 次提交
  12. 31 7月, 2023 2 次提交
  13. 28 7月, 2023 2 次提交
  14. 27 7月, 2023 2 次提交
  15. 26 7月, 2023 1 次提交
  16. 25 7月, 2023 1 次提交
    • H
      [NewIR]new ir dygraph to static supoort gpu (#55620) · fb9bec5d
      hong 提交于
      * add kernel dialect
      
      * change DenseTensorTypeStorage to DenseTensorType
      
      * add test case`
      
      * add first pd_op to kernel dialect
      
      * lower pd op to kernel dialect
      
      * update
      
      * update
      
      * remove useless code
      
      * add attrite print test
      
      * fix bug
      
      * update
      
      * update
      
      * update
      
      * update
      
      * polish code
      
      * fix bug
      
      * polish  code  and add python test
      
      * add test
      
      * fix test error
      
      * relax constraint when inserting get_parameter
      
      * add env flag
      
      * fix bug
      
      * dygraph2static support new ir
      
      * fix bug
      
      * revert test env
      
      * change cc_test_old to cc_test
      
      * update
      
      * fix build_static bug
      
      * update test
      
      * fix type test error
      
      * udpate cmake
      
      * disable test in windows
      
      * fix inference compile
      
      * fix program translator error
      
      * only run on cpu, not support gpu yet
      
      * fix conflict
      
      * polish code
      
      * fix bug
      
      * add feed with place op
      
      * update
      
      * remove useless unitest
      
      * udpate mkldnn
      
      * update
      
      * update
      
      * align mkldnn version
      
      * new ir support builtin slice op
      
      * fix bug
      
      * fix phi kernel adaptor bug
      
      * add enable static
      
      * add enable_static
      
      * remove useless test case
      
      * change feed list to single variable
      
      * update
      
      * add feed with place and shaddow output op
      
      * fix bug
      
      * remove usless code
      
      * support gpu
      
      * fix bug
      
      * fix bug
      
      * remove template
      
      * add more data type
      
      * fix cimpile bug
      
      * udpate
      
      * remove useless code
      
      * revert dygraph2st test
      
      * remove usless code
      
      * revert op
      
      * fix bug
      
      * new ir dygraph2static support gpu
      
      * remove usless code
      
      * code polish
      
      * add const
      
      * revert code and remove useless code
      
      * revert code
      
      * revert legacy op yaml
      
      * remove useless code
      
      * delete std::move
      
      ---------
      Co-authored-by: Nkangguangli <kangguangli@hotmail.com>
      fb9bec5d
  17. 24 7月, 2023 1 次提交
  18. 21 7月, 2023 1 次提交
  19. 20 7月, 2023 2 次提交
  20. 19 7月, 2023 4 次提交
  21. 18 7月, 2023 4 次提交
    • G
      batch add inpalce api (#55078) · 19302938
      GGBond8488 提交于
      * batch add inpalce api
      
      * fix inplace fn generate
      
      * add test for  new inpalce api
      
      * fix typro
      
      * fix typro
      
      * fix typro
      
      * fix test error
      
      * fix atan2
      
      * remove atan2
      
      * auto genereate inpalce api
      
      * fix inplace generate fn error
      
      * fix windows error
      
      * fix test error
      
      * fix test error
      
      * fix windows ci error
      
      * fix test error
      
      * fix test_error
      
      * fix test error
      
      * fix eigen aliasing error in inplace
      
      * remove elementwise_pow inplace
      
      * fix doc error
      
      * fix test error
      19302938
    • H
      [NewIR]Fix new ir concat split bug (#55419) · 5e6645d7
      hong 提交于
      * fix new ir concat op bug
      
      * fix bug
      
      * using add_n_with_kernel instead of add_n impl
      
      * fix pd_op yaml bug
      
      * fix bug
      5e6645d7
    • K
      [NewIR] fix hsigmoid_loss (#55483) · 38782dc3
      kangguangli 提交于
      * fix hsigmoid_loss
      
      * add test into whitelist
      38782dc3
    • G
      [OpCompat] add cast and repeat_interleave in op_compat.yaml (#55467) · 922d2481
      gouzil 提交于
      * add cast and repeat_interleave
      
      * fix
      922d2481