1. 05 9月, 2023 1 次提交
  2. 04 9月, 2023 3 次提交
  3. 01 9月, 2023 2 次提交
    • S
      【Complex op】add complex support for index_select and index_sample (#56457) · 0b608393
      Scotty 提交于
      * support index_select op
      
      * index_sample in cpu
      
      * support index_sample in gpu
      
      * change data_transform
      
      * fix api gen and use skip_transform in yaml
      0b608393
    • C
      [IR] Generate pd_op.parsed.yaml from pd_op.yaml (#56674) · 962f67d2
      chen2016013 提交于
      * Generate pd_op.parsed.yaml from pd_op.yaml
      
      * Generate pd_op.parsed.yaml from pd_op.yaml
      
      * fix bug
      
      * bug fix
      
      * bug fix
      
      * bug fix
      
      * 向pd_ops.yaml中新增算子 & 修改pd_ops.parsed.yaml存放路径
      
      * 修复路径依赖bug & 添加 .gitignore文件
      
      * fix bug - compat input args in save_combine op
      
      * fix compat file
      
      * fix set_value_with_tensor yaml
      
      * split backward op in original yaml file
      
      * add send_v2 & recv_v2
      962f67d2
  4. 31 8月, 2023 1 次提交
    • T
      Add fused_scale_bias_relu_conv_bnstats OP (#55026) · 71e28b12
      Tian Zheng 提交于
      * Add fused_scale_bias_relu_conv_bnstats op
      
      * Review changes
      
      * Fix no CUDNN Frontend build
      
      * Fix PADDLE_ENFORCE format
      
      * Fix PADDLE_ENFORCE CI error
      
      * Rename kernel filename
      
      * Refactor unittest to use paddle eager_op_test
      
      * Fix padding bugs
      
      * Review changes
      
      * test=cuda117
      
      * test=cuda117
      71e28b12
  5. 30 8月, 2023 2 次提交
    • K
      [NewIR] fix logical op infermeta (#56711) · 987cb97e
      kangguangli 提交于
      * fix logical op infermeta
      
      * add test
      
      * adpat inplace api
      987cb97e
    • H
      Add paddle custom flags support (#56256) · 2ef4ec71
      huangjiyi 提交于
      * update
      
      * repalce gflags header
      
      * replace DEFINE_<type> with PD_DEFINE_<type>
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * update cmake
      
      * add :: before some paddle namespace
      
      * fix link error
      
      * fix CI-Py3
      
      * allow commandline parse
      
      * fix SetFlagsFromEnv
      
      * fix bug
      
      * fix bug
      
      * fix CI-CINN
      
      * fix CI-Coverage-build
      
      * fix CI-Windows-build
      
      * fix CI-Inference
      
      * fix bug
      
      * fix bug
      
      * fix CI-CINN
      
      * fix inference api test
      
      * fix infer_ut test
      
      * revert infer_ut gflags usage
      
      * update
      
      * fix inference
      
      * remove flags export macro
      
      * revert inference demo_ci gflags usage
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * fix bug when turn on WITH_GFLAGS
      
      * turn on WITH_GFLAGS
      
      * fix bug when turn on WITH_GFLAGS
      
      * fix bug when turn on WITH_GFLAGS
      
      * update
      
      * update and add unittest
      
      * add unittest
      
      * fix conflict
      
      * rerun ci
      
      * update
      
      * resolve conflict
      2ef4ec71
  6. 29 8月, 2023 1 次提交
  7. 28 8月, 2023 2 次提交
    • G
      【inplace api】Batch add inplace api gt_, ge_, lt_, le_, eq_, not_equal_,... · c5fc413a
      GGBond8488 提交于
      【inplace api】Batch add inplace api gt_, ge_, lt_, le_, eq_, not_equal_, logical_and_, logical_or_, logical_xor_, logical_not_, divide_, floor_divide_, bitwise_and_ , bitwise_or_, bitwise_xor_, bitwise_not_ (#55509)
      
      * tmp commit
      
      * add atan2
      
      * add inplace api
      
      * fix error
      
      * add inpalce divide
      
      * add inplace api
      
      * add more inplace
      
      * add more inpalce
      
      * fix logical_not error
      
      * support sinh and cosh in cpu
      
      * support asin, acos, atan, asinh, acosh, atanh in cpu
      
      * fix typro
      
      * fix typro
      
      * mv out atan2 ldexp
      
      * mv out atan2 ldexp
      
      * support sinh and cosh in gpu
      
      * support asin, acos, atan, asinh, acosh, atanh in gpu
      
      * fix ge error
      
      * fix dygraph commpare error
      
      * fix dygraph commpare error
      
      * check complex in python
      
      * fix cast inpalce error
      
      * open inplace test
      
      * fix ops.yaml error
      
      * mv cast inpalce to python
      
      * fix coverage ci
      
      * add last inplace
      
      * fix inplace error
      
      * fix cast error
      
      * fix error
      
      * add nan_to_num_
      
      * fix typro
      
      * fix sparse cast error
      
      * remove gpu 4
      
      * fix static cast error
      
      * tmp commit
      
      * add atan2
      
      * add inplace api
      
      * fix error
      
      * add inpalce divide
      
      * add inplace api
      
      * add more inplace
      
      * add more inpalce
      
      * fix logical_not error
      
      * fix typro
      
      * fix typro
      
      * mv out atan2 ldexp
      
      * mv out atan2 ldexp
      
      * fix ge error
      
      * fix dygraph commpare error
      
      * fix dygraph commpare error
      
      * fix cast inpalce error
      
      * open inplace test
      
      * fix ops.yaml error
      
      * mv cast inpalce to python
      
      * fix coverage ci
      
      * add last inplace
      
      * fix inplace error
      
      * fix cast error
      
      * fix error
      
      * add nan_to_num_
      
      * fix typro
      
      * fix sparse cast error
      
      * remove gpu 4
      
      * fix static cast error
      
      * fix cast error
      
      * fix
      
      * Revert "check complex in python"
      
      This reverts commit c822064261d774dd58ad46a4f90ba8b467700a05.
      
      * add renorm , fix error
      
      * add coverage
      
      * fix cumsum inpalce version error
      
      * add cast inpalce impl
      
      * rm test.log
      
      * fix multiply_dyfunction and add multiply_backward test
      
      * add and use is_same_tensor
      
      * fix typro
      
      * fix sone error
      
      * fix typro
      
      ---------
      Co-authored-by: NScotty <jmhgchn@gmail.com>
      Co-authored-by: NScotty <527407973@qq.com>
      c5fc413a
    • K
      [NewIR] register set_value in new ir (#56436) · deee91d8
      kangguangli 提交于
      * register set_value in new ir
      
      * fix
      
      * register set_value_grad
      
      * fix
      
      * fix
      
      * remove debug info
      
      * add unittest
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * resolve comments
      deee91d8
  8. 25 8月, 2023 2 次提交
  9. 24 8月, 2023 3 次提交
  10. 23 8月, 2023 1 次提交
  11. 22 8月, 2023 3 次提交
  12. 21 8月, 2023 1 次提交
  13. 18 8月, 2023 2 次提交
  14. 16 8月, 2023 7 次提交
  15. 15 8月, 2023 1 次提交
  16. 14 8月, 2023 2 次提交
  17. 10 8月, 2023 2 次提交
    • L
      Add variable_length_memory_efficient_attention (#55400) · 4036c937
      lzy 提交于
      * add variable_length_memory_efficient_attention
      * update variable_length_memory_efficient_attention unittest
      * update variable_length_mem_eff_attn's docs and unittest
      * update variable_length_mem_eff_attn's docs
      * Update test_variable_length_memory_efficient_attention.py
      * Update variable_length_memory_efficient_attention.cu
      * fix codestyle
      * fix variable_length_fmha's docs and unittest
      * fix variable_length_fmha's docs
      4036c937
    • L
      add tanh_triple_grad composite logic (#56072) · 7c4a3556
      lxd-cumt 提交于
      * decompose tanh_triple_grad and add it into prim_white_list test=develop
      
      * fix TanhTripleGradKernel bugs test=develop
      
      * decompose tanh_triple_grad test=develop
      7c4a3556
  18. 09 8月, 2023 1 次提交
  19. 08 8月, 2023 2 次提交
  20. 07 8月, 2023 1 次提交
    • Y
      Add attn_mask supported for FlashAttnKernel. (#55969) · 42e0c6b8
      yin wei 提交于
      * add mask
      
      * add backword
      
      * add enforce info
      
      * update scale
      
      * integrate code
      
      * update enforce
      
      * add enforce eq
      
      * add error type
      
      * update enforce
      
      * add test_flash_attention
      
      * Polish codes and fix compiling errors.
      
      * Set num_splits to 0 for flash-attn with tensor mask.
      
      * Fix the compiling error for non flash-attn case.
      
      ---------
      Co-authored-by: NLiu Yiqun <liuyiqun01@baidu.com>
      42e0c6b8