1. 06 9月, 2023 2 次提交
  2. 05 9月, 2023 3 次提交
  3. 04 9月, 2023 3 次提交
  4. 31 8月, 2023 2 次提交
    • T
      Add fused_scale_bias_relu_conv_bnstats OP (#55026) · 71e28b12
      Tian Zheng 提交于
      * Add fused_scale_bias_relu_conv_bnstats op
      
      * Review changes
      
      * Fix no CUDNN Frontend build
      
      * Fix PADDLE_ENFORCE format
      
      * Fix PADDLE_ENFORCE CI error
      
      * Rename kernel filename
      
      * Refactor unittest to use paddle eager_op_test
      
      * Fix padding bugs
      
      * Review changes
      
      * test=cuda117
      
      * test=cuda117
      71e28b12
    • C
      [AutoParallel] Adapt static spmd rules for dynamic graph (#56367) · 54fcd9a9
      Chen Weihang 提交于
      * move matmul spmd rules into phi
      
      * add basic infer spmd utils
      
      * addspmd factory
      
      * fix compile error
      
      * add unittest
      
      * refine infer spmd test and utils
      
      * debug infer spmd test
      
      * adapt python test
      
      * poish details
      
      * change to vector attr arg
      
      * revert needless change
      
      * update matmul spmd rule test
      
      * remove original rule
      
      * polish details
      
      * fix marco error
      
      * add comment
      
      * pass backward test
      
      * fix compile error
      
      * add cmake rule for spmd_rules_test
      
      * add dist meta tensor
      
      * update pybind impl
      
      * add marco for rules
      54fcd9a9
  5. 30 8月, 2023 2 次提交
    • K
      [NewIR] fix logical op infermeta (#56711) · 987cb97e
      kangguangli 提交于
      * fix logical op infermeta
      
      * add test
      
      * adpat inplace api
      987cb97e
    • H
      Add paddle custom flags support (#56256) · 2ef4ec71
      huangjiyi 提交于
      * update
      
      * repalce gflags header
      
      * replace DEFINE_<type> with PD_DEFINE_<type>
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * update cmake
      
      * add :: before some paddle namespace
      
      * fix link error
      
      * fix CI-Py3
      
      * allow commandline parse
      
      * fix SetFlagsFromEnv
      
      * fix bug
      
      * fix bug
      
      * fix CI-CINN
      
      * fix CI-Coverage-build
      
      * fix CI-Windows-build
      
      * fix CI-Inference
      
      * fix bug
      
      * fix bug
      
      * fix CI-CINN
      
      * fix inference api test
      
      * fix infer_ut test
      
      * revert infer_ut gflags usage
      
      * update
      
      * fix inference
      
      * remove flags export macro
      
      * revert inference demo_ci gflags usage
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * fix bug when turn on WITH_GFLAGS
      
      * turn on WITH_GFLAGS
      
      * fix bug when turn on WITH_GFLAGS
      
      * fix bug when turn on WITH_GFLAGS
      
      * update
      
      * update and add unittest
      
      * add unittest
      
      * fix conflict
      
      * rerun ci
      
      * update
      
      * resolve conflict
      2ef4ec71
  6. 29 8月, 2023 2 次提交
    • Z
      [NewIR] support c_sync_calc_stream/c_sync_comm_stream/send_v2/recv_v2 (#56557) · 0ce66c1c
      zhaoyingli 提交于
      * [AutoParallel][NewIR] support calc_sync/comm_sync/send_v2/recv_v2
      
      * pre-commit
      
      * rm unittest
      
      * tiny fix
      
      * api_gen support send_v2's output is empty
      
      * fix format
      
      * python_c_gen support send_v2
      0ce66c1c
    • S
      Remove need_move_to_phi (#56371) · daac3829
      Sonder 提交于
      * remove flag
      
      * open static build flag
      
      * add searchsorted to list
      
      * add register info for fused layernorm
      
      * fix fused_layernorm_kernel output registe info
      
      * fix stft registe info
      
      * add include
      
      * fix registe info
      
      * add skip fake init for fused_layernorm:residual_out
      
      * fix error
      
      * add distributed_fused_lamb_init to StaticBuildBlackList
      
      * set static_build flag to false
      daac3829
  7. 28 8月, 2023 2 次提交
    • G
      【inplace api】Batch add inplace api gt_, ge_, lt_, le_, eq_, not_equal_,... · c5fc413a
      GGBond8488 提交于
      【inplace api】Batch add inplace api gt_, ge_, lt_, le_, eq_, not_equal_, logical_and_, logical_or_, logical_xor_, logical_not_, divide_, floor_divide_, bitwise_and_ , bitwise_or_, bitwise_xor_, bitwise_not_ (#55509)
      
      * tmp commit
      
      * add atan2
      
      * add inplace api
      
      * fix error
      
      * add inpalce divide
      
      * add inplace api
      
      * add more inplace
      
      * add more inpalce
      
      * fix logical_not error
      
      * support sinh and cosh in cpu
      
      * support asin, acos, atan, asinh, acosh, atanh in cpu
      
      * fix typro
      
      * fix typro
      
      * mv out atan2 ldexp
      
      * mv out atan2 ldexp
      
      * support sinh and cosh in gpu
      
      * support asin, acos, atan, asinh, acosh, atanh in gpu
      
      * fix ge error
      
      * fix dygraph commpare error
      
      * fix dygraph commpare error
      
      * check complex in python
      
      * fix cast inpalce error
      
      * open inplace test
      
      * fix ops.yaml error
      
      * mv cast inpalce to python
      
      * fix coverage ci
      
      * add last inplace
      
      * fix inplace error
      
      * fix cast error
      
      * fix error
      
      * add nan_to_num_
      
      * fix typro
      
      * fix sparse cast error
      
      * remove gpu 4
      
      * fix static cast error
      
      * tmp commit
      
      * add atan2
      
      * add inplace api
      
      * fix error
      
      * add inpalce divide
      
      * add inplace api
      
      * add more inplace
      
      * add more inpalce
      
      * fix logical_not error
      
      * fix typro
      
      * fix typro
      
      * mv out atan2 ldexp
      
      * mv out atan2 ldexp
      
      * fix ge error
      
      * fix dygraph commpare error
      
      * fix dygraph commpare error
      
      * fix cast inpalce error
      
      * open inplace test
      
      * fix ops.yaml error
      
      * mv cast inpalce to python
      
      * fix coverage ci
      
      * add last inplace
      
      * fix inplace error
      
      * fix cast error
      
      * fix error
      
      * add nan_to_num_
      
      * fix typro
      
      * fix sparse cast error
      
      * remove gpu 4
      
      * fix static cast error
      
      * fix cast error
      
      * fix
      
      * Revert "check complex in python"
      
      This reverts commit c822064261d774dd58ad46a4f90ba8b467700a05.
      
      * add renorm , fix error
      
      * add coverage
      
      * fix cumsum inpalce version error
      
      * add cast inpalce impl
      
      * rm test.log
      
      * fix multiply_dyfunction and add multiply_backward test
      
      * add and use is_same_tensor
      
      * fix typro
      
      * fix sone error
      
      * fix typro
      
      ---------
      Co-authored-by: NScotty <jmhgchn@gmail.com>
      Co-authored-by: NScotty <527407973@qq.com>
      c5fc413a
    • K
      [NewIR] register set_value in new ir (#56436) · deee91d8
      kangguangli 提交于
      * register set_value in new ir
      
      * fix
      
      * register set_value_grad
      
      * fix
      
      * fix
      
      * remove debug info
      
      * add unittest
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * resolve comments
      deee91d8
  8. 25 8月, 2023 3 次提交
  9. 24 8月, 2023 2 次提交
  10. 23 8月, 2023 2 次提交
  11. 22 8月, 2023 2 次提交
  12. 21 8月, 2023 1 次提交
  13. 18 8月, 2023 1 次提交
  14. 16 8月, 2023 4 次提交
  15. 15 8月, 2023 1 次提交
  16. 14 8月, 2023 1 次提交
    • MarDino's avatar
      Add rmsnorm residual bias add and quant (#55965) · 2ac6a7e4
      MarDino 提交于
      * add rmsnorm residual bias add and quant
      
      * refine python interface
      
      * add rmsnorm unittest
      
      * Add layernorm
      
      * fix layernorm unittest
      
      * refine unittest
      
      * fix example code
      
      * fix review comment
      2ac6a7e4
  17. 10 8月, 2023 1 次提交
    • L
      Add variable_length_memory_efficient_attention (#55400) · 4036c937
      lzy 提交于
      * add variable_length_memory_efficient_attention
      * update variable_length_memory_efficient_attention unittest
      * update variable_length_mem_eff_attn's docs and unittest
      * update variable_length_mem_eff_attn's docs
      * Update test_variable_length_memory_efficient_attention.py
      * Update variable_length_memory_efficient_attention.cu
      * fix codestyle
      * fix variable_length_fmha's docs and unittest
      * fix variable_length_fmha's docs
      4036c937
  18. 09 8月, 2023 1 次提交
  19. 08 8月, 2023 2 次提交
  20. 03 8月, 2023 3 次提交