1. 29 3月, 2023 1 次提交
    • Y
      Add group_norm composite rule (#51874) · cabf3921
      Yichen Zhang 提交于
      * add group_norm composite rule
      
      * add test for scale_grad and bias_grad
      
      * resolve conflicts
      
      * remove amp in composite_rule.py
      
      * add float16 test
      
      * deal with NHWC format
      
      * keep the composite rule in float16 identical as original kernel
      
      * resolve conflicts
      cabf3921
  2. 20 3月, 2023 1 次提交
    • X
      【prim】New layer_norm grad (#51750) · 802a81d0
      xiaoguoguo626807 提交于
      * Add flatten composite rule
      
      * get the right xshape and pass func test
      
      * add cinn unit test
      
      * Remove cinn test, wait for it to be added after repair
      
      * add comp test to test_flatten_contiguous_range_op.py
      
      * remove func test on composite_ops
      
      * Add comments to maybe_wrap_dim func
      
      * remove commented code
      
      * fix the problem with 0D tensor case
      
      * add flatten split rule comment
      
      * fix syntax issues
      
      * block flatten on resnet_prim_cinn
      
      * init change
      
      * tmp commit
      
      * add layer_norm InferMeta check
      
      * cast type modify
      
      * [CINN]Enhance CacheKey hash logic by considering input dtypes (#50557)
      
      * [CINN]Enhance CacheKey hash logic by considering input dtypes
      
      * add unittest
      
      * fix typo
      
      * fix typo
      
      * fix map.at
      
      * fix find
      
      * fix test
      
      * fix cinn cache key structure realize
      
      * using ordered map for attributes
      
      * add test by review advice
      
      ---------
      Co-authored-by: Njiangcheng <thisjiang@qq.com>
      
      * [prim] enable dygraph_to_static to support custom_vjp
      
      * Pr 50885 (#7)
      
      * [CINN]Enhance CacheKey hash logic by considering input dtypes (#50557)
      
      * [CINN]Enhance CacheKey hash logic by considering input dtypes
      
      * add unittest
      
      * fix typo
      
      * fix typo
      
      * fix map.at
      
      * fix find
      
      * fix test
      
      * fix cinn cache key structure realize
      
      * using ordered map for attributes
      
      * add test by review advice
      
      ---------
      Co-authored-by: Njiangcheng <thisjiang@qq.com>
      
      * [prim] enable dygraph_to_static to support custom_vjp
      
      * fix code in a dy2static-friendly way.
      
      * [dystatic] add hooker for prim
      
      ---------
      Co-authored-by: NAurelius84 <zhangliujie@baidu.com>
      Co-authored-by: Njiangcheng <thisjiang@qq.com>
      Co-authored-by: Ncxxly <chenxx_id@163.com>
      
      * [prim] enable dygraph_to_static to support custom_vjp
      
      * fix cast prim and vjp dtype mapping error bug
      
      * recover
      
      * big tol
      
      * [CINN]Enhance CacheKey hash logic by considering input dtypes (#50557)
      
      * [CINN]Enhance CacheKey hash logic by considering input dtypes
      
      * add unittest
      
      * fix typo
      
      * fix typo
      
      * fix map.at
      
      * fix find
      
      * fix test
      
      * fix cinn cache key structure realize
      
      * using ordered map for attributes
      
      * add test by review advice
      
      ---------
      Co-authored-by: Njiangcheng <thisjiang@qq.com>
      
      * [prim] enable dygraph_to_static to support custom_vjp
      
      * Pr 50885 (#7)
      
      * [CINN]Enhance CacheKey hash logic by considering input dtypes (#50557)
      
      * [CINN]Enhance CacheKey hash logic by considering input dtypes
      
      * add unittest
      
      * fix typo
      
      * fix typo
      
      * fix map.at
      
      * fix find
      
      * fix test
      
      * fix cinn cache key structure realize
      
      * using ordered map for attributes
      
      * add test by review advice
      
      ---------
      Co-authored-by: Njiangcheng <thisjiang@qq.com>
      
      * [prim] enable dygraph_to_static to support custom_vjp
      
      * fix code in a dy2static-friendly way.
      
      * [dystatic] add hooker for prim
      
      ---------
      Co-authored-by: NAurelius84 <zhangliujie@baidu.com>
      Co-authored-by: Njiangcheng <thisjiang@qq.com>
      Co-authored-by: Ncxxly <chenxx_id@163.com>
      
      * [prim] enable dygraph_to_static to support custom_vjp
      
      * fix cast prim and vjp dtype mapping error bug
      
      * Cxx prim custom vjp (#8)
      
      * [CINN]Enhance CacheKey hash logic by considering input dtypes (#50557)
      
      ---------
      Co-authored-by: Njiangcheng <thisjiang@qq.com>
      
      * [prim] enable dygraph_to_static to support custom_vjp
      
      * Pr 50885 (#7)
      
      * [CINN]Enhance CacheKey hash logic by considering input dtypes (#50557)
      
      * [CINN]Enhance CacheKey hash logic by considering input dtypes
      
      ---------
      Co-authored-by: Njiangcheng <thisjiang@qq.com>
      
      * [prim] enable dygraph_to_static to support custom_vjp
      
      * fix code in a dy2static-friendly way.
      
      * [dystatic] add hooker for prim
      
      ---------
      Co-authored-by: NAurelius84 <zhangliujie@baidu.com>
      Co-authored-by: Njiangcheng <thisjiang@qq.com>
      Co-authored-by: Ncxxly <chenxx_id@163.com>
      
      * [prim] enable dygraph_to_static to support custom_vjp
      
      * fix cast prim and vjp dtype mapping error bug
      
      * [dy2static-ci] fix dy2static ci errors.
      
      ---------
      Co-authored-by: NAurelius84 <zhangliujie@baidu.com>
      Co-authored-by: Njiangcheng <thisjiang@qq.com>
      Co-authored-by: Ncxxly <chenxx_id@163.com>
      
      * [Prim] enable whitelist and blacklist for custom_vjp
      
      * debug log
      
      * clear log
      
      * fix
      
      * nothing
      
      * less memory
      
      * recover utils
      
      * fix
      
      * modify threshold value
      
      * skip layer_norm for test_bert
      
      * back to bert success state
      
      * add epsion
      
      * delete unnecessary compute
      
      * modify amp dtype
      
      * modify * order
      
      * delete sqrt check and fp16
      
      ---------
      Co-authored-by: Nxuyongsheng <xuyongsheng@baidu.com>
      Co-authored-by: Nxysheng-baidu <121540080+xysheng-baidu@users.noreply.github.com>
      Co-authored-by: NAurelius84 <zhangliujie@baidu.com>
      Co-authored-by: Njiangcheng <thisjiang@qq.com>
      Co-authored-by: Ncxxly <chenxx_id@163.com>
      Co-authored-by: Nxiongkun <807377414@qq.com>
      802a81d0
  3. 16 3月, 2023 1 次提交
  4. 14 3月, 2023 1 次提交
  5. 01 3月, 2023 1 次提交
    • C
      Integration flash attention (#49869) · 61611786
      Chitsing KUI 提交于
      * flash attn
      
      * seed
      
      * almost
      
      * softmax
      
      * fix workspace
      
      * add unitest; linux only
      
      * fix setup
      
      * fix datatype include
      
      * fix setup typo
      
      * fix def scope
      
      * new error api
      
      * use paddle fork
      
      * fix attr bug; complete ut
      
      * update flash hash
      
      * fix rng reset
      
      * fix offset
      
      * fix comments
      61611786
  6. 12 1月, 2023 1 次提交
    • S
      lerp support 0 Tensor (#49667) · 8cd0d5b3
      sunli 提交于
      * lerp support 0 Tensor
      
      * fix lerp grad
      
      * fix lerp zero test
      
      * fix 0D + ND/ND + 0D
      
      * fix check
      
      * update code
      
      * fix lerp infer shape
      
      * static backward test
      
      * updata static graph test
      8cd0d5b3
  7. 16 12月, 2022 1 次提交
  8. 03 12月, 2022 1 次提交
  9. 17 11月, 2022 1 次提交
  10. 15 11月, 2022 1 次提交
  11. 14 11月, 2022 1 次提交
  12. 01 11月, 2022 1 次提交
  13. 31 10月, 2022 1 次提交
  14. 17 10月, 2022 1 次提交
  15. 12 8月, 2022 3 次提交
    • Z
      refix index resize in multiclassnms3 (#45095) · 49e2a4d8
      zhiboniu 提交于
      49e2a4d8
    • Z
      fix extra output of kernels for inference (#45048) · 1cb883da
      zyfncg 提交于
      1cb883da
    • S
      [geometric]Add paddle.geometric.send_ue_recv API (#43174) · 615b15a3
      Siming Dai 提交于
      * add init file
      
      * add op definition and infermeta
      
      * add kernel definition funcs
      
      * add broadcast infer shape
      
      * add gpu forward kernel
      
      * delete SUB and DIV
      
      * add x_grad
      
      * add template
      
      * add e_grad for min and max
      
      * fix small bug
      
      * temp commit
      
      * temp commit
      
      * add e_grad for sum and mean
      
      * fix some compile bug
      
      * fix compile bugs
      
      * fix compile problem
      
      * add sum forward unittest
      
      * fix broadcast error, add kernel sig, register e_grad, change unit test
      
      * fix grad
      
      * add temp grad fix
      
      * temp commit
      
      * add min max unittest
      
      * add max, min unittest, fix mul bug
      
      * add cpu forward sum and mean
      
      * add forward min max, fix mean unittest
      
      * add cpu backward min max
      
      * fix code-style
      
      * add backward sum mean
      
      * fix rocm ci
      
      * set uniitest timeout
      
      * fix bug of x broadcast to e, gpu grad
      
      * fix bug of x broadcast to e, cpu grad
      
      * rename BOOST_GET_CONST macro
      
      * fix rocm ci
      
      * mv graph_send_e_recv to graph_send_ue_recv
      
      * move out_size to IntArray
      
      * add eager op test
      
      * fix max pool type bug, add unittest for api
      
      * revise api doc
      
      * add fp16 for atomic min and max, add unittest
      
      * add unittest
      
      * add fp16 support for graph_send_recv
      
      * fix unittest fp16 bug
      
      * change OutSizeTensor to Out_size
      
      * move E to Y
      
      * add copyright, fix comment
      
      * review code
      
      * fix thread block size
      
      * fix thread block size
      
      * change api attribute name: pool_type to reduce_op, compute_type to message_op
      
      * change api attribute name, move pool_type to reduce_op, move compute_type to message_op
      615b15a3
  16. 09 8月, 2022 1 次提交
    • S
      [geometric]Add paddle.geometric.send_u_recv API (#44580) · 34b43555
      Siming Dai 提交于
      * change out_size to INTArray
      
      * fix out_size eager bug
      
      * add unittest for out_size tensor
      
      * add deprecated for paddle.incubate.graph_send_recv, add paddle.geometric.send_u_recv and unittests
      
      * fix lowest bug
      
      * fix according review comment
      
      * add default value in yaml
      
      * change api file name
      
      * change name
      34b43555
  17. 03 8月, 2022 1 次提交
  18. 01 8月, 2022 1 次提交
  19. 29 7月, 2022 1 次提交
  20. 28 7月, 2022 1 次提交
    • L
      [PHI] Move spectral_norm to phi (#44577) · 768e50c9
      Lin Manhui 提交于
      * Add kernel declarations
      
      * Copy kernel implementation code
      
      * Transfer implementation code
      
      * Fix: Move out_grad to first
      
      * Register new kernels
      
      * Remove old kernels
      
      * Move out_grad to last
      
      * Fix bugs
      
      * Transfer infermeta
      
      * Add yaml files
      
      * Add blank line
      
      * Fix code style
      
      * Optimize directory structure
      Co-authored-by: NBobholamovic <linmanhui@baidu.com>
      768e50c9
  21. 26 7月, 2022 1 次提交
  22. 12 7月, 2022 1 次提交
  23. 08 6月, 2022 1 次提交
  24. 05 6月, 2022 1 次提交
  25. 30 5月, 2022 1 次提交
  26. 27 5月, 2022 1 次提交
  27. 26 5月, 2022 1 次提交
  28. 12 4月, 2022 1 次提交
    • H
      Add layer norm yaml (#41589) · 43d5cca6
      hong 提交于
      * add layer norm infermeta
      
      * add layer norm yaml
      
      * polish layer norm infer meta
      
      * add layer norm to black list
      43d5cca6
  29. 07 4月, 2022 1 次提交
  30. 03 4月, 2022 1 次提交
  31. 31 3月, 2022 1 次提交
  32. 28 3月, 2022 1 次提交
  33. 22 3月, 2022 1 次提交
    • S
      [phi] Update graph_send_recv OP (#40509) · 67b46e45
      Siming Dai 提交于
      * add out_size shape for graph_send_recv
      
      * fix bug in register kernel: no const int& support
      
      * add out_size in infermeta
      
      * change unittest
      
      * fix unittest
      
      * fix out_size default value
      
      * fix doc
      
      * delete arg mapping
      
      * add sig
      
      * move -1 to 0
      
      * move -1 to 0
      67b46e45
  34. 18 3月, 2022 1 次提交
  35. 16 3月, 2022 1 次提交
  36. 13 3月, 2022 1 次提交
  37. 11 3月, 2022 1 次提交
  38. 10 3月, 2022 1 次提交