1. 25 5月, 2023 1 次提交
  2. 18 5月, 2023 1 次提交
    • R
      support auto generate for op layer_norm (#53178) · 4f07b653
      RedContritio 提交于
      * simplify layer_norm_op.cc
      
      * support auto generate for op layer_norm
      
      * update unittest for composite_layer_norm
      
      * remove layer_norm_op.cc from scripts
      
      * replace layer_norm_op with generated_op
      
      * add get_expected_kernel for layer_norm
      
      * update cmake kernel register function for layer_norm_mkldnn_op
      4f07b653
  3. 22 4月, 2023 1 次提交
    • W
      [Zero-Dim] support output 0D for... · b406a7db
      wangfengsheng1999 提交于
      [Zero-Dim] support output 0D for is_empty/as_complex/inner/dot/rank/tensordot/squeeze_/static.accuracy/static.auc/metric.accuracy, test=allcase (#52850)
      
      * [Zero-Dim] support output 0D for is_empty/as_complex/, test=allcase
      
      * [Zero-Dim] support output 0D for is_empty/as_complex/, test=allcase
      
      * add test case
      
      * modify dot/metric.accuracy/static.accuracy/static.auc
      
      * modfiy inner/tensordot bug
      
      * test 9 api
      
      * [Zero-Dim] support output 0D for is_empty/as_complex/inner/dot/rank/tensordot/squeeze_/static.accuracy/static.auc/metric.accuracy, test=allcase
      
      * fix bug
      
      * support output 0D for is_empty/as_complex/inner/dot/rank/tensordot/squeeze_/static.accuracy/static.auc/metric.accuracy
      
      * code style
      
      * fix bug
      
      * fix test_dot_op bug
      
      * fix accuracy bug
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * codestyle
      
      * fix dot bug
      
      * fix dot bug
      
      * fix dot bug
      
      * code style
      
      * fix dot bug
      
      * fix dot bug
      
      * fix dot bug
      
      * fix dot bug
      
      * fix dot bug
      
      * fix dot bug
      
      * modify code
      b406a7db
  4. 06 4月, 2023 1 次提交
  5. 29 3月, 2023 1 次提交
    • Y
      Add group_norm composite rule (#51874) · cabf3921
      Yichen Zhang 提交于
      * add group_norm composite rule
      
      * add test for scale_grad and bias_grad
      
      * resolve conflicts
      
      * remove amp in composite_rule.py
      
      * add float16 test
      
      * deal with NHWC format
      
      * keep the composite rule in float16 identical as original kernel
      
      * resolve conflicts
      cabf3921
  6. 20 3月, 2023 1 次提交
    • X
      【prim】New layer_norm grad (#51750) · 802a81d0
      xiaoguoguo626807 提交于
      * Add flatten composite rule
      
      * get the right xshape and pass func test
      
      * add cinn unit test
      
      * Remove cinn test, wait for it to be added after repair
      
      * add comp test to test_flatten_contiguous_range_op.py
      
      * remove func test on composite_ops
      
      * Add comments to maybe_wrap_dim func
      
      * remove commented code
      
      * fix the problem with 0D tensor case
      
      * add flatten split rule comment
      
      * fix syntax issues
      
      * block flatten on resnet_prim_cinn
      
      * init change
      
      * tmp commit
      
      * add layer_norm InferMeta check
      
      * cast type modify
      
      * [CINN]Enhance CacheKey hash logic by considering input dtypes (#50557)
      
      * [CINN]Enhance CacheKey hash logic by considering input dtypes
      
      * add unittest
      
      * fix typo
      
      * fix typo
      
      * fix map.at
      
      * fix find
      
      * fix test
      
      * fix cinn cache key structure realize
      
      * using ordered map for attributes
      
      * add test by review advice
      
      ---------
      Co-authored-by: Njiangcheng <thisjiang@qq.com>
      
      * [prim] enable dygraph_to_static to support custom_vjp
      
      * Pr 50885 (#7)
      
      * [CINN]Enhance CacheKey hash logic by considering input dtypes (#50557)
      
      * [CINN]Enhance CacheKey hash logic by considering input dtypes
      
      * add unittest
      
      * fix typo
      
      * fix typo
      
      * fix map.at
      
      * fix find
      
      * fix test
      
      * fix cinn cache key structure realize
      
      * using ordered map for attributes
      
      * add test by review advice
      
      ---------
      Co-authored-by: Njiangcheng <thisjiang@qq.com>
      
      * [prim] enable dygraph_to_static to support custom_vjp
      
      * fix code in a dy2static-friendly way.
      
      * [dystatic] add hooker for prim
      
      ---------
      Co-authored-by: NAurelius84 <zhangliujie@baidu.com>
      Co-authored-by: Njiangcheng <thisjiang@qq.com>
      Co-authored-by: Ncxxly <chenxx_id@163.com>
      
      * [prim] enable dygraph_to_static to support custom_vjp
      
      * fix cast prim and vjp dtype mapping error bug
      
      * recover
      
      * big tol
      
      * [CINN]Enhance CacheKey hash logic by considering input dtypes (#50557)
      
      * [CINN]Enhance CacheKey hash logic by considering input dtypes
      
      * add unittest
      
      * fix typo
      
      * fix typo
      
      * fix map.at
      
      * fix find
      
      * fix test
      
      * fix cinn cache key structure realize
      
      * using ordered map for attributes
      
      * add test by review advice
      
      ---------
      Co-authored-by: Njiangcheng <thisjiang@qq.com>
      
      * [prim] enable dygraph_to_static to support custom_vjp
      
      * Pr 50885 (#7)
      
      * [CINN]Enhance CacheKey hash logic by considering input dtypes (#50557)
      
      * [CINN]Enhance CacheKey hash logic by considering input dtypes
      
      * add unittest
      
      * fix typo
      
      * fix typo
      
      * fix map.at
      
      * fix find
      
      * fix test
      
      * fix cinn cache key structure realize
      
      * using ordered map for attributes
      
      * add test by review advice
      
      ---------
      Co-authored-by: Njiangcheng <thisjiang@qq.com>
      
      * [prim] enable dygraph_to_static to support custom_vjp
      
      * fix code in a dy2static-friendly way.
      
      * [dystatic] add hooker for prim
      
      ---------
      Co-authored-by: NAurelius84 <zhangliujie@baidu.com>
      Co-authored-by: Njiangcheng <thisjiang@qq.com>
      Co-authored-by: Ncxxly <chenxx_id@163.com>
      
      * [prim] enable dygraph_to_static to support custom_vjp
      
      * fix cast prim and vjp dtype mapping error bug
      
      * Cxx prim custom vjp (#8)
      
      * [CINN]Enhance CacheKey hash logic by considering input dtypes (#50557)
      
      ---------
      Co-authored-by: Njiangcheng <thisjiang@qq.com>
      
      * [prim] enable dygraph_to_static to support custom_vjp
      
      * Pr 50885 (#7)
      
      * [CINN]Enhance CacheKey hash logic by considering input dtypes (#50557)
      
      * [CINN]Enhance CacheKey hash logic by considering input dtypes
      
      ---------
      Co-authored-by: Njiangcheng <thisjiang@qq.com>
      
      * [prim] enable dygraph_to_static to support custom_vjp
      
      * fix code in a dy2static-friendly way.
      
      * [dystatic] add hooker for prim
      
      ---------
      Co-authored-by: NAurelius84 <zhangliujie@baidu.com>
      Co-authored-by: Njiangcheng <thisjiang@qq.com>
      Co-authored-by: Ncxxly <chenxx_id@163.com>
      
      * [prim] enable dygraph_to_static to support custom_vjp
      
      * fix cast prim and vjp dtype mapping error bug
      
      * [dy2static-ci] fix dy2static ci errors.
      
      ---------
      Co-authored-by: NAurelius84 <zhangliujie@baidu.com>
      Co-authored-by: Njiangcheng <thisjiang@qq.com>
      Co-authored-by: Ncxxly <chenxx_id@163.com>
      
      * [Prim] enable whitelist and blacklist for custom_vjp
      
      * debug log
      
      * clear log
      
      * fix
      
      * nothing
      
      * less memory
      
      * recover utils
      
      * fix
      
      * modify threshold value
      
      * skip layer_norm for test_bert
      
      * back to bert success state
      
      * add epsion
      
      * delete unnecessary compute
      
      * modify amp dtype
      
      * modify * order
      
      * delete sqrt check and fp16
      
      ---------
      Co-authored-by: Nxuyongsheng <xuyongsheng@baidu.com>
      Co-authored-by: Nxysheng-baidu <121540080+xysheng-baidu@users.noreply.github.com>
      Co-authored-by: NAurelius84 <zhangliujie@baidu.com>
      Co-authored-by: Njiangcheng <thisjiang@qq.com>
      Co-authored-by: Ncxxly <chenxx_id@163.com>
      Co-authored-by: Nxiongkun <807377414@qq.com>
      802a81d0
  7. 16 3月, 2023 1 次提交
  8. 14 3月, 2023 1 次提交
  9. 01 3月, 2023 1 次提交
    • C
      Integration flash attention (#49869) · 61611786
      Chitsing KUI 提交于
      * flash attn
      
      * seed
      
      * almost
      
      * softmax
      
      * fix workspace
      
      * add unitest; linux only
      
      * fix setup
      
      * fix datatype include
      
      * fix setup typo
      
      * fix def scope
      
      * new error api
      
      * use paddle fork
      
      * fix attr bug; complete ut
      
      * update flash hash
      
      * fix rng reset
      
      * fix offset
      
      * fix comments
      61611786
  10. 12 1月, 2023 1 次提交
    • S
      lerp support 0 Tensor (#49667) · 8cd0d5b3
      sunli 提交于
      * lerp support 0 Tensor
      
      * fix lerp grad
      
      * fix lerp zero test
      
      * fix 0D + ND/ND + 0D
      
      * fix check
      
      * update code
      
      * fix lerp infer shape
      
      * static backward test
      
      * updata static graph test
      8cd0d5b3
  11. 16 12月, 2022 1 次提交
  12. 03 12月, 2022 1 次提交
  13. 17 11月, 2022 1 次提交
  14. 15 11月, 2022 1 次提交
  15. 14 11月, 2022 1 次提交
  16. 01 11月, 2022 1 次提交
  17. 31 10月, 2022 1 次提交
  18. 17 10月, 2022 1 次提交
  19. 12 8月, 2022 3 次提交
    • Z
      refix index resize in multiclassnms3 (#45095) · 49e2a4d8
      zhiboniu 提交于
      49e2a4d8
    • Z
      fix extra output of kernels for inference (#45048) · 1cb883da
      zyfncg 提交于
      1cb883da
    • S
      [geometric]Add paddle.geometric.send_ue_recv API (#43174) · 615b15a3
      Siming Dai 提交于
      * add init file
      
      * add op definition and infermeta
      
      * add kernel definition funcs
      
      * add broadcast infer shape
      
      * add gpu forward kernel
      
      * delete SUB and DIV
      
      * add x_grad
      
      * add template
      
      * add e_grad for min and max
      
      * fix small bug
      
      * temp commit
      
      * temp commit
      
      * add e_grad for sum and mean
      
      * fix some compile bug
      
      * fix compile bugs
      
      * fix compile problem
      
      * add sum forward unittest
      
      * fix broadcast error, add kernel sig, register e_grad, change unit test
      
      * fix grad
      
      * add temp grad fix
      
      * temp commit
      
      * add min max unittest
      
      * add max, min unittest, fix mul bug
      
      * add cpu forward sum and mean
      
      * add forward min max, fix mean unittest
      
      * add cpu backward min max
      
      * fix code-style
      
      * add backward sum mean
      
      * fix rocm ci
      
      * set uniitest timeout
      
      * fix bug of x broadcast to e, gpu grad
      
      * fix bug of x broadcast to e, cpu grad
      
      * rename BOOST_GET_CONST macro
      
      * fix rocm ci
      
      * mv graph_send_e_recv to graph_send_ue_recv
      
      * move out_size to IntArray
      
      * add eager op test
      
      * fix max pool type bug, add unittest for api
      
      * revise api doc
      
      * add fp16 for atomic min and max, add unittest
      
      * add unittest
      
      * add fp16 support for graph_send_recv
      
      * fix unittest fp16 bug
      
      * change OutSizeTensor to Out_size
      
      * move E to Y
      
      * add copyright, fix comment
      
      * review code
      
      * fix thread block size
      
      * fix thread block size
      
      * change api attribute name: pool_type to reduce_op, compute_type to message_op
      
      * change api attribute name, move pool_type to reduce_op, move compute_type to message_op
      615b15a3
  20. 09 8月, 2022 1 次提交
    • S
      [geometric]Add paddle.geometric.send_u_recv API (#44580) · 34b43555
      Siming Dai 提交于
      * change out_size to INTArray
      
      * fix out_size eager bug
      
      * add unittest for out_size tensor
      
      * add deprecated for paddle.incubate.graph_send_recv, add paddle.geometric.send_u_recv and unittests
      
      * fix lowest bug
      
      * fix according review comment
      
      * add default value in yaml
      
      * change api file name
      
      * change name
      34b43555
  21. 03 8月, 2022 1 次提交
  22. 01 8月, 2022 1 次提交
  23. 29 7月, 2022 1 次提交
  24. 28 7月, 2022 1 次提交
    • L
      [PHI] Move spectral_norm to phi (#44577) · 768e50c9
      Lin Manhui 提交于
      * Add kernel declarations
      
      * Copy kernel implementation code
      
      * Transfer implementation code
      
      * Fix: Move out_grad to first
      
      * Register new kernels
      
      * Remove old kernels
      
      * Move out_grad to last
      
      * Fix bugs
      
      * Transfer infermeta
      
      * Add yaml files
      
      * Add blank line
      
      * Fix code style
      
      * Optimize directory structure
      Co-authored-by: NBobholamovic <linmanhui@baidu.com>
      768e50c9
  25. 26 7月, 2022 1 次提交
  26. 12 7月, 2022 1 次提交
  27. 08 6月, 2022 1 次提交
  28. 05 6月, 2022 1 次提交
  29. 30 5月, 2022 1 次提交
  30. 27 5月, 2022 1 次提交
  31. 26 5月, 2022 1 次提交
  32. 12 4月, 2022 1 次提交
    • H
      Add layer norm yaml (#41589) · 43d5cca6
      hong 提交于
      * add layer norm infermeta
      
      * add layer norm yaml
      
      * polish layer norm infer meta
      
      * add layer norm to black list
      43d5cca6
  33. 07 4月, 2022 1 次提交
  34. 03 4月, 2022 1 次提交
  35. 31 3月, 2022 1 次提交
  36. 28 3月, 2022 1 次提交
  37. 22 3月, 2022 1 次提交
    • S
      [phi] Update graph_send_recv OP (#40509) · 67b46e45
      Siming Dai 提交于
      * add out_size shape for graph_send_recv
      
      * fix bug in register kernel: no const int& support
      
      * add out_size in infermeta
      
      * change unittest
      
      * fix unittest
      
      * fix out_size default value
      
      * fix doc
      
      * delete arg mapping
      
      * add sig
      
      * move -1 to 0
      
      * move -1 to 0
      67b46e45
  38. 18 3月, 2022 1 次提交