1. 16 3月, 2023 3 次提交
  2. 15 3月, 2023 7 次提交
  3. 14 3月, 2023 2 次提交
  4. 13 3月, 2023 4 次提交
    • TaoTao Li's avatar
      Add phi operator all_gather (#51420) · afa26a59
      TaoTao Li 提交于
      * add all_gather and fix conflicts
      
      * fix code format
      
      * fix ut
      
      * fix broadcast ut
      afa26a59
    • H
      【prim】Maximum grad (#51006) · 4a484973
      heyanru 提交于
      * refresh
      
      * compat
      
      * register
      
      * testop
      
      * fix
      
      * fix
      
      * fox
      
      * cast
      
      * cast
      
      * fix
      
      * type
      
      * fix
      
      * out
      
      * cast
      
      * fix
      
      * fix
      
      * fix
      
      * broad
      
      * broad
      
      * broad
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * broad
      
      * broad
      
      * numel
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * cinn
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      4a484973
    • H
      Add from_blob api for constructing tensor from data pointer (#51085) · 74442f5e
      Huang Jiyi 提交于
      * add from_blob
      
      * fix test
      
      * fix test
      
      * fix codestyle
      
      * add gpu test
      
      * fix test
      
      * update
      
      * add comment
      
      * fix comment
      
      * update comment
      
      * fix CI bug
      
      * add thread_local
      
      * update
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix cmake
      
      * fix CI-Py3 make
      
      * update
      
      * use api_reg
      
      * fix include
      
      * update
      
      * update
      
      * update
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix bug
      74442f5e
    • S
      Fused softplus (#51087) · fdcfa04f
      Sławomir Siwek 提交于
      * mkldnn->onednn
      
      * fused softplus op + kernel
      
      * remove extra attributes
      
      * add missing handler
      
      * change var name
      fdcfa04f
  5. 10 3月, 2023 2 次提交
    • HappyHeavyRain's avatar
      [New features]Add function node in phi_kernel for MKLDNN (#51073) · a0a6dc6a
      HappyHeavyRain 提交于
      * Add function node in phi_kernel for MKLDNN
      
      * fix the bug in 'BuildInferVarKernelContext'
      
      * add infer_varkernel_utils.cc
      
      * fix the bug:the first two parametes of 'BuildInferVarKernelContext' can't be template variable
      
      * change the code according to first review
      
      * change the code according to first review
      
      * change the mode of paddle_build.sh
      
      * change 'infer_var_kernel_fn_' to 'get_kerneltype_forvar_fn_'
      
      * add the error information
      
      * fix NotFound infomation warning
      
      * fix NotFound infomation warning
      
      * fix NotFound infomation warning
      a0a6dc6a
    • C
      add flashattn raw kernel (#51383) · f951832d
      Chitsing KUI 提交于
      f951832d
  6. 09 3月, 2023 7 次提交
  7. 08 3月, 2023 2 次提交
  8. 07 3月, 2023 1 次提交
  9. 06 3月, 2023 3 次提交
    • M
      implement floor_grad by primitive logic (#51059) · 769e24ce
      Meteor Liu 提交于
      * implement floor_grad by primitive logic
      
      * implement floor_grad by primitive logic
      
      * Merge branch 'develop' into floor_grad
      769e24ce
    • N
      Add multiprecision for adadelta op (#50131) · a8a2b7f4
      niuliling123 提交于
      a8a2b7f4
    • H
      [phi decoupling] decouple dependency to device_context in phi (Part 1) (#50865) · a1006b2b
      Huang Jiyi 提交于
      * move DeviceContextPool to phi
      
      * add EmplaceExternalContextFunc
      
      * update namespace
      
      * update cmake
      
      * fix bugs and create context_pool_impl.h
      
      * replace platform::is_xxx_place
      
      * fix bugs
      
      * update generator
      
      * fix bugs
      
      * fix bugs
      
      * fix bugs
      
      * fix bugs
      
      * fix bugs
      
      * fix bugs
      
      * fix bugs
      
      * fix enforce usage
      
      * Revert "fix enforce usage"
      
      This reverts commit 5f521f08a69713cee506e64a00ec6d9fba709e27.
      
      * fix bugs
      
      * rm XPUDeviceContext and CustomDeviceContext
      
      * fix bugs
      
      * fix fix context init bug
      
      * fix bugs after merge
      
      * fix bugs
      
      * fix name
      
      * fix mutable_data
      
      * update and fix bugs
      
      * fix bugs
      
      * update
      
      * fix bugs
      
      * fix name
      
      * fix bugs
      
      * merge
      
      * fix bugs
      
      * create context_pool in phi/backends
      
      * create context_pool in phi/backends
      
      * fix bugs
      
      * fix xpu bugs
      
      * fix rocm bugs
      
      * fix bugs
      
      * fix bugs
      
      * fix bugs
      
      * fix xpu bugs
      
      * update
      
      * update
      
      * fix bugs
      
      * fix bugs
      a1006b2b
  10. 03 3月, 2023 2 次提交
  11. 02 3月, 2023 2 次提交
    • H
      [GetCurrentCUDAStream] Add C++ API GetCurrentCUDAStream (#51027) · cce2b94d
      HongyuJia 提交于
      * polish codes according #50813
      
      * [getCurrentCUDAStream] Add C++ API getCurrentCUDAStream
      
      * change get->Get
      
      * wrap with macro
      
      * use Get instead of get
      cce2b94d
    • W
      Add concat grad cinn (#50972) · a4689c90
      wangzhen38 提交于
      * [cinn] concat_grad
      
      * [cinn] concat_grad
      
      * [cinn] concat_grad build success
      
      * [Add PGLBOX] fix unnitest
      
      * [Add PGLBOX] fix unnitest
      
      * [Add PGLBOX] fix codestyle
      
      * [cinn] update by comments
      
      * [cinn] update by comment
      
      * [cinn] add axis check
      a4689c90
  12. 01 3月, 2023 5 次提交
    • C
      Integration flash attention (#49869) · 61611786
      Chitsing KUI 提交于
      * flash attn
      
      * seed
      
      * almost
      
      * softmax
      
      * fix workspace
      
      * add unitest; linux only
      
      * fix setup
      
      * fix datatype include
      
      * fix setup typo
      
      * fix def scope
      
      * new error api
      
      * use paddle fork
      
      * fix attr bug; complete ut
      
      * update flash hash
      
      * fix rng reset
      
      * fix offset
      
      * fix comments
      61611786
    • H
      [Tensor Operants & Prim-Relevant] Tensor supports logical operants (#50983) · 1794927b
      HongyuJia 提交于
      * Add comments for #50886
      
      * [Tensor Operants & Prim-Relevant] Tensor supports logical operants
      
      * add prim dynamic unit test
      
      * add prim static unit test
      1794927b
    • Z
      add topk prim backward (#50679) · 296b3ff0
      zqw_1997 提交于
      * tmp gather vjp
      
      * support gather
      
      * remove useless code
      
      * fix compiling error
      
      * fix ut
      
      * add eager test
      
      * add eager test
      
      * add seed
      
      * small change
      
      * fix cpu error
      
      * fix transpose op compat
      
      * remove tensor index case
      
      * fix prim_cinn
      
      * small commit
      
      * add cumsum prim backward
      
      * small commit
      
      * skip aixs=None test case
      
      * fix op generante eror
      
      * fix static test error
      
      * remove unused code
      
      * fix static test error
      
      * small commit
      
      * skip cpu float16 test case
      
      * skip eager cpu cumsum float16 test case
      
      * add eager and static UT
      
      * fix ut
      
      * add composite backward rule
      
      * fix error
      
      * fix type error and format error
      
      * add try cpu+float16 test
      
      * fix test bugs
      
      * remove test for cpu+float16 and make y[0] be the grad arg
      
      * add cinn test
      
      * fix UT
      
      * fix the wrong dim of v in test cases
      
      * change y[0] to y[1] for grad in UT
      
      * reshape flatten out
      
      * Disable cinn single test
      
      * use scatter_nd_add
      
      * modify the reshape part of topk_grad
      
      * delete useless build file
      
      * to make the syntax right
      
      * modify bug
      
      * try use of put_along_axis
      
      * remove cinn test
      
      * reformat todo
      
      * add silu composite rule
      
      * fix code style.
      
      * add cinn test
      
      * fix composite grad maker code gen
      
      * add prim in cumsum op test
      
      * remove old test
      
      * fix typro
      
      * pass the static test
      
      * fix typro
      
      * modify optest and delete old test files
      
      * remove normal test_top_k_op test
      
      * fix typro
      
      * pass axis=None test case
      
      * buffer comment
      
      * for debug
      
      * add silu fp16 unit test.
      
      * add static guard
      
      * remove forward prim test
      
      * remove same name axis
      
      * modify the test_top_v2_op.py to pass all local tests
      
      * delete the useless testcase
      
      * fix mistake
      
      * add more testcases to test dtype16 and dtype32
      
      ---------
      Co-authored-by: NJiabinYang <360788950@qq.com>
      Co-authored-by: NGGBond8488 <857631483@qq.com>
      Co-authored-by: Nzxcd <228587199@qq.com>
      Co-authored-by: NCharles-hit <wanghao107@baidu.com>
      296b3ff0
    • C
      add op map (#51026) · 83f61bd5
      cyber-pioneer 提交于
      83f61bd5
    • N
      Add multiprecision for rms op (#50132) · 48060b2e
      niuliling123 提交于
      48060b2e