1. 30 8月, 2023 1 次提交
  2. 12 3月, 2023 1 次提交
  3. 16 1月, 2023 1 次提交
    • Z
      CUDA12.0 integration (#49539) · 1885d55a
      zlsh80826 提交于
      * Update warpctc for cuda-12
      
      * Deprecate cudaProfilerInitialize for CUDA > 11
      
      * Deprecate CUSPARSE_MV_ALG_DEFAULT for CUDA_VERSION >= 11040
      
      * Add the missing thrust header
      1885d55a
  4. 16 11月, 2022 1 次提交
  5. 31 10月, 2022 1 次提交
  6. 23 8月, 2022 1 次提交
  7. 16 8月, 2022 1 次提交
    • S
      [geometric]Add paddle.geometric.send_uv API (#44848) · 88724a53
      Siming Dai 提交于
      * initial commit
      
      * fix op maker bug
      
      * fix mul grad bug
      
      * add unittest
      
      * fix add grad bug, add cpu kernel
      
      * add paddle.geometric.message_passing
      
      * add paddle.geometric.send_uv api, add unittest
      
      * add fp16 judgement
      
      * fix file typo, move compute_type to message_op
      
      * add impl file
      
      * fix unittest timeout time
      
      * add review revise
      88724a53
  8. 12 8月, 2022 1 次提交
    • S
      [geometric]Add paddle.geometric.send_ue_recv API (#43174) · 615b15a3
      Siming Dai 提交于
      * add init file
      
      * add op definition and infermeta
      
      * add kernel definition funcs
      
      * add broadcast infer shape
      
      * add gpu forward kernel
      
      * delete SUB and DIV
      
      * add x_grad
      
      * add template
      
      * add e_grad for min and max
      
      * fix small bug
      
      * temp commit
      
      * temp commit
      
      * add e_grad for sum and mean
      
      * fix some compile bug
      
      * fix compile bugs
      
      * fix compile problem
      
      * add sum forward unittest
      
      * fix broadcast error, add kernel sig, register e_grad, change unit test
      
      * fix grad
      
      * add temp grad fix
      
      * temp commit
      
      * add min max unittest
      
      * add max, min unittest, fix mul bug
      
      * add cpu forward sum and mean
      
      * add forward min max, fix mean unittest
      
      * add cpu backward min max
      
      * fix code-style
      
      * add backward sum mean
      
      * fix rocm ci
      
      * set uniitest timeout
      
      * fix bug of x broadcast to e, gpu grad
      
      * fix bug of x broadcast to e, cpu grad
      
      * rename BOOST_GET_CONST macro
      
      * fix rocm ci
      
      * mv graph_send_e_recv to graph_send_ue_recv
      
      * move out_size to IntArray
      
      * add eager op test
      
      * fix max pool type bug, add unittest for api
      
      * revise api doc
      
      * add fp16 for atomic min and max, add unittest
      
      * add unittest
      
      * add fp16 support for graph_send_recv
      
      * fix unittest fp16 bug
      
      * change OutSizeTensor to Out_size
      
      * move E to Y
      
      * add copyright, fix comment
      
      * review code
      
      * fix thread block size
      
      * fix thread block size
      
      * change api attribute name: pool_type to reduce_op, compute_type to message_op
      
      * change api attribute name, move pool_type to reduce_op, move compute_type to message_op
      615b15a3