1. 12 8月, 2022 1 次提交
    • S
      [geometric]Add paddle.geometric.send_ue_recv API (#43174) · 615b15a3
      Siming Dai 提交于
      * add init file
      
      * add op definition and infermeta
      
      * add kernel definition funcs
      
      * add broadcast infer shape
      
      * add gpu forward kernel
      
      * delete SUB and DIV
      
      * add x_grad
      
      * add template
      
      * add e_grad for min and max
      
      * fix small bug
      
      * temp commit
      
      * temp commit
      
      * add e_grad for sum and mean
      
      * fix some compile bug
      
      * fix compile bugs
      
      * fix compile problem
      
      * add sum forward unittest
      
      * fix broadcast error, add kernel sig, register e_grad, change unit test
      
      * fix grad
      
      * add temp grad fix
      
      * temp commit
      
      * add min max unittest
      
      * add max, min unittest, fix mul bug
      
      * add cpu forward sum and mean
      
      * add forward min max, fix mean unittest
      
      * add cpu backward min max
      
      * fix code-style
      
      * add backward sum mean
      
      * fix rocm ci
      
      * set uniitest timeout
      
      * fix bug of x broadcast to e, gpu grad
      
      * fix bug of x broadcast to e, cpu grad
      
      * rename BOOST_GET_CONST macro
      
      * fix rocm ci
      
      * mv graph_send_e_recv to graph_send_ue_recv
      
      * move out_size to IntArray
      
      * add eager op test
      
      * fix max pool type bug, add unittest for api
      
      * revise api doc
      
      * add fp16 for atomic min and max, add unittest
      
      * add unittest
      
      * add fp16 support for graph_send_recv
      
      * fix unittest fp16 bug
      
      * change OutSizeTensor to Out_size
      
      * move E to Y
      
      * add copyright, fix comment
      
      * review code
      
      * fix thread block size
      
      * fix thread block size
      
      * change api attribute name: pool_type to reduce_op, compute_type to message_op
      
      * change api attribute name, move pool_type to reduce_op, move compute_type to message_op
      615b15a3
  2. 11 8月, 2022 1 次提交
  3. 02 8月, 2022 2 次提交
    • S
      move renorm op (#44676) · 669353c1
      seemingwang 提交于
      * move renorm op
      
      * change python api
      
      * change op class func
      
      * alloc data
      
      * remove comments
      
      * fix grad arguments
      
      * fix python argument
      
      * fix python argument
      
      * change unit-test
      
      * remove shape func registration
      
      * recover extra-arguments
      
      * recover shape functor
      669353c1
    • R
      Skip inplace for coalesce_tensor_op outputs (#44795) · bb22e59c
      Ruibiao Chen 提交于
      * Skip inplace for coalesce_tensor_op outputs
      
      * Fix typos
      
      * Add UTs
      
      * Fix typos
      bb22e59c
  4. 01 8月, 2022 1 次提交
  5. 28 7月, 2022 1 次提交
  6. 20 7月, 2022 1 次提交
  7. 13 7月, 2022 1 次提交
    • R
      Enable test_tensordot (#42932) · d4699bd6
      Ruibiao Chen 提交于
      * Enable test_tesnsordot
      
      * Fix CI errors
      
      * Disable windows inference test
      
      * Fix typos
      
      * Fix typos
      
      * Fix CI errors
      
      * Remove disable_wingpu11_test
      d4699bd6
  8. 11 7月, 2022 1 次提交
  9. 04 7月, 2022 1 次提交
  10. 27 6月, 2022 1 次提交
  11. 24 6月, 2022 1 次提交
    • G
      Fix hang bug of TCPStore (#43724) · 4c9330d6
      gongweibao 提交于
      * tmp fix
      
      * init
      
      * compile ok
      
      * compile ok
      
      * add vlogs
      
      * add test
      
      * fix termination error
      
      * add testfile
      
      * add
      
      * fix window compile
      
      * fix window compile
      
      * fix windows compile
      
      * fix windows compile
      
      * fix windows compile
      
      * fix windows compile
      
      * fix windows compile
      
      * fix windows compile
      
      * fix kunlun compile
      
      * fix compilation
      
      * fix compilation
      
      * fix compilation
      
      * tmp fix
      
      * add windows
      
      * add windows
      
      * add more logs
      
      * change timeout to protected
      
      * SB
      
      * add
      
      * add
      
      * fix timeout
      
      * add
      
      * fix test
      
      * fix test
      
      * fix test
      
      * fix ut
      
      * fix ut
      
      * fix ut
      4c9330d6
  12. 23 6月, 2022 1 次提交
  13. 22 6月, 2022 1 次提交
  14. 20 6月, 2022 2 次提交
  15. 14 6月, 2022 1 次提交
  16. 10 6月, 2022 2 次提交
  17. 07 6月, 2022 1 次提交
  18. 04 6月, 2022 1 次提交
  19. 30 5月, 2022 3 次提交
  20. 28 5月, 2022 1 次提交
  21. 27 5月, 2022 1 次提交
  22. 23 5月, 2022 1 次提交
  23. 16 5月, 2022 1 次提交
  24. 13 5月, 2022 1 次提交
  25. 10 5月, 2022 1 次提交
    • W
      [Eager] Refactor several sharding test (#42608) · 668a0a41
      Weilong Wu 提交于
      * [Eager] fix sharding under eager mode
      
      * [Eager] fix several sharding test under eager mode
      
      * Recover using _test_eager_guard
      
      * Ensured fleet.init under legacy
      
      * Ensured fleet.init under legacy
      
      * Fix CI issue, re-definition strategy and call fleet.init() in stage2_offload
      
      * Modified dygraph_group_sharded_api.py, move fleet.init to a better line
      668a0a41
  26. 05 5月, 2022 1 次提交
  27. 28 4月, 2022 1 次提交
  28. 26 4月, 2022 2 次提交
  29. 22 4月, 2022 1 次提交
    • M
      [WIP] Algorithm Cache of cuBlasLt Epilogue (#41010) · 19650d72
      Ming-Xu Huang 提交于
      * Fix leading dimension setting error in fused_gemm_epilogue_grad_op.
      
      * Add dyload to cuBlasLt functions.
      
      * Added cublasLtMatmulAlgoGetHeuristic to improve performance.
      
      * Added FLAGS_cublaslt_exhaustive_search_times to cublasLt epilogue
      
      * Added UTs to FLAGS_cublaslt_exhaustive_search_times
      
      * Added warmup runs in algo searching of Gemm epilogue.
      
      * Update copyright and documents.
      
      * Fixed error handling.
      19650d72
  30. 19 4月, 2022 1 次提交
  31. 13 4月, 2022 2 次提交
  32. 12 4月, 2022 1 次提交
  33. 08 4月, 2022 1 次提交