1. 26 8月, 2022 2 次提交
    • R
      move collective tests into a collective directory (#45223) · 9eb4d89b
      Roc 提交于
      * add simple reformated ci files
      
      * update
      
      * add radme for new unitetsts
      
      * add radme for new unitetsts
      
      * add radme for new unitetsts
      
      * reset mlu
      
      * update for samples
      
      * add base api
      
      * reset some dist unit tests
      
      * add warning in grenerated cmakelists file
      
      * update readme for new dist unit tests
      
      * add all collective tests
      
      * remain base file and launcher file
      
      * Update README.md
      
      * Update README.md
      
      * fix env PYTHONPATH
      
      * Update gen_ut_cmakelists.py
      
      * add all collective tests
      
      * add docs for gen_ut_cmakelists.py
      
      * pretify codes
      
      * commont name == "name"
      
      * update for comments
      
      * update function's help
      
      * update for run type
      
      * update readme
      
      * add all collective tests
      
      * add all collective tests
      
      * mv  collective test files
      
      * update for all collective tests
      
      * update
      
      * update
      
      * update
      
      * update for all tests
      
      * update for checking name
      
      * Update Cmakelists.txt
      
      * update testlist.csv
      
      * remain test_parallel_dygraph_dataparallel in unittests
      
      * set broadcast op all platforms
      
      * update
      
      * remain test_broadcast_tensors_op
      
      * fix
      
      * rm some collective files
      
      * update more colective tests
      
      * update
      
      * update
      
      * update
      gen_ut_supports recursion
      
      * update
      
      * update
      
      * update
      
      * update
      
      * fix nccl version
      
      * update
      
      * update
      
      * update
      
      * update
      
      * fix a bug and try to pass
      
      * update
      
      * add csv
      
      * update for timeout
      
      * remove tcp store
      
      * fix
      
      * fix
      
      * update
      
      * update
      
      * update for more dist tests
      
      * move multi node tests
      
      * update
      
      * update
      
      * update
      
      * fix for auto parallele
      
      * update
      
      * update path in python file
      
      * update
      
      * reset some test in unittests
      
      * fix
      
      * update readme
      
      * fix
      
      * update
      
      * fix port
      9eb4d89b
    • Y
  2. 18 8月, 2022 1 次提交
  3. 16 8月, 2022 1 次提交
    • S
      [geometric]Add paddle.geometric.send_uv API (#44848) · 88724a53
      Siming Dai 提交于
      * initial commit
      
      * fix op maker bug
      
      * fix mul grad bug
      
      * add unittest
      
      * fix add grad bug, add cpu kernel
      
      * add paddle.geometric.message_passing
      
      * add paddle.geometric.send_uv api, add unittest
      
      * add fp16 judgement
      
      * fix file typo, move compute_type to message_op
      
      * add impl file
      
      * fix unittest timeout time
      
      * add review revise
      88724a53
  4. 12 8月, 2022 2 次提交
    • C
      Add Quant Row&Column ParallelLinear (#44869) · 236ad4fc
      Chang Xu 提交于
      236ad4fc
    • S
      [geometric]Add paddle.geometric.send_ue_recv API (#43174) · 615b15a3
      Siming Dai 提交于
      * add init file
      
      * add op definition and infermeta
      
      * add kernel definition funcs
      
      * add broadcast infer shape
      
      * add gpu forward kernel
      
      * delete SUB and DIV
      
      * add x_grad
      
      * add template
      
      * add e_grad for min and max
      
      * fix small bug
      
      * temp commit
      
      * temp commit
      
      * add e_grad for sum and mean
      
      * fix some compile bug
      
      * fix compile bugs
      
      * fix compile problem
      
      * add sum forward unittest
      
      * fix broadcast error, add kernel sig, register e_grad, change unit test
      
      * fix grad
      
      * add temp grad fix
      
      * temp commit
      
      * add min max unittest
      
      * add max, min unittest, fix mul bug
      
      * add cpu forward sum and mean
      
      * add forward min max, fix mean unittest
      
      * add cpu backward min max
      
      * fix code-style
      
      * add backward sum mean
      
      * fix rocm ci
      
      * set uniitest timeout
      
      * fix bug of x broadcast to e, gpu grad
      
      * fix bug of x broadcast to e, cpu grad
      
      * rename BOOST_GET_CONST macro
      
      * fix rocm ci
      
      * mv graph_send_e_recv to graph_send_ue_recv
      
      * move out_size to IntArray
      
      * add eager op test
      
      * fix max pool type bug, add unittest for api
      
      * revise api doc
      
      * add fp16 for atomic min and max, add unittest
      
      * add unittest
      
      * add fp16 support for graph_send_recv
      
      * fix unittest fp16 bug
      
      * change OutSizeTensor to Out_size
      
      * move E to Y
      
      * add copyright, fix comment
      
      * review code
      
      * fix thread block size
      
      * fix thread block size
      
      * change api attribute name: pool_type to reduce_op, compute_type to message_op
      
      * change api attribute name, move pool_type to reduce_op, move compute_type to message_op
      615b15a3
  5. 11 8月, 2022 1 次提交
  6. 02 8月, 2022 2 次提交
    • S
      move renorm op (#44676) · 669353c1
      seemingwang 提交于
      * move renorm op
      
      * change python api
      
      * change op class func
      
      * alloc data
      
      * remove comments
      
      * fix grad arguments
      
      * fix python argument
      
      * fix python argument
      
      * change unit-test
      
      * remove shape func registration
      
      * recover extra-arguments
      
      * recover shape functor
      669353c1
    • R
      Skip inplace for coalesce_tensor_op outputs (#44795) · bb22e59c
      Ruibiao Chen 提交于
      * Skip inplace for coalesce_tensor_op outputs
      
      * Fix typos
      
      * Add UTs
      
      * Fix typos
      bb22e59c
  7. 01 8月, 2022 1 次提交
  8. 28 7月, 2022 1 次提交
  9. 20 7月, 2022 1 次提交
  10. 13 7月, 2022 1 次提交
    • R
      Enable test_tensordot (#42932) · d4699bd6
      Ruibiao Chen 提交于
      * Enable test_tesnsordot
      
      * Fix CI errors
      
      * Disable windows inference test
      
      * Fix typos
      
      * Fix typos
      
      * Fix CI errors
      
      * Remove disable_wingpu11_test
      d4699bd6
  11. 11 7月, 2022 1 次提交
  12. 04 7月, 2022 1 次提交
  13. 27 6月, 2022 1 次提交
  14. 24 6月, 2022 1 次提交
    • G
      Fix hang bug of TCPStore (#43724) · 4c9330d6
      gongweibao 提交于
      * tmp fix
      
      * init
      
      * compile ok
      
      * compile ok
      
      * add vlogs
      
      * add test
      
      * fix termination error
      
      * add testfile
      
      * add
      
      * fix window compile
      
      * fix window compile
      
      * fix windows compile
      
      * fix windows compile
      
      * fix windows compile
      
      * fix windows compile
      
      * fix windows compile
      
      * fix windows compile
      
      * fix kunlun compile
      
      * fix compilation
      
      * fix compilation
      
      * fix compilation
      
      * tmp fix
      
      * add windows
      
      * add windows
      
      * add more logs
      
      * change timeout to protected
      
      * SB
      
      * add
      
      * add
      
      * fix timeout
      
      * add
      
      * fix test
      
      * fix test
      
      * fix test
      
      * fix ut
      
      * fix ut
      
      * fix ut
      4c9330d6
  15. 23 6月, 2022 1 次提交
  16. 22 6月, 2022 1 次提交
  17. 20 6月, 2022 2 次提交
  18. 14 6月, 2022 1 次提交
  19. 10 6月, 2022 2 次提交
  20. 07 6月, 2022 1 次提交
  21. 04 6月, 2022 1 次提交
  22. 30 5月, 2022 3 次提交
  23. 28 5月, 2022 1 次提交
  24. 27 5月, 2022 1 次提交
  25. 23 5月, 2022 1 次提交
  26. 16 5月, 2022 1 次提交
  27. 13 5月, 2022 1 次提交
  28. 10 5月, 2022 1 次提交
    • W
      [Eager] Refactor several sharding test (#42608) · 668a0a41
      Weilong Wu 提交于
      * [Eager] fix sharding under eager mode
      
      * [Eager] fix several sharding test under eager mode
      
      * Recover using _test_eager_guard
      
      * Ensured fleet.init under legacy
      
      * Ensured fleet.init under legacy
      
      * Fix CI issue, re-definition strategy and call fleet.init() in stage2_offload
      
      * Modified dygraph_group_sharded_api.py, move fleet.init to a better line
      668a0a41
  29. 05 5月, 2022 1 次提交
  30. 28 4月, 2022 1 次提交
  31. 26 4月, 2022 2 次提交
  32. 22 4月, 2022 1 次提交
    • M
      [WIP] Algorithm Cache of cuBlasLt Epilogue (#41010) · 19650d72
      Ming-Xu Huang 提交于
      * Fix leading dimension setting error in fused_gemm_epilogue_grad_op.
      
      * Add dyload to cuBlasLt functions.
      
      * Added cublasLtMatmulAlgoGetHeuristic to improve performance.
      
      * Added FLAGS_cublaslt_exhaustive_search_times to cublasLt epilogue
      
      * Added UTs to FLAGS_cublaslt_exhaustive_search_times
      
      * Added warmup runs in algo searching of Gemm epilogue.
      
      * Update copyright and documents.
      
      * Fixed error handling.
      19650d72