- 30 8月, 2023 1 次提交
-
-
由 Ghost Screaming 提交于
* for verify fluid operator support new comm library * u * u * u * compatiable new comm library upgrade for c_allgather, c_reduce, c_reduce_scatter and c_scatter. * Remove useless comments in process_group.py * Polish code style. * Fix some problems. * Remove use fluid api in phi comm_context_manager. * Add PPADDLE_WITH_CUDA and PADDLE_WITH_NCCL micro judgement. * Fix bug of HIP architecture. * Fix some problems. 1. remove useless loggings. 2. Fix conditional compilation for HIP. 3. Fix problems of test_pass_generation_pipeline.py. It calls paddle.distributed.init_parallel_env() at first, then auto.Engine calls _init_comm(), which will calls process_group.instantiate(). However, init_parallel_env() will call paddle.distributed.barrier(), it will call CreateNCCLEnvCache and create corresponding NCCLCommContext. But dev_id is not set, as a result, NCCLCommContext's dev_ctx is not initialized. * Fix some problems. * Polish code. * Polish code. * Revert compatiable upgrade for communication operators. Their upgrades will be submitted in another PR. * Remove StaticTCPStore. * Remove useless modification. * Remove useless set_cuda_device_id. * Polish code. * Remove fluid header files in phi files. * Remove useless comments. * Fix problems of hip arch. * Fix some problems. * Polish code. * Polish code style. --------- Co-authored-by: hitywt <yuwentao126@126.com>
-
- 25 8月, 2023 1 次提交
-
-
由 ronnywang 提交于
-
- 07 8月, 2023 1 次提交
-
-
由 LiYuRio 提交于
* make tcp store a global instance * fix windows compile error
-
- 01 8月, 2023 1 次提交
-
-
由 LiYuRio 提交于
* use string as key for comm_context_manager * remove device_id from comm_context
-
- 24 5月, 2023 1 次提交
-
-
由 Frank Lin 提交于
-
- 16 5月, 2023 1 次提交
-
-
由 zhenhailiu 提交于
* polish * polish
-
- 14 2月, 2023 1 次提交
-
-
由 HongyuJia 提交于
-
- 10 1月, 2023 1 次提交
-
-
由 Wen Sun 提交于
* refactor: gloo comm context migration * fix: headers & avoid mutable_data usage * fix: cmake gloo dep * style: rename funcs * refactor: move to new files * fix: gloo deps * refactor: simplify create device
-
- 09 1月, 2023 1 次提交
-
-
由 LiYuRio 提交于
* comm_context and static init * refactor: move to phi/core/distributed * refactor: avoid mutable_data usage * fix: windows sock * fix: device without nccl Co-authored-by: Wen Sun <syl1887415157@126.com>
-
- 24 6月, 2022 1 次提交
-
-
由 gongweibao 提交于
* tmp fix * init * compile ok * compile ok * add vlogs * add test * fix termination error * add testfile * add * fix window compile * fix window compile * fix windows compile * fix windows compile * fix windows compile * fix windows compile * fix windows compile * fix windows compile * fix kunlun compile * fix compilation * fix compilation * fix compilation * tmp fix * add windows * add windows * add more logs * change timeout to protected * SB * add * add * fix timeout * add * fix test * fix test * fix test * fix ut * fix ut * fix ut
-
- 05 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
-
- 13 5月, 2022 1 次提交
-
-
由 lilong12 提交于
* update
-
- 07 3月, 2022 1 次提交
-
-
由 lilong12 提交于
-
- 03 3月, 2022 1 次提交
-
-
由 lilong12 提交于
* add pg_gloo
-
- 22 2月, 2022 1 次提交
-
-
由 lilong12 提交于
* add tcp_socket and tcp_store
-
- 30 7月, 2020 1 次提交
-
-
由 tangwei12 提交于
Integrated Trainer of Parameter Server (API add `fluid.contrib.layers.sparse_embedding` only) (#22957) * Integrated Trainer of Parameter Server
-
- 30 9月, 2019 1 次提交
-
-
由 Chengmo 提交于
* refector geo sgd & communicator
-
- 23 5月, 2019 1 次提交
-
-
由 Qiao Longfei 提交于
Async exe support communicator
-
- 07 4月, 2018 1 次提交
-
-
由 Yi Wang 提交于
* cpplint test and add tesnor_py_test.cc * Update * Update
-
- 12 2月, 2018 1 次提交
-
-
由 qingqing01 提交于
-
- 10 2月, 2018 2 次提交
- 26 12月, 2017 1 次提交
-
-
由 Luo Tao 提交于
-
- 28 9月, 2017 1 次提交
-
-
由 chengduoZH 提交于
-
- 27 9月, 2017 1 次提交
-
-
由 Yu Yang 提交于
-
- 25 7月, 2017 1 次提交
-
-
由 liaogang 提交于
-
- 17 7月, 2017 2 次提交
-
-
由 Yu Yang 提交于
-
由 Yan Chunwei 提交于
* add NDEBUG switch to PADDLE_ENFORCE
-
- 11 7月, 2017 2 次提交
-
-
由 dongzhihong 提交于
-
由 dongzhihong 提交于
-
- 06 7月, 2017 2 次提交
- 05 7月, 2017 1 次提交
-
-
由 liaogang 提交于
-
- 04 7月, 2017 4 次提交
- 03 7月, 2017 1 次提交
-
-
由 liaogang 提交于
* Free will be added soon
-
- 28 6月, 2017 2 次提交