- 17 10月, 2022 1 次提交
-
-
由 Wen Sun 提交于
* Support both use_calc_stream and sync_op in send recv APIs (#46023) * Support both use_calc_stream and sync_op in allgather API (#46295) * Support both use_calc_stream and sync_op in collective communication API (#46761) * Move group and all reduce from collective to communication (#45848) * Completes bfloat16 dtype for collective api in eager mode (#45844) * Fix collective APIs cannot be recognized when building docs (#46962) Co-authored-by: NLiYuRio <63526175+LiYuRio@users.noreply.github.com>
-
- 07 9月, 2022 1 次提交
-
-
由 LiYuRio 提交于
-
- 06 9月, 2022 1 次提交
-
-
由 Wen Sun 提交于
-
- 01 9月, 2022 2 次提交
-
-
由 sneaxiy 提交于
* make dense_contents_ lazy init * update legacy dygraph * fix legacy dygraph bug
-
由 wangguanqun 提交于
* config * fix unittest * zero init & cache & patch config * add barrier to save and load * add unittest
-
- 31 8月, 2022 1 次提交
-
-
由 LiYuRio 提交于
-
- 26 8月, 2022 1 次提交
-
-
由 danleifeng 提交于
-
- 25 8月, 2022 1 次提交
-
-
由 danleifeng 提交于
* update brpc version;test=develop
-
- 23 8月, 2022 2 次提交
- 22 8月, 2022 1 次提交
-
-
由 ronnywang 提交于
-
- 17 8月, 2022 1 次提交
-
-
由 Sing_chan 提交于
-
- 16 8月, 2022 1 次提交
-
-
由 Yulong Ao 提交于
-
- 15 8月, 2022 1 次提交
-
-
由 houj04 提交于
* [XPU] add some collective ops. test=kunlun * use XPUOpTestWrapper. test=kunlun * skip kl1 for collective ops. fix typo: deivce -> device. test=kunlun
-
- 13 8月, 2022 1 次提交
-
-
由 ziyoujiyi 提交于
* back fl * delete ssl cert * . * make warning * . * unittest paral degree * solve unittest * heter & multi cloud commm ready * . * . * fl-ps v1.0 * . * support N + N mode * . * . * . * . * delete print * . * . * . * . * fix bug * . * . * fl-ps with coordinator ready * merge dev * update message parse only * update fl client scheduler * fix bug * update multithreads sync * fix ci errors * update role_maker.py * update role_maker.py * fix ci error: windows py import error * fix ci error: windows py import error * fix windows ci pylib import error * add dump fields & params * try to fix windows import fleet error * fix ps FLAGS error * fix logging risk * fix logging possible risk * write trainer_desc file * support split sparse params in local & remote * fix import paddle.fluid.core.PSGPU * fix import paddle.fluid.core.PSGPU * add remote_sparse & local_sparse config * fix unittest * fix test_dist_fleet_geo table error * fix PADDLE_ENFORCE error * fix other's pr conflict
-
- 12 8月, 2022 2 次提交
-
-
由 LiYuRio 提交于
-
由 Yulong Ao 提交于
* [Auto Parallel] Pybind11 ProcessMesh and DeviceMesh * [Auto Parallel] Fix the unittest problem * [Auto Parallel] Explicitly add the src file for auto_parallel target * [Auto Parallel] Add the proto depedency explicitly * [Auto Parallel] Fix the cmake bug on windows and mac * [Auto Parallel] Remove the pybind11 header file in process_mesh.h
-
- 11 8月, 2022 1 次提交
-
-
由 zhaocaibei123 提交于
* first refine * second refine * remove some code unuseful
-
- 09 8月, 2022 2 次提交
-
-
由 Yulong Ao 提交于
* [Auto Parallel] Add the c++ dist attrs * [Auto Parallel] Remove some codes to be less than 1000 lines
-
由 zhaocaibei123 提交于
* save load * save load * add unittest * first commit * second commit * third commit * remove SaveLocalFS in memory sparse table * save dense param * update * push slot * fix push show clk: int -> float * add unittest * fix sample * unittest * add AsExtra for op * unittest * modify fs.py * modify fs.py * fix some bugs * add dataset hdfs config * local change * dataset use differenct hadoop ugi/fs_name * add * fix conflict * fix * remove logs * code style * fix * code style * code style * fix * code style * save_dense_param * fix * fix * fix * fix * change momentum in dense optimzer * fix * fix * change fluid => paddle.static * remove some unuseful code Co-authored-by: Nesythan <esythan@126.com>
-
- 08 8月, 2022 3 次提交
- 04 8月, 2022 1 次提交
-
-
由 houj04 提交于
* [XPU] fleet dist_model support xpu. test=kunlun * [XPU] fleet dist_model support xpu. test=kunlun * move unittest file location. test=kunlun
-
- 03 8月, 2022 1 次提交
-
-
由 ronnywang 提交于
* [CustomDevice] add custom ccl 2/2 * update * update * update launch
-
- 01 8月, 2022 2 次提交
-
-
由 Leo Chen 提交于
* remove cudaDeviceContext * remove more template * fix rocm compile * remove alias name CUDADeviceContext * fix compile * fix tests * revert changes
-
由 danleifeng 提交于
Co-authored-by: seemingwang <zsasuke@qq.com> Co-authored-by: NDesmonDay <908660116@qq.com> Co-authored-by: Nseemingwang <seemingwang@users.noreply.github.com> Co-authored-by: NThunderbrook <a754913769@163.com> Co-authored-by: Nxuewujiao <105861147+xuewujiao@users.noreply.github.com> Co-authored-by: Nroot <root@yq01-sys-hic-k8s-v100-box-a225-0693.yq01.baidu.com> Co-authored-by: NThunderbrook <52529258+Thunderbrook@users.noreply.github.com> Co-authored-by: Nroot <root@yq01-inf-hic-k8s-a100-ab2-0009.yq01.baidu.com> Co-authored-by: Nhuwei02 <53012141+huwei02@users.noreply.github.com> Co-authored-by: Nyaoxuefeng <yaoxuefeng@baidu.com> Co-authored-by: Nlxsbupt <luoxsbupt@163.com> Co-authored-by: Nmiaoli06 <106585574+miaoli06@users.noreply.github.com> Co-authored-by: Nroot <root@yq01-inf-hic-k8s-a100-ab2-0008.yq01.baidu.com> Co-authored-by: Nchao9527 <33347532+chao9527@users.noreply.github.com> Co-authored-by: Nqingshui <qshuihu@gmail.com> Co-authored-by: Nyangjunchao <yangjunchao@baidu.com>
-
- 29 7月, 2022 2 次提交
-
-
由 Leo Chen 提交于
* init * move CUDAStream to phi * fix compilation * merge develop * add stream_owned_ member * split cuda_stream.h * fix cpu compile * fix constructor * fix bug * fix windows compile * fix inference test_levit * fix windows tests
-
由 Jiabin Yang 提交于
-
- 28 7月, 2022 1 次提交
-
-
由 LiYuRio 提交于
-
- 26 7月, 2022 1 次提交
-
-
由 ziyoujiyi 提交于
* back fl * delete ssl cert * . * make warning * . * unittest paral degree * solve unittest * heter & multi cloud commm ready * . * . * fl-ps v1.0 * . * support N + N mode * . * . * . * . * delete print * . * . * . * . * fix bug * . * . * fl-ps with coordinator ready * merge dev * update message parse only * update fl client scheduler * fix bug * update multithreads sync * fix ci errors * update role_maker.py * update role_maker.py * fix ci error: windows py import error * fix ci error: windows py import error * fix windows ci pylib import error * add dump fields & params * try to fix windows import fleet error * fix ps FLAGS error
-
- 22 7月, 2022 1 次提交
-
-
由 Haohongxiang 提交于
-
- 21 7月, 2022 1 次提交
-
-
由 zhaocaibei123 提交于
* add slot attr for push sparse op * add pybind * remove fleet * add unittest * fix
-
- 20 7月, 2022 1 次提交
-
-
由 danleifeng 提交于
* add adam/sharedadam optimzier for gpups;edit optimizer struct;test=develop
-
- 19 7月, 2022 2 次提交
-
-
由 zhaocaibei123 提交于
* add save_cache/patch * add pybind * remove pybind * remove const_cast * add fleet
-
由 Ruibiao Chen 提交于
* Rename BOOST_GET macros * Fix conflicts
-
- 16 7月, 2022 1 次提交
-
-
由 Leo Chen 提交于
-
- 15 7月, 2022 1 次提交
-
-
由 Ruibiao Chen 提交于
-
- 11 7月, 2022 2 次提交
-
-
由 Haohongxiang 提交于
* fix conflict * new pg apis * add docs of new apis * update * fix coverage * update * fix bug * fix reduce scatter * fix api * update Co-authored-by: NForFishes <2282912238@qq.com>
-
由 wangguanqun 提交于
* ssd pullsparseptr * update codestyle
-