- 13 10月, 2022 1 次提交
-
-
由 Xinger 提交于
* add rpc module in cpp side * add rpc module in python side * support win32 and mac for rpc * 代码优化 * 优化代码 * update rpc * update rpc launch * rpc remove rank and world_size api * fix logger import bug * remove support for win and mac * remove support for xpu, npu, cinn and rocm * remove support for xpu, npu, cinn and rocm * fix shutdown barrier timeout bug * update:python_rpc_handler to shared ptr * fix master shutodwn first bug * tests support for cpu * update log to vlog * update get service info api * add single process test case * remove process group * remove some useless dependencies * update rpc api comments * update rpc comments: Example to Examples * update rpc api comments * update rpc api comments * update launch api comments * update init_rpc comments * update rpc sync and async comments * fix bug: init_rpc cant be called repeatly in a process * update rpc api comment: make master endpoint unique * update rpc api:service to worker, timeout_ms to timeout * rename ServiceInfo to WorkerInfo * refactor: rename server to worker, log to vlog * add launch test * remove unused codes * refine
-
- 11 10月, 2022 3 次提交
-
-
由 Wen Sun 提交于
-
由 Wen Sun 提交于
-
由 Chen Weihang 提交于
* remove using lodtensor part1 * polish history code format
-
- 10 10月, 2022 1 次提交
-
-
由 LiYuRio 提交于
-
- 08 10月, 2022 1 次提交
-
-
由 Haohongxiang 提交于
-
- 30 9月, 2022 1 次提交
-
-
由 Wen Sun 提交于
-
- 29 9月, 2022 1 次提交
-
-
由 Xinger 提交于
-
- 28 9月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* remove needless using tensor * remove needless using tensor * resolve conflict * replace tensor using * fix format error * revert needless changing * fix rocm and npu compile error * fix cinn compile error * fix format error * fix mkldnn format error * fix mkldnn format error * fix cinn compile error * fix cinn compile error * fix cinn compile error * resolve conflict
-
- 23 9月, 2022 1 次提交
-
-
由 YuanRisheng 提交于
-
- 21 9月, 2022 1 次提交
-
-
由 wuhuachaocoding 提交于
-
- 16 9月, 2022 1 次提交
-
-
由 Wen Sun 提交于
-
- 14 9月, 2022 1 次提交
-
-
由 Nyakku Shigure 提交于
* [CodeStyle] trim trailing whitespace in .md and .rst * empty commit, test=document_fix
-
- 07 9月, 2022 1 次提交
-
-
由 LiYuRio 提交于
-
- 06 9月, 2022 1 次提交
-
-
由 Wen Sun 提交于
-
- 01 9月, 2022 2 次提交
-
-
由 sneaxiy 提交于
* make dense_contents_ lazy init * update legacy dygraph * fix legacy dygraph bug
-
由 wangguanqun 提交于
* config * fix unittest * zero init & cache & patch config * add barrier to save and load * add unittest
-
- 31 8月, 2022 1 次提交
-
-
由 LiYuRio 提交于
-
- 26 8月, 2022 1 次提交
-
-
由 danleifeng 提交于
-
- 25 8月, 2022 1 次提交
-
-
由 danleifeng 提交于
* update brpc version;test=develop
-
- 23 8月, 2022 2 次提交
- 22 8月, 2022 1 次提交
-
-
由 ronnywang 提交于
-
- 17 8月, 2022 1 次提交
-
-
由 Sing_chan 提交于
-
- 16 8月, 2022 1 次提交
-
-
由 Yulong Ao 提交于
-
- 15 8月, 2022 1 次提交
-
-
由 houj04 提交于
* [XPU] add some collective ops. test=kunlun * use XPUOpTestWrapper. test=kunlun * skip kl1 for collective ops. fix typo: deivce -> device. test=kunlun
-
- 13 8月, 2022 1 次提交
-
-
由 ziyoujiyi 提交于
* back fl * delete ssl cert * . * make warning * . * unittest paral degree * solve unittest * heter & multi cloud commm ready * . * . * fl-ps v1.0 * . * support N + N mode * . * . * . * . * delete print * . * . * . * . * fix bug * . * . * fl-ps with coordinator ready * merge dev * update message parse only * update fl client scheduler * fix bug * update multithreads sync * fix ci errors * update role_maker.py * update role_maker.py * fix ci error: windows py import error * fix ci error: windows py import error * fix windows ci pylib import error * add dump fields & params * try to fix windows import fleet error * fix ps FLAGS error * fix logging risk * fix logging possible risk * write trainer_desc file * support split sparse params in local & remote * fix import paddle.fluid.core.PSGPU * fix import paddle.fluid.core.PSGPU * add remote_sparse & local_sparse config * fix unittest * fix test_dist_fleet_geo table error * fix PADDLE_ENFORCE error * fix other's pr conflict
-
- 12 8月, 2022 2 次提交
-
-
由 LiYuRio 提交于
-
由 Yulong Ao 提交于
* [Auto Parallel] Pybind11 ProcessMesh and DeviceMesh * [Auto Parallel] Fix the unittest problem * [Auto Parallel] Explicitly add the src file for auto_parallel target * [Auto Parallel] Add the proto depedency explicitly * [Auto Parallel] Fix the cmake bug on windows and mac * [Auto Parallel] Remove the pybind11 header file in process_mesh.h
-
- 11 8月, 2022 1 次提交
-
-
由 zhaocaibei123 提交于
* first refine * second refine * remove some code unuseful
-
- 09 8月, 2022 2 次提交
-
-
由 Yulong Ao 提交于
* [Auto Parallel] Add the c++ dist attrs * [Auto Parallel] Remove some codes to be less than 1000 lines
-
由 zhaocaibei123 提交于
* save load * save load * add unittest * first commit * second commit * third commit * remove SaveLocalFS in memory sparse table * save dense param * update * push slot * fix push show clk: int -> float * add unittest * fix sample * unittest * add AsExtra for op * unittest * modify fs.py * modify fs.py * fix some bugs * add dataset hdfs config * local change * dataset use differenct hadoop ugi/fs_name * add * fix conflict * fix * remove logs * code style * fix * code style * code style * fix * code style * save_dense_param * fix * fix * fix * fix * change momentum in dense optimzer * fix * fix * change fluid => paddle.static * remove some unuseful code Co-authored-by: Nesythan <esythan@126.com>
-
- 08 8月, 2022 3 次提交
- 04 8月, 2022 1 次提交
-
-
由 houj04 提交于
* [XPU] fleet dist_model support xpu. test=kunlun * [XPU] fleet dist_model support xpu. test=kunlun * move unittest file location. test=kunlun
-
- 03 8月, 2022 1 次提交
-
-
由 ronnywang 提交于
* [CustomDevice] add custom ccl 2/2 * update * update * update launch
-
- 01 8月, 2022 2 次提交
-
-
由 Leo Chen 提交于
* remove cudaDeviceContext * remove more template * fix rocm compile * remove alias name CUDADeviceContext * fix compile * fix tests * revert changes
-
由 danleifeng 提交于
Co-authored-by: seemingwang <zsasuke@qq.com> Co-authored-by: NDesmonDay <908660116@qq.com> Co-authored-by: Nseemingwang <seemingwang@users.noreply.github.com> Co-authored-by: NThunderbrook <a754913769@163.com> Co-authored-by: Nxuewujiao <105861147+xuewujiao@users.noreply.github.com> Co-authored-by: Nroot <root@yq01-sys-hic-k8s-v100-box-a225-0693.yq01.baidu.com> Co-authored-by: NThunderbrook <52529258+Thunderbrook@users.noreply.github.com> Co-authored-by: Nroot <root@yq01-inf-hic-k8s-a100-ab2-0009.yq01.baidu.com> Co-authored-by: Nhuwei02 <53012141+huwei02@users.noreply.github.com> Co-authored-by: Nyaoxuefeng <yaoxuefeng@baidu.com> Co-authored-by: Nlxsbupt <luoxsbupt@163.com> Co-authored-by: Nmiaoli06 <106585574+miaoli06@users.noreply.github.com> Co-authored-by: Nroot <root@yq01-inf-hic-k8s-a100-ab2-0008.yq01.baidu.com> Co-authored-by: Nchao9527 <33347532+chao9527@users.noreply.github.com> Co-authored-by: Nqingshui <qshuihu@gmail.com> Co-authored-by: Nyangjunchao <yangjunchao@baidu.com>
-
- 29 7月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* init * move CUDAStream to phi * fix compilation * merge develop * add stream_owned_ member * split cuda_stream.h * fix cpu compile * fix constructor * fix bug * fix windows compile * fix inference test_levit * fix windows tests
-