- 19 10月, 2022 2 次提交
- 17 10月, 2022 1 次提交
-
-
由 Ghost Screaming 提交于
* Fix bug of reduce_sum op. When input.numel() > INT32_MAX, its result is wrong. * support pure bfloat16 * support bf16 linear * update PR to pass CI * tiny fix where_grad_kernel.cu * Support bfloat16 type for reducer and sharding. * Fix some bug. * Polish code. * Polise code. * Add bfloat16 datatype in fill_grad kernels. Co-authored-by: Nsneaxiy <sneaxiy@126.com>
-
- 13 10月, 2022 1 次提交
-
-
由 Xinger 提交于
* add rpc module in cpp side * add rpc module in python side * support win32 and mac for rpc * 代码优化 * 优化代码 * update rpc * update rpc launch * rpc remove rank and world_size api * fix logger import bug * remove support for win and mac * remove support for xpu, npu, cinn and rocm * remove support for xpu, npu, cinn and rocm * fix shutdown barrier timeout bug * update:python_rpc_handler to shared ptr * fix master shutodwn first bug * tests support for cpu * update log to vlog * update get service info api * add single process test case * remove process group * remove some useless dependencies * update rpc api comments * update rpc comments: Example to Examples * update rpc api comments * update rpc api comments * update launch api comments * update init_rpc comments * update rpc sync and async comments * fix bug: init_rpc cant be called repeatly in a process * update rpc api comment: make master endpoint unique * update rpc api:service to worker, timeout_ms to timeout * rename ServiceInfo to WorkerInfo * refactor: rename server to worker, log to vlog * add launch test * remove unused codes * refine
-
- 11 10月, 2022 3 次提交
-
-
由 Wen Sun 提交于
-
由 Wen Sun 提交于
-
由 Chen Weihang 提交于
* remove using lodtensor part1 * polish history code format
-
- 10 10月, 2022 1 次提交
-
-
由 LiYuRio 提交于
-
- 08 10月, 2022 1 次提交
-
-
由 Haohongxiang 提交于
-
- 30 9月, 2022 1 次提交
-
-
由 Wen Sun 提交于
-
- 29 9月, 2022 1 次提交
-
-
由 Xinger 提交于
-
- 28 9月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* remove needless using tensor * remove needless using tensor * resolve conflict * replace tensor using * fix format error * revert needless changing * fix rocm and npu compile error * fix cinn compile error * fix format error * fix mkldnn format error * fix mkldnn format error * fix cinn compile error * fix cinn compile error * fix cinn compile error * resolve conflict
-
- 23 9月, 2022 1 次提交
-
-
由 YuanRisheng 提交于
-
- 21 9月, 2022 1 次提交
-
-
由 wuhuachaocoding 提交于
-
- 16 9月, 2022 1 次提交
-
-
由 Wen Sun 提交于
-
- 14 9月, 2022 1 次提交
-
-
由 Nyakku Shigure 提交于
* [CodeStyle] trim trailing whitespace in .md and .rst * empty commit, test=document_fix
-
- 07 9月, 2022 1 次提交
-
-
由 LiYuRio 提交于
-
- 06 9月, 2022 1 次提交
-
-
由 Wen Sun 提交于
-
- 01 9月, 2022 2 次提交
-
-
由 sneaxiy 提交于
* make dense_contents_ lazy init * update legacy dygraph * fix legacy dygraph bug
-
由 wangguanqun 提交于
* config * fix unittest * zero init & cache & patch config * add barrier to save and load * add unittest
-
- 31 8月, 2022 1 次提交
-
-
由 LiYuRio 提交于
-
- 26 8月, 2022 1 次提交
-
-
由 danleifeng 提交于
-
- 25 8月, 2022 1 次提交
-
-
由 danleifeng 提交于
* update brpc version;test=develop
-
- 23 8月, 2022 2 次提交
- 22 8月, 2022 1 次提交
-
-
由 ronnywang 提交于
-
- 17 8月, 2022 1 次提交
-
-
由 Sing_chan 提交于
-
- 16 8月, 2022 1 次提交
-
-
由 Yulong Ao 提交于
-
- 15 8月, 2022 1 次提交
-
-
由 houj04 提交于
* [XPU] add some collective ops. test=kunlun * use XPUOpTestWrapper. test=kunlun * skip kl1 for collective ops. fix typo: deivce -> device. test=kunlun
-
- 13 8月, 2022 1 次提交
-
-
由 ziyoujiyi 提交于
* back fl * delete ssl cert * . * make warning * . * unittest paral degree * solve unittest * heter & multi cloud commm ready * . * . * fl-ps v1.0 * . * support N + N mode * . * . * . * . * delete print * . * . * . * . * fix bug * . * . * fl-ps with coordinator ready * merge dev * update message parse only * update fl client scheduler * fix bug * update multithreads sync * fix ci errors * update role_maker.py * update role_maker.py * fix ci error: windows py import error * fix ci error: windows py import error * fix windows ci pylib import error * add dump fields & params * try to fix windows import fleet error * fix ps FLAGS error * fix logging risk * fix logging possible risk * write trainer_desc file * support split sparse params in local & remote * fix import paddle.fluid.core.PSGPU * fix import paddle.fluid.core.PSGPU * add remote_sparse & local_sparse config * fix unittest * fix test_dist_fleet_geo table error * fix PADDLE_ENFORCE error * fix other's pr conflict
-
- 12 8月, 2022 2 次提交
-
-
由 LiYuRio 提交于
-
由 Yulong Ao 提交于
* [Auto Parallel] Pybind11 ProcessMesh and DeviceMesh * [Auto Parallel] Fix the unittest problem * [Auto Parallel] Explicitly add the src file for auto_parallel target * [Auto Parallel] Add the proto depedency explicitly * [Auto Parallel] Fix the cmake bug on windows and mac * [Auto Parallel] Remove the pybind11 header file in process_mesh.h
-
- 11 8月, 2022 1 次提交
-
-
由 zhaocaibei123 提交于
* first refine * second refine * remove some code unuseful
-
- 09 8月, 2022 2 次提交
-
-
由 Yulong Ao 提交于
* [Auto Parallel] Add the c++ dist attrs * [Auto Parallel] Remove some codes to be less than 1000 lines
-
由 zhaocaibei123 提交于
* save load * save load * add unittest * first commit * second commit * third commit * remove SaveLocalFS in memory sparse table * save dense param * update * push slot * fix push show clk: int -> float * add unittest * fix sample * unittest * add AsExtra for op * unittest * modify fs.py * modify fs.py * fix some bugs * add dataset hdfs config * local change * dataset use differenct hadoop ugi/fs_name * add * fix conflict * fix * remove logs * code style * fix * code style * code style * fix * code style * save_dense_param * fix * fix * fix * fix * change momentum in dense optimzer * fix * fix * change fluid => paddle.static * remove some unuseful code Co-authored-by: Nesythan <esythan@126.com>
-
- 08 8月, 2022 3 次提交
- 04 8月, 2022 1 次提交
-
-
由 houj04 提交于
* [XPU] fleet dist_model support xpu. test=kunlun * [XPU] fleet dist_model support xpu. test=kunlun * move unittest file location. test=kunlun
-
- 03 8月, 2022 1 次提交
-
-
由 ronnywang 提交于
* [CustomDevice] add custom ccl 2/2 * update * update * update launch
-