- 20 9月, 2022 1 次提交
-
-
由 Roc 提交于
uniform logger manager in FleetAPI. hidde API under distributed/utils which users don't need.
-
- 19 9月, 2022 1 次提交
-
-
由 Yulong Ao 提交于
-
- 15 8月, 2022 1 次提交
-
-
由 wuhuachaocoding 提交于
* refactor fleet. * refact fleet.py. * update fleet/__init__.py. * update fleet.py * update code style. * update fleet * update fleet * update fleet * update fleet * update model.py * update fleet. * update __init__.py * update fleet. * update fleet. * update fleet * update fleet * update fleet * update fleet. * update optimizer.py * update optimizer * update fleet.py * update scaler.py * update setup.py.in
-
- 09 8月, 2022 1 次提交
-
-
由 zhaocaibei123 提交于
* save load * save load * add unittest * first commit * second commit * third commit * remove SaveLocalFS in memory sparse table * save dense param * update * push slot * fix push show clk: int -> float * add unittest * fix sample * unittest * add AsExtra for op * unittest * modify fs.py * modify fs.py * fix some bugs * add dataset hdfs config * local change * dataset use differenct hadoop ugi/fs_name * add * fix conflict * fix * remove logs * code style * fix * code style * code style * fix * code style * save_dense_param * fix * fix * fix * fix * change momentum in dense optimzer * fix * fix * change fluid => paddle.static * remove some unuseful code Co-authored-by: Nesythan <esythan@126.com>
-
- 26 7月, 2022 1 次提交
-
-
由 ziyoujiyi 提交于
* back fl * delete ssl cert * . * make warning * . * unittest paral degree * solve unittest * heter & multi cloud commm ready * . * . * fl-ps v1.0 * . * support N + N mode * . * . * . * . * delete print * . * . * . * . * fix bug * . * . * fl-ps with coordinator ready * merge dev * update message parse only * update fl client scheduler * fix bug * update multithreads sync * fix ci errors * update role_maker.py * update role_maker.py * fix ci error: windows py import error * fix ci error: windows py import error * fix windows ci pylib import error * add dump fields & params * try to fix windows import fleet error * fix ps FLAGS error
-
- 05 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
* use yapf to format all python file * yapf exclude two unittests file for they rely on writing and reading file, and format will break them * disable diff_py_file because too many diff files cause command following failed
-
- 22 4月, 2022 1 次提交
-
-
由 zhaocaibei123 提交于
* [cherry-pick2.3]fix compile bug of windows cuda11.5 (#41464) cherry-pick fix compile bug of windows cuda11.5 #41433 * fix bug of missing boost when compile cache.cc (#41449) 【chery-pick #41430】fix bug of random compile failure, due to incorrect compile order of dependencies * Fix eager try catch (#41438) (#41477) [Cherry-Pick]Fix eager try catch (#41438) * Cherry-pick-PR41407, fix device_id bug for final_state op in multiprocess testcase (#41407) (#41475) Cherry-pick PR #41407 * [BugFix] Add error hint for one_hot gpu version (#41335) (#41495) * add one_hot gpu hint * move allow_out_of_range judgement * delete useless unittest * fix bugs of reshape double grad infermeta (#41459) (#41493) * [cherrypick-2.3] modify infer gpu memory strategy (#41427), remove cudnn_deterministic=True (#41341) (#41491) Co-authored-by: NJingZhuangzhuang <75348594+JZZ-NOTE@users.noreply.github.com> * [Cherry-pick][ROCm] fix dcu error in device event base, test=develop (#41523) Cherry-pick of #41521 * [Cherry-Pick]Cherry pick PR41200, PR41474, PR41382 (#41509) * Use `self`as a parameter of _hash_with_id function to avoid error caused by hash_id reuse (#41200) * Add fill_constant_batch_size YAML and UT (#41474) * Switch some dy2st UT to eager mode (#41382) * Sitch some dy2st UT to eager mode * Fix test_lstm and remove test_transformer * Run test_resnet_v2 in old dy mode * Unittest recover (#41431) * update name * update name * fix test * fix fleet bind * update name * update name * fix test * fix gpups wrapper * remove Push/Pull/Load/Save with context in client and wrapper base class * fix * fix * remove some interface * fix * remove * code style * recover * fix * remove code unused * remove some unused table & accessor & CommonDenseTable => MemoryDenseTable * fix * fix * fix * recover * remove unused code * recover unittest * fix * remove * fix * remove code unuseful * remove * fix * recover * remove Co-authored-by: Nesythan <esythan@126.com> * add ssd sparse table * fix * add cache shuffle * fix * fix * fix * fix * fix * fix * add unit test * fix Co-authored-by: Zhou Wei <1183042833@qq.com> Co-authored-by: NSing_chan <51314274+betterpig@users.noreply.github.com> Co-authored-by: N0x45f <23097963+0x45f@users.noreply.github.com> Co-authored-by: Npangyoki <pangyoki@126.com> Co-authored-by: NSiming Dai <908660116@qq.com> Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com> Co-authored-by: NZhang Jun <ewalker@live.cn> Co-authored-by: NJingZhuangzhuang <75348594+JZZ-NOTE@users.noreply.github.com> Co-authored-by: NQi Li <qili93@qq.com> Co-authored-by: Nesythan <esythan@126.com>
-
- 22 11月, 2021 1 次提交
-
-
由 zmx 提交于
* fix api. test=develop * fix api. test=develop
-
- 11 11月, 2021 1 次提交
-
-
由 zmx 提交于
* change username * fix * fix * fix * fix * fix * update * update * update unittests * fix * update * fix * update * fix * fix * fix * update * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update send_and_recv op. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * update. test=develop * fix. test=develop * fix. test=develop * fix. test=develop * fix. test=develop * fix ut. test=develop * fix unit. notest,test=coverage * fix ut. notest, test=coverage * update. notest,test=coverage * fix ut. notest, test=coverage * fix ut. notest, test=coverage * fix. notest, test=coverage * fix. notest, test=coverage * fix ut. notest, test=coverage * fix ut. notest, test=coverage * fix ut. notest, test=coverage * fix ut. notest, test=coverage * add func. notest, test=coverage * fix ut. notest, test=coverage * fix. test=develop * fix. test=develop
-
- 27 5月, 2021 1 次提交
-
-
由 Thunderbrook 提交于
* support ssd in PsCore * remove log * remove bz2 * defalut value * code style * parse table class * code style * add define
-
- 06 5月, 2021 1 次提交
-
-
由 zhiboniu 提交于
-
- 25 4月, 2021 1 次提交
-
-
由 ShenLiang 提交于
* add pipeline layer
-
- 22 4月, 2021 1 次提交
-
-
由 ShenLiang 提交于
* add clip/check * add amp & clip grad in dygraph * add logging
-
- 17 4月, 2021 1 次提交
-
-
由 ShenLiang 提交于
* add model parallel support in dygraph
-
- 07 4月, 2021 1 次提交
-
-
由 zhang wenhui 提交于
* Ascend rc (#30483) * Fix compilcation on CANN20.1 and older (#30494) Fix compilcation on CANN20.1 and older * Add distribution supported (#30578) Add distribution supported * Build praser for Hcom* operators (#30627) Build praser for Hcom* operators * Pass device_ids info from launch to trainer. (#30632) Pass device_ids info from launch to trainer * Add Hccl program group (#30642) Add Hccl program group * Add startup bash files of test_ascend_group. (#30645) Add startup bash files of test_ascend_group * cleanup (#30646) cleanup test_ascend_group.py * [Feature] Build parser to support distributed training (#30658) [Feature] Build parser to support distributed training * fix compilation on ascend-20.1 (#30722) fix compilation on ascend-20.1 * Dev/fix ascend string (#30749) Dev/fix ascend string * code style (#30781) code style * Merge ascend_optimizer and ascend_parser. (#30776) Merge ascend_optimizer and ascend_parser. * Ascendrc add converted op : [range/equal/range/uniform_random/expand/squeeze], fix cast op bug (#30797) Ascendrc add converted op : [range/equal/range/uniform_random/expand/squeeze], fix cast op bug * Add paddle ascend distribution training supported (#30796) Add paddle ascend distribution training supported * pass cxx_flags to gloo cmake (#30857) * Destroy session first. (#30954) Destroy session first. * merge * fix, test=develop * fix, test=develop * fix style, test=develop * fix, test=develop * fix * fix log fatal, test=develop * fix enforce style, test=develop * fix, test=develop * fix, test=develop * fix rccl, test=develop * fix test, test=develop * fix, test=develop * fix, test=develop * fix, test=develop * fix node_num, test=develop * fix ids str, test=develop * fix ids str, test=develop * fix ids str, test=develop * fix, test=develop * fix, test=develop * fix, test=develop * fix, test=develop * fix, test=develop * fix, test=develop * fix, test=develop * fix, test=develop * fix style code, test=develop * fix style code, test=develop * fix style code, test=develop * fix style code, test=develop Co-authored-by: Nhutuxian <hutuxian2011@sina.cn> Co-authored-by: Ngongweibao <weibao.gong@gmail.com> Co-authored-by: NVoid Main <voidmain1313113@gmail.com> Co-authored-by: NLeo Chen <chenqiuliang@baidu.com> Co-authored-by: Ndingsiyu <18369187719@163.com> Co-authored-by: NOleNet <olenet@126.com>
-
- 06 4月, 2021 1 次提交
-
-
由 ShenLiang 提交于
* support hyparallel, add topology * fix utest
-
- 20 2月, 2021 1 次提交
-
-
由 123malin 提交于
* test=develop, save/load, shrink Co-authored-by: NseiriosPlus <tangwei12@baidu.com>
-
- 14 1月, 2021 1 次提交
-
-
由 123malin 提交于
* test=develop, add distributed_infer
-
- 16 10月, 2020 1 次提交
-
-
由 WangXi 提交于
-
- 28 9月, 2020 2 次提交
-
-
由 yaoxuefeng 提交于
-
由 Dong Daxiang 提交于
* add get final strategy for user to print final strategy
-
- 24 9月, 2020 1 次提交
-
-
由 123malin 提交于
* test=develop, bug fix
-
- 18 9月, 2020 1 次提交
-
-
由 tangwei12 提交于
* fix worker endpoints * fix gloo wrapper for hdfs * GPU fleetrun support gloo * parameterserver fleetrun support gloo * fix get server endpoint
-
- 16 9月, 2020 1 次提交
-
-
由 yaoxuefeng 提交于
-
- 04 9月, 2020 1 次提交
-
-
由 danleifeng 提交于
paddle.distributed.fleet supports dynamic graph execution.
-
- 29 8月, 2020 1 次提交
-
-
由 Dong Daxiang 提交于
* fix api document
-
- 20 8月, 2020 1 次提交
-
-
由 123malin 提交于
* add save/load for parameter server
-
- 14 8月, 2020 1 次提交
-
-
由 vslyu 提交于
* add unittest for paddlerolemaker with gloo
-
- 13 8月, 2020 1 次提交
-
-
由 Dong Daxiang 提交于
* move paddle.fleet to paddle.distributed.fleet
-
- 06 8月, 2020 1 次提交
-
-
由 xujiaqi01 提交于
* move dataset to fleet test=develop * fix test=develop * fix test=develop * fix test=develop * test=develop * test=develop * test=develop * test=develop * test=develop * test=develop * test=develop
-
- 20 7月, 2020 1 次提交
-
-
由 Dong Daxiang 提交于
refactor fleet api under paddle.fleet update DistributedStrategy
-
- 06 7月, 2020 1 次提交
-
-
由 Dong Daxiang 提交于
* add paddle.fleet.DistributedStrategy for 2.0
-
- 23 3月, 2020 1 次提交
-
-
由 XiaoguangHu 提交于
-
- 16 8月, 2019 1 次提交
-
-
由 Tao Luo 提交于
* remove unused inference_transpiler unit-tests test=develop * remove InferenceTranspiler usage in quantize_transpiler.py test=develop
-
- 20 9月, 2018 3 次提交
-
-
由 Dang Qingqing 提交于
-
由 luotao1 提交于
-
由 luotao1 提交于
-
- 19 9月, 2018 1 次提交
-
-
由 luotao1 提交于
-
- 13 9月, 2018 1 次提交
-
-
由 Dang Qingqing 提交于
-
- 05 9月, 2018 1 次提交
-
-
由 Dang Qingqing 提交于
-