提交 · 526456678613101797f931fd64a0aa4e8d051942 · Crayon鑫 / Paddle

06 5月, 2021 1 次提交
- Z
  
  update 2.0 public api in distributed (#32695) · 70eb435c
  由 zhiboniu 提交于 5月 06, 2021
  
  70eb435c
07 4月, 2021 1 次提交

【NPU】Merge ascend GE&distributed code by 0208 from ascendrc (#31957) · 8c7c53b3

由 zhang wenhui 提交于 4月 07, 2021

* Ascend rc (#30483)

* Fix compilcation on CANN20.1 and older (#30494)

Fix compilcation on CANN20.1 and older

* Add distribution supported (#30578)

Add distribution supported

* Build praser for Hcom* operators (#30627)

Build praser for Hcom* operators

* Pass device_ids info from launch to trainer. (#30632)

Pass device_ids info from launch to trainer

* Add Hccl program group (#30642)

Add Hccl program group

* Add startup bash files of test_ascend_group. (#30645)

Add startup bash files of test_ascend_group

* cleanup (#30646)

cleanup test_ascend_group.py

* [Feature] Build parser to support distributed training (#30658)

[Feature] Build parser to support distributed training

* fix compilation on ascend-20.1 (#30722)

fix compilation on ascend-20.1

* Dev/fix ascend string (#30749)

Dev/fix ascend string

* code style (#30781)

code style

* Merge ascend_optimizer and ascend_parser. (#30776)

Merge ascend_optimizer and ascend_parser.

* Ascendrc add converted op : [range/equal/range/uniform_random/expand/squeeze], fix cast op bug  (#30797)

Ascendrc add converted op : [range/equal/range/uniform_random/expand/squeeze], fix cast op bug

* Add paddle ascend distribution training supported (#30796)

Add paddle ascend distribution training supported

* pass cxx_flags to gloo cmake (#30857)

* Destroy session first. (#30954)

Destroy session first.

* merge

* fix, test=develop

* fix, test=develop

* fix style, test=develop

* fix, test=develop

* fix

* fix log fatal, test=develop

* fix enforce style, test=develop

* fix, test=develop

* fix, test=develop

* fix rccl, test=develop

* fix test, test=develop

* fix, test=develop

* fix, test=develop

* fix, test=develop

* fix node_num, test=develop

* fix ids str, test=develop

* fix ids str, test=develop

* fix ids str, test=develop

* fix, test=develop

* fix, test=develop

* fix, test=develop

* fix, test=develop

* fix, test=develop

* fix, test=develop

* fix, test=develop

* fix, test=develop

* fix style code, test=develop

* fix style code, test=develop

* fix style code, test=develop

* fix style code, test=develop
Co-authored-by: Nhutuxian <hutuxian2011@sina.cn>
Co-authored-by: Ngongweibao <weibao.gong@gmail.com>
Co-authored-by: NVoid Main <voidmain1313113@gmail.com>
Co-authored-by: NLeo Chen <chenqiuliang@baidu.com>
Co-authored-by: Ndingsiyu <18369187719@163.com>
Co-authored-by: NOleNet <olenet@126.com>

8c7c53b3

31 12月, 2020 1 次提交
- L
  Disable gloo by default (#29805) · b0bd93de
  由 lilong12 提交于 12月 31, 2020
```
* update, test=develop
```
  b0bd93de
08 12月, 2020 1 次提交
- L
  Fix bug in gloo that gloo initialization hangs (#29447) · b122d0bb
  由 lilong12 提交于 12月 08, 2020
```
* update, test=develop
```
  b122d0bb
26 11月, 2020 1 次提交
- L
  fix the bug in gloo (#29112) · 2a864c70
  由 lilong12 提交于 11月 26, 2020
```
* update, test=develop
```
  2a864c70
15 10月, 2020 1 次提交

【paddle.fleet】geo send sparse optimize (#27719) · aa3b4ed7

由 123malin 提交于 10月 15, 2020

* test=develop, fix geo sgd communicator

* test=develop, gloo_init_method

* test=develop, bug fix for gloo http_init

aa3b4ed7

14 10月, 2020 1 次提交
- 1
  【paddle.fleet】bug fix for parameter_recv (#27838) · a4f85074
  由 123malin 提交于 10月 14, 2020
```
* test=develop, bug fix for parameter_recv
* test=develop, for unittest, test_fleet_rolemaker_new
```
  a4f85074
13 10月, 2020 1 次提交

【paddle.fleet】Update fleetrun & ps-heter (#27472) · c5f2802d

由 Chengmo 提交于 10月 13, 2020

* refine fleetrun.ps_launch

* update fleet run for multi device support

* ps_graph support ps-gpu

* fix heter save

* add heter save unittest

* fix unittest & simple code

* update fleetrun

* fix fleetrun

* fix launch barrier

* fix role maker

* add paddlecloud rolemaker unittest

* rename heter_worker_device_guard

c5f2802d

30 9月, 2020 1 次提交
- L
  [bug fix] avoiding multiple initialization of gloo for fleet in dygraph mode (#27706) · 742cbe66
  由 lilong12 提交于 9月 30, 2020
```
* add double grad for expand, test=develop
```
  742cbe66
29 9月, 2020 2 次提交
- L
  
  terminate http server used by gloo for fleet after init (#27698) · 5132f512
  由 lilong12 提交于 9月 29, 2020
  
  5132f512
- L
  Initialize gloo for low level collective apis (#27672) · bbc2add7
  由 lilong12 提交于 9月 29, 2020
```
* add gloo initializer, test=develop
```
  bbc2add7
28 9月, 2020 3 次提交
- L
  
  Revert "Initialize gloo for low level collective apis (#27356)", test=document_fix (#27665) · 36c04102
  由 lilong12 提交于 9月 28, 2020
  
  36c04102
- 1
  test=develop, rm netifaces (#27581) · 68223077
  由 123malin 提交于 9月 28, 2020
```
* test=develop, rm netifaces
```
  68223077
- L
  Initialize gloo for low level collective apis (#27356) · fa73e4a2
  由 lilong12 提交于 9月 28, 2020
```
* add gloo initializer, test=develop
```
  fa73e4a2
27 9月, 2020 1 次提交
- C
  Fix test dist fleet heter ctr (#27513) · 0e101c4f
  由 Chengmo 提交于 9月 27, 2020
```
* fix test_dist_fleet_heter_ctr & peformance update
```
  0e101c4f
23 9月, 2020 1 次提交
- D
  
  fix server_num bug;test=develop (#27442) · 0721767b
  由 danleifeng 提交于 9月 23, 2020
  
  0721767b
20 9月, 2020 1 次提交

【paddle.fleet】Fix/role maker api fix (#27326) · d6b54de4

由 tangwei12 提交于 9月 20, 2020

* fix fleet util and gloo

* fix worker endpoints

* fix

* fix UT

* fix gloo

* fix gloo

* update gloo

* update gloo

* update gloo

* update gloo

* update gloo

* fix gloo wrapper for hdfs

* add file gloo and UT

* fix UT

* fix UT

* fix UT

* hide public method of RoleMaker

* fix UT

* GPU fleetrun support gloo

* parameterserver fleetrun support gloo

* add UT

* add UT

* fix UT

* fix get server endpoint

* fix get server endpoint

* fix UT

* hide public method of rolemaker

* hide public method of rolemaker

* hide public method of rolemaker

* Update test_fleet_rolemaker_new.py

* hide public method of rolemaker

* hide public method of rolemaker

d6b54de4

18 9月, 2020 1 次提交

【paddle.fleet】gloo and util (#27213) · 99626502

由 tangwei12 提交于 9月 18, 2020

* fix worker endpoints

* fix gloo wrapper for hdfs

* GPU fleetrun support gloo

* parameterserver fleetrun support gloo

* fix get server endpoint

99626502

17 9月, 2020 1 次提交
- 1
  【Fleet2.0 Util】 add documents (#26698) · f36b9a7f
  由 123malin 提交于 9月 17, 2020
```
* test=develop, util documents
```
  f36b9a7f
03 9月, 2020 1 次提交
- D
  【paddle.fleet】support running python train.py for fleet tasks (#26249) · e35ad3ee
  由 danleifeng 提交于 9月 03, 2020
```
* support running python train.py for fleet-task; test=develop
```
  e35ad3ee
30 8月, 2020 1 次提交
- C
  【paddle.fleet】Support Heter Parameter Server (#25998) · 7f2aa2db
  由 Chengmo 提交于 8月 30, 2020
```
* Support Heter Parameter Server
```
  7f2aa2db
29 8月, 2020 1 次提交
- D
  【paddle.fleet】fix api documents (#26777) · 994217ea
  由 Dong Daxiang 提交于 8月 29, 2020
```
* fix api document
```
  994217ea
22 8月, 2020 1 次提交
- L
  【paddle.fleet】solve the initial configuration about fleet and rolemaker (#26368) · 66596bd2
  由 liuyuhui 提交于 8月 22, 2020
```
* solve the initial configuration about fleet and rolemaker
Co-authored-by: NseiriosPlus <tangwei12@baidu.com>
```
  66596bd2
18 8月, 2020 1 次提交
- M
  add feature to fleet2.0 role_maker, distribute_strategy, test=develop (#26267) · cd48bdad
  由 mapingshuo 提交于 8月 18, 2020
```
* add feature to fleet2.0 role_maker, distribute_strategy, test=develop
```
  cd48bdad
13 8月, 2020 1 次提交
- D
  【paddle.fleet】paddle.fleet -> paddle.distributed.fleet. (#26186) · 50a5bcfc
  由 Dong Daxiang 提交于 8月 13, 2020
```
* move paddle.fleet to paddle.distributed.fleet
```
  50a5bcfc
07 8月, 2020 1 次提交

【paddle.fleet】fleet_util move to paddle.fleet (#25805) · 2191a083

由 123malin 提交于 8月 07, 2020

* test=develop,test=document_fix, remove the out args

* fleet_util move to paddle.fleet
Co-authored-by: NWuHaobo <wuhaobo1994@gmail.com>
Co-authored-by: Ntangwei12 <tangwei12@baidu.com>

2191a083

06 7月, 2020 1 次提交
- D
  Paddle fleet distributed strategy (#25379) · d5e40d1b
  由 Dong Daxiang 提交于 7月 06, 2020
```
* add paddle.fleet.DistributedStrategy for 2.0
```
  d5e40d1b
23 3月, 2020 1 次提交
- X
  
  reorganize the paddle api test=develop (#23151) · 194a22c5
  由 XiaoguangHu 提交于 3月 23, 2020
  
  194a22c5
17 9月, 2018 1 次提交
- Y
  
  Move trainer to contrib · 82b8a3c5
  由 yuyang 提交于 9月 17, 2018
  
  82b8a3c5
03 9月, 2018 1 次提交
- C
  Fix high level API(Inference) bug (#13159) · ef628ab8
  由 chengduo 提交于 9月 03, 2018
```
* fix high level API(Inference) bug

* patch the unit tests
```
  ef628ab8
15 8月, 2018 1 次提交
- M
  
  Add print_function for all python files · 99d3f089
  由 minqiyang 提交于 8月 15, 2018
  
  99d3f089
26 7月, 2018 1 次提交
- M
  
  Apply 2to3 to current paddle main python code · 559d3632
  由 minqiyang 提交于 7月 26, 2018
  
  559d3632
18 6月, 2018 1 次提交
- Q
  
  add doc for Inferencer · 4363d2e4
  由 qiaolongfei 提交于 6月 18, 2018
  
  4363d2e4
09 6月, 2018 1 次提交
- J
  Use for_test=True in the Fluid Trainer to clone the test program (#11323) · 637827a5
  由 Jeff Wang 提交于 6月 08, 2018
```
* Use for_test=True in the Fluid Trainer to clone the test program

* fix typo

* Should do the samething to the inferencer
```
  637827a5
24 5月, 2018 1 次提交
- D
  
  add return_numpy back (#10892) · cc7b4b9e
  由 daminglu 提交于 5月 23, 2018
  
  cc7b4b9e
18 5月, 2018 1 次提交
- Q
  
  add comment · d2d671e3
  由 qiaolongfei 提交于 5月 18, 2018
  
  d2d671e3
17 5月, 2018 2 次提交
- Q
  
  should load parameter before create parallel_executor · feed94e2
  由 qiaolongfei 提交于 5月 17, 2018
  
  feed94e2
- Q
  
  Inferencer support parallel_executor · e8d24aa1
  由 qiaolongfei 提交于 5月 17, 2018
  
  e8d24aa1
16 5月, 2018 1 次提交
- D
  
  Update trainer api (#10674) · 74ca73b8
  由 daminglu 提交于 5月 15, 2018
  
  74ca73b8
12 5月, 2018 1 次提交

Add inferencer infer (#10445) · 2a971f30

由 Qiao Longfei 提交于 5月 12, 2018

* add Inference.infer

* optimize code

* update no_test_word2vec_new_api.py

* update trainer

* split check_and_get_place

* use inference_program to save inference model in Trainer

* update demo

* update save_inference_model

* clean code

2a971f30

Crayon鑫 / Paddle 与 Fork 源项目一致

Crayon鑫 / Paddle
与 Fork 源项目一致