提交 · 5175b3cb2b1aa05779f3a9f14f7bfca7d43a841d · BaiXuePrincess / Paddle

27 9月, 2018 1 次提交

由 chengduo 提交于 9月 27, 2018

* add GraphNum

test=develop

* add graph number check in parallelExecutor

test=develop

* fix transformer_model bug

test=develop

* fix graph num

5175b3cb

25 9月, 2018 1 次提交
- X
  
  pass builder allow cutomize pass in python. · 36c2a9af
  由 Xin Pan 提交于 9月 17, 2018
  
  36c2a9af
20 9月, 2018 1 次提交

Feature/op_fuse_pass (#12440) · d402234b

由 chengduo 提交于 9月 20, 2018

* Add Preface

* Add demo code

* Save file

* Refine code

* seems can work

* use elementwise strategy

* Use ElementwiseComputeEx

* Add comments

* extract functions from operator

* Refine code

* Follow comment

* code refine

* add op_fuse  pass

* add backward

* code refine

* use TopologySortOperations

* follow comments

* refine IsFusible

* code enhance

* fix op_fusion_pass

* refine code

* refine fuse_elemwise_act_op

* adjust the input and output

* refine logic

* add intermediate_edge

* disable inplace

* follow comments

* refine logic

* follow comments

* Remove the removable IntermediateOut

* change strategy

* code refine

* enable fuse backward

* code refine

* code refine

* rename unit test

* follow comments

d402234b

17 9月, 2018 2 次提交
- X
  
  simplify and hide bcast_params · ec6ee0a2
  由 Xin Pan 提交于 9月 17, 2018
  
  ec6ee0a2
- S
  
  modification · 612e1a31
  由 sneaxiy 提交于 9月 15, 2018
  
  612e1a31
15 9月, 2018 1 次提交
- S
  
  feature/eager_delete_tensor · 24ea39c4
  由 sneaxiy 提交于 9月 15, 2018
  
  24ea39c4
10 9月, 2018 2 次提交
- M
  
  Add kids exists detection in Scope · dc863aac
  由 minqiyang 提交于 9月 10, 2018
  
  dc863aac
- M
  
  Make all scope pointer to shared · 681514e1
  由 minqiyang 提交于 9月 10, 2018
  
  681514e1
14 8月, 2018 1 次提交
- Y
  
  Add FastExecutor · 05cadf1b
  由 yuyang18 提交于 8月 14, 2018
  
  05cadf1b
09 8月, 2018 1 次提交
- X
  code clean up and renaming · 626abfc3
  由 Xin Pan 提交于 8月 09, 2018
```
Reduce one level of inheritence.
```
  626abfc3
27 7月, 2018 1 次提交
- X
  
  add pass test · 99c0c204
  由 Xin Pan 提交于 7月 27, 2018
  
  99c0c204
26 7月, 2018 5 次提交
- X
  
  clean up and correctness check · ab72d28a
  由 Xin Pan 提交于 7月 26, 2018
  
  ab72d28a
- X
  all passes · aa1085dd
  由 Xin Pan 提交于 7月 26, 2018
```
add doc
```
  aa1085dd
- X
  
  pass refactoring · e4d7d7ae
  由 Xin Pan 提交于 7月 26, 2018
  
  e4d7d7ae
- X
  
  pass registration · 142e832d
  由 Xin Pan 提交于 7月 25, 2018
  
  142e832d
- X
  
  graph viz pass · 5b183557
  由 Xin Pan 提交于 7月 25, 2018
  
  5b183557
22 7月, 2018 1 次提交
- X
  
  add namespace to Graph · c3f6e0e8
  由 Xin Pan 提交于 7月 20, 2018
  
  c3f6e0e8
18 7月, 2018 5 次提交
- X
  
  clean · 64eaa4c8
  由 Xin Pan 提交于 7月 15, 2018
  
  64eaa4c8
- X
  
  separate graph building pass and graph-based pe builder · 2fa8df1c
  由 Xin Pan 提交于 7月 13, 2018
  
  2fa8df1c
- X
  
  all graphs · 9605fcd1
  由 Xin Pan 提交于 7月 12, 2018
  
  9605fcd1
- X
  
  add a simple program to graph · af79b192
  由 Xin Pan 提交于 7月 12, 2018
  
  af79b192
- X
  
  polish attrs · 68aa5004
  由 Xin Pan 提交于 7月 11, 2018
  
  68aa5004
15 7月, 2018 1 次提交
- C
  Add learning rate decay test (#12124) · 325fbc4f
  由 chengduo 提交于 7月 15, 2018
```
* Add learning rate decay test

* fix test name

* doesn't share @LR_DECAY_COUNTER@
```
  325fbc4f
13 7月, 2018 1 次提交

Refine multi thread cpu parallel exe (#11406) · 86b0a725

由 chengduo 提交于 7月 13, 2018

* refine multi-thread CPU Parallel exe

* refine multi thread CPU Parallel exe

* Refine CPU version for ParallelExecutor

* add share_parameter_between_cards_

* Fix ParallelExecutor bug

* Fix unit test

* Fix parameter opt balance

* Fix with opti (param->grad)

* Add grad to op var

* Remove shard_param_between_cards

86b0a725

12 7月, 2018 2 次提交
- Y
  
  polish function name · d14afced
  由 Yancey1989 提交于 7月 12, 2018
  
  d14afced
- Y
  
  fix pe with cpu place · 1effba33
  由 Yancey1989 提交于 7月 12, 2018
  
  1effba33
29 6月, 2018 1 次提交
- C
  Fix TensorCopy bug (#11822) · 8d76cf39
  由 chengduo 提交于 6月 29, 2018
```
* Fix tensorcopy bug

* follow comment

* Refine TensorCopy
```
  8d76cf39
28 6月, 2018 1 次提交
- C
  
  fix FeedAndSplitTensorIntoLocalScopes (#11817) · 6711b7b5
  由 chengduo 提交于 6月 28, 2018
  
  6711b7b5
26 6月, 2018 4 次提交
- Y
  
  update · 8d04d0e2
  由 yi.wu 提交于 6月 26, 2018
  
  8d04d0e2
- Y
  
  fix broadcast bug · 6f010712
  由 yi.wu 提交于 6月 26, 2018
  
  6f010712
- Y
  
  wip · 8e48c77b
  由 yi.wu 提交于 6月 26, 2018
  
  8e48c77b
- Y
  
  fix dist train broadcasting bug · 3d69a82b
  由 yi.wu 提交于 6月 26, 2018
  
  3d69a82b
21 6月, 2018 1 次提交
- F
  
  fix mac compile · 964f515e
  由 fengjiayi 提交于 6月 21, 2018
  
  964f515e
20 6月, 2018 1 次提交
- Y
  
  fix compile warning · 7e6518e8
  由 Yancey1989 提交于 6月 20, 2018
  
  7e6518e8
14 6月, 2018 1 次提交

Fix NCCLBcast hang up bug in Parallel Executor (#11377) · 046bb5c8

由 Qiyang Min 提交于 6月 13, 2018

* 1. Create buddy allocator in each places before NcclBcast the variables
2. Check the memory usage of ALL gpus rather than the first one

* 1. Make NCCLGroupGuard guards only the ncclBcast part, which avoid ncclGroupEnd blocking the exception throwing
2. NOTE the usage of NCCLGroupGuard

* Remove the memory usage check of gpus

* Fix code style

046bb5c8

12 6月, 2018 1 次提交
- Y
  
  use get_appropriate_dev to schedule rpc op · 6d752baf
  由 Yancey1989 提交于 6月 12, 2018
  
  6d752baf
11 6月, 2018 1 次提交
- C
  replace use_event with use_cuda, because use_event means the program running... · aadaadf7
  由 chengduoZH 提交于 6月 11, 2018
```
replace use_event with use_cuda, because use_event means the program running with CUDA, so use_cuda maybe more intuitive.
```
  aadaadf7
10 6月, 2018 2 次提交
- C
  
  small fix · 1e731f59
  由 chengduoZH 提交于 6月 10, 2018
  
  1e731f59
- C
  
  fix in c++ side · 5a3c8bf8
  由 chengduoZH 提交于 6月 09, 2018
  
  5a3c8bf8
08 6月, 2018 1 次提交
- C
  
  add SSA graph checker · 0c851cab
  由 chengduoZH 提交于 6月 08, 2018
  
  0c851cab

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致