提交 · d3003a16200ee17f004f4f1877a5547fb77f387b · Crayon鑫 / Paddle

11 7月, 2019 1 次提交

Feature/buffer_shared_inplace (#17911) · d3003a16

由 Zeng Jinle 提交于 7月 11, 2019

* feature/buffer_shared_inplace, test=develop

* refine code, test=develop

* fix elementwise_add op cpu inplace and sum inplace bug, test=develop

* add unittest and debug log, test=develop

* fix parallel_executor scope bug, polish code, test=develop

* fix sum op, activation op, single_in_place_inference bug, test=develop

* remove kLocalExecScopeName, test=develop

* fix unittest,test=develop

* fix out_var first version bug, test=develop

* follow comments,test=develop

d3003a16

14 6月, 2019 1 次提交
- G
  
  Fix reinitialized ncclid error! (#18025) · f5caf344
  由 gongweibao 提交于 6月 14, 2019
  
  f5caf344
06 6月, 2019 1 次提交
- G
  
  Add backward and optimizer operator dependency pass. (#17746) · fbbdc9cc
  由 gongweibao 提交于 6月 06, 2019
  
  fbbdc9cc
27 5月, 2019 1 次提交
- G
  
  Add multi-ncclcomm and 2D ncclallreduce support. (#17263) · 65bbf950
  由 gongweibao 提交于 5月 27, 2019
  
  65bbf950
20 5月, 2019 1 次提交
- T
  remove unused expected_kernel_cache_pass (#17486) · 32da5e9c
  由 Tao Luo 提交于 5月 20, 2019
```
test=develop
```
  32da5e9c
14 5月, 2019 1 次提交

make parallel_executor support FLAGS_use_mkldnn (#17341) · 68ec0a6f

由 Tao Luo 提交于 5月 14, 2019

* make parallel_executor support FLAGS_use_mkldnn

test=develop

* add warning when set mkldnn_enabled_op_types_ in non-mkldnn env

test=develop

68ec0a6f

08 5月, 2019 1 次提交
- C
  Code Clean: Move all pass to paddle::framework::ir (#17228) · 04bd413a
  由 chengduo 提交于 5月 08, 2019
```
* move pass to ir

* polish code
test=develop

* fix dependency
test=develop
```
  04bd413a
06 5月, 2019 1 次提交

Add use_cuda to inplace pass (#17205) · ee2028a1

由 Zeng Jinle 提交于 5月 05, 2019

* add use_cuda to inplace pass,test=develop

* add test softmax_with_xe_inplace test,test=develop

ee2028a1

23 4月, 2019 1 次提交
- C
  Add fuse momenutum ops (#16745) · a2be4b4d
  由 chengduo 提交于 4月 23, 2019
```
* Add fuse momenutum ops
```
  a2be4b4d
21 4月, 2019 1 次提交

Refine model gpu memory (#16993) · 1202d3fc

由 Zeng Jinle 提交于 4月 21, 2019

* speedup gc and inplace softmax_with_cross_entropy_grad
test=develop

* refine models gpu mem
Merge skip vars and warning messages of mem opt
remove relu mem opt
test=develop

* follow comments
test=develop

1202d3fc

12 4月, 2019 1 次提交
- C
  Refine Fuse Optimize Ops (#16810) · e9409665
  由 chengduo 提交于 4月 12, 2019
```
* fix bug of fuse optimize ops
```
  e9409665
11 4月, 2019 1 次提交

Add an option to enable the cache of expected kernel in train phase. (#16724) · 112f1614

由 Yiqun Liu 提交于 4月 11, 2019

* Add an option to enable the cache of expected kernel in train phase.
test=develop

* Change the default value of cache_expected_kernel to true.

112f1614

08 4月, 2019 2 次提交
- G
  
  Fix DGC bug. (#16697) · 8b793d0e
  由 gongweibao 提交于 4月 08, 2019
  
  8b793d0e
- Y
  Enable the runtime_context_cache pass in train phase (#16640) · 3fe8cb0d
  由 Yiqun Liu 提交于 4月 08, 2019
```
* Try to enable the runtime_context_cache pass in train phase.

* Put the append of runtime_context_cache pass ahead of multi_dev passes.
test=develop
```
  3fe8cb0d
03 4月, 2019 1 次提交
- C
  
  Fix the bug of AllReduceDepPass (#16393) · ea2a2f77
  由 chengduo 提交于 4月 02, 2019
  
  ea2a2f77
28 3月, 2019 2 次提交

C
Fuse Adam And SGD ops (#15933) · 1096746c
由 chengduo 提交于 3月 28, 2019
```
* fuse optimizer
```
1096746c

Fix the interface of Pass::Apply (#16484) · ed61d67c

由 chengduo 提交于 3月 27, 2019

* modify the interface of Pass::Allay
test=develop

* Polish code
test=develop

* Fix Travis CI
test=develop

* fix Pass::Apply interface
test=develop

* Fix Travis CI
test=develop

ed61d67c

22 3月, 2019 1 次提交

[Speed]Refine ParallelExecutor (#16190) · a6a3b2fb

由 chengduo 提交于 3月 22, 2019

* refine parallelExecutor
test=develop

* Polish op_handle
test=develop

* Remove unnecessary op_handle
test=develop

* Fix Travis CI
test=develop

* Fix fetch bug
test=develop

* Remove WaitInputVarGenerated

* Fix OpHandleBase::Run
test=develop

* debug
test=develop

* use origin fetch_op_handle
test=develop

* Revert op_handle_base.cc
test=develop

* Polish code
test=develop

* Fix OpHandleBase::Run
test=develop

* code refine

* test CI and CE
test=develop

* fix OpHandle::Run
test=develop

* refine AllReduceOpHandle
test=develop

* Polish code
test=develop

a6a3b2fb

20 3月, 2019 1 次提交

Fuse AllReduce (#15921) · f26ba5bd

由 chengduo 提交于 3月 19, 2019

* fuse all_reduce
test=develop

* add fuse_parameter_groups_size
test=develop

* Polish code
test=develop

* Fix travis-ci
test=develop

* Add SetGroupAccordingToLayers and SetGroupAccordingToGroupSize
test=develop

* Add SetGroupAccordingToMemorySize
test=develop

* fix multi_devices_graph
test=develop

* reset params_grads
test=develop

* Polish code
test=develop

f26ba5bd

15 3月, 2019 1 次提交

Support sync batch norm. (#16121) · 8ad672a2

由 qingqing01 提交于 3月 15, 2019

* Support Sync Batch Norm.
* Note, do not enable it in one device.

Usage:

build_strategy = fluid.BuildStrategy()
build_strategy.sync_batch_norm = True
binary = fluid.compiler.CompiledProgram(tp).with_data_parallel(
        loss_name=loss_mean.name,
        build_strategy=build_strategy)

8ad672a2

07 3月, 2019 1 次提交
- Q
  
  fix compile problem · 446fdf95
  由 Qiao Longfei 提交于 3月 07, 2019
  
  446fdf95
05 3月, 2019 1 次提交
- Q
  
  code format test=develop · 4e218dab
  由 Qiao Longfei 提交于 3月 05, 2019
  
  4e218dab
23 2月, 2019 1 次提交
- Q
  
  refine code test=develop · 2b7931d5
  由 Qiao Longfei 提交于 2月 23, 2019
  
  2b7931d5
22 2月, 2019 2 次提交
- X
  polish · 19d78f67
  由 Xin Pan 提交于 2月 22, 2019
```
test=develop
```
  19d78f67
- X
  resolve conflicts · 32d5a160
  由 Xin Pan 提交于 2月 22, 2019
```
test=develop
```
  32d5a160
21 2月, 2019 2 次提交
- X
  allow compiler to use graph · 26e32e09
  由 Xin Pan 提交于 1月 17, 2019
```
test=develop
```
  26e32e09
- Q
  
  fix multi graph test=develop · 7f3be090
  由 Qiao Longfei 提交于 2月 21, 2019
  
  7f3be090
19 2月, 2019 1 次提交
- Y
  
  polish code test=develop · d5090c89
  由 Yancey1989 提交于 2月 19, 2019
  
  d5090c89
18 2月, 2019 2 次提交
- Y
  
  cleanup code test=develop · 0f8bd73c
  由 Yancey1989 提交于 2月 18, 2019
  
  0f8bd73c
- D
  
  polish code for reading. test=develop · d376cf71
  由 dzhwinter 提交于 2月 18, 2019
  
  d376cf71
14 2月, 2019 3 次提交
- Y
  
  cleanup code test=develop · 73005ee0
  由 Yancey1989 提交于 2月 14, 2019
  
  73005ee0
- Y
  
  refine pg execution · f3463ecb
  由 Yancey1989 提交于 2月 14, 2019
  
  f3463ecb
- 乔
  
  Revert "Revert "cpu reduce mode did not need to broadcast params test=develop"" · 45b19cbc
  由乔龙飞 Qiao Longfei 提交于 2月 14, 2019
  
  45b19cbc
12 2月, 2019 2 次提交
- 乔
  
  Revert "cpu reduce mode did not need to broadcast params test=develop" · 6e0e7061
  由乔龙飞 Qiao Longfei 提交于 2月 12, 2019
  
  6e0e7061
- Q
  
  follow comment test=develop · fbadd4b6
  由 Qiao Longfei 提交于 2月 12, 2019
  
  fbadd4b6
11 2月, 2019 2 次提交
- D
  
  add details. test=develop · 04e9776a
  由 dzhwinter 提交于 2月 11, 2019
  
  04e9776a
- Q
  
  async mode support dist train · c4ded17e
  由 Qiao Longfei 提交于 2月 11, 2019
  
  c4ded17e
08 2月, 2019 1 次提交
- Q
  fix compiler · 76072261
  由 Qiao Longfei 提交于 2月 08, 2019
```
test=develop
```
  76072261
07 2月, 2019 1 次提交
- Q
  
  add more log and fix test_dist_base in multi_batch_merge_pass · 5cf00928
  由 Qiao Longfei 提交于 2月 07, 2019
  
  5cf00928
31 1月, 2019 1 次提交
- D
  
  delete graph print pass. test=develop · e537634d
  由 dzhwinter 提交于 1月 31, 2019
  
  e537634d

Crayon鑫 / Paddle 与 Fork 源项目一致

Crayon鑫 / Paddle
与 Fork 源项目一致