提交 · e6e2e53782b695331710a8a512a1df3efc08fe30 · PaddlePaddle / Paddle

05 3月, 2020 1 次提交

reduce default attrs for dynamic graph (#22850) · 5191e544

由 hong 提交于 3月 05, 2020

* reduce default attrs for dynamic graph, test=develop

* add some explanations for explicit attr, test=develop

* tweak explicit attr comments, test=develop

5191e544

03 3月, 2020 1 次提交
- Z
  add fluid.device_guard to specify the device type for Op (#22254) · 4e8bc024
  由 Zhang Ting 提交于 3月 03, 2020
```
* add fluid.device_guard to specify the device type for Op
```
  4e8bc024
02 7月, 2019 1 次提交

supports collective training with programs (#18392) · a873fa84

由 Yi Liu 提交于 7月 02, 2019

1. Since allreduce op has 4 reduce types, We split these four reduce types into four ops
2. We also refined the collective op code, e.g. we separated the collective op kernel into CPUKernel and CUDAKernel, and remove the device specified DeviceContext parameter in template as we already knew the target DeviceContext
3. We remove the newly added Collective op role to reduce the complexity of program and graph analysis

a873fa84

27 6月, 2019 1 次提交

supports collective communicated training (#18175) · b7128bac

由 HaoRen 提交于 6月 27, 2019

* fix prepare context redundant code problem, optimize executor by caching create_varaiables
test=develop

* supports collective training in executor

* make fetch_list runable with variables, add more unittest for use_program_cache
test=develop

* fix comment
test=develop

* use unique name for nccl_id

* supports output to stream in program_to_code

* insert sync_comm_stream before regularization; add skip_op_callstack capability in program_to_code

* set op role in collective training

* add collective op role

* remove orig file

* add build optimizer by strategy

* add collective strategy

* refine collective strategy

* add multi-process role maker

* refine strategy building factory so that we can easily plugin more strategy

* scale loss grad in collective sgd transpiler

* add support for distributed fc

* code format

* revert some features for dist fc

* add support for distributed fc training

* fix prepare context redundant code problem, optimize executor by caching create_varaiables
test=develop

* supports collective training in executor

* make fetch_list runable with variables, add more unittest for use_program_cache
test=develop

* use unique name for nccl_id

* supports output to stream in program_to_code

* insert sync_comm_stream before regularization; add skip_op_callstack capability in program_to_code

* set op role in collective training

* add collective op role

* fix comment
test=develop

* remove orig file

* add build optimizer by strategy

* add collective strategy

* refine collective strategy

* add multi-process role maker

* refine strategy building factory so that we can easily plugin more strategy

* scale loss grad in collective sgd transpiler

* add support for distributed fc

* code format

* revert some features for dist fc

* add support for distributed fc training

* test=develop
add collective op unittest standard

* test=develop
remove the test_collective directory

* test=develop
remove the test_collective directory

* remove slicegather test

* code format for reducescatter

* update attr of shard_index_op

* Modify macro nccl_helper

* remove test without distribute

* macro collective_helper

* marcro update

* test=develop
update support python3.5

* test=develop change gpu memory use to 0.1 when test

* test=develop
update ut equal func

* test=develop
set flags to 1.5

* test=develop fix pickle dumple  py35

* test=develop
fix divide in slice and add sync_comm_stream
update atol and rtol to 1e-05
rm shard_index op and test
modify read input from file to read from memory
remove origin_program in framework and add i/o in c_sync_calc_stream

* test=develop update unittest sync operator I/O

b7128bac

08 1月, 2019 1 次提交
- P
  
  add the python callstack for debug support test=develop · a6f5ceee
  由 peizhilin 提交于 1月 08, 2019
  
  a6f5ceee
26 12月, 2018 1 次提交
- P
  Revert "cherry-pick the #12759" · 2388d0e7
  由 peizhilin 提交于 12月 26, 2018
```
test=develop

This reverts commit 7f6d8ace.
```
  2388d0e7
25 12月, 2018 1 次提交
- P
  cherry-pick the #12759 · 7f6d8ace
  由 peizhilin 提交于 12月 25, 2018
```
test=develop
```
  7f6d8ace
25 10月, 2018 1 次提交
- X
  better fix · d5d09672
  由 Xin Pan 提交于 10月 25, 2018
```
test=develop
```
  d5d09672
22 10月, 2018 1 次提交
- X
  clean up after the changes have been stopped for so long. · 8f2116d8
  由 Xin Pan 提交于 10月 18, 2018
```
test=develop
```
  8f2116d8
30 9月, 2018 1 次提交
- Y
  Revert "Merge pull request #13201 from reyoung/revert_callstack" (#13697) · 186b2b13
  由 Yu Yang 提交于 9月 30, 2018
```
This reverts commit 21bb9e91, reversing
changes made to 3fa68dc1.

test=develop
```
  186b2b13
21 9月, 2018 1 次提交

[Feature] dist op role and lr op role, to support memory optimize with dist training (#13220) · 29c63d18

由 Wu Yi 提交于 9月 21, 2018

* wip

* clean up

* should fix running with memopt

* add ut

* mark lr schedule op role

* hide lr_schedule_guard

* use op_role_var instead of ufind

* unify dist test name

* wip for py3 support

* fix var deref

* fix python3 mem_opt order

* remove comments

29c63d18

16 9月, 2018 1 次提交
- Y
  
  Revert changes for debug · 1c87558c
  由 Yibing Liu 提交于 9月 16, 2018
  
  1c87558c
14 9月, 2018 1 次提交
- Y
  
  Get sequence length in sequence_pad op & fix sequence_mask op · f6595811
  由 Yibing Liu 提交于 9月 14, 2018
  
  f6595811
04 9月, 2018 1 次提交
- Y
  Revert "Revert "Add Python Callstacks when Op::Run error (#12759)"" · cda7842e
  由 Yu Yang 提交于 9月 04, 2018
```
This reverts commit 1f270275.
```
  cda7842e
29 8月, 2018 1 次提交
- X
  
  allow to use name_scope for debugging and visiualization · 51ef0ad7
  由 Xin Pan 提交于 8月 28, 2018
  
  51ef0ad7
23 8月, 2018 2 次提交

G
Revert "Add Python Callstacks when Op::Run error (#12759)" · 1f270275
由 guochaorong 提交于 8月 23, 2018
```
This reverts commit b2df1700.
```
1f270275

Add Python Callstacks when Op::Run error (#12759) · b2df1700

由 Yu Yang 提交于 8月 23, 2018

* Add Python Callstacks when Op::Run error

* Skip op with sub-block

* refactor: refine callstack info's format

* Reshape only support matrix

* Polish Python code

* Fix UT

* Fix Py3

b2df1700

01 8月, 2018 1 次提交

explicit gradient of elementwise_add/elementwise_sub (#11970) · 595a2c83

由 dzhwinter 提交于 8月 01, 2018

* "add gradient register"

* "make some enhance"

* "better format"

* "fix typo"

* "fix reuse"

* "fix get expected kernel"

* "change the mkldnn code"

* "fix mkldnn"

* "fix mkldnn failed test"

* "add comment"

595a2c83

11 6月, 2018 1 次提交

add inplace attribute to op_proto_maker (#10665) · bfa3fd6f

由 dzhwinter 提交于 6月 11, 2018

* "add inplace attribute"

* "register inplace attribute"

* "change se-next model for memory-reuse"

* "fix typo"

* repick

* fix merge conflict

* "fix stupid error"

bfa3fd6f

29 5月, 2018 1 次提交
- Y
  
  singleton rpc_client · 20c24c05
  由 Yancey1989 提交于 5月 29, 2018
  
  20c24c05
22 5月, 2018 1 次提交
- Y
  
  Add default value of op_role · c9782590
  由 yuyang18 提交于 5月 22, 2018
  
  c9782590
15 5月, 2018 2 次提交
- Y
  
  Polish op_proto_maker · 44c52a8c
  由 yuyang18 提交于 5月 15, 2018
  
  44c52a8c
- Y
  
  Add op role · 017bba16
  由 yuyang18 提交于 5月 15, 2018
  
  017bba16
19 4月, 2018 1 次提交
- A
  
  Fix CPPLint errors in some framework files · cbbf08ae
  由 Abhinav Arora 提交于 4月 18, 2018
  
  cbbf08ae
12 2月, 2018 1 次提交
- Q
  
  Fix the grammar in copyright. (#8403) · 24509f4a
  由 qingqing01 提交于 2月 12, 2018
  
  24509f4a
10 2月, 2018 2 次提交
- Y
  
  Correct #include path · fc374821
  由 Yi Wang 提交于 2月 09, 2018
  
  fc374821
- Y
  
  Move file to fluid/; Edit CMakeLists.txt · 90648f33
  由 Yi Wang 提交于 2月 09, 2018
  
  90648f33
21 9月, 2017 1 次提交
- Q
  
  move OpProtoAndCheckerMaker from operator to op_proto_maker · a7a66b80
  由 qiaolongfei 提交于 9月 19, 2017
  
  a7a66b80
20 9月, 2017 1 次提交
- Q
  
  move OpProtoAndCheckerMaker from operator to op_proto_maker · 98ef17ed
  由 qiaolongfei 提交于 9月 19, 2017
  
  98ef17ed

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功