提交 · 8f5fffca0a4b50f12bf2dae374a4ad49402a686f · BaiXuePrincess / Paddle

02 7月, 2019 2 次提交

supports collective training with programs (#18392) · a873fa84

由 Yi Liu 提交于 7月 02, 2019

1. Since allreduce op has 4 reduce types, We split these four reduce types into four ops
2. We also refined the collective op code, e.g. we separated the collective op kernel into CPUKernel and CUDAKernel, and remove the device specified DeviceContext parameter in template as we already knew the target DeviceContext
3. We remove the newly added Collective op role to reduce the complexity of program and graph analysis

a873fa84

C
Add find_no_grad_vars in backward.py (#17942) · e0d8c6ac
由 chengduo 提交于 7月 02, 2019
```
* add not_been_used_vars to no_grad_set
test=develop
```
e0d8c6ac

01 7月, 2019 1 次提交

Make roi_perspective_transform op return mask and transform matrix (#18371) · 449c7a9f

由 LielinJiang 提交于 7月 01, 2019

* modify roi_perspective_transform_op to output mask and transform matrix

* modify comment

* modify comment

* modify API.spec

* update API.spec

* remove no use header, test=develop

* resolve conflict

449c7a9f

27 6月, 2019 2 次提交

add WITH_COVERAGE option, default OFF (#17872) · 27fb9cad

由 kh2se2013 提交于 6月 27, 2019

* add WITH_COVERAGE option, default OFF

test=develop

* add coverage for python sdk

test=develop

* fix code style

* fix COVERAGE_FILE path

test=develop

* remove coverage package

test=develop

* test = develop, run coverage as module

27fb9cad

supports collective communicated training (#18175) · b7128bac

由 HaoRen 提交于 6月 27, 2019

* fix prepare context redundant code problem, optimize executor by caching create_varaiables
test=develop

* supports collective training in executor

* make fetch_list runable with variables, add more unittest for use_program_cache
test=develop

* fix comment
test=develop

* use unique name for nccl_id

* supports output to stream in program_to_code

* insert sync_comm_stream before regularization; add skip_op_callstack capability in program_to_code

* set op role in collective training

* add collective op role

* remove orig file

* add build optimizer by strategy

* add collective strategy

* refine collective strategy

* add multi-process role maker

* refine strategy building factory so that we can easily plugin more strategy

* scale loss grad in collective sgd transpiler

* add support for distributed fc

* code format

* revert some features for dist fc

* add support for distributed fc training

* fix prepare context redundant code problem, optimize executor by caching create_varaiables
test=develop

* supports collective training in executor

* make fetch_list runable with variables, add more unittest for use_program_cache
test=develop

* use unique name for nccl_id

* supports output to stream in program_to_code

* insert sync_comm_stream before regularization; add skip_op_callstack capability in program_to_code

* set op role in collective training

* add collective op role

* fix comment
test=develop

* remove orig file

* add build optimizer by strategy

* add collective strategy

* refine collective strategy

* add multi-process role maker

* refine strategy building factory so that we can easily plugin more strategy

* scale loss grad in collective sgd transpiler

* add support for distributed fc

* code format

* revert some features for dist fc

* add support for distributed fc training

* test=develop
add collective op unittest standard

* test=develop
remove the test_collective directory

* test=develop
remove the test_collective directory

* remove slicegather test

* code format for reducescatter

* update attr of shard_index_op

* Modify macro nccl_helper

* remove test without distribute

* macro collective_helper

* marcro update

* test=develop
update support python3.5

* test=develop change gpu memory use to 0.1 when test

* test=develop
update ut equal func

* test=develop
set flags to 1.5

* test=develop fix pickle dumple  py35

* test=develop
fix divide in slice and add sync_comm_stream
update atol and rtol to 1e-05
rm shard_index op and test
modify read input from file to read from memory
remove origin_program in framework and add i/o in c_sync_calc_stream

* test=develop update unittest sync operator I/O

b7128bac

26 6月, 2019 4 次提交
- H
  
  add ut for pipeline training (#18289) · e42057cd
  由 hutuxian 提交于 6月 26, 2019
  
  e42057cd
- J
  
  test=develop, recover ocr ut on dygraph (#18166) · bd61d899
  由 Jiabin Yang 提交于 6月 26, 2019
  
  bd61d899
- Y
  Update lamb optimizer (#18333) · 23941e43
  由 Yibing Liu 提交于 6月 26, 2019
```
* Update lamb optimizer

test=develop, test=document_preview

* Regenerate api spec

test=develop, test=document_preview
```
  23941e43
- J
  
  test=develop, disable basic gru related ut (#18329) · 79bcdbbf
  由 Jiabin Yang 提交于 6月 26, 2019
  
  79bcdbbf
25 6月, 2019 1 次提交

Sequence mask support tensor (#18249) · df2eee71

由 Hongyu Liu 提交于 6月 25, 2019

* sequnce mask support max length tensor input; test=develop

* add rnn_impl.py; test=develop

* add basic gru lstm unittest; test=develop

* fix api spec; test=develop

* fix sequence_mask op bug;
test=develop
test=document_preview

* change +-*x to elmentwise_op; test=develop

* add mkl flag; test=develop

* fix rnn impl bug; test=develop

* update api spec; test=develop

* fix doc bug; test=develop

* fix lstm bugs; test=develop

df2eee71

23 6月, 2019 1 次提交
- C
  add random seed for recurrent op test (#18274) · d54e13bb
  由 chengduo 提交于 6月 23, 2019
```
test=develop
```
  d54e13bb
21 6月, 2019 2 次提交
- X
  set src_idx > 0 for bilinear_interp_op (#18238) · b58bb802
  由 xiaoting 提交于 6月 21, 2019
```
* set src_idx > 0, test=develop

* add unittest and cu, test=develop
```
  b58bb802
- G
  add more print function for timeout issue, make timeout value larger (#18219) · 7d76e34e
  由 guru4elephant 提交于 6月 21, 2019
```
* add more print function for timeout issue, make timeout value larger
```
  7d76e34e
20 6月, 2019 2 次提交

Fix slice op shape=-1 bug (#18107) · cefd0fb5

由 Hongyu Liu 提交于 6月 20, 2019

* fix slice op bug; test=develop

* fix variabel test bug; test=develop

* remove slice while true; test=develop

cefd0fb5

J
test=develop, fix test_imperative_transformer and ocr (#18127) · b3cbc5be
由 Jiabin Yang 提交于 6月 20, 2019
```
* test=develop, fix test_imperative_transformer and ocr

* test=develop, remove ocr recovery part
```
b3cbc5be

19 6月, 2019 3 次提交
- Q
  
  disable test_async_ssa_graph_executor_mnist test=develop (#18165) · 778f6acf
  由 Qiao Longfei 提交于 6月 19, 2019
  
  778f6acf
- 翟
  fix spelling errors (#17941) · 802ea509
  由翟飞跃提交于 6月 19, 2019
```
* fix spelling errors; test=develop

* Update API.spec

update md5

* Update API.spec

* change the order of api;test=develop
```
  802ea509
- J
  test=develop, add add_multi_gpu_install_check (#18157) · 991c94f1
  由 Jiabin Yang 提交于 6月 19, 2019
```
* test=develop, add add_multi_gpu_install_check

* test=develop, refine warning doc

* test=develop, refine warning doc

* test=develop, refine warning doc

* test=develop, support multi cpu
```
  991c94f1
18 6月, 2019 1 次提交
- C
  Remove nccl dep when the number of GPU is 1 (#18158) · 4978db2c
  由 chengduo 提交于 6月 18, 2019
```
* remove nccl dep when the number of GPU is 1
test=develop
```
  4978db2c
17 6月, 2019 1 次提交
- Z
  Fix py_reader iterable bug (#18108) · 6eec66a1
  由 Zeng Jinle 提交于 6月 17, 2019
```
* fix py_reader iterable bug, test=develop

* move data from buffered_reader,test=develop
```
  6eec66a1
16 6月, 2019 5 次提交

Update backward appending stragety to support double backward and fix some bug. (#18104) · 80d2e66f

由 qingqing01 提交于 6月 16, 2019

* Update backward.py:
     - If there is no input grad var in all outputs of previous ops, do not append this op into graph.
     - Only apply this stragety when double backward.
* Update some double backward op.
* Update sum_op to judge whether a tensor is empty by numel or IsInitialized().

80d2e66f

add detection output operator for supporting retinanet (#17896) · ff83655f

由 FlyingQianMM 提交于 6月 16, 2019

* test=develop
add detection output for supporting retinanet

* test=develop
add test_layers.py

* test=develop
add API.spec

* test=develop
alter test_retinanet_detection_output.py

* test=develop
alter round 2

* test=develop
alter retinanet_detection_output

* test=develop
alter paddle/fluid/API.spec

* test=devlop
alter detection.py

* test=develop
alter retinanet_detection_output

* test=develop
alter paddle/fluid/API.spec

* test=develop
alter detection.py

* test=develop
alter API.spec

* test=develop
alter retinanet_detection_output

* test=develop
alter paddle/fluid/API.spec

* test=develop
alter python/paddle/fluid/tests/unittests/test_retinanet_detection_output.py

* test=develop
alter python/paddle/fluid/tests/unittests/test_retinanet_detection_output.py

* test=develop
fix grammer error

* test=develop
fix grammer error

* test=develop
fix grammer error

* test=develop
alter python/paddle/fluid/tests/unittests/test_layers.py

* test=develop
alter paddle/fluid/API.spec

ff83655f

G
add class name and timeline for test_dist_base.py (#18122) · 0941e3e0
由 guru4elephant 提交于 6月 16, 2019
```
* add class name and timeline for test_dist_base.py
```
0941e3e0

add sigmoid focal loss operator for supporting retinanet (#17895) · 0aee1f00

由 FlyingQianMM 提交于 6月 16, 2019

* test=develop
add sigmoid_focal_loss for supporting retinanet

* test=develop
add test_layers

* test=develop
add API.spc

* test=develop
alter sigmoid_focal_loss_op.cc

* test=develop
alter detection.py

* test=develop
alter API.spec

* test=develop
alter round 1

* test=develop
alter simooid_focal_loss

* test=develop
alter sigmoid_focal_loss_op.cc

* test=develop
alter test_layers.py

* test=develop
alter paddle/fluid/API.spec

* test=develop
alter sigmoid_focal_loss_op.cu

* test=develop
alter paddle/fluid/operators/detection/sigmoid_focal_loss_op.cc

0aee1f00

F
Update generate_proposal_labels_op to support CascadeRCNN. (#17200) · 9e4b9d97
由 FDInSky 提交于 6月 16, 2019
```
* Update generate_proposal_labels_op to support CascadeRCNN.
```
9e4b9d97

15 6月, 2019 2 次提交

add target assign operator for supporting retinanet (#17893) · 9ed2f936

由 FlyingQianMM 提交于 6月 15, 2019

* test=develop add target assign for retinanet

* test=develop
run ci

* test=developp
add test_layers

* test=develop
add APi.spec

* test=develop
alter round 1

* test=develop
alter rpn_target_assign_op.cc

* test=develop
alter test_rpn_target_assign_op.py

* test=develop
alter rpn_target_assign_op.cc

* test=develop

alter API.spec

* test=develop
alter paddle/fluid/operators/detection/rpn_target_assign_op.cc

* test=develop
alter rpn_target_assign_op.cc

* test=develop
alter python/paddle/fluid/layers/detection.py

* test=develop
alter paddle/fluid/API.spec

9ed2f936

C
Fix bug of scope_buffered_ssa_graph_executor (#18100) · 24e988a4
由 chengduo 提交于 6月 15, 2019
```
* fix code bug
test=develop
```
24e988a4

14 6月, 2019 3 次提交
- W
  
  add unit test to cover all parameters for print op test=develop (#18089) · 26a7c1a3
  由 wopeizl 提交于 6月 14, 2019
  
  26a7c1a3
- G
  Refine unittest log (#18084) · b2cfdc38
  由 guru4elephant 提交于 6月 14, 2019
```
* add print log for unittest of distributed training
test=develop
```
  b2cfdc38
- G
  
  Fix reinitialized ncclid error! (#18025) · f5caf344
  由 gongweibao 提交于 6月 14, 2019
  
  f5caf344
13 6月, 2019 2 次提交
- C
  Update CPU_NUM config (#18059) · b5a1c146
  由 chengduo 提交于 6月 13, 2019
```
* update CPU_NUM config
test=develop
```
  b5a1c146
- T
  concat op support negative axis (#18045) · 566bf2ec
  由 tensor-tang 提交于 6月 13, 2019
```
test=develop
```
  566bf2ec
12 6月, 2019 5 次提交

T
fix save/load in fleet (#17675) · 101f74cb
由 tangwei12 提交于 6月 12, 2019
```
* fix save/load in Fleet
* add UT framework of Fleet
```
101f74cb

Fix scatter and gather op when has duplicate index (#17952) · 8eb134c3

由 wawltor 提交于 6月 12, 2019

* test=develop
The scatter op has a calc bug when the indices has same index, the scatter op use overwrite mode to calculate the same index, fix this bug by using the accumulate mode to calculate the same index.At the same time, the gather op has the same bug when the op calc the grad. And we use the lib of open-blas and eigen to optimize the time cost in accumulate mode.

* test=develop
Fix some code format problem, and the same time add the test case in gather and scatter op

8eb134c3

Cherry-pick: fix random CI failure. (#18011) · 0bf25351

由 Huihuang Zheng 提交于 6月 12, 2019

* Cherry-pick fix random Python3 CI failure.

In some tests, SWEs used "print('xxx').format('xxx')". The syntax
is only supported in Python2, not python3. However, since those
lines are related to data download, if the CI machines already have
the data, it passes CI tests. That causes random failure.

* Cherry-pick: disable CUDNN case of test_warpctc_op

Also temporary disable a unit test. The test will be fixed under high priority.

0bf25351

fix logging basicConfig cannot be setting after import paddle (#17786) · 96ee528e

由 Kaipeng Deng 提交于 6月 12, 2019

* fix logging unable. test=develop

* unset sys.stdout for stream handler. test=develop

* fix newly add basicConfig. test=develop

* fix import error. test=develop

96ee528e

add deformable psroi pooling (#17827) · 871af28d

由 cjt222 提交于 6月 12, 2019

* add deformable psroi pooling

* test=develop

* test=develop

* test=develop
modify format

* fix bug

* test=develop run ci

* test=develop
add API.spec

* add test_layers.py

* run ci again

* test=develop
run ci again

* run ci again

* test=develop
run ci again

* test=develop
run ci again

* test=develop
run ci again

* add space between two lines

* test=develop
add space between two lines

* test=develop
add space between lines

* test=develop
modify comment in nn.py

* test=develop
add space between two lines

* test=develop
add space between two lines

* update API.spec

* run ci again

* test=develop
run ci again

* rerun ci

* test=develop
rerun ci

* change input shape

* run ci

* test=develop
run ci

* modify format of nn.py

* test=develop

* test=develop

* test=develop
update API.spec

* test=develop
fix API doc

* modify API comment

* modift API comment

* test=develop
update API.spec

* test=develop
modify comment

* test=develop
modift comment

* test=develop
modift comment

* test=develop
update API.spec

* test=develop
modify comment

* test=develop
add inference in nn.py

* test=develop
update API.spec

* test=develop
resolve confict

* test=develop
update API.spec

871af28d

11 6月, 2019 1 次提交

add unfold op (new op),test=develop (#17944) · 40885c22

由 SunGaofeng 提交于 6月 11, 2019

* add unfold op
test=develop

* fix divide bug in python3 when calculating output width and height
test=develop

* add name=None in python api, move redundant code into inline function

* try to trigger ci for this code
test=develop

40885c22

10 6月, 2019 2 次提交

H
Ignore a unit test which failed on cuda9/10 python3 ci task (#17950) · 9f519baf
由 Huihuang Zheng 提交于 6月 10, 2019
```
TODO: it is a temporary fix for Paddle release 1.5. We have to fix
this failed unit test soon.

test=develop
```
9f519baf

Enable seq_pool op to accept len 0 input (#17284) · 33d1e565

由 Yibing Liu 提交于 6月 10, 2019

* Enable seq_pool op to accept len 0 input

test=develop

* Update sequence_pool's api

test=develop

* Add more unittest cases for seq_pool op

test=develop

* Remove legacy comments

test=develop

* Don't use template in op maker

test=develop

33d1e565

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致