提交 · 9901f69677e9b85f1a5b8f6ac97ea1f3e2887375 · BaiXuePrincess / Paddle

23 9月, 2019 7 次提交

Forward recompute3 (#19913) · 9901f696

由 mapingshuo 提交于 9月 23, 2019

* add recompute based checkpoints methods for large batch training
test=develop

* add append_backward_with_forward_recomputation
test=develop

* refine optimizer
test=develop

* update backward and optimizer
test=develop

* make Variable usable
test=develop

* add recompute code

* refine optimizer
test=develop

* refine addup _append_backward_ops_with_checkpoints_
1) for recompute part, just cache the grad_op_desc without appending to block
2) before appending grad_op_desc to backward part, addup_repetitive_vars, remove unused branch
test=develop

* make method private

* add recompute strategy into DistributedStrategy
test=develop

* checkpoint version3
test=develop

* remove some print information
test=develop

* remove unused sumop
test=develop

* try to fix recompute with graph building modules

* add input names to vars should be held

* add memory debug tool

* backup backward

* Fix bugs

* add backward desc for op not in any segments

* add exception info for sub_block

test=develop

* modify code style

test=develop

* modify code style

test=develop

* remove print functions

test=develop

* add API spec

test=develop
test=document_preview

* make Recompute a child class of Optimizer

test=develop
test=document_preview

* add API spec

test=develop
test=document_preview

* modify API spec

test=develop
test=document_preview

* add document for Recompute

test=develop
test=document_preview

* change API doc of Rcompute

test=develop
test=document_preview

* code cleaning

test=develop
test=document_preview

* modify API spec

* fix bugs when segments hold no element

* add testcase for Recompute Optimizer

test=develop
test=document_preview

* add test for apply_gradient, and code cleaning

test=develop
test=document_preview

* add test case for load function

* enable CI

test=develop
test=document

* add test case

test=develop
test=document_preview

* add sample code for 4 function of recompute optimizer

test=develop
test=document_preview

9901f696

G

warning when user save a inference model which contains auc op test=develop (#19838) · 4836ee68
由 Ghost Under Moon 提交于 9月 23, 2019

4836ee68
W
optimize the error information when the input for while op has a wron… (#19872) · e606b175
由 wopeizl 提交于 9月 23, 2019
```
* optimize the error information when the input for while op has a wrong shape test=develop
```
e606b175
R
add mse_loss (#19759) · d31c92a2
由 ruri 提交于 9月 23, 2019
```
* add mse_loss op
```
d31c92a2

move tree_conv to fluid.contrib.layers (#19918) · a4919d36

由 Tao Luo 提交于 9月 23, 2019

* move tree_conv to fluid.contrib.layers

test=develop

* update API.spec for tree_conv

test=develop

* update tree_conv api to increase unit coverage

test=develop

a4919d36

Unify DataLoader APIs (#19305) · 0436efd6

由 Zeng Jinle 提交于 9月 23, 2019

* unify DataLoader APIs, test=develop

* integrate iterable CPU Dataset, test=develop
add GPU dataset supporting, test=develop

* add unittests for dataset, test=develop

* add more docs to dataloader apis, test=develop, test=document_preview

* refine doc, test=develop

* refine doc again, test=develop

* increase coverage, test=develop

0436efd6

T
paddle cloud role maker fix (#19646) · 278dd003
由 tangwei12 提交于 9月 23, 2019
```
* optimize cloud rolemaker, test=develop
```
278dd003

22 9月, 2019 1 次提交
- L
  add instance norm (#19500) · 4155e625
  由 lvmengsi 提交于 9月 22, 2019
```
* add instance norm op
```
  4155e625
21 9月, 2019 3 次提交

A
Add support for other axes in MKLDNN softmax op (#19907) · cb65439d
由 Adam 提交于 9月 21, 2019
```
* Initial, functional commit

* Clean commit related files
test=develop
```
cb65439d

Feature/auto prune in dygraph (#19757) · 45425411

由 Jiabin Yang 提交于 9月 21, 2019

* refactor dygraph,test=develop

* fix failed unittest,test=develop

* polish code,test=develop

* check windows ci error,test=develop
try to fix windows ci error by np.allclose,test=develop

* polish vlog and profiler, test=develop

* try to fix preceding ops order,test=develop

* test transformer in windows ci, test=develop

* use python c-api to speed up tracer.trace,test=develop

* test=develop, fix docker with paddle nccl problem

* test=develop, add ut for debug string and gradient_accumulator

* test=develop, add tests for layer/gradient_accumulator/prepared_op

* test=develop, fix complie error for test_prepared_op

* test=develop, add more ut for dygraph

* test=develop, create API.spec for dygraph api change

* test=develop, refoctor name to make it easier to understand

* test=develop, refoctor name to make it easier to understand

* test=develop, fix multi-gpu failed problem , add Tracer tests, change PADDLEENFORCE to PADDLEENFORCE_EQ

* test=develop, fix ut failed on parallel se-resnext

* test=develop, change one more PADDLE_ENFORCE

* support auto prune in dygraph mode

* test=develop, support auto prune

* test=develop, merge develop conflict

* test=develop, fix test_layer and test_tracer ut

* test=develop, fix bug which may cause stop_gradient disabled with a list of backward inputs

45425411

A

move match_matrix var_conv2d et.al api into fluid.contrib test=develop (#19859) · 418a0967
由 Aurelius84 提交于 9月 21, 2019

418a0967

20 9月, 2019 5 次提交

Z

fix readers bug, test=develop (#19868) · cee0079a
由 Zeng Jinle 提交于 9月 20, 2019

cee0079a
A
support 2-level lod of input in sequence_pool (#19839) · fcf53e55
由 Aurelius84 提交于 9月 20, 2019
```
* support 2-level lod of input in sequence_pool test=develop

* fix lod level bug in .cu test=develop
```
fcf53e55
Z
group_norm support data_layout:NHWC, test=develop, test=document_preview (#19614) · 93364b45
由 Zhang Ting 提交于 9月 20, 2019
```
1. group_norm support data_layout=NHWC
2. modified doc of group_norm
```
93364b45

modified interpolate op to support tensor attribute, test=develop, test=document_preview (#19287) · 439d95e1

由 Zhang Ting 提交于 9月 20, 2019

modified interpolate_op to support tensor attribute

1. the parameter out_shape of image_resize、resize_nearest/bilinear/trilinear can be a list or a 1-D tensor variable. If a list, each element can be an integer or a tensor variable with shape: [1].

2. the parameter scale of above Ops can be a 1-D tensor variable.
modified document of image_resize, resize_nearest, resize_bilinear, resize_trilinear and add some code example.

439d95e1

add crop_tensor_op, test=develop, test=document_preview (#19314) · b3888941

由 Zhang Ting 提交于 9月 20, 2019

add crop_tensor op. The main difference with crop is :

1. If the argument shape is a list, each element is an integer or a tensor variable with shape: [1]. This way is suitable for the case that the shape may be changed each iteration.

2. If the argument shape is a variable. Its rank must be 1. In crop op, the rank of shape must be the same as x

offsets can be a list, in which each element is an integer or a tensor variavle with shape: [1].

b3888941

19 9月, 2019 6 次提交

A
Remove constraint that last dimension is forced to be 1 in cross_entropy (#19606) · b125e327
由 Aurelius84 提交于 9月 19, 2019
```
* Remove constraint that last dimension is forced to be 1 in cross_entropy
test=develop

* modify labels last dims test=develop
```
b125e327

add precise roi pooling op test=develop (#18960) · a7c440d3

由 wopeizl 提交于 9月 19, 2019

* add precise roi pooling op test=develop

* test=develop

* test=develop

* test=develop

* test=develop

* test=develop

* test=develop

* test=develop

* test=develop

* test=develop

* test=develop

* test=develop

* test=develop

* test=develop

* detail the description test=develop

* test=develop

* elaborate the doc for return type test=develop

* test=develop

a7c440d3

Add a pass to fuse fc+elementwise_add+layernorm (#19776) · 3cd985a6

由 Yiqun Liu 提交于 9月 19, 2019

* Add fc_elementwise_layernorm_fuse pass and unittest.

* Add fused_fc_elementwise_layernorm op and its GPU kernel.
test=develop

* Apply fc_elementwise_layernorm_fuse_pass to GPU inference.

* Add the setting of attrs in the definition of binary_op.
test=develop

* Add comment.

* Implement the unittest.
test=develop

* Change the unittest name of layer_norm.
test=develop

3cd985a6

W
distribute.launch use poll to query subprocess (#19853) · 8c2c8dc6
由 WangXi 提交于 9月 18, 2019
```
distribute.launch use poll to query subprocess
```
8c2c8dc6

Disable test_dygraph_mnist_fp16.py (#19844) · 8e927327

由 chengduo 提交于 9月 19, 2019

* Fix std::ostream& operator<<(std::ostream& os, const Tensor& t)
test=develop

* Fix test_dygraph_mnist_fp16
test=develop

* disable test_dygraph_mnist_fp16
test=develop

* revert tensor_util.cc fix
test=develop

8e927327

Strided slice (#19642) · 47af618f

由 wangchaochaohu 提交于 9月 19, 2019

* strided_slice op basic function test=develop

* test=develop rewrite and fix

* fix bug test=develop

* fix for the PADDLE_ENFORCE usage

* add some unit testw

* fix for the aip  test and copright and fix test=develop

* fix API.spec test=develop

* fix API.spec test=develop

* add axis parameter test=develop

* fix for the build error test=develop

* fix python api  test=develop

* fix the build test=develop

* fix build test=develop

* fix API spec test=develop

* test=develop add some comment and single op test

* fix API spece test=develop

* fix test=develop

* fix test=develop

* fix api test=develop

* fix api test=develop

* fix API.spec test=develop

* fix typo test=develop

* fix API.spec test=develop

* fix API typo test=develop

* fix doc and API.spec test=develop

47af618f

18 9月, 2019 2 次提交
- L
  
  fix_roi_transform_bug (#19785) · 6d72a86b
  由 LielinJiang 提交于 9月 18, 2019
  
  6d72a86b
- Z
  [Bug fix] Disable memory reuse on feeded variables (#19835) · db26de83
  由 Zeng Jinle 提交于 9月 18, 2019
```
* fix memory reuse bug on feeding variables, test=develop

* add comments to reference count members, test=develop
```
  db26de83
17 9月, 2019 11 次提交

C
add deformable conv v1 op and cpu version of deformable conv v2 (#18500) · 00efd1d8
由 chengjuntao 提交于 9月 17, 2019
```
* add deformable conv v1 op, test=develop
```
00efd1d8
C
Add fp16 support for dygraph (#19828) · b99fc38c
由 chengduo 提交于 9月 17, 2019
```
* Add fp16 support for dygraph
test=develop

* Add unit test
test=develop
```
b99fc38c

Enhance OpTest to support double grad inplace check (#19826) · 5fbf03d6

由 Leo Chen 提交于 9月 17, 2019

* update OpTest to support double grad inplace check, test=develop

* keep consistency of _calc_output function, test=develop

5fbf03d6

fix pow op, support tensor for agument factor. (#19313) · 677e7144

由 liym27 提交于 9月 17, 2019

improve pow op according to reviews:
1. Delete unnecessary judgement statements in PowGradOpDescMaker;
2. Improve test of test_api;

overload GetKernelTypeForVar

add stop_gradient=True when attr(factor) is tensor Variable, change examples in API pow.
test=develop,test=document_preview

677e7144

add tensor support for argument shape in reshape op; (#19268) · bd89a273

由 liym27 提交于 9月 17, 2019

add support parameter inference when argument shape is a list containing integer and tensor variable;
test=develop

fix reshape op according to reviews:
1. improve or message;
2. improve test of test_api.
test=develop,test=document_preview

fix reshape op: Add error message in nn.py, test=develop

add stop_gradient=True when attr(shape) is tensor Variable.
change examples in API reshape.
test=develop,test=document_preview

bd89a273

add tensor(tensor and tensor in list) support for argument starts and ends in slice op; (#19208) · 88628016

由 liym27 提交于 9月 17, 2019

add support parameter inference when arguments starts or ends is a list containing integer and tensor variable;
test=develop,test=document_preview

improve slice op according to review(from hongyu). test=develop

fix slice op according to review: infer_flags, test=develop

fix slice op: improve overload operator __getitem__ to support attrs(starts and ends) are Variable.
test=develop,test=document_preview

fix test_slice_op: add TestSliceOp_decs_dim_6 to resolve conflict with test_slice_ngraph_op. test=develop

add stop_gradient=True when attr(starts) or attr(ends) is tensor Variable.
test=develop,test=document_preview

88628016

fix expand op: (#19302) · e9e3c087

由 liym27 提交于 9月 17, 2019

1. add tensor support for argument expand_times in expand op;
2. add support parameter inference when argument expand_times is a list containing integer and tensor variable;

improve expand op according to reviews:
1. add doc of ExpandTimes in expand_op.cc;
2. improve the test of test_api.

add stop_gradient=True when attr(expand_times) is tensor Variable, change code examples.
test=develop,test=document_preview

e9e3c087

X
support preload thread, optimize hdfs log, fix master+patch bug (#19695) · 6bf298bf
由 xujiaqi01 提交于 9月 17, 2019
```
* support preload thread
* sleep before fleet wrapper exit for pslib core dump
* optimize hdfs log
* fix master+patch bug
```
6bf298bf

Feature/add transform data dygraph (#19707) · cc311bdf

由 Jiabin Yang 提交于 9月 17, 2019

* refactor dygraph,test=develop

* fix failed unittest,test=develop

* polish code,test=develop

* check windows ci error,test=develop
try to fix windows ci error by np.allclose,test=develop

* polish vlog and profiler, test=develop

* try to fix preceding ops order,test=develop

* test transformer in windows ci, test=develop

* use python c-api to speed up tracer.trace,test=develop

* test=develop, fix docker with paddle nccl problem

* test=develop, add ut for debug string and gradient_accumulator

* test=develop, add tests for layer/gradient_accumulator/prepared_op

* test=develop, fix complie error for test_prepared_op

* test=develop, add more ut for dygraph

* test=develop, create API.spec for dygraph api change

* add transform_data to dygraph

* test=develop, refoctor name to make it easier to understand

* test=develop, refoctor name to make it easier to understand

* add test and change input to const ref for safety

* test=develop, fix multi-gpu failed problem , add Tracer tests, change PADDLEENFORCE to PADDLEENFORCE_EQ

* add ut for data transform

* refine ut for data_transform

* test=develop, fix ut failed on parallel se-resnext

* test=develop, change one more PADDLE_ENFORCE

* add test_tracer on multiple devices

* test=develop, change place to mutable for data transform

* test=develop, add transform data on same place test and remove useless log

* test=develop, Add to do for data layout and and ut for conv2d with no bias

cc311bdf

L
cpu Conv double grad (#19672) · b76343c3
由 lvmengsi 提交于 9月 17, 2019
```
* cpu conv_grad_grad
```
b76343c3

翟

Implement FusedEmbeddingSeqPoolGradKernel with cblas_saxpy (#19770) · 93c85c93

由翟飞跃提交于 9月 17, 2019

* Implement the operator with sprase matrix multiply

* Update the URL of mklml library.

test=develop

* Disable MKLML implematation when using no-linux.

test=develop

* optimize bp with mkl sparse matrix
test=develop

* tmp add fused_emb_seq layer

* Add the support of padding_idx attribute.

test=develop

* add padding_idx support
test=develop

* implement grad refer lego
test=develop

93c85c93

16 9月, 2019 3 次提交
- R
  add unittest for square error cost op (#19746) · a0e9b7b9
  由 ruri 提交于 9月 16, 2019
```
* add unit test for square error cost op
```
  a0e9b7b9
- Z
  add kernel for squeeze_op, test=develop (#19656) · 52673956
  由 zhongpu 提交于 9月 16, 2019
```
* add kernel for squeeze_op, test=develop

* delete comment, test=develop
```
  52673956
- C
  
  Add prune_backward function to cover complicated test_program.clone situation (#19772) · 00d5375e
  由 Chen Weihang 提交于 9月 16, 2019
  
  00d5375e
12 9月, 2019 1 次提交

Remove constraint that last dimension is forced to be 1 by adding one_hot_v2 (#19716) · 8c7e4119

由 Aurelius84 提交于 9月 12, 2019

* add one_hot_v2_op to remove last_dims==1 test=develop

* add api unittest code for CI_Coverage test=develop

* improve CI_Coverage rate by adding test_with_depth test=develop

8c7e4119

11 9月, 2019 1 次提交
- C
  Fix test_parallel_executor_test_while_train (#19723) · c308c88d
  由 chengduo 提交于 9月 11, 2019
```
Fix test_parallel_executor_test_while_train 
```
  c308c88d

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致