提交 · cedc04775c85d044c6e2705701232278196d28f7 · Crayon鑫 / Paddle

24 9月, 2019 9 次提交

support change shuffle and train thread num (#19841) · cedc0477

由 xujiaqi01 提交于 9月 24, 2019

* support change shuffle thread num
* support change train thread num
* fix receive shuffle data of each channel
* data norm stop gradient
* add check thread_tensor type and root_tensor type when merge metric
* remove sleep in shuffle, add config
* add config of pslib client to client communication
* fix xbox str
* add data norm op testcase
* add flush in trainer finalize

cedc0477

K

add elementwise mod support float/double. test=develop (#19570) · 14625ffe
由 Kaipeng Deng 提交于 9月 24, 2019

14625ffe

- ReImplemented pooling fwd mkldnn (#19911) · 5b07ca9c

由 Jacek Czaja 提交于 9月 24, 2019

- First implementation of BWD and FWD of pooling mkl-dnn

- Compilation fix

- Fix

- Fix

 - Fix

- Fix to crash

- Compilation fix

- Combined AcquireBacward with Fwd

test=develop

5b07ca9c

G
give warnings when save a model without any parameters (#19931) · 790d5226
由 Ghost Under Moon 提交于 9月 24, 2019
```
* give warnings when save a model without any parameters test=develop

* delete one line comment test=develop
```
790d5226
Z
Add py_reader combination unittest (#19923) · f254b477
由 Zeng Jinle 提交于 9月 24, 2019
```
* add py_reader combination unittest,test=develop

* follow huihuang's comments, test=develop
```
f254b477
Z

fix huber loss op attr type, test=develop (#19937) · b1e83b33
由 Zeng Jinle 提交于 9月 24, 2019

b1e83b33
Z

add inplace to assign op, test=develop (#19927) · cc157d59
由 Zeng Jinle 提交于 9月 24, 2019

cc157d59
C
clean tensor array (#19930) · 55ce6969
由 chengduo 提交于 9月 24, 2019
```
test=develop
```
55ce6969

Make OpTest check grad inplace even if forward has no inplace (#19847) · 57606205

由 Leo Chen 提交于 9月 24, 2019

* make OpTest check grad inplace even if forward has no inplace, test=develop

* do not run PE when enable_inplace is False, test=develop

* add conv3d cuda kernel for float16 type, test=develop

* refactor OpTest for inplace, test=develop

* add comments, test=develop

57606205

23 9月, 2019 14 次提交

J
add fake_quant_dequant_op for average pool2d, test=develop (#19880) · b0ceed6f
由 juncaipeng 提交于 9月 23, 2019
```
* add fake_quant_dequant_op for average pool2d
* add test
```
b0ceed6f
Z

resize Ops support data_layout:channel_last, test=develop, test=document_preview (#19914) · cb8f3c03
由 Zhang Ting 提交于 9月 23, 2019

cb8f3c03

Forward recompute3 (#19913) · 9901f696

由 mapingshuo 提交于 9月 23, 2019

* add recompute based checkpoints methods for large batch training
test=develop

* add append_backward_with_forward_recomputation
test=develop

* refine optimizer
test=develop

* update backward and optimizer
test=develop

* make Variable usable
test=develop

* add recompute code

* refine optimizer
test=develop

* refine addup _append_backward_ops_with_checkpoints_
1) for recompute part, just cache the grad_op_desc without appending to block
2) before appending grad_op_desc to backward part, addup_repetitive_vars, remove unused branch
test=develop

* make method private

* add recompute strategy into DistributedStrategy
test=develop

* checkpoint version3
test=develop

* remove some print information
test=develop

* remove unused sumop
test=develop

* try to fix recompute with graph building modules

* add input names to vars should be held

* add memory debug tool

* backup backward

* Fix bugs

* add backward desc for op not in any segments

* add exception info for sub_block

test=develop

* modify code style

test=develop

* modify code style

test=develop

* remove print functions

test=develop

* add API spec

test=develop
test=document_preview

* make Recompute a child class of Optimizer

test=develop
test=document_preview

* add API spec

test=develop
test=document_preview

* modify API spec

test=develop
test=document_preview

* add document for Recompute

test=develop
test=document_preview

* change API doc of Rcompute

test=develop
test=document_preview

* code cleaning

test=develop
test=document_preview

* modify API spec

* fix bugs when segments hold no element

* add testcase for Recompute Optimizer

test=develop
test=document_preview

* add test for apply_gradient, and code cleaning

test=develop
test=document_preview

* add test case for load function

* enable CI

test=develop
test=document

* add test case

test=develop
test=document_preview

* add sample code for 4 function of recompute optimizer

test=develop
test=document_preview

9901f696

C
Delete local execution scopes (#19749) · d7251a8e
由 chengduo 提交于 9月 23, 2019
```
* Add RecordHistoryLocalExecScopes
test=develop
```
d7251a8e
G

warning when user save a inference model which contains auc op test=develop (#19838) · 4836ee68
由 Ghost Under Moon 提交于 9月 23, 2019

4836ee68
W
remove the useless warning for user to avoid confuse test=develop (#19871) · 5452b6a1
由 wopeizl 提交于 9月 23, 2019
```
* remove the useless warning for user to avoid confuse test=develop
```
5452b6a1
W
optimize the error information when the input for while op has a wron… (#19872) · e606b175
由 wopeizl 提交于 9月 23, 2019
```
* optimize the error information when the input for while op has a wrong shape test=develop
```
e606b175
R
add mse_loss (#19759) · d31c92a2
由 ruri 提交于 9月 23, 2019
```
* add mse_loss op
```
d31c92a2

Add op compatible information (#19910) · 85b398f1

由 hong 提交于 9月 23, 2019

* add op compatible infomation; test=develop

* add enum type

* add enum type; test=develop

85b398f1

K
fix softmax CE time limit check failed (#19846) · 3f021781
由 Kaipeng Deng 提交于 9月 23, 2019
```
* fix softmax ce time limit check failed. test=develop

* refine softmax calc. test=develop
```
3f021781

move tree_conv to fluid.contrib.layers (#19918) · a4919d36

由 Tao Luo 提交于 9月 23, 2019

* move tree_conv to fluid.contrib.layers

test=develop

* update API.spec for tree_conv

test=develop

* update tree_conv api to increase unit coverage

test=develop

a4919d36

石

tensor_array_to_tensor_op.cc, test=develop (#19289) · 30adea0a
由石晓伟提交于 9月 23, 2019

30adea0a

Unify DataLoader APIs (#19305) · 0436efd6

由 Zeng Jinle 提交于 9月 23, 2019

* unify DataLoader APIs, test=develop

* integrate iterable CPU Dataset, test=develop
add GPU dataset supporting, test=develop

* add unittests for dataset, test=develop

* add more docs to dataloader apis, test=develop, test=document_preview

* refine doc, test=develop

* refine doc again, test=develop

* increase coverage, test=develop

0436efd6

T
paddle cloud role maker fix (#19646) · 278dd003
由 tangwei12 提交于 9月 23, 2019
```
* optimize cloud rolemaker, test=develop
```
278dd003

22 9月, 2019 2 次提交
- L
  add instance norm (#19500) · 4155e625
  由 lvmengsi 提交于 9月 22, 2019
```
* add instance norm op
```
  4155e625
- Z
  Add lock to cudnn handle calls (#19845) · c7f36e7c
  由 Zeng Jinle 提交于 9月 22, 2019
```
* refine reallocate of workspace size, test=develop

* add lock to cudnn handle calls, test=develop
```
  c7f36e7c
21 9月, 2019 7 次提交

P
Add two extra flags for test_analyzer_int8_image_classification to disable fp32/int8 (#19840) · 2c5c6365
由 pawelpiotrowicz 提交于 9月 21, 2019
```
test=develop
```
2c5c6365
A
Add support for other axes in MKLDNN softmax op (#19907) · cb65439d
由 Adam 提交于 9月 21, 2019
```
* Initial, functional commit

* Clean commit related files
test=develop
```
cb65439d

Feature/auto prune in dygraph (#19757) · 45425411

由 Jiabin Yang 提交于 9月 21, 2019

* refactor dygraph,test=develop

* fix failed unittest,test=develop

* polish code,test=develop

* check windows ci error,test=develop
try to fix windows ci error by np.allclose,test=develop

* polish vlog and profiler, test=develop

* try to fix preceding ops order,test=develop

* test transformer in windows ci, test=develop

* use python c-api to speed up tracer.trace,test=develop

* test=develop, fix docker with paddle nccl problem

* test=develop, add ut for debug string and gradient_accumulator

* test=develop, add tests for layer/gradient_accumulator/prepared_op

* test=develop, fix complie error for test_prepared_op

* test=develop, add more ut for dygraph

* test=develop, create API.spec for dygraph api change

* test=develop, refoctor name to make it easier to understand

* test=develop, refoctor name to make it easier to understand

* test=develop, fix multi-gpu failed problem , add Tracer tests, change PADDLEENFORCE to PADDLEENFORCE_EQ

* test=develop, fix ut failed on parallel se-resnext

* test=develop, change one more PADDLE_ENFORCE

* support auto prune in dygraph mode

* test=develop, support auto prune

* test=develop, merge develop conflict

* test=develop, fix test_layer and test_tracer ut

* test=develop, fix bug which may cause stop_gradient disabled with a list of backward inputs

45425411

A

move match_matrix var_conv2d et.al api into fluid.contrib test=develop (#19859) · 418a0967
由 Aurelius84 提交于 9月 21, 2019

418a0967
P
Add TRT input shape check between model and runtime (#19864) · baccd7e2
由 Pei Yang 提交于 9月 21, 2019
```
* add TRT shape check, test=develop

* model_input_shape == runtime_input_shape, refine message, test=develop
```
baccd7e2
P
Fix BUGS: paddle-TRT repeatedly sets weight_map and overdeletes repetitive_params (#19825) · 74812d1c
由 Pei Yang 提交于 9月 21, 2019
```
* fix trt bugs when sharing params, test=develop

* add unittest for cascade_rcnn
```
74812d1c
Z

add py_reader may be deprecated msg, test=develop (#19891) · e2372750
由 Zeng Jinle 提交于 9月 21, 2019

e2372750

20 9月, 2019 8 次提交
- Z
  
  fix readers bug, test=develop (#19868) · cee0079a
  由 Zeng Jinle 提交于 9月 20, 2019
  
  cee0079a
- Z
  Refine err msg of out of gpu memory (#19779) · 747d4498
  由 Zeng Jinle 提交于 9月 20, 2019
```
* refine err msg of out of gpu memory, test=develop

* refine err msg again, test=develop

* refine errog message again, test=develop

* follow reviewer's comments, test=develop
```
  747d4498
- A
  support 2-level lod of input in sequence_pool (#19839) · fcf53e55
  由 Aurelius84 提交于 9月 20, 2019
```
* support 2-level lod of input in sequence_pool test=develop

* fix lod level bug in .cu test=develop
```
  fcf53e55
- Z
  
  remove enforce.h file written, test=develop (#19897) · b25d1e75
  由 Zeng Jinle 提交于 9月 20, 2019
  
  b25d1e75
- C
  refine optimier function (#19886) · ae31faaa
  由 chengduo 提交于 9月 20, 2019
```
test=developt
```
  ae31faaa
- Z
  group_norm support data_layout:NHWC, test=develop, test=document_preview (#19614) · 93364b45
  由 Zhang Ting 提交于 9月 20, 2019
```
1. group_norm support data_layout=NHWC
2. modified doc of group_norm
```
  93364b45
- H
  Set states of recurrent op as dependent vars in prune (#19865) · e1171142
  由 Huihuang Zheng 提交于 9月 20, 2019
```
* Set states of recurrent op as dependent vars in prune of save inference model

This PR will fix the save/load inference model problem of RNN models.

The reason of the bug is that save_inferenc_model will prune OPs that doesn't contribute to Output. But in recurrent_op, States are not Output, OPs refers States will be pruned. 

This fix adds States of recurrent_op as dependent var so that OPs referring States won't be pruned. 
```
  e1171142
- 石
  
  support MLU nums, test=develop (#19899) · c5eedcf6
  由石晓伟提交于 9月 20, 2019
  
  c5eedcf6

Crayon鑫 / Paddle 与 Fork 源项目一致

Crayon鑫 / Paddle
与 Fork 源项目一致