提交 · 39ff0f9cd9b34871e916db0f242a0c884eda6521 · afeixing77 / Paddle

27 9月, 2019 5 次提交

Optimze/optimize dygraph api (#19999) · 39ff0f9c

由 Jiabin Yang 提交于 9月 27, 2019

* test=develop, fix docker with paddle nccl problem

* test=develop, Add Variable api and refine dygraph related API

* test=develop, Add Variable api and refine dygraph related API

* test=develop, refine test for new api and error info

* test=develop, refine error info and test_layers

* test=develop, add API.spec

* test=devleop, fix to_string python2 and python3 compat error and refien doc

* test=devleop, add API spec

* test=devleop, update API spec

* test=devleop, update API spec

* test=develop, invoke ci

* test=develop, fix example code

* test=develop, update API spec

* test=develop, add compat test and fix inplace campat dict error

39ff0f9c

polish pool infer shape (#20038) · e7a6567b

由 Kaipeng Deng 提交于 9月 27, 2019

* fix pool infershape. test=develop

* fix unittest converage. test=develop

* fix format. test=develop

e7a6567b

C
Add fp16 support for pad and split (#19881) · fb2a9cdf
由 chengduo 提交于 9月 27, 2019
```
* make pad and split support fp16
test=develop
```
fb2a9cdf

the integrated communicator (#19849) · 8f0b3c05

由 tangwei12 提交于 9月 27, 2019

* add a base class for the Communicator
* add AsyncCommunicator Impl for async distributed training

8f0b3c05

Z
Fix name_scope test case bug (#20034) · 5a2ecdea
由 zhaoyuchen2018 提交于 9月 27, 2019
```
test=develop
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>
```
5a2ecdea

26 9月, 2019 11 次提交
- G
  
  Add `RUN_SERIAL` attribute to `exclusive` test. (#20026) · afc40a59
  由 gongweibao 提交于 9月 26, 2019
  
  afc40a59
- W
  
  fix reduce bug test=develop (#19971) · 3409db95
  由 wangchaochaohu 提交于 9月 26, 2019
  
  3409db95
- L
  improve the error message when handling ndarray with unsupported dtype (#19949) · bda7eab7
  由 liuwei1031 提交于 9月 26, 2019
```
* impove error message when passing ndarray with object dtype

* imporve message format

* change assert to raise TypeError

* remind user how to locate the irregular data instead of printing

* add unittest for input array type check
```
  bda7eab7
- J
  Fix test pool2d int8 mkldnn (#19976) · 1d32897c
  由 joanna.wozna.intel 提交于 9月 26, 2019
```
* Fix conv2d+dequantize squash for residual fusion

test=develop

* Correct int8 input

test=develop

* Add if exclude or include padding in pool2d mkldnn

test=develop
```
  1d32897c
- A
  Require x.dims=label.dims in huber_loss (#20017) · f58c8db6
  由 Aurelius84 提交于 9月 26, 2019
```
* x.dims == y.dims test=develop

* refine comment
```
  f58c8db6
- Y
  Expose `mutable_data` as python binding (#19932) · cde73a7b
  由 Yang Zhang 提交于 9月 26, 2019
```
* Expose `mutable_data` as python binding

test=develop

* Add test for device pointer binding

test=develop

* Make test compatible with python 2
```
  cde73a7b
- A
  Remove constraint that last dimension is forced to be 1 in rank_loss (#19997) · 137e6336
  由 Aurelius84 提交于 9月 26, 2019
```
* fix input shape check test=develop

* move PADDLE_ENFORCE test=develop
```
  137e6336
- C
  Add dtype for coalesce_tensor_op (#20016) · 101a2b61
  由 chengduo 提交于 9月 26, 2019
```
Add dtype for coalesce_tensor_op
```
  101a2b61
- H
  Add new data layer (#19916) · 88af4ab6
  由 Huihuang Zheng 提交于 9月 26, 2019
```
The new "fluid.data" changes old "fluid.layers.data":

1. Add shape and dtype check.
2. Remove "append_batch_size" parameter. We won't offer this in the new data layer because other deep learning platforms don't have this kind of data layer pre-processing. It may confuse users.
3. Remove "stop gradient" parameter because the data layer doesn't do back-propagation

TODO：
Now data layer feeded by executor is checked, will we want to check the feed data of readers in the future?
```
  88af4ab6
- Z
  
  fix math_op_path.py when integers, test=develop (#20008) · 1b7de894
  由 Zeng Jinle 提交于 9月 26, 2019
  
  1b7de894
- Q
  
  Remove unit testing for large shape in test_affine_channel_op (#19993) · bb271b6d
  由 qingqing01 提交于 9月 26, 2019
  
  bb271b6d
25 9月, 2019 8 次提交

add kernel for fill_op, test=develop (#19719) · b1bb2384

由 zhongpu 提交于 9月 25, 2019

* add kernel for fill_op, test=develop

* modify PADDLE_ENFORCE to PADDLE_ENFORCE_EQ, test=develop

* add op test for fill_op, test=develop

* REGISTER COP CUDA KERNEL, test=develop

* update test_fill_op.py, test=develop

* change FillConstantOpVarTypeInference to FillOpVarTypeInference, test=develop

* fix op test, test=develop

* add head file, test=develop

b1bb2384

add support tensor and tensorlist for strided_slice OP (#19929) · 382d099d

由 wangchaochaohu 提交于 9月 25, 2019

* add support tensor and tensorlist for strided_slice OP test=develop

* fix the commnet test=develop

* fix test=develop

* fix the bug test=develop

* delete log test=develop

* fix API.spec test=develop

* fix test=develop

382d099d

L
Fix OpTest of bn (#19062) · 619a241b
由 lvmengsi 提交于 9月 25, 2019
```
* fix bn
```
619a241b
S
Avoid treating broadcast as initialization operation (#19857) · 5920d69d
由 ShenLiang 提交于 9月 25, 2019
```
* treat broadcast as non-initial, test=develop

* rename the class name

* rename the class name, test=develop
```
5920d69d

add support of matmul with multiple head even different width and height (#19708) · c670058a

由 Bob Zhu 提交于 9月 25, 2019

* add support of matmul with multiple head even different width and height

Original matmul with multiple head supports only the mat_a.width == mat_b.height,
in that case, mat_b will be horizontally split. In this patch, we extend the
support when mat_a.width != mat_b.height but mat_a.width/head_number == mat_b.height,
in this case, mab_b will be vertically split.

One example is A is [3, 8], B is [2, 16], head_number is 4. In this
case, A will be split as [3, 2], B will be (vertically) split as
[2, 4]. The final result will be 4 matrix of 4 matrix of [3,4], i.e. [3, 16]

test=develop

* add support of matmul with multiple head even different width and height

Original matmul with multiple head supports only the mat_a.width == mat_b.height,
in that case, mat_b will be horizontally split. In this patch, we extend the
support when mat_a.width != mat_b.height but mat_a.width/head_number == mat_b.height,
in this case, mab_b will be vertically split.

One example is A is [3, 8], B is [2, 16], head_number is 4. In this
case, A will be split as [3, 2], B will be (vertically) split as
[2, 4]. The final result will be 4 matrix of 4 matrix of [3,4], i.e. [3, 16]

test=develop

* refactor the code of matmul with multiple head even different width and height

test=develop

c670058a

L
refine ctc align op with padding (#19926) · 6884dc80
由 Liufang Sang 提交于 9月 25, 2019
```
* refine ctc align op with padding 
* refine api sample code
```
6884dc80
T
add input type and dtype check for softmax_op (#19975) · 65a02fc1
由 Tao Luo 提交于 9月 25, 2019
```
* add input type and dtype check for softmax_op

test=develop

* refine error message

test=develop
```
65a02fc1

Removing length dims constraints of seq_pad and seq_unpad (#19497) · 99a9615a

由 Aurelius84 提交于 9月 25, 2019

* Removing last dims constraints of seq_pad and seq_unpad test=develop

* fix test_layer api code test=develop

* fix sequence_pad_op.cc conflict test=develop

* remove test_analyzer_mm_dnn test=develop

* fix vectorize bug test=develop

* fix vectorize<int> test=develop

99a9615a

24 9月, 2019 8 次提交

J

add optimizer:dpsgd,test=develop (#19915) · 766bd529
由 jhjiangcs 提交于 9月 24, 2019

766bd529

Add float16 support to `sync_batch_norm_op` (#19681) · ebff68fa

由 Yang Zhang 提交于 9月 24, 2019

* Add float16 support to `sync_batch_norm_op`

test=develop

* Add test for sync_bn with FP16 input

test=develop

ebff68fa

Remove constraint that last dimension is forced to be 1 by adding lookup_table_v2 (#19735) · 039b9710

由 Aurelius84 提交于 9月 24, 2019

* Remove constraint that last dimension is forced to be 1 by add
lookup_table_v2 test=develop

* modify into PADDLE_ENFORCE_CUDA_SUCCESS test=develop

* Revert "modify into PADDLE_ENFORCE_CUDA_SUCCESS test=develop"

This reverts commit 8a960bfc61e51aa27c3c529df8fb90b93ebd19f9.

* move api into fluid.embedding test=develop

* fix example code test=develop

* move one_hot into fluid.one_hot

* modify api.spec test=develop

* fix loss shape test=develop

039b9710

support change shuffle and train thread num (#19841) · cedc0477

由 xujiaqi01 提交于 9月 24, 2019

* support change shuffle thread num
* support change train thread num
* fix receive shuffle data of each channel
* data norm stop gradient
* add check thread_tensor type and root_tensor type when merge metric
* remove sleep in shuffle, add config
* add config of pslib client to client communication
* fix xbox str
* add data norm op testcase
* add flush in trainer finalize

cedc0477

K

add elementwise mod support float/double. test=develop (#19570) · 14625ffe
由 Kaipeng Deng 提交于 9月 24, 2019

14625ffe
G
give warnings when save a model without any parameters (#19931) · 790d5226
由 Ghost Under Moon 提交于 9月 24, 2019
```
* give warnings when save a model without any parameters test=develop

* delete one line comment test=develop
```
790d5226
Z
Add py_reader combination unittest (#19923) · f254b477
由 Zeng Jinle 提交于 9月 24, 2019
```
* add py_reader combination unittest,test=develop

* follow huihuang's comments, test=develop
```
f254b477

Make OpTest check grad inplace even if forward has no inplace (#19847) · 57606205

由 Leo Chen 提交于 9月 24, 2019

* make OpTest check grad inplace even if forward has no inplace, test=develop

* do not run PE when enable_inplace is False, test=develop

* add conv3d cuda kernel for float16 type, test=develop

* refactor OpTest for inplace, test=develop

* add comments, test=develop

57606205

23 9月, 2019 8 次提交

Z

resize Ops support data_layout:channel_last, test=develop, test=document_preview (#19914) · cb8f3c03
由 Zhang Ting 提交于 9月 23, 2019

cb8f3c03

Forward recompute3 (#19913) · 9901f696

由 mapingshuo 提交于 9月 23, 2019

* add recompute based checkpoints methods for large batch training
test=develop

* add append_backward_with_forward_recomputation
test=develop

* refine optimizer
test=develop

* update backward and optimizer
test=develop

* make Variable usable
test=develop

* add recompute code

* refine optimizer
test=develop

* refine addup _append_backward_ops_with_checkpoints_
1) for recompute part, just cache the grad_op_desc without appending to block
2) before appending grad_op_desc to backward part, addup_repetitive_vars, remove unused branch
test=develop

* make method private

* add recompute strategy into DistributedStrategy
test=develop

* checkpoint version3
test=develop

* remove some print information
test=develop

* remove unused sumop
test=develop

* try to fix recompute with graph building modules

* add input names to vars should be held

* add memory debug tool

* backup backward

* Fix bugs

* add backward desc for op not in any segments

* add exception info for sub_block

test=develop

* modify code style

test=develop

* modify code style

test=develop

* remove print functions

test=develop

* add API spec

test=develop
test=document_preview

* make Recompute a child class of Optimizer

test=develop
test=document_preview

* add API spec

test=develop
test=document_preview

* modify API spec

test=develop
test=document_preview

* add document for Recompute

test=develop
test=document_preview

* change API doc of Rcompute

test=develop
test=document_preview

* code cleaning

test=develop
test=document_preview

* modify API spec

* fix bugs when segments hold no element

* add testcase for Recompute Optimizer

test=develop
test=document_preview

* add test for apply_gradient, and code cleaning

test=develop
test=document_preview

* add test case for load function

* enable CI

test=develop
test=document

* add test case

test=develop
test=document_preview

* add sample code for 4 function of recompute optimizer

test=develop
test=document_preview

9901f696

G

warning when user save a inference model which contains auc op test=develop (#19838) · 4836ee68
由 Ghost Under Moon 提交于 9月 23, 2019

4836ee68
W
optimize the error information when the input for while op has a wron… (#19872) · e606b175
由 wopeizl 提交于 9月 23, 2019
```
* optimize the error information when the input for while op has a wrong shape test=develop
```
e606b175
R
add mse_loss (#19759) · d31c92a2
由 ruri 提交于 9月 23, 2019
```
* add mse_loss op
```
d31c92a2

move tree_conv to fluid.contrib.layers (#19918) · a4919d36

由 Tao Luo 提交于 9月 23, 2019

* move tree_conv to fluid.contrib.layers

test=develop

* update API.spec for tree_conv

test=develop

* update tree_conv api to increase unit coverage

test=develop

a4919d36

Unify DataLoader APIs (#19305) · 0436efd6

由 Zeng Jinle 提交于 9月 23, 2019

* unify DataLoader APIs, test=develop

* integrate iterable CPU Dataset, test=develop
add GPU dataset supporting, test=develop

* add unittests for dataset, test=develop

* add more docs to dataloader apis, test=develop, test=document_preview

* refine doc, test=develop

* refine doc again, test=develop

* increase coverage, test=develop

0436efd6

T
paddle cloud role maker fix (#19646) · 278dd003
由 tangwei12 提交于 9月 23, 2019
```
* optimize cloud rolemaker, test=develop
```
278dd003

afeixing77 / Paddle 与 Fork 源项目一致

afeixing77 / Paddle
与 Fork 源项目一致