提交 · 8f0b3c05162cf28171a8ff2b93ca7c03088732ad · BaiXuePrincess / Paddle

27 9月, 2019 3 次提交
- T
  the integrated communicator (#19849) · 8f0b3c05
  由 tangwei12 提交于 9月 27, 2019
```
* add a base class for the Communicator
* add AsyncCommunicator Impl for async distributed training
```
  8f0b3c05
- D
  Polish English docs of elementwise_add/sub/mul/div (#20027) · 5cef7a2f
  由 danleifeng 提交于 9月 27, 2019
```
Polish English docs of elementwise_add/sub/mul/div
```
  5cef7a2f
- L
  Fixed warpctc, test=develop (#20011) · c8e12587
  由 Li Fuchen 提交于 9月 27, 2019
```
Use AllocateTmpTensor() for creating temporary tensors in warpctc.
```
  c8e12587
26 9月, 2019 11 次提交
- W
  
  fix reduce bug test=develop (#19971) · 3409db95
  由 wangchaochaohu 提交于 9月 26, 2019
  
  3409db95
- A
  MKLDNN BatchNorm operator refactor (#20012) · 4b65af77
  由 Adam 提交于 9月 26, 2019
```
test=develop
```
  4b65af77
- J
  Fix test pool2d int8 mkldnn (#19976) · 1d32897c
  由 joanna.wozna.intel 提交于 9月 26, 2019
```
* Fix conv2d+dequantize squash for residual fusion

test=develop

* Correct int8 input

test=develop

* Add if exclude or include padding in pool2d mkldnn

test=develop
```
  1d32897c
- A
  Require x.dims=label.dims in huber_loss (#20017) · f58c8db6
  由 Aurelius84 提交于 9月 26, 2019
```
* x.dims == y.dims test=develop

* refine comment
```
  f58c8db6
- A
  Remove constraint that last dimension is forced to be 1 in rank_loss (#19997) · 137e6336
  由 Aurelius84 提交于 9月 26, 2019
```
* fix input shape check test=develop

* move PADDLE_ENFORCE test=develop
```
  137e6336
- C
  Add dtype for coalesce_tensor_op (#20016) · 101a2b61
  由 chengduo 提交于 9月 26, 2019
```
Add dtype for coalesce_tensor_op
```
  101a2b61
- Z
  fix if else error info (#19974) · f04f2b23
  由 Zhaolong Xing 提交于 9月 26, 2019
```
test=develop
test=document_fix
```
  f04f2b23
- G
  Polish elementwise max min pow document to add more examples. (#19946) · a7512db2
  由 gongweibao 提交于 9月 26, 2019
```
Polish elementwise max min pow document to add more examples
```
  a7512db2
- A
  
  fix dataType in C++ comment in embedding op (#20004) · 2b5b4b3c
  由 Aurelius84 提交于 9月 26, 2019
  
  2b5b4b3c
- T
  enhance shape error message of mul_op (#19998) · bcb2903e
  由 Tao Luo 提交于 9月 26, 2019
```
test=develop
```
  bcb2903e
- C
  Add LoD empty check for all related sequence ops (#19980) · 1409586e
  由 Chen Weihang 提交于 9月 26, 2019
```
* add lod check for sequence op, test=develop

* delete unnecessary check in expend op, test=develop
```
  1409586e
25 9月, 2019 6 次提交

add kernel for fill_op, test=develop (#19719) · b1bb2384

由 zhongpu 提交于 9月 25, 2019

* add kernel for fill_op, test=develop

* modify PADDLE_ENFORCE to PADDLE_ENFORCE_EQ, test=develop

* add op test for fill_op, test=develop

* REGISTER COP CUDA KERNEL, test=develop

* update test_fill_op.py, test=develop

* change FillConstantOpVarTypeInference to FillOpVarTypeInference, test=develop

* fix op test, test=develop

* add head file, test=develop

b1bb2384

add support tensor and tensorlist for strided_slice OP (#19929) · 382d099d

由 wangchaochaohu 提交于 9月 25, 2019

* add support tensor and tensorlist for strided_slice OP test=develop

* fix the commnet test=develop

* fix test=develop

* fix the bug test=develop

* delete log test=develop

* fix API.spec test=develop

* fix test=develop

382d099d

L
Fix OpTest of bn (#19062) · 619a241b
由 lvmengsi 提交于 9月 25, 2019
```
* fix bn
```
619a241b

add support of matmul with multiple head even different width and height (#19708) · c670058a

由 Bob Zhu 提交于 9月 25, 2019

* add support of matmul with multiple head even different width and height

Original matmul with multiple head supports only the mat_a.width == mat_b.height,
in that case, mat_b will be horizontally split. In this patch, we extend the
support when mat_a.width != mat_b.height but mat_a.width/head_number == mat_b.height,
in this case, mab_b will be vertically split.

One example is A is [3, 8], B is [2, 16], head_number is 4. In this
case, A will be split as [3, 2], B will be (vertically) split as
[2, 4]. The final result will be 4 matrix of 4 matrix of [3,4], i.e. [3, 16]

test=develop

* add support of matmul with multiple head even different width and height

Original matmul with multiple head supports only the mat_a.width == mat_b.height,
in that case, mat_b will be horizontally split. In this patch, we extend the
support when mat_a.width != mat_b.height but mat_a.width/head_number == mat_b.height,
in this case, mab_b will be vertically split.

One example is A is [3, 8], B is [2, 16], head_number is 4. In this
case, A will be split as [3, 2], B will be (vertically) split as
[2, 4]. The final result will be 4 matrix of 4 matrix of [3,4], i.e. [3, 16]

test=develop

* refactor the code of matmul with multiple head even different width and height

test=develop

c670058a

L
refine ctc align op with padding (#19926) · 6884dc80
由 Liufang Sang 提交于 9月 25, 2019
```
* refine ctc align op with padding 
* refine api sample code
```
6884dc80

Removing length dims constraints of seq_pad and seq_unpad (#19497) · 99a9615a

由 Aurelius84 提交于 9月 25, 2019

* Removing last dims constraints of seq_pad and seq_unpad test=develop

* fix test_layer api code test=develop

* fix sequence_pad_op.cc conflict test=develop

* remove test_analyzer_mm_dnn test=develop

* fix vectorize bug test=develop

* fix vectorize<int> test=develop

99a9615a

24 9月, 2019 9 次提交

J

add optimizer:dpsgd,test=develop (#19915) · 766bd529
由 jhjiangcs 提交于 9月 24, 2019

766bd529

Add float16 support to `sync_batch_norm_op` (#19681) · ebff68fa

由 Yang Zhang 提交于 9月 24, 2019

* Add float16 support to `sync_batch_norm_op`

test=develop

* Add test for sync_bn with FP16 input

test=develop

ebff68fa

Remove constraint that last dimension is forced to be 1 by adding lookup_table_v2 (#19735) · 039b9710

由 Aurelius84 提交于 9月 24, 2019

* Remove constraint that last dimension is forced to be 1 by add
lookup_table_v2 test=develop

* modify into PADDLE_ENFORCE_CUDA_SUCCESS test=develop

* Revert "modify into PADDLE_ENFORCE_CUDA_SUCCESS test=develop"

This reverts commit 8a960bfc61e51aa27c3c529df8fb90b93ebd19f9.

* move api into fluid.embedding test=develop

* fix example code test=develop

* move one_hot into fluid.one_hot

* modify api.spec test=develop

* fix loss shape test=develop

039b9710

support change shuffle and train thread num (#19841) · cedc0477

由 xujiaqi01 提交于 9月 24, 2019

* support change shuffle thread num
* support change train thread num
* fix receive shuffle data of each channel
* data norm stop gradient
* add check thread_tensor type and root_tensor type when merge metric
* remove sleep in shuffle, add config
* add config of pslib client to client communication
* fix xbox str
* add data norm op testcase
* add flush in trainer finalize

cedc0477

K

add elementwise mod support float/double. test=develop (#19570) · 14625ffe
由 Kaipeng Deng 提交于 9月 24, 2019

14625ffe

- ReImplemented pooling fwd mkldnn (#19911) · 5b07ca9c

由 Jacek Czaja 提交于 9月 24, 2019

- First implementation of BWD and FWD of pooling mkl-dnn

- Compilation fix

- Fix

- Fix

 - Fix

- Fix to crash

- Compilation fix

- Combined AcquireBacward with Fwd

test=develop

5b07ca9c

Z

fix huber loss op attr type, test=develop (#19937) · b1e83b33
由 Zeng Jinle 提交于 9月 24, 2019

b1e83b33
Z

add inplace to assign op, test=develop (#19927) · cc157d59
由 Zeng Jinle 提交于 9月 24, 2019

cc157d59

Make OpTest check grad inplace even if forward has no inplace (#19847) · 57606205

由 Leo Chen 提交于 9月 24, 2019

* make OpTest check grad inplace even if forward has no inplace, test=develop

* do not run PE when enable_inplace is False, test=develop

* add conv3d cuda kernel for float16 type, test=develop

* refactor OpTest for inplace, test=develop

* add comments, test=develop

57606205

23 9月, 2019 3 次提交
- Z
  
  resize Ops support data_layout:channel_last, test=develop, test=document_preview (#19914) · cb8f3c03
  由 Zhang Ting 提交于 9月 23, 2019
  
  cb8f3c03
- K
  fix softmax CE time limit check failed (#19846) · 3f021781
  由 Kaipeng Deng 提交于 9月 23, 2019
```
* fix softmax ce time limit check failed. test=develop

* refine softmax calc. test=develop
```
  3f021781
- 石
  
  tensor_array_to_tensor_op.cc, test=develop (#19289) · 30adea0a
  由石晓伟提交于 9月 23, 2019
  
  30adea0a
22 9月, 2019 1 次提交
- L
  add instance norm (#19500) · 4155e625
  由 lvmengsi 提交于 9月 22, 2019
```
* add instance norm op
```
  4155e625
21 9月, 2019 2 次提交
- A
  Add support for other axes in MKLDNN softmax op (#19907) · cb65439d
  由 Adam 提交于 9月 21, 2019
```
* Initial, functional commit

* Clean commit related files
test=develop
```
  cb65439d
- P
  Add TRT input shape check between model and runtime (#19864) · baccd7e2
  由 Pei Yang 提交于 9月 21, 2019
```
* add TRT shape check, test=develop

* model_input_shape == runtime_input_shape, refine message, test=develop
```
  baccd7e2
20 9月, 2019 5 次提交

A
support 2-level lod of input in sequence_pool (#19839) · fcf53e55
由 Aurelius84 提交于 9月 20, 2019
```
* support 2-level lod of input in sequence_pool test=develop

* fix lod level bug in .cu test=develop
```
fcf53e55
Z
group_norm support data_layout:NHWC, test=develop, test=document_preview (#19614) · 93364b45
由 Zhang Ting 提交于 9月 20, 2019
```
1. group_norm support data_layout=NHWC
2. modified doc of group_norm
```
93364b45

[MKL-DNN] LRN refactoring (#19798) · 619c797a

由 Jacek Czaja 提交于 9月 20, 2019

- LRN mkl-dnn kernel refactor

test=develop

- compilation fix

- Another compilation fix

- Compilation fix

- another compilation fix

- compilation fix

- Crash fix

- optional LRN mkldnn workspace

- Added mid allocation

- Workaround for tests

- Removed gradient from is_test ut

- Removed mid for inference

- Reverted LRN mid removal for is_test

- PADDLE_ENFORCE adjusted

- Rebase to templatization commit

- Compilation fix

- compilation fix

test=develop

- lint

test=develop

- Fix to crash

- Rebase to recent codebase

 - lin

- lint

- compilation fix

619c797a

modified interpolate op to support tensor attribute, test=develop, test=document_preview (#19287) · 439d95e1

由 Zhang Ting 提交于 9月 20, 2019

modified interpolate_op to support tensor attribute

1. the parameter out_shape of image_resize、resize_nearest/bilinear/trilinear can be a list or a 1-D tensor variable. If a list, each element can be an integer or a tensor variable with shape: [1].

2. the parameter scale of above Ops can be a 1-D tensor variable.
modified document of image_resize, resize_nearest, resize_bilinear, resize_trilinear and add some code example.

439d95e1

add crop_tensor_op, test=develop, test=document_preview (#19314) · b3888941

由 Zhang Ting 提交于 9月 20, 2019

add crop_tensor op. The main difference with crop is :

1. If the argument shape is a list, each element is an integer or a tensor variable with shape: [1]. This way is suitable for the case that the shape may be changed each iteration.

2. If the argument shape is a variable. Its rank must be 1. In crop op, the rank of shape must be the same as x

offsets can be a list, in which each element is an integer or a tensor variavle with shape: [1].

b3888941

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致