提交 · 728ec1b43dd59e9ec6b92d33d2189318222c0f5b · 机器未来 / Paddle

30 9月, 2019 6 次提交
- C
  Add GEO-SGD distribute training algorithm (#20018) · 728ec1b4
  由 Chengmo 提交于 9月 30, 2019
```
* refector geo sgd & communicator
```
  728ec1b4
- L
  Set lod level of sequence_unpad's output to 1 in compile time (#20068) · 5365cd2f
  由 Li Fuchen 提交于 9月 30, 2019
```
* Set lod level of sequence_unpad's output to 1 in compile time
```
  5365cd2f
- D
  Improve elementwise operators performance in same dimensions. (#19763) · 425279a5
  由 danleifeng 提交于 9月 30, 2019
```
Improve elementwise operators performance in same dimensions
```
  425279a5
- L
  
  fix windows compilation issue when compile with VS2015, test=release/1.6 (#20114) · 292aae43
  由 liuwei1031 提交于 9月 30, 2019
  
  292aae43
- W
  fix compile paddle with anakin bug · 276b5e34
  由 Wilber 提交于 9月 30, 2019
```
* fix compile with anakin bug

* remove useless deps test=develop

- 修复了联编anakin时，遇到的bug.
- 编译test_anakin_activate 不通过
- 编译test_anakin_engine 不通过
```
  276b5e34
- S
  
  Modify the style of function names (#20071) · 649bcd5f
  由 silingtong123 提交于 9月 30, 2019
  
  649bcd5f
29 9月, 2019 3 次提交

fix conv2d and conv3d: (#20042) · 3aa331d9

由 liym27 提交于 9月 29, 2019

1.support asymmetric padding;
    2.support padding algorithm:"SAME" and "VALID";
    3.support channel_last: data_format NHWC and NDHWC;
    4.change doc of python API and c++;

    test=develop, test=document_preview

3aa331d9

C

Fix compling warning in deformable conv. (#20036) · 6f184775
由 chengjuntao 提交于 9月 29, 2019

6f184775
W
Refine api doc (#20037) · da892caf
由 wangguanzhong 提交于 9月 29, 2019
```
* refine doc, test=document_fix

* add API.spec,test=develop,test=document_fix
```
da892caf

28 9月, 2019 4 次提交

improve op uniform_random, argument shape support tensor and tensor in list (#19786) · f1eebf75

由 silingtong123 提交于 9月 28, 2019

* test=develop, argument shape support tensor and tensor in list

* test=develop,Increasing the coverage of CI tests

* test=develop, modify the document and update API.spec

* test=develop, modify the doc and update API.spec

* test=develop, modify the doc and update API.spec

* test=develop, modify the interface of UniformInitializer

* test=develop, modify the interface of XavierInitializer and MSRAInitializer

* test=develop, modify based on review's comments

* test=develop, modify based on review's comments

*  test=develop, modify based on review's comments

f1eebf75

fix pool2d pool3d,support asymmetric padding and channel_last (#19739) · 24010472

由 liym27 提交于 9月 28, 2019

* fix pool2d pool3d:
1. support asymmetric padding;
2. support padding algorithm:"SAME" and "VALID";
3. support channel_last: data_format NHWC and NDHWC;
4. support inferring shape when input with negative dims in compile time;
5. change doc of python API and c++;
6. fix bug in cuda kernel when Attr(adaptive) is true.

test=develop,test=document_preview

* fix 'tensors' to 'Tensors'. test=develop,test=document_preview

* add test for converage ValueError.test=develop,test=document_preview

* resolve conflict in test_pool2d. test=develop

24010472

A
Minor GetMKLDNNFormat changes (#20055) · fe581b0e
由 Adam 提交于 9月 28, 2019
```
test=develop
```
fe581b0e
L

fix conv_grad_grad (#20054) · c92348c3
由 lvmengsi 提交于 9月 28, 2019

c92348c3

27 9月, 2019 6 次提交
- K
  polish pool infer shape (#20038) · e7a6567b
  由 Kaipeng Deng 提交于 9月 27, 2019
```
* fix pool infershape. test=develop

* fix unittest converage. test=develop

* fix format. test=develop
```
  e7a6567b
- C
  Add fp16 support for pad and split (#19881) · fb2a9cdf
  由 chengduo 提交于 9月 27, 2019
```
* make pad and split support fp16
test=develop
```
  fb2a9cdf
- L
  
  fix mul double grad (#20040) · 647ff784
  由 lvmengsi 提交于 9月 27, 2019
  
  647ff784
- T
  the integrated communicator (#19849) · 8f0b3c05
  由 tangwei12 提交于 9月 27, 2019
```
* add a base class for the Communicator
* add AsyncCommunicator Impl for async distributed training
```
  8f0b3c05
- D
  Polish English docs of elementwise_add/sub/mul/div (#20027) · 5cef7a2f
  由 danleifeng 提交于 9月 27, 2019
```
Polish English docs of elementwise_add/sub/mul/div
```
  5cef7a2f
- L
  Fixed warpctc, test=develop (#20011) · c8e12587
  由 Li Fuchen 提交于 9月 27, 2019
```
Use AllocateTmpTensor() for creating temporary tensors in warpctc.
```
  c8e12587
26 9月, 2019 11 次提交
- W
  
  fix reduce bug test=develop (#19971) · 3409db95
  由 wangchaochaohu 提交于 9月 26, 2019
  
  3409db95
- A
  MKLDNN BatchNorm operator refactor (#20012) · 4b65af77
  由 Adam 提交于 9月 26, 2019
```
test=develop
```
  4b65af77
- J
  Fix test pool2d int8 mkldnn (#19976) · 1d32897c
  由 joanna.wozna.intel 提交于 9月 26, 2019
```
* Fix conv2d+dequantize squash for residual fusion

test=develop

* Correct int8 input

test=develop

* Add if exclude or include padding in pool2d mkldnn

test=develop
```
  1d32897c
- A
  Require x.dims=label.dims in huber_loss (#20017) · f58c8db6
  由 Aurelius84 提交于 9月 26, 2019
```
* x.dims == y.dims test=develop

* refine comment
```
  f58c8db6
- A
  Remove constraint that last dimension is forced to be 1 in rank_loss (#19997) · 137e6336
  由 Aurelius84 提交于 9月 26, 2019
```
* fix input shape check test=develop

* move PADDLE_ENFORCE test=develop
```
  137e6336
- C
  Add dtype for coalesce_tensor_op (#20016) · 101a2b61
  由 chengduo 提交于 9月 26, 2019
```
Add dtype for coalesce_tensor_op
```
  101a2b61
- Z
  fix if else error info (#19974) · f04f2b23
  由 Zhaolong Xing 提交于 9月 26, 2019
```
test=develop
test=document_fix
```
  f04f2b23
- G
  Polish elementwise max min pow document to add more examples. (#19946) · a7512db2
  由 gongweibao 提交于 9月 26, 2019
```
Polish elementwise max min pow document to add more examples
```
  a7512db2
- A
  
  fix dataType in C++ comment in embedding op (#20004) · 2b5b4b3c
  由 Aurelius84 提交于 9月 26, 2019
  
  2b5b4b3c
- T
  enhance shape error message of mul_op (#19998) · bcb2903e
  由 Tao Luo 提交于 9月 26, 2019
```
test=develop
```
  bcb2903e
- C
  Add LoD empty check for all related sequence ops (#19980) · 1409586e
  由 Chen Weihang 提交于 9月 26, 2019
```
* add lod check for sequence op, test=develop

* delete unnecessary check in expend op, test=develop
```
  1409586e
25 9月, 2019 6 次提交

add kernel for fill_op, test=develop (#19719) · b1bb2384

由 zhongpu 提交于 9月 25, 2019

* add kernel for fill_op, test=develop

* modify PADDLE_ENFORCE to PADDLE_ENFORCE_EQ, test=develop

* add op test for fill_op, test=develop

* REGISTER COP CUDA KERNEL, test=develop

* update test_fill_op.py, test=develop

* change FillConstantOpVarTypeInference to FillOpVarTypeInference, test=develop

* fix op test, test=develop

* add head file, test=develop

b1bb2384

add support tensor and tensorlist for strided_slice OP (#19929) · 382d099d

由 wangchaochaohu 提交于 9月 25, 2019

* add support tensor and tensorlist for strided_slice OP test=develop

* fix the commnet test=develop

* fix test=develop

* fix the bug test=develop

* delete log test=develop

* fix API.spec test=develop

* fix test=develop

382d099d

L
Fix OpTest of bn (#19062) · 619a241b
由 lvmengsi 提交于 9月 25, 2019
```
* fix bn
```
619a241b

add support of matmul with multiple head even different width and height (#19708) · c670058a

由 Bob Zhu 提交于 9月 25, 2019

* add support of matmul with multiple head even different width and height

Original matmul with multiple head supports only the mat_a.width == mat_b.height,
in that case, mat_b will be horizontally split. In this patch, we extend the
support when mat_a.width != mat_b.height but mat_a.width/head_number == mat_b.height,
in this case, mab_b will be vertically split.

One example is A is [3, 8], B is [2, 16], head_number is 4. In this
case, A will be split as [3, 2], B will be (vertically) split as
[2, 4]. The final result will be 4 matrix of 4 matrix of [3,4], i.e. [3, 16]

test=develop

* add support of matmul with multiple head even different width and height

Original matmul with multiple head supports only the mat_a.width == mat_b.height,
in that case, mat_b will be horizontally split. In this patch, we extend the
support when mat_a.width != mat_b.height but mat_a.width/head_number == mat_b.height,
in this case, mab_b will be vertically split.

One example is A is [3, 8], B is [2, 16], head_number is 4. In this
case, A will be split as [3, 2], B will be (vertically) split as
[2, 4]. The final result will be 4 matrix of 4 matrix of [3,4], i.e. [3, 16]

test=develop

* refactor the code of matmul with multiple head even different width and height

test=develop

c670058a

L
refine ctc align op with padding (#19926) · 6884dc80
由 Liufang Sang 提交于 9月 25, 2019
```
* refine ctc align op with padding 
* refine api sample code
```
6884dc80

Removing length dims constraints of seq_pad and seq_unpad (#19497) · 99a9615a

由 Aurelius84 提交于 9月 25, 2019

* Removing last dims constraints of seq_pad and seq_unpad test=develop

* fix test_layer api code test=develop

* fix sequence_pad_op.cc conflict test=develop

* remove test_analyzer_mm_dnn test=develop

* fix vectorize bug test=develop

* fix vectorize<int> test=develop

99a9615a

24 9月, 2019 4 次提交

J

add optimizer:dpsgd,test=develop (#19915) · 766bd529
由 jhjiangcs 提交于 9月 24, 2019

766bd529

Add float16 support to `sync_batch_norm_op` (#19681) · ebff68fa

由 Yang Zhang 提交于 9月 24, 2019

* Add float16 support to `sync_batch_norm_op`

test=develop

* Add test for sync_bn with FP16 input

test=develop

ebff68fa

Remove constraint that last dimension is forced to be 1 by adding lookup_table_v2 (#19735) · 039b9710

由 Aurelius84 提交于 9月 24, 2019

* Remove constraint that last dimension is forced to be 1 by add
lookup_table_v2 test=develop

* modify into PADDLE_ENFORCE_CUDA_SUCCESS test=develop

* Revert "modify into PADDLE_ENFORCE_CUDA_SUCCESS test=develop"

This reverts commit 8a960bfc61e51aa27c3c529df8fb90b93ebd19f9.

* move api into fluid.embedding test=develop

* fix example code test=develop

* move one_hot into fluid.one_hot

* modify api.spec test=develop

* fix loss shape test=develop

039b9710

support change shuffle and train thread num (#19841) · cedc0477

由 xujiaqi01 提交于 9月 24, 2019

* support change shuffle thread num
* support change train thread num
* fix receive shuffle data of each channel
* data norm stop gradient
* add check thread_tensor type and root_tensor type when merge metric
* remove sleep in shuffle, add config
* add config of pslib client to client communication
* fix xbox str
* add data norm op testcase
* add flush in trainer finalize

cedc0477

机器未来 / Paddle 与 Fork 源项目一致

机器未来 / Paddle
与 Fork 源项目一致