提交 · c9ea317b367b61c3abc6496e76354fb297b4746c · Crayon鑫 / Paddle

27 9月, 2019 8 次提交

codegen code for reconstruction (#19728) · c9ea317b

由 wangchaochaohu 提交于 9月 27, 2019

* codegen code for reconstruction test=develop

* fix the cmake test=develop

* fix review advice test=develop

c9ea317b

L

fix mul double grad (#20040) · 647ff784
由 lvmengsi 提交于 9月 27, 2019

647ff784

the integrated communicator (#19849) · 8f0b3c05

由 tangwei12 提交于 9月 27, 2019

* add a base class for the Communicator
* add AsyncCommunicator Impl for async distributed training

8f0b3c05

W
Refine document of DGCMomentumOptimizer (#19960) · 8d92b36d
由 WangXi 提交于 9月 26, 2019
```
Refine document of DGCMomentumOptimizer
```
8d92b36d
D
Polish English docs of elementwise_add/sub/mul/div (#20027) · 5cef7a2f
由 danleifeng 提交于 9月 27, 2019
```
Polish English docs of elementwise_add/sub/mul/div
```
5cef7a2f

Paddle error message stack shaping and optimization (#19895) · b9163350

由 Chen Weihang 提交于 9月 27, 2019

* shape and optimize paddle error message stack, test=develop

* limit exception type & add unittest, test=develop

* fix multi-platform problem, test=develop

* fix related unnitest failed, test=develop

* add doc & fix unittest errors, test=develop

* fix function name error, test=develop

* update tensor test exception msg compare, test=develop

* remove unittest on win32, the dir format is different, test=develop

* remove useless package, test=develop

* add paddle enforce handler unittest, test=develop

* add exception checkout, test=develop

* fix coverage failed, test=develop

* fix op registry test failed, test=develop

* refactor whole pr, test=develop

* remove test in CMakelist, test=develop

* fix coverage, test=develop

b9163350

L
Fixed warpctc, test=develop (#20011) · c8e12587
由 Li Fuchen 提交于 9月 27, 2019
```
Use AllocateTmpTensor() for creating temporary tensors in warpctc.
```
c8e12587
H
Update API of paddle.fluid.data (#20024) · 7836f477
由 Huihuang Zheng 提交于 9月 27, 2019
```
Set output type LoDTensor only

After code experiment, I found data doens't support other type
```
7836f477

26 9月, 2019 17 次提交
- 1
  fix APIs, test=document_preview (#19954) · 6c74e738
  由 123malin 提交于 9月 26, 2019
```
* fix DistributeTranspilerConfig document, test=develop
```
  6c74e738
- W
  
  fix reduce bug test=develop (#19971) · 3409db95
  由 wangchaochaohu 提交于 9月 26, 2019
  
  3409db95
- W
  Make PaddleSlim support PyReader (#19995) · 3ea2b661
  由 whs 提交于 9月 26, 2019
```
* Make PaddleSlim support PyReader.
* Fix unittest of sensitive pruning.
* Add some assert.
```
  3ea2b661
- A
  MKLDNN BatchNorm operator refactor (#20012) · 4b65af77
  由 Adam 提交于 9月 26, 2019
```
test=develop
```
  4b65af77
- J
  Fix test pool2d int8 mkldnn (#19976) · 1d32897c
  由 joanna.wozna.intel 提交于 9月 26, 2019
```
* Fix conv2d+dequantize squash for residual fusion

test=develop

* Correct int8 input

test=develop

* Add if exclude or include padding in pool2d mkldnn

test=develop
```
  1d32897c
- C
  disable fuse_all_optimizer_ops (#19966) · 2450d15b
  由 chengduo 提交于 9月 26, 2019
```
test=develop
```
  2450d15b
- A
  Require x.dims=label.dims in huber_loss (#20017) · f58c8db6
  由 Aurelius84 提交于 9月 26, 2019
```
* x.dims == y.dims test=develop

* refine comment
```
  f58c8db6
- Y
  Expose `mutable_data` as python binding (#19932) · cde73a7b
  由 Yang Zhang 提交于 9月 26, 2019
```
* Expose `mutable_data` as python binding

test=develop

* Add test for device pointer binding

test=develop

* Make test compatible with python 2
```
  cde73a7b
- A
  Remove constraint that last dimension is forced to be 1 in rank_loss (#19997) · 137e6336
  由 Aurelius84 提交于 9月 26, 2019
```
* fix input shape check test=develop

* move PADDLE_ENFORCE test=develop
```
  137e6336
- C
  Add dtype for coalesce_tensor_op (#20016) · 101a2b61
  由 chengduo 提交于 9月 26, 2019
```
Add dtype for coalesce_tensor_op
```
  101a2b61
- Z
  fix if else error info (#19974) · f04f2b23
  由 Zhaolong Xing 提交于 9月 26, 2019
```
test=develop
test=document_fix
```
  f04f2b23
- G
  Polish elementwise max min pow document to add more examples. (#19946) · a7512db2
  由 gongweibao 提交于 9月 26, 2019
```
Polish elementwise max min pow document to add more examples
```
  a7512db2
- A
  
  fix dataType in C++ comment in embedding op (#20004) · 2b5b4b3c
  由 Aurelius84 提交于 9月 26, 2019
  
  2b5b4b3c
- T
  enhance shape error message of mul_op (#19998) · bcb2903e
  由 Tao Luo 提交于 9月 26, 2019
```
test=develop
```
  bcb2903e
- M
  fix doc of apply_optimize (#19965) · d62360fe
  由 mapingshuo 提交于 9月 26, 2019
```
* fix doc of apply_optimize
test=document_fix
test=document_preview

* modify doc of backward
test=develop
test=document_fix

* modify document hash
test=develop
test=document_preview
```
  d62360fe
- C
  Add LoD empty check for all related sequence ops (#19980) · 1409586e
  由 Chen Weihang 提交于 9月 26, 2019
```
* add lod check for sequence op, test=develop

* delete unnecessary check in expend op, test=develop
```
  1409586e
- H
  Add new data layer (#19916) · 88af4ab6
  由 Huihuang Zheng 提交于 9月 26, 2019
```
The new "fluid.data" changes old "fluid.layers.data":

1. Add shape and dtype check.
2. Remove "append_batch_size" parameter. We won't offer this in the new data layer because other deep learning platforms don't have this kind of data layer pre-processing. It may confuse users.
3. Remove "stop gradient" parameter because the data layer doesn't do back-propagation

TODO：
Now data layer feeded by executor is checked, will we want to check the feed data of readers in the future?
```
  88af4ab6
25 9月, 2019 15 次提交

X
fix memory leak in HogwildWorker (#19956) · f50e701b
由 xujiaqi01 提交于 9月 25, 2019
```
fix memory leak in HogwildWorker,  whose ops are  explicitly deleted in destructor
```
f50e701b
Z

fix buddy_allocator_test, test=develop (#19967) · b8aff5e5
由 Zeng Jinle 提交于 9月 25, 2019

b8aff5e5

Add AdadeltaOptimizer doc (#19875) · 4a5ce4fe

由 Zeng Jinle 提交于 9月 25, 2019

* add AdadeltaOptimizer doc, test=develop

* refine doc,test=develop

* folllow lanxiang's comments, test=develop, test=document_fix

4a5ce4fe

Expose set_gradient_clip API (#19869) · 7912e6ca

由 Zeng Jinle 提交于 9月 25, 2019

* expose set_gradient_clip, test=develop, test=document_preview, test=preview

* expose gradient clip, test=develop, test=document_fix

* refine doc, test=develop

* follow lanxiang's comments, test=develop, test=document_fix

7912e6ca

C
refine deformable roi pooling doc (#19944) · 0099e549
由 chengjuntao 提交于 9月 25, 2019
```
* refine doc, test=develop, test=document_preview
```
0099e549

add kernel for fill_op, test=develop (#19719) · b1bb2384

由 zhongpu 提交于 9月 25, 2019

* add kernel for fill_op, test=develop

* modify PADDLE_ENFORCE to PADDLE_ENFORCE_EQ, test=develop

* add op test for fill_op, test=develop

* REGISTER COP CUDA KERNEL, test=develop

* update test_fill_op.py, test=develop

* change FillConstantOpVarTypeInference to FillOpVarTypeInference, test=develop

* fix op test, test=develop

* add head file, test=develop

b1bb2384

add support tensor and tensorlist for strided_slice OP (#19929) · 382d099d

由 wangchaochaohu 提交于 9月 25, 2019

* add support tensor and tensorlist for strided_slice OP test=develop

* fix the commnet test=develop

* fix test=develop

* fix the bug test=develop

* delete log test=develop

* fix API.spec test=develop

* fix test=develop

382d099d

L
Fix ssdloss num and batch norm format and conv2d (#19754) · fe218df3
由 lvmengsi 提交于 9月 25, 2019
```
* update API.spec
```
fe218df3
L
Fix OpTest of bn (#19062) · 619a241b
由 lvmengsi 提交于 9月 25, 2019
```
* fix bn
```
619a241b

add support of matmul with multiple head even different width and height (#19708) · c670058a

由 Bob Zhu 提交于 9月 25, 2019

* add support of matmul with multiple head even different width and height

Original matmul with multiple head supports only the mat_a.width == mat_b.height,
in that case, mat_b will be horizontally split. In this patch, we extend the
support when mat_a.width != mat_b.height but mat_a.width/head_number == mat_b.height,
in this case, mab_b will be vertically split.

One example is A is [3, 8], B is [2, 16], head_number is 4. In this
case, A will be split as [3, 2], B will be (vertically) split as
[2, 4]. The final result will be 4 matrix of 4 matrix of [3,4], i.e. [3, 16]

test=develop

* add support of matmul with multiple head even different width and height

Original matmul with multiple head supports only the mat_a.width == mat_b.height,
in that case, mat_b will be horizontally split. In this patch, we extend the
support when mat_a.width != mat_b.height but mat_a.width/head_number == mat_b.height,
in this case, mab_b will be vertically split.

One example is A is [3, 8], B is [2, 16], head_number is 4. In this
case, A will be split as [3, 2], B will be (vertically) split as
[2, 4]. The final result will be 4 matrix of 4 matrix of [3,4], i.e. [3, 16]

test=develop

* refactor the code of matmul with multiple head even different width and height

test=develop

c670058a

L
refine ctc align op with padding (#19926) · 6884dc80
由 Liufang Sang 提交于 9月 25, 2019
```
* refine ctc align op with padding 
* refine api sample code
```
6884dc80

FIx C++ inference BUG: When open memory optim and enable trt subgraph at the... · e89b1288

由 Zhaolong Xing 提交于 9月 25, 2019

FIx C++ inference BUG: When open memory optim and enable trt subgraph at the same time, there is a bug (#19969)

* fix memory optimization type
test=develop

* 1. fix BUG: open trt and memory optim will trigger bug.
2. Clean memory optim bug.
test=develop

e89b1288

Add support for new QAT models (#18970) · 4286a627

由 Wojciech Uss 提交于 9月 25, 2019

* Add support for new QAT models

test=develop
Co-Authored-By: NMichał Gallus <michal.gallus@intel.com>
Co-Authored-By: NWojciech Uss <wojciech.uss@intel.com>

* fixed fps results

test=develop

* fix top5 accuracy drop problem

* updated for new QAT models

* skip quantizing average pooling - dirty but working

* add missing pass

* added missing conv+brelu fuse pass

* removed a call to non-existent pass

test=develop

* renamed pass

test=develop

* Adjust finding pooling scale to newest QAT models

* Remove unnecessary code from quantization_mkldnn_pass

* Copy Pooling input scale to output scale in QAT

* Refactor & remove unused code in QAT

* Incorporate fp32 FC into QAT

test=develop

* Enable graph drawing with debug flag

test=develop

* Add tests for QATv2

* Fix paths for QATv2 models

test=develop

* Add option to save transformed int8 qat model

test=develop

* Remove redundant lines from qat mkldnn pass

test=develop

* Delegate disablement of avg pooling to qat

test=develop

* fix CI bug, test=develop

* Follow Wangzhen's Review, test=develop

* Update API.spec

test=develop

* Name False in (is_unsigned, TensorScale) tuple

test=develop

4286a627

Removing length dims constraints of seq_pad and seq_unpad (#19497) · 99a9615a

由 Aurelius84 提交于 9月 25, 2019

* Removing last dims constraints of seq_pad and seq_unpad test=develop

* fix test_layer api code test=develop

* fix sequence_pad_op.cc conflict test=develop

* remove test_analyzer_mm_dnn test=develop

* fix vectorize bug test=develop

* fix vectorize<int> test=develop

99a9615a

C
polish multi process warning info (#19961) · cca26f5c
由 chengduo 提交于 9月 25, 2019
```
test=develop
```
cca26f5c

Crayon鑫 / Paddle 与 Fork 源项目一致

Crayon鑫 / Paddle
与 Fork 源项目一致