提交 · 8f0b3c05162cf28171a8ff2b93ca7c03088732ad · BaiXuePrincess / Paddle

27 9月, 2019 6 次提交

the integrated communicator (#19849) · 8f0b3c05

由 tangwei12 提交于 9月 27, 2019

* add a base class for the Communicator
* add AsyncCommunicator Impl for async distributed training

8f0b3c05

W
Refine document of DGCMomentumOptimizer (#19960) · 8d92b36d
由 WangXi 提交于 9月 26, 2019
```
Refine document of DGCMomentumOptimizer
```
8d92b36d
D
Polish English docs of elementwise_add/sub/mul/div (#20027) · 5cef7a2f
由 danleifeng 提交于 9月 27, 2019
```
Polish English docs of elementwise_add/sub/mul/div
```
5cef7a2f

Paddle error message stack shaping and optimization (#19895) · b9163350

由 Chen Weihang 提交于 9月 27, 2019

* shape and optimize paddle error message stack, test=develop

* limit exception type & add unittest, test=develop

* fix multi-platform problem, test=develop

* fix related unnitest failed, test=develop

* add doc & fix unittest errors, test=develop

* fix function name error, test=develop

* update tensor test exception msg compare, test=develop

* remove unittest on win32, the dir format is different, test=develop

* remove useless package, test=develop

* add paddle enforce handler unittest, test=develop

* add exception checkout, test=develop

* fix coverage failed, test=develop

* fix op registry test failed, test=develop

* refactor whole pr, test=develop

* remove test in CMakelist, test=develop

* fix coverage, test=develop

b9163350

Z
Fix name_scope test case bug (#20034) · 5a2ecdea
由 zhaoyuchen2018 提交于 9月 27, 2019
```
test=develop
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>
```
5a2ecdea
H
Update API of paddle.fluid.data (#20024) · 7836f477
由 Huihuang Zheng 提交于 9月 27, 2019
```
Set output type LoDTensor only

After code experiment, I found data doens't support other type
```
7836f477

26 9月, 2019 15 次提交
- G
  
  Add `RUN_SERIAL` attribute to `exclusive` test. (#20026) · afc40a59
  由 gongweibao 提交于 9月 26, 2019
  
  afc40a59
- 1
  fix APIs, test=document_preview (#19954) · 6c74e738
  由 123malin 提交于 9月 26, 2019
```
* fix DistributeTranspilerConfig document, test=develop
```
  6c74e738
- W
  
  fix reduce bug test=develop (#19971) · 3409db95
  由 wangchaochaohu 提交于 9月 26, 2019
  
  3409db95
- W
  Make PaddleSlim support PyReader (#19995) · 3ea2b661
  由 whs 提交于 9月 26, 2019
```
* Make PaddleSlim support PyReader.
* Fix unittest of sensitive pruning.
* Add some assert.
```
  3ea2b661
- L
  improve the error message when handling ndarray with unsupported dtype (#19949) · bda7eab7
  由 liuwei1031 提交于 9月 26, 2019
```
* impove error message when passing ndarray with object dtype

* imporve message format

* change assert to raise TypeError

* remind user how to locate the irregular data instead of printing

* add unittest for input array type check
```
  bda7eab7
- J
  Fix test pool2d int8 mkldnn (#19976) · 1d32897c
  由 joanna.wozna.intel 提交于 9月 26, 2019
```
* Fix conv2d+dequantize squash for residual fusion

test=develop

* Correct int8 input

test=develop

* Add if exclude or include padding in pool2d mkldnn

test=develop
```
  1d32897c
- A
  Require x.dims=label.dims in huber_loss (#20017) · f58c8db6
  由 Aurelius84 提交于 9月 26, 2019
```
* x.dims == y.dims test=develop

* refine comment
```
  f58c8db6
- Y
  Expose `mutable_data` as python binding (#19932) · cde73a7b
  由 Yang Zhang 提交于 9月 26, 2019
```
* Expose `mutable_data` as python binding

test=develop

* Add test for device pointer binding

test=develop

* Make test compatible with python 2
```
  cde73a7b
- A
  Remove constraint that last dimension is forced to be 1 in rank_loss (#19997) · 137e6336
  由 Aurelius84 提交于 9月 26, 2019
```
* fix input shape check test=develop

* move PADDLE_ENFORCE test=develop
```
  137e6336
- C
  Add dtype for coalesce_tensor_op (#20016) · 101a2b61
  由 chengduo 提交于 9月 26, 2019
```
Add dtype for coalesce_tensor_op
```
  101a2b61
- G
  Polish elementwise max min pow document to add more examples. (#19946) · a7512db2
  由 gongweibao 提交于 9月 26, 2019
```
Polish elementwise max min pow document to add more examples
```
  a7512db2
- M
  fix doc of apply_optimize (#19965) · d62360fe
  由 mapingshuo 提交于 9月 26, 2019
```
* fix doc of apply_optimize
test=document_fix
test=document_preview

* modify doc of backward
test=develop
test=document_fix

* modify document hash
test=develop
test=document_preview
```
  d62360fe
- H
  Add new data layer (#19916) · 88af4ab6
  由 Huihuang Zheng 提交于 9月 26, 2019
```
The new "fluid.data" changes old "fluid.layers.data":

1. Add shape and dtype check.
2. Remove "append_batch_size" parameter. We won't offer this in the new data layer because other deep learning platforms don't have this kind of data layer pre-processing. It may confuse users.
3. Remove "stop gradient" parameter because the data layer doesn't do back-propagation

TODO：
Now data layer feeded by executor is checked, will we want to check the feed data of readers in the future?
```
  88af4ab6
- Z
  
  fix math_op_path.py when integers, test=develop (#20008) · 1b7de894
  由 Zeng Jinle 提交于 9月 26, 2019
  
  1b7de894
- Q
  
  Remove unit testing for large shape in test_affine_channel_op (#19993) · bb271b6d
  由 qingqing01 提交于 9月 26, 2019
  
  bb271b6d
25 9月, 2019 13 次提交

Add AdadeltaOptimizer doc (#19875) · 4a5ce4fe

由 Zeng Jinle 提交于 9月 25, 2019

* add AdadeltaOptimizer doc, test=develop

* refine doc,test=develop

* folllow lanxiang's comments, test=develop, test=document_fix

4a5ce4fe

Expose set_gradient_clip API (#19869) · 7912e6ca

由 Zeng Jinle 提交于 9月 25, 2019

* expose set_gradient_clip, test=develop, test=document_preview, test=preview

* expose gradient clip, test=develop, test=document_fix

* refine doc, test=develop

* follow lanxiang's comments, test=develop, test=document_fix

7912e6ca

C
refine deformable roi pooling doc (#19944) · 0099e549
由 chengjuntao 提交于 9月 25, 2019
```
* refine doc, test=develop, test=document_preview
```
0099e549

add kernel for fill_op, test=develop (#19719) · b1bb2384

由 zhongpu 提交于 9月 25, 2019

* add kernel for fill_op, test=develop

* modify PADDLE_ENFORCE to PADDLE_ENFORCE_EQ, test=develop

* add op test for fill_op, test=develop

* REGISTER COP CUDA KERNEL, test=develop

* update test_fill_op.py, test=develop

* change FillConstantOpVarTypeInference to FillOpVarTypeInference, test=develop

* fix op test, test=develop

* add head file, test=develop

b1bb2384

add support tensor and tensorlist for strided_slice OP (#19929) · 382d099d

由 wangchaochaohu 提交于 9月 25, 2019

* add support tensor and tensorlist for strided_slice OP test=develop

* fix the commnet test=develop

* fix test=develop

* fix the bug test=develop

* delete log test=develop

* fix API.spec test=develop

* fix test=develop

382d099d

L
Fix ssdloss num and batch norm format and conv2d (#19754) · fe218df3
由 lvmengsi 提交于 9月 25, 2019
```
* update API.spec
```
fe218df3
L
Fix OpTest of bn (#19062) · 619a241b
由 lvmengsi 提交于 9月 25, 2019
```
* fix bn
```
619a241b
S
Avoid treating broadcast as initialization operation (#19857) · 5920d69d
由 ShenLiang 提交于 9月 25, 2019
```
* treat broadcast as non-initial, test=develop

* rename the class name

* rename the class name, test=develop
```
5920d69d

add support of matmul with multiple head even different width and height (#19708) · c670058a

由 Bob Zhu 提交于 9月 25, 2019

* add support of matmul with multiple head even different width and height

Original matmul with multiple head supports only the mat_a.width == mat_b.height,
in that case, mat_b will be horizontally split. In this patch, we extend the
support when mat_a.width != mat_b.height but mat_a.width/head_number == mat_b.height,
in this case, mab_b will be vertically split.

One example is A is [3, 8], B is [2, 16], head_number is 4. In this
case, A will be split as [3, 2], B will be (vertically) split as
[2, 4]. The final result will be 4 matrix of 4 matrix of [3,4], i.e. [3, 16]

test=develop

* add support of matmul with multiple head even different width and height

Original matmul with multiple head supports only the mat_a.width == mat_b.height,
in that case, mat_b will be horizontally split. In this patch, we extend the
support when mat_a.width != mat_b.height but mat_a.width/head_number == mat_b.height,
in this case, mab_b will be vertically split.

One example is A is [3, 8], B is [2, 16], head_number is 4. In this
case, A will be split as [3, 2], B will be (vertically) split as
[2, 4]. The final result will be 4 matrix of 4 matrix of [3,4], i.e. [3, 16]

test=develop

* refactor the code of matmul with multiple head even different width and height

test=develop

c670058a

L
refine ctc align op with padding (#19926) · 6884dc80
由 Liufang Sang 提交于 9月 25, 2019
```
* refine ctc align op with padding 
* refine api sample code
```
6884dc80
T
add input type and dtype check for softmax_op (#19975) · 65a02fc1
由 Tao Luo 提交于 9月 25, 2019
```
* add input type and dtype check for softmax_op

test=develop

* refine error message

test=develop
```
65a02fc1

Add support for new QAT models (#18970) · 4286a627

由 Wojciech Uss 提交于 9月 25, 2019

* Add support for new QAT models

test=develop
Co-Authored-By: NMichał Gallus <michal.gallus@intel.com>
Co-Authored-By: NWojciech Uss <wojciech.uss@intel.com>

* fixed fps results

test=develop

* fix top5 accuracy drop problem

* updated for new QAT models

* skip quantizing average pooling - dirty but working

* add missing pass

* added missing conv+brelu fuse pass

* removed a call to non-existent pass

test=develop

* renamed pass

test=develop

* Adjust finding pooling scale to newest QAT models

* Remove unnecessary code from quantization_mkldnn_pass

* Copy Pooling input scale to output scale in QAT

* Refactor & remove unused code in QAT

* Incorporate fp32 FC into QAT

test=develop

* Enable graph drawing with debug flag

test=develop

* Add tests for QATv2

* Fix paths for QATv2 models

test=develop

* Add option to save transformed int8 qat model

test=develop

* Remove redundant lines from qat mkldnn pass

test=develop

* Delegate disablement of avg pooling to qat

test=develop

* fix CI bug, test=develop

* Follow Wangzhen's Review, test=develop

* Update API.spec

test=develop

* Name False in (is_unsigned, TensorScale) tuple

test=develop

4286a627

Removing length dims constraints of seq_pad and seq_unpad (#19497) · 99a9615a

由 Aurelius84 提交于 9月 25, 2019

* Removing last dims constraints of seq_pad and seq_unpad test=develop

* fix test_layer api code test=develop

* fix sequence_pad_op.cc conflict test=develop

* remove test_analyzer_mm_dnn test=develop

* fix vectorize bug test=develop

* fix vectorize<int> test=develop

99a9615a

24 9月, 2019 6 次提交

Y
update en document of shard_index_op (#19963) · 2efdf0ef
由 Yi Liu 提交于 9月 24, 2019
```
test=develop
test=document_fix
```
2efdf0ef
J

add optimizer:dpsgd,test=develop (#19915) · 766bd529
由 jhjiangcs 提交于 9月 24, 2019

766bd529

Add float16 support to `sync_batch_norm_op` (#19681) · ebff68fa

由 Yang Zhang 提交于 9月 24, 2019

* Add float16 support to `sync_batch_norm_op`

test=develop

* Add test for sync_bn with FP16 input

test=develop

ebff68fa

Remove constraint that last dimension is forced to be 1 by adding lookup_table_v2 (#19735) · 039b9710

由 Aurelius84 提交于 9月 24, 2019

* Remove constraint that last dimension is forced to be 1 by add
lookup_table_v2 test=develop

* modify into PADDLE_ENFORCE_CUDA_SUCCESS test=develop

* Revert "modify into PADDLE_ENFORCE_CUDA_SUCCESS test=develop"

This reverts commit 8a960bfc61e51aa27c3c529df8fb90b93ebd19f9.

* move api into fluid.embedding test=develop

* fix example code test=develop

* move one_hot into fluid.one_hot

* modify api.spec test=develop

* fix loss shape test=develop

039b9710

[PaddleSlim] Enhence compressor api in PaddleSlim (#19894) · bdb3e376

由 whs 提交于 9月 24, 2019

1. Support customize eval function instead of eval program.
2. Fix loading checkpoint in quantization strategy.
3. Support saving eval model when saving a checkpoint.
4. Fix decoder of loading context in PaddleSlim.
5. Fix restoring from the checkpoint of uniform prune strategy.
6. Support saving eval model and infer model during training.
7. Add ‘unitest’ for saving eval model, saving infer model and uniform pruning restoring from the checkpoint.
8. Fix pruning of depthwise_conv_grad op by updating the groups.

bdb3e376

support change shuffle and train thread num (#19841) · cedc0477

由 xujiaqi01 提交于 9月 24, 2019

* support change shuffle thread num
* support change train thread num
* fix receive shuffle data of each channel
* data norm stop gradient
* add check thread_tensor type and root_tensor type when merge metric
* remove sleep in shuffle, add config
* add config of pslib client to client communication
* fix xbox str
* add data norm op testcase
* add flush in trainer finalize

cedc0477

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致