提交 · fa7ace7cf2859f927c26f1970bbc2f5551532df1 · 机器未来 / Paddle

10 1月, 2020 2 次提交

Cherry pick from #21862 (#22194) · fa7ace7c

由 Guo Sheng 提交于 1月 10, 2020

* Fix default label dim of label_smooth_op. test=develop (#21862)

* Fix unit tests of label_smooth_op's data size.

fa7ace7c

[cherry-pick] Add FC padding, ernie test unit and layernorm parallel (#22198) · 3df38f5c

由 GaoWei8 提交于 1月 10, 2020

* Optimize the kernel implementation of layernorm with openmp (#20895)

* Add ernie c++ inference test (#21015)

* Add ernie unit test
test=develop

* Add ernie unit test
test=develop

* Add ernie unit test
test=develop

* remove ngraph

* optimize gpu test
test=develop

* optimize codes
test=develop

* fix cmake fails on inference_download_and_uncompress (#21185)

* solve cmake fails on inference_download_and_uncompress
test=develop

* solve cmake fails on inference_download_and_uncompress
test=develop

* Add fc padding to improve mkl GEMM's performance when N and K are multiple of 128. (#20972)

* Add fc padding to solve mkl performance
test=develop

* fix gpu pass and error information
test=develop

* fix fc_fuse_pass_test
test=develop

* fix error information
test=develop

* fix error information
test=develop

* fix name and add fc op padding test
test=develop

* fix attributes
test=develop

* optimize fc padding
test=develop

* fix test
test=develop

* Polish the codes of fc when needs padding (#21378)

test=develop

* Add ernie large c++ inference test (#21365)

* add ernie-large test
test=develop

* add ernie large c++ inference test
test=develop

* Modify padding strategy: remove weight copy in fc padding (#21650)

test=develop

* optimize fc jit (#21878)

test=develop
Co-authored-by: NYihua Xu <yihuaxu@hotmail.com>

3df38f5c

09 1月, 2020 3 次提交
- Z
  [cherry-pick] Fix windows build no kernel issue, test=develop (#22105) (#22184) · 91706d3b
  由 zhaoyuchen2018 提交于 1月 09, 2020
```
windows conv_fusion failed as no kernel， explicit declare lambda
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>
```
  91706d3b
- C
  
  fix softmax_with_cross_entropy_fix bug, test=develop (#21810) (#22183) · bc385a29
  由 Chen Weihang 提交于 1月 09, 2020
  
  bc385a29
- W
  [Cherry-pick 1.6] fix batch_norm_grad shape=0 & allreduce shape enforce &... · 515b206d
  由 WangXi 提交于 1月 09, 2020
```
[Cherry-pick 1.6] fix batch_norm_grad shape=0 & allreduce shape enforce & sync_batch_norm hang in fleet (#22157)
```
  515b206d
08 1月, 2020 1 次提交
- Z
  [cherry-pick] Fix softmax cuda bug (#21720) (#22160) · b9a1d954
  由 zhaoyuchen2018 提交于 1月 08, 2020
```
* Fix softmax cuda bug

* Refine multihead log and softmax logic

* Align block to 32
```
  b9a1d954
07 1月, 2020 2 次提交

Fix optimizer op infershape failed in dygraph multi-cards mode (#21374) (#22112) · 34ef38c8

由 Chen Weihang 提交于 1月 07, 2020

* add param & grad shape check for sgd op

* add _reshape_inplece interface for dygraph parallel

* refine unittest based paddle/models scripts, test=develop

* add unittest for parallel grad fuse, test=develop

34ef38c8

【cherry-pick】fix decay param and overflow in match_matrix (#22107) · eb6d3396

由 Aurelius84 提交于 1月 07, 2020

* fix decay param in DecayAdagrad test=develop (#22026)

* fix integer overflow in match_matrix (#22036)

* fix integer overflow in match_matrix test=develop

* fix integer overflow in match_matrix test=develop

* fix typo test=develop

eb6d3396

06 12月, 2019 2 次提交
- B
  
  cherry-pick MKL-DNN NHWC FWD support fix (#21593) · 1f598dfa
  由 bingyanghuang 提交于 12月 06, 2019
  
  1f598dfa
- A
  
  cherry-pick pyramid_hash op test=develop (#20779)(#18525) (#21562) · f83254d6
  由 Aurelius84 提交于 12月 06, 2019
  
  f83254d6
05 12月, 2019 1 次提交

[Cherry-pick] fix the computation for dx (grad for x) for prelu operation. (#20949) (#21514) · 40549473

由 lilong12 提交于 12月 05, 2019

* fix the computation for dx (grad for x) for prelu operation. (#20949)

* set the default value of alpha for prelu to 0.25, test=develop

* add the call to __syncthreads(), test=develop

* fix the implementation of cpu prelu, test=develop

* repair the implementation of element mode prelu, test=develop

* modify test_prelu_op.py, test=develop

40549473

04 12月, 2019 2 次提交
- W
  
  Fix dgc clip & rampup step, test=release/1.6 (#21519) · 3f1169fe
  由 WangXi 提交于 12月 04, 2019
  
  3f1169fe
- B
  
  [cherry pick] Conv2d and Conv2d transpose MKL-DNN NHWC support (#21525) · 0e63746b
  由 bingyanghuang 提交于 12月 04, 2019
  
  0e63746b
03 12月, 2019 10 次提交
- L
  set dim[0] to -1 if dim[0] < 0 during compiling for c_allgather op (#21402) (#21512) · df2b4002
  由 lilong12 提交于 12月 03, 2019
```
* set dim[0] to -1 if dim[0] < 0 and remove assertion to runtime, test=develop
```
  df2b4002
- L
  Fix transpose conv (#21406), test=release/1.6 (#21510) · 1fbc45b7
  由 Lv Mengsi 提交于 12月 03, 2019
```
* fix transpose conv,test=develop

* fix comments
test=develop
```
  1fbc45b7
- Z
  [cherry-pick] Improve argsort performance. (#21267) (#21442) · 66c18f4a
  由 zhaoyuchen2018 提交于 12月 03, 2019
```
* Improve argsort performance.

- Give 200000 data to compute argsort on v100,
can speed up ~190x
before opt cost: 0.53s
after opt cost:0.0027s

- Add fp16 support

* Refine error message
* Refine code
* Add descending sort

test=develop
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>
```
  66c18f4a
- K
  [cherry-pick] add Adam beta1/beta2 support Variable (#21433) · 735a2db0
  由 Kaipeng Deng 提交于 12月 03, 2019
```
* add Adam beta1/beta2 support Variable. test=develop
```
  735a2db0
- Z
  [cherry-pick] Add Asypadding for conv fusion. (#21041) (#21439) · 2660107c
  由 zhaoyuchen2018 提交于 12月 03, 2019
```
* Add Asypadding for conv fusion.

test=develop

reference: pr/20042

* Fix eigen build link error

* Change back file mode

* Use math function & add more checks.
```
  2660107c
- L
  add the framework support for distfc (#21197) (#21463) · e06f4439
  由 lilong12 提交于 12月 03, 2019
```
* add the framework support for distfc and ut, test=develop
* fix the implementation of shard_index_op, test=develop
```
  e06f4439
- K
  [cherry-pick] add bn momentum variable (#21435) · 9c63b7c1
  由 Kaipeng Deng 提交于 12月 03, 2019
```
* batch_norm momentum support variable. test=develop
```
  9c63b7c1
- P
  
  show shape diff in wrong trt input shape errmsg, test=develop (#21451) (#21470) · badaaee6
  由 Pei Yang 提交于 12月 03, 2019
  
  badaaee6
- B
  
  cherry-pick LRN and Pool2d (FWD) NHWC support (#21476) · ccb508dc
  由 bingyanghuang 提交于 12月 03, 2019
  
  ccb508dc
- W
  
  cherry-pick fix shape check in density_prior_box, test=release/1.6 (#21474) · 9ab738aa
  由 wangguanzhong 提交于 12月 03, 2019
  
  9ab738aa
02 12月, 2019 3 次提交

[cherry-pick] Improve topk performance. (#21087) (#21441) · 5dbe9e59

由 zhaoyuchen2018 提交于 12月 02, 2019

* Improve topk performance.

give 200000 data to compute topk,
before opt: cost 1s
after opt: cost 0.0028s.

* Refine return value.
* Add cuda util funtions.
* Fix ComputeBlockSize bug & refine comments.
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>

5dbe9e59

[cherry-pick] Fix multihead op bug. (#20783) (#21438) · 2f0f10b3

由 zhaoyuchen2018 提交于 12月 02, 2019

The op should handle k=1024
Fix seq_len < warpsize error.

test=develop
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>

2f0f10b3

Z
[cherry-pick] Fix gru as small frame_size has error. (#20922) (#21440) · 873b32de
由 zhaoyuchen2018 提交于 12月 02, 2019
```
seems shuffle_sync cannot handle small size

test=develop
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>
```
873b32de

30 11月, 2019 1 次提交
- Y
  Fix the crash issue when scale or bias was null-pointer. (#21284) (#21444) · 408e638c
  由 Yihua Xu 提交于 11月 30, 2019
```
* Fix the crash issue when scale or bias was null-pointer.

* Add the error message for passing CI.

test=release/1.6
```
  408e638c
29 11月, 2019 1 次提交
- W
  
  Fix dgc accuracy by mv regularization to local, test=release/1.6 (#21390) · 6ce49eea
  由 WangXi 提交于 11月 29, 2019
  
  6ce49eea
26 11月, 2019 4 次提交
- L
  [Cherry pick] instance_norm, gradients and batch_norm (#21301) · 97bbab47
  由 Lv Mengsi 提交于 11月 26, 2019
```
* Fix gradients (#20857)

* fix_gradients

* fix_gradients, test=develop

* fix instance norm (#21042)

* fix instance norm

* update unitest,test=develop

* fix_bn

* revert unittest,test=develop
```
  97bbab47
- B
  
  [cherry-pick] Refactor mkldnn eletwise_mul and error message for NHWC in mkldnn (#21361) · 03dda317
  由 bingyanghuang 提交于 11月 26, 2019
  
  03dda317
- W
  
  [Cherry-pick 1.6] Fix dgc buffer illegal & reuse velocity & fix fuse (#21281) · 93c7f058
  由 WangXi 提交于 11月 26, 2019
  
  93c7f058
- W
  
  Fix INF bug of softmax_cross_entropy_op, test=release/1.6 (#21283) · 3423f0b6
  由 WangXi 提交于 11月 26, 2019
  
  3423f0b6
25 11月, 2019 2 次提交

[cherry-pick] fix crop_tensor, maxout and lrn (#21302) · 3848f720

由 Zhang Ting 提交于 11月 25, 2019

* [cherry-pick] All elements in attr(shape) of crop_tensor can be -1 and int32/64 kernel registered (#20756)

* All elements in attr(shape) of crop_tensor can be -1, test=develop, test=document_preview

* fix the bug that attr(offsets) should be initialized, test=develop

* [cherry-pick] maxout supports channel_last input (#20846)

* maxout support channel_last input, test=develop

* modified details of Input(X) and Attr(groups, axis) in doc, test=develop

* [cherry-pick] lrn supports channel_last input, test=develop (#20954)

3848f720

Add pre-condition check for fuse optimizer op pass (#21005) (#21305) · 9f004548

由 Chen Weihang 提交于 11月 25, 2019

* add pre condition check for fuse optimizer op pass, test=develop

* add log & set init to zero, test=develop

* fix test_fuse_all_reduce_pass failed, test=develop

* polish details, test=develop

* refine PADDLE_ENFORCE & remove needless VLOG, test=develop

* refactor op check method, test=develop

9f004548

24 11月, 2019 1 次提交
- A
  Fix GELU grad error (#21321) · eaf82528
  由 Adam 提交于 11月 23, 2019
```
test=develop
```
  eaf82528
23 11月, 2019 1 次提交
- K
  [cherry-pick] fix elementwise mod (#21315) · 5e35e5ea
  由 Kaipeng Deng 提交于 11月 23, 2019
```
* fix elementwise_mod FP kernel. test=develop

* fix unittest. test=develop
```
  5e35e5ea
21 11月, 2019 2 次提交

Cherry-pick error type support for release1.6 (#21294) · 974b8a83

由 Chen Weihang 提交于 11月 21, 2019

* delete paddle infershape enforce marco (#20832)

* Polish and arrange code in enforce.h (#20901)

* Enrich the type of error and declare the error type interfaces (#21024)

* Enrich the type of error and declare the error type interfaces, test=develop

* adjust tests to adapt new form, test=develop

* add inference deps with error_codes.pb.h, test=develop

* restore stack iter start pos, test=develop

* polish code based review comments, test=develop

* Add dependency for error_codes.proto (#21084)

* fix activation_functions deps, test=develop, test=document_fix

* add error_codes_proto deps, test=develop, test=document_fix

* try delete enforce.h, test=develop, test=document_fix

* change cuda enforce & add example (#21142)
test=release/1.6

974b8a83

[cherry-pick]fix bug in pool/conv/conv_transpose: UpdatePaddingAndDilation,... · 7ab85396

由 liym27 提交于 11月 21, 2019

[cherry-pick]fix bug in pool/conv/conv_transpose: UpdatePaddingAndDilation, _get_padding_with_SAME and conv2dtranspose_forward_naive. (#20997) (#21225)

* fix bug in pool/conv/conv_transpose:
    1. It should be stride[i] not stride[0] in UpdatePaddingAndDilation;
    2. fix bug of func  _get_padding_with_SAME in test_conv/conv_transpose_op.py;
    3. fix bug of the computation process in function conv2dtranspose_forward_naive.
    test=release/1.6

7ab85396

13 11月, 2019 1 次提交
- B
  
  cherry-pick #21059, test=release/1.6 (#21153) · 74ca3ae8
  由 bingyanghuang 提交于 11月 13, 2019
  
  74ca3ae8
07 11月, 2019 1 次提交

[cherry-pick] Add support for asymetric padding in MKLDNN pool, conv and conv_transpose (#21072) · e8890031

由 Adam 提交于 11月 07, 2019

* Add asymetric padding support for mkldnn pooling
test=develop

* Add asymetric padding support for mkldnn conv
test=develop

* Add asymetric padding support for mkldnn conv_transpose
test=develop

e8890031

机器未来 / Paddle 与 Fork 源项目一致

机器未来 / Paddle
与 Fork 源项目一致