提交 · 9ab738aacdb15e6fa7722c7f7b3a90ba4178ad70 · BaiXuePrincess / Paddle

03 12月, 2019 1 次提交
- W
  
  cherry-pick fix shape check in density_prior_box, test=release/1.6 (#21474) · 9ab738aa
  由 wangguanzhong 提交于 12月 03, 2019
  
  9ab738aa
02 12月, 2019 5 次提交

[cherry-pick] find lookup table in order & support dump param (#21347) · 893ea7e0

由 Thunderbrook 提交于 12月 02, 2019

* support dump param of model into afs (#20302)

* support dump param to afs
test=develop

* code style
test=develop

* code style
test=develop

* dump param
test=develop

* dump param
test=develop

* dump param
test=develop

* dump param
test=develop

* find lookup table in order (#20932)

test=develop

* cherry-pick
test=develop

* solve pslib core in stop worker
test=develop

* print table stat info for pslib
test=develop

893ea7e0

[cherry-pick] Improve topk performance. (#21087) (#21441) · 5dbe9e59

由 zhaoyuchen2018 提交于 12月 02, 2019

* Improve topk performance.

give 200000 data to compute topk,
before opt: cost 1s
after opt: cost 0.0028s.

* Refine return value.
* Add cuda util funtions.
* Fix ComputeBlockSize bug & refine comments.
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>

5dbe9e59

[cherry-pick] Fix multihead op bug. (#20783) (#21438) · 2f0f10b3

由 zhaoyuchen2018 提交于 12月 02, 2019

The op should handle k=1024
Fix seq_len < warpsize error.

test=develop
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>

2f0f10b3

Z
[cherry-pick] Fix gru as small frame_size has error. (#20922) (#21440) · 873b32de
由 zhaoyuchen2018 提交于 12月 02, 2019
```
seems shuffle_sync cannot handle small size

test=develop
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>
```
873b32de
Z

CHERRY_PICK: TRT int8: refine trt int8 for dynamic range set (#21112) (#21449) · 0473cdb8
由 Zhaolong Xing 提交于 12月 02, 2019

0473cdb8

30 11月, 2019 1 次提交
- Y
  Fix the crash issue when scale or bias was null-pointer. (#21284) (#21444) · 408e638c
  由 Yihua Xu 提交于 11月 30, 2019
```
* Fix the crash issue when scale or bias was null-pointer.

* Add the error message for passing CI.

test=release/1.6
```
  408e638c
29 11月, 2019 3 次提交
- P
  fix trt weight bug (#21231) (#21443) · 77268831
  由 Pei Yang 提交于 11月 29, 2019
```
added splitter "__" between weight name and suffix number to avoid conflicts.
```
  77268831
- W
  
  Fix dgc accuracy by mv regularization to local, test=release/1.6 (#21390) · 6ce49eea
  由 WangXi 提交于 11月 29, 2019
  
  6ce49eea
- W
  
  Fp32 vs int8 qat C++ performance (#21244) (#21432) · 06545fcf
  由 Wojciech Uss 提交于 11月 29, 2019
  
  06545fcf
28 11月, 2019 1 次提交

cherry-pick1.6 fix cache table bug, add save_paddle_inference_model, fix hdfs util bug (#21339) · 072eb5b6

由 xujiaqi01 提交于 11月 28, 2019

* fix cache table bug, add save_paddle_inference_model, fix hdfs util bug (#21052)

* fix cache table bug
* add save_paddle_inference_model
* fix hdfs util bug
* test=develop

* fix several sparse table issuses (#20686)

* no longer need to define all embedding layers (no one less) of all slots in each program. make trainer_param repeated in ps.proto.
* add find_distributed_lookup_table_grads instead of hard code GRAD
* support embedding stop gradient. push sparse has error before fix this.* 
* fix fill sparse, skip slots which do not have embedding. each slot's embedding in a sparse table should be used in all training programs before fix this.
* fix pull sparse, skip slots which do not have embedding.
* fix collect feasign label info, skip slots which do not have embedding.
* support when there are multi sparse tables in one or multi training programs, each program can pull/push its own related sparse tables instead of all sparse tables.
* test=develop

* add copy table (#21086)

* copy some feasigns and corresponding embeddings from one sparse table to another
* copy all feasigns and corresponding embeddings from one sparse table to another
* copy all dense params from one table to another
* copy some local vars to other local vars

* fix fs_client_param bug (#21212)

* fix fs_client_param bug， user can set this config through fleet_desc_file or fleet config
* test=develop

* fix fleet util bug (#21254)

* fix fleet util bug in save paddle inference model
* test=develop

072eb5b6

26 11月, 2019 4 次提交
- L
  [Cherry pick] instance_norm, gradients and batch_norm (#21301) · 97bbab47
  由 Lv Mengsi 提交于 11月 26, 2019
```
* Fix gradients (#20857)

* fix_gradients

* fix_gradients, test=develop

* fix instance norm (#21042)

* fix instance norm

* update unitest,test=develop

* fix_bn

* revert unittest,test=develop
```
  97bbab47
- B
  
  [cherry-pick] Refactor mkldnn eletwise_mul and error message for NHWC in mkldnn (#21361) · 03dda317
  由 bingyanghuang 提交于 11月 26, 2019
  
  03dda317
- W
  
  [Cherry-pick 1.6] Fix dgc buffer illegal & reuse velocity & fix fuse (#21281) · 93c7f058
  由 WangXi 提交于 11月 26, 2019
  
  93c7f058
- W
  
  Fix INF bug of softmax_cross_entropy_op, test=release/1.6 (#21283) · 3423f0b6
  由 WangXi 提交于 11月 26, 2019
  
  3423f0b6
25 11月, 2019 6 次提交

cherry-pick error info check of Print_op for release1.6 (#21349) · 9a98d11e

由 lijianshe02 提交于 11月 25, 2019

* add input type and input data type check for Print_op test=develop (#21250)

* add input type and input data type check for Print_op test=develop

* cherry-pick error info check of Print_op for release1.6 test=develop

* cherry-pick error info check of Print_op for release1.6 test=develop

9a98d11e

Fix the CAPI ZeroCopy shape error and reuse the code to get output (#21240) (#21345) · c75b162a

由 liu zhengxi 提交于 11月 25, 2019

* fix the CAPI ZeroCopy shape error and reconstruct the output obtain

* use an anonymous namespace to cover the functor

* fix unit tests because of the output of typeid(T).name() is different from linux and windows, test=develop

c75b162a

fix bug of issue #21259 (#21331) · da9752fe

由 Yi Liu 提交于 11月 25, 2019

* fix bug of issue #21259 (#21287)
pass the argument `allow_out_of_range` of one_hot op to c++ back end.

da9752fe

cherry-pick (#21201) to release/1.6 (#21306) · a91b8014

由 liuwei1031 提交于 11月 25, 2019

cudaStreamSynchronize randomly hang when used in multi-thread environment, replace it with cudaStreamQuery API on windows

a91b8014

[cherry-pick] fix crop_tensor, maxout and lrn (#21302) · 3848f720

由 Zhang Ting 提交于 11月 25, 2019

* [cherry-pick] All elements in attr(shape) of crop_tensor can be -1 and int32/64 kernel registered (#20756)

* All elements in attr(shape) of crop_tensor can be -1, test=develop, test=document_preview

* fix the bug that attr(offsets) should be initialized, test=develop

* [cherry-pick] maxout supports channel_last input (#20846)

* maxout support channel_last input, test=develop

* modified details of Input(X) and Attr(groups, axis) in doc, test=develop

* [cherry-pick] lrn supports channel_last input, test=develop (#20954)

3848f720

Add pre-condition check for fuse optimizer op pass (#21005) (#21305) · 9f004548

由 Chen Weihang 提交于 11月 25, 2019

* add pre condition check for fuse optimizer op pass, test=develop

* add log & set init to zero, test=develop

* fix test_fuse_all_reduce_pass failed, test=develop

* polish details, test=develop

* refine PADDLE_ENFORCE & remove needless VLOG, test=develop

* refactor op check method, test=develop

9f004548

24 11月, 2019 2 次提交
- C
  Further simplify the C++ error info stack (#21093) (#21304) · 9110c896
  由 Chen Weihang 提交于 11月 24, 2019
```
* simplify C++ error stack by rewrite Place, test=develop

* polish assignment overload func, test=develop
```
  9110c896
- A
  Fix GELU grad error (#21321) · eaf82528
  由 Adam 提交于 11月 23, 2019
```
test=develop
```
  eaf82528
23 11月, 2019 2 次提交
- K
  
  add mkldnn include. test=develop (#21314) · f9cbe3bd
  由 Kaipeng Deng 提交于 11月 23, 2019
  
  f9cbe3bd
- K
  [cherry-pick] fix elementwise mod (#21315) · 5e35e5ea
  由 Kaipeng Deng 提交于 11月 23, 2019
```
* fix elementwise_mod FP kernel. test=develop

* fix unittest. test=develop
```
  5e35e5ea
21 11月, 2019 2 次提交

Cherry-pick error type support for release1.6 (#21294) · 974b8a83

由 Chen Weihang 提交于 11月 21, 2019

* delete paddle infershape enforce marco (#20832)

* Polish and arrange code in enforce.h (#20901)

* Enrich the type of error and declare the error type interfaces (#21024)

* Enrich the type of error and declare the error type interfaces, test=develop

* adjust tests to adapt new form, test=develop

* add inference deps with error_codes.pb.h, test=develop

* restore stack iter start pos, test=develop

* polish code based review comments, test=develop

* Add dependency for error_codes.proto (#21084)

* fix activation_functions deps, test=develop, test=document_fix

* add error_codes_proto deps, test=develop, test=document_fix

* try delete enforce.h, test=develop, test=document_fix

* change cuda enforce & add example (#21142)
test=release/1.6

974b8a83

[cherry-pick]fix bug in pool/conv/conv_transpose: UpdatePaddingAndDilation,... · 7ab85396

由 liym27 提交于 11月 21, 2019

[cherry-pick]fix bug in pool/conv/conv_transpose: UpdatePaddingAndDilation, _get_padding_with_SAME and conv2dtranspose_forward_naive. (#20997) (#21225)

* fix bug in pool/conv/conv_transpose:
    1. It should be stride[i] not stride[0] in UpdatePaddingAndDilation;
    2. fix bug of func  _get_padding_with_SAME in test_conv/conv_transpose_op.py;
    3. fix bug of the computation process in function conv2dtranspose_forward_naive.
    test=release/1.6

7ab85396

14 11月, 2019 1 次提交
- T
  fix error message in expand API, and fix two error unit-tests (#21180) · cdb81264
  由 Tao Luo 提交于 11月 14, 2019
```
test=release/1.6
```
  cdb81264
13 11月, 2019 1 次提交
- B
  
  cherry-pick #21059, test=release/1.6 (#21153) · 74ca3ae8
  由 bingyanghuang 提交于 11月 13, 2019
  
  74ca3ae8
11 11月, 2019 1 次提交
- H
  Disable cudnn_conv in Parallel Executor unit tests. (#21083) · e7d5e0ea
  由 Huihuang Zheng 提交于 11月 11, 2019
```
TODO: fix cudnn_conv and re-enable it

test=develop
test=release/1.6
```
  e7d5e0ea
08 11月, 2019 1 次提交
- M
  
  add dlpack to imdb demo, test=release/1.6 (#21068) · 108c763f
  由 mapingshuo 提交于 11月 08, 2019
  
  108c763f
07 11月, 2019 3 次提交

[cherry-pick] Add support for asymetric padding in MKLDNN pool, conv and conv_transpose (#21072) · e8890031

由 Adam 提交于 11月 07, 2019

* Add asymetric padding support for mkldnn pooling
test=develop

* Add asymetric padding support for mkldnn conv
test=develop

* Add asymetric padding support for mkldnn conv_transpose
test=develop

e8890031

[cherry-pick] fix squared_mat_sub_fuse_pass bug when elementwise_op input is... · e6ed6379

由 Wilber 提交于 11月 07, 2019

[cherry-pick] fix squared_mat_sub_fuse_pass bug when elementwise_op input is persistable param test=develop test=release/1.6 (#21044)

fix squared_mat_sub_fuse_pass bug when elementwise_op input is persistable param

e6ed6379

H
fix uniform random (#21009) (#21057) · e112ea2b
由 hong 提交于 11月 07, 2019
```
* fix uniform random; test=develop

* add uniform random test; test=develop
```
e112ea2b

06 11月, 2019 1 次提交
- B
  
  [Cherry-pick] 21028: Remove fuse_with_relu argument from batch_norm constructor (#21049) · 4c56586a
  由 bingyanghuang 提交于 11月 06, 2019
  
  4c56586a
04 11月, 2019 1 次提交
- C
  Add parameter init check add run_startup_progrom error message for fc(mul) (#20906) (#20920) · f504d6f1
  由 Chen Weihang 提交于 11月 04, 2019
```
test=release/1.6
```
  f504d6f1
02 11月, 2019 1 次提交
- 石
  fix infer crashes caused by conv/pool upgrades, test=release/1.6 (#20969) · 53f1e024
  由石晓伟提交于 11月 02, 2019
```
* fix infer crashes caused by conv/pool upgrades, test=release/1.6

* fix bug, test=release/1.6
```
  53f1e024
01 11月, 2019 3 次提交

Z
[cherry-pick] fix the bug of conv_transpose cudnn kernel, test=release/1.6 (#20958) (#20974) · 6f0b2b19
由 Zhang Ting 提交于 11月 01, 2019
```
fix the bug of conv_transpose cudnn kernel：cherry-pick #20958
```
6f0b2b19

CHERRY_PICK: 20955, 20966 (#20968) · 692a04ec

由 Zhaolong Xing 提交于 11月 01, 2019

Paddle-trt inference: filter conv, depthwise_conv, pooling when padding size > 4
fix C++ multicard  inference bug.
test=develop

692a04ec

Cherry pick bug fix for Ops: reshape,concat, split and squeeze (#20929) · 33d7aae1

由 liym27 提交于 11月 01, 2019

* [cherry-pick]fix bug in reshape: (#20781)

consider the situation that shape of input can contain more than one -1.

* [cherry-pick]support Tensor for split and concat, support -1 in num_or_sections, add check num_or_sections (#20780)

* improve split and concat op:
1. support Tensor for argument 'dim' in split op.
2. support Tensor for argument 'axis' in concat op.
* redefine function GetDataFromTensor and set unknown output shape to - 1.
* add check: Attr(sections) match Input(X).
* support Tensor for attr(sections) and attr(sections) can contain -1.
* modify error message and fix bug for concat and call Resize only when necessary.
test=release/1.6

* [cherry-pick]improve unsqueeze op to support int, Tensor for argument axes (#20824)

* improve unsqueeze op to support int, Tensor and Tensor list for argument axes.
* call Resize only when necessary. test=release/1.6

* [cherry-pick]Compatible int32 and int64 for attr in concat/split/unsqueeze. test=release/1.6 (#20912)

33d7aae1

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致