提交 · 542a226c3eb68cc8ae522b5de287017d270287e3 · BaiXuePrincess / Paddle

18 6月, 2020 8 次提交
- Z
  add new API: set_global_initializer (#24378) · 542a226c
  由 Zhou Wei 提交于 6月 18, 2020
```
* add new api (set_global_initializer/reset_global_initializer),test=develop

* add new api (set_global_initializer/reset_global_initializer),test=develop

* fix doc and example code of set_global_initializer,test=develop
```
  542a226c
- C
  [Dy2static] Add for iterate or enumerate variable list unittest (#25100) · 509d3ec5
  由 Chen Weihang 提交于 6月 18, 2020
```
* add for iter var list, test=develop

* add enumerate unittest, test=develop
```
  509d3ec5
- L
  
  [Dy2Stat]Remove unnecessary vars from gast.comprehension in LoopTransformer. (#25094) · eb1c0901
  由 liym27 提交于 6月 18, 2020
  
  eb1c0901
- J
  [oneDNN]elementwise_add and elementwise_mul int8 support (#24984) · a7944904
  由 Jacek Czaja 提交于 6月 18, 2020
```
* Start implementing int8 eltwise add

test=develop

* - Fix to Michal PR

* - Fix

test=develop

* - Lint fixes

test=develop

* - Added checking if elementwise_mul can be used

test=develop

* - Added attribs to skip_attrs_set

test=develop

* - Improved broadcasting

test=develop

- fixes to compilation

- fix

- fix

- Lint fixes

test=develop

* - removed redundant condition

test=develop
Co-authored-by: NMichal Gallus <michal.gallus@intel.com>
```
  a7944904
- Z
  fix emb eltwise layernorm (#24873) · 84358115
  由 Zhaolong Xing 提交于 6月 18, 2020
```
test=develop
```
  84358115
- L
  Add relu layer for lenet (#24874) · a01113c3
  由 LielinJiang 提交于 6月 18, 2020
```
* add relu for lenet, test=develop

* fix test model, test=develop
```
  a01113c3
- F
  
  fix dtype error in retinanet_target_assgin example codes. test=develop (#25091) · 3b28629e
  由 FlyingQianMM 提交于 6月 18, 2020
  
  3b28629e
- 石
  
  remove useless test_dot, test=develop (#24957) · 9ab3cf03
  由石晓伟提交于 6月 18, 2020
  
  9ab3cf03
17 6月, 2020 5 次提交

[Dy2Stat] Add test for dygraph seq2seq model. (#25054) · db601f70

由 liym27 提交于 6月 17, 2020

* The arg of append() can be not Tensor temporarily.

* Add Seq2Seq as ProgramTranslator Unit Test. 

* set dtype of vocab_size_tensor to int64 to pass Windows-CI.

db601f70

C

Support conv2d_traspose quantize, test=develop (#25084) · 8fc31d50
由 cc 提交于 6月 17, 2020

8fc31d50
石

fix repeat definitions in liengine.cc, test=develop (#25020) · 6783441e
由石晓伟提交于 6月 17, 2020

6783441e

fix bug of prelu when rank not equal 4, test=develop (#25067) · fa657b3d

由 Leo Chen 提交于 6月 17, 2020

* fix bug of prelu when rank not equal 4, test=develop

* fix prelu inference, test=develop

* fix api, test=develop

* fix shape when mode is chennel, test=develop

* remove debug code, test=develop

* add unittest, test=develop

fa657b3d

[Paddle-TRT] Fixes , opt for SoftmaxKernelWithEltadd kernel, test=develop (#24834) · 479c8834

由 zlsh80826 提交于 6月 17, 2020

* blockReduce opt

* launch threads align to warpSize

* reduce unnecessary shared memory for broadcast reduced value

* vectorize SoftmaxKernelWithEltadd

* add fp16 constrain

* test=develop

479c8834

16 6月, 2020 5 次提交
- H
  Handle Windows flaky test (#25070) · 2c500c30
  由 Huihuang Zheng 提交于 6月 16, 2020
```
As the title
```
  2c500c30
- H
  Monitor Framework (#24079) · 5822862d
  由 hutuxian 提交于 6月 16, 2020
```
* Add a StatValue class in the backend to represent a stat.
* Add a singleton StatRegistry to maintain the collection of stats.
* For the sake of code neatness, we only support type of int and float, which can cover most of the scenarios.
```
  5822862d
- H
  Add test_yolov3 and test_se_resnet Timeout (#25076) · 21138c05
  由 Huihuang Zheng 提交于 6月 16, 2020
```
Some big models can timeout on Windows CPU machine. I added some timeout properties.
```
  21138c05
- T
  
  don't support cmake 3.12, 3.13, 3.14 (#25021) · a73a4a8f
  由 T8T9 提交于 6月 16, 2020
  
  a73a4a8f
- L
  
  fix dtype error of compare op, test=develop (#25059) · 028de857
  由 Leo Chen 提交于 6月 16, 2020
  
  028de857
15 6月, 2020 6 次提交

Y

Fix random fail because of precision problem in unittest of fusion_group (#25051) · 9ed16a43
由 Yiqun Liu 提交于 6月 15, 2020

9ed16a43

bugfix for unique_ptr of IOptimizationProfile (#23917) · bef4afa6

由 Jeng Bai-Cheng 提交于 6月 15, 2020

This commit fixs the compiling bug regarding unique_ptr of IOptimizationProfile.

IOptimizationProfile has protected dtor and is controlled by TensorRT
internally. Application shouldn't delete the pointer of IOptimizationProfile.
See TensorRT document: https://docs.nvidia.com/deeplearning/sdk/tensorrt-api/c_api/classnvinfer1_1_1_i_builder.html#a9ac47e100454151d8206ac91d543299a
test=develop

bef4afa6

Z
[Paddle-TRT] slice kernel optimization (#24783) · 49e4ee27
由 zlsh80826 提交于 6月 15, 2020
```
* parallel move shared data test=develop

* test=develop
```
49e4ee27

update readme of 1.8.2 (#25023) · 1a7fbb73

由 tianshuo78520a 提交于 6月 15, 2020

* update readme of 1.8.2;test=document_fix

* test=develop;test=document_fix

* test=develop;test=document_fix

1a7fbb73

D

update scipy version test=develop (#25007) · 67fb840c
由 Divano 提交于 6月 14, 2020

67fb840c
H
[Dy2stat] Add TSM as ProgramTranslator Unit Test. (#25008) · 9b5b7267
由 Huihuang Zheng 提交于 6月 15, 2020
```
Add TSM as ProgramTranslator Unit Test. The TSM code is referred from PaddlePaddle/models#4229
```
9b5b7267

14 6月, 2020 1 次提交
- T
  fix make device_context error (#25045) · 770c11a1
  由 tianshuo78520a 提交于 6月 14, 2020
```
* test=develop

* test=develop

* fix bug

* test=develop

* test=develop
```
  770c11a1
12 6月, 2020 7 次提交
- L
  
  replace some logging.warn() with warings.warn(), test=develop (#25025) · c7a63908
  由 Leo Chen 提交于 6月 12, 2020
  
  c7a63908
- L
  
  add device attr for regularizer, test=develop (#24981) · ab5a1fb8
  由 lilong12 提交于 6月 12, 2020
  
  ab5a1fb8
- A
  [Dy2stat] Add MobileNet model unittest (#25018) · 0b6145e0
  由 Aurelius84 提交于 6月 12, 2020
```
* add MobileNet unittest test=develop

* fix cudnn random test=develop
```
  0b6145e0
- T
  Fix/sync barrier (#25016) · be6a315f
  由 tangwei12 提交于 6月 12, 2020
```
* fix sync barrier with barrier monitor, test=develop
```
  be6a315f
- C
  
  fix cos_sim, test=develop (#25017) · 8db66fc3
  由 ceci3 提交于 6月 12, 2020
  
  8db66fc3
- L
  Add a tool to collect environments (#24971) · 7a6f4d64
  由 Leo Chen 提交于 6月 12, 2020
```
* add summary_env, test=develop

* update issue template, test=develop

* refine link, test=develop
Co-authored-by: Nroot <root@yq01-gpu-255-129-15-00.epc.baidu.com>
```
  7a6f4d64
- H
  Enable load program state in imperative mode (#24998) · c85c7b22
  由 hong 提交于 6月 12, 2020
```
* enable load_program_state run in imperative mode; test=develop

* remove useless code; test=develop
```
  c85c7b22
11 6月, 2020 2 次提交

[Dy2Static]Convert var.shape stmt and Convert the return variables of... · f16e2778

由 liym27 提交于 6月 11, 2020

[Dy2Static]Convert var.shape stmt and Convert the return variables of Tensor-dependent 'if' staments to Tensor if it not  (#24911)

* Support int and long: int or long -> six.integer_types. 

* Modify test_tensor_shape: fix bug and modify comment. 

* Support convert_var_shape to convert var.shape stmt

* Modify code in ifelse_simple_func.py because don't support return non-Tensor in Tensor-dependent 'if' stament currently. 

* Convert the return variables of Tensor-dependent 'if' staments to Tensor if it not. test=develop

f16e2778

L
Use allow list instead of white list (#25002) · 25a4dac4
由 Leo Chen 提交于 6月 11, 2020
```
* use allow list instead of white list, test=develop

* reduce include, test=develop
```
25a4dac4

10 6月, 2020 6 次提交
- Z
  
  improve performance of instance_norm, test=develop (#25005) · 621b6385
  由 Zhang Ting 提交于 6月 10, 2020
  
  621b6385
- L
  
  decrease the input size for test_transpose_flatten_concat_fuse_pass, test=develop (#24992) · 971ebb26
  由 liu zhengxi 提交于 6月 10, 2020
  
  971ebb26
- H
  support CMatchAuc (#24990) · 1c224e26
  由 hutuxian 提交于 6月 10, 2020
```
Support CMatchAucCalculator based on CMatchRankAucCalculator with a new parameter ignore_rank
```
  1c224e26
- H
  [Dy2stat] Decrease test_yolov3 GPU usage (#24955) · 28d074e9
  由 Huihuang Zheng 提交于 6月 10, 2020
```
[Dy2stat] decrease the batch size to decrease GPU usage.
```
  28d074e9
- Z
  windows publish package scripts (#24851) · ff8ca52f
  由 Zhou Wei 提交于 6月 10, 2020
```
* windows publish package scripts,test=develop

* windows publish package scripts,test=develop

* windows publish package scripts,test=develop
```
  ff8ca52f
- Z
  fix bug in CUDA_NVCC_FALS and CMAKE_CUDA_FLAGS, and eliminate some warning,test=develop (#24982) · 3e04ed22
  由 Zhou Wei 提交于 6月 10, 2020
```
fix bug in CUDA_NVCC_FALS and CMAKE_CUDA_FLAGS
```
  3e04ed22

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致