提交 · 8b2436a776e5cb5d0ebc8bc06f9624ecd87c9189 · BaiXuePrincess / Paddle

04 11月, 2020 2 次提交

Add broadcast_shape api (#28257) · 8b2436a7

由 Leo Chen 提交于 11月 04, 2020

* add broadcast_shape api

* add ut

* follow comments

* add example code, test=dodument_fix

* update example code, test=document_fix

8b2436a7

石

enhance the op_version_registry, test=develop (#28347) · 21a63f6f

由石晓伟提交于 11月 04, 2020

* enhance the op_version_registry, test=develop

* add unittests, test=develop

* enhance the op_version_registry, test=develop

* fix bugs, test=develop

* revert pybind_boost_headers.h, test=develop

* fix a attribute bug, test=develop

21a63f6f

03 11月, 2020 4 次提交

TensorRT中ernie模型推理性能优化，支持变长输入 (#28367) · ea851796

由 Shang Zhizhou 提交于 11月 03, 2020

* fp16 result ok

* change -DWITH_NVINFER_PLUGIN toconfig.EnableTensorRtOSS

* auto detect special slice op converter for ernie with trt oss

* ernie oss only support fp16

* fix special_slice_plugin serialize bug

* matmul in tensorrt ok

* ernie unittest ok

* add matmul tensorrt unittest

* remove demo code

ea851796

J

[oneDNN] sum op refactor (#28318) · 84cc61b2
由 Jacek Czaja 提交于 11月 03, 2020

84cc61b2
L
Pool2d cuda kernel supports fp16 (#28316) · 6115c14f
由 Leo Chen 提交于 11月 02, 2020
```
* pool2d cuda kernel supports fp16

* fix compile issue of template

* add ut
```
6115c14f

Add rnn_op (#28197) · 9a600df3

由 Guo Sheng 提交于 11月 03, 2020

* Add rnn_op.
test=develop

* Fix rnn_op grad maker's drop_empty_grad.
test=develop

9a600df3

02 11月, 2020 4 次提交

Retry CUDA Initialization to Fix Random Failure, test=develop (#28323) · acc11c2a

由 Huihuang Zheng 提交于 11月 02, 2020

This PR is follow up of #28213. On that PR we tried to decrease GPU usage, however the CI still randomly failed. So I added retry logic for the initialization of nccl and cusolver. If the initialization failed, we can retry to avoid the random failure.

acc11c2a

W
add generate_proposals_v2 op (#28214) · 5262b025
由 wangguanzhong 提交于 11月 02, 2020
```
* add generate_proposals_v2 op
```
5262b025

Fix lr setting of AdamW when lr is an instance of LRScheduler (#28300) · b96869bc

由 Guo Sheng 提交于 11月 02, 2020

* Fix lr setting of AdamW when lr is an instance of LRScheduler.
test=develop

* Fix static graph test mode in test_adamw_op.py.
test=develop

b96869bc

A
[Dy2stat] Support to modify value of buffer tensor (#28328) · 57e4411a
由 Aurelius84 提交于 11月 02, 2020
```
* [Dy2stat] Support to modify value of buffer tensor

* remove "defaultTest"

* fix name confliction
```
57e4411a

30 10月, 2020 4 次提交
- 石
  update the version of pybind, test=develop (#28284) · d9b5f126
  由石晓伟提交于 10月 30, 2020
```
* update version pybind to v2.4.3, test=develop

* update unittests, test=develop
```
  d9b5f126
- L
  
  hide some logs of p2p (#28307) · 18c86fb2
  由 Leo Chen 提交于 10月 30, 2020
  
  18c86fb2
- C
  Check and fix tensor and scalar type promotion (#28299) · 4086f48e
  由 Chen Weihang 提交于 10月 30, 2020
```
* check and fix tensor and scalar type promotion

* fix else branch error

* fix scalar method error

* fix test_math_op_path unittest

* add future division for unittest

* rm useless bin file
```
  4086f48e
- Z
  Add median api. (#28310) · 26ede6e0
  由 zhulei 提交于 10月 30, 2020
```
* Add median api.

* Add median api.

* Add median api.

* Add median api.

* Add median api.
```
  26ede6e0
29 10月, 2020 3 次提交
- W
  enable test_parallel_executor_fetch_isolated_var (#28219) · 3ccc0a2f
  由 wanghuancoder 提交于 10月 29, 2020
```
* enable test_parallel_executor_fetch_isolated_var, test=develop

* add enable_static, test=develop

* set test_parallel_executor_fetch_isolated_var RUN_TYPE=DIST, develop=test
```
  3ccc0a2f
- J
  
  Add bf16 transpose2, reshape2, concat ops (#28195) · 571a63e7
  由 joanna.wozna.intel 提交于 10月 29, 2020
  
  571a63e7
- G
  Enhance multiclass_nms op to support LoD for dygraph mode (#28276) · e8f2614d
  由 Guanghua Yu 提交于 10月 29, 2020
```
* Enhance multiclass_nms to support LoD for dygraph mode

* fix some error in multiclass_nms

* update GetLodFromRoisNum to GetNmsLodFromRoisNum
```
  e8f2614d
28 10月, 2020 8 次提交
- L
  
  Fix transpose in conv cudnn kernel when addto enabled (#28295) · 89530384
  由 Leo Chen 提交于 10月 28, 2020
  
  89530384
- C
  add + - * / @ [] operator to ComplexVariable (#28217) · 6cebd714
  由 chentianyu03 提交于 10月 28, 2020
```
* add + - * / @ [] operator to ComplexVariable, also add unittest

* fix circular reference bug

* fit for py2.7

* remove reverse oprators which not supported now
```
  6cebd714
- Z
  fix dygraph gather api · a98c69b6
  由 Zhong Hui 提交于 10月 28, 2020
```
fix dygraph gather api 
```
  a98c69b6
- L
  Set static shape for shape tensor with constant [part 1] (#28275) · 2853f0c4
  由 Leo Chen 提交于 10月 28, 2020
```
* set static shape for shape tensor with constant

* remove debug code

* fix typo

* add ut

* refine code

* refine example
```
  2853f0c4
- C
  【Paddle.Fleet】Fix fleetrun heter (#28252) · 4dc8c44b
  由 Chengmo 提交于 10月 28, 2020
```
* fix fleetrun heter ps on paddlecloud
```
  4dc8c44b
- Z
  
  fix load check_point bug of LinearWarmup (#28280) · b63e0ccb
  由 Zhou Wei 提交于 10月 28, 2020
  
  b63e0ccb
- J
  
  [oneDNN ] conv2d fwd&bwd optimization (#27871) · c11d9b30
  由 Jacek Czaja 提交于 10月 28, 2020
  
  c11d9b30
- W
  update matrix nms op to api 2.0 (#28265) · 41d26a82
  由 wangxinxin08 提交于 10月 28, 2020
```
* update matrix nms op to api 2.0

* modify code according to review
```
  41d26a82
27 10月, 2020 5 次提交

L

fill_constant op supports NINF (#28270) · 7fcb32dd
由 Leo Chen 提交于 10月 27, 2020

7fcb32dd
W

fix the input error of size Op (#28272) · 495a9ceb
由 wangchaochaohu 提交于 10月 27, 2020

495a9ceb

[Dy2Stat-log] Call warnings.warn() to display the warning-message only once... · b1eb28d7

由 liym27 提交于 10月 27, 2020

[Dy2Stat-log] Call warnings.warn() to display the warning-message only once when calling StaticFunc.__call__ or ProgramTranslator().get_output (#28260)

b1eb28d7

Z
add Fuse bn add act pass (#28196) · fdc06f21
由 Zhang Ting 提交于 10月 27, 2020
```
* add fuse_bn_add_act pass
```
fdc06f21

Enrich the python error types of paddle & polish format (#28124) · 813b2ade

由 Chen Weihang 提交于 10月 27, 2020

* add multiple exception type

* define all exception & polish compile pystack

* mapping paddle error to python exception

* polish static mode error format

* fix failed unittests

* fix dytostatic test_error

* fix check_nan_inf failed

* add unittest for coverage

* revert some code try to solve compile error

* refactor enforce & error change

* polish code & add unittest

813b2ade

26 10月, 2020 9 次提交

[Dy2stat]Join break cond with while cond in some pattern (#28171) · a5c18204

由 Aurelius84 提交于 10月 26, 2020

* Join break cond with while cond

* remove usless code

* refine the if code

* Split into BreakTransfromOptimizer

* add BreakTransformOptimizer in ast_transformer

* add more comment

a5c18204

A

[Dy2Stat]Support to save model with nested output (#28224) · 7a3a05cc
由 Aurelius84 提交于 10月 26, 2020

7a3a05cc
K
fix DataLoader return same format between static & dynamic in single mode (#28176) · 4671d85a
由 Kaipeng Deng 提交于 10月 26, 2020
```
* fix DataLoader return same format between static & dynamic in single mode. test=develop
```
4671d85a
A

oneDNN BatchNorm + Act fusion pass. (#27912) · 7db747d9
由 Adam Osewski 提交于 10月 26, 2020

7db747d9
Z

fix print tensor place,add cpu/cuda/pin_memory API for Tensor (#28200) · fb7f8529
由 Zhou Wei 提交于 10月 26, 2020

fb7f8529

[cherry pick ] cherry pick 28108 28198 28199 from release2.0rc (#28215) · 99408718

由 cnn 提交于 10月 26, 2020

* Release 2.0rc cherry pick api rename #28108 (#28184)

* rename count_include_pad-->exclusive  return_indices-->return_mask

* remove track_running_stats

* fix typo.

* rename xxxd-->xxxxD

* solve conflicts

* 2.0rc api add all any (#28199)

* reduce trt warning message (#28011)

add paddle.enable_static() on sample code

alias recude_all-->all, reduce_any-->any

add import reduce_all and reduce_any in python/paddle/tensor/math.py

import all and any in python/paddle/tensor/__init__.py

remove all and any OP in python/paddle/tensor/logic.py, add all and any OP in python/paddle/tensor/math.py

fix import error

remove TestAllAPI temporary

* fix doc of recdue_all and reduce_any, test=document_fix

* fix typo

* fix unittest for all and any API
Co-authored-by: NPei Yang <peiyang@baidu.com>

* rename conv_transposeXd-->convXd_transpose (#28198)

* fix sample code of reduce_all and reduce_any
Co-authored-by: NPei Yang <peiyang@baidu.com>

99408718

L
Refine the format of printing tensor 2 (#28216) · f4f823c8
由 Leo Chen 提交于 10月 25, 2020
```
* refine format

* update doc

* handle uninitialized tensor

* add ut
```
f4f823c8
T
Fix xpu notest (#28204) · 11089cac
由 tianshuo78520a 提交于 10月 26, 2020
```
* Fix xpu notest;test=kunlun

* fix

* test=kunlun

* test=kunlun
```
11089cac
M
add sharding strategy in fleet(#27900) · 81244fbf
由 mapingshuo 提交于 10月 26, 2020
```
* add sharding
```
81244fbf

23 10月, 2020 1 次提交

Fix test_parallel_executor_test_while_train Random Failure by Decreasing GPU Usage (#28213) · a1e7fd4a

由 Huihuang Zheng 提交于 10月 23, 2020

Recently, test_parallel_executor_test_while_train randomly failed on CI. On all CI logs, it showed NCCL initialization failed or cusolver initialization failed. I found online that those failure is usually caused by GPU shortage. Those API calls CUDA APIs directly so it shouldn't be the problem of allocator. It may be somewhere in PaddlePaddle increases GPU usage.

However, I run this test for 1000 times on my machine and the CI machine, either of them can reproduce the random failure. Maybe there is something related to the environment only happened in test env.

To verify my assumption that somewhere in PaddlePaddle increases GPU usage and also fix this CI, I decreased the batch_size to see whether the random failure disappears in test env.

a1e7fd4a

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致