提交 · a19154ca403da27bf8774c9a7aac93b09cd16f21 · Crayon鑫 / Paddle

23 2月, 2021 8 次提交
- C
  [CustomOp] New custom operator extension mechanism in 2.0.1 (#31097) · a19154ca
  由 Chen Weihang 提交于 2月 23, 2021
```
[CustomOp] New custom operator extension mechanism in 2.0.1

Cherry-pick New custom operator basic implementation related PRs
```
  a19154ca
- P
  
  add trt transpose and flatten converter (#31022) (#31139) · 20e68a22
  由 Pei Yang 提交于 2月 23, 2021
  
  20e68a22
- Z
  [cherry-pick] Fix softmax cross entropy integer overflow. (#30590) (#31134) · 30a2e7f0
  由 Zhong Hui 提交于 2月 23, 2021
```
[BUG FIX] Fix softmax cross entropy overflow problem.
```
  30a2e7f0
- W
  [cherry-pick 2.0.1] [kunlun] fix xpu bind threaded executor (#31116) · 29467060
  由 WangXi 提交于 2月 23, 2021
```
* [Kunlun] Add condition_variable and notify() in BindThreadedSSAGraphExecutor (#30586)

* [Kunlun] fix dead lock for exec_op_count_ (#30718)

* Fix the problem that the number of ops executed by xpu is wrong (#30961)
Co-authored-by: Nliuyuhui <liuyuhui@baidu.com>
```
  29467060
- Q
  [Cherry-pick] fix ELU output for nan, test=develop (#31135) · b582be2d
  由 Qi Li 提交于 2月 23, 2021
```
ATT, cherry pick of #31132
```
  b582be2d
- W
  A fix for oneDNN matmul kernel. Fixes issue #30309 for oneDNN 1.6 (#31066) · f5007051
  由 Wojciech Uss 提交于 2月 22, 2021
```
* A fix for oneDNN matmul kernel. Fixes issue #30309 (#30723)

* A fix for #30309 with oneDNN 1.6
```
  f5007051
- T
  test=develop, save/load, shrink (#30625) (#31107) · 36710ebc
  由 tangwei12 提交于 2月 23, 2021
```
* test=develop, save/load, shrink
Co-authored-by: NseiriosPlus <tangwei12@baidu.com>
Co-authored-by: N123malin <malin10@baidu.com>
```
  36710ebc
- S
  
  update merge pr #31060（update trt int8 calibrator to IEntropyCalibratorV2） (#31121) · 1d2bd35e
  由 Shang Zhizhou 提交于 2月 23, 2021
  
  1d2bd35e
22 2月, 2021 2 次提交
- G
  [Cherry-pick]add offset parameter in roi_align,generate_proposals.etc ops (#31030) · 97dbf281
  由 Guanghua Yu 提交于 2月 22, 2021
```
* add  parameter in roi_align op

* fix compatibility of ops

* fix op test & cpu kernel

* fix JaccardOverlap in nms
```
  97dbf281
- W
  
  update paddle_fluid.so to paddle_inference.so (#30850) (#31076) · 6ec5f0fb
  由 Wilber 提交于 2月 22, 2021
  
  6ec5f0fb
20 2月, 2021 1 次提交

bug fix of xpu lite engine, test=develop (#30918) (#31046) · fa0c0fb2

由石晓伟提交于 2月 20, 2021

* bug fix of xpu lite engine, test=develop

* xpu zero copy tensor, test=develop

* revert paddle/fluid/inference/tests/api/CMakeLists.txt

fa0c0fb2

19 2月, 2021 1 次提交
- W
  
  cherry-pick pr (#31043) · 656124da
  由 Wilber 提交于 2月 19, 2021
  
  656124da
18 2月, 2021 1 次提交
- J
  
  [cherry-pick][oneDNN]Extended adaptive pooling support for oneDNN pool kernel (#30993) · 25ee1a73
  由 Jacek Czaja 提交于 2月 18, 2021
  
  25ee1a73
10 2月, 2021 2 次提交
- S
  [cherry-pick] Solve inconsistent order in each card in dynamic graph (#30965) · 0175f566
  由 ShenLiang 提交于 2月 10, 2021
```
* support if else control

* fix conflict
```
  0175f566
- L
  
  Fix python3 incompatibility issues (#30698) (#30969) · aaaae6b4
  由 lidanqing 提交于 2月 10, 2021
  
  aaaae6b4
09 2月, 2021 1 次提交

【Cherry-pick】Fix Parameter Server Bug (#30860) · 94cb210b

由 Chengmo 提交于 2月 09, 2021

* 【Paddle.Fleet】Fix brpc get hostname (#30703)

* fix Brpc get hostname

* fix int64 bug (#30780)

fix push sparse int64 bug

94cb210b

07 2月, 2021 1 次提交
- Z
  【cherry-pick2.0】Polish and Optimize the print/repr information of Layer (#29998) (#30893) · 7780badb
  由 Zhou Wei 提交于 2月 07, 2021
```
cherry-pick #29998
* Polish and Optimize the print/repr message of all layer
* fix some code format
```
  7780badb
05 2月, 2021 2 次提交
- C
  [cherry-pick ]make abs support complex types (#30889) · 963d54d1
  由 chentianyu03 提交于 2月 05, 2021
```
make abs support complex types

cherry-pick:
#30375
#30637
```
  963d54d1
- S
  fix trt plugin clone and initialize bugs in TRT7.1+ (#30709) (#30822) · a64bea0c
  由 Shang Zhizhou 提交于 2月 05, 2021
```
Co-authored-by: Ntianshuo78520a <707759223@qq.com>
```
  a64bea0c
04 2月, 2021 1 次提交
- 石
  
  support xpu with analysis predictor, test=develop (#30832) (#30863) · d199edd8
  由石晓伟提交于 2月 04, 2021
  
  d199edd8
03 2月, 2021 1 次提交
- W
  
  disable memory leak pass in cudnn8 (#30838) · d1ae7b98
  由 Wilber 提交于 2月 03, 2021
  
  d1ae7b98
02 2月, 2021 2 次提交

Conv bn fuse fix (#30830) · b4be9717

由 alncat 提交于 2月 02, 2021

* fixed compilation error on gcc 4.8.x due to the usage of isfinite (#30733)

* modified conv+bn fuse pass to fix wrong mask in mask rcnn (#30704)

b4be9717

add DLA support：C++&&Python api (#30165) (#30810) · a8dfff99

由 Shang Zhizhou 提交于 2月 02, 2021

* add dla

* add python api
Co-authored-by: Nshangzhizhou <root@szth-rp-fanyi-opera49.szth.baidu.com>
Co-authored-by: Nshangzhizhou <root@szth-rp-fanyi-opera49.szth.baidu.com>

a8dfff99

27 1月, 2021 1 次提交
- W
  - Disabling oneDNN inplace pass (#30588) (#30710) · 5d604a6b
  由 Wojciech Uss 提交于 1月 27, 2021
```
Co-authored-by: NJacek Czaja <jacek.czaja@intel.com>
```
  5d604a6b
22 1月, 2021 1 次提交
- P
  
  extend trt ut timeout threshold (#30633) · 02af1a62
  由 Pei Yang 提交于 1月 22, 2021
  
  02af1a62
21 1月, 2021 1 次提交
- Q
  
  fix softmax bug for multi_card in kunlun (#30600) (#30614) · c173887e
  由 QingshuChen 提交于 1月 21, 2021
  
  c173887e
20 1月, 2021 3 次提交
- A
  [cherry-pick]Add tf32 support for A100 tensor core acceleration for cuBLAS (#28732) (#30612) · fd9d6fda
  由 AshburnLee 提交于 1月 20, 2021
```
* Add tf32 support for A100 tensor core acceleration for cuBLAS (#28732)

* Fixed an error

* Fixed an error
```
  fd9d6fda
- A
  Add tf32 switch for cuDNN (#29192) (#30574) · 138a71b7
  由 AshburnLee 提交于 1月 20, 2021
```
This PR is cherry-picked from PR: #29192
Function: Added TF32 switch for cuDNN. Turned on as default, turned off when users set the switch as False
```
  138a71b7
- W
  
  fix compile error on sw and mips (#30584) · 619869bd
  由 Wilber 提交于 1月 20, 2021
  
  619869bd
19 1月, 2021 11 次提交
- P
  [Cherry-pick] PR 30520. fix error message of Inplace strategy (#30520) (#30568) · 40b3e752
  由 pangyoki 提交于 1月 19, 2021
```
Cherry pick PR #30520 .
Fix error message of Inplace strategy.
```
  40b3e752
- L
  [cherry-pick] support layer_norm fp16 in dygraph amp (#30430) #30566 · 0ea41e62
  由 Leo Chen 提交于 1月 19, 2021
```
[cherry-pick] support layer_norm fp16 in dygraph amp (#30430)
```
  0ea41e62
- Z
  fix bug of multicard grad ncclAllReduce (#30554) · 96058384
  由 Zhou Wei 提交于 1月 19, 2021
```
cherry-pick #30553
fix bug of multicard grad ncclAllReduce, the gradient accumulater of parameters should be keep order, otherwsie, it will influence multicard ncclAllReduce of grad.
```
  96058384
- L
  [Cherry-Pick] Fix bug: GetAttrValue should deal with attr with attrType vector<double> (#30564) · f15bed11
  由 liym27 提交于 1月 19, 2021
```
cherry-pick #30536
```
  f15bed11
- Z
  [Cherry-pick]Fix the compiling error of update_loss_scaling when using cuda9.(#30538) #30539 · e114f892
  由 Zhen Wang 提交于 1月 19, 2021
```
Fix the compiling error of update_loss_scaling when using cuda9.
```
  e114f892
- H
  
  Ascend Framework Part1: OP & Wrapper (#30281) (#30546) · 6f563ace
  由 hutuxian 提交于 1月 19, 2021
  
  6f563ace
- H
  
  Ascend Framework Part2: pybind files (#30410) (#30547) · 9b1031f3
  由 hutuxian 提交于 1月 19, 2021
  
  9b1031f3
- T
  【Cherry-Pick】add trainer number for pserver (#30524) · 3bdf1544
  由 tangwei12 提交于 1月 19, 2021
```
* add trainers for pserver

Change-Id: I99c0ab1cc427318f1f9bf8f8f5faff2b8890645d

* add trainers for pserver

Change-Id: I1a75793ec81ce126d07f4c47cae09b95d530bbc8
```
  3bdf1544
- T
  Pd2.0 (#30532) · 1323e5e7
  由 taixiurong 提交于 1月 19, 2021
```
* support transformer v2.0

* fix range op crash in dygraph xpu place
```
  1323e5e7
- L
  
  [Kunlun]PR3: add xpu executor, multi xpu card train function optimization (#30317) (#30535) · 420fdbb2
  由 liuyuhui 提交于 1月 19, 2021
  
  420fdbb2
- J
  
  Recompute Offload: fixed bug in memcpy (#30484) (#30517) · 7a4ccf59
  由 JZ-LIANG 提交于 1月 19, 2021
  
  7a4ccf59

Crayon鑫 / Paddle 与 Fork 源项目一致

Crayon鑫 / Paddle
与 Fork 源项目一致