提交 · 1c7ec53682483d3749b71f5d5cc2e61b39903d68 · 机器未来 / Paddle

29 1月, 2021 1 次提交

Cherry pick fix acc sample code bug (#30716) · 1c7ec536

由 Jiaqi Liu 提交于 1月 29, 2021

* Alias from  paddle.fluid.layers.auc to paddle.static.auc (#30206)

* add alias from  fluid.layers.auc to static.auc

* Update __init__.py

* add auc into all list

* alias acc, expose to users

* add auc into 'all' list (#30310)

* add auc into 'all' list

* alias acc, expose to users

* update sample code

* fix paddle.static.acc and auc sample code bug, test=document_fix

1c7ec536

27 1月, 2021 1 次提交
- W
  - Disabling oneDNN inplace pass (#30588) (#30710) · 5d604a6b
  由 Wojciech Uss 提交于 1月 27, 2021
```
Co-authored-by: NJacek Czaja <jacek.czaja@intel.com>
```
  5d604a6b
22 1月, 2021 1 次提交
- P
  
  extend trt ut timeout threshold (#30633) · 02af1a62
  由 Pei Yang 提交于 1月 22, 2021
  
  02af1a62
21 1月, 2021 1 次提交
- Q
  
  fix softmax bug for multi_card in kunlun (#30600) (#30614) · c173887e
  由 QingshuChen 提交于 1月 21, 2021
  
  c173887e
20 1月, 2021 10 次提交
- A
  [cherry-pick]Add tf32 support for A100 tensor core acceleration for cuBLAS (#28732) (#30612) · fd9d6fda
  由 AshburnLee 提交于 1月 20, 2021
```
* Add tf32 support for A100 tensor core acceleration for cuBLAS (#28732)

* Fixed an error

* Fixed an error
```
  fd9d6fda
- G
  Fix the error of save_quantized_model (#30587) · 228c1d7c
  由 guofei 提交于 1月 20, 2021
```
动态图中Conv2D保存成预测模型时，对应的Op可能是conv2d，也可能是depthwise_conv2d，但目前的save_quantized_model接口并未考虑depthwise_conv2d情况，可能会致使out_scale的值保存错误，该PR主要是修复这个问题。
```
  228c1d7c
- Z
  [Cherry-pic]Fix the bug in fleet amp_init. (#30606) (#30608) · 09aed38d
  由 Zhen Wang 提交于 1月 20, 2021
```
* Fix the bug in fleet amp_init.

* Fix the amp_init unit test.
```
  09aed38d
- C
  
  update document of paddle.vision.dataset, test=document (#30415) · 2494562d
  由 cnn 提交于 1月 20, 2021
  
  2494562d
- A
  Add tf32 switch for cuDNN (#29192) (#30574) · 138a71b7
  由 AshburnLee 提交于 1月 20, 2021
```
This PR is cherry-picked from PR: #29192
Function: Added TF32 switch for cuDNN. Turned on as default, turned off when users set the switch as False
```
  138a71b7
- A
  [Dy2static]Fix paddle prefix in is_paddle_api (#30569) (#30594) · 12c51f57
  由 Aurelius84 提交于 1月 20, 2021
```
[Dy2static]Fix paddle prefix in is_paddle_api (#30569)
cherry-pick #30569
```
  12c51f57
- H
  [cherry pick]Add pure fp16 amp_init for fleet API. (#30592) · 3317cf01
  由 huangxu96 提交于 1月 20, 2021
```
* add fleet amp.init()

* add unittest for fleet_amp_init
```
  3317cf01
- W
  
  fix compile error on sw and mips (#30584) · 619869bd
  由 Wilber 提交于 1月 20, 2021
  
  619869bd
- H
  [Cherry-pick]Implemented AddQuantDequantPass in imperative quantization. (#26692) (#30525) · a0e82c2b
  由 huangxu96 提交于 1月 20, 2021
```
* Implemented AddQuantDequantPass in imperative quantization.

* support 2.0 API such as Pool2D and ReLU
```
  a0e82c2b
- Q
  
  update kunlun dependence for aarch64 & sunway platform (#30516) (#30570) · 3688d9e9
  由 QingshuChen 提交于 1月 20, 2021
  
  3688d9e9
19 1月, 2021 20 次提交
- W
  
  fix adamw lr_to_coeff is fixed when dygraph (#30526) (#30559) · 436144e9
  由 WangXi 提交于 1月 19, 2021
  
  436144e9
- W
  [cherry pick]修复save/load相关的两个bug (#30543) · 832032c2
  由 WeiXin 提交于 1月 19, 2021
```
原始PR：#30485，#30507
```
  832032c2
- W
  [cherry pick]if pybind.cc changed, generate total report (#30557) · dbbfbccd
  由 wanghuancoder 提交于 1月 19, 2021
```
* if pybind.cc changed, generate total report
```
  dbbfbccd
- P
  [Cherry-pick] PR 30520. fix error message of Inplace strategy (#30520) (#30568) · 40b3e752
  由 pangyoki 提交于 1月 19, 2021
```
Cherry pick PR #30520 .
Fix error message of Inplace strategy.
```
  40b3e752
- L
  [cherry-pick] support layer_norm fp16 in dygraph amp (#30430) #30566 · 0ea41e62
  由 Leo Chen 提交于 1月 19, 2021
```
[cherry-pick] support layer_norm fp16 in dygraph amp (#30430)
```
  0ea41e62
- Z
  fix bug of multicard grad ncclAllReduce (#30554) · 96058384
  由 Zhou Wei 提交于 1月 19, 2021
```
cherry-pick #30553
fix bug of multicard grad ncclAllReduce, the gradient accumulater of parameters should be keep order, otherwsie, it will influence multicard ncclAllReduce of grad.
```
  96058384
- W
  [cherry pick]perfect 'var_list' of static.load/fluid.load (#30457) (#30479) · 5844dfe4
  由 WeiXin 提交于 1月 19, 2021
```
完善static.load的var_list参数。
当加载的是多个小文件时，Tensor列表可以是所有加载文件中Tensor的子集。
原始PR：#30457
```
  5844dfe4
- L
  [Cherry-Pick] Fix bug: GetAttrValue should deal with attr with attrType vector<double> (#30564) · f15bed11
  由 liym27 提交于 1月 19, 2021
```
cherry-pick #30536
```
  f15bed11
- Z
  [Cherry-pick]Fix the compiling error of update_loss_scaling when using cuda9.(#30538) #30539 · e114f892
  由 Zhen Wang 提交于 1月 19, 2021
```
Fix the compiling error of update_loss_scaling when using cuda9.
```
  e114f892
- Z
  [2.0 API] device guard (#30307) (#30562) · 46322911
  由 Zhang Ting 提交于 1月 19, 2021
```
* add 2.0 API: device_guard
```
  46322911
- H
  
  Ascend Framework Part3: Ascend Parser (#30391) (#30549) · 88c30b75
  由 hutuxian 提交于 1月 19, 2021
  
  88c30b75
- H
  
  Ascend Framework Part1: OP & Wrapper (#30281) (#30546) · 6f563ace
  由 hutuxian 提交于 1月 19, 2021
  
  6f563ace
- H
  
  Ascend Framework Part2: pybind files (#30410) (#30547) · 9b1031f3
  由 hutuxian 提交于 1月 19, 2021
  
  9b1031f3
- T
  【Cherry-Pick】add trainer number for pserver (#30524) · 3bdf1544
  由 tangwei12 提交于 1月 19, 2021
```
* add trainers for pserver

Change-Id: I99c0ab1cc427318f1f9bf8f8f5faff2b8890645d

* add trainers for pserver

Change-Id: I1a75793ec81ce126d07f4c47cae09b95d530bbc8
```
  3bdf1544
- C
  
  Collect weight threshold for lstm op in post_training_quantization, test=develop (#30515) · 42f07437
  由 cc 提交于 1月 19, 2021
  
  42f07437
- T
  Pd2.0 (#30532) · 1323e5e7
  由 taixiurong 提交于 1月 19, 2021
```
* support transformer v2.0

* fix range op crash in dygraph xpu place
```
  1323e5e7
- C
  
  Fix bug of supporting channelwise dygraph quantized model, test=develop (#30531) (#30545) · 4875b972
  由 cc 提交于 1月 19, 2021
  
  4875b972
- L
  
  [Kunlun]PR3: add xpu executor, multi xpu card train function optimization (#30317) (#30535) · 420fdbb2
  由 liuyuhui 提交于 1月 19, 2021
  
  420fdbb2
- J
  
  Recompute Offload: fixed bug in memcpy (#30484) (#30517) · 7a4ccf59
  由 JZ-LIANG 提交于 1月 19, 2021
  
  7a4ccf59
- L
  Update voc dataset url (#30450) (#30505) · 1bd284cd
  由 LielinJiang 提交于 1月 19, 2021
```
* update voc url
```
  1bd284cd
18 1月, 2021 6 次提交
- Z
  [cherry-pick] avoid calling cast twice #30528 · 2967624b
  由 Zhang Ting 提交于 1月 18, 2021
```
 cherry-pick #30527 
```
  2967624b
- L
  fix cache key for inplaced elementwise ops (#30404) (#30478) · c2a4a50e
  由 lidanqing 提交于 1月 18, 2021
```
Co-authored-by: NWojciech Uss <wojciech.uss@intel.com>
```
  c2a4a50e
- G
  [cherry-pick]Modify the calculation logic of LambOptimizer (#29313) (#30510) · b3fa899b
  由 guofei 提交于 1月 18, 2021
```
* Modify the calculation logic of LambOptimizer (#29313)

* Modify the calculation logic of LambOptimizer

* Modify the calculation logic of LambOptimizer

* Modify the calculation logic of LambOptimizer
```
  b3fa899b
- C
  [cherry-pick] add pad and concat double grad #29549 (#30432) · 5e4d54a1
  由 ceci3 提交于 1月 18, 2021
```
* add pad and concat double grad

* resolve conflict
```
  5e4d54a1
- Z
  [cherry-pick] improve perfomance of cast and tril op (#30498) · de003cee
  由 Zhang Ting 提交于 1月 18, 2021
```
* add fp16 support for tril_triu op (#30186)

* add VecCastCUDAKernel (#30296)
Co-authored-by: Nfurnace <34057289+windstamp@users.noreply.github.com>
```
  de003cee
- 1
  test=develop, fix fleet.metric (#30438) (#30473) · 2c3799d1
  由 123malin 提交于 1月 18, 2021
```
* test=develop, fix fleet.metrics(mse, rmse, mae)
```
  2c3799d1

机器未来 / Paddle 与 Fork 源项目一致

机器未来 / Paddle
与 Fork 源项目一致