提交 · ff4654e216df6f7d19c06d22280713dc0cf7fe0e · PaddlePaddle / Paddle

25 2月, 2021 1 次提交
- L
  refactor npu device manager (#31154) · ff4654e2
  由 Leo Chen 提交于 2月 25, 2021
```
refactor npu device manager (#31154)
```
  ff4654e2
23 2月, 2021 1 次提交

[NPU] Support executor with NPU (#31057) · 1435b4c0

由 liym27 提交于 2月 23, 2021

* [NPU] Support executor with NPU

* Fix code according to reviews

* Fix code

* Add unittest for sub op npu

1435b4c0

18 2月, 2021 1 次提交
- X
  support parsing ascend rank table file (#31000) · a6edbc47
  由 xiayanming 提交于 2月 18, 2021
```
support parsing ascend rank table file
```
  a6edbc47
25 1月, 2021 1 次提交
- V
  [Feature] Build parser to support distributed training (#30658) · 904cc443
  由 Void Main 提交于 1月 25, 2021
```
[Feature] Build parser to support distributed training
```
  904cc443
22 1月, 2021 2 次提交
- G
  cleanup (#30646) · 5b77b259
  由 gongweibao 提交于 1月 22, 2021
```
cleanup test_ascend_group.py
```
  5b77b259
- G
  Add startup bash files of test_ascend_group. (#30645) · 7158061a
  由 gongweibao 提交于 1月 22, 2021
```
Add startup bash files of test_ascend_group
```
  7158061a
21 1月, 2021 4 次提交
- G
  Add Hccl program group (#30642) · e4287ca6
  由 gongweibao 提交于 1月 21, 2021
```
Add Hccl program group
```
  e4287ca6
- G
  Pass device_ids info from launch to trainer. (#30632) · f5aca8fb
  由 gongweibao 提交于 1月 21, 2021
```
Pass device_ids info from launch to trainer
```
  f5aca8fb
- V
  Build praser for Hcom* operators (#30627) · d2404da7
  由 Void Main 提交于 1月 21, 2021
```
Build praser for Hcom* operators
```
  d2404da7
- G
  Add distribution supported (#30578) · f9c97dd7
  由 gongweibao 提交于 1月 21, 2021
```
Add distribution supported
```
  f9c97dd7
15 1月, 2021 3 次提交
- H
  
  Ascend rc (#30483) · 6dd52c5b
  由 hutuxian 提交于 1月 15, 2021
  
  6dd52c5b
- W
  
  perfect 'var_list' of static.load/fluid.load (#30457) · e5bb4edb
  由 WeiXin 提交于 1月 15, 2021
  
  e5bb4edb
- 1
  test=develop, fix fleet.metric (#30438) · 05f06d9a
  由 123malin 提交于 1月 15, 2021
```
* test=develop, fix fleet.metrics(mse, rmse, mae)
```
  05f06d9a
14 1月, 2021 5 次提交
- T
  
  support transformer v2.0 (#30381) · 6a3c8725
  由 taixiurong 提交于 1月 14, 2021
  
  6a3c8725
- Z
  
  Separate AVX and NO_AVX compilation, enhance installation error message (#30413) · c94a4b94
  由 Zhou Wei 提交于 1月 14, 2021
  
  c94a4b94
- J
  add auc into 'all' list (#30310) · e395bcd1
  由 Jiaqi Liu 提交于 1月 14, 2021
```
* add auc into 'all' list

* alias acc, expose to users

* update sample code
```
  e395bcd1
- 1
  test=develop, add distributed_infer (#30300) · 2a98e932
  由 123malin 提交于 1月 14, 2021
```
* test=develop, add distributed_infer
```
  2a98e932
- C
  
  fix prune input bug (#30384) · ae1f3209
  由 Chen Weihang 提交于 1月 13, 2021
  
  ae1f3209
13 1月, 2021 9 次提交
- H
  Decrease Batch Size for Windows CI, test=develop (#30331) · cd5f11b8
  由 Huihuang Zheng 提交于 1月 13, 2021
```
As the title
```
  cd5f11b8
- C
  skip quantizing ops in cpu inference (#30342) · 8e3a2940
  由 cc 提交于 1月 13, 2021
```
* skip quantizing ops in cpu inference, test=develop
```
  8e3a2940
- B
  
  fix quantize error in speical naming model (#30354) · ad6fee2f
  由 Bai Yifan 提交于 1月 13, 2021
  
  ad6fee2f
- H
  
  add amp example document (#30314) · 342d62de
  由 huangxu96 提交于 1月 13, 2021
  
  342d62de
- H
  Decrease Mac Input Size Because of CI Short Memory (#30330) · 017a5348
  由 Huihuang Zheng 提交于 1月 13, 2021
```
As the title
```
  017a5348
- L
  Set expected place in child thread for dataloader to avoid costing cuda memory... · 3d015f1c
  由 Leo Chen 提交于 1月 13, 2021
```
Set expected place in child thread for dataloader to avoid costing cuda memory on other card (#30338)

* set expected place in child thread for dataloader

* set device id when set tensor from numpy

* revert tensor_py change

* add compile guard

* fix ci

* fix bug
```
  3d015f1c
- Q
  optimize memcpy perf for kunlun (#30291) · 2c1bba02
  由 QingshuChen 提交于 1月 13, 2021
```
* optimize memcpy perf for kunlun

* remove useless unitest for kunlun mean

* minor
```
  2c1bba02
- H
  Implemented AddQuantDequantPass in imperative quantization. (#26692) · ee623bff
  由 huangxu96 提交于 1月 13, 2021
```
* Implemented AddQuantDequantPass in imperative quantization.

* Supported LeakyReLU Quantization

* For meeting coverage rate.

* Changed the file name of test of AddQuantDequant

* Implemented more Quantized NoWeightLayers.

* Fix the loss cannot align problem between static and dynamic model quantization, add swish as supported quantized layer in imperative quantization.

* remove noweight_list

* support 2.0 API such as Pool2D and ReLu
```
  ee623bff
- S
  
  Support unused parameters in dynamic graph distributed (#30224) · a60f17b8
  由 ShenLiang 提交于 1月 13, 2021
  
  a60f17b8
12 1月, 2021 9 次提交

J

Recompute Offload (#30233) · 75936d83
由 JZ-LIANG 提交于 1月 12, 2021

75936d83
L

Skip some conv2d_int8 tests in windows (#30128) · a2382986
由 lidanqing 提交于 1月 12, 2021

a2382986

Wojtuss/upgrade one dnn 2.0 (#30295) · fc42faff

由 Wojciech Uss 提交于 1月 12, 2021

* upgrade oneDNN version to 2.0 master branch

* - Added workarounds for new lib onednn change

* fix regex
Co-authored-by: NJacek Czaja <jacek.czaja@intel.com>

fc42faff

add sparse embedding & load vars for 2.0 & gloo bug fix (#30306) · 5e839e4d

由 tangwei12 提交于 1月 12, 2021

* add sparse embedding & load vars for 2.0

Change-Id: I36b59ed5f015189dc9d9d2e34a9357722d369f1b

* fix hdfs gloo

Change-Id: Ia84d579053720ad804183e54c9a04b4f031c79c6

* fix gloo hdfs

Change-Id: I5ab982fd483cddc10adcdef0b8aa83aca976cb9e

* move loadvar/sparse embedding from incubute to static

Change-Id: I57081d3545ad2efab78c72420d2162c0eacaf3a0

5e839e4d

Y
disable test_pipeline (#30204) · da3ab010
由 YUNSHEN XIE 提交于 1月 12, 2021
```
* disable test_pipeline

* fix error
```
da3ab010

fix bug of celoss when using ignore_index and reduction (#30180) · 113810c5

由 chajchaj 提交于 1月 12, 2021

* fix bug of using ignore_index and reduction,test=develop

* fix bug of celoss when using ignore_index and reduction, test=develop

* improve performance when ignore_index=-100, test=develop

* add test in test_cross_entropy_loss.py for coverage rate, test=develop

* rm comment in test_cross_entropy_loss.py, test=develop

* del  hard code of "float64" in python/paddle/nn/functional/loss.py, test=develop

* change mask to a more simplified implementation, test=develop

* del comment in python/paddle/nn/functional/loss.py, test=develop

* del hard code and change mask to a more simplified implementation, test=develop

* change mask to a more simplified implementation, test=develop

* change mask to a more simplified implementation, test=develop

113810c5

fix elugradgrad test fail & error message opt (#30171) · 231501fe

由 Double_V 提交于 1月 12, 2021

* fix elugradgrad test fail and error message opt

* fix unitest,test=develop

* Update prroi_pool_op.h

fix error message

* opt message,test=develop

* fix ci fail,test=develop

231501fe

Z
Fix the accuracy problem of allclose op when using float64 data type in static mode. (#29890) · fb49ea38
由 Zhen Wang 提交于 1月 12, 2021
```
* Fix the accuracy problem of allclose op when using float64 data type in static mode.

* Format the code style.
```
fb49ea38
F

add fp16 support for tril_triu op (#30186) · 77051cc9
由 furnace 提交于 1月 12, 2021

77051cc9

11 1月, 2021 4 次提交
- L
  Support vector<double> as type of op attribute and op set_value suppport... · b4989fb7
  由 liym27 提交于 1月 11, 2021
```
Support vector<double> as type of op attribute and op set_value suppport vector<double> as value (#30126)
```
  b4989fb7
- F
  
  fix empty op unit test fail sometimes (#30225) · c6296b2b
  由 furnace 提交于 1月 11, 2021
  
  c6296b2b
- A
  
  Add tf32 switch for cuDNN (#29192) · 924aac22
  由 AshburnLee 提交于 1月 11, 2021
  
  924aac22
- C
  type promotion for grad (#30177) · c7371b7b
  由 chentianyu03 提交于 1月 11, 2021
```
* type promotion for grad

* add type promotion for div op
```
  c7371b7b

PaddlePaddle / Paddle 1 年多 前同步成功

PaddlePaddle / Paddle
1 年多前同步成功