提交 · cf786d22ec78aacf04ca25a8fb39f04079703980 · BaiXuePrincess / Paddle

13 1月, 2021 9 次提交
- H
  Decrease Batch Size for Windows CI, test=develop (#30331) · cd5f11b8
  由 Huihuang Zheng 提交于 1月 13, 2021
```
As the title
```
  cd5f11b8
- C
  skip quantizing ops in cpu inference (#30342) · 8e3a2940
  由 cc 提交于 1月 13, 2021
```
* skip quantizing ops in cpu inference, test=develop
```
  8e3a2940
- B
  
  fix quantize error in speical naming model (#30354) · ad6fee2f
  由 Bai Yifan 提交于 1月 13, 2021
  
  ad6fee2f
- H
  
  add amp example document (#30314) · 342d62de
  由 huangxu96 提交于 1月 13, 2021
  
  342d62de
- H
  Decrease Mac Input Size Because of CI Short Memory (#30330) · 017a5348
  由 Huihuang Zheng 提交于 1月 13, 2021
```
As the title
```
  017a5348
- L
  Set expected place in child thread for dataloader to avoid costing cuda memory... · 3d015f1c
  由 Leo Chen 提交于 1月 13, 2021
```
Set expected place in child thread for dataloader to avoid costing cuda memory on other card (#30338)

* set expected place in child thread for dataloader

* set device id when set tensor from numpy

* revert tensor_py change

* add compile guard

* fix ci

* fix bug
```
  3d015f1c
- Q
  optimize memcpy perf for kunlun (#30291) · 2c1bba02
  由 QingshuChen 提交于 1月 13, 2021
```
* optimize memcpy perf for kunlun

* remove useless unitest for kunlun mean

* minor
```
  2c1bba02
- H
  Implemented AddQuantDequantPass in imperative quantization. (#26692) · ee623bff
  由 huangxu96 提交于 1月 13, 2021
```
* Implemented AddQuantDequantPass in imperative quantization.

* Supported LeakyReLU Quantization

* For meeting coverage rate.

* Changed the file name of test of AddQuantDequant

* Implemented more Quantized NoWeightLayers.

* Fix the loss cannot align problem between static and dynamic model quantization, add swish as supported quantized layer in imperative quantization.

* remove noweight_list

* support 2.0 API such as Pool2D and ReLu
```
  ee623bff
- S
  
  Support unused parameters in dynamic graph distributed (#30224) · a60f17b8
  由 ShenLiang 提交于 1月 13, 2021
  
  a60f17b8
12 1月, 2021 9 次提交

J

Recompute Offload (#30233) · 75936d83
由 JZ-LIANG 提交于 1月 12, 2021

75936d83
L

Skip some conv2d_int8 tests in windows (#30128) · a2382986
由 lidanqing 提交于 1月 12, 2021

a2382986

Wojtuss/upgrade one dnn 2.0 (#30295) · fc42faff

由 Wojciech Uss 提交于 1月 12, 2021

* upgrade oneDNN version to 2.0 master branch

* - Added workarounds for new lib onednn change

* fix regex
Co-authored-by: NJacek Czaja <jacek.czaja@intel.com>

fc42faff

add sparse embedding & load vars for 2.0 & gloo bug fix (#30306) · 5e839e4d

由 tangwei12 提交于 1月 12, 2021

* add sparse embedding & load vars for 2.0

Change-Id: I36b59ed5f015189dc9d9d2e34a9357722d369f1b

* fix hdfs gloo

Change-Id: Ia84d579053720ad804183e54c9a04b4f031c79c6

* fix gloo hdfs

Change-Id: I5ab982fd483cddc10adcdef0b8aa83aca976cb9e

* move loadvar/sparse embedding from incubute to static

Change-Id: I57081d3545ad2efab78c72420d2162c0eacaf3a0

5e839e4d

Y
disable test_pipeline (#30204) · da3ab010
由 YUNSHEN XIE 提交于 1月 12, 2021
```
* disable test_pipeline

* fix error
```
da3ab010

fix bug of celoss when using ignore_index and reduction (#30180) · 113810c5

由 chajchaj 提交于 1月 12, 2021

* fix bug of using ignore_index and reduction,test=develop

* fix bug of celoss when using ignore_index and reduction, test=develop

* improve performance when ignore_index=-100, test=develop

* add test in test_cross_entropy_loss.py for coverage rate, test=develop

* rm comment in test_cross_entropy_loss.py, test=develop

* del  hard code of "float64" in python/paddle/nn/functional/loss.py, test=develop

* change mask to a more simplified implementation, test=develop

* del comment in python/paddle/nn/functional/loss.py, test=develop

* del hard code and change mask to a more simplified implementation, test=develop

* change mask to a more simplified implementation, test=develop

* change mask to a more simplified implementation, test=develop

113810c5

fix elugradgrad test fail & error message opt (#30171) · 231501fe

由 Double_V 提交于 1月 12, 2021

* fix elugradgrad test fail and error message opt

* fix unitest,test=develop

* Update prroi_pool_op.h

fix error message

* opt message,test=develop

* fix ci fail,test=develop

231501fe

Z
Fix the accuracy problem of allclose op when using float64 data type in static mode. (#29890) · fb49ea38
由 Zhen Wang 提交于 1月 12, 2021
```
* Fix the accuracy problem of allclose op when using float64 data type in static mode.

* Format the code style.
```
fb49ea38
F

add fp16 support for tril_triu op (#30186) · 77051cc9
由 furnace 提交于 1月 12, 2021

77051cc9

11 1月, 2021 10 次提交
- L
  Support vector<double> as type of op attribute and op set_value suppport... · b4989fb7
  由 liym27 提交于 1月 11, 2021
```
Support vector<double> as type of op attribute and op set_value suppport vector<double> as value (#30126)
```
  b4989fb7
- F
  
  fix empty op unit test fail sometimes (#30225) · c6296b2b
  由 furnace 提交于 1月 11, 2021
  
  c6296b2b
- A
  
  Add tf32 switch for cuDNN (#29192) · 924aac22
  由 AshburnLee 提交于 1月 11, 2021
  
  924aac22
- C
  type promotion for grad (#30177) · c7371b7b
  由 chentianyu03 提交于 1月 11, 2021
```
* type promotion for grad

* add type promotion for div op
```
  c7371b7b
- Y
  disable ut test_tsm on windows (#30017) · 42a6442a
  由 YUNSHEN XIE 提交于 1月 11, 2021
```
* disable ut test_tsm on windows

* fix error

* add ut execuate time
```
  42a6442a
- W
  Fix bug for 'save mutiple method' (#30218) · edafb546
  由 WeiXin 提交于 1月 11, 2021
```
* Fix bug for 'save mutiple method'

* To pass coverage.

* edit code to pass coverage.

* edit code to pass coverage.

* add unittest for coverage.

* change for coverage.

* edit for coverage.
```
  edafb546
- G
  
  Fix unittests bugs. (#30250) · 8700a7bd
  由 gongweibao 提交于 1月 11, 2021
  
  8700a7bd
- B
  
  fix test_pool3d_op timeout issue (#30248) · dd6f5919
  由 Bai Yifan 提交于 1月 11, 2021
  
  dd6f5919
- H
  Add Static Variable Clone (#30208) · c372a763
  由 Huihuang Zheng 提交于 1月 11, 2021
```
Add clone method for static Variable so that this interface will be same as dygraph. It fixed some bugs in dy2stat
```
  c372a763
- X
  clean redundant API alias in 2.0 - part 2 (#30013) · 6bfdef72
  由 XiaoguangHu 提交于 1月 10, 2021
```
* delete paddle.nn.functional.assign

* fix dynamic to static error
```
  6bfdef72
10 1月, 2021 1 次提交
- W
  reduce the occupied size of memory for the fused pattern of elementwise_add... · af80859d
  由 wangchaochaohu 提交于 1月 10, 2021
```
reduce the  occupied size  of memory for the fused pattern of elementwise_add Op and activation Op(relu Op for example) (#29885)
```
  af80859d
09 1月, 2021 1 次提交

add View(reuse allocation) strategy on squeeze, unsqueeze, reshape, flatten op (#29913) · da16b33f

由 pangyoki 提交于 1月 09, 2021

* add view strategy on squeeze,unsqueeze,reshape,flatten

* add squeeze unittest

* add unittests

* use View strategy as name rather than Reuse Allacation

* fix view api doc

* fix format

* use core.ops when input of reshape2 is Tensor

* fix test_cross_entropy_loss error because of reshape2

* delete selected_rows

* change op_function

* little change

* solve HandleViewBetweenInputAndOutput

da16b33f

08 1月, 2021 10 次提交

H

fix windows bug (#29993) · be5c2e60
由 huangxu96 提交于 1月 08, 2021

be5c2e60
C

remove distributed prepare context (#30219) · 3016ba85
由 Chen Weihang 提交于 1月 08, 2021

3016ba85

Support pure fp16 training for AMP API. (#29544) · 7f7dfccf

由 Zhen Wang 提交于 1月 08, 2021

* add cast ops before and after unsupported fp16 ops.

* Keep partial net in FP32 pattern.

* Support check_finite_and_unscale and update_loss_scaling for FP16 calculation mode.

* Add fp16 support for adam op.

* add multi precision attr for adam.

* Fix the bug of test_multi_precision_fp16_train UT.

* Code format for CI.

* Fix the redefine error about MPTypeTrait on windows.

* fix bugs of the _create_accumulators func in Momentum.

* fix bug when inserting post cast op.

* Add the update_loss_scaling op in allow_set of UnusedVarCheck.

* Update for ci coverage.

* Add some doc for OptimizerWithMixedPrecision.

* Fix the code style.

* Imporve the doc of `amp_init`.

* Change for fp16 testing if users have the infer program defined in separate way.

7f7dfccf

Fix dtype of ungenerated grad var (#28511) · 8696335f

由 Leo Chen 提交于 1月 08, 2021

* fix dtype of ungenerated grad var

* update ut

* refine code

* set default dtype

* fix could_use_cudnn bug

* remove debug code

* re-implement

* fix bug

8696335f

A
Skip convert tensor shape while using Paddle.shape (#30223) · 03e07273
由 Aurelius84 提交于 1月 08, 2021
```
* fix tensor shape bug

* fix op_num

* clean code
```
03e07273
L

[Dy2Stat] Use Paddle2.0 api paddle.tensor.array_* (#30156) · 31ed9a5e
由 liym27 提交于 1月 08, 2021

31ed9a5e

[Dy2Stat] Don't convert to paddle.shape if var_x.shape is not negetive (#29965) · ad55f609

由 liym27 提交于 1月 08, 2021

1. When x is Variable, call nn.shape(x) only in following cases:
1）The shape of x is used in control flow condition.
2）The dim to be used is negetive
2. When x is Variable, but x.shape or x.shape[idx] doesn't contain negetive value, don't convert to paddle.shape()

ad55f609

Add callback after TensorCopy (#30123) · 1f97d61c

由 Leo Chen 提交于 1月 08, 2021

* change to tensor copy sync

* change to tensor copy sync

* make copy_to safe when use TensorCopy

* refine code

* add ut

* add cudapinned garbagecollector

* add testcase: cpu place -> cuda pinned place

1f97d61c

L
Fix test_slice: avoid unnecessary copying of TensorArray from subblock to parent block(#30168) · b2483d78
由 liym27 提交于 1月 08, 2021
```
In control flow, don't copy TensorArray from subblock to parent block when TensorArray is created in parent block.
```
b2483d78
C
【Paddle.Fleet】Fix tensor table (#30075) · 528e03fc
由 Chengmo 提交于 1月 08, 2021
```
* add tensor table
```
528e03fc

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致