提交 · 7f7dfccf20347eb9f0600b15a6472c32f1c34c4b · PaddlePaddle / Paddle

08 1月, 2021 17 次提交

Support pure fp16 training for AMP API. (#29544) · 7f7dfccf

由 Zhen Wang 提交于 1月 08, 2021

* add cast ops before and after unsupported fp16 ops.

* Keep partial net in FP32 pattern.

* Support check_finite_and_unscale and update_loss_scaling for FP16 calculation mode.

* Add fp16 support for adam op.

* add multi precision attr for adam.

* Fix the bug of test_multi_precision_fp16_train UT.

* Code format for CI.

* Fix the redefine error about MPTypeTrait on windows.

* fix bugs of the _create_accumulators func in Momentum.

* fix bug when inserting post cast op.

* Add the update_loss_scaling op in allow_set of UnusedVarCheck.

* Update for ci coverage.

* Add some doc for OptimizerWithMixedPrecision.

* Fix the code style.

* Imporve the doc of `amp_init`.

* Change for fp16 testing if users have the infer program defined in separate way.

7f7dfccf

L

use cuda generator in bernoulli cuda kernel (#30199) · 789743e1
由 Leo Chen 提交于 1月 08, 2021

789743e1

Fix dtype of ungenerated grad var (#28511) · 8696335f

由 Leo Chen 提交于 1月 08, 2021

* fix dtype of ungenerated grad var

* update ut

* refine code

* set default dtype

* fix could_use_cudnn bug

* remove debug code

* re-implement

* fix bug

8696335f

A
Skip convert tensor shape while using Paddle.shape (#30223) · 03e07273
由 Aurelius84 提交于 1月 08, 2021
```
* fix tensor shape bug

* fix op_num

* clean code
```
03e07273
L
In creation.assgin, reuse implamention code of layers.tensor.assign to avoid... · 49411a20
由 liym27 提交于 1月 08, 2021
```
In creation.assgin, reuse implamention code of layers.tensor.assign to avoid maintain two code (#30227)
```
49411a20
L

fix pad (#30222) · e03171b7
由 littletomatodonkey 提交于 1月 08, 2021

e03171b7
W

shape op support int8 and uint8 tensor (#30201) · 609c0222
由 Wilber 提交于 1月 08, 2021

609c0222
W

fix windows compile when WITH_PYTHON=ON and WITH_TENSORRT=ON (#30194) · 01a287bf
由 Wilber 提交于 1月 08, 2021

01a287bf
R

Add version checking, test=op_version (#30129) · e42e1e80
由 ruri 提交于 1月 08, 2021

e42e1e80
L

[Dy2Stat] Use Paddle2.0 api paddle.tensor.array_* (#30156) · 31ed9a5e
由 liym27 提交于 1月 08, 2021

31ed9a5e

[Dy2Stat] Don't convert to paddle.shape if var_x.shape is not negetive (#29965) · ad55f609

由 liym27 提交于 1月 08, 2021

1. When x is Variable, call nn.shape(x) only in following cases:
1）The shape of x is used in control flow condition.
2）The dim to be used is negetive
2. When x is Variable, but x.shape or x.shape[idx] doesn't contain negetive value, don't convert to paddle.shape()

ad55f609

Add callback after TensorCopy (#30123) · 1f97d61c

由 Leo Chen 提交于 1月 08, 2021

* change to tensor copy sync

* change to tensor copy sync

* make copy_to safe when use TensorCopy

* refine code

* add ut

* add cudapinned garbagecollector

* add testcase: cpu place -> cuda pinned place

1f97d61c

L
Fix test_slice: avoid unnecessary copying of TensorArray from subblock to parent block(#30168) · b2483d78
由 liym27 提交于 1月 08, 2021
```
In control flow, don't copy TensorArray from subblock to parent block when TensorArray is created in parent block.
```
b2483d78
C
【Paddle.Fleet】Fix tensor table (#30075) · 528e03fc
由 Chengmo 提交于 1月 08, 2021
```
* add tensor table
```
528e03fc
G
Quantization supports 2.0 APIs (#30036) · 1bdf9242
由 guofei 提交于 1月 08, 2021
```
* Quantization supports 2.0 APIs

* Fix the error of save_quantized_model
```
1bdf9242
W

disable mkldnn inplace pass on windows (#30164) · ade24494
由 Wilber 提交于 1月 08, 2021

ade24494
J
Fix analysis predictor test (#30191) · 907262ee
由 joanna.wozna.intel 提交于 1月 08, 2021
```
* Add a necessary condition

* Remove test for white list and add header
```
907262ee

07 1月, 2021 18 次提交
- L
  enhance error message of nll_loss op test=develop (#30125) · 2dc7ee27
  由 lijianshe02 提交于 1月 07, 2021
```
* enhance error message of nll_loss op test=develop
```
  2dc7ee27
- H
  Refine PADDLE_ENFORCE Error Messages. test=develop (#30149) · 54bf3f5a
  由 Huihuang Zheng 提交于 1月 07, 2021
```
Improve some error messages in parallel_executor.cc, conditional_block_op.cc, recurrent_op.cc
```
  54bf3f5a
- C
  [Complex] Simplify prepared op impl to improve performance (#30153) · d0fb06b2
  由 Chen Weihang 提交于 1月 07, 2021
```
* simplify prepared op impl to improve performance

* fix kunlun compile error

* continue fix kunlun compile error

* only transform diff place when dtype diff

* fix failed unittests

* remove useless file

* polish impl by review comment
```
  d0fb06b2
- C
  
  try multi times for sys.exit (#30188) · e5034707
  由 Chen Weihang 提交于 1月 07, 2021
  
  e5034707
- W
  
  fix adamw apply gradient (#30130) · 619c62bb
  由 WangXi 提交于 1月 07, 2021
  
  619c62bb
- T
  
  down openssl (#29958) · 7564d43b
  由 tianshuo78520a 提交于 1月 07, 2021
  
  7564d43b
- L
  
  fix paddle.pow doc, test=document_fix (#30159) · 1ff69f58
  由 LutaoChu 提交于 1月 07, 2021
  
  1ff69f58
- 1
  Improve Index select cuda kernel (#30139) · c5b415bf
  由 123malin 提交于 1月 07, 2021
```
* test=develop, add index_select_cuda kernel
```
  c5b415bf
- W
  
  refine the paddle place support using str (#28769) · 7dd551e0
  由 wangchaochaohu 提交于 1月 07, 2021
  
  7dd551e0
- W
  
  Add detailed error message for curandStatus_t, cublasStatus_t, cusolverStatus_t (#30161) · 404c1676
  由 WeiXin 提交于 1月 07, 2021
  
  404c1676
- W
  enhance error info for py_func (#30138) · 91a8a257
  由 Wilber 提交于 1月 07, 2021
```
* enhance error info for py_func

* update
```
  91a8a257
- Z
  
  open normal unittest on windows (#30167) · 6aa82e03
  由 Zhou Wei 提交于 1月 07, 2021
  
  6aa82e03
- W
  [XPU] Remove lite_xpu ut lite_resnet50_test since fusion pass changes... · b8207af6
  由 weihaoji 提交于 1月 07, 2021
```
[XPU] Remove lite_xpu ut lite_resnet50_test since fusion pass changes introduced precision diff. test=develop (#30122)
```
  b8207af6
- L
  
  fix assign_op_xpu concat_op_xpu warining (#30120) · 15fac5e7
  由 liuyuhui 提交于 1月 07, 2021
  
  15fac5e7
- J
  
  fix enforce msg of sum xpu op (#30113) · f5428eca
  由 Jack Zhou 提交于 1月 07, 2021
  
  f5428eca
- C
  Simplify the options of spawn based on fleetrun (#30144) · 8020e34e
  由 Chen Weihang 提交于 1月 06, 2021
```
* Simplify the options of spawn based on fleetrun

* polish details

* polish doc details
```
  8020e34e
- T
  pre padding in dygraph (#30163) · 4763e6bc
  由 tangwei12 提交于 1月 07, 2021
```
Change-Id: Ia5279b0cbb6a5b3970aff66e9510e0d85efa70ce
```
  4763e6bc
- 1
  Add Lookahead and ModelAverage Optimizer (#30004) · 198fbdfb
  由 123malin 提交于 1月 07, 2021
```
* test=develop, add model_average and lookahead
```
  198fbdfb
06 1月, 2021 5 次提交
- C
  fix syncbn convert (#30158) · 6a19e41f
  由 ceci3 提交于 1月 06, 2021
```
* fix syncbn convet

* add unittest
```
  6a19e41f
- L
  add dispenable input for core.ops.reshape2/expand/slice (#30072) · adac38c5
  由 Leo Chen 提交于 1月 06, 2021
```
* add dispenable input 'shape' for core.ops.reshape2

* add dispenable inputs for core.ops.reshape2/expand/slice

* add ut
```
  adac38c5
- C
  
  update readme test=document_fix (#30154) · 3be65939
  由 Chen Long 提交于 1月 06, 2021
  
  3be65939
- T
  
  fix ubuntu18 openssl error (#30077) · 35fbc484
  由 tianshuo78520a 提交于 1月 06, 2021
  
  35fbc484
- S
  
  fix error message (#30135) · becf99d2
  由 ShenLiang 提交于 1月 06, 2021
  
  becf99d2

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功