提交 · be5c2e6050bdc95cf8fc26005a1b6d16f3d33c38 · BaiXuePrincess / Paddle

08 1月, 2021 13 次提交

H

fix windows bug (#29993) · be5c2e60
由 huangxu96 提交于 1月 08, 2021

be5c2e60
C

remove distributed prepare context (#30219) · 3016ba85
由 Chen Weihang 提交于 1月 08, 2021

3016ba85

Support pure fp16 training for AMP API. (#29544) · 7f7dfccf

由 Zhen Wang 提交于 1月 08, 2021

* add cast ops before and after unsupported fp16 ops.

* Keep partial net in FP32 pattern.

* Support check_finite_and_unscale and update_loss_scaling for FP16 calculation mode.

* Add fp16 support for adam op.

* add multi precision attr for adam.

* Fix the bug of test_multi_precision_fp16_train UT.

* Code format for CI.

* Fix the redefine error about MPTypeTrait on windows.

* fix bugs of the _create_accumulators func in Momentum.

* fix bug when inserting post cast op.

* Add the update_loss_scaling op in allow_set of UnusedVarCheck.

* Update for ci coverage.

* Add some doc for OptimizerWithMixedPrecision.

* Fix the code style.

* Imporve the doc of `amp_init`.

* Change for fp16 testing if users have the infer program defined in separate way.

7f7dfccf

Fix dtype of ungenerated grad var (#28511) · 8696335f

由 Leo Chen 提交于 1月 08, 2021

* fix dtype of ungenerated grad var

* update ut

* refine code

* set default dtype

* fix could_use_cudnn bug

* remove debug code

* re-implement

* fix bug

8696335f

A
Skip convert tensor shape while using Paddle.shape (#30223) · 03e07273
由 Aurelius84 提交于 1月 08, 2021
```
* fix tensor shape bug

* fix op_num

* clean code
```
03e07273
L
In creation.assgin, reuse implamention code of layers.tensor.assign to avoid... · 49411a20
由 liym27 提交于 1月 08, 2021
```
In creation.assgin, reuse implamention code of layers.tensor.assign to avoid maintain two code (#30227)
```
49411a20
L

fix pad (#30222) · e03171b7
由 littletomatodonkey 提交于 1月 08, 2021

e03171b7
L

[Dy2Stat] Use Paddle2.0 api paddle.tensor.array_* (#30156) · 31ed9a5e
由 liym27 提交于 1月 08, 2021

31ed9a5e

[Dy2Stat] Don't convert to paddle.shape if var_x.shape is not negetive (#29965) · ad55f609

由 liym27 提交于 1月 08, 2021

1. When x is Variable, call nn.shape(x) only in following cases:
1）The shape of x is used in control flow condition.
2）The dim to be used is negetive
2. When x is Variable, but x.shape or x.shape[idx] doesn't contain negetive value, don't convert to paddle.shape()

ad55f609

Add callback after TensorCopy (#30123) · 1f97d61c

由 Leo Chen 提交于 1月 08, 2021

* change to tensor copy sync

* change to tensor copy sync

* make copy_to safe when use TensorCopy

* refine code

* add ut

* add cudapinned garbagecollector

* add testcase: cpu place -> cuda pinned place

1f97d61c

L
Fix test_slice: avoid unnecessary copying of TensorArray from subblock to parent block(#30168) · b2483d78
由 liym27 提交于 1月 08, 2021
```
In control flow, don't copy TensorArray from subblock to parent block when TensorArray is created in parent block.
```
b2483d78
C
【Paddle.Fleet】Fix tensor table (#30075) · 528e03fc
由 Chengmo 提交于 1月 08, 2021
```
* add tensor table
```
528e03fc
G
Quantization supports 2.0 APIs (#30036) · 1bdf9242
由 guofei 提交于 1月 08, 2021
```
* Quantization supports 2.0 APIs

* Fix the error of save_quantized_model
```
1bdf9242

07 1月, 2021 8 次提交
- C
  [Complex] Simplify prepared op impl to improve performance (#30153) · d0fb06b2
  由 Chen Weihang 提交于 1月 07, 2021
```
* simplify prepared op impl to improve performance

* fix kunlun compile error

* continue fix kunlun compile error

* only transform diff place when dtype diff

* fix failed unittests

* remove useless file

* polish impl by review comment
```
  d0fb06b2
- C
  
  try multi times for sys.exit (#30188) · e5034707
  由 Chen Weihang 提交于 1月 07, 2021
  
  e5034707
- W
  
  fix adamw apply gradient (#30130) · 619c62bb
  由 WangXi 提交于 1月 07, 2021
  
  619c62bb
- L
  
  fix paddle.pow doc, test=document_fix (#30159) · 1ff69f58
  由 LutaoChu 提交于 1月 07, 2021
  
  1ff69f58
- W
  
  refine the paddle place support using str (#28769) · 7dd551e0
  由 wangchaochaohu 提交于 1月 07, 2021
  
  7dd551e0
- C
  Simplify the options of spawn based on fleetrun (#30144) · 8020e34e
  由 Chen Weihang 提交于 1月 06, 2021
```
* Simplify the options of spawn based on fleetrun

* polish details

* polish doc details
```
  8020e34e
- T
  pre padding in dygraph (#30163) · 4763e6bc
  由 tangwei12 提交于 1月 07, 2021
```
Change-Id: Ia5279b0cbb6a5b3970aff66e9510e0d85efa70ce
```
  4763e6bc
- 1
  Add Lookahead and ModelAverage Optimizer (#30004) · 198fbdfb
  由 123malin 提交于 1月 07, 2021
```
* test=develop, add model_average and lookahead
```
  198fbdfb
06 1月, 2021 10 次提交
- C
  fix syncbn convert (#30158) · 6a19e41f
  由 ceci3 提交于 1月 06, 2021
```
* fix syncbn convet

* add unittest
```
  6a19e41f
- L
  add dispenable input for core.ops.reshape2/expand/slice (#30072) · adac38c5
  由 Leo Chen 提交于 1月 06, 2021
```
* add dispenable input 'shape' for core.ops.reshape2

* add dispenable inputs for core.ops.reshape2/expand/slice

* add ut
```
  adac38c5
- Z
  Polish and Optimize the print/repr information of Layer (#29998) · 30888ca3
  由 Zhou Wei 提交于 1月 06, 2021
```
* Polish and Optimize the print/repr message of all layer

* fix some code format
```
  30888ca3
- W
  
  Extend the timeout for the (#30151) · f3a23926
  由 WeiXin 提交于 1月 06, 2021
  
  f3a23926
- Z
  
  fix unittest failed on windows (#29837) · 9c99d379
  由 Zhou Wei 提交于 1月 06, 2021
  
  9c99d379
- L
  Fix bug: In dynamic mode, if start or end is negetive, __getitem__ return wrong result(#30003) · 9922bd41
  由 liym27 提交于 1月 06, 2021
```
1. when slice_item is a slice: 
 1) the start of __getitem__ should be std::max(start, 0) if slice
 2) the start of __getitem__ should be std::min(end, dim) 
2. when slice_item is an integer, it should be in [-dim_len, dim_len) 
3. Fix error message to use accurate data
```
  9922bd41
- G
  
  fix logs info test=develop (#30071) · 4d2a4bb2
  由 gongweibao 提交于 1月 06, 2021
  
  4d2a4bb2
- C
  
  fix bn docs (#30096) · a125d633
  由 ceci3 提交于 1月 06, 2021
  
  a125d633
- C
  add attribute for batch_norm (#29950) · 33424779
  由 ceci3 提交于 1月 06, 2021
```
* add attribute for batch_norm
```
  33424779
- J
  Fix beam search bug (#29824) · 2e8425b6
  由 Jiaqi Liu 提交于 1月 06, 2021
```
* fix beam search bug

* add dygraph unittest

* update dynamic_decode argument doc

* add warning info for state which has no lengths attribute
```
  2e8425b6
05 1月, 2021 8 次提交

Support storage of large parameters (#29988) · f43e1d8c

由 WeiXin 提交于 1月 05, 2021

* Support storage of large parameters

* Reduce the complexity of the unittest

* Reduce the complexity of the unittest,commented out unittest for

* add unittest for static.save/load

* Increase the timeout threshold of 'test_static_save_load'

* Increase the timeout threshold of 'test_static_save_load'

* Increase the timeout threshold of 'test_static_save_load' and 'test_paddle_save_load'

* Increase the timeout threshold of 'test_static_save_load' and 'test_paddle_save_load'

f43e1d8c

C

change the kron gradient when complex types (#29995) · 666e6651
由 chentianyu03 提交于 1月 05, 2021

666e6651
W

[fleet] combine amp and gradient merge, test=develop (#30086) · ab049978
由 WangXi 提交于 1月 05, 2021

ab049978
W

optimize momentum to speedup dygraph, a little, test=develop (#30099) · 88e6dc4a
由 wanghuancoder 提交于 1月 05, 2021

88e6dc4a
T
add topo-aware in heter-ps (#30087) · 0b8e1fad
由 Thunderbrook 提交于 1月 05, 2021
```
* add topo aware

* resource.h

* topo aware

* format
```
0b8e1fad
G

fix selected_gpus test=develop (#30044) · eea7090c
由 gongweibao 提交于 1月 05, 2021

eea7090c

Support dygraph quant model (#29927) · 1fa863da

由 cc 提交于 1月 05, 2021

* Avoid the scale to be infinity in quant2_int8_mkldnn_pass, test=develop
* support quantized model for paddle2.0 dygraph, test=develop

1fa863da

Set FLAGS_selected_gpus for spawn (#29962) · 46c46954

由 Chen Weihang 提交于 1月 04, 2021

* set flags_selectedd_gpus for spawn

* add cond for unittest

* Delete test_no_single_process_using_multi_gpus_in_spawn.py

* Update spawn.py

* Update nccl_context.cc

46c46954

04 1月, 2021 1 次提交
- W
  
  Optimization grad merge performance (#29784) · ee16006b
  由 WangXi 提交于 1月 04, 2021
  
  ee16006b

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致