提交 · 7f7dfccf20347eb9f0600b15a6472c32f1c34c4b · PaddlePaddle / Paddle

08 1月, 2021 6 次提交

Support pure fp16 training for AMP API. (#29544) · 7f7dfccf

由 Zhen Wang 提交于 1月 08, 2021

* add cast ops before and after unsupported fp16 ops.

* Keep partial net in FP32 pattern.

* Support check_finite_and_unscale and update_loss_scaling for FP16 calculation mode.

* Add fp16 support for adam op.

* add multi precision attr for adam.

* Fix the bug of test_multi_precision_fp16_train UT.

* Code format for CI.

* Fix the redefine error about MPTypeTrait on windows.

* fix bugs of the _create_accumulators func in Momentum.

* fix bug when inserting post cast op.

* Add the update_loss_scaling op in allow_set of UnusedVarCheck.

* Update for ci coverage.

* Add some doc for OptimizerWithMixedPrecision.

* Fix the code style.

* Imporve the doc of `amp_init`.

* Change for fp16 testing if users have the infer program defined in separate way.

7f7dfccf

L

use cuda generator in bernoulli cuda kernel (#30199) · 789743e1
由 Leo Chen 提交于 1月 08, 2021

789743e1

Fix dtype of ungenerated grad var (#28511) · 8696335f

由 Leo Chen 提交于 1月 08, 2021

* fix dtype of ungenerated grad var

* update ut

* refine code

* set default dtype

* fix could_use_cudnn bug

* remove debug code

* re-implement

* fix bug

8696335f

W

shape op support int8 and uint8 tensor (#30201) · 609c0222
由 Wilber 提交于 1月 08, 2021

609c0222
R

Add version checking, test=op_version (#30129) · e42e1e80
由 ruri 提交于 1月 08, 2021

e42e1e80
C
【Paddle.Fleet】Fix tensor table (#30075) · 528e03fc
由 Chengmo 提交于 1月 08, 2021
```
* add tensor table
```
528e03fc

07 1月, 2021 7 次提交
- L
  enhance error message of nll_loss op test=develop (#30125) · 2dc7ee27
  由 lijianshe02 提交于 1月 07, 2021
```
* enhance error message of nll_loss op test=develop
```
  2dc7ee27
- H
  Refine PADDLE_ENFORCE Error Messages. test=develop (#30149) · 54bf3f5a
  由 Huihuang Zheng 提交于 1月 07, 2021
```
Improve some error messages in parallel_executor.cc, conditional_block_op.cc, recurrent_op.cc
```
  54bf3f5a
- 1
  Improve Index select cuda kernel (#30139) · c5b415bf
  由 123malin 提交于 1月 07, 2021
```
* test=develop, add index_select_cuda kernel
```
  c5b415bf
- W
  
  refine the paddle place support using str (#28769) · 7dd551e0
  由 wangchaochaohu 提交于 1月 07, 2021
  
  7dd551e0
- W
  enhance error info for py_func (#30138) · 91a8a257
  由 Wilber 提交于 1月 07, 2021
```
* enhance error info for py_func

* update
```
  91a8a257
- L
  
  fix assign_op_xpu concat_op_xpu warining (#30120) · 15fac5e7
  由 liuyuhui 提交于 1月 07, 2021
  
  15fac5e7
- J
  
  fix enforce msg of sum xpu op (#30113) · f5428eca
  由 Jack Zhou 提交于 1月 07, 2021
  
  f5428eca
06 1月, 2021 6 次提交
- S
  
  fix error message (#30135) · becf99d2
  由 ShenLiang 提交于 1月 06, 2021
  
  becf99d2
- W
  
  fix error message for distribute_fpn_proposals_op (#30116) · 69839f8a
  由 wangguanzhong 提交于 1月 06, 2021
  
  69839f8a
- Q
  add aarch64 and sunway kunlun lib (#30027) · 8e1c3ddf
  由 QingshuChen 提交于 1月 06, 2021
```
* add aarch64 and sunway kunlun lib

* minor

* optimize elementwise_add for kunlun

* update kunlun dependence

* minor

* minor
```
  8e1c3ddf
- 石
  
  fix a bug in op_version_registry, test=develop, test=op_version (#29994) · 53bb1265
  由石晓伟提交于 1月 06, 2021
  
  53bb1265
- X
  
  Optimize the error message of framework. (#30134) · 3e0c4929
  由 xiemoyuan 提交于 1月 06, 2021
  
  3e0c4929
- L
  Fix bug: In dynamic mode, if start or end is negetive, __getitem__ return wrong result(#30003) · 9922bd41
  由 liym27 提交于 1月 06, 2021
```
1. when slice_item is a slice: 
 1) the start of __getitem__ should be std::max(start, 0) if slice
 2) the start of __getitem__ should be std::min(end, dim) 
2. when slice_item is an integer, it should be in [-dim_len, dim_len) 
3. Fix error message to use accurate data
```
  9922bd41
05 1月, 2021 4 次提交
- C
  
  change the kron gradient when complex types (#29995) · 666e6651
  由 chentianyu03 提交于 1月 05, 2021
  
  666e6651
- C
  add trace op_register_version and fix version bug; test=op_version (#30000) · a5e422c8
  由 chentianyu03 提交于 1月 05, 2021
```
* add trace op_register_version and fix defaulf bug; test=op_version

* add trace op_register_version; test=op_version

* add trace op_register_version; test=op_version

* add trace op_register_version; test=op_version

* fix missing the template bug of vector; test=op_version
```
  a5e422c8
- C
  Fix the formate of raising error in randperm op (#30108) · 9f34374b
  由 cc 提交于 1月 05, 2021
```
* fix the formate of raising error in randperm op
```
  9f34374b
- W
  
  fix the compiler error when gcc4 cuda9.0 (#29997) · d0a56205
  由 wangchaochaohu 提交于 1月 05, 2021
  
  d0a56205
04 1月, 2021 7 次提交
- W
  
  Optimization grad merge performance (#29784) · ee16006b
  由 WangXi 提交于 1月 04, 2021
  
  ee16006b
- Add p_norm op version info (#30042) · e891f4da
  由 myq406450149 提交于 1月 04, 2021
```
* p_norm fix op version info. test=develop
```
  e891f4da
- T
  for inference checkpoint (#30081) · 7d1c149e
  由 tangwei12 提交于 1月 04, 2021
```
* for inference checkpoint

Change-Id: I36c979240ffa55bf1ef0c9315402960762af6be4

* for inference checkpoint

Change-Id: I82025365d5b792cbea1ead506df685aecc8ac198
```
  7d1c149e
- W
  
  Add version checking (#30040) · 1b999d2b
  由 whs 提交于 1月 04, 2021
  
  1b999d2b
- C
  register ModifyAttr for instance_norm, test=op_version (#30065) · 85b2f05a
  由 ceci3 提交于 1月 04, 2021
```
* register instance norm, test=op_version
```
  85b2f05a
- C
  fix op_register_version for compare ops, test=op_version (#30007) · ddcff254
  由 channings 提交于 1月 04, 2021
```
Co-authored-by: Nzhoushunjie <zhoushunjie@baidu.com>
```
  ddcff254
- G
  
  add REGISTER_OP_VERSION for LSTM (#30038) · a6482258
  由 GaoWei8 提交于 1月 04, 2021
  
  a6482258
31 12月, 2020 7 次提交
- Y
  Register op version for linspace,test=op_version (#30025) · 6e93fb92
  由 yinhaofeng 提交于 12月 31, 2020
```
* Register op version for linspace,test=op_version

* Register op version for linspace,test=op_version

* Register op version for linspace,test=op_version

* Register op version for linspace,test=op_version

* Register op version for linspace,test=op_version
```
  6e93fb92
- 1
  test=develop, add op_register_version for roll_op (#30023) · d0056c32
  由 123malin 提交于 12月 31, 2020
```
* test=develop, add op_register_version for roll_op
```
  d0056c32
- C
  complex gradient matmul (#29966) · e012930a
  由 chentianyu03 提交于 12月 31, 2020
```
* dot op support complex types

* matmul support complex types

* add test case

* matmul broadcast gradient support complex

* move conjFunctor to complex_functor.h
```
  e012930a
- S
  Fix rank_attention op_version, test=op_version (#30006) · 893d37e5
  由 ShenLiang 提交于 12月 31, 2020
```
* fix rank_attention, test=op_version
```
  893d37e5
- A
  operator checkpoints for new attributes. (#29832) · 13aef970
  由 Adam Osewski 提交于 12月 31, 2020
```
* Add operator checkpoints for new attributes.

* Fix adding subsequent checkpoint to quantize op.
```
  13aef970
- W
  
  add REGISTER_OP_VERSION for generate_proposals, roi_align, roi_pool test=op_version (#30034) · 844d8e0c
  由 wangguanzhong 提交于 12月 31, 2020
  
  844d8e0c
- C
  Add mkldnn nearest_interp and bilinear_interp op (#30016) · c3c064a8
  由 cc 提交于 12月 31, 2020
```
* Add mkldnn nearest_interp and bilinear_interp op
* don't run mkldnn interpolate in default
* add interpolate_mkldnn_pass
```
  c3c064a8
30 12月, 2020 3 次提交
- C
  
  Revert "register ModifyAttr for instance_norm, test=op_version (#29938)" · c053bf2a
  由 chalsliu 提交于 12月 30, 2020
  
  c053bf2a
- W
  add the support the op version check for matmul, test=op_version (#30011) · cc2f9462
  由 wawltor 提交于 12月 30, 2020
```
* add the support the op version check for matmul, test=op_version
```
  cc2f9462
- W
  add the op version check for the elementwise ops, test=op_version (#30010) · b33aaea8
  由 wawltor 提交于 12月 30, 2020
```
* add the op version check for the elementwise ops, test=op_version

* add the support check for elementwise_ops, test=op_version
```
  b33aaea8

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功