提交 · c6296b2b0ed55c0d224da870338f106401fe786a · PaddlePaddle / Paddle

11 1月, 2021 6 次提交
- W
  
  register OPMaker and Infer Shape Check for fused_elementwise_add (#30259) · 8dcae0c5
  由 wangchaochaohu 提交于 1月 11, 2021
  
  8dcae0c5
- A
  
  Add tf32 switch for cuDNN (#29192) · 924aac22
  由 AshburnLee 提交于 1月 11, 2021
  
  924aac22
- C
  type promotion for grad (#30177) · c7371b7b
  由 chentianyu03 提交于 1月 11, 2021
```
* type promotion for grad

* add type promotion for div op
```
  c7371b7b
- L
  
  Check the rank of input in kernel of set_value op (#30147) · 3ce878f3
  由 liym27 提交于 1月 11, 2021
  
  3ce878f3
- W
  modify error message based on comments (#30189) · 66dc4ac7
  由 WeiXin 提交于 1月 11, 2021
```
* modify error message based on comments

* edit code according to review.

* Correct spelling according to review.
```
  66dc4ac7
- W
  just add the op error message for the matmul xpu (#30246) · fee42441
  由 wawltor 提交于 1月 11, 2021
```
 add the op error message for the matmul xpu 
```
  fee42441
10 1月, 2021 2 次提交
- G
  optimize softmax forward (#30217) · 0a21924a
  由 GaoWei8 提交于 1月 10, 2021
```
* optimize softmax forward
```
  0a21924a
- W
  reduce the occupied size of memory for the fused pattern of elementwise_add... · af80859d
  由 wangchaochaohu 提交于 1月 10, 2021
```
reduce the  occupied size  of memory for the fused pattern of elementwise_add Op and activation Op(relu Op for example) (#29885)
```
  af80859d
09 1月, 2021 2 次提交
- Z
  
  enhance error message, test=develop (#30220) · 5932fee6
  由 zhang wenhui 提交于 1月 09, 2021
  
  5932fee6
- J
  [oneDNN] Added UT for testing elementwise_mul caching (#30203) · 4aba17b5
  由 Jacek Czaja 提交于 1月 09, 2021
```
* - Added UT for testing elementwise_mul caching

* lint fixes
```
  4aba17b5
08 1月, 2021 6 次提交

Support pure fp16 training for AMP API. (#29544) · 7f7dfccf

由 Zhen Wang 提交于 1月 08, 2021

* add cast ops before and after unsupported fp16 ops.

* Keep partial net in FP32 pattern.

* Support check_finite_and_unscale and update_loss_scaling for FP16 calculation mode.

* Add fp16 support for adam op.

* add multi precision attr for adam.

* Fix the bug of test_multi_precision_fp16_train UT.

* Code format for CI.

* Fix the redefine error about MPTypeTrait on windows.

* fix bugs of the _create_accumulators func in Momentum.

* fix bug when inserting post cast op.

* Add the update_loss_scaling op in allow_set of UnusedVarCheck.

* Update for ci coverage.

* Add some doc for OptimizerWithMixedPrecision.

* Fix the code style.

* Imporve the doc of `amp_init`.

* Change for fp16 testing if users have the infer program defined in separate way.

7f7dfccf

L

use cuda generator in bernoulli cuda kernel (#30199) · 789743e1
由 Leo Chen 提交于 1月 08, 2021

789743e1

Fix dtype of ungenerated grad var (#28511) · 8696335f

由 Leo Chen 提交于 1月 08, 2021

* fix dtype of ungenerated grad var

* update ut

* refine code

* set default dtype

* fix could_use_cudnn bug

* remove debug code

* re-implement

* fix bug

8696335f

W

shape op support int8 and uint8 tensor (#30201) · 609c0222
由 Wilber 提交于 1月 08, 2021

609c0222
R

Add version checking, test=op_version (#30129) · e42e1e80
由 ruri 提交于 1月 08, 2021

e42e1e80
C
【Paddle.Fleet】Fix tensor table (#30075) · 528e03fc
由 Chengmo 提交于 1月 08, 2021
```
* add tensor table
```
528e03fc

07 1月, 2021 7 次提交
- L
  enhance error message of nll_loss op test=develop (#30125) · 2dc7ee27
  由 lijianshe02 提交于 1月 07, 2021
```
* enhance error message of nll_loss op test=develop
```
  2dc7ee27
- H
  Refine PADDLE_ENFORCE Error Messages. test=develop (#30149) · 54bf3f5a
  由 Huihuang Zheng 提交于 1月 07, 2021
```
Improve some error messages in parallel_executor.cc, conditional_block_op.cc, recurrent_op.cc
```
  54bf3f5a
- 1
  Improve Index select cuda kernel (#30139) · c5b415bf
  由 123malin 提交于 1月 07, 2021
```
* test=develop, add index_select_cuda kernel
```
  c5b415bf
- W
  
  refine the paddle place support using str (#28769) · 7dd551e0
  由 wangchaochaohu 提交于 1月 07, 2021
  
  7dd551e0
- W
  enhance error info for py_func (#30138) · 91a8a257
  由 Wilber 提交于 1月 07, 2021
```
* enhance error info for py_func

* update
```
  91a8a257
- L
  
  fix assign_op_xpu concat_op_xpu warining (#30120) · 15fac5e7
  由 liuyuhui 提交于 1月 07, 2021
  
  15fac5e7
- J
  
  fix enforce msg of sum xpu op (#30113) · f5428eca
  由 Jack Zhou 提交于 1月 07, 2021
  
  f5428eca
06 1月, 2021 6 次提交
- S
  
  fix error message (#30135) · becf99d2
  由 ShenLiang 提交于 1月 06, 2021
  
  becf99d2
- W
  
  fix error message for distribute_fpn_proposals_op (#30116) · 69839f8a
  由 wangguanzhong 提交于 1月 06, 2021
  
  69839f8a
- Q
  add aarch64 and sunway kunlun lib (#30027) · 8e1c3ddf
  由 QingshuChen 提交于 1月 06, 2021
```
* add aarch64 and sunway kunlun lib

* minor

* optimize elementwise_add for kunlun

* update kunlun dependence

* minor

* minor
```
  8e1c3ddf
- 石
  
  fix a bug in op_version_registry, test=develop, test=op_version (#29994) · 53bb1265
  由石晓伟提交于 1月 06, 2021
  
  53bb1265
- X
  
  Optimize the error message of framework. (#30134) · 3e0c4929
  由 xiemoyuan 提交于 1月 06, 2021
  
  3e0c4929
- L
  Fix bug: In dynamic mode, if start or end is negetive, __getitem__ return wrong result(#30003) · 9922bd41
  由 liym27 提交于 1月 06, 2021
```
1. when slice_item is a slice: 
 1) the start of __getitem__ should be std::max(start, 0) if slice
 2) the start of __getitem__ should be std::min(end, dim) 
2. when slice_item is an integer, it should be in [-dim_len, dim_len) 
3. Fix error message to use accurate data
```
  9922bd41
05 1月, 2021 4 次提交
- C
  
  change the kron gradient when complex types (#29995) · 666e6651
  由 chentianyu03 提交于 1月 05, 2021
  
  666e6651
- C
  add trace op_register_version and fix version bug; test=op_version (#30000) · a5e422c8
  由 chentianyu03 提交于 1月 05, 2021
```
* add trace op_register_version and fix defaulf bug; test=op_version

* add trace op_register_version; test=op_version

* add trace op_register_version; test=op_version

* add trace op_register_version; test=op_version

* fix missing the template bug of vector; test=op_version
```
  a5e422c8
- C
  Fix the formate of raising error in randperm op (#30108) · 9f34374b
  由 cc 提交于 1月 05, 2021
```
* fix the formate of raising error in randperm op
```
  9f34374b
- W
  
  fix the compiler error when gcc4 cuda9.0 (#29997) · d0a56205
  由 wangchaochaohu 提交于 1月 05, 2021
  
  d0a56205
04 1月, 2021 7 次提交
- W
  
  Optimization grad merge performance (#29784) · ee16006b
  由 WangXi 提交于 1月 04, 2021
  
  ee16006b
- Add p_norm op version info (#30042) · e891f4da
  由 myq406450149 提交于 1月 04, 2021
```
* p_norm fix op version info. test=develop
```
  e891f4da
- T
  for inference checkpoint (#30081) · 7d1c149e
  由 tangwei12 提交于 1月 04, 2021
```
* for inference checkpoint

Change-Id: I36c979240ffa55bf1ef0c9315402960762af6be4

* for inference checkpoint

Change-Id: I82025365d5b792cbea1ead506df685aecc8ac198
```
  7d1c149e
- W
  
  Add version checking (#30040) · 1b999d2b
  由 whs 提交于 1月 04, 2021
  
  1b999d2b
- C
  register ModifyAttr for instance_norm, test=op_version (#30065) · 85b2f05a
  由 ceci3 提交于 1月 04, 2021
```
* register instance norm, test=op_version
```
  85b2f05a
- C
  fix op_register_version for compare ops, test=op_version (#30007) · ddcff254
  由 channings 提交于 1月 04, 2021
```
Co-authored-by: Nzhoushunjie <zhoushunjie@baidu.com>
```
  ddcff254
- G
  
  add REGISTER_OP_VERSION for LSTM (#30038) · a6482258
  由 GaoWei8 提交于 1月 04, 2021
  
  a6482258

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功