提交 · 8700a7bd908d97f52cdfdfff4cfdc070bc05beb8 · Crayon鑫 / Paddle

11 1月, 2021 1 次提交
- W
  just add the op error message for the matmul xpu (#30246) · fee42441
  由 wawltor 提交于 1月 11, 2021
```
 add the op error message for the matmul xpu 
```
  fee42441
10 1月, 2021 2 次提交
- G
  optimize softmax forward (#30217) · 0a21924a
  由 GaoWei8 提交于 1月 10, 2021
```
* optimize softmax forward
```
  0a21924a
- W
  reduce the occupied size of memory for the fused pattern of elementwise_add... · af80859d
  由 wangchaochaohu 提交于 1月 10, 2021
```
reduce the  occupied size  of memory for the fused pattern of elementwise_add Op and activation Op(relu Op for example) (#29885)
```
  af80859d
09 1月, 2021 3 次提交

Z

enhance error message, test=develop (#30220) · 5932fee6
由 zhang wenhui 提交于 1月 09, 2021

5932fee6

add View(reuse allocation) strategy on squeeze, unsqueeze, reshape, flatten op (#29913) · da16b33f

由 pangyoki 提交于 1月 09, 2021

* add view strategy on squeeze,unsqueeze,reshape,flatten

* add squeeze unittest

* add unittests

* use View strategy as name rather than Reuse Allacation

* fix view api doc

* fix format

* use core.ops when input of reshape2 is Tensor

* fix test_cross_entropy_loss error because of reshape2

* delete selected_rows

* change op_function

* little change

* solve HandleViewBetweenInputAndOutput

da16b33f

J
[oneDNN] Added UT for testing elementwise_mul caching (#30203) · 4aba17b5
由 Jacek Czaja 提交于 1月 09, 2021
```
* - Added UT for testing elementwise_mul caching

* lint fixes
```
4aba17b5

08 1月, 2021 10 次提交

Support pure fp16 training for AMP API. (#29544) · 7f7dfccf

由 Zhen Wang 提交于 1月 08, 2021

* add cast ops before and after unsupported fp16 ops.

* Keep partial net in FP32 pattern.

* Support check_finite_and_unscale and update_loss_scaling for FP16 calculation mode.

* Add fp16 support for adam op.

* add multi precision attr for adam.

* Fix the bug of test_multi_precision_fp16_train UT.

* Code format for CI.

* Fix the redefine error about MPTypeTrait on windows.

* fix bugs of the _create_accumulators func in Momentum.

* fix bug when inserting post cast op.

* Add the update_loss_scaling op in allow_set of UnusedVarCheck.

* Update for ci coverage.

* Add some doc for OptimizerWithMixedPrecision.

* Fix the code style.

* Imporve the doc of `amp_init`.

* Change for fp16 testing if users have the infer program defined in separate way.

7f7dfccf

L

use cuda generator in bernoulli cuda kernel (#30199) · 789743e1
由 Leo Chen 提交于 1月 08, 2021

789743e1

Fix dtype of ungenerated grad var (#28511) · 8696335f

由 Leo Chen 提交于 1月 08, 2021

* fix dtype of ungenerated grad var

* update ut

* refine code

* set default dtype

* fix could_use_cudnn bug

* remove debug code

* re-implement

* fix bug

8696335f

W

shape op support int8 and uint8 tensor (#30201) · 609c0222
由 Wilber 提交于 1月 08, 2021

609c0222
W

fix windows compile when WITH_PYTHON=ON and WITH_TENSORRT=ON (#30194) · 01a287bf
由 Wilber 提交于 1月 08, 2021

01a287bf
R

Add version checking, test=op_version (#30129) · e42e1e80
由 ruri 提交于 1月 08, 2021

e42e1e80

Add callback after TensorCopy (#30123) · 1f97d61c

由 Leo Chen 提交于 1月 08, 2021

* change to tensor copy sync

* change to tensor copy sync

* make copy_to safe when use TensorCopy

* refine code

* add ut

* add cudapinned garbagecollector

* add testcase: cpu place -> cuda pinned place

1f97d61c

C
【Paddle.Fleet】Fix tensor table (#30075) · 528e03fc
由 Chengmo 提交于 1月 08, 2021
```
* add tensor table
```
528e03fc
W

disable mkldnn inplace pass on windows (#30164) · ade24494
由 Wilber 提交于 1月 08, 2021

ade24494
J
Fix analysis predictor test (#30191) · 907262ee
由 joanna.wozna.intel 提交于 1月 08, 2021
```
* Add a necessary condition

* Remove test for white list and add header
```
907262ee

07 1月, 2021 11 次提交
- L
  enhance error message of nll_loss op test=develop (#30125) · 2dc7ee27
  由 lijianshe02 提交于 1月 07, 2021
```
* enhance error message of nll_loss op test=develop
```
  2dc7ee27
- H
  Refine PADDLE_ENFORCE Error Messages. test=develop (#30149) · 54bf3f5a
  由 Huihuang Zheng 提交于 1月 07, 2021
```
Improve some error messages in parallel_executor.cc, conditional_block_op.cc, recurrent_op.cc
```
  54bf3f5a
- C
  [Complex] Simplify prepared op impl to improve performance (#30153) · d0fb06b2
  由 Chen Weihang 提交于 1月 07, 2021
```
* simplify prepared op impl to improve performance

* fix kunlun compile error

* continue fix kunlun compile error

* only transform diff place when dtype diff

* fix failed unittests

* remove useless file

* polish impl by review comment
```
  d0fb06b2
- 1
  Improve Index select cuda kernel (#30139) · c5b415bf
  由 123malin 提交于 1月 07, 2021
```
* test=develop, add index_select_cuda kernel
```
  c5b415bf
- W
  
  refine the paddle place support using str (#28769) · 7dd551e0
  由 wangchaochaohu 提交于 1月 07, 2021
  
  7dd551e0
- W
  
  Add detailed error message for curandStatus_t, cublasStatus_t, cusolverStatus_t (#30161) · 404c1676
  由 WeiXin 提交于 1月 07, 2021
  
  404c1676
- W
  enhance error info for py_func (#30138) · 91a8a257
  由 Wilber 提交于 1月 07, 2021
```
* enhance error info for py_func

* update
```
  91a8a257
- W
  [XPU] Remove lite_xpu ut lite_resnet50_test since fusion pass changes... · b8207af6
  由 weihaoji 提交于 1月 07, 2021
```
[XPU] Remove lite_xpu ut lite_resnet50_test since fusion pass changes introduced precision diff. test=develop (#30122)
```
  b8207af6
- L
  
  fix assign_op_xpu concat_op_xpu warining (#30120) · 15fac5e7
  由 liuyuhui 提交于 1月 07, 2021
  
  15fac5e7
- J
  
  fix enforce msg of sum xpu op (#30113) · f5428eca
  由 Jack Zhou 提交于 1月 07, 2021
  
  f5428eca
- 1
  Add Lookahead and ModelAverage Optimizer (#30004) · 198fbdfb
  由 123malin 提交于 1月 07, 2021
```
* test=develop, add model_average and lookahead
```
  198fbdfb
06 1月, 2021 10 次提交
- L
  add dispenable input for core.ops.reshape2/expand/slice (#30072) · adac38c5
  由 Leo Chen 提交于 1月 06, 2021
```
* add dispenable input 'shape' for core.ops.reshape2

* add dispenable inputs for core.ops.reshape2/expand/slice

* add ut
```
  adac38c5
- S
  
  fix error message (#30135) · becf99d2
  由 ShenLiang 提交于 1月 06, 2021
  
  becf99d2
- Z
  Polish and Optimize the print/repr information of Layer (#29998) · 30888ca3
  由 Zhou Wei 提交于 1月 06, 2021
```
* Polish and Optimize the print/repr message of all layer

* fix some code format
```
  30888ca3
- Z
  
  fix unittest failed on windows (#29837) · 9c99d379
  由 Zhou Wei 提交于 1月 06, 2021
  
  9c99d379
- W
  
  fix error message for distribute_fpn_proposals_op (#30116) · 69839f8a
  由 wangguanzhong 提交于 1月 06, 2021
  
  69839f8a
- Q
  add aarch64 and sunway kunlun lib (#30027) · 8e1c3ddf
  由 QingshuChen 提交于 1月 06, 2021
```
* add aarch64 and sunway kunlun lib

* minor

* optimize elementwise_add for kunlun

* update kunlun dependence

* minor

* minor
```
  8e1c3ddf
- S
  add inference api： DisableTensorRtOps (#30109) · 05b27695
  由 Shang Zhizhou 提交于 1月 06, 2021
```
* snap

* add inference api: DisableTensorRtOPs

* fix code style

* update api to experimental

* update variable name
```
  05b27695
- 石
  
  fix a bug in op_version_registry, test=develop, test=op_version (#29994) · 53bb1265
  由石晓伟提交于 1月 06, 2021
  
  53bb1265
- X
  
  Optimize the error message of framework. (#30134) · 3e0c4929
  由 xiemoyuan 提交于 1月 06, 2021
  
  3e0c4929
- L
  Fix bug: In dynamic mode, if start or end is negetive, __getitem__ return wrong result(#30003) · 9922bd41
  由 liym27 提交于 1月 06, 2021
```
1. when slice_item is a slice: 
 1) the start of __getitem__ should be std::max(start, 0) if slice
 2) the start of __getitem__ should be std::min(end, dim) 
2. when slice_item is an integer, it should be in [-dim_len, dim_len) 
3. Fix error message to use accurate data
```
  9922bd41
05 1月, 2021 3 次提交
- C
  
  change the kron gradient when complex types (#29995) · 666e6651
  由 chentianyu03 提交于 1月 05, 2021
  
  666e6651
- C
  add trace op_register_version and fix version bug; test=op_version (#30000) · a5e422c8
  由 chentianyu03 提交于 1月 05, 2021
```
* add trace op_register_version and fix defaulf bug; test=op_version

* add trace op_register_version; test=op_version

* add trace op_register_version; test=op_version

* add trace op_register_version; test=op_version

* fix missing the template bug of vector; test=op_version
```
  a5e422c8
- C
  Fix the formate of raising error in randperm op (#30108) · 9f34374b
  由 cc 提交于 1月 05, 2021
```
* fix the formate of raising error in randperm op
```
  9f34374b

Crayon鑫 / Paddle 与 Fork 源项目一致

Crayon鑫 / Paddle
与 Fork 源项目一致