提交 · af80859dd61ab3fe1d91ef6e54451ec01cfd6759 · Crayon鑫 / Paddle

10 1月, 2021 1 次提交
- W
  reduce the occupied size of memory for the fused pattern of elementwise_add... · af80859d
  由 wangchaochaohu 提交于 1月 10, 2021
```
reduce the  occupied size  of memory for the fused pattern of elementwise_add Op and activation Op(relu Op for example) (#29885)
```
  af80859d
09 1月, 2021 3 次提交

Z

enhance error message, test=develop (#30220) · 5932fee6
由 zhang wenhui 提交于 1月 09, 2021

5932fee6

add View(reuse allocation) strategy on squeeze, unsqueeze, reshape, flatten op (#29913) · da16b33f

由 pangyoki 提交于 1月 09, 2021

* add view strategy on squeeze,unsqueeze,reshape,flatten

* add squeeze unittest

* add unittests

* use View strategy as name rather than Reuse Allacation

* fix view api doc

* fix format

* use core.ops when input of reshape2 is Tensor

* fix test_cross_entropy_loss error because of reshape2

* delete selected_rows

* change op_function

* little change

* solve HandleViewBetweenInputAndOutput

da16b33f

J
[oneDNN] Added UT for testing elementwise_mul caching (#30203) · 4aba17b5
由 Jacek Czaja 提交于 1月 09, 2021
```
* - Added UT for testing elementwise_mul caching

* lint fixes
```
4aba17b5

08 1月, 2021 19 次提交
- H
  
  fix windows bug (#29993) · be5c2e60
  由 huangxu96 提交于 1月 08, 2021
  
  be5c2e60
- C
  
  remove distributed prepare context (#30219) · 3016ba85
  由 Chen Weihang 提交于 1月 08, 2021
  
  3016ba85
- Z
  Support pure fp16 training for AMP API. (#29544) · 7f7dfccf
  由 Zhen Wang 提交于 1月 08, 2021
```
* add cast ops before and after unsupported fp16 ops.

* Keep partial net in FP32 pattern.

* Support check_finite_and_unscale and update_loss_scaling for FP16 calculation mode.

* Add fp16 support for adam op.

* add multi precision attr for adam.

* Fix the bug of test_multi_precision_fp16_train UT.

* Code format for CI.

* Fix the redefine error about MPTypeTrait on windows.

* fix bugs of the _create_accumulators func in Momentum.

* fix bug when inserting post cast op.

* Add the update_loss_scaling op in allow_set of UnusedVarCheck.

* Update for ci coverage.

* Add some doc for OptimizerWithMixedPrecision.

* Fix the code style.

* Imporve the doc of `amp_init`.

* Change for fp16 testing if users have the infer program defined in separate way.
```
  7f7dfccf
- L
  
  use cuda generator in bernoulli cuda kernel (#30199) · 789743e1
  由 Leo Chen 提交于 1月 08, 2021
  
  789743e1
- L
  Fix dtype of ungenerated grad var (#28511) · 8696335f
  由 Leo Chen 提交于 1月 08, 2021
```
* fix dtype of ungenerated grad var

* update ut

* refine code

* set default dtype

* fix could_use_cudnn bug

* remove debug code

* re-implement

* fix bug
```
  8696335f
- A
  Skip convert tensor shape while using Paddle.shape (#30223) · 03e07273
  由 Aurelius84 提交于 1月 08, 2021
```
* fix tensor shape bug

* fix op_num

* clean code
```
  03e07273
- L
  In creation.assgin, reuse implamention code of layers.tensor.assign to avoid... · 49411a20
  由 liym27 提交于 1月 08, 2021
```
In creation.assgin, reuse implamention code of layers.tensor.assign to avoid maintain two code (#30227)
```
  49411a20
- L
  
  fix pad (#30222) · e03171b7
  由 littletomatodonkey 提交于 1月 08, 2021
  
  e03171b7
- W
  
  shape op support int8 and uint8 tensor (#30201) · 609c0222
  由 Wilber 提交于 1月 08, 2021
  
  609c0222
- W
  
  fix windows compile when WITH_PYTHON=ON and WITH_TENSORRT=ON (#30194) · 01a287bf
  由 Wilber 提交于 1月 08, 2021
  
  01a287bf
- R
  
  Add version checking, test=op_version (#30129) · e42e1e80
  由 ruri 提交于 1月 08, 2021
  
  e42e1e80
- L
  
  [Dy2Stat] Use Paddle2.0 api paddle.tensor.array_* (#30156) · 31ed9a5e
  由 liym27 提交于 1月 08, 2021
  
  31ed9a5e
- L
  [Dy2Stat] Don't convert to paddle.shape if var_x.shape is not negetive (#29965) · ad55f609
  由 liym27 提交于 1月 08, 2021
```
1. When x is Variable, call nn.shape(x) only in following cases:
 1）The shape of x is used in control flow condition.
 2）The dim to be used is negetive
2. When x is Variable, but x.shape or x.shape[idx] doesn't contain negetive value, don't convert to paddle.shape()
```
  ad55f609
- L
  Add callback after TensorCopy (#30123) · 1f97d61c
  由 Leo Chen 提交于 1月 08, 2021
```
* change to tensor copy sync

* change to tensor copy sync

* make copy_to safe when use TensorCopy

* refine code

* add ut

* add cudapinned garbagecollector

* add testcase: cpu place -> cuda pinned place
```
  1f97d61c
- L
  Fix test_slice: avoid unnecessary copying of TensorArray from subblock to parent block(#30168) · b2483d78
  由 liym27 提交于 1月 08, 2021
```
In control flow, don't copy TensorArray from subblock to parent block when TensorArray is created in parent block.
```
  b2483d78
- C
  【Paddle.Fleet】Fix tensor table (#30075) · 528e03fc
  由 Chengmo 提交于 1月 08, 2021
```
* add tensor table
```
  528e03fc
- G
  Quantization supports 2.0 APIs (#30036) · 1bdf9242
  由 guofei 提交于 1月 08, 2021
```
* Quantization supports 2.0 APIs

* Fix the error of save_quantized_model
```
  1bdf9242
- W
  
  disable mkldnn inplace pass on windows (#30164) · ade24494
  由 Wilber 提交于 1月 08, 2021
  
  ade24494
- J
  Fix analysis predictor test (#30191) · 907262ee
  由 joanna.wozna.intel 提交于 1月 08, 2021
```
* Add a necessary condition

* Remove test for white list and add header
```
  907262ee
07 1月, 2021 17 次提交
- L
  enhance error message of nll_loss op test=develop (#30125) · 2dc7ee27
  由 lijianshe02 提交于 1月 07, 2021
```
* enhance error message of nll_loss op test=develop
```
  2dc7ee27
- H
  Refine PADDLE_ENFORCE Error Messages. test=develop (#30149) · 54bf3f5a
  由 Huihuang Zheng 提交于 1月 07, 2021
```
Improve some error messages in parallel_executor.cc, conditional_block_op.cc, recurrent_op.cc
```
  54bf3f5a
- C
  [Complex] Simplify prepared op impl to improve performance (#30153) · d0fb06b2
  由 Chen Weihang 提交于 1月 07, 2021
```
* simplify prepared op impl to improve performance

* fix kunlun compile error

* continue fix kunlun compile error

* only transform diff place when dtype diff

* fix failed unittests

* remove useless file

* polish impl by review comment
```
  d0fb06b2
- C
  
  try multi times for sys.exit (#30188) · e5034707
  由 Chen Weihang 提交于 1月 07, 2021
  
  e5034707
- W
  
  fix adamw apply gradient (#30130) · 619c62bb
  由 WangXi 提交于 1月 07, 2021
  
  619c62bb
- T
  
  down openssl (#29958) · 7564d43b
  由 tianshuo78520a 提交于 1月 07, 2021
  
  7564d43b
- L
  
  fix paddle.pow doc, test=document_fix (#30159) · 1ff69f58
  由 LutaoChu 提交于 1月 07, 2021
  
  1ff69f58
- 1
  Improve Index select cuda kernel (#30139) · c5b415bf
  由 123malin 提交于 1月 07, 2021
```
* test=develop, add index_select_cuda kernel
```
  c5b415bf
- W
  
  refine the paddle place support using str (#28769) · 7dd551e0
  由 wangchaochaohu 提交于 1月 07, 2021
  
  7dd551e0
- W
  
  Add detailed error message for curandStatus_t, cublasStatus_t, cusolverStatus_t (#30161) · 404c1676
  由 WeiXin 提交于 1月 07, 2021
  
  404c1676
- W
  enhance error info for py_func (#30138) · 91a8a257
  由 Wilber 提交于 1月 07, 2021
```
* enhance error info for py_func

* update
```
  91a8a257
- Z
  
  open normal unittest on windows (#30167) · 6aa82e03
  由 Zhou Wei 提交于 1月 07, 2021
  
  6aa82e03
- W
  [XPU] Remove lite_xpu ut lite_resnet50_test since fusion pass changes... · b8207af6
  由 weihaoji 提交于 1月 07, 2021
```
[XPU] Remove lite_xpu ut lite_resnet50_test since fusion pass changes introduced precision diff. test=develop (#30122)
```
  b8207af6
- L
  
  fix assign_op_xpu concat_op_xpu warining (#30120) · 15fac5e7
  由 liuyuhui 提交于 1月 07, 2021
  
  15fac5e7
- J
  
  fix enforce msg of sum xpu op (#30113) · f5428eca
  由 Jack Zhou 提交于 1月 07, 2021
  
  f5428eca
- C
  Simplify the options of spawn based on fleetrun (#30144) · 8020e34e
  由 Chen Weihang 提交于 1月 06, 2021
```
* Simplify the options of spawn based on fleetrun

* polish details

* polish doc details
```
  8020e34e
- T
  pre padding in dygraph (#30163) · 4763e6bc
  由 tangwei12 提交于 1月 07, 2021
```
Change-Id: Ia5279b0cbb6a5b3970aff66e9510e0d85efa70ce
```
  4763e6bc

Crayon鑫 / Paddle 与 Fork 源项目一致

Crayon鑫 / Paddle
与 Fork 源项目一致