提交 · 025053b46f3711981181469714be143c829b6dd7 · Crayon鑫 / Paddle

24 11月, 2021 10 次提交

Z
Adapt auto search (#37490) · 025053b4
由 zhaoyingli 提交于 11月 24, 2021
```
* adapt auto search

* adapt auto search

* fix matmulv2 compatible

* del debug
```
025053b4
T
Fix op-benchmark CI (#37487) · 5ff1ff5a
由 tianshuo78520a 提交于 11月 24, 2021
```
Fix op-benchmark CI
```
5ff1ff5a
Y
[Auto Parallel] Add the unified cluster representation (#37091) · db727551
由 Yulong Ao 提交于 11月 24, 2021
```
* [Auto Parallel]  Add the unified cluster representation

* Add the local id for devices

* Add some comments
```
db727551
A

[NewExe] Support HandleComplexGradToRealGrad to cast complex into Real (#37450) · 8b87d5eb
由 Aurelius84 提交于 11月 24, 2021

8b87d5eb
C
[PTen] Standardized unittest namespace (#37456) · 1c969d20
由 Chen Weihang 提交于 11月 23, 2021
```
* standarded unittest namespace

* fix detail error
```
1c969d20

[Dy2stat]support pure fp16 for dy2stat (#36944) · 52edad6a

由 0x45f 提交于 11月 24, 2021

* run dy2stat pure fp16 in Linear model

* no use self._pure_fp16_inputs

* add test and fix Adam error in dy2stat pure fp16 training

* use paddle.optimizer.Adam

* run test in gpu

* change test time for CI

* enlarge atol for test_resnet_pure_fp16

* refine code and enlarge atol

* make custom_white_list and custom_black_list take effect for AMP and pure fp16

* check tracer is not None

* use default atol

* change filter_size

* change atol and add some NOTE

52edad6a

Z

fix lite with xpu or nnadapter (#37449) · 93aefceb
由 zhupengyang 提交于 11月 24, 2021

93aefceb
F

fix:transform the data from cpu to gpu when trt is used (#37427) · 49366a63
由 feng_shuai 提交于 11月 24, 2021

49366a63
W

[fleet_executor] Complete compute interceptor (#37485) · be3b7740
由 WangXi 提交于 11月 24, 2021

be3b7740

Refactor dygraph to eager -- TensorWrapper, EagerUtils, GlobalUtils (#37466) · 1799c032

由 Jiabin Yang 提交于 11月 24, 2021

* Add EagerTensor and tests

* remove useless enforce

* remove comment in cmake

* support autograd meta

* support grad node info test

* support grad_node_info

* add more edge test

* remove Python.h

* add tensor wrapper with tests

* support compute require grad and stop gradient

* support sync methods and global utils

* support pure cpu test

* refine error msg

* refine error msg

* refine error info

* fix npu error

1799c032

23 11月, 2021 28 次提交
- P
  fix inplace bug when the first grad_var(loss_grad) is inplace var (#37420) · ee1e1642
  由 pangyoki 提交于 11月 23, 2021
```
* fix inplace bug

* fix custom grad input error

* add unittest

* fix inplace bug
```
  ee1e1642
- Q
  [XPU] Reorganize xpu device codes in platform, test=develop (#37428) · 79800978
  由 Qi Li 提交于 11月 23, 2021
```
* [XPU] Reorganize xpu device codes in platform, test=develop

* fix xpu_header.h, test=develop
```
  79800978
- L
  Add support bias is none for fused_attention op. (#37411) · 1a8786cf
  由 Li Min 提交于 11月 23, 2021
```
Add support for bias is none for fused_attention op.
```
  1a8786cf
- W
  
  set feed var skip inplace, test=develop (#37467) · 4812eda5
  由 wanghuancoder 提交于 11月 23, 2021
  
  4812eda5
- Y
  
  [fleet_executor] Update with collective (#37462) · df14dbf0
  由 Yuang Liu 提交于 11月 23, 2021
  
  df14dbf0
- T
  
  test=document_fix (#37477) · 38f1ef50
  由 tianshuo78520a 提交于 11月 23, 2021
  
  38f1ef50
- F
  
  use ShareBufferWith instead of ShareDataWith for ops with view mechanism (#37464) · 81349970
  由 Feiyu Chan 提交于 11月 23, 2021
  
  81349970
- W
  fix problem of dcnv2 trt (#37345) · e91141fb
  由 wangxinxin08 提交于 11月 23, 2021
```
* modify code about fp16 of dcnv2 trt
```
  e91141fb
- Z
  
  Removed debug code (#37447) · 586bafbd
  由 Zhanlue Yang 提交于 11月 23, 2021
  
  586bafbd
- C
  Speedup download uncompress function (#37311) · 467099f0
  由 CtfGo 提交于 11月 23, 2021
```
`paddle.utils.download` ：change to call `extractall` on tar/zip compressd file  to speed up the uncompress process when they includes many files

--- result of decompression speed comparison ---
1. dataset：https://paddlenlp.bj.bcebos.com/datasets/cnn_dailymail/cnn_stories.tgz, decompression time
：5m50s vs 20s
2. dataset：https://paddlenlp.bj.bcebos.com/datasets/cnn_dailymail/dailymail_stories.tgz, decompression time：33m20s vs 47s
```
  467099f0
- L
  [new-exec] skip compiled program with places > 1 (#37457) · 2dfcdf21
  由 Leo Chen 提交于 11月 23, 2021
```
* skip compiled program with places > 1

* fix corner case and add ut
```
  2dfcdf21
- L
  [new-exec] sync scope and variable_scope when init executor (#37445) · 33653195
  由 Leo Chen 提交于 11月 23, 2021
```
* sync scope and variable_scope when init executor

* set var_desc for new var
```
  33653195
- S
  test on_infer off problem in windows (#37366) · 30dbdbaa
  由 Sing_chan 提交于 11月 23, 2021
```
* test on_infer off problem in windows

* turn off on infer in windows-ci
```
  30dbdbaa
- T
  Fix PR-CI-Static-Check (#37417) · 56283952
  由 tianshuo78520a 提交于 11月 23, 2021
```
Fix PR-CI-Static-Check
```
  56283952
- C
  
  fix test_egr_ds_auotgrad_meta compile failed (#37459) · 399ddf99
  由 Chen Weihang 提交于 11月 23, 2021
  
  399ddf99
- Z
  fix ut retry (#37301) · c0443835
  由 zhangchunle 提交于 11月 23, 2021
```
* fix ut retry
```
  c0443835
- W
  [Paddle Inference] Fix_nearest: align_corners != true (#37368) · bc150edc
  由 Wangzheee 提交于 11月 23, 2021
```
* fix_nearest

* fix_nearest

* fix_nearest

* fix_nearest
```
  bc150edc
- Z
  
  fix CMakeLists. test=develop (#37454) · ccad31f5
  由 zmx 提交于 11月 23, 2021
  
  ccad31f5
- S
  modify for wincheck-inference case (#37292) · 3dbabc4d
  由 Sing_chan 提交于 11月 23, 2021
```
* for pure fp16

* opt topk

* modify for wincheck-inference case

* modify according to zhouwei's comment

* modify according to zhouwei's comment 2nd time
Co-authored-by: Nzhangkaihuo <zhangkaihuo@baidu.com>
```
  3dbabc4d
- S
  Enhance the error message of scatter op (#37429) · 11b17c88
  由 sneaxiy 提交于 11月 23, 2021
```
* enhance scatter err msg check

* fix ci error
```
  11b17c88
- Y
  [PTen]Elementwise_div Kernel Refactor (#37418) · 32d9beef
  由 YuanRisheng 提交于 11月 23, 2021
```
* elementwise_div refactor

* fix compile bugs in windows ci
```
  32d9beef
- J
  Refactor dygraph to eager -- Autograd info (#37406) · c5ad3d06
  由 Jiabin Yang 提交于 11月 23, 2021
```
* Add EagerTensor and tests

* remove useless enforce

* remove comment in cmake

* support autograd meta

* support grad node info test

* support grad_node_info

* add more edge test

* remove Python.h

* refine error code

* add error type in error msg

* given default null name for tensor
```
  c5ad3d06
- R
  [NPU] Added HCCL backend support in dygraph mode (#36285) · 83e55cff
  由 ronnywang 提交于 11月 23, 2021
```
* Added HCCL backend support in dynamic graph mode

* fix segmentation fault

* add ut
```
  83e55cff
- Z
  Bug fix for snapshotting VariableWrapper with initialized tensor but e… (#37410) · e58ac121
  由 Zhanlue Yang 提交于 11月 23, 2021
```
* Bug fix for snapshoting VariableWrapper with initialized tensor but empty allocation

* Added unittest for inplace&clear_gradient
```
  e58ac121
- C
  [PTen] Add cast method for Tensor and rename to method to copy_to (#37423) · 90dad8b2
  由 Chen Weihang 提交于 11月 22, 2021
```
* rename to api to copy_to

* support cast method for tensor

* fix compile failed
```
  90dad8b2
- C
  [PTen] Adapt to inference api dir for pten (#37415) · 73f4601d
  由 Chen Weihang 提交于 11月 22, 2021
```
* adapt to inference api dir for pten

* fix conflit with develop

* fix test_egr_ds_eager_tensor compile failed
```
  73f4601d
- A
  [NewExe] Support layout/dtype transform by adding transfer_layout/transfer_dtype op (#37299) · 2a1f009e
  由 Aurelius84 提交于 11月 23, 2021
```
* Add transfer_layout/dtype op

* clean useless codes

* fix unused var

* add optest in white.txt

* split into data_transfer.cc

* fix cmake

* modify according reviewer comment

* replace cast_op with transfer_dtype_op
```
  2a1f009e
- 石
  
  add pten allocation places, test=develop (#37369) · 684de4b3
  由石晓伟提交于 11月 23, 2021
  
  684de4b3
22 11月, 2021 2 次提交

disable copying of datatype when sharing buffer between two tensors. (#37247) · 9ec1432d

由 Feiyu Chan 提交于 11月 22, 2021

* disable copying of datatype when sharing buffer between two tensors.
* fix for mkldnn operator kernels (elementwise_add, sum, softplus, softmax, scale, activation), mannually set the data type when reusing memory by ShareBufferWith.

9ec1432d

Z
fix autoconvert (#37347) · 693c3c14
由 zhaoyingli 提交于 11月 22, 2021
```
* fix autoconvert

* fix merge parameter
```
693c3c14

Crayon鑫 / Paddle 与 Fork 源项目一致

Crayon鑫 / Paddle
与 Fork 源项目一致