提交 · cc47c83caecccd2b660991bfaa09552017cbc0bf · PaddlePaddle / Paddle

01 12月, 2021 2 次提交
- J
  fix fc_fuse pass (#37694) · cc47c83c
  由 Jason 提交于 12月 01, 2021
```
* fix fc_fuse

* modify cmake notest,test=windows_ci

* retrigger all the ci
```
  cc47c83c
- H
  Modify ShareTensorWithCinnBuffer by callback to save memory (#37493) · 661dbdbe
  由 Huihuang Zheng 提交于 12月 01, 2021
```
Modify ShareTensorWithCinnBuffer by callback to save memory
```
  661dbdbe
30 11月, 2021 2 次提交
- Z
  
  pscore global shuffle&default accessor config (#37626) · 1514eec6
  由 zhaocaibei123 提交于 11月 30, 2021
  
  1514eec6
- X
  Fix test calc gradient (#37672) · a0631364
  由 xiongkun 提交于 11月 30, 2021
```
* add scope_guard

* 1. fix control flow cases 2. fix calc_gradient
```
  a0631364
29 11月, 2021 8 次提交
- Z
  
  Refactored eager legacy namespace (#37659) · 74fdba7c
  由 Zhanlue Yang 提交于 11月 29, 2021
  
  74fdba7c
- W
  
  continue if transform not support dtype, test=develop (#37661) · 1b00fc48
  由 wanghuancoder 提交于 11月 29, 2021
  
  1b00fc48
- W
  Support fetch lodtensor array (#37580) · a0678eb1
  由 wanghuancoder 提交于 11月 29, 2021
```
* suport fetch lodtensor array, test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop
```
  a0678eb1
- T
  [HeterPs] fix allocation (#37476) · 27a5f52b
  由 Thunderbrook 提交于 11月 29, 2021
```
* auc temp

* cuballocator

* code format

* code format
```
  27a5f52b
- A
  
  [NPU] fix compile (#37648) · b6307742
  由 Aganlengzi 提交于 11月 29, 2021
  
  b6307742
- X
  
  delete gloo connect retry (#37616) · 41564184
  由 xiaoxiao-luomu 提交于 11月 29, 2021
  
  41564184
- Z
  
  Enabled AutoCodeGen for Eager Dygraph (#37639) · d50ae7ec
  由 Zhanlue Yang 提交于 11月 29, 2021
  
  d50ae7ec
- W
  
  [ut] Update skip concept to ignore. (#37635) · ae544242
  由 Wilber 提交于 11月 29, 2021
  
  ae544242
27 11月, 2021 2 次提交

J

fix save inference model conditional op (#37579) · fd41456f
由 JingZhuangzhuang 提交于 11月 27, 2021

fd41456f

[NPU] reorganization for device API abstraction (#37110) · 72241a6a

由 Aganlengzi 提交于 11月 27, 2021

* [NPU] reorganization for device API abstraction

* [NPU] delete old files

* [NPU] fix npu_collective_helper

* [NPU] fix collective_helper

* [NPU] fix ut

* [NPU] mod memory allocation and hccl_helper

* [NPU] fix place_type

* [NPU] split enfoce.h

* move acl* call into npu_info

* merge conflict

* fix merge

* merge conflict

* merge conflict

72241a6a

26 11月, 2021 3 次提交

W
clear local scope every setp (#37569) · 641038dc
由 wanghuancoder 提交于 11月 26, 2021
```
* clear local scope every setp, test=develop

* refine,test=develop

* refine, test=develop
```
641038dc

TDM2 (#37044) · 4826167c

由 wangzhen38 提交于 11月 26, 2021

* add tdm sample

* add tdm sample in c++

* update tdm sample

* modify sample count

* fix conflict

* add set_date

* fix cmake error

* fix bug of proto

* update index_dataset proto

* update cmake

* fix error cmake

* fix cmake mkldnn

* fix cmake proto

* update cmake proto

* update cmake

* update rec

* update dataset

* update dataset

* update dataset

* updata dataset

* updata dataset

* updata coverage

* updata ci

* goback4

* fix npu ci

* add xxhash dep

4826167c

Z

fix bug of slice_grad using use_mkldnn attr (#37571) · e2fdb080
由 zyfncg 提交于 11月 26, 2021

e2fdb080

25 11月, 2021 5 次提交

【PTen】Add fill_constant kernel using ScalarArray in pten (#37481) · a0d465f8

由 zyfncg 提交于 11月 25, 2021

* add scalar and scalar_array

* remove DenseTensor include from Scalar and ScalarArray

* remove inner header from scalar_array

* refactor the method of fill_constant and add some comment

* add fill_constant kernel using ScalarArray

* modify some prompt

* remove fill_constant kernel with no shape

a0d465f8

Z

Pass the stream created by Paddle to CINN. (#37337) · c249556d
由 Zhen Wang 提交于 11月 25, 2021

c249556d
W

fix pass_desc.proto compilation error, test=develop (#37536) · a4ef88ed
由 wuhuanzhou 提交于 11月 25, 2021

a4ef88ed

[cherry-pick 2.2 heterps]bug fix for launch_utils.py (#37521) · 8bb1038c

由 zmx 提交于 11月 25, 2021

* fix. test=develop

* fix. test=develop

* fix. test=develop

* fix. test=develop

* fix. test=develop

* [heterps]bug fix for _run_from_dataset

* fix heter_server.cc

* fix launch_utils.py

* fix heter_section_worker.cc

* fix. test=develop

* fix. test=develop

8bb1038c

X
Fix test rnn memory helper op (#37474) · e4791d88
由 xiongkun 提交于 11月 25, 2021
```
* clear LoDTensorArray

* fix  bugs

* fix

* fix gpu
```
e4791d88

24 11月, 2021 5 次提交
- P
  Changed second batch of deprecated mkldnn header and function names to new oneDNN names (#37351) · 7db7a0ec
  由 piotrekobiIntel 提交于 11月 24, 2021
```
* Add second batch of deprecated mkldnn namespace and macro changes

* Unlock CI

* Fix temporary namespace alias placing
```
  7db7a0ec
- A
  
  Fix lod in fetch_v2 (#37514) · acbf9974
  由 Aurelius84 提交于 11月 24, 2021
  
  acbf9974
- L
  
  [new-exec] support skipping infershape (#37510) · e76b601b
  由 Leo Chen 提交于 11月 24, 2021
  
  e76b601b
- Z
  Adapt auto search (#37490) · 025053b4
  由 zhaoyingli 提交于 11月 24, 2021
```
* adapt auto search

* adapt auto search

* fix matmulv2 compatible

* del debug
```
  025053b4
- A
  
  [NewExe] Support HandleComplexGradToRealGrad to cast complex into Real (#37450) · 8b87d5eb
  由 Aurelius84 提交于 11月 24, 2021
  
  8b87d5eb
23 11月, 2021 6 次提交
- Q
  [XPU] Reorganize xpu device codes in platform, test=develop (#37428) · 79800978
  由 Qi Li 提交于 11月 23, 2021
```
* [XPU] Reorganize xpu device codes in platform, test=develop

* fix xpu_header.h, test=develop
```
  79800978
- W
  
  set feed var skip inplace, test=develop (#37467) · 4812eda5
  由 wanghuancoder 提交于 11月 23, 2021
  
  4812eda5
- L
  [new-exec] sync scope and variable_scope when init executor (#37445) · 33653195
  由 Leo Chen 提交于 11月 23, 2021
```
* sync scope and variable_scope when init executor

* set var_desc for new var
```
  33653195
- Z
  
  fix CMakeLists. test=develop (#37454) · ccad31f5
  由 zmx 提交于 11月 23, 2021
  
  ccad31f5
- C
  [PTen] Adapt to inference api dir for pten (#37415) · 73f4601d
  由 Chen Weihang 提交于 11月 22, 2021
```
* adapt to inference api dir for pten

* fix conflit with develop

* fix test_egr_ds_eager_tensor compile failed
```
  73f4601d
- A
  [NewExe] Support layout/dtype transform by adding transfer_layout/transfer_dtype op (#37299) · 2a1f009e
  由 Aurelius84 提交于 11月 23, 2021
```
* Add transfer_layout/dtype op

* clean useless codes

* fix unused var

* add optest in white.txt

* split into data_transfer.cc

* fix cmake

* modify according reviewer comment

* replace cast_op with transfer_dtype_op
```
  2a1f009e
22 11月, 2021 3 次提交

disable copying of datatype when sharing buffer between two tensors. (#37247) · 9ec1432d

由 Feiyu Chan 提交于 11月 22, 2021

* disable copying of datatype when sharing buffer between two tensors.
* fix for mkldnn operator kernels (elementwise_add, sum, softplus, softmax, scale, activation), mannually set the data type when reusing memory by ShareBufferWith.

9ec1432d

[PTen] Add variable transform to/from ptenTensor and add cast kernel (#36916) · 5caa6fc5

由 chentianyu03 提交于 11月 22, 2021

* add cast kernel

* add cast cuda kernel

* add cast kernel

* make cast kernel output dtype undefined

* get cast dtype from vardesc

* move cast to manipulation and add test case

* add castinfershape

* avoid reinitilaze variable

* InitializeVariable support datatype

* merge develop branch

* fix merge bug

* revert modify initializeVariable

* revert modify on InitializeVariable

* revert modify on InitializeVariable

* mutable support reset dtype

* enable make pten tensor from variable when def_arg.type is undefined

* fix build pten ctx start_idx error

* copy pten out tensor to variable

* merge develop branch

* fix non pten kernel cast failed

* add reset allocation place for remake tensor

* fix inplace realloc error

* add mutable on pten kernles and remove unused cast files

* rename function names

* fix output type error

* fix conflict with develop branch

* set data type to variable with pten's dtype

* fix test_cast_api type mismatch

* densorTensro mutable_data support 0 bytes value

* fix the inplace bug of reshape kernel

* fix pten.backend != variable.place when moving storage, palce mismatch bug

* fix conflict with develop branch

* Fix bug of paddle::experimental::MovesStorage

* fix ReMakePtenDenseTensor place mismatch bug

* Revert "fix ReMakePtenDenseTensor place mismatch bug"

This reverts commit 86336032f60b8a15eacd2c1ff2fa513f5d8dfd1a.

* fix ReMakePtenDenseTensor place mismatch bug

* reverts the set_lod interface, test=develop

* modify by the review options

* modify error message

* add & for const input arguments

* add reference in params

* elementwise_sub add mutable_data

* fix ResetHolderWithType check size bug

* add dependence pten_tensor to test_cast_api object

* remove unused code to pass ci coverage
Co-authored-by: NChen Weihang <chenweihang@baidu.com>
Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com>
Co-authored-by: Nshixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>

5caa6fc5

L

[new feature] add local scope for interpretercore (#37379) · 1f0512be
由 Leo Chen 提交于 11月 22, 2021

1f0512be

19 11月, 2021 2 次提交

J
Optimize cinn_cache_key by replace GraphToProgram to Dot string (#37317) · edc3496f
由 jiangcheng 提交于 11月 19, 2021
```
* optimize cache-key by replace GraphToProgram to Dot string

* fix compile failure bug
```
edc3496f

Add fuse_resnet_unit pass (#36818) · 3cd3bf29

由 wuhuanzhou 提交于 11月 19, 2021

* GeneratePass support attr condition and mapping, test=develop

* fix coverage, test=develop

* Add fuse_resnet_unit pass, test=develop

* fix CI errors, test=develop

* fix CI errors, test=develop

* fix unittest error when compiling without CUDA, test=develop

* fix static ci error, test=develop

* limit kernel size must equal 1, test=develop

3cd3bf29

18 11月, 2021 1 次提交

Add the `GetFetchNames` method in CinnGraphSymbolization. (#37218) · 3ad495e8

由 Zhen Wang 提交于 11月 18, 2021

* Add the `GetFetchNames` method in CinnGraphSymbolization.

* Use unordered_set instead vector as the type of fetch_var_names.

* Reuse the definition of kCompilationKey.

* Use CompileOptions to set fetch_var_ids.

* Update the argument passing of GraphCompiler.Build.

* Fix some bugs in CinnGraphSymbolization::GetFetchIds.

3ad495e8

17 11月, 2021 1 次提交
- L
  [new-exec] Refine standalone executor (#37278) · 6d6642c8
  由 Leo Chen 提交于 11月 17, 2021
```
* init

* add feed ops in python side

* import LRScheduler

* update_feed

* refine code format
```
  6d6642c8

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功