提交 · 29c211ee079c03b14929f9354002ade6752e2238 · PaddlePaddle / Paddle

10 1月, 2022 2 次提交

[Unify Tensors PR ] framework::Tensor inherits from DenseTensor,test=allcases (#38632) · 5c73a6ea

由 Zhanlue Yang 提交于 1月 10, 2022

* Added shared_ptr<Allocation> member & corresponding interfaces to Storage

* Removed original pten::Allocation from Storage and adjusted the interfaces accordingly

* Fixed issues with storage offset

* Used place to malloc allocation for TensorStorage

* [Unify Tensors PR #3]Ported framework::Tensor interfaces to pten::DenseTensor

* Fixed issues with place

* Added comments

* Moved mutable_data with stream argument to DenseTensor

* Added set_offset interface

* Fixed CI issues,test=allcases

* [Unify Tensors PR #4] Port LoDTensor interfaces to DenseTensor

* Removed friend class EigenTensor/EigenMatrix/EigenVector from Tensor

* Modified framework::Tensor to inherit from DenseTensor

* Reverted changes too pten_layout() interface

* Removed friend classes

* Rearranged cfunction calls from tensor.data<void>() to tensor.data()

* Fixed CI issues

* Fixed lite issues

* Fixed data() interface issues,test=allcases

* Resolved IsInitialized() issues

* Fixed ResetHolder() issues

* Fixed MKLDNN & Storage issues

* Resolved ShareBufferWith() issues

* Fixed LoD issues

5c73a6ea

Support setting infershape function for custom grad op (#38776) · 046553c7

由 Chen Weihang 提交于 1月 10, 2022

* unify infer_shape func calling

* support set grad infer shape fn for custom op

* unify infershape in new executor and eager

* remove todo comment

* revert infershape in operator

046553c7

04 1月, 2022 1 次提交

[Pten]Move CPU_implementation of elementwise kernel in new directory (#38651) · 7c020c71

由 YuanRisheng 提交于 1月 04, 2022

* change 'math' to 'math_kernel'

* fix compile bugs

* merge develop

* fix compile bugs

* move cpu_impl of elementwise kernel to new directory

7c020c71

31 12月, 2021 2 次提交
- C
  [PTen] Unify data layout of pten and fluid (#38583) · 8d32cef8
  由 Chen Weihang 提交于 12月 31, 2021
```
* unify data layout

* fix test_transfer_layout error
```
  8d32cef8
- Y
  [Pten]Move math to new directory and change 「math」 to 「math_kernel」 (#38604) · e76087ad
  由 YuanRisheng 提交于 12月 31, 2021
```
* change 'math' to 'math_kernel'

* fix compile bugs

* merge develop

* fix compile bugs
```
  e76087ad
30 12月, 2021 1 次提交

[PTen] Remove offset in storage (#38472) · a504ff3f

由 Chen Weihang 提交于 12月 29, 2021

* remove offset in storage

* revert api change

* fix custom op slice bug

* fix mutable_data error

a504ff3f

28 12月, 2021 4 次提交

Support test basic of Var and Layer (#38426) · 1fb80a6a

由 Jiabin Yang 提交于 12月 28, 2021

* Rearranged Eager AutoCodeGen directory structure

* Removed USE_OP in Eager AutoCodeGen

* Enabled generation for Operators without Grad/Inputs/Outputs

* Resolved operators without input

* Fixed merge conflicts

* Enabled Eager AutoCodeGen for 10+ more operators

* Refactored Eager AutoCodeGen with more organized helper objects

* Enabled Eager AutoCodeGen for operators with multiple OpBases

* Adjusted Eager AutoCodeGen to Enable Passing Output Tensor as Input Argument

* Handled Dispensable Inputs/Outputs in Eager AutoCodeGen

* Adjusted function generation/call between Python-C API & Dygraph API

* Synchronized auto-generated Python-C API with Dygraph Forward Functions

* support more eager tensor api

* fix merge compile error

* fix compile error and fit develop code

* support pure CPU

* fix some logic error in eager_mode

* support _varbase_creator in eager mode

* Added safe_initialized interface to EagerTensor for use in processing dispensable inputs

* for eager mode

* refine

* support multiple constructor for eager tensor

* add place related code

* polish code

* specific randint with dtype of int64

* Support pure cpu test

* eager logic

* refine test in pure cpu

* eager logic

* eager logic

* eager logic, test=develop

* skip core.eager when in inference, test=develop

* refine, test=develop

* refine, test=develop

* call RetainGrad after run forward kernel, test=develop

* refine, test=develop

* support dygraph util, meta, guard test

* support inference test

* refine test and fix initializer failed

* support create varbase and fix retain grad error

* fix windows error

* support test code coverage

* support test code coverage

* support test code coverage
Co-authored-by: Njim19930609 <jim19930609@gmail.com>
Co-authored-by: NWang Huan <wanghuan29@baidu.com>

1fb80a6a

Z
refactor matmul directory in pten (#38227) · 982bf444
由 zyfncg 提交于 12月 28, 2021
```
* refactor matmul directory in pten

* fix merge conflict
```
982bf444

[pten] remove in_type arg in cast kernel (#38486) · 0637b9a6

由 chentianyu03 提交于 12月 28, 2021

* remove intype arg in cast kernel

* modify conj config in api.yaml by dictionary order

* rm unused code in cast_kernel.cu

0637b9a6

Z

Fixed issue with offset,test=allcases (#38506) · dc30ad1d
由 Zhanlue Yang 提交于 12月 28, 2021

dc30ad1d

27 12月, 2021 2 次提交
- Y
  [PTen]move reshape kernel according to new directory (#38432) · 49216134
  由 YuanRisheng 提交于 12月 27, 2021
```
* move reshape

* fix compile bugs

* delete manipulation file

* fix compile bugs
```
  49216134
- C
  [PTen] Move cast kernel impl (#38382) · 1fb734d7
  由 Chen Weihang 提交于 12月 26, 2021
```
* rename to api to copy_to

* revert needless change

* polish format
```
  1fb734d7
24 12月, 2021 3 次提交

C

add is dense tensor method (#38424) · 6ff3596e
由 Chen Weihang 提交于 12月 24, 2021

6ff3596e

[pten] combine reduce_cuda codes (#38328) · 08941eda

由 chentianyu03 提交于 12月 24, 2021

* combine reduce_cuda codes

* support float16 in pten redcue_mean

* replace ReduceCudaKernel impl with pten reduce impl

* mv reduce funcs into reduce_cuda_impl

* rm unsed codes and headers

* mv GetReduceDim into reduce_cuda_impl

* recover GetReduceDim in reduce_op.h

* add new dispatch macro

* fix pool op output not inited and cause transform to pten::denseTensor error

* fix output tensor not initialized error

* rename new dispatch macro and format code style

* rm reduce_functor_op.h file

08941eda

[Unify Tensors PR ] Replaced pten::Allocation with... · 42cf2bee

由 Zhanlue Yang 提交于 12月 24, 2021

[Unify Tensors PR #1] Replaced pten::Allocation with shared_ptr<memory::Allocation> for Storage (#38301)

* Added shared_ptr<Allocation> member & corresponding interfaces to Storage

* Removed original pten::Allocation from Storage and adjusted the interfaces accordingly

* Fixed issues with storage offset

* Used place to malloc allocation for TensorStorage

42cf2bee

23 12月, 2021 5 次提交
- C
  
  move conj kernel impl (#38365) · 8da9eff4
  由 Chen Weihang 提交于 12月 23, 2021
  
  8da9eff4
- Z
  【PTen】Add empty and empty_like kernel in pten (#38334) · 4221cd33
  由 zyfncg 提交于 12月 23, 2021
```
* add empty and empty_like kernel in pten

* add empty dev_api
```
  4221cd33
- C
  
  move sign kernel impl (#38363) · bb38b6aa
  由 Chen Weihang 提交于 12月 22, 2021
  
  bb38b6aa
- C
  [PTen] Move dot kernel impl (#38359) · 0a4ffbc7
  由 Chen Weihang 提交于 12月 22, 2021
```
* move dot kernel impl

* remove needless cmake items
```
  0a4ffbc7
- 石
  updates the pten allocation, test=develop (#38355) · 4d5a6064
  由石晓伟提交于 12月 23, 2021
```
* updates the pten allocation, test=develop

* avoids an error message, test=develop
```
  4d5a6064
22 12月, 2021 2 次提交

[PTen] Add cmake function for kernels (#38311) · e6310dbd

由 Chen Weihang 提交于 12月 22, 2021

* add pten kernel cmake

* add pten kernel cmake function

* fix compile error

* add enforce include for full kernel

* fix compile failed

* change cuda to gpu

* fix cmake function error

e6310dbd

Y
[PTen]Move flatten kernel to new directory (#38255) · 4d1ce184
由 YuanRisheng 提交于 12月 22, 2021
```
* move flatten

* fix bugs of test

* modify header file

* add copy declare

* fix compile bugs
```
4d1ce184

21 12月, 2021 2 次提交
- C
  [PTen] Rename cuda dir and context to gpu (#38296) · dc7597e3
  由 Chen Weihang 提交于 12月 21, 2021
```
* rename cuda to gpu

* revert CMake change

* resolve conflit

* rename other cuda to gpu

* poish details
```
  dc7597e3
- C
  [PTen] Remove eigen and blas directory (#38291) · d9fcdc3a
  由 Chen Weihang 提交于 12月 20, 2021
```
* remove eigen and blas dir

* fix declare error
```
  d9fcdc3a
20 12月, 2021 1 次提交

[pten]add pten conj kernel (#38247) · a2793e5e

由 chentianyu03 提交于 12月 20, 2021

* add pten conj kernel

* modify conj_kernel file path

* add defined cuda macro to cuda/conj_kernel.h

a2793e5e

17 12月, 2021 2 次提交

Support multi place constructor (#38171) · 6f439e5a

由 Jiabin Yang 提交于 12月 17, 2021

* support more eager tensor api

* support multiple constructor for eager tensor

* add place related code

* polish code

* specific randint with dtype of int64

* Support pure cpu test

* refine test in pure cpu

* refine test in pure cpu

6f439e5a

[pten] modify reduce_sum reduce_mean args (#38216) · eaa2363e

由 chentianyu03 提交于 12月 17, 2021

* modify sum mean args

* add GetExpectedPtenKernelArgs for redcue_op

* modify kernel args number

* modify kernel args number

eaa2363e

16 12月, 2021 2 次提交

[PTen] Add register_ctx_kernel marco and move scale kernel (#38121) · af498677

由 Chen Weihang 提交于 12月 16, 2021

* add register_ctx_kernel and move scale kernel

* polish details by reviewer comment

* fix xpu compile failed

* fix cmake error

af498677

[PTen] Unify device context entrance in pten part 1 (#38172) · 047ee26c

由 Chen Weihang 提交于 12月 15, 2021

* unify device context entrance

* move all_context include to header

* polish cmake relay for device_context

* fix npu compile failed

* fix npu compile failed

* revert part of change

047ee26c

15 12月, 2021 3 次提交
- C
  
  replace moves_storage and alloc_construct (#38134) · e78eb3f4
  由 Chen Weihang 提交于 12月 14, 2021
  
  e78eb3f4
- C
  
  revert attr type change (#38129) · 038ca68d
  由 Chen Weihang 提交于 12月 14, 2021
  
  038ca68d
- C
  
  move tensor using to single header (#38142) · c23afce1
  由 Chen Weihang 提交于 12月 14, 2021
  
  c23afce1
14 12月, 2021 3 次提交
- C
  [PTen] Polish kernel register marco design (#38078) · c9da845f
  由 Chen Weihang 提交于 12月 14, 2021
```
* polish register marco

* resolve compile failed

* revert needless change

* revert eager related change

* revert eager related change

* change register marco name

* polish deetails
```
  c9da845f
- Y
  
  remove KernelName (#38082) · 8198cad7
  由 YuanRisheng 提交于 12月 14, 2021
  
  8198cad7
- Y
  [PTen] Reduce reshape kernel functions in pten (#38055) · a3c8abc7
  由 YuanRisheng 提交于 12月 14, 2021
```
* Reduce reshape kernel functions in pten

* delete notes

* fix bugs when compile
```
  a3c8abc7
13 12月, 2021 2 次提交

C

fix custom op infershape error (#38045) · 3a339cc0
由 Chen Weihang 提交于 12月 13, 2021

3a339cc0

【PTen】Add variadic args kernel for PTen API to replace KernelContext (#37942) · b76ef045

由 zyfncg 提交于 12月 13, 2021

* add variadic_args kernel in pten

* merge develop code

* add variadic_args kernel and benchmark

* change dynamic_cast to static_cast for DeviceContext

* merge the code

* modify code format

* refactor variadic kernel function

b76ef045

10 12月, 2021 1 次提交
- Z
  
  fix cmake bug when WITH_PYTHON=OFF (#38015) · 7ccf67e5
  由 zyfncg 提交于 12月 10, 2021
  
  7ccf67e5
09 12月, 2021 1 次提交

[PTen] Refine Kernel Registrar Writing (#37977) · b199ba85

由 Chen Weihang 提交于 12月 09, 2021

* refine the kernel register impl

* fix cmake and symbol error

* remove overload marco

* polish details

b199ba85

07 12月, 2021 1 次提交
- Z
  
  add cmake depend for api_gen.py (#37900) · 7e831b5a
  由 zyfncg 提交于 12月 07, 2021
  
  7e831b5a

PaddlePaddle / Paddle 1 年多 前同步成功

PaddlePaddle / Paddle
1 年多前同步成功