提交 · be817719982f1821ab0519ceab85ec238bf99d43 · Crayon鑫 / Paddle

11 1月, 2022 1 次提交

【PTen】Add dot and matmul grad kernel in pten (#38713) · be817719

由 zyfncg 提交于 1月 11, 2022

* refactor matmul directory in pten

* fix merge conflict

* add dot_grad kernel

* add dot_grad kernel in pten

* add matmul_grad kernel

* update the code

* delete useless code in fluid

* fix some bug of running matmul grad kernel

* fix merge conflict

* refactor some code

* refactor code

be817719

10 1月, 2022 2 次提交

C

move get expected kernel args into pten (#38825) · 3a23c1a2
由 Chen Weihang 提交于 1月 10, 2022

3a23c1a2

[Unify Tensors PR #5] framework::Tensor inherits from DenseTensor,test=allcases (#38632) · 5c73a6ea

由 Zhanlue Yang 提交于 1月 10, 2022

* Added shared_ptr<Allocation> member & corresponding interfaces to Storage

* Removed original pten::Allocation from Storage and adjusted the interfaces accordingly

* Fixed issues with storage offset

* Used place to malloc allocation for TensorStorage

* [Unify Tensors PR #3]Ported framework::Tensor interfaces to pten::DenseTensor

* Fixed issues with place

* Added comments

* Moved mutable_data with stream argument to DenseTensor

* Added set_offset interface

* Fixed CI issues,test=allcases

* [Unify Tensors PR #4] Port LoDTensor interfaces to DenseTensor

* Removed friend class EigenTensor/EigenMatrix/EigenVector from Tensor

* Modified framework::Tensor to inherit from DenseTensor

* Reverted changes too pten_layout() interface

* Removed friend classes

* Rearranged cfunction calls from tensor.data<void>() to tensor.data()

* Fixed CI issues

* Fixed lite issues

* Fixed data() interface issues,test=allcases

* Resolved IsInitialized() issues

* Fixed ResetHolder() issues

* Fixed MKLDNN & Storage issues

* Resolved ShareBufferWith() issues

* Fixed LoD issues

5c73a6ea

07 1月, 2022 1 次提交

[PTen]Refactor flatten_grad kernel (#38712) · 5cf0bb79

由 YuanRisheng 提交于 1月 07, 2022

* refactor flatten grad kernel

* fix bugs when run ci unittest

* fix bugs when use default GetExpectedPtenKernelArgs

* xshape sometimes is has null holder ,fix this bugs

5cf0bb79

04 1月, 2022 1 次提交

[Unify Tensors PR #3]Port framework::Tensor members & interfaces to... · dfdc9960

由 Zhanlue Yang 提交于 1月 04, 2022

[Unify Tensors PR #3]Port framework::Tensor members & interfaces to pten::DenseTensor, test=allcases (#38473)

* Added shared_ptr<Allocation> member & corresponding interfaces to Storage

* Removed original pten::Allocation from Storage and adjusted the interfaces accordingly

* Fixed issues with storage offset

* Used place to malloc allocation for TensorStorage

* [Unify Tensors PR #3]Ported framework::Tensor interfaces to pten::DenseTensor

* Fixed issues with place

* Added comments

* Moved mutable_data with stream argument to DenseTensor

* Added set_offset interface

* Fixed CI issues,test=allcases

* [Unify Tensors PR #4] Port LoDTensor interfaces to DenseTensor

* Reverted changes too pten_layout() interface

* Removed friend classes

dfdc9960

31 12月, 2021 1 次提交
- C
  [PTen] Unify data layout of pten and fluid (#38583) · 8d32cef8
  由 Chen Weihang 提交于 12月 31, 2021
```
* unify data layout

* fix test_transfer_layout error
```
  8d32cef8
30 12月, 2021 1 次提交

[PTen] Remove offset in storage (#38472) · a504ff3f

由 Chen Weihang 提交于 12月 29, 2021

* remove offset in storage

* revert api change

* fix custom op slice bug

* fix mutable_data error

a504ff3f

27 12月, 2021 1 次提交
- C
  
  remove npu related impl (#38428) · f1d56b77
  由 Chen Weihang 提交于 12月 26, 2021
  
  f1d56b77
26 12月, 2021 1 次提交
- Z
  [Unify Tensors PR #2] Replaced pten::LoD with paddle::framework::LoD (#38275) · bbe879fc
  由 Zhanlue Yang 提交于 12月 26, 2021
```
* Replaced pten::LoD with paddle::framework::LoD

* Overrided CPUVector with CUDAVector

* Refactored paddle::framework::Vector
```
  bbe879fc
24 12月, 2021 2 次提交

[Unify Tensors PR ] Replaced pten::Allocation with... · 42cf2bee

由 Zhanlue Yang 提交于 12月 24, 2021

[Unify Tensors PR #1] Replaced pten::Allocation with shared_ptr<memory::Allocation> for Storage (#38301)

* Added shared_ptr<Allocation> member & corresponding interfaces to Storage

* Removed original pten::Allocation from Storage and adjusted the interfaces accordingly

* Fixed issues with storage offset

* Used place to malloc allocation for TensorStorage

42cf2bee

C

add register general kernel marco (#38409) · fc0a50aa
由 Chen Weihang 提交于 12月 23, 2021

fc0a50aa

23 12月, 2021 1 次提交
- 石
  updates the pten allocation, test=develop (#38355) · 4d5a6064
  由石晓伟提交于 12月 23, 2021
```
* updates the pten allocation, test=develop

* avoids an error message, test=develop
```
  4d5a6064
22 12月, 2021 2 次提交
- C
  [PTen] Change functions to funcs (#38340) · 64e2f670
  由 Chen Weihang 提交于 12月 22, 2021
```
* change functions to funcs

* remove useless code
```
  64e2f670
- C
  
  add copy constructor for densetensor (#38319) · fabc058b
  由 Chen Weihang 提交于 12月 21, 2021
  
  fabc058b
21 12月, 2021 1 次提交
- C
  [PTen] Rename cuda dir and context to gpu (#38296) · dc7597e3
  由 Chen Weihang 提交于 12月 21, 2021
```
* rename cuda to gpu

* revert CMake change

* resolve conflit

* rename other cuda to gpu

* poish details
```
  dc7597e3
16 12月, 2021 2 次提交

[PTen] Add register_ctx_kernel marco and move scale kernel (#38121) · af498677

由 Chen Weihang 提交于 12月 16, 2021

* add register_ctx_kernel and move scale kernel

* polish details by reviewer comment

* fix xpu compile failed

* fix cmake error

af498677

[PTen] Unify device context entrance in pten part 1 (#38172) · 047ee26c

由 Chen Weihang 提交于 12月 15, 2021

* unify device context entrance

* move all_context include to header

* polish cmake relay for device_context

* fix npu compile failed

* fix npu compile failed

* revert part of change

047ee26c

14 12月, 2021 2 次提交
- C
  [PTen] Polish kernel register marco design (#38078) · c9da845f
  由 Chen Weihang 提交于 12月 14, 2021
```
* polish register marco

* resolve compile failed

* revert needless change

* revert eager related change

* revert eager related change

* change register marco name

* polish deetails
```
  c9da845f
- Y
  
  remove KernelName (#38082) · 8198cad7
  由 YuanRisheng 提交于 12月 14, 2021
  
  8198cad7
13 12月, 2021 1 次提交

【PTen】Add variadic args kernel for PTen API to replace KernelContext (#37942) · b76ef045

由 zyfncg 提交于 12月 13, 2021

* add variadic_args kernel in pten

* merge develop code

* add variadic_args kernel and benchmark

* change dynamic_cast to static_cast for DeviceContext

* merge the code

* modify code format

* refactor variadic kernel function

b76ef045

10 12月, 2021 1 次提交
- Y
  [PTen]Add alias name for matmul and remove redundant member in kernel factory (#38011) · c5a7da4b
  由 YuanRisheng 提交于 12月 10, 2021
```
* add alias kernel name

* modify code as suggestions

* add alias name for matmul and remove redundant member in kernel factory
```
  c5a7da4b
09 12月, 2021 2 次提交
- C
  [PTen] Refine Kernel Registrar Writing (#37977) · b199ba85
  由 Chen Weihang 提交于 12月 09, 2021
```
* refine the kernel register impl

* fix cmake and symbol error

* remove overload marco

* polish details
```
  b199ba85
- C
  
  fix make error by alias name change (#37971) · e3f68f42
  由 Chen Weihang 提交于 12月 08, 2021
  
  e3f68f42
08 12月, 2021 1 次提交
- Y
  [PTen]Add alias kernel name (#37881) · ff6507db
  由 YuanRisheng 提交于 12月 08, 2021
```
* add alias kernel name

* modify code as suggestions
```
  ff6507db
07 12月, 2021 2 次提交

[Eager] fix cmake generate error, and fix circular import (#37871) · 79c25979

由 wanghuancoder 提交于 12月 07, 2021

* refine a test case, test=develop

* rm python, test=develop

* refine, test=develop

* fix cmake generate error, and fix circular import, test=develop

79c25979

[Pten]Move func from kernel_context.h into kernel_context.cc (#37804) · bfa0d7f3

由 YuanRisheng 提交于 12月 07, 2021

* add inplace op adaptation

* optimize inplace logic and fix bugs when run kernel that has args of vector<DenseTensor>

* move func in kernel_context.h into kernel_context.cc

* refactor logic that transform variable to densetensor

* fix bugs when compile

* update func name

* fix bugs when run windows-ci

bfa0d7f3

03 12月, 2021 2 次提交

R
refine structure for cuda and rocm (#37202) · a6d2fddb
由 ronnywang 提交于 12月 03, 2021
```
* refine structure for cuda and rocm

* update

* update

* update

* update
```
a6d2fddb

[Eager] publish python c api for eager (#37550) · 07b4fe93

由 wanghuancoder 提交于 12月 03, 2021

* refine a test case, test=develop

* publish python c api for eager, test=develop

* revert modify about test_allclose_layer.py, test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop

* delete numpy includes, use pybind11 numpy.h, test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop

* suport eager error msg, and add grad test case, test=develop

* refine, test=develop

* refine, test=develop

07b4fe93

02 12月, 2021 1 次提交

[PTen]Make inplace_op and vector<DenseTensor> input compatible with old architecture (#37674) · c1fd1b1c

由 YuanRisheng 提交于 12月 02, 2021

* add inplace op adaptation

* optimize inplace logic and fix bugs when run kernel that has args of vector<DenseTensor>

* refactor logic that transform variable to densetensor

* update func name

c1fd1b1c

24 11月, 2021 1 次提交

【PTen】Add Scalar and ScalarArray in pten (#37409) · 0f24de83

由 zyfncg 提交于 11月 24, 2021

* add scalar and scalar_array

* remove DenseTensor include from Scalar and ScalarArray

* remove inner header from scalar_array

* refactor the method of fill_constant and add some comment

0f24de83

22 11月, 2021 1 次提交

[PTen] Add variable transform to/from ptenTensor and add cast kernel (#36916) · 5caa6fc5

由 chentianyu03 提交于 11月 22, 2021

* add cast kernel

* add cast cuda kernel

* add cast kernel

* make cast kernel output dtype undefined

* get cast dtype from vardesc

* move cast to manipulation and add test case

* add castinfershape

* avoid reinitilaze variable

* InitializeVariable support datatype

* merge develop branch

* fix merge bug

* revert modify initializeVariable

* revert modify on InitializeVariable

* revert modify on InitializeVariable

* mutable support reset dtype

* enable make pten tensor from variable when def_arg.type is undefined

* fix build pten ctx start_idx error

* copy pten out tensor to variable

* merge develop branch

* fix non pten kernel cast failed

* add reset allocation place for remake tensor

* fix inplace realloc error

* add mutable on pten kernles and remove unused cast files

* rename function names

* fix output type error

* fix conflict with develop branch

* set data type to variable with pten's dtype

* fix test_cast_api type mismatch

* densorTensro mutable_data support 0 bytes value

* fix the inplace bug of reshape kernel

* fix pten.backend != variable.place when moving storage, palce mismatch bug

* fix conflict with develop branch

* Fix bug of paddle::experimental::MovesStorage

* fix ReMakePtenDenseTensor place mismatch bug

* Revert "fix ReMakePtenDenseTensor place mismatch bug"

This reverts commit 86336032f60b8a15eacd2c1ff2fa513f5d8dfd1a.

* fix ReMakePtenDenseTensor place mismatch bug

* reverts the set_lod interface, test=develop

* modify by the review options

* modify error message

* add & for const input arguments

* add reference in params

* elementwise_sub add mutable_data

* fix ResetHolderWithType check size bug

* add dependence pten_tensor to test_cast_api object

* remove unused code to pass ci coverage
Co-authored-by: NChen Weihang <chenweihang@baidu.com>
Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com>
Co-authored-by: Nshixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>

5caa6fc5

19 11月, 2021 1 次提交

【PTen】Rename TensorMeta member type to dtype (#37277) · c13edf66

由 zyfncg 提交于 11月 19, 2021

* rename TensorBase interface data_type() to dtype()

* rename type to dtype of TensorMeta

* merge the code

* merge the code

* fix the problem when merge conflict

c13edf66

17 11月, 2021 2 次提交
- 石
  
  change the meta modification rules, test=develop (#37255) · 8c44ad47
  由石晓伟提交于 11月 17, 2021
  
  8c44ad47
- Z
  
  rename TensorBase interface data_type() to dtype() (#37257) · 1e9b3a3d
  由 zyfncg 提交于 11月 17, 2021
  
  1e9b3a3d
16 11月, 2021 2 次提交

Add API and unit test for reshape (#37232) · 79b49c20

由 YuanRisheng 提交于 11月 16, 2021

* reshape kernel refactor

* fix compile bugs when run ci

* support xpu for reshape

* fix bugs when run unittest in kunlun ci

* fix compile bugs when run kunlun

* perfect code according to suggestion

* add api and unit test for reshape

79b49c20

石

supports the slice of upper tensor, test=develop (#37215) · c5ccff73
由石晓伟提交于 11月 16, 2021

c5ccff73

15 11月, 2021 1 次提交

[Pten] Refactor the implementation of custom operator (#37122) · 1e598f1a

由 Chen Weihang 提交于 11月 15, 2021

* move extension into pten [no-verify]

* append tensor methods by ext_tensor [no-verify]

* append other tensor methods [no-verify]

* ext related files tidy [no-verify]

* include relation tidy [no-verify]

* add pten tensor test [no-verify]

* replace tensor in custom op & compile success

* refine tensor constructor for unittest

* custom relu jit run success

* fix all custom op unittests

* add inference cmake adapt [no-verify]

* fix failed unittests

* fix windows failed unittests

* try to fix kunlun and inference failed

* fix test_elementwise_api error

* try to fix win compile failed

* fix kunlun fp16 type error

* remove useless haddle error macro

* add custom linear op test

* fix compile failed & add win symbols

* fix non pten kernel cast failed

* add dll decl for api

* polish several deetails

* polish details by review comment

* add dll_decl for register

1e598f1a

14 11月, 2021 1 次提交

[PTen]Reshape Kernel Refactor (#37164) · 895692e3

由 YuanRisheng 提交于 11月 14, 2021

* reshape kernel refactor

* fix compile bugs when run ci

* support xpu for reshape

* fix bugs when run unittest in kunlun ci

* fix compile bugs when run kunlun

* perfect code according to suggestion

895692e3

12 11月, 2021 2 次提交
- 石
  
  add the shallow clone member func of the dense tensor, test=develop (#37146) · 9303b095
  由石晓伟提交于 11月 12, 2021
  
  9303b095
- 石
  
  adjust the COLUMNS=128; (#37120) · 4d536678
  由石晓伟提交于 11月 12, 2021
  
  4d536678

Crayon鑫 / Paddle 与 Fork 源项目一致

Crayon鑫 / Paddle
与 Fork 源项目一致