提交 · 3a23c1a224f4e51003ff106d8114a343ec6ecc23 · Crayon鑫 / Paddle

10 1月, 2022 3 次提交

C

move get expected kernel args into pten (#38825) · 3a23c1a2
由 Chen Weihang 提交于 1月 10, 2022

3a23c1a2

[Unify Tensors PR #5] framework::Tensor inherits from DenseTensor,test=allcases (#38632) · 5c73a6ea

由 Zhanlue Yang 提交于 1月 10, 2022

* Added shared_ptr<Allocation> member & corresponding interfaces to Storage

* Removed original pten::Allocation from Storage and adjusted the interfaces accordingly

* Fixed issues with storage offset

* Used place to malloc allocation for TensorStorage

* [Unify Tensors PR #3]Ported framework::Tensor interfaces to pten::DenseTensor

* Fixed issues with place

* Added comments

* Moved mutable_data with stream argument to DenseTensor

* Added set_offset interface

* Fixed CI issues,test=allcases

* [Unify Tensors PR #4] Port LoDTensor interfaces to DenseTensor

* Removed friend class EigenTensor/EigenMatrix/EigenVector from Tensor

* Modified framework::Tensor to inherit from DenseTensor

* Reverted changes too pten_layout() interface

* Removed friend classes

* Rearranged cfunction calls from tensor.data<void>() to tensor.data()

* Fixed CI issues

* Fixed lite issues

* Fixed data() interface issues,test=allcases

* Resolved IsInitialized() issues

* Fixed ResetHolder() issues

* Fixed MKLDNN & Storage issues

* Resolved ShareBufferWith() issues

* Fixed LoD issues

5c73a6ea

Support setting infershape function for custom grad op (#38776) · 046553c7

由 Chen Weihang 提交于 1月 10, 2022

* unify infer_shape func calling

* support set grad infer shape fn for custom op

* unify infershape in new executor and eager

* remove todo comment

* revert infershape in operator

046553c7

07 1月, 2022 2 次提交
- Y
  [PTen]Refactor flatten_grad kernel (#38712) · 5cf0bb79
  由 YuanRisheng 提交于 1月 07, 2022
```
* refactor flatten grad kernel

* fix bugs when run ci unittest

* fix bugs when use default GetExpectedPtenKernelArgs

* xshape sometimes is has null holder ,fix this bugs
```
  5cf0bb79
- N
  
  Fix a bug when reduce_num = 1 in Reduce Op (#38771) · f634c0b1
  由 niuliling123 提交于 1月 07, 2022
  
  f634c0b1
06 1月, 2022 5 次提交
- L
  
  [pten] fix typo of device (#38760) · 42cfd15e
  由 Leo Chen 提交于 1月 06, 2022
  
  42cfd15e
- Y
  [PTen]Move manipulation mid to new directory and rename flatten/reshape kernel (#38730) · 3d3bc681
  由 YuanRisheng 提交于 1月 06, 2022
```
* move mid api and rename kernel

* use empty kernel
```
  3d3bc681
- C
  [pten]move reduce files and dev_api (#38715) · c48bd3ff
  由 chentianyu03 提交于 1月 06, 2022
```
* move eigen/reduce.h imple into cpu/reduce.h

* ctx to dev_ctx
```
  c48bd3ff
- Z
  【PTen】Adjust the format of full kernel (#38596) · 0c02d2ed
  由 zyfncg 提交于 1月 06, 2022
```
* adjust the full kernel

* remove creation.h

* use Empty to create tensor in full
```
  0c02d2ed
- Y
  [Pten]Move GPU_implementation of elementwise kernel in new directory (#38696) · c1adced7
  由 YuanRisheng 提交于 1月 06, 2022
```
* move gpu_impl of elementwise kernel

* change copyright to 2022
```
  c1adced7
05 1月, 2022 2 次提交

[pten]Move reduce code new (#38648) · 7a4a512d

由 chentianyu03 提交于 1月 05, 2022

* change 'math' to 'math_kernel'

* fix compile bugs

* merge develop

* fix compile bugs

* fix compile bugs

* move reduce files by new rule

* add set header

* format code style

* merge develop and fix conflict

* merge develop and fix conflict
Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com>

7a4a512d

C
[PTen] Polish infermeta filename (#38695) · d6df5bd9
由 Chen Weihang 提交于 1月 05, 2022
```
* polish infermeta filename

* polish infermeta filename
```
d6df5bd9

04 1月, 2022 4 次提交

N
Add OpFunctor and replace cast, scale, clip, bce_loss and abs_grad with... · 6eac06e3
由 niuliling123 提交于 1月 04, 2022
```
Add OpFunctor and replace cast, scale, clip, bce_loss and abs_grad with elementwise_no_broadcast (#38500)
```
6eac06e3

[Pten]Move CPU_implementation of elementwise kernel in new directory (#38651) · 7c020c71

由 YuanRisheng 提交于 1月 04, 2022

* change 'math' to 'math_kernel'

* fix compile bugs

* merge develop

* fix compile bugs

* move cpu_impl of elementwise kernel to new directory

7c020c71

[Unify Tensors PR #3]Port framework::Tensor members & interfaces to... · dfdc9960

由 Zhanlue Yang 提交于 1月 04, 2022

[Unify Tensors PR #3]Port framework::Tensor members & interfaces to pten::DenseTensor, test=allcases (#38473)

* Added shared_ptr<Allocation> member & corresponding interfaces to Storage

* Removed original pten::Allocation from Storage and adjusted the interfaces accordingly

* Fixed issues with storage offset

* Used place to malloc allocation for TensorStorage

* [Unify Tensors PR #3]Ported framework::Tensor interfaces to pten::DenseTensor

* Fixed issues with place

* Added comments

* Moved mutable_data with stream argument to DenseTensor

* Added set_offset interface

* Fixed CI issues,test=allcases

* [Unify Tensors PR #4] Port LoDTensor interfaces to DenseTensor

* Reverted changes too pten_layout() interface

* Removed friend classes

dfdc9960

C
[PTen] Move inner empty and cast api to kernel.h (#38587) · 64538c8d
由 Chen Weihang 提交于 1月 04, 2022
```
* move inner cast api to cast_kernel.h

* resolve conflit
```
64538c8d

31 12月, 2021 3 次提交
- C
  
  replace contextt to context (#38619) · f1366d58
  由 Chen Weihang 提交于 12月 31, 2021
  
  f1366d58
- C
  [PTen] Unify data layout of pten and fluid (#38583) · 8d32cef8
  由 Chen Weihang 提交于 12月 31, 2021
```
* unify data layout

* fix test_transfer_layout error
```
  8d32cef8
- Y
  [Pten]Move math to new directory and change 「math」 to 「math_kernel」 (#38604) · e76087ad
  由 YuanRisheng 提交于 12月 31, 2021
```
* change 'math' to 'math_kernel'

* fix compile bugs

* merge develop

* fix compile bugs
```
  e76087ad
30 12月, 2021 2 次提交
- S
  
  try to expose cast with ptr function (#38598) · 15cbf81b
  由 sneaxiy 提交于 12月 30, 2021
  
  15cbf81b
- C
  [PTen] Remove offset in storage (#38472) · a504ff3f
  由 Chen Weihang 提交于 12月 29, 2021
```
* remove offset in storage

* revert api change

* fix custom op slice bug

* fix mutable_data error
```
  a504ff3f
29 12月, 2021 3 次提交
- C
  
  unify infermeta target (#38580) · 458365cf
  由 Chen Weihang 提交于 12月 29, 2021
  
  458365cf
- S
  
  fix reduce_max/reduce_min bug (#38476) · 995332ef
  由 Shang Zhizhou 提交于 12月 29, 2021
  
  995332ef
- L
  
  code clean (#38550) · 206a8f6c
  由 limingshu 提交于 12月 29, 2021
  
  206a8f6c
28 12月, 2021 5 次提交

L
Support multi-output feature for elementwise (#38410) · 48f061fb
由 limingshu 提交于 12月 28, 2021
```
* first commit

* pass ctest of  elementwise_div_grad
```
48f061fb

Support test basic of Var and Layer (#38426) · 1fb80a6a

由 Jiabin Yang 提交于 12月 28, 2021

* Rearranged Eager AutoCodeGen directory structure

* Removed USE_OP in Eager AutoCodeGen

* Enabled generation for Operators without Grad/Inputs/Outputs

* Resolved operators without input

* Fixed merge conflicts

* Enabled Eager AutoCodeGen for 10+ more operators

* Refactored Eager AutoCodeGen with more organized helper objects

* Enabled Eager AutoCodeGen for operators with multiple OpBases

* Adjusted Eager AutoCodeGen to Enable Passing Output Tensor as Input Argument

* Handled Dispensable Inputs/Outputs in Eager AutoCodeGen

* Adjusted function generation/call between Python-C API & Dygraph API

* Synchronized auto-generated Python-C API with Dygraph Forward Functions

* support more eager tensor api

* fix merge compile error

* fix compile error and fit develop code

* support pure CPU

* fix some logic error in eager_mode

* support _varbase_creator in eager mode

* Added safe_initialized interface to EagerTensor for use in processing dispensable inputs

* for eager mode

* refine

* support multiple constructor for eager tensor

* add place related code

* polish code

* specific randint with dtype of int64

* Support pure cpu test

* eager logic

* refine test in pure cpu

* eager logic

* eager logic

* eager logic, test=develop

* skip core.eager when in inference, test=develop

* refine, test=develop

* refine, test=develop

* call RetainGrad after run forward kernel, test=develop

* refine, test=develop

* support dygraph util, meta, guard test

* support inference test

* refine test and fix initializer failed

* support create varbase and fix retain grad error

* fix windows error

* support test code coverage

* support test code coverage

* support test code coverage
Co-authored-by: Njim19930609 <jim19930609@gmail.com>
Co-authored-by: NWang Huan <wanghuan29@baidu.com>

1fb80a6a

Z
refactor matmul directory in pten (#38227) · 982bf444
由 zyfncg 提交于 12月 28, 2021
```
* refactor matmul directory in pten

* fix merge conflict
```
982bf444

[pten] remove in_type arg in cast kernel (#38486) · 0637b9a6

由 chentianyu03 提交于 12月 28, 2021

* remove intype arg in cast kernel

* modify conj config in api.yaml by dictionary order

* rm unused code in cast_kernel.cu

0637b9a6

Z

Fixed issue with offset,test=allcases (#38506) · dc30ad1d
由 Zhanlue Yang 提交于 12月 28, 2021

dc30ad1d

27 12月, 2021 4 次提交

Y
[PTen]move reshape kernel according to new directory (#38432) · 49216134
由 YuanRisheng 提交于 12月 27, 2021
```
* move reshape

* fix compile bugs

* delete manipulation file

* fix compile bugs
```
49216134

Support multi-outputs feature for broadcast ops (#38329) · 89d38f55

由 limingshu 提交于 12月 27, 2021

* No harm to KP

* Pass the compile stage

* change the WriteData function

* fix template bugs and pass ctest of current elementwise

* for passing partial template specialization of tempalte function in CI-ROCm

* To make 'WriteData' funtion flexible.

* a less harmful way to support multi-output

* a less harmful way to support multi-output

89d38f55

C

remove npu related impl (#38428) · f1d56b77
由 Chen Weihang 提交于 12月 26, 2021

f1d56b77
C
[PTen] Move cast kernel impl (#38382) · 1fb734d7
由 Chen Weihang 提交于 12月 26, 2021
```
* rename to api to copy_to

* revert needless change

* polish format
```
1fb734d7

26 12月, 2021 2 次提交

[PTen] Move copy kernel impl (#38421) · 73819658

由 Chen Weihang 提交于 12月 26, 2021

* add register general kernel marco

* move copy kernel impl

* revert needless change

* polish details

* fix xpu compil faild

* fix xpu compile failed

* polish format

73819658

Z
[Unify Tensors PR #2] Replaced pten::LoD with paddle::framework::LoD (#38275) · bbe879fc
由 Zhanlue Yang 提交于 12月 26, 2021
```
* Replaced pten::LoD with paddle::framework::LoD

* Overrided CPUVector with CUDAVector

* Refactored paddle::framework::Vector
```
bbe879fc

24 12月, 2021 4 次提交

C

add is dense tensor method (#38424) · 6ff3596e
由 Chen Weihang 提交于 12月 24, 2021

6ff3596e

[pten] combine reduce_cuda codes (#38328) · 08941eda

由 chentianyu03 提交于 12月 24, 2021

* combine reduce_cuda codes

* support float16 in pten redcue_mean

* replace ReduceCudaKernel impl with pten reduce impl

* mv reduce funcs into reduce_cuda_impl

* rm unsed codes and headers

* mv GetReduceDim into reduce_cuda_impl

* recover GetReduceDim in reduce_op.h

* add new dispatch macro

* fix pool op output not inited and cause transform to pten::denseTensor error

* fix output tensor not initialized error

* rename new dispatch macro and format code style

* rm reduce_functor_op.h file

08941eda

[Unify Tensors PR ] Replaced pten::Allocation with... · 42cf2bee

由 Zhanlue Yang 提交于 12月 24, 2021

[Unify Tensors PR #1] Replaced pten::Allocation with shared_ptr<memory::Allocation> for Storage (#38301)

* Added shared_ptr<Allocation> member & corresponding interfaces to Storage

* Removed original pten::Allocation from Storage and adjusted the interfaces accordingly

* Fixed issues with storage offset

* Used place to malloc allocation for TensorStorage

42cf2bee

C

add register general kernel marco (#38409) · fc0a50aa
由 Chen Weihang 提交于 12月 23, 2021

fc0a50aa

23 12月, 2021 1 次提交
- C
  
  move conj kernel impl (#38365) · 8da9eff4
  由 Chen Weihang 提交于 12月 23, 2021
  
  8da9eff4

Crayon鑫 / Paddle 与 Fork 源项目一致

Crayon鑫 / Paddle
与 Fork 源项目一致