提交 · 6827adbdf0400c32a6a21cc724aa45c4d792cebb · Crayon鑫 / Paddle

27 12月, 2021 4 次提交

Revert "[Unify Tensors PR ] Replaced pten::Allocation with... · 6827adbd

由 Zhanlue Yang 提交于 12月 27, 2021

Revert "[Unify Tensors PR #1] Replaced pten::Allocation with shared_ptr<memory::Allocation> for Storage (#38301)"

This reverts commit 42cf2bee.

6827adbd

Support multi-outputs feature for broadcast ops (#38329) · 89d38f55

由 limingshu 提交于 12月 27, 2021

* No harm to KP

* Pass the compile stage

* change the WriteData function

* fix template bugs and pass ctest of current elementwise

* for passing partial template specialization of tempalte function in CI-ROCm

* To make 'WriteData' funtion flexible.

* a less harmful way to support multi-output

* a less harmful way to support multi-output

89d38f55

C

remove npu related impl (#38428) · f1d56b77
由 Chen Weihang 提交于 12月 26, 2021

f1d56b77
C
[PTen] Move cast kernel impl (#38382) · 1fb734d7
由 Chen Weihang 提交于 12月 26, 2021
```
* rename to api to copy_to

* revert needless change

* polish format
```
1fb734d7

26 12月, 2021 2 次提交

[PTen] Move copy kernel impl (#38421) · 73819658

由 Chen Weihang 提交于 12月 26, 2021

* add register general kernel marco

* move copy kernel impl

* revert needless change

* polish details

* fix xpu compil faild

* fix xpu compile failed

* polish format

73819658

Z
[Unify Tensors PR #2] Replaced pten::LoD with paddle::framework::LoD (#38275) · bbe879fc
由 Zhanlue Yang 提交于 12月 26, 2021
```
* Replaced pten::LoD with paddle::framework::LoD

* Overrided CPUVector with CUDAVector

* Refactored paddle::framework::Vector
```
bbe879fc

24 12月, 2021 4 次提交

C

add is dense tensor method (#38424) · 6ff3596e
由 Chen Weihang 提交于 12月 24, 2021

6ff3596e

[pten] combine reduce_cuda codes (#38328) · 08941eda

由 chentianyu03 提交于 12月 24, 2021

* combine reduce_cuda codes

* support float16 in pten redcue_mean

* replace ReduceCudaKernel impl with pten reduce impl

* mv reduce funcs into reduce_cuda_impl

* rm unsed codes and headers

* mv GetReduceDim into reduce_cuda_impl

* recover GetReduceDim in reduce_op.h

* add new dispatch macro

* fix pool op output not inited and cause transform to pten::denseTensor error

* fix output tensor not initialized error

* rename new dispatch macro and format code style

* rm reduce_functor_op.h file

08941eda

[Unify Tensors PR ] Replaced pten::Allocation with... · 42cf2bee

由 Zhanlue Yang 提交于 12月 24, 2021

[Unify Tensors PR #1] Replaced pten::Allocation with shared_ptr<memory::Allocation> for Storage (#38301)

* Added shared_ptr<Allocation> member & corresponding interfaces to Storage

* Removed original pten::Allocation from Storage and adjusted the interfaces accordingly

* Fixed issues with storage offset

* Used place to malloc allocation for TensorStorage

42cf2bee

C

add register general kernel marco (#38409) · fc0a50aa
由 Chen Weihang 提交于 12月 23, 2021

fc0a50aa

23 12月, 2021 5 次提交
- C
  
  move conj kernel impl (#38365) · 8da9eff4
  由 Chen Weihang 提交于 12月 23, 2021
  
  8da9eff4
- Z
  【PTen】Add empty and empty_like kernel in pten (#38334) · 4221cd33
  由 zyfncg 提交于 12月 23, 2021
```
* add empty and empty_like kernel in pten

* add empty dev_api
```
  4221cd33
- C
  
  move sign kernel impl (#38363) · bb38b6aa
  由 Chen Weihang 提交于 12月 22, 2021
  
  bb38b6aa
- C
  [PTen] Move dot kernel impl (#38359) · 0a4ffbc7
  由 Chen Weihang 提交于 12月 22, 2021
```
* move dot kernel impl

* remove needless cmake items
```
  0a4ffbc7
- 石
  updates the pten allocation, test=develop (#38355) · 4d5a6064
  由石晓伟提交于 12月 23, 2021
```
* updates the pten allocation, test=develop

* avoids an error message, test=develop
```
  4d5a6064
22 12月, 2021 5 次提交
- C
  [PTen] Change functions to funcs (#38340) · 64e2f670
  由 Chen Weihang 提交于 12月 22, 2021
```
* change functions to funcs

* remove useless code
```
  64e2f670
- C
  [PTen] Add cmake function for kernels (#38311) · e6310dbd
  由 Chen Weihang 提交于 12月 22, 2021
```
* add pten kernel cmake

* add pten kernel cmake function

* fix compile error

* add enforce include for full kernel

* fix compile failed

* change cuda to gpu

* fix cmake function error
```
  e6310dbd
- C
  
  add copy constructor for densetensor (#38319) · fabc058b
  由 Chen Weihang 提交于 12月 21, 2021
  
  fabc058b
- Y
  [PTen]Move flatten kernel to new directory (#38255) · 4d1ce184
  由 YuanRisheng 提交于 12月 22, 2021
```
* move flatten

* fix bugs of test

* modify header file

* add copy declare

* fix compile bugs
```
  4d1ce184
- Z
  Rename full infer_meta (#38332) · abb07f35
  由 zyfncg 提交于 12月 22, 2021
```
* rename full infer_meta

* fix merge problem
```
  abb07f35
21 12月, 2021 3 次提交
- C
  [PTen] Rename cuda dir and context to gpu (#38296) · dc7597e3
  由 Chen Weihang 提交于 12月 21, 2021
```
* rename cuda to gpu

* revert CMake change

* resolve conflit

* rename other cuda to gpu

* poish details
```
  dc7597e3
- C
  [pten] fix when out_dtype is same with x.dtype and still transform type error (#38285) · e0fd3bbf
  由 chentianyu03 提交于 12月 21, 2021
```
* fix when out_dtype is same with x.dtype and still transform type error

* fix spell error
```
  e0fd3bbf
- C
  [PTen] Remove eigen and blas directory (#38291) · d9fcdc3a
  由 Chen Weihang 提交于 12月 20, 2021
```
* remove eigen and blas dir

* fix declare error
```
  d9fcdc3a
20 12月, 2021 3 次提交
- C
  [pten]add pten conj kernel (#38247) · a2793e5e
  由 chentianyu03 提交于 12月 20, 2021
```
* add pten conj kernel

* modify conj_kernel file path

* add defined cuda macro to cuda/conj_kernel.h
```
  a2793e5e
- 石
  
  changes the call AllocShared to Alloc, test=develop (#38258) · bb0713b2
  由石晓伟提交于 12月 20, 2021
  
  bb0713b2
- Z
  
  move the directory of fill kernels in pten (#38219) · 06128b9f
  由 zyfncg 提交于 12月 20, 2021
  
  06128b9f
17 12月, 2021 5 次提交
- J
  Support multi place constructor (#38171) · 6f439e5a
  由 Jiabin Yang 提交于 12月 17, 2021
```
* support more eager tensor api

* support multiple constructor for eager tensor

* add place related code

* polish code

* specific randint with dtype of int64

* Support pure cpu test

* refine test in pure cpu

* refine test in pure cpu
```
  6f439e5a
- C
  
  add scale lost deps (#38237) · 66a9d71a
  由 Chen Weihang 提交于 12月 17, 2021
  
  66a9d71a
- C
  [pten] modify reduce_sum reduce_mean args (#38216) · eaa2363e
  由 chentianyu03 提交于 12月 17, 2021
```
* modify sum mean args

* add GetExpectedPtenKernelArgs for redcue_op

* modify kernel args number

* modify kernel args number
```
  eaa2363e
- C
  
  fix detail error for scale (#38213) · 20b7c99c
  由 Chen Weihang 提交于 12月 16, 2021
  
  20b7c99c
- L
  [BugFix]: Elementwise branch selection and Broadcast dimension merge (#38204) · e097a748
  由 limingshu 提交于 12月 17, 2021
```
* fix_bugs_for_elementwise_branch_selection

* fix merge_dims bugs

* fix all influenced file
```
  e097a748
16 12月, 2021 4 次提交

[PTen] Unify device context entrance in pten part 2 (#38182) · e02537f9

由 Chen Weihang 提交于 12月 16, 2021

* unify device context entrance

* move all_context include to header

* polish cmake relay for device_context

* fix npu compile failed

* fix npu compile failed

e02537f9

[PTen] Add register_ctx_kernel marco and move scale kernel (#38121) · af498677

由 Chen Weihang 提交于 12月 16, 2021

* add register_ctx_kernel and move scale kernel

* polish details by reviewer comment

* fix xpu compile failed

* fix cmake error

af498677

[Pten]Modify registered kernel name (#38109) · be874c08

由 YuanRisheng 提交于 12月 16, 2021

* Reduce reshape kernel functions in pten

* delete notes

* fix bugs when compile

* modify register name

* fix compile bugs

be874c08

[PTen] Unify device context entrance in pten part 1 (#38172) · 047ee26c

由 Chen Weihang 提交于 12月 15, 2021

* unify device context entrance

* move all_context include to header

* polish cmake relay for device_context

* fix npu compile failed

* fix npu compile failed

* revert part of change

047ee26c

15 12月, 2021 4 次提交
- Y
  Change a comment in pten header to avoid the disturb to op benchmark ci. (#38165) · cecea8e6
  由 Yiqun Liu 提交于 12月 15, 2021
```
test=document_fix
```
  cecea8e6
- C
  
  replace moves_storage and alloc_construct (#38134) · e78eb3f4
  由 Chen Weihang 提交于 12月 14, 2021
  
  e78eb3f4
- C
  
  revert attr type change (#38129) · 038ca68d
  由 Chen Weihang 提交于 12月 14, 2021
  
  038ca68d
- C
  
  move tensor using to single header (#38142) · c23afce1
  由 Chen Weihang 提交于 12月 14, 2021
  
  c23afce1
14 12月, 2021 1 次提交

[PTen] Polish kernel register marco design (#38078) · c9da845f

由 Chen Weihang 提交于 12月 14, 2021

* polish register marco

* resolve compile failed

* revert needless change

* revert eager related change

* revert eager related change

* change register marco name

* polish deetails

c9da845f

Crayon鑫 / Paddle 与 Fork 源项目一致

Crayon鑫 / Paddle
与 Fork 源项目一致