提交 · 6554cc106c0c9ffafb21251467cc3f36182ca033 · BaiXuePrincess / Paddle

24 12月, 2021 3 次提交

[pten] combine reduce_cuda codes (#38328) · 08941eda

由 chentianyu03 提交于 12月 24, 2021

* combine reduce_cuda codes

* support float16 in pten redcue_mean

* replace ReduceCudaKernel impl with pten reduce impl

* mv reduce funcs into reduce_cuda_impl

* rm unsed codes and headers

* mv GetReduceDim into reduce_cuda_impl

* recover GetReduceDim in reduce_op.h

* add new dispatch macro

* fix pool op output not inited and cause transform to pten::denseTensor error

* fix output tensor not initialized error

* rename new dispatch macro and format code style

* rm reduce_functor_op.h file

08941eda

[Unify Tensors PR #1] Replaced pten::Allocation with... · 42cf2bee

由 Zhanlue Yang 提交于 12月 24, 2021

[Unify Tensors PR #1] Replaced pten::Allocation with shared_ptr<memory::Allocation> for Storage (#38301)

* Added shared_ptr<Allocation> member & corresponding interfaces to Storage

* Removed original pten::Allocation from Storage and adjusted the interfaces accordingly

* Fixed issues with storage offset

* Used place to malloc allocation for TensorStorage

42cf2bee

C

add register general kernel marco (#38409) · fc0a50aa
由 Chen Weihang 提交于 12月 23, 2021

fc0a50aa

23 12月, 2021 5 次提交
- C
  
  move conj kernel impl (#38365) · 8da9eff4
  由 Chen Weihang 提交于 12月 23, 2021
  
  8da9eff4
- Z
  【PTen】Add empty and empty_like kernel in pten (#38334) · 4221cd33
  由 zyfncg 提交于 12月 23, 2021
```
* add empty and empty_like kernel in pten

* add empty dev_api
```
  4221cd33
- C
  
  move sign kernel impl (#38363) · bb38b6aa
  由 Chen Weihang 提交于 12月 22, 2021
  
  bb38b6aa
- C
  [PTen] Move dot kernel impl (#38359) · 0a4ffbc7
  由 Chen Weihang 提交于 12月 22, 2021
```
* move dot kernel impl

* remove needless cmake items
```
  0a4ffbc7
- 石
  updates the pten allocation, test=develop (#38355) · 4d5a6064
  由石晓伟提交于 12月 23, 2021
```
* updates the pten allocation, test=develop

* avoids an error message, test=develop
```
  4d5a6064
22 12月, 2021 5 次提交
- C
  [PTen] Change functions to funcs (#38340) · 64e2f670
  由 Chen Weihang 提交于 12月 22, 2021
```
* change functions to funcs

* remove useless code
```
  64e2f670
- C
  [PTen] Add cmake function for kernels (#38311) · e6310dbd
  由 Chen Weihang 提交于 12月 22, 2021
```
* add pten kernel cmake

* add pten kernel cmake function

* fix compile error

* add enforce include for full kernel

* fix compile failed

* change cuda to gpu

* fix cmake function error
```
  e6310dbd
- C
  
  add copy constructor for densetensor (#38319) · fabc058b
  由 Chen Weihang 提交于 12月 21, 2021
  
  fabc058b
- Y
  [PTen]Move flatten kernel to new directory (#38255) · 4d1ce184
  由 YuanRisheng 提交于 12月 22, 2021
```
* move flatten

* fix bugs of test

* modify header file

* add copy declare

* fix compile bugs
```
  4d1ce184
- Z
  Rename full infer_meta (#38332) · abb07f35
  由 zyfncg 提交于 12月 22, 2021
```
* rename full infer_meta

* fix merge problem
```
  abb07f35
21 12月, 2021 3 次提交
- C
  [PTen] Rename cuda dir and context to gpu (#38296) · dc7597e3
  由 Chen Weihang 提交于 12月 21, 2021
```
* rename cuda to gpu

* revert CMake change

* resolve conflit

* rename other cuda to gpu

* poish details
```
  dc7597e3
- C
  [pten] fix when out_dtype is same with x.dtype and still transform type error (#38285) · e0fd3bbf
  由 chentianyu03 提交于 12月 21, 2021
```
* fix when out_dtype is same with x.dtype and still transform type error

* fix spell error
```
  e0fd3bbf
- C
  [PTen] Remove eigen and blas directory (#38291) · d9fcdc3a
  由 Chen Weihang 提交于 12月 20, 2021
```
* remove eigen and blas dir

* fix declare error
```
  d9fcdc3a
20 12月, 2021 3 次提交
- C
  [pten]add pten conj kernel (#38247) · a2793e5e
  由 chentianyu03 提交于 12月 20, 2021
```
* add pten conj kernel

* modify conj_kernel file path

* add defined cuda macro to cuda/conj_kernel.h
```
  a2793e5e
- 石
  
  changes the call AllocShared to Alloc, test=develop (#38258) · bb0713b2
  由石晓伟提交于 12月 20, 2021
  
  bb0713b2
- Z
  
  move the directory of fill kernels in pten (#38219) · 06128b9f
  由 zyfncg 提交于 12月 20, 2021
  
  06128b9f
17 12月, 2021 5 次提交
- J
  Support multi place constructor (#38171) · 6f439e5a
  由 Jiabin Yang 提交于 12月 17, 2021
```
* support more eager tensor api

* support multiple constructor for eager tensor

* add place related code

* polish code

* specific randint with dtype of int64

* Support pure cpu test

* refine test in pure cpu

* refine test in pure cpu
```
  6f439e5a
- C
  
  add scale lost deps (#38237) · 66a9d71a
  由 Chen Weihang 提交于 12月 17, 2021
  
  66a9d71a
- C
  [pten] modify reduce_sum reduce_mean args (#38216) · eaa2363e
  由 chentianyu03 提交于 12月 17, 2021
```
* modify sum mean args

* add GetExpectedPtenKernelArgs for redcue_op

* modify kernel args number

* modify kernel args number
```
  eaa2363e
- C
  
  fix detail error for scale (#38213) · 20b7c99c
  由 Chen Weihang 提交于 12月 16, 2021
  
  20b7c99c
- L
  [BugFix]: Elementwise branch selection and Broadcast dimension merge (#38204) · e097a748
  由 limingshu 提交于 12月 17, 2021
```
* fix_bugs_for_elementwise_branch_selection

* fix merge_dims bugs

* fix all influenced file
```
  e097a748
16 12月, 2021 4 次提交

[PTen] Unify device context entrance in pten part 2 (#38182) · e02537f9

由 Chen Weihang 提交于 12月 16, 2021

* unify device context entrance

* move all_context include to header

* polish cmake relay for device_context

* fix npu compile failed

* fix npu compile failed

e02537f9

[PTen] Add register_ctx_kernel marco and move scale kernel (#38121) · af498677

由 Chen Weihang 提交于 12月 16, 2021

* add register_ctx_kernel and move scale kernel

* polish details by reviewer comment

* fix xpu compile failed

* fix cmake error

af498677

[Pten]Modify registered kernel name (#38109) · be874c08

由 YuanRisheng 提交于 12月 16, 2021

* Reduce reshape kernel functions in pten

* delete notes

* fix bugs when compile

* modify register name

* fix compile bugs

be874c08

[PTen] Unify device context entrance in pten part 1 (#38172) · 047ee26c

由 Chen Weihang 提交于 12月 15, 2021

* unify device context entrance

* move all_context include to header

* polish cmake relay for device_context

* fix npu compile failed

* fix npu compile failed

* revert part of change

047ee26c

15 12月, 2021 4 次提交
- Y
  Change a comment in pten header to avoid the disturb to op benchmark ci. (#38165) · cecea8e6
  由 Yiqun Liu 提交于 12月 15, 2021
```
test=document_fix
```
  cecea8e6
- C
  
  replace moves_storage and alloc_construct (#38134) · e78eb3f4
  由 Chen Weihang 提交于 12月 14, 2021
  
  e78eb3f4
- C
  
  revert attr type change (#38129) · 038ca68d
  由 Chen Weihang 提交于 12月 14, 2021
  
  038ca68d
- C
  
  move tensor using to single header (#38142) · c23afce1
  由 Chen Weihang 提交于 12月 14, 2021
  
  c23afce1
14 12月, 2021 3 次提交
- C
  [PTen] Polish kernel register marco design (#38078) · c9da845f
  由 Chen Weihang 提交于 12月 14, 2021
```
* polish register marco

* resolve compile failed

* revert needless change

* revert eager related change

* revert eager related change

* change register marco name

* polish deetails
```
  c9da845f
- Y
  
  remove KernelName (#38082) · 8198cad7
  由 YuanRisheng 提交于 12月 14, 2021
  
  8198cad7
- Y
  [PTen] Reduce reshape kernel functions in pten (#38055) · a3c8abc7
  由 YuanRisheng 提交于 12月 14, 2021
```
* Reduce reshape kernel functions in pten

* delete notes

* fix bugs when compile
```
  a3c8abc7
13 12月, 2021 3 次提交
- C
  
  fix custom op infershape error (#38045) · 3a339cc0
  由 Chen Weihang 提交于 12月 13, 2021
  
  3a339cc0
- Z
  【PTen】Add variadic args kernel for PTen API to replace KernelContext (#37942) · b76ef045
  由 zyfncg 提交于 12月 13, 2021
```
* add variadic_args kernel in pten

* merge develop code

* add variadic_args kernel and benchmark

* change dynamic_cast to static_cast for DeviceContext

* merge the code

* modify code format

* refactor variadic kernel function
```
  b76ef045
- S
  fix reduce_max bug (#38026) · 512e4339
  由 Shang Zhizhou 提交于 12月 13, 2021
```
* fix reduce_max bug

* add unittest
```
  512e4339
10 12月, 2021 2 次提交
- C
  
  rename TensoCopy (#38036) · 8f2b0860
  由 chentianyu03 提交于 12月 10, 2021
  
  8f2b0860
- Y
  [PTen]Add alias name for matmul and remove redundant member in kernel factory (#38011) · c5a7da4b
  由 YuanRisheng 提交于 12月 10, 2021
```
* add alias kernel name

* modify code as suggestions

* add alias name for matmul and remove redundant member in kernel factory
```
  c5a7da4b

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致