提交 · 5631da9cc2ca284727c512c663e23f426cbcc8cf · 机器未来 / Paddle

27 1月, 2022 7 次提交

[PTen]Support AllocateFrom in Tensor and Alloc/HostAlloc in Context (#39022) · 5631da9c

由 Aurelius84 提交于 1月 27, 2022

* Support allocate_from in Tensor and allocate_data in Context

* fix #ifdef CUDA

* fix cycle depends

* fix test_xxx_dev_api failed

* fix windows compiling error

* fix unittest

* modify into PImpl

* fix selected rows

* add TODO comment

* refine interface according reviewer

5631da9c

C
[PTen] Add infermeta registry (#39204) · f3f16126
由 Chen Weihang 提交于 1月 27, 2022
```
* add infermeta registry

* add infermeta registry

* add unittest

* polish details
```
f3f16126

[PluggableDevice] Add custom kernel support based on pten kernel management (#38848) · a8879215

由 Aganlengzi 提交于 1月 27, 2022

* [Demo] custom kernel based on pten kernel

* merge and npu custom work well

* del comments

* delete other code

* fix CUDAContext

* fix not found small_vector.h

* support NPU

* fix NPUContext

* fix DeviceContext support

* add UT

* fix call

* add UT

* fix

* fix for comments and ut

* add MACRO control

* fix multi input output

* support env CUSTOM_DEVICE_ROOT

* deal with special cases

* fix for Windows

* try coverage with test_custom_kernel_dot.py

* fix test_custom_kernel_dot

* fix test_custom_kernel_dot

* fix merge

* fix merge

* fix CI

* update

* merge and fix

* remove WITH_CUSTOM_KERNEL

* fix merge

* merge and fix

* fix ut

* fix ut for mac

* add more UT

* add more UT

* fix

a8879215

[pten] add full xpu kernel (#39172) · 93839717

由 chentianyu03 提交于 1月 27, 2022

* add full_kernel xpu

* fix full xpu register device type error

* fix full kernel bug

* add fulllike kernel impl and replace with raw kernel

* fix dev_ctx convert template args error

* modify namespace and header file

* add isinf check

* fix input type args in TensorSetConstantXPU error

93839717

optimize kunlun/xpu softmax_with_cross_entropy add add unitest (#39180) · 2b9bb8bb

由 QingshuChen 提交于 1月 27, 2022

* optimize kunlun/xpu softmax_with_cross_entropy add add unitest
*test=kunlun

* minor
*test=kunlun

* minor
*test=kunlun

* minor
*test=kunlun

* minor
*test=kunlun

2b9bb8bb

Add SparseCooTensor and SparseCsrTensor (#38906) · a7edb3f3

由 zhangkaihuo 提交于 1月 27, 2022

* fix bug:
1. atten: set the default value of attn_dropout_rate to None
2. ffn: add activation parameter

* for pure fp16

* Add a SparseCsrTensor

* remove unused functional

* remove const

* remove SetMemoberTensor

* remove non_zero_nums_, the number of non zero elements of each batch can be obtained from the crows

* SparseCooTensor

* add SetMember

* merge upstream; add SetMember

* merge upstream

* merge upstream; add newline at end of file

* add newline at end of file

* remove newline at end of file

* remove newline at end of file

* stash

* user pten::framework::make_ddim

* user pten::framework::make_ddim

* merge upstream; use the latest mutable_data

* merge upstream; use the latest mutable_data

* return mutable dense tensor

a7edb3f3

F

move math_cuda_utils.h to pten/kernels/funcs (#39246) · 809a10b6
由 Feiyu Chan 提交于 1月 27, 2022

809a10b6

26 1月, 2022 9 次提交

[pten] remove deprecated fluid op kernel for pten (#38842) · 3ab9aef1

由 Leo Chen 提交于 1月 26, 2022

* update cmake file to remove fluid kernel

* add pten declaration.h to where pybind.h used

* fix sync_bn and tensorrt_engine

* refine detection_library

* fix interpreter_core

* support eager legacy

* fit eager legacy for pten

* fall back to cpu if not found kernel

* fix compile problem

* fix compile problem

* refine fallback logic

* fit operator.run()

* fix xpu compile

* fit for new_exec

* add REGISTER_OP_WITHOUT_GRADIENT

* un-cache pt_kernel_context

* fix compile

* fix cudnn

* fix compiling with on_infer

* fix mkldnn

* fix isfinite_v2

* fix xpu problem

* fix op_device

* refine fallback for xpu

* fix xpu compile

* merge develop

* refine code format

* fix compile

* fix compile

* add data_transfer

* fix PreparePtenData

* fix cpu context

* merge develop

* fix compile

* fix error device context

* fix xpu

* fix dev_ctx

3ab9aef1

[pten] Cast xpu kernel (#39179) · 93d2f0a6

由 chentianyu03 提交于 1月 26, 2022

* cast xpu kernel init

* cast xpu kernel

* replace with raw cast xpu kernel

* fix cast kernel bug

* add the missing break

* modify namespace and header file

93d2f0a6

X

add dependences of enforce (#39237) · 2c0160e5
由 xiongkun 提交于 1月 26, 2022

2c0160e5

[Move selected_rows PR #5] VisitDataType use Pten::DataType (#39236) · 42a0947e

由 Weilong Wu 提交于 1月 26, 2022

* Added selected_rows and rw_lock to pten

* Renamed the unit test target to fix CI

* Removed Class SelectedRows in Fluid, changed include/cmake relationship, use pten::SelectedRows in Fluid

* Remove rw_lock.h,rw_lock_test.cc in fluid

* Use pten::RWLock and pten::AutoRDLock, fix CI

* Use pten::SelectedRows

* Use pten::SelectedRows

* Fix to pass NPU CI

* Selected_Rows inherits from TensorBase

* Use pten::SelectedRows, to pass NPU CI

* To fix NPU CI

* To fix NPU CI again

* Use paddle/pten/core/enforce and polish code

* Use pten::DataType instead of using proto_type

* Move part of data_type to pten

* Polish Code

42a0947e

Y
[Pten]Move kernel_primitives lib to Pten directory (#39169) · 452bcbe2
由 YuanRisheng 提交于 1月 26, 2022
```
* move kernel_primitives

* use pten's errors
```
452bcbe2
W
[PTEN] cpu_context add eigen deps (#39234) · bd5c962d
由 Wilber 提交于 1月 26, 2022
```
* add eigen deps

* update
```
bd5c962d

[Move selected_rows PR #4] SelectedRows inherits from TensorBase. (#39162) · 3e80253a

由 Weilong Wu 提交于 1月 26, 2022

* Added selected_rows and rw_lock to pten

* Renamed the unit test target to fix CI

* Removed Class SelectedRows in Fluid, changed include/cmake relationship, use pten::SelectedRows in Fluid

* Remove rw_lock.h,rw_lock_test.cc in fluid

* Use pten::RWLock and pten::AutoRDLock, fix CI

* Use pten::SelectedRows

* Use pten::SelectedRows

* Fix to pass NPU CI

* Selected_Rows inherits from TensorBase

* Use pten::SelectedRows, to pass NPU CI

* To fix NPU CI

* To fix NPU CI again

* Use paddle/pten/core/enforce and polish code

3e80253a

石

[Refactoring Tensor PR #7] differentiate deprecated interfaces (#39228) · 30470853
由石晓伟提交于 1月 26, 2022

30470853

[PTen] Unify InferMeta(Shape) Function in pten and fluid op (#38976) · b75507d3

由 Chen Weihang 提交于 1月 26, 2022

* infermeta context init design

* support infermeta called in fluid op

* add hasattr and attr methods

* add dygraah GetVarPtrs support

* rename arg_map_context to arg_map_utils

* add registry for arg map func

* resolve conflit

* refactor op utils design

* polish meta config

* fix details

* remove hasattr method

* resolve conflit

* revert cmake order change

* revert some change

* change init pos

* fix compile faileed

* fix typo

* fix inference failed

* fix windows ccompile failed

* polish format
Co-authored-by: NWang Huan <wanghuan29@baidu.com>

b75507d3

25 1月, 2022 9 次提交
- Z
  
  fix compile problem cause by api code_gen (#39199) · 39238275
  由 zyfncg 提交于 1月 25, 2022
  
  39238275
- Y
  
  change infermeta and remove makePtenTenosr in reshape (#39186) · 7613129e
  由 YuanRisheng 提交于 1月 25, 2022
  
  7613129e
- N
  Revert "Replace EigenBroadcast with ElementwiseBroadcast in ReduceGrad (#38959)" (#39205) · 978558be
  由 niuliling123 提交于 1月 25, 2022
```
This reverts commit 9059ef69.
```
  978558be
- 石
  
  fix custom ops, test=develop (#39153) · 712ccfbf
  由石晓伟提交于 1月 25, 2022
  
  712ccfbf
- N
  
  Replace EigenBroadcast with ElementwiseBroadcast in ReduceGrad (#38959) · 9059ef69
  由 niuliling123 提交于 1月 25, 2022
  
  9059ef69
- W
  [Move selected_rows PR #3] Change the relationship of [include/Cmake]. (#39128) · 2bafd338
  由 Weilong Wu 提交于 1月 25, 2022
```
* Added selected_rows and rw_lock to pten

* Renamed the unit test target to fix CI

* Removed Class SelectedRows in Fluid, changed include/cmake relationship, use pten::SelectedRows in Fluid

* Remove rw_lock.h,rw_lock_test.cc in fluid

* Use pten::RWLock and pten::AutoRDLock, fix CI

* Use pten::SelectedRows

* Use pten::SelectedRows

* Fix to pass NPU CI

* Use pten::SelectedRows, to pass NPU CI

* To fix NPU CI

* To fix NPU CI again
```
  2bafd338
- W
  
  [PTEN] Add xpu context. (#39098) · c1e5a393
  由 Wilber 提交于 1月 25, 2022
  
  c1e5a393
- Z
  
  fix the bug of SetAllocationForOutputTenosr (#39174) · 0c3657ad
  由 zyfncg 提交于 1月 25, 2022
  
  0c3657ad
- X
  [PTen] Migrate string tinyformat errors and part of enforce into pten (#39051) · 6ca49164
  由 xiongkun 提交于 1月 25, 2022
```
* transfer: string tinyformat errors and part of enforce into pten

* remove comment

* fix by code review

* assert is not compile in -DNDEBUG

* add string as dependences of paddle_inference
```
  6ca49164
24 1月, 2022 8 次提交

[pten] add Scale xpu kernel (#39092) · 7874d0a5

由 chentianyu03 提交于 1月 24, 2022

* add scale xpu kernel

* add scale xpu kernel

* add scale xpu kernel

* replace with pten scale kernel

* change dev_ctx

* modify float16 head path

* remove unused xpu header

7874d0a5

[Pten]Refactor elementwise_add grad / double grad / triple grad Kernel and... · 3bf3a6ee

由 YuanRisheng 提交于 1月 24, 2022

[Pten]Refactor elementwise_add grad / double grad / triple grad Kernel and move them to pten (#39048)

* refactor elementwise add grad

* fix compile bugs

* fix unit test bugs

* fix file conflicts

* fix bugs when buildPtenContext

3bf3a6ee

[Pten] Migration of eigen numeric extensions and functors in paddle/fluid/operatos/eigen (#39124) · a1e40dc6

由 Feiyu Chan 提交于 1月 24, 2022

* migration of functors in paddle/fluid/operators/eigen and paddle/fluid/platform/eigen_ext.h
* update path of data types like float16.h in includes in extensions.h

a1e40dc6

A
Move dim_test/ddim_test into tests directory (#39111) · f783b846
由 Aurelius84 提交于 1月 24, 2022
```
Co-authored-by: NChen Weihang <chenweihang@baidu.com>
```
f783b846

[PTEN] Move dynload from fluid to pten. (#39120) · 3c1dc6f6

由 Wilber 提交于 1月 24, 2022

* move dynload from fluid to pten.

* fix ci compile

* fix windows ci compile.

* update

* update

* fix compile error

3c1dc6f6

石

[Refactoring Tensor PR #5] replace storage with pten allocation (#39085) · a56e16a7

由石晓伟提交于 1月 24, 2022

* updates callers, test=develop

* updates tensor, test=develop

* fixes errors, test=develop

* remove some dtypes, test=develop

* fix errors in the base storage modification, test=develop

* fixes a bug, test=develop

* fixes the bugs in push the whole, test=develop

* updates, test=develop

* update

* update, test=develop

* fixes the mac-py3 CI, test=develop

* remove the storage impl, test=develop

* updates some codes, test=develop

* update, test=develop

* updates pten allocation, test=develop

a56e16a7

Z

Fixed ResizeAndAllocate issues (#39101) · 9cfa811e
由 Zhanlue Yang 提交于 1月 24, 2022

9cfa811e

Backward C++ API Code-Generation (#39057) · f83d1c0b

由 zyfncg 提交于 1月 24, 2022

* add config of backward-api auto-gene

* fix compile bug

* remove wrong header

* rename grad_api to backward_api

* modify .gitignore

f83d1c0b

23 1月, 2022 1 次提交
- C
  [PTen] Add infermeta utils for register infermeta funtion (#39135) · 85334b04
  由 Chen Weihang 提交于 1月 23, 2022
```
* add infermeta utils for register infermeta

* polish license format
```
  85334b04
22 1月, 2022 5 次提交
- C
  
  remove useless cmake list (#39141) · 60df9254
  由 Chen Weihang 提交于 1月 22, 2022
  
  60df9254
- C
  [PTen] Add attr method for ArgumentMappingContext (#39130) · 7ac2f80f
  由 Chen Weihang 提交于 1月 22, 2022
```
* add attr for arg map context

* add argument fn declare

* add attr test for get attr value method

* polish details
```
  7ac2f80f
- W
  [Move selected_rows PR #2] Added Selected_Rows and rw_lock to Pten (#39087) · ff7f9d06
  由 Weilong Wu 提交于 1月 22, 2022
```
* Renamed selected_rows.* -> selected_rows_utils.*

* Added selected_rows and rw_lock to pten

* Removed useless header

* Renamed the unit test target to fix CI

* Use pten::framework::DDim

* Set selceted_rows_test properties timeout

* Polish code to pten style
Co-authored-by: NChen Weihang <chenweihang@baidu.com>
```
  ff7f9d06
- C
  
  add meta tensor for unify infershape (#39131) · 09f6f17c
  由 Chen Weihang 提交于 1月 22, 2022
  
  09f6f17c
- C
  [PTen] Auto generate include headers (#39123) · e92b3040
  由 Chen Weihang 提交于 1月 22, 2022
```
* auto gen include headers

* move to pten.cmake
```
  e92b3040
21 1月, 2022 1 次提交
- C
  [pten] fix test concat dev api build failed (#39117) · a14dc688
  由 chentianyu03 提交于 1月 21, 2022
```
* fix test concat dev api build failed

* fix conflict

* fix conflict
```
  a14dc688

机器未来 / Paddle 与 Fork 源项目一致

机器未来 / Paddle
与 Fork 源项目一致