提交 · 3e7825f375ce0a3e91d11979b883acfbfa7556f1 · PaddlePaddle / Paddle

15 2月, 2022 10 次提交

[PluggableDevice] Add custom runtime support (#38740) · 3e7825f3

由 ronnywang 提交于 2月 15, 2022

* [CustomRuntime] Add DeviceManager

* [CustomRuntime] Add DeviceInterface

* [CustomRuntime] Add Stream, Event, DeviceGuard, CallbackManager

* [CustomRuntime] Add plug-in device

* [CustomRuntime] Memory module support PluggableDevice

* [CustomRuntime] Add WITH_PLUGGABLE_DEVICE cmake option

* update

* [API] update API doc based on comments, test=develop
Co-authored-by: Nqili93 <qili93@qq.com>

3e7825f3

F
[Pten] move paddle/operators/math/functors.h and compound_functors.h (#39514) · 0d46a108
由 Feiyu Chan 提交于 2月 15, 2022
```
* move paddle/operators/math/functors.h
* move paddle/operators/math/compound_functors.h
```
0d46a108

move histogram to pten (#39496) · 556f6eb0

由 hong 提交于 2月 15, 2022

* move histogram to pten; test=develop

* fix format error; test=develop

* fix histogram kernel format; test=develop

556f6eb0

Move Abs OP to pten (#39492) · fb473067

由 From00 提交于 2月 15, 2022

* Move Abs op to pten

* Fix NPU compilation error

* Fix CI error

* Use LaunchSameDimsElementwiseCudaKernel in pten

fb473067

[Pten] Support SelectedRows in C++ API (#39497) · 5bb3b668

由 zyfncg 提交于 2月 15, 2022

* add data_transform in pten api

* support GetKernelTypeForVar

* fix complie problem of bfloat16

* add scale_sr in api

* suppport select_row in C++ api

* merge code

5bb3b668

C
[PTen] Fix single dtype register errror (#39506) · 9fd67ffe
由 Chen Weihang 提交于 2月 15, 2022
```
* fix single dtype reg errror

* fix windows failed
```
9fd67ffe

move algorithm.h (#39502) · 7eb9593e

由 Feiyu Chan 提交于 2月 15, 2022

Move paddle/fluid/operators/math/algorithm.h to paddle/pten/kernels/funcs and rename all references to symbols in it.

7eb9593e

[Pten]Move expand_v2 to pten (#39471) · 2d16d69b

由 Linjie Chen 提交于 2月 15, 2022

* move expand to pten

* move expand_v2 to pten

* move expand_v2 to pten

* fix grad register

* fix grad register

* fix tensorcpry

* fix tensorcopy

* fix tensorcopy

* fix tensorcopy

* fix tensorcopy

* fix ci

* fix tensorcopy

2d16d69b

C
[PTen] Polish trace moving (#39510) · ab866777
由 Chen Weihang 提交于 2月 15, 2022
```
* polish trace moving

* remove useless header
```
ab866777

[PTen]Migrate proto::VarType outside of Pten (#39411) · 7e7e9404

由 Aurelius84 提交于 2月 15, 2022

* #1 migrate dist-related type()-> dtype()

* move datatype function from pten -> fluid/framework

* change type() in imperative into convert(dtype())

* modify xx_tensor->type into xx_tensor->dtype

* change the set_type interface and the caller

* modify xx_tensor.type into xx_tensor.dtype

* fix mutable_data(place, dtype())

* change caller of mutable_data in pten and distributed

* change the caller of mutable_data in fluid/framework

* change the caller of mutable_data in imperative directory

* mutable_data: inference

* update the call of mutable_data

* transfer MakePenScalarArray MakePtenScalar ResetHolderWithType

* pass the compile. the next step is remove VarType in Pten

* fix all and remove VarType from pten. success in linux. Next task is other platform

* fix conflict with develop

* fix compiled error

* Fix reset conversion

* fix conflict

* fix compiled problem

* fix typo

* Fix << in tensor_utils.cc

* fix type->dtype

* fix unittest

* fix tensor init constructor

* fix DataTypeSize for BFloat16

* fix code style

* fix npu compiled error

* fix npu

* compile npu sucessfully

* fix conflict

* fix conflict
Co-authored-by: Nxiongkun <xiongkun03@baidu.com>

7e7e9404

14 2月, 2022 4 次提交

C

[pten] add CI check for using DenseTensor::mutable_data() in pten directions (#39467) · 14049ae5
由 chentianyu03 提交于 2月 14, 2022

14049ae5
W
context add generator (#39475) · 463e31f4
由 Wilber 提交于 2月 14, 2022
```
* context add generator

* update
```
463e31f4
C
[PTen] Add HasAttr for ArgumentMappingContext (#39464) · ddb1e23f
由 Chen Weihang 提交于 2月 14, 2022
```
* add has_attr for arg map context

* skip useless attr now

* skip attr if not exists

* fix typo
```
ddb1e23f

[pten] add split kernel (#39060) · d0df5632

由 chentianyu03 提交于 2月 14, 2022

* add split kernel

* add split kernel signature

* fix split bug

* modify MakePtenScalarArrayFromVarList

* modify MakePtenScalarArrayFromVarList

* fix split windows register error

* add test case for split kernel

* replace raw split kernel with pten kernel

* fix makeScalar/ScalarArray bug

* remove debug log

* remove int64_t type in buildPtcontext

* update by code review

* fix split dev test failed

* change DenseTensorMeta to MetaTensor

* change split api code from auto gen to manual

* split cuda kernel support bfloat16 type

* fix conflict

* rm raw split kernel

* merge develop branch

* change to pten::errors

d0df5632

13 2月, 2022 1 次提交

[Pten] Generate Wrapped InferMeta by Yaml (#39482) · 74a150fe

由 zyfncg 提交于 2月 13, 2022

* generate wrapped_infer_meta

* add test for wrapped_infer_meta

* Update test_meta_fn_utils.cc

* change the dir of generated file
Co-authored-by: NChen Weihang <chenweihang@baidu.com>
Co-authored-by: NChen Weihang <chenwhpro@163.com>

74a150fe

12 2月, 2022 1 次提交
- C
  
  unify naming style (#39481) · bdeb479c
  由 Chen Weihang 提交于 2月 12, 2022
  
  bdeb479c
11 2月, 2022 8 次提交

C

move memcpy.h into cc file (#39469) · 575fa0fe
由 Chen Weihang 提交于 2月 11, 2022

575fa0fe
F
[Pten] move operators/math/math_function_* to pten/kernels/func (#39300) · d25a7f9e
由 Feiyu Chan 提交于 2月 11, 2022
```
* move operators/math/math_function_* to pten/kernels/func
* namespace from `paddle::operators::math` to `pten::funcs`
```
d25a7f9e

[PTen] Remove pten core's dependency on fluid xxx_info.h (#39401) · d763a91a

由 Chen Weihang 提交于 2月 11, 2022

* ermove xxx_info include

* fix namespace error

* resolve conflict

* skip xpu context in registry

* fix macro error

* resolve conflict

* resolve conflict

* revert xpu convert

* remove trans to fluid place

* remove useless headers

d763a91a

Z

fix compilation warning on mac (#39438) · be8ab0ea
由 zhangkaihuo 提交于 2月 11, 2022

be8ab0ea

[PTen] Move grad GetExpectedPtenKernelArgs into pten (#39418) · 667bd962

由 Chen Weihang 提交于 2月 11, 2022

* move grad get expected pten kernel args

* fix reduce sum error

* fix element_sub_grad failed

* revert kernel judge change

667bd962

add print pten kernel tool (#39371) · 8803f6bb

由 Shang Zhizhou 提交于 2月 11, 2022

* test=document_fix;add print pten kernel tool

* test=document_fix

* test=document_fix

* test=document_fix

* test=document_fix

* add print_pten_kernels tool

* add print_pten_kernels tool

* fix windows complie

* notest,test=rocm_ci

* add merge tool

* add comments

8803f6bb

Z
Support different dtypes of inputs for elementwise ops (#38859) · bf305033
由 Zhang Ting 提交于 2月 11, 2022
```
* improve backward performance

* support different dtypes for elementwise ops
```
bf305033

【Pten】Auto-Generate InterMeta register (#39436) · 7d6096ff

由 zyfncg 提交于 2月 11, 2022

* fix code conflict

* generate inter_meta register

* clear cache

* just try

* add sign c++ api

* polish some code

7d6096ff

10 2月, 2022 8 次提交
- Z
  
  fix check error of ResetHolder (#39439) · f7a3389e
  由 zyfncg 提交于 2月 10, 2022
  
  f7a3389e
- H
  move Masked select to pten (#39193) · e2ad433b
  由 hong 提交于 2月 10, 2022
```
* move masked select cpu kernel

* add masked selected gpu kernel; test=develop

* fix bugs; test=develop

* bug fix; test=develop

* bug fix; test=develop

* add namespace to set mask array; test=develop

* fix bug; test=develop

* fix bugs; test=develop

* fix ddim bug; test=develop

* fix npu op bug; test=develop

* fix xpu dependecy bug; test=develop

* move kernel args to sig.cc; test=develop
```
  e2ad433b
- W
  
  fix compile error on jetson (#39441) · 8b58862a
  由 Wilber 提交于 2月 10, 2022
  
  8b58862a
- Z
  【Pten】Refactor C++ API code-gen (#39408) · 7b70b792
  由 zyfncg 提交于 2月 10, 2022
```
* refactor C++ API code-gen

* fix windows problem of C++ API
```
  7b70b792
- Z
  [bf16] add bf16 kernel: dropout & reshape & slice (#39395) · e8ac7fc3
  由 zhangbo9674 提交于 2月 10, 2022
```
* add dropout

* add reshape

* add slice

* refien slice unittest

* refine slice unittest

* add cpu bf16 kernel
```
  e8ac7fc3
- C
  [PTen] Add standard kernel suffix set (#39404) · c7c1db33
  由 Chen Weihang 提交于 2月 10, 2022
```
* add standard_suffix_set_and_remove_reshape_with_xshape

* revert reshape change

* polish reduce name
```
  c7c1db33
- A
  
  [PluggableDevice] custom kernel supports multi cpp_dtype registering (#39385) · 63d2333e
  由 Aganlengzi 提交于 2月 10, 2022
  
  63d2333e
- Z
  Fix code conflict of empty dev_api (#39430) · 2a5d858c
  由 zyfncg 提交于 2月 10, 2022
```
* fix code conflict

* clear cache

* just try
```
  2a5d858c
09 2月, 2022 8 次提交

Z
【Pten】Adjust the Empyt dev_api (#39143) · 9d4d0c3b
由 zyfncg 提交于 2月 09, 2022
```
* adjust the Empyt dev_api

* fix merge conflict

* fix sparse_utils_kernel
```
9d4d0c3b

Fix trace conflict (#39421) · 87f4a681

由 hong 提交于 2月 09, 2022

* add trace op

* bug fix

* bug fix; test=develop

* thrust bug fix; test=develop

* remove useless register; test=develop

* fix bug; test=develop

* update trace kernel; test=develop

* move kernel args to trace_sig; test=develop

* try to fix trace kernel conflict; test=develop

87f4a681

N

Replace EigenBroadcast with ElementwiseBroadcast in ReduceGrad (#39255) · 772be4f5
由 niuliling123 提交于 2月 09, 2022

772be4f5

Replace EagerTensor with Tensor (#39376) · 945a3ce9

由 Jiabin Yang 提交于 2月 09, 2022

* merge legacy to fluid

* Remove legacy code

* Remove legacy code

* Remove DataType test

* Using Tensor directly instead of using EagerTensor

* support gradient_accumulation

* make test_imperative_lod_tensor_to_selected_rows longer

* make test_imperative_lod_tensor_to_selected_rows longer

945a3ce9

Add a Sparse Op to_dense (#39335) · aca86470

由 zhangkaihuo 提交于 2月 09, 2022

* implement AllocateFrom

* dense_to_sparse_coo

* optimize unit testing; support rocm

* 1. delete fluid related header file
2. update the copyright

* fix hipMemcpy

* update dense_to_sparsecoo

* add namespace sparse

* sparse_csr_to_dense

* test to_sparse_coo: csr_to_coo

* fix writing error

* to_sparse_csr: dense_to_sparse_csr and sparse_coo_to_csr

* fix check shape

* fix unit test

* to_dense: sparse_coo_to_dense, sparse_csr_to_dense

* replace CUDADeviceContext by GPUContext

aca86470

Y

Rename partial function name TensorReduceFunctorImpl to TensorReduceImpl. (#39387) · 6354f81c
由 Yiqun Liu 提交于 2月 09, 2022

6354f81c

Move trace op to pten (#39227) · d7dddf94

由 hong 提交于 2月 09, 2022

* add trace op

* bug fix

* bug fix; test=develop

* thrust bug fix; test=develop

* remove useless register; test=develop

* fix bug; test=develop

* update trace kernel; test=develop

* move kernel args to trace_sig; test=develop

d7dddf94

C
[CustomOp] Fix slice bug of custom op (#39393) · 91b074a2
由 Chen Weihang 提交于 2月 09, 2022
```
* fix slice bug of cusstom op

* add offset in check
```
91b074a2

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功