提交 · 5c66338f4e9678d1a1254c6f1adb5d124a15512c · 机器未来 / Paddle

18 2月, 2022 6 次提交
- X
  [pten] trans diagonal kernel into pten (#39575) · 5c66338f
  由 xiongkun 提交于 2月 18, 2022
```
* trans diagonal kernel into pten

* fix by code review
```
  5c66338f
- Z
  [AMP] support GPU BF16 amp for dygraph (#39029) · 7d6d3848
  由 zhangbo9674 提交于 2月 18, 2022
```
* support dtype param for auto_cast

* add amp_dtype for tracer

* add unsupported bf16 list

* support bf16 amp for O2

* refine python interface for bfloat16

* refine code

* refine code

* refine unittest

* refine code

* refine code

* add bf16 o1

* refine code by comment

* add gradient accumulator

* add recompute
```
  7d6d3848
- R
  
  [CustomDevice]Improved custom device initialization (#39634) · 7e4ed848
  由 ronnywang 提交于 2月 18, 2022
  
  7e4ed848
- R
  
  [CustomRuntime] add pten::Backend support (#39606) · d6d0820e
  由 ronnywang 提交于 2月 18, 2022
  
  d6d0820e
- Z
  [Pten] Support inplace and intermediate in C++ API (#39651) · 638aab6e
  由 zyfncg 提交于 2月 18, 2022
```
* support inplace and intermediate in yaml

* add cmake for dygraph_api
```
  638aab6e
- C
  [pten]add T, remove default value of DataType in DeviceContext::Alloc (#39620) · 8363406a
  由 chentianyu03 提交于 2月 18, 2022
```
* add T to Alloc and remove default value of DataType in DeviceContext::Alloc

* add dtype
```
  8363406a
17 2月, 2022 10 次提交
- L
  avoid custom kernel deps on pten_function_api (#39661) · cbce0e60
  由 Leo Chen 提交于 2月 17, 2022
```
* pten matmul cuda kernel support bf16

* avoid custom kernel deps on pten_function_api

* Revert "pten matmul cuda kernel support bf16"

This reverts commit 5d520845b9a189375677276efb673235ed8e5ee0.

* refine code

* fix compile

* fix test_split_api
```
  cbce0e60
- L
  [pten] move bernoulli kernel to pten (#39590) · f86073c4
  由 Leo Chen 提交于 2月 17, 2022
```
* move bernoulli kernel to pten

* follow comments
```
  f86073c4
- Z
  
  fix selected_rows bug in C++ API (#39658) · b72d4cb4
  由 zyfncg 提交于 2月 17, 2022
  
  b72d4cb4
- S
  move trunc to pten (#39543) · 4501abd6
  由 Sing_chan 提交于 2月 17, 2022
```
* move trunc to pten

* modify according to YuanRisheng's comment
```
  4501abd6
- C
  [PTen] Clean useless header in pten core (#39560) · c05cd7ed
  由 Chen Weihang 提交于 2月 17, 2022
```
* clean useless header in pten core

* fix compiled failed

* fix cmake target

* fix typo

* resolve conflict
```
  c05cd7ed
- 石
  
  change classes to pten, test=develop (#39643) · 8f2d14ad
  由石晓伟提交于 2月 17, 2022
  
  8f2d14ad
- C
  
  move trace infer shape (#39517) · 1c9b2483
  由 Chen Weihang 提交于 2月 17, 2022
  
  1c9b2483
- C
  
  support set fp32 input for fp16 kernel (#39625) · 5fb9cf60
  由 Chen Weihang 提交于 2月 17, 2022
  
  5fb9cf60
- C
  [PTen] Remove fluid device context deps (#39604) · d63ece1f
  由 Chen Weihang 提交于 2月 17, 2022
```
* remove fluid device context deps

* fix compile failde
```
  d63ece1f
- N
  
  Modified distribution kernel with Kernel Primitive API (#39563) · 1354652b
  由 niuliling123 提交于 2月 17, 2022
  
  1354652b
16 2月, 2022 8 次提交

0
Move lerp OP to pten (#39524) · d480d7b1
由 0x45f 提交于 2月 16, 2022
```
* move lerp to pten

* refine include

* move files

* refine code
```
d480d7b1

[bf16] pten matmul cuda kernel support bf16 (#39485) · d5a0d31a

由 Leo Chen 提交于 2月 16, 2022

* pten matmul cuda kernel support bf16

* fix pten kernel name

* add matmul_grad bf16 kernel

* add emptylike bf16 kernel

* fix compile

* suppport rocm

* fix error

* fix rocm

* add bf16 header file

* fix compile

d5a0d31a

EagerTensor to EagerVariable (#39447) · 831fd86e

由 Jiabin Yang 提交于 2月 16, 2022

* merge legacy to fluid

* Remove legacy code

* Remove legacy code

* Remove DataType test

* Using Tensor directly instead of using EagerTensor

* support gradient_accumulation

* make test_imperative_lod_tensor_to_selected_rows longer

* make test_imperative_lod_tensor_to_selected_rows longer

* refine code

* Rename all EagerTensor to Tensor

* Rename some EagerTensor to Tensor

* rename EagerTensor to EagerVariable

* add more test

* merge develop and refine code

831fd86e

[PTen] Add attr support for infershape utils (#39513) · 6eb95caf

由 Chen Weihang 提交于 2月 16, 2022

* add attr support for infershape

* add unittest for coverage

* add unittest for coverage

* polish unittest detail

* fix windows test failed

6eb95caf

F
[Pten] move complex_functors.h (#39558) · 5b5656d0
由 Feiyu Chan 提交于 2月 16, 2022
```
* move complex_functors.h and update all references to symbols within it
```
5b5656d0
C
[PTen] Rename general grad infermeta func (#39578) · 12ca438e
由 Chen Weihang 提交于 2月 16, 2022
```
* rename general grad infermeta func

* remove useless code
```
12ca438e
A
[Pten]Modify framework::VisitDataType into Pten::VisitDataType (#39550) · 6b756fb7
由 Aurelius84 提交于 2月 16, 2022
```
* Modify framework::VisitDataType into Pten::VisitDataType

* migrate unittest
```
6b756fb7

[Pten]Remove reshape and elementwise_add's registry code in Fluid (#39317) · c6478270

由 YuanRisheng 提交于 2月 16, 2022

* remove reshape and elementwise_add registry

* delete code

* fix bugs when run ci ut

* remove log

* fix bugs when run unit test

* fix bugs when run unit test

* fix bugs when run cinn

* fix bugs when run ci-mac-python3

* fix compile bugs

* fix compile bugs

* fix compile bugs

* fix bugs when run kunlun

* fix bugs when compile

* update code according comment

c6478270

15 2月, 2022 10 次提交

[PluggableDevice] Add custom runtime support (#38740) · 3e7825f3

由 ronnywang 提交于 2月 15, 2022

* [CustomRuntime] Add DeviceManager

* [CustomRuntime] Add DeviceInterface

* [CustomRuntime] Add Stream, Event, DeviceGuard, CallbackManager

* [CustomRuntime] Add plug-in device

* [CustomRuntime] Memory module support PluggableDevice

* [CustomRuntime] Add WITH_PLUGGABLE_DEVICE cmake option

* update

* [API] update API doc based on comments, test=develop
Co-authored-by: Nqili93 <qili93@qq.com>

3e7825f3

F
[Pten] move paddle/operators/math/functors.h and compound_functors.h (#39514) · 0d46a108
由 Feiyu Chan 提交于 2月 15, 2022
```
* move paddle/operators/math/functors.h
* move paddle/operators/math/compound_functors.h
```
0d46a108

move histogram to pten (#39496) · 556f6eb0

由 hong 提交于 2月 15, 2022

* move histogram to pten; test=develop

* fix format error; test=develop

* fix histogram kernel format; test=develop

556f6eb0

Move Abs OP to pten (#39492) · fb473067

由 From00 提交于 2月 15, 2022

* Move Abs op to pten

* Fix NPU compilation error

* Fix CI error

* Use LaunchSameDimsElementwiseCudaKernel in pten

fb473067

[Pten] Support SelectedRows in C++ API (#39497) · 5bb3b668

由 zyfncg 提交于 2月 15, 2022

* add data_transform in pten api

* support GetKernelTypeForVar

* fix complie problem of bfloat16

* add scale_sr in api

* suppport select_row in C++ api

* merge code

5bb3b668

C
[PTen] Fix single dtype register errror (#39506) · 9fd67ffe
由 Chen Weihang 提交于 2月 15, 2022
```
* fix single dtype reg errror

* fix windows failed
```
9fd67ffe

move algorithm.h (#39502) · 7eb9593e

由 Feiyu Chan 提交于 2月 15, 2022

Move paddle/fluid/operators/math/algorithm.h to paddle/pten/kernels/funcs and rename all references to symbols in it.

7eb9593e

[Pten]Move expand_v2 to pten (#39471) · 2d16d69b

由 Linjie Chen 提交于 2月 15, 2022

* move expand to pten

* move expand_v2 to pten

* move expand_v2 to pten

* fix grad register

* fix grad register

* fix tensorcpry

* fix tensorcopy

* fix tensorcopy

* fix tensorcopy

* fix tensorcopy

* fix ci

* fix tensorcopy

2d16d69b

C
[PTen] Polish trace moving (#39510) · ab866777
由 Chen Weihang 提交于 2月 15, 2022
```
* polish trace moving

* remove useless header
```
ab866777

[PTen]Migrate proto::VarType outside of Pten (#39411) · 7e7e9404

由 Aurelius84 提交于 2月 15, 2022

* #1 migrate dist-related type()-> dtype()

* move datatype function from pten -> fluid/framework

* change type() in imperative into convert(dtype())

* modify xx_tensor->type into xx_tensor->dtype

* change the set_type interface and the caller

* modify xx_tensor.type into xx_tensor.dtype

* fix mutable_data(place, dtype())

* change caller of mutable_data in pten and distributed

* change the caller of mutable_data in fluid/framework

* change the caller of mutable_data in imperative directory

* mutable_data: inference

* update the call of mutable_data

* transfer MakePenScalarArray MakePtenScalar ResetHolderWithType

* pass the compile. the next step is remove VarType in Pten

* fix all and remove VarType from pten. success in linux. Next task is other platform

* fix conflict with develop

* fix compiled error

* Fix reset conversion

* fix conflict

* fix compiled problem

* fix typo

* Fix << in tensor_utils.cc

* fix type->dtype

* fix unittest

* fix tensor init constructor

* fix DataTypeSize for BFloat16

* fix code style

* fix npu compiled error

* fix npu

* compile npu sucessfully

* fix conflict

* fix conflict
Co-authored-by: Nxiongkun <xiongkun03@baidu.com>

7e7e9404

14 2月, 2022 4 次提交

C

[pten] add CI check for using DenseTensor::mutable_data() in pten directions (#39467) · 14049ae5
由 chentianyu03 提交于 2月 14, 2022

14049ae5
W
context add generator (#39475) · 463e31f4
由 Wilber 提交于 2月 14, 2022
```
* context add generator

* update
```
463e31f4
C
[PTen] Add HasAttr for ArgumentMappingContext (#39464) · ddb1e23f
由 Chen Weihang 提交于 2月 14, 2022
```
* add has_attr for arg map context

* skip useless attr now

* skip attr if not exists

* fix typo
```
ddb1e23f

[pten] add split kernel (#39060) · d0df5632

由 chentianyu03 提交于 2月 14, 2022

* add split kernel

* add split kernel signature

* fix split bug

* modify MakePtenScalarArrayFromVarList

* modify MakePtenScalarArrayFromVarList

* fix split windows register error

* add test case for split kernel

* replace raw split kernel with pten kernel

* fix makeScalar/ScalarArray bug

* remove debug log

* remove int64_t type in buildPtcontext

* update by code review

* fix split dev test failed

* change DenseTensorMeta to MetaTensor

* change split api code from auto gen to manual

* split cuda kernel support bfloat16 type

* fix conflict

* rm raw split kernel

* merge develop branch

* change to pten::errors

d0df5632

13 2月, 2022 1 次提交

[Pten] Generate Wrapped InferMeta by Yaml (#39482) · 74a150fe

由 zyfncg 提交于 2月 13, 2022

* generate wrapped_infer_meta

* add test for wrapped_infer_meta

* Update test_meta_fn_utils.cc

* change the dir of generated file
Co-authored-by: NChen Weihang <chenweihang@baidu.com>
Co-authored-by: NChen Weihang <chenwhpro@163.com>

74a150fe

12 2月, 2022 1 次提交
- C
  
  unify naming style (#39481) · bdeb479c
  由 Chen Weihang 提交于 2月 12, 2022
  
  bdeb479c

机器未来 / Paddle 与 Fork 源项目一致

机器未来 / Paddle
与 Fork 源项目一致