提交 · 6b756fb76454866f26f77d0c136941fb78650da1 · Crayon鑫 / Paddle

16 2月, 2022 4 次提交

A
[Pten]Modify framework::VisitDataType into Pten::VisitDataType (#39550) · 6b756fb7
由 Aurelius84 提交于 2月 16, 2022
```
* Modify framework::VisitDataType into Pten::VisitDataType

* migrate unittest
```
6b756fb7
王

[infrt] add infrt dialect ir. test=develop (#39455) · 2c7f6e6d
由王明冬提交于 2月 16, 2022

2c7f6e6d

[Pten]Remove reshape and elementwise_add's registry code in Fluid (#39317) · c6478270

由 YuanRisheng 提交于 2月 16, 2022

* remove reshape and elementwise_add registry

* delete code

* fix bugs when run ci ut

* remove log

* fix bugs when run unit test

* fix bugs when run unit test

* fix bugs when run cinn

* fix bugs when run ci-mac-python3

* fix compile bugs

* fix compile bugs

* fix compile bugs

* fix bugs when run kunlun

* fix bugs when compile

* update code according comment

c6478270

C
Add profiler tree dump code (#39519) · 40d2b7c6
由 chenjian 提交于 2月 16, 2022
```
* add profiler tree dump code

* add CMakeLists item

* reduce some contents for pr
```
40d2b7c6

15 2月, 2022 19 次提交

J

disabled unnecessary int reorders profiling (#39498) · 3581c075
由 jakpiase 提交于 2月 15, 2022

3581c075

[Paddle-Inference] support preln_ernie: add... · 2bc91cc5

由 Wangzheee 提交于 2月 15, 2022

[Paddle-Inference] support preln_ernie: add preln_embedding_eltwise_layernorm_fuse_pass, preln_skip_layernorm_fuse_pass (#39508)

* support preln_ernie

* support preln_ernie

2bc91cc5

[PluggableDevice] Add custom runtime support (#38740) · 3e7825f3

由 ronnywang 提交于 2月 15, 2022

* [CustomRuntime] Add DeviceManager

* [CustomRuntime] Add DeviceInterface

* [CustomRuntime] Add Stream, Event, DeviceGuard, CallbackManager

* [CustomRuntime] Add plug-in device

* [CustomRuntime] Memory module support PluggableDevice

* [CustomRuntime] Add WITH_PLUGGABLE_DEVICE cmake option

* update

* [API] update API doc based on comments, test=develop
Co-authored-by: Nqili93 <qili93@qq.com>

3e7825f3

F
[Pten] move paddle/operators/math/functors.h and compound_functors.h (#39514) · 0d46a108
由 Feiyu Chan 提交于 2月 15, 2022
```
* move paddle/operators/math/functors.h
* move paddle/operators/math/compound_functors.h
```
0d46a108

Add cinn_instruction_run_op for launching execution of a cinn instruction (#39435) · 9d0baeab

由 TeFeng Chen 提交于 2月 15, 2022

* add cinn_instruction_run_op for launching execution of a cinn instruction

* fix multi definition compilation error

* update cmake

* fix bug at infershape

* fix compile error due to lacking header file

9d0baeab

F

pool2d_coonvert_ut (#39545) · cf8a5573
由 feng_shuai 提交于 2月 15, 2022

cf8a5573
S

fix bug when openblas ci use ninja (#39472) · a558d386
由 Sing_chan 提交于 2月 15, 2022

a558d386
L
[Paddle-TRT] Replace GeLU plugin with TensorRT built-in layer for TensorRT 7.0. (#38399) · a3689d8c
由 Leo Chen 提交于 2月 15, 2022
```
* Replace GeLU plugin with TRT built-in layers for approximate GeLU

* Add TensorRT built-in layer for nonapproximate GeLU
```
a3689d8c

move histogram to pten (#39496) · 556f6eb0

由 hong 提交于 2月 15, 2022

* move histogram to pten; test=develop

* fix format error; test=develop

* fix histogram kernel format; test=develop

556f6eb0

Move Abs OP to pten (#39492) · fb473067

由 From00 提交于 2月 15, 2022

* Move Abs op to pten

* Fix NPU compilation error

* Fix CI error

* Use LaunchSameDimsElementwiseCudaKernel in pten

fb473067

[Eager] Support SellectedRows MergeAdd case (#39449) · 6549a041

由 Weilong Wu 提交于 2月 15, 2022


* Refactor SelectedRows MergeAdd func by using template

* Add GetInnerMutable func instead of modify GetInnerMutableTensor

* Updated PADDLE_ENFORCE statement

* Remove useless PADDLE_ENFORCE statement

* Polish Code

6549a041

[Pten] Support SelectedRows in C++ API (#39497) · 5bb3b668

由 zyfncg 提交于 2月 15, 2022

* add data_transform in pten api

* support GetKernelTypeForVar

* fix complie problem of bfloat16

* add scale_sr in api

* suppport select_row in C++ api

* merge code

5bb3b668

C
[PTen] Fix single dtype register errror (#39506) · 9fd67ffe
由 Chen Weihang 提交于 2月 15, 2022
```
* fix single dtype reg errror

* fix windows failed
```
9fd67ffe
S

add dropout fp32 (#39501) · b81358d1
由 sneaxiy 提交于 2月 15, 2022

b81358d1
F

delete mish_convert_ut skip (#39432) · 8cedcd3e
由 feng_shuai 提交于 2月 15, 2022

8cedcd3e

move algorithm.h (#39502) · 7eb9593e

由 Feiyu Chan 提交于 2月 15, 2022

Move paddle/fluid/operators/math/algorithm.h to paddle/pten/kernels/funcs and rename all references to symbols in it.

7eb9593e

[Pten]Move expand_v2 to pten (#39471) · 2d16d69b

由 Linjie Chen 提交于 2月 15, 2022

* move expand to pten

* move expand_v2 to pten

* move expand_v2 to pten

* fix grad register

* fix grad register

* fix tensorcpry

* fix tensorcopy

* fix tensorcopy

* fix tensorcopy

* fix tensorcopy

* fix ci

* fix tensorcopy

2d16d69b

C
[PTen] Polish trace moving (#39510) · ab866777
由 Chen Weihang 提交于 2月 15, 2022
```
* polish trace moving

* remove useless header
```
ab866777

[PTen]Migrate proto::VarType outside of Pten (#39411) · 7e7e9404

由 Aurelius84 提交于 2月 15, 2022

* #1 migrate dist-related type()-> dtype()

* move datatype function from pten -> fluid/framework

* change type() in imperative into convert(dtype())

* modify xx_tensor->type into xx_tensor->dtype

* change the set_type interface and the caller

* modify xx_tensor.type into xx_tensor.dtype

* fix mutable_data(place, dtype())

* change caller of mutable_data in pten and distributed

* change the caller of mutable_data in fluid/framework

* change the caller of mutable_data in imperative directory

* mutable_data: inference

* update the call of mutable_data

* transfer MakePenScalarArray MakePtenScalar ResetHolderWithType

* pass the compile. the next step is remove VarType in Pten

* fix all and remove VarType from pten. success in linux. Next task is other platform

* fix conflict with develop

* fix compiled error

* Fix reset conversion

* fix conflict

* fix compiled problem

* fix typo

* Fix << in tensor_utils.cc

* fix type->dtype

* fix unittest

* fix tensor init constructor

* fix DataTypeSize for BFloat16

* fix code style

* fix npu compiled error

* fix npu

* compile npu sucessfully

* fix conflict

* fix conflict
Co-authored-by: Nxiongkun <xiongkun03@baidu.com>

7e7e9404

14 2月, 2022 13 次提交

C

add gitignore for eager tmp file, test=document_fix (#39512) · 9c2cee1c
由 Chen Weihang 提交于 2月 14, 2022

9c2cee1c
C

[pten] add CI check for using DenseTensor::mutable_data() in pten directions (#39467) · 14049ae5
由 chentianyu03 提交于 2月 14, 2022

14049ae5

[Bug fix] prevent squashing pair u8 dequantize -> s8 quantize (#39346) · 66b5348e

由 Sylwester Fraczek 提交于 2月 14, 2022

* prevent squashing pair u8 dequantize -> s8 quantize

* add relu op to check for uint8

* fix ptq fc attr name fuse_activation->activation_type

* fix

* add unit test

* remove unused variable

* test fix unsuccessful

* fix test and logic

* multiline comment

* remove cout

* Revert "fix ptq fc attr name fuse_activation->activation_type"

This reverts commit ffd023353a5e9b0fd15e81b9e9f9fe1794035017.

* fix ptq fc attr name fuse_activation->activation_type

66b5348e

W
context add generator (#39475) · 463e31f4
由 Wilber 提交于 2月 14, 2022
```
* context add generator

* update
```
463e31f4

Add cuda tracer (#39488) · 0790f949

由 liutiexing 提交于 2月 14, 2022

* add align for WorkQueue

* add spinlock

* merge develop

* merge

* Add EventsWaiter

* Revert "Add EventsWaiter"

This reverts commit e206173aa9be7401b83a53581627bfaf557c8fb2.

* add log for Executor

* Add CudaTracer to trace CUDA events
Co-authored-by: Nliutiexing <liutiexing@google.com>

0790f949

[NewExe] Ignore eof Exception(#39487) · 2f642159

由 liutiexing 提交于 2月 14, 2022

* add align for WorkQueue

* add spinlock

* merge develop

* merge

* Add EventsWaiter

* Revert "Add EventsWaiter"

This reverts commit e206173aa9be7401b83a53581627bfaf557c8fb2.

* add log for Executor

* Avoid thread reconsruction when EOF
Co-authored-by: Nliutiexing <liutiexing@google.com>

2f642159

C
[PTen] Add HasAttr for ArgumentMappingContext (#39464) · ddb1e23f
由 Chen Weihang 提交于 2月 14, 2022
```
* add has_attr for arg map context

* skip useless attr now

* skip attr if not exists

* fix typo
```
ddb1e23f

[pten] add split kernel (#39060) · d0df5632

由 chentianyu03 提交于 2月 14, 2022

* add split kernel

* add split kernel signature

* fix split bug

* modify MakePtenScalarArrayFromVarList

* modify MakePtenScalarArrayFromVarList

* fix split windows register error

* add test case for split kernel

* replace raw split kernel with pten kernel

* fix makeScalar/ScalarArray bug

* remove debug log

* remove int64_t type in buildPtcontext

* update by code review

* fix split dev test failed

* change DenseTensorMeta to MetaTensor

* change split api code from auto gen to manual

* split cuda kernel support bfloat16 type

* fix conflict

* rm raw split kernel

* merge develop branch

* change to pten::errors

d0df5632

T

fix gather_nd, *test=kunlun (#39283) · d12c3636
由 TTerror 提交于 2月 14, 2022

d12c3636
T

update xpu test build script and fix get_test_cover_info, *test=kunlun (#39235) · 9ba3f429
由 TTerror 提交于 2月 14, 2022

9ba3f429
[MLU] add mlu kernel for c_broadcast op (#39470) · 1b9e6790
由 mhhhh1 提交于 2月 14, 2022

1b9e6790
Z
Fixed get_tensor method for EagerTensor (#39414) · 97229944
由 Zhanlue Yang 提交于 2月 14, 2022
```
* Enabled Eager OpTest #1

* Enabled Eager OpTest #1

* Fixed get_tensor method for EagerTensor
```
97229944

Adjusted python-level trace_op to accomodate final state Eager Dygraph (#39319) · ec8a0c1d

由 Zhanlue Yang 提交于 2月 14, 2022

* Removed debug info

* Added automatic code generation for final state Eager Dygraph

* Modified backward yaml

* Added EagerUtils helper functions for final state CodeGen

* Adjusted CMakeFiles to support compilation for final state auto generated codes

* Added python-c code generation for final state Eager Dygraph

* Fixed minor issue

* Fixed yaml.load() method failure

* Fixed minor issues

* Refactored Python-C Attributes Parsing Functions

* Fixed minor issue with Python-C AddFunctions

* Adjusted python-level trace_op to accomodate final state Eager Dygraph

* Added Logs for final state Eager Dygraph

* Fixed merge issues

* Fixed minor issue

ec8a0c1d

13 2月, 2022 1 次提交

[Pten] Generate Wrapped InferMeta by Yaml (#39482) · 74a150fe

由 zyfncg 提交于 2月 13, 2022

* generate wrapped_infer_meta

* add test for wrapped_infer_meta

* Update test_meta_fn_utils.cc

* change the dir of generated file
Co-authored-by: NChen Weihang <chenweihang@baidu.com>
Co-authored-by: NChen Weihang <chenwhpro@163.com>

74a150fe

12 2月, 2022 1 次提交
- C
  
  unify naming style (#39481) · bdeb479c
  由 Chen Weihang 提交于 2月 12, 2022
  
  bdeb479c
11 2月, 2022 2 次提交

Fix add profiler node tree implementation cmake error (#39474) · 739da6cb

由 chenjian 提交于 2月 11, 2022

* add event node implementation

* modify profiler.stop interface

* fix according to review

* fix file mode

* modify class method name in event_node.cc

* modify LLONG_MAX to ULLONG_MAX

* fix ci error

* fix ci error

* fix dependency error

739da6cb

L

Add TensorRT inspector into Paddle-TRT (#38362) · 69793a27
由 Leo Chen 提交于 2月 11, 2022

69793a27

Crayon鑫 / Paddle 与 Fork 源项目一致

Crayon鑫 / Paddle
与 Fork 源项目一致