提交 · 50fb57c9b33f51a29a28c58591fe1d6bf754eef7 · BaiXuePrincess / Paddle

18 2月, 2022 4 次提交

J

add flatten op for mlu (#39530) · 4c5cec5c
由 joeqiao12 提交于 2月 18, 2022

4c5cec5c
Z
[MLU]add sync stream ops and broadcast pytest (#39518) · d2bd05b9
由 zn 提交于 2月 18, 2022
```
* [MLU]add sync stream ops and broadcast pytest

* [MLU]fix broadcast pytest to add data type
```
d2bd05b9

[Bug Fix]Fix gradient accumulator (#39577) · a7cbd3ef

由 Jiabin Yang 提交于 2月 18, 2022

* merge legacy to fluid

* Remove legacy code

* Remove legacy code

* Remove DataType test

* Using Tensor directly instead of using EagerTensor

* support gradient_accumulation

* make test_imperative_lod_tensor_to_selected_rows longer

* make test_imperative_lod_tensor_to_selected_rows longer

* refine code

* Rename all EagerTensor to Tensor

* Rename some EagerTensor to Tensor

* rename EagerTensor to EagerVariable

* add more test

* fix different device gradient_accmulator bug

* merge develop

* remove useless tests

a7cbd3ef

W
[Eager] Support GradientHook before running separate GradNode (#39638) · adf4b98f
由 Weilong Wu 提交于 2月 18, 2022
```
* [Eager] Support GradientHook before running seperate GradNode

* Fix CI issue

* Fix CI issue
```
adf4b98f

17 2月, 2022 12 次提交
- L
  [pten] move bernoulli kernel to pten (#39590) · f86073c4
  由 Leo Chen 提交于 2月 17, 2022
```
* move bernoulli kernel to pten

* follow comments
```
  f86073c4
- L
  [new-exec] refactor code of interpretercore gc (#39617) · c3135426
  由 Leo Chen 提交于 2月 17, 2022
```
* relocate code of interpretercore gc
```
  c3135426
- S
  [bugfix] to concat input squash (#39593) · f29da150
  由 Sylwester Fraczek 提交于 2月 17, 2022
```
* fix and add more tests

* remove unwanted changes

* check only concat and elementwise

* move check to a function

* add todo comment

* Revert "fix ptq fc attr name fuse_activation->activation_type"

This reverts commit ffd023353a5e9b0fd15e81b9e9f9fe1794035017.
```
  f29da150
- J
  
  add reshape2 op for mlu (#39562) · 2d2f11d1
  由 joeqiao12 提交于 2月 17, 2022
  
  2d2f11d1
- T
  save the name lists of variables of a cinn subgraph as its attributes (#39622) · a1ad003c
  由 TeFeng Chen 提交于 2月 17, 2022
```
* save the name lists of the input,internal and output variables of a subgraph as its attribute

* fix compile error
```
  a1ad003c
- S
  move trunc to pten (#39543) · 4501abd6
  由 Sing_chan 提交于 2月 17, 2022
```
* move trunc to pten

* modify according to YuanRisheng's comment
```
  4501abd6
- H
  add softplus op for kunlun2. test=kunlun (#39555) · 9f99b591
  由 houj04 提交于 2月 17, 2022
```
* add softplus op for kunlun2. test=kunlun

* add softplus op for kunlun2. test=kunlun

* fix code style. test=kunlun

* fix code style. test=kunlun

* add more test cases. test=kunlun
```
  9f99b591
- W
  adaptive pool2d pass fix (#39600) · c1c5c1fc
  由 wenbin 提交于 2月 17, 2022
```
* first commit

* teller fix

* bug fix

* enable for pool2d only

* fix global_pooling issue

* pooling_type

* fix test
```
  c1c5c1fc
- Z
  [Pten] Remove register of matmul_v2 kernel (#39542) · db43b541
  由 zyfncg 提交于 2月 17, 2022
```
* remove register of matmul_v2 kernel

* delete matmul_v2 grad register in fluid
```
  db43b541
- C
  
  move trace infer shape (#39517) · 1c9b2483
  由 Chen Weihang 提交于 2月 17, 2022
  
  1c9b2483
- B
  update inference ut to support nhwc format (#39551) · b4d3597a
  由 baoachun 提交于 2月 17, 2022
```
* update inference ut to support nhwc format

* update ut and pass OpCompat

* update ut

* update ut
```
  b4d3597a
- N
  
  Modified distribution kernel with Kernel Primitive API (#39563) · 1354652b
  由 niuliling123 提交于 2月 17, 2022
  
  1354652b
16 2月, 2022 17 次提交

F

[MLU] fix TensorAdd for mlu (#39523) · 24b8f63e
由 fwenguang 提交于 2月 16, 2022

24b8f63e
Y

[fleet exe] Update comm init for dist model (#39603) · 7d53a288
由 Yuang Liu 提交于 2月 16, 2022

7d53a288
T

optimize prior_box for kunlun, *test=kunlun (#39477) · e254e7c6
由 TTerror 提交于 2月 16, 2022

e254e7c6
F

[MLU] support adative pooling (#39500) · f138371c
由 fwenguang 提交于 2月 16, 2022

f138371c
0
Move lerp OP to pten (#39524) · d480d7b1
由 0x45f 提交于 2月 16, 2022
```
* move lerp to pten

* refine include

* move files

* refine code
```
d480d7b1
A

Add ConditionalBlockGradInferVarType (#39585) · ff7e3590
由 Aurelius84 提交于 2月 16, 2022

ff7e3590

[bf16] pten matmul cuda kernel support bf16 (#39485) · d5a0d31a

由 Leo Chen 提交于 2月 16, 2022

* pten matmul cuda kernel support bf16

* fix pten kernel name

* add matmul_grad bf16 kernel

* add emptylike bf16 kernel

* fix compile

* suppport rocm

* fix error

* fix rocm

* add bf16 header file

* fix compile

d5a0d31a

[Paddle-Inference] support preln-ernie: add preln_emb_eltwise_layernorm_op,... · f31c2426

由 Wangzheee 提交于 2月 16, 2022

[Paddle-Inference] support preln-ernie: add preln_emb_eltwise_layernorm_op, preln_skip_layernorm_op (#39570)

* support preln_ernie: add preln_emb_eltwise_layernorm_op, preln_skip_layernorm_op

* support preln_ernie: add preln_emb_eltwise_layernorm_op, preln_skip_layernorm_op

f31c2426

J

Support GetGradAccumulator for reducer (#39537) · ae92da87
由 Jiabin Yang 提交于 2月 16, 2022

ae92da87

EagerTensor to EagerVariable (#39447) · 831fd86e

由 Jiabin Yang 提交于 2月 16, 2022

* merge legacy to fluid

* Remove legacy code

* Remove legacy code

* Remove DataType test

* Using Tensor directly instead of using EagerTensor

* support gradient_accumulation

* make test_imperative_lod_tensor_to_selected_rows longer

* make test_imperative_lod_tensor_to_selected_rows longer

* refine code

* Rename all EagerTensor to Tensor

* Rename some EagerTensor to Tensor

* rename EagerTensor to EagerVariable

* add more test

* merge develop and refine code

831fd86e

[PTen] Add attr support for infershape utils (#39513) · 6eb95caf

由 Chen Weihang 提交于 2月 16, 2022

* add attr support for infershape

* add unittest for coverage

* add unittest for coverage

* polish unittest detail

* fix windows test failed

6eb95caf

W

Support nce in eager mode (#39589) · 672def6c
由 Weilong Wu 提交于 2月 16, 2022

672def6c
F
[Pten] move complex_functors.h (#39558) · 5b5656d0
由 Feiyu Chan 提交于 2月 16, 2022
```
* move complex_functors.h and update all references to symbols within it
```
5b5656d0
C
[PTen] Rename general grad infermeta func (#39578) · 12ca438e
由 Chen Weihang 提交于 2月 16, 2022
```
* rename general grad infermeta func

* remove useless code
```
12ca438e
A
[Pten]Modify framework::VisitDataType into Pten::VisitDataType (#39550) · 6b756fb7
由 Aurelius84 提交于 2月 16, 2022
```
* Modify framework::VisitDataType into Pten::VisitDataType

* migrate unittest
```
6b756fb7

[Pten]Remove reshape and elementwise_add's registry code in Fluid (#39317) · c6478270

由 YuanRisheng 提交于 2月 16, 2022

* remove reshape and elementwise_add registry

* delete code

* fix bugs when run ci ut

* remove log

* fix bugs when run unit test

* fix bugs when run unit test

* fix bugs when run cinn

* fix bugs when run ci-mac-python3

* fix compile bugs

* fix compile bugs

* fix compile bugs

* fix bugs when run kunlun

* fix bugs when compile

* update code according comment

c6478270

C
Add profiler tree dump code (#39519) · 40d2b7c6
由 chenjian 提交于 2月 16, 2022
```
* add profiler tree dump code

* add CMakeLists item

* reduce some contents for pr
```
40d2b7c6

15 2月, 2022 7 次提交

J

disabled unnecessary int reorders profiling (#39498) · 3581c075
由 jakpiase 提交于 2月 15, 2022

3581c075

[Paddle-Inference] support preln_ernie: add... · 2bc91cc5

由 Wangzheee 提交于 2月 15, 2022

[Paddle-Inference] support preln_ernie: add preln_embedding_eltwise_layernorm_fuse_pass, preln_skip_layernorm_fuse_pass (#39508)

* support preln_ernie

* support preln_ernie

2bc91cc5

[PluggableDevice] Add custom runtime support (#38740) · 3e7825f3

由 ronnywang 提交于 2月 15, 2022

* [CustomRuntime] Add DeviceManager

* [CustomRuntime] Add DeviceInterface

* [CustomRuntime] Add Stream, Event, DeviceGuard, CallbackManager

* [CustomRuntime] Add plug-in device

* [CustomRuntime] Memory module support PluggableDevice

* [CustomRuntime] Add WITH_PLUGGABLE_DEVICE cmake option

* update

* [API] update API doc based on comments, test=develop
Co-authored-by: Nqili93 <qili93@qq.com>

3e7825f3

F
[Pten] move paddle/operators/math/functors.h and compound_functors.h (#39514) · 0d46a108
由 Feiyu Chan 提交于 2月 15, 2022
```
* move paddle/operators/math/functors.h
* move paddle/operators/math/compound_functors.h
```
0d46a108

Add cinn_instruction_run_op for launching execution of a cinn instruction (#39435) · 9d0baeab

由 TeFeng Chen 提交于 2月 15, 2022

* add cinn_instruction_run_op for launching execution of a cinn instruction

* fix multi definition compilation error

* update cmake

* fix bug at infershape

* fix compile error due to lacking header file

9d0baeab

F

pool2d_coonvert_ut (#39545) · cf8a5573
由 feng_shuai 提交于 2月 15, 2022

cf8a5573
L
[Paddle-TRT] Replace GeLU plugin with TensorRT built-in layer for TensorRT 7.0. (#38399) · a3689d8c
由 Leo Chen 提交于 2月 15, 2022
```
* Replace GeLU plugin with TRT built-in layers for approximate GeLU

* Add TensorRT built-in layer for nonapproximate GeLU
```
a3689d8c

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致