提交 · eaacf8bfee5c9583f7ebf0deff20b90db9d73478 · 机器未来 / Paddle

03 3月, 2022 13 次提交

Y

fix save_vars bugs (#40062) · eaacf8bf
由 YuanRisheng 提交于 3月 03, 2022

eaacf8bf
0

move eye, lerp infershape to phi (#40105) · 1c205883
由 0x45f 提交于 3月 03, 2022

1c205883

cinn_launch_op: switch to execution by PE (#39911) · 167d511f

由 TeFeng Chen 提交于 3月 03, 2022

* swith to PE execution in cinn launch

* fix outer variables erased

* skip the map bug temporarily for test

* temporary solution for batch_norm bug

* update comment

* fix compile error

* cinn_instruction_run_op_test: update code to skip external alloc/free instructions generated

167d511f

Move compare OPs to phi (#39970) · 0969a4eb

由 From00 提交于 3月 03, 2022

* Move compare OPs to phi

* Fix bug

* Use BroadcastKernel and ElementwiseKernel in phi

0969a4eb

W
modify infershape of multiclass nms (#40059) · 756af9ff
由 wangxinxin08 提交于 3月 03, 2022
```
* modify infershape of multiclass nms
```
756af9ff
Y
[Phi]Delete kernel registry of elementwise_sub op in Fluid (#40039) · cac00e0b
由 YuanRisheng 提交于 3月 03, 2022
```
* delete elementwise_sub kernel registry

* fix compile bugs in xpu ci

* fix bugs when run inference ci
```
cac00e0b
W
EmbEltwiseLayernorm fix (#40015) · c3f3643b
由 wenbin 提交于 3月 03, 2022
```
* emb fix

* fix trt6 compile

* fix half

* absolute error fix
```
c3f3643b

Modified sigmoid by the elementwise interface. (#39898) · 5d9e11a4

由 huangxu96 提交于 3月 03, 2022

* Modified sigmoid by elementwise interface.

* using TensorReduceImpl to repalce Sum function

* using reduceimpl to calculate the norm variable

* Removed useless code

5d9e11a4

Add support of int16 for gather op. (#40052) · 3e56e816

由 Li Min 提交于 3月 03, 2022

* add support of int16 for gather op.

* Recover formats.

* Recover formats.

* fix.

* Fix format.

* Fix format.

3e56e816

X
[phi] transfer pad kernel into phi and pass the test_pad_op (#40012) · 9f74b84e
由 xiongkun 提交于 3月 03, 2022
```
* add pad forward

* fix error

* transfer pad and pass the test_pad_op
```
9f74b84e
C

move gather_tree infer shape (#40082) · 3779e807
由 crystal 提交于 3月 03, 2022

3779e807
F
[Phi] move gaussian_random (#39932) · 00bbb8c5
由 furnace 提交于 3月 03, 2022
```
[Phi] move gaussian_random kernel
```
00bbb8c5

Move bn to pten (#39347) · ebd0f512

由 hong 提交于 3月 03, 2022

* add bn cpu version; test=develop

* move batch norm to pten

* move batch norm to pten; test=develop

* fix bug; test=develop

* fix func::tranpose depend bug; test=develop

* fix compile bugs; test=develop

* fix use_op batch_norm bug; test=develop

* fix cudnn bn add relu test; test=develop

* fix pten context build and double grad bug; test= develop

* remve useless code; test=develop

* add batch norm gpu fp16 support; test=develop

* fix test bn op bug; test=develop

* remove output dtype set; test=develop

* fix bug; test=develop

* fix bug; test=develop

* fix applay pass to program bug; test=develop

* revert to develop; test=develop

* fix rocm bug; test=develop

* revert operator to develop; test=develop

* fix pre_commit; test=develop

* fix statci check error; test=develop

* resolve conflict; test=develop

* ana batch norm bug;

* revert batch norm op

* resolve conlict

* fix nan inf and speed bug; test=develop

* fix bug; test=develop

* fix error; test=develop

* test expand op; test=develop

* fix bug; test=develop

* resolve confilct

* resolve confilct; test=develop

* polish code; test=develop

* polish code; test=develop

* change mutable data to ctx alloc; test=develop

* make format same with ci; test=develop

* fix format error with ci; test=develop

ebd0f512

02 3月, 2022 17 次提交

L
Replacing dropout eval eigen usage by cuda kernel (#40053) · 272b32fd
由 Li Min 提交于 3月 02, 2022
```
* Replacing dropout eval eigen usage by cuda kernel
```
272b32fd
F
[MLU] add mlu ci script (#39805) · a8e02ef1
由 fwenguang 提交于 3月 02, 2022
```
* [MLU] add mlu ci script

* Update CMakeLists.txt
```
a8e02ef1

Move sgd to phi (#40045) · f3d54e2e

由 hong 提交于 3月 02, 2022

* move sgd to phi; test=develop

* update

* add sgd kernel; test=develop

f3d54e2e

W
modify infershape of yolo_box (#40056) · ebc6959c
由 wangxinxin08 提交于 3月 02, 2022
```
* modify infershape of yolo_box
```
ebc6959c
S
Move gather.h/gather.cu.h/scatter.h/scatter.cu.h to the phi library (#40043) · 09258040
由 sneaxiy 提交于 3月 02, 2022
```
* move gather.h gather.cu.h scatter.h scatter.cu.h to phi library

* fix CI

* fix rocm ci
```
09258040
S

vec scale kernel (#40011) · 2e6548a9
由 sneaxiy 提交于 3月 02, 2022

2e6548a9
Y
[Phi]Move elementwise function to funcs directory (#39986) · 5898e9ab
由 YuanRisheng 提交于 3月 02, 2022
```
* move elementwise function to funcs directory

* fix compile bugs

* modify according to comment
```
5898e9ab

Move transpose to pten (#39327) · 7a857924

由 hong 提交于 3月 02, 2022

* immigrate_transpose_to_pten cpu kernel only; test=develop

* fix bug; test=develop

* add transpose cuda api

* bug fix;

* fix bugs

* fix bugs; test=develop

* bug fix;

* move transepose to pten; test=develop

* fix bug; test=develop

* fix bugs; test=develop

* add transpose grad fp16 support; test=develop

* fix bug; test=develop

* fix npu bug; test=develop

* fix nemul = 0 bug; test=develop

* add fp16 support; test=develop

* fix data type register bug; test=develop

* fix transpose bug; test=develop

* update transpose

* fix transpose bug; test=develop

* remove useless code; test=develop

* remove useless code; test=develop

* fix transpose alias bug; test=develop

* polish code; test=develop

* resolve confict; test=develop

* resolve confilct; test=develop

* recover prepared operator; test=develop

* fix bug; test=develop

* polish code; test=develop

* fix bug; test=develop

* fix bug; test=develop

7a857924

Move BroadcastTensors OP to phi (#40047) · 2a5590a1

由 From00 提交于 3月 02, 2022

* Move BroadcastTensors OP to phi

* Remove mutable_data in impl

* Move BilinearTensorProductInferMeta to multiary.h/cc

2a5590a1

Z
[bf16] add bf16 kernel: softmax & log_softmax (#39999) · 4a4215ff
由 zhangbo9674 提交于 3月 02, 2022
```
* add softmax log_softmax

* refine rocm

* refine unittest
```
4a4215ff
C
【phi】migrate gather_tree,reduce_prod to phi (#39844) · 6af2729e
由 crystal 提交于 3月 02, 2022
```
* move to phi

* migrate gather_tree_op into phi

* move reduce_prod tp phi

* optimize code
```
6af2729e
J

add logic kernel for mlu (#39940) · bc113e10
由 joeqiao12 提交于 3月 02, 2022

bc113e10
L

[KP] Activation op registration for XPU2. part 1/2 (#40002) · 90ab7403
由 Lijunhui 提交于 3月 02, 2022

90ab7403
C
[Phi] Unify complex type trait and fix real imag bug (#40036) · 0764fda2
由 Chen Weihang 提交于 3月 02, 2022
```
* unify complex type trait and fix real imag bug

* add unittest for type tratis
```
0764fda2
Q
[MLU] adapt matmul op (#39727) · b4d931e8
由 qipengh 提交于 3月 02, 2022
```
* [MLU] adapt matmul op

* [MLU] fix phi namespace
```
b4d931e8
F

[MLU] add transpose2 mlu kernel (#39994) · 4cab812e
由 fwenguang 提交于 3月 02, 2022

4cab812e

[Pten] Gru lstm migration (#39729) · e4dba69a

由 Feiyu Chan 提交于 3月 02, 2022

* move sequence2batch

* move lstm and gru

* Add phi/kernels directory into exclusion to stop using hipcc to compile non .cu files in it.

e4dba69a

01 3月, 2022 10 次提交

[Phi]rm reduce infershape (#39820) · 09039636

由 chentianyu03 提交于 3月 01, 2022

* modify infershape utils and rm reduce infershape

* merge develop

* fix infermete bug

* add IsForInferShape func in ArgumentMappingContext

* add reduce_mean infermeta

* modify annotation

* add default dims

09039636

[phi] tranfer the selu_op and pass the CI (#39819) · 197da15a

由 xiongkun 提交于 3月 01, 2022

* tranfer the selu_op and pass the CI

* add sig files

* fix code

* fix by code review

* remove TOOD

* change the include position

* change the head position

197da15a

[bf16] add bf16 kernel: layer_norm p_norm reduce_sum (#39843) · ce8ed978

由 zhangbo9674 提交于 3月 01, 2022

* add layer norm

* add p norm

* add reduce sum

* refine layer norm register bf16 for cudnn811

* add bf16 cast for hip

* add unittest

* refine rocm

* refine layer_norm unittest

* refine reduce op

* refine unittest

* enhance atol for reduce unittest

ce8ed978

[bf16] add bf16 kernel: scale gather sum (#39683) · 6d26b332

由 zhangbo9674 提交于 3月 01, 2022

* add scale gather sum

* refine CUDA_ATOMIC_WRAPPER ADD for bf16

* add gather unittest

* solve conflict

* add scale uinttest

* add sum unittest

* solve conflict

* refine gather unittest

* refine unittest

6d26b332

R

[phi] migrate where kernel into phi (#39811) · 468a2a17
由 ronnywang 提交于 3月 01, 2022

468a2a17
L
[phi] move uniform_random to phi (#39937) · b3466387
由 Leo Chen 提交于 3月 01, 2022
```
* move uniform_random to phi

* fit selected_rows

* replace mutable_data
```
b3466387

optimize mergeadd for sparse_adam,*test=kunlun (#39966) · d4911594

由 z8hanghuan 提交于 3月 01, 2022

* optimize mergeadd for sparse_adam,*test=kunlun

* optimize mergeadd for sparse_adam,*test=kunlun

* optimize mergeadd for sparse_adam, *test=kunlun

d4911594

[PHI] Support Multi Input and Output for InferShape (#39870) · e8d45583

由 zyfncg 提交于 3月 01, 2022

* add multi input for infer_shape

* support multi output for infershape

* fix split bug

* fix bug of concat

* support vector<MetaTensor*> in infrt

* fix bug

e8d45583

A
[Phi] Migrate logical_and/or/not/xor into Phi (#39942) · 8c237973
由 Aurelius84 提交于 3月 01, 2022
```
* [Phi] Migrate logical_and/or/not/xor into Phi

* fix unittest

* fix function name
```
8c237973

Optimize group_norm op forward (#39596) · 657dd5a9

由 crystal 提交于 3月 01, 2022

* optimize group norm forward

* use vectorized optimization

* add scalar calculation code

* optimize code

657dd5a9

机器未来 / Paddle 与 Fork 源项目一致

机器未来 / Paddle
与 Fork 源项目一致