提交 · d09962a1403f6ed832b0c6a7b378b22b5f2bab52 · BaiXuePrincess / Paddle

06 2月, 2023 1 次提交
- E
  
  phi move ReshapeToMatrix & GetValue (#50139) · d09962a1
  由 engineer1109 提交于 2月 06, 2023
  
  d09962a1
07 12月, 2022 1 次提交
- 张
  
  [phi::DenseTensor] Replace Tensor with phi::DenseTensor (#48682) · 65420271
  由张春乔提交于 12月 07, 2022
  
  65420271
28 11月, 2022 1 次提交
- W
  fix: multihead matmul biasqk broadcast support for [1,1,seq,seq] shape (#47975) · 11b9d85f
  由 Wang Bojun 提交于 11月 28, 2022
```
* add trt support
```
  11b9d85f
31 10月, 2022 1 次提交

由 feng_shuai 提交于 10月 31, 2022

* optimize: vit 384

* fix:bug

* fix:bug

* fix:supoort rocm complie

* refactor:name

* fix:support rocm

* fix:__HIP_NO_HALF_CONVERSIONS__

* optimize: delete scalar

* fix:rocm can't support

* fix:ernie error

520adc0e

28 9月, 2022 1 次提交

Remove the declaration of using Tensor in framework/tensor.h (#46432) · e12a905e

由 Chen Weihang 提交于 9月 28, 2022

* remove needless using tensor

* remove needless using tensor

* resolve conflict

* replace tensor using

* fix format error

* revert needless changing

* fix rocm and npu compile error

* fix cinn compile error

* fix format error

* fix mkldnn format error

* fix mkldnn format error

* fix cinn compile error

* fix cinn compile error

* fix cinn compile error

* resolve conflict

e12a905e

17 8月, 2022 1 次提交
- W
  fix multi stream error. (#45196) · a79d4a75
  由 Wilber 提交于 8月 17, 2022
```
* fix multi stream error.
```
  a79d4a75
16 8月, 2022 1 次提交

convert multihead to oss (#45019) · f706d95d

由 feng_shuai 提交于 8月 16, 2022

* convert multihead to oss

* fix:bug

* fix:delete const cast

* fix:don't support bias_qk

* add vit pass

* fix:convert bug and add preln_residual_bias

* support length=-1

* add UT for convert

* add no_bias_qk support for gpu_multihead_op

* delete infer_shape depends on bias_qk

* oss just can be used in T4 and A*

* fix:change api for ROCM CI

f706d95d

02 8月, 2022 1 次提交

Multihead matmul fp16 (#44792) · 0fd8ee63

由 Wilber 提交于 8月 02, 2022

* multihead matmul add fp16

* fix windows error

* fix rocm error

* fix rocm error

0fd8ee63

01 8月, 2022 1 次提交

unify gpu context (#44740) · 86763023

由 Leo Chen 提交于 8月 01, 2022

* remove cudaDeviceContext

* remove more template

* fix rocm compile

* remove alias name CUDADeviceContext

* fix compile

* fix tests

* revert changes

86763023

26 6月, 2022 1 次提交
- S
  
  format all files in fluid using new config (#43776) · 576236a0
  由 Sing_chan 提交于 6月 26, 2022
  
  576236a0
05 6月, 2022 1 次提交
- S
  
  【code format check upgrade】 step2：clang-format (#42840) · a3730dc8
  由 Sing_chan 提交于 6月 05, 2022
  
  a3730dc8
20 2月, 2022 1 次提交

[PTen->Phi PR1] Change pten dirname and namespace to phi (#39748) · dcfe1986

由 Chen Weihang 提交于 2月 20, 2022

* rename pten dir to phi

* rename namespace to phi

* rename infrt pten dir to phi

* resolve conflict

* rename pten to phi in cmake

* revert all infrt change

* change needed files

* fix infrt failed

* fix inference failed

dcfe1986

19 2月, 2022 1 次提交

[Pten]Unify paddle/pten::framework::ddim into pten::ddim (#39614) · 2fe04264

由 Aurelius84 提交于 2月 19, 2022

* Unify paddle/pten::framework::ddim into pten::ddim

* fix paddle namespace

* compile sucessfully

* fix npu src file

* fix conflict

* fix conflict

* fix tensorrt compiler error

* fix conflict

* fix conflict

* fix tesst file conflict

* fix conflict

* fix mlu file conflict

* fix mlu file conflict

* fix cinn header file conflict

* fix conflict

* fix conflict

* fix conflict

* fix conflict

2fe04264

18 2月, 2022 1 次提交
- F
  [Pten] blas and lapck migration (#39587) · 8c7ee8c2
  由 Feiyu Chan 提交于 2月 18, 2022
```
* move blas related files
* move lapack related files
```
  8c7ee8c2
17 9月, 2021 1 次提交
- F
  broadcast qkv_op (#35780) · cf9eae4c
  由 feng_shuai 提交于 9月 17, 2021
```
* broadcast qkv_op

* use PADDLE_ENFORCE_GT to replace assert
```
  cf9eae4c
03 3月, 2021 1 次提交
- Q
  [ROCM] update fluid operators for rocm (part3), test=develop (#31213) · 84639b61
  由 Qi Li 提交于 3月 03, 2021
```
* [ROCM] update fluid operators for rocm (part3), test=develop

* fix clang format error, test=develop
```
  84639b61
04 4月, 2020 1 次提交

Delete Ref & VectorRef and add GetDataSafely (#22997) · 16315d3d

由 Chen Weihang 提交于 4月 04, 2020

* delete invalid check inferface Ref & VectorRef, test=develop

* fix vector ref delete error, test=develop

* try the new check inferface, test=develop

* change all related code with new check macro, test=develop

* remove static assert, test=develop

* polish detail, test=develop

* skip coverage problem, test=develop

* add new check macro, test=develop

16315d3d

26 3月, 2020 1 次提交

[Paddle-TRT]: Ernie Dynamic shape support. (#23138) · 430b0099

由 Zhaolong Xing 提交于 3月 26, 2020

* add dynamic plugin support.
test=develop

* change emb eltwise layernorm to math function
test=develop

* add emb eltwise layernorm
test=develop

* can run dynamic shape ernie
test=develop

* fix ci
test=develop

* add ut for trt ernie dynamic

test=develop

* refine dynamic shape c++ interface.
test=develop

* fix comments
test=develop

* fix comments
test=develop

430b0099

19 3月, 2020 1 次提交
- Z
  fix align error (#23090) · 8c6fde9e
  由 Zhaolong Xing 提交于 3月 19, 2020
```
test=develop
```
  8c6fde9e
05 3月, 2020 1 次提交
- Z
  [BUG]: Multihead matmul op's ouput size should be BxSx(N*H) (#22848) · 1a533ed2
  由 Zhaolong Xing 提交于 3月 05, 2020
```
test=develop
```
  1a533ed2
13 2月, 2020 1 次提交

[Ernie GPU Optim]: Fuse three fc to multihtead matmul (#22486) · 8acd745c

由 Zhaolong Xing 提交于 2月 13, 2020

* 1. optim multihead matmul: fuse three fc to multihtead matmul

test=develop

* fix conflict
test=develop

* fix comments
test=develop

8acd745c

27 12月, 2019 1 次提交

Refine multihead kernel, align block to 32 (#21961) · 8859ddd6

由 zhaoyuchen2018 提交于 12月 27, 2019

* Refine multihead kernel, align block to 32

test=develop
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>

* Refine log comments

test=develop
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>

8859ddd6

16 12月, 2019 1 次提交
- Z
  Fix softmax cuda bug (#21720) · a5a8d144
  由 zhaoyuchen2018 提交于 12月 16, 2019
```
* Fix softmax cuda bug

* Refine multihead log and softmax logic
```
  a5a8d144
29 11月, 2019 1 次提交

Add dscending for argsort (#21400) · b1627455

由 zhaoyuchen2018 提交于 11月 29, 2019

* Add ascending for argsort

* Refine api doc description.

* Refine descending description

* Add int32 logic to speedup when data is small size.

* Remove int32 opt as not support in python

b1627455

26 11月, 2019 1 次提交
- Z
  Fix ernie python infer diff (#21311) · afb13484
  由 zhaoyuchen2018 提交于 11月 26, 2019
```
* Fix ernie pythoin infer diff
* Refine mask

test=develop
```
  afb13484
30 10月, 2019 1 次提交

Move the codes of fused operators to operators/fused directory. (#20881) · 03ba0fda

由 Yiqun Liu 提交于 10月 30, 2019

* Move the codes of fused operators to operators/fused directory.
test=develop

* Correct the op name in cmake.

* Change the use of PADDLE_ENFORCE.
test=develop

03ba0fda

23 10月, 2019 1 次提交

Fix multihead op bug. (#20783) · 6e6eab07

由 zhaoyuchen2018 提交于 10月 23, 2019

The op should handle k=1024

test=develop
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>

6e6eab07

02 10月, 2019 1 次提交

Add multihead op for ernie opt (#19933) · e8673668

由 zhaoyuchen2018 提交于 10月 02, 2019

* Add multihead op for ernie opt

test=develop
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>

* Refine code

test=develop
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>

* Refine code

test=develop
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>

* Refine code

test=develop
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>

* Refine softmax

test=develop
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>

* Refine kernel.

test=develop
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>

* Refine code

test=develop
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>

* Refine code

test=develop
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>

* Refine code

test=develop
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>

* Refine cuda kernel

test=develop
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>

* Refine cuda version

test=develop
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>

* Refine code

test=develop
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>

* Refine cmake

test=develop
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>

e8673668

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致