提交 · 636dc2ff48ca7bd421a39dd487f421da77d8207a · PaddlePaddle / Paddle

25 8月, 2023 1 次提交
- X
  [Paddle Inference] Add bias input of mmha and simplify mmha. (#56411) · 636dc2ff
  由 xiaoxiaohehe001 提交于 8月 25, 2023
```
* add_bias_and_simplify_mmha
```
  636dc2ff
23 8月, 2023 1 次提交
- W
  [IR] Ir fill constant (#56520) · e914f7fc
  由 wanghuancoder 提交于 8月 23, 2023
```
* support ir fill constant
```
  e914f7fc
22 8月, 2023 1 次提交
- [Paddle Inference] refactor linear_compress (#55490) · ffff3da0
  由 FormlessUnit 提交于 8月 22, 2023
```
* Modify kernels to support quantized_matmul

---------
Co-authored-by: Nsuperxf <1208713646@qq.com>
```
  ffff3da0
21 8月, 2023 1 次提交
- R
  
  fix dynamic to static when export LLM inference model (#56390) · 95c4bb41
  由 RichardWooSJTU 提交于 8月 21, 2023
  
  95c4bb41
18 8月, 2023 1 次提交
- H
  
  move dgc_momentum InferShape to phi (#56358) · a533dae3
  由 huangjiyi 提交于 8月 18, 2023
  
  a533dae3
15 8月, 2023 1 次提交

[Paddle Inference] Add masked multihead attention kernel and export API. (#55344) · 989c5e87

由 xiaoxiaohehe001 提交于 8月 15, 2023

* support_mmha
* add_python_api
* add_api_doc
* fix_doc_error
* fix_infermeta
* add_infermeta
* add_bf16_cuda_check
* add_bf16_check
* fix_ci_windows
* fix_ci_windows_kernel_register
* fix_test_mmha
* add_cumoffsets
* remove_bias
* delete_mmha_reshape_input_output
* rename_delete_hfile
* remove_fluid

---------
Co-authored-by: Nyangjianfengo1 <yangjianfeng01@baidu.com>

989c5e87

14 8月, 2023 1 次提交

Add rmsnorm residual bias add and quant (#55965) · 2ac6a7e4

由 MarDino 提交于 8月 14, 2023

* add rmsnorm residual bias add and quant

* refine python interface

* add rmsnorm unittest

* Add layernorm

* fix layernorm unittest

* refine unittest

* fix example code

* fix review comment

2ac6a7e4

10 8月, 2023 1 次提交

Add variable_length_memory_efficient_attention (#55400) · 4036c937

由 lzy 提交于 8月 10, 2023

* add variable_length_memory_efficient_attention
* update variable_length_memory_efficient_attention unittest
* update variable_length_mem_eff_attn's docs and unittest
* update variable_length_mem_eff_attn's docs
* Update test_variable_length_memory_efficient_attention.py
* Update variable_length_memory_efficient_attention.cu
* fix codestyle
* fix variable_length_fmha's docs and unittest
* fix variable_length_fmha's docs

4036c937

08 8月, 2023 2 次提交
- W
  move `decayed_adagrad_op` to phi (#55995) · 0d920178
  由 Wang Xin 提交于 8月 08, 2023
```
* move decayed_adagrad_op to phi

* fix bug
```
  0d920178
- F
  
  optimize op structure (#55988) · 6bd7f860
  由 freeliuzc 提交于 8月 08, 2023
  
  6bd7f860
03 8月, 2023 1 次提交
- Y
  
  Optim fused linear grad add (#55927) · 91873469
  由 Yuang Liu 提交于 8月 03, 2023
  
  91873469
26 7月, 2023 1 次提交
- T
  
  add sin and cos optional parameters to fused_rope op (#55415) · 581d05bb
  由 tianhaodongbd 提交于 7月 26, 2023
  
  581d05bb
13 7月, 2023 1 次提交
- F
  [inference] Add FusedBiasActKernel (#55301) · 0a4d1999
  由 freeliuzc 提交于 7月 13, 2023
```
* add init value for CudaSwishFunctor

* add new phi kernel fusedBiasActKernel
```
  0a4d1999
11 7月, 2023 1 次提交
- Linear compress (#55128) · f4290a92
  由 FormlessUnit 提交于 7月 11, 2023
```
* rename weight_only/llm.int8
```
  f4290a92
03 7月, 2023 1 次提交
- add linear_compress API (#54140) · c4d5ec66
  由 FormlessUnit 提交于 7月 03, 2023
```
* add linear_compress API
```
  c4d5ec66
29 6月, 2023 1 次提交

Add fused_rope forward op (#54351) · a215c46a

由 niuliling123 提交于 6月 29, 2023

* style

* more

* update ctest

* Update legacy_backward.yaml

* Update legacy_ops.yaml

* Update legacy_ops.yaml

* update

* update

* update for move

a215c46a

28 6月, 2023 1 次提交
- S
  [BugFix] Fix bug for binary_cross_entropy_with_logits loss (#54869) · bb42d870
  由 Siming Dai 提交于 6月 28, 2023
```
* add pos_weight in kernel

* fix unittest

* fix xpu

* fix bce unittest, change infermeta order
```
  bb42d870
26 6月, 2023 1 次提交

remove ops from OpsWithFluidKernelNeedMoveToPhi set (#54007) · 733eca85

由 Sonder 提交于 6月 26, 2023

* remove ops from OpsWithFluidKernelNeedMoveToPhi set

* open static build flag

* OpsWithFluidKernelNeedMoveToPhi

* open new_executor_static_build

* add infermate for cudnn_lstm

* fix

* update

* fix

* update

* update

* update

* fix pow2 decay

* fix pow2 decay

* recover analysis_predictor.cc

* fix pow2 decay

* fix cudnn lstm

* add output register info for svd

* fix pow2_decay_with_linear_warmup_kernel

* recover test lstm cudnn

* recover svg register codes

* fix register info

* fix reduce sum register info

* add output info for adadelta

* add output info for adadelta

* add output info for adamax

* fix complex abs register info

* add register info for cudnn_lstm_grad

* recover

* fix lstm cudnn

* fix

* fix xpu output registe info

* remove std::cout

* add backend

* remove output info in pow2_decay_with_linear_warmup_kernel

* add judgment in TensorShouldBeFakeInitialized

* recover power_

* close new_executor_static_build

* fix set_value_xpu

733eca85

16 6月, 2023 1 次提交
- Z
  fix lamb optimizer always_adapt (#54654) · 2a56f4b3
  由 zhiboniu 提交于 6月 16, 2023
```
* fix lamb always_adapt

* fix optest

* fix all optests
```
  2a56f4b3
01 6月, 2023 1 次提交

Support static graph code generation for conv2d, conv3d, depthwise_conv2d (#54201) · f3eccb3f

由 huangjiyi 提交于 6月 01, 2023

* update

* update cmake

* update

* update

* update

* update

* Revert "update cmake"

This reverts commit 1e1dc1b2bc9967b725201272607f939260070fd4.

* update

* update

* update

* update

f3eccb3f

23 5月, 2023 1 次提交
- H
  move fusion_group infershape to phi (#53934) · 3dc99088
  由 huangjiyi 提交于 5月 23, 2023
```
* update

* update

* update

* set out dtype
```
  3dc99088
10 5月, 2023 1 次提交

傅

add index_put api (#52886) · f3393f49

由傅剑寒提交于 5月 10, 2023

* add index_put api

* fix value broadcast in backward and add test case in static

* add timeout=120s for index_put

* add op_compat for index_put

* add inplace index_put test

* add test case when index tensor in indices is int32 when indices.size less than x.dims

* add index_put api backward in cpu place

* add backward test case

* refactor code to delete some duplicated code

* replace reshape with resize for decrease extra memcpy

* add datatype flag in backward yaml

* fix bug in documentation

* Update python/paddle/tensor/manipulation.py

---------
Co-authored-by: NLigoml <39876205+Ligoml@users.noreply.github.com>

f3393f49

24 4月, 2023 1 次提交
- S
  Add weighted sample (#52013) · 6a8d98e0
  由 Siming Dai 提交于 4月 24, 2023
```
Add paddle.geometric.weighted_sample_neighbors API
```
  6a8d98e0
19 4月, 2023 1 次提交
- Z
  fix graph_reindex (#52930) · e5506be6
  由 zhangyuqin1998 提交于 4月 19, 2023
```
* fix graph_reindex

* fix

* Update op_compat.yaml
```
  e5506be6
11 4月, 2023 2 次提交
- Z
  
  delete remote_prefetch (#52748) · 3951c40d
  由 zhangyuqin1998 提交于 4月 11, 2023
  
  3951c40d
- W
  
  [BUG Fixs] adadelta lr support (#49732) · 23032590
  由 wangzhen38 提交于 4月 11, 2023
  
  23032590
04 4月, 2023 1 次提交
- Z
  rename_bilinear_tensor_product (#52375) · 34069c46
  由 zhangyuqin1998 提交于 4月 04, 2023
```
* rename_bilinear_tensor_product

* fix
```
  34069c46
27 3月, 2023 1 次提交
- Z
  
  edit formate of mea (#52147) · 13baef48
  由 ZhangDY-6483 提交于 3月 27, 2023
  
  13baef48
24 3月, 2023 1 次提交

Memory Efficient Attention (#51867) · e5ad3859

由 ZhangDY-6483 提交于 3月 24, 2023

* first version, notest

* return final rst, notest

* use infinity() instead of max

* ut structure

* start up of ut

* generate lse

* update

* add depense

* reconstruct cmake

* move file

* add memory efficient attention and fix blasimpl

* update

* update cmake

* add namespace

* update cmake

* use .cu

* update for pad3d

* bug fix

* bug fix

* update

* bug fix

* update enforce

* add test case

* merge the lse pad

* fix kernel_fn of backward

* fix PADDLE_ENFORCE_EQ and phi_api

* fix PADDLE_ENFORCE

* fix PADDLE_ENFORCE

* rerun coverage

* fix memory efficient attention test

* rerun ci

* add cuda version condition

* add cuda version condition

* delete WIP test

* replace PADDLE_ENFORCE

* edit the namespace of datatype in multiple.cc

* rerun

* rerun

---------
Co-authored-by: Nliuyuang <liuyuang@baidu.com>

e5ad3859

22 3月, 2023 1 次提交

Add fused_linear_param_grad_add_kernel (#51805) · f59c5d8b

由 sneaxiy 提交于 3月 22, 2023

* add fused_linear_param_grad_add_kernel

* fix compile error

* remove flag

* fix ci compile error

* fix ci compile error

* revert pylayer revision

* fix ci ut

* improve performance

f59c5d8b

08 3月, 2023 1 次提交
- N
  
  Add mult_precision param for adamax op (#49705) · 151ec311
  由 niuliling123 提交于 3月 08, 2023
  
  151ec311
06 3月, 2023 1 次提交
- N
  
  Add multiprecision for adadelta op (#50131) · a8a2b7f4
  由 niuliling123 提交于 3月 06, 2023
  
  a8a2b7f4
03 3月, 2023 1 次提交
- N
  
  Add multi_precision for adagrad op (#50078) · 4779c2c1
  由 niuliling123 提交于 3月 03, 2023
  
  4779c2c1
01 3月, 2023 1 次提交
- N
  
  Add multiprecision for rms op (#50132) · 48060b2e
  由 niuliling123 提交于 3月 01, 2023
  
  48060b2e
17 2月, 2023 1 次提交

Rename MultiTensorAdam To FusedAdam (#50449) · e6af9bd2

由 yuehuayingxueluo 提交于 2月 17, 2023

* rename multi_tensor_adam to fused_adam

* fix some bugs

* fix CI coverage

* rename test_fused_adam.py

* fix some bug

* add test_fused_adam_op.py

* fix some bugs

* fix fused_adam_op.cc

* fix CI bugs

* fix CI bug

* fix CI bug

e6af9bd2

16 2月, 2023 1 次提交
- C
  Add logspace yaml (#49194) · c284d42a
  由 Chen Weihang 提交于 2月 16, 2023
```
* add logspace yaml

* update by comments

* resolve test framework conflicct
```
  c284d42a
09 2月, 2023 1 次提交

Add MultiTenosrAdam OP (#49220) · 10654c77

由 yuehuayingxueluo 提交于 2月 09, 2023

* add multi_tenosr_adam

* update multi_tensor_base.py, test_multi_tensor_adam.py, adamw.py

* fix adam.py optimizer.py

* fix adamw.py

* fix test_multi_tensor_adam.py

* fix CI bug

* fix CI coverage

* fix ci bug

* fix betapow

* fix some bugs

* fix test_adamw_op.py

* fix CI coverage

* fix multi_tensor_adam_kernel.cc

* fix CI bug

* fix multi_tensor_adam_op.cc and test_multi_tensor_adam.py

* fix code style

* update C++ parts

* remove python parts modification temporarily

* add C++ ut

* update betapow copy code logic

* fix ci ut

* fix windows ci

* fix coverage ci

* improve coverage rate

---------
Co-authored-by: Nsneaxiy <sneaxiy@126.com>

10654c77

23 12月, 2022 1 次提交
- H
  add rnn-t loss and api (#49199) · c088f9ec
  由 Hui Zhang 提交于 12月 23, 2022
```
* add warp transducer code
```
  c088f9ec
22 12月, 2022 1 次提交
- X
  
  [Paddle Inference] Add moe phi kernel (#48703) · def2a87f
  由 xiaoxiaohehe001 提交于 12月 22, 2022
  
  def2a87f
09 12月, 2022 1 次提交
- L
  move share_buffer kernel to phi (#48858) · c2e77ba3
  由 Leo Chen 提交于 12月 09, 2022
```
* move share_buffer kernel to phi

* fix ut

* add source file

* fix window links
```
  c2e77ba3

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功