提交 · c958ba740806a5a07e70851e3d778c4a88207f5e · PaddlePaddle / Paddle

06 9月, 2023 3 次提交

[xdoctest][task 248-249,266-267,269] reformat example code with google style... · c958ba74

由小飞猪提交于 9月 06, 2023

[xdoctest][task 248-249,266-267,269] reformat example code with google style in `incubate/distributed/fleet/*`,`incubate/nn/layer/*` (#56772)

* [Doctest]fix No.248-249,266-267,269, test=docs_preview

* fix style

* fix

* add env:DISTRIBUTED

c958ba74

小

[xdoctest][task 268] reformat example code with google style in... · 9d183662

由小飞猪提交于 9月 06, 2023

[xdoctest][task 268] reformat example code with google style in `/incubate/nn/layer/fused_transformer.py` (#56965)

* [Doctest]fix No.268, test=docs_preview

* Apply suggestions from code review

---------
Co-authored-by: NNyakku Shigure <sigure.qaq@gmail.com>

9d183662

[xdoctest] reformat example code with google style in No. 250-260 (#56541) · 4dbe441c

由 cyberslack_lee 提交于 9月 06, 2023

* test=docs_preview

* test=docs_preview

* test=docs_preview

* test=docs_preview

* test=docs_preview

* test=docs_preview

* fix

* test=docs_preview

* test=docs_preview

* fix

* move stmts under imports

---------
Co-authored-by: NSigureMo <sigure.qaq@gmail.com>

4dbe441c

05 9月, 2023 1 次提交
- K
  [xdoctest] reformat example code with google style in No. 264-265 (#56907) · 8746e230
  由 KongAKun 提交于 9月 05, 2023
```
* Fix styles of code

* update the GPU option

* add the GPU setup

* remove the note

* update the code
```
  8746e230
04 9月, 2023 1 次提交

Add rotate_half implementation for fused_rope (#56401) · c089a2af

由 tianhaodongbd 提交于 9月 04, 2023

* add rotate_half in fused_rope

* add position_ids in fused_rope

* modified examples about fused_rope

* add set_device in examples

c089a2af

31 8月, 2023 1 次提交
- Y
  
  [xdoctest] reformat example code with google style in No.284-285 (#56784) · 3c2276e6
  由 yuchen202 提交于 8月 31, 2023
  
  3c2276e6
25 8月, 2023 1 次提交
- X
  [Paddle Inference] Add bias input of mmha and simplify mmha. (#56411) · 636dc2ff
  由 xiaoxiaohehe001 提交于 8月 25, 2023
```
* add_bias_and_simplify_mmha
```
  636dc2ff
21 8月, 2023 1 次提交
- R
  
  fix dynamic to static when export LLM inference model (#56390) · 95c4bb41
  由 RichardWooSJTU 提交于 8月 21, 2023
  
  95c4bb41
16 8月, 2023 1 次提交
- Refine FusedNorm comment (#56305) · 12547fb4
  由 MarDino 提交于 8月 16, 2023
```
* refine static op return val
```
  12547fb4
15 8月, 2023 1 次提交

[Paddle Inference] Add masked multihead attention kernel and export API. (#55344) · 989c5e87

由 xiaoxiaohehe001 提交于 8月 15, 2023

* support_mmha
* add_python_api
* add_api_doc
* fix_doc_error
* fix_infermeta
* add_infermeta
* add_bf16_cuda_check
* add_bf16_check
* fix_ci_windows
* fix_ci_windows_kernel_register
* fix_test_mmha
* add_cumoffsets
* remove_bias
* delete_mmha_reshape_input_output
* rename_delete_hfile
* remove_fluid

---------
Co-authored-by: Nyangjianfengo1 <yangjianfeng01@baidu.com>

989c5e87

14 8月, 2023 1 次提交

Add rmsnorm residual bias add and quant (#55965) · 2ac6a7e4

由 MarDino 提交于 8月 14, 2023

* add rmsnorm residual bias add and quant

* refine python interface

* add rmsnorm unittest

* Add layernorm

* fix layernorm unittest

* refine unittest

* fix example code

* fix review comment

2ac6a7e4

10 8月, 2023 1 次提交

Add variable_length_memory_efficient_attention (#55400) · 4036c937

由 lzy 提交于 8月 10, 2023

* add variable_length_memory_efficient_attention
* update variable_length_memory_efficient_attention unittest
* update variable_length_mem_eff_attn's docs and unittest
* update variable_length_mem_eff_attn's docs
* Update test_variable_length_memory_efficient_attention.py
* Update variable_length_memory_efficient_attention.cu
* fix codestyle
* fix variable_length_fmha's docs and unittest
* fix variable_length_fmha's docs

4036c937

09 8月, 2023 1 次提交
- N
  
  change index's dtype for int to int64 (#55949) · 8d181e37
  由 niuliling123 提交于 8月 09, 2023
  
  8d181e37
26 7月, 2023 2 次提交
- T
  
  add sin and cos optional parameters to fused_rope op (#55415) · 581d05bb
  由 tianhaodongbd 提交于 7月 26, 2023
  
  581d05bb
- J
  [Fluid Clean] remove module fluid.layers.control_flow (#55661) · a646e75f
  由 JYChen 提交于 7月 26, 2023
```
* remove api staticrnn

* move select_input/output to static/controw flow

* delete some func, only remain Switch

* clean fluid.layers.controw_flow

* remove fluid.layers.controlflow

* fix conditional_block ut
```
  a646e75f
20 7月, 2023 1 次提交
- N
  
  Add fuse_linear_activation (#55420) · fa084e5e
  由 niuliling123 提交于 7月 20, 2023
  
  fa084e5e
11 7月, 2023 1 次提交

Integrate rmsnorm kernel (#54998) · 97d3d6ee

由 MarDino 提交于 7月 11, 2023

* add rmsnorm kernel
* add static graph test
* fix round type
* use alignas to avoid msvc compile error
* remove redundant headerfile to avoid rocm compile error
* fix rocm compile not found cub
* Add document

97d3d6ee

03 7月, 2023 1 次提交
- N
  
  Update the rope op according to the comments (#54985) · 2401d48d
  由 niuliling123 提交于 7月 03, 2023
  
  2401d48d
29 6月, 2023 1 次提交

Add fused_rope forward op (#54351) · a215c46a

由 niuliling123 提交于 6月 29, 2023

* style

* more

* update ctest

* Update legacy_backward.yaml

* Update legacy_ops.yaml

* Update legacy_ops.yaml

* update

* update

* update for move

a215c46a

12 6月, 2023 1 次提交
- N
  
  bump black to 2023 style (#54523) · 44e0393c
  由 Nyakku Shigure 提交于 6月 12, 2023
  
  44e0393c
09 6月, 2023 1 次提交
- N
  bump ruff to 0.0.272 and update config (#54449) · 8f65f72e
  由 Nyakku Shigure 提交于 6月 09, 2023
```
* bump ruff to 0.0.271 and update config

* exclude third_party

* bump ruff to 0.0.272

* refine config
```
  8f65f72e
23 5月, 2023 1 次提交
- C
  
  fix typos(#53967) · c36a000d
  由 cyberslack_lee 提交于 5月 23, 2023
  
  c36a000d
22 5月, 2023 1 次提交

[dygraph]unify _non_static_mode() in_dygraph_mode() and in_dynamic_mode() (#53856) · 3794d171

由 Meteor Liu 提交于 5月 22, 2023

* [dygraph]unify _non_static_mode() in_dygraph_mode() and in_dynamic_mode()

* [dygraph]unify _non_static_mode() in_dygraph_mode() and in_dynamic_mode()

* [dygraph]unify _non_static_mode() in_dygraph_mode() and in_dynamic_mode()

* [dygraph]unify _non_static_mode() in_dygraph_mode() and in_dynamic_mode()

* [dygraph]unify _non_static_mode() in_dygraph_mode() and in_dynamic_mode()

* [dygraph]unify _non_static_mode() in_dygraph_mode() and in_dynamic_mode()

* fixed cyclic reference that caused patial import

* fixed bad change

* fix bad import

* fix bad import

* fix bad import

* fix ut failed caused by change in_dynamic_mode

* fix ut failed caused by change in_dynamic_mode

* fixed usage of in_dynamic_mode() or in_dygraph_mode()

* revert python3 to python in .pre-commit-config.yaml

* fix merge conflicts

3794d171

19 5月, 2023 1 次提交

Add flash attention to speedup fused_gate_attention. (#52731) · d29c1f8e

由 limingshu 提交于 5月 19, 2023

* Reorganize the forward codes of flash-attention.

* Fix forward.

* Remove some noused codes.

* Simplify codes and fix backward.

* Change all LOG(INFO) to VLOG and fix the backward.

* add scale for AF2 flash_attn, much thanks to xreki and shaojie for debug these codes

* decrease the effect of debug print on performance

* Unify the initialize of flashattn arguments.

* Rewirte the reshape of temp_mask and temp_bias.

* API support use_flash_attn.

* Fix compiling error on CI.

* Try to crop the flash-attention lib.

* Correct the condition of whether can use flash-attn.

* Remove the softmax_out argument.

* Remove is_causal.

* Polish codes.

* Fix qkv_transpose_out's shape and scaling of Q * K.

* Update commit of flash-attention.

---------
Co-authored-by: NLiu Yiqun <liuyiqun01@baidu.com>

d29c1f8e

06 5月, 2023 1 次提交

Add fused_gate_attention API. (#53432) · b7295120

由 Yiqun Liu 提交于 5月 06, 2023

* Add fused_gate_attention API.

* Implement FusedDropout API.

* Fix doc and add unittest.

* Skip for non-gpu device.

* Add unittest.

b7295120

17 4月, 2023 1 次提交
- C
  [Fused] controlled randomness for fused dropout add (#52903) · e36f80c6
  由 Chitsing KUI 提交于 4月 17, 2023
```
* add random control for fused dropout add

* add __init__
```
  e36f80c6
31 3月, 2023 1 次提交

张

[CodeStyle][UP030][UP031][UP032] using f-string (#52062) · 40e4f5a5

由张春乔提交于 3月 31, 2023

* autofix
Co-authored-by: NLiyulingyue <83450930+Liyulingyue@users.noreply.github.com>

* revert changes in python/paddle/distributed/fleet/utils/hybrid_parallel_util.py

* empty commit, trigger ci

* fix test_slice

---------
Co-authored-by: NSigureMo <sigure.qaq@gmail.com>

40e4f5a5

29 3月, 2023 1 次提交
- S
  Fix generate_kernels.py in CUDA 12.0 (#52232) · f2c96bc2
  由 sneaxiy 提交于 3月 29, 2023
```
* fix generate_kernels.py in CUDA 12.0

* fix attrs bug
```
  f2c96bc2
24 3月, 2023 1 次提交

Memory Efficient Attention (#51867) · e5ad3859

由 ZhangDY-6483 提交于 3月 24, 2023

* first version, notest

* return final rst, notest

* use infinity() instead of max

* ut structure

* start up of ut

* generate lse

* update

* add depense

* reconstruct cmake

* move file

* add memory efficient attention and fix blasimpl

* update

* update cmake

* add namespace

* update cmake

* use .cu

* update for pad3d

* bug fix

* bug fix

* update

* bug fix

* update enforce

* add test case

* merge the lse pad

* fix kernel_fn of backward

* fix PADDLE_ENFORCE_EQ and phi_api

* fix PADDLE_ENFORCE

* fix PADDLE_ENFORCE

* rerun coverage

* fix memory efficient attention test

* rerun ci

* add cuda version condition

* add cuda version condition

* delete WIP test

* replace PADDLE_ENFORCE

* edit the namespace of datatype in multiple.cc

* rerun

* rerun

---------
Co-authored-by: Nliuyuang <liuyuang@baidu.com>

e5ad3859

23 3月, 2023 1 次提交

[CodeStyle][C408][C409][C410] Fix unnecessary <dict/list/tuple> call and... · cf391b81

由 PuQing 提交于 3月 23, 2023

[CodeStyle][C408][C409][C410] Fix unnecessary <dict/list/tuple> call and unnecessary <list/tuple> passed to <list/tupule>() (#51928)

* autofix

* add select config

* autofix C410

* add C410 select

cf391b81

22 3月, 2023 1 次提交
- S
  
  add fused dropout add (#51752) · 6ba0507d
  由 ShenLiang 提交于 3月 22, 2023
  
  6ba0507d
17 3月, 2023 1 次提交

fluid clean: remove fluid.ir and fluid.io (#51167) · 00877381

由 qizhaoaoe 提交于 3月 17, 2023

* fluid clean: remove fluid.ir to framework.ir and some funcs form fluid.layer.io to incubate.

* delete fluid.ir

00877381

10 3月, 2023 1 次提交

Add attn_bias.py of xformers (#51387) · 54331f1a

由 sneaxiy 提交于 3月 10, 2023

* add attn_bias.py

* add Python interface

* add license

* add test_attn_bias.py

* fix CPU test error

* fix ci error

54331f1a

22 2月, 2023 1 次提交

Fix some typos. (#50429) · 93b2bf4b

由 Shuangchi He 提交于 2月 22, 2023

* Fix some typos.
Signed-off-by: Yulv-git <yulvchi@qq.com>

* pre-commit
Signed-off-by: Yulv-git <yulvchi@qq.com>

---------
Signed-off-by: Yulv-git <yulvchi@qq.com>

93b2bf4b

15 2月, 2023 1 次提交

make FusedMultiTransformer supports variable-lengths. (#49560) · 53df50c7

由 lzy 提交于 2月 15, 2023

* make FusedMultiTransformer supports variable-lengths.

* modify ffn2 when cuda_version >= 11.6 because of #49392.

* code style

* delete remove_padding

53df50c7

01 2月, 2023 1 次提交

Fix Python IndexError of case2-3 (#49986) · fd5b8eea

由 RedContritio 提交于 2月 01, 2023

* add shape check for fused_multi_head_attention

* use raise for coverage test

* add unittest

* remove unnecessary pass

* add unittest

fd5b8eea

05 1月, 2023 2 次提交
- Y
  
  udpate_fused_attention_en_docs, test=document_fix (#49564) · 2811dcd0
  由 Yuang Liu 提交于 1月 05, 2023
  
  2811dcd0
- Y
  
  Add transpose_qkv_wb flags to the fused_attention_op. (#49494) · ec857b85
  由 Yuang Liu 提交于 1月 05, 2023
  
  ec857b85
23 12月, 2022 1 次提交
- L
  
  make FusedMultiTransformer supports RoPE (#48842) · 644dfc60
  由 lzy 提交于 12月 23, 2022
  
  644dfc60
22 12月, 2022 1 次提交
- X
  
  [Paddle Inference] Add moe phi kernel (#48703) · def2a87f
  由 xiaoxiaohehe001 提交于 12月 22, 2022
  
  def2a87f

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功