提交 · 8f6446d3d98418677f79e239e0e6d4214fb7aec6 · PaddlePaddle / Paddle

14 8月, 2023 3 次提交

[NewIR]test new ir op test in gpu (#55857) · 8f6446d3

由 kangguangli 提交于 8月 14, 2023

* add ir output check in OpTest

* add ir grad check in op test

* fix bug in output check

* trigger CI

* test gpu ci

* trigger CI

* trigger CI

* add white list to relax precision check for some tests

* relax timeout of test_concat_op

* relax timeout of test_concat_op

8f6446d3

S

[Fluid] Move fused_softmax_mask_upper_triangle to phi (#55769) · 6e40fc1d
由 Sonder 提交于 8月 14, 2023

6e40fc1d

Add rmsnorm residual bias add and quant (#55965) · 2ac6a7e4

由 MarDino 提交于 8月 14, 2023

* add rmsnorm residual bias add and quant

* refine python interface

* add rmsnorm unittest

* Add layernorm

* fix layernorm unittest

* refine unittest

* fix example code

* fix review comment

2ac6a7e4

11 8月, 2023 5 次提交

L
remove the optimizer base and learning rate base (#56099) · 6eaed2da
由 LoneRanger 提交于 8月 11, 2023
```
* remove the optimizer base and learning rate base

* fix bug

* fix bug
```
6eaed2da
Y
Fix the shape of input sin and cos for fused_rope. (#56132) · f60c698f
由 Yiqun Liu 提交于 8月 11, 2023
```
* Fix the shape of input sin and cos for fused_rope.

* Update shape in unittest.
```
f60c698f

repacle fluid.io.load_inference_model, fluid.io.save_inference_model in fluid... · bfc64801

由 Difer 提交于 8月 11, 2023

repacle fluid.io.load_inference_model, fluid.io.save_inference_model in fluid with 2.0 version  (#55345)

* repacle fluid.io.load_inference_model

* replace fluid.io.save_inference_model

* fix some bug

* fix some bugs of load & save model

* fix some bug

* fix test_inference_model_io bug

* fix word2vec_inference_model bug

* fix some bug

* fix valueError bug

* fix some bug

* fix a warning error

* for debug

* for debug

* fix io error

* fix test_wordvec_book error

* remove debug print

* fix load_var bug

* for debug cinn test

* revert cinn & fix inference_pass_test in windows

* fix some bugs

* revert cinn & fix inference_pass_test in windows

* for debug vars

* for debug

* fix quant_dequant_test

* fix some path errors

* remove fluid save/load

* fix incubate-fleet save

* move some from fluid.io to static.io

bfc64801

move some fluid apis (#55986) · eafc9889

由 Difer 提交于 8月 11, 2023

* move fluid apis

* fix type error

* remove static exponential_decay

* fix some import error

* remove nn.py

* fix some error

* fix type error

eafc9889

C
[Prim] Fix get var in prim when list of single tensor (#56114) · 1e5fec39
由 cyber-pioneer 提交于 8月 11, 2023
```
* fix get var in prim

* fix stack test case
```
1e5fec39

10 8月, 2023 4 次提交

Y

Canceled the inplace_check of test_ group_ norm_op · b546b923
由 yangjianfengo1 提交于 8月 10, 2023

b546b923
J

fix trainable (#56104) · 292fd200
由 JYChen 提交于 8月 10, 2023

292fd200

Add variable_length_memory_efficient_attention (#55400) · 4036c937

由 lzy 提交于 8月 10, 2023

* add variable_length_memory_efficient_attention
* update variable_length_memory_efficient_attention unittest
* update variable_length_mem_eff_attn's docs and unittest
* update variable_length_mem_eff_attn's docs
* Update test_variable_length_memory_efficient_attention.py
* Update variable_length_memory_efficient_attention.cu
* fix codestyle
* fix variable_length_fmha's docs and unittest
* fix variable_length_fmha's docs

4036c937

Y

fix A100 fused linear grad add ut bug (#56136) · b561a05e
由 Yuang Liu 提交于 8月 10, 2023

b561a05e

09 8月, 2023 6 次提交

X
[Paddle Inference] Set softmax op use_cudnn default true. (#56036) · 4f2cf7fb
由 xiaoxiaohehe001 提交于 8月 09, 2023
```
* fix_softmax_eigen

* fix_ctest_seresnet

* fix_ci_error
```
4f2cf7fb
C

Add FP16 & BF16 for nanmedian (#56056) · 4ae9945b
由 cyberslack_lee 提交于 8月 09, 2023

4ae9945b
U

Fix select sdp for FA-2 (#56045) · 08e46d6f
由 umiswing 提交于 8月 09, 2023

08e46d6f
N

change index's dtype for int to int64 (#55949) · 8d181e37
由 niuliling123 提交于 8月 09, 2023

8d181e37
K
[NewIR] minor fix about new ir test (#56075) · a127d7c8
由 kangguangli 提交于 8月 09, 2023
```
* fix bugs about new ir test

* enable dy2st newir test in all cases

* fix
```
a127d7c8

remove the... · 723c6f77

由 LoneRanger 提交于 8月 09, 2023

remove the AdamOptimizer、SGDOptimizer、MomentumOptimizer、ModelAverage、LookaheadOptimizer、FtrlOptimizer、DecayedAdagradOptimizer、DpsgdOptimizer in fluid and relocate the ExponentialMovingAverage、PipelineOptimizer、GradientMergeOptimizer and change optimizer base for LarsMomentumOptimizer and RecomputeOptimizer (#55970)

* change the optimizer base for SGDOptimizer

* change the optimizer base for SGDOptimizer

* replace the SGDOptimizer with SGD

* fix bug of sgd

* change the optimizer base for MomentumOptimizer

* fix the remaining tests

* remove the Momentum in fluid/optimizer.py

* fix bug

* fix bug

* fix bug

* fix bug

* Update test_resnet_cinn.py

* Update test_resnet_prim_cinn.py

* fix bug

* fix bug

* fix bug

* remove the ModelAverage in fluid

* remove the LookaheadOptimizer in fluid

* fix bug

* remove AdamOptimizer in fluid

* Update test_image_classification_fp16.py

* fix bug

* relocate the ExponentialMovingAverage in fluid

* restore the static api

* remove the FtrlOptimizer in fluid

* remove the DecayedAdagradOptimizer in fluid

* remove the DpsgdOptimizer in fluid

* fix bug

* fix codestyle

* fix bug

* fix bug

* relocate the PipelineOptimizer

* relocate the GradientMergeOptimizer

* fix bug

* fix bug

* fix bug

* fix doc

* Update __init__.py

* Update test_fleet_qat_meta_optimizer.py

* change optimizer base for LarsMomentumOptimizer

* fix bug

* fix conflict

* fix code-style

* fix sample codes

* fix bug

* fix bug

* fix cinn bug

* fix bug

* fix bug

* Update qat_optimizer.py

* Update __init__.py

* fix bug

* change optimizer base for RecomputeOptimizer

* fix bug

* fix bug

* Update test_imperative_optimizer_v2.py

723c6f77

08 8月, 2023 3 次提交
- W
  move `decayed_adagrad_op` to phi (#55995) · 0d920178
  由 Wang Xin 提交于 8月 08, 2023
```
* move decayed_adagrad_op to phi

* fix bug
```
  0d920178
- F
  
  optimize op structure (#55988) · 6bd7f860
  由 freeliuzc 提交于 8月 08, 2023
  
  6bd7f860
- T
  
  Fix test_lu_op (#55896) · 4728d58d
  由 Tian Zheng 提交于 8月 08, 2023
  
  4728d58d
07 8月, 2023 6 次提交

Add attn_mask supported for FlashAttnKernel. (#55969) · 42e0c6b8

由 yin wei 提交于 8月 07, 2023

* add mask

* add backword

* add enforce info

* update scale

* integrate code

* update enforce

* add enforce eq

* add error type

* update enforce

* add test_flash_attention

* Polish codes and fix compiling errors.

* Set num_splits to 0 for flash-attn with tensor mask.

* Fix the compiling error for non flash-attn case.

---------
Co-authored-by: NLiu Yiqun <liuyiqun01@baidu.com>

42e0c6b8

C

Fix typos (#56008) · 4d094b0c
由 co63oc 提交于 8月 07, 2023

4d094b0c
T
Test Del paddle_bfloat (#55904) · 8fc2366c
由 tianshuo78520a 提交于 8月 07, 2023
```
* Test Del paddle_bfloat

* Del paddle_bfloat test
```
8fc2366c
Y
Increase absolute error of test_group_norm_op (#55992) · 496de7f3
由 yangjianfengo1 提交于 8月 07, 2023
```
* inplace tol

* code style
```
496de7f3
C

Fix typos, test=document_fix (#56005) · f55f601e
由 co63oc 提交于 8月 07, 2023

f55f601e

[WIP] Integration flash attention 2 (#55758) · 0473369f

由 umiswing 提交于 8月 07, 2023

* Work for fa-2 padded fwd. Code to be cleaned.

* Work for fa2 unpadded fwd.

* Work for padded-bwd, dk get small diff on np.random.seed(0)

* Anyway I pass paddle's utest, except return softmax without dropout.

* Clean code.

* Modify interface.

* Clean code and add some check.

* Easy compile for dev.

* Fix ci.

* Fix ci-build.

* Add std c++17 option again.

* Limit max job when compiling fa2.

* Remove const_cast

* Add fwd params, to be cleaned.

* Clean code.

* Add bwd params.

* Clean code.

* Add enforce.

* Use v2.0.4

* Pass RNG state to fa2 capi

* Fix review.

* Add assert

* Skip compile for sm less than 80.

0473369f

04 8月, 2023 4 次提交

repacle embedding in fluid with 2.0 version (#55757) · 2d91a9bd

由 Difer 提交于 8月 04, 2023

* replace embedding

* replace sparse_embedding

* fix some bugs

* del embedding

* repalce layers.embedding

* fix type error

2d91a9bd

[NewIR]New ir aot placement refactor (#55810) · dd1379e4

由 hong 提交于 8月 04, 2023

* refacot aot

* update

* fix bugs

* remove some test

* fix bug

* fix bug

* fix bug

* fix bug

* update

dd1379e4

Support Combined indexing for __getitem__ and __setitem__ (#55211) · 697c712f

由 JYChen 提交于 8月 04, 2023

* WIP: start writing combined indexing get

* list/tuple/Variable

* getitem 80%

* add setitem

* add some unittest for setitem

* lazy import

* fix some setitem error

* fix advance indexing with decreasing axes; fix strided_slice input name

* combine int-tensor getitem is ok (without boolean support & broadcast); add getitem unittest for static

* add broadcast & parse bool tensor for __getitem

* [change getitem] _getitem_impl_ to _getitem_static, not deleting the former one

* refine new getitem; fix ut in variable/var_base

* add __getitem__ ut in dygraph

* re-dispatch getitem for Py/CPP; fix strided_slice decrease axes error in dygraph

* fix ut; support tensor in slice

* [change setitem] _setitem_impl_ to _setitem_static, not deleting the former one

* remove some UT (for some, temporarily)

* add IndexError to solve timeout problem in static-mode

* 1.temply forbideen all-False bool-indexput; 2.setitem_static will return new variable

* xpu uses old stratege

* rename dy2st setitem ut to avoid same-name problem

* dy2st for new combined index

* ut case for combine-index with dy2st

* open ut with all-false-bool setitem

* remove useless doc and _getitem_impl_

* change static res

* fix static xpu

697c712f

L

【PaddlePaddle Hackathon 4】No.63 : add embedding fp16 test (#51321) · 9f2d88e9
由 LoneRanger 提交于 8月 04, 2023

9f2d88e9

03 8月, 2023 2 次提交
- Y
  Increase relative error of test_group_norm_op unittest (#55943) · 84445499
  由 yangjianfengo1 提交于 8月 03, 2023
```
* fix fp 16

* bf16 rtol

* fixed input

* code style
```
  84445499
- T
  
  Fix test_eig_op_static_build (#55897) · 128f5df8
  由 Tian Zheng 提交于 8月 03, 2023
  
  128f5df8
02 8月, 2023 5 次提交

[Inference] Replace groupNorm when data types are bf16 and fp16, and data... · e61d892a

由 yangjianfengo1 提交于 8月 02, 2023

[Inference] Replace groupNorm when data types are bf16 and fp16, and data format is NHWC implementation. (#55399)

* finish

* cpergroup odd

* fix bf16

* single channel

* code style

* jingdu duiqi

* add head_file

* add bf16 head file

* bf16 2

* bf16

* bf16 head

* bf16 compile

* py test

* bf16 compile

* bf16 compile

* unset py test

* nhwc

* test

* mean var

* bf16 success

* su

* ctest success

* use is_same_as

* is_same

* use is_same

* rtol

* gpu_stream

* del sigmod

* fix bfloat16 type

* use cuda_bf16_hpp

* use_cuda_arch

* bfloat162float2

* del inplace_tol

* del max_releative_tol

* temp store

* jingdu duiqi

* temp store

* plugin

* jingdu duiqi

* duiqi

* include cuda.h

* del half

* half single

* ci

* add const

* ci

* cudamemset

* del printf

* fp16 test

* add half compute

* del br16 ci

* del ci

* ci approve

* del fluid include

e61d892a

W
fix security bug (#55866) · 92aa92fa
由 wanghuancoder 提交于 8月 02, 2023
```
* fix security bug
```
92aa92fa
C

Add FP16 & BF16 for erfinv (#55287) · 6d7efd09
由 cyberslack_lee 提交于 8月 02, 2023

6d7efd09
X

Fix bugs of windows CI: skip the unit tests related to devices (#55889) · db700d10
由 xuxinyi389 提交于 8月 02, 2023

db700d10
Add scaled_dot_product_attention api (#55242) · b19dfb8c
由 zhenhailiu 提交于 8月 02, 2023

b19dfb8c

01 8月, 2023 1 次提交

[CodeStyle] replace `assert np.allclose` with `np.testing.assert_allclose` and... · 744e1eaf

由 Zhan Rongrui 提交于 8月 01, 2023

[CodeStyle] replace `assert np.allclose` with `np.testing.assert_allclose` and `assert np.array_equal` with `np.testing.assert_array_equal` (#55385)

744e1eaf

31 7月, 2023 1 次提交

[BugFix] fix bug of UserWarnings in test_layer_norm_op.py (#55762) · 4df4b9fe

由 RedContritio 提交于 7月 31, 2023

* update TestAPI arguments to enable param_attr and bias_attr in test_layer_norm_op

* add bf16 condition in test_layer_norm_op

* add fast_math condition

4df4b9fe

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功