提交 · c5fc413a03e43c34dc76bf7ff35716842e4fbd6a · PaddlePaddle / Paddle

28 8月, 2023 2 次提交

【inplace api】Batch add inplace api gt_, ge_, lt_, le_, eq_, not_equal_,... · c5fc413a

由 GGBond8488 提交于 8月 28, 2023

【inplace api】Batch add inplace api gt_, ge_, lt_, le_, eq_, not_equal_, logical_and_, logical_or_, logical_xor_, logical_not_, divide_, floor_divide_, bitwise_and_ , bitwise_or_, bitwise_xor_, bitwise_not_ (#55509)

* tmp commit

* add atan2

* add inplace api

* fix error

* add inpalce divide

* add inplace api

* add more inplace

* add more inpalce

* fix logical_not error

* support sinh and cosh in cpu

* support asin, acos, atan, asinh, acosh, atanh in cpu

* fix typro

* fix typro

* mv out atan2 ldexp

* mv out atan2 ldexp

* support sinh and cosh in gpu

* support asin, acos, atan, asinh, acosh, atanh in gpu

* fix ge error

* fix dygraph commpare error

* fix dygraph commpare error

* check complex in python

* fix cast inpalce error

* open inplace test

* fix ops.yaml error

* mv cast inpalce to python

* fix coverage ci

* add last inplace

* fix inplace error

* fix cast error

* fix error

* add nan_to_num_

* fix typro

* fix sparse cast error

* remove gpu 4

* fix static cast error

* tmp commit

* add atan2

* add inplace api

* fix error

* add inpalce divide

* add inplace api

* add more inplace

* add more inpalce

* fix logical_not error

* fix typro

* fix typro

* mv out atan2 ldexp

* mv out atan2 ldexp

* fix ge error

* fix dygraph commpare error

* fix dygraph commpare error

* fix cast inpalce error

* open inplace test

* fix ops.yaml error

* mv cast inpalce to python

* fix coverage ci

* add last inplace

* fix inplace error

* fix cast error

* fix error

* add nan_to_num_

* fix typro

* fix sparse cast error

* remove gpu 4

* fix static cast error

* fix cast error

* fix

* Revert "check complex in python"

This reverts commit c822064261d774dd58ad46a4f90ba8b467700a05.

* add renorm , fix error

* add coverage

* fix cumsum inpalce version error

* add cast inpalce impl

* rm test.log

* fix multiply_dyfunction and add multiply_backward test

* add and use is_same_tensor

* fix typro

* fix sone error

* fix typro

---------
Co-authored-by: NScotty <jmhgchn@gmail.com>
Co-authored-by: NScotty <527407973@qq.com>

c5fc413a

[Phi] move shuffle_batch to phi (#56547) · 30708028

由 Sonder 提交于 8月 28, 2023

* move shuffle_batch to phi

* remove useless codes

* add test_shuffle_batch_op to STATIC_BUILD_TESTS

* move shuffle_batch_kernel.cc to cpu folder

* move shuffle_batch_grad to phi

* rm shuffle_batch_op.h

* change year at file head

30708028

25 8月, 2023 1 次提交
- L
  [Reshard] Support create shard tensor and non-zero dim reshard (#56553) · 99795a13
  由 LiYuRio 提交于 8月 25, 2023
```
* support create shard dist tesnor

* support non-zero shard to replicated

* change reshard signature
```
  99795a13
24 8月, 2023 2 次提交
- N
  
  Add enable/disable_model_check_nan_inf op (#54081) · 1c0db09a
  由 niuliling123 提交于 8月 24, 2023
  
  1c0db09a
- W
  
  refine fill with tensor (#56568) · 9ad06e06
  由 wanghuancoder 提交于 8月 24, 2023
  
  9ad06e06
23 8月, 2023 2 次提交
- W
  
  move c_identity to phi (#56215) · 9ed58bff
  由 Wang Xin 提交于 8月 23, 2023
  
  9ed58bff
- W
  [IR] Ir fill constant (#56520) · e914f7fc
  由 wanghuancoder 提交于 8月 23, 2023
```
* support ir fill constant
```
  e914f7fc
22 8月, 2023 2 次提交
- R
  
  [Fluid] NO.4 Migrate c_split to PHI (#56327) · 5dc7ff04
  由 Ruibin Cheung 提交于 8月 22, 2023
  
  5dc7ff04
- [Paddle Inference] refactor linear_compress (#55490) · ffff3da0
  由 FormlessUnit 提交于 8月 22, 2023
```
* Modify kernels to support quantized_matmul

---------
Co-authored-by: Nsuperxf <1208713646@qq.com>
```
  ffff3da0
21 8月, 2023 3 次提交
- I
  
  bug fix of operator "interp_linear" · 752f29a1
  由 idontkonwher 提交于 8月 21, 2023
  
  752f29a1
- J
  
  bugfix, read and write race at fast_ln_fwd (#56435) · 1f987a75
  由 Jeng Bai-Cheng 提交于 8月 21, 2023
  
  1f987a75
- R
  【Complex op】add complex support for numel (#56412) · f8cba26d
  由 Ryan 提交于 8月 21, 2023
```
* add complex numel

* change test && add doc
```
  f8cba26d
16 8月, 2023 3 次提交
- H
  move dgc_momentum kernel to phi (#56158) · baa4fb42
  由 huangjiyi 提交于 8月 16, 2023
```
* update

* update
```
  baa4fb42
- S
  
  [Fluid] move assign_pos to phi (#55794) · 9d899273
  由 Sonder 提交于 8月 16, 2023
  
  9d899273
- R
  [Fluid] NO.1 Migrate c_embedding to PHI (#56129) · 7c9abfb2
  由 Ruibin Cheung 提交于 8月 16, 2023
```
* [Fluid] Migrate c_embedding to PHI

* fix

* add python_api

* fix ut

* migrate xpu kernel

* fix windows compile error
```
  7c9abfb2
15 8月, 2023 5 次提交
- Y
  Add flash attention backward grad check (#56249) · 1509a036
  由 yinwei 提交于 8月 15, 2023
```
---------
Co-authored-by: Ntianhaodongbd <tianhaodong@baidu.com>
```
  1509a036
- [dtype] add fp16 support for dist_kernel (#56184) · ea590ef6
  由 iSerendipity 提交于 8月 15, 2023
```
* [dtype] add fp16 support for dist_kernel

* fix typo

* fix CE

* fix CE

* fix CE

* fix CE

* fix CE

* refactor

* fix CE

* fix CE

* fix varname

* add bf16

* add ut for bf16

* fix CE
```
  ea590ef6
- Z
  
  disable fast_layer_norm (#56263) · bf0ef606
  由 zhaoyingli 提交于 8月 15, 2023
  
  bf0ef606
- R
  [Fluid] NO.12 Migrate number_count to PHI (#56128) · c27bd049
  由 Ruibin Cheung 提交于 8月 15, 2023
```
* [Fluid] Migrate number_count to PHI

* fix out alloc

* fix ut (add python_api)
```
  c27bd049
- L
  
  Merge reduce type of auto_parallel and phi kernel (#56202) · 786c6e99
  由 LiYuRio 提交于 8月 15, 2023
  
  786c6e99
14 8月, 2023 2 次提交
- write the common functions p_norm_kernel.cu and p_norm_grad_kernel.cu to p_norm_utils.h (#56191) · 7f0bdf07
  由周波涛提交于 8月 14, 2023
  
  7f0bdf07
- Add rmsnorm residual bias add and quant (#55965) · 2ac6a7e4
  由 MarDino 提交于 8月 14, 2023
```
* add rmsnorm residual bias add and quant

* refine python interface

* add rmsnorm unittest

* Add layernorm

* fix layernorm unittest

* refine unittest

* fix example code

* fix review comment
```
  2ac6a7e4
10 8月, 2023 1 次提交
- L
  
  Implement reshard from s to r with same process_mesh (#56039) · 4569ae13
  由 LiYuRio 提交于 8月 10, 2023
  
  4569ae13
09 8月, 2023 1 次提交
- C
  
  Add FP16 & BF16 for nanmedian (#56056) · 4ae9945b
  由 cyberslack_lee 提交于 8月 09, 2023
  
  4ae9945b
08 8月, 2023 3 次提交
- W
  move `decayed_adagrad_op` to phi (#55995) · 0d920178
  由 Wang Xin 提交于 8月 08, 2023
```
* move decayed_adagrad_op to phi

* fix bug
```
  0d920178
- H
  
  move dgc kernel to phi (#56003) · 3c03ade8
  由 huangjiyi 提交于 8月 08, 2023
  
  3c03ade8
- H
  
  add data op data type (#56033) · 7472057c
  由 hong 提交于 8月 08, 2023
  
  7472057c
07 8月, 2023 3 次提交

Add attn_mask supported for FlashAttnKernel. (#55969) · 42e0c6b8

由 yin wei 提交于 8月 07, 2023

* add mask

* add backword

* add enforce info

* update scale

* integrate code

* update enforce

* add enforce eq

* add error type

* update enforce

* add test_flash_attention

* Polish codes and fix compiling errors.

* Set num_splits to 0 for flash-attn with tensor mask.

* Fix the compiling error for non flash-attn case.

---------
Co-authored-by: NLiu Yiqun <liuyiqun01@baidu.com>

42e0c6b8

R

[clang-tidy] enable modernize-use-equals-default (#55983) · 30a02d27
由 Ruibin Cheung 提交于 8月 07, 2023

30a02d27

[WIP] Integration flash attention 2 (#55758) · 0473369f

由 umiswing 提交于 8月 07, 2023

* Work for fa-2 padded fwd. Code to be cleaned.

* Work for fa2 unpadded fwd.

* Work for padded-bwd, dk get small diff on np.random.seed(0)

* Anyway I pass paddle's utest, except return softmax without dropout.

* Clean code.

* Modify interface.

* Clean code and add some check.

* Easy compile for dev.

* Fix ci.

* Fix ci-build.

* Add std c++17 option again.

* Limit max job when compiling fa2.

* Remove const_cast

* Add fwd params, to be cleaned.

* Clean code.

* Add bwd params.

* Clean code.

* Add enforce.

* Use v2.0.4

* Pass RNG state to fa2 capi

* Fix review.

* Add assert

* Skip compile for sm less than 80.

0473369f

04 8月, 2023 1 次提交

[NewIR] Rename feed with place to data (#55778) · 274e5e54

由 kangguangli 提交于 8月 04, 2023

* fix bug: feed_with_place should consider variable existence

* fix

* fix build scope

* change method to set feed var name

* remove feed_with_place to placeholder

* fix

* rename to data

* fix

* fix

274e5e54

03 8月, 2023 2 次提交
- Y
  
  FLUID: move limit_by_capacity to PHI (#55948) · 230c6ce1
  由 yangguohao 提交于 8月 03, 2023
  
  230c6ce1
- W
  
  [clang-tidy] [No.4] enable `modernize-loop-convert` (#55704) · 81ccd99e
  由 Wang Xin 提交于 8月 03, 2023
  
  81ccd99e
02 8月, 2023 3 次提交

[Inference] Replace groupNorm when data types are bf16 and fp16, and data... · e61d892a

由 yangjianfengo1 提交于 8月 02, 2023

[Inference] Replace groupNorm when data types are bf16 and fp16, and data format is NHWC implementation. (#55399)

* finish

* cpergroup odd

* fix bf16

* single channel

* code style

* jingdu duiqi

* add head_file

* add bf16 head file

* bf16 2

* bf16

* bf16 head

* bf16 compile

* py test

* bf16 compile

* bf16 compile

* unset py test

* nhwc

* test

* mean var

* bf16 success

* su

* ctest success

* use is_same_as

* is_same

* use is_same

* rtol

* gpu_stream

* del sigmod

* fix bfloat16 type

* use cuda_bf16_hpp

* use_cuda_arch

* bfloat162float2

* del inplace_tol

* del max_releative_tol

* temp store

* jingdu duiqi

* temp store

* plugin

* jingdu duiqi

* duiqi

* include cuda.h

* del half

* half single

* ci

* add const

* ci

* cudamemset

* del printf

* fp16 test

* add half compute

* del br16 ci

* del ci

* ci approve

* del fluid include

e61d892a

C

Add FP16 & BF16 for erfinv (#55287) · 6d7efd09
由 cyberslack_lee 提交于 8月 02, 2023

6d7efd09
W
fix security bug (#55782) · 19da5c0c
由 wanghuancoder 提交于 8月 02, 2023
```
* fix security bug
```
19da5c0c

01 8月, 2023 3 次提交
- S
  move prune_gate_by_capacity to phi (#55780) · 6b93ba0a
  由 Sonder 提交于 8月 01, 2023
```
* move prune_gate_by_capacity to phi

* fix

* fix registe info

* remove useless codes
```
  6b93ba0a
- G
  
  [phi] move nop to phi (#55816) · 719b1ed3
  由 gouzil 提交于 8月 01, 2023
  
  719b1ed3
- H
  [NewIR]New ir support print op (#55648) · 75c29ac1
  由 hong 提交于 8月 01, 2023
```
* new ir support print op

* fix gpu bug

* fix bug

* update

* remove layout to string

* remove usless header

* polish code

* fix bug

* posolis code
```
  75c29ac1
31 7月, 2023 1 次提交
- H
  [NewIR]fix new ir shadow typo (#55706) · 2265d63c
  由 hong 提交于 7月 31, 2023
```
* fix new ir shadow typo

* update
```
  2265d63c

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功