提交 · bbca66f2fada26a03cb82ef851a5e53f75771b82 · PaddlePaddle / Paddle

02 3月, 2023 11 次提交
- J
  【Prim】Fix slice error and eager comp (#51086) · bbca66f2
  由 Jiabin Yang 提交于 3月 02, 2023
```
* fix attrs copy error

* fix bert by fix slice error

* fix op test
```
  bbca66f2
- X
  [dy2static] bugfix: make stop_gradient a cache key (#50883) · 5e1185de
  由 xiongkun 提交于 3月 02, 2023
```
* [dy2static] bugfix: make stop_gradient a cache key
1. make stop_gradient cache key in dy2static.

* fix ci errors

* fix ci error

* fix ci error

* fix ci error
```
  5e1185de
- W
  
  [XPU] add smallest mode for top_k (#51053) · 0fd6e2a1
  由 wangshengxiang 提交于 3月 02, 2023
  
  0fd6e2a1
- fluid clean: remove dygraph_utils._append_bias_in_dygraph (#51021) · 8ac05c09
  由 easywaytolifebelief 提交于 3月 02, 2023
```
* fluid clean: remove dygraph_utils._append_bias_in_dygraph

* fix func name and imports
```
  8ac05c09
- L
  [AMP OP&Test] register fp16 and bf16 kernel for uniform_random (#50993) · 72f34450
  由 Leo Chen 提交于 3月 02, 2023
```
* register fp16 and bf16 kernel for uniform_random

* fix compile

* support selected_rows

* add ut

* revert cpu

* fp16 test skip cpu
```
  72f34450
- W
  Add concat grad cinn (#50972) · a4689c90
  由 wangzhen38 提交于 3月 02, 2023
```
* [cinn] concat_grad

* [cinn] concat_grad

* [cinn] concat_grad build success

* [Add PGLBOX] fix unnitest

* [Add PGLBOX] fix unnitest

* [Add PGLBOX] fix codestyle

* [cinn] update by comments

* [cinn] update by comment

* [cinn] add axis check
```
  a4689c90
- L
  
  [fp16] suppot fp16 in std (#50936) · d1dd7302
  由 LoneRanger 提交于 3月 02, 2023
  
  d1dd7302
- G
  
  [Hackathon NO.74] 为 Paddle-TRT 添加 grid_sampler 算子 (#50934) · 8f156fd7
  由 gaoziyuan 提交于 3月 02, 2023
  
  8f156fd7
- R
  Comp hardswish (#51003) · 51331098
  由 Roc 提交于 3月 02, 2023
```
* add composite op hard swish

* add test grad

* update apis calling

* update date range

* add ut

* tune off cinn for 0-d shape

* skip cinn
```
  51331098
- J
  
  [CINN] reopen some prim with cinn single test (#51081) · bb5dd203
  由 jiangcheng 提交于 3月 02, 2023
  
  bb5dd203
- V
  
  fix bug calculate_output in eagerChecker (#51069) · ff7ce2ff
  由 Vvsmile 提交于 3月 02, 2023
  
  ff7ce2ff
01 3月, 2023 12 次提交

Integration flash attention (#49869) · 61611786

由 Chitsing KUI 提交于 3月 01, 2023

* flash attn

* seed

* almost

* softmax

* fix workspace

* add unitest; linux only

* fix setup

* fix datatype include

* fix setup typo

* fix def scope

* new error api

* use paddle fork

* fix attr bug; complete ut

* update flash hash

* fix rng reset

* fix offset

* fix comments

61611786

[Tensor Operants & Prim-Relevant] Tensor supports logical operants (#50983) · 1794927b

由 HongyuJia 提交于 3月 01, 2023

* Add comments for #50886

* [Tensor Operants & Prim-Relevant] Tensor supports logical operants

* add prim dynamic unit test

* add prim static unit test

1794927b

add topk prim backward (#50679) · 296b3ff0

由 zqw_1997 提交于 3月 01, 2023

* tmp gather vjp

* support gather

* remove useless code

* fix compiling error

* fix ut

* add eager test

* add eager test

* add seed

* small change

* fix cpu error

* fix transpose op compat

* remove tensor index case

* fix prim_cinn

* small commit

* add cumsum prim backward

* small commit

* skip aixs=None test case

* fix op generante eror

* fix static test error

* remove unused code

* fix static test error

* small commit

* skip cpu float16 test case

* skip eager cpu cumsum float16 test case

* add eager and static UT

* fix ut

* add composite backward rule

* fix error

* fix type error and format error

* add try cpu+float16 test

* fix test bugs

* remove test for cpu+float16 and make y[0] be the grad arg

* add cinn test

* fix UT

* fix the wrong dim of v in test cases

* change y[0] to y[1] for grad in UT

* reshape flatten out

* Disable cinn single test

* use scatter_nd_add

* modify the reshape part of topk_grad

* delete useless build file

* to make the syntax right

* modify bug

* try use of put_along_axis

* remove cinn test

* reformat todo

* add silu composite rule

* fix code style.

* add cinn test

* fix composite grad maker code gen

* add prim in cumsum op test

* remove old test

* fix typro

* pass the static test

* fix typro

* modify optest and delete old test files

* remove normal test_top_k_op test

* fix typro

* pass axis=None test case

* buffer comment

* for debug

* add silu fp16 unit test.

* add static guard

* remove forward prim test

* remove same name axis

* modify the test_top_v2_op.py to pass all local tests

* delete the useless testcase

* fix mistake

* add more testcases to test dtype16 and dtype32

---------
Co-authored-by: NJiabinYang <360788950@qq.com>
Co-authored-by: NGGBond8488 <857631483@qq.com>
Co-authored-by: Nzxcd <228587199@qq.com>
Co-authored-by: NCharles-hit <wanghao107@baidu.com>

296b3ff0

[Fluidclean]move fluid.transpiler to distributed.transpiler (#51025) · 51aa2129

由 wangxiaoning 提交于 3月 01, 2023

* remove transpiler

* Revert "remove transpiler"

This reverts commit 46044ccd52011d45d7026786d331f264a6a8f645.

* Revert "Revert "remove transpiler""

This reverts commit 80ad0945401b5b5efebac4baee0ec50a793d4405.

* codestyle

* fix setup

* fix

* fix

51aa2129

Z

fix unit tests random error (#51054) · 9c60c5ec
由 Zhang Ting 提交于 3月 01, 2023

9c60c5ec

[Zero-Dim] Add Expand/Expand_as/Top_k for XPU to support Zero Dim Input. (#50947) · 226b4a95

由 yunyaoXYY 提交于 3月 01, 2023

* Add unitest from shilong

* Add kernel code from shilong

* fix codestyle

* add broadcast_shape test

* fix unitest

* fix unitests

* fix unitest

* add 0D grad support

* add 0D grad support

* add 0D grad support

* fix 0D tensor

* fix 0D

* fix xpu 0D

* fix expand kernel

* fix xpu expand

* Fix 0D kernel

* fix 0D

* fix 0D

* fix 0D

* fix 0D

* fix XPU top_k

* cancel the modify of xpu

* add XPU 0D tensor

* fix 0D

226b4a95

W

fix the backward bug of cumsum (#50997) · 934934d8
由 wawltor 提交于 3月 01, 2023

934934d8
C
fix zero bug of case18: paddle.logsumexp (#51034) · 2f900965
由 chenxiao120660 提交于 3月 01, 2023
```
* fix bug of logsumexp

* fix bug for logsumexp

* fix bug for logsumexp
```
2f900965

Add full_like composite rule (#50794) · 7468bab4

由 Yichen Zhang 提交于 3月 01, 2023

* implement composite full_like and simple unit test

* implement op tests for composite full_like op

* some modification as reviewers suggested
add cinn op test to CMakeLists.txt
fix code style

* fix code style

* modify input args of prim fill_any_like op

* resolve conflicts

* resolve conflicts

* modify python api and unit tests as suggested

* resolve conflicts

* resolve conflicts

* use framework.dtype to convert dtype in Op test

7468bab4

L

[fp16] suppot fp16 in diagflat (#50945) · af149c0c
由 LoneRanger 提交于 3月 01, 2023

af149c0c
N

Add multiprecision for rms op (#50132) · 48060b2e
由 niuliling123 提交于 3月 01, 2023

48060b2e

[XPU] Add kernels for VITDET (#50992) · 798b527c

由 duanyanhui 提交于 3月 01, 2023

* add support of int64 add for xpu

* add transpose support for int64

* add randperm kernel

* fix randperm

* add distribute_fpn_proposal kernel

* fix comment

* add reduce_sum_int32

798b527c

28 2月, 2023 17 次提交

I

Fix some typos (#50914) · 5d8fe822
由 iLeGend 提交于 2月 28, 2023

5d8fe822
H
Rewrite mkldnn fc rnn fuse pass tester (#50265) · eb22391c
由 Hulek 提交于 2月 28, 2023
```
* Added file

* Tests separated and rewritten, fixed fc_lstm_fuse_pass

* Resolve conflicts
```
eb22391c
H
[Extension Operants] Extension supports tensor operants (#50869) · 539293e2
由 HongyuJia 提交于 2月 28, 2023
```
* [Extension Operants] Extension supports tensor operants

* Polish fluid init_tensor_operants
```
539293e2

【prim】Matmul double grad composite api (#50452) · a0c473f4

由 xiaoguoguo626807 提交于 2月 28, 2023

* modify name

* merge develop

* original code

* build modify

* success 2*2

* fused dim=1 failed

* success

* modify static

* success for static except dim=1

* delete log

* tmp modify

* success

* success

* add fp1664

* delete fp16 cpu test

* stop windows test

* review modify

* modify tanh test

* modify tanh

* fix_conflixt

* modift static prim

* fix_conflict

* Update test_static_prim.cc

* update

* bug fix

a0c473f4

J
[Hybrid parallelism] Tensor Parallel Extra Sync (#50676) · 0b25f665
由 JZ-LIANG 提交于 2月 28, 2023
```
* main code

* unitest bug

* revert cmake
```
0b25f665

add cumsum prim backward (#50565) · ca2b6095

由 GGBond8488 提交于 2月 28, 2023

* add cumsum prim backward

* skip aixs=None test case

* fix op generante eror

* fix static test error

* remove unused code

* fix static test error

* skip cpu float16 test case

* skip eager cpu cumsum float16 test case

* add cinn test

* reshape flatten out

* Disable cinn single test

* remove cinn test

* reformat todo

* add prim in cumsum op test

* remove old test

* fix typro

* fix typro

* fix typro

* pass axis=None test case

* remove forward prim test

* remove same name axis

ca2b6095

陈

add float16 to equal (#50933) · 1e02769b
由陈沧夜提交于 2月 28, 2023

1e02769b
I

[fp16] suppot fp16 in argmin (#50858) · 69d49aba
由 Infinity_lee 提交于 2月 28, 2023

69d49aba
C

add static guard (#50971) · 72cbb6da
由 Charles-hit 提交于 2月 28, 2023

72cbb6da
Z

[XPU] support convert fp16 model (#50790) · f265a313
由 zhupengyang 提交于 2月 28, 2023

f265a313
L
[fp16] suppot fp16 in tensordot (#50938) · e4fbb286
由 LoneRanger 提交于 2月 28, 2023
```
* fix fp16 bug of tensordot

* fix fp16 of tensordot

* fix fp16 of tensordot
```
e4fbb286
张
[fp16] support fp16 on LocalResponseNorm (#50918) · b69af7ad
由张春乔提交于 2月 28, 2023
```
* support fp16 on LocalResponseNorm

* add docs in avgpool3d
```
b69af7ad
I

fix fp16 for tile op (#50913) · d841062b
由 Infinity_lee 提交于 2月 28, 2023

d841062b

张

[fp16] suppot fp16 on nn.Dropout2D (#50904) · bf05168c

由张春乔提交于 2月 28, 2023

* add unittest for nn.DropOut2D

* add fp16

* add fp16 in docs of temporal_shift_op.cc

* Update test_dropout_op.py

bf05168c

Z
add silu composite rule (#50838) · 5d70ba6d
由 zxcd 提交于 2月 28, 2023
```
* add silu composite rule

* fix code style.

* add silu fp16 unit test.
```
5d70ba6d

Add flatten composite rule (#50672) · 8220771b

由 xysheng-baidu 提交于 2月 28, 2023

* Add flatten composite rule

* get the right xshape and pass func test

* add cinn unit test

* Remove cinn test, wait for it to be added after repair

* add comp test to test_flatten_contiguous_range_op.py

* remove func test on composite_ops

* Add comments to maybe_wrap_dim func

* remove commented code

* fix the problem with 0D tensor case

* add flatten split rule comment

* fix syntax issues

* block flatten on resnet_prim_cinn

* remove maybe_wrap_dim func

* Use none instead od xshape

8220771b

T

zero-dim support for gcd and lcm (#50950) · c77eb1fd
由 Tao Luo 提交于 2月 28, 2023

c77eb1fd

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功