提交 · ec814cf5eb85bf83dbb1dc93b3a4f7576b4185b2 · PaddlePaddle / Paddle

27 2月, 2023 5 次提交
- C
  
  revert operator.cc (#50895) · ec814cf5
  由 csy0225 提交于 2月 27, 2023
  
  ec814cf5
- J
  [kunlun] support reduce_scatter (#50792) · 6786c012
  由 jameszhang 提交于 2月 27, 2023
```
* [kunlun] support reduce_scatter

* uncomment unittest

* update xccl to 1.0.10
```
  6786c012
- revert reshape 0 represent copy and support perm < 0 for paddle.transpose (#50720) · 3669868d
  由 zhouweiwei2014 提交于 2月 27, 2023
  
  3669868d
- [Bfloat16]register bfloat16 datatype for squared l2 norm (#50908) · 3c121040
  由 shaojie_wang 提交于 2月 26, 2023
```
* register bfloat16 datatype for squared l2 norm

* register bfloat16 datatype for softmax with upper triangular mask

* register bfloat16 for tril triu cuda kernel
```
  3c121040
- W
  [mv fleet] mv fleet to distributed (#50834) · 5d322ced
  由 wangzhen38 提交于 2月 27, 2023
```
* [mv fleet] mv fleet to distributed

* [mv fleet] for ci

* [mv fleet] for ci

* [mv fleet] solve ci of version
```
  5d322ced
26 2月, 2023 1 次提交

Enable matmul + bias fusion in fused_gat_attention. (#50755) · 57f6a469

由 Yiqun Liu 提交于 2月 26, 2023

* Enable matmul + bias fusion in fused_gat_attention.

* Add a variable to control whether using fused matmul + bias.

57f6a469

24 2月, 2023 11 次提交

Z
[Paddle-TRT] allow plugin fall back to fp16 when int8 (#50554) · f24eadd9
由 zhoutianzi666 提交于 2月 24, 2023
```
* allow fall back to fp16 when int8

* refine code

* refine code

* refine code
```
f24eadd9

Fused ops converter (#50751) · 9429936c

由 Sławomir Siwek 提交于 2月 24, 2023

* ConvertToFusedOp

* change static to inline
Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>

---------
Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>

9429936c

N

Fix KP operator Kernel selection error (#50178) · 6ef3f2ce
由 niuliling123 提交于 2月 24, 2023

6ef3f2ce

【Prim】Fix prim amp (#50518) · 6664a232

由 Jiabin Yang 提交于 2月 24, 2023

* change amp with to_prim

* fix prim amp

* fix rules

* fix liear

* add amp test

* add test

* disable this test on cpu

* disable this test on cpu

---------
Co-authored-by: Ncyber-pioneer <chenzhuo@tju.edu.cn>

6664a232

C

fix composite grad maker code gen (#50854) · 07c416c8
由 Charles-hit 提交于 2月 24, 2023

07c416c8
Y

Fix libpaddle_inference.so symbol conflicts with other .so (gflags) (#50787) · 041ea14c
由 Yuanle Liu 提交于 2月 24, 2023

041ea14c

support 'backend' in static ops (#50671) · 363825df

由 HappyHeavyRain 提交于 2月 24, 2023

* support 'backend' in static ops

* change bitwise_xx comment in python

* change bitwise_xxx comment in python

* change 'backend' and 'data_type' in GetExpectedKernelType

363825df

【prim】Slice grad (#50771) · f6dea800

由 xiaoguoguo626807 提交于 2月 24, 2023

* support prim test in OpTest

* fix cmake

* fix op test

* fix test_input_spec

* disable cinn in reduce_sum unit test

* add bfloat16 dtype for sum

* add approve rules

* polish code

* add clear jit program function

* convert grad out from tensor to numpy

* remove unnecessary code

* add only_prim flag

* fix flag

* fix op test

* add attr

* fix optest comp inplace error

* fix op test

* fix op test with guard

* add initialization of check_comp flag

* fix comp inplace error in op test

* rename check_comp with check_prim and add bfloat16 dtype convert

* rename comp_op_type to prim_op_type

* rename comp to prim

* remove useless code

* skip ci check for only prim

* add no_grad_vars and grad_outputs in prim test

* fix var_dict

* fix op test for only_prim

* fix dy2static bugs

* polish some code

* temp

* modify op test

* except cinn test

* modify bfp16

* modify pad grad

* add pad_grad dtype

* start cinn part

---------
Co-authored-by: NCharles-hit <wanghao107@baidu.com>

f6dea800

H

[Tensor Operants & Prim] Tensor arithmetic operants support left scalar type (#50840) · 0d956e17
由 HongyuJia 提交于 2月 24, 2023

0d956e17
Z
[Paddle-TRT] Fix QkvToContextPluginDynamic bug (#50715) · 612d5da0
由 zhoutianzi666 提交于 2月 24, 2023
```
* fix multihead

* fix multihead
```
612d5da0

[CINN]Enhance CacheKey hash logic by considering input dtypes (#50557) · 21c6eccf

由 Aurelius84 提交于 2月 24, 2023

* [CINN]Enhance CacheKey hash logic by considering input dtypes

* add unittest

* fix typo

* fix typo

* fix map.at

* fix find

* fix test

* fix cinn cache key structure realize

* using ordered map for attributes

* add test by review advice

---------
Co-authored-by: Njiangcheng <thisjiang@qq.com>

21c6eccf

23 2月, 2023 9 次提交

C

[XPU] Migrate xpu_embedding_with_eltwise_add_fuse_pass (#50590) · 8d325d82
由 csy0225 提交于 2月 23, 2023

8d325d82

[Tensor API & Prim-Relevant] Unsupport prob Tensor API (#50756) · d7673e2f

由 HongyuJia 提交于 2月 23, 2023

* change phi tensor_gen->tensor_operants_gen

* [Tensor API] Support multiple Tensor C++ api

* [Tensor API] Unsupport prob Tensor API

* accept reviewers comment of #50731

* delete tensor_api.yaml

d7673e2f

[phi decoupling] move generator implementation from fluid to phi (#50746) · 4e417409

由 Huang Jiyi 提交于 2月 23, 2023

* move fluid generator to phi

* move fluid generator to phi

* update .gitignore

* fix bugs

* fix cannot find "glog/logging.h" in "generator.h"

* fix bugs

4e417409

R

fix bug that touch __init__.py (#50793) · e1956ab5
由 risemeup1 提交于 2月 23, 2023

e1956ab5

[Tensor Operants & Prim] Tensor arithmetic operants support right scalar type (#50563) · 5f5a2082

由 HongyuJia 提交于 2月 23, 2023

* polish namespace

* change static_tensor_operants

* polish namespace

* support add, subtract, divide

* add unit test

* polish unittest

* fix cmake error

* solve conflicts, merge auto code-gen

* add scalar operator in tensor.h

* tensorbase

* static prim full support more datatype

* fix prim unittest

* polish codes

* fix cmake error

5f5a2082

Y
[PHI Decoupling]Remove Profiler header (Part3) (#50721) · 8476c552
由 YuanRisheng 提交于 2月 23, 2023
```
* move profiler

* fix compile bugs
```
8476c552

Support 'complex promote' in yaml (#50611) · 91a3d159

由 HappyHeavyRain 提交于 2月 23, 2023

* support 'complex promote' in yaml

* change the compplex_promote

* change 'kron' in math.py

* change 'kron' comment in python

* change kron comment in python

* change kron comment in python

91a3d159

Z

[XPU] optimize multi_encoder_xpu_pass (#50759) · 5c9299e5
由 zhupengyang 提交于 2月 23, 2023

5c9299e5

kunlun support c_softmax_with_cross_entropy (#49934) · f43b5fe5

由 jameszhang 提交于 2月 23, 2023

* kunlun support c_softmax_with_cross_entropy

* fix grad calc error

* replace mutable_data() and ShareDataWith()

* update xdnn

* update xpu toolchain to 20230215

* remove fluid from test file

f43b5fe5

22 2月, 2023 6 次提交
- * remove broadcast (#50701) · 2fa91d71
  由 TaoTao Li 提交于 2月 22, 2023
  
  2fa91d71
- H
  [Tensor API] Support multiple Tensor C++ api (#50731) · 652d12cc
  由 HongyuJia 提交于 2月 22, 2023
```
* change phi tensor_gen->tensor_operants_gen

* [Tensor API] Support multiple Tensor C++ api
```
  652d12cc
- [Win]fix compile error due to depend xxhash (#50760) · a35dbc29
  由 zhouweiwei2014 提交于 2月 22, 2023
  
  a35dbc29
- S
  Fix some typos. (#50429) · 93b2bf4b
  由 Shuangchi He 提交于 2月 22, 2023
```
* Fix some typos.
Signed-off-by: Yulv-git <yulvchi@qq.com>

* pre-commit
Signed-off-by: Yulv-git <yulvchi@qq.com>

---------
Signed-off-by: Yulv-git <yulvchi@qq.com>
```
  93b2bf4b
- Z
  
  [XPU] link out_max to x_max between xpu_fusion_ops (#50690) · 1fd1c169
  由 zhupengyang 提交于 2月 22, 2023
  
  1fd1c169
- J
  【Prim】Add gather vjp (#50305) · 4db8e5c7
  由 Jiabin Yang 提交于 2月 22, 2023
```
* tmp gather vjp

* support gather

* remove useless code

* fix compiling error

* fix ut

* add eager test

* add eager test

* add seed

* fix cpu error

* fix transpose op compat

* remove tensor index case

* fix prim_cinn

* fix ut
```
  4db8e5c7
21 2月, 2023 6 次提交

Support bw invoke fw (#50260) · d8845735

由 HappyHeavyRain 提交于 2月 21, 2023

* support bw invoke fw

* fix scale in static_backward.yaml

* fix the bug in tensorrt/convert

* move 'scale','sign' into ops.yaml

* add scale_grad of scale in op_compat.yaml

* change generated_static_op in CMakeLists.txt

d8845735

Q

add c_reduce_sum/unstack/all_reduce_datatype for kunlun (#50606) · 397c9403
由 QingshuChen 提交于 2月 21, 2023

397c9403

[PHI Decoupling]Remove memory header (Part1) (#50419) · 1cfcb71d

由 YuanRisheng 提交于 2月 21, 2023

* decouple_memory

* perfect memory utils

* fix ci bugs

* fix inference bugs

* fix custom test bugs

* fix converage bugs

* modify code according comment

* modify namespace

* deal with compile bugs

1cfcb71d

[phi decoupling] move sequence_padding from fluid to phi (#50639) · 5f443601

由 Huang Jiyi 提交于 2月 21, 2023

* move sequence_padding to phi

* fix bugs

* fix bugs

* fix bugs

* fix bugs

* fix bugs

* fix bugs

* fix bugs

* fix bugs

* fix buga

* fix bugs

* revert and update phi::XPUContext

5f443601

D
[Custom Device] Add static custom back_list (#50666) · d79d5933
由 duanyanhui 提交于 2月 21, 2023
```
* add static custom back_list

* rm comments

* fix log

* fix comment
```
d79d5933

Optimize the ernie inference performance on xpu backend. (#50357) · b39afb13

由 csy0225 提交于 2月 21, 2023

* Optimize the ernie inference performance on xpu

* fix enable runtime cache logic

* when op's input shape has changed, should create a new runtime context

* fix

* set flag when input shape has changed

b39afb13

20 2月, 2023 2 次提交
- J
  
  share_data interface support paddle.Tensor type (#50240) · 8ad635d5
  由 JingZhuangzhuang 提交于 2月 20, 2023
  
  8ad635d5
- S
  
  [XPU] fix fc_xpu_fuse_pass (#50569) · 77606f5d
  由 shentanyue 提交于 2月 20, 2023
  
  77606f5d

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功