提交 · f706d95dfe9301e18ee6575c3e58e7ba37d6e78a · BaiXuePrincess / Paddle

16 8月, 2022 4 次提交

convert multihead to oss (#45019) · f706d95d

由 feng_shuai 提交于 8月 16, 2022

* convert multihead to oss

* fix:bug

* fix:delete const cast

* fix:don't support bias_qk

* add vit pass

* fix:convert bug and add preln_residual_bias

* support length=-1

* add UT for convert

* add no_bias_qk support for gpu_multihead_op

* delete infer_shape depends on bias_qk

* oss just can be used in T4 and A*

* fix:change api for ROCM CI

f706d95d

A

support fp16 softmax on custom place (#45177) · a0bbfbd4
由 Aganlengzi 提交于 8月 16, 2022

a0bbfbd4
F
Fix problem that the shape of tensor is not inited correctly when backward in static graph (#45030) · e26f80ad
由 feifei-111 提交于 8月 16, 2022
```
* fix_shape

* code style

* fix assert

* fix to_tensor badreturn
```
e26f80ad

【autograd】add select_p、eq_p、pow_p primitive operator for new autograd (#44813) · b681c88c

由 Sing_chan 提交于 8月 16, 2022

* add select_p

* fix bugs

* add custom test for select_p; modify select_p primrules

* modify according to xiaoxu's comment

* add eq_p, select_p, pow_p, use autograd to test grad of high order

* add requirement of autograd, modify expected type of eq

* modify according to xiaoxu's comment

* import primops to use primops.pow

b681c88c

15 8月, 2022 4 次提交
- Y
  
  fused_embedding_eltwise_layernorm_op and skip_layernorm_op support fp16 (#44969) · ac0553a0
  由 Yuanle Liu 提交于 8月 15, 2022
  
  ac0553a0
- Z
  
  add mish and mish_grad for XPU, test=kunlun (#45098) · 6815c8ab
  由 zhangyikun02 提交于 8月 15, 2022
  
  6815c8ab
- H
  [XPU] add some collective ops. (#45049) · 7e2a20d5
  由 houj04 提交于 8月 15, 2022
```
* [XPU] add some collective ops. test=kunlun

* use XPUOpTestWrapper. test=kunlun

* skip kl1 for collective ops. fix typo: deivce -> device. test=kunlun
```
  7e2a20d5
- W
  convert_fp16 support multi block (#45050) · 9aecf286
  由 Wilber 提交于 8月 15, 2022
```
* convert_fp16 support multi block

* update

* update
```
  9aecf286
12 8月, 2022 6 次提交

Offload calculations from matmul op to fuse pass (#44941) · acb78ea2

由 Sławomir Siwek 提交于 8月 12, 2022

* remove v2_transpose_reshape

* matmul_transpose_reshape

* reshape_transpose_matmul

* Add int8 support for matmulV2

* restore ut

* adjust old ut

* restore parallel UT ruels

* remove mkldnn code from base ops

* move enforces to pass

* remove duplicated functions

* delete duplicated enforces

* feedback from review

* add comments to variables

* enable eltwise support

* dynamic attribute

* remove fusepass tests from op test

* remove fuse pass cases from op test

* revert introduction of dynamic attributes

* style
Co-authored-by: Nwozna <joanna.wozna@intel.com>

acb78ea2

transfer memcpy_h2d from fluid to phi (#44932) · 7bc57d35

由 kangguangli 提交于 8月 12, 2022

* transfer memcpy_h2d from fluid to phi

* use UnchangedInferMeta instead

* restore test_standalone_executor

* add newline to fix codestyle check

* rename pt -> phi

* simplify logic and add check

* make the comment more clear

* remove useless comment

* refine code

7bc57d35

Y
trt engine input data type should be consistent with trt input bindin… (#45103) · a3eb341e
由 Yuanle Liu 提交于 8月 12, 2022
```
* trt engine input data type should be consistent with trt input bindings type

* fix some bugs

* fix some bugs

* fix some bugs
```
a3eb341e
D
enhance grid_sampler to support 3d input (#45015) · 1773fbba
由 duanyanhui 提交于 8月 12, 2022
```
* enhance grid_sampler to support 3d input
```
1773fbba
Z

fix extra output of kernels for inference (#45048) · 1cb883da
由 zyfncg 提交于 8月 12, 2022

1cb883da

[geometric]Add paddle.geometric.send_ue_recv API (#43174) · 615b15a3

由 Siming Dai 提交于 8月 12, 2022

* add init file

* add op definition and infermeta

* add kernel definition funcs

* add broadcast infer shape

* add gpu forward kernel

* delete SUB and DIV

* add x_grad

* add template

* add e_grad for min and max

* fix small bug

* temp commit

* temp commit

* add e_grad for sum and mean

* fix some compile bug

* fix compile bugs

* fix compile problem

* add sum forward unittest

* fix broadcast error, add kernel sig, register e_grad, change unit test

* fix grad

* add temp grad fix

* temp commit

* add min max unittest

* add max, min unittest, fix mul bug

* add cpu forward sum and mean

* add forward min max, fix mean unittest

* add cpu backward min max

* fix code-style

* add backward sum mean

* fix rocm ci

* set uniitest timeout

* fix bug of x broadcast to e, gpu grad

* fix bug of x broadcast to e, cpu grad

* rename BOOST_GET_CONST macro

* fix rocm ci

* mv graph_send_e_recv to graph_send_ue_recv

* move out_size to IntArray

* add eager op test

* fix max pool type bug, add unittest for api

* revise api doc

* add fp16 for atomic min and max, add unittest

* add unittest

* add fp16 support for graph_send_recv

* fix unittest fp16 bug

* change OutSizeTensor to Out_size

* move E to Y

* add copyright, fix comment

* review code

* fix thread block size

* fix thread block size

* change api attribute name: pool_type to reduce_op, compute_type to message_op

* change api attribute name, move pool_type to reduce_op, move compute_type to message_op

615b15a3

11 8月, 2022 1 次提交
- C
  make affine_grid_op support 5d input_dim on cpu and gpu (#45012) · 7812522c
  由 carryyu 提交于 8月 11, 2022
```
* make affine_grid_op support 5d_input on cpu and gpu
```
  7812522c
10 8月, 2022 4 次提交
- Y
  
  fix mkldnn interpolate ops (#45008) · 3f49817a
  由 yeliang2258 提交于 8月 10, 2022
  
  3f49817a
- D
  [phi] migration of class center sample infermeta (#45025) · b1e33bea
  由 duanboqiang 提交于 8月 10, 2022
```
* add class center sample infershape

* add yaml

* modify unittest

* modify unittest

* remove comment
```
  b1e33bea
- fix bug of adaptive pool2d_grad, *test=kunlun (#45031) · 01d05bc0
  由 z8hanghuan 提交于 8月 10, 2022
```
* fix bug of adaptive pool2d_grad, *test=kunlun

* fix bug of adaptive pool2d_grad, *test=kunlun

* fix bug of adaptive pool2d_grad, *test=kunlun
```
  01d05bc0
- A
  [OpAttr]Support VarDesc* and vector<VarDesc*> in Attribute (#44737) · 81d6fa6c
  由 Aurelius84 提交于 8月 10, 2022
```
* [OpAttr]Support VarDesc* and vector<VarDesc*> in Attribute

* add unittest for inference predictor
```
  81d6fa6c
09 8月, 2022 7 次提交
- S
  [geometric]Add paddle.geometric.send_u_recv API (#44580) · 34b43555
  由 Siming Dai 提交于 8月 09, 2022
```
* change out_size to INTArray

* fix out_size eager bug

* add unittest for out_size tensor

* add deprecated for paddle.incubate.graph_send_recv, add paddle.geometric.send_u_recv and unittests

* fix lowest bug

* fix according review comment

* add default value in yaml

* change api file name

* change name
```
  34b43555
- C
  move api(erfinv) from legacy_api.yaml to api.yaml (#44987) · 76e0926c
  由 Charles-hit 提交于 8月 09, 2022
```
* move api(erfinv) from legacy_api.yaml to api.yaml

* change inplace_map key
```
  76e0926c
- D
  [phi]migrate class center sample kernel (#44949) · a46d7fe6
  由 duanboqiang 提交于 8月 09, 2022
```
* migrate class center sample kernel

* fix Resize ddim error

* set buffer ptr

* add header

* add header

* remove comment

* remove header
```
  a46d7fe6
- Y
  
  fix vol2col (#44998) · ecc3098e
  由 yeliang2258 提交于 8月 09, 2022
  
  ecc3098e
- D
  [phi] migrate margin infer shape and yaml (#44940) · 6d5744b4
  由 duanboqiang 提交于 8月 09, 2022
```
* add margin infer

* migrate yaml

* modify unittests script
```
  6d5744b4
- Y
  Fix a bug in transpose2 when run native cpu (#44659) · 8185cecd
  由 yeliang2258 提交于 8月 09, 2022
```
* fix a bug in transpose2 about mkldnn

* fix bug
```
  8185cecd
- A
  
  fix format for paddle/phi/api/lib/tensor.cc (#44972) · b54abbe8
  由 Allen Guo 提交于 8月 09, 2022
  
  b54abbe8
08 8月, 2022 6 次提交

【autograd】add log_p primitive operator for new autograd (#44779) · 463fc15e

由 Sing_chan 提交于 8月 08, 2022

* add log_p for auto_grad

* add log_p_op.cc in prim_op_test srcs

* fix bug of wrong op name; add test in test_primops

* add test case of log in testprimapi

* fix bug of test_without_guard

* no need to fix test_without_guard

463fc15e

[phi] Transfer fluid fill_any to PHI fill (#44879) · ad716551

由 HongyuJia 提交于 8月 08, 2022

* transfer kernel, make complete

* add fill_sig file

* fix code style

* fix fill_sig, add yaml, modify python API

* fix inplace, add inplace testcase

* deprecated_op_names append fill

* resolve comments, add test_backward

ad716551

Lml/fix utf8 bug windows (#44945) · cf5742ac

由 levi131 提交于 8月 08, 2022

* for test

* Revert "for test"

This reverts commit baf58738ca485a06073d771e20e3644d8811bf31.

* fix utf8 bug on windows

cf5742ac

T

move lamb_op to phi (#44899) · 4a7aa7c3
由 Thomas Young 提交于 8月 08, 2022

4a7aa7c3
F

[MLU] fix bn_grad and hard_sigmoid_grad error (#44919) · 8573ca54
由 fwenguang 提交于 8月 08, 2022

8573ca54
L
clean includes of tensor.h (#44928) · ee9ea48d
由 Leo Chen 提交于 8月 08, 2022
```
* clean tensor.h

* fix gather_nd
```
ee9ea48d

05 8月, 2022 7 次提交

fix 5 operator makers with typos which pass string literal to argument... · ce9d2a9e

由 Feiyu Chan 提交于 8月 05, 2022

fix 5 operator makers with typos which pass string literal to argument 'generated', remove generated as parameter of AddAttr (#44935)

ce9d2a9e

[MKLDNN]Move mkldnn activation kernel to phi (#44365) · 2dfa88d2

由 YuanRisheng 提交于 8月 05, 2022

* move mkldnn activation kernel

* fix compile bugs

* fix compile bugs

* deal with conflict

* fix compile bugs

* fix windows compile bugs

* mkldnn unittest fix

* change mutable to alloc

* fix unittest bugs

* modify code according comment

2dfa88d2

J

Add int8 support for matmulV2 (#44908) · f3c14762
由 joanna.wozna.intel 提交于 8月 05, 2022

f3c14762

migrate kernel (#44841) · 62a98130

由 duanboqiang 提交于 8月 05, 2022

* migrate kernel

* fix sig order

* remove header files

* remove header

* remove header

* modify logits grad

62a98130

C
enhance fused_multi_transformer_op(post_layer_norm) (#44789) · 643c94e4
由 carryyu 提交于 8月 05, 2022
```
* add fused_multi_transformer post_layer_norm

* add test post_layer_norm
```
643c94e4

update trt workspace size param (#44469) · bdce552b

由 Zhang Jun 提交于 8月 05, 2022

* update trt workspace size param

* update

* update

* update

* use int64_t

* use int64_t

* upate

* update

bdce552b

move fft kernels to phi (#44714) · 153f1138

由 Feiyu Chan 提交于 8月 05, 2022

* move fft kernels to phi, done with cufft, pocketfft, mkl_cdft, hipfft
* make stft_op use fft from phi/kernels/funcs, clean code

153f1138

04 8月, 2022 1 次提交

Matmuls with activation and elementwise_add fuses (#44655) · 0420d514

由 Sławomir Siwek 提交于 8月 04, 2022

* Add unit tests

* matmul_v2 + activation

* matmuls + elementwise_add

* matmul_v2 postops

* transform matmul to v2

* opcompat

* fix fusing matmul with multipe outs

* add shape constraints

* remove unused vars

* change pass order

* - Unit tests to be debugged

- fix

- refactor

- diagnostic

- more diagnostic

- fix

- Fix number two

- fix

- fix

- fix

- alpha added

- more fixes

- compilation fix

- removed diagnostic code

- cosmetic fixes

* lint

* add alpha constraint

* merge matmul refactor

* trigger CI

* - fix

* - another fix

* code style

* add support for matmul+elementwise_add+activation

* code style

* fix bfloat16 bugs

* change append_binary to append_sum
Co-authored-by: NJacek Czaja <jacek.czaja@intel.com>

0420d514

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致