提交 · 88d42398557899d44e2e1a1e1ea3df5af64c2dfb · PaddlePaddle / Paddle

13 3月, 2023 1 次提交
- R
  [PHI] Add reduce_min_grad xpu op and the corresponding unittest (#51431) · 88d42398
  由 RuohengMa 提交于 3月 13, 2023
```
* [XPU] add reduce_min_grad XPU kernel

* add unittest for reduce_min xpu op
```
  88d42398
10 3月, 2023 5 次提交
- S
  
  [XPU] EmbeddingWithEltwiseAddXpuKernel support FP16 (#51426) · 1a8cc15e
  由 shentanyue 提交于 3月 10, 2023
  
  1a8cc15e
- Q
  
  support c_embedding_grad for kunlun (#51399) · cb7fd370
  由 QingshuChen 提交于 3月 10, 2023
  
  cb7fd370
- Y
  
  add xpu tile and concat kernel int64, test=kunlun (#51349) · 04f56338
  由 ykkk2333 提交于 3月 10, 2023
  
  04f56338
- Z
  
  add index_select_grad for xpu (#51342) · b33673be
  由 zhangyikun02 提交于 3月 10, 2023
  
  b33673be
- M
  Xpu ernie3: support fp16 for xpu kernels: full_like/stack/where (#51271) · 374e757f
  由 mayang002 提交于 3月 10, 2023
```
* [xpu-ernie3] support fp16 for full_like/stack/where xpu kernels

* [xpu-ernie3] support fp16 for full_like/stack/where xpu kernels
```
  374e757f
01 3月, 2023 1 次提交

[XPU] Add kernels for VITDET (#50992) · 798b527c

由 duanyanhui 提交于 3月 01, 2023

* add support of int64 add for xpu

* add transpose support for int64

* add randperm kernel

* fix randperm

* add distribute_fpn_proposal kernel

* fix comment

* add reduce_sum_int32

798b527c

28 2月, 2023 2 次提交
- Z
  
  [XPU] support convert fp16 model (#50790) · f265a313
  由 zhupengyang 提交于 2月 28, 2023
  
  f265a313
- S
  
  xpu gaussian_random support fp16 (#50881) · 569b018e
  由 shentanyue 提交于 2月 28, 2023
  
  569b018e
27 2月, 2023 2 次提交
- H
  [XPU] add fp16 support for shape and lookup_table_v2 op. (#50773) · d2a0577a
  由 houj04 提交于 2月 27, 2023
```
* [XPU] add fp16 support for shape op.

* [XPU] add fp16 support for lookup_table_v2 op.

* update approval list: add qingshu's id.
```
  d2a0577a
- W
  xpu: bind op scatter_nd_add. add data type for transpose2, clip & assign_value (#50825) · 0d12afea
  由 wangshengxiang 提交于 2月 27, 2023
```
* [XPU] bind op scatter_nd_add

* [XPU] add more data type for op: clip, transpose2 & assign_value
```
  0d12afea
24 2月, 2023 1 次提交
- R
  [XPU] add expand_grad, isnan, meshgrid kernels (#50774) · 7271de88
  由 ronnywang 提交于 2月 24, 2023
```
* [XPU] add expand_grad, isnan, meshgrid kernels

* update
```
  7271de88
23 2月, 2023 2 次提交
- C
  
  [XPU] Migrate xpu_embedding_with_eltwise_add_fuse_pass (#50590) · 8d325d82
  由 csy0225 提交于 2月 23, 2023
  
  8d325d82
- J
  kunlun support c_softmax_with_cross_entropy (#49934) · f43b5fe5
  由 jameszhang 提交于 2月 23, 2023
```
* kunlun support c_softmax_with_cross_entropy

* fix grad calc error

* replace mutable_data() and ShareDataWith()

* update xdnn

* update xpu toolchain to 20230215

* remove fluid from test file
```
  f43b5fe5
22 2月, 2023 1 次提交
- H
  
  [XPU] add fp16 support for assign. update xccl to 1.0.9. (#50702) · 613a3ffe
  由 houj04 提交于 2月 22, 2023
  
  613a3ffe
21 2月, 2023 2 次提交
- Q
  
  add c_reduce_sum/unstack/all_reduce_datatype for kunlun (#50606) · 397c9403
  由 QingshuChen 提交于 2月 21, 2023
  
  397c9403
- Z
  
  pad3d support fp16 for xpu (#50653) · e84fa263
  由 zhangyikun02 提交于 2月 21, 2023
  
  e84fa263
20 2月, 2023 2 次提交
- H
  
  [XPU] add fp16 support for tril_triu. add index_sample op. (#50655) · 47306c58
  由 houj04 提交于 2月 20, 2023
  
  47306c58
- H
  
  [XPU] add fp16 support for top_k_v2, squeeze2 and argsort. (#50614) · 689de12c
  由 houj04 提交于 2月 20, 2023
  
  689de12c
17 2月, 2023 2 次提交
- H
  [XPU] add fp16 support for cumsum and log (#50599) · 3027c58a
  由 houj04 提交于 2月 17, 2023
```
* [XPU] add fp16 support for cumsum and log.

* [XPU] add fp16 support for cumsum and log.
```
  3027c58a
- Z
  [XPU] add multi_encoder_xpu_slice_fuse_pass, generate_sequence_xpu_fuse_pass,... · 61469eec
  由 zhupengyang 提交于 2月 17, 2023
```
[XPU] add multi_encoder_xpu_slice_fuse_pass, generate_sequence_xpu_fuse_pass, generate_sequence_xpu kernel (#50570)
```
  61469eec
16 2月, 2023 3 次提交
- S
  [XPU][Fleet] Support multi-card infer for xpu (#50490) · 517d8074
  由 shentanyue 提交于 2月 16, 2023
```
* support xpu multi-card infer

* add ut

* clean code

* clean code

* fix

* fix

* fix

* fix
```
  517d8074
- R
  [XPU] add group_norm, sin, cos, linspace, randint kernels (#50465) · c86a5140
  由 ronnywang 提交于 2月 16, 2023
```
* [XPU] add group_norm kernel

* update

* add xpu sin, cos, randint, linspace kernels

* update

* update
```
  c86a5140
- Z
  
  [XPU] fix dropout pass; add multi_encoder_xpu_fuse_pass & multi_encoder_xpu kernel (#50499) · c8aa6405
  由 zhupengyang 提交于 2月 16, 2023
  
  c8aa6405
15 2月, 2023 2 次提交
- Z
  
  add gather_nd_grad op and where_grad support zero_dim for xpu (#50454) · 055d0c2d
  由 zhangyikun02 提交于 2月 15, 2023
  
  055d0c2d
- Q
  
  remove duplicated op in xpu2_op_list (#50450) · 47c23ccb
  由 QingshuChen 提交于 2月 15, 2023
  
  47c23ccb
13 2月, 2023 1 次提交

add xpu pool3d kernels (#50233) · 1281b612

由 ykkk2333 提交于 2月 13, 2023

* add xpu adagrad and where_grad kernels, test=kunlun

* add xpu pool3d kernels, test=kunlun

1281b612

10 2月, 2023 3 次提交
- L
  Fix bugs and add unit tests in instance_norm_grad_kernel when d_scale and (#50394) · 4c373e6b
  由 Leo Guo 提交于 2月 10, 2023
```
d_bias are nullptr. Modify the code style of full_kernel.cc. Add new data
type for concat, elementwise_add, gather, scale, scatter ops. test=kunlun
```
  4c373e6b
- Z
  
  [XPU] add fc_xpu op&pass to optimize ernie model (#50277) · 945f918c
  由 zhupengyang 提交于 2月 10, 2023
  
  945f918c
- W
  
  [XPU] bind op: atan & deformable_conv_v1 (#50373) · e15ef948
  由 wangshengxiang 提交于 2月 10, 2023
  
  e15ef948
09 2月, 2023 2 次提交
- L
  
  Modify full kernel for xpu. test=kunlun (#50209) · 18e0e01d
  由 Leo Guo 提交于 2月 09, 2023
  
  18e0e01d
- Z
  
  add logical_and, logical_or and logical_xor for xpu (#50228) · 0036316e
  由 zhangyikun02 提交于 2月 09, 2023
  
  0036316e
01 2月, 2023 1 次提交
- Z
  
  support grid_sampler_grad op for XPU (#49857) · 520f48d6
  由 zhangyikun02 提交于 2月 01, 2023
  
  520f48d6
31 1月, 2023 1 次提交
- W
  
  bind pixel_shuffle & pixel_shuffle_grad op for xpu (#50090) · a5f2e1f7
  由 wangshengxiang 提交于 1月 31, 2023
  
  a5f2e1f7
19 1月, 2023 1 次提交

[KUNLUN] add op: maxpool_with_index (#49505) · f71f77e9

由 jameszhang 提交于 1月 19, 2023

* [KUNLUN] add op: maxpool_with_index

* use DeviceContext::Alloc() instead of DenseTensor::mutable_data()

* fix file format

* solve clip unittest failure

* minor fix

* Revert "solve clip unittest failure" since the issue is fixed
in #49535

This reverts commit 1127adc66e79afe35ac3c00bb34e6aaa7cd7d78b.

* align with xdnn on the definition of mask in max_pool_with_index

* minor

f71f77e9

18 1月, 2023 2 次提交

[PHI] remove bitwise and, or, xor (#49916) · 9056cc8b

由 RuohengMa 提交于 1月 18, 2023

* add reduce_sum_int64 and reduce_sum_int8 xpu kernels

* [PHI] add clip grad kernel with support type float32 and int32

* [PHI unittest] add clip_grad unit test

* adapt code to clang-format

* update xpu api output with clip_grad api

* remove int8 support of reduce_sum xpu kernel since it can not pass unit tests

* adapt license date, add code for XPUDataType convertion

* add int8 support of reduce_sum

* add reduce_sum unit tests for dtype int64, int8, and add more test cases

* update license date

* remove buggy bitwise and, or and xor xpu kernels, refine bitwise not xpu kernel

* change license date

9056cc8b

H

[XPU] add logical_not op. (#49911) · 60d1199a
由 houj04 提交于 1月 18, 2023

60d1199a

16 1月, 2023 1 次提交
- Q
  
  add prod for kunlun (#49816) · bd03652f
  由 QingshuChen 提交于 1月 16, 2023
  
  bd03652f
13 1月, 2023 2 次提交
- J
  kunlun add support for c_concat and c_split (#49757) · a09b9a3f
  由 jameszhang 提交于 1月 13, 2023
```
* kunlun add support for c_concat and c_split

* replace mutable_data() and ShareDataWith()
```
  a09b9a3f
- Y
  
  add xpu adagrad and where_grad kernels (#49701) · a99c3cd4
  由 ykkk2333 提交于 1月 13, 2023
  
  a99c3cd4

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功