提交 · a36cdd6b6cd95b0a4c708661dc22782481e38e7e · PaddlePaddle / Paddle

24 2月, 2023 1 次提交
- R
  [XPU] add expand_grad, isnan, meshgrid kernels (#50774) · 7271de88
  由 ronnywang 提交于 2月 24, 2023
```
* [XPU] add expand_grad, isnan, meshgrid kernels

* update
```
  7271de88
23 2月, 2023 2 次提交
- C
  
  [XPU] Migrate xpu_embedding_with_eltwise_add_fuse_pass (#50590) · 8d325d82
  由 csy0225 提交于 2月 23, 2023
  
  8d325d82
- J
  kunlun support c_softmax_with_cross_entropy (#49934) · f43b5fe5
  由 jameszhang 提交于 2月 23, 2023
```
* kunlun support c_softmax_with_cross_entropy

* fix grad calc error

* replace mutable_data() and ShareDataWith()

* update xdnn

* update xpu toolchain to 20230215

* remove fluid from test file
```
  f43b5fe5
22 2月, 2023 1 次提交
- H
  
  [XPU] add fp16 support for assign. update xccl to 1.0.9. (#50702) · 613a3ffe
  由 houj04 提交于 2月 22, 2023
  
  613a3ffe
21 2月, 2023 2 次提交
- Q
  
  add c_reduce_sum/unstack/all_reduce_datatype for kunlun (#50606) · 397c9403
  由 QingshuChen 提交于 2月 21, 2023
  
  397c9403
- Z
  
  pad3d support fp16 for xpu (#50653) · e84fa263
  由 zhangyikun02 提交于 2月 21, 2023
  
  e84fa263
20 2月, 2023 2 次提交
- H
  
  [XPU] add fp16 support for tril_triu. add index_sample op. (#50655) · 47306c58
  由 houj04 提交于 2月 20, 2023
  
  47306c58
- H
  
  [XPU] add fp16 support for top_k_v2, squeeze2 and argsort. (#50614) · 689de12c
  由 houj04 提交于 2月 20, 2023
  
  689de12c
17 2月, 2023 2 次提交
- H
  [XPU] add fp16 support for cumsum and log (#50599) · 3027c58a
  由 houj04 提交于 2月 17, 2023
```
* [XPU] add fp16 support for cumsum and log.

* [XPU] add fp16 support for cumsum and log.
```
  3027c58a
- Z
  [XPU] add multi_encoder_xpu_slice_fuse_pass, generate_sequence_xpu_fuse_pass,... · 61469eec
  由 zhupengyang 提交于 2月 17, 2023
```
[XPU] add multi_encoder_xpu_slice_fuse_pass, generate_sequence_xpu_fuse_pass, generate_sequence_xpu kernel (#50570)
```
  61469eec
16 2月, 2023 3 次提交
- S
  [XPU][Fleet] Support multi-card infer for xpu (#50490) · 517d8074
  由 shentanyue 提交于 2月 16, 2023
```
* support xpu multi-card infer

* add ut

* clean code

* clean code

* fix

* fix

* fix

* fix
```
  517d8074
- R
  [XPU] add group_norm, sin, cos, linspace, randint kernels (#50465) · c86a5140
  由 ronnywang 提交于 2月 16, 2023
```
* [XPU] add group_norm kernel

* update

* add xpu sin, cos, randint, linspace kernels

* update

* update
```
  c86a5140
- Z
  
  [XPU] fix dropout pass; add multi_encoder_xpu_fuse_pass & multi_encoder_xpu kernel (#50499) · c8aa6405
  由 zhupengyang 提交于 2月 16, 2023
  
  c8aa6405
15 2月, 2023 2 次提交
- Z
  
  add gather_nd_grad op and where_grad support zero_dim for xpu (#50454) · 055d0c2d
  由 zhangyikun02 提交于 2月 15, 2023
  
  055d0c2d
- Q
  
  remove duplicated op in xpu2_op_list (#50450) · 47c23ccb
  由 QingshuChen 提交于 2月 15, 2023
  
  47c23ccb
13 2月, 2023 1 次提交

add xpu pool3d kernels (#50233) · 1281b612

由 ykkk2333 提交于 2月 13, 2023

* add xpu adagrad and where_grad kernels, test=kunlun

* add xpu pool3d kernels, test=kunlun

1281b612

10 2月, 2023 3 次提交
- L
  Fix bugs and add unit tests in instance_norm_grad_kernel when d_scale and (#50394) · 4c373e6b
  由 Leo Guo 提交于 2月 10, 2023
```
d_bias are nullptr. Modify the code style of full_kernel.cc. Add new data
type for concat, elementwise_add, gather, scale, scatter ops. test=kunlun
```
  4c373e6b
- Z
  
  [XPU] add fc_xpu op&pass to optimize ernie model (#50277) · 945f918c
  由 zhupengyang 提交于 2月 10, 2023
  
  945f918c
- W
  
  [XPU] bind op: atan & deformable_conv_v1 (#50373) · e15ef948
  由 wangshengxiang 提交于 2月 10, 2023
  
  e15ef948
09 2月, 2023 2 次提交
- L
  
  Modify full kernel for xpu. test=kunlun (#50209) · 18e0e01d
  由 Leo Guo 提交于 2月 09, 2023
  
  18e0e01d
- Z
  
  add logical_and, logical_or and logical_xor for xpu (#50228) · 0036316e
  由 zhangyikun02 提交于 2月 09, 2023
  
  0036316e
01 2月, 2023 1 次提交
- Z
  
  support grid_sampler_grad op for XPU (#49857) · 520f48d6
  由 zhangyikun02 提交于 2月 01, 2023
  
  520f48d6
31 1月, 2023 1 次提交
- W
  
  bind pixel_shuffle & pixel_shuffle_grad op for xpu (#50090) · a5f2e1f7
  由 wangshengxiang 提交于 1月 31, 2023
  
  a5f2e1f7
19 1月, 2023 1 次提交

[KUNLUN] add op: maxpool_with_index (#49505) · f71f77e9

由 jameszhang 提交于 1月 19, 2023

* [KUNLUN] add op: maxpool_with_index

* use DeviceContext::Alloc() instead of DenseTensor::mutable_data()

* fix file format

* solve clip unittest failure

* minor fix

* Revert "solve clip unittest failure" since the issue is fixed
in #49535

This reverts commit 1127adc66e79afe35ac3c00bb34e6aaa7cd7d78b.

* align with xdnn on the definition of mask in max_pool_with_index

* minor

f71f77e9

18 1月, 2023 2 次提交

[PHI] remove bitwise and, or, xor (#49916) · 9056cc8b

由 RuohengMa 提交于 1月 18, 2023

* add reduce_sum_int64 and reduce_sum_int8 xpu kernels

* [PHI] add clip grad kernel with support type float32 and int32

* [PHI unittest] add clip_grad unit test

* adapt code to clang-format

* update xpu api output with clip_grad api

* remove int8 support of reduce_sum xpu kernel since it can not pass unit tests

* adapt license date, add code for XPUDataType convertion

* add int8 support of reduce_sum

* add reduce_sum unit tests for dtype int64, int8, and add more test cases

* update license date

* remove buggy bitwise and, or and xor xpu kernels, refine bitwise not xpu kernel

* change license date

9056cc8b

H

[XPU] add logical_not op. (#49911) · 60d1199a
由 houj04 提交于 1月 18, 2023

60d1199a

16 1月, 2023 1 次提交
- Q
  
  add prod for kunlun (#49816) · bd03652f
  由 QingshuChen 提交于 1月 16, 2023
  
  bd03652f
13 1月, 2023 4 次提交
- J
  kunlun add support for c_concat and c_split (#49757) · a09b9a3f
  由 jameszhang 提交于 1月 13, 2023
```
* kunlun add support for c_concat and c_split

* replace mutable_data() and ShareDataWith()
```
  a09b9a3f
- Y
  
  add xpu adagrad and where_grad kernels (#49701) · a99c3cd4
  由 ykkk2333 提交于 1月 13, 2023
  
  a99c3cd4
- J
  fix xpu unittest issue (#49760) · ddc8a726
  由 jameszhang 提交于 1月 13, 2023
```
* fix xpu unittest issue: zero_dim_tensor

* deal with leftout issue introduced by #49470
```
  ddc8a726
- W
  
  add prelu & prelu_grad op for xpu (#49672) · 8d512b8f
  由 wangshengxiang 提交于 1月 13, 2023
  
  8d512b8f
12 1月, 2023 3 次提交
- Y
  
  deal with conflict (#49766) · 27aec62b
  由 YuanRisheng 提交于 1月 12, 2023
  
  27aec62b
- L
  Fix the bugs of set_value and set_value_grad ops and add register in (#49750) · 438975fd
  由 Leo Guo 提交于 1月 12, 2023
```
xpu2_op_list.cc. test=kunlun
```
  438975fd
- Y
  [PHI]Rename some PHI Kernel (#49470) · 30f5e39b
  由 YuanRisheng 提交于 1月 12, 2023
```
* rename kernel

* delete sig

* modify code according comment

* fix ci bugs
```
  30f5e39b
09 1月, 2023 2 次提交
- Q
  
  add fill/fill_any for kunlun (#49645) · 31ea3231
  由 QingshuChen 提交于 1月 09, 2023
  
  31ea3231
- Y
  [XPU] add einsum fill diagonal and diagonal kernels (#49465) · a5bf156b
  由 ykkk2333 提交于 1月 09, 2023
```
* migrate shaple sgd, split,sign xpu kernels to phi, test=kunlun

* fix dlrm throughput problem, test=kunlun

* add xpu einsum, fill_diagonal, and diagonal kernels, test=kunlun
```
  a5bf156b
06 1月, 2023 1 次提交

Dev (#49591) · 07db4a9f

由 RuohengMa 提交于 1月 06, 2023

* add bitwise and, bitwise not, bitwise or and bitwise xor

* correct typo

07db4a9f

27 12月, 2022 1 次提交
- Z
  
  add unbind op for xpu (#49356) · 16931039
  由 zhangyikun02 提交于 12月 27, 2022
  
  16931039
26 12月, 2022 1 次提交

fix dlrm qpsproblem (#49171) · c8f76337

由 ykkk2333 提交于 12月 26, 2022

* migrate shaple sgd, split,sign xpu kernels to phi, test=kunlun

* fix dlrm throughput problem, test=kunlun

c8f76337

23 12月, 2022 1 次提交
- H
  
  square_grad support fp16 *test=kunlun (#48847) · ae544586
  由 haosicheng 提交于 12月 23, 2022
  
  ae544586

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功