提交 · de436f07904b09ea401de72c82010709df8084fe · BaiXuePrincess / Paddle

29 8月, 2022 2 次提交
- Z
  Move matmul_v2 kernel of xpu from fluid to phi (#45446) · de436f07
  由 zyfncg 提交于 8月 29, 2022
```
* move matmul_v2 kernel of xpu from fluid to phi, test=kunlun

* fix complie bug, test=kunlun

* fix complie bug, test=kunlun

* fix complie bug, test=kunlun
```
  de436f07
- W
  
  [XPU] migrate bce_loss to phi;test=kunlun (#45459) · d3ec3fe3
  由 Weilong Wu 提交于 8月 29, 2022
  
  d3ec3fe3
26 8月, 2022 4 次提交
- R
  
  Move conv2d_transpose_grad XPU kernel to PHI, test=kunlun (#45466) · a635a8a5
  由 Ruibiao Chen 提交于 8月 26, 2022
  
  a635a8a5
- R
  
  Move grid_sample XPU kernel to PHI, test=kunlun (#45425) · 2c89bccb
  由 Ruibiao Chen 提交于 8月 26, 2022
  
  2c89bccb
- R
  
  Move conv2d_transpose XPU kernel to PHI, test=kunlun (#45419) · 1f1a7835
  由 Ruibiao Chen 提交于 8月 26, 2022
  
  1f1a7835
- K
  Transfer transfer_layout from fluid to phi (#45261) · 985f2a4a
  由 kangguangli 提交于 8月 26, 2022
```
* remove fluid kernel and activate phi kernel

* fix parameter error

* transfer mkldnn part

* modify header file path

* fix compile error

* transfer special case

* fix lod setting and special case for layout setting

* add testcase and refine code
```
  985f2a4a
25 8月, 2022 10 次提交

Enable OMP multithreading in lookup_table_v2 (#45249) · 0c363de8

由 piotrekobi 提交于 8月 25, 2022

* Add omp parallel for directives

* Revert "Add omp parallel for directives"

This reverts commit f4e4f8ddb12454018d9c1e49c074af2543659de6.

* Add #pragma omp parallel for to correct file

* Add check for _OPENMP definition

* Disable omp on gpu

* Trigger CI

* Readd check for _OPENMP definition

* Change macro disabling changes on GPU

* Improve macro readability

0c363de8

A
[OpAttr]axis of Reverse Support Tensor type (#45391) · 91110661
由 Aurelius84 提交于 8月 25, 2022
```
* [OpAttr]axis of Reverse Support Tensor type

* fix coverage

* fix unittest
```
91110661
A
[OpAttr]min/max of uniform_random support Tensor type (#45417) · c8955d0d
由 Aurelius84 提交于 8月 25, 2022
```
* [OpAttr]min/max of Uniform_rand support Tensor type

* fix typo
```
c8955d0d

Transfer memcpy d2h from fluid to phi (#45150) · 0d14e74a

由 kangguangli 提交于 8月 25, 2022

* transfer memcpy_d2h from fluid to phi

* refine arg check and add comment

* fix cannot fallback to phi kernel

* fix gpu_context host alloc when tensor size = 0

* add kernel for std::vector<DenseTensor> args

* fix bugs in MemcpyD2HMultiIOKernel

* remove useless header file

* polish format

* fix typo

* add testcase for cudapinned place

* refine check condition in test

* polish error message

* polish error message

* remove header in fluid  directory

* merge memcpy_h2d and memcpy_d2h into one file, change register method to simplify implementation

* fix code style check

0d14e74a

S
make full_like support double_max in dygraph (#45385) · edd66f2e
由 Sing_chan 提交于 8月 25, 2022
```
* make full_like support double_max in dygraph

* fix bug
```
edd66f2e
W
[Eager] sync_batch_norm_grad delete mean and variance (#45411) · 5df464fe
由 wanghuancoder 提交于 8月 25, 2022
```
* sync_batch_norm_grad delete mean and variance
```
5df464fe

optimize conv algo cache (#41891) · 1cd7e68b

由 hong 提交于 8月 25, 2022

* optimizer conv alog speed

* code polish

* remove useless code

* fix compile error

* fix cpu compile error

* not use cudnn alog t

* add search cache max number

* polish code

* fix cache test bug

* add groups data format to conv args

* fix cache test bug

* fix cudnn_deterministic bug

* fix test switch auto tune bug

* fix test swith autotune bug;

* fix conv cache bug

* fix cache test error

* fix cache test bug

* fix windows mac compile error

* fix workspace search error

* update cudnn cache

* fix cache test bug; test=develop

* fix autotune swith test error

* polish code

* oplish code

1cd7e68b

R

[triu_indices] add triu_indices_op (#45168) · a410c397
由 Rayman 提交于 8月 25, 2022

a410c397
S
Fix unique_kernel bugs (#45032) · ea1f4702
由 sprouteer 提交于 8月 25, 2022
```
* fix unique_kernel bugs

* fix unique kernel cu bugs
```
ea1f4702
H

add temporal shift and grad *test=kunlun (#45300) · 63d9a175
由 haosicheng 提交于 8月 25, 2022

63d9a175

24 8月, 2022 4 次提交

make tensor_util contains no cuda code (#45256) · 78916a7a

由 Leo Chen 提交于 8月 24, 2022

* make tensor_util contains no cuda code

* refine isfinite

* revert ut

* move isfinite function to its op

* fix test

* fix compile

* std::isnan is not defined for int type on windows

* fix windows compile

* fix fp16

* fix rocm compile

* revert gradient node

78916a7a

W

Adapt tensor axis for cumsum (#45372) · 7f49b9ba
由 WangZhen 提交于 8月 24, 2022

7f49b9ba

【Hackathon No.34】优化 poisson op (#45160) · 3c14b094

由 Rayman 提交于 8月 24, 2022

* 【Hackathon No.34】优化 poisson op

* [poisson] code style fix

* modify code style

* prevent from big number

* modify code style

* modify code style

* modify import

* modify import

* modify code style

3c14b094

W
[OpAttr]Adapt tensor minlength for bincount (#45342) · 12917c8c
由 WangZhen 提交于 8月 24, 2022
```
* Adapt minlength attr for bincount
```
12917c8c

23 8月, 2022 6 次提交
- N
  
  Delete the template parameter BLockSize in Kernel Primitive API (#45220) · 1a0cd447
  由 niuliling123 提交于 8月 23, 2022
  
  1a0cd447
- Z
  [Sparse]Use shorted function names (#45325) · 3a7b1810
  由 zhangkaihuo 提交于 8月 23, 2022
```
* rename the member function of SparseTensor

* use shorter function names
```
  3a7b1810
- L
  
  first commit (#45253) · b5d8bd2f
  由 limingshu 提交于 8月 23, 2022
  
  b5d8bd2f
- S
  
  [Geometric] Fix cuda configuration error for message_passing api (#45315) · 03ef0bdc
  由 Siming Dai 提交于 8月 23, 2022
  
  03ef0bdc
- T
  【PaddlePaddle Hackathon 3 No.33】为 Paddle 优化 erfinv op 在 GPU 上的计算性能 (#45057) · 0e384ade
  由 thunder95 提交于 8月 23, 2022
```
* erfinv

* fix some tiny issues
```
  0e384ade
- Y
  [Phi]Move distribute_fpn_proposals to PHI (#45212) · 8f8ed7de
  由 YuanRisheng 提交于 8月 23, 2022
```
* move distribute_fpn_proposals

* fix some code

* fix yaml bugs

* add set dtype

* move proposal_impl to funcs

* fix compile bugs
```
  8f8ed7de
22 8月, 2022 3 次提交
- W
  [Eager] some python c api use final state (#45221) · d2ef888b
  由 wanghuancoder 提交于 8月 22, 2022
```
some python c api use final state
```
  d2ef888b
- Z
  
  rename the member function of SparseTensor (#45291) · 016b94c2
  由 zhangkaihuo 提交于 8月 22, 2022
  
  016b94c2
- S
  
  fix infershape in compile time (#45156) · ed57237e
  由 shangliang Xu 提交于 8月 22, 2022
  
  ed57237e
19 8月, 2022 1 次提交
- W
  Trt groupnorm dynamic plugin (#44911) · 1aa6adb1
  由 Wang Bojun 提交于 8月 19, 2022
```
* add group_norm dyanmic plugin
```
  1aa6adb1
18 8月, 2022 3 次提交

[phi] Transfer fluid trilinear_interp_v2 to phi trilinear_interp (add yaml) (#45145) · 6150fade

由 HongyuJia 提交于 8月 18, 2022

* transfer trilinear op to phi, change name from trilinear_interp_v2 to trilinear_interp

* reserve linear_interp param

* change testcase scale if-branch

* testcase test_imperative_case

* fix trilinear testcase

* import paddle in test_trilinear_interp_v2

6150fade

A
[OpAttr]Squeeze axes support Tensor (#45189) · c93451f4
由 Aurelius84 提交于 8月 18, 2022
```
* [OpAttr]Squeeze axes support Tensor

* add support_tensor

* fix unittest

* fix coverage
```
c93451f4

[phi] Transfer fluid bilinear_interp_v2 to phi bilinear_interp (add yaml) (#45140) · 2c2137bb

由 HongyuJia 提交于 8月 18, 2022

* transfer bilinear op to phi, change bname from bilinear_interp_v2 to bilinear_interp

* reserve linear_interp param

* fix cross device import

2c2137bb

17 8月, 2022 4 次提交
- L
  Reuse addKernel to replace TensorAdd (#45161) · 0e3b49d4
  由 Leo Chen 提交于 8月 17, 2022
```
* use addKernel

* fix compile

* remove elementwiseAddto

* add return

* fix custom place
```
  0e3b49d4
- Y
  add instance norm op for xpu (#45097) · 216d25ac
  由 ykkk2333 提交于 8月 17, 2022
```
* xpu unittest grad compute supports more types, *test=kunlun

* add instance norm xpu, *test=kunlun
```
  216d25ac
- H
  [phi] Transfer fluid bicubic_interp_v2 to phi bicubic_interp (add yaml) (#45151) · f4da2d4d
  由 HongyuJia 提交于 8月 17, 2022
```
* transfer bicubic_interp op to phi, change name from bicubic_interp_v2 to bicubic_interp

* test final_state_bicubic_interp api

* testcase match imperative case
```
  f4da2d4d
- S
  Fix squared_l2_norm wrong stream bug (#45174) · 951010a2
  由 sneaxiy 提交于 8月 17, 2022
```
* fix squared_l2_norm bug

* update buffer.h
```
  951010a2
16 8月, 2022 3 次提交

[Phi] Move amp ops into phi (#45079) · b4f67757

由 Chen Weihang 提交于 8月 16, 2022

* move check finite and unscale kernel into phi

* move infershape into phi

* move update_loss_scaling kernel into phi

* remove original kernels

* move update loss scaling infershape into phi

* add header for xpu and npu

* solve coverage failed

* fix npu test failed

* remove mutable data in cu file

* fix new executor failed

* add valid check for meta tensor output

b4f67757

[geometric]Add paddle.geometric.send_uv API (#44848) · 88724a53

由 Siming Dai 提交于 8月 16, 2022

* initial commit

* fix op maker bug

* fix mul grad bug

* add unittest

* fix add grad bug, add cpu kernel

* add paddle.geometric.message_passing

* add paddle.geometric.send_uv api, add unittest

* add fp16 judgement

* fix file typo, move compute_type to message_op

* add impl file

* fix unittest timeout time

* add review revise

88724a53

H

transfer nearest_interp op to phi, change name from nearest_interp_v2 to nearest_interp (#45148) · 6452ab3b
由 HongyuJia 提交于 8月 16, 2022

6452ab3b

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致