提交 · a0e3a1754ba7d71309c6e126df1723af31b1f040 · Crayon鑫 / Paddle

31 8月, 2022 6 次提交

X
[XPU] transfer concat kernel (#45463) · a0e3a175
由 xiongkun 提交于 8月 31, 2022
```
* transfer concat kernel

* test=kunlun

* test=kunlun

* test=kunlun

* test=kunlun
```
a0e3a175

[XPU] move pool/pool_grad xpu kernel to phi (#45480) · 13a0ea4c

由 james 提交于 8月 31, 2022

* move pool/pool_grad xpu kernel to phi, test=kunlun

* replace mutable_data() with DeviceContext::Alloc()

* replace PADDLE_ENFORCE_EQ with PADDLE_ENFORCE_XDNN, test=kunlun

* adjust function param name & update include header

* remove pool_op_xpu.cc

* fire r200 test

* minor, test=kunlun

13a0ea4c

R

[ROCM] fix bmm_kernel (#45530) · 4478389c
由 ronnywang 提交于 8月 31, 2022

4478389c

Fix split api bug (#45396) · 4a25b60d

由 Charles-hit 提交于 8月 31, 2022

* fix split bug

* solve function redefine

* fix fluid.layers.split and add unit test

* delete splitInferMeta register in unary.cc

* modify test_split_op GPU unit test

* modify test_split_op GPU unit test place param

* refactor split op and fix infershape bugs

* add () in && and ||

* fix split C++ unit test

* fix split infershape

4a25b60d

Move XPU mean and mean_grad to phi (#45512) · df7600ab

由 WangZhen 提交于 8月 31, 2022

* Move XPU mean and mean_grad to phi, test=kunlun

* Fix stream, test=kunlun

* Replace ENFORCE, test=kunlun

df7600ab

L

Add index add API (#45176) · 45171911
由 Li Min 提交于 8月 31, 2022

45171911

30 8月, 2022 12 次提交
- H
  [phi] Transfer coalesce_tensor to phi (#45478) · cf9d651b
  由 HongyuJia 提交于 8月 30, 2022
```
* add coalesce_tensor kernel

* polist coalesce_tensor kernel

* add sig and InferMeta

* add testcase

* add legacy_api.yaml

* fix infermeta

* fix yaml

* fix kernel implementation

* add compile dependency of phi/kernels

* fix MetaConfig

* add python api

* add and fix testcase

* rnn.py add import

* change _C_ops.coalesce_tensor

* remove useless comments

* add SetBackend

* restore XPU kernel temporarily

* fix code according to PR comments
```
  cf9d651b
- P
  [PHI] move huber_loss/huber_loss_grad xpu kernel to phi (#45521) · f69d2c32
  由 pangyoki 提交于 8月 30, 2022
```
* move huber_loss xpu kernel to phi, test=kunlun

* fix, test=kunlun

* fix paddle_enforce, test=kunlun
```
  f69d2c32
- Z
  
  Move prior_box, softmax and softmax_grad kernel to phi, test=kunlun (#45510) · 6dd13152
  由 zhangyikun02 提交于 8月 30, 2022
  
  6dd13152
- W
  [OpAttr]Adapt tensor axis for argmin/max (#45453) · 6fc15986
  由 WangZhen 提交于 8月 30, 2022
```
* Adapt tensor axis for argmin/max

* Add UT

* Polish UT
```
  6fc15986
- P
  [PHI] move layer_norm/layer_norm_grad xpu kernel to phi (#45524) · 871e3329
  由 pangyoki 提交于 8月 30, 2022
```
* move layer_norm xpu kernel to phi, test=kunlun

* fix, test=kunlun
```
  871e3329
- W
  [OpAttr]Adapt tensor axis for reduce_min/max/mean/sum/prod (#45078) · 32f42e94
  由 WangZhen 提交于 8月 30, 2022
```
* [OpAttr]Adapt tensor axis for reduce_min/max/mean/sum/prod
```
  32f42e94
- A
  [OpAttr]padding_value of Pad support Tensor type (#45514) · db235bf0
  由 Aurelius84 提交于 8月 30, 2022
```
* [OpAttr]padding_value of Pad support Tensor type

* fix unittest

* fix unittest

* fix coverage
```
  db235bf0
- K
  fix memcpy_h2d bug related to cuda stream setting when allocate memory (#45450) · 10abdb8f
  由 kangguangli 提交于 8月 30, 2022
```
* fix memcpy_h2d bug related to cuda stream setting when allocate memory

* add header file

* fix compile error for cpu only
```
  10abdb8f
- L
  move gelu/gelu_grad/generate_proposals_v2 kernel to phi (#45471) · 8b24c795
  由 Leo Chen 提交于 8月 30, 2022
```
* move xpu kernel to phi

* delete fluid file

* fix compile

* add guard, test=kunlun

* xpu set constant

* fix xpu error, test=kunlun
```
  8b24c795
- W
  
  Adapt tensor num_samples for multinomial (#45522) · c857841e
  由 WangZhen 提交于 8月 30, 2022
  
  c857841e
- M
  
  strided_slice grad add fp16 support (#45504) · 51f4291c
  由 ming1753 提交于 8月 30, 2022
  
  51f4291c
- C
  
  rename mod c api name (#45476) · ad96fe2c
  由 Chen Weihang 提交于 8月 29, 2022
  
  ad96fe2c
29 8月, 2022 13 次提交
- Y
  [PHI]Mv xpu elementwise add kernel to phi (#45473) · bb3e4e0c
  由 YuanRisheng 提交于 8月 29, 2022
```
* mv elementwise add to xpu , test=kunlun

* fix ci bugs, test=kunlun

* fix ci bugs , test=kunlun
```
  bb3e4e0c
- S
  [PHI] Migrate relu6 and abs kernels (#45397) · 632bc1f2
  由 Sławomir Siwek 提交于 8月 29, 2022
```
* abs relu6 fwd

* abs bwd

* gaussian_random_kernel and mkldnn-onednn renaming

* scale kernel

* whitespace

* whitespace

* revert scale migration

* whitespaces

* revert changes to gaussian kernel

* whitespaces
```
  632bc1f2
- W
  [XPU] migrate mul to phi (#45502) · 923594de
  由 Weilong Wu 提交于 8月 29, 2022
```
* [XPU] migrate mul to phi;test=kunlun

* rm fluid mul xpu op;test=kunlun
```
  923594de
- C
  Migrate assign xpu kernel into phi (#45467) · 0710f058
  由 Chen Weihang 提交于 8月 29, 2022
```
* migrate assign xpu kernel, test=kunlun

* remove assign_value xpu, test=kunlun
```
  0710f058
- W
  [Phi] gather gather_grad gather_nd gaussian_random xpu to Phi (#45465) · 60e1eccb
  由 wanghuancoder 提交于 8月 29, 2022
```
* gather gather_grad gather_nd gaussian_random xpu to phi
```
  60e1eccb
- Z
  
  refine expand_as_v2 XPU kernel, test=kunlun (#45501) · ca5567e1
  由 zhangbo9674 提交于 8月 29, 2022
  
  ca5567e1
- Z
  Fix HardSwish inf (#35386) · fbd83812
  由 Zhang Ting 提交于 8月 29, 2022
```
* fix hard_swish inf

* skip_check_grad for mkldnn op

* 'fix code style'

* fix unittest
```
  fbd83812
- A
  [OpAttr]num_rows/num_colums of eye support Tensor type (#45427) · b93b710a
  由 Aurelius84 提交于 8月 29, 2022
```
* [OpAttr]num_rows/num_colums of eye support Tensor type

* fix attr cast with long type
```
  b93b710a
- S
  [geometric]Move graph-related incubate api to geometric (#44970) · 8f657f74
  由 Siming Dai 提交于 8月 29, 2022
```
* move incubate to geometric

* add paddle.geometric

* fix unittest bug

* add float16 support for segment op

* change reindex and sample neighbors flag name

* add heter graph reindex

* move sample_neighbors.py to neighbors.py

* delete khop_sampler in geometric

* delete unused code

* change sample_neighbors api input order

* fix en doc

* fix unittest

* fix unittest

* change reindex

* fix division by 0

* delete unnecessary input argument

* delete final_state
```
  8f657f74
- Z
  
  Move expand_as_v2 XPU kernel to PHI, test=kunlun (#45474) · 4b749513
  由 zhangbo9674 提交于 8月 29, 2022
  
  4b749513
- Z
  
  move expand_v2 to phi, test=kunlun (#45469) · 23a79923
  由 zhangbo9674 提交于 8月 29, 2022
  
  23a79923
- Z
  Move matmul_v2 kernel of xpu from fluid to phi (#45446) · de436f07
  由 zyfncg 提交于 8月 29, 2022
```
* move matmul_v2 kernel of xpu from fluid to phi, test=kunlun

* fix complie bug, test=kunlun

* fix complie bug, test=kunlun

* fix complie bug, test=kunlun
```
  de436f07
- W
  
  [XPU] migrate bce_loss to phi;test=kunlun (#45459) · d3ec3fe3
  由 Weilong Wu 提交于 8月 29, 2022
  
  d3ec3fe3
26 8月, 2022 4 次提交
- R
  
  Move conv2d_transpose_grad XPU kernel to PHI, test=kunlun (#45466) · a635a8a5
  由 Ruibiao Chen 提交于 8月 26, 2022
  
  a635a8a5
- R
  
  Move grid_sample XPU kernel to PHI, test=kunlun (#45425) · 2c89bccb
  由 Ruibiao Chen 提交于 8月 26, 2022
  
  2c89bccb
- R
  
  Move conv2d_transpose XPU kernel to PHI, test=kunlun (#45419) · 1f1a7835
  由 Ruibiao Chen 提交于 8月 26, 2022
  
  1f1a7835
- K
  Transfer transfer_layout from fluid to phi (#45261) · 985f2a4a
  由 kangguangli 提交于 8月 26, 2022
```
* remove fluid kernel and activate phi kernel

* fix parameter error

* transfer mkldnn part

* modify header file path

* fix compile error

* transfer special case

* fix lod setting and special case for layout setting

* add testcase and refine code
```
  985f2a4a
25 8月, 2022 5 次提交

Enable OMP multithreading in lookup_table_v2 (#45249) · 0c363de8

由 piotrekobi 提交于 8月 25, 2022

* Add omp parallel for directives

* Revert "Add omp parallel for directives"

This reverts commit f4e4f8ddb12454018d9c1e49c074af2543659de6.

* Add #pragma omp parallel for to correct file

* Add check for _OPENMP definition

* Disable omp on gpu

* Trigger CI

* Readd check for _OPENMP definition

* Change macro disabling changes on GPU

* Improve macro readability

0c363de8

A
[OpAttr]axis of Reverse Support Tensor type (#45391) · 91110661
由 Aurelius84 提交于 8月 25, 2022
```
* [OpAttr]axis of Reverse Support Tensor type

* fix coverage

* fix unittest
```
91110661
A
[OpAttr]min/max of uniform_random support Tensor type (#45417) · c8955d0d
由 Aurelius84 提交于 8月 25, 2022
```
* [OpAttr]min/max of Uniform_rand support Tensor type

* fix typo
```
c8955d0d

Transfer memcpy d2h from fluid to phi (#45150) · 0d14e74a

由 kangguangli 提交于 8月 25, 2022

* transfer memcpy_d2h from fluid to phi

* refine arg check and add comment

* fix cannot fallback to phi kernel

* fix gpu_context host alloc when tensor size = 0

* add kernel for std::vector<DenseTensor> args

* fix bugs in MemcpyD2HMultiIOKernel

* remove useless header file

* polish format

* fix typo

* add testcase for cudapinned place

* refine check condition in test

* polish error message

* polish error message

* remove header in fluid  directory

* merge memcpy_h2d and memcpy_d2h into one file, change register method to simplify implementation

* fix code style check

0d14e74a

S
make full_like support double_max in dygraph (#45385) · edd66f2e
由 Sing_chan 提交于 8月 25, 2022
```
* make full_like support double_max in dygraph

* fix bug
```
edd66f2e

Crayon鑫 / Paddle 与 Fork 源项目一致

Crayon鑫 / Paddle
与 Fork 源项目一致