提交 · c8fc33798a1a28354b90d78feac181000b96451b · PaddlePaddle / Paddle

31 10月, 2022 1 次提交
- [Zero-Dim] support input 0D Tensor for reduce_sum/reduce_mean (#47219) · c8fc3379
  由 zhouweiwei2014 提交于 10月 31, 2022
  
  c8fc3379
26 10月, 2022 1 次提交
- L
  [Fix] Fix paddle.pow() Gets Incorrect Result When Broadcasting Is Triggered (#47307) · d8314ff5
  由 Lin Manhui 提交于 10月 26, 2022
```
* Fix paddle.pow() bugs

* Add unittest cases

* Fix ut cases

* Add ut cases on multiple devices
```
  d8314ff5
19 10月, 2022 1 次提交
- W
  
  slice op supports uint8_t (#47067) · 1e1c7275
  由 will-jl944 提交于 10月 19, 2022
  
  1e1c7275
17 10月, 2022 1 次提交
- Y
  [PHI]Modify DataLayout's namespace from paddle::experimental to phi (#46869) · ec749398
  由 YuanRisheng 提交于 10月 17, 2022
```
* namespace modify

* update by comment
```
  ec749398
14 10月, 2022 1 次提交
- W
  TRT pool2d adaptive mode bugfix (#46802) · eb32746a
  由 Wang Bojun 提交于 10月 14, 2022
```
* draft with debug print
```
  eb32746a
13 10月, 2022 1 次提交

由 Zhang Ting 提交于 10月 13, 2022

* Revert "【Hackathon No.56&38】deformable_conv_v1 算子实现 float16 数据类型支持&前向运行加速 (#46111)"

cf9ca61d

12 10月, 2022 2 次提交
- S
  Fix some operators when the tensor.numel() > INT32_MAX (#46767) · e896567e
  由 sneaxiy 提交于 10月 12, 2022
```
* fix some ops for int64 range

* update error message
```
  e896567e
- [Zero-Dim] support input 0D Tensor for some unary api (#45992) · 05c2b9ba
  由 zhouweiwei2014 提交于 10月 12, 2022
```
* [Zero-Dim] support input 0D Tensor for unary api

* fix CI
```
  05c2b9ba
11 10月, 2022 1 次提交
- C
  Remove LoDTensor using in fluid (Part 1) (#46663) · 940d8f25
  由 Chen Weihang 提交于 10月 11, 2022
```
* remove using lodtensor part1

* polish history code format
```
  940d8f25
10 10月, 2022 1 次提交
- R
  【Hackathon No.56&38】deformable_conv_v1 算子实现 float16 数据类型支持&前向运行加速 (#46111) · 5e0614a1
  由 Rayman 提交于 10月 10, 2022
```
support fp16 for deformable conv
```
  5e0614a1
03 10月, 2022 1 次提交
- J
  Requantize to use Memory Desc in Tensors (#46608) · a579e523
  由 Jacek Czaja 提交于 10月 03, 2022
```
* - some more MD changes

* - lint

* - compilation fixes

* - compilation fixes

* - lint

* - fix
```
  a579e523
30 9月, 2022 1 次提交

support pure bfloat16 for more ops (#46364) · b7b231a6

由 sneaxiy 提交于 9月 30, 2022

* support pure bfloat16

* support bf16 linear

* update PR to pass CI

* tiny fix where_grad_kernel.cu

* add bfloat16 to selu_grad to pass CI

* fix selu grad compilation error

b7b231a6

28 9月, 2022 2 次提交

Remove the declaration of using Tensor in framework/tensor.h (#46432) · e12a905e

由 Chen Weihang 提交于 9月 28, 2022

* remove needless using tensor

* remove needless using tensor

* resolve conflict

* replace tensor using

* fix format error

* revert needless changing

* fix rocm and npu compile error

* fix cinn compile error

* fix format error

* fix mkldnn format error

* fix mkldnn format error

* fix cinn compile error

* fix cinn compile error

* fix cinn compile error

* resolve conflict

e12a905e

[NPU] add gpu kernel for transfer layout (#46307) · 526d963e

由 kangguangli 提交于 9月 28, 2022

* add gpu kernel for transfer layout

* comment error throw

* fix: flag setting in testcase; add condition check for raising error

* fix typo

* fix: add error type for PADDLE_THROW

* remove kernel fallback in data_transfer.cc

* remove useless variable definition

526d963e

26 9月, 2022 1 次提交
- L
  
  [Fix] Remove std::trunc() in FloorDivideFunctor and InverseFloorDivideFunctor (#45051) · 091ae705
  由 Lin Manhui 提交于 9月 26, 2022
  
  091ae705
23 9月, 2022 1 次提交
- Y
  
  move selected_rows_functor (#46373) · b6c6f4f9
  由 YuanRisheng 提交于 9月 23, 2022
  
  b6c6f4f9
21 9月, 2022 1 次提交
- Z
  Revert "SparseConv support duplicate coordinates (#44976)" (#45202) · 8fbe97e4
  由 zhangkaihuo 提交于 9月 21, 2022
```
This reverts commit e8de9dfd.
```
  8fbe97e4
20 9月, 2022 4 次提交
- 5
  
  optimization of max_pool3d grad (#45934) · 0e563da6
  由 5u13 提交于 9月 20, 2022
  
  0e563da6
- O
  【PFCC算子性能优化】为Paddle优化adaptive_pooling_op性能 (#45959) · 6d067860
  由 Ouyang Chao 提交于 9月 20, 2022
```
* optimize adaptive_pooling_op (forward)

* fix bug of AdaptiveKernelMaxPool2dWithIdx

* fix bug of AdaptiveKernelPool2D
```
  6d067860
- Y
  
  move reduce func (#46248) · 6b47507d
  由 YuanRisheng 提交于 9月 20, 2022
  
  6b47507d
- J
  [Eager Bug fix]Fix Detection (#46147) · 192e7ccf
  由 Jiabin Yang 提交于 9月 20, 2022
```
* fix linspace error in amp

* fix log

* fix amp error

* Revert "Simplify size op impl (#45808)"

This reverts commit c252b1de.

* fix_seg

* fix detection
Co-authored-by: NChen Weihang <sunny_cwh@163.com>
```
  192e7ccf
19 9月, 2022 2 次提交

Fix wrong eigen header include (#46082) · 59a2a987

由 zyfncg 提交于 9月 19, 2022

* fix wrong eigen header include

* fix complie bug

* fix nan_inf_utils_detail

* fix resource_manager

* fix conv_miopen_helper

59a2a987

Performance fix for broadcast kernel [Part3] (#46071) · 46e4fb2a

由 limingshu 提交于 9月 19, 2022

* first commit

* refine code with template argument

* refine code with template argument

* add ternary broadcast test file

* add ternary broadcast test file

* fix accoriding to ci

* fix op-benchmark ci error

46e4fb2a

16 9月, 2022 1 次提交

Support broadcast elementwise operators with int64 index type (#45741) · 20b5bf84

由 sneaxiy 提交于 9月 16, 2022

* support int64 non-broadcast

* support broadcast case for int64 index

* fix bug

* support more Arity

* remove some codes

* upgrade patchelf to v0.15.0 to pass CI build

* fix bug

* fix patchelf installation

* add debug flags

* remove useless codes

* fix viterbi_decode and set_value op uts

* remove always enable int64

20b5bf84

15 9月, 2022 2 次提交
- W
  Support 0 shapes input Tensor for MKL slice (#45930) · 1d78681d
  由 WangZhen 提交于 9月 15, 2022
```
Support 0 shapes input Tensor for MKL slice kernel
```
  1d78681d
- L
  Performance fix for broadcast kernel [Part3] (#45854) · f48b1264
  由 limingshu 提交于 9月 15, 2022
```
* first commit

* fix some bugs in code

* fix bugs

* to optimize merge one dimension feature
```
  f48b1264
09 9月, 2022 3 次提交
- S
  Fix softmax op when the input shape is larger than INT32_MAX (#45897) · 38edea9a
  由 sneaxiy 提交于 9月 09, 2022
```
* fix softmax int64

* follow comments
```
  38edea9a
- 5
  
  optimization of max_pool3d forward (#45820) · 2632d77d
  由 5u13 提交于 9月 09, 2022
  
  2632d77d
- X
  modify slice op Infershape (#45855) · 97847ae8
  由 xiaoguoguo626807 提交于 9月 09, 2022
```
* modify slice infershape

* code style

* modify slice_unittest
```
  97847ae8
07 9月, 2022 2 次提交
- H
  
  [XPU] move rnn op to phi. (#45822) · 91631492
  由 houj04 提交于 9月 07, 2022
  
  91631492
- L
  Performance fix for broadcast kernel [Part2] (#40051) · 87cba48b
  由 limingshu 提交于 9月 07, 2022
```
* first commit

* merged with develop

* merged with develop

* fix merge sequential one dims bugs
```
  87cba48b
06 9月, 2022 2 次提交
- Y
  [PHI]Add TensorArray for PHI (#45479) · 68f99b78
  由 YuanRisheng 提交于 9月 06, 2022
```
* add tensor array

* fix ci bugs

* fix ci bugs

* fix ci bugs

* fix ci bugs

* update by comment

* update code
```
  68f99b78
- X
  
  elementwise op support fp16 (#45496) · f6d9ec27
  由 xiaohemaikoo 提交于 9月 06, 2022
  
  f6d9ec27
05 9月, 2022 1 次提交
- K
  [Bug Fix] fix compile error in gcc540 (#45702) · fd56f08e
  由 kangguangli 提交于 9月 05, 2022
```
* fix compile error in gcc540
```
  fd56f08e
04 9月, 2022 1 次提交

[PHI] Migrate gaussian_random kernel (#45481) · 4e3d222d

由 Sławomir Siwek 提交于 9月 04, 2022

* gaussian random

* mkldnn to onednn renaming

* fix merge conflicts

* remove fluid code

* onednn renaming

* change header path

* change fluid import to phi

4e3d222d

02 9月, 2022 4 次提交
- K
  
  move onednn file from phi/kernels/funcs/onednn to phi/backends/onednn (#45659) · 6813f41e
  由 kangguangli 提交于 9月 02, 2022
  
  6813f41e
- T
  
  xpu-paddlepaddle-38 [任务] 迁移bilinear_interp，nearest_interp到phi test=kunlun (#45608) · 445fce62
  由 taixiurong 提交于 9月 02, 2022
  
  445fce62
- Y
  
  interpolate (forward grad) op support fp16 on gpu (#45061) · b12c27eb
  由 Yuanle Liu 提交于 9月 02, 2022
  
  b12c27eb
- A
  [XPU]Migrate Adam XPU kernel into Phi (#45572) · cbabbe2e
  由 Aurelius84 提交于 9月 02, 2022
```
* [XPU]Migrate Adam XPU kernel into Phi

* test=kunlun
```
  cbabbe2e
01 9月, 2022 1 次提交

[phi] Migrate uniform_random XPU kernel to PHI (#45583) · ded33b58

由 HongyuJia 提交于 9月 01, 2022

* copy kernel file to phi

* delete some code

* migrate uniform_random, test=kunlun

* fix input error, test=kunlun

* fix gpu register error, test=kunlun

* add include file, test=kunlun

* try fix error from CI, test=kunlun

* polish other PR

* fix CI-coverage error, test=kunlun

ded33b58

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功