提交 · a38fc5e1818657ddc0f82323bd4ff1eaca0de7a1 · PaddlePaddle / Paddle

10 11月, 2022 4 次提交

P
[PHI decoupling] remove "paddle/fluid/platform/device/gpu/gpu_launch_config.h" in phi (#47808) · 40a9b488
由 PuQing 提交于 11月 10, 2022
```
* rm fluid gpu_launch_config

* fix type
```
40a9b488

[PHI Decoupling] remove "paddle/fluid/platform/float16.h" and... · 8164b97a

由 huangjiyi 提交于 11月 10, 2022

[PHI Decoupling] remove "paddle/fluid/platform/float16.h" and "paddle/fluid/platform/for_range.h" in phi. (#47817)

* rm "paddle/fluid/platform/float16.h" in phi

* rm "paddle/fluid/platform/for_range.h" in phi

8164b97a

[PHI Decoupling] remove dependency on "paddle/fluid/platform/errors.h" and... · 4c375454

由 huangjiyi 提交于 11月 10, 2022

[PHI Decoupling] remove dependency on "paddle/fluid/platform/errors.h" and "paddle/fluid/platform/fast_divmod.h" in phi. (#47815)

* rm "paddle/fluid/platform/errors.h" in phi

* rm "paddle/fluid/platform/fast_divmod.h" in phi

4c375454

XPU multi-card support eager mode (#47445) · 3b91f8f3

由 james 提交于 11月 10, 2022

* XPU support eager mode

* add unittest for XPU eager mode

* minor bugfix

* minor bugfix, test=kunlun

* correct copyright info

* 1. remove unsed vars/funcs
2. ProcessGroupBKCL inherit from ProcessGroupStream

* bugfix for fp16 in eager mode multi-card, test=kunlun

* rebase & fix a few issues

* use new processgroup interface, test=kunlun

* fix compile issue, test=kunlun

3b91f8f3

09 11月, 2022 5 次提交
- H
  [PHI decoupling] remove "paddle/fluid/platform/dynload/xxx.h" in phi (#47787) · 7c302538
  由 huangjiyi 提交于 11月 09, 2022
```
* rm "paddle/fluid/platform/dynload/cudnn.h" in phi

* rm "paddle/fluid/platform/dynload/mklml.h" in phi

* rm "paddle/fluid/platform/dynload/rocblas.h" in phi

* replace "paddle::platform::dynload::" with "phi::dynload::" in phi

* revert "blas_impl.cu.h"
```
  7c302538
- W
  [PHI decoupling] remove framework/data_type.h from phi (#47776) · 1631836f
  由 Wang Xin 提交于 11月 09, 2022
```
* remove framework/data_type.h from phi

* fix CI fail: map proto::VarType to phi::DataType

* refactor code to add more detailed comments
```
  1631836f
- H
  
  rm "paddle/fluid/platform/dynload/cublas.h" in phi (#47778) · 692a9632
  由 huangjiyi 提交于 11月 09, 2022
  
  692a9632
- C
  
  add sin triple grad operator (#47753) · 267b218f
  由 cyber-pioneer 提交于 11月 09, 2022
  
  267b218f
- Z
  
  [Sparse]optimize sparse convolution and fix MaskHelper bug (#47703) · 1aa64d13
  由 zhangkaihuo 提交于 11月 09, 2022
  
  1aa64d13
08 11月, 2022 1 次提交
- J
  removing dependent to fluid/framework/eigen.h in phi (#47675) · c7cd8d98
  由 jzhang533 提交于 11月 08, 2022
```
* removing dependent to fluid/framework/eigen.h in phi

* more fix according to PR-CI-Py3 fail
```
  c7cd8d98
04 11月, 2022 1 次提交

Add sin double grad operator. (#47543) · 297f5efe

由 cyber-pioneer 提交于 11月 04, 2022

* add sin double grad operator

* add sin double grad test example

* move sindoublegradopmaker to backward.yaml

* fix sindoublegrad code

* simplify sindoublegrad functor

297f5efe

03 11月, 2022 2 次提交
- sparse attention kernel is used from 11.8 (#47594) · 7648f429
  由 zhouweiwei2014 提交于 11月 03, 2022
  
  7648f429
- S
  
  fix gemm compute_type (#47613) · 954be40d
  由 sneaxiy 提交于 11月 03, 2022
  
  954be40d
02 11月, 2022 2 次提交
- T
  
  fix amax/amin/max/min write overflow (#47570) · 6f7a80c3
  由 Tao Luo 提交于 11月 02, 2022
  
  6f7a80c3
- [Zero-Dim] support input 0D Tensor for some binary api (#46909) · cad2e68d
  由 zhouweiwei2014 提交于 11月 02, 2022
  
  cad2e68d
31 10月, 2022 2 次提交
- Y
  [PHI]Standardise some C++ API (#47385) · 60e0c506
  由 YuanRisheng 提交于 10月 31, 2022
```
* standard api

* fix ci bugs

* fix ci bugs

* fix ce bugs
```
  60e0c506
- [Zero-Dim] support input 0D Tensor for reduce_sum/reduce_mean (#47219) · c8fc3379
  由 zhouweiwei2014 提交于 10月 31, 2022
  
  c8fc3379
26 10月, 2022 1 次提交
- L
  [Fix] Fix paddle.pow() Gets Incorrect Result When Broadcasting Is Triggered (#47307) · d8314ff5
  由 Lin Manhui 提交于 10月 26, 2022
```
* Fix paddle.pow() bugs

* Add unittest cases

* Fix ut cases

* Add ut cases on multiple devices
```
  d8314ff5
19 10月, 2022 1 次提交
- W
  
  slice op supports uint8_t (#47067) · 1e1c7275
  由 will-jl944 提交于 10月 19, 2022
  
  1e1c7275
17 10月, 2022 1 次提交
- Y
  [PHI]Modify DataLayout's namespace from paddle::experimental to phi (#46869) · ec749398
  由 YuanRisheng 提交于 10月 17, 2022
```
* namespace modify

* update by comment
```
  ec749398
14 10月, 2022 1 次提交
- W
  TRT pool2d adaptive mode bugfix (#46802) · eb32746a
  由 Wang Bojun 提交于 10月 14, 2022
```
* draft with debug print
```
  eb32746a
13 10月, 2022 1 次提交

Revert #46111 (#46961) · cf9ca61d

由 Zhang Ting 提交于 10月 13, 2022

* Revert "【Hackathon No.56&38】deformable_conv_v1 算子实现 float16 数据类型支持&前向运行加速 (#46111)"

cf9ca61d

12 10月, 2022 2 次提交
- S
  Fix some operators when the tensor.numel() > INT32_MAX (#46767) · e896567e
  由 sneaxiy 提交于 10月 12, 2022
```
* fix some ops for int64 range

* update error message
```
  e896567e
- [Zero-Dim] support input 0D Tensor for some unary api (#45992) · 05c2b9ba
  由 zhouweiwei2014 提交于 10月 12, 2022
```
* [Zero-Dim] support input 0D Tensor for unary api

* fix CI
```
  05c2b9ba
11 10月, 2022 1 次提交
- C
  Remove LoDTensor using in fluid (Part 1) (#46663) · 940d8f25
  由 Chen Weihang 提交于 10月 11, 2022
```
* remove using lodtensor part1

* polish history code format
```
  940d8f25
10 10月, 2022 1 次提交
- R
  【Hackathon No.56&38】deformable_conv_v1 算子实现 float16 数据类型支持&前向运行加速 (#46111) · 5e0614a1
  由 Rayman 提交于 10月 10, 2022
```
support fp16 for deformable conv
```
  5e0614a1
03 10月, 2022 1 次提交
- J
  Requantize to use Memory Desc in Tensors (#46608) · a579e523
  由 Jacek Czaja 提交于 10月 03, 2022
```
* - some more MD changes

* - lint

* - compilation fixes

* - compilation fixes

* - lint

* - fix
```
  a579e523
30 9月, 2022 1 次提交

support pure bfloat16 for more ops (#46364) · b7b231a6

由 sneaxiy 提交于 9月 30, 2022

* support pure bfloat16

* support bf16 linear

* update PR to pass CI

* tiny fix where_grad_kernel.cu

* add bfloat16 to selu_grad to pass CI

* fix selu grad compilation error

b7b231a6

28 9月, 2022 2 次提交

Remove the declaration of using Tensor in framework/tensor.h (#46432) · e12a905e

由 Chen Weihang 提交于 9月 28, 2022

* remove needless using tensor

* remove needless using tensor

* resolve conflict

* replace tensor using

* fix format error

* revert needless changing

* fix rocm and npu compile error

* fix cinn compile error

* fix format error

* fix mkldnn format error

* fix mkldnn format error

* fix cinn compile error

* fix cinn compile error

* fix cinn compile error

* resolve conflict

e12a905e

[NPU] add gpu kernel for transfer layout (#46307) · 526d963e

由 kangguangli 提交于 9月 28, 2022

* add gpu kernel for transfer layout

* comment error throw

* fix: flag setting in testcase; add condition check for raising error

* fix typo

* fix: add error type for PADDLE_THROW

* remove kernel fallback in data_transfer.cc

* remove useless variable definition

526d963e

26 9月, 2022 1 次提交
- L
  
  [Fix] Remove std::trunc() in FloorDivideFunctor and InverseFloorDivideFunctor (#45051) · 091ae705
  由 Lin Manhui 提交于 9月 26, 2022
  
  091ae705
23 9月, 2022 1 次提交
- Y
  
  move selected_rows_functor (#46373) · b6c6f4f9
  由 YuanRisheng 提交于 9月 23, 2022
  
  b6c6f4f9
21 9月, 2022 1 次提交
- Z
  Revert "SparseConv support duplicate coordinates (#44976)" (#45202) · 8fbe97e4
  由 zhangkaihuo 提交于 9月 21, 2022
```
This reverts commit e8de9dfd.
```
  8fbe97e4
20 9月, 2022 4 次提交
- 5
  
  optimization of max_pool3d grad (#45934) · 0e563da6
  由 5u13 提交于 9月 20, 2022
  
  0e563da6
- O
  【PFCC算子性能优化】为Paddle优化adaptive_pooling_op性能 (#45959) · 6d067860
  由 Ouyang Chao 提交于 9月 20, 2022
```
* optimize adaptive_pooling_op (forward)

* fix bug of AdaptiveKernelMaxPool2dWithIdx

* fix bug of AdaptiveKernelPool2D
```
  6d067860
- Y
  
  move reduce func (#46248) · 6b47507d
  由 YuanRisheng 提交于 9月 20, 2022
  
  6b47507d
- J
  [Eager Bug fix]Fix Detection (#46147) · 192e7ccf
  由 Jiabin Yang 提交于 9月 20, 2022
```
* fix linspace error in amp

* fix log

* fix amp error

* Revert "Simplify size op impl (#45808)"

This reverts commit c252b1de.

* fix_seg

* fix detection
Co-authored-by: NChen Weihang <sunny_cwh@163.com>
```
  192e7ccf
19 9月, 2022 2 次提交

Fix wrong eigen header include (#46082) · 59a2a987

由 zyfncg 提交于 9月 19, 2022

* fix wrong eigen header include

* fix complie bug

* fix nan_inf_utils_detail

* fix resource_manager

* fix conv_miopen_helper

59a2a987

Performance fix for broadcast kernel [Part3] (#46071) · 46e4fb2a

由 limingshu 提交于 9月 19, 2022

* first commit

* refine code with template argument

* refine code with template argument

* add ternary broadcast test file

* add ternary broadcast test file

* fix accoriding to ci

* fix op-benchmark ci error

46e4fb2a

16 9月, 2022 1 次提交

Support broadcast elementwise operators with int64 index type (#45741) · 20b5bf84

由 sneaxiy 提交于 9月 16, 2022

* support int64 non-broadcast

* support broadcast case for int64 index

* fix bug

* support more Arity

* remove some codes

* upgrade patchelf to v0.15.0 to pass CI build

* fix bug

* fix patchelf installation

* add debug flags

* remove useless codes

* fix viterbi_decode and set_value op uts

* remove always enable int64

20b5bf84

PaddlePaddle / Paddle 1 年多 前同步成功

PaddlePaddle / Paddle
1 年多前同步成功