提交 · d1e8b1e2ca84e4e458e47da9e5a95a96bcf5f330 · PaddlePaddle / Paddle

11 4月, 2023 1 次提交

Cherry pick for fix of operator precision. (#52705) · d1e8b1e2

由 Yiqun Liu 提交于 4月 11, 2023

* Fix scale kernel for low precision, cherry pick #50998.

* Fix the FP16 precision problem of add_n. (#50129)

* Change squared_l2_norm to reuse ReduceKernel, and register fp16 and bf16 kernel, which is cherry pick #48315.

* Cherry-pick the fix of MPTypeTrait in KP, which is implemented in #50993.

* Cherry-pick the multi-precision support of AdamW for bf16, #48041.

* Fix compiling error.

* Cherry-pick the fix of CubTensorReduceImpl for bfloat16 in #50993.

* Fix unittest.

---------
Co-authored-by: Nliuruyan <44316842+liuruyan@users.noreply.github.com>

d1e8b1e2

05 8月, 2022 1 次提交

move fft kernels to phi (#44714) · 153f1138

由 Feiyu Chan 提交于 8月 05, 2022

* move fft kernels to phi, done with cufft, pocketfft, mkl_cdft, hipfft
* make stft_op use fft from phi/kernels/funcs, clean code

153f1138

21 6月, 2022 1 次提交
- S
  resort .cu headers, set clang-format not sort include block and consider .cu... · 829723f2
  由 Sing_chan 提交于 6月 21, 2022
```
resort .cu headers, set clang-format not sort include block and consider .cu as main source file (#43633)
```
  829723f2
05 6月, 2022 1 次提交
- S
  
  【code format check upgrade】 step2：clang-format (#42840) · a3730dc8
  由 Sing_chan 提交于 6月 05, 2022
  
  a3730dc8
12 3月, 2022 1 次提交
- Z
  [PHI] Move forward kernel of roi_align into phi (#40382) · 39de9b8a
  由 zyfncg 提交于 3月 12, 2022
```
* move roi_align kernel to phi

* fix bug of roi_align xpu
```
  39de9b8a
01 3月, 2022 1 次提交

[bf16] add bf16 kernel: scale gather sum (#39683) · 6d26b332

由 zhangbo9674 提交于 3月 01, 2022

* add scale gather sum

* refine CUDA_ATOMIC_WRAPPER ADD for bf16

* add gather unittest

* solve conflict

* add scale uinttest

* add sum unittest

* solve conflict

* refine gather unittest

* refine unittest

6d26b332

22 2月, 2022 1 次提交
- C
  [PTen->Phi PR2] Rename PT_REGISTER macro to PD_REGISTER (#39790) · 4a338796
  由 Chen Weihang 提交于 2月 22, 2022
```
* unify register macro

* rename declare macro

* fix infrt error
```
  4a338796
20 2月, 2022 2 次提交
- C
  [PTen->Phi PR1] Change pten dirname and namespace to phi (#39748) · dcfe1986
  由 Chen Weihang 提交于 2月 20, 2022
```
* rename pten dir to phi

* rename namespace to phi

* rename infrt pten dir to phi

* resolve conflict

* rename pten to phi in cmake

* revert all infrt change

* change needed files

* fix infrt failed

* fix inference failed
```
  dcfe1986
- Y
  
  Rename the general elementwise and broadcast functions. (#39623) · 553afc07
  由 Yiqun Liu 提交于 2月 20, 2022
  
  553afc07
11 2月, 2022 1 次提交
- Z
  Support different dtypes of inputs for elementwise ops (#38859) · bf305033
  由 Zhang Ting 提交于 2月 11, 2022
```
* improve backward performance

* support different dtypes for elementwise ops
```
  bf305033
08 2月, 2022 1 次提交
- N
  Replace clip, bce_loss, full and full_like with elementwise (#39197) · 424700ff
  由 niuliling123 提交于 2月 08, 2022
```
* Replace clip, bce_loss, full and full_like with elementwise
```
  424700ff
28 1月, 2022 1 次提交
- Y
  [PTen]Refactor scale kernel that has selected_rows input (#39278) · abfc2fe9
  由 YuanRisheng 提交于 1月 28, 2022
```
* refactor scale kernel that its input is selected_rows

* complement upload file
```
  abfc2fe9
27 1月, 2022 1 次提交

[PTen]Support AllocateFrom in Tensor and Alloc/HostAlloc in Context (#39022) · 5631da9c

由 Aurelius84 提交于 1月 27, 2022

* Support allocate_from in Tensor and allocate_data in Context

* fix #ifdef CUDA

* fix cycle depends

* fix test_xxx_dev_api failed

* fix windows compiling error

* fix unittest

* modify into PImpl

* fix selected rows

* add TODO comment

* refine interface according reviewer

5631da9c

24 1月, 2022 1 次提交

石

[Refactoring Tensor PR ] replace storage with pten allocation (#39085) · a56e16a7

由石晓伟提交于 1月 24, 2022

* updates callers, test=develop

* updates tensor, test=develop

* fixes errors, test=develop

* remove some dtypes, test=develop

* fix errors in the base storage modification, test=develop

* fixes a bug, test=develop

* fixes the bugs in push the whole, test=develop

* updates, test=develop

* update

* update, test=develop

* fixes the mac-py3 CI, test=develop

* remove the storage impl, test=develop

* updates some codes, test=develop

* update, test=develop

* updates pten allocation, test=develop

a56e16a7

20 1月, 2022 1 次提交
- A
  [Pten] Migrate bfloat16/float16/complex from paddle::platform into pten::common (#39044) · f1143f0c
  由 Aurelius84 提交于 1月 20, 2022
```
* Migrate bfloat16/float16/complex from platform into pten::common

* fix typo

* fix code style
```
  f1143f0c
18 1月, 2022 2 次提交

[Unify Tensors PR ] Merged Tensor into DenseTensor, test=allcases (#38914) · 2052f1e3

由 Zhanlue Yang 提交于 1月 18, 2022

* Merged LoDTensor with Tensor,test=allcases

* Patched python level LoDTensor

* Patched python level LoDTensor

* Merge Tensor into DenseTensor

* Fixed namespace issues,test=allcases

* Fixed merge issues

* Fixed inference issues

* Fixed NPU test issues

* Fixed merge issues

2052f1e3

Y

break the circular dependency between reduce and elementwise (#38951) · a1980d9c
由 YuanRisheng 提交于 1月 18, 2022

a1980d9c

15 1月, 2022 1 次提交
- C
  
  replace last contextT (#38971) · 1053b1d5
  由 Chen Weihang 提交于 1月 15, 2022
  
  1053b1d5
13 1月, 2022 2 次提交

C
[PTen] Rename kernel register marco (#38861) · 158bf13f
由 Chen Weihang 提交于 1月 13, 2022
```
* rename register marco

* fix error changing

* fix format error
```
158bf13f

[pten]Remove pten/include dir files (#38878) · 7e0292ea

由 chentianyu03 提交于 1月 13, 2022

* move dot_dev api into dot_kernel.h

* add infermate header

* modify to dotkerel in dot_op.h

* mvoe conj dev api into complex_kernel.h

* move sign dev api into  sign_kernel.h

* move scale dev api into kernel.h and remove infermete.h

* rm paddle/pten/include/math.h

* rm paddle/pten/include/math.h

* rm include dir

* rm paddle/pten/include/math.h

* fix conflict with develop branch

* rm devContext in conj_op.h

* add the missing complex_kernel header

7e0292ea

12 1月, 2022 1 次提交
- Z
  
  [part 1]change type of function args (#38885) · df5d55bb
  由 Zhang Ting 提交于 1月 12, 2022
  
  df5d55bb
04 1月, 2022 1 次提交
- N
  Add OpFunctor and replace cast, scale, clip, bce_loss and abs_grad with... · 6eac06e3
  由 niuliling123 提交于 1月 04, 2022
```
Add OpFunctor and replace cast, scale, clip, bce_loss and abs_grad with elementwise_no_broadcast (#38500)
```
  6eac06e3
21 12月, 2021 2 次提交
- C
  [PTen] Rename cuda dir and context to gpu (#38296) · dc7597e3
  由 Chen Weihang 提交于 12月 21, 2021
```
* rename cuda to gpu

* revert CMake change

* resolve conflit

* rename other cuda to gpu

* poish details
```
  dc7597e3
- C
  [PTen] Remove eigen and blas directory (#38291) · d9fcdc3a
  由 Chen Weihang 提交于 12月 20, 2021
```
* remove eigen and blas dir

* fix declare error
```
  d9fcdc3a
20 12月, 2021 1 次提交
- Z
  
  move the directory of fill kernels in pten (#38219) · 06128b9f
  由 zyfncg 提交于 12月 20, 2021
  
  06128b9f

PaddlePaddle / Paddle 1 年多 前同步成功

PaddlePaddle / Paddle
1 年多前同步成功