提交 · f469f176d66b3e65df9756ff3dc8d98b301a0f63 · PaddlePaddle / Paddle

20 6月, 2023 1 次提交
- Y
  
  Remove reduntant definition of MPTypeTrait. (#54756) · f469f176
  由 Yiqun Liu 提交于 6月 20, 2023
  
  f469f176
19 6月, 2023 1 次提交
- L
  
  Fix incorrect size of grid dimension in index_select (#54660) · 20bf9592
  由 Leo Chen 提交于 6月 19, 2023
  
  20bf9592
16 6月, 2023 2 次提交
- C
  
  fix batch_norm grad kernel nhwc error (#54703) · 4c6f77d8
  由 cyber-pioneer 提交于 6月 16, 2023
  
  4c6f77d8
- C
  
  fix batch_norm cuda grad kernel test mode bug (#54681) · eb9d07e5
  由 cyber-pioneer 提交于 6月 16, 2023
  
  eb9d07e5
15 6月, 2023 1 次提交

exp/expm1 support int32/int64/float16 forward (#54556) · 58ae8c7c

由 Hui Zhang 提交于 6月 15, 2023

* fix for log xxx

* add int32/int64 for cpu/gpu; add float16/bfloat16 for cpu forward

* fix docstring

* fix bug

* fix bugs

* fix bugs

* fix bugs

* fix bugs

* fix bug

* using cast

* fix test

* fix api

* fix other bugs

* fix ci bug for not using dygraph guard

* add bfloat16 test

* fix ut

* bf16

* exp/expm1 support int32/int64

* fix ut

* fix ut

* fix ut

58ae8c7c

14 6月, 2023 3 次提交
- C
  
  support group_norm and cumsum prim ops bf16 dtype (#54580) · f7eb03c6
  由 Charles-hit 提交于 6月 14, 2023
  
  f7eb03c6
- [Zero-Dim] paddle.nanmedian/nanquantile support 0D Tensor (#54500) · 3d4d995f
  由 zhouweiwei2014 提交于 6月 14, 2023
```
* [Zero-Dim] paddle.nanmedian support 0D Tensor

* fix CI
```
  3d4d995f
- S
  Fix A100 CUDA12 ut (#54487) · a96c6dc7
  由 sneaxiy 提交于 6月 14, 2023
```
* fix A100 CUDA12 ut

* fix ci uts

* fix test_sync_batch_norm_op

* fix sync bn op ut again by separating 2 files

* fix codestyle ci

* combine other PRs

* fix codestyle

* fix codestyle ci
```
  a96c6dc7
13 6月, 2023 1 次提交
- N
  
  【Hackathon 4 No.17】Add cummax / cummin API to Paddle (#53546) · 3a3fb1fe
  由 NetPunk 提交于 6月 13, 2023
  
  3a3fb1fe
12 6月, 2023 1 次提交

log/Log10/log2/log1p support int32/int64/float16/bfloat16 forward (#54089) · 2ddd0473

由 Hui Zhang 提交于 6月 12, 2023

* fix for log xxx

* add int32/int64 for cpu/gpu; add float16/bfloat16 for cpu forward

* fix docstring

* fix bug

* fix bugs

* fix bugs

* fix bugs

* fix bugs

* fix bug

* using cast

* fix test

* fix api

* fix other bugs

* fix ci bug for not using dygraph guard

* add bfloat16 test

* fix ut

* bf16

2ddd0473

08 6月, 2023 1 次提交

[AMP] Add check_numerics API. (#54301) · a5444592

由 Yiqun Liu 提交于 6月 08, 2023

* Add outputs to check_numerics_kernel.

* Add check_numerics to yaml.

* Add API and unittest.

* Add check_nan_inf_level as argument of check_numerics_kernel.

* Add more unittests.

* Fix static API implementation and unittest.

* Move the implementation of check_numerics to paddle.amp.

* Fix import error.

a5444592

05 6月, 2023 3 次提交

【Hackathon 4 No.19】Add polygamma API to Paddle (#53791) · ed604569

由 PommesPeter 提交于 6月 05, 2023

* feat: added polygamma init code

* feat: added polygamma unittest code

* test: added more test cases

* refactor: added forward impl

* refactor: added backward impl

* test: updated cases

* refactor: updated test cases

* refactor: added more case and fixed some bugs

* test: updated ref func

* refactor: updated code style

* refactor: move the code

* refactor: updated test

* refactor: updated test

* docs: updated en doc
Co-authored-by: Nzachary sun <70642955+sunzhongkai588@users.noreply.github.com>

* docs: updated math eq

---------
Co-authored-by: Nzachary sun <70642955+sunzhongkai588@users.noreply.github.com>

ed604569

W

[bug fix] group norm backward (#54341) · d338b2f8
由 wangzhen38 提交于 6月 05, 2023

d338b2f8

optimize logsumexp in small data scale (#52952) · 93e1bb98

由 Asthestarsfalll 提交于 6月 05, 2023

* optimize logsumexp in small data scale

* fix

* fix

* add #pragma once

* swith to use aligned_vector and support arbitrarily shape

* fix store

* fix store

* refine for special cases

* try

* fix

* update

* fix

* fix all_reduce

* try

* fix rocm bug

* fix rocm bug

* fix rocm bug

* fix rocm bug

* fix rocm bug

* fix rocm bug

* fix rocm bug

* fix rocm bug

93e1bb98

02 6月, 2023 2 次提交
- D
  【PaddlePaddle Hackathon 4】No.56 :add fp and bf16 for bernoulli (#54232) · 85d5f26d
  由 Difer 提交于 6月 02, 2023
```
* add fp&bf16 bernoulli

* add check_dtype & fix error

* fix rocm error
```
  85d5f26d
- Z
  [AMP] support master_grad for adam and momentum (#54240) · 703a64a3
  由 Zhang Ting 提交于 6月 02, 2023
```
* support master_grad for adam and momentum

Co-authored-by: zhangting_2017@163.com <zhangting2020>
```
  703a64a3
30 5月, 2023 1 次提交

[AMP] Reimplement check_nan_inf as check_numerics_kernel. (#52245) · 44bd5927

由 Yiqun Liu 提交于 5月 30, 2023

* Reimplement the check_nan_inf function as check_numerics kernel.

* Remove the cpu implemention to phi.

* Add ifdef for the including of omp.h.

* Move the use of FLAGS_check_nan_inf_level out of header file.

* Implement a common PrintAndThrowError function.

* Fix the error using of __NVCC__, which should be instead with __CUDA_ARCH__.

* Add dependency of phi.

* Polish codes and unittest.

44bd5927

26 5月, 2023 1 次提交

[PHI Decoupling]Create PHI shared lib (#53735) · da50a009

由 YuanRisheng 提交于 5月 26, 2023

* create phi so

* fix ci bugs

* fix py3 bugs

* add file

* fix py3 bugs

* fix windows bugs

* perfect so

* fix py3 bugs

* delete all static target in phi

* fix windows bugs

* fix py3 bugs

* fix ci bugs

* fix windows bugs

* fix bugs: gflags can't be linked by dynamic and static lib

* fix bugs that can not load 3rd party

* fix ci bugs

* fix compile bugs

* fix py3 bugs

* fix conflict

* fix xpu bugs

* fix mac compile bugs

* fix psgpu bugs

* fix inference failed

* deal with conflict

* fix LIBRARY_PATH bug

* fix windows bugs

* fix onednn error

* fix windows compile bugs

* fix windows compile bugs

* fix test_cuda_graph_static_mode_error aborted

* fix windows bugs

* fix mac-python3 error

* fix hip compile bugs

* change mode to static

* change to static mode

* fix ci bugs

* fix py3 bugs

* fix windows bugs

* fix bugs

* add static flag

* add PADDLE_API

* change position of PADDLE_API

* fix windows bugs

* change mode to dynamic lib

* fix windows static bugs

* deal with conflict

* fix windows unit bug

* fix coverage

* deal with conflict

* fix windows-inference

* fix py3 bugs

* fix bugs when compile type_info

* fix compile bugs

* fix py3 bugs

* fix windows bugs

* fix windows openblas

* fix xpu bugs

* fix enforce_test in windows

* update code according comment

* fix windows cmake bug

* fix windows bugs

* fix windows bugs

* delete cinn unittest

* fix cinn bugs

---------
Co-authored-by: lzydev <1528794076@qq.com>

da50a009

25 5月, 2023 1 次提交
- Z
  
  Using a sorting method may achieve better performance. (#54045) · 6d1292ef
  由 zhangkaihuo 提交于 5月 25, 2023
  
  6d1292ef
24 5月, 2023 1 次提交
- W
  Update lerp_kernel.cu (#54071) · a299797d
  由 Winters Montagne 提交于 5月 24, 2023
```
Removed unnecessary header files introduced
```
  a299797d
23 5月, 2023 2 次提交
- Z
  [AMP OP&Test] Support float16 in selu (#54030) · 6133ca4e
  由 Zhang Zheng 提交于 5月 23, 2023
```
* [AMP OP&Test] Support float16 in selu

* fix
```
  6133ca4e
- C
  
  fix typos(#53967) · c36a000d
  由 cyberslack_lee 提交于 5月 23, 2023
  
  c36a000d
22 5月, 2023 1 次提交

Add multiclass_nms3 GPU kernel (#52401) · f71c805e

由 Tian Zheng 提交于 5月 22, 2023

* Add GPU kernel for multiclass_nms3 op

* Make multiclass_nms3 gpu kernel output consistent with cpu kernel

* Fix API incompatibility

* Fix unittests on builds without CUDA

* Fix ROCM build

* Remove fluid headers; Use default atol for unittest

* Change function and variable naming

* Add comments; Reduce redundant code

* Use paddle test framework

f71c805e

19 5月, 2023 3 次提交
- G
  
  test,test=develop (#53811) · 10758725
  由 Galaxy1458 提交于 5月 19, 2023
  
  10758725
- G
  
  test,test=develop (#53843) · c1f4005a
  由 Galaxy1458 提交于 5月 19, 2023
  
  c1f4005a
- D
  delete bf16 of cross entropy (#53922) · 69d3f4e3
  由 Danyang Zhang 提交于 5月 19, 2023
```
* delete bf16 of cross entropy

* delete bf16 of cross entropy
```
  69d3f4e3
18 5月, 2023 3 次提交
- C
  [AMP OP&Test]support prod、meshgrid、expand_as bf16 dtype (#53865) · 706503d0
  由 Charles-hit 提交于 5月 18, 2023
```
* add meshgrid,expand_as, prod and grad bf16 kernel

* fix bf16 for optest

* modify code style

* fix amp test
```
  706503d0
- C
  
  Add segment_pool tests (#53785) · 0bed2203
  由 co63oc 提交于 5月 18, 2023
  
  0bed2203
- L
  
  add fp16 and bf16 for trunc (#53876) · d8407c51
  由 LoneRanger 提交于 5月 18, 2023
  
  d8407c51
17 5月, 2023 1 次提交
- L
  【Hackathon 4 No.21】Add i1 / i1e to paddle (#53210) · a63fb4c8
  由 LyndonKong 提交于 5月 17, 2023
```
* Add i1 and i1e op

* resolve merge conflicts
```
  a63fb4c8
16 5月, 2023 5 次提交

C

Add huber_loss tests (#53535) · 74b91bce
由 co63oc 提交于 5月 16, 2023

74b91bce

【Hackathon No57】add bf16 for mode (#53195) · 640cff0a

由 Difer 提交于 5月 16, 2023

* add bf16 for mode

* remove random seed 666

* try to fix op_type error

* test for me

* try to fix op_type

* fix redundancy code

* add fp,bf for lastdim

* fix some error

* simplify code

* fix shape error

* optype error

* fix skipif bf16

640cff0a

【PaddlePaddle Hackathon 4 No.34】为 Paddle 优化 Lerp OP 在 GPU 上的性能 (#53154) · e592534a

由 Winters Montagne 提交于 5月 16, 2023

* modify lerp_kernel.cu

* pre-commit

* fix some CI issues

* fix some CI issues

* fix some CI issues

* fix some CI issues

* fix some CI issues

* fix some CI issues

* fix some CI issues

* fix some CI issues

* Add files via upload

fix some CI issues

e592534a

move cudnn_lstm kernel to phi (#53730) · 52889e38

由 huangjiyi 提交于 5月 16, 2023

* update

* fix bug

* test

* test

* update

* update mutable_data

* fix bug

* update

* fix bug

* update output type reg

* update

* update

52889e38

[phi] move stft to phi - Step 1 (#53517) · 00c21abc

由 gouzil 提交于 5月 16, 2023

* [phi]mv StftKernel to phi

* [phi] fix KernelSignature

* [phi]fix arr error

* [phi] Disable check_dygraph

* [phi]fix include

* [phi] rewrite mutable_data, add output register

* [phi] fix  Alloc

* [phi] fix Alloc again

* [phi] fix mutable_data

* [phi] fix onesided_out Resize

00c21abc

15 5月, 2023 3 次提交
- [BUG] fix windows kernel dispatch of _lzcnt bug (#53728) · 972daa46
  由 zhouweiwei2014 提交于 5月 15, 2023
  
  972daa46
- N
  Tranpose layout (#53351) · 3dce9f0a
  由 niuliling123 提交于 5月 15, 2023
```
* update

* Update backward.h

* Update composite_backward_api.h

* Update tensor_utils.cc

* Update backward.cc

* update

* stype

* update

* add ctest

* code stype
```
  3dce9f0a
- Z
  move OneHotRawKernel to legacy (#53200) · 34122e3e
  由 zhangyuqin1998 提交于 5月 15, 2023
```
* move OneHotRawKernel to legacy

* fix
```
  34122e3e
12 5月, 2023 2 次提交

【Hackathon 4 No.20】Add i0 / i0e to paddle (#52058) · ce256f75

由 PommesPeter 提交于 5月 12, 2023

* added base code for i0 and i0e

* added grad base code for i0 and i0e

* added i0 and i0e python code

* added ops and backward yaml config

* added i0 and i0e cpu kernel, but not test.

* added i0 and i0e code and unitest files

* added test files

* added i0/i0e gpu implementation code

* updated code style

* updated code style

* fixed unitests code

* updated i0 with eigen3

* fixed bug and added more test cases

* refactor: fixed static graph bug

* refactor: removed i0 and i0e from op_compat

* refactor: updated code style

* refactor: updated op_compat.yaml

* refactor: updated op_compat.yaml

* refactor: fixed op name mapping and optimize unittest case

* refactor: manually implement i0 / i0e

* refactor: added grad kernel for i0 / i0e,didn't finish

* Update math.py

* refactor: added equation to doc in English and added comments for computing i0 / i0e gradient

* refactor: removed eigen implementation

* refactor: finished i0 / i0e cpu and gpu op

* refactor: updated code style

* fix: find  a bug but not fix

* fix: incorrect unittest cases

* update: updated code style and remove my file

* update: updated unittest case

* fix: fixed sign error

* fix: fixed mistakes when merging

* refactor: updated code style

* refactor: remove unused code

* refactor: updated code style

ce256f75

L

fix add_n kernel of large shape (#53749) · 4d39cc7f
由 Leo Chen 提交于 5月 12, 2023

4d39cc7f

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功