提交 · 70120c7f98229df0697657c336c83654db5c185e · Crayon鑫 / Paddle

05 5月, 2022 2 次提交
- W
  
  fix unittest of conv2d due to V100 do not support bfloat16 (#42483) · 70120c7f
  由 wangxinxin08 提交于 5月 05, 2022
  
  70120c7f
- W
  
  fix the v100 cuda11.2 matmul_v2 and elementwise_div bug (#42477) · 98c3f85e
  由 wawltor 提交于 5月 05, 2022
  
  98c3f85e
03 5月, 2022 1 次提交

Hotfix Release 2.3 Bug for CUDA 11.2 (#42437) · b0a64800

由 Huihuang Zheng 提交于 5月 03, 2022

This PR hotfixed the `test_cond.py` in CUDA 11.2

The reason of the bug is that the `fill_constant` op returns wrong value in the modified test case `test_extremely_simple_net_with_op_in_condition`, SWEs can use `layers.Print(a)` and `layers.Print(b)` in the test case to reproduce it and they can see the `fill_constant` returns something `e-50` instead of `1.23` and `1.25`

This PR hotfixed the bug by comparing `b` value instead of actual number, which makes sure the `cond` logic is right. **However, the PR didn't fix `fill_constant`**. We would let the SWEs who are working here to find the op bug and fix it.

b0a64800

29 4月, 2022 9 次提交
- modify reshape to reshape2 in paddle.nn.initializer.dirac (#42396) · eca6638c
  由 zhouweiwei2014 提交于 4月 29, 2022
  
  eca6638c
- Y
  
  add unit test for batch_norm and leaky_relu (#42369) · dbe189b1
  由 YuanRisheng 提交于 4月 29, 2022
  
  dbe189b1
- X
  Make einsum_v2 support multi-operands (#42327) · 32cae24c
  由 xiongkun 提交于 4月 29, 2022
```
* Extend python einsum interface to make einsum_v2 support multi-operands and switch it to default.

* add opt_einsum dependence

* add yaml and support eager model

* fix by code review
```
  32cae24c
- W
  
  [Eager] Support test_diff_op switch to eager mode (#42360) · 21d94dd3
  由 Weilong Wu 提交于 4月 29, 2022
  
  21d94dd3
- W
  [Eager] Remove enable_legacy_dygraph setting (#42363) · 05d6be7e
  由 Weilong Wu 提交于 4月 29, 2022
```
* [Eager] Remove enable_legacy_dygraph setting

* Add more tests
```
  05d6be7e
- W
  
  [Eager] Support test_label_smooth_functional switch to eager mode (#42366) · c3852b08
  由 Weilong Wu 提交于 4月 29, 2022
  
  c3852b08
- W
  
  [Eager] Support test_eigh_op switch to eager mode (#42379) · 08f07dcb
  由 Weilong Wu 提交于 4月 29, 2022
  
  08f07dcb
- Y
  Add some double/triple grad kernel yaml file (#42361) · 24ec6ed0
  由 YuanRisheng 提交于 4月 29, 2022
```
* add double yaml

* add inline func
```
  24ec6ed0
- A
  [Dy2Stat]Fix losting pre/post hook from outermost layer while jit.save (#42273) · 27cf7afb
  由 Aurelius84 提交于 4月 29, 2022
```
* [Dy2Stat]Fix losting pre/post hook from outermost layer while jit.save

* fix kwargs

* fix unittest
```
  27cf7afb
28 4月, 2022 3 次提交

Add gradient merge for DistributedFusedLamb optimizer (#40177) · 108aeb28

由 sneaxiy 提交于 4月 28, 2022

* add gradient merge for DistributedFusedLamb

* use master acc gradient

* fix CI ut

* polish

* remove math_function_impl.h change

* fix test_update_loss_scaling_op.py

* try to fix XPU/NPU CI

* add gm ut

108aeb28

[CustomDevice]change import way of unpublished file in op_test test=allcases (#42285) · 62c0304b

由 Aganlengzi 提交于 4月 28, 2022

* test op_test test=allcases

* fix

* avoid copy many same file

* fix for win

* test PYTHONPATH

* change path adding way

* fix win

* use old way

* use old way test=allcase

* use old way test=allcase

62c0304b

P
fix collections.Sequence in python3.10 (#42242) · edb61a52
由 pangyoki 提交于 4月 28, 2022
```
* fix collections.Sequence in python3.10

* fix format
```
edb61a52

27 4月, 2022 7 次提交
- J
  Added missing test for shuffle_channel_mkldnn_detect_pass (#42001) · 5134f110
  由 jakpiase 提交于 4月 27, 2022
```
* added test for shuffle_channel_mkldnn_detect_pass

* added UT using new framework

* CI fix
```
  5134f110
- Z
  
  implement autotune python API (#42299) · 2094a584
  由 Zhang Ting 提交于 4月 27, 2022
  
  2094a584
- A
  [CustomDevice] op_test supports custom device (#42227) · 4df02fdf
  由 Aganlengzi 提交于 4月 27, 2022
```
* [DO NOT MERGE] test op_test

* update with more related modifications

* split op_test.py to use test=allcases for testing

* split op_test.py to use test=allcases for testing
```
  4df02fdf
- Y
  
  Adjust the relative error of QR's grad (#42221) · 4c80385a
  由 Yulong Ao 提交于 4月 27, 2022
  
  4c80385a
- Q
  
  [MLU]add dropout op (#42274) · acca0352
  由 qipengh 提交于 4月 27, 2022
  
  acca0352
- L
  
  add the support for allreduce_prod for new dygraph (#42284) · 89951472
  由 lilong12 提交于 4月 27, 2022
  
  89951472
- fix multinomial paddle_enforce bug (#42302) · 31c33122
  由 zhouweiwei2014 提交于 4月 27, 2022
  
  31c33122
26 4月, 2022 6 次提交

【PaddlePaddle Hackathon 2】29、为 Paddle 新增 PixelUnshuffle 组网 API (#40728) · 5be9b824

由 BrilliantYuKaimin 提交于 4月 26, 2022

* 增加PixelUnshuffle的形状推断

* 增加PixelUnshuffle的算子注册

* 增加PixelUnshuffle及其梯度的核函数

* 增加PixelUnshuffle算子的描述

* 增加PixelUnshuffle算子的签名

* 在Python层面增加PixelUnshuffle

* 增加PixelUnshuffle的单测

* Update test_pixel_unshuffle.py

* test=document_fix

* Update test_pixel_unshuffle.py

增加对extra_repr的测试

* 修正代码格式

* Update test_pixel_unshuffle.py

修正对extra_repr的测试

* 修改pixel_unshuffle核函数的实现位置

* 修正代码格式

* 完善对输入的检查

* Update test_pixel_unshuffle.py

* 完善pixel_unshuffle的输入检查

* Update pixel_unshuffle_op.cc

* Update unary.cc

* add pixel_unshuffle

* Update test_pixel_unshuffle.py

* Update vision.py

* 调整代码格式

* Update vision.py

* Delete extra spaces

* Update pixel_unshuffle_sig.cc

* Update vision.py

* Update vision.py

* add PixelUnshuffleGradInferMeta

* remove PixelUnshuffleOpArgumentMapping

* Update pixel_unshuffle_op.cc

* 调整pixel_unshuffle及其梯度的核函数的实现位置

* Update pixel_unshuffle_op.cc

5be9b824

W

[Eager] Remove retain_grad_flag in accumulation_nade, add is_new_grad args in operator (#42240) · 2998a7d2
由 Weilong Wu 提交于 4月 26, 2022

2998a7d2

[Eager] Fix final state adam in selected rows case (#42219) · 12311ddc

由 Weilong Wu 提交于 4月 26, 2022

* [Eager] Support final_state_adam when argument grad (position 1) is selected_rows

* Remove needless code

* Add adam_dense_param_sparse_grad kernel

12311ddc

W

Add fused_multi_transformer op to optimize transformer generation performance (#41814) · 9dadf7df
由 WangXi 提交于 4月 26, 2022

9dadf7df
Z

Add Sparse MaxPool3D (#42130) · 18e9aafb
由 zhangkaihuo 提交于 4月 26, 2022

18e9aafb

Add C++ EinsumOp which support 2 operands einsum. (#42105) · c7302f96

由 xiongkun 提交于 4月 26, 2022

* full api fix

* when out is None, go old dygraph mode

* by static check

* first version: support 2-inputs forwards. TODO: 1. backward  2. BroadCast  3. MultiVariable

* time out -> 120

c7302f96

25 4月, 2022 4 次提交

【PaddlePaddle Hackathon 2】24、为 Paddle 新增 nn.ChannelShuffle 组网 API (#40743) · bbaaf217

由 BrilliantYuKaimin 提交于 4月 25, 2022

* Add infermeta for ChannelShuffle

* Create channel_shuffle_grad_kernel.h

* Create channel_shuffle_kernel.h

* Create channel_shuffle_sig.cc

* Create channel_shuffle_op.cc

ChannelShuffle算子的描述

* Create channel_shuffle_kernel_impl.h

ChannelShuffle核函数的实现

* Create channel_shuffle_grad_kernel_impl.h

ChannelShuffle反向核函数的实现

* Add kernel register of channel shuffle and grad

注册ChannelShuffle及其反向的核函数

* add nn.functional.channel_shuffle

* add nn.ChannelShuffle

* Create test_channel_shuffle.py

* Update example of ChannelShuffle in vision.py

* Update test_channel_shuffle.py

* 修改channel_shuffle核函数的实现位置

* 修正代码格式

* 删除多余空格

* 完善channel_shuffle的错误检查

* Update unary.cc

* Update channel_shuffle_op.cc

* Update test_channel_shuffle.py

* Update unary.cc

* add channel_shuffle

* Update test_channel_shuffle.py

* Update vision.py

* 调整代码格式

* Update channel_shuffle_sig.cc

* 更新ChannelShuffle的文档

* 更新channel_shuffle的文档

* remove ChannelShuffleOpArgumentMapping

* add ChannelShuffleGradInferMeta

* Update channel_shuffle_op.cc

* 调整channel_shuffle及其梯度的核函数的位置

bbaaf217

[Eager] Support div(scalar) in eager mode (#42148) · f4ce8a92

由 Weilong Wu 提交于 4月 25, 2022

* [Eager] Support div scalar in eager mode

* Updated and remove debug logs

* Remove list, use 'or' directly

* Remove useless statement

f4ce8a92

W

[Eager] Remove redundancy code, fix fp16 case (#42169) · 3b8f8b6c
由 Weilong Wu 提交于 4月 25, 2022

3b8f8b6c
W

[Eager] Support numpy.ndarry in CastNumpy2Scalar (#42136) · 4a16d5c6
由 Weilong Wu 提交于 4月 25, 2022

4a16d5c6

24 4月, 2022 3 次提交
- fix python3.10 compile bug on windows (#42140) · 13190707
  由 zhouweiwei2014 提交于 4月 24, 2022
  
  13190707
- P
  disable unittest failed in eager CI in temporary (#42101) · d6b66924
  由 pangyoki 提交于 4月 24, 2022
```
* test=py3-eager

* test=py3-eager

* test=py3-eager
```
  d6b66924
- Z
  
  refine optest logic for bfloat16 (#42151) · 532c3b4c
  由 zhangbo9674 提交于 4月 24, 2022
  
  532c3b4c
22 4月, 2022 5 次提交

Y
Support triple grad check of op in Eager mode (#42131) · 34ac7b74
由 YuanRisheng 提交于 4月 22, 2022
```
* support 3-rd order gradient

* change code format
```
34ac7b74

Ssd sparse table (#41812) · cca57c4a

由 zhaocaibei123 提交于 4月 22, 2022

* [cherry-pick2.3]fix compile bug of windows cuda11.5 (#41464)

cherry-pick

fix compile bug of windows cuda11.5 #41433

* fix bug of missing boost when compile cache.cc (#41449)

【chery-pick #41430】fix bug of random compile failure, due to incorrect compile order of dependencies

* Fix eager try catch (#41438) (#41477)

[Cherry-Pick]Fix eager try catch (#41438)

* Cherry-pick-PR41407, fix device_id bug for final_state op in multiprocess testcase (#41407) (#41475)

Cherry-pick PR #41407

* [BugFix] Add error hint for one_hot gpu version (#41335) (#41495)

* add one_hot gpu hint

* move allow_out_of_range judgement

* delete useless unittest

* fix bugs of reshape double grad infermeta (#41459) (#41493)

* [cherrypick-2.3] modify infer gpu memory strategy (#41427), remove cudnn_deterministic=True (#41341)  (#41491)
Co-authored-by: NJingZhuangzhuang <75348594+JZZ-NOTE@users.noreply.github.com>

* [Cherry-pick][ROCm] fix dcu error in device event base, test=develop (#41523)

Cherry-pick of #41521

* [Cherry-Pick]Cherry pick PR41200, PR41474, PR41382 (#41509)

* Use `self`as a parameter of _hash_with_id function to avoid error caused by hash_id reuse (#41200)

* Add fill_constant_batch_size YAML and UT (#41474)

* Switch some dy2st UT to eager mode (#41382)

* Sitch some dy2st UT to eager mode

* Fix test_lstm and remove test_transformer

* Run test_resnet_v2 in old dy mode

* Unittest recover (#41431)

* update name

* update name

* fix test

* fix fleet bind

* update name

* update name

* fix test

* fix gpups wrapper

* remove Push/Pull/Load/Save with context in client and wrapper base class

* fix

* fix

* remove some interface

* fix

* remove

* code style

* recover

* fix

* remove code unused

* remove some unused table & accessor & CommonDenseTable => MemoryDenseTable

* fix

* fix

* fix

* recover

* remove unused code

* recover unittest

* fix

* remove

* fix

* remove code unuseful

* remove

* fix

* recover

* remove
Co-authored-by: Nesythan <esythan@126.com>

* add ssd sparse table

* fix

* add cache shuffle

* fix

* fix

* fix

* fix

* fix

* fix

* add unit test

* fix
Co-authored-by: Zhou Wei <1183042833@qq.com>
Co-authored-by: NSing_chan <51314274+betterpig@users.noreply.github.com>
Co-authored-by: N0x45f <23097963+0x45f@users.noreply.github.com>
Co-authored-by: Npangyoki <pangyoki@126.com>
Co-authored-by: NSiming Dai <908660116@qq.com>
Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com>
Co-authored-by: NZhang Jun <ewalker@live.cn>
Co-authored-by: NJingZhuangzhuang <75348594+JZZ-NOTE@users.noreply.github.com>
Co-authored-by: NQi Li <qili93@qq.com>
Co-authored-by: Nesythan <esythan@126.com>

cca57c4a

C
Reduce performance influence by record event in python (#42040) · 4fd190d5
由 chenjian 提交于 4月 22, 2022
```
* optimize performance

* fix

* improve coverage

* fix

* fix
```
4fd190d5

[WIP] Algorithm Cache of cuBlasLt Epilogue (#41010) · 19650d72

由 Ming-Xu Huang 提交于 4月 22, 2022

* Fix leading dimension setting error in fused_gemm_epilogue_grad_op.

* Add dyload to cuBlasLt functions.

* Added cublasLtMatmulAlgoGetHeuristic to improve performance.

* Added FLAGS_cublaslt_exhaustive_search_times to cublasLt epilogue

* Added UTs to FLAGS_cublaslt_exhaustive_search_times

* Added warmup runs in algo searching of Gemm epilogue.

* Update copyright and documents.

* Fixed error handling.

19650d72

Z

Add Sparse BatchNorm and fix two bugs (#42013) · 8a6456db
由 zhangkaihuo 提交于 4月 22, 2022

8a6456db

Crayon鑫 / Paddle 与 Fork 源项目一致

Crayon鑫 / Paddle
与 Fork 源项目一致