提交 · 5fef043dc0950c2884ad538303f8f56ee3b1c86f · PaddlePaddle / Paddle

18 10月, 2022 2 次提交

[cherry-pick 2.4] add sparse api transpose/reshape/is_same_shape (#47076) · 5fef043d
由 zhouweiwei2014 提交于 10月 18, 2022
```
新增sparse.is_same_shape、sparse.reshape、sparse.transpose 三个API
```
5fef043d

Cherry pick for sharding (#47061) · 5b642140

由 Yuang Liu 提交于 10月 18, 2022

* [dygraph sharding] Overlap the reduce and the caculation for sharding stage 2. (#46495)

* [dygraph sharding stage 2] sharding broadcast overlap (#46656)

* Multi groups for broadcast of sharding stage 2 (#46894)

5b642140

17 10月, 2022 4 次提交

[Cherry-pick] Collective communication APIs (#46922) · 5fba2a98

由 Wen Sun 提交于 10月 17, 2022

* Support both use_calc_stream and sync_op in send recv APIs (#46023)

* Support both use_calc_stream and sync_op in allgather API (#46295)

* Support both use_calc_stream and sync_op in collective communication API (#46761)

* Move group and all reduce from collective to communication (#45848)

* Completes bfloat16 dtype for collective api in eager mode (#45844)

* Fix collective APIs cannot be recognized when building docs (#46962)
Co-authored-by: NLiYuRio <63526175+LiYuRio@users.noreply.github.com>

5fba2a98

Z
[cherry-pick]Sparse static graph (#46838) · 10225d22
由 zhangkaihuo 提交于 10月 17, 2022
```
cherry-pick : #46322, #46245
Sparse API 支持静态图
```
10225d22
A

fix ut timeout 2 (#45233) (#46867) · d913bc98
由 Allen Guo 提交于 10月 17, 2022

d913bc98
A

rm fp16 dtype_check (#46739) (#46866) · a1cdbad1
由 Allen Guo 提交于 10月 17, 2022

a1cdbad1

14 10月, 2022 6 次提交
- W
  
  cherry-pick 46942 (#47015) · 82db4993
  由 Wilber 提交于 10月 14, 2022
  
  82db4993
- G
  
  update quantization new format (#46529) · 84333cf5
  由 Guanghua Yu 提交于 10月 14, 2022
  
  84333cf5
- X
  
  Add bmm convert (#47011) · 8f1ac7cf
  由 xiaoxiaohehe001 提交于 10月 14, 2022
  
  8f1ac7cf
- A
  [BUG]Fix expand_as_v2 bug while X and Y with different dtype (#46950) (#46999) · 4b472656
  由 Aurelius84 提交于 10月 14, 2022
```
* [BUG]Fix expand_as_v2 bug while X and Y with different dtype

* fix commit
```
  4b472656
- Z
  [cherry-pick 2.4][inference] fix reshape2 opteller (#46871) · 535d7574
  由 Zhang Jun 提交于 10月 14, 2022
```
* fix reshape2 opteller;
add elementwise min/max register for tensorrt
```
  535d7574
- Z
  
  [Paddle-TRT] support new quant format from slim (#46022) (#46979) · b8677c0d
  由 zhoutianzi666 提交于 10月 14, 2022
  
  b8677c0d
13 10月, 2022 2 次提交

傅
[Cherry-pick] Add fp16 dtype support for set_value op (#46906) · 100a0750
由傅剑寒提交于 10月 13, 2022
```
Fix set_value failure when source tensor is fp16 Dtype and destiny value is a number
(dev PR link:#46801)
```
100a0750

[cherry-pick] [PHI] transpose2_grad op migration (#46139) (#46873) · 0280c0b9

由 Sławomir Siwek 提交于 10月 13, 2022

* Revert pool+grad oneDNN kernel conversion (#45989)

* [PHI] transpose2_grad op migration (#46139)

* op migrated, Copy(OneDNNContext, ...) added

* mutable_data & op registration in fluid removed

* refactoring

* OneDNNGetDataType to uppercase

* missing cpu check added, handler moved to .h file

* name changed to transpose_grad

* Copy changed back to TensorCopy

* Resizing corrected, Copy(OneDNNContext) removed
Co-authored-by: NPiotr Paturej <48731682+piotrekobi@users.noreply.github.com>
Co-authored-by: NPaulina Gacek <paulina.gacek@intel.com>

0280c0b9

12 10月, 2022 2 次提交
- N
  [Cherry-pick]Update layout autotune for module with no modified (#46541) (#46515) (#46880) · 61273c0e
  由 niuliling123 提交于 10月 12, 2022
```
Cherry-pick 46541
保证Reset50 TSM deeplabv3模型零修改下实现Layout自动调优
```
  61273c0e
- R
  cherry pick pr46536 (#46901) · 08d233f9
  由 ronnywang 提交于 10月 12, 2022
```
cherry pick pr46536 
```
  08d233f9
11 10月, 2022 1 次提交
- S
  
  hard_swish grad (#46857) · 2c6bd4ad
  由 Sławomir Siwek 提交于 10月 11, 2022
  
  2c6bd4ad
10 10月, 2022 1 次提交
- F
  Fix gather op convert for Paddle-TensorRT (#46779) (#46825) · a0e03418
  由 feng_shuai 提交于 10月 10, 2022
```
* fix gather op convert to only support int32 index as input.
* add ut
```
  a0e03418
09 10月, 2022 1 次提交

[Dy2Static] refactor the return transformer (#45900) (#46205) · 4282af69

由 xiongkun 提交于 10月 09, 2022

* 1. refactor the return transformer.
2. fix some bugs in return transformer.

* support raise error while return stmt's father is For or while

* fix ci error.

* fix ci error and add some unittest

* code format

* fix ci error

4282af69

29 9月, 2022 2 次提交
- 傅
  [cherry-pick] Add FP16 support for uniform in dygraph mode on Nvidia GPU (#46641) · a58663f3
  由傅剑寒提交于 9月 29, 2022
```
Add FP16 support for uniform in dygraph mode on Nvidia GPU
Dev PR link PR46212
```
  a58663f3
- W
  
  Fix the half precision problem of general plugin (#46580) · d90db9bd
  由 weishengying 提交于 9月 29, 2022
  
  d90db9bd
28 9月, 2022 1 次提交
- Z
  
  remove trt_reshape2_matmul_fuse_pass (#46363) · a77a6f6b
  由 zhoutianzi666 提交于 9月 28, 2022
  
  a77a6f6b
27 9月, 2022 3 次提交

Z

[AutoParallel] fix amp o1 (#46391) (#46481) · 5dab0b0d
由 zhaoyingli 提交于 9月 27, 2022

5dab0b0d

[cherry-pick] clear extra attrs of some ops in OpMaker (#45845, #45984, 46060) (#46218) · 0cc2251f

由 zyfncg 提交于 9月 27, 2022

* Clear extra attrs of elementwise op in OpMaker (#45845)

* clear extra attrs of elementwise op in opmaker

* fix op_debug_string_test

* fix bug of grad_add

* fix sort of runtime attrs

* Clear extra attrs of scale in OpMaker (#45984)

* clear extra attr of scale in opmaker

* fix sum bug

* fix merge conflict

* fix minus

* Clear extra attributes of some Op in OpMaker (Part4) (#46060)

* clear extra attr of some ops in opmaker

* revert clear use_cudnn for pool

* fix test_operator_desc

* fix Attr interface of OperatorBase

* fix code stype

0cc2251f

L

change use_calc_stream to sync_op (#46182) (#46493) · 8089a1fb
由 LiYuRio 提交于 9月 27, 2022

8089a1fb

26 9月, 2022 2 次提交
- F
  
  fix conflict (#46388) · 4a8aa6d8
  由 feifei-111 提交于 9月 26, 2022
  
  4a8aa6d8
- H
  [cherrypick] Fix elementwise_sub sign reverse for mkldnn (#46107) · 6990edfe
  由 Hui Zhang 提交于 9月 26, 2022
```
* fix sub sign reverse for mkldnn

* refactor code as comment

* remove useless
```
  6990edfe
23 9月, 2022 2 次提交
- A
  
  [OpAttr]Fix dropout2d/3d static API (#46434) · 55f73ba5
  由 Aurelius84 提交于 9月 23, 2022
  
  55f73ba5
- A
  [Cherry-Pick][BugFix]Fix reduce_mean/min/sum/prod, cumsum grad_op infershape bug (#46409) · 484377cd
  由 Aurelius84 提交于 9月 23, 2022
```
* [BugFix]Fix reduce_mean/min/sum/prod, cumsum grad_op infershape bug

* fix typo

* fix typo
```
  484377cd
22 9月, 2022 1 次提交

logger manager (#45909) (#46087) · 7eb046c7

由 Roc 提交于 9月 22, 2022

uniform logger manager in FleetAPI.
hidde API under distributed/utils which users don't need.

7eb046c7

21 9月, 2022 2 次提交
- A
  [Cherry-pick][BugFix]Fix pooling output_size bug if encounter list[Tensor] (#46360) · cc3e7cd8
  由 Aurelius84 提交于 9月 21, 2022
```
* [Check]Enhance pooling output_size type check

* add unittest
```
  cc3e7cd8
- G
  
  remove tmp fp32 var for gaussian_random (#46285) · b027652b
  由 Guoxia Wang 提交于 9月 21, 2022
  
  b027652b
20 9月, 2022 8 次提交

Z
[Paddle-TRT][Cherry-Pick]Fix cast bug (#46293) · 230b9a82
由 zhoutianzi666 提交于 9月 20, 2022
```
* fix cast bug
```
230b9a82
H
[PolishComments] Polish some code comments (#46032) (#46261) · 42e56f65
由 HongyuJia 提交于 9月 20, 2022
```
* polish code comments

* polish data_device_transform.cc
```
42e56f65

[Cherry-Pick][AutoParallel] change import way and fix strategy (#46270) · c43ebfcf

由 zhaoyingli 提交于 9月 20, 2022

* [Auto Parallel] Change the import way of Auto Parallel (#46115)

* fix strategy (#46256)

* [Auto Parallel] performance improvement for Sharding-DP hybrid parallelism (#46180)

* remove no need grad allreduce communication when sharding-dp

* remove no need grad allreduce communication when sharding-dp

* bugfix

* bugfix

* bugfix
Co-authored-by: NYulong Ao <aoyulong@baidu.com>
Co-authored-by: NJZ-LIANG <jianzhongliang10@gmail.com>

c43ebfcf

Z
[Paddle-TRT] Support matmul_v2 in Paddle-TensorRT (#46177) · 654807cd
由 zhoutianzi666 提交于 9月 20, 2022
```
* Support matmul_v2 in Paddle-TensorRT converter.
```
654807cd
W
Fix TransDataBackend Error when call unsqueeze using MKL Tensor (#46094) (#46186) · 50340302
由 WangZhen 提交于 9月 20, 2022
```
* Fix TransDataBackend Error when call unsqueeze using MKL Tensor

* Add UT

* Refine UT
```
50340302

[Cherry-pick] Sparse add InferMeta (#46235) · fd8ec4a1

由 zhangkaihuo 提交于 9月 20, 2022

cherry-pick : #46016, #46021, #45974

* [Sparse]Sparse add support gpu (#45974)

* [Sparse]Remove unused code (#46021)

* [Sparse] Add infer meta (#46016)

fd8ec4a1

(cherry-pick)Support some op refuse forward and fix some bugs (#46211) · bc92d5f5

由 Charles-hit 提交于 9月 20, 2022

* support cast op backward refuse forward and fix some bugs (#46173)

* support cast op backward refuse forward

* Fix the bug of high order unit test framework

* support sign op backward refuse forward (#46002)

bc92d5f5

[Cherry-pick] Update layoutautotune for inplace (#45826) (#46226) · c0324e82

由 niuliling123 提交于 9月 20, 2022

cherry-pick from #45826
LayoutAutotune 支持 inplace 类型的OP
 根据 Add eager layout autotune #45409 修改意见调整UseAutotune
将LayoutAutotune判断放到controller中，与AMP 判断保持一致

c0324e82

PaddlePaddle / Paddle 1 年多 前同步成功

PaddlePaddle / Paddle
1 年多前同步成功