提交 · d5c51e6291d71f2a7a5734efec9b5685ff309deb · BaiXuePrincess / Paddle

24 11月, 2021 5 次提交

J

fix range op (#37486) · d5c51e62
由 Jiawei Wang 提交于 11月 24, 2021

d5c51e62

[Paddle-Inference] Matmul_int8_convert: tensor*tensor (#37285) · 16590799

由 Wangzheee 提交于 11月 24, 2021

* matmul_convert_int8

* matmul_convert_int8

* matmulconvert_int8

* Matmul_int8_convert: tensor*tensor

* Matmul_int8_convert: tensor*tensor

* Matmul_int8_convert: tensor*tensor

16590799

Z
Adapt auto search (#37490) · 025053b4
由 zhaoyingli 提交于 11月 24, 2021
```
* adapt auto search

* adapt auto search

* fix matmulv2 compatible

* del debug
```
025053b4
Y
[Auto Parallel] Add the unified cluster representation (#37091) · db727551
由 Yulong Ao 提交于 11月 24, 2021
```
* [Auto Parallel]  Add the unified cluster representation

* Add the local id for devices

* Add some comments
```
db727551

[Dy2stat]support pure fp16 for dy2stat (#36944) · 52edad6a

由 0x45f 提交于 11月 24, 2021

* run dy2stat pure fp16 in Linear model

* no use self._pure_fp16_inputs

* add test and fix Adam error in dy2stat pure fp16 training

* use paddle.optimizer.Adam

* run test in gpu

* change test time for CI

* enlarge atol for test_resnet_pure_fp16

* refine code and enlarge atol

* make custom_white_list and custom_black_list take effect for AMP and pure fp16

* check tracer is not None

* use default atol

* change filter_size

* change atol and add some NOTE

52edad6a

23 11月, 2021 8 次提交

P
fix inplace bug when the first grad_var(loss_grad) is inplace var (#37420) · ee1e1642
由 pangyoki 提交于 11月 23, 2021
```
* fix inplace bug

* fix custom grad input error

* add unittest

* fix inplace bug
```
ee1e1642
L
Add support bias is none for fused_attention op. (#37411) · 1a8786cf
由 Li Min 提交于 11月 23, 2021
```
Add support for bias is none for fused_attention op.
```
1a8786cf

Speedup download uncompress function (#37311) · 467099f0

由 CtfGo 提交于 11月 23, 2021

`paddle.utils.download` ：change to call `extractall` on tar/zip compressd file  to speed up the uncompress process when they includes many files

--- result of decompression speed comparison ---
1. dataset：https://paddlenlp.bj.bcebos.com/datasets/cnn_dailymail/cnn_stories.tgz, decompression time
：5m50s vs 20s
2. dataset：https://paddlenlp.bj.bcebos.com/datasets/cnn_dailymail/dailymail_stories.tgz, decompression time：33m20s vs 47s

467099f0

L
[new-exec] skip compiled program with places > 1 (#37457) · 2dfcdf21
由 Leo Chen 提交于 11月 23, 2021
```
* skip compiled program with places > 1

* fix corner case and add ut
```
2dfcdf21
W
[Paddle Inference] Fix_nearest: align_corners != true (#37368) · bc150edc
由 Wangzheee 提交于 11月 23, 2021
```
* fix_nearest

* fix_nearest

* fix_nearest

* fix_nearest
```
bc150edc
R
[NPU] Added HCCL backend support in dygraph mode (#36285) · 83e55cff
由 ronnywang 提交于 11月 23, 2021
```
* Added HCCL backend support in dynamic graph mode

* fix segmentation fault

* add ut
```
83e55cff
Z
Bug fix for snapshotting VariableWrapper with initialized tensor but e… (#37410) · e58ac121
由 Zhanlue Yang 提交于 11月 23, 2021
```
* Bug fix for snapshoting VariableWrapper with initialized tensor but empty allocation

* Added unittest for inplace&clear_gradient
```
e58ac121

[NewExe] Support layout/dtype transform by adding transfer_layout/transfer_dtype op (#37299) · 2a1f009e

由 Aurelius84 提交于 11月 23, 2021

* Add transfer_layout/dtype op

* clean useless codes

* fix unused var

* add optest in white.txt

* split into data_transfer.cc

* fix cmake

* modify according reviewer comment

* replace cast_op with transfer_dtype_op

2a1f009e

22 11月, 2021 13 次提交
- Z
  fix autoconvert (#37347) · 693c3c14
  由 zhaoyingli 提交于 11月 22, 2021
```
* fix autoconvert

* fix merge parameter
```
  693c3c14
- A
  Add isclose op (#37135) · d2200e97
  由 andyjpaddle 提交于 11月 22, 2021
```
* add isclose op, test=develop

* add isclose op, test=develop

* add isclose api, test=develop

* rm useless code

* rm useless code

* update python api of isclose

* add some unittest of isclose op, test=develop
```
  d2200e97
- 0
  [Dy2stat]Allow users to switch eval/train mode when using @to_static to... · eb602398
  由 0x45f 提交于 11月 22, 2021
```
[Dy2stat]Allow users to switch eval/train mode when using @to_static to decorate a function (#37383)

* Allow users to switch eval/train mode when using @to_static to decorate a function

* refine code for train() and eval()
```
  eb602398
- Z
  
  elu support alpha < 0 (#37316) · e3503de8
  由 zhupengyang 提交于 11月 22, 2021
  
  e3503de8
- Z
  Support zero value in dimension for slice (#37313) · e788c7b5
  由 zyfncg 提交于 11月 22, 2021
```
* support zero dim for slice op

* support zero dim Tensor in set_value op

* polish some debug log
```
  e788c7b5
- Z
  
  fix bug of indexing tensor with None (#37400) · de0cb386
  由 zyfncg 提交于 11月 22, 2021
  
  de0cb386
- Z
  
  Add backward function hook to dygraph (#37141) · 31344ab7
  由 Zhanlue Yang 提交于 11月 22, 2021
  
  31344ab7
- W
  shape api should not backward (#37340) · 21957476
  由 Wilber 提交于 11月 22, 2021
```
* shape api should not backward

* fix stop_gradient

* update

* update doc
```
  21957476
- J
  
  Refine autoscan pass (#37363) · 6c4621f1
  由 Jason 提交于 11月 22, 2021
  
  6c4621f1
- W
  Renamed Func and removed ENFORCE statement (#37348) · 2702af21
  由 Weilong Wu 提交于 11月 22, 2021
```
* Removed one ENFORCE statement

* Changed func name to _share_buffer_to

* Improve error reporting information

* Updated the logic of _is_share_buffer_to func
```
  2702af21
- bugfix in fleetrun when launching multiple machines training manually (#37274) · ead89b11
  由 Webbley 提交于 11月 22, 2021
  
  ead89b11
- Z
  [heterps]remove api for heter pipeline ps (#37396) · 0b250a79
  由 zmx 提交于 11月 22, 2021
```
* fix api. test=develop

* fix api. test=develop
```
  0b250a79
- L
  
  [new feature] add local scope for interpretercore (#37379) · 1f0512be
  由 Leo Chen 提交于 11月 22, 2021
  
  1f0512be
19 11月, 2021 8 次提交

W

Add dygraph triple grad test, broadcast case (#37377) · bb2733fa
由 Weilong Wu 提交于 11月 19, 2021

bb2733fa
L

bug fix shard_index (#37042) · b505ff96
由 lilong12 提交于 11月 19, 2021

b505ff96
add new API paddle.nn.initializer.Orthogonal and calculate_gain (#37163) · 62ad3594
由 zhouweiwei2014 提交于 11月 19, 2021
```
* add new API paddle.nn.initializer.Orthogonal and calculate_gain

* fix comment

* fix comment
```
62ad3594

Add fuse_resnet_unit pass (#36818) · 3cd3bf29

由 wuhuanzhou 提交于 11月 19, 2021

* GeneratePass support attr condition and mapping, test=develop

* fix coverage, test=develop

* Add fuse_resnet_unit pass, test=develop

* fix CI errors, test=develop

* fix CI errors, test=develop

* fix unittest error when compiling without CUDA, test=develop

* fix static ci error, test=develop

* limit kernel size must equal 1, test=develop

3cd3bf29

W

fix bug in save_inference_model (#37362) · 77bca4de
由 wangguanqun 提交于 11月 19, 2021

77bca4de

Add paddle.incubate.graph_send_recv API (#37205) · 39012536

由 Siming Dai 提交于 11月 19, 2021

* add cpu version, using set: sum, min, max

* add cpu version: mean

* improve cpu code and fix dynamic memory allcation problem

* fix arg error, add index judge, delete fp16

* fix bug in CudaAtomicMax and CudaAtomicMin

* add CUDA version

* fix grad_op bug for index

* add op test, add correct cpu grad op

* Add correct CUDA Mean grad

* [Add] Successful MEAN and SUM

* [Add] Successful MIN and MAX in CPU

* [Add] Successful MIN and MAX in CUDA

* fix windows dtype ci

* fix ROCM ci by adding HIP flag

* rename fused_gather_scatter to send_recv

* unify name as send and recv

* change zero index return time

* add send_recv incubate api

* fix index data type, add unittest case for API

* delete redundant input tensor

* fix en example and docs, add default value in pool_type

* add shape judge and max grid judge

* fix comment

* fix index type bug

* add const &

* fix en docs

* delete numpy in examples

* add unittest for int input

* fix send_recv comment

* change send_recv to graph_send_recv

39012536

Y

[fleet_executor] Parse pipeline config (#37319) · ca088f92
由 Yuang Liu 提交于 11月 19, 2021

ca088f92
0
[Dy2stat]Support `for i in [1,2,3]` statements in dy2stat (#37259) · d772a9aa
由 0x45f 提交于 11月 19, 2021
```
* support `for i in [1,2,3]` statements in dy2stat

* add test case

* fix ci

* remove wrong code
```
d772a9aa

18 11月, 2021 6 次提交

[heterps]change default executor for heter trainer (#37314) · c98d175d

由 zmx 提交于 11月 18, 2021

* fix pslib. test=develop

* add device to train_from_dataset. test=develop

* refine fleet.stop_worker. test=develop

* fix ut. test=develop

* fix ut. test=develop

* fix executor & ut. test=develop

* fix executor & ut. test=develop

* fix executor & ut. test=develop

c98d175d

Optimize fleet elastic scale in/out (#37177) · 6d34d266

由 xiayanming 提交于 11月 18, 2021

* fleet support elastic train

* fleet support elastic train

* support elastic

* add unittest

* fix unitest bug

* fix unittest bug

* fix unittest bug

* fix unittest coverage

* fix unittest coverage

* fix unittest coverage

* fix unittest coverage

* fix unittest coverage

* fix elastic bug

* fix ci fail

* fix ci fail

* fix elastic bug

* fix elastic bug

* fix joint debugging bug

* fix joint debugging bug

* fix windows ci failed

* fix windows ci failed

* Optimize fleet elastic scale in/out

* elastic support pre hook

* add prehook unittest

6d34d266

Z

Fix Layer.to() of device bug (#37156) · 706a7897
由 zhangbo9674 提交于 11月 18, 2021

706a7897
S

update unittest timeout (#37279) · 34a44d59
由 Shang Zhizhou 提交于 11月 18, 2021

34a44d59
Z

[heterps]add heterps mode judgement (#37298) · dd7189ff
由 zmx 提交于 11月 18, 2021

dd7189ff
Y

[fleet_executor] Parse runtime graph to start carrier (#37282) · f85bd5c9
由 Yuang Liu 提交于 11月 18, 2021

f85bd5c9

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致