提交 · e64829e25b3bd107e4fd6864121bd4f3b4922647 · Crayon鑫 / Paddle

25 11月, 2021 5 次提交
- L
  [new-exec] fix program cache key (#37500) · e64829e2
  由 Leo Chen 提交于 11月 25, 2021
```
* fix program cache key

* bug fix

* fix cache problem

* remove unused code
```
  e64829e2
- L
  
  Export task node to python (#37509) · 3f815e76
  由 LiYuRio 提交于 11月 25, 2021
  
  3f815e76
- C
  Hot fix for dataloader thread error because of pten (#37520) · ed7a21de
  由 Chen Weihang 提交于 11月 24, 2021
```
* hot fix for dataloader thread error

* polish comment

* fix type in comment, test=document_fix
```
  ed7a21de
- M
  【PaddlePaddle Hackathon】6、在 Paddle 中新增 ZeroPad2d (#37151) · 81861f69
  由 Matsumoto GAO 提交于 11月 25, 2021
```
* add zeropad2d v0.1

* add zeropad2d v0.2

* add zeropad2d v0.3

* add zeropad2d v0.3

* add zeropad2d v0.3

* add zeropad2d v0.4

* add zeropad2d v0.5

* add zeropad2d v0.5 codestyle

* add zeropad2d v0.5 codestyle

* add zeropad2d v0.6 functional

* add zeropad2d v0.6 functional

* add zeropad2d v0.6 functional
```
  81861f69
- L
  [new-exec] skip compiled program (#37512) · 171da2ce
  由 Leo Chen 提交于 11月 25, 2021
```
* skip compiled program

* fix ut
```
  171da2ce
24 11月, 2021 6 次提交

T
[GpuPs]pybind core (#37287) · d69daed1
由 Thunderbrook 提交于 11月 24, 2021
```
* pybind core

* set use psgpu
```
d69daed1
J

fix range op (#37486) · d5c51e62
由 Jiawei Wang 提交于 11月 24, 2021

d5c51e62

[Paddle-Inference] Matmul_int8_convert: tensor*tensor (#37285) · 16590799

由 Wangzheee 提交于 11月 24, 2021

* matmul_convert_int8

* matmul_convert_int8

* matmulconvert_int8

* Matmul_int8_convert: tensor*tensor

* Matmul_int8_convert: tensor*tensor

* Matmul_int8_convert: tensor*tensor

16590799

Z
Adapt auto search (#37490) · 025053b4
由 zhaoyingli 提交于 11月 24, 2021
```
* adapt auto search

* adapt auto search

* fix matmulv2 compatible

* del debug
```
025053b4
Y
[Auto Parallel] Add the unified cluster representation (#37091) · db727551
由 Yulong Ao 提交于 11月 24, 2021
```
* [Auto Parallel]  Add the unified cluster representation

* Add the local id for devices

* Add some comments
```
db727551

[Dy2stat]support pure fp16 for dy2stat (#36944) · 52edad6a

由 0x45f 提交于 11月 24, 2021

* run dy2stat pure fp16 in Linear model

* no use self._pure_fp16_inputs

* add test and fix Adam error in dy2stat pure fp16 training

* use paddle.optimizer.Adam

* run test in gpu

* change test time for CI

* enlarge atol for test_resnet_pure_fp16

* refine code and enlarge atol

* make custom_white_list and custom_black_list take effect for AMP and pure fp16

* check tracer is not None

* use default atol

* change filter_size

* change atol and add some NOTE

52edad6a

23 11月, 2021 8 次提交

P
fix inplace bug when the first grad_var(loss_grad) is inplace var (#37420) · ee1e1642
由 pangyoki 提交于 11月 23, 2021
```
* fix inplace bug

* fix custom grad input error

* add unittest

* fix inplace bug
```
ee1e1642
L
Add support bias is none for fused_attention op. (#37411) · 1a8786cf
由 Li Min 提交于 11月 23, 2021
```
Add support for bias is none for fused_attention op.
```
1a8786cf

Speedup download uncompress function (#37311) · 467099f0

由 CtfGo 提交于 11月 23, 2021

`paddle.utils.download` ：change to call `extractall` on tar/zip compressd file  to speed up the uncompress process when they includes many files

--- result of decompression speed comparison ---
1. dataset：https://paddlenlp.bj.bcebos.com/datasets/cnn_dailymail/cnn_stories.tgz, decompression time
：5m50s vs 20s
2. dataset：https://paddlenlp.bj.bcebos.com/datasets/cnn_dailymail/dailymail_stories.tgz, decompression time：33m20s vs 47s

467099f0

L
[new-exec] skip compiled program with places > 1 (#37457) · 2dfcdf21
由 Leo Chen 提交于 11月 23, 2021
```
* skip compiled program with places > 1

* fix corner case and add ut
```
2dfcdf21
W
[Paddle Inference] Fix_nearest: align_corners != true (#37368) · bc150edc
由 Wangzheee 提交于 11月 23, 2021
```
* fix_nearest

* fix_nearest

* fix_nearest

* fix_nearest
```
bc150edc
R
[NPU] Added HCCL backend support in dygraph mode (#36285) · 83e55cff
由 ronnywang 提交于 11月 23, 2021
```
* Added HCCL backend support in dynamic graph mode

* fix segmentation fault

* add ut
```
83e55cff
Z
Bug fix for snapshotting VariableWrapper with initialized tensor but e… (#37410) · e58ac121
由 Zhanlue Yang 提交于 11月 23, 2021
```
* Bug fix for snapshoting VariableWrapper with initialized tensor but empty allocation

* Added unittest for inplace&clear_gradient
```
e58ac121

[NewExe] Support layout/dtype transform by adding transfer_layout/transfer_dtype op (#37299) · 2a1f009e

由 Aurelius84 提交于 11月 23, 2021

* Add transfer_layout/dtype op

* clean useless codes

* fix unused var

* add optest in white.txt

* split into data_transfer.cc

* fix cmake

* modify according reviewer comment

* replace cast_op with transfer_dtype_op

2a1f009e

22 11月, 2021 13 次提交
- Z
  fix autoconvert (#37347) · 693c3c14
  由 zhaoyingli 提交于 11月 22, 2021
```
* fix autoconvert

* fix merge parameter
```
  693c3c14
- A
  Add isclose op (#37135) · d2200e97
  由 andyjpaddle 提交于 11月 22, 2021
```
* add isclose op, test=develop

* add isclose op, test=develop

* add isclose api, test=develop

* rm useless code

* rm useless code

* update python api of isclose

* add some unittest of isclose op, test=develop
```
  d2200e97
- 0
  [Dy2stat]Allow users to switch eval/train mode when using @to_static to... · eb602398
  由 0x45f 提交于 11月 22, 2021
```
[Dy2stat]Allow users to switch eval/train mode when using @to_static to decorate a function (#37383)

* Allow users to switch eval/train mode when using @to_static to decorate a function

* refine code for train() and eval()
```
  eb602398
- Z
  
  elu support alpha < 0 (#37316) · e3503de8
  由 zhupengyang 提交于 11月 22, 2021
  
  e3503de8
- Z
  Support zero value in dimension for slice (#37313) · e788c7b5
  由 zyfncg 提交于 11月 22, 2021
```
* support zero dim for slice op

* support zero dim Tensor in set_value op

* polish some debug log
```
  e788c7b5
- Z
  
  fix bug of indexing tensor with None (#37400) · de0cb386
  由 zyfncg 提交于 11月 22, 2021
  
  de0cb386
- Z
  
  Add backward function hook to dygraph (#37141) · 31344ab7
  由 Zhanlue Yang 提交于 11月 22, 2021
  
  31344ab7
- W
  shape api should not backward (#37340) · 21957476
  由 Wilber 提交于 11月 22, 2021
```
* shape api should not backward

* fix stop_gradient

* update

* update doc
```
  21957476
- J
  
  Refine autoscan pass (#37363) · 6c4621f1
  由 Jason 提交于 11月 22, 2021
  
  6c4621f1
- W
  Renamed Func and removed ENFORCE statement (#37348) · 2702af21
  由 Weilong Wu 提交于 11月 22, 2021
```
* Removed one ENFORCE statement

* Changed func name to _share_buffer_to

* Improve error reporting information

* Updated the logic of _is_share_buffer_to func
```
  2702af21
- bugfix in fleetrun when launching multiple machines training manually (#37274) · ead89b11
  由 Webbley 提交于 11月 22, 2021
  
  ead89b11
- Z
  [heterps]remove api for heter pipeline ps (#37396) · 0b250a79
  由 zmx 提交于 11月 22, 2021
```
* fix api. test=develop

* fix api. test=develop
```
  0b250a79
- L
  
  [new feature] add local scope for interpretercore (#37379) · 1f0512be
  由 Leo Chen 提交于 11月 22, 2021
  
  1f0512be
19 11月, 2021 8 次提交

W

Add dygraph triple grad test, broadcast case (#37377) · bb2733fa
由 Weilong Wu 提交于 11月 19, 2021

bb2733fa
L

bug fix shard_index (#37042) · b505ff96
由 lilong12 提交于 11月 19, 2021

b505ff96
add new API paddle.nn.initializer.Orthogonal and calculate_gain (#37163) · 62ad3594
由 zhouweiwei2014 提交于 11月 19, 2021
```
* add new API paddle.nn.initializer.Orthogonal and calculate_gain

* fix comment

* fix comment
```
62ad3594

Add fuse_resnet_unit pass (#36818) · 3cd3bf29

由 wuhuanzhou 提交于 11月 19, 2021

* GeneratePass support attr condition and mapping, test=develop

* fix coverage, test=develop

* Add fuse_resnet_unit pass, test=develop

* fix CI errors, test=develop

* fix CI errors, test=develop

* fix unittest error when compiling without CUDA, test=develop

* fix static ci error, test=develop

* limit kernel size must equal 1, test=develop

3cd3bf29

W

fix bug in save_inference_model (#37362) · 77bca4de
由 wangguanqun 提交于 11月 19, 2021

77bca4de

Add paddle.incubate.graph_send_recv API (#37205) · 39012536

由 Siming Dai 提交于 11月 19, 2021

* add cpu version, using set: sum, min, max

* add cpu version: mean

* improve cpu code and fix dynamic memory allcation problem

* fix arg error, add index judge, delete fp16

* fix bug in CudaAtomicMax and CudaAtomicMin

* add CUDA version

* fix grad_op bug for index

* add op test, add correct cpu grad op

* Add correct CUDA Mean grad

* [Add] Successful MEAN and SUM

* [Add] Successful MIN and MAX in CPU

* [Add] Successful MIN and MAX in CUDA

* fix windows dtype ci

* fix ROCM ci by adding HIP flag

* rename fused_gather_scatter to send_recv

* unify name as send and recv

* change zero index return time

* add send_recv incubate api

* fix index data type, add unittest case for API

* delete redundant input tensor

* fix en example and docs, add default value in pool_type

* add shape judge and max grid judge

* fix comment

* fix index type bug

* add const &

* fix en docs

* delete numpy in examples

* add unittest for int input

* fix send_recv comment

* change send_recv to graph_send_recv

39012536

Y

[fleet_executor] Parse pipeline config (#37319) · ca088f92
由 Yuang Liu 提交于 11月 19, 2021

ca088f92
0
[Dy2stat]Support `for i in [1,2,3]` statements in dy2stat (#37259) · d772a9aa
由 0x45f 提交于 11月 19, 2021
```
* support `for i in [1,2,3]` statements in dy2stat

* add test case

* fix ci

* remove wrong code
```
d772a9aa

Crayon鑫 / Paddle 与 Fork 源项目一致

Crayon鑫 / Paddle
与 Fork 源项目一致