提交 · ff6507db5641ff673292098f4729b8efb2f028ff · PaddlePaddle / Paddle

08 12月, 2021 15 次提交

Y
[PTen]Add alias kernel name (#37881) · ff6507db
由 YuanRisheng 提交于 12月 08, 2021
```
* add alias kernel name

* modify code as suggestions
```
ff6507db

Add paddle.lerp API to do a linear interpolation (#37253) · 1716324c

由 wuhuanzhou 提交于 12月 08, 2021

* save temp

* add unittest, test=develop

* fix ci error, test=develop

* fix grad accuracy error, test=develop

* fix unused error, test=develop

* fix compilation error on Windows, test=develop

* add unittest, test=develop

* modify by review comment and add lerp_

* fix inplace api, test=develop

* fix inplace api, test=develop

* fix coverage error, test=develop

1716324c

C
add update func of auto search (#37867) · 46212b80
由 caozhou 提交于 12月 08, 2021
```
* add update func of auto search

* update unitest
```
46212b80
W

[fleet_executor] Add interceptor gc (#37889) · 6b48dfe9
由 WangXi 提交于 12月 08, 2021

6b48dfe9
Y

[fleet_executor] bug fix for fleet_executor, test=allcase (#37934) · 55b87742
由 Yuang Liu 提交于 12月 08, 2021

55b87742
C
implementation of broadcast sub backward by reduce (#37754) · 567e6bbc
由 crystal 提交于 12月 08, 2021
```
* add boardcast_sub

* add boardcast_sub
```
567e6bbc

Fix CUDAGraphAllocator bug for StreamSafeCUDAAllocator (#37821) · b4a67491

由 From00 提交于 12月 08, 2021

* Fix CUDAGraph bug for StreamSafeCUDAAllocator

* Add CUDAGrapthAllocator check in multi-stream interface

* Set FLAGS_use_stream_safe_cuda_allocator defaulted to false

* Fix environment error for cmake

* Fix cmake error

* Add UT of GetAllocatorInterfaceTest

* Add UT of CUDAGraphExceptionTest

* Enhance CUDAGraphExceptionTest

b4a67491

C

add check whether tensor is inplace and leaf when calcute gradient (#37931) · 2c02a580
由 chentianyu03 提交于 12月 08, 2021

2c02a580
F
fix: when ceil_model==true && Padding_algo!=SAME, (x-size)/stride != int, this... · d1ab323f
由 feng_shuai 提交于 12月 08, 2021
```
fix: when ceil_model==true && Padding_algo!=SAME, (x-size)/stride != int, this convert is wrong (#37929)
```
d1ab323f

[Eager] generate eager core ops, only 4 ops (#37813) · 52f63cd2

由 wanghuancoder 提交于 12月 08, 2021

* refine a test case, test=develop

* publish python c api for eager, test=develop

* revert modify about test_allclose_layer.py, test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop

* delete numpy includes, use pybind11 numpy.h, test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop

* suport eager error msg, and add grad test case, test=develop

* refine, test=develop

* refine, test=develop

* generate eager core ops, only 4 ops, test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop

52f63cd2

Enabled Eager AutoCodeGen for 40+ more operators (#37910) · cf873c39

由 Zhanlue Yang 提交于 12月 08, 2021

* Rearranged Eager AutoCodeGen directory structure

* Removed USE_OP in Eager AutoCodeGen

* Enabled generation for Operators without Grad/Inputs/Outputs

* Resolved operators without input

* Fixed merge conflicts

* Enabled Eager AutoCodeGen for 10+ more operators

cf873c39

Y

fix softmax max dim (#37901) · b5dd12fb
由 Yanxing Shi 提交于 12月 08, 2021

b5dd12fb
S

add pyyaml needed by python\paddle\utils\code_gen\api_gen.py (#37897) · a8f009e4
由 Sing_chan 提交于 12月 08, 2021

a8f009e4
S
Fix CUDA Graph H2D bug by restore host memory (#37774) · a1ad3a63
由 sneaxiy 提交于 12月 08, 2021
```
* fix CUDA Graph H2D bug again

* fix no return bug
```
a1ad3a63
Y

bug fix for adamw (#37905) · 9a2d327c
由 Yuang Liu 提交于 12月 08, 2021

9a2d327c

07 12月, 2021 25 次提交

L

[Fleet Executor] Add feed, fetch and check correctness (#37824) · b8793f70
由 LiYuRio 提交于 12月 07, 2021

b8793f70

introduce INF-RT (#37669) · 70dea138

由 Yan Chunwei 提交于 12月 07, 2021

* add infrt code

refined with Paddle's code style.

* rename CinnRtConfig to InfRtConfig

* rename CinnRt to InfRt of some code

* rename CINNRT to INFRT

* remove unnecessary code

* replace CINN to INFRT in the source code

* replace all "cinn" in code to "infrt"

* remove some const_cast

70dea138

X
add maxunpool2d in __all__ (#37698) · 890bd626
由 xiaoting 提交于 12月 07, 2021
```
* add maxunpool2d in __all__

* fix MaxUnPool2D example
```
890bd626
S

block MASM : warning A4018 when building cryptopp in windows with ninja (#37890) · ca6ff1f6
由 Sing_chan 提交于 12月 07, 2021

ca6ff1f6

Buf fix for reset grad inplace version (#37811) · cf586021

由 Zhanlue Yang 提交于 12月 07, 2021

* Debug

* Fixed issue with reset_grad_inplace_version when used with clear_gradient & cross-batch accumulation

* Rearranged interfaces

* Fixed ci issues

cf586021

S
update logsumexp doc (#37883) · 723cbe9f
由 Shang Zhizhou 提交于 12月 07, 2021
```
* update logsumexp doc

* update api doc

* update api doc
```
723cbe9f
Z

add cmake depend for api_gen.py (#37900) · 7e831b5a
由 zyfncg 提交于 12月 07, 2021

7e831b5a
T
Fix static git diff (#37914) · a754d907
由 tianshuo78520a 提交于 12月 07, 2021
```
* fix static git diff check

* test=document_fix
```
a754d907
W

ut support block (#37909) · cf5de26f
由 Wilber 提交于 12月 07, 2021

cf5de26f
D

fix filter_by_instag op for lod_level=0 without lod;test=develop (#37834) · b48545ee
由 danleifeng 提交于 12月 07, 2021

b48545ee
S
make some non_parallel unittest parallel execute (#37805) · b154110a
由 Sing_chan 提交于 12月 07, 2021
```
* make some non_parallel unittest parallel execute

* delete duplicate ut
```
b154110a
J
multithread memory optimize error fix (#37894) · 6b7b7677
由 JingZhuangzhuang 提交于 12月 07, 2021
```
* multithread_memory_optimize
```
6b7b7677
H
Set runtime_include_dir in Paddle.__init__.py (#37886) · e3cca8ac
由 Huihuang Zheng 提交于 12月 07, 2021
```
Paddle don't have to set runtime_include_dir during run CINN.
```
e3cca8ac
0
[Dy2Stat]Polish for zip in dy2stat (#37846) · 4e63d69b
由 0x45f 提交于 12月 07, 2021
```
* polish for zip in dy2stat

* polish comment

* polish is_builtin_len

* fix comment
```
4e63d69b
T
add some op to xpu2 op list && format xpu op list (#37832) · efd7a229
由 TTerror 提交于 12月 07, 2021
```
* format xpu op list

* format xpu op list

* update xpu1 op list
```
efd7a229

[Eager] fix cmake generate error, and fix circular import (#37871) · 79c25979

由 wanghuancoder 提交于 12月 07, 2021

* refine a test case, test=develop

* rm python, test=develop

* refine, test=develop

* fix cmake generate error, and fix circular import, test=develop

79c25979

[Auto para] Relaunch with auto mapping function (#37326) · 506e79d1

由 Yulong Ao 提交于 12月 07, 2021

* [Auto Parallel]  Add the unified cluster representation

* [Auto Parallel] Add the graph class for physical mapping

* [Auto Parallel] Add the simple physical mapper

* Set the timeout of the mapper

* Merge the upstream develop unittests cmake files

* Fix a bug of the process group

* Remove mapper unittest from platforms which is not GPU

* Move the instantiation of process group after resharding

* Add the local id for devices

* Update the rank mapping format

* [Auto Parallel] Relaunch with the rank mapping file

* Remove the unnecessary json file

* Avoid entering get_device_proc_info for auto mapping

* Correct the mapper unit test

* Add some comments

* Remove the related files about mapping

* Update the unittest for auto mapping

* Remove unused rank_mapping unittest

* Improve the unittest coverage

* Improve the unittest coverage

* Improve the unittest of relaunch

* Fix the unittest problem in CI

* Improve the unittest of relaunch

* Remove unnecessary statements

* Update the unittest cmakefile

* Correct the cmakefile of auto parallel unittests

* Modify codes based on the new elastic change

* Use the GPUs exclusively in the unittest

* Correct the cmakefile

* Set the timeout of the unittest

506e79d1

[Pten]Move func from kernel_context.h into kernel_context.cc (#37804) · bfa0d7f3

由 YuanRisheng 提交于 12月 07, 2021

* add inplace op adaptation

* optimize inplace logic and fix bugs when run kernel that has args of vector<DenseTensor>

* move func in kernel_context.h into kernel_context.cc

* refactor logic that transform variable to densetensor

* fix bugs when compile

* update func name

* fix bugs when run windows-ci

bfa0d7f3

Z
[heterps]fix heter service (#37860) · b3185296
由 zmxdream 提交于 12月 07, 2021
```
* fix heter service. test=develop

* fix heter section worker in debug mode
```
b3185296
W
don't exit if requested_size < size (#37880) · 4035bd2b
由 wenbin 提交于 12月 07, 2021
```
don't exit if requested_size < size
```
4035bd2b
Z

fix pyyaml dependence problem for api-gen (#37879) · 508b756a
由 zyfncg 提交于 12月 07, 2021

508b756a
T

Add ce framework dockerfile (#37762) · 0372883e
由 tianshuo78520a 提交于 12月 07, 2021

0372883e
Z
Quantize slice op (#37630) · 2bd0f3c7
由 Zuza 提交于 12月 07, 2021
```
* quantize slice op

* correct test

* fix code formatting
```
2bd0f3c7
J

add ipu device p1 (#37841) · c9a3c669
由 jianghaicheng 提交于 12月 07, 2021

c9a3c669
Z

Enabled generation for special operators, the GradNode/Inputs/Outputs of which are empty (#37837) · de874cdd
由 Zhanlue Yang 提交于 12月 07, 2021

de874cdd

PaddlePaddle / Paddle 1 年多 前同步成功

PaddlePaddle / Paddle
1 年多前同步成功