提交 · 13f1641dcaba539cda08c7bd176428d3f8422e3d · PaddlePaddle / Paddle

30 3月, 2022 2 次提交

suppor inplace in tensor_method_setitem (#40915) · 7170c687

由 pangyoki 提交于 3月 30, 2022

* suppor inplace in tensor_method_setitem

* delete bump_inplace_version

* optimize inplace unittest

* fix

* fix setitem bug

* update eager_generator

* optimize inplace unittest

* little change

7170c687

[Eager] Pylayer (#39989) · 157c1a28

由 wanghuancoder 提交于 3月 30, 2022

* Supported Complex2Real Conversion for Eager Dygraph

* Supported Complex2Real Conversion for Eager Dygraph

* Enabled complex type promotion test for matmul_v2

* pylayer, test=develop

* Fix CI issues

* Support initializing specific grad tensors to zero for selected operators

* finish forward, test=develop

* create grad node finish, test=develop

* Merged adj_edges_ with GradSlotMeta

* Fixed monir issue

* backward finish, start dbg, test=develop

* Adjusted num runs

* Recovered Eager performance tests configurations

* Recovered Eager performance tests configurations

* finish, test=develop

* polish, test=develop

* polish, test=develop

* refine, test=develop

* eager, test=develop

* Adjusted performance tests configurations

* Fixed Minor Issues with performance tests

* [Phi] Fix macro name typo

* support set_materialize_grads, test=develop

* suppotr mark_non_differentiable, test=develop

* support once_differentiable, test=develop

* refine, test=develop

* refine, test=develop

* Moved out Edge from GradSlotMeta

* Fixed issues from merge

* Fixed typo

* Addressed review comments

* Fixed merge issues

* Fixed minor issues

* Fixed minor issue

* refine, test=develop

* refine, test=develop

* refine, test=develop

* Fixed major issues and enabled auto_prune test cases

* Fixed issues from merge

* refine, test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop
Co-authored-by: Njim19930609 <jim19930609@gmail.com>
Co-authored-by: NAurelius84 <zhangliujie@baidu.com>

157c1a28

29 3月, 2022 1 次提交

Use _C_ops.yolov3_loss in eager mode for test_yolov3.py (#40831) · 3b381aac

由 0x45f 提交于 3月 29, 2022

* Use _C_ops.yolov3_loss in eager mode for test_yolov3.py

* fix code for test_yolov3_loss_op

* remove useless import

* Fix dygraph_mode flag

3b381aac

28 3月, 2022 3 次提交
- T
  [HeterPS] So Parser (#40750) · cadc4e6a
  由 Thunderbrook 提交于 3月 28, 2022
```
* So Parser

* add macro

* add macro

* slotrecord

* add macro

* code format
```
  cadc4e6a
- W
  [Eager] Support SelectedRows in eager mode (#40858) · 5c5a2a83
  由 Weilong Wu 提交于 3月 28, 2022
```
* [Eager] Support SelectedRows in eager mode

* Remove unnecessary codes

* Adapt new dygraph flag
```
  5c5a2a83
- 0
  Refine test_lac.py for eager mode (#40951) · c03186f9
  由 0x45f 提交于 3月 28, 2022
```
* Refine test_lac.py for eager mode

* refine code

* Fix test_program_translator for eager
```
  c03186f9
25 3月, 2022 2 次提交

update eager code gen (#40924) · afe2fdd1

由 hong 提交于 3月 25, 2022

* update

* remove useless code

* remove label smooth test

* polish code

* polish code

* polish code

* remove _in_eager_mode error;

afe2fdd1

Refactor Dygraph Flags (#40786) · 3085d5e4

由 Jiabin Yang 提交于 3月 25, 2022

* refactor eager flags

* fix flags error when we switch from eager to dygraph

* fix ci problem

* fix ci

* fix ci

* merge develop and fix code style

* merge develop and fix code style

* fix op test error

* fix op test error

* fix op test error

* fix op test error

* fix op test error

* merge develop

3085d5e4

24 3月, 2022 3 次提交

[AMP] Support amp for Intermediate_dygraph (#40623) · c12f7d48

由 zhangbo9674 提交于 3月 24, 2022

* approve amp for intermediate_dygraph

* add amp_utils for intermediate_dygraph

* add amp needcast check for mlu & npu

* test unittest

* add SetGradNode for set_stop_gradient && add checktensor for GradientHooks

* refine code

* refien unittest of imperative_amp for new dygraph

* inplace api skip amp

* add test_imperative_qat_amp for intermediate amp

* refine code

* refine test_amp ci strategy

* refine unittest code

* refine amp_utils code

* refine amp getpromotetype for some special op

* refine unittest code

c12f7d48

Z

Add sparse convertion api and sparse creation api (#40780) · a8f86600
由 zhangkaihuo 提交于 3月 24, 2022

a8f86600

Refine eager run_program OP for dy2st UT (#40768) · 4ccd5cb8

由 0x45f 提交于 3月 24, 2022

* Refine eager run_program OP for dy2st UT

* append run_program error string and refine run_program_grad

* remove some comments

* refine ConstructXGradTensors

4ccd5cb8

23 3月, 2022 8 次提交

two-phase training for ps (#40762) · b1a4668c

由 zhaocaibei123 提交于 3月 23, 2022

* fix benchmark and communicator config

* fix bugs of the_one_ps

* multi program and fix bug in optimizer

* multi program in the_one_ps

* public commcontext

* ps optimizer multi programs

* cvm & datanorm backend

* fix dim

* fix unittest

* fix

* the one ps merge

* remove comm

* add DownpourLiteWorker

* all

* fix

* fix

* device worker downpour lite

* fix

* fix bug in global shuffle

* save inference model

* fix & add log

* fix

* remove log

* fix

* fix save summary

* fix

* fix pscore

* fix

* fix

* fix

* fix

* fix

* remove logs

* fix

* fix

* fix

* fix

* fix

* add some comments

* fix
Co-authored-by: Nesythan <esythan@126.com>

b1a4668c

[Eager Hook + Inplace] Refactor register_hook and test with inplace operation (#40778) · ff7cbaae

由 Weilong Wu 提交于 3月 23, 2022

* disable scatter case in test_inplace_eager_fluid

* Update register_hook logic

* Add register_hook test cases
Co-authored-by: Npangyoki <pangyoki@126.com>

ff7cbaae

Support sharding (#40637) · fe291daf

由 Jiabin Yang 提交于 3月 23, 2022

* suppor sharding api

* support multi api for sharding in eager

* support multi api for sharding in eager

* fix test

* fix test coverage

fe291daf

[Phi] Move deformable_conv and deformable_conv_v1 to phi (#40794) · 7e3752bb

由 zyfncg 提交于 3月 23, 2022

* move deformable_conv_grad to phi

* move infershape of deformable_conv to phi

* adjust some code format

* move deformable_conv_v1 to phi

7e3752bb

Add yaml config part2 (#40742) · f4075db8

由 hong 提交于 3月 23, 2022

* fix error; test=develop

* update

* close some yaml

* fix backward attrite error; test=develop

* add div test

* polish code; test=develop

* remove none gbk charactor;

* remove some yaml;

* fix optional bug

* recover yaml config

* resolve confilct; test=develop

* close div; test=develop

f4075db8

[Eager] Slice (#40587) · b07d239c

由 wanghuancoder 提交于 3月 23, 2022

* fix some slice bug, test=develop

* eager slice, test=develop

* eager slice, test=develop

* refine, test=develop

* refine, test=develop

* fix bug, test=develop

* refine, test=develop

* rename function name, test=develop

b07d239c

Add profiler features (#40357) · c15e3823

由 chenjian 提交于 3月 23, 2022

* add event record for model profiling

* fix format

* fix format

* fix code example bug

* no

* add profiler statistic

* add profiler feature

* fix bug

* fix bug

* fix bug

* fix bug

* required: gpu

* required: gpu

* fix bug

* required: gpu

* fix ci bug

* fix ci error

* fix ci error

* upgrade document

* fix doc

* fix ci bug

* add doc and fix bug

* nothing

* fix bug

* fix format bug

* modify format

* add deprecated description for old profiler

* fix bug

* fix bug

* fix

* add load_profiler_reuslt doc

* add load_profiler_reuslt doc

* add load_profiler_reuslt doc

* help fix old profiler sample code

* add api doc

* fix format

* fix api doc

* fix api doc format

* fix api doc format

* fix api doc c format

* fix api doc format

c15e3823

W

Support test_layers(group_norm,while_loop) with eager mode (#40816) · db41e39e
由 Weilong Wu 提交于 3月 23, 2022

db41e39e

22 3月, 2022 2 次提交

[new-exec] async prepare deps (#40713) · 814f7211

由 Leo Chen 提交于 3月 22, 2022

* async prepare deps

* fix bug that std::future is not set

* add ut

* refine code

* fix standalone ut

* disable prof

814f7211

Z
[Phi] Replace Backend by Place in C++ API (#40732) · 5b7fadec
由 zyfncg 提交于 3月 22, 2022
```
* replace Backend by Place in C++ API

* fix left code

* fix test_to_api bug
```
5b7fadec

21 3月, 2022 5 次提交

Refine to_tensor for eager mode and support gpu_pinned (#40535) · 45d1fb8d

由 0x45f 提交于 3月 21, 2022

* Refine to_tensor for eager mode

* support gpu_pinned

* refine code

* support gpu_pinned copy_to

* fix layer.__setattr__

* support to_tensor for gpu_pinned

* fix unit test

* refine gpu_pinned

* restore the original code

* add is_gup_pinned() and refine eager.Tensor._copy_to()

45d1fb8d

H
remove duplicate code (#40758) · 4ff9fe43
由 hong 提交于 3月 21, 2022
```
* remove duplicate code;

* add some line; test=document_fix
```
4ff9fe43

Merge some test bug (#40543) · 56c43ccd

由 hong 提交于 3月 21, 2022

* switch eager mode and change it

* set default is eager

* set default is eager

* fix error; test=develop

* fix some error; test=develop

* update

* upd

* update code; test=develop

* update

* fix some bug; test=develop

* fix bug; test=develop

* fix bug; test=develop

* fix bug; test=develop

* fix error; test=develop

* format; test=develop
Co-authored-by: NJiabinYang <360788950@qq.com>

56c43ccd

[IPU] update ipu_backend (#40685) · d67fe921

由 Allen Guo 提交于 3月 21, 2022

* sync changes

* copy sOpNamescope

* fix UTs

* add authors
Co-authored-by: NXiaobing Wang <xiaobingw@graphcore.ai>
Co-authored-by: NAllen Guo <alleng@graphcore.ai>
Co-authored-by: NZhixin Yao <zhixiny@graphcore.ai>
Co-authored-by: NZhaorui Chen <zhaoruic@graphcore.ai>
Co-authored-by: NHan Zhao <hanzhao@graphcore.ai>

* fix code-format

* fix compile error

* add comments for feed_op
Co-authored-by: NXiaobing Wang <xiaobingw@graphcore.ai>
Co-authored-by: NZhixin Yao <zhixiny@graphcore.ai>
Co-authored-by: NZhaorui Chen <zhaoruic@graphcore.ai>
Co-authored-by: NHan Zhao <hanzhao@graphcore.ai>

d67fe921

Add yaml config part0 (#40020) · cc853e95

由 hong 提交于 3月 21, 2022

* add add yaml

* add elementwise add yaml; test=develop

* add norm

* update

* add some yaml config; test=develop

* fix bug; test=develop

* fix compare error; test=develop

* revert erger_gen.py

* update; test=deveop

* remove usless code; test=deveop

* fix bug; test=develop

* fix test error; test=develop

* remove int_type; test=develop

* fix type error; test=develop

* format; test=develop

* remove type register; test=develop

* polish code; test=develop

* fix ci error; test=develop

cc853e95

19 3月, 2022 3 次提交

Z
Call sparse op from python (#40608) · 95fbbc5b
由 zhangkaihuo 提交于 3月 19, 2022
```
* call sparse api from python
```
95fbbc5b
C

fix python hook mem leak (#40716) · c46f2ddb
由 Chen Weihang 提交于 3月 19, 2022

c46f2ddb

support inplace in dygraph eager_fluid state (#40400) · 8e612903

由 pangyoki 提交于 3月 19, 2022

* [Eager] Support eager grad interface, draft version

* Support eager grad interface with allow_unused and multi startup_op

* Fix code format

* Fix allow_unused case, return PyNone if tensor not initialize

* Support output's stop_gradient related to create_graph

* Support grad exception case in eager mode, fix coverage CI

* Update ToPyObject, return PyNone if not initialize

* AccumulationNode add FLAGS_retain_grad_for_all_tensor

* Fix ci issue

* Fix CI issue

* fix, use core.eager.Tensor

* Add func SetBufferSlotRankZeros for GradTensorHolder

* Support retain_graph by using ClearTensorWrappers

* Support retain_graph by using ClearTensorWrappers

* Update retain_graph and no_grad_vars related test case

* Update code gen logic for ClearTensorWrappers

* Fix by override statement

* fix override func args

* Support retain_graph, update unit tests

* Updated ClearTensorWrappers logic

* fix grad python interface

* Use deep copy and update unit tests

* Polish code

* Polish code

* Fix CI issue, Deep copy only use when user set grad_tensors

* Fix CI, use Backward instead RunBackward

* Fix CI, Declare kernel explicitly in test file

* Polish, remove vector of TensorWrapper

* Refactor the logic of grad/backward, polish codes

* Update code after merge upstream develop

* Polish after merge upstream develop

* Update to adapt new GradNodeBase superclass

* Fix error introduced during conflict resolution

* support inplace strategy in eager_fluid state

* solve conflict

* nothing

* Update purify potential_startup_nodes logic

* Fix errors

* Polish code

* Remove useless args for ToPyObject

* Remove useless TensorWrappersSet

* fix record conflict

* Fix code-format, re-install pre-commit

* fix tensor_wrapper bug

* Fix pre-process logic for potential_startup_ops

* Update unit tests, use eager mode

* Fix conflicts

* fix unittest timeout

* little change
Co-authored-by: NWeilong Wu <veyron_wu@163.com>

8e612903

18 3月, 2022 4 次提交

Supported Complex2Real Conversion for Eager Dygraph (#39878) · e3b2a035

由 Zhanlue Yang 提交于 3月 18, 2022

* Supported Complex2Real Conversion for Eager Dygraph

* Supported Complex2Real Conversion for Eager Dygraph

* Enabled complex type promotion test for matmul_v2

* Fix CI issues

* Merged adj_edges_ with GradSlotMeta

* Fixed monir issue

* Adjusted num runs

* Recovered Eager performance tests configurations

* Recovered Eager performance tests configurations

* Adjusted performance tests configurations

* Fixed Minor Issues with performance tests

* Moved out Edge from GradSlotMeta

* Fixed issues from merge

* Fixed typo

* Addressed review comments

* Fixed minor issues

e3b2a035

S
[DataParallel]Support control flow in new DP (#40593) · 984eacb3
由 ShenLiang 提交于 3月 18, 2022
```
* fix bug

* fix bug
```
984eacb3
L

Use store for gloo process group (#40629) · bb2cb762
由 lilong12 提交于 3月 18, 2022

bb2cb762

王

[infrt] rename pd dialect from mlir to infrt. (#40651) · ef4ef154

由王明冬提交于 3月 18, 2022

* [infrt] rename pd dialect from mlir to infrt. test=develop

* [infrt] fix the kernel signature generator bug.

ef4ef154

17 3月, 2022 4 次提交

S
merge cpu and gpu graph engines (#40597) · 31776199
由 seemingwang 提交于 3月 17, 2022
```
* extract sub-graph

* graph-engine merging

* fix

* fix

* fix heter-ps config
```
31776199
B

support gpu mixed precision inference (#40531) · 06fee998
由 baoachun 提交于 3月 17, 2022

06fee998

[Eager Grad] Support eager grad interface (#40170) · 4db8cf24

由 Weilong Wu 提交于 3月 17, 2022

* [Eager] Support eager grad interface, draft version

* Support eager grad interface with allow_unused and multi startup_op

* Fix code format

* Fix allow_unused case, return PyNone if tensor not initialize

* Support output's stop_gradient related to create_graph

* Support grad exception case in eager mode, fix coverage CI

* Update ToPyObject, return PyNone if not initialize

* AccumulationNode add FLAGS_retain_grad_for_all_tensor

* Fix ci issue

* Fix CI issue

* fix, use core.eager.Tensor

* Add func SetBufferSlotRankZeros for GradTensorHolder

* Support retain_graph by using ClearTensorWrappers

* Support retain_graph by using ClearTensorWrappers

* Update retain_graph and no_grad_vars related test case

* Update code gen logic for ClearTensorWrappers

* Fix by override statement

* fix override func args

* Support retain_graph, update unit tests

* Updated ClearTensorWrappers logic

* fix grad python interface

* Use deep copy and update unit tests

* Polish code

* Polish code

* Fix CI issue, Deep copy only use when user set grad_tensors

* Fix CI, use Backward instead RunBackward

* Fix CI, Declare kernel explicitly in test file

* Polish, remove vector of TensorWrapper

* Refactor the logic of grad/backward, polish codes

* Update code after merge upstream develop

* Polish after merge upstream develop

* Update to adapt new GradNodeBase superclass

* Fix error introduced during conflict resolution

* Update purify potential_startup_nodes logic

* Fix errors

* Polish code

* Remove useless args for ToPyObject

* Remove useless TensorWrappersSet

* Fix code-format, re-install pre-commit

* Fix pre-process logic for potential_startup_ops

* Update unit tests, use eager mode

4db8cf24

J
fix copy_ problem by doing it with phi copy (#40521) · c1931beb
由 Jiabin Yang 提交于 3月 17, 2022
```
* fix copy_ problem by doing it with phi copy

* improve test coverage

* refactor copy with sr kernel
```
c1931beb

16 3月, 2022 2 次提交

R

clean up DeviceManager in advance manually (#40504) · 23c036d6
由 ronnywang 提交于 3月 16, 2022

23c036d6

[Auto Parallel] Add the support for the auto completion of while_op (#39939) · ec6b8fbd

由 Yulong Ao 提交于 3月 16, 2022

* [Auto Parallel] Support the auto completion of while_op

* [Auto Parallel] Improve the completion algorithms

* [Auto Parallel] Fix bugs for ernie inference

* [Auto Parallel] Remove attrs which cannot be pickled

* [Auto Parallel] make the dims_mappings of LodTensorArray vars empty

* [Auto Parallel] Fix bugs for the ernie inference in the pipeline parallel

* [Auto Parallel] Remove unncessary comments

* [Auto Parallel] Fix a bug of the CMakeLists

* [Auto Parallel] Use the newest APIs to write the unit test

* [Auto Parallel] Remove unnecessary statements

ec6b8fbd

15 3月, 2022 1 次提交
- X
  run python api in eager model and filter the out in argument list (#40523) · 4d886f75
  由 xiongkun 提交于 3月 15, 2022
```
* run python api in eager model and filter the out in argument list

* fix code
```
  4d886f75

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功