提交 · c15e3823346e323d2d84f1fc7c174ae09790ccec · PaddlePaddle / Paddle

23 3月, 2022 4 次提交

Add profiler features (#40357) · c15e3823

由 chenjian 提交于 3月 23, 2022

* add event record for model profiling

* fix format

* fix format

* fix code example bug

* no

* add profiler statistic

* add profiler feature

* fix bug

* fix bug

* fix bug

* fix bug

* required: gpu

* required: gpu

* fix bug

* required: gpu

* fix ci bug

* fix ci error

* fix ci error

* upgrade document

* fix doc

* fix ci bug

* add doc and fix bug

* nothing

* fix bug

* fix format bug

* modify format

* add deprecated description for old profiler

* fix bug

* fix bug

* fix

* add load_profiler_reuslt doc

* add load_profiler_reuslt doc

* add load_profiler_reuslt doc

* help fix old profiler sample code

* add api doc

* fix format

* fix api doc

* fix api doc format

* fix api doc format

* fix api doc c format

* fix api doc format

c15e3823

change CUDA implementation of multinomial OP (#40752) · 58970995
由 zhouweiwei2014 提交于 3月 23, 2022

58970995
W

Support test_layers(group_norm,while_loop) with eager mode (#40816) · db41e39e
由 Weilong Wu 提交于 3月 23, 2022

db41e39e
K

enable continuous log; update doc (#40782) · fdafbc7b
由 kuizhiqing 提交于 3月 23, 2022

fdafbc7b

22 3月, 2022 6 次提交

[new-exec] async prepare deps (#40713) · 814f7211

由 Leo Chen 提交于 3月 22, 2022

* async prepare deps

* fix bug that std::future is not set

* add ut

* refine code

* fix standalone ut

* disable prof

814f7211

polish python api logic and add backward python api check (#40666) · c29f85b6

由 xiongkun 提交于 3月 22, 2022

* 1. add the python api grad 2. add final and intermediate state vlog 3. change the python_api error logic

* add python api or close the check_eager=True

* fix the compatibility

* matmul

* disable unittests: test_elementwise_add_op test_scatter_nd_op test_gather_nd_op test_scatter_op test_index_sample_op test_elementwise_add_mkldnn_op

c29f85b6

W

Update unit tests by using _test_eager_guard (#40760) · 3b8bcd5a
由 Weilong Wu 提交于 3月 22, 2022

3b8bcd5a
Z

Add more annotations to test_cholesky_solve_op.py, make it an example in hackson guide · 64c268b2
由 zhiboniu 提交于 3月 22, 2022

64c268b2
P

disable scatter case in test_inplace_eager_fluid (#40756) · 72a2bfe2
由 pangyoki 提交于 3月 22, 2022

72a2bfe2

[phi] Update graph_send_recv OP (#40509) · 67b46e45

由 Siming Dai 提交于 3月 22, 2022

* add out_size shape for graph_send_recv

* fix bug in register kernel: no const int& support

* add out_size in infermeta

* change unittest

* fix unittest

* fix out_size default value

* fix doc

* delete arg mapping

* add sig

* move -1 to 0

* move -1 to 0

67b46e45

21 3月, 2022 9 次提交

Refine to_tensor for eager mode and support gpu_pinned (#40535) · 45d1fb8d

由 0x45f 提交于 3月 21, 2022

* Refine to_tensor for eager mode

* support gpu_pinned

* refine code

* support gpu_pinned copy_to

* fix layer.__setattr__

* support to_tensor for gpu_pinned

* fix unit test

* refine gpu_pinned

* restore the original code

* add is_gup_pinned() and refine eager.Tensor._copy_to()

45d1fb8d

K

fleetrun launch in legacy mode (#40568) · c54c60de
由 kuizhiqing 提交于 3月 21, 2022

c54c60de

Merge some test bug (#40543) · 56c43ccd

由 hong 提交于 3月 21, 2022

* switch eager mode and change it

* set default is eager

* set default is eager

* fix error; test=develop

* fix some error; test=develop

* update

* upd

* update code; test=develop

* update

* fix some bug; test=develop

* fix bug; test=develop

* fix bug; test=develop

* fix bug; test=develop

* fix error; test=develop

* format; test=develop
Co-authored-by: NJiabinYang <360788950@qq.com>

56c43ccd

Z

conv2d support FP16 on xpu and update unittest for conv2d, test=kunlun (#40395) · 276017bb
由 zhangyikun02 提交于 3月 21, 2022

276017bb

[IPU] add more ops (#40691) · df3ae18a

由 Allen Guo 提交于 3月 21, 2022

* add more ops

* add authors
Co-authored-by: NXiaobing Wang <xiaobingw@graphcore.ai>
Co-authored-by: NAllen Guo <alleng@graphcore.ai>
Co-authored-by: NZhixin Yao <zhixiny@graphcore.ai>
Co-authored-by: NZhaorui Chen <zhaoruic@graphcore.ai>
Co-authored-by: NHan Zhao <hanzhao@graphcore.ai>

* rm ipu_strategy.check()

* fix UT fail

* fix typo
Co-authored-by: NXiaobing Wang <xiaobingw@graphcore.ai>
Co-authored-by: NZhixin Yao <zhixiny@graphcore.ai>
Co-authored-by: NZhaorui Chen <zhaoruic@graphcore.ai>
Co-authored-by: NHan Zhao <hanzhao@graphcore.ai>

df3ae18a

L

add _init_parallel_env and _new_group (#40579) · b8dc673d
由 lilong12 提交于 3月 21, 2022

b8dc673d

[IPU] update ipu_backend (#40685) · d67fe921

由 Allen Guo 提交于 3月 21, 2022

* sync changes

* copy sOpNamescope

* fix UTs

* add authors
Co-authored-by: NXiaobing Wang <xiaobingw@graphcore.ai>
Co-authored-by: NAllen Guo <alleng@graphcore.ai>
Co-authored-by: NZhixin Yao <zhixiny@graphcore.ai>
Co-authored-by: NZhaorui Chen <zhaoruic@graphcore.ai>
Co-authored-by: NHan Zhao <hanzhao@graphcore.ai>

* fix code-format

* fix compile error

* add comments for feed_op
Co-authored-by: NXiaobing Wang <xiaobingw@graphcore.ai>
Co-authored-by: NZhixin Yao <zhixiny@graphcore.ai>
Co-authored-by: NZhaorui Chen <zhaoruic@graphcore.ai>
Co-authored-by: NHan Zhao <hanzhao@graphcore.ai>

d67fe921

[Eager grad] Refactor partial grad logic (#40693) · facda828

由 Weilong Wu 提交于 3月 21, 2022

* Refactor partial_grad/backward logic

* Add DuplicateCheck and polish code

* Refactor partial_grad/backward more clearly

* Refactor GeneralGrad by SingleInstance

facda828

Add yaml config part0 (#40020) · cc853e95

由 hong 提交于 3月 21, 2022

* add add yaml

* add elementwise add yaml; test=develop

* add norm

* update

* add some yaml config; test=develop

* fix bug; test=develop

* fix compare error; test=develop

* revert erger_gen.py

* update; test=deveop

* remove usless code; test=deveop

* fix bug; test=develop

* fix test error; test=develop

* remove int_type; test=develop

* fix type error; test=develop

* format; test=develop

* remove type register; test=develop

* polish code; test=develop

* fix ci error; test=develop

cc853e95

19 3月, 2022 3 次提交

Z
Call sparse op from python (#40608) · 95fbbc5b
由 zhangkaihuo 提交于 3月 19, 2022
```
* call sparse api from python
```
95fbbc5b

Add infer meta (#40544) · 8e4e19ab

由 hong 提交于 3月 19, 2022

* add infer meta; test=develop

* add histogram infer meta; test=develop

* fix unitest bug; test=develop

* format; test=develop

* format; test=develop

* bn not use new infer meta; test=develop

* add infer meta; test=develop

* fixbug; test=develop

* fix bug;

* recover unitest; test=develop

8e4e19ab

support inplace in dygraph eager_fluid state (#40400) · 8e612903

由 pangyoki 提交于 3月 19, 2022

* [Eager] Support eager grad interface, draft version

* Support eager grad interface with allow_unused and multi startup_op

* Fix code format

* Fix allow_unused case, return PyNone if tensor not initialize

* Support output's stop_gradient related to create_graph

* Support grad exception case in eager mode, fix coverage CI

* Update ToPyObject, return PyNone if not initialize

* AccumulationNode add FLAGS_retain_grad_for_all_tensor

* Fix ci issue

* Fix CI issue

* fix, use core.eager.Tensor

* Add func SetBufferSlotRankZeros for GradTensorHolder

* Support retain_graph by using ClearTensorWrappers

* Support retain_graph by using ClearTensorWrappers

* Update retain_graph and no_grad_vars related test case

* Update code gen logic for ClearTensorWrappers

* Fix by override statement

* fix override func args

* Support retain_graph, update unit tests

* Updated ClearTensorWrappers logic

* fix grad python interface

* Use deep copy and update unit tests

* Polish code

* Polish code

* Fix CI issue, Deep copy only use when user set grad_tensors

* Fix CI, use Backward instead RunBackward

* Fix CI, Declare kernel explicitly in test file

* Polish, remove vector of TensorWrapper

* Refactor the logic of grad/backward, polish codes

* Update code after merge upstream develop

* Polish after merge upstream develop

* Update to adapt new GradNodeBase superclass

* Fix error introduced during conflict resolution

* support inplace strategy in eager_fluid state

* solve conflict

* nothing

* Update purify potential_startup_nodes logic

* Fix errors

* Polish code

* Remove useless args for ToPyObject

* Remove useless TensorWrappersSet

* fix record conflict

* Fix code-format, re-install pre-commit

* fix tensor_wrapper bug

* Fix pre-process logic for potential_startup_ops

* Update unit tests, use eager mode

* Fix conflicts

* fix unittest timeout

* little change
Co-authored-by: NWeilong Wu <veyron_wu@163.com>

8e612903

18 3月, 2022 7 次提交

F
[NPU] fix fp16 (PART I) (#40259) · aaa71ea4
由 furnace 提交于 3月 18, 2022
```
[NPU] fix fp16 (PART I)
```
aaa71ea4
0
Support assign x.shape to dict['key'] in dy2st (#40611) · 6e1fe4f1
由 0x45f 提交于 3月 18, 2022
```
* support assign x.shape to dict['key'] in dy2st

* remove replace_dot

* refine unit test
```
6e1fe4f1
Z

update unittests for tile op and silce op on XPU, test=kunlun (#40227) · 7f93e2b0
由 zhangyikun02 提交于 3月 18, 2022

7f93e2b0

Supported Complex2Real Conversion for Eager Dygraph (#39878) · e3b2a035

由 Zhanlue Yang 提交于 3月 18, 2022

* Supported Complex2Real Conversion for Eager Dygraph

* Supported Complex2Real Conversion for Eager Dygraph

* Enabled complex type promotion test for matmul_v2

* Fix CI issues

* Merged adj_edges_ with GradSlotMeta

* Fixed monir issue

* Adjusted num runs

* Recovered Eager performance tests configurations

* Recovered Eager performance tests configurations

* Adjusted performance tests configurations

* Fixed Minor Issues with performance tests

* Moved out Edge from GradSlotMeta

* Fixed issues from merge

* Fixed typo

* Addressed review comments

* Fixed minor issues

e3b2a035

S
[DataParallel]Support control flow in new DP (#40593) · 984eacb3
由 ShenLiang 提交于 3月 18, 2022
```
* fix bug

* fix bug
```
984eacb3
L

Use store for gloo process group (#40629) · bb2cb762
由 lilong12 提交于 3月 18, 2022

bb2cb762
F
[NPU] fix fp16 (PART II) (#40537) · 1a13fa0f
由 furnace 提交于 3月 18, 2022
```
[NPU] fix fp16 (PART II)
```
1a13fa0f

17 3月, 2022 5 次提交

T

modify sequence_conv_xpu op test. test=kunlun (#40347) · 96d2f337
由 tanzhipeng 提交于 3月 17, 2022

96d2f337

Move layer norm to phi (#40193) · 681a6865

由 hong 提交于 3月 17, 2022

* update

* fix bugs; test=develop

* update; test=develop

* fix test compile error; test=develop

* fix cpu compile error; test=develop

* fix test error; test=develo

* fix layer_norm_op plugin error; test=develop

* fix error; test=develop

* fix test bug; test=develop

* update; test=develop

* polish code; test=develop

* fix bugs; test=develop

* remove unused depency; test=develop

* polish code; test=develop

681a6865

H

add time of unittests for dataparallel in dygraph mode (#40639) · e3a67782
由 Haohongxiang 提交于 3月 17, 2022

e3a67782

[Eager Grad] Support eager grad interface (#40170) · 4db8cf24

由 Weilong Wu 提交于 3月 17, 2022

* [Eager] Support eager grad interface, draft version

* Support eager grad interface with allow_unused and multi startup_op

* Fix code format

* Fix allow_unused case, return PyNone if tensor not initialize

* Support output's stop_gradient related to create_graph

* Support grad exception case in eager mode, fix coverage CI

* Update ToPyObject, return PyNone if not initialize

* AccumulationNode add FLAGS_retain_grad_for_all_tensor

* Fix ci issue

* Fix CI issue

* fix, use core.eager.Tensor

* Add func SetBufferSlotRankZeros for GradTensorHolder

* Support retain_graph by using ClearTensorWrappers

* Support retain_graph by using ClearTensorWrappers

* Update retain_graph and no_grad_vars related test case

* Update code gen logic for ClearTensorWrappers

* Fix by override statement

* fix override func args

* Support retain_graph, update unit tests

* Updated ClearTensorWrappers logic

* fix grad python interface

* Use deep copy and update unit tests

* Polish code

* Polish code

* Fix CI issue, Deep copy only use when user set grad_tensors

* Fix CI, use Backward instead RunBackward

* Fix CI, Declare kernel explicitly in test file

* Polish, remove vector of TensorWrapper

* Refactor the logic of grad/backward, polish codes

* Update code after merge upstream develop

* Polish after merge upstream develop

* Update to adapt new GradNodeBase superclass

* Fix error introduced during conflict resolution

* Update purify potential_startup_nodes logic

* Fix errors

* Polish code

* Remove useless args for ToPyObject

* Remove useless TensorWrappersSet

* Fix code-format, re-install pre-commit

* Fix pre-process logic for potential_startup_ops

* Update unit tests, use eager mode

4db8cf24

Refine io for test_mnist.py (#40496) · 1e045cae

由 0x45f 提交于 3月 17, 2022

* for test_mnist.py

* remove comments

* using type() replace isinstance()

* valid vars for run program OP in io.py

* open test_mnist in eager_gurad for coverage

1e045cae

16 3月, 2022 6 次提交

L
[KP] Fix registry and add UT for thresholded_relu & softshrink (#40524) · bef6f2e1
由 Lijunhui 提交于 3月 16, 2022
```
* init commit

* correct namespace
```
bef6f2e1

Refactor elementwise op grad classes (#40187) · 7004f65c

由 piotrekobi 提交于 3月 16, 2022

* Refactor elementwise op grad classes

* Add more refactor changes

* Revert set layout and format deletion

* Fix failing elementwise test

7004f65c

[PHI] Migrate index_select op (#40260) · 99452af7

由 chenenquan 提交于 3月 16, 2022

* [PHI] Migrate index_select op

* [PHI] Fix bug in test_variable

* [PHI] migrate index_select op

99452af7

M

Add Support Layer List to ASP (#40253) · c040bbd7
由 Ming-Xu Huang 提交于 3月 16, 2022

c040bbd7
T

fix xpu op test, *test=kunlun (#40409) · d1a98f0b
由 TTerror 提交于 3月 16, 2022

d1a98f0b

[Auto Parallel] Add the support for the auto completion of while_op (#39939) · ec6b8fbd

由 Yulong Ao 提交于 3月 16, 2022

* [Auto Parallel] Support the auto completion of while_op

* [Auto Parallel] Improve the completion algorithms

* [Auto Parallel] Fix bugs for ernie inference

* [Auto Parallel] Remove attrs which cannot be pickled

* [Auto Parallel] make the dims_mappings of LodTensorArray vars empty

* [Auto Parallel] Fix bugs for the ernie inference in the pipeline parallel

* [Auto Parallel] Remove unncessary comments

* [Auto Parallel] Fix a bug of the CMakeLists

* [Auto Parallel] Use the newest APIs to write the unit test

* [Auto Parallel] Remove unnecessary statements

ec6b8fbd

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功