提交 · 3085d5e434f45f4bb6c94b42a3772b192dce791c · PaddlePaddle / Paddle

25 3月, 2022 6 次提交

Refactor Dygraph Flags (#40786) · 3085d5e4

由 Jiabin Yang 提交于 3月 25, 2022

* refactor eager flags

* fix flags error when we switch from eager to dygraph

* fix ci problem

* fix ci

* fix ci

* merge develop and fix code style

* merge develop and fix code style

* fix op test error

* fix op test error

* fix op test error

* fix op test error

* fix op test error

* merge develop

3085d5e4

T

fix xpu op test, *test=kunlun (#40862) · 1db9cd46
由 TTerror 提交于 3月 25, 2022

1db9cd46

[OpTest] Polish optest (#40879) · d43e8433

由 xiongkun 提交于 3月 25, 2022

* 1. add the python api grad 2. add final and intermediate state vlog 3. change the python_api error logic

* add python api or close the check_eager=True

* fix the compatibility

* matmul

* disable unittests: test_elementwise_add_op test_scatter_nd_op test_gather_nd_op test_scatter_op test_index_sample_op test_elementwise_add_mkldnn_op

* refine the logic of prepara_parameter logic

* fix Tensor(gpu) 2 Scalar segment fault.

* add multi-attribute. (test_unsqueeze_op); add python_sig_out for customizing op sig out

* fix some bugs, support python_out_sig

d43e8433

A
[NPU] add merged_momentum (#40875) · 2b74b739
由 Aganlengzi 提交于 3月 25, 2022
```
* [NPU] add merged_momentum

* fix

* fix device
```
2b74b739
Z

modify unit test in bn, stack and split. *test=kunlun (#40880) · 139a30ec
由 Zhangjingyu06 提交于 3月 25, 2022

139a30ec

support get_item where the index is a bool scalar tensor (#40829) · 0f5e90a2

由 FlyingQianMM 提交于 3月 25, 2022

* support get_item where the index is a bool scalar tensor

* add unittests for supporting get_item where the index is a bool scalar tensor

0f5e90a2

24 3月, 2022 11 次提交

[AMP] Support amp for Intermediate_dygraph (#40623) · c12f7d48

由 zhangbo9674 提交于 3月 24, 2022

* approve amp for intermediate_dygraph

* add amp_utils for intermediate_dygraph

* add amp needcast check for mlu & npu

* test unittest

* add SetGradNode for set_stop_gradient && add checktensor for GradientHooks

* refine code

* refien unittest of imperative_amp for new dygraph

* inplace api skip amp

* add test_imperative_qat_amp for intermediate amp

* refine code

* refine test_amp ci strategy

* refine unittest code

* refine amp_utils code

* refine amp getpromotetype for some special op

* refine unittest code

c12f7d48

[MoE]Assign pos op (#40580) · 305f32d1

由 Roc 提交于 3月 24, 2022

* # This is a combination of 10 commits.
# The first commit's message is:
add expert count op

add ut for expert_count

# This is the 2nd commit message:

update UT only for cuda

# This is the 3rd commit message:

fix for rocm

# This is the 4th commit message:

update ut

# This is the 5th commit message:

add moe module

# This is the 6th commit message:

add expert count op

add ut for expert_count

# This is the 7th commit message:

update UT only for cuda

# This is the 8th commit message:

update ut

# This is the 9th commit message:

add moe module

# This is the 10th commit message:

make expert count private

* add assign pos op

* fix upper num name

* add api _assign pos

* add ut for assign pos op

* update date

* fix for win

* update for test (timeout)

* fix ut

* update

* fix ut for number count
Co-authored-by: Nhlygit66666 <2570058140@qq.com>

305f32d1

L

Wrap dist api for dygraph mode (#40408) · 9d8cfc1b
由 lilong12 提交于 3月 24, 2022

9d8cfc1b
G

support dp for class_center_sample and margin_cross_entropy (#39852) · bff9e28e
由 Guoxia Wang 提交于 3月 24, 2022

bff9e28e
X
[Auto Parallel] Gradient merge pass support dist attribute (#40737) · 0443c6f4
由 xiayanming 提交于 3月 24, 2022
```
* [Auto Parallel] gradient merge pass support dist attribute
```
0443c6f4
Z

Add sparse convertion api and sparse creation api (#40780) · a8f86600
由 zhangkaihuo 提交于 3月 24, 2022

a8f86600
Z

modify communicator api (#40881) · f95f3a65
由 zhaocaibei123 提交于 3月 24, 2022

f95f3a65
K

fix device id env (#40844) · 8562668e
由 kuizhiqing 提交于 3月 24, 2022

8562668e

Polish optest: refine the optest parameter logic. support name, dtype, out,... · a8df3901

由 xiongkun 提交于 3月 24, 2022

Polish optest: refine the optest parameter logic. support name, dtype, out, output in arbitrary position (#40824)

* 1. add the python api grad 2. add final and intermediate state vlog 3. change the python_api error logic

* add python api or close the check_eager=True

* fix the compatibility

* matmul

* disable unittests: test_elementwise_add_op test_scatter_nd_op test_gather_nd_op test_scatter_op test_index_sample_op test_elementwise_add_mkldnn_op

* refine the logic of prepara_parameter logic

* fix Tensor(gpu) 2 Scalar segment fault.

a8df3901

Refine eager run_program OP for dy2st UT (#40768) · 4ccd5cb8

由 0x45f 提交于 3月 24, 2022

* Refine eager run_program OP for dy2st UT

* append run_program error string and refine run_program_grad

* remove some comments

* refine ConstructXGradTensors

4ccd5cb8

C
[Auto Parallel] Update cost model (#40457) · c1c9368f
由 caozhou 提交于 3月 24, 2022
```
* refactor cost model
```
c1c9368f

23 3月, 2022 14 次提交

J
Added support for BF16 datatype for all oneDNN activation kernels (#40721) · 8e67629c
由 jakpiase 提交于 3月 23, 2022
```
* added missing BF16 activations

* added softplus bf16

* minor change

* disabled tests for GPU
```
8e67629c

[NPU] add npu support for conv3d and conv3d_grad (#38480) · ff568afa

由 furnace 提交于 3月 23, 2022

* [NPU] add npu support for conv3d and conv3d_grad

* [NPU] delete failed unittests due to Ascend not support

* [NPU] delete debug codes

* [NPU] optimize codes, notest

* [NPU] remove const_cast

* [NPU] optimize for remove const_cast

* [NPU] fix written errors

ff568afa

two-phase training for ps (#40762) · b1a4668c

由 zhaocaibei123 提交于 3月 23, 2022

* fix benchmark and communicator config

* fix bugs of the_one_ps

* multi program and fix bug in optimizer

* multi program in the_one_ps

* public commcontext

* ps optimizer multi programs

* cvm & datanorm backend

* fix dim

* fix unittest

* fix

* the one ps merge

* remove comm

* add DownpourLiteWorker

* all

* fix

* fix

* device worker downpour lite

* fix

* fix bug in global shuffle

* save inference model

* fix & add log

* fix

* remove log

* fix

* fix save summary

* fix

* fix pscore

* fix

* fix

* fix

* fix

* fix

* remove logs

* fix

* fix

* fix

* fix

* fix

* add some comments

* fix
Co-authored-by: Nesythan <esythan@126.com>

b1a4668c

Z
[AutoParallel] engine & dist_saver (#40528) · 3980e222
由 zhaoyingli 提交于 3月 23, 2022
```
* add dist_saver and update engine

* add dist_saver and update engine
```
3980e222

[Eager Hook + Inplace] Refactor register_hook and test with inplace operation (#40778) · ff7cbaae

由 Weilong Wu 提交于 3月 23, 2022

* disable scatter case in test_inplace_eager_fluid

* Update register_hook logic

* Add register_hook test cases
Co-authored-by: Npangyoki <pangyoki@126.com>

ff7cbaae

Support sharding (#40637) · fe291daf

由 Jiabin Yang 提交于 3月 23, 2022

* suppor sharding api

* support multi api for sharding in eager

* support multi api for sharding in eager

* fix test

* fix test coverage

fe291daf

Add yaml config part2 (#40742) · f4075db8

由 hong 提交于 3月 23, 2022

* fix error; test=develop

* update

* close some yaml

* fix backward attrite error; test=develop

* add div test

* polish code; test=develop

* remove none gbk charactor;

* remove some yaml;

* fix optional bug

* recover yaml config

* resolve confilct; test=develop

* close div; test=develop

f4075db8

[Eager] Slice (#40587) · b07d239c

由 wanghuancoder 提交于 3月 23, 2022

* fix some slice bug, test=develop

* eager slice, test=develop

* eager slice, test=develop

* refine, test=develop

* refine, test=develop

* fix bug, test=develop

* refine, test=develop

* rename function name, test=develop

b07d239c

Support initializing specific grad tensors to zero for selected operators (#39963) · 2f50ae99

由 Zhanlue Yang 提交于 3月 23, 2022

* Supported Complex2Real Conversion for Eager Dygraph

* Supported Complex2Real Conversion for Eager Dygraph

* Enabled complex type promotion test for matmul_v2

* Fix CI issues

* Support initializing specific grad tensors to zero for selected operators

* Merged adj_edges_ with GradSlotMeta

* Fixed monir issue

* Adjusted num runs

* Recovered Eager performance tests configurations

* Recovered Eager performance tests configurations

* Adjusted performance tests configurations

* Fixed Minor Issues with performance tests

* Moved out Edge from GradSlotMeta

* Fixed issues from merge

* Fixed typo

* Addressed review comments

* Fixed merge issues

* Fixed minor issues

* Fixed minor issue

* Fixed major issues and enabled auto_prune test cases

* Fixed issues from merge

2f50ae99

Add complex type compatibility for stft api and stft op. (#40113) · 319f95d0

由 KP 提交于 3月 23, 2022

* Add stft_op.

* Add stft_grad_op.

* Add stft_op unittest.

* [DLTP-45176] Add complex compatibility in static mode for stft api.

* [DLTP-45176] Add complex compatibility in static mode for stft api.

* Add doc.

* Update unitests of stft op.

* Update spectral helper.

* fix coding style.

319f95d0

Add profiler features (#40357) · c15e3823

由 chenjian 提交于 3月 23, 2022

* add event record for model profiling

* fix format

* fix format

* fix code example bug

* no

* add profiler statistic

* add profiler feature

* fix bug

* fix bug

* fix bug

* fix bug

* required: gpu

* required: gpu

* fix bug

* required: gpu

* fix ci bug

* fix ci error

* fix ci error

* upgrade document

* fix doc

* fix ci bug

* add doc and fix bug

* nothing

* fix bug

* fix format bug

* modify format

* add deprecated description for old profiler

* fix bug

* fix bug

* fix

* add load_profiler_reuslt doc

* add load_profiler_reuslt doc

* add load_profiler_reuslt doc

* help fix old profiler sample code

* add api doc

* fix format

* fix api doc

* fix api doc format

* fix api doc format

* fix api doc c format

* fix api doc format

c15e3823

change CUDA implementation of multinomial OP (#40752) · 58970995
由 zhouweiwei2014 提交于 3月 23, 2022

58970995
W

Support test_layers(group_norm,while_loop) with eager mode (#40816) · db41e39e
由 Weilong Wu 提交于 3月 23, 2022

db41e39e
K

enable continuous log; update doc (#40782) · fdafbc7b
由 kuizhiqing 提交于 3月 23, 2022

fdafbc7b

22 3月, 2022 6 次提交

[new-exec] async prepare deps (#40713) · 814f7211

由 Leo Chen 提交于 3月 22, 2022

* async prepare deps

* fix bug that std::future is not set

* add ut

* refine code

* fix standalone ut

* disable prof

814f7211

polish python api logic and add backward python api check (#40666) · c29f85b6

由 xiongkun 提交于 3月 22, 2022

* 1. add the python api grad 2. add final and intermediate state vlog 3. change the python_api error logic

* add python api or close the check_eager=True

* fix the compatibility

* matmul

* disable unittests: test_elementwise_add_op test_scatter_nd_op test_gather_nd_op test_scatter_op test_index_sample_op test_elementwise_add_mkldnn_op

c29f85b6

W

Update unit tests by using _test_eager_guard (#40760) · 3b8bcd5a
由 Weilong Wu 提交于 3月 22, 2022

3b8bcd5a
Z

Add more annotations to test_cholesky_solve_op.py, make it an example in hackson guide · 64c268b2
由 zhiboniu 提交于 3月 22, 2022

64c268b2
P

disable scatter case in test_inplace_eager_fluid (#40756) · 72a2bfe2
由 pangyoki 提交于 3月 22, 2022

72a2bfe2

[phi] Update graph_send_recv OP (#40509) · 67b46e45

由 Siming Dai 提交于 3月 22, 2022

* add out_size shape for graph_send_recv

* fix bug in register kernel: no const int& support

* add out_size in infermeta

* change unittest

* fix unittest

* fix out_size default value

* fix doc

* delete arg mapping

* add sig

* move -1 to 0

* move -1 to 0

67b46e45

21 3月, 2022 3 次提交

Refine to_tensor for eager mode and support gpu_pinned (#40535) · 45d1fb8d

由 0x45f 提交于 3月 21, 2022

* Refine to_tensor for eager mode

* support gpu_pinned

* refine code

* support gpu_pinned copy_to

* fix layer.__setattr__

* support to_tensor for gpu_pinned

* fix unit test

* refine gpu_pinned

* restore the original code

* add is_gup_pinned() and refine eager.Tensor._copy_to()

45d1fb8d

K

fleetrun launch in legacy mode (#40568) · c54c60de
由 kuizhiqing 提交于 3月 21, 2022

c54c60de

Merge some test bug (#40543) · 56c43ccd

由 hong 提交于 3月 21, 2022

* switch eager mode and change it

* set default is eager

* set default is eager

* fix error; test=develop

* fix some error; test=develop

* update

* upd

* update code; test=develop

* update

* fix some bug; test=develop

* fix bug; test=develop

* fix bug; test=develop

* fix bug; test=develop

* fix error; test=develop

* format; test=develop
Co-authored-by: NJiabinYang <360788950@qq.com>

56c43ccd

PaddlePaddle / Paddle 大约 2 年 前同步成功

PaddlePaddle / Paddle
大约 2 年前同步成功