提交 · 3d5a27f03968cd0cbdae1768e414fbf50a252529 · 机器未来 / Paddle

28 3月, 2022 5 次提交
- G
  add adaround post-quant method (#38460) · 3d5a27f0
  由 Guanghua Yu 提交于 3月 28, 2022
```
* add adaround post-quant method
```
  3d5a27f0
- Z
  Enabled eager_mode for complex unit tests, except for test_complex_op.py and... · 56dc8c79
  由 Zhanlue Yang 提交于 3月 28, 2022
```
Enabled eager_mode for complex unit tests, except for test_complex_op.py and test_complex_view_op.py (#40887)
```
  56dc8c79
- A
  [Dy2Stat] Fix ForLoop Transformation with single return (#40683) · 287cbde8
  由 Aurelius84 提交于 3月 28, 2022
```
* [Dy2Stat] Fix ForLoop Transformation with single return

* [Dy2Stat] Fix ForLoop Transformation with single return
```
  287cbde8
- 0
  Refine test_lac.py for eager mode (#40951) · c03186f9
  由 0x45f 提交于 3月 28, 2022
```
* Refine test_lac.py for eager mode

* refine code

* Fix test_program_translator for eager
```
  c03186f9
- A
  Fix bug while specifying target grad in high order gradient (#40940) · 0d0d76eb
  由 Aurelius84 提交于 3月 28, 2022
```
* Fix bug while specifying target grad in high order gradient

* add more unittest

* add more unittest
```
  0d0d76eb
27 3月, 2022 4 次提交

X
[ Optest ] refactor optest check_output_with_place logic (#40928) · 37f914c8
由 xiongkun 提交于 3月 27, 2022
```
* first version, maybe many errors

* refactor op_test

* fix compare list

* fix bg

* fix bugs
```
37f914c8

[new-exec] fit for mkldnn and inplace op (#40955) · afa0e82c

由 Leo Chen 提交于 3月 27, 2022

* fit for mkldnn and inplace op

* fix compile

* refine ut

* register op version

* fix inplace op

* fix transfer_layout

afa0e82c

Move slice to phi (#40736) · b8236b7b

由 hong 提交于 3月 27, 2022

* move slice to pten

* merge develop; test=develop

* fix slice bug;

* update

* update

* fix error

* update

* fix bug

* polish code

* polish code

* polish code

* try to fix windows bug

* add gpu compile flag;

* try to fix

* remov template;

* polish code;

* fix npu bug;

* fix npu bug

* fix npu bug; test=develop

* fix slice bug;

* remove no need dep

b8236b7b

A
[NPU] fix npu cast ut (#40982) · f6b6b057
由 Aganlengzi 提交于 3月 27, 2022
```
* [NPU] fix npu cast ut

* [NPU] fix npu cast ut
```
f6b6b057

26 3月, 2022 2 次提交
- C
  
  Move the redundant numpy() (#40931) · 7e05680c
  由 crystal 提交于 3月 26, 2022
  
  7e05680c
- C
  
  add double grad op example (#40963) · 0ee76f92
  由 Chen Weihang 提交于 3月 26, 2022
  
  0ee76f92
25 3月, 2022 15 次提交

update eager code gen (#40924) · afe2fdd1

由 hong 提交于 3月 25, 2022

* update

* remove useless code

* remove label smooth test

* polish code

* polish code

* polish code

* remove _in_eager_mode error;

afe2fdd1

D
fix lars optitmizer bug (#40892) · c006a609
由 duanboqiang 提交于 3月 25, 2022
```
* fix lars optitmizer bug

* Update optimizer.py
```
c006a609
Z

fix sync_bn error in fp16 amp-o2 (#40943) · 9ab3c76b
由 zhangbo9674 提交于 3月 25, 2022

9ab3c76b
Z

[MLU]add allreduce max/prod/min mlu kernel (#40792) · 9261dff4
由 zn 提交于 3月 25, 2022

9261dff4
0

Fix param@grad type error for amp in run_program (#40938) · 54632b5c
由 0x45f 提交于 3月 25, 2022

54632b5c
J
Fix in dygraph mode doc (#40942) · 09e5b00c
由 Jiabin Yang 提交于 3月 25, 2022
```
* fix doc for enable api

* test=document_fix
```
09e5b00c

add cast_grad phi kernel (#40798) · b79c6a9b

由 zhangbo9674 提交于 3月 25, 2022

* add cast_grad phi kernel

* refie unittest

* refien unittest

* refine unittest

* refine include header path

* refien xpu cast unittest

* refine code

b79c6a9b

support multi_dims for tril_triu, *test=kunlun (#40712) · 9ffedcfd

由 z8hanghuan 提交于 3月 25, 2022

* support multi_dims for tril_triu, *test=kunlun

* support multi_dims for tril_triu, *test=kunlun

* support multi_dims for tril_triu, *test=kunlun

* update xpu.cmake date, support multi_dims for tril_triu, *test=kunlun

9ffedcfd

change CUDA implementation of dropout OP (#40874) · 1c01d1cc
由 zhouweiwei2014 提交于 3月 25, 2022

1c01d1cc

Refactor Dygraph Flags (#40786) · 3085d5e4

由 Jiabin Yang 提交于 3月 25, 2022

* refactor eager flags

* fix flags error when we switch from eager to dygraph

* fix ci problem

* fix ci

* fix ci

* merge develop and fix code style

* merge develop and fix code style

* fix op test error

* fix op test error

* fix op test error

* fix op test error

* fix op test error

* merge develop

3085d5e4

T

fix xpu op test, *test=kunlun (#40862) · 1db9cd46
由 TTerror 提交于 3月 25, 2022

1db9cd46

[OpTest] Polish optest (#40879) · d43e8433

由 xiongkun 提交于 3月 25, 2022

* 1. add the python api grad 2. add final and intermediate state vlog 3. change the python_api error logic

* add python api or close the check_eager=True

* fix the compatibility

* matmul

* disable unittests: test_elementwise_add_op test_scatter_nd_op test_gather_nd_op test_scatter_op test_index_sample_op test_elementwise_add_mkldnn_op

* refine the logic of prepara_parameter logic

* fix Tensor(gpu) 2 Scalar segment fault.

* add multi-attribute. (test_unsqueeze_op); add python_sig_out for customizing op sig out

* fix some bugs, support python_out_sig

d43e8433

A
[NPU] add merged_momentum (#40875) · 2b74b739
由 Aganlengzi 提交于 3月 25, 2022
```
* [NPU] add merged_momentum

* fix

* fix device
```
2b74b739
Z

modify unit test in bn, stack and split. *test=kunlun (#40880) · 139a30ec
由 Zhangjingyu06 提交于 3月 25, 2022

139a30ec

support get_item where the index is a bool scalar tensor (#40829) · 0f5e90a2

由 FlyingQianMM 提交于 3月 25, 2022

* support get_item where the index is a bool scalar tensor

* add unittests for supporting get_item where the index is a bool scalar tensor

0f5e90a2

24 3月, 2022 11 次提交

[AMP] Support amp for Intermediate_dygraph (#40623) · c12f7d48

由 zhangbo9674 提交于 3月 24, 2022

* approve amp for intermediate_dygraph

* add amp_utils for intermediate_dygraph

* add amp needcast check for mlu & npu

* test unittest

* add SetGradNode for set_stop_gradient && add checktensor for GradientHooks

* refine code

* refien unittest of imperative_amp for new dygraph

* inplace api skip amp

* add test_imperative_qat_amp for intermediate amp

* refine code

* refine test_amp ci strategy

* refine unittest code

* refine amp_utils code

* refine amp getpromotetype for some special op

* refine unittest code

c12f7d48

[MoE]Assign pos op (#40580) · 305f32d1

由 Roc 提交于 3月 24, 2022

* # This is a combination of 10 commits.
# The first commit's message is:
add expert count op

add ut for expert_count

# This is the 2nd commit message:

update UT only for cuda

# This is the 3rd commit message:

fix for rocm

# This is the 4th commit message:

update ut

# This is the 5th commit message:

add moe module

# This is the 6th commit message:

add expert count op

add ut for expert_count

# This is the 7th commit message:

update UT only for cuda

# This is the 8th commit message:

update ut

# This is the 9th commit message:

add moe module

# This is the 10th commit message:

make expert count private

* add assign pos op

* fix upper num name

* add api _assign pos

* add ut for assign pos op

* update date

* fix for win

* update for test (timeout)

* fix ut

* update

* fix ut for number count
Co-authored-by: Nhlygit66666 <2570058140@qq.com>

305f32d1

L

Wrap dist api for dygraph mode (#40408) · 9d8cfc1b
由 lilong12 提交于 3月 24, 2022

9d8cfc1b
G

support dp for class_center_sample and margin_cross_entropy (#39852) · bff9e28e
由 Guoxia Wang 提交于 3月 24, 2022

bff9e28e
X
[Auto Parallel] Gradient merge pass support dist attribute (#40737) · 0443c6f4
由 xiayanming 提交于 3月 24, 2022
```
* [Auto Parallel] gradient merge pass support dist attribute
```
0443c6f4
Z

Add sparse convertion api and sparse creation api (#40780) · a8f86600
由 zhangkaihuo 提交于 3月 24, 2022

a8f86600
Z

modify communicator api (#40881) · f95f3a65
由 zhaocaibei123 提交于 3月 24, 2022

f95f3a65
K

fix device id env (#40844) · 8562668e
由 kuizhiqing 提交于 3月 24, 2022

8562668e

Polish optest: refine the optest parameter logic. support name, dtype, out,... · a8df3901

由 xiongkun 提交于 3月 24, 2022

Polish optest: refine the optest parameter logic. support name, dtype, out, output in arbitrary position (#40824)

* 1. add the python api grad 2. add final and intermediate state vlog 3. change the python_api error logic

* add python api or close the check_eager=True

* fix the compatibility

* matmul

* disable unittests: test_elementwise_add_op test_scatter_nd_op test_gather_nd_op test_scatter_op test_index_sample_op test_elementwise_add_mkldnn_op

* refine the logic of prepara_parameter logic

* fix Tensor(gpu) 2 Scalar segment fault.

a8df3901

Refine eager run_program OP for dy2st UT (#40768) · 4ccd5cb8

由 0x45f 提交于 3月 24, 2022

* Refine eager run_program OP for dy2st UT

* append run_program error string and refine run_program_grad

* remove some comments

* refine ConstructXGradTensors

4ccd5cb8

C
[Auto Parallel] Update cost model (#40457) · c1c9368f
由 caozhou 提交于 3月 24, 2022
```
* refactor cost model
```
c1c9368f

23 3月, 2022 3 次提交

J
Added support for BF16 datatype for all oneDNN activation kernels (#40721) · 8e67629c
由 jakpiase 提交于 3月 23, 2022
```
* added missing BF16 activations

* added softplus bf16

* minor change

* disabled tests for GPU
```
8e67629c

[NPU] add npu support for conv3d and conv3d_grad (#38480) · ff568afa

由 furnace 提交于 3月 23, 2022

* [NPU] add npu support for conv3d and conv3d_grad

* [NPU] delete failed unittests due to Ascend not support

* [NPU] delete debug codes

* [NPU] optimize codes, notest

* [NPU] remove const_cast

* [NPU] optimize for remove const_cast

* [NPU] fix written errors

ff568afa

two-phase training for ps (#40762) · b1a4668c

由 zhaocaibei123 提交于 3月 23, 2022

* fix benchmark and communicator config

* fix bugs of the_one_ps

* multi program and fix bug in optimizer

* multi program in the_one_ps

* public commcontext

* ps optimizer multi programs

* cvm & datanorm backend

* fix dim

* fix unittest

* fix

* the one ps merge

* remove comm

* add DownpourLiteWorker

* all

* fix

* fix

* device worker downpour lite

* fix

* fix bug in global shuffle

* save inference model

* fix & add log

* fix

* remove log

* fix

* fix save summary

* fix

* fix pscore

* fix

* fix

* fix

* fix

* fix

* remove logs

* fix

* fix

* fix

* fix

* fix

* add some comments

* fix
Co-authored-by: Nesythan <esythan@126.com>

b1a4668c

机器未来 / Paddle 与 Fork 源项目一致

机器未来 / Paddle
与 Fork 源项目一致