提交 · 9c2a9afd0dd688f99d9ec8d22cafcd3f6ce0bb44 · PaddlePaddle / Paddle

01 4月, 2022 3 次提交

Z

add framework._non_static_mode temporarily for hackson; test=document_fix (#41220) · e6a19aea
由 zhiboniu 提交于 4月 01, 2022

e6a19aea

[Phi] Add shape and strided_slice yaml & Adapt eager mode (#41131) · 9b6a02d4

由 Chen Weihang 提交于 4月 01, 2022

* add several yaml

* polish strided slice kernel & add yaml

* reorder yaml

* add several yaml

* revert yaml config change

* resolve conflict

* Update test_strided_slice_op.py

9b6a02d4

Add basic yaml backward (#40751) · 98303291

由 hong 提交于 4月 01, 2022

* fix error; test=develop

* update

* close some yaml

* fix backward attrite error; test=develop

* add div test

* polish code; test=develop

* update

* update

* fix bug

* update bitwise code; test=develop

* update

* update

* fix some bug

* update

* revert cmakelist

* fix optional bug;

* fix bug

* fix bug;

* add backward test

* open bn

* update

* update

* revert eager_gen

* polish code

* fix topk error

* update

* update

* fix bug;

* move label smooth, nll loss

* revert topk

* fix topk label smooth bug;

* remove batch_norm

* remove topk

* change flip infer meta

* fix flip bug

* update yaml

* close abs

* fix histogram bug

* fix histogram bug

* add abs

* fix histogram kernel

* remove expand

98303291

31 3月, 2022 22 次提交

C

remove comment yamls, test=document_fix (#41221) · 3a7761a0
由 Chen Weihang 提交于 3月 31, 2022

3a7761a0
0

Switch some dy2st UT to eager (#41175) · 2003610e
由 0x45f 提交于 3月 31, 2022

2003610e
Z
[Phi] Rename ScalarArray to IntArray (#40975) · e559fe41
由 zyfncg 提交于 3月 31, 2022
```
* rename scalar_array to int_array

* update cmake

* fix conflict

* remove useless log
```
e559fe41
A
[Yaml] Migrate sqrt/square/reciprocal yaml (#41164) · 2d69abd2
由 Aurelius84 提交于 3月 31, 2022
```
* [Yaml] Migrate sqrt/square/reciprocal yaml

* clean file

* fix unittest error
```
2d69abd2
0

Fix `parent_block.var(name)` error in static mode (#41162) · a54ec5a8
由 0x45f 提交于 3月 31, 2022

a54ec5a8

Enhance eigh, eigvalsh unit tests (#40699) · a8be9b6d

由 zlsh80826 提交于 3月 31, 2022

* Enhance test_eigh_op

* Use eigen decomposition to validate eigen values and vectors
* Fix that TestEighBatchAPI didn't run the batched input

* Enhance test_eigvalsh_op

* Align cusolver tolerance to validate eigenvalues
* Fix that BatchAPI didn't run the batched input

* Add abs for |d_ref|

* Remove comment

a8be9b6d

W

fix some bug, test=develop (#41144) · eac23db1
由 wanghuancoder 提交于 3月 31, 2022

eac23db1
W
add multiclass nms3 trt converter (#41181) · 08c3edb3
由 wangxinxin08 提交于 3月 31, 2022
```
* add multiclass_nms3 converter
```
08c3edb3

add flatten2,reshape2,squueze2_trt_fuse_pass test cast (#41031) · 7ef69202

由 heliqi 提交于 3月 31, 2022

* add flatten2,reshape2,squueze2_trt_fuse_pass  test cast

* add flatten2,reshape2,squueze2_trt_fuse_pass  test cast

* add flatten2,reshape2,squueze2_trt_fuse_pass  test cast

7ef69202

[New API]: miminize_bfgs and miminize_lbfgs (#40710) · e7928a06

由 Sing_chan 提交于 3月 31, 2022

* [New API]: miminize_bfgs and miminize_lbfgs

* modify for python module call correctly

* add functional package, add error raise in static_graph, change assign to set_value

* unify static_graph and dygraph, fix bug when x or H0 is float64

* now only accept input is tensor, put check args in utils.py, put exception test together

* temp

* add more detailed algorithm illustration and comment, reduce test case to limit test time in 15s

* change in_dygraph_mode to in_dynamic_mode

* fix bug of sample code; reduce test case to reduce test time

* change dir to incubate

e7928a06

0

Fix test_run_program_op.py (#41141) · 7c555f4e
由 0x45f 提交于 3月 31, 2022

7c555f4e

fix load bug and add distributed strategy from pslib (#40883) · 47383dca

由 wangguanqun 提交于 3月 31, 2022

* fix load bug and add distributed strategy from pslib

* add unittest

* use cvm config

* trainer and worker config

* add unittest

* add unittest

* add test

* code style

47383dca

L
add depend when doing fuse_all_optimizer on program (#41178) · 3b00dc92
由 Leo Chen 提交于 3月 31, 2022
```
* fix dependency of fused optimizer

* add ut
```
3b00dc92
C
Fix operator summary table (#41157) · 4e3c7338
由 chenjian 提交于 3月 31, 2022
```
* no

* fix operator summary table

* update unit test
```
4e3c7338

Add probability distribution transformation APIs (#40536) · 6735a37a

由 Xiaoxu Chen 提交于 3月 31, 2022

* add random varaiable transformations API for paddle's distribution package

* add TransformedDistribution API for paddle's probability distribution package

* add random variable transformation unitests for static graph

* replace math.prod which not support python3.7 with functools.reduce

* add Independent and TransformedDistribution distribution

* add unittests for constraint

* fix typo and AffineTransform sample code error

* add mean,variance,rsample abstract method for Distribution

6735a37a

Add time range duration display (#41029) · 6744754f

由 chenjian 提交于 3月 31, 2022

* no

* fix bugs

* fix doc according to review

* fix api doc format

* fix api doc according to review

* fix bug and add unit test

* fix record event bug

* optimize chrome tracing display

* fix bug

* add comment

* add unit test

* fix a bug

* fix

* fix

* fix format

6744754f

Z

Opt the compilation of sparse kernel (#41086) · b9da48da
由 zhangkaihuo 提交于 3月 31, 2022

b9da48da
Y

update elementwise unittest style, *test=kunlun (#40779) · 23a69bc7
由 ykkk2333 提交于 3月 31, 2022

23a69bc7
Z

fix adam is_sparse bug in final state dygraph (#41125) · 0d5c27b2
由 zhangbo9674 提交于 3月 31, 2022

0d5c27b2

support view strategy in eager_fluid state (#40830) · 2f1c1ae5

由 pangyoki 提交于 3月 31, 2022

* support view strategy in eager_fluid state

* little change

* little change

* optimize unittest

* fix

2f1c1ae5

P

fix eager_gen node bug (#41165) · 56493c9e
由 pangyoki 提交于 3月 31, 2022

56493c9e

Support inplace strategy for pylayer (#41043) · 11d1a51a

由 pangyoki 提交于 3月 31, 2022

* Supported Complex2Real Conversion for Eager Dygraph

* Supported Complex2Real Conversion for Eager Dygraph

* Enabled complex type promotion test for matmul_v2

* pylayer, test=develop

* Fix CI issues

* Support initializing specific grad tensors to zero for selected operators

* finish forward, test=develop

* create grad node finish, test=develop

* Merged adj_edges_ with GradSlotMeta

* Fixed monir issue

* backward finish, start dbg, test=develop

* Adjusted num runs

* Recovered Eager performance tests configurations

* Recovered Eager performance tests configurations

* finish, test=develop

* polish, test=develop

* polish, test=develop

* refine, test=develop

* eager, test=develop

* Adjusted performance tests configurations

* Fixed Minor Issues with performance tests

* [Phi] Fix macro name typo

* support set_materialize_grads, test=develop

* suppotr mark_non_differentiable, test=develop

* support once_differentiable, test=develop

* refine, test=develop

* refine, test=develop

* Moved out Edge from GradSlotMeta

* Fixed issues from merge

* Fixed typo

* Addressed review comments

* Fixed merge issues

* Fixed minor issues

* Fixed minor issue

* refine, test=develop

* refine, test=develop

* refine, test=develop

* Fixed major issues and enabled auto_prune test cases

* Fixed issues from merge

* refine, test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop

* support inplace for pylayer
Co-authored-by: Njim19930609 <jim19930609@gmail.com>
Co-authored-by: NWang Huan <wanghuan29@baidu.com>
Co-authored-by: NAurelius84 <zhangliujie@baidu.com>

11d1a51a

30 3月, 2022 15 次提交

Z
py36 Import error bug fix (#41135) · d006c7ff
由 ziyoujiyi 提交于 3月 30, 2022
```
* lazy import

* log error
```
d006c7ff
F

Fix bug for UT test_calc_gradient (#41130) · 4d6a3b9f
由 From00 提交于 3月 30, 2022

4d6a3b9f
0

Fix test_jit_save_load (#41114) · 4b61918d
由 0x45f 提交于 3月 30, 2022

4b61918d
Z
[AutoParallel] fix converter when sliced_shape is 1 (#41103) · 59c4fdac
由 zhaoyingli 提交于 3月 30, 2022
```
* fix converter when sliced_shape is 1

* update unittest
```
59c4fdac

delete ps env (#41079) · a0e961c0

由 ziyoujiyi 提交于 3月 30, 2022

* back fl

* delete ssl cert

* .

* make warning

* .

* unittest paral degree

* solve unittest

* heter & multi cloud commm ready

* correct pass not regisiter

* back

* back

* .

* .

a0e961c0

P

dsiable scatter case in test_inplace, test=document_fix (#41152) · 5f7d129a
由 pangyoki 提交于 3月 30, 2022

5f7d129a

[MoE] Moe apis (#41092) · aac7879a

由 Roc 提交于 3月 30, 2022

* add random routing op

add _random_routing api in utils

add random routing ut

* # This is a combination of 10 commits.
# The first commit's message is:
add expert count op

add ut for expert_count

# This is the 2nd commit message:

update UT only for cuda

# This is the 3rd commit message:

fix for rocm

# This is the 4th commit message:

update ut

# This is the 5th commit message:

add moe module

# This is the 6th commit message:

add expert count op

add ut for expert_count

# This is the 7th commit message:

update UT only for cuda

# This is the 8th commit message:

update ut

# This is the 9th commit message:

add moe module

# This is the 10th commit message:

make expert count private

* add assign pos op

* fix upper num name

* add api _assign pos

* add ut for assign pos op

* update date

* add op about moe gate

update utils

add limit by capacity op

add ut for limit_by_capacity

add ut for prune_gate_by_capacity

add ut for limit_by_capacity

add ut for prune_gate_by_capacity

* fix for win

* fix bugs in test_limit_by_capacity_op

* update ut

* update for test (timeout)

* fix ut

* update

* update(fix) ut for win

* moe apis in incubate

* # This is a combination of 10 commits.
# The first commit's message is:
add expert count op

add ut for expert_count

# This is the 2nd commit message:

update UT only for cuda

# This is the 3rd commit message:

fix for rocm

# This is the 4th commit message:

update ut

# This is the 5th commit message:

add moe module

# This is the 6th commit message:

add expert count op

add ut for expert_count

# This is the 7th commit message:

update UT only for cuda

# This is the 8th commit message:

update ut

# This is the 9th commit message:

add moe module

# This is the 10th commit message:

make expert count private

* add assign pos op

* fix upper num name

* add api _assign pos

* add ut for assign pos op

* update date

* fix for win

* update for test (timeout)

* fix ut

* update

* fix ut for number count

* add apis and utils

* add gate apis

* add moe and grad clip apis

* update moe apis

* add ops for moe gate

* fix

* update for base moe layer api

* add random routing op

add _random_routing api in utils

add random routing ut

* fix for dygraph

* update with ranodm routing

* update

* fix ut for limit by capacity

* update

* update limit by capacity for easily to switch to single thread mode

* update api docs
Co-authored-by: Nhlygit66666 <2570058140@qq.com>

aac7879a

H
[Op] Fix uncontrolled randomness of index_select op (#41078) · 8f7c02f2
由 Haohongxiang 提交于 3月 30, 2022
```
* fix uncontrolled randomness of op

* fix bugs
```
8f7c02f2

Add new APIs for GPU memory monitoring (max_memory_allocated,... · afe02e9d

由 From00 提交于 3月 30, 2022

Add new APIs for GPU memory monitoring (max_memory_allocated, max_memory_reserved, memory_allocated, memory_reserved) (#38657)

* Add new API memory_reserved

* Add memory_allocated, max_memory_reserved and max_memory_allocater

* Fix CI error

* Fix CI error

* Enhance UT

* Add FLAGS_memory_stats_opt

* Add STATS macro functions

* Add StatAllocator

* Fix CI errors

* Add UT

* Fix CI errors

afe02e9d

C

fix reshard bug (#41106) · e494b73b
由 caozhou 提交于 3月 30, 2022

e494b73b
H
Revert "Revert "Move some activation to phi (#40727)" (#41056)" (#41095) · 91bb52cd
由 hong 提交于 3月 30, 2022
```
This reverts commit 05f3d48e.
```
91bb52cd

[DoubleGrad PR ] Supported higher-order GradNode generation (#41051) · abd2df4c

由 Zhanlue Yang 提交于 3月 30, 2022

* [Refactor] refactored eager_gen.py PR #2

* [DoubleGrad PR #1] Decoupled code generation logics for Dygraph ForwardFunctions and GradNodes

* Fixed minor issue

* Adjusted logics of GenerateNodeCreationCodes and GenerateForwardDefinition

* Fixed issues

* Supported higher-order grad node generation

* [DoubleGrad PR #4] Supported higher-order GradNode generation

* Fixed yaml typo

abd2df4c

fix cross_entropy when run static graph mode of mlu and npu (#40621) · 489a64ef
由努力努力在努力丶提交于 3月 30, 2022

489a64ef
P

add _reset_grad_inplace_version (#41101) · cb8afc24
由 pangyoki 提交于 3月 30, 2022

cb8afc24
0
Switch some dy2st UT to eager mode (#41052) · a5bfa797
由 0x45f 提交于 3月 30, 2022
```
* Switch some dy2st UT to eager mode

* Add UT
```
a5bfa797

PaddlePaddle / Paddle 1 年多 前同步成功

PaddlePaddle / Paddle
1 年多前同步成功