提交 · ab786715fb55cefeec20f7acefeeba896ef7c5e1 · PaddlePaddle / Paddle

28 12月, 2022 15 次提交

S

fix unique_kernel support axis=-1 (#49385) · ab786715
由 sprouteer 提交于 12月 28, 2022

ab786715

remove fluid.contrib.fused_elemwise_activation, sequence_topk_avg_pooling,... · da357615

由 zqw_1997 提交于 12月 28, 2022

remove fluid.contrib.fused_elemwise_activation, sequence_topk_avg_pooling, var_conv_2d, match_matrix_tensor and tree_conv (#49331)

da357615

[new-exec] Ahead-Of-Time choosing kernel (#48789) · 63d2d722

由 Leo Chen 提交于 12月 28, 2022

* add skip run

* alloc minimum memory

* skip check_size in Alloc

* skip check_size in Alloc

* skip check_size in Alloc

* fix cases when tensor is initialized or empty

* alloc empty output for place info

* add test

* increase timeout

* format code

* skip cpu

* add cudnn_deterministic

* fit for hostAlloc

* follow comments

* change check_size to fake_alloc

63d2d722

generate the static graph code of some ops (#49212) · 1804f834

由 HappyHeavyRain 提交于 12月 28, 2022

* generate the static op of some ops

* add the VERSION of pixel_shuffle

* change the API doc of isclose

* change the API doc of isclose

* fix the isclose op comment

1804f834

[ 0d-Tensor ] einsum support 0d tensor. (#49177) · 71bde066

由 xiongkun 提交于 12月 28, 2022

* einsum support 0d tensor.
1. support 0d tensor in multi-operands.
2. add 9 unittests for einsum 0d tensor.

* override NVIDIA_TF32_OVERRIDE to avoid accuracy problem in 11.2 and 11.8

71bde066

X

fix_moe (#49353) · 04511cf9
由 xiaoxiaohehe001 提交于 12月 28, 2022

04511cf9

[CodeStyle][py36] update pypi doc (#48640) · a221158f

由 Matsumoto Ruko 提交于 12月 28, 2022

* update pypi doc

* update pypi doc

* update pypi doc

* empty commit, re-trigger all ci
Co-authored-by: NSigureMo <sigure.qaq@gmail.com>

a221158f

H

fix bugs of paddle.multiplex API (#49368) · f6f0c562
由 Haohongxiang 提交于 12月 28, 2022

f6f0c562

[AutoParallel] adapt for clip (#49249) · df944772

由 zhaoyingli 提交于 12月 28, 2022

* [AutoParallel] adapt for clip

* fix unittest

* enable_static

* fix dist_fill_constant_batch_size_like

* fix process_mesh.shape

* update cond of modifying shape_list

df944772

Z

[AutoParallel] fix process_mesh's member (#49371) · 836a662c
由 zhaoyingli 提交于 12月 28, 2022

836a662c
Y

update some trt log (#49330) · 02019804
由 Yuanle Liu 提交于 12月 28, 2022

02019804
W

Fix misspelled words in comments (#49366) · e2b2f7d0
由 WangZhen 提交于 12月 28, 2022

e2b2f7d0

姜

rm legacy fluid part4 (#49281) · f1072973

由姜永久提交于 12月 28, 2022

* rm legacy fluid part4

* rm non_static_mode

* minor change

* modify initializer

* rm legacy for initializer

* fix dataloader test

f1072973

Fix CUDA11.8 Unittest Accuracy (#49373) · 76f43f6d

由 Huihuang Zheng 提交于 12月 28, 2022

This PR increased the delta in unit test for CUDA 11.8. The reason of this fix:
(1) It seems CUDA 11.8 has higher delta in accuracy result. Our other targets for seresnext under parallel executor have already added delta such as CPU, all reduce test cases, so we did same for GPU base case with CUDA 11.8
(2) A new executor is under developing in PaddlePaddle team, so the unit test for old executor can be relaxed.

76f43f6d

W
delete old dygraph pylayer (#49339) · 0b60b784
由 wanghuancoder 提交于 12月 28, 2022
```
* delete old dygraph pylayer
```
0b60b784

27 12月, 2022 17 次提交
- Y
  
  update jetson ampere sm (#49363) · 941811b2
  由 Yuanle Liu 提交于 12月 27, 2022
  
  941811b2
- fux bug of UT test_version (#49349) · 8a4e67a1
  由 zhouweiwei2014 提交于 12月 27, 2022
  
  8a4e67a1
- J
  fix CINN should add float16.h may install bug (#49324) · 0531a48b
  由 jiangcheng 提交于 12月 27, 2022
```
* fix CINN should add float16.h may install bug

* reupdate setuppy support float16

* add only if float16.h file exists
```
  0531a48b
- Z
  
  add unbind op for xpu (#49356) · 16931039
  由 zhangyikun02 提交于 12月 27, 2022
  
  16931039
- R
  fix run_setup problem (#49358) · 746a4ddb
  由 risemeup1 提交于 12月 27, 2022
```
* fix run_setup problem

* test
```
  746a4ddb
- X
  fix fold for large bs (#49337) · 9dde26f6
  由 xiaoting 提交于 12月 27, 2022
```
* fix fold for large bs

* fix fold for large bs
```
  9dde26f6
- X
  Revert "make bilinear interpolate stable. (#48644)" (#49307) · 17ec1620
  由 xiongkun 提交于 12月 27, 2022
```
This reverts commit e1e8bf72.
```
  17ec1620
- Z
  [AutoParallel] fix input order (#49329) · a9533953
  由 zhaoyingli 提交于 12月 27, 2022
```
* fix input order

* add unittest

* update cmakelist
```
  a9533953
- L
  
  fit for ninja generator (#49303) · 4634f0ff
  由 Leo Chen 提交于 12月 27, 2022
  
  4634f0ff
- Z
  [AutoParallel] quantization pass support export (#48072) · 27ce06aa
  由 zhaoyingli 提交于 12月 27, 2022
```
* [AutoParallel] quantization pass support export

* support subgraph

* move_presist_var_to_global_block

* update unittest

* fix ci-coverage

* fix codestyle

* fix fake_dequantize_op

* remove unused var

* fix ci error and aprroval error

* add unittest for fp16 in test_dequant_linear

* replace mutable data

* fix unittest in non-cuda-core

* fix unittest
Co-authored-by: Ncarryyu <569782149@qq.com>
Co-authored-by: Nwufeisheng <wfs1997@163.com>
```
  27ce06aa
- W
  
  delete old dygraph pylayer recompute (#49338) · 522c2bc0
  由 wanghuancoder 提交于 12月 27, 2022
  
  522c2bc0
- W
  delete old dygraph sharding (#49334) · 2bbdc47a
  由 wanghuancoder 提交于 12月 27, 2022
```
* delete old dygraph sharding
```
  2bbdc47a
- Z
  [new executor]Support CINN use InterpreterCore (#48911) · 2ca3d3f7
  由 zhangbo9674 提交于 12月 27, 2022
```
* cinn use interpretercore

* fix bug

* fix compile bug

* fix scope bug

* refine code

* refine code by comment

* refine code by comment
```
  2ca3d3f7
- R
  Support priority scheduling for standalone executor (#49275) · 0839bba3
  由 Ruibiao Chen 提交于 12月 27, 2022
```
* Support priority scheduling for standalone executor

* Add CPU test
```
  0839bba3
- 姜
  
  rm _in_legacy part3 (#49264) · 0a837cb2
  由姜永久提交于 12月 27, 2022
  
  0a837cb2
- 姜
  rm in_legacy_dygraph python/paddle/nn/functional/ part1 (#49258) · 140d786d
  由姜永久提交于 12月 27, 2022
```
* rm in_legacy_dygraph nn part1

* rm non_static_mode

* modify rrelu
```
  140d786d
- W
  delete legacy dygraph code in python/paddle/tensor (#49286) · 861fef52
  由 wanghuancoder 提交于 12月 27, 2022
```
* delete _in_legacy_dygraph
```
  861fef52
26 12月, 2022 8 次提交

Z

add float16 to index_sample eng doc (#49317) · ea741aff
由 zmxdream 提交于 12月 26, 2022

ea741aff

Add collective communication APIs to improve completeness (#49252) · dec67d6d

由 Wen Sun 提交于 12月 26, 2022

* feat: broadcast_object_list & scatter_object_list

* chore: update ut conf

* get_backend & is_available

* docs: update requirements

* fix: resolve conflicts
Co-authored-by: NLiYuRio <liyuruijx@163.com>

dec67d6d

姜
rm legacy unittest part5 (#49282) · a72a0da0
由姜永久提交于 12月 26, 2022
```
* rm legacy unittest part5

* add custom op
```
a72a0da0

fix dlrm qpsproblem (#49171) · c8f76337

由 ykkk2333 提交于 12月 26, 2022

* migrate shaple sgd, split,sign xpu kernels to phi, test=kunlun

* fix dlrm throughput problem, test=kunlun

c8f76337

[fluid clean]replace fliud.io.load_inference_model from util_factory.py (#49156) · 3f896dce

由 wangxiaoning 提交于 12月 26, 2022

* add index sample fp16 support

* remove fluid APIs in distributed_strategy.py and role_maker.py

* Revert "remove fluid APIs in distributed_strategy.py and role_maker.py"

This reverts commit 223bbee990d3bf69e252fc3c0f19e3873550a264.

* move load_inference_model to distributed

* fix origin develop codes diff

* move _endpoints_replacement

* delete line

* reset line

* add unittest case of load_inference_model

* fix unittest

* fix unittest

* fix coverage

* fix coverage

3f896dce

R

Revert params in paddle.nn.SpectralNorm and paddle.nnFlatten.forward (#49311) · 945f777f
由 Roc 提交于 12月 26, 2022

945f777f
R
[0d Tensor] update scatter for zero-dimension tensor (#49279) · 73aa98cf
由 Roc 提交于 12月 26, 2022
```
* revert concat and change concat to stack

* let stack kernel support int8, uint8 and bool type
```
73aa98cf

[Auto Parallel] Merge the python and c++ impls of ProcessMesh (#47503) · 1c0afa79

由 Yulong Ao 提交于 12月 26, 2022

* [Auto Parallel] Rename methods of ProcessMesh

* [Auto Parallel] Impl the python process_mesh by the c++ one

* [Auto Parallel] Add some minor modifications

* [Auto Parallel] Rename some methods

* [Auto Parallel] Remove unnecessary codes

* [Auto Parallel] Add back some removed files

* [Auto Parallel] Fix bugs

* [Auto Parallel] Fix a bug

* Update process_mesh.cc

* [Auto Parallel] Fix a bug

1c0afa79

PaddlePaddle / Paddle 1 年多 前同步成功

PaddlePaddle / Paddle
1 年多前同步成功