提交 · 822ea0f9cdf3c6e9c9e1a48262166e996129cc85 · BaiXuePrincess / Paddle

03 1月, 2023 2 次提交
- S
  
  Add not_equal trt converter (#49393) · 822ea0f9
  由 Sanbu 提交于 1月 03, 2023
  
  822ea0f9
- J
  [Auto Parallel] Add All Relu Flops (#48083) · c5137b22
  由 Jianghai 提交于 1月 03, 2023
```
* relu flops all

* add annotations and tests

* revision for codestyle
```
  c5137b22
02 1月, 2023 1 次提交
- H
  
  Scale Matmul Fuse pass rewritten (#49105) · 18c0a002
  由 Hulek 提交于 1月 02, 2023
  
  18c0a002
31 12月, 2022 1 次提交
- C
  
  support flip 0D (#49460) · cb22a5c7
  由 caozhou 提交于 12月 31, 2022
  
  cb22a5c7
30 12月, 2022 10 次提交

X
[ bugfix ] fix bugs in Indexable and support LayerDict (#49409) · 291cf821
由 xiongkun 提交于 12月 30, 2022
```
* bugfix: fix bugs in Indexable and support LayerDict

* fix bugs.
```
291cf821
W
check weight shape of conv1d_transpose (#49417) · 5c4adfae
由 wangxinxin08 提交于 12月 30, 2022
```
* check weight shape of conv1d_transpose

* add unittest case
```
5c4adfae

[Custom device] Add custom_cpu testcase of custom_relu (#49300) · 69c7edcf

由 HongyuJia 提交于 12月 30, 2022

* add custom_cpu testcase

* update test_custom_device_setup

* update path to custom_runtime

* fix cmd wait

* test Linux only

* setup once

* integrate to one run_cmd

* add pip install

* change timeout

* add debug string

* add debug string

* add debug string

* use os.system and change module name

* add runtime

* add more debug message

* continue debug

* timestamp

* fix testcase import bug

* remove error message

* set TIMEOUT property

69c7edcf

delete batch_norm (#49396) · 0111d012

由 risemeup1 提交于 12月 30, 2022

* delete batch_norm

* test

* test

* test

* test

* test

* recover cmake_gen

* debug

0111d012

R

unit test of reduce with zero dim (#49436) · b2f41825
由 Roc 提交于 12月 30, 2022

b2f41825

[Custom Extension] Polish xpu testcase (#49158) · 9f5afa62

由 HongyuJia 提交于 12月 30, 2022

* clean custom_xpu testcase test_static_pe

* use assert_allclose to solve precision error

* adjust precision

* flatten tensor

* fix flatten

9f5afa62

Z

[clean fluid api] Move fluid/contrib/slim and remove fluid api. (#48717) · 72973d5a
由 zhouzj 提交于 12月 30, 2022

72973d5a

在文档中统一静态图模式与动态图模式的英文翻译 (#49170) · a186e60d

由 Sanbu 提交于 12月 30, 2022

* 1219

* temporarily change the num_diff_files limit, test=document_fix

* Revert "temporarily change the num_diff_files limit, test=document_fix"

This reverts commit 8e70f00ef468d2dad0e38b3da06295ed62990d20.

* for codestyle

* remove duplicate license

* `static mode` -> `static graph mode`

* Update hybrid_parallel_inference.py

* Update layer_function_generator.py

* Update manipulation.py

* reset
Co-authored-by: NLigoml <39876205+Ligoml@users.noreply.github.com>
Co-authored-by: NSigureMo <sigure.qaq@gmail.com>

a186e60d

W
Fix default GetExpectedKernelType for ops supported tensor attrs (#49414) · 8a859554
由 WangZhen 提交于 12月 30, 2022
```
* Fix default GetExpectedKernelType for ops supported tensor attrs
```
8a859554
姜
Yj/rm legacy part 0 (#49424) · 3ffcd693
由姜永久提交于 12月 30, 2022
```
* rm legacy

* clear in_legacy

* fix tracer
```
3ffcd693

29 12月, 2022 7 次提交
- W
  [fluid remove] rawconv (#49395) · 9e6007f0
  由 wangzhen38 提交于 12月 29, 2022
```
* [fluid remove] rawconv
```
  9e6007f0
- A
  [D2SCinn]Support deliver skip_gc_vars into Graph (#49411) · ffa32e44
  由 Aurelius84 提交于 12月 29, 2022
```
* [D2SCinn]Support deliver skip_gc_vars into Graph

* fix unittest

* fix copy
```
  ffa32e44
- L
  
  Add scale and floor_divide ut cases (#49418) · a30e3602
  由 Lin Manhui 提交于 12月 29, 2022
  
  a30e3602
- X
  auto parallel bf16 (#49079) · 418edae5
  由 xu98bin 提交于 12月 29, 2022
```
* auto parallel bf16
```
  418edae5
- 姜
  rm legacy dygraph part7 (#49285) · df3f74df
  由姜永久提交于 12月 29, 2022
```
* rm legacy dygraph part7

* rm non_static_mode

* modify

* modify

* add static test

* set static for lstm_cudnn test

* reset tracer

* reset varbase

* fix
```
  df3f74df
- W
  fused_attention_op paratmers stop grad support (#49351) · 0bb999b6
  由 Wang Bojun 提交于 12月 29, 2022
```
* fusedAttenGrad_noGrad

* code style fix

* add ut

* remove unnecessary log
```
  0bb999b6
- 姜
  rm in_legacy part8 (#49386) · 1c7ae954
  由姜永久提交于 12月 29, 2022
```
* rm legacy layers part6

* rm non_static_mode

* modify non_static

* minor change

* rm loss

* rm in_legacy part8

* minor change
```
  1c7ae954
28 12月, 2022 12 次提交

R

skip this ut when cuda < 11.2 && cuda_arch < 8 (#49313) · 0c52e8a8
由 RichardWooSJTU 提交于 12月 28, 2022

0c52e8a8

姜

rm legacy nn part2 (#49259) · 69e51c77

由姜永久提交于 12月 28, 2022

* rm legacy nn part2

* rm _non_static_mode

* modify

* modify unpool test

* modify unpool test

* modify loss

* keep legacy for layer_norm

69e51c77

remove fluid.contrib.fused_elemwise_activation, sequence_topk_avg_pooling,... · da357615

由 zqw_1997 提交于 12月 28, 2022

remove fluid.contrib.fused_elemwise_activation, sequence_topk_avg_pooling, var_conv_2d, match_matrix_tensor and tree_conv (#49331)

da357615

[new-exec] Ahead-Of-Time choosing kernel (#48789) · 63d2d722

由 Leo Chen 提交于 12月 28, 2022

* add skip run

* alloc minimum memory

* skip check_size in Alloc

* skip check_size in Alloc

* skip check_size in Alloc

* fix cases when tensor is initialized or empty

* alloc empty output for place info

* add test

* increase timeout

* format code

* skip cpu

* add cudnn_deterministic

* fit for hostAlloc

* follow comments

* change check_size to fake_alloc

63d2d722

generate the static graph code of some ops (#49212) · 1804f834

由 HappyHeavyRain 提交于 12月 28, 2022

* generate the static op of some ops

* add the VERSION of pixel_shuffle

* change the API doc of isclose

* change the API doc of isclose

* fix the isclose op comment

1804f834

[ 0d-Tensor ] einsum support 0d tensor. (#49177) · 71bde066

由 xiongkun 提交于 12月 28, 2022

* einsum support 0d tensor.
1. support 0d tensor in multi-operands.
2. add 9 unittests for einsum 0d tensor.

* override NVIDIA_TF32_OVERRIDE to avoid accuracy problem in 11.2 and 11.8

71bde066

[CodeStyle][py36] update pypi doc (#48640) · a221158f

由 Matsumoto Ruko 提交于 12月 28, 2022

* update pypi doc

* update pypi doc

* update pypi doc

* empty commit, re-trigger all ci
Co-authored-by: NSigureMo <sigure.qaq@gmail.com>

a221158f

[AutoParallel] adapt for clip (#49249) · df944772

由 zhaoyingli 提交于 12月 28, 2022

* [AutoParallel] adapt for clip

* fix unittest

* enable_static

* fix dist_fill_constant_batch_size_like

* fix process_mesh.shape

* update cond of modifying shape_list

df944772

Z

[AutoParallel] fix process_mesh's member (#49371) · 836a662c
由 zhaoyingli 提交于 12月 28, 2022

836a662c

姜

rm legacy fluid part4 (#49281) · f1072973

由姜永久提交于 12月 28, 2022

* rm legacy fluid part4

* rm non_static_mode

* minor change

* modify initializer

* rm legacy for initializer

* fix dataloader test

f1072973

Fix CUDA11.8 Unittest Accuracy (#49373) · 76f43f6d

由 Huihuang Zheng 提交于 12月 28, 2022

This PR increased the delta in unit test for CUDA 11.8. The reason of this fix:
(1) It seems CUDA 11.8 has higher delta in accuracy result. Our other targets for seresnext under parallel executor have already added delta such as CPU, all reduce test cases, so we did same for GPU base case with CUDA 11.8
(2) A new executor is under developing in PaddlePaddle team, so the unit test for old executor can be relaxed.

76f43f6d

W
delete old dygraph pylayer (#49339) · 0b60b784
由 wanghuancoder 提交于 12月 28, 2022
```
* delete old dygraph pylayer
```
0b60b784

27 12月, 2022 7 次提交
- fux bug of UT test_version (#49349) · 8a4e67a1
  由 zhouweiwei2014 提交于 12月 27, 2022
  
  8a4e67a1
- Z
  
  add unbind op for xpu (#49356) · 16931039
  由 zhangyikun02 提交于 12月 27, 2022
  
  16931039
- X
  fix fold for large bs (#49337) · 9dde26f6
  由 xiaoting 提交于 12月 27, 2022
```
* fix fold for large bs

* fix fold for large bs
```
  9dde26f6
- Z
  [AutoParallel] fix input order (#49329) · a9533953
  由 zhaoyingli 提交于 12月 27, 2022
```
* fix input order

* add unittest

* update cmakelist
```
  a9533953
- Z
  [AutoParallel] quantization pass support export (#48072) · 27ce06aa
  由 zhaoyingli 提交于 12月 27, 2022
```
* [AutoParallel] quantization pass support export

* support subgraph

* move_presist_var_to_global_block

* update unittest

* fix ci-coverage

* fix codestyle

* fix fake_dequantize_op

* remove unused var

* fix ci error and aprroval error

* add unittest for fp16 in test_dequant_linear

* replace mutable data

* fix unittest in non-cuda-core

* fix unittest
Co-authored-by: Ncarryyu <569782149@qq.com>
Co-authored-by: Nwufeisheng <wfs1997@163.com>
```
  27ce06aa
- W
  
  delete old dygraph pylayer recompute (#49338) · 522c2bc0
  由 wanghuancoder 提交于 12月 27, 2022
  
  522c2bc0
- W
  delete old dygraph sharding (#49334) · 2bbdc47a
  由 wanghuancoder 提交于 12月 27, 2022
```
* delete old dygraph sharding
```
  2bbdc47a

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致