提交 · c0c18cf963b82ff5d0b61334585ac9db72ccc1ec · PaddlePaddle / Paddle

03 1月, 2023 10 次提交
- G
  
  fix unsqueeze2+conv2d quantization (#49164) · c0c18cf9
  由 Guanghua Yu 提交于 1月 03, 2023
  
  c0c18cf9
- Z
  [Paddle Inference] Implement conv2d_fusion NHWC format using cutlass (#47989) · c123dd1e
  由 zhoutianzi666 提交于 1月 03, 2023
```
* Implement conv2d_fusion NHWC format using CUTLASS
* Add unit testing for CUTLASS Conv in inference
* Add experimental API for CUTLASS.
```
  c123dd1e
- A
  [OpAttr]Fix Ignore AttriteTensor in IndicateDataType bug in grad_op (#49472) · 5ac96468
  由 Aurelius84 提交于 1月 03, 2023
```
* [OpAttr]Fix Ignore AttriteTensor in IndicateDataType bug in grad_op

* add GetExpectedKernelType
```
  5ac96468
- Y
  Use BroadcastKernel and ReduceKernel to optimize expand and expand_grad. (#49419) · c4604025
  由 Yiqun Liu 提交于 1月 03, 2023
```
* Use BroadcastKernel and ReduceKernel to optimize expand and expand_grad.

* Correct the axis when there is only 1 input in BroadcastKernel.

* Add the calculate of output's shape.
```
  c4604025
- Z
  [Zero-Dim] reshape/reshape_/reverse 0D support (#49357) · 347d2123
  由 zhaoyingli 提交于 1月 03, 2023
```
* [Zero-Dim] reshape/reshape_/reverse 0D support

* rm comment

* change paddle.to_tensor to paddle.full

* fix docs

* update paddle.full
```
  347d2123
- Z
  
  forbid ops who have 1D intermediate tensor entering Paddle-TRT (#49378) · 021085e3
  由 zhoutianzi666 提交于 1月 03, 2023
  
  021085e3
- L
  
  fix nvcc_lazy error on ninja build (#49448) · 121eaea7
  由 Leo Chen 提交于 1月 03, 2023
  
  121eaea7
- 骑
  
  [FluidAPI]remove clip api (#48946) · fe0dc40d
  由骑马小猫提交于 1月 03, 2023
  
  fe0dc40d
- S
  
  Add not_equal trt converter (#49393) · 822ea0f9
  由 Sanbu 提交于 1月 03, 2023
  
  822ea0f9
- J
  [Auto Parallel] Add All Relu Flops (#48083) · c5137b22
  由 Jianghai 提交于 1月 03, 2023
```
* relu flops all

* add annotations and tests

* revision for codestyle
```
  c5137b22
02 1月, 2023 1 次提交
- H
  
  Scale Matmul Fuse pass rewritten (#49105) · 18c0a002
  由 Hulek 提交于 1月 02, 2023
  
  18c0a002
01 1月, 2023 1 次提交
- G
  
  memorty_optimize remove inplace op (#49431) · aa96ddc3
  由 gem5 提交于 1月 01, 2023
  
  aa96ddc3
31 12月, 2022 1 次提交
- C
  
  support flip 0D (#49460) · cb22a5c7
  由 caozhou 提交于 12月 31, 2022
  
  cb22a5c7
30 12月, 2022 18 次提交

Z

speedup lcov (#49476) · 4458a1e5
由 zhangbo9674 提交于 12月 30, 2022

4458a1e5
X
[ bugfix ] fix bugs in Indexable and support LayerDict (#49409) · 291cf821
由 xiongkun 提交于 12月 30, 2022
```
* bugfix: fix bugs in Indexable and support LayerDict

* fix bugs.
```
291cf821
W
check weight shape of conv1d_transpose (#49417) · 5c4adfae
由 wangxinxin08 提交于 12月 30, 2022
```
* check weight shape of conv1d_transpose

* add unittest case
```
5c4adfae
Z
[CI-Precision] Optimize precision test logic (#49441) · 3e8cec85
由 zhangbo9674 提交于 12月 30, 2022
```
* speedup getFNDAFile

* add fnda_base for c++ ut cc file

* fix bug

* fix bug

* fix bug

* fix bug
```
3e8cec85

[Custom device] Add custom_cpu testcase of custom_relu (#49300) · 69c7edcf

由 HongyuJia 提交于 12月 30, 2022

* add custom_cpu testcase

* update test_custom_device_setup

* update path to custom_runtime

* fix cmd wait

* test Linux only

* setup once

* integrate to one run_cmd

* add pip install

* change timeout

* add debug string

* add debug string

* add debug string

* use os.system and change module name

* add runtime

* add more debug message

* continue debug

* timestamp

* fix testcase import bug

* remove error message

* set TIMEOUT property

69c7edcf

Z
Fix test_conv_bn_fuse_pass_cc on Windows System (#49446) · a4b4343f
由 zyfncg 提交于 12月 30, 2022
```
* fix test_conv_bn_fuse_pass_cc

* remove comment
```
a4b4343f
Z
[inference][trt] update Convolution to ConvolutionNd (#47653) · 6e5917e4
由 Zhang Jun 提交于 12月 30, 2022
```
* update conv to convNd

* trigger ci
```
6e5917e4
L

revert phi_static (#49433) · 802c5797
由 Leo Chen 提交于 12月 30, 2022

802c5797

delete batch_norm (#49396) · 0111d012

由 risemeup1 提交于 12月 30, 2022

* delete batch_norm

* test

* test

* test

* test

* test

* recover cmake_gen

* debug

0111d012

R

unit test of reduce with zero dim (#49436) · b2f41825
由 Roc 提交于 12月 30, 2022

b2f41825

Support static graph code-gen for squeeze and unsqueeze op (#49430) · 23c1ac2c

由 zyfncg 提交于 12月 30, 2022

* support static graph code-gen for squeeze op

* generate static graph code of unsqueeze

* refine op name

* add extra output in op_compat

* remove debug log

23c1ac2c

H

fix possible bug (#49367) · 18f0ab86
由 HongyuJia 提交于 12月 30, 2022

18f0ab86

[Custom Extension] Polish xpu testcase (#49158) · 9f5afa62

由 HongyuJia 提交于 12月 30, 2022

* clean custom_xpu testcase test_static_pe

* use assert_allclose to solve precision error

* adjust precision

* flatten tensor

* fix flatten

9f5afa62

Z

[clean fluid api] Move fluid/contrib/slim and remove fluid api. (#48717) · 72973d5a
由 zhouzj 提交于 12月 30, 2022

72973d5a

在文档中统一静态图模式与动态图模式的英文翻译 (#49170) · a186e60d

由 Sanbu 提交于 12月 30, 2022

* 1219

* temporarily change the num_diff_files limit, test=document_fix

* Revert "temporarily change the num_diff_files limit, test=document_fix"

This reverts commit 8e70f00ef468d2dad0e38b3da06295ed62990d20.

* for codestyle

* remove duplicate license

* `static mode` -> `static graph mode`

* Update hybrid_parallel_inference.py

* Update layer_function_generator.py

* Update manipulation.py

* reset
Co-authored-by: NLigoml <39876205+Ligoml@users.noreply.github.com>
Co-authored-by: NSigureMo <sigure.qaq@gmail.com>

a186e60d

R
fix_mac_build_problem (#49435) · 162f8fe2
由 risemeup1 提交于 12月 30, 2022
```
* fix_mac_build_problem

* fix_mac_build_problem

* fix_mac_build_problem
```
162f8fe2
W
Fix default GetExpectedKernelType for ops supported tensor attrs (#49414) · 8a859554
由 WangZhen 提交于 12月 30, 2022
```
* Fix default GetExpectedKernelType for ops supported tensor attrs
```
8a859554
姜
Yj/rm legacy part 0 (#49424) · 3ffcd693
由姜永久提交于 12月 30, 2022
```
* rm legacy

* clear in_legacy

* fix tracer
```
3ffcd693

29 12月, 2022 9 次提交
- R
  
  fix_bug (#49390) · 839e1499
  由 risemeup1 提交于 12月 29, 2022
  
  839e1499
- R
  fix_static_problem (#49439) · 3481ff55
  由 risemeup1 提交于 12月 29, 2022
```
* fix_static_problem

* test

* fix_static_problem,test=document_fix
```
  3481ff55
- W
  [fluid remove] rawconv (#49395) · 9e6007f0
  由 wangzhen38 提交于 12月 29, 2022
```
* [fluid remove] rawconv
```
  9e6007f0
- A
  [D2SCinn]Support deliver skip_gc_vars into Graph (#49411) · ffa32e44
  由 Aurelius84 提交于 12月 29, 2022
```
* [D2SCinn]Support deliver skip_gc_vars into Graph

* fix unittest

* fix copy
```
  ffa32e44
- L
  
  Add scale and floor_divide ut cases (#49418) · a30e3602
  由 Lin Manhui 提交于 12月 29, 2022
  
  a30e3602
- Y
  
  xpu kernels support api int64 vector inputs, test=kunlun (#49336) · 3c2420a3
  由 ykkk2333 提交于 12月 29, 2022
  
  3c2420a3
- X
  auto parallel bf16 (#49079) · 418edae5
  由 xu98bin 提交于 12月 29, 2022
```
* auto parallel bf16
```
  418edae5
- Z
  [pglbox2.0]fix load into memory (#49389) · 1078e064
  由 zmxdream 提交于 12月 29, 2022
```
* fix load into memory

* fix load into memory

* fix code style
```
  1078e064
- 姜
  rm legacy dygraph part7 (#49285) · df3f74df
  由姜永久提交于 12月 29, 2022
```
* rm legacy dygraph part7

* rm non_static_mode

* modify

* modify

* add static test

* set static for lstm_cudnn test

* reset tracer

* reset varbase

* fix
```
  df3f74df

PaddlePaddle / Paddle 1 年多 前同步成功

PaddlePaddle / Paddle
1 年多前同步成功