提交 · 922d2481e05c1fbc047b723cd763f2d17f11cf95 · PaddlePaddle / Paddle

17 7月, 2023 9 次提交
- B
  [Inference][trt] fix trt ci and timeout (#55441) · 8fe94d44
  由 bukejiyu 提交于 7月 17, 2023
```
* unary bitwise_not adapter tensorRT8.6 in Paddle-TensorRT
* Update test_trt_inspector.py
* test_trt_convert_conv2d_transpose
* Update test_trt_convert_conv2d_transpose.py
```
  8fe94d44
- I
  [Paddle-TRT] Support conv2d op enter into trt when filter is not a persistable tensor (#55246) · 74206917
  由 iamsonderr 提交于 7月 17, 2023
```
* support_conv2d

* remove comment

* check code style

* add former Test

* check code style

* add unittest

* fix log

* change unittest

---------
Co-authored-by: zhoutianzi666 <17801055074@163.com>
```
  74206917
- Z
  Support more dtype for any/all API. (#55253) · 7b19efe4
  由 zxcd 提交于 7月 17, 2023
```
* add more data type for all/any.

* remove xpu fix.

* add test unit.

* fix typename name.

* fix output data type.
```
  7b19efe4
- M
  [Paddle-TRT] add assign op (#55426) · d778737e
  由 ming1753 提交于 7月 17, 2023
```
* [Paddle-TRT] add assign op
```
  d778737e
- W
  
  [IR] finetune the StrAttribute interface. (#55439) · 896d7cfa
  由 winter-wang 提交于 7月 17, 2023
  
  896d7cfa
- H
  
  [0D-Tensor] CINN supports cast and relu6 (#55442) · 5464b5c4
  由 HongyuJia 提交于 7月 17, 2023
  
  5464b5c4
- H
  
  [0D-Tensor] CINN supports reverse, fix infershape (#55337) · a771c1e1
  由 HongyuJia 提交于 7月 17, 2023
  
  a771c1e1
- H
  
  [0D-Tensor] CINN supports unsqueeze, delete hack in Paddle's pass (#55336) · f736f151
  由 HongyuJia 提交于 7月 17, 2023
  
  f736f151
- K
  
  add whitelist unittest for ir op test (#55433) · 14551c85
  由 kangguangli 提交于 7月 17, 2023
  
  14551c85
14 7月, 2023 6 次提交
- Z
  [IR] Refine BuildScope in phi_kernel_util (#55423) · f00a06d8
  由 zhangbo9674 提交于 7月 14, 2023
```
* add code

* fix bug

* refine code

* refine code

* fix bug
```
  f00a06d8
- G
  
  Fix test_cholesky_op.py (#55331) · 5de773d1
  由 Guo Sheng 提交于 7月 14, 2023
  
  5de773d1
- H
  
  [0D-Tensor] CINN supports reshape (#55326) · f5e4a316
  由 HongyuJia 提交于 7月 14, 2023
  
  f5e4a316
- H
  
  [0D-Tensor] CINN supports transpose, add special case to expand_zero_dim_pass (#55379) · d1b74ba5
  由 HongyuJia 提交于 7月 14, 2023
  
  d1b74ba5
- Z
  [IR] Reconstruct the Instruction for NewIrInterpreter (#55239) · 69e9f03e
  由 zhangbo9674 提交于 7月 14, 2023
```
* add inplace interface

* support inplace

* refine code

* fix bug

* fix bug

* refien code

* add file

* add interface

* refine code

* refine code

* add phi kernel instruction

* refine code

* add test

* delete unuse code

* add test

* add test

* add deps

* delete unused code

* fix bug

* fix bug
```
  69e9f03e
- T
  Update CUDNN Frontend API to v0.9.1 (#54949) · 76b77d81
  由 Tian Zheng 提交于 7月 14, 2023
```
* Update CUDNN Frontend API to v0.9.1
- Remove old patches
- Remove workarounds that are no longer needed

* Fix test_switch_autotune
```
  76b77d81
13 7月, 2023 11 次提交
- F
  [inference] Add FusedBiasActKernel (#55301) · 0a4d1999
  由 freeliuzc 提交于 7月 13, 2023
```
* add init value for CudaSwishFunctor

* add new phi kernel fusedBiasActKernel
```
  0a4d1999
- Y
  
  fix the bug of test_fused_multi_transformer_op on cuda12 (#55296) · d12837d3
  由 Yichen Zhang 提交于 7月 13, 2023
  
  d12837d3
- C
  【AMP Prim OP】support instance_norm prim ops for fp16 and bf16 dtype (#55368) · 65950324
  由 Charles-hit 提交于 7月 13, 2023
```
* [prim]support fp16 for instance_norm and instance_norm_grad

* support fp16 and bfp16 dtype for instance_norm prim rules

* fix new ir test

---------
Co-authored-by: Ncxxly <chenxx_id@163.com>
```
  65950324
- add phi operator c_concat and ut (#55320) · 788be26d
  由 lil-Xing 提交于 7月 13, 2023
```
* add phi operator c_concat and ut

* update create_var use

* update copyright
```
  788be26d
- H
  [NewIR]new ir support builtin slice op (#55381) · 4b6d2f5f
  由 hong 提交于 7月 13, 2023
```
* new ir support builtin slice op

* fix phi kernel adaptor bug
```
  4b6d2f5f
- Z
  Move compare_raw_kernel to legacy (#53928) · 1dd8770a
  由 zhangyuqin1998 提交于 7月 13, 2023
```
* Move compare_raw_kernel to legacy

* fix

* Update compare_kernel.cc

* Move compare_raw_kernel to legacy
```
  1dd8770a
- L
  Integrate QAT into distributed optimizer (#54241) · aaf021c9
  由 Leo Chen 提交于 7月 13, 2023
```
* Support AMP program for onnx QAT API

* Integrate QAT into distributed optimizer

* Reduce the size of test data and increase time limit

* Use logger and reduce time limit of unittests

* Rename and move unittest into fleet test

* Test qat_init API
```
  aaf021c9
- H
  
  [0D-Tensor] Support matmul, fix infershape (#55316) · ce8455c0
  由 HongyuJia 提交于 7月 13, 2023
  
  ce8455c0
- R
  Add matmul_int8 op (#55228) · 27cc0df5
  由 RichardWooSJTU 提交于 7月 13, 2023
```
* add matmul int8
```
  27cc0df5
- H
  [NewIR]fix new ir edit distance bug (#55294) · 2194e4c1
  由 hong 提交于 7月 13, 2023
```
* fix edit distance bug

* add op define kernel data type

* fix bug

* update

* add header

* add op test to cmake
```
  2194e4c1
- Q
  Modify bf16 and fix the elementwise_max (#54799) · 6f7ceca0
  由 Qi Shao 提交于 7月 13, 2023
```
* modify the accuracy checking framework of bf16 optest, including both of forward and backward
```
  6f7ceca0
12 7月, 2023 8 次提交

H

[0D-Tensor] CINN supports broadcast_to, fix infershape (#55321) · 276c159d
由 HongyuJia 提交于 7月 12, 2023

276c159d

[Semi Auto] Softmax SPMD Rule (#55196) · 885d1aec

由 JZ-LIANG 提交于 7月 12, 2023

* resolute input sharding conflict maybe

* fixed comment

---------
Co-authored-by: NYichen Zhang <zhangyichen03@baidu.com>
Co-authored-by: Nzhiqiu <chenqiuliang@baidu.com>

885d1aec

H

[0D-Tensor] CINN supports squeeze, fix infershape and GetPositiveAxes (#55333) · bb0df468
由 HongyuJia 提交于 7月 12, 2023

bb0df468
Y
[Inference] rewrite identity_op_clean_pass (#55240) · 2363e623
由 Yuanle Liu 提交于 7月 12, 2023
```
* rewrite identity_op_clean_pass

* fix

* adjust identity_op_clean_pass order in gpu passes

* fix ut
```
2363e623
W

[bug fix] gpups ci (#55314) · 766fcdf0
由 wangzhen38 提交于 7月 12, 2023

766fcdf0

Support selected rows new ir (#54987) · fc66b5d7

由 hong 提交于 7月 12, 2023

* refine program translator

* fix warning: not override

* fix bug

* merge new modifications

* modify by reviews

* resolve conflicts

* resolve conflicts

* fix

* fix

* update

* support selected rows

* update

* add selectrows

* fix bug

* add ut

* refine code

* refien code

* update

* update

* support selected rows

* support selected rows

* support dense tensor

* remove useless code

* polish code

* remote standalone executor test

---------
Co-authored-by: Nkangguangli <kangguangli@hotmail.com>
Co-authored-by: Nzhangbo9674 <zhangbo54@baidu.com>

fc66b5d7

[ONEDNN] Upgrade oneDNN version to v3.1 (#52463) · cfa513f7

由 YangQun 提交于 7月 12, 2023

* squash pick the poc code
* fix build after rebase
* fix int8 conv and fc uts
* Fix and clean-up Get_SRC_Scale_Memory
* fix floating point fc uts
* fix test_analyzer_int8_googlenet
* test_analyzer_int8_mobilenetv1
* fix int8 mobilenet v2 and v3
* fix build error after rebase
* [oneDNN] rename library version
* fix conv bias datatype
* try to fix import error
* fix rebase error
* [oneDNN] pack library into python wheel
* add MKLDNN_SHARED_LIB_3 to env_dict
* fix test_analyzer_bert
* fix fill_constant op kernel
* fix ernie and matmul op ut
* fix softplus ut
* fix conv+relu6 fusion ut
* fix hardswish fusion
* fix quant+transpose fusion ut
* fixsgd ut
* fix int8 matmul with flatten
* fix fc+scale fusion
* fix conv/matmul+gelu fusion uts
* fix rebase error
* Revert "fix conv/matmul+gelu fusion uts"
This reverts commit 47eb5e49972bd8f7271a233def9bfb3e98ce78e1.
* upgrade to onednn v3.1
* remove older version onednn
* use densetensor::data() for achieving mean and var in layernorm impl
* comments for atol of integer tests
* fix clang-format
* Revert "remove older version onednn"
This reverts commit 783e57ddfd4401254596eae7d47adb9b03590c09.
* improve binary handle
* fix expand kernel
* Revert "use densetensor::data() for achieving mean and var in layernorm impl"
* always use forward_inference for conv
* remove activation scales
* rollback changes to mkldnn.cmake
* address comments
* port changes to dequantize kernel
* fix merge error
* fix fused_elementwise_kernel
* upgrade onednn version to v3.1.1
* fix some approval error
* fix error msg format
* remove old onednn libs
* try to fix symbolic link issue
* fix cinn test case segfault
* do not explicit link test with onednn
* remove unnecessary changes
* integrate CINN with onednn v3
* link with mkldnn project
* fix cinn build file

---------
Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>
Co-authored-by: NChen, Xinyu1 <xinyu1.chen@intel.com>
Co-authored-by: Ntianshuo78520a <707759223@qq.com>

cfa513f7

[clang-tidy] enable `readability-container-size-empty` check (#55279) · be3a6fa7

由 Wang Xin 提交于 7月 12, 2023

* [clang-tidy] enable readability-container-size-empty check

* fix test_custom_kernel Failed

* add clang-tid-10 in dockerfile

* add clang-tidy in dockerfile

* fix bug

be3a6fa7

11 7月, 2023 6 次提交

support sharding parallel (#54634) · b7a05057

由 pangengzheng 提交于 7月 11, 2023

* support sharding parallel

* fix name

* fix

* update

* test amp for sharding

---------

Co-authored-by: pangengzheng <pangengzheng.baidu.com>

b7a05057

replace the AdagradOptimizer... · 94365855

由 LoneRanger 提交于 7月 11, 2023

replace the AdagradOptimizer 、adamaxOptimizer、AdadeltaOptimizer、RMSPropOptimizer、LambOptimizer and Momentum (#54152)

* replace the AdadeltaOptimizer with Adadelta

* replace the RMSPropOptimizer with RMSProp

* replace the LambOptimizer with lamb

* replace the momentum in contrib/optimizer.py with Momentum in python/paddle/optimizer/momentum.py

* fix bug

* fix bug

* fix bug

* fix bug of Lamp

* fix bug of Lamp

* fix bug of import

* replace the AdamaxOptimizer with Admax and change the optimizer base for AdagradOptimizer

* fix bug

* fix bug

* Update optimizer.py

* fix bug

* fix bug

94365855

Integrate rmsnorm kernel (#54998) · 97d3d6ee

由 MarDino 提交于 7月 11, 2023

* add rmsnorm kernel
* add static graph test
* fix round type
* use alignas to avoid msvc compile error
* remove redundant headerfile to avoid rocm compile error
* fix rocm compile not found cub
* Add document

97d3d6ee

张

[CodeStyle][CINN] ruff F403 in test/cinn (#55255) · f4bdfa60
由张春乔提交于 7月 11, 2023

f4bdfa60
Z
[IR] Add op compat info for grad op (#55277) · b4d7e1e0
由 zhangbo9674 提交于 7月 11, 2023
```
* fix bug

* fix bug

* fix bug
```
b4d7e1e0
H

[0D-Tensor] Support isclose and polish codes (#55292) · 036c0ae1
由 HongyuJia 提交于 7月 11, 2023

036c0ae1

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功