提交 · 7a705727fb1b6677cd47c697ea35507808867e8a · PaddlePaddle / Paddle

12 7月, 2023 5 次提交

R
[CustomDevice] fix release error in process_group_custom (#55293) · 7a705727
由 ronnywang 提交于 7月 12, 2023
```
* [CustomDevice] fix release error for process_group_custom

* update
```
7a705727
W

[bug fix] gpups ci (#55314) · 766fcdf0
由 wangzhen38 提交于 7月 12, 2023

766fcdf0

Support selected rows new ir (#54987) · fc66b5d7

由 hong 提交于 7月 12, 2023

* refine program translator

* fix warning: not override

* fix bug

* merge new modifications

* modify by reviews

* resolve conflicts

* resolve conflicts

* fix

* fix

* update

* support selected rows

* update

* add selectrows

* fix bug

* add ut

* refine code

* refien code

* update

* update

* support selected rows

* support selected rows

* support dense tensor

* remove useless code

* polish code

* remote standalone executor test

---------
Co-authored-by: Nkangguangli <kangguangli@hotmail.com>
Co-authored-by: Nzhangbo9674 <zhangbo54@baidu.com>

fc66b5d7

[ONEDNN] Upgrade oneDNN version to v3.1 (#52463) · cfa513f7

由 YangQun 提交于 7月 12, 2023

* squash pick the poc code
* fix build after rebase
* fix int8 conv and fc uts
* Fix and clean-up Get_SRC_Scale_Memory
* fix floating point fc uts
* fix test_analyzer_int8_googlenet
* test_analyzer_int8_mobilenetv1
* fix int8 mobilenet v2 and v3
* fix build error after rebase
* [oneDNN] rename library version
* fix conv bias datatype
* try to fix import error
* fix rebase error
* [oneDNN] pack library into python wheel
* add MKLDNN_SHARED_LIB_3 to env_dict
* fix test_analyzer_bert
* fix fill_constant op kernel
* fix ernie and matmul op ut
* fix softplus ut
* fix conv+relu6 fusion ut
* fix hardswish fusion
* fix quant+transpose fusion ut
* fixsgd ut
* fix int8 matmul with flatten
* fix fc+scale fusion
* fix conv/matmul+gelu fusion uts
* fix rebase error
* Revert "fix conv/matmul+gelu fusion uts"
This reverts commit 47eb5e49972bd8f7271a233def9bfb3e98ce78e1.
* upgrade to onednn v3.1
* remove older version onednn
* use densetensor::data() for achieving mean and var in layernorm impl
* comments for atol of integer tests
* fix clang-format
* Revert "remove older version onednn"
This reverts commit 783e57ddfd4401254596eae7d47adb9b03590c09.
* improve binary handle
* fix expand kernel
* Revert "use densetensor::data() for achieving mean and var in layernorm impl"
* always use forward_inference for conv
* remove activation scales
* rollback changes to mkldnn.cmake
* address comments
* port changes to dequantize kernel
* fix merge error
* fix fused_elementwise_kernel
* upgrade onednn version to v3.1.1
* fix some approval error
* fix error msg format
* remove old onednn libs
* try to fix symbolic link issue
* fix cinn test case segfault
* do not explicit link test with onednn
* remove unnecessary changes
* integrate CINN with onednn v3
* link with mkldnn project
* fix cinn build file

---------
Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>
Co-authored-by: NChen, Xinyu1 <xinyu1.chen@intel.com>
Co-authored-by: Ntianshuo78520a <707759223@qq.com>

cfa513f7

[clang-tidy] enable `readability-container-size-empty` check (#55279) · be3a6fa7

由 Wang Xin 提交于 7月 12, 2023

* [clang-tidy] enable readability-container-size-empty check

* fix test_custom_kernel Failed

* add clang-tid-10 in dockerfile

* add clang-tidy in dockerfile

* fix bug

be3a6fa7

11 7月, 2023 14 次提交

support sharding parallel (#54634) · b7a05057

由 pangengzheng 提交于 7月 11, 2023

* support sharding parallel

* fix name

* fix

* update

* test amp for sharding

---------

Co-authored-by: pangengzheng <pangengzheng.baidu.com>

b7a05057

M
DOCS: Adding imformation about datatype in math.py (#55297) · ab73b8c6
由 Muhammad Ishaque Nizamani 提交于 7月 11, 2023
```
* DOCS: Adding imformation about datatype in math.py

* replaced uint16 with bfloat16.
```
ab73b8c6

Pipeline pass base (#55174) · 5434560a

由 Wennie396 提交于 7月 11, 2023

* format correction

* variable names adjustment

* variable names adjustment, name-->type, value-->sub_program

5434560a

replace the AdagradOptimizer... · 94365855

由 LoneRanger 提交于 7月 11, 2023

replace the AdagradOptimizer 、adamaxOptimizer、AdadeltaOptimizer、RMSPropOptimizer、LambOptimizer and Momentum (#54152)

* replace the AdadeltaOptimizer with Adadelta

* replace the RMSPropOptimizer with RMSProp

* replace the LambOptimizer with lamb

* replace the momentum in contrib/optimizer.py with Momentum in python/paddle/optimizer/momentum.py

* fix bug

* fix bug

* fix bug

* fix bug of Lamp

* fix bug of Lamp

* fix bug of import

* replace the AdamaxOptimizer with Admax and change the optimizer base for AdagradOptimizer

* fix bug

* fix bug

* Update optimizer.py

* fix bug

* fix bug

94365855

R

[ROCM] reduce build log (#55097) · a1396a80
由 ronnywang 提交于 7月 11, 2023

a1396a80

Integrate rmsnorm kernel (#54998) · 97d3d6ee

由 MarDino 提交于 7月 11, 2023

* add rmsnorm kernel
* add static graph test
* fix round type
* use alignas to avoid msvc compile error
* remove redundant headerfile to avoid rocm compile error
* fix rocm compile not found cub
* Add document

97d3d6ee

[NewIR] Fix new ir unsqueeze op bug (#55212) · 852d7a12

由 hong 提交于 7月 11, 2023

* suport optional input in new_ir

* polish code

* add coverate test

* update

* update

* add unitest

* remove reduplicate code

* udpate

* fix assign error

* revert test arg min max

* update

* fix bug

* polish code

* update

* fix unique and close op bug

* update

* update

* revert test code

* revert unique test

* polish code

* remove useless code

---------
Co-authored-by: Nzhangbo9674 <zhangbo54@baidu.com>

852d7a12

张

[CodeStyle][CINN] ruff F403 in test/cinn (#55255) · f4bdfa60
由张春乔提交于 7月 11, 2023

f4bdfa60
Z
[IR] Add op compat info for grad op (#55277) · b4d7e1e0
由 zhangbo9674 提交于 7月 11, 2023
```
* fix bug

* fix bug

* fix bug
```
b4d7e1e0
H

[0D-Tensor] Support isclose and polish codes (#55292) · 036c0ae1
由 HongyuJia 提交于 7月 11, 2023

036c0ae1
Linear compress (#55128) · f4290a92
由 FormlessUnit 提交于 7月 11, 2023
```
* rename weight_only/llm.int8
```
f4290a92

赛题七-开发grad_fn、next_functions两个API 并暴露到python端-v1 (#54838) · ab46b14c

由 qiuwenbo 提交于 7月 11, 2023

* [尝试] 给tensor增加一个属性, 这个属性是一个定值 1

* 暴露gradnode 并构建gradnode新的方法(用来测试)进行暴露给python python端可以访问

* 开发grad_fn、next_functions两个API 并暴露到python端- 做一些规范化处理

* 增加一个单元测试

* 优化 code-style

ab46b14c

H

fix new ir sigmoid cross entropy op (#55284) · 22c49634
由 hong 提交于 7月 11, 2023

22c49634
A
[NewIR]Refine IrPrinter and basic Concept Interface for const Object (#55209) · 4fa3e149
由 Aurelius84 提交于 7月 11, 2023
```
* [NewIR]Refine IrPrinter and basic Concept Interface for const Object
```
4fa3e149

10 7月, 2023 11 次提交
- D
  
  fix generator pickle for custom device (#55247) · b20d22df
  由 duanyanhui 提交于 7月 10, 2023
  
  b20d22df
- H
  Fix new ir polygama op bug (#55278) · 31bf1e88
  由 hong 提交于 7月 10, 2023
```
* update polygama

* udpate compact

* format yaml file
```
  31bf1e88
- L
  [SemiAuto] move ut of auto_parallel (#55217) · 89600fa1
  由 Leo Chen 提交于 7月 10, 2023
```
* move ut of auto_parallel

* fix ut
```
  89600fa1
- K
  [NewIR] add stop_gradient attribute for defining op (#55235) · c5a191bb
  由 kangguangli 提交于 7月 10, 2023
```
* add stop_gradient attribute for defining op

* modify by reviews

* fix
```
  c5a191bb
- Y
  
  [PASS] add constant folding pass (#55099) · 4905a247
  由 Yuanle Liu 提交于 7月 10, 2023
  
  4905a247
- R
  
  [CustomDevice] add custom device support for Variable.set_value (#55272) · df311526
  由 ronnywang 提交于 7月 10, 2023
  
  df311526
- R
  
  [CustomDevice] fix get_paddle_place (#55225) · 33730ae7
  由 ronnywang 提交于 7月 10, 2023
  
  33730ae7
- H
  [NewIR] fix new ir affine grid bug (#55244) · df21f815
  由 hong 提交于 7月 10, 2023
```
* fix affine grid bug

* revert cummax
```
  df21f815
- K
  
  update white_list and remove warning (#55243) · a8cd12d2
  由 kangguangli 提交于 7月 10, 2023
  
  a8cd12d2
- Z
  [IR] Support inplace execute logic for NewIrInterpreter (#55210) · e8cba1cb
  由 zhangbo9674 提交于 7月 10, 2023
```
* add inplace interface

* support inplace

* refine code

* fix bug

* fix bug

* refien code
```
  e8cba1cb
- W
  
  Fix load fine tune error (#55233) · 5f00305d
  由 WangZhen 提交于 7月 10, 2023
  
  5f00305d
08 7月, 2023 1 次提交
- 张
  [CodeStyle][CINN] ruff F401 and F403 in python/cinn (#55182) · 32bc8b88
  由张春乔提交于 7月 08, 2023
```
Co-authored-by: NSigureMo <sigure.qaq@gmail.com>
```
  32bc8b88
07 7月, 2023 9 次提交
- H
  [NewIR] Fix new ir cross op (#55222) · 832d6516
  由 hong 提交于 7月 07, 2023
```
* fix exception bug

* update

* fix attribute translator bug

* remove usless code
```
  832d6516
- Y
  [Semi-Auto] Add reduction spmd rule (#54991) · 35b72e87
  由 Yichen Zhang 提交于 7月 07, 2023
```
* add reduction spmd rule for auto parallel

* fix the logic of handling partial

* fix code style

* fix the partial handling
```
  35b72e87
- G
  修改COPY-FROM No. 2 jit (#54920) · a02e6dbd
  由 gouzil 提交于 7月 07, 2023
```
* [jit] add copy-from; test=document_fix

* [jit] add copy-from; test=document_fix

* fix TracedLayer
```
  a02e6dbd
- X
  
  [fix] move exception throw out of omp parallel for loop (#55064) · 9ed8bafd
  由 xiaoye 提交于 7月 07, 2023
  
  9ed8bafd
- W
  
  [XPU] Add layernorm fuse pass (#55154) · eb12739e
  由 wz1qqx 提交于 7月 07, 2023
  
  eb12739e
- R
  
  [CustomDevice] fix resource_pool release bug (#55229) · 6af85a81
  由 ronnywang 提交于 7月 07, 2023
  
  6af85a81
- W
  
  [XPU] Eliminate small ops (#55193) · b8f265d2
  由 wz1qqx 提交于 7月 07, 2023
  
  b8f265d2
- M
  
  add odd rules for getting kernels (#55178) · 0bcbfe83
  由 ming1753 提交于 7月 07, 2023
  
  0bcbfe83
- M
  [IR&PASS] add conv + elementwise_add fuse pattern (#55176) · 463a4f25
  由 ming1753 提交于 7月 07, 2023
```
* [IR&PASS] add conv + elementwise_add fuse pattern

* add conv2dAddPattern to pass
```
  463a4f25

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功