提交 · 9c24a4acce07307c2bd35bd6ac6fa3c4e2578341 · PaddlePaddle / Paddle

13 2月, 2023 1 次提交

Fix div 0 error of case25: paddle.dot (#50014) · 5c5536cb

由 RedContritio 提交于 2月 13, 2023

* support size 0 dot input

* prevent div 0 in grad

* add unittest

* remove unnecessary vlog

* add unittests

5c5536cb

12 2月, 2023 1 次提交
- X
  
  [prim] generate static prim api (#50315) · 82cf1fad
  由 Xiaoxu Chen 提交于 2月 12, 2023
  
  82cf1fad
11 2月, 2023 1 次提交

[Tensor Operator] Overload Tensor Operator (#50098) · 14e45f6b

由 HongyuJia 提交于 2月 11, 2023

* init commit

* fix tensor operator*

* fix compile bug

* bug reproduce

* update commit

* polish codes

* fix compile bug

* test begin

* test begin

* compile finish

* restore origin composite_backward_api

* pass local CI

* fix merge error

* fix merge error

* change py_test from GPU->CPU, test custom op

* polish codes, modify prim unittest

* modify prim unittest

* determine phi_tensor_operants location

* polish codes

* add header file

* solve windows unresolved symbol

* fix some CI error

* add overload defination

* fix CI inference and Windows

* polish codes according to reviewers' opinion

* polish codes according to reviewers' opinion

14e45f6b

10 2月, 2023 9 次提交
- U
  
  remove if constexpr(), which is not supported on gcc54 (#50395) · 22bcb75a
  由 umiswing 提交于 2月 10, 2023
  
  22bcb75a
- L
  Fix bugs and add unit tests in instance_norm_grad_kernel when d_scale and (#50394) · 4c373e6b
  由 Leo Guo 提交于 2月 10, 2023
```
d_bias are nullptr. Modify the code style of full_kernel.cc. Add new data
type for concat, elementwise_add, gather, scale, scatter ops. test=kunlun
```
  4c373e6b
- Y
  
  add xpu batch norm ncdhw layout, test=kunlun (#50384) · ca520280
  由 ykkk2333 提交于 2月 10, 2023
  
  ca520280
- I
  
  fix stackoverflow case13 gather (#50243) · bf80664c
  由 Infinity_lee 提交于 2月 10, 2023
  
  bf80664c
- R
  Fix UFA非法地址访问(UFA illegal address access) of case2: paddle.scatter (#50025) · fb228c4a
  由 RedContritio 提交于 2月 10, 2023
```
* add dim check in scatter

* add check in scatter.cu

* add unittest

* remove unnecessary log and comment

---------

Co-authored-by: RedContritio <>
```
  fb228c4a
- Z
  
  [XPU] add fc_xpu op&pass to optimize ernie model (#50277) · 945f918c
  由 zhupengyang 提交于 2月 10, 2023
  
  945f918c
- H
  [phi decoupling] remove AllocatorFacade in phi (#50380) · d1bfb4b7
  由 Huang Jiyi 提交于 2月 10, 2023
```
* remove AllocatorFacade in phi

* fix include

* fix bugs
```
  d1bfb4b7
- H
  [phi decoupling] rm gradient_accumulator in phi (#50385) · 13f57ec0
  由 Huang Jiyi 提交于 2月 10, 2023
```
* rm gradient_accumulator in phi

* update
```
  13f57ec0
- W
  
  [XPU] bind op: atan & deformable_conv_v1 (#50373) · e15ef948
  由 wangshengxiang 提交于 2月 10, 2023
  
  e15ef948
09 2月, 2023 6 次提交

L

Modify full kernel for xpu. test=kunlun (#50209) · 18e0e01d
由 Leo Guo 提交于 2月 09, 2023

18e0e01d

[PHI decoupling] move strided_memcpy.h to phi (#50346) · 17318c1a

由 Huang Jiyi 提交于 2月 09, 2023

* decouple strided_memcpy

* move strided_memcpy

* move strided_memcpy to phi

* fix namespace

* update

* fix gpu compile bugs

17318c1a

H

remove layout_utils in phi (#50355) · 90650534
由 Huang Jiyi 提交于 2月 09, 2023

90650534

Add MultiTenosrAdam OP (#49220) · 10654c77

由 yuehuayingxueluo 提交于 2月 09, 2023

* add multi_tenosr_adam

* update multi_tensor_base.py, test_multi_tensor_adam.py, adamw.py

* fix adam.py optimizer.py

* fix adamw.py

* fix test_multi_tensor_adam.py

* fix CI bug

* fix CI coverage

* fix ci bug

* fix betapow

* fix some bugs

* fix test_adamw_op.py

* fix CI coverage

* fix multi_tensor_adam_kernel.cc

* fix CI bug

* fix multi_tensor_adam_op.cc and test_multi_tensor_adam.py

* fix code style

* update C++ parts

* remove python parts modification temporarily

* add C++ ut

* update betapow copy code logic

* fix ci ut

* fix windows ci

* fix coverage ci

* improve coverage rate

---------
Co-authored-by: Nsneaxiy <sneaxiy@126.com>

10654c77

Z

add logical_and, logical_or and logical_xor for xpu (#50228) · 0036316e
由 zhangyikun02 提交于 2月 09, 2023

0036316e
傅

fix set_value_65965 (#50340) · b3f60f39
由傅剑寒提交于 2月 09, 2023

b3f60f39

08 2月, 2023 7 次提交
- P
  fuse quantize+transpose and transpose+dequantize (#49509) · 197a4ffe
  由 Paulina Gacek 提交于 2月 08, 2023
```
* QuantTranpose pattern is being found by pass

* quant + transpose fuse

* code style changes

* UT written, reorder fixed

* Dequantize + transpose2 fuse  added

* pass name changed

* UT added & shift corrected

* got rid of redundancy

* review changes

* AsIntermediate corrected

* compat added
```
  197a4ffe
- H
  
  Use inference, save construct time (#50163) · 7a82b6de
  由 HongyuJia 提交于 2月 08, 2023
  
  7a82b6de
- Z
  Fix bn performance degradation (#50287) · 6f1ec935
  由 zhangkaihuo 提交于 2月 08, 2023
```
* fix bn performance degradation
```
  6f1ec935
- H
  [Tensor Support unsigned] Tensor::data() supports unsigned int and bfloat16 (#50257) · 80dc81c5
  由 HongyuJia 提交于 2月 08, 2023
```
* support unsigned int and bfloat16

* update unit test

* update DenseTensor datatype

* unsupport more datatype of mutable_data(Place)

* fix unittest
```
  80dc81c5
- Z
  
  [Zero-Dim] Fix 0d axis support for argmin/argmax (#50293) · aec1e4ce
  由 Zhong Hui 提交于 2月 08, 2023
  
  aec1e4ce
- H
  
  move mixed_vector (#50282) · 35d7d1f0
  由 Huang Jiyi 提交于 2月 08, 2023
  
  35d7d1f0
- Y
  [PHI]Unify Fluid and PHI kernel (#49328) · e92e3aab
  由 YuanRisheng 提交于 2月 08, 2023
```
* unify_kernel

* fix compile bugs

* modify macro name

* perfect code according comment

* fix compile bugs

* fix compile bugs

* fix ci bugs

* fix ci bug

* fix ci bugs

* fix ci bugs

* modify code according comment

* rm conv_fusion_op
```
  e92e3aab
07 2月, 2023 5 次提交
- Z
  Remove axis in some elementwise api (#50190) · 1dedaada
  由 zyfncg 提交于 2月 07, 2023
```
* remove axis in some elementwise api

* fix inplace bug eager-gen

* fix bug

* revert change for CheckInplace

* polish code
```
  1dedaada
- 张
  
  fix div 0 error in conv1/2/3 (#49999) · 7a0fdeb9
  由张春乔提交于 2月 07, 2023
  
  7a0fdeb9
- C
  add batch_norm composite rule (#49894) · 9b3a41b1
  由 cyber-pioneer 提交于 2月 07, 2023
```
move composite test case

remove unuseful var

add composite op blacklist
```
  9b3a41b1
- C
  
  Support build with gcc12 for CUDA less than 12.0 (#50106) · 755049f2
  由 chalsliu 提交于 2月 07, 2023
  
  755049f2
- Y
  
  Fix gather, scatter op 0d tenor GPU error. (#50271) · 05c9c0a5
  由 Yuang Liu 提交于 2月 07, 2023
  
  05c9c0a5
06 2月, 2023 7 次提交
- Y
  
  remove profiler (#50191) · 5a13280a
  由 YuanRisheng 提交于 2月 06, 2023
  
  5a13280a
- Z
  Delete extra input (Bias, ResidualData) in OpMaker of conv2d (#49121) · 2deada9a
  由 zyfncg 提交于 2月 06, 2023
```
* remove extra input of conv2d

* fix bug

* fix unittest bug

* adjust conv2d.pbtxt

* fix cpu_quantize_pass_tester

* revert use_addto of conv2d

* fix runtime attribute

* fix bug

* recover force_fp32_output in conv2d

* refine error info

* fix bug
```
  2deada9a
- 张
  
  fix div 0 error of split (#49958) · e12c9221
  由张春乔提交于 2月 06, 2023
  
  e12c9221
- H
  
  [XPU] add int type for concat and split functor (#50200) · b3e5b0c4
  由 houj04 提交于 2月 06, 2023
  
  b3e5b0c4
- D
  
  unique_consecutive add 0d (#50213) · eb8353a4
  由 duanboqiang 提交于 2月 06, 2023
  
  eb8353a4
- E
  
  phi move ReshapeToMatrix & GetValue (#50139) · d09962a1
  由 engineer1109 提交于 2月 06, 2023
  
  d09962a1
- R
  
  fix gcc12 error: mismatched-new-delete error in custom_device.cc (#47466) · 6d70761e
  由 risemeup1 提交于 2月 06, 2023
  
  6d70761e
03 2月, 2023 3 次提交

R
Fix 堆栈溢出 (stack overflow) of case8: paddle.unique_consecutive (#49983) · 83077f6f
由 RedContritio 提交于 2月 03, 2023
```
* support negative index in unique_consecutive

* add unittest

* add unittest
```
83077f6f

Replace matmul(v2) with fused_matmul during oneDNN fuse passes (#49515) · 5cfe1645

由 Sławomir Siwek 提交于 2月 03, 2023

* replace matmul with matmul_v2 in fuse passes

* Remove fusion logic from matmul

* removing fusion methods

* add proper name

* adjust namespaces

* clean attrs in python tests

* delete checkpoint and restore matmul version

* remove unused code

* matmul and reshape/transpose fuses migrated

* split MatmulOneDNN headers

* fuse activation and eltwise_add

* add fuse_activation

* matmul_transpose_reshape/reshape_transpose_matmul

* matmul + elementwise_add (fused)

* activation temporary modifciation

* merge newest develop

* remove depedency from other PR

* revert pbtxt

* remove placeholders from matmul_v2

* add description in OPMaker

* remove matmul_v2_op.h and all depedencies

* remove dims changing in base op

* add possibility to fuse already fused_matmul

* restart broken CI

* Empty-Commit

* revert matmul_utils.h

* codestyle

* adjust imports

* add pbtxt file

* 100% matmul unit tests coverage

* trigger CI with minimal changes to develop

* adjust changes to develop

* add fused_matmul op

* inherit base ops

* add "v2"

* move OPMaker

* Gradually add fused_matmul files

* second batch of fused_matmul changes

* split infershapes of matmul_v2 and fused_matmul

* inherit fused_matmul from matmul_v2

* Update paddle/phi/backends/onednn/onednn_reuse.h
Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>

* Update paddle/phi/kernels/fusion/onednn/fused_matmul_kernel.cc
Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>

---------
Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>

5cfe1645

R

Fix div 0 error of case20: paddle.min (#50013) · 50c43dd3
由 RedContritio 提交于 2月 03, 2023

50c43dd3

PaddlePaddle / Paddle 1 年多 前同步成功

PaddlePaddle / Paddle
1 年多前同步成功