提交 · df0ed4d6f9eef3434664025fab7d9763ea9cc41d · PaddlePaddle / Paddle

15 2月, 2023 5 次提交
- C
  fix composite op map (#50397) · ff86aeab
  由 cyber-pioneer 提交于 2月 15, 2023
```
* map output from composite rule to origin op

add mean layer_norm dropout op map

add input map check

composite softmax support input shape []

* composite softmax support shape []

* polish log

* solve conflict

* polish code

* polish op map output

* add check dtype
```
  ff86aeab
- Y
  [PHI Decoupling]Remove Profiler header (Part2) (#50183) · 8fabca11
  由 YuanRisheng 提交于 2月 15, 2023
```
* move profiler

* add file

* fix mac compile bugs

* fix ci bugs

* fix mac bugs

* fix ci bugs

* fix compile bugs

* perfect code according comment
```
  8fabca11
- Z
  
  add gather_nd_grad op and where_grad support zero_dim for xpu (#50454) · 055d0c2d
  由 zhangyikun02 提交于 2月 15, 2023
  
  055d0c2d
- Q
  
  remove duplicated op in xpu2_op_list (#50450) · 47c23ccb
  由 QingshuChen 提交于 2月 15, 2023
  
  47c23ccb
- Y
  [CUSTOM]custom device add black_list (#50409) · 66d3c56e
  由 YuhangLi 提交于 2月 15, 2023
```
* [CUSTOM]custom device add black_list

* change log level

* fix some issues
```
  66d3c56e
14 2月, 2023 6 次提交
- E
  decouple tensor_utils (#50264) · 057cdb95
  由 engineer1109 提交于 2月 14, 2023
```
fix X

remove TensorCopy

codestyle

add fluid memory header

fix symbol

fix cmake

fix cmake

fix context

fix header

fix place

fix context

fix context

fix context

fix code

fix custom context

fix custom context

fix copy

fix data_transform

fix style

remove changes of custom

fix scalar
```
  057cdb95
- D
  Expand mixed_precision to custom device (#50378) · fcb746cb
  由 duanyanhui 提交于 2月 14, 2023
```
* expand mix_precision to custom_device

* fix bug

* fix bug

* fix comment

* fix DEFINE bug
```
  fcb746cb
- H
  
  fix operants_manager.cc compile error (#50492) · 4a7d9cd8
  由 HongyuJia 提交于 2月 14, 2023
  
  4a7d9cd8
- H
  [Polish Namespace] Polish operants namespace (#50420) · 61a933ac
  由 HongyuJia 提交于 2月 14, 2023
```
* polish namespace

* change static_tensor_operants

* polish namespace
```
  61a933ac
- S
  
  support int8 for embedding (#50413) · 78eb2d87
  由 seemingwang 提交于 2月 14, 2023
  
  78eb2d87
- L
  Decrease usage of GetVecSize for optimizing host computation efficiency (#50353) · 976606fe
  由 limingshu 提交于 2月 14, 2023
```
* first commit.

* a little changes

* add some changes for get vec_size efficiently

* fix bugs

---------
Co-authored-by: Nzhangbopd <1299246947@qq.com>
```
  976606fe
13 2月, 2023 5 次提交
- Z
  Delete axis of fmin kernel (#50358) · 8df8cb10
  由 zyfncg 提交于 2月 13, 2023
```
* delete axis of fmin

* fix bug
```
  8df8cb10
- H
  
  Fix compile error of operants_manager.cc (#50442) · 615d9f53
  由 HongyuJia 提交于 2月 13, 2023
  
  615d9f53
- H
  [Tensor data()] Tensor support `void* data()` function (#50262) · 8907cdca
  由 HongyuJia 提交于 2月 13, 2023
```
* Tensor support void* data() function

* add unittest

* add selectedRows unittest

* polish unittest

* polish unittest

* polish unittest

* polish unittest
```
  8907cdca
- Y
  add xpu pool3d kernels (#50233) · 1281b612
  由 ykkk2333 提交于 2月 13, 2023
```
* add xpu adagrad and where_grad kernels, test=kunlun

* add xpu pool3d kernels, test=kunlun
```
  1281b612
- R
  Fix div 0 error of case25: paddle.dot (#50014) · 5c5536cb
  由 RedContritio 提交于 2月 13, 2023
```
* support size 0 dot input

* prevent div 0 in grad

* add unittest

* remove unnecessary vlog

* add unittests
```
  5c5536cb
12 2月, 2023 1 次提交
- X
  
  [prim] generate static prim api (#50315) · 82cf1fad
  由 Xiaoxu Chen 提交于 2月 12, 2023
  
  82cf1fad
11 2月, 2023 1 次提交

[Tensor Operator] Overload Tensor Operator (#50098) · 14e45f6b

由 HongyuJia 提交于 2月 11, 2023

* init commit

* fix tensor operator*

* fix compile bug

* bug reproduce

* update commit

* polish codes

* fix compile bug

* test begin

* test begin

* compile finish

* restore origin composite_backward_api

* pass local CI

* fix merge error

* fix merge error

* change py_test from GPU->CPU, test custom op

* polish codes, modify prim unittest

* modify prim unittest

* determine phi_tensor_operants location

* polish codes

* add header file

* solve windows unresolved symbol

* fix some CI error

* add overload defination

* fix CI inference and Windows

* polish codes according to reviewers' opinion

* polish codes according to reviewers' opinion

14e45f6b

10 2月, 2023 9 次提交
- U
  
  remove if constexpr(), which is not supported on gcc54 (#50395) · 22bcb75a
  由 umiswing 提交于 2月 10, 2023
  
  22bcb75a
- L
  Fix bugs and add unit tests in instance_norm_grad_kernel when d_scale and (#50394) · 4c373e6b
  由 Leo Guo 提交于 2月 10, 2023
```
d_bias are nullptr. Modify the code style of full_kernel.cc. Add new data
type for concat, elementwise_add, gather, scale, scatter ops. test=kunlun
```
  4c373e6b
- Y
  
  add xpu batch norm ncdhw layout, test=kunlun (#50384) · ca520280
  由 ykkk2333 提交于 2月 10, 2023
  
  ca520280
- I
  
  fix stackoverflow case13 gather (#50243) · bf80664c
  由 Infinity_lee 提交于 2月 10, 2023
  
  bf80664c
- R
  Fix UFA非法地址访问(UFA illegal address access) of case2: paddle.scatter (#50025) · fb228c4a
  由 RedContritio 提交于 2月 10, 2023
```
* add dim check in scatter

* add check in scatter.cu

* add unittest

* remove unnecessary log and comment

---------

Co-authored-by: RedContritio <>
```
  fb228c4a
- Z
  
  [XPU] add fc_xpu op&pass to optimize ernie model (#50277) · 945f918c
  由 zhupengyang 提交于 2月 10, 2023
  
  945f918c
- H
  [phi decoupling] remove AllocatorFacade in phi (#50380) · d1bfb4b7
  由 Huang Jiyi 提交于 2月 10, 2023
```
* remove AllocatorFacade in phi

* fix include

* fix bugs
```
  d1bfb4b7
- H
  [phi decoupling] rm gradient_accumulator in phi (#50385) · 13f57ec0
  由 Huang Jiyi 提交于 2月 10, 2023
```
* rm gradient_accumulator in phi

* update
```
  13f57ec0
- W
  
  [XPU] bind op: atan & deformable_conv_v1 (#50373) · e15ef948
  由 wangshengxiang 提交于 2月 10, 2023
  
  e15ef948
09 2月, 2023 6 次提交

L

Modify full kernel for xpu. test=kunlun (#50209) · 18e0e01d
由 Leo Guo 提交于 2月 09, 2023

18e0e01d

[PHI decoupling] move strided_memcpy.h to phi (#50346) · 17318c1a

由 Huang Jiyi 提交于 2月 09, 2023

* decouple strided_memcpy

* move strided_memcpy

* move strided_memcpy to phi

* fix namespace

* update

* fix gpu compile bugs

17318c1a

H

remove layout_utils in phi (#50355) · 90650534
由 Huang Jiyi 提交于 2月 09, 2023

90650534

Add MultiTenosrAdam OP (#49220) · 10654c77

由 yuehuayingxueluo 提交于 2月 09, 2023

* add multi_tenosr_adam

* update multi_tensor_base.py, test_multi_tensor_adam.py, adamw.py

* fix adam.py optimizer.py

* fix adamw.py

* fix test_multi_tensor_adam.py

* fix CI bug

* fix CI coverage

* fix ci bug

* fix betapow

* fix some bugs

* fix test_adamw_op.py

* fix CI coverage

* fix multi_tensor_adam_kernel.cc

* fix CI bug

* fix multi_tensor_adam_op.cc and test_multi_tensor_adam.py

* fix code style

* update C++ parts

* remove python parts modification temporarily

* add C++ ut

* update betapow copy code logic

* fix ci ut

* fix windows ci

* fix coverage ci

* improve coverage rate

---------
Co-authored-by: Nsneaxiy <sneaxiy@126.com>

10654c77

Z

add logical_and, logical_or and logical_xor for xpu (#50228) · 0036316e
由 zhangyikun02 提交于 2月 09, 2023

0036316e
傅

fix set_value_65965 (#50340) · b3f60f39
由傅剑寒提交于 2月 09, 2023

b3f60f39

08 2月, 2023 7 次提交
- P
  fuse quantize+transpose and transpose+dequantize (#49509) · 197a4ffe
  由 Paulina Gacek 提交于 2月 08, 2023
```
* QuantTranpose pattern is being found by pass

* quant + transpose fuse

* code style changes

* UT written, reorder fixed

* Dequantize + transpose2 fuse  added

* pass name changed

* UT added & shift corrected

* got rid of redundancy

* review changes

* AsIntermediate corrected

* compat added
```
  197a4ffe
- H
  
  Use inference, save construct time (#50163) · 7a82b6de
  由 HongyuJia 提交于 2月 08, 2023
  
  7a82b6de
- Z
  Fix bn performance degradation (#50287) · 6f1ec935
  由 zhangkaihuo 提交于 2月 08, 2023
```
* fix bn performance degradation
```
  6f1ec935
- H
  [Tensor Support unsigned] Tensor::data() supports unsigned int and bfloat16 (#50257) · 80dc81c5
  由 HongyuJia 提交于 2月 08, 2023
```
* support unsigned int and bfloat16

* update unit test

* update DenseTensor datatype

* unsupport more datatype of mutable_data(Place)

* fix unittest
```
  80dc81c5
- Z
  
  [Zero-Dim] Fix 0d axis support for argmin/argmax (#50293) · aec1e4ce
  由 Zhong Hui 提交于 2月 08, 2023
  
  aec1e4ce
- H
  
  move mixed_vector (#50282) · 35d7d1f0
  由 Huang Jiyi 提交于 2月 08, 2023
  
  35d7d1f0
- Y
  [PHI]Unify Fluid and PHI kernel (#49328) · e92e3aab
  由 YuanRisheng 提交于 2月 08, 2023
```
* unify_kernel

* fix compile bugs

* modify macro name

* perfect code according comment

* fix compile bugs

* fix compile bugs

* fix ci bugs

* fix ci bug

* fix ci bugs

* fix ci bugs

* modify code according comment

* rm conv_fusion_op
```
  e92e3aab

PaddlePaddle / Paddle 1 年多 前同步成功

PaddlePaddle / Paddle
1 年多前同步成功