提交 · 4f307a7e1589ab3d3242bdd1d5637baffc69552f · PaddlePaddle / Paddle

20 7月, 2023 6 次提交

Z

rename hard_sigmoid to hardsigmoid for kernel name (#55559) · c3080386
由 zyfncg 提交于 7月 20, 2023

c3080386

[XPU][PHI Kernels] bind reduce_max_int64 set_value_bool sin_grad_fp32... · ab00c96c

由 lijin23 提交于 7月 20, 2023

[XPU][PHI Kernels] bind reduce_max_int64 set_value_bool sin_grad_fp32 cos_grad_fp32 for XPU (#55375)

* bind kernels for xpu

* format code

* format code

* 0d support for set value

* refine set_value

ab00c96c

M

fix bug of constant folding pass (#55556) · bc61c796
由 ming1753 提交于 7月 20, 2023

bc61c796

[Kunlun] Modify some legacy code on distributed training (#55515) · 806f8d2b

由 XiaociZhang 提交于 7月 20, 2023

* [Kunlun] Mofify some legacy code on distributed training

There were limitations on XPUs before, such as concat/split is not
supported, and c_broadcast only support fp32. These limitations are
lifted recently.

Multi-device profiling on XPU will also be supported by this PR.
Without this PR, a hanging broadcast will be issued by devices that
enables profiling, eventually lead to kernel timeout error.

* fix typo

806f8d2b

[Semi Auto] Entropy SPMD Rule (#55394) · 5f376f00

由 JZ-LIANG 提交于 7月 20, 2023

* base rule

* add sharidng merge

* add sharidng axis merge

* define unified data class for inferencing dist_attr

* test wrap DistTensorSpec in dygraph mode

* matmul main logic done

* shape int64

* common cc

* define unified data class for inferencing dist_attr

* test wrap DistTensorSpec in dygraph mode

* define python api and wrap function in static mode for DistTensorSpec

* revise syntax

* map bugfix

* broadcast func

* compile 1

* add unitest

* add registry

* update unitest

* bugfix

* bugfix

* add pybind

* bugfix

* bugfix macro gloabl name space

* bugfix macro gloabl name space

* pybind

* pybind test

* pybind bugfixed1

* pybind bugfixed2

* pybind unitest

* merge dev

* merge dev

* merge dev

* fixed cmake conflict

* fixed cmake conflict

* rename get method

* revise inferforward output type

* revise comment

* replicated rule

* replicated rule 2

* revert bug deps

* add rule

* add unitest

* add rule

* add unitest

* move ut of auto_parallel

* fix ut

* bugfix

* bugfix

* bugfix

* bugfix

* bugfix

* bugfix

* bugfix

* resolute input sharding conflict maybe

* fixed comment

* add rule

* add unitest

* fixed typoes

---------
Co-authored-by: NYichen Zhang <zhangyichen03@baidu.com>
Co-authored-by: Nzhiqiu <chenqiuliang@baidu.com>

5f376f00

Z
[IR] Add variable name prefix for BuildScope (#55536) · 44f409cf
由 zhangbo9674 提交于 7月 20, 2023
```
* add interface

* add code

* add code

* add code

* add code

* fix bug

* fix bug

* add var prefix
```
44f409cf

19 7月, 2023 16 次提交
- R
  
  [CustomPass] add register_pass api (#55511) · 6216beb3
  由 ronnywang 提交于 7月 19, 2023
  
  6216beb3
- R
  
  [PHI CAPI] Add support for registering a new operator, PART1 (#55532) · 3f17596a
  由 ronnywang 提交于 7月 19, 2023
  
  3f17596a
- 张
  [OpCompat] add and update fill in op_compat.yaml (#55517) · 77032f0e
  由张春乔提交于 7月 19, 2023
```
* Update op_compat.yaml

* Update op_compat.yaml
```
  77032f0e
- K
  
  fix put_along_axis (#55513) · 5a69dbcb
  由 kangguangli 提交于 7月 19, 2023
  
  5a69dbcb
- K
  [NewIR] fix one_hot_v2 compat (#55317) · 1dcc3bf7
  由 kangguangli 提交于 7月 19, 2023
```
* fix

* fix

* fix

* fix

* fix

* fix coverage ci

* add test case
```
  1dcc3bf7
- A
  
  [NewIR]Replace frontend::Program & hlir::Graph with ::ir::Program in CINN (#55186) · 72a910e4
  由 Aurelius84 提交于 7月 19, 2023
  
  72a910e4
- Z
  [IR] Add Dependency build for new ir interpretercore (#55468) · fd192303
  由 zhangbo9674 提交于 7月 19, 2023
```
* add interface

* add code

* add code

* add code

* add code

* fix bug

* fix bug
```
  fd192303
- C
  
  add TRT op unbind (#55476) · 4a55f5e7
  由 chen 提交于 7月 19, 2023
  
  4a55f5e7
- L
  
  skip first time reset (#55498) · 89e54d69
  由 Leo Chen 提交于 7月 19, 2023
  
  89e54d69
- H
  [NewIR]Add feed with place op (#55343) · 8e9e0659
  由 hong 提交于 7月 19, 2023
```
* add feed with place op

* remove useless unitest

* udpate mkldnn

* update

* add enable_static

* remove useless test case

* register int and doubel type

* fix bug
```
  8e9e0659
- C
  [AutoParallel] Polish DistTensor details (#55436) · 927c0d50
  由 Chen Weihang 提交于 7月 19, 2023
```
* polish dist_tensor details

* add unittest for coverage

* revert uselesss change

* skip test without dist
```
  927c0d50
- C
  
  Delete repeat ops add gather squeeze unsqueeze (#55371) · 552ed8d8
  由 csy0225 提交于 7月 19, 2023
  
  552ed8d8
- Y
  
  Sharding stage 1 tensor fusion (#55427) · 4c4d3185
  由 Yuang Liu 提交于 7月 19, 2023
  
  4c4d3185
- T
  Add gpups script (#55479) · f7cbfc4c
  由 tianshuo78520a 提交于 7月 19, 2023
```
* Add gpups ci test script

* test=gpups

* test=gpups

* test=gpups

* test=gpups
```
  f7cbfc4c
- Z
  delete relu6_raw (#55383) · 56d46ccc
  由 zhangyuqin1998 提交于 7月 19, 2023
```
* delete relu6_raw

* fix codestyle

* Update test_mkldnn_matmul_activation_fuse_pass.py

* fix

* Update backward.yaml

* Update ops.yaml

* Update backward.yaml
```
  56d46ccc
- S
  Fix mea segmentation fault error (#55408) · cc262c55
  由 sneaxiy 提交于 7月 19, 2023
```
* fix mea seg fault develop

* fix bias_grad seg fault
```
  cc262c55
18 7月, 2023 9 次提交

Clarify cinn/ir dirs [Part1] (#55121) · 3624723f

由 limingshu 提交于 7月 18, 2023

* Clarify cinn/ir dirs [Part1]

* addition of cinn/ir/op dir

* change header inludsion of ir/ir_operator.h to ir/op/ir_operator.h

* merge with develop changes

* relocate libschedule_desc_proto.a

* remove extra ir_schedule_error.cc

* addition for schedule/ir_schedule_error files

3624723f

batch add inpalce api (#55078) · 19302938

由 GGBond8488 提交于 7月 18, 2023

* batch add inpalce api

* fix inplace fn generate

* add test for  new inpalce api

* fix typro

* fix typro

* fix typro

* fix test error

* fix atan2

* remove atan2

* auto genereate inpalce api

* fix inplace generate fn error

* fix windows error

* fix test error

* fix test error

* fix windows ci error

* fix test error

* fix test_error

* fix test error

* fix eigen aliasing error in inplace

* remove elementwise_pow inplace

* fix doc error

* fix test error

19302938

[NewIR]Fix new ir concat split bug (#55419) · 5e6645d7

由 hong 提交于 7月 18, 2023

* fix new ir concat op bug

* fix bug

* using add_n_with_kernel instead of add_n impl

* fix pd_op yaml bug

* fix bug

5e6645d7

K
[NewIR] fix hsigmoid_loss (#55483) · 38782dc3
由 kangguangli 提交于 7月 18, 2023
```
* fix hsigmoid_loss

* add test into whitelist
```
38782dc3
L

fix typo: thream->stream (#55445) · 2558364c
由 Leo Chen 提交于 7月 18, 2023

2558364c
H
[0D-Tensor] CINN supports argmax, fix infershape (#55489) · 015285fd
由 HongyuJia 提交于 7月 18, 2023
```
* [0D-Tensor] CINN supports argmax, fix infershape

* [0D-Tensor] CINN supports argmax, fix infershape
```
015285fd
H

[0D-Tensor] CINN supports softmax and flip, fix infershape (#55470) · c7ba0312
由 HongyuJia 提交于 7月 18, 2023

c7ba0312
G
[OpCompat] add cast and repeat_interleave in op_compat.yaml (#55467) · 922d2481
由 gouzil 提交于 7月 18, 2023
```
* add cast and repeat_interleave

* fix
```
922d2481
K
[NewIR] support custom verify in op definition generation (#55428) · 7bd50187
由 kangguangli 提交于 7月 18, 2023
```
* support custom verify

* fix

* fix

* fix

* fix coverage ci

* remove custom verify in assert
```
7bd50187

17 7月, 2023 9 次提交
- Z
  
  update slice in op_compat.yaml (#55432) · c16ab557
  由 Zhenghai Zhang 提交于 7月 17, 2023
  
  c16ab557
- Z
  
  fix bug (#55471) · e9b8feac
  由 zhangbo9674 提交于 7月 17, 2023
  
  e9b8feac
- W
  
  [IR] optimize the error log. (#55465) · 1d2a91c6
  由 winter-wang 提交于 7月 17, 2023
  
  1d2a91c6
- I
  [Paddle-TRT] Support conv2d op enter into trt when filter is not a persistable tensor (#55246) · 74206917
  由 iamsonderr 提交于 7月 17, 2023
```
* support_conv2d

* remove comment

* check code style

* add former Test

* check code style

* add unittest

* fix log

* change unittest

---------
Co-authored-by: zhoutianzi666 <17801055074@163.com>
```
  74206917
- Z
  Support more dtype for any/all API. (#55253) · 7b19efe4
  由 zxcd 提交于 7月 17, 2023
```
* add more data type for all/any.

* remove xpu fix.

* add test unit.

* fix typename name.

* fix output data type.
```
  7b19efe4
- Z
  TensorSetConstantXPU support to use xpu::constant when T is float/float16 (#55122) · 6692dc9a
  由 zhangyikun02 提交于 7月 17, 2023
```
* TensorSetConstantXPU support to use xpu::constant when T is float/float16

* add xpu_wait for TensorSetConstantXPU
```
  6692dc9a
- H
  Remove Old Schedules in Ops (#55391) · 70183c4b
  由 Huihuang Zheng 提交于 7月 17, 2023
```
Remove old schedules.
```
  70183c4b
- Z
  
  Delete unused code (#55413) · db1f2c42
  由 Zhang Zheng 提交于 7月 17, 2023
  
  db1f2c42
- R
  
  update transpose in op_compat.yaml (#55458) · f66a705f
  由 RedContritio 提交于 7月 17, 2023
  
  f66a705f

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功