提交 · 69436bf57047454ae2a43f1c158cae570a4b87b2 · PaddlePaddle / Paddle

01 4月, 2023 2 次提交
- J
  Delete the /paddle/fluid/platform/device/npu directory (#52384) · 69436bf5
  由 jjyaoao 提交于 4月 01, 2023
```
* Delete the /paddle/fluid/platform/device/npu directory

* clear Cmakelists

* Try removing npu in the header file
```
  69436bf5
- F
  
  enable setting double attribute into opdesc (#52406) · 8a4aee18
  由 Feiyu Chan 提交于 4月 01, 2023
  
  8a4aee18
31 3月, 2023 17 次提交

由 zhenhailiu 提交于 3月 31, 2023

* gather with doc

* resolve comment

* polish

* polish

* code style

* polish doc

* add_test

* polish

* polish

* add test check

* add test check

* polish

* polish

* polish

* polish

* fix_time_out

* polish

* fix timeout

* fix_timeout

* polish

* polish

* polish

* polish

* polish

77d24854

R

support auto generate static for eye (#52370) · 20ee0d7f
由 RedContritio 提交于 3月 31, 2023

20ee0d7f

由 huangjiyi 提交于 3月 31, 2023

* update bipartite_match

* update

* fix bug

* fix test

* fix bug

* fix Kunlun-KP-Build

* Revert "fix Kunlun-KP-Build"

This reverts commit ceab63cc23079fd6839c826bb52db893fb056355.

* update

d05b73e4

FIX_LINUX_Wternimate (#52307) · ffff133b

由 Galaxy1458 提交于 3月 31, 2023

* this is a test pr, test=develop

* solve the four [-Wterminate] warning, test=develop

* solve the four [-Wterminate] warning, test=develop

* new fix [-Wterminate], test=delelop

* new fix [-Wterminate], test=delelop

* new fix [-Wterminate], test=delelop

* new , test = develop

* new , test = develop

* new , test = develop

* new , test = develop

* new , test = develop

* new , test = develop

ffff133b

陈

删除paddle/fluid/platform/device/mlu目录 (#52382) · d972de56
由陈沧夜提交于 3月 31, 2023

d972de56
J
[kunlun] prevent overflow in collective softmax_with_ce (#52356) · fb276f23
由 jameszhang 提交于 3月 31, 2023
```
* [kunlun] prevent numerical overflow in collective softmax_with_ce

* add fix in another branch
```
fb276f23

[Prim] Add prod backward composite rule (#51238) · a0069278

由 chenjian 提交于 3月 31, 2023

* first commit

* add registry

* add unit test

* fix format

* add unit test

* fix  bug

* replace unsuqeeze to reshape

* fix

* fix unit test

* update test

* update test

* fix unit test

* fix

* fix

a0069278

Add Yaml config for some op (#52347) · 967dee45

由 zyfncg 提交于 3月 31, 2023

* add yaml for some op

* fix inplace_abn

* fix test_leaky_relu_grad_grad_functor

* fix yaml

* fix typo

967dee45

L

fix bug in op_desc (#52396) · 07c7926f
由 Leo Chen 提交于 3月 31, 2023

07c7926f

[CustomOP Optional Inplace] Custom op supports inplace optional tensor (#52216) · fcd77346

由 HongyuJia 提交于 3月 31, 2023

* [CustomOP Inplace] Automap inplace dtype and shape, prepare for vector<Tensor> output

* delete custom_inplace_setup.py

* [CustomOP Optional Inplace] Custom operator supports inplace optional Tensor input

* fix bug for vector<Tensor> inplace test

fcd77346

Y

fix bugs (#52377) · 3ebb7e4d
由 YuanRisheng 提交于 3月 31, 2023

3ebb7e4d
Y
[PHI Decoupling]Remove distribute header (#52202) · e923642e
由 YuanRisheng 提交于 3月 31, 2023
```
* remove distribute

* fix py3 bugs

* fix gpu-ps bugs

* fix compile bugs

* fix unittest bugs
```
e923642e
W
[Paddle-TRT] fix skiplayernorm, add trt_version check (#52342) · 4e23af72
由 Wangzheee 提交于 3月 31, 2023
```
* fix skiplayernorm, add trt_version check
```
4e23af72
E
[GCC9][Werror]fix -Werror=maybe-uninitialized (#52265) · 74d87a61
由 engineer1109 提交于 3月 31, 2023
```
fix with auto&
```
74d87a61

【prim】 optimize layer_norm_grad rules (#52308) · 1da67779

由 xiaoguoguo626807 提交于 3月 31, 2023

* add to sub & delete full scale

* decrease 1_div_shape_2 compute

* x_sub_mean_mul_sqrt_var_1

* delete log

* add mean var test

* nothing

1da67779

张

[CodeStyle][UP030][UP031][UP032] using f-string (#52062) · 40e4f5a5

由张春乔提交于 3月 31, 2023

* autofix
Co-authored-by: NLiyulingyue <83450930+Liyulingyue@users.noreply.github.com>

* revert changes in python/paddle/distributed/fleet/utils/hybrid_parallel_util.py

* empty commit, trigger ci

* fix test_slice

---------
Co-authored-by: NSigureMo <sigure.qaq@gmail.com>

40e4f5a5

Y

use int64 for c split (#52279) (#52340) · 9fd4fd5f
由 Yuang Liu 提交于 3月 31, 2023

9fd4fd5f

30 3月, 2023 21 次提交

[Bug-fix] fix bug of Tensor.item() when CUDAPinnedPlace (#52322) · 0f9ec013
由 zhouweiwei2014 提交于 3月 30, 2023

0f9ec013
Z

[XPU] add delete_cast_op_pass (#52305) · 8b622d58
由 zhupengyang 提交于 3月 30, 2023

8b622d58
K
mv paddle/fluid/platform/device/xpu/tests 2 test/xpu/cpp (#52243) · bc5bae16
由 Kim 提交于 3月 30, 2023
```
* mv paddle/fluid/platform/device/xpu/tests 2 test/xpu/cpp

* add missing cmake
```
bc5bae16
Z
[Move Test] Move prim cpp (#52173) · a445466f
由 Zheng-Bicheng 提交于 3月 30, 2023
```
* update

* update

* update
```
a445466f

support complex data types for libpaddle.Tensor's element get and set (#52324) · 13b12457

由 Feiyu Chan 提交于 3月 30, 2023

1. add type caster for paddle's complex type, to allow pybind to automatically cast it with python's complex type;
2. add complex64 and complex128 data type for `libpaddle.Tensor`'s element get and set(which is required to perturb an element to get the numerical derivative)
3. add support for cuda pinned place in `libpaddle.Tensor` element get and set

---
4. fix a bug in op code generation.(Creation of output folder in concurrent with parsing op yamls.)

13b12457

[AMP] Add python API for collecting operator stats. (#52215) · 73544322

由 Yiqun Liu 提交于 3月 30, 2023

* [AMP] Add python API for collecting operator stats.

* Fix import and polish codes.

* Add more unittest.

* Add doc for the new APIs.

73544322

W
add autogen code support for spectral_norm (#52145) · 28927209
由 Wang Xin 提交于 3月 30, 2023
```
* add autogen code support for spectral_norm

* bug fixed

* fix PR-CI-Static-Check fail
```
28927209

Speedup worker (#51760) · 8ca86d72

由 pangengzheng 提交于 3月 30, 2023

* support run haokanctr model in heterps-models

* polish setup.py

* polish JVM_LIB in evn_dict

* align infer auc with DistPsArch pre-stable

* async and multi thread data feed

* rewrite dense tensor intialization

* async infer shape and reuse memory

8ca86d72

Y

adjust binding order (#52225) · 16ec22c4
由 Yuanle Liu 提交于 3月 30, 2023

16ec22c4

add scatter composite rule. (#52005) · e16eb22c

由 zxcd 提交于 3月 30, 2023

* add scatter composite rule.

* add public_python_api

* add python unit16 support.

* fix code style.

* add cinn to makelist

* cinn unsupport uint16, forbidden cinn when dtype==uint16.

e16eb22c

由 huangjiyi 提交于 3月 30, 2023

* update assign_pos

* update attention_lstm

* update barrier

* update batch_fc

* update beam_search

* update beam_search_decode

* update bilateral_slice

* fix bug

* Handle Structure kernel for InterpreterCore::RunOperator

* fix bug

* fix rocm compile

* fix rocm compile

* Revert "fix rocm compile"

* test

* revert test and update cmake

---------
Co-authored-by: Nchenruibiao <chenruibiao@baidu.com>

93d01787

Z

[XPU] add delete_concat_op_pass (#52304) · 70ebef81
由 zhupengyang 提交于 3月 30, 2023

70ebef81

Fix bug of c_softmax_with_cross_entropy_op_xpu_op (#52296) · 8ef97088

由 Ghost Screaming 提交于 3月 30, 2023

* Support ignore_index for c_softmax_with_cross_entropy_op.

* Polish code. Remove useless comments and add Testcase.

* Polish code for TestCase.

* Polish code.

* Polish code style.

* Polish code.

* Change loss calculation formula and ignore_index dtype.

* Polish TestCase.

* Fix bug of c_softmax_with_cross_entropy_op_xpu_op. Attribute 'ignore_index'
dtype is int64_t.

8ef97088

F

rename Scalar related utility functions(use CamelCase) (#52280) · e5a0dc31
由 Feiyu Chan 提交于 3月 30, 2023

e5a0dc31

support auto generate for prelu (#51913) · d1c7b386

由 Ainavo 提交于 3月 30, 2023

* support auto generate for prelu

* op_compat 中增加输入参数

* del attrs ; add kernel data_type

* add PreluGradInferMeta

d1c7b386

Z

[AMP] use promote dtype when amp_level=O2 (#51063) · 6f8ab1fa
由 Zhang Ting 提交于 3月 30, 2023

6f8ab1fa
R

fix gcc12 error (#52318) · 77b7765f
由 risemeup1 提交于 3月 30, 2023

77b7765f
G
add autogen code support for sigmoid_cross_entropy_with_logits (#52263) · 710c13ed
由 gouzil 提交于 3月 30, 2023
```
* add autogen code support for sigmoid_cross_entropy_with_logits

* add inplace
```
710c13ed
W
add autogen code support for merge_selected_rows (#52274) · 6cd3575c
由 Wang Xin 提交于 3月 30, 2023
```
* add autogen code support for merge_selected_rows

* bug fixed
```
6cd3575c
W
force sync batch norm grad sequential (#52268) · 336160cf
由 wanghuancoder 提交于 3月 30, 2023
```
* force sync batch norm grad sequential
```
336160cf
R

Skip device transfer when arg-defs is set to Allbackend (#52294) · 54497c47
由 Ruibiao Chen 提交于 3月 30, 2023

54497c47

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功