提交 · fcd77346d37f6b4f54b5884eccf08114d7e5fd15 · PaddlePaddle / Paddle

31 3月, 2023 12 次提交
- H
  [CustomOP Optional Inplace] Custom op supports inplace optional tensor (#52216) · fcd77346
  由 HongyuJia 提交于 3月 31, 2023
```
* [CustomOP Inplace] Automap inplace dtype and shape, prepare for vector<Tensor> output

* delete custom_inplace_setup.py

* [CustomOP Optional Inplace] Custom operator supports inplace optional Tensor input

* fix bug for vector<Tensor> inplace test
```
  fcd77346
- Y
  
  fix bugs (#52377) · 3ebb7e4d
  由 YuanRisheng 提交于 3月 31, 2023
  
  3ebb7e4d
- Y
  [PHI Decoupling]Remove distribute header (#52202) · e923642e
  由 YuanRisheng 提交于 3月 31, 2023
```
* remove distribute

* fix py3 bugs

* fix gpu-ps bugs

* fix compile bugs

* fix unittest bugs
```
  e923642e
- R
  
  [CustomDevice] fix set_constant (#52360) · f22b9666
  由 ronnywang 提交于 3月 31, 2023
  
  f22b9666
- W
  [Paddle-TRT] fix skiplayernorm, add trt_version check (#52342) · 4e23af72
  由 Wangzheee 提交于 3月 31, 2023
```
* fix skiplayernorm, add trt_version check
```
  4e23af72
- E
  [GCC9][Werror]fix -Werror=maybe-uninitialized (#52265) · 74d87a61
  由 engineer1109 提交于 3月 31, 2023
```
fix with auto&
```
  74d87a61
- X
  【prim】 optimize layer_norm_grad rules (#52308) · 1da67779
  由 xiaoguoguo626807 提交于 3月 31, 2023
```
* add to sub & delete full scale

* decrease 1_div_shape_2 compute

* x_sub_mean_mul_sqrt_var_1

* delete log

* add mean var test

* nothing
```
  1da67779
- H
  
  [XPU] register bmm fp16 (#52354) · 53f5edbd
  由 houj04 提交于 3月 31, 2023
  
  53f5edbd
- 张
  [CodeStyle][UP030][UP031][UP032] using f-string (#52062) · 40e4f5a5
  由张春乔提交于 3月 31, 2023
```
* autofix
Co-authored-by: NLiyulingyue <83450930+Liyulingyue@users.noreply.github.com>

* revert changes in python/paddle/distributed/fleet/utils/hybrid_parallel_util.py

* empty commit, trigger ci

* fix test_slice

---------
Co-authored-by: NSigureMo <sigure.qaq@gmail.com>
```
  40e4f5a5
- Z
  
  fix xpu fp16 lod_reset (#52346) · b4137338
  由 zhupengyang 提交于 3月 31, 2023
  
  b4137338
- S
  
  fix copyright date in scope_guard.h, test=document_fix (#52026) · 50c949f0
  由 sneaxiy 提交于 3月 31, 2023
  
  50c949f0
- Y
  
  use int64 for c split (#52279) (#52340) · 9fd4fd5f
  由 Yuang Liu 提交于 3月 31, 2023
  
  9fd4fd5f
30 3月, 2023 28 次提交
- Z
  move elementwise_raw_kernel to new dir (#51965) · 49461a02
  由 zhangyuqin1998 提交于 3月 30, 2023
```
* move elementwise raw

* fix

* fix
```
  49461a02
- [Zero-Dim] Support broadcast_tensors input 0D and distribution API output 0D (#51721) · 2bd0a946
  由 zhouweiwei2014 提交于 3月 30, 2023
  
  2bd0a946
- [Bug-fix] fix bug of Tensor.item() when CUDAPinnedPlace (#52322) · 0f9ec013
  由 zhouweiwei2014 提交于 3月 30, 2023
  
  0f9ec013
- Z
  
  [Sparse]Fix the bug of elementwise_grad (#52102) · aeb8c2e2
  由 zhangkaihuo 提交于 3月 30, 2023
  
  aeb8c2e2
- Z
  
  [XPU] add delete_cast_op_pass (#52305) · 8b622d58
  由 zhupengyang 提交于 3月 30, 2023
  
  8b622d58
- K
  mv paddle/fluid/platform/device/xpu/tests 2 test/xpu/cpp (#52243) · bc5bae16
  由 Kim 提交于 3月 30, 2023
```
* mv paddle/fluid/platform/device/xpu/tests 2 test/xpu/cpp

* add missing cmake
```
  bc5bae16
- Z
  [Move Test] Move prim cpp (#52173) · a445466f
  由 Zheng-Bicheng 提交于 3月 30, 2023
```
* update

* update

* update
```
  a445466f
- F
  support complex data types for libpaddle.Tensor's element get and set (#52324) · 13b12457
  由 Feiyu Chan 提交于 3月 30, 2023
```
1. add type caster for paddle's complex type, to allow pybind to automatically cast it with python's complex type;
2. add complex64 and complex128 data type for `libpaddle.Tensor`'s element get and set(which is required to perturb an element to get the numerical derivative)
3. add support for cuda pinned place in `libpaddle.Tensor` element get and set

---
4. fix a bug in op code generation.(Creation of output folder in concurrent with parsing op yamls.)
```
  13b12457
- R
  
  [AMP OP&Test] add fp16 test for linspace (#52161) · 40b30f50
  由 Roc 提交于 3月 30, 2023
  
  40b30f50
- Y
  [AMP] Add python API for collecting operator stats. (#52215) · 73544322
  由 Yiqun Liu 提交于 3月 30, 2023
```
* [AMP] Add python API for collecting operator stats.

* Fix import and polish codes.

* Add more unittest.

* Add doc for the new APIs.
```
  73544322
- W
  add autogen code support for spectral_norm (#52145) · 28927209
  由 Wang Xin 提交于 3月 30, 2023
```
* add autogen code support for spectral_norm

* bug fixed

* fix PR-CI-Static-Check fail
```
  28927209
- P
  Speedup worker (#51760) · 8ca86d72
  由 pangengzheng 提交于 3月 30, 2023
```
* support run haokanctr model in heterps-models

* polish setup.py

* polish JVM_LIB in evn_dict

* align infer auc with DistPsArch pre-stable

* async and multi thread data feed

* rewrite dense tensor intialization

* async infer shape and reuse memory
```
  8ca86d72
- Y
  
  adjust binding order (#52225) · 16ec22c4
  由 Yuanle Liu 提交于 3月 30, 2023
  
  16ec22c4
- Z
  add scatter composite rule. (#52005) · e16eb22c
  由 zxcd 提交于 3月 30, 2023
```
* add scatter composite rule.

* add public_python_api

* add python unit16 support.

* fix code style.

* add cinn to makelist

* cinn unsupport uint16, forbidden cinn when dtype==uint16.
```
  e16eb22c
- Y
  
  add xpu cumprod, group norm grad (#52089) · fb16bdc7
  由 ykkk2333 提交于 3月 30, 2023
  
  fb16bdc7
- H
  register fluid kerenls to phi [part 1] (#52014) · 93d01787
  由 huangjiyi 提交于 3月 30, 2023
```
* update assign_pos

* update attention_lstm

* update barrier

* update batch_fc

* update beam_search

* update beam_search_decode

* update bilateral_slice

* fix bug

* Handle Structure kernel for InterpreterCore::RunOperator

* fix bug

* fix rocm compile

* fix rocm compile

* Revert "fix rocm compile"

* test

* revert test and update cmake

---------
Co-authored-by: Nchenruibiao <chenruibiao@baidu.com>
```
  93d01787
- Z
  
  [XPU] add delete_concat_op_pass (#52304) · 70ebef81
  由 zhupengyang 提交于 3月 30, 2023
  
  70ebef81
- G
  Fix bug of c_softmax_with_cross_entropy_op_xpu_op (#52296) · 8ef97088
  由 Ghost Screaming 提交于 3月 30, 2023
```
* Support ignore_index for c_softmax_with_cross_entropy_op.

* Polish code. Remove useless comments and add Testcase.

* Polish code for TestCase.

* Polish code.

* Polish code style.

* Polish code.

* Change loss calculation formula and ignore_index dtype.

* Polish TestCase.

* Fix bug of c_softmax_with_cross_entropy_op_xpu_op. Attribute 'ignore_index'
dtype is int64_t.
```
  8ef97088
- Y
  [AMP OP&Test] Register FP16 for multinomial. (#52107) · 7788b65e
  由 yunyaoXYY 提交于 3月 30, 2023
```
* add FP16 for multinomial

* fix input data

* update code

* fix FP16

* fix code
```
  7788b65e
- F
  
  rename Scalar related utility functions(use CamelCase) (#52280) · e5a0dc31
  由 Feiyu Chan 提交于 3月 30, 2023
  
  e5a0dc31
- A
  support auto generate for prelu (#51913) · d1c7b386
  由 Ainavo 提交于 3月 30, 2023
```
* support auto generate for prelu

* op_compat 中增加输入参数

* del attrs ; add kernel data_type

* add PreluGradInferMeta
```
  d1c7b386
- Z
  
  [AMP] use promote dtype when amp_level=O2 (#51063) · 6f8ab1fa
  由 Zhang Ting 提交于 3月 30, 2023
  
  6f8ab1fa
- W
  [AMP OP&Test] Strided slice fp16 and bf16 unitest (#52220) · 5cdd9f2c
  由 Wang Xinyu 提交于 3月 30, 2023
```
* stride slice fp16 and bf16 unitest

* fix code style

* add self.dtype
```
  5cdd9f2c
- R
  
  fix gcc12 error (#52318) · 77b7765f
  由 risemeup1 提交于 3月 30, 2023
  
  77b7765f
- G
  add autogen code support for sigmoid_cross_entropy_with_logits (#52263) · 710c13ed
  由 gouzil 提交于 3月 30, 2023
```
* add autogen code support for sigmoid_cross_entropy_with_logits

* add inplace
```
  710c13ed
- W
  add autogen code support for merge_selected_rows (#52274) · 6cd3575c
  由 Wang Xin 提交于 3月 30, 2023
```
* add autogen code support for merge_selected_rows

* bug fixed
```
  6cd3575c
- W
  force sync batch norm grad sequential (#52268) · 336160cf
  由 wanghuancoder 提交于 3月 30, 2023
```
* force sync batch norm grad sequential
```
  336160cf
- J
  
  [Test Mv] remove infrt (#52270) · 551ff882
  由 jjyaoao 提交于 3月 30, 2023
  
  551ff882

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功