提交 · b413733824452d624a06c0fa74aec1035b4af253 · PaddlePaddle / Paddle

31 3月, 2023 3 次提交
- Z
  
  fix xpu fp16 lod_reset (#52346) · b4137338
  由 zhupengyang 提交于 3月 31, 2023
  
  b4137338
- S
  
  fix copyright date in scope_guard.h, test=document_fix (#52026) · 50c949f0
  由 sneaxiy 提交于 3月 31, 2023
  
  50c949f0
- Y
  
  use int64 for c split (#52279) (#52340) · 9fd4fd5f
  由 Yuang Liu 提交于 3月 31, 2023
  
  9fd4fd5f
30 3月, 2023 35 次提交
- Z
  move elementwise_raw_kernel to new dir (#51965) · 49461a02
  由 zhangyuqin1998 提交于 3月 30, 2023
```
* move elementwise raw

* fix

* fix
```
  49461a02
- [Zero-Dim] Support broadcast_tensors input 0D and distribution API output 0D (#51721) · 2bd0a946
  由 zhouweiwei2014 提交于 3月 30, 2023
  
  2bd0a946
- [Bug-fix] fix bug of Tensor.item() when CUDAPinnedPlace (#52322) · 0f9ec013
  由 zhouweiwei2014 提交于 3月 30, 2023
  
  0f9ec013
- Z
  
  [Sparse]Fix the bug of elementwise_grad (#52102) · aeb8c2e2
  由 zhangkaihuo 提交于 3月 30, 2023
  
  aeb8c2e2
- Z
  
  [XPU] add delete_cast_op_pass (#52305) · 8b622d58
  由 zhupengyang 提交于 3月 30, 2023
  
  8b622d58
- K
  mv paddle/fluid/platform/device/xpu/tests 2 test/xpu/cpp (#52243) · bc5bae16
  由 Kim 提交于 3月 30, 2023
```
* mv paddle/fluid/platform/device/xpu/tests 2 test/xpu/cpp

* add missing cmake
```
  bc5bae16
- Z
  [Move Test] Move prim cpp (#52173) · a445466f
  由 Zheng-Bicheng 提交于 3月 30, 2023
```
* update

* update

* update
```
  a445466f
- F
  support complex data types for libpaddle.Tensor's element get and set (#52324) · 13b12457
  由 Feiyu Chan 提交于 3月 30, 2023
```
1. add type caster for paddle's complex type, to allow pybind to automatically cast it with python's complex type;
2. add complex64 and complex128 data type for `libpaddle.Tensor`'s element get and set(which is required to perturb an element to get the numerical derivative)
3. add support for cuda pinned place in `libpaddle.Tensor` element get and set

---
4. fix a bug in op code generation.(Creation of output folder in concurrent with parsing op yamls.)
```
  13b12457
- R
  
  [AMP OP&Test] add fp16 test for linspace (#52161) · 40b30f50
  由 Roc 提交于 3月 30, 2023
  
  40b30f50
- Y
  [AMP] Add python API for collecting operator stats. (#52215) · 73544322
  由 Yiqun Liu 提交于 3月 30, 2023
```
* [AMP] Add python API for collecting operator stats.

* Fix import and polish codes.

* Add more unittest.

* Add doc for the new APIs.
```
  73544322
- W
  add autogen code support for spectral_norm (#52145) · 28927209
  由 Wang Xin 提交于 3月 30, 2023
```
* add autogen code support for spectral_norm

* bug fixed

* fix PR-CI-Static-Check fail
```
  28927209
- P
  Speedup worker (#51760) · 8ca86d72
  由 pangengzheng 提交于 3月 30, 2023
```
* support run haokanctr model in heterps-models

* polish setup.py

* polish JVM_LIB in evn_dict

* align infer auc with DistPsArch pre-stable

* async and multi thread data feed

* rewrite dense tensor intialization

* async infer shape and reuse memory
```
  8ca86d72
- Y
  
  adjust binding order (#52225) · 16ec22c4
  由 Yuanle Liu 提交于 3月 30, 2023
  
  16ec22c4
- Z
  add scatter composite rule. (#52005) · e16eb22c
  由 zxcd 提交于 3月 30, 2023
```
* add scatter composite rule.

* add public_python_api

* add python unit16 support.

* fix code style.

* add cinn to makelist

* cinn unsupport uint16, forbidden cinn when dtype==uint16.
```
  e16eb22c
- Y
  
  add xpu cumprod, group norm grad (#52089) · fb16bdc7
  由 ykkk2333 提交于 3月 30, 2023
  
  fb16bdc7
- H
  register fluid kerenls to phi [part 1] (#52014) · 93d01787
  由 huangjiyi 提交于 3月 30, 2023
```
* update assign_pos

* update attention_lstm

* update barrier

* update batch_fc

* update beam_search

* update beam_search_decode

* update bilateral_slice

* fix bug

* Handle Structure kernel for InterpreterCore::RunOperator

* fix bug

* fix rocm compile

* fix rocm compile

* Revert "fix rocm compile"

* test

* revert test and update cmake

---------
Co-authored-by: Nchenruibiao <chenruibiao@baidu.com>
```
  93d01787
- Z
  
  [XPU] add delete_concat_op_pass (#52304) · 70ebef81
  由 zhupengyang 提交于 3月 30, 2023
  
  70ebef81
- G
  Fix bug of c_softmax_with_cross_entropy_op_xpu_op (#52296) · 8ef97088
  由 Ghost Screaming 提交于 3月 30, 2023
```
* Support ignore_index for c_softmax_with_cross_entropy_op.

* Polish code. Remove useless comments and add Testcase.

* Polish code for TestCase.

* Polish code.

* Polish code style.

* Polish code.

* Change loss calculation formula and ignore_index dtype.

* Polish TestCase.

* Fix bug of c_softmax_with_cross_entropy_op_xpu_op. Attribute 'ignore_index'
dtype is int64_t.
```
  8ef97088
- Y
  [AMP OP&Test] Register FP16 for multinomial. (#52107) · 7788b65e
  由 yunyaoXYY 提交于 3月 30, 2023
```
* add FP16 for multinomial

* fix input data

* update code

* fix FP16

* fix code
```
  7788b65e
- F
  
  rename Scalar related utility functions(use CamelCase) (#52280) · e5a0dc31
  由 Feiyu Chan 提交于 3月 30, 2023
  
  e5a0dc31
- A
  support auto generate for prelu (#51913) · d1c7b386
  由 Ainavo 提交于 3月 30, 2023
```
* support auto generate for prelu

* op_compat 中增加输入参数

* del attrs ; add kernel data_type

* add PreluGradInferMeta
```
  d1c7b386
- Z
  
  [AMP] use promote dtype when amp_level=O2 (#51063) · 6f8ab1fa
  由 Zhang Ting 提交于 3月 30, 2023
  
  6f8ab1fa
- W
  [AMP OP&Test] Strided slice fp16 and bf16 unitest (#52220) · 5cdd9f2c
  由 Wang Xinyu 提交于 3月 30, 2023
```
* stride slice fp16 and bf16 unitest

* fix code style

* add self.dtype
```
  5cdd9f2c
- R
  
  fix gcc12 error (#52318) · 77b7765f
  由 risemeup1 提交于 3月 30, 2023
  
  77b7765f
- G
  add autogen code support for sigmoid_cross_entropy_with_logits (#52263) · 710c13ed
  由 gouzil 提交于 3月 30, 2023
```
* add autogen code support for sigmoid_cross_entropy_with_logits

* add inplace
```
  710c13ed
- W
  add autogen code support for merge_selected_rows (#52274) · 6cd3575c
  由 Wang Xin 提交于 3月 30, 2023
```
* add autogen code support for merge_selected_rows

* bug fixed
```
  6cd3575c
- W
  force sync batch norm grad sequential (#52268) · 336160cf
  由 wanghuancoder 提交于 3月 30, 2023
```
* force sync batch norm grad sequential
```
  336160cf
- J
  
  [Test Mv] remove infrt (#52270) · 551ff882
  由 jjyaoao 提交于 3月 30, 2023
  
  551ff882
- R
  
  Skip device transfer when arg-defs is set to Allbackend (#52294) · 54497c47
  由 Ruibiao Chen 提交于 3月 30, 2023
  
  54497c47
- S
  [BugFix]Fix segment fault in order setting (#52293) · d2cdc7e3
  由 ShenLiang 提交于 3月 29, 2023
```
* fix bug in proto

* add utest
```
  d2cdc7e3
- D
  
  fix the compare in PD_MEA_CHECK_OVERFLOW (#52300) · 155018ee
  由 Danyang Zhang 提交于 3月 30, 2023
  
  155018ee
- L
  Change some op with xpu control (#52067) · 1faa06f0
  由 lzydev 提交于 3月 30, 2023
```
* change op with xpu

* change range yaml

* fix bug in generate_op.py
```
  1faa06f0
- J
  [CINN] pass global seed to CINN (#52078) · 94aea284
  由 jiangcheng 提交于 3月 30, 2023
```
* [CINN] pass global seed to CINN

* fix cu not include cinn/runtime/flags.h bug

* fix DefaultCUDAGenerator should has device id bug
```
  94aea284
- C
  [CodeStyle][C416][C417] rewrite unnecessary comprehension with function call... · 929892c3
  由 cyberslack_lee 提交于 3月 30, 2023
```
[CodeStyle][C416][C417] rewrite unnecessary comprehension with function call and use generator instead of map (#52140)

* codestyle c416 c417

* fix error

* fix inc

* unify all C4 rules into one

* fix inc

---------
Co-authored-by: NSigureMo <sigure.qaq@gmail.com>
```
  929892c3
- Y
  Add Gloo SendRecv Function (#52221) · b8850521
  由 yuehuayingxueluo 提交于 3月 30, 2023
```
* add gloo  send_recv

* fix code_stype

* fix CI bug

* fix send_recv.cc

* add send_recv without sync_op

* fix send_recv test

* fix gather.cc
```
  b8850521
29 3月, 2023 2 次提交

[AMP OP&Test] pad3d add unittests of fp16 and bf16 (#51015) · f86d0be7

由 zengshao0622 提交于 3月 29, 2023

* pad3d add unittests of fp16 and bf16

* pad3d add unittests of fp16 and bf16

* fix cuda place

* fix random to uniform

* fix class name

* fix fp16 max relative error to 1.5e-3

* add dytpe register for onednn

* add pad uint16 check of common.py

* remove check_eager

* test_check_grad --> test_check_grad_normal

f86d0be7

J
Clear the infrt-related code (#52273) · da5a2584
由 jjyaoao 提交于 3月 29, 2023
```
* Clear the infrt-related code

* remove tools/infrt
```
da5a2584

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功