提交 · a4644c507e65ec3f38a790d9057cd82b0ee4c6cf · PaddlePaddle / Paddle

26 7月, 2023 13 次提交
- [BUG] fix bug of float/int/long/index Tensor (#55568) · a4644c50
  由 zhouweiwei2014 提交于 7月 26, 2023
  
  a4644c50
- Y
  [New IR]Bind core structrure (#55665) · ee506c2f
  由 YuanRisheng 提交于 7月 26, 2023
```
* bind ir core

* perfect code

* deal with conflict
```
  ee506c2f
- Y
  
  skip CopyOrAdd when tmp grad is None (#55679) · e838a4b4
  由 Yuang Liu 提交于 7月 26, 2023
  
  e838a4b4
- A
  [NewIR]Add ConvertIRType and fix some TODO for IR+CINN (#55691) · 2ade1f92
  由 Aurelius84 提交于 7月 26, 2023
```
* [NewIR]Add ConvertIRType and fix some TODO for IR+CINN

* modify into GPUPlace
```
  2ade1f92
- L
  [Reshard] Implement replicated to split with same placement (#55552) · 9f3b5f15
  由 LiYuRio 提交于 7月 26, 2023
```
* Implement replicated to split reshard function

* fix link error in clang

* refine split functor

* simplify reshard code
```
  9f3b5f15
- H
  [0D-Tensor] CINN supports `fill_constant`, fix infershape and pass (#55563) · f5830c05
  由 HongyuJia 提交于 7月 26, 2023
```
* [0D-Tensor] CINN supports fill_constant, fix infershape and pass

* fix infershape of fill_constant

* add back fill_constant to zero_tensor_trick_pass
```
  f5830c05
- T
  Add py3.10 (#55286) · 97ec1d84
  由 tianshuo78520a 提交于 7月 26, 2023
```
* Add py3.10;test=py3-ninja

* Add py3.10;test=py3-ninja

* test=py3-ninja

* test=py3-ninja

* test=py3-ninja

* test=py3-ninja

* test=py3-ninja

* Fix test error

* Fix build docker error

* Fix build docker error
```
  97ec1d84
- R
  
  [PHI CAPI] support get & set random seed (#55659) · ba6dcbc9
  由 ronnywang 提交于 7月 26, 2023
  
  ba6dcbc9
- H
  New ir support save combine (#55538) · a88d36aa
  由 hong 提交于 7月 26, 2023
```
* new ir support save combine

* update

* polish code
```
  a88d36aa
- Z
  Cinn error refactor (#55544) · 74266762
  由 Zhang Zheng 提交于 7月 26, 2023
```
* Refactor the error message system

* fix header

* fix compile
```
  74266762
- G
  
  add modernize-redundant-void-arg check (#55652) · 12fb18dd
  由 gouzil 提交于 7月 26, 2023
  
  12fb18dd
- R
  
  [CustomDevice] fix SplitDenseTensor (#55615) · 6c675ed9
  由 ronnywang 提交于 7月 26, 2023
  
  6c675ed9
- V
  
  update security advisory, test=document_fix (#55690) · f9b9b8b6
  由 Vigi Zhang 提交于 7月 26, 2023
  
  f9b9b8b6
25 7月, 2023 16 次提交

L

fix a bug caused by hipcc lambda value capture (#55612) · 8db3ff1f
由 lishicheng1996 提交于 7月 25, 2023

8db3ff1f

Bugfix, fast layer norm, OOB (#55639) · 017a6164

由 Jeng Bai-Cheng 提交于 7月 25, 2023

* Fix LayerNormForward perf issue

* Bugfix, fast_layer_norm OOB

* apply pre-commit

---------
Co-authored-by: NShijie Wang <jaywan@nvidia.com>

017a6164

A

[NewIR]Support Instruction.Run in CINN for Runtime::Program (#55680) · f9e1b2d2
由 Aurelius84 提交于 7月 25, 2023

f9e1b2d2
傅

add all false bool indices support for index_put (#55655) · c737f0ae
由傅剑寒提交于 7月 25, 2023

c737f0ae
remove fluid allreduce op (#55672) · 7da1ffbe
由 TaoTao Li 提交于 7月 25, 2023

7da1ffbe
L

fix bugs in rnn op (#55656) · 0cd422b6
由 Lucas 提交于 7月 25, 2023

0cd422b6
W

fix div 0 bug (#55644) · 690ffe81
由 wanghuancoder 提交于 7月 25, 2023

690ffe81
T
Update ccache (#55136) · 6093a7ed
由 tianshuo78520a 提交于 7月 25, 2023
```
* Update ccache

* del 3.7.9

* fix error
```
6093a7ed

[NewIR]new ir dygraph to static supoort gpu (#55620) · fb9bec5d

由 hong 提交于 7月 25, 2023

* add kernel dialect

* change DenseTensorTypeStorage to DenseTensorType

* add test case`

* add first pd_op to kernel dialect

* lower pd op to kernel dialect

* update

* update

* remove useless code

* add attrite print test

* fix bug

* update

* update

* update

* update

* polish code

* fix bug

* polish  code  and add python test

* add test

* fix test error

* relax constraint when inserting get_parameter

* add env flag

* fix bug

* dygraph2static support new ir

* fix bug

* revert test env

* change cc_test_old to cc_test

* update

* fix build_static bug

* update test

* fix type test error

* udpate cmake

* disable test in windows

* fix inference compile

* fix program translator error

* only run on cpu, not support gpu yet

* fix conflict

* polish code

* fix bug

* add feed with place op

* update

* remove useless unitest

* udpate mkldnn

* update

* update

* align mkldnn version

* new ir support builtin slice op

* fix bug

* fix phi kernel adaptor bug

* add enable static

* add enable_static

* remove useless test case

* change feed list to single variable

* update

* add feed with place and shaddow output op

* fix bug

* remove usless code

* support gpu

* fix bug

* fix bug

* remove template

* add more data type

* fix cimpile bug

* udpate

* remove useless code

* revert dygraph2st test

* remove usless code

* revert op

* fix bug

* new ir dygraph2static support gpu

* remove usless code

* code polish

* add const

* revert code and remove useless code

* revert code

* revert legacy op yaml

* remove useless code

* delete std::move

---------
Co-authored-by: Nkangguangli <kangguangli@hotmail.com>

fb9bec5d

Call multiply_ instead of scale_ to avoid multiple DtoH copy. (#55589) · 05720257

由 Yiqun Liu 提交于 7月 25, 2023

* Call multiply_ instead of scale_ to avoid multiple DtoH copy.

* Call _squared_l2_norm to calculate grad_clip.

* Fix import error.

05720257

K
[BugFix] fix random fail of test_bilinear_interp_v2_op (#55643) · 98c7a3e0
由 kangguangli 提交于 7月 25, 2023
```
* fix random fail of test_bilinear_interp_v2_op

* reset if compiledProgram
```
98c7a3e0

解决 grad_fn next_functions api 接口导致内存异常的问题 - (#55627) · 03a2f187

由 qiuwenbo 提交于 7月 25, 2023

* [尝试] 给tensor增加一个属性, 这个属性是一个定值 1

* 暴露gradnode 并构建gradnode新的方法(用来测试)进行暴露给python python端可以访问

* 开发grad_fn、next_functions两个API 并暴露到python端- 做一些规范化处理

* 增加一个单元测试

* 优化 code-style

* 将单侧文件迁到正确的位置

* 优化 code-style

* 删除无用注释

* 解决 __main__ has no attribute

* 修改单侧文件

* 修改单侧脚本-temp

* 解决 grad_fn next_functions api 接口导致内存异常的问题

* 修改单测内容

* 解决 code-style 问题

03a2f187

H

[0D-Tensor] Fix test_elementwise_max_op unittest (#55674) · 05a40691
由 HongyuJia 提交于 7月 25, 2023

05a40691
J

Fix reduce_ops for mixed-precision FP16 support (#55573) · ca72aa2a
由 jiangfan06 提交于 7月 25, 2023

ca72aa2a
J

[XPU] Add FP16 support for arg_min_max (#55642) · 14094aad
由 jiangfan06 提交于 7月 25, 2023

14094aad
Z

add vjp interface (#55660) · a7567cd0
由 zhangbo9674 提交于 7月 25, 2023

a7567cd0

24 7月, 2023 11 次提交
- C
  [AutoParallel] Simplify DistTensor namespace path (#55593) · ae2d8ba1
  由 Chen Weihang 提交于 7月 24, 2023
```
* simplify dist tensor namespace path

* fix tensor dist attr decl error
```
  ae2d8ba1
- C
  [Paddle-TRT] Convert 0D tensor to 1D tensor, increase the shape tensor's... · a3cf25e3
  由 chen 提交于 7月 24, 2023
```
[Paddle-TRT] Convert 0D tensor to 1D tensor, increase the shape tensor's number count when collecting shape (#55503)

* make 0-D tensor to 1-D tensor to support Grounding-SAM and add shape check

* recover identity_op_clean_pass.cc
```
  a3cf25e3
- J
  add IndexPutGradInfermeta to fix backward error in static-mode (#55602) · 76530a2a
  由 JYChen 提交于 7月 24, 2023
```
* add IndexPutGradInfermeta to fix backward error in static-mode

* codestyle
```
  76530a2a
- J
  修改COPY-FROM No.13 distributed (#55236) · 38fbbe6b
  由 jjyaoao 提交于 7月 24, 2023
```
Signed-off-by: Njjyaoao <jjyaoao@126.com>
```
  38fbbe6b
- W
  
  [Bug Fix] convert environment variables' types (#55586) · 0f0dfe9a
  由 Windfarer 提交于 7月 24, 2023
  
  0f0dfe9a
- Y
  [Semi-Auto] Add transpose spmd rule (#55350) · f6161d1e
  由 Yichen Zhang 提交于 7月 24, 2023
```
* [Semi-Auto] Add transpose spmd rule

* add unit test in cmake file

* log perm info
```
  f6161d1e
- Y
  
  [sharding stage 1 optim] Sharding comm overlap with backward (#55598) · a9f877ff
  由 Yuang Liu 提交于 7月 24, 2023
  
  a9f877ff
- H
  
  [PHI] add fused_softmax_mask and fused_softmax_mask_grad for CPU. (#55616) · b10b899c
  由 houj04 提交于 7月 24, 2023
  
  b10b899c
- F
  [CINN] Remove threshold in op mapper relu6 (#55611) · 81bd57c7
  由 Fisher 提交于 7月 24, 2023
```
* Just set threshold to 6 in op mapper relu6

* Remove attrs in op mapper relu6
```
  81bd57c7
- X
  onednn: remove fc_elementwise_add fusion (#55504) · bea1f04c
  由 Xinyu Chen 提交于 7月 24, 2023
```
* onednn: remove fc+eltwiseadd fusion pass
* onednn: remove post-sum fusion in fc kernel
* onednn: tests: make unfused add run into f32
```
  bea1f04c
- 傅
  
  delete modification on pre-commit (#55519) · 5b8f0637
  由傅剑寒提交于 7月 24, 2023
  
  5b8f0637

PaddlePaddle / Paddle 接近 2 年 前同步成功

PaddlePaddle / Paddle
接近 2 年前同步成功