提交 · 86858a5acabf666ea163ec1f9ce76f39e4012585 · PaddlePaddle / Paddle

28 6月, 2023 21 次提交
- L
  
  fix test_conv3d_transpose_op A100 test fail (#54913) · 86858a5a
  由 LokeZhou 提交于 6月 28, 2023
  
  86858a5a
- J
  
  revert bug deps (#54901) · 6fc5b7a5
  由 JZ-LIANG 提交于 6月 28, 2023
  
  6fc5b7a5
- R
  
  support auto generate for static op reduce_prod (#54316) · ac94b135
  由 RedContritio 提交于 6月 28, 2023
  
  ac94b135
- W
  
  [IR] add verify api for new ir (#54922) · add77ccb
  由 winter-wang 提交于 6月 28, 2023
  
  add77ccb
- Z
  add gc for multi jobs (#54897) · fcffd84d
  由 zhaoyingli 提交于 6月 28, 2023
```
* add gc for multi jobs

* fix job.h

* update OpInfo to OpInOutInfo

* update get_skip_gc_vars algo order
```
  fcffd84d
- L
  [XPU][PHI Kernels] add int_with_ll quantization for conv kernels (#54827) · bd67209f
  由 lijin23 提交于 6月 28, 2023
```
* add int_with_ll to conv

* fix bugs when output_size is specified for conv2d_transpose
```
  bd67209f
- H
  Fix output vector type bug (#54865) · 9c2dae1a
  由 hong 提交于 6月 28, 2023
```
* add fetch kernel

* support fetch var in new ir

* fix bug

* polish code

* change array equal to np.testing

* support feed in new ir

* fix bug

* try to hack combine op

* add scope guard

* revert atan2 op

* polish code

* fix vector type bug

* modify feed data type
```
  9c2dae1a
- Z
  Add set_lr_scheduler api (#54752) · 99c593bc
  由 zqw_1997 提交于 6月 28, 2023
```
* demo1

* add test cases

* modify the usage of StepDecay

* refine
```
  99c593bc
- L
  replace PiecewiseDecay, StepDecay, MultiStepDecay, LambdaDecay with 2.0 version (#53992) · 63f242b6
  由 LoneRanger 提交于 6月 28, 2023
```
* replace PiecewiseDecay(LearningRateDecay) with PiecewiseDecay(LRScheduler)

* fix bug

* fix bug

* replace the StepDecay,MultiStepDecay,LambdaDecay with 2.0 version
```
  63f242b6
- R
  
  fix cinn compile error (#54899) · 54b86fd4
  由 risemeup1 提交于 6月 28, 2023
  
  54b86fd4
- S
  [BugFix] Fix bug for binary_cross_entropy_with_logits loss (#54869) · bb42d870
  由 Siming Dai 提交于 6月 28, 2023
```
* add pos_weight in kernel

* fix unittest

* fix xpu

* fix bce unittest, change infermeta order
```
  bb42d870
- R
  [ROCM] fix cupti, rccl on rocm (#54807) · 57da105c
  由 ronnywang 提交于 6月 28, 2023
```
* [ROCM] fix cupti, hipcub

* update

* update
```
  57da105c
- 6
  Migrate the CI of CINN (#54890) · 6cfe9bfd
  由 6clc 提交于 6月 28, 2023
```
* test=cinnunit

* test=cinnunit

* sync to develop of cinn

* test=cinnunit

* test=cinnunit
```
  6cfe9bfd
- X
  [XPU] fix the dataloader problem in RDMA env (#54150) · 15c87528
  由 XiaociZhang 提交于 6月 28, 2023
```
* [kunlun] fix the dataloader problem in RDMA env

When running multi-machine training with Paddle DataLoader, an
unexpected segmentfault will be raised in DataLoader Process,
where the traceback goes all back to a runtime error that dataloader
workers exit unexpectedly. Similar problems have been discussed
that lead to a misbehavior of OpenCV working in multiprocessing
environment.
See
https://stackoverflow.com/questions/54013846/pytorch-dataloader-stucked-if-using-opencv-resize-method

* code style

* fix 'RuntimeError: context has already been set'

* Update dataloader_iter.py

spawn method raise error 'Can't pickle local object' in some situations

* code format check

* code style
```
  15c87528
- L
  
  add 1f1b pass (#54787) · 133e05c1
  由 LiYuRio 提交于 6月 28, 2023
  
  133e05c1
- K
  [IR] complement ir type (#54911) · ffc1b027
  由 kangguangli 提交于 6月 28, 2023
```
* complement ir type

* fix ir_printer
```
  ffc1b027
- X
  [XPU] fix compile issue for XPTI (#54800) · 4588892a
  由 XiaociZhang 提交于 6月 28, 2023
```
* [XPU] fix compile issue for XPTI

* bugfix

* bugfix
```
  4588892a
- Y
  
  Support 0-D Tensor for check_numerics_kernel. (#54868) · b7fbd339
  由 Yiqun Liu 提交于 6月 28, 2023
  
  b7fbd339
- B
  [inference][trt]add Einsum op (#54860) · 69bf5ee8
  由 bukejiyu 提交于 6月 28, 2023
```
* add einsum layer
```
  69bf5ee8
- Q
  
  update xpu api date (#54900) · 6da0a24d
  由 QingshuChen 提交于 6月 28, 2023
  
  6da0a24d
- W
  
  [IR] add op_operand api for ir::Operation. (#54875) · c3077ec1
  由 winter-wang 提交于 6月 28, 2023
  
  c3077ec1
27 6月, 2023 19 次提交

W

[IR] rectify the verify api (#54895) · 96652265
由 winter-wang 提交于 6月 27, 2023

96652265
W
[Paddle Inference]Enhance the shape check of trt_embedding_eltwise_layernorm_fuse_pass,… (#54861) · e49c17d2
由 Wangzheee 提交于 6月 27, 2023
```
* Enhance the shape check of trt_embedding_eltwise_layernorm_fuse_pass, embedding_eltwise_layernorm_fuse_pass
```
e49c17d2
C

support zero_dim for some prim ops (#54892) · f8d02146
由 Charles-hit 提交于 6月 27, 2023

f8d02146

[BugFix] fix bugs in DCU unit tests (#54874) · abc1c3d4

由 lishicheng1996 提交于 6月 27, 2023

* block bf16 tests on ROCM

* block more bf16 tests on ROCM

* some unittest cases doesn't have kernels on ROCm

* some unittest cases doesn't have kernels on ROCm

* fix code style

abc1c3d4

Z
delete swish_raw (#54536) · 0cdaafea
由 zhangyuqin1998 提交于 6月 27, 2023
```
* delete swish_raw

* fix

* Update activation_kernel.cc

* fix
```
0cdaafea
X
【prim】modify eular_beam (#54736) · 7c2c965d
由 xiaoguoguo626807 提交于 6月 27, 2023
```
* modify eular_beam

* modify matmul infermeta

* add test

* modify timeout
```
7c2c965d
R

fix compiler error (#54883) · 8dc97857
由 risemeup1 提交于 6月 27, 2023

8dc97857

[Semi-Auto] SPMD Parallel Rule Base (#53863) · 6863e2ae

由 JZ-LIANG 提交于 6月 27, 2023

* base rule

* add sharidng merge

* add sharidng axis merge

* define unified data class for inferencing dist_attr

* test wrap DistTensorSpec in dygraph mode

* matmul main logic done

* define unified data class for inferencing dist_attr

---------
Co-authored-by: NYichen Zhang <zhangyichen03@baidu.com>

6863e2ae

L

fix bug when place 'use_cudnn' in extra (#54766) · 689e27af
由 lzydev 提交于 6月 27, 2023

689e27af
X

add xpu_optimize_cachekv_initialization_pass (#54809) · 610a47dd
由 xinxinZi 提交于 6月 27, 2023

610a47dd
L
replace the CosineDecay in fluid with 2.0 version (#54829) · 5a804830
由 LoneRanger 提交于 6月 27, 2023
```
* remove the CosineDecay in fluid

* Update test_basic_api_transformation.py
```
5a804830

replace NaturalExpDecay, ExponentialDecay, InverseTimeDecay with 2.0 version (#54424) · de60c1d1

由 LoneRanger 提交于 6月 27, 2023

* remove the NaturalExpDecay in fluid

* fix bug

* remove the ExponentialDecay in fluid

* remove the InverseTimeDecay in fluid

* remove the InverseTimeDecay class

* fix bug

de60c1d1

Y

fix xpu retry allocator bug (#54847) · 5bbbf5dd
由 ykkk2333 提交于 6月 27, 2023

5bbbf5dd
add all_to_all phi operator (#54797) · 158b7ae5
由 TaoTao Li 提交于 6月 27, 2023
```
* add all_to_all phi operator, kernel, api

* add all_to_all ut

* tinyfix
```
158b7ae5
周

commit (#54894) · 70288456
由周周周提交于 6月 27, 2023

70288456

Code merge | Merge CINN into Paddle (#54749) · 67c69dca

由 6clc 提交于 6月 27, 2023

* feat(cmake): add cmake of cinn

* feat(cmake): add cmake of cinn python test

* feat(cmake): add jit

* feat(cmake): test/CMakeList.txt

* feat(cmake): rebase to develop

* feat(cmake): remove some flags

* fix(cmake): fix cinn's gflags depends

* feat(cmake): add ci scripts of cinn

* feat(cmake): copy code of cinn

* fix(cmake): fix cinn third_party model path

* gflags dynamic dependce

* fix ci build_demo

* tmp update to c++17 of cinn-only test

* fix cinn only with c++17

67c69dca

W

[IR&PASS] part 3-3: Add PatternRewrite Driver code. (#54738) · 72b8c7c2
由 Wilber 提交于 6月 27, 2023

72b8c7c2
Z

delete_assign_op_pass (#54887) · 813266a2
由 zhupengyang 提交于 6月 27, 2023

813266a2

New ir support data transfer (#54763) · b58869fa

由 hong 提交于 6月 27, 2023

* add kernel dialect

* change DenseTensorTypeStorage to DenseTensorType

* add test case`

* add first pd_op to kernel dialect

* lower pd op to kernel dialect

* update

* update

* remove useless code

* add attrite print test

* fix bug

* update

* update

* update

* update

* polish code

* fix bug

* polish  code  and add python test

* add test

* fix test error

* add env flag

* fix bug

* revert test env

* change cc_test_old to cc_test

* fix build_static bug

* fix type test error

* udpate cmake

* disable test in windows

* update

* update

* fix bug

* split file

* fix conflict

* polish code and fix conflict

* support place transformer

* finish bug

* add gpu flags

* fix with cuda macro

* update

* add scope guard

* polish code

b58869fa

PaddlePaddle / Paddle 大约 2 年 前同步成功

PaddlePaddle / Paddle
大约 2 年前同步成功