提交 · f1873b9018ff1302fb9f2b369d8279a8402d3c74 · PaddlePaddle / Paddle

02 8月, 2022 10 次提交

[Eager] use eager final state instead intermediate state (#44722) · f1873b90

由 Weilong Wu 提交于 8月 02, 2022

* [Eager] call final_state_slice under eager mode

* rm useless comments

* use eager final state instead intermidiate state

* update fill_constant yaml

* update fill_constant yaml

* modify wrapped_infermeta_gen logic to fix special case

* fix slice in manipulation

* use fill_constant_

* modify slice infermeta

* rm final_state_conv2d

* use final_state_slice

* use final_state_slice only

* polish slice, use final state

* add paddle_throw for SplitInferMeta

* rm fill_constant_ temply

* recover array_equal, not allclose

* recover original code

f1873b90

[Phi] Move QR to Phi (#44742) · 2cf2e786

由 Yulong Ao 提交于 8月 02, 2022

* [Phi] Move Qr to the Phi

* [Phi] Regiter the cpu grad kernel for qr

* [Phi] Share the cuda kernels to lstsq

* [Phi] Remove some improper inlcude files

* [Phi] Modify codes based on the reviews

* [Phi] Remove unecessary files and add the cuda_only comment

* [Phi] Remove the unecessary include file

* [Phi] Remove qr_op.cu and lstsq_op.cu

2cf2e786

X
[Eager]Menual fused_gemm_epilogue (#44748) · a2980169
由 xiaoguoguo626807 提交于 8月 02, 2022
```
* manuel_fused_gemm_epilogue
```
a2980169
H
[XPU] fp16 for layer_norm op (#44778) · 4c3e13de
由 houj04 提交于 8月 02, 2022
```
* [XPU] fp16 for layer_norm op. test=kunlun
```
4c3e13de

[Dy2St]Raise TypeError when call to_static to convert a method of a common class (#44781) · c3d4a3d8

由 WangZhen 提交于 8月 02, 2022

* Fix to_static error when call to_static to convert a method of a common class

* raise typerror when class no inherits from layer

* Fix @to_static

c3d4a3d8

X

fix test-einsum-v2 unittest in cuda 11.7 (#44772) · acfdb8b3
由 xiongkun 提交于 8月 02, 2022

acfdb8b3

write trainer_desc file (#44702) · 65a3530c

由 ziyoujiyi 提交于 8月 02, 2022

* back fl

* delete ssl cert

* .

* make warning

* .

* unittest paral degree

* solve unittest

* heter & multi cloud commm ready

* .

* .

* fl-ps v1.0

* .

* support N + N mode

* .

* .

* .

* .

* delete print

* .

* .

* .

* .

* fix bug

* .

* .

* fl-ps with coordinator ready

* merge dev

* update message parse only

* update fl client scheduler

* fix bug

* update multithreads sync

* fix ci errors

* update role_maker.py

* update role_maker.py

* fix ci error: windows py import error

* fix ci error: windows py import error

* fix windows ci pylib import error

* add dump fields & params

* try to fix windows import fleet error

* fix ps FLAGS error

* fix logging risk

* fix logging possible risk

* write trainer_desc file

65a3530c

R
Skip inplace for coalesce_tensor_op outputs (#44795) · bb22e59c
由 Ruibiao Chen 提交于 8月 02, 2022
```
* Skip inplace for coalesce_tensor_op outputs

* Fix typos

* Add UTs

* Fix typos
```
bb22e59c

[phi] add yolov3_loss yaml and unittest (#44476) · c7cf12fc

由 ccrrong 提交于 8月 02, 2022

* add yaml and unittest

* update yaml

* update backward yaml and unittest

* update yaml

* add Yolov3LossGradInferMeta

* update yolov3_loss_op.cc

* fix bug

* code format

c7cf12fc

K

fix ut new_group_api (#44764) · d8fedcb9
由 kuizhiqing 提交于 8月 02, 2022

d8fedcb9

01 8月, 2022 10 次提交

unify gpu context (#44740) · 86763023

由 Leo Chen 提交于 8月 01, 2022

* remove cudaDeviceContext

* remove more template

* fix rocm compile

* remove alias name CUDADeviceContext

* fix compile

* fix tests

* revert changes

86763023

Z

Revert for cmake static library errors on XPU KP #44762 · f15d930a
由 zhiboniu 提交于 8月 01, 2022

f15d930a

GPUGraph merge to develop (#44594) · 798670bb

由 danleifeng 提交于 8月 01, 2022

798670bb

[Sparse] optimize sparse attention (#44743) · 1149a378
由 zhouweiwei2014 提交于 8月 01, 2022

1149a378
R

[CI] CI for Distributed (#44085) · f064ead6
由 Roc 提交于 8月 01, 2022

f064ead6
L

fix all_gather_object with various length, test=allcases (#44718) · e48cb42b
由 LiYuRio 提交于 8月 01, 2022

e48cb42b
Z
Fix test and doc (#44735) · 3e8708bc
由 zhangkaihuo 提交于 8月 01, 2022
```
* fix test and doc
```
3e8708bc

[operator migration] Migrate unstack_op and nms_op (#44424) · 9d2e0ecb

由 Thomas Young 提交于 8月 01, 2022

* update unstack_op

* update unstack_op

* update unstack_op

* fix unstack test

* update unstack

* update with remote

* fix unstack_test.py

* temp_save_change_nms_op

* add nms test

* update nms fix

* update unstack_op

* temp save change

* finish fix nms_op

* pass nms test

* fix CI

* fix ops test

* save change

* fix code style

* fix code style

* fix ci and codestyle

* fix ci
Co-authored-by: NShiningZhang <zhang_liang1991@126.com>

9d2e0ecb

L
migrate overlap_add and overlap_add_grad op (#44739) · 2a8219c1
由 levi131 提交于 8月 01, 2022
```
* update code format

* add ymal and test

* update for comments
```
2a8219c1

[PHI] Move lu_unpack to phi (#44674) · c905a9e9

由 Lin Manhui 提交于 8月 01, 2022

* Add kernel declarations

* Copy kernel implementation code

* Transfer implementation code

* Register new kernels

* Remove old kernels

* Fix code style

* Fix bugs

* mutable_data->HostAlloc

* Transfer infermeta

* Add yaml and update python api

* Add PADDLE_WITH_HIP check

* Update unittests

* Add kernel declarations

* Copy kernel implementation code

* Transfer kernel implementation code

* Register new kernels

* Remove old kernels

* Add lu_unpack_sig

* Fix bugs

* Fix bugs

* Fix bugs

* Optimize directory structure

* Add output checks

* Update include files

* lu_impl.h->lu_kernel_impl.h

* Transfer infermeta

* Add yaml and update python api

* Add check_eager
Co-authored-by: NBobholamovic <linmanhui@baidu.com>

c905a9e9

30 7月, 2022 1 次提交
- Z
  Phi prior box (#44431) · d92b2f2d
  由 zhiboniu 提交于 7月 30, 2022
```
* phi_prior_box

* add float[] support

* phi_prior_box_optest

* update
```
  d92b2f2d
29 7月, 2022 18 次提交

【PaddlePaddle Hackathon 3 No.12】为 Paddle 新增 pairwise_distance (#44161) · 46be6854

由 Ainavo 提交于 7月 29, 2022

* add paddle.nn.functional.pairwise_distance (cattidea/Paddle#273)
* remove the test case for undefined behavior
Co-authored-by: NSigureMo <sigure.qaq@gmail.com>

46be6854

Z
Add sparse SyncBatchNorm (#43520) · 0a2db7c8
由 zhangkaihuo 提交于 7月 29, 2022
```
* add sparse SyncBatchNorm
```
0a2db7c8

[API/OP] Migrate Lstsq op into phi (#44318) · ab2aaf8b

由 Haohongxiang 提交于 7月 29, 2022

* migrate lstsq op

* update

* fix bugs for CIs

* update

* fix bugs

* add uts

* update

* update

* update

* fix bugs of jip

* fix bugs of hip

* update

* update according to review

* update

* update

* update

* update

ab2aaf8b

C

add dist op costs (#44701) · ec1e0d5a
由 caozhou 提交于 7月 29, 2022

ec1e0d5a
Q
add some fp16 op for kunlun resnet50 model (#44672) · fecbc958
由 QingshuChen 提交于 7月 29, 2022
```
* add some fp16 op for kunlun resnet50 model
*test=kunlun

* tmp
*test=kunlun
```
fecbc958
Z

phi_multiclass_nms3 (#44613) · a9919903
由 zhiboniu 提交于 7月 29, 2022

a9919903
A
add FLAGS_enable_api_kernel_fallback (#44706) · e439d735
由 Aganlengzi 提交于 7月 29, 2022
```
* add FLAGS_enable_api_kernel_fallback

* deal with more cases

* add ut for coverage
```
e439d735
T
【PaddlePaddle Hackathon 3 No.15】为 Paddle 新增 count_nonzero (#44169) · a6c50a6c
由 thunder95 提交于 7月 29, 2022
```
* add count_nonzero api

* remove grad test
```
a6c50a6c

Phi softplus migration (#44542) · 05515662

由 Wang Bojun 提交于 7月 29, 2022

* add yaml and utests of phi softplus

add yaml of softplus

fix softplus bug in phi

* update utests

* bug fix

* bug fix for test_layers

* layer api match

* match def and doc in ops.py

* doc polish

* fix unwanted modified of thresholded_relu

* style imporve

05515662

C
skip cast trt convert when input dtype is bool (#44716) · 5d94618d
由 ccrrong 提交于 7月 29, 2022
```
* skip cast trt convert when input dtype is bool
```
5d94618d

[Auto parallel] Optimization Tuning (#43782) · 72f2ed43

由 JZ-LIANG 提交于 7月 29, 2022

* fixed bug for pass & engine

* fixed bug for benchmark GPT-3

* add tuner & profiler

* add algorithms & config

72f2ed43

move CUDAStream to phi (#44529) · da3743fd

由 Leo Chen 提交于 7月 29, 2022

* init

* move CUDAStream to phi

* fix compilation

* merge develop

* add stream_owned_ member

* split cuda_stream.h

* fix cpu compile

* fix constructor

* fix bug

* fix windows compile

* fix inference test_levit

* fix windows tests

da3743fd

A

update to sdk2.6.0 (#44673) · 23ad0cc4
由 Allen Guo 提交于 7月 29, 2022

23ad0cc4
J

Support backward final hook (#44686) · 8c43c0fe
由 Jiabin Yang 提交于 7月 29, 2022

8c43c0fe
F

[MLU] add pytest for mlu strided_slice kernel (#44523) · b7496bcb
由 fwenguang 提交于 7月 29, 2022

b7496bcb

[PHI] Move lu to phi (#44605) · 3d88816e

由 Lin Manhui 提交于 7月 29, 2022

* Add kernel declarations

* Copy kernel implementation code

* Transfer implementation code

* Register new kernels

* Remove old kernels

* Fix code style

* Fix bugs

* mutable_data->HostAlloc

* Transfer infermeta

* Add yaml and update python api

* Add PADDLE_WITH_HIP check

* Update unittests

* Fix bugs

* Fix bugs

* Optimize directory structure

* Add output checks

* lu_impl.h->lu_kernel_impl.h
Co-authored-by: NBobholamovic <linmanhui@baidu.com>

3d88816e

[Phi] Add yaml for assign_value (#44596) · 88584396

由 Yulong Ao 提交于 7月 29, 2022

* [Phi] Add yaml for assign_value

* [Phi] Fix the bug of the assign api and modify the unittest

* [Phi] Fix the bug when the tensor does not have the backend info

* [Phi] Replace the functional-style cast init by the brace-init

* [Phi] Cast the data explicitly

88584396

H

[XPU] add sampling_id op, add top_k op, update xdnn api. test=kunlun (#44704) · e61f48c1
由 houj04 提交于 7月 29, 2022

e61f48c1

28 7月, 2022 1 次提交
- R
  
  Skip CUDA Graph case for standalone executor (#44693) · e9b92018
  由 Ruibiao Chen 提交于 7月 28, 2022
  
  e9b92018

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功