提交 · e3766da649244ab9290eeb4fc475b693e0ef3227 · Crayon鑫 / Paddle

29 7月, 2022 11 次提交

K

[LAUNCH] fix set args bug (#44717) · e3766da6
由 kuizhiqing 提交于 7月 29, 2022

e3766da6
C
skip cast trt convert when input dtype is bool (#44716) · 5d94618d
由 ccrrong 提交于 7月 29, 2022
```
* skip cast trt convert when input dtype is bool
```
5d94618d

[Auto parallel] Optimization Tuning (#43782) · 72f2ed43

由 JZ-LIANG 提交于 7月 29, 2022

* fixed bug for pass & engine

* fixed bug for benchmark GPT-3

* add tuner & profiler

* add algorithms & config

72f2ed43

move CUDAStream to phi (#44529) · da3743fd

由 Leo Chen 提交于 7月 29, 2022

* init

* move CUDAStream to phi

* fix compilation

* merge develop

* add stream_owned_ member

* split cuda_stream.h

* fix cpu compile

* fix constructor

* fix bug

* fix windows compile

* fix inference test_levit

* fix windows tests

da3743fd

A

update to sdk2.6.0 (#44673) · 23ad0cc4
由 Allen Guo 提交于 7月 29, 2022

23ad0cc4
J

Support backward final hook (#44686) · 8c43c0fe
由 Jiabin Yang 提交于 7月 29, 2022

8c43c0fe
F

[MLU] add pytest for mlu strided_slice kernel (#44523) · b7496bcb
由 fwenguang 提交于 7月 29, 2022

b7496bcb

[PHI] Move lu to phi (#44605) · 3d88816e

由 Lin Manhui 提交于 7月 29, 2022

* Add kernel declarations

* Copy kernel implementation code

* Transfer implementation code

* Register new kernels

* Remove old kernels

* Fix code style

* Fix bugs

* mutable_data->HostAlloc

* Transfer infermeta

* Add yaml and update python api

* Add PADDLE_WITH_HIP check

* Update unittests

* Fix bugs

* Fix bugs

* Optimize directory structure

* Add output checks

* lu_impl.h->lu_kernel_impl.h
Co-authored-by: NBobholamovic <linmanhui@baidu.com>

3d88816e

[Phi] Add yaml for assign_value (#44596) · 88584396

由 Yulong Ao 提交于 7月 29, 2022

* [Phi] Add yaml for assign_value

* [Phi] Fix the bug of the assign api and modify the unittest

* [Phi] Fix the bug when the tensor does not have the backend info

* [Phi] Replace the functional-style cast init by the brace-init

* [Phi] Cast the data explicitly

88584396

M
fused_fc_elementwise_layernorm_op support fp16 (#44710) · 856f741a
由 ming1753 提交于 7月 29, 2022
```
* fused_fc_elementwise_layernorm support fp16

* fused_fc_elementwise_layernorm support double
```
856f741a
H

[XPU] add sampling_id op, add top_k op, update xdnn api. test=kunlun (#44704) · e61f48c1
由 houj04 提交于 7月 29, 2022

e61f48c1

28 7月, 2022 23 次提交

H

clone ort_predictor reuse session (#44703) · 72b65d6b
由 heliqi 提交于 7月 28, 2022

72b65d6b
W

[Eager] fix lerp grad kernel logic (#44705) · bd813d35
由 Weilong Wu 提交于 7月 28, 2022

bd813d35
R

Skip CUDA Graph case for standalone executor (#44693) · e9b92018
由 Ruibiao Chen 提交于 7月 28, 2022

e9b92018

fix logging debug level (#44684) · 8aa286dd

由 ziyoujiyi 提交于 7月 28, 2022

* back fl

* delete ssl cert

* .

* make warning

* .

* unittest paral degree

* solve unittest

* heter & multi cloud commm ready

* .

* .

* fl-ps v1.0

* .

* support N + N mode

* .

* .

* .

* .

* delete print

* .

* .

* .

* .

* fix bug

* .

* .

* fl-ps with coordinator ready

* merge dev

* update message parse only

* update fl client scheduler

* fix bug

* update multithreads sync

* fix ci errors

* update role_maker.py

* update role_maker.py

* fix ci error: windows py import error

* fix ci error: windows py import error

* fix windows ci pylib import error

* add dump fields & params

* try to fix windows import fleet error

* fix ps FLAGS error

* fix logging risk

* fix logging possible risk

8aa286dd

X
[Paddle Inference] Support depthwise_conv2d fp16. (#44642) · ed857585
由 xiaoxiaohehe001 提交于 7月 28, 2022
```
* depthwise_fp16

* depthwise_fp16

* depthwise_fp16

* depthwise_fp16
```
ed857585

[phi]move softsign from fluid to phi (#44616) · 20759c30

由 HongyuJia 提交于 7月 28, 2022

* test_activation_op unitest error, yaml & activation.py in_dygraph_mode incomplete

* fix test_activation_op unitest error, add yaml and dygraph test

* fix code style with pre-commit

* try to fix namespace error of abs in activation_functor.h

* fix namespace error of abs

20759c30

X
migrate dirichlet kernel to phi (#44434) · 798a4eac
由 Xiaoxu Chen 提交于 7月 28, 2022
```
* migrate dirichlet op kernel to phi

* fix dirichlet sample memory leak
```
798a4eac
H

fix bugs of lstsq (#44689) · 2781740b
由 Haohongxiang 提交于 7月 28, 2022

2781740b
Z
Fix some problem of kernel fallback in C++ API (#44681) · 55aaeb39
由 zyfncg 提交于 7月 28, 2022
```
* support auto fallback to  cpu kernel for cusom device

* fix some problem of kernel fallback
```
55aaeb39
Z

adapt for resnet (#44685) · 2cec4c88
由 zhaoyingli 提交于 7月 28, 2022

2cec4c88
C

[MLU] fix log_softmax mode selection. (#44669) · a9f76d07
由 Chenxiao Niu 提交于 7月 28, 2022

a9f76d07
N

delete elementwise pow in xpu_kp_list (#44661) · dfeb1942
由 niuliling123 提交于 7月 28, 2022

dfeb1942

Move frame kernel to phi (#44615) · 28b4b2f7

由 Charles-hit 提交于 7月 28, 2022

* Move frame OP to phi、add frame OP yaml config and supplement single test

* add Header file of in_dygraph_mode

* Modify variable name and FrameGradInferMeta multiplex UnchangedInferMeta

* move seq2col to phi

28b4b2f7

Move api(lgamma) from legacy_api.yaml to api.yaml (#44355) · 511a2c1c

由 Charles-hit 提交于 7月 28, 2022

* Move api(lgamma) from legacy_api.yaml to api.yaml

* Move api(lgamma) from legacy_api.yaml to api.yaml

* Move api(lgamma) from legacy_api.yaml to api.yaml

* modify code style

* add x to X mapping

* add definition of lgamma

* delete redundant lgamma definitions

* Modify code comments

* Modify ops.py code format

* add lgamma  single test and lgamma api in fluid

* Optimized lgamma unittest

511a2c1c

K
[LAUNCH] add distributed launch check tools (#44495) · 9a3e1bce
由 kuizhiqing 提交于 7月 28, 2022
```
* add launch test

* launch test for cpu

* bs 1
```
9a3e1bce
support log_grad op, *test=kunlun (#44662) · 067107ad
由 z8hanghuan 提交于 7月 28, 2022

067107ad
W
[Eager] refactor general_grad and fix some bugs (#44611) · acde295c
由 Weilong Wu 提交于 7月 28, 2022
```
* refactor general_grad and fix some bugs

* add TODO: support prune logic deeper
```
acde295c
L

Complete the dtypes for all_gather, add all_gather_object api (#44417) · d4cf02bc
由 LiYuRio 提交于 7月 28, 2022

d4cf02bc

[PHI] Move spectral_norm to phi (#44577) · 768e50c9

由 Lin Manhui 提交于 7月 28, 2022

* Add kernel declarations

* Copy kernel implementation code

* Transfer implementation code

* Fix: Move out_grad to first

* Register new kernels

* Remove old kernels

* Move out_grad to last

* Fix bugs

* Transfer infermeta

* Add yaml files

* Add blank line

* Fix code style

* Optimize directory structure
Co-authored-by: NBobholamovic <linmanhui@baidu.com>

768e50c9

J

Support broadcast tensor in phi system (#44590) · a90b8dc1
由 Jiabin Yang 提交于 7月 28, 2022

a90b8dc1

[XPU] add top_k op (#44656) · acf07c74

由 houj04 提交于 7月 28, 2022

* [XPU] add top_k op. test=kunlun

* [XPU] add top_k op. test=kunlun

* use PADDLE_ENFORCE_XDNN_NOT_NULL to check pointer. test=kunlun

acf07c74

F
Change the way to set attributes for grad op maker (#44514) · 8ee9140b
由 Feiyu Chan 提交于 7月 28, 2022
```
* fix typos in template for codegen of operators
* change the way to set attributes for grad op maker
```
8ee9140b
Y

[auto parallel] bug fix for op has sub_block attr created with copy_from (#44664) · 822e42d7
由 Yuang Liu 提交于 7月 28, 2022

822e42d7

27 7月, 2022 6 次提交
- S
  
  add matrix_nms in python/paddle/vision/ops.py (#44357) · 8fc1cf60
  由 shangliang Xu 提交于 7月 27, 2022
  
  8fc1cf60
- Z
  
  [Eager] Add hierarchical_sigmoid yaml (#44638) · ea91ca2f
  由 Zhong Hui 提交于 7月 27, 2022
  
  ea91ca2f
- Y
  
  xpu unittest grad compute supports more types, *test=kunlun (#44606) · ae25ab56
  由 ykkk2333 提交于 7月 27, 2022
  
  ae25ab56
- Z
  
  retain dist op returns (#44634) · 5be7a1ff
  由 zhaoyingli 提交于 7月 27, 2022
  
  5be7a1ff
- Q
  
  [MLU]fix sync_batch_norm and concat_grad op (#44586) · f49b0cb9
  由 qipengh 提交于 7月 27, 2022
  
  f49b0cb9
- M
  
  Strided slice fp16 (#44653) · 84d595fa
  由 ming1753 提交于 7月 27, 2022
  
  84d595fa

Crayon鑫 / Paddle 与 Fork 源项目一致

Crayon鑫 / Paddle
与 Fork 源项目一致