提交 · 6f41e177626919c6b8e76797734eac7959c80c9c · PaddlePaddle / Paddle

14 4月, 2023 24 次提交

[Zero-Dim] support 0-D tensor for... · 6f41e177

由 YangQun 提交于 4月 14, 2023

[Zero-Dim] support 0-D tensor for reduce/reshape/stack/prelu/expand_v2/gaussion onednn kernels (#52185)

* support 0-D tensor for reduce/reshape/stack/prelu/expand_v2/gaussion ops

* fix gaussian random mkldnn op ut

6f41e177

[Decouple enforce.h] Move LOG from enforce.h to enforce.cc (#52883) · b33f95b0

由 HongyuJia 提交于 4月 14, 2023

* [Decouple enforce.h] Move LOG from enforce.h to enforce.cc

* update cmake of device_context.cc, solve cuda_device_context_allocator.h compile error

* add namespace inside macro

b33f95b0

H

[CustomOP Unittest] Optimize unit test, save setUp time (#52889) · b66c833f
由 HongyuJia 提交于 4月 14, 2023

b66c833f

1. modify set_value op, use Scalars to represent attr `values`, instead of a... · dd2a749a

由 Feiyu Chan 提交于 4月 14, 2023

1. modify set_value op, use Scalars to represent attr `values`, instead of a bunch of attributs of various types; (#52408)

2. add program converter and set_value op as an example, which provides the functionality to convert `paddle::framework::ProgramDesc` between old and new formats(the differences are mainly some operators with incompatible updates in the definition);
3. program version and operator version map now are always saved when serializing `paddle::framework::ProgramDesc` to identify the version;
3. provide an option `legacy_format=false` in serialization of `paddle::framework::ProgramDesc`, it decided whether to convert ProgramDesc back to a legacy format, which is compatible for paddle 2.4.2 or earlier versions to load and execute;
4. deserialization of `paddle::framework::ProgramDesc` is now automatically detecting whether the bytes it receives is in legacy format(contains any of the operators that has been incompatibly updated and have any attribute of type `Scalar`) and convert it to new format. But if you want a faithful deserialization without the automatic conversion, you can use protobuf's deserialization instead. Though it is not recommended, it can be used for the purpose of testing.

dd2a749a

[phi] move sequence_pool to phi - Step 2 : sequence_pool_op (#52750) · b281b221

由 gouzil 提交于 4月 14, 2023

* [phi] move sequence_pool kernel to phi

* [phi] mv sequence_pooling to phi funcs

* [phi] mv sequence_pooling_test

* [phi] RollBACK `paddle/fluid/operators/sequence_ops/sequence_pool_op.cc`

* [phi][funcs] fix mutable_data

* [phi][funcs] fix mutable_data

b281b221

Move fused_attention op to phi [迁移反向 GPU OpKernel] (#51909) · 3bac6264

由 Sonder 提交于 4月 14, 2023

* add kernel functions

* update kernel functions

* update func parameters' name

* create codes for gpu device

* 调整文件位置

* fix include error

* remove dependent files to phi/

* restore fused_attention_op.cu

* fix dependence errors

* fix dependence errors

* fix include error

* fix all depandence errors[build success]

* remove useless include

* recover useless include

* use phi::ToNCCLDataType

* fix namespace

* update new register code

* fix error in fused_gemm_epilogue_utils

* fix error in FusedAttentionKernel parm

* finish fused_attention registe code[build success]

* add paddle::optional

* add sig file

* fix build error

* fix a include error

* 恢复正向代码

* update CMkaeList

* trans Compute function to phi [build success]

* add register code and fix include error [build success]

* fix parameter sequence

* add include file

* update #if before include

* update #if before include

* fix grammly error

* update codes for DropoutParam

* remove const cast

* trans some fluid api to phi api

* remove const cast

* trans some fluid api to phi api

* add #if

* update test code

* update test codes

* recover test codes

* fix namespace and remove fluid include

* recover random seed

* remove fluid quant_helper

* fix include error

* include utils in funcs

* change include file

* move grad codes back to fluid floder

* move grad codes back to fluid floder

* fix sig file error

* update include

* recover codes to develop

* update register codes

* fix build error

* recover fluid include

* remove some fluid include

* remove some fluid include

* Update fused_attention_op.cu

* remove fluid include

* add some fluid include

* Update fused_attention_op.cu

* Update fused_attention_op.cu

* Update fused_attention_op.cu

* Update fused_attention_op.cu

* remote useless include

3bac6264

G
fix some [-Wunused-function] and [-Wunused-function] warning (#52868) · ab163063
由 Galaxy1458 提交于 4月 14, 2023
```
* test,test=develop

* test,test=develop

* test,test=develop
```
ab163063

【Prim】Add more infer var type (#52818) · 630d14f5

由 Jiabin Yang 提交于 4月 14, 2023

* add more infer var type

* fix split error

* fix ut

* fix top_k infer vartype

* fix top_k infer vartype

630d14f5

L

add backend config to select kernel (#52907) · 1ab7e77a
由 lzydev 提交于 4月 14, 2023

1ab7e77a
S

fix win cu116 compile error (#52894) · 60ba559a
由 sneaxiy 提交于 4月 14, 2023

60ba559a
Z

delete cast if lookup_table_v2 support fp16; delete repeated ops (#52888) · 7aafeb45
由 zhupengyang 提交于 4月 14, 2023

7aafeb45
D

add npu to device_guard (#52774) · 64b4aaba
由 duanyanhui 提交于 4月 14, 2023

64b4aaba
骑
[Function optimization] support uint16 python op in d2s (#52809) · 6d231b02
由骑马小猫提交于 4月 14, 2023
```
* support uint16 python op in d2s

* convert uint16 -> bfloat16 in docstring
```
6d231b02
K

rem cncl (#52434) · 25bd5ed8
由 Kim Yann 提交于 4月 14, 2023

25bd5ed8

[AMP] Unify the static amp codes of fp16 and bf16. (#52694) · dfcba7f4

由 Yiqun Liu 提交于 4月 14, 2023

* Unify the static amp codes of fp16 and bf16.

* Polish apis and add unittest.

* Add operator stats collecting tools for program.

* Add the check of number of bloat16 operators in unittest.

* Add warning for operator not supported for amp.

* Add testing of BF16 O1 and O2.

dfcba7f4

R

[CustomDevice] add model parallel support for custom device (#52872) · f8d09011
由 ronnywang 提交于 4月 14, 2023

f8d09011
Z
[IR] Move paddle_ir_test to test_ir (#52877) · 6b756e8c
由 zhangbo9674 提交于 4月 14, 2023
```
* move paddle_ir_test to test_ir

* fix bug

* fix bug
```
6b756e8c
H

update (#52875) · ce6978c6
由 huangjiyi 提交于 4月 14, 2023

ce6978c6
H

update (#52880) · 2f499713
由 huangjiyi 提交于 4月 14, 2023

2f499713
Z

delete unused param from swish_grad and relu6_grad (#52805) · 54e4360a
由 zhangyuqin1998 提交于 4月 14, 2023

54e4360a
H

update (#52879) · b1bb7484
由 huangjiyi 提交于 4月 14, 2023

b1bb7484
H

update (#52878) · e93e8a3f
由 huangjiyi 提交于 4月 14, 2023

e93e8a3f
R
Fix test full name usage (#52790) · aac8da90
由 risemeup1 提交于 4月 14, 2023
```
* test

* fix test error

* fix test error

* fix test error
```
aac8da90
石

add ci reviewer for inference size (#52159) · 3fed97f4
由石晓伟提交于 4月 14, 2023

3fed97f4

13 4月, 2023 16 次提交
- Y
  
  remove need cpp14 support (#52867) · d9c3abe6
  由 Yuanle Liu 提交于 4月 13, 2023
  
  d9c3abe6
- W
  [Paddle-Trt] Replace fc mul matmul matmul_v2 with matrix_multiply (#52222) · ef734e84
  由 Wangzheee 提交于 4月 13, 2023
```
* Paddle-Trt: Replace fc mul matmul matmul_v2 with matrix_multiply
```
  ef734e84
- J
  remove code with PADDLE_WITH_ASCEND (#52830) · acf55016
  由 jjyaoao 提交于 4月 13, 2023
```
* remove code with PADDLE_WITH_ASCEND

* try pass codestyle
```
  acf55016
- J
  delete WITH_ASCEND_CL (#52825) · 4a374c60
  由 jjyaoao 提交于 4月 13, 2023
```
* delete WITH_ASCEND_CL

* delete NPU/ and WITH_MLU
```
  4a374c60
- S
  【Hackathon No.55】 add channel_shuffle FP16/BF16 support and tests (#51884) · 48ccb785
  由 superwinner1 提交于 4月 13, 2023
```
* No55 add channel_shuffle FP16/BF16 support and tests
```
  48ccb785
- D
  【Hackathon No57】add_fp16_bf16_for_dot & bf16_for_cross (#52426) · 205094f0
  由 Difer 提交于 4月 13, 2023
```
* add_fp_bf_for_dot & bf_for_cross

* fix error

* fix some error

* fix some error

* change something

* fix magic number
```
  205094f0
- Z
  [AMP OP&Test] Support fp16&bf16 in reduce_max (#52862) · e0e044c0
  由 Zhang Zheng 提交于 4月 13, 2023
```
* [AMP OP&Test] Support fp16&bf16 in reduce_max
```
  e0e044c0
- Z
  [Paddle-TRT]fix bilinear_interp_v2 && some other bugs in trt 7011 (#52753) · dc8d6a1a
  由 zhoutianzi666 提交于 4月 13, 2023
```
* fix bilinear_interp_v2 && some other bugs in trt 7011

* add version check in test_trt_convert_bilinear_interp_v2.py
```
  dc8d6a1a
- L
  
  Fix the parameter check error in rmsprop_kernel_xpu. (#52866) · 9dc7e5ef
  由 Leo Guo 提交于 4月 13, 2023
  
  9dc7e5ef
- N
  
  Add TensorCheckerConfig for debugging tools (#51906) · 28de4558
  由 niuliling123 提交于 4月 13, 2023
  
  28de4558
- C
  
  Add pixel_shuffle pixel_unshuffle fp16/bf16 (#52582) · 2aaed989
  由 chenxujun 提交于 4月 13, 2023
  
  2aaed989
- Z
  
  move some function of cuda error from enforce.h to enforce.cc (#52828) · e64ce0bb
  由 zyfncg 提交于 4月 13, 2023
  
  e64ce0bb
- Z
  Add GaussianNLLLoss API. (#50843) · 802129b3
  由 Zman 提交于 4月 13, 2023
```
* Add GaussianNLLLoss API.

* Change `rotl` `atol`.Check `var` in dynamic graph

* remove assertTrue

* update unittest

* update unittest for ci-covarage.add broadcast with same dim.

* Supply static err print.

* Repair note and example.

* Split unitest.

* empty commit.

* for standard commit.

* for standard commit.

* Add int dynamic graph test.

* Repair parameters name.

* Repair unitest parameters name.

* Repair unitest parameters name

* Repair unitest parameters name

* Repair unitest parameters name

* add square in code-block

* fit few notes.

* fit few notes.

* fit few notes.

* fit few notes.

* add few interpretations.

* add few interpretations.

* add few interpretations.

* fix import.

* fix space.

* empty commit for ci.
```
  802129b3
- C
  
  add batch_norm cinn case (#52815) · e05df020
  由 cyber-pioneer 提交于 4月 13, 2023
  
  e05df020
- G
  Fix ignore index of c_softmax_with_cross_entropy_op. (#52835) · 4341ebd9
  由 Ghost Screaming 提交于 4月 13, 2023
```
* Fix bug of reduce_sum op. When input.numel() > INT32_MAX, its result
is wrong.

* Remove climits.

* Fix bug of c_softmax_with_cross_entropy_op. Support ignore_index is
negative number.
```
  4341ebd9
- C
  
  Fix delete_isolated_node_pass problem (#52856) · 0f2dc4ca
  由 csy0225 提交于 4月 13, 2023
  
  0f2dc4ca

PaddlePaddle / Paddle 1 年多 前同步成功

PaddlePaddle / Paddle
1 年多前同步成功