提交 · 281ea2f462a5e146cb7b570cbe00b39bab90da36 · PaddlePaddle / Paddle

14 4月, 2023 8 次提交

1. modify set_value op, use Scalars to represent attr `values`, instead of a... · dd2a749a

由 Feiyu Chan 提交于 4月 14, 2023

1. modify set_value op, use Scalars to represent attr `values`, instead of a bunch of attributs of various types; (#52408)

2. add program converter and set_value op as an example, which provides the functionality to convert `paddle::framework::ProgramDesc` between old and new formats(the differences are mainly some operators with incompatible updates in the definition);
3. program version and operator version map now are always saved when serializing `paddle::framework::ProgramDesc` to identify the version;
3. provide an option `legacy_format=false` in serialization of `paddle::framework::ProgramDesc`, it decided whether to convert ProgramDesc back to a legacy format, which is compatible for paddle 2.4.2 or earlier versions to load and execute;
4. deserialization of `paddle::framework::ProgramDesc` is now automatically detecting whether the bytes it receives is in legacy format(contains any of the operators that has been incompatibly updated and have any attribute of type `Scalar`) and convert it to new format. But if you want a faithful deserialization without the automatic conversion, you can use protobuf's deserialization instead. Though it is not recommended, it can be used for the purpose of testing.

dd2a749a

【Prim】Add more infer var type (#52818) · 630d14f5

由 Jiabin Yang 提交于 4月 14, 2023

* add more infer var type

* fix split error

* fix ut

* fix top_k infer vartype

* fix top_k infer vartype

630d14f5

Z

delete cast if lookup_table_v2 support fp16; delete repeated ops (#52888) · 7aafeb45
由 zhupengyang 提交于 4月 14, 2023

7aafeb45
D

add npu to device_guard (#52774) · 64b4aaba
由 duanyanhui 提交于 4月 14, 2023

64b4aaba
骑
[Function optimization] support uint16 python op in d2s (#52809) · 6d231b02
由骑马小猫提交于 4月 14, 2023
```
* support uint16 python op in d2s

* convert uint16 -> bfloat16 in docstring
```
6d231b02
K

rem cncl (#52434) · 25bd5ed8
由 Kim Yann 提交于 4月 14, 2023

25bd5ed8

[AMP] Unify the static amp codes of fp16 and bf16. (#52694) · dfcba7f4

由 Yiqun Liu 提交于 4月 14, 2023

* Unify the static amp codes of fp16 and bf16.

* Polish apis and add unittest.

* Add operator stats collecting tools for program.

* Add the check of number of bloat16 operators in unittest.

* Add warning for operator not supported for amp.

* Add testing of BF16 O1 and O2.

dfcba7f4

R

[CustomDevice] add model parallel support for custom device (#52872) · f8d09011
由 ronnywang 提交于 4月 14, 2023

f8d09011

13 4月, 2023 20 次提交
- W
  [Paddle-Trt] Replace fc mul matmul matmul_v2 with matrix_multiply (#52222) · ef734e84
  由 Wangzheee 提交于 4月 13, 2023
```
* Paddle-Trt: Replace fc mul matmul matmul_v2 with matrix_multiply
```
  ef734e84
- S
  【Hackathon No.55】 add channel_shuffle FP16/BF16 support and tests (#51884) · 48ccb785
  由 superwinner1 提交于 4月 13, 2023
```
* No55 add channel_shuffle FP16/BF16 support and tests
```
  48ccb785
- D
  【Hackathon No57】add_fp16_bf16_for_dot & bf16_for_cross (#52426) · 205094f0
  由 Difer 提交于 4月 13, 2023
```
* add_fp_bf_for_dot & bf_for_cross

* fix error

* fix some error

* fix some error

* change something

* fix magic number
```
  205094f0
- Z
  [AMP OP&Test] Support fp16&bf16 in reduce_max (#52862) · e0e044c0
  由 Zhang Zheng 提交于 4月 13, 2023
```
* [AMP OP&Test] Support fp16&bf16 in reduce_max
```
  e0e044c0
- Z
  [Paddle-TRT]fix bilinear_interp_v2 && some other bugs in trt 7011 (#52753) · dc8d6a1a
  由 zhoutianzi666 提交于 4月 13, 2023
```
* fix bilinear_interp_v2 && some other bugs in trt 7011

* add version check in test_trt_convert_bilinear_interp_v2.py
```
  dc8d6a1a
- N
  
  Add TensorCheckerConfig for debugging tools (#51906) · 28de4558
  由 niuliling123 提交于 4月 13, 2023
  
  28de4558
- C
  
  Add pixel_shuffle pixel_unshuffle fp16/bf16 (#52582) · 2aaed989
  由 chenxujun 提交于 4月 13, 2023
  
  2aaed989
- Z
  Add GaussianNLLLoss API. (#50843) · 802129b3
  由 Zman 提交于 4月 13, 2023
```
* Add GaussianNLLLoss API.

* Change `rotl` `atol`.Check `var` in dynamic graph

* remove assertTrue

* update unittest

* update unittest for ci-covarage.add broadcast with same dim.

* Supply static err print.

* Repair note and example.

* Split unitest.

* empty commit.

* for standard commit.

* for standard commit.

* Add int dynamic graph test.

* Repair parameters name.

* Repair unitest parameters name.

* Repair unitest parameters name

* Repair unitest parameters name

* Repair unitest parameters name

* add square in code-block

* fit few notes.

* fit few notes.

* fit few notes.

* fit few notes.

* add few interpretations.

* add few interpretations.

* add few interpretations.

* fix import.

* fix space.

* empty commit for ci.
```
  802129b3
- S
  
  Support static graph code-gen for temporal_shift (#52686) · 9246b93c
  由 Sanbu 提交于 4月 13, 2023
  
  9246b93c
- C
  
  Add overlap_add, sign tests (#52667) · cb6de765
  由 chenxujun 提交于 4月 13, 2023
  
  cb6de765
- [Auto Parallel] Add auto parallel tuner options in launch (#52053) · a67d3bb7
  由 TaoTao Li 提交于 4月 13, 2023
```
* add auto parallel tuner options in launch

* add ut for launch in auto_parallel tuner

fix code format

* fix ci-converage
```
  a67d3bb7
- V
  [AMP OP&Test] adjust test_elementwise_sub's tolerance, max_relative_error of grad and (#50953) · 2cff9839
  由 Vvsmile 提交于 4月 13, 2023
```
* adjust test_elementwise_sub's tolerance, max_relative_error of grad and
atol/rtol of output to 1e-3

* fix the dtype in setUp

* fix the elementwise_sub optest

* modify elementwise_sub optest

* fix and add bf16/fp16 to elementwise_sub

* fix elementwise_sub bugs

* fix bugs

* fix elementwise_sub op

* fix the data type

* fix elementwise_sub

* fix elementwise

* fix elementwise_sub

* fix bugs

* fix elementwise sub

* fix elementwise_sub

* remove scalar and vector
```
  2cff9839
- Z
  
  delete useless cast, elementwise_mul (#52831) · 0695fb88
  由 zhupengyang 提交于 4月 13, 2023
  
  0695fb88
- W
  [Dy2St]Fix _param_grad_names when grad name likes 'param@GRAD@GRAD' (#52821) · f4ae3737
  由 WangZhen 提交于 4月 13, 2023
```
* Fix _param_grad_names when like 'param@GRAD@GRAD'
```
  f4ae3737
- G
  
  add uint16 for bfloat16 dtype check in layer_norm under static mode (#52845) · 5a864270
  由 Guoxia Wang 提交于 4月 13, 2023
  
  5a864270
- J
  [CINN] optest add cinn check test (#52205) · edd578a1
  由 jiangcheng 提交于 4月 13, 2023
```
* [CINN] optest add cinn check test

* replace set self.check_cinn to pass check_cinn by function parameter

* fix ci bug

* add cinn atol/rtol
```
  edd578a1
- Z
  
  rename_bilinear_tensor_op (#52745) · eb93b5c9
  由 zhangyuqin1998 提交于 4月 13, 2023
  
  eb93b5c9
- L
  Fix bug in cross_entropy in static mode (#52771) · 16c36465
  由 lzydev 提交于 4月 13, 2023
```
* fix bug in cross_entropy in static mode

* fix ci-coverage
```
  16c36465
- K
  rem cncl in ut & build sh (#52811) · 4c7d5045
  由 Kim Yann 提交于 4月 13, 2023
```
* rem cncl in new test

* rem cncl in build sh

* rem cncl in old test
```
  4c7d5045
- G
  [Hackathon NO.75] 为 Paddle-TRT 添加 expend_as_v2 算子 (#51028) · 94cc1d6b
  由 gaoziyuan 提交于 4月 13, 2023
```
---------
Co-authored-by: NZhang Jun <ewalker@live.cn>
```
  94cc1d6b
12 4月, 2023 12 次提交

S

fix bug of mp (#52789) · 3ece0ece
由 ShenLiang 提交于 4月 12, 2023

3ece0ece

[Auto Parallel] Move some changes or bug fixes from 2.4 to develop (#52721) · cbdba509

由 Yulong Ao 提交于 4月 12, 2023

* [Auto Parallel] Speedup the completion process

* [Auto Parallel] Skip the property of dist_context when deepcopying

* [Auto Parallel] Remove the unnecessary print

* [Auto Parallel] Move some changes from 2.4 branch to develop

* Update engine.py

* [Auto Parallel] Fix a bug

cbdba509

L

Add layer func: float(), half(), bfloat16(). (#51635) · a64d50b7
由 liuruyan 提交于 4月 12, 2023

a64d50b7

张

remove *hccl*.cc (#52798) · 2131ee5c

由张春乔提交于 4月 12, 2023

* remove c_comm_init_hccl_op.cc and c_gen_hccl_id_op.cc

* remove gen_hccl_id_op.cc

2131ee5c

C

[Auto Parallel]Add the single-node topology detection (#52723) · 05fd6d10
由 CHANGer 提交于 4月 12, 2023

05fd6d10
A

[API]Fix paddle.arange infershape always -1 (#52764) · f063074f
由 Aurelius84 提交于 4月 12, 2023

f063074f
Q
fix dtype cast in amp for instance_norm. (#52765) · f650e901
由 qizhaoaoe 提交于 4月 12, 2023
```
* fix dtype cast in amp.

* add test case and update docs.

* remove set_prim.
```
f650e901
G

【Hackathon 78】为Paddle-TRT增加cumsum算子 (#52518) · 2309aa58
由 gaoziyuan 提交于 4月 12, 2023

2309aa58

[AMP OP&Test] add fp16/bf16 unittest for pool2d op (#52288) · f9b155f9

由 Wei Shengyu 提交于 4月 12, 2023

* add bf16 support and bf16/fp16 unittest for pool2d

* add include files

* dbg

* reformat

* reformat

* modify code according to review comment

* remove duplicate code

* remove dup code

* remove useless include

* dbg

f9b155f9

[Move Test] xpu (#52661) · 9a7c83bd

由 RedContritio 提交于 4月 12, 2023

* move python/paddle/fluid/tests/unittests/xpu to test/xpu

* update CMakeLists.txt

* remove xpu in fluid/tests/unittests/

* add path to op_test_xpu

* fix incorrect path

* update test script

* fix test_adadelta_op_xpu error

9a7c83bd

[AMP OP&Test] support bf16 for batch norm (#52407) · 523f8a26

由 Guoxia Wang 提交于 4月 12, 2023

* [AMP OP&Test] support bf16 for batchnorm

* codestyle

* Update batch_norm_grad_kernel.cu

* Update batch_norm_kernel.cu

* fix codestyle

* fix

* fix

* fix

* fix

* fix

* Update batch_norm_kernel.cc

523f8a26

Modify LayerNorm Composite Rule (#52712) · a2060568

由 Huihuang Zheng 提交于 4月 12, 2023

* [Do NOT merge] Expr PR on Composite

* Expr PR on Composite

* Revert some compsite experiment

* Remove unnecessary composite code

* Add rsqrt as sub primitives

a2060568

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功