提交 · d2a0577a10ce577b9592afd873f80020daae5e4d · PaddlePaddle / Paddle

27 2月, 2023 22 次提交
- H
  [XPU] add fp16 support for shape and lookup_table_v2 op. (#50773) · d2a0577a
  由 houj04 提交于 2月 27, 2023
```
* [XPU] add fp16 support for shape op.

* [XPU] add fp16 support for lookup_table_v2 op.

* update approval list: add qingshu's id.
```
  d2a0577a
- M
  [Bug fix] Fix fp16 dtype checking for AvgPool1D op (#50929) · f8ec430e
  由 Maple Xie 提交于 2月 27, 2023
```
* Fix fp16 dtype checking for AvgPool1D op

* Update code style for PR-CI-Static-Check
```
  f8ec430e
- 张
  
  support fp16 on temporal_shift (#50919) · 12075f2a
  由张春乔提交于 2月 27, 2023
  
  12075f2a
- H
  [Tensor Operants & Prim] Tensor pow API uses elementwise_pow (#50886) · 8a097399
  由 HongyuJia 提交于 2月 27, 2023
```
* [Tensor Operants & Prim] Tensor pow API uses elementwise_pow

* unittest change to fill_constant+elementwise_pow
```
  8a097399
- 张
  [fp16] support fp16 on AvgPool3D (#50920) · 659cede0
  由张春乔提交于 2月 27, 2023
```
* support fp16 on AvgPool3D

* Apply suggestions from code review
```
  659cede0
- 张
  
  support fp16 on AlphaDropout (#50917) · 3678cae2
  由张春乔提交于 2月 27, 2023
  
  3678cae2
- 张
  
  support fp16 on unbind (#50916) · 5f60b597
  由张春乔提交于 2月 27, 2023
  
  5f60b597
- 张
  
  suppot fp16 in gather_nd (#50909) · 336cd205
  由张春乔提交于 2月 27, 2023
  
  336cd205
- 张
  
  suppot fp16 in flatten (#50906) · 7ffbf7e3
  由张春乔提交于 2月 27, 2023
  
  7ffbf7e3
- 张
  
  suppot fp16 in broadcast (#50905) · 77298931
  由张春乔提交于 2月 27, 2023
  
  77298931
- H
  fix fp16 dtype checking for clip op (#50878) · d832a54d
  由 haozi 提交于 2月 27, 2023
```
* fix fp16 dtype checking for clip op

* modify the name

* fix type error

* fix check error

* Update test_clip_op.py

fix test error

* Update test_clip_op.py

fix code style

---------
Co-authored-by: NZhang Ting <Douyaer2020@qq.com>
```
  d832a54d
- I
  
  fix fp16 dtype checking for conj op (#50868) · 6b85eb59
  由 Infinity_lee 提交于 2月 27, 2023
  
  6b85eb59
- H
  [Error Msg] Polish error message when GPU kernel not found (#50880) · 3e9ffaef
  由 HongyuJia 提交于 2月 27, 2023
```
* [Error Msg] Polish error message when GPU kernel not found

* Only test in GPU environment
```
  3e9ffaef
- Z
  [bug fix] fix fp16 dtype checking for argmax op (#50811) · f3aec871
  由 Zhang Ting 提交于 2月 27, 2023
```
* fix fp16 dtype checking for argmax op

* run fp16 test when place is gpu

* Update search.py

fix doc
```
  f3aec871
- A
  
  [fp16] fix fp16 support for nn.PairwiseDistance (#50849) · 587120ec
  由 Ainavo 提交于 2月 27, 2023
  
  587120ec
- 陈
  
  fix fp16 dtype checking for paddle.diag API (#50848) · ebea0885
  由陈沧夜提交于 2月 27, 2023
  
  ebea0885
- 张
  [fp16] suppot fp16 input in nansum (#50847) · 9951b86f
  由张春乔提交于 2月 27, 2023
```
* add float16 in python/paddle/math

* add unittest for float16

* add float16 support in python.paddle.tensor.search.where

* remove fp16 error cases

* Add NotImplementedError unittest

* fix codestyle

* fluid to paddle.static; add cases with GPU

* Add float16 in English docs
```
  9951b86f
- C
  
  add prim test for sqrt and exp (#50942) · cf209204
  由 Charles-hit 提交于 2月 27, 2023
  
  cf209204
- J
  [kunlun] support reduce_scatter (#50792) · 6786c012
  由 jameszhang 提交于 2月 27, 2023
```
* [kunlun] support reduce_scatter

* uncomment unittest

* update xccl to 1.0.10
```
  6786c012
- revert reshape 0 represent copy and support perm < 0 for paddle.transpose (#50720) · 3669868d
  由 zhouweiwei2014 提交于 2月 27, 2023
  
  3669868d
- W
  xpu: bind op scatter_nd_add. add data type for transpose2, clip & assign_value (#50825) · 0d12afea
  由 wangshengxiang 提交于 2月 27, 2023
```
* [XPU] bind op scatter_nd_add

* [XPU] add more data type for op: clip, transpose2 & assign_value
```
  0d12afea
- W
  [mv fleet] mv fleet to distributed (#50834) · 5d322ced
  由 wangzhen38 提交于 2月 27, 2023
```
* [mv fleet] mv fleet to distributed

* [mv fleet] for ci

* [mv fleet] for ci

* [mv fleet] solve ci of version
```
  5d322ced
25 2月, 2023 2 次提交

Support 0D for equal tensor with scalar (#50857) · 7c73910e
由 zhouweiwei2014 提交于 2月 25, 2023

7c73910e

change outputs and grads from fp16-fp16-comparision and fp16-fp32 (#50700) · 2dec64d0

由 Vvsmile 提交于 2月 25, 2023

* change outputs and grads from fp16-fp16-comparision and fp16-fp32
comparision

* support grad comparision fp16-fp32

* the change of reference dtype only occured from np.float16 to np.float32

* fix the list type can not infer the dtype by attribute dtype by transfer
the list to array

* adjust the default atol and rtol of float16 to 1e-3

* Polish code

* fix error

* fix

* Polish code

* fix the _is_cal_ref and np.float16

* fix the combination of is_calc_ref and np.float16

* remove unuseful codes in op_test.py

* fix ci

* fix the rtol set in the dygraph checker and eager checker

---------
Co-authored-by: NZzSean <18818272991@163.com>

2dec64d0

24 2月, 2023 11 次提交

Y

[Zero-Dim] Support 0D Tensor input for topk/broadcast_to/expand/expand_as/broadcast_shape (#50536) · 5041158f
由 yunyaoXYY 提交于 2月 24, 2023

5041158f

Revert grad scale optimization pr (#50839) · 8a503522

由 Weilong Wu 提交于 2月 24, 2023

* Revert "fixoptminizer _set_auxiliary_var bug (#50335)"

This reverts commit c44005f0.

* Revert "refine optimizer create accumulators (#50188)"

This reverts commit 244e7546.

* Revert "fix found_inf bug for custom optimizer (#50158)"

This reverts commit 64573f9f.

* Revert "refine amp scaler found_inf (#49864)"

This reverts commit 382e9a06.

* fix code format

* fix conflict

8a503522

姜

dynamic graph tests (#50572) · 09694f82

由姜永久提交于 2月 24, 2023

* fix

* and others

* more ops

* reset distribute_fpn and precision_recall

* reset fc

* modify arange test

* modify reshape&reduce

* add fill_any and sigmoid_cross_entropy

* reset linear_interp_v2

* reset reduce

* modify

* modify arange

* modify cast

09694f82

【Prim】Fix prim amp (#50518) · 6664a232

由 Jiabin Yang 提交于 2月 24, 2023

* change amp with to_prim

* fix prim amp

* fix rules

* fix liear

* add amp test

* add test

* disable this test on cpu

* disable this test on cpu

---------
Co-authored-by: Ncyber-pioneer <chenzhuo@tju.edu.cn>

6664a232

W
Add bert prim and cinn test (#50545) · bfa217e4
由 WangZhen 提交于 2月 24, 2023
```
* Add bert prim and cinn test
```
bfa217e4

【prim】Slice grad (#50771) · f6dea800

由 xiaoguoguo626807 提交于 2月 24, 2023

* support prim test in OpTest

* fix cmake

* fix op test

* fix test_input_spec

* disable cinn in reduce_sum unit test

* add bfloat16 dtype for sum

* add approve rules

* polish code

* add clear jit program function

* convert grad out from tensor to numpy

* remove unnecessary code

* add only_prim flag

* fix flag

* fix op test

* add attr

* fix optest comp inplace error

* fix op test

* fix op test with guard

* add initialization of check_comp flag

* fix comp inplace error in op test

* rename check_comp with check_prim and add bfloat16 dtype convert

* rename comp_op_type to prim_op_type

* rename comp to prim

* remove useless code

* skip ci check for only prim

* add no_grad_vars and grad_outputs in prim test

* fix var_dict

* fix op test for only_prim

* fix dy2static bugs

* polish some code

* temp

* modify op test

* except cinn test

* modify bfp16

* modify pad grad

* add pad_grad dtype

* start cinn part

---------
Co-authored-by: NCharles-hit <wanghao107@baidu.com>

f6dea800

H

[Tensor Operants & Prim] Tensor arithmetic operants support left scalar type (#50840) · 0d956e17
由 HongyuJia 提交于 2月 24, 2023

0d956e17
X

[dy2static] bug fix: Lazy initialize bugs (#50785) · 44a32fbd
由 xiongkun 提交于 2月 24, 2023

44a32fbd
C
[Prim]fix attrs loss in creating op (#50780) · 016f5ecb
由 cyber-pioneer 提交于 2月 24, 2023
```
* fix attrs loss in creating op

* add comment

* add case

* add case

* remove unused case setting
```
016f5ecb
Y
[Save/Load]Fix backward op's error when use jit.load (#50744) · 2be69d05
由 YuanRisheng 提交于 2月 24, 2023
```
* perfect translated layer

* perfect code according comment
```
2be69d05
R
[XPU] add expand_grad, isnan, meshgrid kernels (#50774) · 7271de88
由 ronnywang 提交于 2月 24, 2023
```
* [XPU] add expand_grad, isnan, meshgrid kernels

* update
```
7271de88

23 2月, 2023 5 次提交

C

[XPU] Migrate xpu_embedding_with_eltwise_add_fuse_pass (#50590) · 8d325d82
由 csy0225 提交于 2月 23, 2023

8d325d82

[OptionalOptimization]: LayerNorm forward Optimization with Welford (#50362) · 746b774b

由 limingshu 提交于 2月 23, 2023

* first commit

* main codes has been developed

* fix all bugs

* add vectorize input&output

* a test for optimization_of_layer_norm_fwd

* add some changes

* fix memory coalesced access for more optimization.

* fix addition ctest error

* fix according to ci-approval

* remove change on slice

746b774b

D

add custom_cpu mixed_precision test (#50789) · 7b1f42d3
由 duanyanhui 提交于 2月 23, 2023

7b1f42d3

Fluid clean move dygraph profiler, fluid.input.on_hot and fluid.input.embedding (#50141) · df7cc3a0

由 GGBond8488 提交于 2月 23, 2023

* remove dygraph.profiler

* remove fluid.input.one-hot and move embedding to paddle.static.nn

* fix unitest error

* fix type error

* fix type error

* fix xpu test error

* fxi sample code error

* fxi sample code error

* fix sample code error

* remove test.py

* remove variable in docstr

df7cc3a0

[Tensor Operants & Prim] Tensor arithmetic operants support right scalar type (#50563) · 5f5a2082

由 HongyuJia 提交于 2月 23, 2023

* polish namespace

* change static_tensor_operants

* polish namespace

* support add, subtract, divide

* add unit test

* polish unittest

* fix cmake error

* solve conflicts, merge auto code-gen

* add scalar operator in tensor.h

* tensorbase

* static prim full support more datatype

* fix prim unittest

* polish codes

* fix cmake error

5f5a2082

PaddlePaddle / Paddle 1 年多 前同步成功

PaddlePaddle / Paddle
1 年多前同步成功