提交 · 6fb34e743e19510dd99d15f13bb99efc737cd362 · BaiXuePrincess / Paddle

19 8月, 2022 9 次提交

W
fix layernormTrt meanVar alloc bug (#45255) · 6fb34e74
由 Wang Bojun 提交于 8月 19, 2022
```
* fix layernormTrt meanVar alloc bug
```
6fb34e74
R
Fix random op dependency and lr_shedule bugs for standalone executor (#45265) · 6d4ae007
由 Ruibiao Chen 提交于 8月 19, 2022
```
* Fix random op depenency and lr_shedule bugs for standalone executor

* Fix CI errors

* Fix CI errors

* Fix CI errors
```
6d4ae007
W
Trt groupnorm dynamic plugin (#44911) · 1aa6adb1
由 Wang Bojun 提交于 8月 19, 2022
```
* add group_norm dyanmic plugin
```
1aa6adb1

polish default param of XXX_interp_test, the same default value with … (#45258) · 4528ed2a

由 HongyuJia 提交于 8月 19, 2022

* polish default param of XXX_interp_test, the same default value with XXX_interp_np

* set default value data_layout=NCHW, cause C++ end treat NCDHW the same way as NCHW

4528ed2a

[XPU] add merged_momentum unittest and change momentum (#45241) · e0f1c9f2

由 dongfangshenzhu 提交于 8月 19, 2022

* add merged_momentum *test=kunlun

* add merged_momentum *test=kunlun

* add fp16 to merged_momentum,*test=kunlun

* change dist_model.cc

* add merged_momentum unittest and  change momentum,test=kunlun

* add merged_momentum unittest and  change momentum,test=kunlun

* add merged_momentum unittest and  change momentum,test=kunlun

* add merged_momentum unittest and  change momentum,test=kunlun

e0f1c9f2

N
[CodeStyle] use np.testing.assert_allclose instead of... · 4e2a3c11
由 Nyakku Shigure 提交于 8月 19, 2022
```
[CodeStyle] use np.testing.assert_allclose instead of self.assertTrue(np.allclose(...)) (part 3) (#45251)
```
4e2a3c11
X
[ Dy2Static ]Modify while interface[python] to fit onnx (#45034) · e654f1e7
由 xiongkun 提交于 8月 19, 2022
```
* Make sure that the output of whilep must exist in the input

* insert assign in block(0)

* add unittest.
```
e654f1e7

[CodeStyle] use np.testing.assert_allclose instead of... · 9107b653

由 Nyakku Shigure 提交于 8月 19, 2022

[CodeStyle] use np.testing.assert_allclose instead of self.assertTrue(np.allclose(...)) (part 2) (#45213)

* autofix (get ci log)

* retrigger ci

* fix test_gather_nd_op, wrong expected in dygraph

* fix test_activation_op, unpack static graph result

* fix test_auc_op, unpack static graph result

* fix test_bce_loss, unpack static graph result

* fix test_bce_with_logits_loss, unpack static graph result

* fix test_cond, unpack static graph result

* fix test_dygraph_weight_norm, wrong numpy reference when `axis=None`

* fix test_einsum, wrong matmul inputs

* fix test_elementwise_heaviside_op, unpack static graph result

* fix test_frac_api, unpack static graph result

* skip test_group_norm_op_v2, probably the wrong numpy reference

* fix test_imperative_double_grad, wrong subscript

* skip test_imperative_tensor_clear_gradient, ???

* skip test_layer_norm_op, probably the wrong numpy reference

* fix test_math_op_patch, unpack static graph results

* fix test_masked_select_op, unpack static graph results

* fix test_mse_loss, unpack static graph results

* fix test_multi_label_soft_margin_loss, unpack static graph results

* fix test_multi_dot_op, unpack static graph results

* fix test_nll_loss, unpack static graph results

* fix test_normalization_wrapper, unpack static graph results

* fix test_pass_builder, unpack static graph results

* fix test_prelu_op, possibly an extra comma

* fix test_psroi_pool_op, unpack static graph results

* fix test_queue, unpack static graph results

* fix test_reorder_lod_tensor, compare an item with a list

* fix test_rrelu_op, unpack static graph results

* fix test_searchsorted_op, unpack static graph results

* fix test_sigmoid_focal_loss, unpack static graph results

* fix test_smooth_l1_loss, unpack static graph results

* fix test_soft_margin_loss, unpack static graph results

* fix test_softmax2d, unpack static graph results

* fix test_square_error_cost, unpack static graph results

* fix test_tril_indices_op, unpack static graph results

* fix test_unsqueeze_op, mismatch numpy reference (axis)

* skip test_layers, `static_rlt` is missing an axis

* fix test_mnist, unpack PredictorTools result (also a list)

* fix test_build_strategy, unpack PredictorTools result

* fix test_mobile_net, unpack PredictorTools result

* fix test_resnet_v2, unpack PredictorTools result

* revert some changes

revert test_layers

revert test_group_norm_op_v2

revert test_layer_norm_op

revert test_imperative_tensor_clear_gradient

* fix test_normal, use flatten instead of reshape, (PR-CI-Windows-OPENBLAS)

* empty commit, trigger CI

9107b653

A

[CustomDevice] support scalar (#45244) · dc331231
由 Aganlengzi 提交于 8月 19, 2022

dc331231

18 8月, 2022 9 次提交
- F
  [API]Support static branch in paddle.to_tensor (#45164) · 30122212
  由 feifei-111 提交于 8月 18, 2022
```
* fix_shape
```
  30122212
- Z
  [AutoParallel] support ClipGradByGlobalNorm (#45205) · bb6bd223
  由 zhaoyingli 提交于 8月 18, 2022
```
* add clip_grad

* fix comments

* add unittest

* update logger
```
  bb6bd223
- W
  [Eager] Add get_tensor_from_selected_rows (#45227) · d257acc6
  由 Weilong Wu 提交于 8月 18, 2022
```
* [Eager] add get_tensor_from_selected_rows

* add PADDLE_ENFORCE to check SelectedRows

* use _ prefix in temp
```
  d257acc6
- H
  [phi] Transfer fluid trilinear_interp_v2 to phi trilinear_interp (add yaml) (#45145) · 6150fade
  由 HongyuJia 提交于 8月 18, 2022
```
* transfer trilinear op to phi, change name from trilinear_interp_v2 to trilinear_interp

* reserve linear_interp param

* change testcase scale if-branch

* testcase test_imperative_case

* fix trilinear testcase

* import paddle in test_trilinear_interp_v2
```
  6150fade
- P
  apply buffer_shared_inplace_pass and inplace_addto_op_pass pass to program in... · d8d124b6
  由 pangyoki 提交于 8月 18, 2022
```
apply buffer_shared_inplace_pass and inplace_addto_op_pass pass to program in Standalone Executor (#45085)

* apply inplace addto in python apply_pass

* fix

* apply inplace pass for program

* skip feed and fetch var

* fix block_desc.move_from

* fix block desc

* alltoall remove inplace

* fix
```
  d8d124b6
- A
  [OpAttr]Squeeze axes support Tensor (#45189) · c93451f4
  由 Aurelius84 提交于 8月 18, 2022
```
* [OpAttr]Squeeze axes support Tensor

* add support_tensor

* fix unittest

* fix coverage
```
  c93451f4
- R
  
  Add a tool to manage unit tests. (#45147) · b6a4db1d
  由 Roc 提交于 8月 18, 2022
  
  b6a4db1d
- H
  [phi] Transfer fluid bilinear_interp_v2 to phi bilinear_interp (add yaml) (#45140) · 2c2137bb
  由 HongyuJia 提交于 8月 18, 2022
```
* transfer bilinear op to phi, change bname from bilinear_interp_v2 to bilinear_interp

* reserve linear_interp param

* fix cross device import
```
  2c2137bb
- Z
  
  support selected_rows kernel for multiply in dygraph (#45217) · bcbb7a97
  由 zyfncg 提交于 8月 18, 2022
  
  bcbb7a97
17 8月, 2022 7 次提交

A

fix timeout for test_activation_ops_ipu (#45181) · e51ea538
由 Allen Guo 提交于 8月 17, 2022

e51ea538

[CodeStyle][NPU] use np.testing.assert_allclose instead of... · 2de0d676

由 Nyakku Shigure 提交于 8月 17, 2022

[CodeStyle][NPU] use np.testing.assert_allclose instead of self.assertTrue(np.allclose(...)) (part 1) (#44988)

* autofix

* try resolve precision issues

* revert some changes

* clean some `err_msg`

* 0.0001 -> 1e-4

* update commented assert code

* try to fix some shape errors

* `numpy` -> `np`

* empty commit, trigger kunlun ci, test=kunlun

* empty commit, retrigger kunlun ci, test=kunlun

* empty commit, trigger kunlun ci, try fix npu memcpy_h2d, test=kunlun

* try fix npu import error, test=kunlun

2de0d676

[OpAttr]Add SupportTensor for OpMaker with whitelist mechanism (#45084) · 2594935a

由 Aurelius84 提交于 8月 17, 2022

* [OpAttr]Add SupportTensor for OpMaker

* fix typo

* fix code style

* add SupportTensor for concat op

* add unittest for register Tensor

* add shape checker and split attribute

2594935a

A
[Eager]Support Lazy initialization for nn.Layer (#44990) · f59c666c
由 Aurelius84 提交于 8月 17, 2022
```
* [Eager]Support Lazy initialization for nn.Lazyer
```
f59c666c

add instance norm op for xpu (#45097) · 216d25ac

由 ykkk2333 提交于 8月 17, 2022

* xpu unittest grad compute supports more types, *test=kunlun

* add instance norm xpu, *test=kunlun

216d25ac

[phi] Transfer fluid bicubic_interp_v2 to phi bicubic_interp (add yaml) (#45151) · f4da2d4d

由 HongyuJia 提交于 8月 17, 2022

* transfer bicubic_interp op to phi, change name from bicubic_interp_v2 to bicubic_interp

* test final_state_bicubic_interp api

* testcase match imperative case

f4da2d4d

Z

Optimize performance of amp (#45188) · 5e1a20bf
由 Zhang Zheng 提交于 8月 17, 2022

5e1a20bf

16 8月, 2022 8 次提交

[Phi] Move amp ops into phi (#45079) · b4f67757

由 Chen Weihang 提交于 8月 16, 2022

* move check finite and unscale kernel into phi

* move infershape into phi

* move update_loss_scaling kernel into phi

* remove original kernels

* move update loss scaling infershape into phi

* add header for xpu and npu

* solve coverage failed

* fix npu test failed

* remove mutable data in cu file

* fix new executor failed

* add valid check for meta tensor output

b4f67757

[geometric]Add paddle.geometric.send_uv API (#44848) · 88724a53

由 Siming Dai 提交于 8月 16, 2022

* initial commit

* fix op maker bug

* fix mul grad bug

* add unittest

* fix add grad bug, add cpu kernel

* add paddle.geometric.message_passing

* add paddle.geometric.send_uv api, add unittest

* add fp16 judgement

* fix file typo, move compute_type to message_op

* add impl file

* fix unittest timeout time

* add review revise

88724a53

[Auto Paralle]Add reshard cost and update estimator (#45118) · 6a15d407

由 caozhou 提交于 8月 16, 2022

* update reshard cost and cost estimator

* add unittest

* add dropout cost

* fix import error

* fix reshard code style error

* improve unittest coverage

6a15d407

convert multihead to oss (#45019) · f706d95d

由 feng_shuai 提交于 8月 16, 2022

* convert multihead to oss

* fix:bug

* fix:delete const cast

* fix:don't support bias_qk

* add vit pass

* fix:convert bug and add preln_residual_bias

* support length=-1

* add UT for convert

* add no_bias_qk support for gpu_multihead_op

* delete infer_shape depends on bias_qk

* oss just can be used in T4 and A*

* fix:change api for ROCM CI

f706d95d

H

transfer nearest_interp op to phi, change name from nearest_interp_v2 to nearest_interp (#45148) · 6452ab3b
由 HongyuJia 提交于 8月 16, 2022

6452ab3b
H

[XPU] add truncated_gaussian_random op. (#45152) · 5bcabf78
由 houj04 提交于 8月 16, 2022

5bcabf78

【autograd】add select_p、eq_p、pow_p primitive operator for new autograd (#44813) · b681c88c

由 Sing_chan 提交于 8月 16, 2022

* add select_p

* fix bugs

* add custom test for select_p; modify select_p primrules

* modify according to xiaoxu's comment

* add eq_p, select_p, pow_p, use autograd to test grad of high order

* add requirement of autograd, modify expected type of eq

* modify according to xiaoxu's comment

* import primops to use primops.pow

b681c88c

F

add strongly typed functions to set attributes to avoid unexpected type conversions. (#45107) · 307801d5
由 Feiyu Chan 提交于 8月 16, 2022

307801d5

15 8月, 2022 7 次提交

R
modify atol and rtol to solve unnittest failure (#45139) · f30c7bd6
由 RichardWooSJTU 提交于 8月 15, 2022
```
Co-authored-by: NminghaoBD <liminghao03@baidu.com>
```
f30c7bd6

[phi] change op name linear_interp_v2 to linear_interp (#45128) · 6de3bdb3

由 HongyuJia 提交于 8月 15, 2022

* change name linear_interp_v2 to linear_interp

* fix deprecated_op_names

* deprecated_op_names add linear_interp_grad

6de3bdb3

Refine TRT unit test (#45102) · 3512bf11

由 zlsh80826 提交于 8月 15, 2022

* Reduce pool2d test configuration

* Reduce depthwise_conv2d test configuration

* Reduce trt_convert_conv2d_fusion test configuration

* Reduce trt_convert_conv2d test configuration

* Reduce trt_convert_conv2d_transpose test configuration

* Reduce trt_convert_hard_swish test configuration

* Enhance trt auto scan test error message and mechanism

* Increase FP16 trt ut tolerance

3512bf11

Z

add mish and mish_grad for XPU, test=kunlun (#45098) · 6815c8ab
由 zhangyikun02 提交于 8月 15, 2022

6815c8ab
Z
[AutoParallel] add collate_fn for dist_loader (#45053) · 3649099f
由 zhaoyingli 提交于 8月 15, 2022
```
* add collate_fn

* fix number of inputs
```
3649099f
H
[jit] rm useless property pybind (#44962) · 8788513b
由 Hui Zhang 提交于 8月 15, 2022
```
* rm useless pybind

* rm useless ut
```
8788513b

[Auto Parallel] Move the distributed info from python to c++ (#44510) · a52357fe

由 Yulong Ao 提交于 8月 15, 2022

* [Auto Parallel] Move the distributed info from python to c++

* [Auto Parallel] Add dist_attrs for VarDesc and OpDesc

* [Auto Parallel] Add the lost file

* [Auto Parallel] Make the dist attr be unique_ptr

* [Auto Parallel] Add the proto conversion

* [Auto Parallel] Improve the proto support

* [Auto Parallel] Fix the bugs for adding a device or a link

* [Auto Parallel] Add the C++ ProcessMesh and DistributedMapper

* [Auto Parallel] Improve the impl of these dist attrs

* [Auto Parallel] Pybind11 ProcessMesh and DeviceMesh

* [Auto Parallel] Fix the unittest problem

* [Auto Parallel] Explicitly add the src file for auto_parallel target

* [Auto Parallel] Add the proto depedency explicitly

* [Auto Parallel] Fix the cmake bug on windows and mac

* [Auto Parallel] Remove the pybind11 header file in process_mesh.h

* [Auto Parallel] Remove unused codes

* [Auto Parallel] Check whether the dist attr is null

* [Auto Parallel] Implement the assign operator for OpDesc explicitly

a52357fe

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致