提交 · 2005b98b4b411745062c8a38285b0f203973b5d6 · BaiXuePrincess / Paddle

21 12月, 2021 7 次提交
- G
  
  fix recompute no grad warning (#38293) · 2005b98b
  由 Guoxia Wang 提交于 12月 21, 2021
  
  2005b98b
- B
  add seqpool_cvm_concat_fuse_pass ut (#37902) · 06cf314a
  由 baoachun 提交于 12月 21, 2021
```
* add seqpool_cvm_concat_fuse_pass ut

* rename ut name
```
  06cf314a
- S
  Support FP16 mean (#38289) · 643a268e
  由 sneaxiy 提交于 12月 21, 2021
```
* mean first version

* fix scalar mean

* add fp16 dtype for api
```
  643a268e
- Y
  Fix test_conv_eltwiseadd_bn_fuse_pass timeout bug (#38302) · c197d73b
  由 yeliang2258 提交于 12月 21, 2021
```
* fix timeout bug

* update
```
  c197d73b
- B
  update repeated_fc_relu_fuse_pass ut (#37845) · a896d1ce
  由 baoachun 提交于 12月 21, 2021
```
* update repeated_fc_relu_fuse_pass ut

* update ut
```
  a896d1ce
- H
  optimize performance of offload in dygraph sharding stage2 (#38064) · f74ebd8a
  由 Haohongxiang 提交于 12月 21, 2021
```
* update

* fix bugs

* modify code style

* fix bugs of _get_global_group
```
  f74ebd8a
- H
  PassAutoScan 基线跟测试用例使用一样配置的config (#38252) · 61ef56a1
  由 heliqi 提交于 12月 21, 2021
```
* add timeout

* add timeout

* PassAutoScan base_line use same config

* try run base_line

* fix dropout Mask of output attr error

* fix dropout Mask of output attr error
```
  61ef56a1
20 12月, 2021 10 次提交

S

add check pass conflict tools (#38276) · 0d12aa64
由 sneaxiy 提交于 12月 20, 2021

0d12aa64

add mkldnn conv_transpose_bias fuse pass ut (#37508) · ac696941

由 baoachun 提交于 12月 20, 2021

* add mkldnn conv_transpose_bias fuse pass ut

* update conv_transpose_bias_mkldnn_fuse_pass ut

* update conv_transpose_bias_mkldnn_fuse_pass ut

* update conv_transpose_bias_mkldnn_fuse_pass ut

* restrict conv2d data_format in conv_transpose_bias_mkldnn_fuse_pass

* update ut timeout setting

* update ut

ac696941

[pten]add pten conj kernel (#38247) · a2793e5e

由 chentianyu03 提交于 12月 20, 2021

* add pten conj kernel

* modify conj_kernel file path

* add defined cuda macro to cuda/conj_kernel.h

a2793e5e

Support FP16 for more ops (#38123) · 1f445bf3

由 sneaxiy 提交于 12月 20, 2021

* support FP16 for more ops

* add amp list tests

* refine reduce_mean_grad

* fix OP benchmark ci

* fix fp16 reduce_mean

* updat ut, but still have some problems

* remove mean/reduce_mean fp16 kernel

1f445bf3

add matmul_scale_fuse_pass (#37962) · ce335c23

由 heliqi 提交于 12月 20, 2021

* add matmul_scale matmul_v2_scale fuse pass

* add scaletensor judge

* modify var name

* add timeout notest;test=coverag

* fix error commit

* fix use_mkldnn attr

* fix use_mkldnn attr

ce335c23

K

fix repeat doc, test=document_fix (#38238) · 2fc479c0
由 kuizhiqing 提交于 12月 20, 2021

2fc479c0
0

[Dy2St]Skip windows for test_mnist_pure_fp16 (#38214) · 69cfb7a2
由 0x45f 提交于 12月 20, 2021

69cfb7a2

Add multi_tensor for momentum optimizer and clear_grads (#37564) · 0cc5e22c

由 zhangbo9674 提交于 12月 20, 2021

* add multi_tensor for momentum and clear_grads for optimizer

* fix bug for dygraph

* add unittest

* refine comment

* add param_group

* refine regularizaiton logic

* del clear_grads

* add clear_grads

* add dispensable check of None

* refine clear_grad

* fix build bug

* refine code by comment

* refine code

* add multi tensor check

* refine param_group update

* add multi tensor for static mode

* refine comments

* delete useless comma for momentum

* refine comment for momentum

* refine code by commment

0cc5e22c

Y

[fleet_executor] Remove runtime graph, all scheduler on python side (#38261) · 2f188341
由 Yuang Liu 提交于 12月 20, 2021

2f188341
F

add doc for is_complex and is_integer and expose them as public APIs (#38158) · 8c9c81cc
由 Feiyu Chan 提交于 12月 20, 2021

8c9c81cc

19 12月, 2021 1 次提交
- B
  
  Integration sharding stage2 function (#38151) · 327e5050
  由 Baibaifan 提交于 12月 19, 2021
  
  327e5050
18 12月, 2021 2 次提交
- Y
  add test_conv_act_mkldnn_fuse_pass (#38153) · 6418bc75
  由 yeliang2258 提交于 12月 18, 2021
```
* add test_conv_act_mkldnn_fuse_pass

* update cmakelist

* fix cmakelist

* fix timeout

* fix timeout

* fix timeout

* fix
```
  6418bc75
- F
  add complex op (#37918) · 31e874b1
  由 Feiyu Chan 提交于 12月 18, 2021
```
* add complex op and `paddle.complex`.
```
  31e874b1
17 12月, 2021 12 次提交
- C
  Add mcmc of planner, of update cost model and relaunch (#38177) · 1bb2c68a
  由 caozhou 提交于 12月 17, 2021
```
* add planner

* add planner

* add cost model update

* add relaunch updation

* update process_group

* fix error

* add unitest

* update unitest

* update cost model

* avoid api problem
```
  1bb2c68a
- J
  Support multi place constructor (#38171) · 6f439e5a
  由 Jiabin Yang 提交于 12月 17, 2021
```
* support more eager tensor api

* support multiple constructor for eager tensor

* add place related code

* polish code

* specific randint with dtype of int64

* Support pure cpu test

* refine test in pure cpu

* refine test in pure cpu
```
  6f439e5a
- S
  Add _compile_dir argument for custom ops compilation (#38211) · 411d64ad
  由 sneaxiy 提交于 12月 17, 2021
```
* add compile_dir

* follow comments
```
  411d64ad
- S
  Refine some AMP operators for BERT (#37923) · d80fe268
  由 sneaxiy 提交于 12月 17, 2021
```
* support multi precision update for LAMB

* hide some api

* fix ci uts

* fix lamb output of dygraph

* remove some changes to some PR

* try to fix Py3 CI compile error

* fix test_imperative_optimizer, add lars ut, add layer_norm ut

* fix ut, fix format

* fix ut

* fix windows ci
```
  d80fe268
- F
  
  add test for conv_transpose_bn_fuse_pass (#38203) · 672d94b2
  由 feng_shuai 提交于 12月 17, 2021
  
  672d94b2
- C
  [pten] modify reduce_sum reduce_mean args (#38216) · eaa2363e
  由 chentianyu03 提交于 12月 17, 2021
```
* modify sum mean args

* add GetExpectedPtenKernelArgs for redcue_op

* modify kernel args number

* modify kernel args number
```
  eaa2363e
- K
  
  add op/api repeat/interleave (#37981) · a7de0e66
  由 kuizhiqing 提交于 12月 17, 2021
  
  a7de0e66
- H
  test_adaptive_pool2d_convert_global_pass增加超时时间 (#38220) · 885767e3
  由 heliqi 提交于 12月 17, 2021
```
* add timeout

* add timeout
```
  885767e3
- A
  [CustomOp]Add RWLock to protect loading module under multi-thread and multi-process (#38128) · 8bc27015
  由 Aurelius84 提交于 12月 17, 2021
```
* Add RWLock to protect loading module under multi-thread

* refine code

* remove import statement
```
  8bc27015
- Z
  [AutoParallel] add gpt model for unittest (#38202) · 76eb371e
  由 zhaoyingli 提交于 12月 17, 2021
```
* add gpt modeling

* update file name
```
  76eb371e
- Y
  
  [fleet_executor] run time graph on python side (#38164) · fc701369
  由 Yuang Liu 提交于 12月 17, 2021
  
  fc701369
- W
  
  fix bind failed with Address already in use (#38174) · 446a62e8
  由 WangXi 提交于 12月 17, 2021
  
  446a62e8
16 12月, 2021 8 次提交

S

modify according to zhouwei's comment (#38166) · a37be82f
由 Sing_chan 提交于 12月 16, 2021

a37be82f

Add arc hyperbolic function op (#37076) · 36b7368d

由 xiaoting 提交于 12月 16, 2021

* add activation

* update activation_op

* add unitest for activation

* fix acosh for init, test=develop

36b7368d

Conv transpose eltwiseadd bn fuse pass (#37800) · e64f0997

由 feng_shuai 提交于 12月 16, 2021

* conv_transpose_eltwiseadd_bn_fuse_pass

* change timeout

* add TIMEOUT

* add random num for group and dilation

* change PassCompat

e64f0997

Add tests for PaddleInference Pass (#37676) · 96597a85

由 yeliang2258 提交于 12月 16, 2021

* add test for conv_elementwise_add2_act_fuse_pass and conv_elementwise_add_act_fuse_pass

* Add conv_eltwiseadd_bn_fuse_pass test and fix test_conv_elementwise_addX_act_fuse_pass

* add tests for conv_act_mkldnn_fuse_pass

* add test for conv_bias_mkldnn_fuse_pass

* update code

* add conv_act_mkldnn_fuse_pass for relu, relu6, swish, leaky_relu

* update test

* update

* update bug

* update

* update pattern_detector

* fix test_conv_eltwiseadd_bn_fuse_pass

* add diff display notest;test=windows_ci_inference

* fix

* remove test_conv_act_mkldnn_fuse_pass.py

* ifix

96597a85

W

Arg weight of lerp support float in static mode, test=develop (#38080) · 58b4bc72
由 wuhuanzhou 提交于 12月 16, 2021

58b4bc72
J
support eager switch system (#38170) · 8305c2be
由 Jiabin Yang 提交于 12月 16, 2021
```
* support eager switch system

* polish code
```
8305c2be
L
Add fmax and fmin operators (#37826) · dd3afc9d
由 LJQ❤️ 提交于 12月 16, 2021
```
Add elementwise_fmax and elementwise_fmin operators
```
dd3afc9d

Add sparse_attention mask ,test=develop (#37973) · fa463b90

由 Liu-xiandong 提交于 12月 16, 2021

Add key_padding_mask and attn_mask in sparse_attention Api

1.Key padding mask is a tensor with dimensions [batch_size, seq_len], and attention mask is a tensor with dimensions [seq_len, seq_len]. The data types of the two masks are consistent with Q, K, and V, which are float32 or float64. If the value in Mask is 0, it means that the position needs to be masked.

2.The changed files are mainly paddle/fluid/operators/sparse_attention_op.cu and python/paddle/fluid/tests/unittests/test_sparse_attention_op.py. sparse_attention has three parts: sddmm, softmax, and dsd. Adding the mask operation only needs to modify the softmax. It has no effect on the other two parts. In addition, in order to test the mask function, related tests has been added.

fa463b90

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致