- 14 4月, 2023 1 次提交
-
-
由 zhupengyang 提交于
-
- 13 4月, 2023 2 次提交
-
-
由 Wangzheee 提交于
* Paddle-Trt: Replace fc mul matmul matmul_v2 with matrix_multiply
-
由 zhupengyang 提交于
-
- 12 4月, 2023 1 次提交
-
-
由 Yuanle Liu 提交于
-
- 10 4月, 2023 1 次提交
-
-
由 xiaoxiaohehe001 提交于
* Support two inputs of multihead attention named qk_multihead
-
- 03 4月, 2023 1 次提交
-
-
由 wz1qqx 提交于
-
- 31 3月, 2023 1 次提交
-
-
由 YuanRisheng 提交于
* remove distribute * fix py3 bugs * fix gpu-ps bugs * fix compile bugs * fix unittest bugs
-
- 30 3月, 2023 2 次提交
-
-
由 zhupengyang 提交于
-
由 zhupengyang 提交于
-
- 29 3月, 2023 2 次提交
-
-
由 zhupengyang 提交于
-
由 yuehuayingxueluo 提交于
* add fuse adamw pass * fix some bugs * fix CIbug * change chunk_size * fix CI bug * rm test_fused_adam_op.py * fix CI bugs * fix fuse_adamw_op_pass.cc * change code style * fix CI bug * fix ut bug and use_adamw_op_pass.cc * fix test_fuse_adamw_pass.py * fix CI bug * remove fluid * fix ci bug * fix CI bug
-
- 22 3月, 2023 2 次提交
-
-
由 Ghost Screaming 提交于
* Add fused_feed_forward pass for semi-automatic static graph training. * Add fused_feedforward property in parallel_executor.cc * Polish code. * Polish fused feed_forward pass code. Support use_dropout1 and use_dropout2 option. * Support model parallel in fused_feedforward pass.
-
由 zhupengyang 提交于
-
- 20 3月, 2023 1 次提交
-
-
由 mayang002 提交于
-
- 16 3月, 2023 1 次提交
-
-
由 wenbin 提交于
* split pass * fix compile * fix ut * more time * modify ut * reduce dim * fix compile * reshape weight * tensor * remove enforce * static shape ut * batchsize * reorder pass * minus test cases * windows timeout * windows time out * remove test for windows * correct * sssss * xxx
-
- 13 3月, 2023 1 次提交
-
-
由 Sławomir Siwek 提交于
* mkldnn->onednn * fused softplus op + kernel * remove extra attributes * add missing handler * change var name
-
- 07 3月, 2023 1 次提交
-
-
由 zhupengyang 提交于
-
- 06 3月, 2023 1 次提交
-
-
由 Paulina Gacek 提交于
* first approach * test finished * cpp test deleted * CmakeList corrected * multi_gru_seq_fuse_pass rewritten * dummy cout deleted * review changes * timeout extended
-
- 01 3月, 2023 1 次提交
-
-
由 zhupengyang 提交于
-
- 28 2月, 2023 2 次提交
-
-
由 Hulek 提交于
* Added file * Tests separated and rewritten, fixed fc_lstm_fuse_pass * Resolve conflicts
-
由 zhupengyang 提交于
-
- 23 2月, 2023 1 次提交
-
-
由 csy0225 提交于
-
- 22 2月, 2023 1 次提交
-
-
由 zhupengyang 提交于
-
- 17 2月, 2023 1 次提交
-
-
由 zhupengyang 提交于
[XPU] add multi_encoder_xpu_slice_fuse_pass, generate_sequence_xpu_fuse_pass, generate_sequence_xpu kernel (#50570)
-
- 16 2月, 2023 2 次提交
-
-
由 Hulek 提交于
* New onednn test * checkopoint * added new test, fixed issue with onednn bias * fix bias check * remove prints, refactor code * delete old test * update python tests cmake * Delete depracated conv bias * Delete outdated bias from convolution test
-
由 zhupengyang 提交于
-
- 15 2月, 2023 2 次提交
-
-
由 Hulek 提交于
* Done * Deleted old python test, fixed new python test, changed names in parallel_UT * Revert parallel UT changes * Revert parallel UT changes v2 * Review fixes and simplification of conv output shape calculation, disabled sqrt from conv_act_duse_pass * delete sqrt from possible activations from conv_concat_relu test * review refactor * merge main * delete sqrt from list of compatible activations * Test with no outdated inputs
-
由 YuanRisheng 提交于
* move profiler * add file * fix mac compile bugs * fix ci bugs * fix mac bugs * fix ci bugs * fix compile bugs * perfect code according comment
-
- 11 2月, 2023 1 次提交
-
-
由 Wang Bojun 提交于
* eleadd_trans first version log fix * refine code for linear format, add pass check * linear format refine and ut fix * fix ut * windows ut * windows ut 2 * move tensorMeta and alloc to configure
-
- 10 2月, 2023 1 次提交
-
-
由 zhupengyang 提交于
-
- 09 2月, 2023 1 次提交
-
-
由 Wang Bojun 提交于
* trans_layernorm
-
- 08 2月, 2023 1 次提交
-
-
由 Paulina Gacek 提交于
* QuantTranpose pattern is being found by pass * quant + transpose fuse * code style changes * UT written, reorder fixed * Dequantize + transpose2 fuse added * pass name changed * UT added & shift corrected * got rid of redundancy * review changes * AsIntermediate corrected * compat added
-
- 03 2月, 2023 1 次提交
-
-
由 Paulina Gacek 提交于
* conv_bias_mkldnn_fuse_pass_tester rewritten * conv_concat_relu_mkldnn_fuse_pass_tester rewritten * conv_elementwise_add_fuse_pass_tester rewritten * mkldnn changed to onednn * tests added to cmakeLists, style fix * got rid of unnecessary UT, some style changes * changes in naming convention * max_examples reduced * time out added
-
- 31 1月, 2023 1 次提交
-
-
由 wenbin 提交于
* gn_silu * add ut * set TIMEOUT * correct comments * comments * disable windows ut * rename parameter
-
- 17 1月, 2023 1 次提交
-
-
由 Paulina Gacek 提交于
* reshape_transpose_matmul_pass_tester rewritten * matmul_transpose_reshape_pass_tester rewritten * mkldnn to onednn
-
- 16 1月, 2023 1 次提交
-
-
由 Yuanle Liu 提交于
* add trt_support_nhwc_pass
-
- 13 1月, 2023 1 次提交
-
-
由 Wang Bojun 提交于
* add fmha_flashattention oss plugin
-
- 10 1月, 2023 1 次提交
-
-
由 Yuang Liu 提交于
-
- 09 1月, 2023 2 次提交