- 09 2月, 2023 1 次提交
-
-
由 Wang Bojun 提交于
* trans_layernorm
-
- 08 2月, 2023 1 次提交
-
-
由 Paulina Gacek 提交于
* QuantTranpose pattern is being found by pass * quant + transpose fuse * code style changes * UT written, reorder fixed * Dequantize + transpose2 fuse added * pass name changed * UT added & shift corrected * got rid of redundancy * review changes * AsIntermediate corrected * compat added
-
- 03 2月, 2023 1 次提交
-
-
由 Paulina Gacek 提交于
* conv_bias_mkldnn_fuse_pass_tester rewritten * conv_concat_relu_mkldnn_fuse_pass_tester rewritten * conv_elementwise_add_fuse_pass_tester rewritten * mkldnn changed to onednn * tests added to cmakeLists, style fix * got rid of unnecessary UT, some style changes * changes in naming convention * max_examples reduced * time out added
-
- 31 1月, 2023 1 次提交
-
-
由 wenbin 提交于
* gn_silu * add ut * set TIMEOUT * correct comments * comments * disable windows ut * rename parameter
-
- 17 1月, 2023 1 次提交
-
-
由 Paulina Gacek 提交于
* reshape_transpose_matmul_pass_tester rewritten * matmul_transpose_reshape_pass_tester rewritten * mkldnn to onednn
-
- 16 1月, 2023 1 次提交
-
-
由 Yuanle Liu 提交于
* add trt_support_nhwc_pass
-
- 13 1月, 2023 1 次提交
-
-
由 Wang Bojun 提交于
* add fmha_flashattention oss plugin
-
- 10 1月, 2023 1 次提交
-
-
由 Yuang Liu 提交于
-
- 09 1月, 2023 3 次提交
- 05 1月, 2023 1 次提交
-
-
由 Wilber 提交于
-
- 03 1月, 2023 1 次提交
-
-
由 zhoutianzi666 提交于
* Implement conv2d_fusion NHWC format using CUTLASS * Add unit testing for CUTLASS Conv in inference * Add experimental API for CUTLASS.
-
- 02 1月, 2023 1 次提交
-
-
由 Hulek 提交于
-
- 14 12月, 2022 2 次提交
-
-
由 Yuanle Liu 提交于
-
由 Hulek 提交于
* Deleted mkldnn_inplace_pass code * Fixed error with cmake * Resolve conflicts
-
- 08 12月, 2022 2 次提交
-
-
由 RichardWooSJTU 提交于
* rewrite delete_weight_deqquant_linear_op_encoder/decoder pass
-
由 Wangzheee 提交于
* general optimization no_varlen embedding layernorm
-
- 06 12月, 2022 1 次提交
-
-
由 Yuanle Liu 提交于
-
- 05 12月, 2022 1 次提交
-
-
由 Wang Bojun 提交于
* pass * pass * draft version * share mem opt * remove sharemem * add pattern for the case with circle_shift=0 * add UT * pass opt * test_fix * code-commit * code-style * code style * code-style * ut-fix * op teller refine * resolve conflict * adjust position op_teller list and pass order for swin * ut code style update * adjust paddle pass order * refine pass order * refine pass order * refine pass order
-
- 30 11月, 2022 1 次提交
-
-
由 RichardWooSJTU 提交于
* delete unnecessary shape and slice op Co-authored-by: NYour Name <you@example.com>
-
- 23 11月, 2022 2 次提交
-
-
由 Wilber 提交于
-
由 Yuanle Liu 提交于
-
- 15 11月, 2022 1 次提交
-
-
由 jakpiase 提交于
* optimization for ln * fix * added output to gpd * added formatting * fix
-
- 11 11月, 2022 1 次提交
-
-
由 zyfncg 提交于
* generate static graph code for some ops by yaml * remove deleted files * update cmake * update cmake * udpate cmake
-
- 10 11月, 2022 2 次提交
-
-
由 zhangxin81 提交于
* add roformer pass&&plugin(novarlen)
-
由 RichardWooSJTU 提交于
* add fuse_multi_transformer_layer_pass
-
- 08 11月, 2022 1 次提交
-
-
由 Sławomir Siwek 提交于
* softplus+activation * fc + elementwise_add test refactored * rename MKLDNN to OneDNN * fc+activation tests refactored * remove softplus ut * whitespace * whitespace * codestyle * codestyle * add more cases to fc+act * remove softplus+hard_sigmoid pass * remove softplus + hard_sigmoid UT * add approximate for gelu * swish beta range * new codestyle * reduce number of tests
-
- 07 11月, 2022 1 次提交
-
-
由 Hui Zhang 提交于
* suqeeze2 transpose2 fuse onednn * format * fix output shape * fix conflict * format * format * remove useless * remove log * simply pass * fix comment * fix * fix msg * fix error msg * format
-
- 04 11月, 2022 1 次提交
-
-
由 jakpiase 提交于
* tmp save * minor chnage * CI fix * added FC optimizations * latest update * CI fix * fixed bug with fusing fc
-
- 27 10月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* make all cpp tests dynamic linked to libpaddle.so * add comments * keep old cc_test for some tests * fix some ut * make some ut use cc_test_old * fix typos and fit for win32 * fix lib path * fix some tests * skip lite test * fit for rocm * fit for cinn * fit for mac * fit for win32 * skip inference ut * skip windows * fix coverage
-
- 26 10月, 2022 2 次提交
-
-
由 wenbin 提交于
* prelnlayernorm_shift * add ut * remove paddle_enforce * remove useless * add UT * remove UT * add UT * set timeout
-
由 Sławomir Siwek 提交于
* fc/matmuls + scale fuse pass * remove double-extension * add unit tests * comments from review * codestyle * add pass to int8 list * new codestyle * attr name typo
-
- 20 10月, 2022 1 次提交
-
-
由 Kaipeng Deng 提交于
* add fused_multi_transformer_encoder/decoder pass, run GPT-3 success
-
- 18 10月, 2022 1 次提交
-
-
由 Wang Bojun 提交于
* first version, accuracy corrected * disable debug print * use blockReduceSum in phi * add UT * add opCompat * code style * code refine * bug fix * code refine * test fix * bugfix * codesytle fix * code style * code-style * code-style * code-style
-
- 17 10月, 2022 1 次提交
-
- 16 10月, 2022 1 次提交
-
-
由 ZeKai Zhou 提交于
-
- 27 9月, 2022 1 次提交
-
-
由 Wangzheee 提交于
* [Paddle Inference]support n lookup_tables fuse to embeddinglayernorm(3)
-
- 22 9月, 2022 1 次提交
-
-
由 Leo Chen 提交于
-
- 21 9月, 2022 1 次提交
-
-
由 Zhen Wang 提交于
* use cinn in the paddle inference * fix some cmake errors * Avoid division by zero in the arange_kernel. * Avoid dynamic ops. * Remove some useless codes. * Use OpTransInfo to encapsulate some codes used in the build_cinn_pass.
-