- 21 12月, 2022 1 次提交
-
-
由 zyfncg 提交于
* refactor conv_activation_mkldnn_fuse_pass * refactor conv_affine_channel_mkldnn_fuse_pass * fix conv_activation_mkldnn_fuse_pass * fix mkldnn unittest * refactor int8_scale_calculation_mkldnn_pass and params_quantization_mkldnn_pass * refactor conv_elementwise_add_mkldnn_fuse_pass * fix quant * refactor conv_bn_fuse_pass * fix conv_bn_fuse_pass * refactor depthwise_conv_bn_fuse_pass * fix unittest * fix conv_bn_fuse_pass * remove redundant conv2d in params_quantization_mkldnn_pass * fix params_quantization_mkldnn_pass_tester
-
- 19 12月, 2022 4 次提交
- 15 12月, 2022 1 次提交
-
-
由 Sławomir Siwek 提交于
* fix wrong handler name * mkldnn_engine -> onednn_engine * remove fluid/errors.h imports * remove fluid/enforce.h imports * remove note and unnecessary import * remove fluid/pretty_log.h imports * remove fluid/place.h imports * remove fluid/data_layout_transform.h imports * remove fluid/device_context.h imports * remove mkldnn_helper code * remove fluid/mkldnn_reuse.h imports * pretty_log import
-
- 14 12月, 2022 3 次提交
-
-
由 Yuanle Liu 提交于
-
由 Hulek 提交于
* Deleted mkldnn_inplace_pass code * Fixed error with cmake * Resolve conflicts
-
由 Yiqun Liu 提交于
-
- 13 12月, 2022 1 次提交
-
-
由 joanna.wozna.intel 提交于
-
- 12 12月, 2022 4 次提交
-
-
由 RichardWooSJTU 提交于
* fit paddle.matmul in fleetx.gpt
-
由 YuanRisheng 提交于
* add new tensor * fix windows compile bugs * fix ci bugs * fix ci bugs * fix ci bugs * perfect according comment * fix ci compile bugs * add raw tensor * fix ci bugs * modify code by comment * delete String
-
由 Yiqun Liu 提交于
* Enable to print device info. * Enhance the nan and inf checking for cpu. * Implement a common print function. * Unify the check of complex numbers. * Rewrite the omp method. * Count and print the number of nan and inf. * Change the print content. * Add unittest.
-
由 Ruibiao Chen 提交于
* Add UT * Support cross-step stream synchronization for standalone executor * Fix typos * Fix typos * Update UTs
-
- 09 12月, 2022 5 次提交
-
-
由 Yuanle Liu 提交于
* clean ir_pass_manager and fix map_depthwise_conv_to_conv_pass * fix unitest timeout
-
由 HongyuJia 提交于
* support custom_xpu * update cmake to test xpu * support custom_xpu, verify mechanism * fix test_custom_relu_op_xpu_setup.py, test=kunlun * fix FLAGS_init_allocated_mem * cancel TIMEOUT property * reset FLAGS_init_allocated_mem property
-
由 zhoutianzi666 提交于
* add cutlass act set in conv_elementwise_add_act_fuse_pass
-
由 Leo Chen 提交于
* move share_buffer kernel to phi * fix ut * add source file * fix window links
-
由 PuQing 提交于
-
- 08 12月, 2022 6 次提交
-
-
由 jiangcheng 提交于
-
由 RichardWooSJTU 提交于
* rewrite delete_weight_deqquant_linear_op_encoder/decoder pass
-
由 Wangzheee 提交于
* general optimization no_varlen embedding layernorm
-
由 Wilber 提交于
-
由 Ruibiao Chen 提交于
-
由 Wilber 提交于
-
- 06 12月, 2022 4 次提交
-
-
由 zyfncg 提交于
* delete Bias and ResidualData in OpMaker of conv2d * delete extra input of conv3d * refactor pass of conv_bias_fusion * fix mkldnn dependency * fix mkldnn compile * fix test_conv_bias_mkldnn_fuse_pass * police some code * remove useless log * fix analyzer_vit_ocr_tester * fix conv_activation_mkldnn_fuse_pass * fix test_analyzer_ocr * add fused_conv_sig * fix performence regression * fix performance regression
-
由 QingshuChen 提交于
*test=kunlun
-
由 Sławomir Siwek 提交于
* remove fluid code * init * typo * fix merge conflicts
-
由 Yuanle Liu 提交于
-
- 05 12月, 2022 5 次提交
-
-
由 YuanRisheng 提交于
-
由 Wang Bojun 提交于
* pass * pass * draft version * share mem opt * remove sharemem * add pattern for the case with circle_shift=0 * add UT * pass opt * test_fix * code-commit * code-style * code style * code-style * ut-fix * op teller refine * resolve conflict * adjust position op_teller list and pass order for swin * ut code style update * adjust paddle pass order * refine pass order * refine pass order * refine pass order
-
由 Wilber 提交于
-
由 huangjiyi 提交于
-
由 xiaoxiaohehe001 提交于
-
- 02 12月, 2022 1 次提交
-
-
由 Shijie 提交于
* Fix fuse_gemm_epilogue * update tests * Update CMakeLists.txt * Update CMakeLists.txt * Update CMakeLists.txt * fix random seed * use assert_allclose * Update test_dist_fuse_gemm_epilogue_pass.py * Update cpp_pass.py * Update test_dist_fuse_gemm_epilogue_pass.py * fix codestyle * update seed and atol
-
- 01 12月, 2022 5 次提交
-
-
由 Wangzheee 提交于
* general optimization for no_varlen multihead
-
由 Wilber 提交于
* update memory_optimize pass
-
由 zhoutianzi666 提交于
* remove conv_act_set from graph_pattern_detector.cc
-
由 minghaoBD 提交于
* fuse-mt passes compatible with structured pruning
-
由 HongyuJia 提交于
* fix typo error * pass CI-coverage
-