- 08 12月, 2022 4 次提交
-
-
由 Wangzheee 提交于
* general optimization no_varlen embedding layernorm
-
由 Wilber 提交于
-
由 Ruibiao Chen 提交于
-
由 Wilber 提交于
-
- 06 12月, 2022 4 次提交
-
-
由 zyfncg 提交于
* delete Bias and ResidualData in OpMaker of conv2d * delete extra input of conv3d * refactor pass of conv_bias_fusion * fix mkldnn dependency * fix mkldnn compile * fix test_conv_bias_mkldnn_fuse_pass * police some code * remove useless log * fix analyzer_vit_ocr_tester * fix conv_activation_mkldnn_fuse_pass * fix test_analyzer_ocr * add fused_conv_sig * fix performence regression * fix performance regression
-
由 QingshuChen 提交于
*test=kunlun
-
由 Sławomir Siwek 提交于
* remove fluid code * init * typo * fix merge conflicts
-
由 Yuanle Liu 提交于
-
- 05 12月, 2022 5 次提交
-
-
由 YuanRisheng 提交于
-
由 Wang Bojun 提交于
* pass * pass * draft version * share mem opt * remove sharemem * add pattern for the case with circle_shift=0 * add UT * pass opt * test_fix * code-commit * code-style * code style * code-style * ut-fix * op teller refine * resolve conflict * adjust position op_teller list and pass order for swin * ut code style update * adjust paddle pass order * refine pass order * refine pass order * refine pass order
-
由 Wilber 提交于
-
由 huangjiyi 提交于
-
由 xiaoxiaohehe001 提交于
-
- 02 12月, 2022 1 次提交
-
-
由 Shijie 提交于
* Fix fuse_gemm_epilogue * update tests * Update CMakeLists.txt * Update CMakeLists.txt * Update CMakeLists.txt * fix random seed * use assert_allclose * Update test_dist_fuse_gemm_epilogue_pass.py * Update cpp_pass.py * Update test_dist_fuse_gemm_epilogue_pass.py * fix codestyle * update seed and atol
-
- 01 12月, 2022 5 次提交
-
-
由 Wangzheee 提交于
* general optimization for no_varlen multihead
-
由 Wilber 提交于
* update memory_optimize pass
-
由 zhoutianzi666 提交于
* remove conv_act_set from graph_pattern_detector.cc
-
由 minghaoBD 提交于
* fuse-mt passes compatible with structured pruning
-
由 HongyuJia 提交于
* fix typo error * pass CI-coverage
-
- 30 11月, 2022 5 次提交
-
-
由 Netpunk 提交于
* migrate transpose_op.cu.h and gpu_utils.h * format code style * fix some problems * format code * reset tranpose_op.cc * test commit * recover transpose_op.h * delete transpose_op.h * adjust header files order in transpose_op.cc
-
由 Yuanle Liu 提交于
-
由 zhangbo9674 提交于
* add fuse act add grad pass * polish code * refine code * add test * refine code
-
由 zyfncg 提交于
* fix bug of eigen_dependency * fix xpu compile
-
由 RichardWooSJTU 提交于
* delete unnecessary shape and slice op Co-authored-by: NYour Name <you@example.com>
-
- 29 11月, 2022 2 次提交
-
-
由 kangguangli 提交于
* fix:add no support for cuda_arch<700 * replace Executor in while op with InterpreterCore * cache InterpreterCore as the member of WhileOp * fix bug: tensor place changed because of assign op in while loop * refine code * refine code * refine code * hot fix * fix compile * merge develop * follow comments * add log for test * remove LoDTensor * set flag control_flow_use_new_executor false Co-authored-by: Nfengshuai <fengshuai03@baidu.com> Co-authored-by: Nzhiqiu <chenqiuliang@baidu.com>
-
由 Sławomir Siwek 提交于
-
- 28 11月, 2022 4 次提交
-
-
由 Sławomir Siwek 提交于
-
由 YuanRisheng 提交于
* Fix onednn kernel bugs * fix gpu bugs
-
由 zhangbo9674 提交于
* add trace mode for interpretercore * fix bug * add a ctrl flag * add record for memcpyd2h * polish code * polish code
-
由 Ruibiao Chen 提交于
* Remove kSyncRun in StreamAnalyzer * Update code
-
- 26 11月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* hot fix * fix compile * merge develop * follow comments
-
- 25 11月, 2022 2 次提交
-
-
由 Chitsing KUI 提交于
* attr ready * op ip ready * start dynamic * end2end ok * input shape to map, stat by op * layer wip * first version ready * fix proto depds * fix profiler deps * fix flops typo, rm tuple shape
-
由 Ruibiao Chen 提交于
* Move stream_anayzer to interpreter * Refactor StreamAnalyzer * Refactor RunNextInstructionList * Remove no_data_transform_index * Fix typos * Fix data_transfer OpFuncType error * Add event for depend_op * Update transfer OpFuncType for heter place
-
- 24 11月, 2022 3 次提交
-
-
由 tianshuo78520a 提交于
-
由 huangjiyi 提交于
* rm dependence to "convert_utils.h" in some files * fix bugs * replace DataType2String with DataTypeToString * replace framework::DataTypeSize with phi::SizeOf * mv convert_function from fluid to phi and rm old map * recommit with pre-commit * repalce ProtoVarType with ProtoDataType and update comment. * fix error about include "dnnl.hpp" * revert add dep mkldnn to convert_utils in phi * add mkldnn deps in convert_utils.h in phi * move deps to convert_utils.h in phi
-
由 Wangzheee 提交于
* optimize token prune
-
- 23 11月, 2022 3 次提交
-
-
由 Wilber 提交于
-
由 Yuanle Liu 提交于
-
由 zhangyikun02 提交于
-
- 22 11月, 2022 1 次提交
-
-
由 HongyuJia 提交于
-