- 20 10月, 2022 1 次提交
-
-
由 Sylwester Fraczek 提交于
-
- 19 10月, 2022 2 次提交
-
-
由 Ruibiao Chen 提交于
* Support stream overlap for c_allreduce_sum * Test CI * Add notes * Add SingleStreamGuard for BuildOpFuncList
-
由 WangZhen 提交于
* Fix recurrent op eager deletion pass error in dy2st * Polish code * Refine error message
-
- 18 10月, 2022 2 次提交
-
-
由 Wang Bojun 提交于
* first version, accuracy corrected * disable debug print * use blockReduceSum in phi * add UT * add opCompat * code style * code refine * bug fix * code refine * test fix * bugfix * codesytle fix * code style * code-style * code-style * code-style
-
由 Sławomir Siwek 提交于
* git * style * leave default relu in kernel * style * cleanup FCMKLDNN pattern * merge conflicts * update develop * update develop * add const * rename to oneDNN and adjust attributes * whitespace
-
- 17 10月, 2022 4 次提交
-
-
由 Wang Bojun 提交于
* first version of ln_s_p with s>0 * refine and UT * pass opt draft * pass opt * code refine * code-style * bug fix * fix ci test * code style
-
由 jakpiase 提交于
-
由 pangyoki 提交于
* skip ReplaceAllReduceOp in GraphtoBlock when nccl_ctxs_ is nullptr * update ut * test_dist_allreduce_op failed * fix test_dist_allreduce_op * add ut * fix nccl cpu compile * fix
- 16 10月, 2022 1 次提交
-
-
由 ZeKai Zhou 提交于
-
- 13 10月, 2022 2 次提交
-
-
由 yeliang2258 提交于
* fix immutable op quantize bugs * fix * fix build bug * fix test * notest,test=inference * fix ppyoloe acc drop bugs * fix test * fix test * add test * fix * fix * fix test * fix refined name bug * fix test * bias fix * fix matmul weight dequant bug * re-ci * fix tester * fix test * fix tester * update weight dequantize func * update code * update test for converage * update test * update cmake * update cmakelist * update code * rerun ci * remove useless code
-
由 joanna.wozna.intel 提交于
* Add unsigned int8 propagation * Add or modify unit tests * Correct concat scale checking * Apply review suggestions * Corrections
-
- 12 10月, 2022 1 次提交
-
-
由 weishengying 提交于
-
- 11 10月, 2022 2 次提交
-
-
由 Sylwester Fraczek 提交于
* add logging to fc residual fuse pass * expand logging message to fc residual fuse pass * Add test for fc residual not fusing with activation
-
由 Chen Weihang 提交于
* remove using lodtensor part1 * polish history code format
-
- 10 10月, 2022 3 次提交
-
-
由 Sylwester Fraczek 提交于
* fix fc pattern remove use_bias add residual input switch fix references to pattern * review fixes
-
由 Sylwester Fraczek 提交于
* Add methods that find input or output name by var name * kind of bugfix - initialize variables * ci fix * review fixed
-
由 zhoutianzi666 提交于
-
- 30 9月, 2022 2 次提交
-
-
由 Allen Guo 提交于
* paddle-inference support custom-ops Co-authored-by: NZhixin Yao <zhixiny@graphcore.ai> * fix tolower Co-authored-by: NZhixin Yao <zhixiny@graphcore.ai>
-
由 HongyuJia 提交于
-
- 29 9月, 2022 1 次提交
-
-
由 yeliang2258 提交于
* remove calibration file path * remove useless code
-
- 28 9月, 2022 3 次提交
-
-
由 Chen Weihang 提交于
* remove needless using tensor * remove needless using tensor * resolve conflict * replace tensor using * fix format error * revert needless changing * fix rocm and npu compile error * fix cinn compile error * fix format error * fix mkldnn format error * fix mkldnn format error * fix cinn compile error * fix cinn compile error * fix cinn compile error * resolve conflict
-
由 Ruibiao Chen 提交于
* Convert GradMergeAllReduceOpHandle in GraphToBlock * Set FLAGS_CONVERT_GRAPH_TO_PROGRAM to False
-
由 Leo Chen 提交于
-
- 27 9月, 2022 1 次提交
-
-
由 Wangzheee 提交于
* [Paddle Inference]support n lookup_tables fuse to embeddinglayernorm(3)
-
- 22 9月, 2022 2 次提交
-
-
由 Sławomir Siwek 提交于
* gaussian random * mkldnn to onednn renaming * fix merge conflicts * remove fluid code * onednn renaming * gelu fwd * sort activations * gelu gradient * remove unused macros * merge conflicts * fix merge conflicts * remove extra contraint from gelu op
-
由 Leo Chen 提交于
-
- 21 9月, 2022 2 次提交
-
-
由 Zhen Wang 提交于
* use cinn in the paddle inference * fix some cmake errors * Avoid division by zero in the arange_kernel. * Avoid dynamic ops. * Remove some useless codes. * Use OpTransInfo to encapsulate some codes used in the build_cinn_pass.
-
由 wenbin 提交于
* residual_no_bias * comments * more ut * fix input
-
- 20 9月, 2022 1 次提交
-
-
由 zhoutianzi666 提交于
* fix preln_residual_bias_fuse_pass bug in TNT_small model
-
- 19 9月, 2022 1 次提交
-
-
由 zyfncg 提交于
* fix wrong eigen header include * fix complie bug * fix nan_inf_utils_detail * fix resource_manager * fix conv_miopen_helper
-
- 13 9月, 2022 1 次提交
-
-
由 JingZhuangzhuang 提交于
* add softmax infer kernel
-
- 09 9月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* add operator<< for BuildStrategy * add fake_coalesce * fit allreduce mode for new_exe * remove dubeg code * follow comments
-
- 07 9月, 2022 1 次提交
-
-
由 wenbin 提交于
* first commit * conver done * correct format * layernorm_shift_partition * correct convert * redefine plugin * runable * bug fix * modify ShiftPartitionPattern * correct * add UT * modify ut * compile * modify enforce * modify UT
-
- 06 9月, 2022 1 次提交
-
-
由 zhoutianzi666 提交于
[Paddle Inference] fix bugs in quant_conv2d_dequant_fuse_pass when weight is shared between ops (#45719) * fix_old_format * fix bug in quant_conv2d_dequant * fix bug in quant_conv2d_dequant
-
- 05 9月, 2022 2 次提交
-
-
由 yeliang2258 提交于
* support onnx format quantized model * update code * add test * add test * fix * fix test * fix cmake * update code * change scale file path to calibration file path * update code * update code * fix build bug * fix build bugs * fix * fix
-
由 feng_shuai 提交于
* fix: vit attention pass * reflash CI
-
- 31 8月, 2022 1 次提交
-
-
由 Hui Zhang 提交于
* add del dropout op pass to jit pe enigne * add delete dropout test
-
- 30 8月, 2022 2 次提交
-
-
由 zyfncg 提交于
* add runtime config in phi * add runtime attr for op desc and op * fix no proto error * adjust opdesc set_attr impl * try to remove conv_op extra attrs * add init runtime attr map * change extra header path * fix runtime_attr * fix trace_op * fix bug of pass * fix merge conflict * fix dygraph attrs * fix bug of pass * fix dygraph bug * fix unittest module * delete extra attr default * fix dropout kernel * polish code * fix extra output of instance_norm * fix merge confilct * fix op_desc bug * add extra attr in yaml for conv3d_transpose * don't remove extra input and output * fix save_inference_model * fix bug of batch_norm * revert some change * polish log * polish code * add code comment Co-authored-by: NChen Weihang <chenweihang@baidu.com>
-
由 zhoutianzi666 提交于
add constant folding pass, for some model,it will get less latency;
-