- 07 9月, 2022 11 次提交
-
-
由 zyfncg 提交于
* clear extra attrs of reduce op in opmaker * fix reduce_mean
-
由 houj04 提交于
-
由 wenbin 提交于
* first commit * conver done * correct format * layernorm_shift_partition * correct convert * redefine plugin * runable * bug fix * modify ShiftPartitionPattern * correct * add UT * modify ut * compile * modify enforce * modify UT
-
由 caozhou 提交于
* support iterable dataset for auto parallel * add split_data proto * fix unittest bug * fix recompute bug * update cmake
-
由 qipengh 提交于
* [MLU] fix sync_bn of mlu and add unittests * [MLU] remove redunant code of pytest
-
由 LiYuRio 提交于
-
由 limingshu 提交于
* first commit * merged with develop * merged with develop * fix merge sequential one dims bugs
-
由 Sławomir Siwek 提交于
* scale kernel * endline * add inplace * fix merge conflicts * Merge conflicts
-
由 xiongkun 提交于
* add compile-time infermeta logic for stack infermeta. * add unittest for stack infermeta where -1 exists in shapes. * remove backward changes.
-
由 zhangkaihuo 提交于
-
由 sneaxiy 提交于
* fix amp kernel * update to remove PADDLE_WITH_XPU macro
-
- 06 9月, 2022 26 次提交
-
-
由 YuanRisheng 提交于
* add tensor array * fix ci bugs * fix ci bugs * fix ci bugs * fix ci bugs * update by comment * update code
-
由 danleifeng 提交于
-
由 Hui Zhang 提交于
* using mkldnn * using with mkldnn macro * fix use mkldnn
-
由 Wilber 提交于
-
由 jakpiase 提交于
* concat workaround * CI rerun
-
由 ykkk2333 提交于
-
由 Ruibiao Chen 提交于
* Enable startup program for standalone executor * Disable test_py_reader_using_executor * Fix test_parallel_executor_mnist * Fix CI errors * Fix CI errors
-
由 chenjian 提交于
* update protobuf format * fix protobuf content * fix file mode * fix compiling error when gpu not exists * fix compiling error when gpu not exists * fix compiling error when gpu not exists * fix compiling error when gpu not exists * support rocm
-
由 zhoutianzi666 提交于
[Paddle Inference] fix bugs in quant_conv2d_dequant_fuse_pass when weight is shared between ops (#45719) * fix_old_format * fix bug in quant_conv2d_dequant * fix bug in quant_conv2d_dequant
-
由 ykkk2333 提交于
-
由 OccupyMars2025 提交于
-
由 YuanRisheng 提交于
-
由 Weilong Wu 提交于
[Eager, Performance optimization] reduce_all interface move reduce_all flag from python to C++ (#45744) * [Eager, Performance optimization] move reduce_all flag from python to c++ * polish reduce_all * fix ci error * fix errors
-
由 niuliling123 提交于
-
由 whs 提交于
-
由 Weilong Wu 提交于
* [Eager, Performance optimization] reduce_max / min polish * polish reduce_max / min * update min/max kernel reduce_all logic * fix a mistake * fix ci errors * fix errors
-
由 xiaohemaikoo 提交于
-
由 zyfncg 提交于
* set use_cudnn=true for conv2d * clear opmaker of matmul_v2 * fix bug of set_attr * add extra attr checker in infer_shape
-
由 zyfncg 提交于
-
由 LielinJiang 提交于
* add silu converter
-
由 LielinJiang 提交于
* fix grad error of grounorm op when cuda version==11.7
-
由 Wangzheee 提交于
* remove int8 fallback
-
由 Chen Weihang 提交于
-
由 Chen Weihang 提交于
-
由 Wen Sun 提交于
-
由 houj04 提交于
-
- 05 9月, 2022 3 次提交
-
-
由 piotrekobi 提交于
* gaussian random * mkldnn to onednn renaming * fix merge conflicts * remove fluid code * onednn renaming * Move classes from mkldnn_reuse.h to onednn_reuse.h * Move more functions from mkldnn_helper.h to onednn_helpper.h * Change MKLDNN to OneDNN in VLOG message Co-authored-by: NSilv3S <slawomir.siwek@intel.com>
-
由 yeliang2258 提交于
* support onnx format quantized model * update code * add test * add test * fix * fix test * fix cmake * update code * change scale file path to calibration file path * update code * update code * fix build bug * fix build bugs * fix * fix
-
由 kangguangli 提交于
* fix compile error in gcc540
-