- 09 12月, 2022 11 次提交
-
-
由 Sławomir Siwek 提交于
* reshape * typo * remove header
-
由 Yuanle Liu 提交于
* clean ir_pass_manager and fix map_depthwise_conv_to_conv_pass * fix unitest timeout
-
由 HongyuJia 提交于
* support custom_xpu * update cmake to test xpu * support custom_xpu, verify mechanism * fix test_custom_relu_op_xpu_setup.py, test=kunlun * fix FLAGS_init_allocated_mem * cancel TIMEOUT property * reset FLAGS_init_allocated_mem property
-
由 Zhang Jun 提交于
* add prelu
-
由 MarDino 提交于
-
由 huangjiyi 提交于
-
由 Weilong Wu 提交于
* mv fused_bias_dropout_residual_ln to fluid manual dir * rm useless comments
-
由 zhoutianzi666 提交于
* add cutlass act set in conv_elementwise_add_act_fuse_pass
-
由 zyfncg 提交于
* add suppport_tensor for code_gen to static graph * support code-gen for int_array * polish code * fix bug of data_type
-
由 Leo Chen 提交于
* move share_buffer kernel to phi * fix ut * add source file * fix window links
-
由 PuQing 提交于
-
- 08 12月, 2022 12 次提交
-
-
由 jiangcheng 提交于
-
由 limingshu 提交于
-
由 Kai Song 提交于
* fix 'BlasAXPBY unimplemented' error with custom device * fix utils CmakeLists bug
-
由 RichardWooSJTU 提交于
* rewrite delete_weight_deqquant_linear_op_encoder/decoder pass
-
由 Wangzheee 提交于
* general optimization no_varlen embedding layernorm
-
由 huangjiyi 提交于
* move cuda_graph from fluid to phi * move device_memory_aligment from fluid to phi * Revert "move device_memory_aligment from fluid to phi" This reverts commit b92fcd39a0a50fdac13278f49be0237a85f3a13f. * update xpu cmake
-
由 Wilber 提交于
-
由 Ruibiao Chen 提交于
-
由 QingshuChen 提交于
*test=kunlun
-
由 六个骨头 提交于
* add onehot trt converter * add unitest * fix bug * opt code * fix bug * fix depth_tensor * fix unitest * fix bug * fix unitest * fix bug * fix bug * fix bug * fix bug
-
由 Wilber 提交于
-
由 wanghuancoder 提交于
-
- 07 12月, 2022 5 次提交
-
-
由 Sławomir Siwek 提交于
* squeeze kernel * squeze fwd * whitespace
-
由 张春乔 提交于
-
由 feng_shuai 提交于
-
由 Qi Li 提交于
-
由 zhangyikun02 提交于
-
- 06 12月, 2022 7 次提交
-
-
由 zyfncg 提交于
* delete Bias and ResidualData in OpMaker of conv2d * delete extra input of conv3d * refactor pass of conv_bias_fusion * fix mkldnn dependency * fix mkldnn compile * fix test_conv_bias_mkldnn_fuse_pass * police some code * remove useless log * fix analyzer_vit_ocr_tester * fix conv_activation_mkldnn_fuse_pass * fix test_analyzer_ocr * add fused_conv_sig * fix performence regression * fix performance regression
-
由 QingshuChen 提交于
*test=kunlun
-
由 Sławomir Siwek 提交于
* remove fluid code * init * typo * fix merge conflicts
-
由 houj04 提交于
-
由 Zhang Jun 提交于
* add reduce max for trt
-
由 Yuanle Liu 提交于
-
由 ykkk2333 提交于
* add stat tool * add roll and roll_grad kernels and strided_slice and strided_slice_grad kernels, test=kunlun * add xpu rmsprop centered, test=kunlun
-
- 05 12月, 2022 5 次提交
-
-
由 limingshu 提交于
* first commit * fix bugs according to ci * add some changes * change file name into function.cu.h * remove const_cast
-
由 zhoutianzi666 提交于
-
由 YuanRisheng 提交于
-
由 Wang Bojun 提交于
* pass * pass * draft version * share mem opt * remove sharemem * add pattern for the case with circle_shift=0 * add UT * pass opt * test_fix * code-commit * code-style * code style * code-style * ut-fix * op teller refine * resolve conflict * adjust position op_teller list and pass order for swin * ut code style update * adjust paddle pass order * refine pass order * refine pass order * refine pass order
-
由 Wilber 提交于
-