- 08 9月, 2022 16 次提交
-
-
由 tianshuo78520a 提交于
-
由 HongyuJia 提交于
-
由 Charles-hit 提交于
* support more op for high level * add unit test for high level op * remove unnecessary comments
-
由 taixiurong 提交于
* add gemm_epilogue * xpu-paddlepaddle-40 [任务] fused_gemm_epilogue 支持 test=kunlun
-
由 pangyoki 提交于
-
由 TeFeng Chen 提交于
-
由 Aurelius84 提交于
-
由 OccupyMars2025 提交于
-
由 Aurelius84 提交于
* [OpAttr]Refine Teller logic if encounter OpDesc with Variable type Attribute * fix iterator * fix typo * fix lambda expr * fix ptr
-
由 Leo Chen 提交于
-
由 xiongkun 提交于
* stop pass filter int32/int16/int64/bool inputs in cond_op * fix bugs: except block 0, the backward vars and forward vars exist in different blocks. * fix code by review
-
由 LiYuRio 提交于
-
由 Siming Dai 提交于
-
由 wenbin 提交于
-
由 Guanghua Yu 提交于
-
由 sneaxiy 提交于
-
- 07 9月, 2022 24 次提交
-
-
由 Chen Weihang 提交于
* add save kernel * add save_sr_kernel * remove original save_op * add save gpu kernel * remove combine kernel * add port.h include * add save selected rows test * remove useless kernel.h
-
由 Leo Chen 提交于
-
由 Vigi Zhang 提交于
add running untrusted models in security policy
-
由 houj04 提交于
* [XPU] update xdnn to 0906. test=kunlun * [XPU] update xdnn to 0907. test=kunlun
-
由 Yuang Liu 提交于
-
由 Chen Weihang 提交于
* fix infermeta bug for vector input and output * add unittest
-
由 WangZhen 提交于
* Construct exec and ctx only once in cond op to speed up * Fix construct function error
-
由 Wilber 提交于
-
由 BiynXu 提交于
-
由 piotrekobi 提交于
* gaussian random * mkldnn to onednn renaming * fix merge conflicts * Migrate reduce_op oneDNN kernels to phi * Remove unnecessary header * remove fluid code * onednn renaming * Change std::vector<int64_t> to IntArray * Fix code style * Move classes from mkldnn_reuse.h to onednn_reuse.h * Move more functions from mkldnn_helper.h to onednn_helpper.h * Change MKLDNN to OneDNN in VLOG message * Implement reviewer suggestions Co-authored-by: NSilv3S <slawomir.siwek@intel.com>
-
由 Chen Weihang 提交于
* fix test_custom_relu_op_jit windows error * polish assert format
-
由 WangZhen 提交于
Adapt tensor output_size for conv2d_transpose and depthwise_conv2d_transpose
-
由 Yuang Liu 提交于
-
由 Charles-hit 提交于
* relace fill_zeros_like op with fill_any_like op in backward.py and tensor.py * Remove unnecessary comments * modify create op_desc param
-
由 Ruibiao Chen 提交于
-
由 Wilber 提交于
* enable memory optimize when fp16. * optimiza params sync between cpu and gpu.
-
由 zyfncg 提交于
* clear extra attrs of reduce op in opmaker * fix reduce_mean
-
由 houj04 提交于
-
由 Yuang Liu 提交于
-
由 wenbin 提交于
* first commit * conver done * correct format * layernorm_shift_partition * correct convert * redefine plugin * runable * bug fix * modify ShiftPartitionPattern * correct * add UT * modify ut * compile * modify enforce * modify UT
-
由 caozhou 提交于
* support iterable dataset for auto parallel * add split_data proto * fix unittest bug * fix recompute bug * update cmake
-
由 qipengh 提交于
* [MLU] fix sync_bn of mlu and add unittests * [MLU] remove redunant code of pytest
-
由 LiYuRio 提交于
-
由 Weilong Wu 提交于
-