- 22 9月, 2022 8 次提交
-
-
由 Yuanle Liu 提交于
-
由 Siming Dai 提交于
-
由 Hui Zhang 提交于
* fix sub sign reverse for mkldnn * refactor code as comment * remove useless * format code
-
由 zhoutianzi666 提交于
* fix beta bug in fill_constant_batch_size_like
-
由 Haohongxiang 提交于
* fix bugs of mp * fix bugs of mp * update * update * fix bug
-
由 carryyu 提交于
* Optimize topk's performance when k is small and input_width is large * 修改blockdim设置逻辑 * Update top_k_function_cuda.h
-
由 zyfncg 提交于
-
由 Chenxiao Niu 提交于
-
- 21 9月, 2022 17 次提交
-
-
由 ccrrong 提交于
* add fp16 support * update * update half * code format * fix unittest * fix rocm compile error * code format * code format * fix rocm compile error * fix rocm compile error
-
由 Piotr Paturej 提交于
-
由 Chen Weihang 提交于
-
由 Zhen Wang 提交于
* use cinn in the paddle inference * fix some cmake errors * Avoid division by zero in the arange_kernel. * Avoid dynamic ops. * Remove some useless codes. * Use OpTransInfo to encapsulate some codes used in the build_cinn_pass.
-
由 RichardWooSJTU 提交于
-
由 Xinger 提交于
-
由 ykkk2333 提交于
* migrate sigmoid with cross entropy, and tile xpu kernels to phi, test=kunlun * migrate add_n kernep to phi, test=kunlun
-
由 wuhuachaocoding 提交于
-
由 wenbin 提交于
* residual_no_bias * comments * more ut * fix input
-
由 Piotr Paturej 提交于
[PHI] Migrate concat+grad, expand+grad, fill_constant, nearest_interp and bilinear_interp oneDNN kernels (#45863) * Migrate concat+grad, expand+grad, fill_constant, nearest_interp_v2 and bilinear_interp_v2 oneDNN kernels to PHI * Remove old namespace variable * Fix invalid out dims error * Add mutable_data method to concat output * Add check for -1 dim before computing out_dims * Capitalize oneDNNGetDataType function name * Change fill_constant kernel to correct PHI kernel * Attempt to fix dims error * Fix fill_constant (full) kernel
-
由 zhoutianzi666 提交于
* Remove trt_reshape2_matmul_fuse_pass
-
由 Wilber 提交于
-
由 jiahongyu 提交于
-
由 Leo Chen 提交于
-
由 Nyakku Shigure 提交于
* [CodeStyle] remove tabs in cpp files * update comment format
-
由 Weilong Wu 提交于
* [Eager] polish clone interface * rm clone in python, add clone in eager_method.cc
-
由 YangZhou 提交于
* unexpose audio ParameterError * clean audio utils api
-
- 20 9月, 2022 15 次提交
-
-
由 Wangzheee 提交于
* [Paddle Inference]support n lookup_tables fuse to embeddinglayernorm(1): add some funtion for embedding
-
由 Sławomir Siwek 提交于
* init * remove softmaxop * merge dev * correct dir * style
-
由 Piotr Paturej 提交于
* Convert split, pad and pad3d kernels * Convert slice+grad oneDNN fluid kernels to PHI * change out->mutable_data to dev_ctx.Alloc
-
由 jakpiase 提交于
-
由 Paulina Gacek 提交于
* First approach * Shape kernel corrected * Compilation error fixed * Resize corrected * Registered types added * Mistake corrected & types added * sum kernel deleted
-
由 zhoutianzi666 提交于
* Support matmul_v2 in PaddleTensorRT
-
由 Wangzheee 提交于
* [Paddle Inference]support n lookup_tables fuse to embeddinglayernorm(2)
-
由 zhoutianzi666 提交于
* Move ITensor construction for Weight (persistable variable) from OpConvert to TensorRTEngine.
-
由 YuanRisheng 提交于
* move_merge_selected_rows * update code
-
由 Nyakku Shigure 提交于
-
由 WangZhen 提交于
* [JitLayer]Erase out vars to avoid data rewrittinig * Fix code comments
-
由 weishengying 提交于
-
由 jiahongyu 提交于
-
由 jiahongyu 提交于
-
由 ronnywang 提交于
* [NPU] fix run_program_op, test=develop * [NPU] fix matmul_v2 in cann502, test=develop
-