- 13 3月, 2023 13 次提交
-
-
由 YuanRisheng 提交于
* remove transpose infershape * fix ci bugs * fix ci bugs * delete transpose infershape * fix ci bugs * fix ci bugs
-
由 iSerendipity 提交于
* Replace paddle::experimental::DataType as phi::DataType * restore custom_device.cc
-
由 Zhenghai Zhang 提交于
* Add output defs for mode kernel * fix bug
-
由 wangxiaoning 提交于
* add fp16/bf16 * add grad bf16 * test name
-
由 Huang Jiyi 提交于
* platform::CUDAPinnedDeviceContext -> phi::GPUPinnedContext * replace platform::TraceEventCollector
-
由 iSerendipity 提交于
* remove fused_matmul from list * add infermeta for fused matmul
-
由 Sławomir Siwek 提交于
* mkldnn->onednn * fused softplus op + kernel * remove extra attributes * add missing handler * change var name
-
由 Sanbu 提交于
* Add output defs for conv3d_coo distribute_fpn_proposals kernel * fix
-
由 junxiu777 提交于
* add register of KthvalueKernel add register of KthvalueKernel * Update kthvalue_kernel.cc * Update kthvalue_kernel.cu
-
由 zhoutianzi666 提交于
* use python to generate cutlass code * refine CommonConvKernelPart1, CommonConvKernelPart2 * remove useless code in generate_cutlass_code.sh * add more config in conv2d_residual * CommonCutlassConvKernelPart1 and CommonCutlassConvKernelPart2 * add group conv support in util.cu * remove .sh * refine name * make name goodgit status! * add fuse_alpha * make code easy to understand * mot fopen generate in py * use python script to generate conv2d,group=1 cutlass code * use const & * use const & && use python script to generate conv2d/group=1 code
-
由 Little-chick 提交于
* Update interpreter_util.cc * Update auc_kernel.cc * Update auc_kernel.cu * Update auc_kernel.cc * Update auc_kernel.cu
-
由 houj04 提交于
* [XPU] add increment op. * fix ci
-
由 RuohengMa 提交于
* [XPU] add reduce_min_grad XPU kernel * add unittest for reduce_min xpu op
-
- 12 3月, 2023 5 次提交
-
-
由 hellockx 提交于
* Update interpreter_util.cc * Update bincount_kernel.cc * Update bincount_kernel.cu
-
由 hellolllw 提交于
* Update interpreter_util.cc * Update is_empty_kernel.cc * Update is_empty_kernel.cc
-
由 PPPPzhang 提交于
-
由 Sanbu 提交于
* Add output defs for all_close all_raw kernel * Update interpreter_util.cc
-
由 PPPPzhang 提交于
-
- 10 3月, 2023 13 次提交
-
-
由 HappyHeavyRain 提交于
* Add function node in phi_kernel for MKLDNN * fix the bug in 'BuildInferVarKernelContext' * add infer_varkernel_utils.cc * fix the bug:the first two parametes of 'BuildInferVarKernelContext' can't be template variable * change the code according to first review * change the code according to first review * change the mode of paddle_build.sh * change 'infer_var_kernel_fn_' to 'get_kerneltype_forvar_fn_' * add the error information * fix NotFound infomation warning * fix NotFound infomation warning * fix NotFound infomation warning
-
由 Infinity_lee 提交于
-
由 张春乔 提交于
* add register of yolo_loss_kernel.cc * fix something * Update paddle/phi/kernels/cpu/yolo_loss_kernel.cc
-
由 张春乔 提交于
* delete something in interpreter_util.cc * rollback * add REGISTER of angle * bug fix * fix something * Update paddle/phi/kernels/gpu/angle_kernel.cu * Update paddle/phi/kernels/cpu/angle_kernel.cc * Update paddle/phi/kernels/gpu/angle_kernel.cu * Update paddle/phi/kernels/cpu/angle_kernel.cc
-
由 陈沧夜 提交于
-
由 YuanRisheng 提交于
This reverts commit 3f4917f6.
-
由 shentanyue 提交于
-
由 ykkk2333 提交于
-
由 Ryan 提交于
* add numel INT64 register * del numl * add svd UNDEFINED register * remove svd register
-
由 Chitsing KUI 提交于
-
由 iSerendipity 提交于
* remove operator.h from blas.h and remove paddle::framework::ExecutionContext * remove the deps for GetBlas(exe_ctx) * fix error
-
由 zhangyikun02 提交于
-
由 mayang002 提交于
* [xpu-ernie3] support fp16 for full_like/stack/where xpu kernels * [xpu-ernie3] support fp16 for full_like/stack/where xpu kernels
-
- 09 3月, 2023 9 次提交
-
-
由 yangjianfengo1 提交于
* AMP arange & Test * fix arange bfloat16 dtype * update for review * update for review2 * fix tile * update * fix ci * r * f * fix windows ci * update bfloat data * fix bloat16 input * add print * Update test_where_op.py * update kernel * del repeat * update review
-
由 Yiqun Liu 提交于
* Add the collect and print of kernel registry infomation in op benchmark ci. * Little change to test the ci. * Remove the reduntant function. * Move the collect of kernel registry information to the end of ci.
-
由 will-jl944 提交于
* add softplus double grad * use constant method
-
由 zyfncg 提交于
* register custom kernel for all type of custom device * fix bug * fix GetKernelInputArgDef * fix amp bug * fix TransToPhiPlace * adapt interpreter_util
-
由 iSerendipity 提交于
* add output defs for fused_adam kernel * complete the oters defs for cpu and gpu * remove register for param_out
-
由 iSerendipity 提交于
* Add output defs for sgd kernel * add datatype infer for sgd * add infer logic
-
由 iSerendipity 提交于
* add output defs for edit_distance kernel * change seqnum as output0
-
由 Wang Xin 提交于
-
由 张春乔 提交于
* add REGISTER of float32 in accuracy * fix something
-