- 14 3月, 2023 8 次提交
-
-
由 engineer1109 提交于
-
由 limingshu 提交于
* first commit * fix code bugs in for_loop * fix bugs in cuLoadAddStridedInputs. * optimization for LayerNormBackwardComputeGradInput * add unitest for validating the optimization * fix windows ci error
-
由 wangxiaoning 提交于
-
由 Infinity_lee 提交于
-
由 zhiboniu 提交于
* add fp16 and bf16 test * update
-
由 Huang Jiyi 提交于
* remove device_context include * fix bug * fix bug
-
由 denglianbin 提交于
* finish task * add static_check and fix unittest. * add int32/64 * Update test_cross_op.py --------- Co-authored-by: NZhang Ting <Douyaer2020@qq.com>
-
由 Infinity_lee 提交于
-
- 13 3月, 2023 14 次提交
-
-
由 TaoTao Li 提交于
* add all_gather and fix conflicts * fix code format * fix ut * fix broadcast ut
-
由 YuanRisheng 提交于
* remove transpose infershape * fix ci bugs * fix ci bugs * delete transpose infershape * fix ci bugs * fix ci bugs
-
由 iSerendipity 提交于
* Replace paddle::experimental::DataType as phi::DataType * restore custom_device.cc
-
由 Zhenghai Zhang 提交于
* Add output defs for mode kernel * fix bug
-
由 wangxiaoning 提交于
* add fp16/bf16 * add grad bf16 * test name
-
由 Huang Jiyi 提交于
* platform::CUDAPinnedDeviceContext -> phi::GPUPinnedContext * replace platform::TraceEventCollector
-
由 iSerendipity 提交于
* remove fused_matmul from list * add infermeta for fused matmul
-
由 Sławomir Siwek 提交于
* mkldnn->onednn * fused softplus op + kernel * remove extra attributes * add missing handler * change var name
-
由 Sanbu 提交于
* Add output defs for conv3d_coo distribute_fpn_proposals kernel * fix
-
由 junxiu777 提交于
* add register of KthvalueKernel add register of KthvalueKernel * Update kthvalue_kernel.cc * Update kthvalue_kernel.cu
-
由 zhoutianzi666 提交于
* use python to generate cutlass code * refine CommonConvKernelPart1, CommonConvKernelPart2 * remove useless code in generate_cutlass_code.sh * add more config in conv2d_residual * CommonCutlassConvKernelPart1 and CommonCutlassConvKernelPart2 * add group conv support in util.cu * remove .sh * refine name * make name goodgit status! * add fuse_alpha * make code easy to understand * mot fopen generate in py * use python script to generate conv2d,group=1 cutlass code * use const & * use const & && use python script to generate conv2d/group=1 code
-
由 Little-chick 提交于
* Update interpreter_util.cc * Update auc_kernel.cc * Update auc_kernel.cu * Update auc_kernel.cc * Update auc_kernel.cu
-
由 houj04 提交于
* [XPU] add increment op. * fix ci
-
由 RuohengMa 提交于
* [XPU] add reduce_min_grad XPU kernel * add unittest for reduce_min xpu op
-
- 12 3月, 2023 5 次提交
-
-
由 hellockx 提交于
* Update interpreter_util.cc * Update bincount_kernel.cc * Update bincount_kernel.cu
-
由 hellolllw 提交于
* Update interpreter_util.cc * Update is_empty_kernel.cc * Update is_empty_kernel.cc
-
由 PPPPzhang 提交于
-
由 Sanbu 提交于
* Add output defs for all_close all_raw kernel * Update interpreter_util.cc
-
由 PPPPzhang 提交于
-
- 10 3月, 2023 13 次提交
-
-
由 HappyHeavyRain 提交于
* Add function node in phi_kernel for MKLDNN * fix the bug in 'BuildInferVarKernelContext' * add infer_varkernel_utils.cc * fix the bug:the first two parametes of 'BuildInferVarKernelContext' can't be template variable * change the code according to first review * change the code according to first review * change the mode of paddle_build.sh * change 'infer_var_kernel_fn_' to 'get_kerneltype_forvar_fn_' * add the error information * fix NotFound infomation warning * fix NotFound infomation warning * fix NotFound infomation warning
-
由 Infinity_lee 提交于
-
由 张春乔 提交于
* add register of yolo_loss_kernel.cc * fix something * Update paddle/phi/kernels/cpu/yolo_loss_kernel.cc
-
由 张春乔 提交于
* delete something in interpreter_util.cc * rollback * add REGISTER of angle * bug fix * fix something * Update paddle/phi/kernels/gpu/angle_kernel.cu * Update paddle/phi/kernels/cpu/angle_kernel.cc * Update paddle/phi/kernels/gpu/angle_kernel.cu * Update paddle/phi/kernels/cpu/angle_kernel.cc
-
由 陈沧夜 提交于
-
由 YuanRisheng 提交于
This reverts commit 3f4917f6.
-
由 shentanyue 提交于
-
由 ykkk2333 提交于
-
由 Ryan 提交于
* add numel INT64 register * del numl * add svd UNDEFINED register * remove svd register
-
由 Chitsing KUI 提交于
-
由 iSerendipity 提交于
* remove operator.h from blas.h and remove paddle::framework::ExecutionContext * remove the deps for GetBlas(exe_ctx) * fix error
-
由 zhangyikun02 提交于
-
由 mayang002 提交于
* [xpu-ernie3] support fp16 for full_like/stack/where xpu kernels * [xpu-ernie3] support fp16 for full_like/stack/where xpu kernels
-