- 15 3月, 2023 14 次提交
-
-
由 limingshu 提交于
-
由 thunder95 提交于
* untracked files * prelu_perf * remove unused files * upd * fix bug
-
由 umiswing 提交于
-
由 ronnywang 提交于
* [XPU] add int32,fp32 support for conv2d_transpose* * update
-
由 iSerendipity 提交于
* Revert "Revert "【Hackathon No.67】remove operator.h in blas.h (#50989)" (#51467)" This reverts commit b9d91531. * remove cout * add header * fix missing header * fix refer fluid error * fix missing header * 更新 repeat_interleave_grad_kernel_impl.h Change to phi style datatype. * 更新 repeat_interleave_grad_kernel_impl.h Fix missing header * datatype fluid -> phi * paddle::experimental -> phi * fix reference error * fix reference error * fix reference error * fix errors * fix missing FLAGS * fix missing headers * fix missing headers * fix missing headers * fix missing headers * fix missing header * fix missing header * fix errors
-
由 zhangyuqin1998 提交于
* Delete randperm raw op * fix
-
由 RedContritio 提交于
-
由 HappyHeavyRain 提交于
* test_get_kernel * add invoke signature * change reduce_max * change frobenius_norm * reset reduce_max according to composite and change reduce_all * fix the bug when Scalar(*) * fix 'scalar when support_tensor' * change code according to review * change 'keep_signature' to 'manual_signature' and add some erro info
-
由 pangyoki 提交于
-
由 wangxiaoning 提交于
-
由 Yuang Liu 提交于
-
由 Guoxia Wang 提交于
-
由 zhangyuqin1998 提交于
* Delete hardswish_raw op * fix ut
-
由 xiaoguoguo626807 提交于
* modify_yaml * delete default param * add output for matmul_double_grad
-
- 14 3月, 2023 12 次提交
-
-
由 engineer1109 提交于
-
由 ccrrong 提交于
* add split_with_num composite rule * add split_with_num composite rule * add split composite rule * update * update test * update test * delete split_with_num_grad
-
由 limingshu 提交于
* first commit * fix code bugs in for_loop * fix bugs in cuLoadAddStridedInputs. * optimization for LayerNormBackwardComputeGradInput * add unitest for validating the optimization * fix windows ci error
-
由 pangyoki 提交于
* cuda graph support multi-stream for new executor * fix windows compile error * delete create_cuda_graph_stream
-
由 wangxiaoning 提交于
-
由 wangxiaoning 提交于
-
由 Infinity_lee 提交于
-
由 zhiboniu 提交于
* add fp16 and bf16 test * update
-
由 HongyuJia 提交于
-
由 Huang Jiyi 提交于
* remove device_context include * fix bug * fix bug
-
由 denglianbin 提交于
* finish task * add static_check and fix unittest. * add int32/64 * Update test_cross_op.py --------- Co-authored-by: NZhang Ting <Douyaer2020@qq.com>
-
由 Infinity_lee 提交于
-
- 13 3月, 2023 14 次提交
-
-
由 TaoTao Li 提交于
* add all_gather and fix conflicts * fix code format * fix ut * fix broadcast ut
-
由 heyanru 提交于
* refresh * compat * register * testop * fix * fix * fox * cast * cast * fix * type * fix * out * cast * fix * fix * fix * broad * broad * broad * fix * fix * fix * fix * fix * broad * broad * numel * fix * fix * fix * fix * cinn * fix * fix * fix * fix
-
由 YuanRisheng 提交于
* remove transpose infershape * fix ci bugs * fix ci bugs * delete transpose infershape * fix ci bugs * fix ci bugs
-
由 iSerendipity 提交于
* Replace paddle::experimental::DataType as phi::DataType * restore custom_device.cc
-
由 Zhenghai Zhang 提交于
* Add output defs for mode kernel * fix bug
-
由 wangxiaoning 提交于
* add fp16/bf16 * add grad bf16 * test name
-
由 Huang Jiyi 提交于
* platform::CUDAPinnedDeviceContext -> phi::GPUPinnedContext * replace platform::TraceEventCollector
-
由 iSerendipity 提交于
* remove fused_matmul from list * add infermeta for fused matmul
-
由 Huang Jiyi 提交于
* add from_blob * fix test * fix test * fix codestyle * add gpu test * fix test * update * add comment * fix comment * update comment * fix CI bug * add thread_local * update * fix bug * fix bug * fix bug * fix bug * fix bug * fix bug * fix cmake * fix CI-Py3 make * update * use api_reg * fix include * update * update * update * fix bug * fix bug * fix bug * fix bug
-
由 Sławomir Siwek 提交于
* mkldnn->onednn * fused softplus op + kernel * remove extra attributes * add missing handler * change var name
-
由 Sanbu 提交于
* Add output defs for conv3d_coo distribute_fpn_proposals kernel * fix
-
由 junxiu777 提交于
* add register of KthvalueKernel add register of KthvalueKernel * Update kthvalue_kernel.cc * Update kthvalue_kernel.cu
-
由 zhoutianzi666 提交于
* use python to generate cutlass code * refine CommonConvKernelPart1, CommonConvKernelPart2 * remove useless code in generate_cutlass_code.sh * add more config in conv2d_residual * CommonCutlassConvKernelPart1 and CommonCutlassConvKernelPart2 * add group conv support in util.cu * remove .sh * refine name * make name goodgit status! * add fuse_alpha * make code easy to understand * mot fopen generate in py * use python script to generate conv2d,group=1 cutlass code * use const & * use const & && use python script to generate conv2d/group=1 code
-
由 Little-chick 提交于
* Update interpreter_util.cc * Update auc_kernel.cc * Update auc_kernel.cu * Update auc_kernel.cc * Update auc_kernel.cu
-