- 14 4月, 2023 6 次提交
-
-
由 Sonder 提交于
* add kernel functions * update kernel functions * update func parameters' name * create codes for gpu device * 调整文件位置 * fix include error * remove dependent files to phi/ * restore fused_attention_op.cu * fix dependence errors * fix dependence errors * fix include error * fix all depandence errors[build success] * remove useless include * recover useless include * use phi::ToNCCLDataType * fix namespace * update new register code * fix error in fused_gemm_epilogue_utils * fix error in FusedAttentionKernel parm * finish fused_attention registe code[build success] * add paddle::optional * add sig file * fix build error * fix a include error * 恢复正向代码 * update CMkaeList * trans Compute function to phi [build success] * add register code and fix include error [build success] * fix parameter sequence * add include file * update #if before include * update #if before include * fix grammly error * update codes for DropoutParam * remove const cast * trans some fluid api to phi api * remove const cast * trans some fluid api to phi api * add #if * update test code * update test codes * recover test codes * fix namespace and remove fluid include * recover random seed * remove fluid quant_helper * fix include error * include utils in funcs * change include file * move grad codes back to fluid floder * move grad codes back to fluid floder * fix sig file error * update include * recover codes to develop * update register codes * fix build error * recover fluid include * remove some fluid include * remove some fluid include * Update fused_attention_op.cu * remove fluid include * add some fluid include * Update fused_attention_op.cu * Update fused_attention_op.cu * Update fused_attention_op.cu * Update fused_attention_op.cu * remote useless include
-
由 Galaxy1458 提交于
* test,test=develop * test,test=develop * test,test=develop
-
由 lzydev 提交于
-
由 sneaxiy 提交于
-
由 huangjiyi 提交于
-
由 zhangyuqin1998 提交于
-
- 13 4月, 2023 22 次提交
-
-
由 Yuanle Liu 提交于
-
由 jjyaoao 提交于
* delete WITH_ASCEND_CL * delete NPU/ and WITH_MLU
-
由 superwinner1 提交于
* No55 add channel_shuffle FP16/BF16 support and tests
-
由 Difer 提交于
* add_fp_bf_for_dot & bf_for_cross * fix error * fix some error * fix some error * change something * fix magic number
-
由 Zhang Zheng 提交于
* [AMP OP&Test] Support fp16&bf16 in reduce_max
-
由 Leo Guo 提交于
-
由 chenxujun 提交于
-
由 zyfncg 提交于
-
由 RedContritio 提交于
-
由 Sanbu 提交于
-
由 chenxujun 提交于
-
由 huangjiyi 提交于
* add get_kernels.py * update * update * update * update * update * update test=document_fix --------- Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com>
-
由 zhangyuqin1998 提交于
* rename PD_REGISTER_GENERAL_KERNEL * Update feed_op.cc * fix * Update strings_empty_kernel.cc
-
由 Wang Xin 提交于
* add autogen code support for expand_as op * bug fixed
-
由 Sanbu 提交于
* Support static graph code-gen for yolo_box * Support static graph code-gen for yolo_box * Support static graph code-gen for yolo_box * Update op_compat.yaml * fix * fix
-
由 HongyuJia 提交于
* [enforce.h Decouple logging.h] Delete glog/logging.h from enforce.h * Add logging.h for profiler.cc * Add logging.h for gloo_utils.h * Add logging.h for addmm_kernel_impl.h * Add logging.h for addmm_grad_kernel_impl.h * Add logging.h for p_send_kernel.cu * Add logging.h for determinant_grad_kernel_impl.h * Add logging.h for p_recv_kernel.cu * Add logging.h for elementwise_grad_base.h * Add logging.h for transfer_layout_kernel.cc * Add logging.h for eigvals_kernel.cc and index_select_impl.h * Add logging.h for all files in kernel directory * Add logging.h for xpu_info.cc * Add logging.h for xpu
-
由 zhupengyang 提交于
-
由 umiswing 提交于
-
由 zhangyuqin1998 提交于
-
由 lzydev 提交于
* fix bug in cross_entropy in static mode * fix ci-coverage
-
由 csy0225 提交于
-
由 RedContritio 提交于
-
- 12 4月, 2023 7 次提交
-
-
由 Wang Xin 提交于
* add autogen code for clip_by_norm op * bug fixed
-
由 Zhang Zheng 提交于
* Optimize performance of unique kernel * fix ci
-
由 YepKong 提交于
* add autogen code support for squared_l2_norm_op * Update ops.yaml
-
由 Wei Shengyu 提交于
* add bf16 support and bf16/fp16 unittest for pool2d * add include files * dbg * reformat * reformat * modify code according to review comment * remove duplicate code * remove dup code * remove useless include * dbg
-
由 wangzhen38 提交于
* [DO NOT MERGE] adadelta lr support * [DO NOT MERGE] gpu support * [test] follow torch * fix acc update order * for ci * [bug fix] update master para * [bug fix] update test * [bug fix] for ci test * for ci * fix xpu * [adadelta fix] del fluid head file * for ci * del notes
-
由 jiangcheng 提交于
-
由 Guoxia Wang 提交于
* [AMP OP&Test] support bf16 for batchnorm * codestyle * Update batch_norm_grad_kernel.cu * Update batch_norm_kernel.cu * fix codestyle * fix * fix * fix * fix * fix * Update batch_norm_kernel.cc
-
- 11 4月, 2023 5 次提交
-
-
由 lzydev 提交于
-
由 Yuanle Liu 提交于
-
由 wz1qqx 提交于
-
由 RedContritio 提交于
-
由 RedContritio 提交于
* support auto generate static for randperm * remove enforce in randperm infermeta
-