- 24 4月, 2023 1 次提交
-
-
由 Galaxy1458 提交于
* test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop * test ,test=develop
-
- 23 4月, 2023 4 次提交
-
-
由 zhangyuqin1998 提交于
* delete overwrite from gather_grad * fix * Update gather_grad_kernel.cc
-
由 houj04 提交于
* [XPU] fc use int_with_ll_t * fix test_unbind_op_xpu
-
由 zhangyuqin1998 提交于
* remove axis from elementwise_grad * Update elementwise_sig.cc
-
由 Galaxy1458 提交于
* test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop
-
- 22 4月, 2023 1 次提交
-
-
由 wangfengsheng1999 提交于
[Zero-Dim] support output 0D for is_empty/as_complex/inner/dot/rank/tensordot/squeeze_/static.accuracy/static.auc/metric.accuracy, test=allcase (#52850) * [Zero-Dim] support output 0D for is_empty/as_complex/, test=allcase * [Zero-Dim] support output 0D for is_empty/as_complex/, test=allcase * add test case * modify dot/metric.accuracy/static.accuracy/static.auc * modfiy inner/tensordot bug * test 9 api * [Zero-Dim] support output 0D for is_empty/as_complex/inner/dot/rank/tensordot/squeeze_/static.accuracy/static.auc/metric.accuracy, test=allcase * fix bug * support output 0D for is_empty/as_complex/inner/dot/rank/tensordot/squeeze_/static.accuracy/static.auc/metric.accuracy * code style * fix bug * fix test_dot_op bug * fix accuracy bug * fix bug * fix bug * fix bug * fix bug * codestyle * fix dot bug * fix dot bug * fix dot bug * code style * fix dot bug * fix dot bug * fix dot bug * fix dot bug * fix dot bug * fix dot bug * modify code
-
- 21 4月, 2023 7 次提交
-
-
由 JYChen 提交于
* support 0-D output and 0-D as indice in __getitem__ * fix tests * fix inference and UT * add unittest for setitem * fix xpu test * fix xpu 0-d
-
由 Shijie 提交于
* add deterministic embedding grad kernel * minor change * minor change * Add new FLAG to enable deterministic embedding * Update embedding deterministic kernel
-
由 co63oc 提交于
-
由 co63oc 提交于
-
由 umiswing 提交于
-
由 Yuang Liu 提交于
-
- 20 4月, 2023 5 次提交
-
-
由 zhangyuqin1998 提交于
* setup * Update elementwise_kernel.cc * Update elementwise_kernel.cc * fix * fix * Update elementwise_kernel.cu * fix * Update elementwise_kernel.cc * Update elementwise_kernel.cc * Update elementwise_kernel.cc * Update elementwise_kernel.cc * Update elementwise_kernel.cc * Update elementwise_kernel.cc
-
由 Chitsing KUI 提交于
* add flash randomness control * fix VLOG undefied
-
由 zhouweiwei2014 提交于
-
由 Wang Xin 提交于
* remove ASCEND* keyword * update docstring * bug fixed * bug fixed
-
由 co63oc 提交于
-
- 19 4月, 2023 5 次提交
-
-
由 Sonder 提交于
* trans fused attention to phi * add optional parm * trans fused_attention_grad to phi * add fused attention grad register info * fix include * test=kunlun * add fused attention to static build list * add remove * update remove
-
由 zhangyikun02 提交于
-
由 houj04 提交于
-
由 limingshu 提交于
* first commit * restruct c++ interface to divide linear from matmulwithcublaslt * finish building in cublaslt impl * fix code bugs * fix host cost * add some changes
-
由 zhangyuqin1998 提交于
* fix graph_reindex * fix * Update op_compat.yaml
-
- 18 4月, 2023 9 次提交
-
-
由 chenxujun 提交于
* Add prelu, clip_by_norm, multi_dot tests * Fix code * Fix code
-
由 Zhang Zheng 提交于
* [AMP OP&Test] Unique support float16&bfloat16 * add test
-
由 zhangyuqin1998 提交于
* reorder MatrixRank * fix * fix * fix * fix * fix
-
由 chenxujun 提交于
-
由 chenxujun 提交于
* Add split, split_with_num tests * Add randperm tests * Fix code
-
由 chenxujun 提交于
-
由 GGBond8488 提交于
* add 0D output support for inalg.slogdet,test=allcase * fix zerom dime test error test=allcase * fix test error test=allcase * add static backward test, test=allcase
-
由 JYChen 提交于
* fix the set_value error in cpu * add a unitest for set_value OP * fix platform::is_gpu_place * add todo note for set_value
-
由 zhangyuqin1998 提交于
* reorder_prior_box * fix
-
- 17 4月, 2023 8 次提交
-
-
由 zhoutianzi666 提交于
* initial commit for cutlass_teller * second commit for cutlass_teller * add conv2d_depthwise python template * add conv2d_depthwise cutlass template * /zhoukangkang/paddle_cutlass/Paddle/paddle/fluid/framework/ir/cutlass_teller.h * refine code in Conv2dFusionCanSupport * add macro in cutlass_teller.h * add 3x3 5x5 teller * add groups not 1 or conv2d_depthwise teller * 只生成ic是8的倍数的conv2d_depthwise 的kernel * add EXPLICIT in cutlass_teller.h * final commit * add split_k_slices in conv2d_depthwise * make stages == 2 * 重构部分代码 * add CutlassFusionType * solve illegal memory * make stride_h=stride_w && make dilation==1 * must check HasAttr(use_cutlass) before GetAttrIfExists * add CONV2D_DEPTHWISE_BIAS_SILU to OpType2String * modify decl.h and util.cu
-
由 Chitsing KUI 提交于
* add random control for fused dropout add * add __init__
-
由 Vvsmile 提交于
* fix multinomial * fix test_elementwise * fix convert_float_to_uint16 * aadd test_multimial_op * fix code style
-
由 thunder95 提交于
* untracked files * bce_loss_fp16 * remove unused files * back max_rel_erro still big * simplify code * upd * fix max_relative_error * restart ci * Update test_bce_loss.py * Update test_bce_loss.py * Update test_bce_loss.py * Update test_bce_loss.py * try to pass test * restore file * remove error value * fix bug --------- Co-authored-by: NZhang Ting <Douyaer2020@qq.com>
-
由 Jiabin Yang 提交于
* fix multiply double grad error * fix multiply dy only kenrel
-
由 Hanchiao 提交于
* Implement optimized kernel for OP-expand_as. * Support fp16. Co-authored-by: Timber-Ye <ye_hanqiao@163.com> Co-authored-by: NBrianQian1999 <brianqianhitsz@gmail.com> * remove fp16 support * remove MAX_RANK_SUPPORTED --------- Co-authored-by: NBrianQian1999 <brianqianhitsz@gmail.com>
-
由 zhangyuqin1998 提交于
-
由 Sonder 提交于
* add register info for eigh and eig_gard * add sync_batch_norm_op.cu register info * add lamb output register info * add unique register info * change type name * change type name * add output register info for check_finite_and_unscale * update cmake and config file * add register info for adagrad * fix build error * add sync to run_unittests.sh * add register info for unique_consecutive * fix build error * add eigh to STATIC_BUILD_TESTS * update eig_kernel.cc * update eig_kernel.cc * fix infer mate error * fix unique register error * fix lamb register info error * fix lamb register info * update lamb register info * fix lamb * remove one Output Register * update static build file * add eigh op to disable_wingpu_test * update run_unittests
-