- 17 4月, 2023 32 次提交
-
-
由 Yulong Ao 提交于
-
由 tianshuo78520a 提交于
* mv ps distributed dir * fix * add del auto_parallel * add auto_parallel * fix ps * fix bug * fix test bug * fix test bug * merge develop fix error * merge develop fix error * merge develop fix error
-
由 zhoutianzi666 提交于
* initial commit for cutlass_teller * second commit for cutlass_teller * add conv2d_depthwise python template * add conv2d_depthwise cutlass template * /zhoukangkang/paddle_cutlass/Paddle/paddle/fluid/framework/ir/cutlass_teller.h * refine code in Conv2dFusionCanSupport * add macro in cutlass_teller.h * add 3x3 5x5 teller * add groups not 1 or conv2d_depthwise teller * 只生成ic是8的倍数的conv2d_depthwise 的kernel * add EXPLICIT in cutlass_teller.h * final commit * add split_k_slices in conv2d_depthwise * make stages == 2 * 重构部分代码 * add CutlassFusionType * solve illegal memory * make stride_h=stride_w && make dilation==1 * must check HasAttr(use_cutlass) before GetAttrIfExists * add CONV2D_DEPTHWISE_BIAS_SILU to OpType2String * modify decl.h and util.cu
-
由 LiYuRio 提交于
* cherry-pick fleet executor from 2.4 * fix test case
-
由 Sanbu 提交于
-
由 sneaxiy 提交于
* fix ipc counter * fix missing std::to_string
-
由 YuanRisheng 提交于
* unify kernel * fix ci bugs * fix py3 bugs * fix py3 bugs * perfect code
-
由 liulinduo 提交于
* [Test Mv] remove rnn * Update test_rnn_cell_api.py
-
由 lzydev 提交于
* fix bug in parse args * fix bug * recover legacy_*.yaml * change 'Out' to Output
-
由 LoneRanger 提交于
-
由 Galaxy1458 提交于
-
由 wangzhen38 提交于
* [CINN] fix concat&pow * update concat * composite_backward_api * for ci * for ci * update test & fix opmaker
-
由 tianshuo78520a 提交于
-
由 Chitsing KUI 提交于
* add random control for fused dropout add * add __init__
-
由 Vvsmile 提交于
* fix multinomial * fix test_elementwise * fix convert_float_to_uint16 * aadd test_multimial_op * fix code style
-
由 thunder95 提交于
* untracked files * bce_loss_fp16 * remove unused files * back max_rel_erro still big * simplify code * upd * fix max_relative_error * restart ci * Update test_bce_loss.py * Update test_bce_loss.py * Update test_bce_loss.py * Update test_bce_loss.py * try to pass test * restore file * remove error value * fix bug --------- Co-authored-by: NZhang Ting <Douyaer2020@qq.com>
-
由 JingZhuangzhuang 提交于
-
由 WangZhen 提交于
-
由 Jiabin Yang 提交于
* fix multiply double grad error * fix multiply dy only kenrel
-
由 Hanchiao 提交于
* Implement optimized kernel for OP-expand_as. * Support fp16. Co-authored-by: Timber-Ye <ye_hanqiao@163.com> Co-authored-by: NBrianQian1999 <brianqianhitsz@gmail.com> * remove fp16 support * remove MAX_RANK_SUPPORTED --------- Co-authored-by: NBrianQian1999 <brianqianhitsz@gmail.com>
-
由 Kim Yann 提交于
-
由 zhangyuqin1998 提交于
-
由 张春乔 提交于
-
由 张春乔 提交于
* remove hccl in .py files * remove ascend in setup.py.in * remove ascend in setup.py
-
由 Sonder 提交于
* add register info for eigh and eig_gard * add sync_batch_norm_op.cu register info * add lamb output register info * add unique register info * change type name * change type name * add output register info for check_finite_and_unscale * update cmake and config file * add register info for adagrad * fix build error * add sync to run_unittests.sh * add register info for unique_consecutive * fix build error * add eigh to STATIC_BUILD_TESTS * update eig_kernel.cc * update eig_kernel.cc * fix infer mate error * fix unique register error * fix lamb register info error * fix lamb register info * update lamb register info * fix lamb * remove one Output Register * update static build file * add eigh op to disable_wingpu_test * update run_unittests
-
由 chenxujun 提交于
-
由 ronnywang 提交于
* [CustomDevice] fix custom cpu unittests failure when multiple python exist in CI * Update test_custom_op_setup.py
-
由 Zhang Zheng 提交于
* [AMP OP&Test] Sync_batch_norm support bfloat16 * fix * fix
-
由 Haohongxiang 提交于
-
由 caozhou 提交于
* add o2 tune * add unittest * fix error * set unittest timeout
-
由 sneaxiy 提交于
- 15 4月, 2023 1 次提交
-
-
由 HongyuJia 提交于
-
- 14 4月, 2023 7 次提交
-
-
由 Zhang Zheng 提交于
-
由 jjyaoao 提交于
* delete SupportNPU(), SupportMLU() * delete npu branch
-
由 cyberslack_lee 提交于
-
由 cyberslack_lee 提交于
-
由 chenxujun 提交于
* Add digamma, dirichlet tests * Fix code
-
由 superwinner1 提交于
* add erf FP16 test
-
由 duanyanhui 提交于
-