- 03 4月, 2023 6 次提交
-
-
由 chenxujun 提交于
-
由 zhangyuqin1998 提交于
* add kernel register macro for all backend * fix msvc bug * fix --------- Co-authored-by: Nzhangyuqin1998 <2368719379@qq.com>
-
由 thunder95 提交于
-
由 LoneRanger 提交于
【PaddlePaddle Hackathon 4】No.56 : add fp16 test and bf16 test for diag, diagonal, fill and fill_diagonal_tensor (#51649)
-
由 zhangyuqin1998 提交于
-
由 wz1qqx 提交于
-
- 31 3月, 2023 5 次提交
-
-
由 zhangyuqin1998 提交于
-
由 csy0225 提交于
-
由 YuanRisheng 提交于
* remove distribute * fix py3 bugs * fix gpu-ps bugs * fix compile bugs * fix unittest bugs
-
由 ronnywang 提交于
-
由 张春乔 提交于
* autofix Co-authored-by: NLiyulingyue <83450930+Liyulingyue@users.noreply.github.com> * revert changes in python/paddle/distributed/fleet/utils/hybrid_parallel_util.py * empty commit, trigger ci * fix test_slice --------- Co-authored-by: NSigureMo <sigure.qaq@gmail.com>
-
- 30 3月, 2023 9 次提交
-
-
由 zhangyuqin1998 提交于
* move elementwise raw * fix * fix
-
由 zhouweiwei2014 提交于
-
由 zhangkaihuo 提交于
-
由 Roc 提交于
-
由 ykkk2333 提交于
-
由 yunyaoXYY 提交于
* add FP16 for multinomial * fix input data * update code * fix FP16 * fix code
-
由 Wang Xinyu 提交于
* stride slice fp16 and bf16 unitest * fix code style * add self.dtype
-
由 Danyang Zhang 提交于
-
由 cyberslack_lee 提交于
[CodeStyle][C416][C417] rewrite unnecessary comprehension with function call and use generator instead of map (#52140) * codestyle c416 c417 * fix error * fix inc * unify all C4 rules into one * fix inc --------- Co-authored-by: NSigureMo <sigure.qaq@gmail.com>
-
- 29 3月, 2023 9 次提交
-
-
由 zengshao0622 提交于
* pad3d add unittests of fp16 and bf16 * pad3d add unittests of fp16 and bf16 * fix cuda place * fix random to uniform * fix class name * fix fp16 max relative error to 1.5e-3 * add dytpe register for onednn * add pad uint16 check of common.py * remove check_eager * test_check_grad --> test_check_grad_normal
-
由 hjyp 提交于
* regist output type for GraphSampleNeighbors and GroupNorm * Update return type * fix return type * update * fix detail
-
由 ShenLiang 提交于
* fix bg * add utest * add utest
-
由 zhupengyang 提交于
-
由 yuehuayingxueluo 提交于
* add fuse adamw pass * fix some bugs * fix CIbug * change chunk_size * fix CI bug * rm test_fused_adam_op.py * fix CI bugs * fix fuse_adamw_op_pass.cc * change code style * fix CI bug * fix ut bug and use_adamw_op_pass.cc * fix test_fuse_adamw_pass.py * fix CI bug * remove fluid * fix ci bug * fix CI bug
-
由 zhangyikun02 提交于
-
由 傅剑寒 提交于
-
由 YuhangLi 提交于
-
由 sneaxiy 提交于
* fix generate_kernels.py in CUDA 12.0 * fix attrs bug
-
- 28 3月, 2023 5 次提交
-
-
由 sneaxiy 提交于
* add overflow check in memory efficient attention * fix ci compile error * fix ci compile error
-
由 houj04 提交于
* fix int8 support for full kernel * fix ut.
-
由 Haohongxiang 提交于
-
由 wangxinxin08 提交于
* add unittest for conv2d/depthwise_conv2d/conv2d_transpose * add bf16 for DWConv and ConvTranspose * fix unitest of conv2d_transpose * modify DWConv2d op and unittest * fix unittest of conv2d_transpose_bf16 * modify unittest name according to review * modify atol of DWConv2D unittest
-
由 csy0225 提交于
-
- 27 3月, 2023 6 次提交
-
-
由 ZhangDY-6483 提交于
-
由 Xinyu Chen 提交于
-
由 HappyHeavyRain 提交于
* support assign op * support assign infer_var_type * change code according to review * change code according to review * only save 'get_infer_var_type_func' * rest file mode
-
由 Leo Chen 提交于
* unbind support bool dtype * replace np.array_equal
-
由 Leo Guo 提交于
instance_norm_grad kernel. Fix bugs that the data type of input is different from output in reduce_sum kernel. test=kunlun
-
由 risemeup1 提交于
* fix_gcc12_error * fix gcc12 error * fix gcc12 error
-