- 19 10月, 2022 3 次提交
-
-
由 Leo Chen 提交于
* clean unused code: piece.cc/h * clean usage
-
由 wanghuancoder 提交于
-
由 Li-fAngyU 提交于
-
- 18 10月, 2022 10 次提交
-
-
由 weishengying 提交于
-
由 zhoutianzi666 提交于
* Rewrite strided_slice converter using shape tensor * clean code
-
由 Wang Bojun 提交于
* first version, accuracy corrected * disable debug print * use blockReduceSum in phi * add UT * add opCompat * code style * code refine * bug fix * code refine * test fix * bugfix * codesytle fix * code style * code-style * code-style * code-style
-
由 Sławomir Siwek 提交于
* git * style * leave default relu in kernel * style * cleanup FCMKLDNN pattern * merge conflicts * update develop * update develop * add const * rename to oneDNN and adjust attributes * whitespace
-
由 Hui Zhang 提交于
* cond infer apply exec seprate * fix bugs
-
由 Wilber 提交于
-
由 xiaoxiaohehe001 提交于
-
由 Weilong Wu 提交于
-
由 zyfncg 提交于
* support generating code of opmaker for backward op invoke forward op * gsupport code-gen of opmaker for sparse op * refind logic of choose phi kernrel * fix complie budg * fix code_gen bug * fix bug * fix kernel signature code-gen * fix complie bug of VarType * fix complie bug of VarType * fix test_sparse_conv_op * fix test_sparse_norm_op
-
由 HongyuJia 提交于
-
- 17 10月, 2022 11 次提交
-
-
由 Ghost Screaming 提交于
* Fix bug of reduce_sum op. When input.numel() > INT32_MAX, its result is wrong. * Support allow_partial switch, which can be configure in pipeline_configs. If sent tensor are not the same from different hosts, they shouldn't been sent partially and then concated as a whole tensor. * Change name allow_partial to enable_partial_send_recv. * Add global variable _enable_partial_send_recv
-
由 Ghost Screaming 提交于
* Fix bug of reduce_sum op. When input.numel() > INT32_MAX, its result is wrong. * support pure bfloat16 * support bf16 linear * update PR to pass CI * tiny fix where_grad_kernel.cu * Support bfloat16 type for reducer and sharding. * Fix some bug. * Polish code. * Polise code. * Add bfloat16 datatype in fill_grad kernels. Co-authored-by: Nsneaxiy <sneaxiy@126.com>
-
由 YuanRisheng 提交于
* namespace modify * update by comment
-
由 Weilong Wu 提交于
-
由 Wang Bojun 提交于
* first version of ln_s_p with s>0 * refine and UT * pass opt draft * pass opt * code refine * code-style * bug fix * fix ci test * code style
-
由 jakpiase 提交于
-
由 pangyoki 提交于
* skip ReplaceAllReduceOp in GraphtoBlock when nccl_ctxs_ is nullptr * update ut * test_dist_allreduce_op failed * fix test_dist_allreduce_op * add ut * fix nccl cpu compile * fix
-
由 Jiabin Yang 提交于
-
由 HongyuJia 提交于
-
由 Weilong Wu 提交于
-
- 16 10月, 2022 1 次提交
-
-
由 ZeKai Zhou 提交于
-
- 15 10月, 2022 1 次提交
-
-
由 HongyuJia 提交于
-
- 14 10月, 2022 6 次提交
-
-
由 Chen Weihang 提交于
* simplify conv_mkldnn op registration * remove custom type value in conv grad op
-
由 Wang Bojun 提交于
* draft with debug print
-
由 Wilber 提交于
-
由 Zhang Jun 提交于
-
由 WangZhen 提交于
* Add more record event in run program op * Refine code * Restore code * Rename event
-
由 Shijie 提交于
-
- 13 10月, 2022 8 次提交
-
-
由 yeliang2258 提交于
* fix immutable op quantize bugs * fix * fix build bug * fix test * notest,test=inference * fix ppyoloe acc drop bugs * fix test * fix test * add test * fix * fix * fix test * fix refined name bug * fix test * bias fix * fix matmul weight dequant bug * re-ci * fix tester * fix test * fix tester * update weight dequantize func * update code * update test for converage * update test * update cmake * update cmakelist * update code * rerun ci * remove useless code
-
由 xiaoxiaohehe001 提交于
-
由 Leo Chen 提交于
-
由 Wangzheee 提交于
-
由 weishengying 提交于
Add symbolic shape deduction function for unfold, scatter_nd_add, p_norm, grid_sampler, pad3d, etc (#46291)
-
由 zhouweiwei2014 提交于
-
由 Wangzheee 提交于
* test=infer-coverage
-
由 YuanRisheng 提交于
-