- 09 2月, 2023 7 次提交
-
-
由 Huang Jiyi 提交于
* decouple strided_memcpy * move strided_memcpy * move strided_memcpy to phi * fix namespace * update * fix gpu compile bugs
-
由 yuehuayingxueluo 提交于
* add multi_tenosr_adam * update multi_tensor_base.py, test_multi_tensor_adam.py, adamw.py * fix adam.py optimizer.py * fix adamw.py * fix test_multi_tensor_adam.py * fix CI bug * fix CI coverage * fix ci bug * fix betapow * fix some bugs * fix test_adamw_op.py * fix CI coverage * fix multi_tensor_adam_kernel.cc * fix CI bug * fix multi_tensor_adam_op.cc and test_multi_tensor_adam.py * fix code style * update C++ parts * remove python parts modification temporarily * add C++ ut * update betapow copy code logic * fix ci ut * fix windows ci * fix coverage ci * improve coverage rate --------- Co-authored-by: Nsneaxiy <sneaxiy@126.com>
-
由 LiYuRio 提交于
-
由 zhoutianzi666 提交于
* add fmha_flashattention oss plugin * add fmhca * add oss fmhca * code reconstruct and add ut * code style refine * fix ut and enforce check * refine trt version check refine compile fix compile * fix cross ut * code refine * use runtime trt version check * bug fix and code refine * compile fix * merge develop * add GN QDQ kernel * support GN int8 fake kernel * add with_int8 * add GN int8 fake kernel * add GN int8 fake kernel * add GN int8 fake kernel * add GN int8 fake kernel * add GN int8 fake kernel * add GN int8 fake kernel * add GN int8 fake kernel * add GN int8 UT * add verison > 8000 in GN int8 UT * add some check in .cu * add stdlib.h in UT * little change in .cu * remove rand_r use rand * remove use rand * setAxis(1) * when int8 is on allow fall back to fp16 --------- Co-authored-by: Nwwbitejotunn <wang_bojun@outlook.com>
-
由 kangguangli 提交于
* fix judgement about scope validation * fix ci bug: same address is not enough for data consistency * remove useless check
-
由 pangengzheng 提交于
-
由 Wang Bojun 提交于
* trans_layernorm
-
- 08 2月, 2023 13 次提交
-
-
由 Paulina Gacek 提交于
* QuantTranpose pattern is being found by pass * quant + transpose fuse * code style changes * UT written, reorder fixed * Dequantize + transpose2 fuse added * pass name changed * UT added & shift corrected * got rid of redundancy * review changes * AsIntermediate corrected * compat added
-
由 Sławomir Siwek 提交于
* add support for bf16 fused_ops * fused_matmul only
-
由 wangxiaoning 提交于
* fix codestyle * fix std
-
由 Zhang Jun 提交于
* update * update * format code * update * Update test_trt_convert_nearest_interp_v2.py
-
由 Yuang Liu 提交于
-
由 zmxdream 提交于
* hidden unzip * fix * fix
-
由 weishengying 提交于
-
由 LiYuRio 提交于
-
由 risemeup1 提交于
-
由 gaoziyuan 提交于
* remove_engine_info * remove_engine_info * remove_engine_info * change trtlayerinformation line to json --------- Co-authored-by: Ngaoziyuan <gaoziyuan@baidu.com>
-
由 pangengzheng 提交于
* fix feature_value.h and feature_value.cu to support pslib * code style * align DistPsArch pre-stable branch --------- Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com> Co-authored-by: NTomasz Socha <tomasz.socha@intel.com> Co-authored-by: Nheliqi <1101791222@qq.com> Co-authored-by: Nzqw_1997 <118182234+zhengqiwen1997@users.noreply.github.com> Co-authored-by: Njameszhang <zhangxiaoci@baidu.com> Co-authored-by: Nxiaoguoguo626807 <100397923+xiaoguoguo626807@users.noreply.github.com> Co-authored-by: NFeiyu Chan <chenfeiyu@baidu.com> Co-authored-by: NGGBond8488 <33050871+GGBond8488@users.noreply.github.com> Co-authored-by: Nsprouteer <89541335+sprouteer@users.noreply.github.com> Co-authored-by: Njakpiase <jakpia21@gmail.com> Co-authored-by: NJiabin Yang <360788950@qq.com> Co-authored-by: Nlimingshu <61349199+JamesLim-sy@users.noreply.github.com> Co-authored-by: Nzhangbopd <1299246947@qq.com> Co-authored-by: N张春乔 <83450930+Liyulingyue@users.noreply.github.com> Co-authored-by: NLiYuRio <63526175+LiYuRio@users.noreply.github.com> Co-authored-by: N姜永久 <34344716+yjjiang11@users.noreply.github.com> Co-authored-by: NYuang Liu <liuyuang@baidu.com> Co-authored-by: Njiangcheng <thisjiang@qq.com> Co-authored-by: Nronnywang <ronny1996@163.com> Co-authored-by: Nsneaxiy <32832641+sneaxiy@users.noreply.github.com> Co-authored-by: Nhouj04 <35131887+houj04@users.noreply.github.com> Co-authored-by: Nzhangbo9674 <82555433+zhangbo9674@users.noreply.github.com> Co-authored-by: Ngem5 <117625383+linsheng011@users.noreply.github.com> Co-authored-by: Nwanghuancoder <wanghuan29@baidu.com> Co-authored-by: NRyan <44900829+DrRyanHuang@users.noreply.github.com> Co-authored-by: NRuibiao Chen <chenruibiao@baidu.com> Co-authored-by: Nengineer1109 <jialiang.wang@xdxct.com> Co-authored-by: NRedContritio <RedContritio@qq.com> Co-authored-by: Nmjxs <52824616+kk-2000@users.noreply.github.com> Co-authored-by: NYiqun Liu <Xreki@users.noreply.github.com> Co-authored-by: N张正海 <65210872+ccsuzzh@users.noreply.github.com> Co-authored-by: NHongyuJia <jiahongyu@baidu.com> Co-authored-by: Npangyoki <pangyoki@126.com> Co-authored-by: NLoneRanger <836253168@qq.com> Co-authored-by: NTeFeng Chen <ctfeng66@163.com> Co-authored-by: NLeo Guo <58431564+ZibinGuo@users.noreply.github.com> Co-authored-by: Nxiaoting <31891223+tink2123@users.noreply.github.com> Co-authored-by: N201716010711 <87008376+201716010711@users.noreply.github.com> Co-authored-by: Nwangxiaoning <71813629+wangxn12138@users.noreply.github.com> Co-authored-by: NYuanle Liu <yuanlehome@163.com> Co-authored-by: ZZK <359521840@qq.com> Co-authored-by: Nzhangkaihuo <zhangkaihuo@baidu.com> Co-authored-by: NRoc <30228238+sljlp@users.noreply.github.com> Co-authored-by: NPuQing <me@puqing.work> Co-authored-by: NZhang Jun <ewalker@live.cn> Co-authored-by: NCharles-hit <56987902+Charles-hit@users.noreply.github.com> Co-authored-by: Nniuliling123 <51102941+niuliling123@users.noreply.github.com> Co-authored-by: Nwenbin <wang3323032@qq.com> Co-authored-by: Nwangshengxiang <121413869+shengxiangwang@users.noreply.github.com> Co-authored-by: NBo Zhang <105368690+zhangbopd@users.noreply.github.com> Co-authored-by: NAurelius84 <zhangliujie@baidu.com> Co-authored-by: Nzxcd <228587199@qq.com> Co-authored-by: Nzhoutianzi666 <39978853+zhoutianzi666@users.noreply.github.com> Co-authored-by: Ngouzil <66515297+gouzil@users.noreply.github.com> Co-authored-by: Nzhangyikun02 <48021248+zhangyk0314@users.noreply.github.com> Co-authored-by: NHui Zhang <zhtclz@foxmail.com> Co-authored-by: NWang Bojun <105858416+wwbitejotunn@users.noreply.github.com> Co-authored-by: NGuanghua Yu <742925032@qq.com> Co-authored-by: NYUNSHEN XIE <1084314248@qq.com> Co-authored-by: NZhong Hui <zhonghui.net@gmail.com> Co-authored-by: Nrisemeup1 <62429225+risemeup1@users.noreply.github.com> Co-authored-by: Nliuruyan <44316842+liuruyan@users.noreply.github.com> Co-authored-by: NLeo Chen <chenqiuliang@baidu.com> Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com> Co-authored-by: Nwuhuachaocoding <77733235+wuhuachaocoding@users.noreply.github.com> Co-authored-by: NCcc <52520497+juncaipeng@users.noreply.github.com>
-
由 Huang Jiyi 提交于
-
由 YuanRisheng 提交于
* unify_kernel * fix compile bugs * modify macro name * perfect code according comment * fix compile bugs * fix compile bugs * fix ci bugs * fix ci bug * fix ci bugs * fix ci bugs * modify code according comment * rm conv_fusion_op
-
- 07 2月, 2023 9 次提交
-
-
由 xiaoxiaohehe001 提交于
-
由 zyfncg 提交于
* remove axis in some elementwise api * fix inplace bug eager-gen * fix bug * revert change for CheckInplace * polish code
-
由 Chen Weihang 提交于
* fix bad alloc exp error * remove needless files
-
由 张春乔 提交于
* fix the div 0 error of sequence_concat * Update test_sequence_concat.py
-
由 Yuanle Liu 提交于
-
由 chalsliu 提交于
-
由 LiYuRio 提交于
-
由 Ruibiao Chen 提交于
-
由 TeFeng Chen 提交于
* support 0D Tensor for while_loop op * update * clean unit test * revert test_while_loop_op.py * test again * remove invalid check * fix error * change fluid to paddle.static * fix paddle.full * merge forward and backward test * simplify code * add precision check * fix condition var check * add dygraph test
-
- 06 2月, 2023 11 次提交
-
-
由 zmxdream 提交于
* add dump_walk_path (#193) * add dump_walk_path; test=develop * add dump_walk_path; test=develop * add dump_walk_path; test=develop * Add multiple CPU communication, parameter query and merging functions, support batch alignment between multiple cards (#194) * compatible with edge_type of src2dst and src2etype2dst (#195) * do not merge_feature_shard when using metapath_split_opt (#198) * support only load reverse_edge (#199) * refactor GraphTable (#201) * fix * fix * fix code style * fix code style * fix test_dataset * fix hogwild worker * fix code style * fix code style * fix code style * fix code style * fix code style. * fix code style. --------- Co-authored-by: Ndanleifeng <52735331+danleifeng@users.noreply.github.com> Co-authored-by: Nqingshui <qshuihu@gmail.com> Co-authored-by: NWebbley <liwb5@foxmail.com> Co-authored-by: Nhuwei02 <53012141+huwei02@users.noreply.github.com>
-
由 Yuanle Liu 提交于
* disable conv2d_fusion_layout_transfer_pass temporarily * disable conv2d_fusion_layout_transfer_pass temporarily
-
由 wenbin 提交于
-
由 zyfncg 提交于
* remove extra input of conv2d * fix bug * fix unittest bug * adjust conv2d.pbtxt * fix cpu_quantize_pass_tester * revert use_addto of conv2d * fix runtime attribute * fix bug * recover force_fp32_output in conv2d * refine error info * fix bug
-
由 Yuang Liu 提交于
-
由 xiaoxiaohehe001 提交于
* add_hasattri_check * add_hasattri_check
-
由 RedContritio 提交于
* check tensor numel in PyObject_CheckLongOrToLong * add unittest
-
由 houj04 提交于
-
由 Siming Dai 提交于
* fix to_dlpack for loop * fix reference count
-
由 engineer1109 提交于
-
由 jiangcheng 提交于
-