- 08 2月, 2023 7 次提交
-
-
由 Paulina Gacek 提交于
* QuantTranpose pattern is being found by pass * quant + transpose fuse * code style changes * UT written, reorder fixed * Dequantize + transpose2 fuse added * pass name changed * UT added & shift corrected * got rid of redundancy * review changes * AsIntermediate corrected * compat added
-
由 Sławomir Siwek 提交于
* add support for bf16 fused_ops * fused_matmul only
-
由 wangxiaoning 提交于
* fix codestyle * fix std
-
由 Yuang Liu 提交于
-
由 pangengzheng 提交于
* fix feature_value.h and feature_value.cu to support pslib * code style * align DistPsArch pre-stable branch --------- Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com> Co-authored-by: NTomasz Socha <tomasz.socha@intel.com> Co-authored-by: Nheliqi <1101791222@qq.com> Co-authored-by: Nzqw_1997 <118182234+zhengqiwen1997@users.noreply.github.com> Co-authored-by: Njameszhang <zhangxiaoci@baidu.com> Co-authored-by: Nxiaoguoguo626807 <100397923+xiaoguoguo626807@users.noreply.github.com> Co-authored-by: NFeiyu Chan <chenfeiyu@baidu.com> Co-authored-by: NGGBond8488 <33050871+GGBond8488@users.noreply.github.com> Co-authored-by: Nsprouteer <89541335+sprouteer@users.noreply.github.com> Co-authored-by: Njakpiase <jakpia21@gmail.com> Co-authored-by: NJiabin Yang <360788950@qq.com> Co-authored-by: Nlimingshu <61349199+JamesLim-sy@users.noreply.github.com> Co-authored-by: Nzhangbopd <1299246947@qq.com> Co-authored-by: N张春乔 <83450930+Liyulingyue@users.noreply.github.com> Co-authored-by: NLiYuRio <63526175+LiYuRio@users.noreply.github.com> Co-authored-by: N姜永久 <34344716+yjjiang11@users.noreply.github.com> Co-authored-by: NYuang Liu <liuyuang@baidu.com> Co-authored-by: Njiangcheng <thisjiang@qq.com> Co-authored-by: Nronnywang <ronny1996@163.com> Co-authored-by: Nsneaxiy <32832641+sneaxiy@users.noreply.github.com> Co-authored-by: Nhouj04 <35131887+houj04@users.noreply.github.com> Co-authored-by: Nzhangbo9674 <82555433+zhangbo9674@users.noreply.github.com> Co-authored-by: Ngem5 <117625383+linsheng011@users.noreply.github.com> Co-authored-by: Nwanghuancoder <wanghuan29@baidu.com> Co-authored-by: NRyan <44900829+DrRyanHuang@users.noreply.github.com> Co-authored-by: NRuibiao Chen <chenruibiao@baidu.com> Co-authored-by: Nengineer1109 <jialiang.wang@xdxct.com> Co-authored-by: NRedContritio <RedContritio@qq.com> Co-authored-by: Nmjxs <52824616+kk-2000@users.noreply.github.com> Co-authored-by: NYiqun Liu <Xreki@users.noreply.github.com> Co-authored-by: N张正海 <65210872+ccsuzzh@users.noreply.github.com> Co-authored-by: NHongyuJia <jiahongyu@baidu.com> Co-authored-by: Npangyoki <pangyoki@126.com> Co-authored-by: NLoneRanger <836253168@qq.com> Co-authored-by: NTeFeng Chen <ctfeng66@163.com> Co-authored-by: NLeo Guo <58431564+ZibinGuo@users.noreply.github.com> Co-authored-by: Nxiaoting <31891223+tink2123@users.noreply.github.com> Co-authored-by: N201716010711 <87008376+201716010711@users.noreply.github.com> Co-authored-by: Nwangxiaoning <71813629+wangxn12138@users.noreply.github.com> Co-authored-by: NYuanle Liu <yuanlehome@163.com> Co-authored-by: ZZK <359521840@qq.com> Co-authored-by: Nzhangkaihuo <zhangkaihuo@baidu.com> Co-authored-by: NRoc <30228238+sljlp@users.noreply.github.com> Co-authored-by: NPuQing <me@puqing.work> Co-authored-by: NZhang Jun <ewalker@live.cn> Co-authored-by: NCharles-hit <56987902+Charles-hit@users.noreply.github.com> Co-authored-by: Nniuliling123 <51102941+niuliling123@users.noreply.github.com> Co-authored-by: Nwenbin <wang3323032@qq.com> Co-authored-by: Nwangshengxiang <121413869+shengxiangwang@users.noreply.github.com> Co-authored-by: NBo Zhang <105368690+zhangbopd@users.noreply.github.com> Co-authored-by: NAurelius84 <zhangliujie@baidu.com> Co-authored-by: Nzxcd <228587199@qq.com> Co-authored-by: Nzhoutianzi666 <39978853+zhoutianzi666@users.noreply.github.com> Co-authored-by: Ngouzil <66515297+gouzil@users.noreply.github.com> Co-authored-by: Nzhangyikun02 <48021248+zhangyk0314@users.noreply.github.com> Co-authored-by: NHui Zhang <zhtclz@foxmail.com> Co-authored-by: NWang Bojun <105858416+wwbitejotunn@users.noreply.github.com> Co-authored-by: NGuanghua Yu <742925032@qq.com> Co-authored-by: NYUNSHEN XIE <1084314248@qq.com> Co-authored-by: NZhong Hui <zhonghui.net@gmail.com> Co-authored-by: Nrisemeup1 <62429225+risemeup1@users.noreply.github.com> Co-authored-by: Nliuruyan <44316842+liuruyan@users.noreply.github.com> Co-authored-by: NLeo Chen <chenqiuliang@baidu.com> Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com> Co-authored-by: Nwuhuachaocoding <77733235+wuhuachaocoding@users.noreply.github.com> Co-authored-by: NCcc <52520497+juncaipeng@users.noreply.github.com>
-
由 Huang Jiyi 提交于
-
由 YuanRisheng 提交于
* unify_kernel * fix compile bugs * modify macro name * perfect code according comment * fix compile bugs * fix compile bugs * fix ci bugs * fix ci bug * fix ci bugs * fix ci bugs * modify code according comment * rm conv_fusion_op
-
- 07 2月, 2023 3 次提交
-
-
由 Yuanle Liu 提交于
-
由 chalsliu 提交于
-
由 Ruibiao Chen 提交于
-
- 06 2月, 2023 5 次提交
-
-
由 zmxdream 提交于
* add dump_walk_path (#193) * add dump_walk_path; test=develop * add dump_walk_path; test=develop * add dump_walk_path; test=develop * Add multiple CPU communication, parameter query and merging functions, support batch alignment between multiple cards (#194) * compatible with edge_type of src2dst and src2etype2dst (#195) * do not merge_feature_shard when using metapath_split_opt (#198) * support only load reverse_edge (#199) * refactor GraphTable (#201) * fix * fix * fix code style * fix code style * fix test_dataset * fix hogwild worker * fix code style * fix code style * fix code style * fix code style * fix code style. * fix code style. --------- Co-authored-by: Ndanleifeng <52735331+danleifeng@users.noreply.github.com> Co-authored-by: Nqingshui <qshuihu@gmail.com> Co-authored-by: NWebbley <liwb5@foxmail.com> Co-authored-by: Nhuwei02 <53012141+huwei02@users.noreply.github.com>
-
由 zyfncg 提交于
* remove extra input of conv2d * fix bug * fix unittest bug * adjust conv2d.pbtxt * fix cpu_quantize_pass_tester * revert use_addto of conv2d * fix runtime attribute * fix bug * recover force_fp32_output in conv2d * refine error info * fix bug
-
由 Yuang Liu 提交于
-
由 Siming Dai 提交于
* fix to_dlpack for loop * fix reference count
-
由 engineer1109 提交于
-
- 04 2月, 2023 1 次提交
-
-
由 Huihuang Zheng 提交于
As the title
-
- 03 2月, 2023 4 次提交
-
-
由 Sławomir Siwek 提交于
* replace matmul with matmul_v2 in fuse passes * Remove fusion logic from matmul * removing fusion methods * add proper name * adjust namespaces * clean attrs in python tests * delete checkpoint and restore matmul version * remove unused code * matmul and reshape/transpose fuses migrated * split MatmulOneDNN headers * fuse activation and eltwise_add * add fuse_activation * matmul_transpose_reshape/reshape_transpose_matmul * matmul + elementwise_add (fused) * activation temporary modifciation * merge newest develop * remove depedency from other PR * revert pbtxt * remove placeholders from matmul_v2 * add description in OPMaker * remove matmul_v2_op.h and all depedencies * remove dims changing in base op * add possibility to fuse already fused_matmul * restart broken CI * Empty-Commit * revert matmul_utils.h * codestyle * adjust imports * add pbtxt file * 100% matmul unit tests coverage * trigger CI with minimal changes to develop * adjust changes to develop * add fused_matmul op * inherit base ops * add "v2" * move OPMaker * Gradually add fused_matmul files * second batch of fused_matmul changes * split infershapes of matmul_v2 and fused_matmul * inherit fused_matmul from matmul_v2 * Update paddle/phi/backends/onednn/onednn_reuse.h Co-authored-by: NTomasz Socha <tomasz.socha@intel.com> * Update paddle/phi/kernels/fusion/onednn/fused_matmul_kernel.cc Co-authored-by: NTomasz Socha <tomasz.socha@intel.com> --------- Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>
-
由 Paulina Gacek 提交于
* conv_bias_mkldnn_fuse_pass_tester rewritten * conv_concat_relu_mkldnn_fuse_pass_tester rewritten * conv_elementwise_add_fuse_pass_tester rewritten * mkldnn changed to onednn * tests added to cmakeLists, style fix * got rid of unnecessary UT, some style changes * changes in naming convention * max_examples reduced * time out added
-
由 Yuang Liu 提交于
-
由 Ruibiao Chen 提交于
* Reduce time cost of BuildOpHappensBefore * Update code * Update code * Improve data struct
-
- 02 2月, 2023 1 次提交
-
-
由 YuanRisheng 提交于
* fix bugs * fix ci bugs
-
- 01 2月, 2023 2 次提交
-
-
由 Yuang Liu 提交于
-
由 Wang Bojun 提交于
* preln_residual 2 fused_bias_residual * skip layernorm fix and ut * code refine * code style refine * fix ut * fix output * add trt layer fall back info * refine op teller and ut * DropoutMaskOut output fix
-
- 31 1月, 2023 6 次提交
-
-
由 wenbin 提交于
* gn_silu * add ut * set TIMEOUT * correct comments * comments * disable windows ut * rename parameter
-
由 Zhang Jun 提交于
-
由 niuliling123 提交于
-
由 Charles-hit 提交于
* polish static grad op maker gen * fix some bugs * fix static code gen * solve conflict * modify composite grad maker name * integrate phi and fluid info in static code gen * rename some composite maker * modify static code gen format
-
由 TeFeng Chen 提交于
* support inplaced variable in cinn_launch * fix error hint when compiling * fix inplaced output variable of the subgraph * skip CinnCompiler check * using existed definition * fix namespace reference error * modify error message * update cinn tage * fix namespace * skip enforce check * fix unittest attribute throw
-
由 pangyoki 提交于
-
- 30 1月, 2023 5 次提交
-
-
由 jiangcheng 提交于
-
由 engineer1109 提交于
replace all TensorFromVector & TensorToVector AssignKernel async copy
-
由 Ruibiao Chen 提交于
* Support stream priority for standalone executor * Fix compile error * Fix compile error * Fix compile error * Fix compile error * Fix compile error
-
由 zmxdream 提交于
* add set slot_num for psgpuwraper (#177) * add set slot_num_for_pull_feature for psgpuwarper * Add get_epoch_finish python interface (#182) * add get_epoch_finish interface * add return * delete return * add unzip op (#183) * fix miss key for error dataset (#186) * fix miss key for error dataset * fix miss key for error dataset Co-authored-by: Nyangjunchao <yangjunchao@baidu.com> * add excluded_train_pair and infer_node_type (#187) * support return of degree (#188) * fix task stuck in barrier (#189) Co-authored-by: Nyangjunchao <yangjunchao@baidu.com> * check node/feature format when loading (#190) * check node&feature format when loading * check node&feature format when loading (2£ (2) * degrade log (#191) * [PGLBOX]fix conflict * [PGLBOX]fix conflict * [PGLBOX]replace LodTensor with phi::DenseTensor * [PGLBOX]fix gpu_primitives.h include path * [PGLBOX]from platform::PADDLE_CUDA_NUM_THREADS to phi::PADDLE_CUDA_NUM_THREADS * [PGLBOX]fix unzip example code * [PGLBOX]fix unzip example code * [PGLBOX]fix unzip example code * [PGLBOX]fix unzip example code * [PGLBOX]fix unzip ut * [PGLBOX]fix unzip ut * [PGLBOX]fix code style * [PGLBOX]fix code style * [PGLBOX]fix code style * fix code style * fix code style * fix unzip ut * fix unzip ut * fix unzip ut * fix unzip * fix code stype * add ut * add c++ ut & fix train_mode_ set * fix load into memory * fix c++ ut * fix c++ ut * fix c++ ut * fix c++ ut * fix code style * fix collective * fix unzip_op.cc * fix barrier * fix code style * fix barrier * fix barrier * fix code styple * fix unzip * add unzip.py * add unzip.py * fix unzip.py --------- Co-authored-by: Nchao9527 <33347532+chao9527@users.noreply.github.com> Co-authored-by: NSiming Dai <908660116@qq.com> Co-authored-by: Nhuwei02 <53012141+huwei02@users.noreply.github.com> Co-authored-by: Nyangjunchao <yangjunchao@baidu.com>
-
由 gem5 提交于
-
- 29 1月, 2023 4 次提交
-
-
由 jiangcheng 提交于
-
由 sneaxiy 提交于
* add missing proto file * fix windows ci * fix ci compile error
-
由 jiangcheng 提交于
* [CINN] collect inplace var into cinn op desc's kInplaceVarNames attribute * attr move from op desc to subgraph * GetFetchIds from var_map instead of var_model_to_program_map_
-
由 Yuang Liu 提交于
-
- 18 1月, 2023 2 次提交
-
-
由 Sławomir Siwek 提交于
* extract fuse pass logic to header file * adjust namespaces * Update paddle/fluid/framework/ir/mkldnn/activation_onednn_fuse_pass.h update date Co-authored-by: NTomasz Socha <tomasz.socha@intel.com> * add inline remove static Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>
-
由 Leo Chen 提交于
-