- 10 2月, 2023 1 次提交
-
-
由 wangshengxiang 提交于
-
- 09 2月, 2023 23 次提交
-
-
由 Zhang Jun 提交于
* update * support int64 shape tensor as engine input * add inference_predictor ut
-
由 Leo Guo 提交于
-
由 Roc 提交于
Co-authored-by: Nzhangxiaoci <zhangxiaoci@baidu.com>
-
由 xiongkun 提交于
-
由 joanna.wozna.intel 提交于
* Adjust mkldnn_placement_pass to check library type and data type * Check if var has inputs * Remove unrelated test * Refactor
-
由 zqw_1997 提交于
* remove dygraph.parallel.ParallelEnv * logger.py error: AttributeError: module 'paddle' has no attribute 'distributed' * move the implenmentation to the root folder * logger.py import ParallelEnv from paddle.parallel to avoid circular import * add the comment of why import ParallelEnv from paddle.parallel in logger.py and remove the api interface in the paddle/parallel.py * outdated Env and note removed * decouple the logger.py and ParallelEnv * remove another ref of parallel in init.py
-
由 Huang Jiyi 提交于
* decouple strided_memcpy * move strided_memcpy * move strided_memcpy to phi * fix namespace * update * fix gpu compile bugs
-
由 Huang Jiyi 提交于
-
由 yuehuayingxueluo 提交于
* add multi_tenosr_adam * update multi_tensor_base.py, test_multi_tensor_adam.py, adamw.py * fix adam.py optimizer.py * fix adamw.py * fix test_multi_tensor_adam.py * fix CI bug * fix CI coverage * fix ci bug * fix betapow * fix some bugs * fix test_adamw_op.py * fix CI coverage * fix multi_tensor_adam_kernel.cc * fix CI bug * fix multi_tensor_adam_op.cc and test_multi_tensor_adam.py * fix code style * update C++ parts * remove python parts modification temporarily * add C++ ut * update betapow copy code logic * fix ci ut * fix windows ci * fix coverage ci * improve coverage rate --------- Co-authored-by: Nsneaxiy <sneaxiy@126.com>
-
由 YangZhou 提交于
* fix typo * add sox_io in audio test * fix * fix
-
由 LiYuRio 提交于
-
由 yuehuayingxueluo 提交于
* fix the processing order of passes in pass_base.py * fix processing order * add _PASS_PROCESS_ORDER_LIST * delete some pass in _PASS_PROCESS_ORDER_LIST * add assert in pass_base.py * remove fuse_optimizer * add _fusion_opt_list_rule * add test_pass_base_list.py * fix some bug * add fused_attention * add some passes to list * fix ci bug * fix ci bug
-
由 wangzhen38 提交于
-
由 zhoutianzi666 提交于
* add fmha_flashattention oss plugin * add fmhca * add oss fmhca * code reconstruct and add ut * code style refine * fix ut and enforce check * refine trt version check refine compile fix compile * fix cross ut * code refine * use runtime trt version check * bug fix and code refine * compile fix * merge develop * add GN QDQ kernel * support GN int8 fake kernel * add with_int8 * add GN int8 fake kernel * add GN int8 fake kernel * add GN int8 fake kernel * add GN int8 fake kernel * add GN int8 fake kernel * add GN int8 fake kernel * add GN int8 fake kernel * add GN int8 UT * add verison > 8000 in GN int8 UT * add some check in .cu * add stdlib.h in UT * little change in .cu * remove rand_r use rand * remove use rand * setAxis(1) * when int8 is on allow fall back to fp16 --------- Co-authored-by: Nwwbitejotunn <wang_bojun@outlook.com>
-
由 wangxiaoning 提交于
-
由 kangguangli 提交于
* fix judgement about scope validation * fix ci bug: same address is not enough for data consistency * remove useless check
-
由 pangengzheng 提交于
-
由 Jiabin Yang 提交于
-
由 zhangbo9674 提交于
* add TypeID * Specification comment code * refine code * add AbstractType * add TypeStorage * fix unittest bug * change dir * change dir * refine code * fix bug * Refine code by comment * delete unused code * normative naming rules * refine code by comment * refine doc * refine codestyle
-
由 wanghuancoder 提交于
-
由 zhangyikun02 提交于
-
由 Wang Bojun 提交于
* trans_layernorm
-
由 傅剑寒 提交于
-
- 08 2月, 2023 16 次提交
-
-
由 Paulina Gacek 提交于
* QuantTranpose pattern is being found by pass * quant + transpose fuse * code style changes * UT written, reorder fixed * Dequantize + transpose2 fuse added * pass name changed * UT added & shift corrected * got rid of redundancy * review changes * AsIntermediate corrected * compat added
-
由 Sławomir Siwek 提交于
* add support for bf16 fused_ops * fused_matmul only
-
由 wangxiaoning 提交于
* fix codestyle * fix std
-
由 Zhang Jun 提交于
* update * update * format code * update * Update test_trt_convert_nearest_interp_v2.py
-
由 Yuang Liu 提交于
-
由 zmxdream 提交于
* hidden unzip * fix * fix
-
由 weishengying 提交于
-
由 HongyuJia 提交于
-
由 tianshuo78520a 提交于
-
由 tianshuo78520a 提交于
-
由 zhangkaihuo 提交于
* fix bn performance degradation
-
由 cyber-pioneer 提交于
* add flag * change flag * use prim flag * fix code * fix softmax prim flag * set case timeout
-
由 LiYuRio 提交于
-
由 HongyuJia 提交于
* support unsigned int and bfloat16 * update unit test * update DenseTensor datatype * unsupport more datatype of mutable_data(Place) * fix unittest
-
由 risemeup1 提交于
-
由 risemeup1 提交于
* add_eager_and_jit,test=ljd_test * test * test,test=ljd_test * test,test=ljd_test * test,test=ljd_test * test,test=ljd_test * test,test=ljd_test * test,test=ljd_test * add_eager_jit_dir,test=ljd_test * fix conflict,test=ljd_test * test,test=ljd_test * get new precise_map,test=ljd_test * test
-