- 11 2月, 2023 1 次提交
-
-
由 Wang Bojun 提交于
* eleadd_trans first version log fix * refine code for linear format, add pass check * linear format refine and ut fix * fix ut * windows ut * windows ut 2 * move tensorMeta and alloc to configure
-
- 10 2月, 2023 21 次提交
-
-
由 umiswing 提交于
-
由 Ruibiao Chen 提交于
-
由 Leo Guo 提交于
d_bias are nullptr. Modify the code style of full_kernel.cc. Add new data type for concat, elementwise_add, gather, scale, scatter ops. test=kunlun
-
由 risemeup1 提交于
-
由 Aurelius84 提交于
* Fix inferMefer in transpose2_grad * fix infershape * fix unittest
-
由 ykkk2333 提交于
-
由 yuehuayingxueluo 提交于
-
由 wangxiaoning 提交于
* fluid clean * fix optimizer * fix distributed_transpiler * fix fluid.__init__ * remove from fluid.init
-
由 Infinity_lee 提交于
-
由 RedContritio 提交于
* add dim check in scatter * add check in scatter.cu * add unittest * remove unnecessary log and comment --------- Co-authored-by: RedContritio <>
-
由 HongyuJia 提交于
* fix NLP-Bert model performance loss * fix windows compile error
-
由 risemeup1 提交于
* fix test_fleet_exe_dist_model_run * test
-
由 Weilong Wu 提交于
-
由 zhupengyang 提交于
-
由 LoneRanger 提交于
* 为split增加取值范围维度的判断 * 为glu的axis进行取值判断并添加单测 * 完善glu的单测 * fix glu
-
由 Aurelius84 提交于
-
由 mhy-666 提交于
* add test_std * add test_var * fix std/var assertequal * fix std/var assertequal * fix std/var assertequal * -madd api name to reduce_api * fix * fix var * fix * fix * fix stat * fix unitest * fix stat/var * fix stat/var, unittest * fix stat/std, unittest * add unittest of var,std, fix stat/var,std * fix stat/var, unittest * fix * fix unittest * fix * fix * fix * fix unittest
-
由 HongyuJia 提交于
-
由 Huang Jiyi 提交于
* remove AllocatorFacade in phi * fix include * fix bugs
-
由 Huang Jiyi 提交于
* rm gradient_accumulator in phi * update
-
由 wangshengxiang 提交于
-
- 09 2月, 2023 18 次提交
-
-
由 Zhang Jun 提交于
* update * support int64 shape tensor as engine input * add inference_predictor ut
-
由 Leo Guo 提交于
-
由 Roc 提交于
Co-authored-by: Nzhangxiaoci <zhangxiaoci@baidu.com>
-
由 xiongkun 提交于
-
由 joanna.wozna.intel 提交于
* Adjust mkldnn_placement_pass to check library type and data type * Check if var has inputs * Remove unrelated test * Refactor
-
由 zqw_1997 提交于
* remove dygraph.parallel.ParallelEnv * logger.py error: AttributeError: module 'paddle' has no attribute 'distributed' * move the implenmentation to the root folder * logger.py import ParallelEnv from paddle.parallel to avoid circular import * add the comment of why import ParallelEnv from paddle.parallel in logger.py and remove the api interface in the paddle/parallel.py * outdated Env and note removed * decouple the logger.py and ParallelEnv * remove another ref of parallel in init.py
-
由 Huang Jiyi 提交于
* decouple strided_memcpy * move strided_memcpy * move strided_memcpy to phi * fix namespace * update * fix gpu compile bugs
-
由 Huang Jiyi 提交于
-
由 yuehuayingxueluo 提交于
* add multi_tenosr_adam * update multi_tensor_base.py, test_multi_tensor_adam.py, adamw.py * fix adam.py optimizer.py * fix adamw.py * fix test_multi_tensor_adam.py * fix CI bug * fix CI coverage * fix ci bug * fix betapow * fix some bugs * fix test_adamw_op.py * fix CI coverage * fix multi_tensor_adam_kernel.cc * fix CI bug * fix multi_tensor_adam_op.cc and test_multi_tensor_adam.py * fix code style * update C++ parts * remove python parts modification temporarily * add C++ ut * update betapow copy code logic * fix ci ut * fix windows ci * fix coverage ci * improve coverage rate --------- Co-authored-by: Nsneaxiy <sneaxiy@126.com>
-
由 YangZhou 提交于
* fix typo * add sox_io in audio test * fix * fix
-
由 LiYuRio 提交于
-
由 yuehuayingxueluo 提交于
* fix the processing order of passes in pass_base.py * fix processing order * add _PASS_PROCESS_ORDER_LIST * delete some pass in _PASS_PROCESS_ORDER_LIST * add assert in pass_base.py * remove fuse_optimizer * add _fusion_opt_list_rule * add test_pass_base_list.py * fix some bug * add fused_attention * add some passes to list * fix ci bug * fix ci bug
-
由 wangzhen38 提交于
-
由 zhoutianzi666 提交于
* add fmha_flashattention oss plugin * add fmhca * add oss fmhca * code reconstruct and add ut * code style refine * fix ut and enforce check * refine trt version check refine compile fix compile * fix cross ut * code refine * use runtime trt version check * bug fix and code refine * compile fix * merge develop * add GN QDQ kernel * support GN int8 fake kernel * add with_int8 * add GN int8 fake kernel * add GN int8 fake kernel * add GN int8 fake kernel * add GN int8 fake kernel * add GN int8 fake kernel * add GN int8 fake kernel * add GN int8 fake kernel * add GN int8 UT * add verison > 8000 in GN int8 UT * add some check in .cu * add stdlib.h in UT * little change in .cu * remove rand_r use rand * remove use rand * setAxis(1) * when int8 is on allow fall back to fp16 --------- Co-authored-by: Nwwbitejotunn <wang_bojun@outlook.com>
-
由 wangxiaoning 提交于
-
由 kangguangli 提交于
* fix judgement about scope validation * fix ci bug: same address is not enough for data consistency * remove useless check
-
由 pangengzheng 提交于
-
由 Jiabin Yang 提交于
-