- 29 11月, 2022 10 次提交
-
-
由 Sławomir Siwek 提交于
* cleanup unused code * unify is_int8 is_bfloat16 * Simplify matmul_v2 FWD kernel * remove RunKernel methods * remove import namespace * remove headers * clean fluid/phi cross imports * remove fluid axpy_handler * delete fluid methods * activations * OneDNNMemDesc * MKLDNNFormatForSize * MatchShapeToLayout * MKLDNNMemoryFormat * MKLDNNFormat * ReorderMKLDNNHandler * to_void_cast * review suggestions * interpolate * remove fluid depedency * init * ExecuteMatMulV2 * rm fluid kernel * matmul_grad * remove mutable_data * mul_grad * matmul fwd * add extra attr * temp disable passes * re-enable passes * workaround for matmul+act * fix for matmul+eltwise_add * fix typo * merge bugfix #48364 * remove merge conflict
-
由 kangguangli 提交于
* fix:add no support for cuda_arch<700 * replace Executor in while op with InterpreterCore * cache InterpreterCore as the member of WhileOp * fix bug: tensor place changed because of assign op in while loop * refine code * refine code * refine code * hot fix * fix compile * merge develop * follow comments * add log for test * remove LoDTensor * set flag control_flow_use_new_executor false Co-authored-by: Nfengshuai <fengshuai03@baidu.com> Co-authored-by: Nzhiqiu <chenqiuliang@baidu.com>
-
由 haosicheng 提交于
-
由 JZ-LIANG 提交于
* get default calc stream from execution ctx instead of global dev ctx pool.
-
由 gem5 提交于
-
由 LiYuRio 提交于
* remove lod_tensor_to_array, array_to_lod_tensor, DynamicRNN * remove less_equal, greater_than, greater_equal, equal, not_equal
-
由 Sławomir Siwek 提交于
-
由 HappyHeavyRain 提交于
* generate static graph code for lerp by yaml, test=develop * modify the op_compat.yaml of lerp, test=develop * generate static graph code for lerp by yaml, test=develop * modify the op_compat.yaml of lerp, test=develop * remove the 'attrs' of lerp, test=develop Signed-off-by: lizhiyu02 <1528794076@qq.com> Signed-off-by: lizhiyu02 <1528794076@qq.com>
-
由 zhangkaihuo 提交于
-
由 zhangkaihuo 提交于
-
- 28 11月, 2022 22 次提交
-
-
由 Sławomir Siwek 提交于
-
由 jakpiase 提交于
* re-enabled reshape, squeeze and flatten kernels * added formatting
-
由 Wang Bojun 提交于
* add trt support
-
由 zyfncg 提交于
* generate static graph code for some operators * add some ops generate * revert npu gelu
-
由 huangjiyi 提交于
* decouple cudnn_desc.h from fluid * move cudnn_desc.h from fluid to phi * fix bugs * decouple cudnn_helper.h from fluid * fix bugs * move cudnn_helper.h from fluid to phi * add fluid cudnn_helper.h * move miopen_desc.h from fluid to phi * move miopen_helper.h from fluid to phi * fix bugs * move gpu_dnn.h from fluid to phi * fix bugs * update copyright year * simplify gpu_dnn.h in fluid * fix bugs * fix xpu build bug * fix compile bug * fix bug
-
由 张春乔 提交于
-
由 Yiqun Liu 提交于
* Optimize the log of broadcast and decrease the log level. * Remove the redundant brackets. * Change op benchmark ci to test the tests module. * Remove the observe of elementwise and reduce_ops sub-directory.
-
由 YuanRisheng 提交于
* Fix onednn kernel bugs * fix gpu bugs
-
由 Asthestarsfalll 提交于
-
由 PuQing 提交于
-
由 Qi Li 提交于
* [NPU] apply npu_identity to conv bn and copy2cpu, test=develop * update npu identity to share data with x, test=develop * address review comments, test=develop
-
由 zhangbo9674 提交于
* add trace mode for interpretercore * fix bug * add a ctrl flag * add record for memcpyd2h * polish code * polish code
-
由 Ruibiao Chen 提交于
* Remove kSyncRun in StreamAnalyzer * Update code
-
由 huangjiyi 提交于
* rm fluid “xpu_header.h” deps in phi * move part of xpu_op_list.h from fluid to phi * add fluid xpu_op_list deps * add glog deps for xpu_op_list in phi * fix PR-CI-Kunlun
-
由 zyfncg 提交于
* add fluid_op_name_map * rename some kernel name * add comments for op-kernel map * refine map name of op to kernel
-
由 MarDino 提交于
-
由 wenbin 提交于
-
由 xiaoxiaohehe001 提交于
* add_gather_nd_ * add_gather_nd_ * add_gather_nd_
-
由 Thomas Young 提交于
* fix expand as op * fix bug
-
由 haosicheng 提交于
-
由 xiaoguoguo626807 提交于
* remove fluid.reduce_sum * remove fluid.reduce_sum * modify axis and import paddle * modify keepdim and out_name * modift unittest * modift unittest * modify CI_static and loss.py * modify test_mse_loss * modify static ci * modify static ci datatype * add import paddle in test * fix conflict * fix conflict * modify ci * modify ci * fix_conflict * fix bug * code_style
-
由 张春乔 提交于
* Update communicator.cc * Update communicator.cc * remove LoDTensor * remove LoDTensor and Tensor
-
- 26 11月, 2022 2 次提交
- 25 11月, 2022 6 次提交
-
-
由 zhangxin81 提交于
* fix loopup_table plugin deserialize size error
-
由 wanghuancoder 提交于
* for xpu multi thread bug test
-
由 Wangzheee 提交于
* fix
-
由 Wang Bojun 提交于
* group norm fp16 support
-
由 Chitsing KUI 提交于
* attr ready * op ip ready * start dynamic * end2end ok * input shape to map, stat by op * layer wip * first version ready * fix proto depds * fix profiler deps * fix flops typo, rm tuple shape
-
由 Ruibiao Chen 提交于
* Move stream_anayzer to interpreter * Refactor StreamAnalyzer * Refactor RunNextInstructionList * Remove no_data_transform_index * Fix typos * Fix data_transfer OpFuncType error * Add event for depend_op * Update transfer OpFuncType for heter place
-