- 25 5月, 2023 2 次提交
-
-
由 ronnywang 提交于
-
由 zhoutianzi666 提交于
* add tranfer_elim * transfer layout elimination
-
- 24 5月, 2023 1 次提交
-
-
由 wz1qqx 提交于
-
- 23 5月, 2023 3 次提交
-
-
由 co63oc 提交于
-
由 cyberslack_lee 提交于
-
由 zhupengyang 提交于
-
- 22 5月, 2023 3 次提交
-
-
由 zhupengyang 提交于
-
由 zhupengyang 提交于
-
由 Yuanle Liu 提交于
[Inference] add config.enable_low_precision_io api and remove rely on AnalysisConfig::Precison in trt (#52485)
-
- 19 5月, 2023 1 次提交
-
-
由 shentanyue 提交于
-
- 18 5月, 2023 2 次提交
-
-
由 Hulek 提交于
* Fused elementwises kernels and ops * change fuse pass name * adjust .pbtxt files * adjust quantization attributes * add missing arguments and fix others, review fixed * simplify fused kernel registration * fix elementwise unit tests * reuse one fused elementwise op * adjust proto * Add supported datatypes * Change 'Scale' to 'scale' in tests, change some tests to onednn * Revert breaking changes * Fix unit tests * Delete obsolete test cases * Delete commented out code * Fix codestyle * delete temporary condition * fix conflicts and delete duplicate fusing * Fix code after merge * Move tests to new directory * fix tests volatility * Rename test_elementwise_add_onednn_op.py to test_elementwise_add_mkldnn_op.py * Update CMakeLists.txt add mkldnn op test --------- Co-authored-by: NSilv3S <slawomir.siwek@intel.com>
-
由 huangjiyi 提交于
-
- 16 5月, 2023 4 次提交
-
-
由 Galaxy1458 提交于
* test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop
-
由 sprouteer 提交于
-
由 Yuanle Liu 提交于
* remove prelu land ookuip_table plugin, adjust .h include location * clean code and adjust some .h * update
-
由 Mahmoud Ashraf 提交于
* correct 1th to 1st * correct 1th to 1st * fix typo * fix typos
-
- 11 5月, 2023 1 次提交
-
-
由 Yuanle Liu 提交于
* prelu op trt converter support zero dim
-
- 10 5月, 2023 1 次提交
-
-
由 wz1qqx 提交于
* fix as review, add fp16 conv2d_transpose * fix unittest of bn and reduce_mean * fix bn unittest * fix ci * fix ci
-
- 09 5月, 2023 2 次提交
-
-
由 Galaxy1458 提交于
* test,test=develop * test,test=develop * test,test=develop * test,test=develop
-
由 zhoutianzi666 提交于
* delete delete_fill_constant_op_pass and unsqueeze2_eltwise_fuse_pass
-
- 08 5月, 2023 2 次提交
- 05 5月, 2023 2 次提交
-
-
由 shentanyue 提交于
-
由 sprouteer 提交于
-
- 04 5月, 2023 1 次提交
-
-
由 weishengying 提交于
-
- 27 4月, 2023 1 次提交
-
-
由 zhupengyang 提交于
-
- 25 4月, 2023 2 次提交
-
-
由 sprouteer 提交于
-
由 YuanRisheng 提交于
* add flags for phi * fix compile bugs * fix ci bugs * fix inference bugs * fix cinn' bugs * fix cinn bugs * perfect code according comment * fix ci bugs * fix ci bugs
-
- 24 4月, 2023 2 次提交
-
-
由 zhupengyang 提交于
-
由 Galaxy1458 提交于
* test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop * test ,test=develop
-
- 23 4月, 2023 1 次提交
-
-
由 Galaxy1458 提交于
* test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop
-
- 21 4月, 2023 1 次提交
-
-
由 zhupengyang 提交于
-
- 19 4月, 2023 1 次提交
-
-
由 csy0225 提交于
-
- 17 4月, 2023 1 次提交
-
-
由 zhoutianzi666 提交于
* initial commit for cutlass_teller * second commit for cutlass_teller * add conv2d_depthwise python template * add conv2d_depthwise cutlass template * /zhoukangkang/paddle_cutlass/Paddle/paddle/fluid/framework/ir/cutlass_teller.h * refine code in Conv2dFusionCanSupport * add macro in cutlass_teller.h * add 3x3 5x5 teller * add groups not 1 or conv2d_depthwise teller * 只生成ic是8的倍数的conv2d_depthwise 的kernel * add EXPLICIT in cutlass_teller.h * final commit * add split_k_slices in conv2d_depthwise * make stages == 2 * 重构部分代码 * add CutlassFusionType * solve illegal memory * make stride_h=stride_w && make dilation==1 * must check HasAttr(use_cutlass) before GetAttrIfExists * add CONV2D_DEPTHWISE_BIAS_SILU to OpType2String * modify decl.h and util.cu
-
- 14 4月, 2023 1 次提交
-
-
由 zhupengyang 提交于
-
- 13 4月, 2023 5 次提交
-
-
由 Wangzheee 提交于
* Paddle-Trt: Replace fc mul matmul matmul_v2 with matrix_multiply
-
由 csy0225 提交于
-
由 HongyuJia 提交于
* [enforce.h Decouple logging.h] Delete glog/logging.h from enforce.h * Add logging.h for profiler.cc * Add logging.h for gloo_utils.h * Add logging.h for addmm_kernel_impl.h * Add logging.h for addmm_grad_kernel_impl.h * Add logging.h for p_send_kernel.cu * Add logging.h for determinant_grad_kernel_impl.h * Add logging.h for p_recv_kernel.cu * Add logging.h for elementwise_grad_base.h * Add logging.h for transfer_layout_kernel.cc * Add logging.h for eigvals_kernel.cc and index_select_impl.h * Add logging.h for all files in kernel directory * Add logging.h for xpu_info.cc * Add logging.h for xpu
-
由 zhupengyang 提交于
-
由 csy0225 提交于
-