- 18 2月, 2020 1 次提交
-
-
由 yiicy 提交于
-
- 14 2月, 2020 4 次提交
-
-
由 tienfeek 提交于
test=develop
-
由 Yiqun Liu 提交于
* Optimize softmax. When the input tensor is 2-D and axis is 1, there is no need to resize. * Optimize the gru, avoid calling Tensor::Slice. test=develop * Remove a std::vector in softmax. test=develop * Define CalculateSeqWidth to get the width of a sequence. test=develop
-
由 Yuan Shuai 提交于
* comment All opencl buffer kernels. test=develop * refactor conv, depthwise into one routing selection. test=develop
-
由 xiaogang 提交于
fix fpga lite_tensor compile bug add fake quantize_abs_max op test=develop
-
- 13 2月, 2020 2 次提交
-
-
由 GaoWei8 提交于
* replace gru RowWiseAdd Eigen with c implementation test=develop
-
由 Yuan Shuai 提交于
* [LITE][OPENCL] Add opencl image2d conv3x3. test=develop
-
- 12 2月, 2020 7 次提交
-
-
由 Jiaying Zhao 提交于
* [LITE][OPENCL]Add scale kernel. * [LITE][OPENCL]Add scale kernel, format code style.
-
由 HappyAngel 提交于
add concat ut, test=develop * fix axis compute, test=develop * add other axis, test=develop * fix ut. test=develop
-
由 HappyAngel 提交于
* fix con+relu6/leakyRelu fusion in Fp32, test=develop * note m=397 in sgemv_int8 ut, test=develop * fix ios build error. test=develop
-
由 xiaogang 提交于
fix conv_winograd multithreads bug
-
由 yiicy 提交于
* [ARM] add 5x5s2 depthwise conv armv8 impl, test=develop * [ARM] add int8 5x5s2 dw conv armv7 impl, test=develop * [ARM] add int8 5x5s2 dw conv impl, test=develop * [ARM] close int8 conv ut, test=develop
-
由 yiicy 提交于
[OPENCL] add sigmoid image2d kernel and ut
-
由 xiaogang 提交于
* fix: update fpga backend and kernel test=develop * style: style fix test=develop
-
- 11 2月, 2020 1 次提交
-
-
由 Yiqun Liu 提交于
* Optimize the transform from Paddle' Tensor to EigenVector, avoiding defining multiple DDim. * Optimize the compute implementation of several operators. test=develop
-
- 10 2月, 2020 2 次提交
-
-
由 xiaogang 提交于
-
由 Yuan Shuai 提交于
[LITE][OPENCL] Add 3 kernels of ElementwiseAdd/FusionElemenwiseAddAct op with opencl image format (#2844) * [LITE][OPENCL] Add 3 kernels of ElementwiseAdd/FuseElementwiseAdd op. test=develop
-
- 08 2月, 2020 1 次提交
-
-
由 xiebaiyuan 提交于
* [LITE][OPENCL][Image]develop 1x1/5x5/7x7 routing in conv_compute ,test=develop * [LITE][OPENCL][Image]develop 1x1/5x5/7x7 routing in conv_compute ,convert bias filter in prepare for run ,test=develop
-
- 07 2月, 2020 1 次提交
-
-
由 Yuan Shuai 提交于
* [LITE][OPENCL] remove useless code. test=develop * [LITE][OPENCL] finish 4 kernel and unit tests of image2d opencl elementwise_mul kernel. test=develop * [LITE][OPENCL] Fix little bug of ASSERT. test=develop * [LITE][OPENCL] Fix bug of channel_mul_d2 and d4. test=develop
-
- 05 2月, 2020 1 次提交
-
-
由 HappyAngel 提交于
[arm] fix conv_dw leakyRelu compute error
-
- 04 2月, 2020 1 次提交
-
-
由 yiicy 提交于
* refactor 5x5s1 dw conv armv8, test=develop * [ARM] refactor depthwise conv 5x5s1, and support relu6, leakey relu, test=develop * [ARM] sgemv support fuse relu6 and leakey relu,test=develop * [ARM] reduce some conv ut case, test=develop * [ARM] fix 5x5dw conv pick kernel bug, test=develop * fix code style, test=develop * [ARM] fix sgemv fuse relu6 bug, test=develop * [ARM] fix fp32 5x5s1 dw bug, test=develop * [ARM] fix fp32 5x5 dw conv pick kernel bug, test=develop
-
- 03 2月, 2020 2 次提交
-
-
由 Yuan Shuai 提交于
-
由 xiebaiyuan 提交于
* [LITE][OPENCL]develop basic image depthwiseconv,passed loop test,test=develop * [LITE][OPENCL]log to vlog(4),test=develop * [LITE][OPENCL]fix depthwise buffer conv kernel name ,test=develop
-
- 19 1月, 2020 2 次提交
-
-
由 HappyAngel 提交于
-
由 xiebaiyuan 提交于
* [LITE][OPENCL] conv2d_1x1_image, choose simple kernel when in some case. for opencl ,test=develop * [LITE][OPENCL] conv2d_1x1_image, add looptest ,test=develop
-
- 17 1月, 2020 2 次提交
- 16 1月, 2020 1 次提交
-
-
由 HappyAngel 提交于
1. add conv_5x5s2_dw to support any padding 2. add 1x1s2pooling impl 3. fix conv dw 3x3 s1p01 bug
-
- 15 1月, 2020 1 次提交
-
-
由 HappyAngel 提交于
* fix, test=develop * add fc_relu, test=develop
-
- 14 1月, 2020 2 次提交
-
-
由 HappyAngel 提交于
add gemm + relu6/leakyrelu fusion
-
由 myq406450149 提交于
* Support bitman backend
-
- 02 1月, 2020 2 次提交
-
-
由 石晓伟 提交于
-
由 hong19860320 提交于
-
- 31 12月, 2019 1 次提交
-
-
由 hong19860320 提交于
* Fix the compiling error which occurs when specify the ddk_root path and build for huawei NPU. * Refine the registration of op bridges and make it similar to the registration of op and kernel. * Refine the interfaces of the graph and node for op bridges, and support creating constant and data node automatically according to the attribute 'persistable' of the target tensor. * Add the unit test of the scale and softmax op bridge for NPU.
-
- 26 12月, 2019 1 次提交
-
-
由 Wilber 提交于
-fix fluid-lite-subgraph x86 compile error - Replace FLAGS with environment variables
-
- 25 12月, 2019 1 次提交
-
-
由 Yiqun Liu 提交于
* Remove GEMM padding in fc_compute. test=develop * Write a common ParallelFor function to run the for loop in parallel. * Add the codes of padding GEMM back in fc. * Refine the code of fc when padding_weight is false to avoid the definition of temporary Tensor. * Refine the unit test of fc and add testing case of padding and parallel. test=develop * Enable more test cases in common fc unittest, including padding and parallel for x86 target. * Remove the fc test under kernels/x86. test=develop * Disable relu in test of fc for non-x86 target. test=develop * Change the eps of arm. test=develop
-
- 24 12月, 2019 1 次提交
-
-
由 hong19860320 提交于
* Support multiple types for XPU and NPU op bridges * Add lookup_table, gather, slice, stack and scale op bridges for supporting BERT * Fix the definition of lookup_table kernel for X86
-
- 23 12月, 2019 2 次提交
-
-
由 HappyAngel 提交于
-
由 HappyAngel 提交于
* fix conv 2-pad to 4-pad * fix compute conv shape * fix pad, test=develop * change conv_depthwise_3x3s1_fp.cc name to conv3x3s1p01_depthwise_fp32.cc to distinguish between conv3x3s1_depthwise_fp32.cc * delete printf note in conv3x3s1, test=develop * delete printf note, test=develop * delete gem_sdot.h, test=develop it is coped from __gemm_sdot_meta_.h * update compute padding, test=develop * fix padding size, must be 2 or 4. test=develop * fix format in operators/conv_op.cc, test=develop * change #if 0 to #if 1, test=develop * put 2-pad to 4-pad in AttachImpl, test=develop * fix clang-format error inn tests/math/connv_compute_test, test=develop * fix x86 test result error, test=develop * add asymmetric padding test case in liite/tests/math/conv_compute.cc, test=develop * change paddings type to support dynamically modify, test=develop * fix x86 build error in connv_compute_test, test=develop * fix opencl build error, test=develop * fix oopencl build error, test=develop * fix opencl/conv_compute build error, test=develop * fix opencl/conv_compute build error, test=develop * fix format in kernels/opencl/conv_computte_ttest,test=develop * fix build error, test=develop fix build error in kernels/x86/conv_compute.h * fix ccompute shape error in ooperators/conv_op.h, test=develop * add conv_reelu6 and conv leaky_relu in conv_3x3s1_direct * add conv_relu6 in c1, c2, c4,test=develop * fix conflict in conv_bloock_utils.h, test=develop * add relu6 and leankyrelu in conv_3x3s1_dw * add conv_3x3s1px_dw relu6 and leaky_relu fusion, test=develop * fix conflict in tests/math/conv_compute_arm, test=develop * fix build error in winograd arm, test=develop * channge act_param as pointer in conv_block_tuils.h, test=develop * fix winograd in no equal 4-padding compute error, test=develop * add conv relu6 and leaky_relu in conv_dw_3x3s2, test=develop * fix format, test=develop * fix format in conv_block_utils, test=develop * move updatePadding from conv_op.cc to conv_op.h, test=develop * fix format conv_op.h, test=develop * fix buuilde error in conv_oop.h, test=develop * remove flag_relu parameter in conv_3x3_depthwise, test=develop
-
- 20 12月, 2019 1 次提交
-
-
由 zhupengyang 提交于
* [XPU] add transpose bridge and unit test test=develop
-