- 19 1月, 2020 1 次提交
-
-
由 xiebaiyuan 提交于
* [LITE][OPENCL] conv2d_1x1_image, choose simple kernel when in some case. for opencl ,test=develop * [LITE][OPENCL] conv2d_1x1_image, add looptest ,test=develop
-
- 17 1月, 2020 2 次提交
- 16 1月, 2020 1 次提交
-
-
由 HappyAngel 提交于
1. add conv_5x5s2_dw to support any padding 2. add 1x1s2pooling impl 3. fix conv dw 3x3 s1p01 bug
-
- 15 1月, 2020 1 次提交
-
-
由 HappyAngel 提交于
* fix, test=develop * add fc_relu, test=develop
-
- 14 1月, 2020 2 次提交
-
-
由 HappyAngel 提交于
add gemm + relu6/leakyrelu fusion
-
由 myq406450149 提交于
* Support bitman backend
-
- 02 1月, 2020 2 次提交
-
-
由 石晓伟 提交于
-
由 hong19860320 提交于
-
- 31 12月, 2019 1 次提交
-
-
由 hong19860320 提交于
* Fix the compiling error which occurs when specify the ddk_root path and build for huawei NPU. * Refine the registration of op bridges and make it similar to the registration of op and kernel. * Refine the interfaces of the graph and node for op bridges, and support creating constant and data node automatically according to the attribute 'persistable' of the target tensor. * Add the unit test of the scale and softmax op bridge for NPU.
-
- 26 12月, 2019 1 次提交
-
-
由 Wilber 提交于
-fix fluid-lite-subgraph x86 compile error - Replace FLAGS with environment variables
-
- 25 12月, 2019 1 次提交
-
-
由 Yiqun Liu 提交于
* Remove GEMM padding in fc_compute. test=develop * Write a common ParallelFor function to run the for loop in parallel. * Add the codes of padding GEMM back in fc. * Refine the code of fc when padding_weight is false to avoid the definition of temporary Tensor. * Refine the unit test of fc and add testing case of padding and parallel. test=develop * Enable more test cases in common fc unittest, including padding and parallel for x86 target. * Remove the fc test under kernels/x86. test=develop * Disable relu in test of fc for non-x86 target. test=develop * Change the eps of arm. test=develop
-
- 24 12月, 2019 1 次提交
-
-
由 hong19860320 提交于
* Support multiple types for XPU and NPU op bridges * Add lookup_table, gather, slice, stack and scale op bridges for supporting BERT * Fix the definition of lookup_table kernel for X86
-
- 23 12月, 2019 2 次提交
-
-
由 HappyAngel 提交于
-
由 HappyAngel 提交于
* fix conv 2-pad to 4-pad * fix compute conv shape * fix pad, test=develop * change conv_depthwise_3x3s1_fp.cc name to conv3x3s1p01_depthwise_fp32.cc to distinguish between conv3x3s1_depthwise_fp32.cc * delete printf note in conv3x3s1, test=develop * delete printf note, test=develop * delete gem_sdot.h, test=develop it is coped from __gemm_sdot_meta_.h * update compute padding, test=develop * fix padding size, must be 2 or 4. test=develop * fix format in operators/conv_op.cc, test=develop * change #if 0 to #if 1, test=develop * put 2-pad to 4-pad in AttachImpl, test=develop * fix clang-format error inn tests/math/connv_compute_test, test=develop * fix x86 test result error, test=develop * add asymmetric padding test case in liite/tests/math/conv_compute.cc, test=develop * change paddings type to support dynamically modify, test=develop * fix x86 build error in connv_compute_test, test=develop * fix opencl build error, test=develop * fix oopencl build error, test=develop * fix opencl/conv_compute build error, test=develop * fix opencl/conv_compute build error, test=develop * fix format in kernels/opencl/conv_computte_ttest,test=develop * fix build error, test=develop fix build error in kernels/x86/conv_compute.h * fix ccompute shape error in ooperators/conv_op.h, test=develop * add conv_reelu6 and conv leaky_relu in conv_3x3s1_direct * add conv_relu6 in c1, c2, c4,test=develop * fix conflict in conv_bloock_utils.h, test=develop * add relu6 and leankyrelu in conv_3x3s1_dw * add conv_3x3s1px_dw relu6 and leaky_relu fusion, test=develop * fix conflict in tests/math/conv_compute_arm, test=develop * fix build error in winograd arm, test=develop * channge act_param as pointer in conv_block_tuils.h, test=develop * fix winograd in no equal 4-padding compute error, test=develop * add conv relu6 and leaky_relu in conv_dw_3x3s2, test=develop * fix format, test=develop * fix format in conv_block_utils, test=develop * move updatePadding from conv_op.cc to conv_op.h, test=develop * fix format conv_op.h, test=develop * fix buuilde error in conv_oop.h, test=develop * remove flag_relu parameter in conv_3x3_depthwise, test=develop
-
- 20 12月, 2019 1 次提交
-
-
由 zhupengyang 提交于
* [XPU] add transpose bridge and unit test test=develop
-
- 19 12月, 2019 2 次提交
-
-
由 Wilber 提交于
* optimize content-dnn cuda kernel
-
由 TianXiaogang 提交于
-
- 18 12月, 2019 1 次提交
-
-
由 juncaipeng 提交于
* Support Mask RCNN2 (#2588)
-
- 17 12月, 2019 3 次提交
-
-
由 HappyAngel 提交于
-
由 xiebaiyuan 提交于
* add reshape opencl kernel && optimise conv 1x1 ,test=develop * add reshape opencl kernel && optimise conv 1x1 &&code style ,test=develop * add reshape opencl kernel && optimise conv 1x1 &&code style ,test=develop
-
由 Yuan Shuai 提交于
-
- 16 12月, 2019 3 次提交
-
-
由 TianXiaogang 提交于
* fix: update backend fpga patch
-
由 Yuan Shuai 提交于
[LITE][OPENCL] Add relu image2d kernel unit test, Fix conv2d_1x1, relu, layout using new Image2D Layout (#2564) * add 3 layout for opencl image. test=develop * add relu image2d test. test=develop
-
由 Jiaying Zhao 提交于
* [LITE][OPENCL] Add depthwise_conv_3x3 opencl kernel * [LITE][OPENCL] Add depthwise_conv_3x3 opencl kernel. test=develop * [LITE][OPENCL] Add Pool opencl kernel. test=develop
-
- 15 12月, 2019 1 次提交
-
-
由 Wilber 提交于
optimize search_grnn
-
- 13 12月, 2019 1 次提交
-
-
由 hong19860320 提交于
[LITE][NPU][XPU] Refine subgraph pass, and support NPU/XPU model generation at execution time (#2576)
-
- 12 12月, 2019 1 次提交
-
-
由 xiebaiyuan 提交于
* add opencl conv1x1 image impl and unit test pass with relu & bias, add layout_compute --> buffer2image float32 --> with unit test pass suite checked test for more situation , test=develop * add opencl conv1x1 image impl and unit test pass with relu & bias, add layout_compute --> buffer2image float32 --> with unit test pass suite checked test for more situation , test=develop * fix white space cpp lint , test=develop
-
- 11 12月, 2019 1 次提交
-
-
由 TianXiaogang 提交于
-
- 10 12月, 2019 1 次提交
-
-
由 yiicy 提交于
-
- 09 12月, 2019 2 次提交
-
-
由 yiicy 提交于
-
由 zhupengyang 提交于
test=develop
-
- 07 12月, 2019 1 次提交
-
-
由 juncaipeng 提交于
* add arm split lod tensor, test=develop * add arm merge lod tensor, test=develop * update split merge lod tensor, test=develop * add reduce_prob op, test=develop * support mask_rcnn succeed, test=develop
-
- 04 12月, 2019 1 次提交
-
-
由 Zhaolong Xing 提交于
* init resnet cuda int8 support test=develop * refine cuda unit test test=develop * add the forgeted file. test=develop
-
- 03 12月, 2019 1 次提交
-
-
由 TianXiaogang 提交于
* feat: add sgemm4x4 for armv8 * fix: fix armv7 gemm choose condition
-
- 30 11月, 2019 1 次提交
-
-
由 zhupengyang 提交于
test=develop
-
- 28 11月, 2019 1 次提交
-
-
由 yiicy 提交于
-
- 27 11月, 2019 1 次提交
-
-
由 huzhiqiang 提交于
-
- 26 11月, 2019 1 次提交
-
-
由 TianXiaogang 提交于
fix: fix conv_block prepack_input_nxwc4 bug * fix: optimize sgemm_c4 in armv7 change condition of choose winograd kernel * fix: change conv choose kernel condition
-
- 25 11月, 2019 1 次提交
-
-
由 yiicy 提交于
* unfinish sgemmc4 * finish armv8 sgemmc4 * arm add sgemmc4 with deal with remain * [ARM] add sgemmc4 small kernel, test=develop * [ARM] sgemmc4 small improve armv7 speed by add 4x8 block, test=develop
-