- 31 12月, 2019 1 次提交
-
-
由 hong19860320 提交于
* Fix the compiling error which occurs when specify the ddk_root path and build for huawei NPU. * Refine the registration of op bridges and make it similar to the registration of op and kernel. * Refine the interfaces of the graph and node for op bridges, and support creating constant and data node automatically according to the attribute 'persistable' of the target tensor. * Add the unit test of the scale and softmax op bridge for NPU.
-
- 30 12月, 2019 2 次提交
-
-
由 juncaipeng 提交于
* fix yolov3 bug when run several times, test=develop
-
由 Yiqun Liu 提交于
Optimize the execution of RuntimeProgram by saving the bool whether the op is feed/fetch op. (#2703) test=develop
-
- 28 12月, 2019 1 次提交
-
-
由 huzhiqiang 提交于
-
- 27 12月, 2019 4 次提交
-
-
由 石晓伟 提交于
-
由 yiicy 提交于
-
由 hong19860320 提交于
-
由 huzhiqiang 提交于
remove test_models ci projects, because these project hass been removed in ci test test=develop (#2669)
-
- 26 12月, 2019 3 次提交
-
-
由 Wilber 提交于
-fix fluid-lite-subgraph x86 compile error - Replace FLAGS with environment variables
-
由 xiaogang 提交于
* feat: add multi_thread ut
-
由 zhupengyang 提交于
test=develop
-
- 25 12月, 2019 6 次提交
-
-
由 juncaipeng 提交于
add clear for tensor
-
由 juncaipeng 提交于
* fix op inputs and outputs type, test=develop
-
由 Wilber 提交于
optimize softmax cuda kernel
-
由 juncaipeng 提交于
-
由 hong19860320 提交于
-
由 Yiqun Liu 提交于
* Remove GEMM padding in fc_compute. test=develop * Write a common ParallelFor function to run the for loop in parallel. * Add the codes of padding GEMM back in fc. * Refine the code of fc when padding_weight is false to avoid the definition of temporary Tensor. * Refine the unit test of fc and add testing case of padding and parallel. test=develop * Enable more test cases in common fc unittest, including padding and parallel for x86 target. * Remove the fc test under kernels/x86. test=develop * Disable relu in test of fc for non-x86 target. test=develop * Change the eps of arm. test=develop
-
- 24 12月, 2019 8 次提交
-
-
由 zhupengyang 提交于
-
由 hong19860320 提交于
-
由 huzhiqiang 提交于
-
由 hong19860320 提交于
* Support multiple types for XPU and NPU op bridges * Add lookup_table, gather, slice, stack and scale op bridges for supporting BERT * Fix the definition of lookup_table kernel for X86
-
由 yiicy 提交于
-
由 zhupengyang 提交于
test=develop
-
由 zhupengyang 提交于
test=develop
-
由 zhupengyang 提交于
* [XPU] elementwise_add unit test * [XPU] softmax unit test test=develop
-
- 23 12月, 2019 6 次提交
-
-
由 HappyAngel 提交于
-
由 HappyAngel 提交于
* fix conv 2-pad to 4-pad * fix compute conv shape * fix pad, test=develop * change conv_depthwise_3x3s1_fp.cc name to conv3x3s1p01_depthwise_fp32.cc to distinguish between conv3x3s1_depthwise_fp32.cc * delete printf note in conv3x3s1, test=develop * delete printf note, test=develop * delete gem_sdot.h, test=develop it is coped from __gemm_sdot_meta_.h * update compute padding, test=develop * fix padding size, must be 2 or 4. test=develop * fix format in operators/conv_op.cc, test=develop * change #if 0 to #if 1, test=develop * put 2-pad to 4-pad in AttachImpl, test=develop * fix clang-format error inn tests/math/connv_compute_test, test=develop * fix x86 test result error, test=develop * add asymmetric padding test case in liite/tests/math/conv_compute.cc, test=develop * change paddings type to support dynamically modify, test=develop * fix x86 build error in connv_compute_test, test=develop * fix opencl build error, test=develop * fix oopencl build error, test=develop * fix opencl/conv_compute build error, test=develop * fix opencl/conv_compute build error, test=develop * fix format in kernels/opencl/conv_computte_ttest,test=develop * fix build error, test=develop fix build error in kernels/x86/conv_compute.h * fix ccompute shape error in ooperators/conv_op.h, test=develop * add conv_reelu6 and conv leaky_relu in conv_3x3s1_direct * add conv_relu6 in c1, c2, c4,test=develop * fix conflict in conv_bloock_utils.h, test=develop * add relu6 and leankyrelu in conv_3x3s1_dw * add conv_3x3s1px_dw relu6 and leaky_relu fusion, test=develop * fix conflict in tests/math/conv_compute_arm, test=develop * fix build error in winograd arm, test=develop * channge act_param as pointer in conv_block_tuils.h, test=develop * fix winograd in no equal 4-padding compute error, test=develop * add conv relu6 and leaky_relu in conv_dw_3x3s2, test=develop * fix format, test=develop * fix format in conv_block_utils, test=develop * move updatePadding from conv_op.cc to conv_op.h, test=develop * fix format conv_op.h, test=develop * fix buuilde error in conv_oop.h, test=develop * remove flag_relu parameter in conv_3x3_depthwise, test=develop
-
由 Wilber 提交于
add sequence_pool_concat fuse pass add fuse kernel
-
由 yiicy 提交于
-
由 yiicy 提交于
-
由 liu zhengxi 提交于
-
- 21 12月, 2019 1 次提交
-
-
由 zhupengyang 提交于
test=develop
-
- 20 12月, 2019 3 次提交
-
-
由 zhupengyang 提交于
test=develop
-
由 Wilber 提交于
add var_conv_2d + relu fuse pass
-
由 zhupengyang 提交于
* [XPU] add transpose bridge and unit test test=develop
-
- 19 12月, 2019 5 次提交
-
-
由 石晓伟 提交于
-
由 Wilber 提交于
* optimize content-dnn cuda kernel
-
由 xiaogang 提交于
* feature: update fpga kernel patch
-
由 TianXiaogang 提交于
-
由 yiicy 提交于
* [ARM] change global pooling choose kernel policy, test=develop
-