- 31 12月, 2019 1 次提交
-
-
由 hong19860320 提交于
* Fix the compiling error which occurs when specify the ddk_root path and build for huawei NPU. * Refine the registration of op bridges and make it similar to the registration of op and kernel. * Refine the interfaces of the graph and node for op bridges, and support creating constant and data node automatically according to the attribute 'persistable' of the target tensor. * Add the unit test of the scale and softmax op bridge for NPU.
-
- 30 12月, 2019 1 次提交
-
-
由 juncaipeng 提交于
* fix yolov3 bug when run several times, test=develop
-
- 28 12月, 2019 1 次提交
-
-
由 huzhiqiang 提交于
-
- 27 12月, 2019 1 次提交
-
-
由 hong19860320 提交于
-
- 26 12月, 2019 2 次提交
-
-
由 Wilber 提交于
-fix fluid-lite-subgraph x86 compile error - Replace FLAGS with environment variables
-
由 zhupengyang 提交于
test=develop
-
- 25 12月, 2019 5 次提交
-
-
由 juncaipeng 提交于
add clear for tensor
-
由 juncaipeng 提交于
* fix op inputs and outputs type, test=develop
-
由 Wilber 提交于
optimize softmax cuda kernel
-
由 hong19860320 提交于
-
由 Yiqun Liu 提交于
* Remove GEMM padding in fc_compute. test=develop * Write a common ParallelFor function to run the for loop in parallel. * Add the codes of padding GEMM back in fc. * Refine the code of fc when padding_weight is false to avoid the definition of temporary Tensor. * Refine the unit test of fc and add testing case of padding and parallel. test=develop * Enable more test cases in common fc unittest, including padding and parallel for x86 target. * Remove the fc test under kernels/x86. test=develop * Disable relu in test of fc for non-x86 target. test=develop * Change the eps of arm. test=develop
-
- 24 12月, 2019 5 次提交
-
-
由 zhupengyang 提交于
-
由 hong19860320 提交于
-
由 hong19860320 提交于
* Support multiple types for XPU and NPU op bridges * Add lookup_table, gather, slice, stack and scale op bridges for supporting BERT * Fix the definition of lookup_table kernel for X86
-
由 yiicy 提交于
-
由 zhupengyang 提交于
test=develop
-
- 23 12月, 2019 2 次提交
- 21 12月, 2019 1 次提交
-
-
由 zhupengyang 提交于
test=develop
-
- 20 12月, 2019 3 次提交
-
-
由 zhupengyang 提交于
test=develop
-
由 Wilber 提交于
add var_conv_2d + relu fuse pass
-
由 zhupengyang 提交于
* [XPU] add transpose bridge and unit test test=develop
-
- 19 12月, 2019 3 次提交
- 18 12月, 2019 1 次提交
-
-
由 juncaipeng 提交于
* Support Mask RCNN2 (#2588)
-
- 17 12月, 2019 3 次提交
-
-
由 HappyAngel 提交于
-
由 xiebaiyuan 提交于
* add reshape opencl kernel && optimise conv 1x1 ,test=develop * add reshape opencl kernel && optimise conv 1x1 &&code style ,test=develop * add reshape opencl kernel && optimise conv 1x1 &&code style ,test=develop
-
由 Yuan Shuai 提交于
-
- 16 12月, 2019 2 次提交
-
-
由 Yuan Shuai 提交于
[LITE][OPENCL] Add relu image2d kernel unit test, Fix conv2d_1x1, relu, layout using new Image2D Layout (#2564) * add 3 layout for opencl image. test=develop * add relu image2d test. test=develop
-
由 Jiaying Zhao 提交于
* [LITE][OPENCL] Add depthwise_conv_3x3 opencl kernel * [LITE][OPENCL] Add depthwise_conv_3x3 opencl kernel. test=develop * [LITE][OPENCL] Add Pool opencl kernel. test=develop
-
- 15 12月, 2019 1 次提交
-
-
由 Wilber 提交于
optimize search_grnn
-
- 13 12月, 2019 1 次提交
-
-
由 hong19860320 提交于
[LITE][NPU][XPU] Refine subgraph pass, and support NPU/XPU model generation at execution time (#2576)
-
- 12 12月, 2019 1 次提交
-
-
由 xiebaiyuan 提交于
* add opencl conv1x1 image impl and unit test pass with relu & bias, add layout_compute --> buffer2image float32 --> with unit test pass suite checked test for more situation , test=develop * add opencl conv1x1 image impl and unit test pass with relu & bias, add layout_compute --> buffer2image float32 --> with unit test pass suite checked test for more situation , test=develop * fix white space cpp lint , test=develop
-
- 11 12月, 2019 1 次提交
-
-
由 TianXiaogang 提交于
-
- 10 12月, 2019 3 次提交
-
-
由 huzhiqiang 提交于
-
由 yiicy 提交于
-
由 zhupengyang 提交于
test=develop
-
- 09 12月, 2019 1 次提交
-
-
由 zhupengyang 提交于
test=develop
-
- 08 12月, 2019 1 次提交
-
-
由 liu zhengxi 提交于
-