- 17 2月, 2020 1 次提交
-
-
由 GaoWei8 提交于
* Add reduce sum op test test=develop
-
- 24 1月, 2020 1 次提交
-
-
由 zhupengyang 提交于
-
- 16 1月, 2020 1 次提交
-
-
由 zhupengyang 提交于
-
- 15 1月, 2020 1 次提交
-
-
由 hong19860320 提交于
-
- 14 1月, 2020 3 次提交
-
-
由 HappyAngel 提交于
add gemm + relu6/leakyrelu fusion
-
由 zhupengyang 提交于
- enhance interp InferShape
-
由 myq406450149 提交于
* Support bitman backend
-
- 13 1月, 2020 2 次提交
-
-
由 zhupengyang 提交于
-
由 hong19860320 提交于
[LITE][NPU] Add instance_norm op bridge and unit test, refine the registration of op bridges (#2747)
-
- 10 1月, 2020 1 次提交
-
-
由 zhupengyang 提交于
-
- 09 1月, 2020 1 次提交
-
-
由 zhupengyang 提交于
-
- 08 1月, 2020 1 次提交
-
-
由 zhupengyang 提交于
* [NPU] enhance unittest for shuffle_channel, unsqueeze, pool test=develop
-
- 07 1月, 2020 1 次提交
-
-
由 zhupengyang 提交于
test=develop
-
- 03 1月, 2020 1 次提交
-
-
由 zhupengyang 提交于
test=develop
-
- 31 12月, 2019 3 次提交
-
-
由 Wilber 提交于
X86 and cuda compile simutaneously cmake .. -DCMAKE_BUILD_TYPE=RelWithDebInfo -DWITH_MKL=ON -DLITE_WITH_CUDA=ON -DWITH_MKLDNN=OFF -DLITE_WITH_X86=ON -DLITE_WITH_PROFILE=OFF -DWITH_LITE=OFF -DLITE_WITH_LIGHT_WEIGHT_FRAMEWORK=OFF -DWITH_PYTHON=OFF -DWITH_TESTING=ON -DLITE_WITH_ARM=OFF -DLITE_ON_TINY_PUBLISH=OFF -DCUDNN_ROOT=/usr/local/cudnn/ -DLITE_BUILD_EXTRA=ON (#2708) x86 and cuda compile simutaneously
-
由 zhupengyang 提交于
test=develop
-
由 hong19860320 提交于
* Fix the compiling error which occurs when specify the ddk_root path and build for huawei NPU. * Refine the registration of op bridges and make it similar to the registration of op and kernel. * Refine the interfaces of the graph and node for op bridges, and support creating constant and data node automatically according to the attribute 'persistable' of the target tensor. * Add the unit test of the scale and softmax op bridge for NPU.
-
- 26 12月, 2019 1 次提交
-
-
由 zhupengyang 提交于
test=develop
-
- 24 12月, 2019 3 次提交
-
-
由 hong19860320 提交于
* Support multiple types for XPU and NPU op bridges * Add lookup_table, gather, slice, stack and scale op bridges for supporting BERT * Fix the definition of lookup_table kernel for X86
-
由 zhupengyang 提交于
test=develop
-
由 zhupengyang 提交于
* [XPU] elementwise_add unit test * [XPU] softmax unit test test=develop
-
- 23 12月, 2019 1 次提交
-
-
由 yiicy 提交于
-
- 21 12月, 2019 1 次提交
-
-
由 zhupengyang 提交于
test=develop
-
- 20 12月, 2019 2 次提交
-
-
由 zhupengyang 提交于
test=develop
-
由 zhupengyang 提交于
* [XPU] add transpose bridge and unit test test=develop
-
- 13 12月, 2019 1 次提交
-
-
由 hong19860320 提交于
[LITE][NPU][XPU] Refine subgraph pass, and support NPU/XPU model generation at execution time (#2576)
-
- 10 12月, 2019 1 次提交
-
-
由 yiicy 提交于
-
- 07 12月, 2019 1 次提交
-
-
由 juncaipeng 提交于
* add arm split lod tensor, test=develop * add arm merge lod tensor, test=develop * update split merge lod tensor, test=develop * add reduce_prob op, test=develop * support mask_rcnn succeed, test=develop
-
- 16 11月, 2019 1 次提交
-
-
由 hong19860320 提交于
-
- 12 11月, 2019 1 次提交
-
-
由 juncaipeng 提交于
* update concat and unsqueeze, test=develop
-
- 07 11月, 2019 1 次提交
-
-
由 huzhiqiang 提交于
We have changed 11 arm_kernels into extra type in #2347 , which has caused test_compiling failure. In this PR , we move their 11 related arm_kernel_test into build_extra=ON
-
- 28 10月, 2019 1 次提交
-
-
由 hong19860320 提交于
* Initial support for XPU * Fix compiling errors of XPU * Move XPU op kernel bridges from backends to kernels to fix deps order * Change the namespace and directory of XPU bridges * Add XPU SDK * Fix header files and namespace of XPU SDK * Add unit tests for relu and conv2d ops * Restore the modification of paddle_api_test * Supports simple model which contains only a relu layer * Add compiling scripts for XPU * Fix compiling errors of XPU * Add comments for XPU LoadModel and BuildModel
-
- 18 9月, 2019 1 次提交
-
-
由 Xiaoyang LI 提交于
* fix gemm_int8, gemv-int8 and conv-int8 math function, add float bias * change conv impl * neon int8 kernel support float bias * arm compute kernel support float bias * add math_test target * add tensor utils for testing, fix sgemm ut error * add gemm_int8 unit test, support float bias * fix build script * add conv compute unit test for arm * fix build script, test=develop * fix fp32 dw conv3x3s1, test=develop * add fp32 dw conv3x3s1, test=develop * add armv7 fp32 dw conv3x3s1, test=develop * add fp32 depthwise conv3x3s2, test=develop * fix fp32 conv3x3 depthwise build error, test=develop * fix gemm_like conv trans weights error, test=develop * fix int8 depthwise conv3x3 error, test=develop * turn on all test for arm fp32 conv, test=develop * fix int8 conv1x1 error * fix int8 direct conv3x3s1 error, test=develop * fix int8 direct conv3x3s2, test=develop * turn on all test for arm int8 conv, test=develop * fix int8 fc error, change mobilenetv1-int8 ground-truth result to fluid, test=develop * remove debug info, strip ut binary, test=develop * fix conv compute error, test=develop * change Init() to ReInitWhenNeeded(), test=develop * fix code style, test=develop * remote engine_test, test=develop * fix building server tests error, test=develop * fix sdot clang build error, test=develop * fix sgemm ut timeout error, test=develop * fix clang build error, test=develop * turn off math basic test due to ci time out, test=develop * fix conv_int8 ut error, test=develop
-
- 12 9月, 2019 1 次提交
-
-
由 Wilber 提交于
* add unsqueeze and range op. modify concat op test=develop * modify exception in range_test_x86
-
- 09 9月, 2019 1 次提交
-
-
由 juncaipeng 提交于
* add assign_value op, arm kernel and test, add fluid_type, test=develop * add hard_sigmoid, test=develop
-
- 04 9月, 2019 1 次提交
-
-
由 Wilber 提交于
* modify slice op and add slice test * modify slice op bug
-
- 02 9月, 2019 1 次提交
-
-
由 juncaipeng 提交于
* add ops for faster rcnn * disable test for generate_proposals and roi_align, test=develop * remove .swp file * remove log in tensor slice * finish the unit test for roi_align, test=develop * add box_clip op and fix tensor slice bug * remove add four op twice * rewrite the implement for box_coder and sequence_expand, add faster_rcnn_test, test=develop * fix test bug of box_clip in x86 server, test=develop
-
- 29 8月, 2019 3 次提交
-
-
由 Wilber 提交于
* add yolo_box_compute cuda * move multiclass_nms(arm) to host * add lod in scale op * add yolo_box_cuda cmake config * modify shuffle_channel_fuse and transpose_softmax_transpose_fuse to support run ssd model. test=develop * reshape and transpose op don't have xshape output. * modify yolo_box_compute_cuda, use tensor to manage cuda memory test=develop * add yolo_box use kernel test=develop
-
由 liu zhengxi 提交于
-
由 juncaipeng 提交于
ad ops for faster rcnn, including affine_channel, anchor_generator, generate_proposals and roi_align (#1895) * add ops for faster rcnn * disable test for generate_proposals and roi_align, test=develop * remove .swp file * remove log in tensor slice * finish the unit test for roi_align, test=develop
-