- 20 12月, 2019 1 次提交
-
-
由 zhupengyang 提交于
* [XPU] add transpose bridge and unit test test=develop
-
- 13 12月, 2019 1 次提交
-
-
由 hong19860320 提交于
[LITE][NPU][XPU] Refine subgraph pass, and support NPU/XPU model generation at execution time (#2576)
-
- 10 12月, 2019 1 次提交
-
-
由 yiicy 提交于
-
- 07 12月, 2019 1 次提交
-
-
由 juncaipeng 提交于
* add arm split lod tensor, test=develop * add arm merge lod tensor, test=develop * update split merge lod tensor, test=develop * add reduce_prob op, test=develop * support mask_rcnn succeed, test=develop
-
- 16 11月, 2019 1 次提交
-
-
由 hong19860320 提交于
-
- 12 11月, 2019 1 次提交
-
-
由 juncaipeng 提交于
* update concat and unsqueeze, test=develop
-
- 07 11月, 2019 1 次提交
-
-
由 huzhiqiang 提交于
We have changed 11 arm_kernels into extra type in #2347 , which has caused test_compiling failure. In this PR , we move their 11 related arm_kernel_test into build_extra=ON
-
- 28 10月, 2019 1 次提交
-
-
由 hong19860320 提交于
* Initial support for XPU * Fix compiling errors of XPU * Move XPU op kernel bridges from backends to kernels to fix deps order * Change the namespace and directory of XPU bridges * Add XPU SDK * Fix header files and namespace of XPU SDK * Add unit tests for relu and conv2d ops * Restore the modification of paddle_api_test * Supports simple model which contains only a relu layer * Add compiling scripts for XPU * Fix compiling errors of XPU * Add comments for XPU LoadModel and BuildModel
-
- 18 9月, 2019 1 次提交
-
-
由 Xiaoyang LI 提交于
* fix gemm_int8, gemv-int8 and conv-int8 math function, add float bias * change conv impl * neon int8 kernel support float bias * arm compute kernel support float bias * add math_test target * add tensor utils for testing, fix sgemm ut error * add gemm_int8 unit test, support float bias * fix build script * add conv compute unit test for arm * fix build script, test=develop * fix fp32 dw conv3x3s1, test=develop * add fp32 dw conv3x3s1, test=develop * add armv7 fp32 dw conv3x3s1, test=develop * add fp32 depthwise conv3x3s2, test=develop * fix fp32 conv3x3 depthwise build error, test=develop * fix gemm_like conv trans weights error, test=develop * fix int8 depthwise conv3x3 error, test=develop * turn on all test for arm fp32 conv, test=develop * fix int8 conv1x1 error * fix int8 direct conv3x3s1 error, test=develop * fix int8 direct conv3x3s2, test=develop * turn on all test for arm int8 conv, test=develop * fix int8 fc error, change mobilenetv1-int8 ground-truth result to fluid, test=develop * remove debug info, strip ut binary, test=develop * fix conv compute error, test=develop * change Init() to ReInitWhenNeeded(), test=develop * fix code style, test=develop * remote engine_test, test=develop * fix building server tests error, test=develop * fix sdot clang build error, test=develop * fix sgemm ut timeout error, test=develop * fix clang build error, test=develop * turn off math basic test due to ci time out, test=develop * fix conv_int8 ut error, test=develop
-
- 12 9月, 2019 1 次提交
-
-
由 Wilber 提交于
* add unsqueeze and range op. modify concat op test=develop * modify exception in range_test_x86
-
- 09 9月, 2019 1 次提交
-
-
由 juncaipeng 提交于
* add assign_value op, arm kernel and test, add fluid_type, test=develop * add hard_sigmoid, test=develop
-
- 04 9月, 2019 1 次提交
-
-
由 Wilber 提交于
* modify slice op and add slice test * modify slice op bug
-
- 02 9月, 2019 1 次提交
-
-
由 juncaipeng 提交于
* add ops for faster rcnn * disable test for generate_proposals and roi_align, test=develop * remove .swp file * remove log in tensor slice * finish the unit test for roi_align, test=develop * add box_clip op and fix tensor slice bug * remove add four op twice * rewrite the implement for box_coder and sequence_expand, add faster_rcnn_test, test=develop * fix test bug of box_clip in x86 server, test=develop
-
- 29 8月, 2019 3 次提交
-
-
由 Wilber 提交于
* add yolo_box_compute cuda * move multiclass_nms(arm) to host * add lod in scale op * add yolo_box_cuda cmake config * modify shuffle_channel_fuse and transpose_softmax_transpose_fuse to support run ssd model. test=develop * reshape and transpose op don't have xshape output. * modify yolo_box_compute_cuda, use tensor to manage cuda memory test=develop * add yolo_box use kernel test=develop
-
由 liu zhengxi 提交于
-
由 juncaipeng 提交于
ad ops for faster rcnn, including affine_channel, anchor_generator, generate_proposals and roi_align (#1895) * add ops for faster rcnn * disable test for generate_proposals and roi_align, test=develop * remove .swp file * remove log in tensor slice * finish the unit test for roi_align, test=develop
-
- 28 8月, 2019 2 次提交
-
-
由 juncaipeng 提交于
* modify cast op, test=develop * modify cast op and remove warning in argmax_test, test=develop
-
由 huzhiqiang 提交于
-
- 24 8月, 2019 1 次提交
-
-
由 Yan Chunwei 提交于
-
- 22 8月, 2019 1 次提交
-
-
由 Yan Chunwei 提交于
-
- 16 8月, 2019 1 次提交
-
-
由 Yan Chunwei 提交于
-