- 29 8月, 2019 11 次提交
-
-
由 tensor-tang 提交于
-
由 tensor-tang 提交于
-
由 Wilber 提交于
* add yolo_box_compute cuda * move multiclass_nms(arm) to host * add lod in scale op * add yolo_box_cuda cmake config * modify shuffle_channel_fuse and transpose_softmax_transpose_fuse to support run ssd model. test=develop * reshape and transpose op don't have xshape output. * modify yolo_box_compute_cuda, use tensor to manage cuda memory test=develop * add yolo_box use kernel test=develop
-
由 sangoly 提交于
-
由 Yanzhan Yang 提交于
* refine toolchain test=develop * fix wrap compilation error * fix yolov3 armv8 compilation test=develop * revert to armv7 as default test=develop * fix fpga compilation test=develop
-
由 liu zhengxi 提交于
-
由 Zhaolong Xing 提交于
* paddle lite cuda init can run model with leaky_relu * add the missing file. test=develop * add the load from memory interface. test=develop * refine this pr. fix comments fix ci error test=develop
-
由 sangoly 提交于
-
由 tensor-tang 提交于
test=develop
-
由 juncaipeng 提交于
ad ops for faster rcnn, including affine_channel, anchor_generator, generate_proposals and roi_align (#1895) * add ops for faster rcnn * disable test for generate_proposals and roi_align, test=develop * remove .swp file * remove log in tensor slice * finish the unit test for roi_align, test=develop
-
由 tensor-tang 提交于
* add npu script and tester * fix npu armv7 so and refine tests test=develop * update fix and refine log test=develop * refine npu generate api * refine npu subgraph * refine npu gen and clean code * fix model laod * refine node2rm in subgraph * refine the build npu functions test=develop
-
- 28 8月, 2019 14 次提交
-
-
由 juncaipeng 提交于
* modify cast op, test=develop * modify cast op and remove warning in argmax_test, test=develop
-
由 Yanzhan Yang 提交于
-
由 Yanzhan Yang 提交于
-
由 Yan Chunwei 提交于
-
由 zhupengyang 提交于
test=developt branch
-
由 Yanzhan Yang 提交于
-
由 huzhiqiang 提交于
-
由 zhupengyang 提交于
* add transpose-softmax-transpose fuse pass test=develop * enable supported lite-npu ops test=develop
-
由 huzhiqiang 提交于
add x86 math:sequence_scale,sequence_padding,sequence2batch,sequence_pooling. test=develop (#1884)
-
由 hong19860320 提交于
* [NPU] fix conv2d npu bridge, supports bias from input map test=develop * [NPU] support more dimensions for the bias of conv2d NPU bridge test=develop
-
由 sangoly 提交于
-
由 sangoly 提交于
-
由 Huie 提交于
2.add flatten2 for gpu. 3.add concat 4 inputs size for gpu. 4.fix pool. 5.fix transpose2 test=develop
-
由 zp7 提交于
-
- 27 8月, 2019 5 次提交
-
-
由 Zhaolong Xing 提交于
* paddle lite cuda init can run model with leaky_relu * add the missing file. test=develop
-
由 zp7 提交于
* [test=develop]1.fix crash when gpu op scale&elementwise_add input dim size equal 2 2.add gpu op mul * fix code style test=develop
-
由 tensor-tang 提交于
* add npu script and tester * fix npu armv7 so and refine tests test=develop * update fix and refine log test=develop
-
由 Yanzhan Yang 提交于
-
由 Yan Chunwei 提交于
-
- 26 8月, 2019 7 次提交
-
-
由 Jiaying Zhao 提交于
* Catch mobile exceptions test=develop * code style format test=develop
-
由 sangoly 提交于
-
由 juncaipeng 提交于
-
由 juncaipeng 提交于
-
由 sangoly 提交于
-
由 Yan Chunwei 提交于
-
- 25 8月, 2019 2 次提交
-
-
由 Yan Chunwei 提交于
-
由 juncaipeng 提交于
-
- 24 8月, 2019 1 次提交
-
-
由 Yanzhan Yang 提交于
* enable fast test compilation && push opencl kernels by run.py * merge cl kernels into so * restore pre-commit
-