• W
    Add yolo_box_cuda multiclass_nms_host kernel. (#1908) · de43e479
    Wilber 提交于
    * add yolo_box_compute cuda
    
    * move multiclass_nms(arm) to host
    
    * add lod in scale op
    
    * add yolo_box_cuda cmake config
    
    * modify shuffle_channel_fuse and transpose_softmax_transpose_fuse to support run ssd model. test=develop
    
    * reshape and transpose op don't have xshape output.
    
    * modify yolo_box_compute_cuda, use tensor to manage cuda memory test=develop
    
    * add yolo_box use kernel test=develop
    de43e479
yolo_box_compute.cu 8.7 KB