1. 22 7月, 2020 1 次提交
    • H
      [Core] Add the graph optimization of subblocks for transformer model (#3947) · 7af1a258
      hong19860320 提交于
      * [Core][ARM] Fix beam_search, eltwise_mul supports broadcast and int64_t data type, add print op and kernel, add exeception
      test=develop
      
      * Fix the dims of parent idx of the arm kernel of beam_search op
      
      * elementwise_mul supports int64_t data type with broadcasting
      
      * Add print op and kernel for debugging
      
      * Support throwing the exception when the internal error occurs
      
      * Refine while and conditional_block op kernel
      
      * Support the graph optimization on subblocks
      
      * Pass program_desc and block_idx into the kernel of the control flow ops(while/conditional_block/subgraph), and create the RuntimeProgram online, it make it possiable to call the control flow ops recursively
      
      *Add unit test for masked transformer model
      7af1a258
  2. 14 7月, 2020 1 次提交
  3. 16 6月, 2020 1 次提交
  4. 15 6月, 2020 1 次提交
  5. 11 6月, 2020 1 次提交
  6. 18 5月, 2020 1 次提交
  7. 09 5月, 2020 1 次提交
  8. 20 4月, 2020 1 次提交
  9. 17 4月, 2020 1 次提交
  10. 16 4月, 2020 2 次提交
  11. 13 4月, 2020 1 次提交
  12. 08 4月, 2020 1 次提交
  13. 02 4月, 2020 1 次提交
  14. 16 3月, 2020 1 次提交
  15. 08 11月, 2019 1 次提交
    • H
      Move muliti class kernel back to basic (#2396) · 52e0db46
      huzhiqiang 提交于
      * move multiclass_nms kernel back to host test=develop
      
      * move layer_norm OP and arm_kernel into extra type since it's added after release/v2.0-beta1 and not related with CV test=develop
      
      * fix code_style test=develop
      52e0db46
  16. 04 11月, 2019 1 次提交
  17. 11 9月, 2019 1 次提交
  18. 03 9月, 2019 1 次提交
    • J
      rewrite multiclass_nms according to fluid, test=develop (#1945) · deaddf9d
      juncaipeng 提交于
      * add ops for faster rcnn
      
      * disable test for generate_proposals and roi_align, test=develop
      
      * remove .swp file
      
      * remove log in tensor slice
      
      * finish the unit test for roi_align, test=develop
      
      * add box_clip op and fix tensor slice bug
      
      * remove add four op twice
      
      * rewrite the implement for box_coder and sequence_expand, add faster_rcnn_test, test=develop
      
      * fix test bug of box_clip in x86 server, test=develop
      
      * rewrite multiclass_nms according to fluid, test=develop
      
      * fix param load bug in box_coder and multiclass_nms op, test=develop
      
      * fix value transfor error in multiclass_nms, test=develop
      deaddf9d
  19. 29 8月, 2019 1 次提交
    • W
      Add yolo_box_cuda multiclass_nms_host kernel. (#1908) · de43e479
      Wilber 提交于
      * add yolo_box_compute cuda
      
      * move multiclass_nms(arm) to host
      
      * add lod in scale op
      
      * add yolo_box_cuda cmake config
      
      * modify shuffle_channel_fuse and transpose_softmax_transpose_fuse to support run ssd model. test=develop
      
      * reshape and transpose op don't have xshape output.
      
      * modify yolo_box_compute_cuda, use tensor to manage cuda memory test=develop
      
      * add yolo_box use kernel test=develop
      de43e479
  20. 24 8月, 2019 1 次提交
  21. 16 8月, 2019 1 次提交