1. 23 7月, 2020 1 次提交
  2. 22 7月, 2020 2 次提交
    • H
      [Core] Add the graph optimization of subblocks for transformer model (#3947) · 7af1a258
      hong19860320 提交于
      * [Core][ARM] Fix beam_search, eltwise_mul supports broadcast and int64_t data type, add print op and kernel, add exeception
      test=develop
      
      * Fix the dims of parent idx of the arm kernel of beam_search op
      
      * elementwise_mul supports int64_t data type with broadcasting
      
      * Add print op and kernel for debugging
      
      * Support throwing the exception when the internal error occurs
      
      * Refine while and conditional_block op kernel
      
      * Support the graph optimization on subblocks
      
      * Pass program_desc and block_idx into the kernel of the control flow ops(while/conditional_block/subgraph), and create the RuntimeProgram online, it make it possiable to call the control flow ops recursively
      
      *Add unit test for masked transformer model
      7af1a258
    • Y
      [OPENCL] enable use_tune by default (#3968) · c572523f
      ysh329 提交于
      * enable opencl kernel tune by default. test=develop
      c572523f
  3. 21 7月, 2020 1 次提交
    • C
      [LITE][XPU] Support mmdnn3.0-ras (a.k.a. crmm-0608) (#3950) · 89a0ecd1
      Cwndmiao 提交于
      * fix typo
      
      * [LITE][XPU] accomodate crmm(variant 20200608)
      
      * refine lite/tests/api/test_mmdnn_lite_xpu.cc
      
      * more comments, test=develop test=xpu
      
      * bugfix in crmm pattern match
      
      * pr comments, test=develop test=xpu
      
      * add XPU_CALL and retval check, test=develop test=xpu
      89a0ecd1
  4. 20 7月, 2020 1 次提交
  5. 16 7月, 2020 5 次提交
  6. 14 7月, 2020 3 次提交
  7. 13 7月, 2020 2 次提交
  8. 10 7月, 2020 2 次提交
  9. 09 7月, 2020 3 次提交
  10. 08 7月, 2020 3 次提交
  11. 07 7月, 2020 2 次提交
  12. 06 7月, 2020 1 次提交
  13. 03 7月, 2020 2 次提交
  14. 02 7月, 2020 1 次提交
  15. 01 7月, 2020 3 次提交
  16. 30 6月, 2020 3 次提交
  17. 29 6月, 2020 1 次提交
  18. 26 6月, 2020 1 次提交
  19. 23 6月, 2020 1 次提交
  20. 22 6月, 2020 2 次提交
    • S
      optimize register mechanism (#3745) · db7639ca
      Shibo Tao 提交于
      * refactor register mechanism, current so size: 1.20MB. test=develop
      
      * fix KernelRegistry::Global().Create. test=develop
      
      * fix cpplint errors. test=develop
      
      * fix test_subgraph_pass bug. test=develop
      
      * register kernel with target,precision,datalayout combination. test=develop
      
      * fix test_paddle_api no op found bug. test=develop
      
      * enhance comment
      
      * fix lite/kernels/arm/elementwise_compute_test.cc. test=develop
      
      * fix code style
      
      * revert format of unchanged files. test=develop
      
      * fix code format according to cpplint 1.5.1. test=develop
      
      * remove redundant include header. test=develop
      db7639ca
    • D
      732bb91b