• H
    [Cherry-pick][Core] Add the graph optimization of subblocks for transformer model (#3947) (#3979) · c56bf0d8
    hong19860320 提交于
    * [Cherry-pick][Core] Add the graph optimization of subblocks for transformer model (#3947)
    test=develop
    * [Core][ARM] Fix beam_search, eltwise_mul supports broadcast and int64_t data type, add print op and kernel, add exeception
    test=develop
    
    * Fix the dims of parent idx of the arm kernel of beam_search op
    
    * elementwise_mul supports int64_t data type with broadcasting
    
    * Add print op and kernel for debugging
    
    * Support throwing the exception when the internal error occurs
    
    * Refine while and conditional_block op kernel
    
    * Support the graph optimization on subblocks
    
    * Pass program_desc and block_idx into the kernel of the control flow ops(while/conditional_block/subgraph), and create the RuntimeProgram online, it make it possiable to call the control flow ops recursively
    
    *Add unit test for masked transformer model
    c56bf0d8
android.cmake 4.2 KB