• H
    [Core] Add the graph optimization of subblocks for transformer model (#3947) · 7af1a258
    hong19860320 提交于
    * [Core][ARM] Fix beam_search, eltwise_mul supports broadcast and int64_t data type, add print op and kernel, add exeception
    test=develop
    
    * Fix the dims of parent idx of the arm kernel of beam_search op
    
    * elementwise_mul supports int64_t data type with broadcasting
    
    * Add print op and kernel for debugging
    
    * Support throwing the exception when the internal error occurs
    
    * Refine while and conditional_block op kernel
    
    * Support the graph optimization on subblocks
    
    * Pass program_desc and block_idx into the kernel of the control flow ops(while/conditional_block/subgraph), and create the RuntimeProgram online, it make it possiable to call the control flow ops recursively
    
    *Add unit test for masked transformer model
    7af1a258
configure.cmake 5.1 KB