• F
    add index initialization in the block loop for index_sample kernel when... · c6950ab2
    FlyingQianMM 提交于
    add index initialization in the block loop for index_sample kernel when dealing with a input tensor whose shape is larger than block_dim * grid_dim (#39736)
    
    * add block and grid loop for index_sample kernel to deal with a large-shape tensor
    
    * fix code format
    
    * limit grid dim
    
    * fix the omissive initialization of index_i in the second cycle for index_sample kernel
    
    * fix conflicts
    c6950ab2
index_sample_op.cu 9.7 KB