• L
    Optimize nearest_interp forward (#38528) · 232bbce2
    Lijunhui 提交于
    * init commit
    
    * remove comments
    
    * remove nchw branch
    
    * optimize code
    
    * apply fast div mod in 1D kernel, rm 3D kernel
    
    * move init of FastDivMode to CPU
    
    * 3D kernel for nchw, FastDiv for 1D kernel
    
    * debug done. process boundary
    
    * 2^n
    
    * optimize
    
    * optimize
    
    * change code & optimize code
    232bbce2
gpu_launch_config.h 5.8 KB