1. 14 6月, 2018 5 次提交
    • Q
      Fix NCCLBcast hang up bug in Parallel Executor (#11377) · 046bb5c8
      Qiyang Min 提交于
      * 1. Create buddy allocator in each places before NcclBcast the variables
      2. Check the memory usage of ALL gpus rather than the first one
      
      * 1. Make NCCLGroupGuard guards only the ncclBcast part, which avoid ncclGroupEnd blocking the exception throwing
      2. NOTE the usage of NCCLGroupGuard
      
      * Remove the memory usage check of gpus
      
      * Fix code style
      046bb5c8
    • W
      Add mean IOU op. (#10519) · 6fcdb240
      whs 提交于
      * Add mean_iou op.
      
      * Add unitest for mean iou op.
      
      * Add optional collections of confusion matrix and mean_iou.
      
      * Fix cuda kernel.
      
      * Refine code.
      1. Merge computing in GPU to two kernel.
      2. Use wrong array and correct array instead of confusion matrix.
      
      * Add python api and fix cuda kernel.
      
      * Fix comments.
      
      * Small fix.
      
      * Small fix.
      6fcdb240
    • Q
      490a07f5
    • X
      Remove cuptiFinalize. · d2afd210
      Xin Pan 提交于
      In cupti samples, only cuptiFlush is used.
      I can't find any places calling cuptiFinalize and
      this API can error out as not_implemented in some
      cuda installation.
      d2afd210
    • Y
      Dynamic Graph first prototype (#11415) · d827c6e8
      Yang Yang(Tony) 提交于
      d827c6e8
  2. 13 6月, 2018 7 次提交
  3. 12 6月, 2018 8 次提交
  4. 11 6月, 2018 20 次提交