1. 30 3月, 2016 11 次提交
    • D
      Fix arrow marker for thick reference edges. · e3a0d6fb
      Dan Smilkov 提交于
      - Added several fixed-sized arrow markers for reference edges (small, medium, large, xlarge)
      - Added a method to shorten the path enough such that the tip of the start/end marker will
        point to the start/end of the path. It needs an arbitrary marker.
      - Updated the legend to reflect that only ref edges have an arrow.
      - Removed unused css related to this.
      Change: 118530630
      e3a0d6fb
    • V
      TensorFlow: disable a flaky test in gpu_allocator_retry_test · 972d7327
      Vijay Vasudevan 提交于
      Change: 118522346
      972d7327
    • J
      Better bounds checking for the segment reduction ops. · 71ff7c84
      Josh Levenberg 提交于
      Also fix some warnings about unsafe conversions.
      Change: 118519927
      71ff7c84
    • A
      In assertArrayNear() check that the arrays have the same length. · 4acc2dad
      A. Unique TensorFlower 提交于
      Add a test that we actually check for array length (we were not.)
      Change: 118518452
      4acc2dad
    • A
      Optimized DepthwiseConvBackpropInputOp for CPU. · 9d9ad03c
      A. Unique TensorFlower 提交于
      // OLD
      Benchmark                             Time(ns)    CPU(ns) Iterations
      --------------------------------------------------------------------
      BM_ConvFloatDepthwiseBkInCPU1_conv0  207770233  207338129        100  796.0M items/s 32_112_112_3_8_24_3_3_1_2_cpu1
      BM_ConvFloatDepthwiseBkInCPU1_conv1  715403538  713939287        100  616.4M items/s 32_112_112_64_1_64_3_3_1_2_cpu1
      BM_ConvFloatDepthwiseBkInCPU1_conv2  357349749  356594057        100  617.0M items/s 32_56_56_128_1_128_3_3_1_2_cpu1
      BM_ConvFloatDepthwiseBkInCPU1_conv3  274697435  274160117        100  802.7M items/s 32_56_56_128_1_128_3_3_2_2_cpu1
      BM_ConvFloatDepthwiseBkInCPU1_conv4   87072020   86874244        100  633.1M items/s 32_28_28_128_1_128_3_3_1_2_cpu1
      BM_ConvFloatDepthwiseBkInCPU1_conv5   87172482   86948501        100  632.4M items/s 32_14_14_512_1_512_3_3_1_2_cpu1
      BM_ConvFloatDepthwiseBkInCPU1_conv6   46763611   46620163        100  589.4M items/s 32_7_7_1024_1_1024_3_3_1_2_cpu1
      
      // NEW 1-thread
      Benchmark                             Time(ns)    CPU(ns) Iterations
      --------------------------------------------------------------------
      BM_ConvFloatDepthwiseBkInCPU1_conv0   60173061   59839526        100  2.7G items/s 32_112_112_3_8_24_3_3_1_2_cpu1
      BM_ConvFloatDepthwiseBkInCPU1_conv1   99396102   99143542        100  4.3G items/s 32_112_112_64_1_64_3_3_1_2_cpu1
      BM_ConvFloatDepthwiseBkInCPU1_conv2   39376616   39226953        100  5.5G items/s 32_56_56_128_1_128_3_3_1_2_cpu1
      BM_ConvFloatDepthwiseBkInCPU1_conv3   35987577   35843443        100  6.0G items/s 32_56_56_128_1_128_3_3_2_2_cpu1
      BM_ConvFloatDepthwiseBkInCPU1_conv4    9665813    9600518        100  5.6G items/s 32_28_28_128_1_128_3_3_1_2_cpu1
      BM_ConvFloatDepthwiseBkInCPU1_conv5   12498989   12427035        100  4.3G items/s 32_14_14_512_1_512_3_3_1_2_cpu1
      BM_ConvFloatDepthwiseBkInCPU1_conv6    8459759    8397047        100  3.2G items/s 32_7_7_1024_1_1024_3_3_1_2_cpu1
      
      // NEW 4-threads
      Benchmark                             Time(ns)    CPU(ns) Iterations
      --------------------------------------------------------------------
      BM_ConvFloatDepthwiseBkInCPU4_conv0   30696635  101663830        100  5.3G items/s 32_112_112_3_8_24_3_3_1_2_cpu4
      BM_ConvFloatDepthwiseBkInCPU4_conv1   68884630  198616710        100  6.3G items/s 32_112_112_64_1_64_3_3_1_2_cpu4
      BM_ConvFloatDepthwiseBkInCPU4_conv2   16948037   50360587        100  12.7G items/s 32_56_56_128_1_128_3_3_1_2_cpu4
      BM_ConvFloatDepthwiseBkInCPU4_conv3   15834408   46873689        100  13.6G items/s 32_56_56_128_1_128_3_3_2_2_cpu4
      BM_ConvFloatDepthwiseBkInCPU4_conv4    3904734   11659079        167  13.8G items/s 32_28_28_128_1_128_3_3_1_2_cpu4
      BM_ConvFloatDepthwiseBkInCPU4_conv5    3482083   12555105        188  15.5G items/s 32_14_14_512_1_512_3_3_1_2_cpu4
      BM_ConvFloatDepthwiseBkInCPU4_conv6    2330680    8593020        281  11.5G items/s 32_7_7_1024_1_1024_3_3_1_2_cpu4
      Change: 118514706
      9d9ad03c
    • V
      Make TensorFlow transpose_op_test only take 2 seconds to run · 62e0d8e1
      Vijay Vasudevan 提交于
      instead of 48 (on my machine).
      Change: 118512022
      62e0d8e1
    • B
      Leverage index list to further speedup the computation of the convolution · 5a130818
      Benoit Steiner 提交于
      gradients by 1 to 10% depending on the size of the convolution kernel.
      Change: 118505660
      5a130818
    • F
      Add link to the inception model serving tutorial. · 89aab41d
      Fangwei Li 提交于
      Change: 118497433
      89aab41d
    • A
      First pass at enabling non-square strides for convolutions: · 6a073b39
      A. Unique TensorFlower 提交于
      - Change the stride, and in_stride template arguments of Eigen
        SpatialConvolution, SpatialConvolutionBackwardInput, and
        SpatialConvolutionBackwardKernel to row_stride, col_stride, row_in_stride,
        col_in_stride.
      
      - Change tensorflow kernels to pass the additional stride parameters.
      
      - Rationalize the place where we swap row/col: swap just before calling Eigen.
      
      This just enables the plumbing. Non-square strides are still forbidden in the ops.
      Change: 118484322
      6a073b39
    • A
      Parse uploaded pbtxt in chunks to allow for loading of large pbtxt files, this... · 622daf83
      A. Unique TensorFlower 提交于
      Parse uploaded pbtxt in chunks to allow for loading of large pbtxt files, this only affects files uploaded with the file chooser menu not XHR requests.
      Change: 118479897
      622daf83
    • D
      Fixing some implicit int64->32 downcast errors. (In most cases, by · dff1e630
      David G. Andersen 提交于
      having the ops explicitly return a failure that they can't handle
      overly-large inputs).  Most of these should never affect correct
      tf programs until people get a lot more memory in their machines.
      Change: 118476613
      dff1e630
  2. 29 3月, 2016 29 次提交