1. 16 1月, 2019 1 次提交
    • Y
      Optimize while_op for test (#14764) · 568cc2ff
      Yiqun Liu 提交于
      * Simplify the compare op for CPU.
      
      * Use asynchronous tensor copy in reshape_op's kernel.
      
      * Optimize while_op for test, avoiding creating variables every time.
      test=develop
      
      * Enable the cache of kernel type and kernel function.
      test=develop
      
      * Enable profiling with gperftools.
      
      * Remove flags for testing, and fix the linking error.
      test=develop
      
      * Delete the codes of ChooseKernel.
      test=develop
      
      * Fix bug when preparing ExecutorPrepareContext for while_op.
      
      * Fix missing depending on grpc libraries.
      
      * Remove the redundant print.
      test=develop
      
      * Follow comments.
      
      * Remove the codes related to prepare the ExecutorPrepareContext for while_op.
      test=develop
      568cc2ff
  2. 15 1月, 2019 2 次提交
  3. 14 1月, 2019 4 次提交
  4. 13 1月, 2019 6 次提交
  5. 12 1月, 2019 1 次提交
  6. 11 1月, 2019 3 次提交
    • M
      Fix expand op compile time bug · bc3e0d6e
      minqiyang 提交于
      test=develop
      bc3e0d6e
    • C
      Revert "Remove workspace_handle in conv_cudnn (#15186)" · 358e657f
      chengduozh 提交于
      test=develop
      This reverts commit 064512aa.
      358e657f
    • C
      Remove workspace_handle in conv_cudnn (#15186) · 064512aa
      chengduo 提交于
      * remove workspace_handle in conv2d_cudnn
      test=develop
      
      * remove workspace_handle
      test=develop
      
      * fix bug
      test=develop
      
      * make test_conv2d_op SERIAL
      test=develop
      
      * save memory in conv_cudnn
      test=develop
      
      * enhance thread safety
      test=develop
      
      * enhance temporary allocator
      test=develop
      
      * Add excess fraction
      test=develop
      
      * follow comments
      test=develop
      
      * fix bug and code refine
      test=develop
      
      * fix memory size check
      test=develop
      
      * rename reuse_tmp_allocation_excess_fraction
      test=develop
      064512aa
  7. 10 1月, 2019 5 次提交
    • T
      fix typo and refine · c3a9f3c4
      tensor-tang 提交于
      test=develop
      c3a9f3c4
    • X
      Conv int8 residual (#15145) · 8f17c714
      xiaolil1 提交于
      * Enable basic MKL-DNN INT8 Conv OP
      test=develop
      
      * Modify test case
      test=develop
      
      * Clean unittest code
      test=develop
      
      * Fix test
      test=develop
      
      * Modify test
      test=develop
      
      * Enable MKL-DNN INT8 Conv with Relu Fusion OP
      test=develop
      
      * Enable INT8 Conv with residual fusion OP
      test=develop
      
      * Modify code.
      test=develop
      
      * Modify basic INT8 Conv
      test=develop
      
      * Modify Conv.
      test=develop
      
      * fix style
      test=develop
      
      * Fix style
      test=develop
      
      * Fix test
      test=develop
      
      * Modify code.
      test=develop
      
      * Fix test
      test=develop
      8f17c714
    • X
      Enhance key generation for INT8 test. · f34e779f
      xiaoli.liu@intel.com 提交于
      test=develop
      f34e779f
    • W
      [Feature] support mix precision training for resnet (#14899) · fd854183
      Wu Yi 提交于
      * clip softmax for fp16
      
      * updates
      
      * fuse xent support fp16 test=develop
      
      * wip
      
      * wip
      
      * add simple row reduce
      
      * wip fp16 accurate softmax
      
      * add accurate softmax kernel for fp16 test=develop
      
      * update test=develop
      
      * fix cpu build test=develop
      
      * update api.spec test=develop
      
      * follow comments test=develop
      
      * fix build test=develop
      
      * fix trt build test=develop
      
      * fix inference build test=develop
      
      * fix merge test=develop
      
      * update test=develop
      
      * try fix build test=develop
      
      * fix build test=develop
      
      * rename real_exp test=develop
      
      * fortest
      
      * remove hacky kernels test=develop
      
      * clean up test=develop
      fd854183
    • T
      follow comment and fix typo · 8e086a85
      tensor-tang 提交于
      test=develop
      8e086a85
  8. 09 1月, 2019 1 次提交
  9. 08 1月, 2019 7 次提交
  10. 07 1月, 2019 10 次提交