1. 25 2月, 2019 2 次提交
    • J
      [MKL-DNN] MKL-DNN specific Tensor modification (#15429) · dec9cf53
      Jacek Czaja 提交于
      * - Implemented draft of primitive desc keeping in Tensor
      
      test=develop
      
      - TransposeMKLDNNHandler::AcquireSrcMemory was reimplemented
      
      - Added nchw and nc formats setting for sake of compatiblity
      
      Fixed unit tests
      
      - Worakaround to problem with 5D data in conv
      
      - Added 3D and 1D MKL-DNN formats for name handles for tensor
      
      test=develop
      
      - Fix to UTs
      
      test=develop
      
      - Conv fp32 op was updated
      
      Cosmetic fixes
      
      test=develop
      
      - tensor mkldnn cosmetics
      
      test=develop
      
      - Moved most of mkl-dnn specific code from Tensor to mkl-dnn utils
      
      * - Lint fixes
      
      test=develop
      
      * - setting prim dec in Tensor , sets also layout to kMKLDNN
      
      test=develop
      
      * - Moved creation of prim desc totally out of Tensor
      
      test=develop
      
      * - Cosmetic fixes adter review
      
      test=develop
      dec9cf53
    • X
      polish · 5dd281f7
      Xin Pan 提交于
      test=develop
      5dd281f7
  2. 23 2月, 2019 1 次提交
  3. 22 2月, 2019 9 次提交
  4. 21 2月, 2019 6 次提交
    • X
      allow compiler to use graph · 26e32e09
      Xin Pan 提交于
      test=develop
      26e32e09
    • S
      add override to ApplyImpl · 0b926114
      Sylwester Fraczek 提交于
      and #pragma once in edited headers
      
      add #include<string> in edited headers
      
      test=develop
      0b926114
    • S
      fix typo releated->related · 543e53db
      Sylwester Fraczek 提交于
      543e53db
    • X
      add per kernel config and remove const_cast. · 5eb87506
      Xin Pan 提交于
      test=develop
      5eb87506
    • Q
      fix use gpu test=develop · 62f1248f
      Qiao Longfei 提交于
      62f1248f
    • D
      Profiler refine and add CUDA runtime api tracer (#15301) · a83e4704
      Dun 提交于
      * refine profiler && add runtime tracer
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * fix bug && test=develop
      
      * add thread id map && test=develop
      
      * test=develop
      
      * testing
      
      * bug fix
      
      * remove cuda event && refine code && test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * fix windows temp file && test=develop
      
      * test=develop
      
      * fix windows bug && test=develop
      
      * fix start up issue && test=develop
      
      * code polish &&  test=develop
      
      * remove unused code && test=develop
      
      * add some cupti cbid && test=develop
      
      * add FLAGS_multiple_of_cupti_buffer_size && test=develop
      
      * fix compile error && test=develop
      
      * add keyword && test=develop
      
      * fix && test=develop
      
      * code polish && test=develop
      a83e4704
  5. 19 2月, 2019 6 次提交
  6. 18 2月, 2019 10 次提交
  7. 14 2月, 2019 6 次提交