1. 03 12月, 2018 1 次提交
  2. 27 11月, 2018 5 次提交
    • C
      Add activation gelu (#14569) · 6c71c1f8
      Clementine 提交于
      6c71c1f8
    • M
      EltwiseMul: Extract StringToFormat to MKLDNN helper · 9455be0b
      Michal Gallus 提交于
      test=develop
      9455be0b
    • J
      - ASUM MKL integration · 8bfa1fa9
      Jacek Czaja 提交于
      8bfa1fa9
    • P
      minor fix · 38715e6f
      peizhilin 提交于
      38715e6f
    • J
      - conv2d transpose MKL-DNN · fb24690a
      Jacek Czaja 提交于
      test=develop
      
      - Added new header for MKLDNN reuse functionality
      
      - Extended conv2d_transpose GetExpectedKernelType for MKL-DNN supporrt
      
      - Buildable conv transpose mkldnn and conv mkldnn using conv template
      
      - Conv2d transpose roughlt implemented and buildable
      
      - Added modifications conv2d transpose MKLDNN unit tests
      
      - Fix to UT of conv2d transpose mkldnn op
      
      - Wrong type of MKLDNN primitive was chosen for conv2d transpose
      
      - HAcks for conv2d transpose
      
      - UT enalbed
      
      - Replaced copying loop with memcpy
      
      - Draft of passing lambda into AcquireMemory
      
      - Made reorder (IOHW->OIHW) to be called only once
      fb24690a
  3. 26 11月, 2018 2 次提交
  4. 23 11月, 2018 3 次提交
  5. 22 11月, 2018 4 次提交
    • C
      Refine cublas to support CUBLAS_TENSOR_OP_MATH (#13929) · 00b9e9a1
      chengduo 提交于
      * refine cublase
      test=develop
      
      * code refine
      
      * refine cublas
      
      * add GEMME_EX
      
      * add enable_cublas_tensor_op_math doc and add cublasCall
      test=develop
      
      * fix CublasCall for cuda version
      test=develop
      
      * fix error
      test=develop
      
      * fix GEMM_EX to be compatible with gcc 4.8
      test=develop
      
      * add GEMM_EX
      test=develop
      
      * to compatiable with gcc4.8
      test=develop
      00b9e9a1
    • P
      code style fix · e280c7a4
      peizhilin 提交于
      test=develop
      e280c7a4
    • P
      fix unit test cases · 7c8c9dc9
      peizhilin 提交于
      7c8c9dc9
    • W
      Windows/online (#14474) · d9a1f3e5
      wopeizl 提交于
      * add recordio support
      
      * disable the openblas multi-thread on windows since no support
      adjust the python script
      
      * code style
      
      * code style
      test=develop
      
      * add create_recordio_file_reader back
      
      * fix code style
      test=develop
      
      * fix the gtest.cmake on windows
      
      * fix cc_test on windows
      
      * fix the win build
      test=develop
      
      * remove fused compile support on windows
      test=develop
      
      * add the jit support
      test=develop
      
      * add the jit support, test=develop
      
      * add the jit support, test=develop
      
      * add the jit back
      fix compile error on windows
      
      * rollback test=develop
      
      * test case fix
      
      * disable DSO by default on windows
      
      * exclude warpctc_op on windows
      
      * exclude the dynload_warpctc out on windows
      test=develop
      
      * fix the scripts error
      test=develop
      
      * disable avx on windows by default
      test=develop
      
      * re-organize the cmake file
      
      * disable mkl on windows by default
      
      * add warp_ctc back
      
      * fix the dependency
      
      * fix the dependency
      
      * fix the build issue on windows
      
      * remove unsupported flag on windows
      
      * code style
      
      * code style
      test=develop
      
      * fix issue
      
      * add profiler, parallel_executor back
      
      * clean up the pre-definitions on windows
      
      * fix build issue
      
      * test=develop
      d9a1f3e5
  6. 21 11月, 2018 2 次提交
  7. 20 11月, 2018 2 次提交
  8. 19 11月, 2018 1 次提交
  9. 18 11月, 2018 2 次提交
  10. 17 11月, 2018 2 次提交
  11. 16 11月, 2018 4 次提交
    • W
      Add cudnn ctc loss (#12366) · b32c13dc
      Wu Yi 提交于
      * add cudnn ctc loss
      
      * wip add test test=develop
      
      * wip
      
      * wip
      
      * done test=develop
      
      * move include cudnn test=develop
      
      * test test=develop
      
      * fix build test=develop
      
      * fix build test=develop
      
      * fix build on cudnn5 test=develop
      
      * fix cudnn5 build test=develop
      
      * fix cudnn5 build test=develop
      
      * merge develop softmax functor change test=develop
      b32c13dc
    • P
      code style · dc80be27
      peizhilin 提交于
      test=develop
      dc80be27
    • P
      code style · d1a1fafc
      peizhilin 提交于
      d1a1fafc
    • P
      disable the openblas multi-thread on windows since no support · 162f2d41
      peizhilin 提交于
      adjust the python script
      162f2d41
  12. 15 11月, 2018 2 次提交
  13. 14 11月, 2018 2 次提交
  14. 13 11月, 2018 1 次提交
  15. 12 11月, 2018 2 次提交
  16. 09 11月, 2018 5 次提交