1. 19 11月, 2018 3 次提交
    • Y
      Optimize the layer_norm operator with AVX intrinsic function (#14417) · f4c869d8
      Yihua Xu 提交于
      * Optimize layer_norm operator with AVX intrinsic functions
      
      * Revert the wrong modifications
      
      * Implement the jit kernel for layer_norm operator
      
      * Add math headfile to fix the compile issue (test=develop)
      
      * Add math headfile to fix the compile issue (test=develop)
      
      * Fixed the intrinsic headfile issue (test=develop)
      
      * Fix the conflicts (test=develop)
      
      * Revert for CUDA compiler (test=develop)
      
      * Fixed the cuda depency (test=develop)
      
      * Fix the marco issues (test=develop)
      f4c869d8
    • Q
      Convolution fusion operator. (#14449) · fd7e6431
      qingqing01 提交于
      * Convolution fusion operator.
      * Clean code
      test=develop
      fd7e6431
    • W
      fix dist deps (#14471) · d7bd0361
      Wu Yi 提交于
      * fix dist deps test=develop
      
      * update test=develop
      
      * update test=develop
      
      * update test=develop
      
      * update test=develop
      d7bd0361
  2. 16 11月, 2018 8 次提交
    • T
      fix build error on noavx · 1ffce8c0
      tensor-tang 提交于
      test=develop
      1ffce8c0
    • W
      Refine operator cmake (#14413) · a2d9b344
      Wu Yi 提交于
      * wip simplify operator framework
      
      * wip
      
      * wip
      
      * done test=develop
      
      * clean test=develop
      
      * fix test=develop
      
      * fix deps test=develop
      
      * fix cpu build test=develop
      
      * fix tensorrt build test=develop
      
      * fix tests test=develop
      
      * fix test=develop
      
      * fix cpu build test=develop
      a2d9b344
    • J
      fix space_to_depth_op unicode problem (#14430) · 28bd5b7b
      Jiabin Yang 提交于
      * fix space_to_depth_op unicode problem
      
      * test=develop
      28bd5b7b
    • W
      Make nce support more distribution. (#13549) · 17226782
      whs 提交于
      * Fix truncated normal.
      
      * Fix.
      
      * Make nce support more distribution.
      
      * Fix API.spec.
      
      * Fix python API.
      
      * Fix.
      test=develop
      
      * Fix API.spec
      test=develop
      
      * Fix sampler.
      
      * Fix order of arguments in python API.
      test=develop
      17226782
    • T
      exp, sigmoid, tanh jitcode support more size · 1f00723f
      tensor-tang 提交于
      test=develop
      1f00723f
    • W
      Add cudnn ctc loss (#12366) · b32c13dc
      Wu Yi 提交于
      * add cudnn ctc loss
      
      * wip add test test=develop
      
      * wip
      
      * wip
      
      * done test=develop
      
      * move include cudnn test=develop
      
      * test test=develop
      
      * fix build test=develop
      
      * fix build test=develop
      
      * fix build on cudnn5 test=develop
      
      * fix cudnn5 build test=develop
      
      * fix cudnn5 build test=develop
      
      * merge develop softmax functor change test=develop
      b32c13dc
    • T
      remove ComputeDeprecated · e2d6eddd
      tensor-tang 提交于
      test=develop
      e2d6eddd
    • T
      fix lrn on mac (#14426) · 64f7516a
      tensor-tang 提交于
      * rename and fix blas vsqr
      
      test=develop
      
      * update
      64f7516a
  3. 15 11月, 2018 11 次提交
  4. 14 11月, 2018 7 次提交
  5. 13 11月, 2018 8 次提交
  6. 12 11月, 2018 3 次提交