1. 19 9月, 2019 1 次提交
    • Y
      Add a pass to fuse fc+elementwise_add+layernorm (#19776) · 3cd985a6
      Yiqun Liu 提交于
      * Add fc_elementwise_layernorm_fuse pass and unittest.
      
      * Add fused_fc_elementwise_layernorm op and its GPU kernel.
      test=develop
      
      * Apply fc_elementwise_layernorm_fuse_pass to GPU inference.
      
      * Add the setting of attrs in the definition of binary_op.
      test=develop
      
      * Add comment.
      
      * Implement the unittest.
      test=develop
      
      * Change the unittest name of layer_norm.
      test=develop
      3cd985a6
  2. 03 1月, 2019 1 次提交
  3. 28 12月, 2018 1 次提交
    • Q
      Inception fusion operator. (#14968) · 6f0a1d7b
      qingqing01 提交于
      * Inception fusion operator.
      * Support horizontal layer fusion in conv_fusion_op.
      * Search conv algo strategy for variable-length input.
         search N times and cache the searched algos. For other input, choose the algo of input whose area is closest to this input.
      6f0a1d7b
  4. 26 11月, 2018 1 次提交
  5. 16 11月, 2018 1 次提交
    • W
      Refine operator cmake (#14413) · a2d9b344
      Wu Yi 提交于
      * wip simplify operator framework
      
      * wip
      
      * wip
      
      * done test=develop
      
      * clean test=develop
      
      * fix test=develop
      
      * fix deps test=develop
      
      * fix cpu build test=develop
      
      * fix tensorrt build test=develop
      
      * fix tests test=develop
      
      * fix test=develop
      
      * fix cpu build test=develop
      a2d9b344