1. 07 6月, 2018 9 次提交
    • D
      Big data op_test benchmark, for checking output consistent in different runs. (#10646) · f7c96f07
      dzhwinter 提交于
      * "init benchmark ops"
      
      * "untrack outputs"
      
      * "delete some usused code"
      
      * "benchmark"
      
      * "fix ci"
      
      * "fix op test"
      
      * "fix uint16 missing"
      
      * "fix ci"
      
      * "follow comments"
      
      * "fix ci"
      
      * "follow comments"
      
      * "conficts. merge develop branch"
      
      * repick
      
      * "merge develop branch"
      f7c96f07
    • M
      Mkldnn layout (#11040) · 3ff9ba0e
      mozga-intel 提交于
      * Add MKLDNN layout support in Paddle
      
      Add MKLDNN layout in Paddle so that MKLDNN friendly memory layout
      can be used in MKLDNN enabled OP kernel. Before this commit, NCHW
      is hardcode to be used in all MKLDNN op kernels. As a result,
      non-optimized execution path is selected in MKLDNN primitive which
      bring worse performance.
      Besides framework change, three MKLDNN OP kernels were updated
      for using new MKLDNN layout. They are conv/pool2d/batch_norm.
      Other MKLDNN OP kernels need be also updated in similar way to
      achieve best performance.
      
      * Add MKLDNN layout support in activation OP
      
      * Don't populate layout from input to output when kMKLDNN in
      
      * Refine pool mkldnn op kernel
      
      * MKLDNN layout
      
      * Remove the inferitance from tensor file
      
      * MKLDNN layout: refactoring
      
      * Remove additional #define to register new operator
      
      * Prepare mkldnn tests to work with layout
      3ff9ba0e
    • F
      fix a compile error · 2f5e3101
      fengjiayi 提交于
      2f5e3101
    • L
      add test_mode in trt/activation_op · f6fb51a1
      Luo Tao 提交于
      f6fb51a1
    • L
      install libgtk2.0-dev in latest images · 08220d39
      Luo Tao 提交于
      08220d39
    • T
      refine scope lock · f326b011
      tensor-tang 提交于
      f326b011
    • T
      refine nlp test · 9cf1f351
      tensor-tang 提交于
      9cf1f351
    • G
      Add rpc_client interface. (#11154) · 2028a8ef
      gongweibao 提交于
      2028a8ef
    • Y
      feature/trt engine op test (#11182) · 4f95bc94
      Yan Chunwei 提交于
      4f95bc94
  2. 06 6月, 2018 19 次提交
  3. 05 6月, 2018 9 次提交
  4. 04 6月, 2018 3 次提交