1. 19 6月, 2018 1 次提交
    • Q
      Fix decay bug (#11520) · a29cb4be
      Qiyang Min 提交于
      * Add sub_blocks of lr_decay_op to pserver_prog after distribute_transpiler
      
      * Remove unused logs and logics
      
      * 1. Add ops to new block (considering the nested block condition)
      2. Follow the original hierarchy of blocks
      3. Change the function's name and remove debug lines
      a29cb4be
  2. 16 6月, 2018 2 次提交
  3. 15 6月, 2018 1 次提交
    • K
      Modify Pybind LoDTensor API according to length-based LoD (#11106) · 417fcf4f
      Kexin Zhao 提交于
      * add lod_tensor util and modify pybind
      
      * refind pybind LoDTensor API and modify LoDTensor and DataFeeder test
      
      * fix test error
      
      * fix detection map op test
      
      * fix reorder_lod_tensor test
      
      * fix seq_concat_op
      
      * fix chunk evel op test
      
      * fix target assign op
      
      * fix warp ctc op
      
      * address comments step 1: reverse reset_lod op
      
      * step 2: modify op test
      
      * add warning message
      
      * remove has_valid_lod
      
      * add back has_valid_lod
      
      * address comments
      
      * add exception catching trial
      417fcf4f
  4. 14 6月, 2018 3 次提交
  5. 13 6月, 2018 3 次提交
  6. 12 6月, 2018 2 次提交
  7. 11 6月, 2018 11 次提交
  8. 10 6月, 2018 5 次提交
  9. 08 6月, 2018 6 次提交
  10. 07 6月, 2018 6 次提交
    • T
      make scope thread safe · b8d315fb
      tensor-tang 提交于
      b8d315fb
    • D
      split reduce op into multiple libraries, accelerate the compiling (#11029) · d48172f2
      dzhwinter 提交于
      * "split into multiple .ccl"
      
      * "refine file structure"
      
      * "refine files"
      
      * "remove the cmakelist"
      
      * "fix typo"
      
      * "fix typo"
      
      * fix ci
      d48172f2
    • D
      Big data op_test benchmark, for checking output consistent in different runs. (#10646) · f7c96f07
      dzhwinter 提交于
      * "init benchmark ops"
      
      * "untrack outputs"
      
      * "delete some usused code"
      
      * "benchmark"
      
      * "fix ci"
      
      * "fix op test"
      
      * "fix uint16 missing"
      
      * "fix ci"
      
      * "follow comments"
      
      * "fix ci"
      
      * "follow comments"
      
      * "conficts. merge develop branch"
      
      * repick
      
      * "merge develop branch"
      f7c96f07
    • F
      dc8e0b49
    • M
      Mkldnn layout (#11040) · 3ff9ba0e
      mozga-intel 提交于
      * Add MKLDNN layout support in Paddle
      
      Add MKLDNN layout in Paddle so that MKLDNN friendly memory layout
      can be used in MKLDNN enabled OP kernel. Before this commit, NCHW
      is hardcode to be used in all MKLDNN op kernels. As a result,
      non-optimized execution path is selected in MKLDNN primitive which
      bring worse performance.
      Besides framework change, three MKLDNN OP kernels were updated
      for using new MKLDNN layout. They are conv/pool2d/batch_norm.
      Other MKLDNN OP kernels need be also updated in similar way to
      achieve best performance.
      
      * Add MKLDNN layout support in activation OP
      
      * Don't populate layout from input to output when kMKLDNN in
      
      * Refine pool mkldnn op kernel
      
      * MKLDNN layout
      
      * Remove the inferitance from tensor file
      
      * MKLDNN layout: refactoring
      
      * Remove additional #define to register new operator
      
      * Prepare mkldnn tests to work with layout
      3ff9ba0e
    • C
      8291b916