1. 19 9月, 2019 1 次提交
    • Y
      Add a pass to fuse fc+elementwise_add+layernorm (#19776) · 3cd985a6
      Yiqun Liu 提交于
      * Add fc_elementwise_layernorm_fuse pass and unittest.
      
      * Add fused_fc_elementwise_layernorm op and its GPU kernel.
      test=develop
      
      * Apply fc_elementwise_layernorm_fuse_pass to GPU inference.
      
      * Add the setting of attrs in the definition of binary_op.
      test=develop
      
      * Add comment.
      
      * Implement the unittest.
      test=develop
      
      * Change the unittest name of layer_norm.
      test=develop
      3cd985a6
  2. 17 9月, 2019 1 次提交
  3. 11 9月, 2019 1 次提交
    • Y
      Implement the GPU kernel of fc operator (#19687) · a65c728e
      Yiqun Liu 提交于
      * Refine the codes related to fc op.
      
      * Add GPU implementation for fc functor.
      
      * Apply fc_fuse_pass in GPU inference.
      test=develop
      
      * Change the cmake for fc op.
      
      * Change PADDLE_ENFORCE to PADDLE_ENFORCE_EQ.
      
      * Add an attribute to set the activation type in fc_op.
      
      * Enhance the unittest of fc_op.
      test=develop
      
      * Remove the declaration of FCOpGrad back to the header file.
      test=develop
      
      * Set default value for newly added arguments in test_fc_op.
      test=develop
      a65c728e
  4. 30 5月, 2019 1 次提交
  5. 28 3月, 2019 1 次提交
  6. 15 3月, 2019 1 次提交
    • Q
      Support sync batch norm. (#16121) · 8ad672a2
      qingqing01 提交于
      * Support Sync Batch Norm.
      * Note, do not enable it in one device.
      
      Usage:
      
      build_strategy = fluid.BuildStrategy()
      build_strategy.sync_batch_norm = True
      binary = fluid.compiler.CompiledProgram(tp).with_data_parallel(
              loss_name=loss_mean.name,
              build_strategy=build_strategy)
      8ad672a2
  7. 04 3月, 2019 1 次提交
    • D
      polish cudnn related code and fix bug. (#15164) · 4449e855
      dzhwinter 提交于
      * staged.
      
      * polish code
      
      * polish code. test=develop
      
      * polish code. test=develop
      
      * api change. test=develop
      
      * fix default value. test=develop
      
      * fix default value. test=develop
      4449e855
  8. 27 2月, 2019 1 次提交
    • D
      polish cudnn related code and fix bug. (#15164) · 225c11a9
      dzhwinter 提交于
      * staged.
      
      * polish code
      
      * polish code. test=develop
      
      * polish code. test=develop
      
      * api change. test=develop
      
      * fix default value. test=develop
      
      * fix default value. test=develop
      225c11a9
  9. 25 2月, 2019 1 次提交
  10. 29 1月, 2019 1 次提交
  11. 28 12月, 2018 1 次提交
    • Q
      Inception fusion operator. (#14968) · 6f0a1d7b
      qingqing01 提交于
      * Inception fusion operator.
      * Support horizontal layer fusion in conv_fusion_op.
      * Search conv algo strategy for variable-length input.
         search N times and cache the searched algos. For other input, choose the algo of input whose area is closest to this input.
      6f0a1d7b
  12. 18 12月, 2018 3 次提交
  13. 05 12月, 2018 1 次提交
  14. 26 11月, 2018 1 次提交
  15. 22 11月, 2018 1 次提交
    • W
      Windows/online (#14474) · d9a1f3e5
      wopeizl 提交于
      * add recordio support
      
      * disable the openblas multi-thread on windows since no support
      adjust the python script
      
      * code style
      
      * code style
      test=develop
      
      * add create_recordio_file_reader back
      
      * fix code style
      test=develop
      
      * fix the gtest.cmake on windows
      
      * fix cc_test on windows
      
      * fix the win build
      test=develop
      
      * remove fused compile support on windows
      test=develop
      
      * add the jit support
      test=develop
      
      * add the jit support, test=develop
      
      * add the jit support, test=develop
      
      * add the jit back
      fix compile error on windows
      
      * rollback test=develop
      
      * test case fix
      
      * disable DSO by default on windows
      
      * exclude warpctc_op on windows
      
      * exclude the dynload_warpctc out on windows
      test=develop
      
      * fix the scripts error
      test=develop
      
      * disable avx on windows by default
      test=develop
      
      * re-organize the cmake file
      
      * disable mkl on windows by default
      
      * add warp_ctc back
      
      * fix the dependency
      
      * fix the dependency
      
      * fix the build issue on windows
      
      * remove unsupported flag on windows
      
      * code style
      
      * code style
      test=develop
      
      * fix issue
      
      * add profiler, parallel_executor back
      
      * clean up the pre-definitions on windows
      
      * fix build issue
      
      * test=develop
      d9a1f3e5
  16. 21 11月, 2018 1 次提交
  17. 19 11月, 2018 2 次提交
  18. 18 11月, 2018 1 次提交
  19. 17 11月, 2018 1 次提交
  20. 16 11月, 2018 3 次提交
    • W
      Refine operator cmake (#14413) · a2d9b344
      Wu Yi 提交于
      * wip simplify operator framework
      
      * wip
      
      * wip
      
      * done test=develop
      
      * clean test=develop
      
      * fix test=develop
      
      * fix deps test=develop
      
      * fix cpu build test=develop
      
      * fix tensorrt build test=develop
      
      * fix tests test=develop
      
      * fix test=develop
      
      * fix cpu build test=develop
      a2d9b344
    • W
      Make nce support more distribution. (#13549) · 17226782
      whs 提交于
      * Fix truncated normal.
      
      * Fix.
      
      * Make nce support more distribution.
      
      * Fix API.spec.
      
      * Fix python API.
      
      * Fix.
      test=develop
      
      * Fix API.spec
      test=develop
      
      * Fix sampler.
      
      * Fix order of arguments in python API.
      test=develop
      17226782
    • W
      Add cudnn ctc loss (#12366) · b32c13dc
      Wu Yi 提交于
      * add cudnn ctc loss
      
      * wip add test test=develop
      
      * wip
      
      * wip
      
      * done test=develop
      
      * move include cudnn test=develop
      
      * test test=develop
      
      * fix build test=develop
      
      * fix build test=develop
      
      * fix build on cudnn5 test=develop
      
      * fix cudnn5 build test=develop
      
      * fix cudnn5 build test=develop
      
      * merge develop softmax functor change test=develop
      b32c13dc
  21. 15 11月, 2018 1 次提交
  22. 13 11月, 2018 1 次提交
  23. 12 11月, 2018 1 次提交
  24. 09 11月, 2018 2 次提交
    • P
      simplify the logic · 7638f0af
      peizhilin 提交于
      7638f0af
    • L
      Add lod tensor array to tensor op (#13990) · 688ed601
      li099 提交于
      * add lod tensor array concat
      
      * add lod tensor array concat
      
      * test=develop
      
      * add lod tensor array concat
      test=develop
      
      * Fix API.spec
      test=develop
      
      * add lod tensor array concat
      test=develop
      
      * revise some bug of lod tensor array concat
      test=develop
      
      * add unittest for tensor array concat
      test=develop
      
      * change to tensor array to tensor
      test=develop
      
      * revise bug
      test=develop
      
      * revise a bug
      test=develop
      
      * revise a bug
      test=develop
      
      * revise a bug of python3
      test=develop
      688ed601
  25. 08 11月, 2018 2 次提交
    • C
      Fix input<tensor> (#14208) · c5b6573a
      chengduo 提交于
      * fix input<tensor>
      test=develop
      
      * fix split_ids
      test=develop
      
      * ElementwiseMul should not support SelectedRows
      
      * fix scale op
      test=develop
      
      * change GetTensorFromVar() method to GetTensorOrSelectedRowsFromVar()
      
      * fix operator
      
      * refine MultiOutput
      
      * fix MultiOutput
      test=develop
      
      * disable test_dist_save_load
      test=develop
      
      * fix elementwise_op
      test=develop
      
      * add get_sparse_as_op
      test=develop
      
      * add info for check
      test=develop
      
      * rename get_sparse_as_op with extract_rows_as_op.
      test=develop
      
      * elementwise doesn't support selected_rows
      
      * fix regularizer
      
      * remove extract_rows_as
      test=develop
      
      * fix ci
      test=develop
      
      * add test for sum_op
      
      * fix regularizer
      test=develop
      
      *  test=develop
      
      * fix pserver weight decay multi inputs test=develop
      c5b6573a
    • Z
      Revert "cherry picked windows patches." · ba8b5619
      Zhaolong Xing 提交于
      ba8b5619
  26. 05 11月, 2018 2 次提交
  27. 31 10月, 2018 1 次提交
  28. 30 10月, 2018 1 次提交
  29. 26 10月, 2018 1 次提交
  30. 24 10月, 2018 1 次提交
  31. 23 10月, 2018 1 次提交
    • C
      Refine Split op (#13967) · a7497653
      chengduo 提交于
      * speedup split_op
      test=develop
      
      * speedup split_op
      test=develop
      
      * rename ConcatGrad to Split
      
      * refine concat and split
      test=develop
      
      * fix compile error
      a7497653
  32. 18 10月, 2018 1 次提交