1. 28 8月, 2019 1 次提交
    • T
      Fix the correctness of async mode at distributed training (#18863) · 65c73684
      tangwei12 提交于
      * fix correctness of the communicator
      
      * fix a bug in send thread when sending var context is empty, test=develop
      
      * add lookup_table_prefetch_op and prefetch optimize, test=develop
      
      * remove remote prefetch GPU supported
      
      * word2vec force with CPU, test=develop
      
      * test dist remote lookup table force with CPU, test=develop
      65c73684
  2. 27 8月, 2019 1 次提交
  3. 23 8月, 2019 1 次提交
  4. 22 8月, 2019 2 次提交
    • L
      Enhance OpTest to check the consistency of operators when using and not using inplace (#19101) · a9d5fc51
      Leo Chen 提交于
      * add pybind interface to get all inplace ops, test=develop
      
      * enhance OpTest to check whether the consistency of operator when using and not using inplace, test=develop
      
      * handle corner cases in op_test, test=develop
      
      * support outputs without tensor holder_, like XShape in reshape_op, test=develop
      
      * fix bug, some op has GradOpMaker, but actually no grad_op in OpInfoMap, test=develop
      
      * use reshape_grad instead of reshape in FlattenGradOp, test=develop
      
      * fix error debug dims info for variables like XShape, test=develop
      
      * change computational order in sum_op to relieve computation difference using inplace, test=develop
      
      * add inplace_atol to check group_norm, and skip inplace_grad for mkldnn, test=develop
      
      * follow sneaxiy's comments, test=develop
      
      * remove unused DefaultGradOpDescMaker in mkldnn op, test=develop
      a9d5fc51
    • T
      stronger the error message of tensor's mutable_data (#19303) · e3c68bde
      Tao Luo 提交于
      * stronger the error message of tensor's mutable_data
      
      test=develop
      
      * update error message
      
      test=develop
      e3c68bde
  5. 21 8月, 2019 1 次提交
  6. 19 8月, 2019 5 次提交
  7. 16 8月, 2019 1 次提交
  8. 15 8月, 2019 2 次提交
  9. 14 8月, 2019 2 次提交
  10. 13 8月, 2019 1 次提交
  11. 12 8月, 2019 3 次提交
  12. 11 8月, 2019 1 次提交
    • Y
      add save cache model api in fleet& add slots shuffle in dataset module & add... · 9150cf50
      yaoxuefeng 提交于
      add save cache model api in fleet& add slots shuffle in dataset module & add metric op to calculate ctr related metrics (#18871)
      
      * add ctr related metric layer test=develop
      
      * add save cache and slots shuffle test=develop
      
      * add save cache and slots shuffle test=develop
      
      * fix error
      
      * fix error
      
      * fix style for ci
      
      * fix for comments
      
      * change SlotsShuffle input to std::strinf for generality
      
      * fix style
      
      * fix style
      
      * fix style
      
      * fix style
      
      * fix style
      
      * fix style
      
      * fix stylr
      
      * fix style
      
      * fix style
      
      * fix style
      
      * fix style
      
      * fix style
      
      * fix style
      
      * fix style
      
      * fix style
      
      * fix style
      
      * fix style
      
      * fix style
      
      * fix style
      
      * fix style
      
      * change non-const reference to pointer
      
      * fix style
      
      * fix style
      
      * fix style test=develop
      
      * fix style  test=develop
      
      * add return ins num in ctr metric op
      
      * change dtype to float in metric_op.py
      
      * fix error test=develop
      
      * fix style test=develop
      
      * fix API spec
      
      * fix API spec
      
      * fix API spec test=develop
      
      * add UT test=develop
      9150cf50
  13. 10 8月, 2019 1 次提交
  14. 09 8月, 2019 2 次提交
  15. 08 8月, 2019 2 次提交
    • J
      fix QueueDataset queue size (#19016) · fc038da7
      jiaqi 提交于
      * fix QueueDataset queue size,set queue size = batch size * 100, to avoid too many instances in channel when training is much slower than reading data.
      fc038da7
    • L
      Fix memory overwriting of tensors returned by executor (#19030) · 8f537354
      Leo Chen 提交于
      * fix memory overlapping of fetch var (return of executor.run), test=develop
      
      * fix wrong usage of ParallelExecutor in op_test, test=develop
      
      * remove useless parameter and simplify code
      
      * avoid tensor destruct untimely, test=develop
      
      * add testcase independent of OpTest, test=develop
      8f537354
  16. 06 8月, 2019 1 次提交
  17. 02 8月, 2019 4 次提交
    • Z
      Open gc by default (#18836) · 7ac748ad
      Zeng Jinle 提交于
      * open gc by default, test=develop
      
      * fix test_train_recognize_digits and disable gc when ngraph is enabled, test=develop
      
      * fix conditional_block op eager deletion bug, test=develop
      
      * add some comments to reviewers, test=develop
      7ac748ad
    • J
      support filelist size < trainer num && fix pull dense (#18956) · 02c370c3
      jiaqi 提交于
      * support filelist size < trainer num
      * pull dense when stop, to make sure local dense params are same as pserver, so save paddle model will save dense model same as pserver
      *  enable QueueDataset train same filelist for serveral times
      02c370c3
    • C
      Disable fuse optimization option (#18924) · e7da0940
      chengduo 提交于
      * Disable fuse optimization
      test=develop
      e7da0940
    • Fusion: seqpool_cvm_concat (#18471) · ee2f296e
      石晓伟 提交于
      * add fusion_seqpool_cvm_concat test=develop
      
      * simplify pass, test=develop
      
      * fix code style, test=develop
      ee2f296e
  18. 01 8月, 2019 1 次提交
  19. 30 7月, 2019 1 次提交
  20. 29 7月, 2019 2 次提交
    • Z
      Remove legacy C++ memory optimization codes (#18834) · 8008ab4e
      Zeng Jinle 提交于
      * remove legacy memory optimization codes, test=develop
      
      * follow huihuang's comments,test=develop
      
      * follow luotao's comments, test=develop
      8008ab4e
    • T
      add clear_model interface in fleetwrapper (#18815) · 52c1431e
      Thunderbrook 提交于
      * dump slot
      
      * test
      
      * proto
      
      * dump slot
      
      * test
      
      * proto
      
      * code style
      
      * code style
      
      * code style
      
      * style
      
      * add delete after unseen days
      
      * add unseen days
      
      * code style
      
      * conflict solve
      test=develop
      
      * add clear model
      
      * code style
      test=develop
      
      * code style
      test=develop
      52c1431e
  21. 27 7月, 2019 1 次提交
  22. 26 7月, 2019 1 次提交
    • Z
      Feature/mem opt pass refactor (#18735) · a802da65
      Zeng Jinle 提交于
      * first version memory optimize pass, test=develop
      
      * remove move_tensor_sharing_pass, test=develop
      
      * refine code comments, add unittests, test=develop
      
      * turn off memory_optimize by default, test=develop
      
      * follow huihuang's comments, test=develop
      
      * follow chengduoZH's comments, test=develop
      
      * fix grammar error, add const qualifier, fix pass_test exception message, test=develop
      
      * follow chengduoZH's comments 2nd, test=develop
      a802da65
  23. 25 7月, 2019 1 次提交
  24. 24 7月, 2019 2 次提交
    • Z
      Update trt5 for paddle-trt (#18645) · 26ae6d49
      Zhaolong Xing 提交于
      * update paddle-trt for:
          1. fix bug: when batch > 2, core in split plugin.
          2. add leaky_relu trt5.0 support (yolov3 from 65ms to 42ms.)
          3. add new attr to dropout.
          4. shuffle channel, swish, relu6 support
          test=develop
      
      * 1. fix ci
      test=develop
      26ae6d49
    • T
      add slot to sparse table (#18686) · d8396281
      Thunderbrook 提交于
      The change includes 2 things:
      
      1. save delta model and shrink table are control by the same parameter before, now add delete_after_unseen_days to control shrink table.
      2. value in sparse table has no slot before, now add slot in sparse table, and add DownpureCtrAccessor to support the new meta.
      test=develop
      d8396281