1. 02 7月, 2019 2 次提交
    • Y
      supports collective training with programs (#18392) · a873fa84
      Yi Liu 提交于
      1. Since allreduce op has 4 reduce types, We split these four reduce types into four ops
      2. We also refined the collective op code, e.g. we separated the collective op kernel into CPUKernel and CUDAKernel, and remove the device specified DeviceContext parameter in template as we already knew the target DeviceContext
      3. We remove the newly added Collective op role to reduce the complexity of program and graph analysis
      a873fa84
    • C
      Add find_no_grad_vars in backward.py (#17942) · e0d8c6ac
      chengduo 提交于
      * add not_been_used_vars to no_grad_set
      test=develop
      e0d8c6ac
  2. 01 7月, 2019 1 次提交
  3. 27 6月, 2019 2 次提交
    • K
      add WITH_COVERAGE option, default OFF (#17872) · 27fb9cad
      kh2se2013 提交于
      * add WITH_COVERAGE option, default OFF
      
      test=develop
      
      * add coverage for python sdk
      
      test=develop
      
      * fix code style
      
      * fix COVERAGE_FILE path
      
      test=develop
      
      * remove coverage package
      
      test=develop
      
      * test = develop, run coverage as module
      27fb9cad
    • H
      supports collective communicated training (#18175) · b7128bac
      HaoRen 提交于
      * fix prepare context redundant code problem, optimize executor by caching create_varaiables
      test=develop
      
      * supports collective training in executor
      
      * make fetch_list runable with variables, add more unittest for use_program_cache
      test=develop
      
      * fix comment
      test=develop
      
      * use unique name for nccl_id
      
      * supports output to stream in program_to_code
      
      * insert sync_comm_stream before regularization; add skip_op_callstack capability in program_to_code
      
      * set op role in collective training
      
      * add collective op role
      
      * remove orig file
      
      * add build optimizer by strategy
      
      * add collective strategy
      
      * refine collective strategy
      
      * add multi-process role maker
      
      * refine strategy building factory so that we can easily plugin more strategy
      
      * scale loss grad in collective sgd transpiler
      
      * add support for distributed fc
      
      * code format
      
      * revert some features for dist fc
      
      * add support for distributed fc training
      
      * fix prepare context redundant code problem, optimize executor by caching create_varaiables
      test=develop
      
      * supports collective training in executor
      
      * make fetch_list runable with variables, add more unittest for use_program_cache
      test=develop
      
      * use unique name for nccl_id
      
      * supports output to stream in program_to_code
      
      * insert sync_comm_stream before regularization; add skip_op_callstack capability in program_to_code
      
      * set op role in collective training
      
      * add collective op role
      
      * fix comment
      test=develop
      
      * remove orig file
      
      * add build optimizer by strategy
      
      * add collective strategy
      
      * refine collective strategy
      
      * add multi-process role maker
      
      * refine strategy building factory so that we can easily plugin more strategy
      
      * scale loss grad in collective sgd transpiler
      
      * add support for distributed fc
      
      * code format
      
      * revert some features for dist fc
      
      * add support for distributed fc training
      
      * test=develop
      add collective op unittest standard
      
      * test=develop
      remove the test_collective directory
      
      * test=develop
      remove the test_collective directory
      
      * remove slicegather test
      
      * code format for reducescatter
      
      * update attr of shard_index_op
      
      * Modify macro nccl_helper
      
      * remove test without distribute
      
      * macro collective_helper
      
      * marcro update
      
      * test=develop
      update support python3.5
      
      * test=develop change gpu memory use to 0.1 when test
      
      * test=develop
      update ut equal func
      
      * test=develop
      set flags to 1.5
      
      * test=develop fix pickle dumple  py35
      
      * test=develop
      fix divide in slice and add sync_comm_stream
      update atol and rtol to 1e-05
      rm shard_index op and test
      modify read input from file to read from memory
      remove origin_program in framework and add i/o in c_sync_calc_stream
      
      * test=develop update unittest sync operator I/O
      b7128bac
  4. 26 6月, 2019 4 次提交
  5. 25 6月, 2019 1 次提交
    • H
      Sequence mask support tensor (#18249) · df2eee71
      Hongyu Liu 提交于
      * sequnce mask support max length tensor input; test=develop
      
      * add rnn_impl.py; test=develop
      
      * add basic gru lstm unittest; test=develop
      
      * fix api spec; test=develop
      
      * fix sequence_mask op bug;
      test=develop
      test=document_preview
      
      * change +-*x to elmentwise_op; test=develop
      
      * add mkl flag; test=develop
      
      * fix rnn impl bug; test=develop
      
      * update api spec; test=develop
      
      * fix doc bug; test=develop
      
      * fix lstm bugs; test=develop
      df2eee71
  6. 23 6月, 2019 1 次提交
  7. 21 6月, 2019 2 次提交
  8. 20 6月, 2019 2 次提交
  9. 19 6月, 2019 3 次提交
  10. 18 6月, 2019 1 次提交
  11. 17 6月, 2019 1 次提交
  12. 16 6月, 2019 5 次提交
    • Q
      Update backward appending stragety to support double backward and fix some bug. (#18104) · 80d2e66f
      qingqing01 提交于
      * Update backward.py:
           - If there is no input grad var in all outputs of previous ops, do not append this op into graph.
           - Only apply this stragety when double backward.
      * Update some double backward op.
      * Update sum_op to judge whether a tensor is empty by numel or IsInitialized().
      80d2e66f
    • F
      add detection output operator for supporting retinanet (#17896) · ff83655f
      FlyingQianMM 提交于
      * test=develop
      add detection output for supporting retinanet
      
      * test=develop
      add test_layers.py
      
      * test=develop
      add API.spec
      
      * test=develop
      alter test_retinanet_detection_output.py
      
      * test=develop
      alter round 2
      
      * test=develop
      alter retinanet_detection_output
      
      * test=develop
      alter paddle/fluid/API.spec
      
      * test=devlop
      alter detection.py
      
      * test=develop
      alter retinanet_detection_output
      
      * test=develop
      alter paddle/fluid/API.spec
      
      * test=develop
      alter detection.py
      
      * test=develop
      alter API.spec
      
      * test=develop
      alter retinanet_detection_output
      
      * test=develop
      alter paddle/fluid/API.spec
      
      * test=develop
      alter python/paddle/fluid/tests/unittests/test_retinanet_detection_output.py
      
      * test=develop
      alter python/paddle/fluid/tests/unittests/test_retinanet_detection_output.py
      
      * test=develop
      fix grammer error
      
      * test=develop
      fix grammer error
      
      * test=develop
      fix grammer error
      
      * test=develop
      alter python/paddle/fluid/tests/unittests/test_layers.py
      
      * test=develop
      alter paddle/fluid/API.spec
      ff83655f
    • G
      add class name and timeline for test_dist_base.py (#18122) · 0941e3e0
      guru4elephant 提交于
      * add class name and timeline for test_dist_base.py
      0941e3e0
    • F
      add sigmoid focal loss operator for supporting retinanet (#17895) · 0aee1f00
      FlyingQianMM 提交于
      * test=develop
      add sigmoid_focal_loss for supporting retinanet
      
      * test=develop
      add test_layers
      
      * test=develop
      add API.spc
      
      * test=develop
      alter sigmoid_focal_loss_op.cc
      
      * test=develop
      alter detection.py
      
      * test=develop
      alter API.spec
      
      * test=develop
      alter round 1
      
      * test=develop
      alter simooid_focal_loss
      
      * test=develop
      alter sigmoid_focal_loss_op.cc
      
      * test=develop
      alter test_layers.py
      
      * test=develop
      alter paddle/fluid/API.spec
      
      * test=develop
      alter sigmoid_focal_loss_op.cu
      
      * test=develop
      alter paddle/fluid/operators/detection/sigmoid_focal_loss_op.cc
      0aee1f00
    • F
      Update generate_proposal_labels_op to support CascadeRCNN. (#17200) · 9e4b9d97
      FDInSky 提交于
      * Update generate_proposal_labels_op to support CascadeRCNN.
      9e4b9d97
  13. 15 6月, 2019 2 次提交
    • F
      add target assign operator for supporting retinanet (#17893) · 9ed2f936
      FlyingQianMM 提交于
      * test=develop add target assign for retinanet
      
      * test=develop
      run ci
      
      * test=developp
      add test_layers
      
      * test=develop
      add APi.spec
      
      * test=develop
      alter round 1
      
      * test=develop
      alter rpn_target_assign_op.cc
      
      * test=develop
      alter test_rpn_target_assign_op.py
      
      * test=develop
      alter rpn_target_assign_op.cc
      
      * test=develop
      
      alter API.spec
      
      * test=develop
      alter paddle/fluid/operators/detection/rpn_target_assign_op.cc
      
      * test=develop
      alter rpn_target_assign_op.cc
      
      * test=develop
      alter python/paddle/fluid/layers/detection.py
      
      * test=develop
      alter paddle/fluid/API.spec
      9ed2f936
    • C
      Fix bug of scope_buffered_ssa_graph_executor (#18100) · 24e988a4
      chengduo 提交于
      * fix code bug
      test=develop
      24e988a4
  14. 14 6月, 2019 3 次提交
  15. 13 6月, 2019 2 次提交
  16. 12 6月, 2019 5 次提交
    • T
      fix save/load in fleet (#17675) · 101f74cb
      tangwei12 提交于
      * fix save/load in Fleet
      * add UT framework of Fleet
      101f74cb
    • W
      Fix scatter and gather op when has duplicate index (#17952) · 8eb134c3
      wawltor 提交于
      * test=develop
      The scatter op has a calc bug when the indices has same index, the scatter op use overwrite mode to calculate the same index, fix this bug by using the accumulate mode to calculate the same index.At the same time, the gather op has the same bug when the op calc the grad. And we use the lib of open-blas and eigen to optimize the time cost in accumulate mode.
      
      * test=develop
      Fix some code format problem, and the same time add the test case in gather and scatter op
      8eb134c3
    • H
      Cherry-pick: fix random CI failure. (#18011) · 0bf25351
      Huihuang Zheng 提交于
      * Cherry-pick fix random Python3 CI failure.
      
      In some tests, SWEs used "print('xxx').format('xxx')". The syntax
      is only supported in Python2, not python3. However, since those
      lines are related to data download, if the CI machines already have
      the data, it passes CI tests. That causes random failure.
      
      * Cherry-pick: disable CUDNN case of test_warpctc_op
      
      Also temporary disable a unit test. The test will be fixed under high priority.
      0bf25351
    • K
      fix logging basicConfig cannot be setting after import paddle (#17786) · 96ee528e
      Kaipeng Deng 提交于
      * fix logging unable. test=develop
      
      * unset sys.stdout for stream handler. test=develop
      
      * fix newly add basicConfig. test=develop
      
      * fix import error. test=develop
      96ee528e
    • C
      add deformable psroi pooling (#17827) · 871af28d
      cjt222 提交于
      * add deformable psroi pooling
      
      * test=develop
      
      * test=develop
      
      * test=develop
      modify format
      
      * fix bug
      
      * test=develop run ci
      
      * test=develop
      add API.spec
      
      * add test_layers.py
      
      * run ci again
      
      * test=develop
      run ci again
      
      * run ci again
      
      * test=develop
      run ci again
      
      * test=develop
      run ci again
      
      * test=develop
      run ci again
      
      * add space between two lines
      
      * test=develop
      add space between two lines
      
      * test=develop
      add space between lines
      
      * test=develop
      modify comment in nn.py
      
      * test=develop
      add space between two lines
      
      * test=develop
      add space between two lines
      
      * update API.spec
      
      * run ci again
      
      * test=develop
      run ci again
      
      * rerun ci
      
      * test=develop
      rerun ci
      
      * change input shape
      
      * run ci
      
      * test=develop
      run ci
      
      * modify format of nn.py
      
      * test=develop
      
      * test=develop
      
      * test=develop
      update API.spec
      
      * test=develop
      fix API doc
      
      * modify API comment
      
      * modift API comment
      
      * test=develop
      update API.spec
      
      * test=develop
      modify comment
      
      * test=develop
      modift comment
      
      * test=develop
      modift comment
      
      * test=develop
      update API.spec
      
      * test=develop
      modify comment
      
      * test=develop
      add inference in nn.py
      
      * test=develop
      update API.spec
      
      * test=develop
      resolve confict
      
      * test=develop
      update API.spec
      871af28d
  17. 11 6月, 2019 1 次提交
    • S
      add unfold op (new op),test=develop (#17944) · 40885c22
      SunGaofeng 提交于
      * add unfold op
      test=develop
      
      * fix divide bug in python3 when calculating output width and height
      test=develop
      
      * add name=None in python api, move redundant code into inline function
      
      * try to trigger ci for this code
      test=develop
      40885c22
  18. 10 6月, 2019 2 次提交