1. 25 6月, 2019 2 次提交
    • H
      Sequence mask support tensor (#18249) · df2eee71
      Hongyu Liu 提交于
      * sequnce mask support max length tensor input; test=develop
      
      * add rnn_impl.py; test=develop
      
      * add basic gru lstm unittest; test=develop
      
      * fix api spec; test=develop
      
      * fix sequence_mask op bug;
      test=develop
      test=document_preview
      
      * change +-*x to elmentwise_op; test=develop
      
      * add mkl flag; test=develop
      
      * fix rnn impl bug; test=develop
      
      * update api spec; test=develop
      
      * fix doc bug; test=develop
      
      * fix lstm bugs; test=develop
      df2eee71
    • Q
      optimize communicator merge sparse gradient test=develop (#18159) · 0e08e91c
      Qiao Longfei 提交于
      * optimize communicator merge sparse gradient test=develop
      
      * revert multithread selected rows merge add test=develop
      
      * follow comment test=develop
      0e08e91c
  2. 24 6月, 2019 2 次提交
  3. 21 6月, 2019 2 次提交
  4. 20 6月, 2019 1 次提交
  5. 19 6月, 2019 2 次提交
  6. 17 6月, 2019 1 次提交
  7. 16 6月, 2019 4 次提交
    • Q
      Update backward appending stragety to support double backward and fix some bug. (#18104) · 80d2e66f
      qingqing01 提交于
      * Update backward.py:
           - If there is no input grad var in all outputs of previous ops, do not append this op into graph.
           - Only apply this stragety when double backward.
      * Update some double backward op.
      * Update sum_op to judge whether a tensor is empty by numel or IsInitialized().
      80d2e66f
    • F
      add detection output operator for supporting retinanet (#17896) · ff83655f
      FlyingQianMM 提交于
      * test=develop
      add detection output for supporting retinanet
      
      * test=develop
      add test_layers.py
      
      * test=develop
      add API.spec
      
      * test=develop
      alter test_retinanet_detection_output.py
      
      * test=develop
      alter round 2
      
      * test=develop
      alter retinanet_detection_output
      
      * test=develop
      alter paddle/fluid/API.spec
      
      * test=devlop
      alter detection.py
      
      * test=develop
      alter retinanet_detection_output
      
      * test=develop
      alter paddle/fluid/API.spec
      
      * test=develop
      alter detection.py
      
      * test=develop
      alter API.spec
      
      * test=develop
      alter retinanet_detection_output
      
      * test=develop
      alter paddle/fluid/API.spec
      
      * test=develop
      alter python/paddle/fluid/tests/unittests/test_retinanet_detection_output.py
      
      * test=develop
      alter python/paddle/fluid/tests/unittests/test_retinanet_detection_output.py
      
      * test=develop
      fix grammer error
      
      * test=develop
      fix grammer error
      
      * test=develop
      fix grammer error
      
      * test=develop
      alter python/paddle/fluid/tests/unittests/test_layers.py
      
      * test=develop
      alter paddle/fluid/API.spec
      ff83655f
    • F
      add sigmoid focal loss operator for supporting retinanet (#17895) · 0aee1f00
      FlyingQianMM 提交于
      * test=develop
      add sigmoid_focal_loss for supporting retinanet
      
      * test=develop
      add test_layers
      
      * test=develop
      add API.spc
      
      * test=develop
      alter sigmoid_focal_loss_op.cc
      
      * test=develop
      alter detection.py
      
      * test=develop
      alter API.spec
      
      * test=develop
      alter round 1
      
      * test=develop
      alter simooid_focal_loss
      
      * test=develop
      alter sigmoid_focal_loss_op.cc
      
      * test=develop
      alter test_layers.py
      
      * test=develop
      alter paddle/fluid/API.spec
      
      * test=develop
      alter sigmoid_focal_loss_op.cu
      
      * test=develop
      alter paddle/fluid/operators/detection/sigmoid_focal_loss_op.cc
      0aee1f00
    • F
      Update generate_proposal_labels_op to support CascadeRCNN. (#17200) · 9e4b9d97
      FDInSky 提交于
      * Update generate_proposal_labels_op to support CascadeRCNN.
      9e4b9d97
  8. 15 6月, 2019 2 次提交
    • F
      add target assign operator for supporting retinanet (#17893) · 9ed2f936
      FlyingQianMM 提交于
      * test=develop add target assign for retinanet
      
      * test=develop
      run ci
      
      * test=developp
      add test_layers
      
      * test=develop
      add APi.spec
      
      * test=develop
      alter round 1
      
      * test=develop
      alter rpn_target_assign_op.cc
      
      * test=develop
      alter test_rpn_target_assign_op.py
      
      * test=develop
      alter rpn_target_assign_op.cc
      
      * test=develop
      
      alter API.spec
      
      * test=develop
      alter paddle/fluid/operators/detection/rpn_target_assign_op.cc
      
      * test=develop
      alter rpn_target_assign_op.cc
      
      * test=develop
      alter python/paddle/fluid/layers/detection.py
      
      * test=develop
      alter paddle/fluid/API.spec
      9ed2f936
    • C
      Fix bug of scope_buffered_ssa_graph_executor (#18100) · 24e988a4
      chengduo 提交于
      * fix code bug
      test=develop
      24e988a4
  9. 14 6月, 2019 2 次提交
  10. 13 6月, 2019 3 次提交
    • L
      refactor the function ConvFwdPrimitiveDesc (#17897) · f8ecc3de
      lidanqing 提交于
      * refractor the function ConvFwdPrimitiveDesc
      test=develop
      
      * change according to review
      test=develop
      
      * use pointer way without boost::optional
      test=develop
      
      * pass vector to function by reference instead of raw vector
      test=develop
      
      * change pointer to shared_ptr
      test=develop
      f8ecc3de
    • W
      Added unit test for QAT FP32 & INT8 comparison (#17814) · 78e93286
      Wojciech Uss 提交于
      * added unit test for QAT FP32 & INT8 comparison
      
      test=develop
      
      * enabled other models and updated filenames
      
      test=develop
      
      * added accuracy check and multiple batch handling
      
      test=develop
      
      * removed quantization_mkldnn_pass.py
      
      test=develop
      
      * cleanup
      
      test=develop
      
      * updated model paths
      
      test=develop
      
      * renamed tests without MKL-DNN
      
      test=develop
      
      * fix reusing mkldnn pool2d primitive
      
      test=develop
      
      * add performance measuring
      
      test=develop
      
      * fix accuracy statistics
      
      test=develop
      
      * removed non-mkldnn tests
      
      test=develop
      
      * added conv2d_depthwise->conv2d mkldnn transformation
      
      test=develop
      
      * format update
      
      test=develop
      
      * fixed creating key for pool2d grad
      
      test=develop
      
      * added pass
      
      * Fix the accuracy issue while using float precision to get the scale.
      
      test=develop
      
      * Fix the format issue when 'X' is not nchw.
      
      test=develop
      
      * removed output comparing and changed number of images
      
      test=develop
      
      * cmake and comment fix
      
      test=develop
      
      * updated acc threshold for QAT comparison tests
      
      test=develop
      
      * added OMP_NUM_THREADS setting
      
      test=develop
      
      * enable all QAT INT8 tests
      
      test=develop
      
      * restored upstream version of a file
      
      test=develop
      
      * modified directory names
      
      test=develop
      78e93286
    • T
      concat op support negative axis (#18045) · 566bf2ec
      tensor-tang 提交于
      test=develop
      566bf2ec
  11. 12 6月, 2019 7 次提交
    • Y
      Optimize the concat and split cuda implementation for cases when the number of... · 7e463c84
      Yiqun Liu 提交于
      Optimize the concat and split cuda implementation for cases when the number of inputs/outputs is less than 5. (#17979)
      
      test=develop
      7e463c84
    • T
      fix save/load in fleet (#17675) · 101f74cb
      tangwei12 提交于
      * fix save/load in Fleet
      * add UT framework of Fleet
      101f74cb
    • G
      Fix GetExpectedKernelType of add_position_encoding_op (#17935) · a06b316b
      Guo Sheng 提交于
      * Fix the GetExpectedKernelType of add_position_encoding_op.
      test=develop
      
      * Fix the doc of lstm_unit outputs in nn.py.
      test=develop
      a06b316b
    • W
      Fix scatter and gather op when has duplicate index (#17952) · 8eb134c3
      wawltor 提交于
      * test=develop
      The scatter op has a calc bug when the indices has same index, the scatter op use overwrite mode to calculate the same index, fix this bug by using the accumulate mode to calculate the same index.At the same time, the gather op has the same bug when the op calc the grad. And we use the lib of open-blas and eigen to optimize the time cost in accumulate mode.
      
      * test=develop
      Fix some code format problem, and the same time add the test case in gather and scatter op
      8eb134c3
    • L
      update load_error_info, test=develop (#18000) · 75fcd292
      lujun 提交于
      Repair error prompt: Users are prompted to check whether the model or parameter files are damaged when loading parameters are wrong.
      75fcd292
    • W
      test=develop (#17984) · 2ae8decc
      wawltor 提交于
      Fix bug in sequence_unpad op, when allocate the output memory do not match actual memory, check memory failed. Fix this bug by allocating the output memeory in correct code position.
      2ae8decc
    • C
      add deformable psroi pooling (#17827) · 871af28d
      cjt222 提交于
      * add deformable psroi pooling
      
      * test=develop
      
      * test=develop
      
      * test=develop
      modify format
      
      * fix bug
      
      * test=develop run ci
      
      * test=develop
      add API.spec
      
      * add test_layers.py
      
      * run ci again
      
      * test=develop
      run ci again
      
      * run ci again
      
      * test=develop
      run ci again
      
      * test=develop
      run ci again
      
      * test=develop
      run ci again
      
      * add space between two lines
      
      * test=develop
      add space between two lines
      
      * test=develop
      add space between lines
      
      * test=develop
      modify comment in nn.py
      
      * test=develop
      add space between two lines
      
      * test=develop
      add space between two lines
      
      * update API.spec
      
      * run ci again
      
      * test=develop
      run ci again
      
      * rerun ci
      
      * test=develop
      rerun ci
      
      * change input shape
      
      * run ci
      
      * test=develop
      run ci
      
      * modify format of nn.py
      
      * test=develop
      
      * test=develop
      
      * test=develop
      update API.spec
      
      * test=develop
      fix API doc
      
      * modify API comment
      
      * modift API comment
      
      * test=develop
      update API.spec
      
      * test=develop
      modify comment
      
      * test=develop
      modift comment
      
      * test=develop
      modift comment
      
      * test=develop
      update API.spec
      
      * test=develop
      modify comment
      
      * test=develop
      add inference in nn.py
      
      * test=develop
      update API.spec
      
      * test=develop
      resolve confict
      
      * test=develop
      update API.spec
      871af28d
  12. 11 6月, 2019 3 次提交
    • S
      add unfold op (new op),test=develop (#17944) · 40885c22
      SunGaofeng 提交于
      * add unfold op
      test=develop
      
      * fix divide bug in python3 when calculating output width and height
      test=develop
      
      * add name=None in python api, move redundant code into inline function
      
      * try to trigger ci for this code
      test=develop
      40885c22
    • J
      [MKL-DNN] Thread-Safety for MKL-DNN reusing Part 1 (#17965) · 84bb45c0
      Jacek Czaja 提交于
      * - removed is_reusing_
      
      * - Added TID to keys for reusing apart from softmax PD
      
      * - compilation fix
      
      * - Yet another compilation fix
      
      * - Batch Norm and Conv adapted
      
      * - Fix to softmax MT
      
      * - Fixes to MT code of MKL-DNN
      
      * - Lint fixes
      
      test=develop
      84bb45c0
    • Update the Anakin interfaces for content-dnn and MLU (#17890) · bce259e5
      石晓伟 提交于
      * update anakin-engine interfaces for content-dnn
      
      test=develop
      
      * support only-gpu mode of Anakin
      
      modify eltwise parse
      
      test=develop
      
      * modification for thread-safe
      
      test=develop
      
      * Integrated template instance
      
      test=develop
      
      * increase template parameters
      
      test=develop
      
      * support MLU predictor
      
      test=develop
      
      * update anakin cmake files
      
      test=develop
      
      * update TargetWrapper::set_device
      
      * update the initialization of anakin subgraph
      
      test=develop
      
      * use the default constructor of base class
      
      test=develop
      bce259e5
  13. 10 6月, 2019 7 次提交
  14. 07 6月, 2019 1 次提交
  15. 06 6月, 2019 1 次提交