1. 08 1月, 2020 1 次提交
    • D
      Support prroi_pool_op with Tensor and LoDTensor rois (#20649) · 6ea38091
      Double_V 提交于
      1. Add a new input named batch_roi_nums for prroi_pool_op. batch_roi_nums includes the number of roi for each image in batch when rois is Tensor. This information is saved in rois's lod when rois is LoDTensor.
      2. add grad check to prroi_pool_op and solve unnormal X grad diff in CPU.
      6ea38091
  2. 07 1月, 2020 4 次提交
  3. 06 1月, 2020 4 次提交
  4. 05 1月, 2020 1 次提交
  5. 04 1月, 2020 1 次提交
  6. 03 1月, 2020 5 次提交
    • S
      register int/int64_t/float16 in pow/square kernel,test=develop (#22023) · 7f4abaf2
      SunAhong1993 提交于
      * register int/int64_t/float16 in  pow/square kernel,test=develop
      
      * add abs/square/exp type,test=develop
      7f4abaf2
    • L
      register NoNeedBufferVarsInference for max_pool_grad_op, test=develop (#22055) · 3f653c83
      Leo Chen 提交于
      * fix test_conv2d_ngraph for grad diff, test=develop
      
      * register NoNeedBufferVarsInference for max_pool_grad_op, test=develop
      
      * refine error message, test=develop
      
      * fix numpy, test=develop
      
      * disable test conv2d_ngraph_op, test=develop
      Co-authored-by: NZhang Ting <709968123@qq.com>
      3f653c83
    • Y
      Add the first implememtation of fusion_group op (#19621) · d4832077
      Yiqun Liu 提交于
      * Add the dynamic load of nvrtc, and support runtime compiling of CUDA kernel using nvrtc.
      test=develop
      
      * Call CUDA driver api to launch the kernel compiled by nvrtc.
      test=develop
      
      * Disable for mac and windows.
      test=develop
      
      * Refine the codes to support manually specified num_threads and workload_per_thread.
      test=develop
      
      * Refine the CUDA kernel to support large dims.
      test=develop
      
      * Add DeviceCodePool to manage all device codes.
      
      * Add the first implementation fusion_group op.
      
      * Add unit-test for fusion_group op.
      
      * Add the check of result.
      
      * Add the check of nvrtc in unit-test.
      test=develop
      
      * Add comment to explain the inputs, outputs and features of fusion_group op.
      test=develop
      
      * Disable fusion_group op for mac and windows.
      test=develop
      
      * Make the compiling of device code return status instead of hanging up.
      test=develop
      
      * Add the check of whether there is CUDA driver library, and do not core dump when failing to call the CUDA driver API.
      
      * Unify fusion_group_op's input and output names.
      test=develop
      
      * Add the check of CUDA driver library in unittest.
      test=develop
      
      * Refine the calling of PADDLE_ENFORCE.
      test=develop
      d4832077
    • M
      [DNNL] 3D Fully-Connected (#21746) · 61921084
      Michał Gallus 提交于
      61921084
    • F
      fix generate_proposal_labesl op (#21793) · aa2ed0dc
      FDInSky 提交于
      * test=develop fix generate_proposal_labesl op
      aa2ed0dc
  7. 02 1月, 2020 2 次提交
  8. 31 12月, 2019 1 次提交
  9. 30 12月, 2019 1 次提交
  10. 27 12月, 2019 3 次提交
  11. 26 12月, 2019 2 次提交
  12. 25 12月, 2019 3 次提交
  13. 24 12月, 2019 3 次提交
    • A
      Optimize adam speed (#21777) · 51a86d2b
      Aurelius84 提交于
      * optimize adam speed by removing _finish_update test=develop
      
      * fix SparseAdamFunctor param list test=develop
      
      * Remove scale_op in expect_list of adam_op test=develop
      
      * fix test optimizer loss assert error test=develop
      
      * fix test optimizer loss assert error test=develop
      
      * modify PADDLE_ENFORCE usage test=develop
      
      * fix op_type in lamb_op.cc test=develop
      
      * fix errors ostream format bug test=develop
      
      * add betaPowOut in ngraph op test=develop
      
      * fix ngraph::op api for gcc8 test=develop
      
      * clean code test=develop
      
      * modify struct into class test=develop
      
      * remove code of beta1Tensor in lamb_op test=develop
      51a86d2b
    • F
      Update iou_similarity op to support non-normalized bbox (#21671) · 6b9fbcf3
      FDInSky 提交于
      Update iou_similarity op to support non-normalized bbox
      6b9fbcf3
    • G
      Modify the while_loop API (#21844) · 46f9184a
      guofei 提交于
      46f9184a
  14. 23 12月, 2019 2 次提交
  15. 20 12月, 2019 1 次提交
  16. 19 12月, 2019 4 次提交
  17. 17 12月, 2019 1 次提交
  18. 16 12月, 2019 1 次提交