1. 25 2月, 2020 2 次提交
    • G
      fix compile&runtime lod_equality of lod_reset (#22737) · ba140222
      GaoWei8 提交于
      ba140222
    • H
      PaddleBox Framework Part2 (#22466) · 175954d8
      hutuxian 提交于
      * Add two types of Metric Calculator: MultiTaskCalculator & CmatchRankCalculator.
      * Add a config for DynamicAdjustChannelNum function to denote whether we will discard the remaining instances when they are not be distributed evenly.
      * Remove CPU code in Pull/PushSparse and we will add it back when testing it fully.
      * Fix some known issues: such as copying persistable vars after one epoch running.
      175954d8
  2. 24 2月, 2020 4 次提交
    • S
      add partial_sum op in contrib (#22292) · 3132681e
      ShenLiang 提交于
      * add partial_sum_op, test=develop
      
      * modify the Paddle Error Message, test=develop
      
      * modify the Paddle Error Message, test=develop
      
      * modify the bug for python3, test=develop
      
      * modify the ut for ci, test=develop
      
      * mv to contrib, test=develop
      
      * use check_variable_and_dtype, test=develop
      
      * fix ci, test=develop
      
      * fix conflict, test=dvelop
      
      * add partial concat, test=develop
      
      * fix the conflict, test=develop
      
      * fix the error, test=develop
      
      * rm SSE4, test=develop
      3132681e
    • W
      Fusion group profile support (#22718) · 611411b9
      wangchaochaohu 提交于
      * add support for the driver api callback and fix the profiler name show bug
      611411b9
    • S
      add partial_concat op in contrib (#22528) · e1366613
      ShenLiang 提交于
      * add partial_concat, test=develop
      
      * fix the grids and blocks, test=develop
      
      * fix the Paddle_Enforce, test=develop
      
      * fix the doc of op, test=develop
      
      * fix the doc, test=develop
      
      * fix the doc of the op, test=develop
      
      * replace -1 with None, test=develop
      e1366613
    • G
      Add an inference interface to disable FC padding (#22097) · cdf5f6fb
      GaoWei8 提交于
      * Add an interface of disabling FC padding
      * fix bert regression
      * polish fc padding interface
      * recover pass function
      * fix argument error
      * fix mkldnn error
      cdf5f6fb
  3. 23 2月, 2020 1 次提交
  4. 22 2月, 2020 2 次提交
  5. 21 2月, 2020 3 次提交
  6. 20 2月, 2020 1 次提交
  7. 19 2月, 2020 1 次提交
  8. 18 2月, 2020 2 次提交
  9. 17 2月, 2020 5 次提交
  10. 15 2月, 2020 2 次提交
    • S
      update ops's unittest data type from float32 to float64 and shape over 100 (#22544) · 90ee3666
      Steffy-zxf 提交于
      * update ops's unittest of elementwise_pow, elementwise_max, elementwise_min, scale and sqrt
      1. update elementwise_pow, elementwise_max and scale's unitests with input data type (float32 -> float64)
      2. fix bug that the elementwise_pow doesn't meet threshold requirements with tackling float64 data
      3. remove sqrt from op_accuracy_white_list.py
      4. update the unittests of elementwise_pow, elementwise_max and elementwise_min ops that their input data shape over 100
      5. test=develop
      
      * modify the writing style according suggestions
      test=develop
      90ee3666
    • F
      remove python inference warning (#22602) · f7eafca8
      flame 提交于
      f7eafca8
  11. 14 2月, 2020 4 次提交
  12. 13 2月, 2020 3 次提交
  13. 12 2月, 2020 4 次提交
  14. 11 2月, 2020 6 次提交
    • H
      Paddlebox about box_wrapper (#22497) · 1a7962be
      hutuxian 提交于
      Refine PaddleBox Framework, Main functions: 
      * Add MetricMsg util class, which can calculate metrics like AUC, bucket_error, COPC.
      * Replace FeedPass with new interface: BeginFeedPass & EndFeedPass
      * Refactor Pull/Push Sparse Function in box_wrapper.
      * Use CUDA Kernel to copy keys and copy feasign between tensor and boxps struct.
      * Cache copied keys in pull sparse in order to reuse it in push period.
      1a7962be
    • H
    • Y
      multi-loss optimization by adding a DownpourOpt worker (#22025) · 2235ee1a
      yaoxuefeng 提交于
      * update
      
      * update test=develop
      
      * update compile set test=develop
      
      * update compile set test=develop
      
      * update test=develop
      
      * update test=develop
      
      * update test=develop
      
      * update compile setting test=develop
      
      * update compile setting test=develop
      
      * update run demo test=develop
      
      * update test=develop
      
      * update test=develop
      
      * fix test=develop
      
      * update test=develop
      
      * update test=develop
      
      * update test=develop
      
      * update test=develop
      
      * update test=develop
      
      * update test=develop
      
      * update test=develop
      
      * update test=develop
      
      * update test=develop
      
      * update format test=develop
      
      * update format test=develop
      
      * update style test=develop
      
      * update style test=develop
      
      * change style test=develop
      
      * change style test=develop
      
      * change style test=develop
      
      * add dataset unittest test=develop
      
      * update test=develop
      
      * update for record test=develop
      
      * udpate style for record test=develop
      
      * update for record test=develop
      
      * update for record test=develop
      
      * update for record test=develop
      
      * fix format test=develop
      
      * update test=develop
      
      * update test=develop
      
      * update test=develop
      
      * update test=develop
      
      * update test=develop
      2235ee1a
    • Z
      Improve transpose performance with tile sm copy, test=develop (#22311) · 54970444
      zhaoyuchen2018 提交于
      
      * Refine code, fix select tile error,test=develop
      
      * Refine element type and some comments, test=develop
      
      * Refine comments and gpu utils, test=develop
      
      * Remove some useless condition
      
      * Refine floor and ceil, test=develop
      
      * refine for loop. test=develop
      Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>
      54970444
    • W
      Compile without nccl deps. [1/2] (#22509) · a90fa540
      Wilber 提交于
      支持不依赖nccl进行编译。[1/2]
      
      多卡下,如果没有打开WITH_NCCL开关编译,多卡不能通信,则只能选择一张卡使用。
      Co-authored-by: N石晓伟 <39303645+Shixiaowei02@users.noreply.github.com>
      a90fa540
    • G
      Make assign op support LoDTensorArray and modify while_loop API (#22309) · 3a59a7a1
      guofei 提交于
      This PR makes assign op support LoDTensorArray and enable the loop_vars in
      while_loop to support tuple or list.
      3a59a7a1