1. 20 4月, 2020 1 次提交
  2. 12 4月, 2020 1 次提交
  3. 10 4月, 2020 3 次提交
  4. 09 4月, 2020 2 次提交
  5. 07 4月, 2020 1 次提交
  6. 04 4月, 2020 2 次提交
  7. 03 4月, 2020 1 次提交
    • Z
      support Exhaustive search in dygraph (#23415) · dbfbd7ea
      zhongpu 提交于
      * use global conv cache; test=develop
      
      * use singleton cache; test=develop
      
      * fix format error; test=develop
      
      * add cudnn helper header; test=develop
      
      * fix header error; test=develop
      
      * fix mac unitest; test=develop
      
      * fix mac unitest; test=develop
      
      * fix file format; test=develop
      
      * fix include file error, test=develop
      
      * remove kernel_configs_ in class ExecutionContext and kernel_configs_map_ in class OperatorWithKernel, test=develop
      
      * fix test_elementwise_mul_op_dim, test=develop
      
      * fix compile error, test=develop
      Co-authored-by: Nphlrain <phliuhongyu@126.com>
      dbfbd7ea
  8. 02 4月, 2020 2 次提交
    • Z
      Revert "Exhaustive search (#22821)", test=develop (#23401) · bfb07aaf
      zhongpu 提交于
      This reverts commit 48144e40.
      bfb07aaf
    • Z
      Exhaustive search (#22821) · 48144e40
      zhongpu 提交于
      * use global conv cache; test=develop
      
      * use singleton cache; test=develop
      
      * fix format error; test=develop
      
      * add cudnn helper header; test=develop
      
      * fix header error; test=develop
      
      * fix mac unitest; test=develop
      
      * fix mac unitest; test=develop
      
      * fix file format; test=develop
      
      * fix include file error, test=develop
      
      * remove kernel_configs_ in class ExecutionContext and kernel_configs_map_ in class OperatorWithKernel, test=develop
      
      * fix test_elementwise_mul_op_dim, test=develop
      Co-authored-by: Nphlrain <phliuhongyu@126.com>
      48144e40
  9. 26 3月, 2020 1 次提交
    • Z
      [Paddle-TRT]: Ernie Dynamic shape support. (#23138) · 430b0099
      Zhaolong Xing 提交于
      * add dynamic plugin support.
      test=develop
      
      * change emb eltwise layernorm to math function
      test=develop
      
      * add emb eltwise layernorm
      test=develop
      
      * can run dynamic shape ernie
      test=develop
      
      * fix ci
      test=develop
      
      * add ut for trt ernie dynamic
      
      test=develop
      
      * refine dynamic shape c++ interface.
      test=develop
      
      * fix comments
      test=develop
      
      * fix comments
      test=develop
      430b0099
  10. 20 3月, 2020 2 次提交
  11. 19 3月, 2020 1 次提交
  12. 13 3月, 2020 1 次提交
  13. 12 3月, 2020 1 次提交
  14. 11 3月, 2020 1 次提交
  15. 09 3月, 2020 1 次提交
  16. 05 3月, 2020 1 次提交
  17. 28 2月, 2020 1 次提交
  18. 23 2月, 2020 1 次提交
  19. 21 2月, 2020 1 次提交
  20. 13 2月, 2020 1 次提交
  21. 10 2月, 2020 1 次提交
  22. 16 1月, 2020 1 次提交
  23. 10 1月, 2020 1 次提交
    • Z
      Add bn and relu fuse pass (#22048) · 46189b16
      Zhen Wang 提交于
      * add bn and relu fuse pass
      
      * add op attr assert and dtype assert
      
      * fix some inputs&&outputs bugs for the fused op and pattern.
      
      * add the unittest for fuse_bn_act_pass. test=develop
      
      * use normative enforce statements. test=develop
      
      * add the cpu test. test=develop
      
      * add the support of batch_size=1 for the bn with relu op. test=develop
      
      * add the error type for paddle throws. test=develop
      
      * add fused_batch_norm_act and fused_batch_norm_act_grad to op_has_unsed_vars_white_list. test=develop
      46189b16
  24. 07 1月, 2020 2 次提交
  25. 03 1月, 2020 1 次提交
    • Y
      Add the first implememtation of fusion_group op (#19621) · d4832077
      Yiqun Liu 提交于
      * Add the dynamic load of nvrtc, and support runtime compiling of CUDA kernel using nvrtc.
      test=develop
      
      * Call CUDA driver api to launch the kernel compiled by nvrtc.
      test=develop
      
      * Disable for mac and windows.
      test=develop
      
      * Refine the codes to support manually specified num_threads and workload_per_thread.
      test=develop
      
      * Refine the CUDA kernel to support large dims.
      test=develop
      
      * Add DeviceCodePool to manage all device codes.
      
      * Add the first implementation fusion_group op.
      
      * Add unit-test for fusion_group op.
      
      * Add the check of result.
      
      * Add the check of nvrtc in unit-test.
      test=develop
      
      * Add comment to explain the inputs, outputs and features of fusion_group op.
      test=develop
      
      * Disable fusion_group op for mac and windows.
      test=develop
      
      * Make the compiling of device code return status instead of hanging up.
      test=develop
      
      * Add the check of whether there is CUDA driver library, and do not core dump when failing to call the CUDA driver API.
      
      * Unify fusion_group_op's input and output names.
      test=develop
      
      * Add the check of CUDA driver library in unittest.
      test=develop
      
      * Refine the calling of PADDLE_ENFORCE.
      test=develop
      d4832077
  26. 27 12月, 2019 1 次提交
  27. 16 12月, 2019 1 次提交
  28. 06 12月, 2019 1 次提交
  29. 29 11月, 2019 2 次提交
    • Z
      Add dscending for argsort (#21400) · b1627455
      zhaoyuchen2018 提交于
      * Add ascending for argsort
      
      * Refine api doc description.
      
      * Refine descending description
      
      * Add int32 logic to speedup when data is small size.
      
      * Remove int32 opt as not support in python
      b1627455
    • H
      Add dygraph execution context (#20157) · ac854670
      hong 提交于
      * add_dygraph_execution_context
      
      * add dygraph infershape context and execution context; test=develop
      
      * fix imperative bug; test=develop
      
      * remove inputs outputs interface from execution context,
      because it have same function with inputNames;
      test=develop
      
      * remove tracer_test ctest; test=develop
      
      * fix split op bug; test=develop
      
      * fix unitests bug; test=develop
      
      * fix distribute test bug; test=develop
      
      * fix ngraph compile bug; test=develop
      
      * fix grad maker bug; test=develop
      
      * fix load op bugs; test=develop
      
      * fix operator.cc construct bug; test=develop
      
      * remove useless name find in operator; test=develop
      
      * add tracer_test; test=develop
      
      * fix concat, split bug; test=develop
      
      * remove tracer_test unitest; test=develop
      
      * fix attribute check bug; test=develop
      
      * add test code to fix converage; test=develop
      
      * remove useless code, change check backward input in engin; test=develop
      
      * unlock var type infer shape;test=develop
      
      * add ShareAllLoD api; test=develop
      
      * add dygraph infershape context unitest; test=develop
      
      * remove increase and decrease lod in dygraph; test=develop
      
      * addd override; test=develop
      
      * fix increase descrease lod; test=develop
      
      * fix paddle_enforce; test=develop
      
      * disable lod op dygraph check; test=develop
      
      * fix paddle enforce error; test=develop
      
      * add comment for op_registry and OperatorBase; test=develop
      
      * optimize the comment of op_registry; test=develop
      
      * fix format of comment; test=develop
      
      * fix format of comment; test=develop
      
      * optimize the format of comment; test=develop
      
      * optimize the format of the comment; test=develop
      
      * optimize comment of op_registry; test=develop
      ac854670
  30. 28 11月, 2019 2 次提交
  31. 26 11月, 2019 1 次提交