1. 27 9月, 2020 1 次提交
  2. 21 9月, 2020 1 次提交
  3. 15 9月, 2020 1 次提交
  4. 27 8月, 2020 1 次提交
  5. 21 8月, 2020 1 次提交
    • Q
      support Baidu Kunlun AI Accelerator (#25959) · 138ecf24
      QingshuChen 提交于
      * support Baidu AI Accelerator
        * test=kunlun
      
      * minor
       * test=kunlun
      
      * support xpu op in separate file
       * test=kunlun
      
      * update XPU error message and remove duplicated code
      
       * test=kunlun
      
      * minor
       * test=kunlun
      
      * minor
       * test=kunlun
      138ecf24
  6. 18 8月, 2020 1 次提交
  7. 16 8月, 2020 2 次提交
  8. 15 8月, 2020 1 次提交
    • Z
      expose and unify the Tensor concepts to the user (#25978) · 6de463d3
      Zhou Wei 提交于
      * expose and unify the Tensor concepts to the user
      
      * expose tensor to user
      
      * add copy place for Tensor
      
      * add copy place for Tensor
      
      * add note
      
      * add macro PADDLE_WITH_CUDA
      
      * remove RUN_TYPE=DIST
      
      * fix some error
      6de463d3
  9. 13 8月, 2020 1 次提交
  10. 06 8月, 2020 1 次提交
    • T
      add heter ps mode (#25682) · 0cb60c70
      Thunderbrook 提交于
      * add heter ps mode
      
      * code style
      test=develop
      
      * add with_pslib
      test=develop
      
      * unitest
      test=develop
      
      * code style
      test=develop
      
      * code style
      test=develop
      
      * code style
      test=develop
      
      * code style
      test=develop
      
      * code style
      test=develop
      
      * code style
      test=develop
      
      * code style
      test=develop
      
      * code style
      test=develop
      
      * test monitor
      test=develop
      
      * prepare trainer
      test=develop
      
      * code style
      test=develop
      0cb60c70
  11. 30 7月, 2020 1 次提交
  12. 07 7月, 2020 1 次提交
  13. 16 6月, 2020 1 次提交
    • H
      Monitor Framework (#24079) · 5822862d
      hutuxian 提交于
      * Add a StatValue class in the backend to represent a stat.
      * Add a singleton StatRegistry to maintain the collection of stats.
      * For the sake of code neatness, we only support type of int and float, which can cover most of the scenarios.
      5822862d
  14. 08 6月, 2020 1 次提交
    • L
      Refine error message in pybind folder (#24886) · 6190023a
      Leo Chen 提交于
      * refine err_msg of pybind.cc, test=develop
      
      * refine err_msg in tensor_py.h, test=develop
      
      * refine error msg, test=develop
      
      * fix test_exception, test=develop
      
      * follow comments, test=develop
      6190023a
  15. 03 6月, 2020 1 次提交
    • Y
      Add crypto python (#24836) · aa47356b
      Yanghello 提交于
      * add crypto helper for paddle, test=develop
      
      * cryptopp.cmake bug fixed, test=develop
      
      * remove debug build type, test=develop
      
      * fixed CMakeLists for new target, test=develop
      
      * fix CI bug, test=develop
      
      * add cmake option flag DWITH_CRYPTO, test=develop
      
      * add crypto api for python, test=develop
      
      * Revert "add crypto api for python, test=develop"
      
      This reverts commit 3a1cfa9d.
      
      * Revert "Add crypto api (#24694)"
      
      This reverts commit 5a7a517c.
      
      * Revert "Revert "Add crypto api (#24694)""
      
      This reverts commit f952b19f.
      
      * fixed cryptopp cmake building error, test=develop
      
      * change WITH_CRYPTO building option to OFF, test=develop
      
      * â€fixed cipher test failed, test=develop
      
      * "add crypto api for python, test=develop"
      
      This reverts commit 83fb55c0.
      
      * travis CI bug fixed, test=develop
      
      * fixed test in python3
      
      * test=develop
      
      * fixed unittest, test=develop
      aa47356b
  16. 11 5月, 2020 1 次提交
    • C
      Add macro BOOST_GET to enrich the error information of boost :: get (#24175) · aa0f254f
      Chen Weihang 提交于
      * add new macro BOOST_GET_SAFELY & unittests, test=develop
      
      * add different macro type, test=develop
      
      * fix get macro type in executor, test=develop
      
      * four macro part change backup
      
      * using one macro for all case, test=develop
      
      * revert attribute change, test=develop
      
      * change to three func to solve gcc4.8 bug, test=develop
      
      * polish some details, test=develop
      aa0f254f
  17. 26 4月, 2020 1 次提交
  18. 24 4月, 2020 1 次提交
  19. 19 4月, 2020 1 次提交
    • G
      Support LoDTensorArray in fetch (#23645) · 2b896c1f
      guofei 提交于
      * Support LoDTEnsorArray in fetch op
      
      test=develop
      
      * Support LoDTensorArray in fetch
      
      test=develop
      
      * Support LoDTensorArray in fetch
      
      test=develop
      
      * Support LoDTensorArray in fetch
      
      test=develop
      
      * Support LoDTensorArray in fetch
      
      test=develop
      
      * Support LoDTensorArray in fetch
      
      test=develop
      
      * Support LoDTensorArray in fetch
      
      test=develop
      
      * Support LoDTensorArray in fetch
      
      test=develop
      
      * Support LoDTensorArray in fetch
      
      test=develop
      
      * Support LoDTensorArray in fetch
      
      test=develop
      2b896c1f
  20. 10 4月, 2020 1 次提交
  21. 09 4月, 2020 2 次提交
  22. 06 4月, 2020 1 次提交
    • C
      Implement StaticModelRunner to support dygraph fine-tune static graph pre-training model (#23171) · 75bd3507
      Chen Weihang 提交于
      * static model runner basic implement, test=develop
      
      * add run program op to execute loaded program, test=develop
      
      * refactor static model runner & run program op, test=develop
      
      * reset engine.cc to resolve conflict
      
      * adapt the change of dygraph double grad, test=develop
      
      * refactor impl to solve control flow error, test=develop
      
      * clear debug code, test=develop
      
      * fix ci str compatible error & checkout dygraph grad maker & add example, test=develop
      
      * hide api & add op test, test=develop
      
      * fix run program op test places error, test=develop
      
      * fix program by review comment, test=develop
      
      * delete change var desc name, test=develop
      
      * fix other program by review comment, test=develop
      
      * remove _static_graph_guard, test=develop
      
      * add selectedrows test, test=develop
      
      * remove desc parser, test=develop
      
      * fix detail program, test=develop
      
      * change socpe create & add test, test=develop
      75bd3507
  23. 03 4月, 2020 2 次提交
    • G
    • L
      [feature] prune program by feed and fetch_list automatically (#22474) · a62599a8
      Leo Chen 提交于
      * prune train program by fetch_list, test=develop
      
      * add unittest for prune, test=develop
      
      * fix pruned feed, test=develop
      
      * support ParallelExecutor and feed prune, test=develop
      
      * add comments, test=develop
      
      * update unittest, test=develop
      
      * update unittests, test=develop
      
      * remove debug code, test=develop
      
      * support cond in clone, test=develop
      
      * support cond in prune, test=develop
      
      * support multiple minimize, test=develop
      
      * support cache, test=develop
      
      * fix _copy_param_info_from, test=develop
      
      * support python2 str, test=develop
      
      * remove debug code, test=develop
      
      * fix bug of caching CompiledProgram, test=develop
      
      * fix multi_device issue, test=develop
      
      * tmp
      
      * support tuple in fetch_list and overriding use_prune, test=develop
      
      * dont use nonlocal in python2, test=develop
      
      * remove nonlocal, test=develop
      
      * code clean, test=develop
      
      * code clean, test=develop
      
      * feed list, test=develop
      
      * test adam, test=develop
      
      * follow comments, test=develop
      
      * reduce duplicate code, test=develop
      
      * update comments, test=develop
      a62599a8
  24. 20 3月, 2020 1 次提交
    • Z
      Reader sequential and inference partial feed (#22699) · acfc9b8a
      Zeng Jinle 提交于
      * sequential reader stage 1, test=develop
      
      * fix ut, test=develop
      
      * fix iterable=False reset bug, add some logs and polish code, test=develop
      
      * inference feed partial data, test=develop
      
      * Turn on keep_order=True for test, test=develop
      
      * enhance ut to test more cases, test=develop
      
      * test commit for reverting
      
      * Revert "test commit for reverting", test=develop
      
      This reverts commit 80aef42e.
      
      * add ut of merged and unmerged results, test=develop
      
      * add more uts for coverages and add en doc of api, test=develop
      
      * follow comments, test=develop
      
      * change note style, test=develop
      acfc9b8a
  25. 02 3月, 2020 2 次提交
    • Z
      Unmerged fetch list (#22635) · 89cfa491
      Zhen Wang 提交于
      * update ScopeBufferedSSAGraphExecutor&AsyncSSAGraphExecutor&ThreadedSSAGraphExecutor&FastThreadedSSAGraphExecutor&ParallelSSAGraphExecutor&ParallelExecutor for fetching unmerged results.
      
      * add the unit test for fetch_unmerged.
      
      * update ut for multi-card and multi-cpu.
      
      * add the error message and the user suggestion in FetchOpHandle. test=develop
      89cfa491
    • C
      Speed up dygraph DataLoader based on shared memory and LoDTensor serialization (#22541) · 7d8d5734
      Chen Weihang 提交于
      * add lodtensor share memory & serialization, test=develop
      
      * fix windows compile error, test=develop
      
      * deal vartype pickle & fix unittest matching error message, test=develop
      
      * update timeout variable name, test=develop
      
      * refactor memory map implement, test=develop
      
      * clear mmap file discripter when exit unexpectedly, test=develop
      
      * remove the child process fd in advance, test=develop
      
      * remove mmap fds after Queue.put in child process, test=develop
      
      * add hard unittests for register exit func, test=develop
      
      * fix python2 compatibility problem in unittest, test=develop
      
      * fix exception unittest error, test=develop
      
      * polish code based review comment, test=develop
      7d8d5734
  26. 26 2月, 2020 1 次提交
  27. 25 2月, 2020 1 次提交
    • H
      PaddleBox Framework Part2 (#22466) · 175954d8
      hutuxian 提交于
      * Add two types of Metric Calculator: MultiTaskCalculator & CmatchRankCalculator.
      * Add a config for DynamicAdjustChannelNum function to denote whether we will discard the remaining instances when they are not be distributed evenly.
      * Remove CPU code in Pull/PushSparse and we will add it back when testing it fully.
      * Fix some known issues: such as copying persistable vars after one epoch running.
      175954d8
  28. 18 2月, 2020 1 次提交
  29. 12 2月, 2020 1 次提交
  30. 10 2月, 2020 1 次提交
  31. 07 2月, 2020 1 次提交
    • Y
      Enable the detection of subgraph composed of grad ops (#21223) · dcfb6038
      Yiqun Liu 提交于
      * Add the first implememtation of fusion_group op #19621 (#3)
      
      * Add the dynamic load of nvrtc, and support runtime compiling of CUDA kernel using nvrtc.
      test=develop
      
      * Call CUDA driver api to launch the kernel compiled by nvrtc.
      test=develop
      
      * Disable for mac and windows.
      test=develop
      
      * Refine the codes to support manually specified num_threads and workload_per_thread.
      test=develop
      
      * Refine the CUDA kernel to support large dims.
      test=develop
      
      * Add DeviceCodePool to manage all device codes.
      
      * Add the first implementation fusion_group op.
      
      * Add unit-test for fusion_group op.
      
      * Add the check of result.
      
      * Add the check of nvrtc in unit-test.
      test=develop
      
      * Add comment to explain the inputs, outputs and features of fusion_group op.
      test=develop
      
      * Disable fusion_group op for mac and windows.
      test=develop
      
      * Make the compiling of device code return status instead of hanging up.
      test=develop
      
      * Add the check of whether there is CUDA driver library, and do not core dump when failing to call the CUDA driver API.
      
      * Unify fusion_group_op's input and output names.
      test=develop
      
      * Add the check of CUDA driver library in unittest.
      test=develop
      
      * Enable generating code for a given subgraph. #21126 (#4)
      
      * Enable generating code for a given subgraph.
      
      * Support sorting the subgraph.
      
      * Remove the rearange of expressions because we use the sorted subgraph directly.
      
      * Enable generating code for a subgraph which is composed of grad ops.
      
      * Use expression information to check the accuracy in unittest.
      
      * Separate load and store from computation expressions.
      test=develop
      
      * Improve the loading statements in generated codes.
      test=develop
      
      * Remove unused arguments from formal list.
      test=develop
      
      * Enable the detection of subgraph of grad ops.
      
      * Generate code for detected subgraph in fusion_group_pass.
      
      * Add an option in BuildStrategy to enable fusion_group_pass and add unittest.
      test=develop
      
      * Fix a bug when checking whether the shape of all inputs are the same.
      
      * Add debug information.
      
      * Remove subgraph_detector from inference/analysis to the common framework/ir directory. (#5)
      
      test=develop
      
      * Call subgraph_detector in fusion_group pass.
      test=develop
      
      * Disable fusion_group when WITH_GPU is OFF.
      test=develop
      
      * Refine all PADDLE_ENFORCE message.
      test=develop
      
      * Fix the case that some inputs are not defined in grad ops, and set op_role for fused op.
      test=develop
      
      * Follow review comments.
      test=develop
      dcfb6038
  32. 05 2月, 2020 1 次提交
  33. 17 1月, 2020 1 次提交
    • Y
      Implement a common python unittest to test the ir passes. (#22209) · b7cac50b
      Yiqun Liu 提交于
      * Implement a common python unittest to test the ir passes.
      test=develop
      
      * Save the results in np.array and support to startup on CPU.
      test=develop
      
      * Fix the unittest.
      test=develop
      
      * Add check_program to check whether the optimized program is different from the origin one.
      test=develop
      
      * Remove the inferface all_ops.
      test=develop
      
      * Add exception test in pass_test.
      test=develop
      b7cac50b
  34. 14 1月, 2020 1 次提交
  35. 10 1月, 2020 1 次提交
    • Z
      Add bn and relu fuse pass (#22048) · 46189b16
      Zhen Wang 提交于
      * add bn and relu fuse pass
      
      * add op attr assert and dtype assert
      
      * fix some inputs&&outputs bugs for the fused op and pattern.
      
      * add the unittest for fuse_bn_act_pass. test=develop
      
      * use normative enforce statements. test=develop
      
      * add the cpu test. test=develop
      
      * add the support of batch_size=1 for the bn with relu op. test=develop
      
      * add the error type for paddle throws. test=develop
      
      * add fused_batch_norm_act and fused_batch_norm_act_grad to op_has_unsed_vars_white_list. test=develop
      46189b16
  36. 06 1月, 2020 1 次提交