1. 03 4月, 2020 1 次提交
    • L
      [feature] prune program by feed and fetch_list automatically (#22474) · a62599a8
      Leo Chen 提交于
      * prune train program by fetch_list, test=develop
      
      * add unittest for prune, test=develop
      
      * fix pruned feed, test=develop
      
      * support ParallelExecutor and feed prune, test=develop
      
      * add comments, test=develop
      
      * update unittest, test=develop
      
      * update unittests, test=develop
      
      * remove debug code, test=develop
      
      * support cond in clone, test=develop
      
      * support cond in prune, test=develop
      
      * support multiple minimize, test=develop
      
      * support cache, test=develop
      
      * fix _copy_param_info_from, test=develop
      
      * support python2 str, test=develop
      
      * remove debug code, test=develop
      
      * fix bug of caching CompiledProgram, test=develop
      
      * fix multi_device issue, test=develop
      
      * tmp
      
      * support tuple in fetch_list and overriding use_prune, test=develop
      
      * dont use nonlocal in python2, test=develop
      
      * remove nonlocal, test=develop
      
      * code clean, test=develop
      
      * code clean, test=develop
      
      * feed list, test=develop
      
      * test adam, test=develop
      
      * follow comments, test=develop
      
      * reduce duplicate code, test=develop
      
      * update comments, test=develop
      a62599a8
  2. 20 3月, 2020 1 次提交
    • Z
      Reader sequential and inference partial feed (#22699) · acfc9b8a
      Zeng Jinle 提交于
      * sequential reader stage 1, test=develop
      
      * fix ut, test=develop
      
      * fix iterable=False reset bug, add some logs and polish code, test=develop
      
      * inference feed partial data, test=develop
      
      * Turn on keep_order=True for test, test=develop
      
      * enhance ut to test more cases, test=develop
      
      * test commit for reverting
      
      * Revert "test commit for reverting", test=develop
      
      This reverts commit 80aef42e.
      
      * add ut of merged and unmerged results, test=develop
      
      * add more uts for coverages and add en doc of api, test=develop
      
      * follow comments, test=develop
      
      * change note style, test=develop
      acfc9b8a
  3. 02 3月, 2020 2 次提交
    • Z
      Unmerged fetch list (#22635) · 89cfa491
      Zhen Wang 提交于
      * update ScopeBufferedSSAGraphExecutor&AsyncSSAGraphExecutor&ThreadedSSAGraphExecutor&FastThreadedSSAGraphExecutor&ParallelSSAGraphExecutor&ParallelExecutor for fetching unmerged results.
      
      * add the unit test for fetch_unmerged.
      
      * update ut for multi-card and multi-cpu.
      
      * add the error message and the user suggestion in FetchOpHandle. test=develop
      89cfa491
    • C
      Speed up dygraph DataLoader based on shared memory and LoDTensor serialization (#22541) · 7d8d5734
      Chen Weihang 提交于
      * add lodtensor share memory & serialization, test=develop
      
      * fix windows compile error, test=develop
      
      * deal vartype pickle & fix unittest matching error message, test=develop
      
      * update timeout variable name, test=develop
      
      * refactor memory map implement, test=develop
      
      * clear mmap file discripter when exit unexpectedly, test=develop
      
      * remove the child process fd in advance, test=develop
      
      * remove mmap fds after Queue.put in child process, test=develop
      
      * add hard unittests for register exit func, test=develop
      
      * fix python2 compatibility problem in unittest, test=develop
      
      * fix exception unittest error, test=develop
      
      * polish code based review comment, test=develop
      7d8d5734
  4. 26 2月, 2020 1 次提交
  5. 25 2月, 2020 1 次提交
    • H
      PaddleBox Framework Part2 (#22466) · 175954d8
      hutuxian 提交于
      * Add two types of Metric Calculator: MultiTaskCalculator & CmatchRankCalculator.
      * Add a config for DynamicAdjustChannelNum function to denote whether we will discard the remaining instances when they are not be distributed evenly.
      * Remove CPU code in Pull/PushSparse and we will add it back when testing it fully.
      * Fix some known issues: such as copying persistable vars after one epoch running.
      175954d8
  6. 18 2月, 2020 1 次提交
  7. 12 2月, 2020 1 次提交
  8. 10 2月, 2020 1 次提交
  9. 07 2月, 2020 1 次提交
    • Y
      Enable the detection of subgraph composed of grad ops (#21223) · dcfb6038
      Yiqun Liu 提交于
      * Add the first implememtation of fusion_group op #19621 (#3)
      
      * Add the dynamic load of nvrtc, and support runtime compiling of CUDA kernel using nvrtc.
      test=develop
      
      * Call CUDA driver api to launch the kernel compiled by nvrtc.
      test=develop
      
      * Disable for mac and windows.
      test=develop
      
      * Refine the codes to support manually specified num_threads and workload_per_thread.
      test=develop
      
      * Refine the CUDA kernel to support large dims.
      test=develop
      
      * Add DeviceCodePool to manage all device codes.
      
      * Add the first implementation fusion_group op.
      
      * Add unit-test for fusion_group op.
      
      * Add the check of result.
      
      * Add the check of nvrtc in unit-test.
      test=develop
      
      * Add comment to explain the inputs, outputs and features of fusion_group op.
      test=develop
      
      * Disable fusion_group op for mac and windows.
      test=develop
      
      * Make the compiling of device code return status instead of hanging up.
      test=develop
      
      * Add the check of whether there is CUDA driver library, and do not core dump when failing to call the CUDA driver API.
      
      * Unify fusion_group_op's input and output names.
      test=develop
      
      * Add the check of CUDA driver library in unittest.
      test=develop
      
      * Enable generating code for a given subgraph. #21126 (#4)
      
      * Enable generating code for a given subgraph.
      
      * Support sorting the subgraph.
      
      * Remove the rearange of expressions because we use the sorted subgraph directly.
      
      * Enable generating code for a subgraph which is composed of grad ops.
      
      * Use expression information to check the accuracy in unittest.
      
      * Separate load and store from computation expressions.
      test=develop
      
      * Improve the loading statements in generated codes.
      test=develop
      
      * Remove unused arguments from formal list.
      test=develop
      
      * Enable the detection of subgraph of grad ops.
      
      * Generate code for detected subgraph in fusion_group_pass.
      
      * Add an option in BuildStrategy to enable fusion_group_pass and add unittest.
      test=develop
      
      * Fix a bug when checking whether the shape of all inputs are the same.
      
      * Add debug information.
      
      * Remove subgraph_detector from inference/analysis to the common framework/ir directory. (#5)
      
      test=develop
      
      * Call subgraph_detector in fusion_group pass.
      test=develop
      
      * Disable fusion_group when WITH_GPU is OFF.
      test=develop
      
      * Refine all PADDLE_ENFORCE message.
      test=develop
      
      * Fix the case that some inputs are not defined in grad ops, and set op_role for fused op.
      test=develop
      
      * Follow review comments.
      test=develop
      dcfb6038
  10. 05 2月, 2020 1 次提交
  11. 17 1月, 2020 1 次提交
    • Y
      Implement a common python unittest to test the ir passes. (#22209) · b7cac50b
      Yiqun Liu 提交于
      * Implement a common python unittest to test the ir passes.
      test=develop
      
      * Save the results in np.array and support to startup on CPU.
      test=develop
      
      * Fix the unittest.
      test=develop
      
      * Add check_program to check whether the optimized program is different from the origin one.
      test=develop
      
      * Remove the inferface all_ops.
      test=develop
      
      * Add exception test in pass_test.
      test=develop
      b7cac50b
  12. 14 1月, 2020 1 次提交
  13. 10 1月, 2020 1 次提交
    • Z
      Add bn and relu fuse pass (#22048) · 46189b16
      Zhen Wang 提交于
      * add bn and relu fuse pass
      
      * add op attr assert and dtype assert
      
      * fix some inputs&&outputs bugs for the fused op and pattern.
      
      * add the unittest for fuse_bn_act_pass. test=develop
      
      * use normative enforce statements. test=develop
      
      * add the cpu test. test=develop
      
      * add the support of batch_size=1 for the bn with relu op. test=develop
      
      * add the error type for paddle throws. test=develop
      
      * add fused_batch_norm_act and fused_batch_norm_act_grad to op_has_unsed_vars_white_list. test=develop
      46189b16
  14. 06 1月, 2020 1 次提交
  15. 18 12月, 2019 1 次提交
  16. 11 12月, 2019 1 次提交
  17. 06 12月, 2019 1 次提交
  18. 05 12月, 2019 2 次提交
    • Z
      add grad maker assert, test=develop (#21564) · 3a7caf48
      Zeng Jinle 提交于
      3a7caf48
    • L
      Split VarBase from Python Variable for Dygraph (#21359) · cdd46d7e
      Leo Chen 提交于
      * test=develop, fix docker with paddle nccl problem
      
      * don't expose numerous Tensor.set(), test=develop
      
      * fix condition, test=develop
      
      * fix float16 bug, test=develop
      
      * feed should be Tensor or np.array, not Variable or number, test=develop
      
      * use forcecast to copy numpy slice to new array, test=develop
      
      * remove float16-uint16 hacking, test=develop
      
      * add variable method to varbase and refactor to_variable to support return varbase
      
      * support kwargs in varbase constructor
      
      * add VarBase constructor to support default python args
      
      * refine varbase initial method
      
      * reset branch
      
      * fix ut for change VarBase error info to PaddleEnforce
      
      * cherry is parameter change before
      
      * overload isinstance to replace too many change of is_variable
      
      * rm useless files
      
      * rm useless code merged by git
      
      * test=develop, fix some ut failed error
      
      * test=develop, fix test_graph_wrapper
      
      * add some tests, test=develop
      
      * refine __getitem__, test=develop
      
      * add tests, test=develop
      
      * fix err_msg, test=develop
      cdd46d7e
  19. 04 12月, 2019 1 次提交
  20. 27 11月, 2019 1 次提交
    • Y
      Support numpy bridge (enabled by default in dygraph mode) (#20983) · d5ff79e5
      Youwei Song 提交于
      * add numpy bridge
      
      * fix template compile
      
      * add unittest, add default
      test=develop
      
      * fix unittest
      test=develop
      
      * fix unittest
      test=develop
      
      * zero_copy=True for to_variable,
      test=develop
      
      * bug fix
      test=develop
      
      * disable deprecated NumPy API
      test=develop
      
      * use better design of NumpyAllocator
      test=develop
      
      * fix Py_None check
      test=develop
      
      * reset c++ tracer when jump out dygraph guard
      test=develop
      
      * refine PADDLE_ENFORCE_xx format
      test=develop
      
      * bug fix of tracer switch
      test=develop
      
      * update decref
      test=develop
      d5ff79e5
  21. 25 11月, 2019 1 次提交
  22. 24 11月, 2019 1 次提交
    • D
      Refactor fetch handler (#21264) · 691ced87
      Dong Daxiang 提交于
      * fix fetch handler problem and refactor
      when a user define FetchHandler class, he or she should initialize a handler
      with variable dict. the key of a variable dict is a user defined name,
      the value of a variable dict is a Varaible generated from python API.
      
      For each fetching, a user should implement handler function in which
      fetched_result_dict will be available and the user can access the fetched value
      with user defined keys.
      691ced87
  23. 14 11月, 2019 1 次提交
  24. 01 11月, 2019 1 次提交
    • L
      Update Tensor.set() to support float16 (#19964) · 9974e407
      Leo Chen 提交于
      * don't expose numerous Tensor.set(), test=develop
      
      * fix condition, test=develop
      
      * fix float16 bug, test=develop
      
      * feed should be Tensor or np.array, not Variable or number, test=develop
      
      * use forcecast to copy numpy slice to new array, test=develop
      
      * remove float16-uint16 hacking, test=develop
      9974e407
  25. 31 10月, 2019 1 次提交
    • Y
      Refine the cache of program, context and scope in executor. (#18483) · 16e4d026
      Yiqun Liu 提交于
      * Refine the cache of program, context and scope in executor.
      test=develop
      
      * Refine the unittest test_executor_and_use_program_cache.
      
      * Add the test the PaddingRNN with use_program_cache=True.
      test=develop
      
      * Remove a check.
      test=develop
      
      * Refine the unittest to check whether it is correct when setting use_program_cache=True.
      test=develop
      16e4d026
  26. 29 10月, 2019 1 次提交
  27. 18 10月, 2019 1 次提交
  28. 14 10月, 2019 2 次提交
    • 6
      Dlpack support (#20039) · 12e4be03
      633WHU 提交于
      * support dlpack to tensor and implement python interface test=develop
      
      * add unittest for _to_dlpack and from_dlpack test=develop
      12e4be03
    • Z
      Refine py_reader exit (#20331) · 40effc61
      Zeng Jinle 提交于
      * refine py_reader exit, test=develop
      
      * fix multiprocess_reader exception unittest, test=develop
      
      * increase code coverage for legacy fluid.layers.py_reader, test=develop
      40effc61
  29. 11 10月, 2019 3 次提交
  30. 10 10月, 2019 2 次提交
    • H
      New save load interface (#20148) · fa43e80e
      hong 提交于
      * add new save load interface; test=develop
      
      * add new save interface; test=develop
      
      * add save load interface ;
      
      * fix save load error;
      
      * fix dygraph set dict bug;
      
      * add save load unit test; test=develop
      
      * fix test_imperative_optimizer bug; test=develop
      
      * fix unitest optimizer bug; test=develop
      
      * fix code coverage; test=develop
      
      * fix converage; test=develop
      
      * add document for apis; test=develop
      
      * fix unitest error; test=develop
      
      * fix save load unit test error; test=develop
      
      * fix error message; test=develop
      
      * change set_parameter set_optimizer to save_dygraph; test=develop
      
      * add load_graph check; test=develop
      
      * fix api spec; test=develop
      fa43e80e
    • L
      Polish en doc of LoDTensorArray, test=document_fix (#19972) · f4c56e9f
      Leo Chen 提交于
      * Polish en doc of LoDTensorArray, test=develop, test=document_fix
      
      * follow comments, test=develop, test=document_dix
      f4c56e9f
  31. 09 10月, 2019 1 次提交
    • Y
      refine CUDA CPU places en doc (#20243) · 20f68916
      Youwei Song 提交于
      * fix CUDA CPU places, test=document_fix, test=develop
      
      * fix CUDAPlace param doc, test=document_fix, test=develop
      
      * fix CUDAPlace param doc, test=document_fix, test=develop
      20f68916
  32. 07 10月, 2019 1 次提交
  33. 28 9月, 2019 1 次提交
    • Q
      Enable users to create custom cpp op outside framework. (#19256) · 1a3eef02
      qingqing01 提交于
      * How to write custom op needs to follow framework OP spec.
      * Package fluid_framework.so and headers into whl.
      * Add paddle.sysconfig.get_include() and paddle.sysconfig.get_lib() to get include dir and lib dir.
      * Export some C-APIs to merge OpInfo between core.so and custom_op.so.
      * Add unit testing.
      * Update API.spec.
      1a3eef02
  34. 27 9月, 2019 1 次提交
    • update operator compatible info, test=develop (#19978) · 01b9d079
      石晓伟 提交于
      * update operator compatible info, test=develop
      
      * revert cmake/version.cmake, test=develop
      
      * add unit_tests and fix bugs, test=develop
      
      * update ../paddle/fluid/framework/framework.proto, test=develop
      
      * fix bug of paddle/fluid/inference/api/analysis_predictor.cc, test=develop
      
      * update paddle/fluid/framework/version_test.cc, test=develop
      
      * add comments and rename interfaces, test=develop
      01b9d079