1. 21 9月, 2020 1 次提交
    • L
      Refine error msg in paddle/fluid/framework/details [part 1] (#25631) · bbc84e0f
      Leo Chen 提交于
      * refine error msg in var_handle.h, test=develop
      
      * refine all_reduce_op_handle
      
      * fix some error msg
      
      * refine variable_visitor
      
      * refine threaded_ssa_graph_executor
      
      * refine inplace related files
      
      * refine executor related files
      
      * refine fetch_op_handle.cc
      
      * fix bug
      
      * follow comments
      bbc84e0f
  2. 03 9月, 2020 2 次提交
  3. 02 9月, 2020 1 次提交
    • W
      Add FetchAsyncOpHandle, and use it in FastThreadedExecutor (#26643) · 2d2c31a6
      wanghuancoder 提交于
      * optimized transformation form tensor to numpy, test=develop
      
      * Modify fetch op handle, from memcpy Sync to memcpy Async, test=develop
      
      * modify CUDAPinnedPlace to CPUPlace, test=develop
      
      * modify CPUPlace to CUDAPinnedPlace, and set default inplace to false, test=develop
      
      * revert fetch_op_handle, add fetch_async_op_handle, test=develop
      
      * revert fetch_op_handle, add fetch_async_op_handle, test=develop
      
      * fix error msg report, test=develop
      
      * fix bug in cpuplace, test=develop
      
      * fix bug in unmerge and tensorarray modle, test=develop
      
      * fix bug, double copy gpu memory, test=develop
      
      * fix chenweihang¡¯s review advice, test=develop
      2d2c31a6
  4. 25 8月, 2020 1 次提交
    • W
      optimized transformation form tensor to numpy (#26447) · c1f5df52
      wanghuancoder 提交于
      * optimized transformation form tensor to numpy, test=develop
      
      * optimized transformation form tensor to numpy, pass pre-commit, test=develop
      
      * modify fetchophandle zerocopy to deepcopy in PE&CUP, test=develop
      
      * modify py:array construct, test=develop
      
      * fix _fetch_var to use deep copy, test=develop
      c1f5df52
  5. 07 8月, 2020 1 次提交
  6. 30 7月, 2020 1 次提交
  7. 10 7月, 2020 1 次提交
  8. 07 7月, 2020 1 次提交
    • H
      catch bad alloc exception (#25140) · 70d7d07f
      hong 提交于
      * cat bad alloc exception; test=develop
      
      * add unitest; test=develop
      
      * move bad alloc catch to the first place; test=develop
      
      * polish error message; test=develop
      
      * polish error message; test=develop
      
      * add mutex header; test=develop
      70d7d07f
  9. 03 6月, 2020 1 次提交
  10. 11 5月, 2020 1 次提交
    • C
      Add macro BOOST_GET to enrich the error information of boost :: get (#24175) · aa0f254f
      Chen Weihang 提交于
      * add new macro BOOST_GET_SAFELY & unittests, test=develop
      
      * add different macro type, test=develop
      
      * fix get macro type in executor, test=develop
      
      * four macro part change backup
      
      * using one macro for all case, test=develop
      
      * revert attribute change, test=develop
      
      * change to three func to solve gcc4.8 bug, test=develop
      
      * polish some details, test=develop
      aa0f254f
  11. 23 4月, 2020 1 次提交
  12. 20 4月, 2020 1 次提交
    • Z
      Optimize the error messages of paddle CUDA API (#23816) · 78170037
      Zhou Wei 提交于
      * Optimize the error messages of paddle CUDA API, test=develop
      
      * fix the error messages of paddle CUDA API, test=develop
      
      * Refactoring PADDLE_ENFORCE_CUDA_SUCCESS, and apply to curand/cudnn/cublas/NCCL,test=develop
      
      * remove build_ex_string,test=develop
      
      * merge conflict,test=develop
      78170037
  13. 19 4月, 2020 1 次提交
    • G
      Support LoDTensorArray in fetch (#23645) · 2b896c1f
      guofei 提交于
      * Support LoDTEnsorArray in fetch op
      
      test=develop
      
      * Support LoDTensorArray in fetch
      
      test=develop
      
      * Support LoDTensorArray in fetch
      
      test=develop
      
      * Support LoDTensorArray in fetch
      
      test=develop
      
      * Support LoDTensorArray in fetch
      
      test=develop
      
      * Support LoDTensorArray in fetch
      
      test=develop
      
      * Support LoDTensorArray in fetch
      
      test=develop
      
      * Support LoDTensorArray in fetch
      
      test=develop
      
      * Support LoDTensorArray in fetch
      
      test=develop
      
      * Support LoDTensorArray in fetch
      
      test=develop
      2b896c1f
  14. 14 4月, 2020 1 次提交
  15. 10 4月, 2020 2 次提交
  16. 09 4月, 2020 1 次提交
    • M
      Remove: NGraph engine from PDPD repository (#23545) · 3baaee9a
      mozga-intel 提交于
      * Remove the NGraph engine from PDPD repository
      1. Each operator was removed from the operator's directory
      2. Each test was removed from the unittest directory
      3. The parallel executor support was removed from the PDPD
      4. The CMake file was removed from the PDPD
      5. The NG flags were removed from the repository
      test=develop
      
      * Remove ngraph from:
      1. Cmake file
      2. Python file
      test=develop
      3baaee9a
  17. 07 4月, 2020 1 次提交
  18. 05 4月, 2020 1 次提交
  19. 04 4月, 2020 1 次提交
  20. 03 4月, 2020 1 次提交
  21. 01 4月, 2020 1 次提交
  22. 25 3月, 2020 2 次提交
  23. 20 3月, 2020 1 次提交
    • Z
      Reader sequential and inference partial feed (#22699) · acfc9b8a
      Zeng Jinle 提交于
      * sequential reader stage 1, test=develop
      
      * fix ut, test=develop
      
      * fix iterable=False reset bug, add some logs and polish code, test=develop
      
      * inference feed partial data, test=develop
      
      * Turn on keep_order=True for test, test=develop
      
      * enhance ut to test more cases, test=develop
      
      * test commit for reverting
      
      * Revert "test commit for reverting", test=develop
      
      This reverts commit 80aef42e.
      
      * add ut of merged and unmerged results, test=develop
      
      * add more uts for coverages and add en doc of api, test=develop
      
      * follow comments, test=develop
      
      * change note style, test=develop
      acfc9b8a
  24. 09 3月, 2020 1 次提交
  25. 02 3月, 2020 1 次提交
    • Z
      Unmerged fetch list (#22635) · 89cfa491
      Zhen Wang 提交于
      * update ScopeBufferedSSAGraphExecutor&AsyncSSAGraphExecutor&ThreadedSSAGraphExecutor&FastThreadedSSAGraphExecutor&ParallelSSAGraphExecutor&ParallelExecutor for fetching unmerged results.
      
      * add the unit test for fetch_unmerged.
      
      * update ut for multi-card and multi-cpu.
      
      * add the error message and the user suggestion in FetchOpHandle. test=develop
      89cfa491
  26. 23 2月, 2020 1 次提交
  27. 22 2月, 2020 1 次提交
  28. 13 2月, 2020 1 次提交
  29. 12 2月, 2020 1 次提交
  30. 11 2月, 2020 1 次提交
  31. 07 2月, 2020 1 次提交
    • Y
      Enable the detection of subgraph composed of grad ops (#21223) · dcfb6038
      Yiqun Liu 提交于
      * Add the first implememtation of fusion_group op #19621 (#3)
      
      * Add the dynamic load of nvrtc, and support runtime compiling of CUDA kernel using nvrtc.
      test=develop
      
      * Call CUDA driver api to launch the kernel compiled by nvrtc.
      test=develop
      
      * Disable for mac and windows.
      test=develop
      
      * Refine the codes to support manually specified num_threads and workload_per_thread.
      test=develop
      
      * Refine the CUDA kernel to support large dims.
      test=develop
      
      * Add DeviceCodePool to manage all device codes.
      
      * Add the first implementation fusion_group op.
      
      * Add unit-test for fusion_group op.
      
      * Add the check of result.
      
      * Add the check of nvrtc in unit-test.
      test=develop
      
      * Add comment to explain the inputs, outputs and features of fusion_group op.
      test=develop
      
      * Disable fusion_group op for mac and windows.
      test=develop
      
      * Make the compiling of device code return status instead of hanging up.
      test=develop
      
      * Add the check of whether there is CUDA driver library, and do not core dump when failing to call the CUDA driver API.
      
      * Unify fusion_group_op's input and output names.
      test=develop
      
      * Add the check of CUDA driver library in unittest.
      test=develop
      
      * Enable generating code for a given subgraph. #21126 (#4)
      
      * Enable generating code for a given subgraph.
      
      * Support sorting the subgraph.
      
      * Remove the rearange of expressions because we use the sorted subgraph directly.
      
      * Enable generating code for a subgraph which is composed of grad ops.
      
      * Use expression information to check the accuracy in unittest.
      
      * Separate load and store from computation expressions.
      test=develop
      
      * Improve the loading statements in generated codes.
      test=develop
      
      * Remove unused arguments from formal list.
      test=develop
      
      * Enable the detection of subgraph of grad ops.
      
      * Generate code for detected subgraph in fusion_group_pass.
      
      * Add an option in BuildStrategy to enable fusion_group_pass and add unittest.
      test=develop
      
      * Fix a bug when checking whether the shape of all inputs are the same.
      
      * Add debug information.
      
      * Remove subgraph_detector from inference/analysis to the common framework/ir directory. (#5)
      
      test=develop
      
      * Call subgraph_detector in fusion_group pass.
      test=develop
      
      * Disable fusion_group when WITH_GPU is OFF.
      test=develop
      
      * Refine all PADDLE_ENFORCE message.
      test=develop
      
      * Fix the case that some inputs are not defined in grad ops, and set op_role for fused op.
      test=develop
      
      * Follow review comments.
      test=develop
      dcfb6038
  32. 05 2月, 2020 1 次提交
  33. 17 1月, 2020 1 次提交
  34. 13 1月, 2020 1 次提交
  35. 10 1月, 2020 1 次提交
    • Z
      Add bn and relu fuse pass (#22048) · 46189b16
      Zhen Wang 提交于
      * add bn and relu fuse pass
      
      * add op attr assert and dtype assert
      
      * fix some inputs&&outputs bugs for the fused op and pattern.
      
      * add the unittest for fuse_bn_act_pass. test=develop
      
      * use normative enforce statements. test=develop
      
      * add the cpu test. test=develop
      
      * add the support of batch_size=1 for the bn with relu op. test=develop
      
      * add the error type for paddle throws. test=develop
      
      * add fused_batch_norm_act and fused_batch_norm_act_grad to op_has_unsed_vars_white_list. test=develop
      46189b16
  36. 19 12月, 2019 1 次提交
  37. 18 12月, 2019 1 次提交