1. 16 9月, 2021 4 次提交
  2. 15 9月, 2021 4 次提交
  3. 14 9月, 2021 2 次提交
  4. 11 9月, 2021 1 次提交
  5. 10 9月, 2021 1 次提交
  6. 09 9月, 2021 1 次提交
    • 0
      Add matrix_rank Op and it's GPU and CPU kernel (#34823) · eb1fbf12
      0x45f 提交于
      * init matrix_rank op, add matrix_rank CPU code and test
      
      * add GPU kernel, remove svd_eigen.h
      
      * add CPU kernel when tol is tensor
      
      * add cpu and gpu code when tol is tensor
      
      * fix CI-ROCM error
      
      * add matrix_rank API describe, fix PR-CI-Py3 error
      
      * fix PR-CI-Windows error, add matrix_rank API test
      
      * delete useless comments
      
      * fix review
      
      * add my code in svd_helper.h
      
      * update doc commets
      
      * remove spaces
      eb1fbf12
  7. 08 9月, 2021 4 次提交
  8. 07 9月, 2021 1 次提交
  9. 06 9月, 2021 1 次提交
  10. 04 9月, 2021 1 次提交
  11. 02 9月, 2021 1 次提交
  12. 01 9月, 2021 1 次提交
    • Z
      Support settiem by Bool index (#35133) · d387820d
      zyfncg 提交于
      * Support getitem by Bool index
      
      * delete some debug info of bool index
      
      * support the case that the shape of bool index is different from indexed tensor
      
      * support setitem by bool index
      
      * add the unittest for throwing exception
      
      * merge conflict
      
      * add check for int tensor when index is bool
      d387820d
  13. 31 8月, 2021 2 次提交
  14. 30 8月, 2021 1 次提交
  15. 27 8月, 2021 2 次提交
  16. 26 8月, 2021 3 次提交
    • S
      Add paddle.utils.dlpack APIs (#35067) · 8dc050d8
      Siming Dai 提交于
      * add dlpack api and fix a from_dlpack 
      8dc050d8
    • W
      support tensor index. (#34824) · e7df47ec
      WeiXin 提交于
      * polish code
      
      * polish code.
      
      * polish code.
      
      * polish code.
      
      * polish code.
      e7df47ec
    • S
      Add copy from tensor (#34406) · ac33c0ca
      Shang Zhizhou 提交于
      * add api
      
      * temp save
      
      * revert
      
      * copytocpu async ok
      
      * fix style
      
      * copy sync ok
      
      * fix compile error
      
      * fix compile error
      
      * api done
      
      * update python async api
      
      * fix compile
      
      * remove async python api; add c++ async unittest
      
      * remove python async api
      
      * update unittest
      
      * update unittest
      
      * add C++ unittest for copytensor
      
      * add unittest
      
      * update namespace utils to class TensorUtils
      
      * add unittest
      
      * update unittest
      
      * update unittest
      
      * update code style
      
      * update code style
      
      * update unittest
      ac33c0ca
  17. 25 8月, 2021 1 次提交
    • L
      fix potential tensor leak in tensor.__setitem__ (#35013) · 763b6d91
      Leo Chen 提交于
      * fix index tensor leak in __setitem__
      
      * fix another usage of PyTuple_Pack
      
      * refine code
      
      * refine code
      
      * handle None index
      
      * add Py_DecRef
      
      * revert ut
      
      * refine code
      
      * merge develop
      
      * use RAII
      
      * follow comments
      763b6d91
  18. 24 8月, 2021 2 次提交
    • W
      add fetch, test=develop (#35019) · a5060b55
      wanghuancoder 提交于
      * add fetch, test=develop
      
      * fix fetch2op, test=develop
      
      * fix fetch2op, test=develop
      
      * refine, test=develop
      
      * fix fetch ctx, test=develop
      
      * add wait, test=develop
      
      * rename fetch2 to fetch_v2, test=develop
      
      * merge, test=develop
      a5060b55
    • Y
      Add auto completion module for auto parallel (#34813) · 93d862b0
      Yulong Ao 提交于
      * add auto_parallel dir
      
      * mv to paddle.distributed
      
      * add shard_xx api
      
      * add distributed attrs for var
      
      * add ut, test=develop
      
      * add dist
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update, test=develop
      
      * update, test=develop
      
      * update, test=develop
      
      * update, test=develop
      
      * update, test=develop
      
      * update, test=develop
      
      * update, test=develop
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update, test=develop
      
      * update, test=develop
      
      * update
      
      * update
      
      * delete unused proto
      
      * resotre op_desc
      
      * restore type_defs
      
      * update var_desc
      
      * remove dimss_mapping for proto_pybind
      
      * update interface.py
      
      * update framework.py
      
      * update
      
      * update
      
      * add auto_parallel dir
      
      * mv to paddle.distributed
      
      * add shard_xx api
      
      * add distributed attrs for var
      
      * add ut, test=develop
      
      * [WIP] Add the auto completion feature and related codes
      
      * [WIP] Improve the auto completion and related codes
      
      * [WIP] Make the auto completion to support data-parallel
      
      * [WIP] Make the completion support mp and dp+mp
      
      * [WIP] Refactor auto completion unit test for MLP
      
      * [WIP] Refactor the implementation of DistributedOperatorImpl
      
      * [WIP] Improve dims_mapping update rule and fix a bug
      
      * [WIP] Support auto completion for one transformer decoder layer
      
      * [WIP] Add a minor change
      
      * [WIP] Fix a bug within the uint test
      
      * Shard XShape tensor, add embedding completion and refactor code
      
      * Add the distributed_operators dir to setup.py.in
      
      * Improve the completion process and add the unittest for gpt
      
      * fix process_mesh ut
      
      * fix process_mesh ut
      
      * update
      
      * update, test=develop
      
      * Add support for automatically completing distributed attrs of special ops
      
      * update
      
      * update
      
      * update
      
      * fix doc sample codes, test=develop
      
      * improve coverage, test=develop
      
      * add static_mode check, test=develop
      
      * Model the cluster for cost model and physical mapping
      
      * update, test=develop
      
      * add set_placement, test=develop
      
      * Add the check to make sure the candidate tensors' size is great than zero
      
      * update doc, test=develop
      
      * update doc, test=develop
      
      * update doc, test=develop
      
      * update doc, test=develop
      
      * update, test=develop
      
      * Auto mark dist attrs annotated by user
      
      * update ndarray to nested list, test=develop
      
      * update, test=develop
      
      * Add auto-completion module for auto-parallel (based on PR#33804)
      
      * Remove unnecessary files
      
      * Remove unrelated files for the auto completion pr
      
      * Update the unit test to improve the coverage
      
      * Modify codes based on reviews
      
      * Minor changes for CI
      
      * Improve some codes based on new comments
      
      * Fix bugs caused by shallow copy in attributes.py
      * Imporve amend_distributed_attr_for_program in context.py
      * Other changes for weihang's comments
      Co-authored-by: Nsandyhouse <lilong12@baidu.com>
      93d862b0
  19. 23 8月, 2021 4 次提交
  20. 19 8月, 2021 1 次提交
  21. 18 8月, 2021 2 次提交
    • W
      code refactoring for new executor (#34970) · 40d4d834
      wanghuancoder 提交于
      * code refactoring, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      40d4d834
    • Z
      Add function to disable paddle signal handler (#34577) · dd533dd3
      Zhanlue Yang 提交于
      * Add function to disable paddle signal handler
      
      Paddle used google::InstallFaultSignalHandler to handle selected system signals,
      mainly for debugging and bug report purposes.
      
      However, this can be conflicted with other python packages whoever captures similar signals.
      Such python package involves tvm and more
      
      To resolve this issue, we support a function to disable signal handler
      
      * Remove signal test from WIN32 platform
      
      * Remove redundant return from disable_signal_handler() function
      
      * Add detailed messages to en_doc
      dd533dd3