1. 16 9月, 2021 1 次提交
  2. 14 9月, 2021 1 次提交
    • S
      windows third party cache optimization: share third party cache among servers (#35368) · e919620a
      Sing_chan 提交于
      * new function: share third party cache among servers to fasten build speed
      
      * modified code according to zhouwei25's comment
      
      * add wget install step, move cd build to the last of if condition
      
      * block note and error of third_party share; change bce upload method
      
      * change third_party sub_dir in bos, since third party in different cuda version cant share
      
      * set sub_dir by get nvcc version
      
      * change third_party local path to be same with bos path
      e919620a
  3. 13 9月, 2021 1 次提交
  4. 09 9月, 2021 1 次提交
    • 0
      Add matrix_rank Op and it's GPU and CPU kernel (#34823) · eb1fbf12
      0x45f 提交于
      * init matrix_rank op, add matrix_rank CPU code and test
      
      * add GPU kernel, remove svd_eigen.h
      
      * add CPU kernel when tol is tensor
      
      * add cpu and gpu code when tol is tensor
      
      * fix CI-ROCM error
      
      * add matrix_rank API describe, fix PR-CI-Py3 error
      
      * fix PR-CI-Windows error, add matrix_rank API test
      
      * delete useless comments
      
      * fix review
      
      * add my code in svd_helper.h
      
      * update doc commets
      
      * remove spaces
      eb1fbf12
  5. 03 9月, 2021 2 次提交
  6. 02 9月, 2021 1 次提交
    • X
      Add SVD Op and it's GPU and CPU kernel (#34953) · 7e5fb462
      xiongkun 提交于
      * Add SVD Op and it's GPU and CPU kernel
      
      * Remove CUDAPlace in test_svd_op, make the test available in CPU package
      
      * modfity the file
      
      * fix windows bug/ fix ROCM / fix test timeout
      
      * for pass the CIs
      
      * improve error report
      
      * for code review
      
      * some modification to test_svd_op
      
      * change python code style
      
      * expose the svd interface for document
      7e5fb462
  7. 01 9月, 2021 1 次提交
  8. 31 8月, 2021 4 次提交
    • S
      Revert "Revert "Add copy from tensor (#34406)" (#35173)" (#35256) · 6116f9af
      Shang Zhizhou 提交于
      * Revert "Revert "Add copy from tensor (#34406)" (#35173)"
      
      This reverts commit 32c1ec42.
      
      * add template instantiation
      6116f9af
    • zhouweiwei2014's avatar
      fix bug that cmake find python (#35304) · 00c9aeb0
      zhouweiwei2014 提交于
      00c9aeb0
    • Z
      New whl release strategy with pruned nv_fatbin (#35239) · 2f3b393d
      Zhanlue Yang 提交于
      [Background]
      Expansion in code size can be irreversible in the long run, leading to huge release packages which
      not only hampers user experience but also exceeds a hard limit of pypi.
      
      In such, NV_FATBIN section takes up 86% of the compiled dylib size, owing to the vast number of GPU
      arches supported.
      
      This PR aims to prune this NV_FATBIN.
      
      [Solution]
      In the new release strategy, two types of whl packages will be involved:
      
      Cubin PIP package:
      PIP package maintains a smaller window for GPU arches support, containing
      sm_60, sm_70, sm_75, sm_80 cubins, covering Pascal - Ampere arches
      
      JIT release package:
      This is a backup for Cubin PIP package, containing compute_35, compute_50, compute_60,
      compute_70, compute_75, compute_80, with best performance and GPU arches coverage.
      
      However, it takes around 10 min to install due to the JIT compilation.
      
      [How to use]
      The new release strategy is disabled by default.
      To compile for Cubin PIP package, add this to cmake: -DCUBIN_RELEASE_PIP
      To compile for JIT release package, add this to cmake: -DJIT_RELEASE_WHL
      2f3b393d
    • W
      fix CI skip cc test error (#35264) · 3d76d003
      wuhuanzhou 提交于
      * fix CI skip cc test error, test=develop
      
      * remove test code, test=develop
      3d76d003
  9. 27 8月, 2021 1 次提交
  10. 26 8月, 2021 1 次提交
    • S
      Add copy from tensor (#34406) · ac33c0ca
      Shang Zhizhou 提交于
      * add api
      
      * temp save
      
      * revert
      
      * copytocpu async ok
      
      * fix style
      
      * copy sync ok
      
      * fix compile error
      
      * fix compile error
      
      * api done
      
      * update python async api
      
      * fix compile
      
      * remove async python api; add c++ async unittest
      
      * remove python async api
      
      * update unittest
      
      * update unittest
      
      * add C++ unittest for copytensor
      
      * add unittest
      
      * update namespace utils to class TensorUtils
      
      * add unittest
      
      * update unittest
      
      * update unittest
      
      * update code style
      
      * update code style
      
      * update unittest
      ac33c0ca
  11. 25 8月, 2021 2 次提交
  12. 23 8月, 2021 1 次提交
  13. 16 8月, 2021 1 次提交
  14. 10 8月, 2021 1 次提交
    • C
      copy boost/any.hpp to utils and replace boost::any with self defined any (#34613) · 12892929
      chentianyu03 提交于
      * add any.hpp to utils and replace boost::any with self defined paddle::any
      
      * add copy any.hpp to custom op depends
      
      * modify any.hpp include path
      
      * remove boost from setup.py.in
      
      * add copy any.hpp to custom op depends
      
      * move any.hpp to paddle/utils/ dirs
      
      * move any.h to extension/include direction
      
      * copy utils to right directions
      12892929
  15. 09 8月, 2021 1 次提交
  16. 06 8月, 2021 1 次提交
  17. 03 8月, 2021 1 次提交
  18. 29 7月, 2021 1 次提交
  19. 21 7月, 2021 1 次提交
  20. 14 7月, 2021 2 次提交
  21. 07 7月, 2021 1 次提交
  22. 06 7月, 2021 1 次提交
    • Z
      Add gpu implementation of shuffle_batch_op (#33938) · c6b6ba1f
      Zeng Jinle 提交于
      * add gpu implementation of shuffle batch
      test=develop
      
      * add thrust cuda patches
      test=develop
      
      * fix macro guard
      
      * fix shuffle batch compile on windows/hip
      
      * fix hip compilation error
      
      * refine CMakeLists.txt
      
      * fix windows compile error
      
      * try to fix windows CI compilation error
      
      * fix windows compilation again
      
      * fix shuffle_batch op test on Windows
      c6b6ba1f
  23. 02 7月, 2021 2 次提交
  24. 29 6月, 2021 2 次提交
  25. 24 6月, 2021 1 次提交
  26. 22 6月, 2021 1 次提交
  27. 21 6月, 2021 1 次提交
  28. 18 6月, 2021 2 次提交
  29. 17 6月, 2021 1 次提交
  30. 16 6月, 2021 1 次提交
  31. 15 6月, 2021 1 次提交