1. 07 6月, 2022 1 次提交
  2. 04 6月, 2022 1 次提交
  3. 30 5月, 2022 3 次提交
  4. 28 5月, 2022 1 次提交
  5. 27 5月, 2022 1 次提交
  6. 23 5月, 2022 1 次提交
  7. 16 5月, 2022 1 次提交
  8. 13 5月, 2022 1 次提交
  9. 10 5月, 2022 1 次提交
    • W
      [Eager] Refactor several sharding test (#42608) · 668a0a41
      Weilong Wu 提交于
      * [Eager] fix sharding under eager mode
      
      * [Eager] fix several sharding test under eager mode
      
      * Recover using _test_eager_guard
      
      * Ensured fleet.init under legacy
      
      * Ensured fleet.init under legacy
      
      * Fix CI issue, re-definition strategy and call fleet.init() in stage2_offload
      
      * Modified dygraph_group_sharded_api.py, move fleet.init to a better line
      668a0a41
  10. 05 5月, 2022 1 次提交
  11. 28 4月, 2022 1 次提交
  12. 26 4月, 2022 2 次提交
  13. 22 4月, 2022 1 次提交
    • M
      [WIP] Algorithm Cache of cuBlasLt Epilogue (#41010) · 19650d72
      Ming-Xu Huang 提交于
      * Fix leading dimension setting error in fused_gemm_epilogue_grad_op.
      
      * Add dyload to cuBlasLt functions.
      
      * Added cublasLtMatmulAlgoGetHeuristic to improve performance.
      
      * Added FLAGS_cublaslt_exhaustive_search_times to cublasLt epilogue
      
      * Added UTs to FLAGS_cublaslt_exhaustive_search_times
      
      * Added warmup runs in algo searching of Gemm epilogue.
      
      * Update copyright and documents.
      
      * Fixed error handling.
      19650d72
  14. 19 4月, 2022 1 次提交
  15. 13 4月, 2022 2 次提交
  16. 12 4月, 2022 1 次提交
  17. 08 4月, 2022 1 次提交
  18. 07 4月, 2022 2 次提交
  19. 06 4月, 2022 1 次提交
    • W
      [Eager] Support test_layers's test cases switch to eager mode (#41216) · 5ae8babb
      Weilong Wu 提交于
      * [Eager] Support test_layers's test cases switch to eager mode
      
      * Update batch_norm _C_ops action to fix CI
      
      * Use None instead of new EmptyTensor
      
      * Updated var name
      
      * Make sure to switch eager mode, Fix Coverage_CI
      
      * Remove _non_static_mode statement
      
      * Remove batch_norm dispensable input statement
      
      * Polish batch_norm code
      
      * Fix CI issue
      5ae8babb
  20. 05 4月, 2022 4 次提交
  21. 04 4月, 2022 3 次提交
  22. 02 4月, 2022 1 次提交
  23. 30 3月, 2022 2 次提交
    • F
      Add new APIs for GPU memory monitoring (max_memory_allocated,... · afe02e9d
      From00 提交于
      Add new APIs for GPU memory monitoring (max_memory_allocated, max_memory_reserved, memory_allocated, memory_reserved) (#38657)
      
      * Add new API memory_reserved
      
      * Add memory_allocated, max_memory_reserved and max_memory_allocater
      
      * Fix CI error
      
      * Fix CI error
      
      * Enhance UT
      
      * Add FLAGS_memory_stats_opt
      
      * Add STATS macro functions
      
      * Add StatAllocator
      
      * Fix CI errors
      
      * Add UT
      
      * Fix CI errors
      afe02e9d
    • P
      suppor inplace in tensor_method_setitem (#40915) · 7170c687
      pangyoki 提交于
      * suppor inplace in tensor_method_setitem
      
      * delete bump_inplace_version
      
      * optimize inplace unittest
      
      * fix
      
      * fix setitem bug
      
      * update eager_generator
      
      * optimize inplace unittest
      
      * little change
      7170c687
  24. 28 3月, 2022 1 次提交
  25. 25 3月, 2022 1 次提交
    • J
      Refactor Dygraph Flags (#40786) · 3085d5e4
      Jiabin Yang 提交于
      * refactor eager flags
      
      * fix flags error when we switch from eager to dygraph
      
      * fix ci problem
      
      * fix ci
      
      * fix ci
      
      * merge develop and fix code style
      
      * merge develop and fix code style
      
      * fix op test error
      
      * fix op test error
      
      * fix op test error
      
      * fix op test error
      
      * fix op test error
      
      * merge develop
      3085d5e4
  26. 24 3月, 2022 1 次提交
  27. 21 3月, 2022 1 次提交
  28. 19 3月, 2022 1 次提交
    • P
      support inplace in dygraph eager_fluid state (#40400) · 8e612903
      pangyoki 提交于
      * [Eager] Support eager grad interface, draft version
      
      * Support eager grad interface with allow_unused and multi startup_op
      
      * Fix code format
      
      * Fix allow_unused case, return PyNone if tensor not initialize
      
      * Support output's stop_gradient related to create_graph
      
      * Support grad exception case in eager mode, fix coverage CI
      
      * Update ToPyObject, return PyNone if not initialize
      
      * AccumulationNode add FLAGS_retain_grad_for_all_tensor
      
      * Fix ci issue
      
      * Fix CI issue
      
      * fix, use core.eager.Tensor
      
      * Add func SetBufferSlotRankZeros for GradTensorHolder
      
      * Support retain_graph by using ClearTensorWrappers
      
      * Support retain_graph by using ClearTensorWrappers
      
      * Update retain_graph and no_grad_vars related test case
      
      * Update code gen logic for ClearTensorWrappers
      
      * Fix by override statement
      
      * fix override func args
      
      * Support retain_graph, update unit tests
      
      * Updated ClearTensorWrappers logic
      
      * fix grad python interface
      
      * Use deep copy and update unit tests
      
      * Polish code
      
      * Polish code
      
      * Fix CI issue, Deep copy only use when user set grad_tensors
      
      * Fix CI, use Backward instead RunBackward
      
      * Fix CI, Declare kernel explicitly in test file
      
      * Polish, remove vector of TensorWrapper
      
      * Refactor the logic of grad/backward, polish codes
      
      * Update code after merge upstream develop
      
      * Polish after merge upstream develop
      
      * Update to adapt new GradNodeBase superclass
      
      * Fix error introduced during conflict resolution
      
      * support inplace strategy in eager_fluid state
      
      * solve conflict
      
      * nothing
      
      * Update purify potential_startup_nodes logic
      
      * Fix errors
      
      * Polish code
      
      * Remove useless args for ToPyObject
      
      * Remove useless TensorWrappersSet
      
      * fix record conflict
      
      * Fix code-format, re-install pre-commit
      
      * fix tensor_wrapper bug
      
      * Fix pre-process logic for potential_startup_ops
      
      * Update unit tests, use eager mode
      
      * Fix conflicts
      
      * fix unittest timeout
      
      * little change
      Co-authored-by: NWeilong Wu <veyron_wu@163.com>
      8e612903
  29. 17 3月, 2022 1 次提交