- 10 6月, 2022 1 次提交
-
-
由 Weilong Wu 提交于
* Add option for test executor profiler * Change option for test executor_profiler
-
- 07 6月, 2022 1 次提交
-
-
由 Haohongxiang 提交于
* fix bugs of reducer * update * update
-
- 04 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
-
- 30 5月, 2022 3 次提交
-
-
由 Li Min 提交于
* add fused_bias_dropout_residual_ln op and layer.
-
由 crystal 提交于
-
由 zhangchunle 提交于
-
- 28 5月, 2022 1 次提交
-
-
由 ShenLiang 提交于
* fix alltoall * rename utest
-
- 27 5月, 2022 1 次提交
-
-
由 Baibaifan 提交于
-
- 23 5月, 2022 1 次提交
-
-
由 Ruibiao Chen 提交于
* Reduce test case for test_tensordot * Fix CI errors
-
- 16 5月, 2022 1 次提交
-
-
由 Sing_chan 提交于
-
- 13 5月, 2022 1 次提交
-
-
由 Ruibiao Chen 提交于
* Refactor test_tensordot * Add test_static * Fix CI errors
-
- 10 5月, 2022 1 次提交
-
-
由 Weilong Wu 提交于
* [Eager] fix sharding under eager mode * [Eager] fix several sharding test under eager mode * Recover using _test_eager_guard * Ensured fleet.init under legacy * Ensured fleet.init under legacy * Fix CI issue, re-definition strategy and call fleet.init() in stage2_offload * Modified dygraph_group_sharded_api.py, move fleet.init to a better line
-
- 05 5月, 2022 1 次提交
-
-
由 Ruibiao Chen 提交于
-
- 28 4月, 2022 1 次提交
-
-
由 sneaxiy 提交于
* add gradient merge for DistributedFusedLamb * use master acc gradient * fix CI ut * polish * remove math_function_impl.h change * fix test_update_loss_scaling_op.py * try to fix XPU/NPU CI * add gm ut
-
- 26 4月, 2022 2 次提交
- 22 4月, 2022 1 次提交
-
-
由 Ming-Xu Huang 提交于
* Fix leading dimension setting error in fused_gemm_epilogue_grad_op. * Add dyload to cuBlasLt functions. * Added cublasLtMatmulAlgoGetHeuristic to improve performance. * Added FLAGS_cublaslt_exhaustive_search_times to cublasLt epilogue * Added UTs to FLAGS_cublaslt_exhaustive_search_times * Added warmup runs in algo searching of Gemm epilogue. * Update copyright and documents. * Fixed error handling.
-
- 19 4月, 2022 1 次提交
-
-
由 ShenLiang 提交于
* fix utest * fix time
-
- 13 4月, 2022 2 次提交
-
-
由 Ruibiao Chen 提交于
* Add yaml for deformable_conv and deformable_conv_v1 OPs * Add UT * Add to skipped_phi_api list for infrt
-
由 Baibaifan 提交于
-
- 12 4月, 2022 1 次提交
-
-
由 Leo Chen 提交于
-
- 08 4月, 2022 1 次提交
-
-
由 lilong12 提交于
-
- 07 4月, 2022 2 次提交
-
-
由 0x45f 提交于
* Sitch some dy2st UT to eager mode * Fix test_lstm and remove test_transformer * Run test_resnet_v2 in old dy mode
-
由 QingshuChen 提交于
* ignore some failed test for KL2 *test=kunlun * minor *test=kunlun * minor *test=kunlun
-
- 06 4月, 2022 1 次提交
-
-
由 Weilong Wu 提交于
* [Eager] Support test_layers's test cases switch to eager mode * Update batch_norm _C_ops action to fix CI * Use None instead of new EmptyTensor * Updated var name * Make sure to switch eager mode, Fix Coverage_CI * Remove _non_static_mode statement * Remove batch_norm dispensable input statement * Polish batch_norm code * Fix CI issue
-
- 05 4月, 2022 4 次提交
-
-
由 Haohongxiang 提交于
* support process group in dp with fleet api * update * fix uts * update
-
由 RichardWooSJTU 提交于
* add nms op and batched_nms api
-
由 Leo Chen 提交于
* enable new executor by default * enable stream safe allocator * test=document_fix;test=coverage * do not use scope in op kernel * fit empty program for new executor * fix communication depend * fix test_sync_batch_norm * skip unsupported place * refine datatransfer * fit for dirtributed program * fix dependencpy * fix some ut
-
由 Chen Weihang 提交于
-
- 04 4月, 2022 3 次提交
-
-
由 Haohongxiang 提交于
* [Dygraph] Support sparse tensor in refactored reducer * add uts * refactor * update * fix bugs
-
由 Chen Weihang 提交于
* add infershape and forward yaml * add final_state call * add base unittests * add backward yaml and test * fix without softmax test error * add cross_entropy test
-
由 From00 提交于
* Add yaml for reduce_sum OP * Fix CI errors * Fix CI errors * Fix CI errors * Fix CI errors
-
- 02 4月, 2022 1 次提交
-
-
由 lilong12 提交于
-
- 30 3月, 2022 2 次提交
-
-
由 From00 提交于
Add new APIs for GPU memory monitoring (max_memory_allocated, max_memory_reserved, memory_allocated, memory_reserved) (#38657) * Add new API memory_reserved * Add memory_allocated, max_memory_reserved and max_memory_allocater * Fix CI error * Fix CI error * Enhance UT * Add FLAGS_memory_stats_opt * Add STATS macro functions * Add StatAllocator * Fix CI errors * Add UT * Fix CI errors
-
由 pangyoki 提交于
* suppor inplace in tensor_method_setitem * delete bump_inplace_version * optimize inplace unittest * fix * fix setitem bug * update eager_generator * optimize inplace unittest * little change
-
- 28 3月, 2022 1 次提交
-
-
由 Haohongxiang 提交于
* add uts for EagerReducer * add more uts * fix bugs * fix bugs * modify * modify uts * fix bugs * update * update * update * solve conflicts and merge * add some other uts * modify time of uts * update * update * update * remove uts of resnet
-
- 25 3月, 2022 1 次提交
-
-
由 Jiabin Yang 提交于
* refactor eager flags * fix flags error when we switch from eager to dygraph * fix ci problem * fix ci * fix ci * merge develop and fix code style * merge develop and fix code style * fix op test error * fix op test error * fix op test error * fix op test error * fix op test error * merge develop
-
- 24 3月, 2022 1 次提交
-
-
由 lilong12 提交于
-
- 21 3月, 2022 1 次提交
-
-
由 kuizhiqing 提交于
-
- 19 3月, 2022 1 次提交
-
-
由 pangyoki 提交于
* [Eager] Support eager grad interface, draft version * Support eager grad interface with allow_unused and multi startup_op * Fix code format * Fix allow_unused case, return PyNone if tensor not initialize * Support output's stop_gradient related to create_graph * Support grad exception case in eager mode, fix coverage CI * Update ToPyObject, return PyNone if not initialize * AccumulationNode add FLAGS_retain_grad_for_all_tensor * Fix ci issue * Fix CI issue * fix, use core.eager.Tensor * Add func SetBufferSlotRankZeros for GradTensorHolder * Support retain_graph by using ClearTensorWrappers * Support retain_graph by using ClearTensorWrappers * Update retain_graph and no_grad_vars related test case * Update code gen logic for ClearTensorWrappers * Fix by override statement * fix override func args * Support retain_graph, update unit tests * Updated ClearTensorWrappers logic * fix grad python interface * Use deep copy and update unit tests * Polish code * Polish code * Fix CI issue, Deep copy only use when user set grad_tensors * Fix CI, use Backward instead RunBackward * Fix CI, Declare kernel explicitly in test file * Polish, remove vector of TensorWrapper * Refactor the logic of grad/backward, polish codes * Update code after merge upstream develop * Polish after merge upstream develop * Update to adapt new GradNodeBase superclass * Fix error introduced during conflict resolution * support inplace strategy in eager_fluid state * solve conflict * nothing * Update purify potential_startup_nodes logic * Fix errors * Polish code * Remove useless args for ToPyObject * Remove useless TensorWrappersSet * fix record conflict * Fix code-format, re-install pre-commit * fix tensor_wrapper bug * Fix pre-process logic for potential_startup_ops * Update unit tests, use eager mode * Fix conflicts * fix unittest timeout * little change Co-authored-by: NWeilong Wu <veyron_wu@163.com>
-