- 18 1月, 2022 1 次提交
-
-
由 Zhanlue Yang 提交于
* Merged LoDTensor with Tensor,test=allcases * Patched python level LoDTensor * Patched python level LoDTensor * Merge Tensor into DenseTensor * Fixed namespace issues,test=allcases * Fixed merge issues * Fixed inference issues * Fixed NPU test issues * Fixed merge issues
-
- 17 1月, 2022 1 次提交
-
-
由 Wilber 提交于
* add pten::Place data structure. * update ci problem * fix ci problem * update * using platform::Place=pten::Place * remove BOOST_GET_CONST for CPUPlace and GPUPlace * compile pass 25%. * compile pass 45% * compile pass 60% * remove boost_get for xpu npu mlu and ipu * compile pass on cpu and gpu. * fix compile problem * fix compile error. * update * fix ci problem * update * ci approve * fix ci problem * fix ci eager test problem * remove BOOST_GET_CONST * fix npu compile
-
- 27 12月, 2021 1 次提交
-
-
由 ShenLiang 提交于
* fix bug in pfp16 * fix hip * fix hip
-
- 10 12月, 2021 1 次提交
-
-
由 kuizhiqing 提交于
-
- 06 12月, 2021 1 次提交
-
-
由 kuizhiqing 提交于
-
- 23 11月, 2021 1 次提交
-
-
由 ronnywang 提交于
* Added HCCL backend support in dynamic graph mode * fix segmentation fault * add ut
-
- 14 9月, 2021 1 次提交
-
-
由 Haohongxiang 提交于
* Add solutions to PyLayer which is unsupported in DataParallel * modify note format for parallel.py * modify docs of dataparallel * add docs of dp with pylayer * modify docs format * modify example format * change example of dp with pylayer * add unittest for dp with pylayer * modify ut * merge latest codes * update * modify for CI-Coverage * modify text-indent
-
- 08 9月, 2021 1 次提交
-
-
由 xiongkun 提交于
* can pass the fake test * add files * modify cmake to pass windows-ci * for ci pass * WITH_GLOO=ON * for pass coverage test * add cpuonly testcase * add * disable nccl when compile with cuda * change python version in cpuonly * add backend argument * add required gpu * add required:gpu
-
- 24 8月, 2021 1 次提交
-
-
由 Haohongxiang 提交于
* Add no_sync in data parallel for dynamic graph * modify UT of no_sync * delete test_parallel_dygraph_dataparallel_no_sync.py * add test_parallel_dygraph_no_sync.py * modify run_trainer_with_spawn in UTs * Add UT of complex control flow in no_sync * add specific descriptions and notes for no_sync * check code style * modify UT's TIMEOUT in CMakeLists.txt
-
- 11 5月, 2021 1 次提交
-
-
由 ShenLiang 提交于
* fix find_unused_parameters default value
-
- 01 5月, 2021 1 次提交
-
-
由 ShenLiang 提交于
-
- 29 4月, 2021 1 次提交
-
-
由 liuyuhui 提交于
-
- 14 4月, 2021 1 次提交
-
-
由 Chen Weihang 提交于
* add register backward hook method * add leaf grad accumullated test
-
- 01 4月, 2021 2 次提交
-
-
由 ShenLiang 提交于
* support control flow * supoort sync_parameters_buffers * fix the bug of sparse embedding
-
由 Chen Weihang 提交于
* refactor and simplify hook design * fix reducer add hook error * add Tensor.register_hook basic impl * refine prepare data impl * revert prepare data change * support register_hook for Tensor * add hook test in model * polish tests and doc example * fix double grad test failed * remove reduce hook func * fix set empty error * polish code by comments * change reduce_hook to mutable_hook * remove useless tmp_ins * fix shape code format error * fix shape code format error
-
- 09 3月, 2021 1 次提交
-
-
由 Qi Li 提交于
-
- 05 3月, 2021 1 次提交
-
-
由 liuyuhui 提交于
[Kunlun]Multi xpu dygraph performance optimization , add distributed.spawn support for multi xpu and some bug-fixes (#31130)
-
- 22 2月, 2021 1 次提交
-
-
由 Qi Li 提交于
* [ROCM] update fluid imperative for rocm (part1), test=develop * [ROCM] update reducer.cc after merge, test=develop * update reducer cmake after merge, test=develop
-
- 19 2月, 2021 1 次提交
-
-
由 ShenLiang 提交于
-
- 09 2月, 2021 1 次提交
-
-
由 ShenLiang 提交于
-
- 08 2月, 2021 1 次提交
-
-
由 liuyuhui 提交于
-
- 04 2月, 2021 1 次提交
-
-
由 wanghuancoder 提交于
* use iwyu clean include second time, test=develop
-
- 03 2月, 2021 1 次提交
-
-
由 WangXi 提交于
-
- 29 1月, 2021 1 次提交
-
-
由 ShenLiang 提交于
-
- 19 1月, 2021 1 次提交
-
-
由 WangXi 提交于
-
- 13 1月, 2021 1 次提交
-
-
由 ShenLiang 提交于
-
- 22 12月, 2020 1 次提交
-
-
由 ShenLiang 提交于
* fix fleet for multi-stream * fix memcpy for ncclid * use sync to solve move operation
-
- 09 12月, 2020 1 次提交
-
-
由 ShenLiang 提交于
* add tensor_indices in AssignGroupBySize * add rebuild group in reducer
-
- 03 12月, 2020 1 次提交
-
-
由 ShenLiang 提交于
-
- 27 11月, 2020 1 次提交
-
-
由 ShenLiang 提交于
* add reducer * refine envent for memorycopy * add concat&split for allreduce * apply concat & split for fuse tensor * fix nccl dep * fix the untest, compile problem and ddp initialize problem * fix untest for mac & add some comments & solve the repeated param in sublayers * fix untest for windows & fix document
-