- 29 3月, 2023 2 次提交
-
-
由 QingshuChen 提交于
-
由 jameszhang 提交于
* [kunlun] support min/max in dygraph mode * update xccl to 1.0.13
-
- 28 3月, 2023 1 次提交
-
-
由 ronnywang 提交于
-
- 27 3月, 2023 1 次提交
-
-
由 HappyHeavyRain 提交于
* add fused_yaml fused_backward * fix eager_funciton bug * add some comment of fused yaml file * add 'support_dygraph_mode' configuration in fused yaml * delete some 'fused_api.h' in include file * add fused flag in api_gen
-
- 21 3月, 2023 1 次提交
-
-
由 iSerendipity 提交于
* move DataType from paddle::experimental to phi * convert namespace * convert namespace * convert namespace * clarify namespace * convert more datatype * Revert "convert more datatype" This reverts commit 083b462959e6a22d4d8767707b628b95b396642e. * convert more in auto_code_generator * fix conflicts for XPU * fix namespace conflicts * fix errors * Revert "fix errors" This reverts commit f9d9958b54ee32141112274c8a5c3c381ab0f876. * fix errors * fix formatting
-
- 16 3月, 2023 1 次提交
-
-
由 Huang Jiyi 提交于
* remove contexts in tensor_utils * update from_blob * update from_blob * update from_blob * fix bug * fix bug
-
- 15 3月, 2023 1 次提交
-
-
由 ronnywang 提交于
* [CustomDevice] fix SyncDefaultStream for process_group_custom * update
-
- 13 3月, 2023 1 次提交
-
-
由 ronnywang 提交于
* add UpdateWaitChain for process_group_custom * add UpdateWaitChain for process_group_custom
-
- 08 3月, 2023 1 次提交
-
-
由 chenxujun 提交于
-
- 07 3月, 2023 2 次提交
-
-
由 Chen Weihang 提交于
-
由 Kai Song 提交于
-
- 03 3月, 2023 1 次提交
-
-
由 ronnywang 提交于
* [CustomDevice] fix process_group_custom api * update * update * update * update
-
- 27 2月, 2023 1 次提交
-
-
由 jameszhang 提交于
* [kunlun] support reduce_scatter * uncomment unittest * update xccl to 1.0.10
-
- 22 2月, 2023 1 次提交
-
-
由 zhouweiwei2014 提交于
-
- 08 2月, 2023 2 次提交
-
-
由 risemeup1 提交于
-
由 Huang Jiyi 提交于
-
- 20 1月, 2023 1 次提交
-
-
由 jameszhang 提交于
* update xccl lib & use native Reduce in dygraph * minor
-
- 18 1月, 2023 2 次提交
-
-
由 jameszhang 提交于
-
由 jameszhang 提交于
* revert to use default XPU stream for computing XPUContext now has a null stream by default. If you want to use a separate stream (e.g. in async collective communication), you should create a dedicated XPUContext and invoke its XPUContext::CreateStream() * minor
-
- 15 1月, 2023 1 次提交
-
-
由 Roc 提交于
1 update xccl lib 2 when using comm_ctx, the allocator should be set manually.
-
- 13 1月, 2023 1 次提交
-
-
由 duanyanhui 提交于
* clear ProcessGroupCustom manually * fix bug * fix bug * move destroy ProcessGroup to ProcessGroupIdMap * enable destroy to all device * remove unused comments * change to internal api * Update process_group.cc * Update process_group.cc
-
- 12 1月, 2023 2 次提交
-
-
由 Wen Sun 提交于
* refactor: migrate comm checks * refactor: add check in comm context * feat: add gloo static check * refactor: add place param in static check
-
由 jameszhang 提交于
* Fix reduce func bug in process_group_bkcl Also catch up with a recent process_group PR that failed to add XPU branch. Note that reduce is still accomplished by allreduce for xpu. Fix this should xccl lib be updated. * fix compile issue for non-XPU
-
- 09 1月, 2023 1 次提交
-
-
由 LiYuRio 提交于
* comm_context and static init * refactor: move to phi/core/distributed * refactor: avoid mutable_data usage * fix: windows sock * fix: device without nccl Co-authored-by: Wen Sun <syl1887415157@126.com>
-
- 06 1月, 2023 1 次提交
-
-
由 Wen Sun 提交于
* fix: fix hidden virtual funcs * fix: add default impl
-
- 05 1月, 2023 1 次提交
-
-
由 Wen Sun 提交于
* refactor: use base class * fix: incorrect deps * fix: add missing header * refactor: update class structures * fix: bkcl typo * fix: remove redundant def
-
- 19 12月, 2022 1 次提交
-
-
由 Wen Sun 提交于
-
- 17 12月, 2022 1 次提交
-
-
由 Wen Sun 提交于
-
- 16 12月, 2022 1 次提交
-
-
由 Wen Sun 提交于
-
- 15 12月, 2022 1 次提交
-
-
由 Wen Sun 提交于
-
- 14 12月, 2022 1 次提交
-
-
由 james 提交于
* nullptr bugfix for XPU pg mode Also a few kernels is added to xpu whitelist * increase error msg length
-
- 12 12月, 2022 1 次提交
-
-
由 Wen Sun 提交于
* chore: unify `SingleTensor` * feat: dynamic check
-
- 05 12月, 2022 1 次提交
-
-
由 ShenLiang 提交于
-
- 03 12月, 2022 1 次提交
-
-
由 Wen Sun 提交于
* refactor: classify static check * refactor: rename to static_check & use forward decl * refactor: switch to unary & binary funcs
-
- 24 11月, 2022 1 次提交
-
-
由 james 提交于
Note: this is a temporary solution, should be replaced once reduce kernel is natively supported on KL2
-
- 23 11月, 2022 1 次提交
-
-
由 Wen Sun 提交于
* feat: static check
-
- 21 11月, 2022 4 次提交