- 07 4月, 2023 1 次提交
-
-
由 shaojie_wang 提交于
* register bf16 for communication ops * fix bfloat16 type finding compile error in c_allreduce_max_op
-
- 31 3月, 2023 1 次提交
-
-
由 Yuang Liu 提交于
-
- 19 12月, 2022 1 次提交
-
-
由 Wen Sun 提交于
-
- 29 11月, 2022 1 次提交
-
-
由 JZ-LIANG 提交于
* get default calc stream from execution ctx instead of global dev ctx pool.
-
- 21 11月, 2022 1 次提交
-
-
由 Wen Sun 提交于
* refactor: replace Collective & PointToPoint with NCCLEnv * refactor: rename to RunFnInNCCLEnv * refactor: pass std::function by value
-
- 14 11月, 2022 1 次提交
-
-
由 Wen Sun 提交于
* refactor: simplify send, recv interfaces * refactor: rm send_partial, recv_partial, all_gather_partial
-
- 11 10月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* remove using lodtensor part1 * polish history code format
-
- 01 8月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* remove cudaDeviceContext * remove more template * fix rocm compile * remove alias name CUDADeviceContext * fix compile * fix tests * revert changes
-
- 22 7月, 2022 1 次提交
-
-
由 Haohongxiang 提交于
-
- 26 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
-
- 15 2月, 2022 1 次提交
-
-
由 Aurelius84 提交于
* #1 migrate dist-related type()-> dtype() * move datatype function from pten -> fluid/framework * change type() in imperative into convert(dtype()) * modify xx_tensor->type into xx_tensor->dtype * change the set_type interface and the caller * modify xx_tensor.type into xx_tensor.dtype * fix mutable_data(place, dtype()) * change caller of mutable_data in pten and distributed * change the caller of mutable_data in fluid/framework * change the caller of mutable_data in imperative directory * mutable_data: inference * update the call of mutable_data * transfer MakePenScalarArray MakePtenScalar ResetHolderWithType * pass the compile. the next step is remove VarType in Pten * fix all and remove VarType from pten. success in linux. Next task is other platform * fix conflict with develop * fix compiled error * Fix reset conversion * fix conflict * fix compiled problem * fix typo * Fix << in tensor_utils.cc * fix type->dtype * fix unittest * fix tensor init constructor * fix DataTypeSize for BFloat16 * fix code style * fix npu compiled error * fix npu * compile npu sucessfully * fix conflict * fix conflict Co-authored-by: Nxiongkun <xiongkun03@baidu.com>
-
- 03 12月, 2021 1 次提交
-
-
由 ronnywang 提交于
* refine structure for cuda and rocm * update * update * update * update
-
- 13 7月, 2021 1 次提交
-
-
由 WangXi 提交于
-
- 10 3月, 2021 1 次提交
-
-
由 lilong12 提交于
* remove the send/recv of tensor size, but users have to specify the shape of the received var explicitly.
-
- 24 2月, 2021 1 次提交
-
-
由 Qi Li 提交于
-
- 16 11月, 2020 1 次提交
-
-
由 lilong12 提交于
-
- 13 11月, 2020 1 次提交
-
-
由 lilong12 提交于
* update, test=develop
-