- 14 4月, 2021 1 次提交
-
- 13 4月, 2021 3 次提交
-
- 12 4月, 2021 5 次提交
-
-
add dataloader modes (#32200)
-
由 pangyoki 提交于
* enable async copy and add wait before sync operation * remove unneccessary wait * add FillNpuTensorWithConstant * refine * fix fill_constant * change TensorFromVector to FillNpuTensorWithConstant * fix ignored api * delete extra unittest * fix little error * fix update_loss_scaling_op_npu and check_finite_and_unscale_op_npu * change TensorCopySync to TensorCopy * delete useless Wait and add StreamWait * fix npu_stream error * fix check_finite_and_unscale_op_npu TensorCopy * only save stream wait * fix NPUDeviceContext in all c++ unittest * delete wait Co-authored-by: Nzhiqiu <chenqiuliang@baidu.com>
-
由 Leo Chen 提交于
* fix NPUDeviceContext in all c++ unittest * refine log Co-authored-by: Npangyoki <pangyoki@126.com>
-
- 09 4月, 2021 1 次提交
-
- 07 4月, 2021 2 次提交
-
- 06 4月, 2021 1 次提交
-
- 01 4月, 2021 4 次提交
-
- 31 3月, 2021 1 次提交
-
- 30 3月, 2021 2 次提交
-
- 29 3月, 2021 1 次提交
-
-
Co-authored-by: Nbaiyangfan <baiyangfan@baidu.com>
-
- 26 3月, 2021 1 次提交
-
- 25 3月, 2021 5 次提交
-
-
由 zhang wenhui 提交于
* fix * fix
- 24 3月, 2021 6 次提交
-
-
由 pangyoki 提交于
* support mixed precision input for npu layer norm * fix layer_norm npu kernel Co-authored-by: Nzhiqiu <chenqiuliang@baidu.com>
- 23 3月, 2021 5 次提交
-
-
由 lilong12 提交于
Add 3d Parallelism Co-authored-by: NWangXi <wangxi16@baidu.com> Co-authored-by: NJZ-LIANG <jianzhongliang10@gmail.com> Co-authored-by: Nroot <root@yq01-sys-hic-k8s-v100-box-a225-0562.yq01.baidu.com>
- 22 3月, 2021 2 次提交
-
-
由 zhang wenhui 提交于
* fix * fix
-
由 xiayanming 提交于
* fix gather grad kernel diff * fix gather grad kernel diff * fix gather review bug
-