- 14 4月, 2021 1 次提交
-
-
由 pangyoki 提交于
-
- 13 4月, 2021 3 次提交
- 12 4月, 2021 5 次提交
-
-
add dataloader modes (#32200)
-
由 pangyoki 提交于
-
由 pangyoki 提交于
* enable async copy and add wait before sync operation * remove unneccessary wait * add FillNpuTensorWithConstant * refine * fix fill_constant * change TensorFromVector to FillNpuTensorWithConstant * fix ignored api * delete extra unittest * fix little error * fix update_loss_scaling_op_npu and check_finite_and_unscale_op_npu * change TensorCopySync to TensorCopy * delete useless Wait and add StreamWait * fix npu_stream error * fix check_finite_and_unscale_op_npu TensorCopy * only save stream wait * fix NPUDeviceContext in all c++ unittest * delete wait Co-authored-by: Nzhiqiu <chenqiuliang@baidu.com>
-
由 Leo Chen 提交于
* fix NPUDeviceContext in all c++ unittest * refine log Co-authored-by: Npangyoki <pangyoki@126.com>
-
由 Leo Chen 提交于
* change aclrtSynchronizeDevice to aclrtSynchronizeStream for better performace * refine code
-
- 09 4月, 2021 1 次提交
-
-
由 pangyoki 提交于
* support save load for NPU * add save load npu unittest * support np.array transform in NPU * fix errors * delete dygraph in unittest * add Wait * fix unittest * fix review comment * fix unittest problem * fix little problem
-
- 07 4月, 2021 2 次提交
- 06 4月, 2021 1 次提交
-
-
由 Leo Chen 提交于
-
- 01 4月, 2021 4 次提交
-
-
由 liym27 提交于
-
由 Leo Chen 提交于
* enable async copy and add wait before sync operation * remove unneccessary wait * add FillNpuTensorWithConstant * refine * fix fill_constant * make TensorFromVector/TensorToVector sync
-
由 Leo Chen 提交于
-
由 Leo Chen 提交于
* support npu profiler * add python api * fix bugs * add wrapper for incomplete type * update profile proto * record npu wait * add xpu placeholder
-
- 31 3月, 2021 1 次提交
-
-
由 Leo Chen 提交于
-
- 30 3月, 2021 2 次提交
- 29 3月, 2021 1 次提交
-
-
Co-authored-by: Nbaiyangfan <baiyangfan@baidu.com>
-
- 26 3月, 2021 1 次提交
-
-
由 Leo Chen 提交于
* support GarbageCollector for npu * fix typo * fix gather_grad * disable NPUDefaultStreamGarbageCollector on NPU
-
- 25 3月, 2021 5 次提交
-
-
由 Leo Chen 提交于
* init * add todo * add npu kernel for truncated_gaussian_random * add sync * fix concat_grad * fix typo
-
由 liym27 提交于
-
由 Leo Chen 提交于
* fix some op bugs * fix some bugs * follow comments * fix log level * add ut
-
由 zhang wenhui 提交于
* fix * fix
-
由 Leo Chen 提交于
-
- 24 3月, 2021 6 次提交
- 23 3月, 2021 5 次提交
-
-
由 Leo Chen 提交于
* add npu kernel for concat op * add npu kernel for concat op * refine code * update * refine concat_grad
-
由 Leo Chen 提交于
* support list of tensor as npu input * add comment * fix typo * fix typo
-
由 Leo Chen 提交于
-
由 lw921014 提交于
add c_reduce_sum op
-
由 lilong12 提交于
Add 3d Parallelism Co-authored-by: NWangXi <wangxi16@baidu.com> Co-authored-by: NJZ-LIANG <jianzhongliang10@gmail.com> Co-authored-by: Nroot <root@yq01-sys-hic-k8s-v100-box-a225-0562.yq01.baidu.com>
-
- 22 3月, 2021 2 次提交
-
-
由 zhang wenhui 提交于
* fix * fix
-
由 xiayanming 提交于
* fix gather grad kernel diff * fix gather grad kernel diff * fix gather review bug
-