“b54435a8ab77bb8d74f05949a2ff0d7cddc112ec”上不存在“doc_cn/design/cluster_train/large_model_dist_train.html”
[NPU] use SparseSoftmaxCrossEntropyWithLogits in npu kernel of softmax_with_cross_entropy (#32858)
* use SparseSoftmaxCrossEntropyWithLogits * fix * test_slice * revert test_slice * add backprob for npu kernel * fix typo * fix ut * fix ut * refine comments * return softmax
Showing
想要评论请 注册 或 登录