- 09 3月, 2021 1 次提交
-
-
由 xiayanming 提交于
* add gather npu op * code review done * update python new line * precommit * fix review * del commit
-
- 08 3月, 2021 2 次提交
-
-
由 lw921014 提交于
* add allreduce and broadcast without test * add c_broadcast_test case * build c_comm_init and c_create_group operators * make the whole thing compile * add broadcast and init op test case but run failed * make unit test compile * fix broadcast test bug and change into hcom for ccl * change c_comm_init and c_create_group ops accordingly * make tests compile * transfer code to 27 * compiled successfully in 28, but run failed * test broadcast in 28, but failed * make hcom primitives work * change hccl data type for base.h * fix broadcast bug * make attributes work * fix group name bug * add allreduce but test failed * allreduce bug for qiuliang * allreduce finished * add allgather and reducescatter * merge all op code * add allgather test * finish run all ccl op test exclude send/recv * all all op and test exclude send/recv * send_v2_npu.cc recv_v2_npiu.cc compiled * fix ccl core dump bug and test allgather, reducescatter, broadcast op * fix allreduce bug just for test * hcom send&recv test pass, without hcom_destroy * for qiuliang test * Ascend Send&Recv Test Pass * all op (ex send/recv) ok * fix bug * merge all ccl op * style merge to PaddlePaddle * merge style * new merge style * merge style 2 * insert an empty at the end * disable ctest for hcom to pass ci Co-authored-by: Nvoid-main <voidmain1313113@gmail.com> Co-authored-by: Nf2hkop <f2huestc@outlook.com>
-
由 Reventon_L 提交于
Co-authored-by: Nroot <xiayanming@baidu.com>
-
- 05 3月, 2021 1 次提交
-
-
由 Leo Chen 提交于
-
- 04 3月, 2021 4 次提交
-
-
由 Leo Chen 提交于
-
由 zhang wenhui 提交于
-
由 zhang wenhui 提交于
* add shape npu * fix * fix
-
由 Leo Chen 提交于
* add npu kernel for equal op * refine code * add more ut * update year
-
- 02 3月, 2021 3 次提交
-
-
由 Leo Chen 提交于
* init commit, add layer_norm npu kernel * fix typo * add unittest * add unittest * fix bug * fix bug * refine ut
-
由 Void Main 提交于
Refactor HCCLCommContext to be compatible with Paddle (#31359)
-
由 Leo Chen 提交于
* fix reading flags from env * fix problem caused by async run * support partial grad * support elementwise_add_grad npu kernel * add unittest * fix bug?
-
- 01 3月, 2021 3 次提交
- 26 2月, 2021 1 次提交
-
-
由 liym27 提交于
* [NPU] Support npu op: (1) pow (2) pow_grad * Support fp16
-
- 25 2月, 2021 2 次提交
- 23 2月, 2021 2 次提交
- 22 2月, 2021 1 次提交
-
-
由 Leo Chen 提交于
* add npu sub op * fix typo * rename test * fix bug * fix bug * add fp16 kernel * fix typo * support sub grad op * support elementwise_sub_grad op Co-authored-by: Nfrankwhzhang <frankwhzhang@126.com>
-
- 09 2月, 2021 3 次提交
-
-
由 Leo Chen 提交于
* support npu allocator * add npu device context * fix some compile problem * fix some compile problem * add npu info * compile ok * fix include dir * support naive_best_fit_allocator * run ut ok, bug failed to exit * call aclrtResetDevice before exit * fix aclFinilize * add system allocatot test * add selected_gpus in gtest * add tensor_test for npu * support npu op, initial commit * add npu stream * add elementwise_add_op * compile ok * fix typo * fix elementwise_add_op_npu_test * support op run * test can run but failed * change aclopExecuteV2 to aclopCompileAndExecute
-
由 Leo Chen 提交于
[feature] support npu operator
-
由 Leo Chen 提交于
[feature] support npu allocator
-
- 08 2月, 2021 1 次提交
-
-
由 gongweibao 提交于
Destroy session first.
-
- 28 1月, 2021 1 次提交
-
-
由 Leo Chen 提交于
Dev/fix ascend string
-
- 27 1月, 2021 1 次提交
-
-
由 Leo Chen 提交于
fix compilation on ascend-20.1
-
- 21 1月, 2021 2 次提交
-
-
由 gongweibao 提交于
Add Hccl program group
-
由 gongweibao 提交于
Add distribution supported
-
- 15 1月, 2021 5 次提交
-
-
由 gongweibao 提交于
Fix compilcation on CANN20.1 and older
-
由 hutuxian 提交于
-
由 石晓伟 提交于
-
由 Wojciech Uss 提交于
-
由 wawltor 提交于
* fix the rnn mask memory bug for out of read * update the code for the rnn
-
- 14 1月, 2021 5 次提交
-
-
由 taixiurong 提交于
-
由 ShenLiang 提交于
-
由 yaoxuefeng 提交于
-
由 123malin 提交于
* test=develop, add distributed_infer
-
由 QingshuChen 提交于
-
- 13 1月, 2021 2 次提交
-
-
由 cc 提交于
* skip quantizing ops in cpu inference, test=develop
-
由 alncat 提交于
* added support for inference using qunatization aware trained dygraph * added support for inference using qunatization aware trained dygraph correct boost get usage * Delete incorrect warning message (#30196) * fix warning and no grad * clean redundant API alias in 2.0 - part 2 (#30013) * delete paddle.nn.functional.assign * fix dynamic to static error * just add the op error message for the matmul xpu (#30246) add the op error message for the matmul xpu * Add Static Variable Clone (#30208) Add clone method for static Variable so that this interface will be same as dygraph. It fixed some bugs in dy2stat * use wget to replace curl to download the lcov file (#30229) * use wget to replace curl to download the lcov file * add cache for lcov * fix test_pool3d_op timeout issue (#30248) * Fix unittests bugs. (#30250) * modify error message based on comments (#30189) * modify error message based on comments * edit code according to review. * Correct spelling according to review. * Fix bug for 'save mutiple method' (#30218) * Fix bug for 'save mutiple method' * To pass coverage. * edit code to pass coverage. * edit code to pass coverage. * add unittest for coverage. * change for coverage. * edit for coverage. * added support for inference using qunatization aware trained dygraph * Alias from paddle.fluid.layers.auc to paddle.static.auc (#30206) * add alias from fluid.layers.auc to static.auc * Update __init__.py * added support for inference using qunatization aware trained dygraph correct boost get usage * corrected boost get usage * corrected naming issues and enforcing zero check * correct paddle enforce message * added more error checkings * corrected error report message and optimized code * corrected findvar usage * corrected paddle_enforce in scope * correct error messages * correct error reporting format Co-authored-by: NLielinJiang <50691816+LielinJiang@users.noreply.github.com> Co-authored-by: NXiaoguangHu <46782768+XiaoguangHu01@users.noreply.github.com> Co-authored-by: Nwawltor <fangzeyang0904@hotmail.com> Co-authored-by: NHuihuang Zheng <zhhsplendid@gmail.com> Co-authored-by: NYUNSHEN XIE <1084314248@qq.com> Co-authored-by: NBai Yifan <me@ethanbai.com> Co-authored-by: Ngongweibao <weibao.gong@gmail.com> Co-authored-by: NWeiXin <weixin10@baidu.com> Co-authored-by: NJiaqi Liu <liujiaqi06@baidu.com>
-