- 31 12月, 2021 1 次提交
-
-
由 zmxdream 提交于
-
- 29 12月, 2021 2 次提交
-
-
由 yaoxuefeng 提交于
add hashtable dynamic mf support
-
由 yaoxuefeng 提交于
add dynamic mf size api
-
- 23 12月, 2021 1 次提交
-
-
由 yaoxuefeng 提交于
add mem pool
-
- 20 12月, 2021 1 次提交
-
-
由 石晓伟 提交于
-
- 16 12月, 2021 1 次提交
-
-
由 danleifeng 提交于
* trainer_device fix and checknan tool for psgpu;test=develop * disable show_one_table;test=develop
-
- 03 12月, 2021 1 次提交
-
-
由 ronnywang 提交于
* refine structure for cuda and rocm * update * update * update * update
-
- 29 11月, 2021 2 次提交
-
-
由 Thunderbrook 提交于
* auc temp * cuballocator * code format * code format
-
由 xiaoxiao-luomu 提交于
-
- 17 11月, 2021 1 次提交
-
-
由 danleifeng 提交于
-
- 16 11月, 2021 1 次提交
-
-
由 danleifeng 提交于
-
- 08 11月, 2021 1 次提交
-
-
由 Thunderbrook 提交于
* add refrence * format
-
- 26 10月, 2021 1 次提交
-
-
由 xiongkun 提交于
* In cpu parallel using gloo, add various length support for SelectedRows * fix bug * fix bugs * fix by code review * remove timeout
-
- 21 10月, 2021 1 次提交
-
-
由 xiongkun 提交于
-
- 20 10月, 2021 1 次提交
-
-
由 danleifeng 提交于
* split into PreBuildTask and BuildPull; slove endpass bug;test=develop * change buildcpu into prebuild and buildcpu into build;test=develop
-
- 19 10月, 2021 1 次提交
-
-
由 danleifeng 提交于
-
- 11 10月, 2021 1 次提交
-
-
由 yaoxuefeng 提交于
-
- 30 9月, 2021 1 次提交
-
-
由 yaoxuefeng 提交于
-
- 27 9月, 2021 1 次提交
-
-
由 xiaoxiao-luomu 提交于
* gloo hdfs set check & gloo connect retry * add vlog * print gloo connect addr & add vlog * . * modify vlof * modify vlog * modify vlog
-
- 26 9月, 2021 1 次提交
-
-
由 Thunderbrook 提交于
* set file_num in one shard * format
-
- 22 9月, 2021 1 次提交
-
-
由 Tomasz Socha 提交于
* Fix copy elision warning * Remove redundand code
-
- 14 9月, 2021 1 次提交
-
-
由 yaoxuefeng 提交于
-
- 07 9月, 2021 1 次提交
-
-
由 yaoxuefeng 提交于
-
- 01 9月, 2021 1 次提交
-
-
由 Thunderbrook 提交于
* merge dense * log level * tensor copy sync * format
-
- 16 8月, 2021 1 次提交
-
-
由 Fan Zhang 提交于
-
- 05 8月, 2021 1 次提交
-
-
由 chentianyu03 提交于
* replace boost::algorithm::ends_with with self define ends_with function * remove BOOST macro in certain operators * remove boost::lexical_cast * add test for string_helper * add more test case for string_helper * modify join_string func and test case * fix build_strategy_test failed bug * remove string_helper_test from parallel_UT_rule.py
-
- 26 7月, 2021 1 次提交
-
-
由 danleifeng 提交于
* psgpu:edit cuda remote_streams; test=develop
-
- 19 7月, 2021 1 次提交
-
-
由 yaoxuefeng 提交于
-
- 16 7月, 2021 1 次提交
-
-
由 Fan Zhang 提交于
-
- 15 7月, 2021 1 次提交
-
-
由 danleifeng 提交于
-
- 06 7月, 2021 1 次提交
-
-
由 danleifeng 提交于
* pipeline adaptive for heterps;test=develop * fix finalize hang;test=develop * add is_compiled_with_heterps for dataset;test=develop * fix hashtable core when pass ins_num=0;test=develop
-
- 05 7月, 2021 1 次提交
-
-
由 danleifeng 提交于
* fix hdfs and fleet_util for supporting save/load infer;test=develop
-
- 29 6月, 2021 1 次提交
-
-
由 Thunderbrook 提交于
* add retry in pull sparse * retry
-
- 22 6月, 2021 1 次提交
-
-
由 danleifeng 提交于
-
- 16 6月, 2021 1 次提交
-
-
由 Thunderbrook 提交于
-
- 10 5月, 2021 1 次提交
-
-
由 Thunderbrook 提交于
* pslib with cmake * heter util * vlog * heter server test * add dtor * cmake
-
- 28 4月, 2021 1 次提交
-
-
由 Thunderbrook 提交于
* Revert "Revert "[PsCore] optimize performance of large kv (#32535)" (#32599)" This reverts commit 809ac036. * brpc dep
-
- 27 4月, 2021 1 次提交
-
-
由 tianshuo78520a 提交于
This reverts commit 4b7242b0.
-
- 26 4月, 2021 1 次提交
-
-
由 Thunderbrook 提交于
* optimize pull sparse * optimize pull sparse * change macro * format
-
- 21 4月, 2021 1 次提交
-
-
由 zhang wenhui 提交于
* add allreduce and broadcast without test (#31024) add allreduce and broadcast without test * Refactor HCCLCommContext to be compatible with Paddle (#31359) Refactor HCCLCommContext to be compatible with Paddle (#31359) * [NPU] add npu kernel for communication op (#31437) * add allreduce and broadcast without test * add c_broadcast_test case * build c_comm_init and c_create_group operators * make the whole thing compile * add broadcast and init op test case but run failed * make unit test compile * fix broadcast test bug and change into hcom for ccl * change c_comm_init and c_create_group ops accordingly * make tests compile * transfer code to 27 * compiled successfully in 28, but run failed * test broadcast in 28, but failed * make hcom primitives work * change hccl data type for base.h * fix broadcast bug * make attributes work * fix group name bug * add allreduce but test failed * allreduce bug for qiuliang * allreduce finished * add allgather and reducescatter * merge all op code * add allgather test * finish run all ccl op test exclude send/recv * all all op and test exclude send/recv * send_v2_npu.cc recv_v2_npiu.cc compiled * fix ccl core dump bug and test allgather, reducescatter, broadcast op * fix allreduce bug just for test * hcom send&recv test pass, without hcom_destroy * for qiuliang test * Ascend Send&Recv Test Pass * all op (ex send/recv) ok * fix bug * merge all ccl op * style merge to PaddlePaddle * merge style * new merge style * merge style 2 * insert an empty at the end * disable ctest for hcom to pass ci Co-authored-by: Nvoid-main <voidmain1313113@gmail.com> Co-authored-by: Nf2hkop <f2huestc@outlook.com> * Add auto-increasing tag id for Hcom OPs (#31702) * add c_reduce_sum op (#31793) add c_reduce_sum op * update Ascendrc hccl to 20.3 (#32126) update Ascendrc hccl to 20.3 (#32126) * fix merge code * change cmake.txt1 * [NPU] Support npu kernel for c sync stream op (#31386) * sync stream npu op * add with_ascend_acl * update c++ unittest * compile all failed * try to pre commit * after pre commit * merge&compile&test hccl successfully! * fix code style * fix code style * fix bugs about hccl * fix some bugs * fix code style * fix style * fix style * fix * fixed * merge develop Co-authored-by: Nlw921014 <liuwei921014@yeah.net> Co-authored-by: NVoid Main <voidmain1313113@gmail.com> Co-authored-by: Nf2hkop <f2huestc@outlook.com> Co-authored-by: Nxiayanming <41795079@qq.com>
-