- 29 12月, 2020 1 次提交
-
-
由 Thunderbrook 提交于
-
- 25 12月, 2020 1 次提交
-
-
由 YUNSHEN XIE 提交于
-
- 23 12月, 2020 1 次提交
-
-
由 Thunderbrook 提交于
* add heter box * add trainer, worker, wrapper... * format * for ci * format * remove boost get * boost & copyright * rename * rename * format * format * format Co-authored-by: Nyaoxuefeng6 <yaoxuefeng@baidu.com>
-
- 14 12月, 2020 1 次提交
-
-
由 lilong12 提交于
-
- 23 11月, 2020 1 次提交
-
-
由 Thunderbrook 提交于
* ps gpu transpile * ps gpu * remove op * gps trainer * local ps * add macro * HeterBox * def cuda * tab * code style * style Co-authored-by: Thunderbrook <a754913769#163.com>
-
- 14 10月, 2020 1 次提交
-
-
由 wanghuancoder 提交于
-
- 25 9月, 2020 1 次提交
-
-
由 Thunderbrook 提交于
* add xpu in heter mode test=develop * BOOST_CONST_GET; PADDLE_THROW test=develop * code style test=develop * code style test=develop * code style test=develop * refine test=develop * refine test=develop * refine test=develop * refine code test=develop
-
- 24 9月, 2020 1 次提交
-
-
由 wanghuancoder 提交于
* use iwyu clean include, test=develop, test=win * compilation error, test=develop * fix compilation error2, test=develop * fix compilation error3, test=develop * fix compilation error4, test=develop * fix compilation error5, test=develop * fix compilation error6, test=develop * fix compilation error7, test=develop * fix compilation error8, test=develop * fix compilation error8, test=develop * fix compilation error10, test=develop * fix compilation error11, test=develop
-
- 18 9月, 2020 1 次提交
-
-
由 tangwei12 提交于
* fix worker endpoints * fix gloo wrapper for hdfs * GPU fleetrun support gloo * parameterserver fleetrun support gloo * fix get server endpoint
-
- 14 9月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
-
- 02 9月, 2020 1 次提交
-
-
由 Thunderbrook 提交于
* fix eigen in push sparse; fix hadoop command test=develop * add log in load_combine_op test=develop
-
- 31 8月, 2020 1 次提交
-
-
由 yaoxuefeng 提交于
-
- 27 8月, 2020 1 次提交
-
-
由 lilong12 提交于
add collective op for cpu using gloo and paddle.distributed.* apis
-
- 18 8月, 2020 1 次提交
-
-
由 Thunderbrook 提交于
test=develop
-
- 07 8月, 2020 2 次提交
-
-
由 Thunderbrook 提交于
test=develop
-
由 123malin 提交于
* test=develop,test=document_fix, remove the out args * fleet_util move to paddle.fleet Co-authored-by: NWuHaobo <wuhaobo1994@gmail.com> Co-authored-by: Ntangwei12 <tangwei12@baidu.com>
-
- 06 8月, 2020 1 次提交
-
-
由 Thunderbrook 提交于
* add heter ps mode * code style test=develop * add with_pslib test=develop * unitest test=develop * code style test=develop * code style test=develop * code style test=develop * code style test=develop * code style test=develop * code style test=develop * code style test=develop * code style test=develop * test monitor test=develop * prepare trainer test=develop * code style test=develop
-
- 11 7月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
* fix softmax_with_cross_entropy cuda kernel overflow bug, test=develop * replace old macro & for condition, test=develop * polish details, test=develop
-
- 10 6月, 2020 1 次提交
-
-
由 hutuxian 提交于
Support CMatchAucCalculator based on CMatchRankAucCalculator with a new parameter ignore_rank
-
- 04 6月, 2020 1 次提交
-
-
由 hutuxian 提交于
* Fix the field length in LoD scenario * Fix the missed lod info when copy tensor in dump field * Add some log to make debug easy
-
- 03 6月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
* remove REPLACE_ENFORCE_GLOG compile option & add ci rule prohibit LOG(FATAL) using, test=develop * remove ci test case, test=develop * replace all LOG(FATAL) & polish message, test=develop * fix typo, test=develop * polish error info detail, test=develop
-
- 26 5月, 2020 1 次提交
-
-
由 ShenLiang 提交于
-
- 25 5月, 2020 1 次提交
-
-
由 hutuxian 提交于
* Support AucRunner in PaddleBox * update some code style
-
- 11 5月, 2020 2 次提交
-
-
由 hutuxian 提交于
* Add InitializeGPUAndLoadModel to solve random hang when downloading sparse parameters. * Update SaveBase to solve test problem.
-
由 Chen Weihang 提交于
* add new macro BOOST_GET_SAFELY & unittests, test=develop * add different macro type, test=develop * fix get macro type in executor, test=develop * four macro part change backup * using one macro for all case, test=develop * revert attribute change, test=develop * change to three func to solve gcc4.8 bug, test=develop * polish some details, test=develop
-
- 30 4月, 2020 1 次提交
-
-
由 xujiaqi01 提交于
* add timeout and http store in communication, add revert and confirm in fleet * test=develop
-
- 17 4月, 2020 1 次提交
-
-
由 hutuxian 提交于
-
- 11 4月, 2020 1 次提交
-
-
由 xujiaqi01 提交于
* add save with prefix * test=develop
-
- 10 4月, 2020 1 次提交
-
-
由 hutuxian 提交于
* Involves AfsAPI to resolve slow downloading. * Mainly used in PaddleBox
-
- 01 4月, 2020 1 次提交
-
-
由 xujiaqi01 提交于
* add fleet pslib pull and push sparse op and push dense op * test=develop
-
- 26 3月, 2020 2 次提交
-
-
由 xujiaqi01 提交于
* add clear_one_table * test=develop
-
由 danleifeng 提交于
* add maskauc in paddlebox; test=develop
-
- 20 3月, 2020 1 次提交
-
-
由 hutuxian 提交于
-
- 25 2月, 2020 1 次提交
-
-
由 hutuxian 提交于
* Add two types of Metric Calculator: MultiTaskCalculator & CmatchRankCalculator. * Add a config for DynamicAdjustChannelNum function to denote whether we will discard the remaining instances when they are not be distributed evenly. * Remove CPU code in Pull/PushSparse and we will add it back when testing it fully. * Fix some known issues: such as copying persistable vars after one epoch running.
-
- 11 2月, 2020 3 次提交
-
-
由 hutuxian 提交于
Refine PaddleBox Framework, Main functions: * Add MetricMsg util class, which can calculate metrics like AUC, bucket_error, COPC. * Replace FeedPass with new interface: BeginFeedPass & EndFeedPass * Refactor Pull/Push Sparse Function in box_wrapper. * Use CUDA Kernel to copy keys and copy feasign between tensor and boxps struct. * Cache copied keys in pull sparse in order to reuse it in push period.
-
由 yaoxuefeng 提交于
* update * update test=develop * update compile set test=develop * update compile set test=develop * update test=develop * update test=develop * update test=develop * update compile setting test=develop * update compile setting test=develop * update run demo test=develop * update test=develop * update test=develop * fix test=develop * update test=develop * update test=develop * update test=develop * update test=develop * update test=develop * update test=develop * update test=develop * update test=develop * update test=develop * update format test=develop * update format test=develop * update style test=develop * update style test=develop * change style test=develop * change style test=develop * change style test=develop * add dataset unittest test=develop * update test=develop * update for record test=develop * udpate style for record test=develop * update for record test=develop * update for record test=develop * update for record test=develop * fix format test=develop * update test=develop * update test=develop * update test=develop * update test=develop * update test=develop
-
由 Wilber 提交于
支持不依赖nccl进行编译。[1/2] 多卡下,如果没有打开WITH_NCCL开关编译,多卡不能通信,则只能选择一张卡使用。 Co-authored-by: N石晓伟 <39303645+Shixiaowei02@users.noreply.github.com>
-
- 05 2月, 2020 1 次提交
-
-
由 Wilber 提交于
cmake选项中添加了WITH_NCCL,显示指定是否编译NCCL的部分代码,WITH_NCCL默认打开,但如果WITH_GPU为OFF,则关闭WITH_NCCL 添加了PADDLE_WITH_NCCL定义 单机单卡能够关闭NCCL编译,多卡的话需要默认打开NCCL,如果关闭NCCL,则只能使用单卡 Co-authored-by: N石晓伟 <39303645+Shixiaowei02@users.noreply.github.com>
-
- 02 2月, 2020 1 次提交
-
-
由 xujiaqi01 提交于
* add GeneralRoleMaker which is for general usage * test=develop
-
- 14 1月, 2020 1 次提交
-
-
由 xujiaqi01 提交于
* add collective communication library in fleet to replace mpi * test=develop
-