1. 26 11月, 2020 1 次提交
  2. 28 9月, 2020 1 次提交
  3. 16 9月, 2020 1 次提交
  4. 13 8月, 2020 1 次提交
  5. 06 8月, 2020 1 次提交
  6. 25 5月, 2020 1 次提交
  7. 14 5月, 2020 1 次提交
  8. 30 4月, 2020 1 次提交
  9. 06 4月, 2020 1 次提交
  10. 20 3月, 2020 1 次提交
  11. 02 3月, 2020 1 次提交
  12. 28 2月, 2020 1 次提交
  13. 25 2月, 2020 1 次提交
    • H
      PaddleBox Framework Part2 (#22466) · 175954d8
      hutuxian 提交于
      * Add two types of Metric Calculator: MultiTaskCalculator & CmatchRankCalculator.
      * Add a config for DynamicAdjustChannelNum function to denote whether we will discard the remaining instances when they are not be distributed evenly.
      * Remove CPU code in Pull/PushSparse and we will add it back when testing it fully.
      * Fix some known issues: such as copying persistable vars after one epoch running.
      175954d8
  14. 23 2月, 2020 1 次提交
  15. 11 2月, 2020 1 次提交
    • Y
      multi-loss optimization by adding a DownpourOpt worker (#22025) · 2235ee1a
      yaoxuefeng 提交于
      * update
      
      * update test=develop
      
      * update compile set test=develop
      
      * update compile set test=develop
      
      * update test=develop
      
      * update test=develop
      
      * update test=develop
      
      * update compile setting test=develop
      
      * update compile setting test=develop
      
      * update run demo test=develop
      
      * update test=develop
      
      * update test=develop
      
      * fix test=develop
      
      * update test=develop
      
      * update test=develop
      
      * update test=develop
      
      * update test=develop
      
      * update test=develop
      
      * update test=develop
      
      * update test=develop
      
      * update test=develop
      
      * update test=develop
      
      * update format test=develop
      
      * update format test=develop
      
      * update style test=develop
      
      * update style test=develop
      
      * change style test=develop
      
      * change style test=develop
      
      * change style test=develop
      
      * add dataset unittest test=develop
      
      * update test=develop
      
      * update for record test=develop
      
      * udpate style for record test=develop
      
      * update for record test=develop
      
      * update for record test=develop
      
      * update for record test=develop
      
      * fix format test=develop
      
      * update test=develop
      
      * update test=develop
      
      * update test=develop
      
      * update test=develop
      
      * update test=develop
      2235ee1a
  16. 02 2月, 2020 1 次提交
  17. 06 12月, 2019 1 次提交
  18. 05 11月, 2019 1 次提交
    • X
      simplify master+patch,remove ins when size != merge_size or has conflict slot (#20913) · 1d1a0793
      xujiaqi01 提交于
      * remove duplicate code and duplicate config of master+patch
      * drop all ins which has conflict slot or size < merge_size
      * user only need to set merge size,if ins num of same id is not equal to merge size, just drop these ins
      * user must make sure master data and patch data has no same slot whose feasigns are both non-zero, otherwise these ins will be dropped. (slot list should still be the same of both master and patch)
      * test=develop
      1d1a0793
  19. 24 9月, 2019 1 次提交
    • X
      support change shuffle and train thread num (#19841) · cedc0477
      xujiaqi01 提交于
      * support change shuffle thread num
      * support change train thread num
      * fix receive shuffle data of each channel
      * data norm stop gradient
      * add check thread_tensor type and root_tensor type when merge metric
      * remove sleep in shuffle, add config
      * add config of pslib client to client communication
      * fix xbox str
      * add data norm op testcase
      * add flush in trainer finalize
      cedc0477
  20. 17 9月, 2019 1 次提交
  21. 31 8月, 2019 1 次提交
    • H
      Paddlebox Framework (#18982) · c756b5d2
      hutuxian 提交于
      * Support looking up embeddings from BoxPS.
      * Add a _pull_box_sparse op, for now this op is not exposed to users.
      * Add a BoxHelper class, providing 'BeginPass', 'EndPass', 'FeedPass' functions and so on.
      * Add 'BoxPSDataset' in python code.
      * Add a compile options WITH_BOX_PS and a MACRO PADDLE_WITH_BOX_PS.
      * Add UT.
      * More concrete information pls refer to: https://github.com/PaddlePaddle/Paddle/pull/18982
      c756b5d2
  22. 29 8月, 2019 1 次提交
    • T
      support debug each output of each ins (#19004) · 1fe468d3
      Thunderbrook 提交于
      * dump slot
      
      * test
      
      * proto
      
      * dump slot
      
      * test
      
      * proto
      
      * code style
      
      * code style
      
      * code style
      
      * style
      
      * add delete after unseen days
      
      * add unseen days
      
      * code style
      
      * conflict solve
      test=develop
      
      * add clear model
      
      * code style
      test=develop
      
      * code style
      test=develop
      
      * support debug tensor of each ins
      test=develop
      
      * support debug tensor of each ins
      test=develop
      
      * learning rate
      
      * code style
      
      * code style
      
      * code style
      
      * code style
      
      * code style
      
      * code style
      
      * code style
      
      * code style
      
      * code style
      
      * code style
      
      * code style
      
      * code style
      
      * code style
      test=develop
      
      * code style
      test=develop
      
      * unitest
      
      * style
      
      * style
      
      * multi phase
      
      * add channel
      
      * code style
      
      * style
      
      * style
      
      * unitest
      
      * style
      
      * define
      
      * define
      test=develop
      
      * style
      test=develop
      
      * rm define
      test=develop
      
      * linux
      
      * linux
      test=develop
      
      * style
      test=develop
      
      * output format
      test=develop
      
      * windows ci
      test=develop
      1fe468d3
  23. 11 8月, 2019 1 次提交
    • Y
      add save cache model api in fleet& add slots shuffle in dataset module & add... · 9150cf50
      yaoxuefeng 提交于
      add save cache model api in fleet& add slots shuffle in dataset module & add metric op to calculate ctr related metrics (#18871)
      
      * add ctr related metric layer test=develop
      
      * add save cache and slots shuffle test=develop
      
      * add save cache and slots shuffle test=develop
      
      * fix error
      
      * fix error
      
      * fix style for ci
      
      * fix for comments
      
      * change SlotsShuffle input to std::strinf for generality
      
      * fix style
      
      * fix style
      
      * fix style
      
      * fix style
      
      * fix style
      
      * fix style
      
      * fix stylr
      
      * fix style
      
      * fix style
      
      * fix style
      
      * fix style
      
      * fix style
      
      * fix style
      
      * fix style
      
      * fix style
      
      * fix style
      
      * fix style
      
      * fix style
      
      * fix style
      
      * fix style
      
      * change non-const reference to pointer
      
      * fix style
      
      * fix style
      
      * fix style test=develop
      
      * fix style  test=develop
      
      * add return ins num in ctr metric op
      
      * change dtype to float in metric_op.py
      
      * fix error test=develop
      
      * fix style test=develop
      
      * fix API spec
      
      * fix API spec
      
      * fix API spec test=develop
      
      * add UT test=develop
      9150cf50
  24. 02 8月, 2019 1 次提交
    • J
      support filelist size < trainer num && fix pull dense (#18956) · 02c370c3
      jiaqi 提交于
      * support filelist size < trainer num
      * pull dense when stop, to make sure local dense params are same as pserver, so save paddle model will save dense model same as pserver
      *  enable QueueDataset train same filelist for serveral times
      02c370c3
  25. 31 7月, 2019 1 次提交
    • J
      set fleet_send_batch_num a default value according to trainer num · 233746d8
      jiaqi 提交于
      (1) set fleet_send_batch_num a default value according to trainer num, the previous 80000 is fixed,if trainer num is much less or larger than 100,global shuffle may have timeout error.
      
      (2) fix load one table bug, add barrier
      233746d8
  26. 23 7月, 2019 1 次提交
    • J
      support patch data, add load_one_table, fix bug (#18509) · d18aabb4
      jiaqi 提交于
      (1)support patch data (merge slots of instances of same line id, modify dense layer which
      changes its size)
      (2)add fleet load_one_table interface, support load from paddle model and load from pslib model
      (3)fix push sparse bug which cause push sparse cost more time(about 10% in my testcase)
      (4)when some slots are not in one of your network (join/update, etc.),data feed、collect label info、push/pull sparse will skip these slots, instead of throw error.
      (5)add more debug info in TrainFilesWithProfiler
      d18aabb4
  27. 21 6月, 2019 1 次提交
    • J
      dataset (#17973) · 3f8031e2
      jiaqi 提交于
      (1) use channel instead of vector/BlockingQueue in Dataset,to keep same with existing implementation, and make code more readable and flexible (dataset single output channel or multi output channel). one previous memory out of limit problem is cause by not release memory after training.
      (2) add Record because MultiSlotType costs too much memory (80B),fix memory out of limit problem.
      (3) add Channel, Archive in paddle/fluid/framework
      (4) change dataset from shared_ptr to unique_ptr in pybind
      (5) move create/destroy readers from trainer to dataset
      (6) move shuffle from datafeed to dataset. dataset holds memory, datafeed is only for load data and feed data to network.
      (7) fix thread num bug of Dataset when filelist size < thread num
      (8) support set_queue_num in InMemoryDataset
      3f8031e2
  28. 11 6月, 2019 1 次提交
    • H
      Pipeline Concurrency (#17402) · 969e6378
      hutuxian 提交于
      Add Pipeline Concurrency Train Mode:
      - Cpp: pipeline_trainer & section_worker
      - Python: PipelineOptimizer
      - Add a new data_feed type: PrivateInstantDataFeed
      - Add a test demo of pipeline trainer and the test model is gnn
      - Do not support win32 now
      969e6378
  29. 18 5月, 2019 1 次提交
  30. 15 5月, 2019 1 次提交
    • J
      add save/load model, shrink table, cvm, config file & fix pull dense bug (#17118) · 66d51206
      jiaqi 提交于
      * add save/load model, shrink table, cvm, config file & fix pull dense bug
      test=develop
      
      * fix global shuffle bug, fix pull dense bug, fix release memeory bug, fix shrink error
      add client flush, add get data size
      test=develop
      
      * fix global shuffle bug
      test=develop
      
      * fix global shuffle bug
      test=develop
      
      * fix code style
      test=develop
      
      * fix code style & modify pslib cmake
      test=develop
      
      * fix error of _role_maker
      test=develop
      
      * fix code style
      test=develop
      
      * fix code style
      test=develop
      
      * fix code style
      test=develop
      
      * fix code style
      test=develop
      
      * fix code style
      test=develop
      
      * fix windows compile error of fleet
      test=develop
      
      * fix global shuffle bug
      
      * add comment
      test=develop
      
      * update pslib.cmake
      test=develop
      
      * fix fill sparse bug
      test=develop
      
      * fix push sparse bug
      test=develop
      66d51206
  31. 25 4月, 2019 1 次提交
  32. 11 4月, 2019 2 次提交
  33. 10 4月, 2019 2 次提交
  34. 04 4月, 2019 1 次提交
  35. 03 4月, 2019 1 次提交
  36. 01 4月, 2019 1 次提交
  37. 29 3月, 2019 2 次提交