1. 09 8月, 2022 1 次提交
    • Z
      refine save/load interface for distributed cpups (#44862) · 7b29c89b
      zhaocaibei123 提交于
      * save load
      
      * save load
      
      * add unittest
      
      * first commit
      
      * second commit
      
      * third commit
      
      * remove SaveLocalFS in memory sparse table
      
      * save dense param
      
      * update
      
      * push slot
      
      * fix push show clk: int -> float
      
      * add unittest
      
      * fix sample
      
      * unittest
      
      * add AsExtra for op
      
      * unittest
      
      * modify fs.py
      
      * modify fs.py
      
      * fix some bugs
      
      * add dataset hdfs config
      
      * local change
      
      * dataset use differenct hadoop ugi/fs_name
      
      * add
      
      * fix conflict
      
      * fix
      
      * remove logs
      
      * code style
      
      * fix
      
      * code style
      
      * code style
      
      * fix
      
      * code style
      
      * save_dense_param
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * change momentum in dense optimzer
      
      * fix
      
      * fix
      
      * change fluid => paddle.static
      
      * remove some unuseful code
      Co-authored-by: Nesythan <esythan@126.com>
      7b29c89b
  2. 01 8月, 2022 1 次提交
  3. 20 7月, 2022 1 次提交
  4. 19 7月, 2022 1 次提交
  5. 16 7月, 2022 1 次提交
  6. 15 7月, 2022 1 次提交
  7. 11 7月, 2022 1 次提交
  8. 26 6月, 2022 1 次提交
  9. 24 6月, 2022 1 次提交
  10. 21 6月, 2022 1 次提交
  11. 20 6月, 2022 1 次提交
    • W
      add dymf to gpups in python (#43497) · a4cfa5ae
      wangguanqun 提交于
      * gpups default config and dataset
      
      * codestyle
      
      * add unittest
      
      * code style
      
      * add dymf to gpups
      
      * codestyle
      
      * add static.nn.cvm import
      
      * PSERVER_DEBUG
      
      * add fs config to worker desc
      
      * update unittest
      
      * unittest
      
      * remove gpups unittest
      
      * remove gpups unittest
      
      * static check
      a4cfa5ae
  12. 17 6月, 2022 1 次提交
  13. 09 6月, 2022 1 次提交
  14. 05 6月, 2022 1 次提交
  15. 04 6月, 2022 1 次提交
  16. 19 5月, 2022 2 次提交
    • D
      【GPUPS】add ctr_dymf_accessor for pscore (#42827) · 148582fe
      danleifeng 提交于
      148582fe
    • S
      distribute label evenly among partitions in graph engine (#42846) · 68babef1
      seemingwang 提交于
      * enable graph-engine to return all id
      
      * change vector's dimension
      
      * change vector's dimension
      
      * enlarge returned ids dimensions
      
      * add actual_val
      
      * change vlog
      
      * fix bug
      
      * bug fix
      
      * bug fix
      
      * fix display test
      
      * singleton of gpu_graph_wrapper
      
      * change sample result's structure to fit training
      
      * recover sample code
      
      * fix
      
      * secondary sample
      
      * add graph partition
      
      * fix pybind
      
      * optimize buffer allocation
      
      * fix node transfer problem
      
      * remove log
      
      * support 32G+ graph on single gpu
      
      * remove logs
      
      * fix
      
      * fix
      
      * fix cpu query
      
      * display info
      
      * remove log
      
      * remove empyt file
      
      * distribute labeled data evenly in graph engine
      Co-authored-by: NDesmonDay <908660116@qq.com>
      68babef1
  17. 18 5月, 2022 1 次提交
  18. 16 5月, 2022 1 次提交
    • S
      fix node transfer problem (#42674) · b61a6e71
      seemingwang 提交于
      * enable graph-engine to return all id
      
      * change vector's dimension
      
      * change vector's dimension
      
      * enlarge returned ids dimensions
      
      * add actual_val
      
      * change vlog
      
      * fix bug
      
      * bug fix
      
      * bug fix
      
      * fix display test
      
      * singleton of gpu_graph_wrapper
      
      * change sample result's structure to fit training
      
      * recover sample code
      
      * fix
      
      * secondary sample
      
      * add graph partition
      
      * fix pybind
      
      * optimize buffer allocation
      
      * fix node transfer problem
      
      * remove log
      
      * support 32G+ graph on single gpu
      
      * remove logs
      
      * fix
      
      * fix
      
      * fix cpu query
      
      * display info
      
      * remove log
      
      * remove empyt file
      Co-authored-by: NDesmonDay <908660116@qq.com>
      b61a6e71
  19. 04 5月, 2022 1 次提交
    • S
      change sample result's structure to fit training (#42426) · 2f99869d
      seemingwang 提交于
      * enable graph-engine to return all id
      
      * change vector's dimension
      
      * change vector's dimension
      
      * enlarge returned ids dimensions
      
      * add actual_val
      
      * change vlog
      
      * fix bug
      
      * bug fix
      
      * bug fix
      
      * fix display test
      
      * singleton of gpu_graph_wrapper
      
      * change sample result's structure to fit training
      
      * recover sample code
      
      * fix
      
      * secondary sample
      
      * add graph partition
      
      * fix pybind
      Co-authored-by: NDesmonDay <908660116@qq.com>
      2f99869d
  20. 29 4月, 2022 1 次提交
  21. 27 4月, 2022 1 次提交
  22. 24 4月, 2022 1 次提交
    • S
      combine graph_table and feature_table in graph_engine (#42134) · 0e0f7da6
      seemingwang 提交于
      * extract sub-graph
      
      * graph-engine merging
      
      * fix
      
      * fix
      
      * fix heter-ps config
      
      * test performance
      
      * test performance
      
      * test performance
      
      * test
      
      * test
      
      * update bfs
      
      * change cmake
      
      * test
      
      * test gpu speed
      
      * gpu_graph_engine optimization
      
      * add dsm sample method
      
      * add graph_neighbor_sample_v2
      
      * Add graph_neighbor_sample_v2
      
      * fix for loop
      
      * add cpu sample interface
      
      * fix kernel judgement
      
      * add ssd layer to graph_engine
      
      * fix allocation
      
      * fix syntax error
      
      * fix syntax error
      
      * fix pscore class
      
      * fix
      
      * change index settings
      
      * recover test
      
      * recover test
      
      * fix spelling
      
      * recover
      
      * fix
      
      * move cudamemcpy after cuda stream sync
      
      * fix linking problem
      
      * remove comment
      
      * add cpu test
      
      * test
      
      * add cpu test
      
      * change comment
      
      * combine feature table and graph table
      
      * test
      
      * test
      
      * pybind
      
      * test
      
      * test
      
      * test
      
      * test
      
      * pybind
      
      * pybind
      
      * fix cmake
      
      * pybind
      
      * fix
      
      * fix
      
      * add pybind
      
      * add pybind
      Co-authored-by: NDesmonDay <908660116@qq.com>
      0e0f7da6
  23. 22 4月, 2022 1 次提交
    • Z
      Ssd sparse table (#41812) · cca57c4a
      zhaocaibei123 提交于
      * [cherry-pick2.3]fix compile bug of windows cuda11.5 (#41464)
      
      cherry-pick
      
      fix compile bug of windows cuda11.5 #41433
      
      * fix bug of missing boost when compile cache.cc (#41449)
      
      【chery-pick #41430】fix bug of random compile failure, due to incorrect compile order of dependencies
      
      * Fix eager try catch (#41438) (#41477)
      
      [Cherry-Pick]Fix eager try catch (#41438)
      
      * Cherry-pick-PR41407, fix device_id bug for final_state op in multiprocess testcase (#41407) (#41475)
      
      Cherry-pick PR #41407
      
      * [BugFix] Add error hint for one_hot gpu version (#41335) (#41495)
      
      * add one_hot gpu hint
      
      * move allow_out_of_range judgement
      
      * delete useless unittest
      
      * fix bugs of reshape double grad infermeta (#41459) (#41493)
      
      * [cherrypick-2.3] modify infer gpu memory strategy (#41427), remove cudnn_deterministic=True (#41341)  (#41491)
      Co-authored-by: NJingZhuangzhuang <75348594+JZZ-NOTE@users.noreply.github.com>
      
      * [Cherry-pick][ROCm] fix dcu error in device event base, test=develop (#41523)
      
      Cherry-pick of #41521
      
      * [Cherry-Pick]Cherry pick PR41200, PR41474, PR41382 (#41509)
      
      * Use `self`as a parameter of _hash_with_id function to avoid error caused by hash_id reuse (#41200)
      
      * Add fill_constant_batch_size YAML and UT (#41474)
      
      * Switch some dy2st UT to eager mode (#41382)
      
      * Sitch some dy2st UT to eager mode
      
      * Fix test_lstm and remove test_transformer
      
      * Run test_resnet_v2 in old dy mode
      
      * Unittest recover (#41431)
      
      * update name
      
      * update name
      
      * fix test
      
      * fix fleet bind
      
      * update name
      
      * update name
      
      * fix test
      
      * fix gpups wrapper
      
      * remove Push/Pull/Load/Save with context in client and wrapper base class
      
      * fix
      
      * fix
      
      * remove some interface
      
      * fix
      
      * remove
      
      * code style
      
      * recover
      
      * fix
      
      * remove code unused
      
      * remove some unused table & accessor & CommonDenseTable => MemoryDenseTable
      
      * fix
      
      * fix
      
      * fix
      
      * recover
      
      * remove unused code
      
      * recover unittest
      
      * fix
      
      * remove
      
      * fix
      
      * remove code unuseful
      
      * remove
      
      * fix
      
      * recover
      
      * remove
      Co-authored-by: Nesythan <esythan@126.com>
      
      * add ssd sparse table
      
      * fix
      
      * add cache shuffle
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * add unit test
      
      * fix
      Co-authored-by: zhouweiwei2014's avatarZhou Wei <1183042833@qq.com>
      Co-authored-by: NSing_chan <51314274+betterpig@users.noreply.github.com>
      Co-authored-by: N0x45f <23097963+0x45f@users.noreply.github.com>
      Co-authored-by: Npangyoki <pangyoki@126.com>
      Co-authored-by: NSiming Dai <908660116@qq.com>
      Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com>
      Co-authored-by: NZhang Jun <ewalker@live.cn>
      Co-authored-by: NJingZhuangzhuang <75348594+JZZ-NOTE@users.noreply.github.com>
      Co-authored-by: NQi Li <qili93@qq.com>
      Co-authored-by: Nesythan <esythan@126.com>
      cca57c4a
  24. 19 4月, 2022 2 次提交
    • W
      double accessor and show_scale (#41943) · 8113c913
      wangguanqun 提交于
      * double accessor and show_scale
      
      * double accessor and show_scale
      
      * rename
      
      * fix bug in pslib config
      
      * add unittest
      8113c913
    • S
      Cpu gpu graph engine (#41942) · 4f461ab9
      seemingwang 提交于
      * extract sub-graph
      
      * graph-engine merging
      
      * fix
      
      * fix
      
      * fix heter-ps config
      
      * test performance
      
      * test performance
      
      * test performance
      
      * test
      
      * test
      
      * update bfs
      
      * change cmake
      
      * test
      
      * test gpu speed
      
      * gpu_graph_engine optimization
      
      * add ssd layer to graph_engine
      
      * fix allocation
      
      * fix syntax error
      
      * fix syntax error
      
      * fix pscore class
      
      * fix
      
      * recover test
      
      * recover test
      
      * fix spelling
      
      * recover
      
      * fix
      
      * fix linking problem
      
      * remove comment
      4f461ab9
  25. 15 4月, 2022 1 次提交
    • S
      gpu_graph engine optimization+ (#41455) · ce72690c
      seemingwang 提交于
      * extract sub-graph
      
      * graph-engine merging
      
      * fix
      
      * fix
      
      * fix heter-ps config
      
      * test performance
      
      * test performance
      
      * test performance
      
      * test
      
      * test
      
      * update bfs
      
      * change cmake
      
      * test
      
      * test gpu speed
      
      * gpu_graph_engine optimization
      
      * add ssd layer to graph_engine
      
      * fix allocation
      
      * fix syntax error
      
      * fix syntax error
      
      * fix pscore class
      
      * fix
      
      * recover test
      
      * recover test
      
      * fix spelling
      
      * recover
      
      * fix
      ce72690c
  26. 09 4月, 2022 1 次提交
    • Z
      Unittest recover (#41431) · 7a07c4a5
      zhaocaibei123 提交于
      * update name
      
      * update name
      
      * fix test
      
      * fix fleet bind
      
      * update name
      
      * update name
      
      * fix test
      
      * fix gpups wrapper
      
      * remove Push/Pull/Load/Save with context in client and wrapper base class
      
      * fix
      
      * fix
      
      * remove some interface
      
      * fix
      
      * remove
      
      * code style
      
      * recover
      
      * fix
      
      * remove code unused
      
      * remove some unused table & accessor & CommonDenseTable => MemoryDenseTable
      
      * fix
      
      * fix
      
      * fix
      
      * recover
      
      * remove unused code
      
      * recover unittest
      
      * fix
      
      * remove
      
      * fix
      
      * remove code unuseful
      
      * remove
      
      * fix
      
      * recover
      
      * remove
      Co-authored-by: Nesythan <esythan@126.com>
      7a07c4a5
  27. 05 4月, 2022 1 次提交
    • Z
      Table refine: remove table/accessor unuseful (#41400) · a288fcab
      zhaocaibei123 提交于
      * update name
      
      * update name
      
      * fix test
      
      * fix fleet bind
      
      * update name
      
      * update name
      
      * fix test
      
      * fix gpups wrapper
      
      * remove Push/Pull/Load/Save with context in client and wrapper base class
      
      * fix
      
      * fix
      
      * remove some interface
      
      * fix
      
      * remove
      
      * code style
      
      * recover
      
      * fix
      
      * remove code unused
      
      * remove some unused table & accessor & CommonDenseTable => MemoryDenseTable
      
      * fix
      
      * fix
      
      * fix
      
      * recover
      
      * remove unused code
      Co-authored-by: Nesythan <esythan@126.com>
      a288fcab
  28. 04 4月, 2022 1 次提交
    • Z
      Table refine: Pull/Push(TableContext) (#41320) · 19cb0d18
      zhaocaibei123 提交于
      * update name
      
      * update name
      
      * fix test
      
      * fix fleet bind
      
      * update name
      
      * update name
      
      * fix test
      
      * fix gpups wrapper
      
      * remove Push/Pull/Load/Save with context in client and wrapper base class
      
      * fix
      
      * fix
      
      * remove some interface
      
      * fix
      
      * remove
      
      * code style
      
      * recover
      
      * fix
      
      * remove code unused
      
      * fix
      
      * recover
      
      * fix
      Co-authored-by: Nesythan <esythan@126.com>
      19cb0d18
  29. 02 4月, 2022 2 次提交
  30. 01 4月, 2022 1 次提交
  31. 31 3月, 2022 1 次提交
    • T
      Using DistConfig in Paddle Inference (#41128) · dc0702fe
      TeslaZhao 提交于
      * Pass compat of conv_transpose_bias_mkldnn_fuse_pass
      
      * Fix a bug of strided_slice op, about the axes parameter access memory out of bounds
      
      * Fix a bug of strided_slice op, about the axes parameter access memory out of bounds
      
      * Fix a bug of transpose op, about accessing memory out of bounds of the perm param
      
      * op:transpose_op supports bool type
      
      * op:transpose_op supports bool type
      
      * Keep strided_slice op behavior consistent with slice op when starts input is less than -rank
      
      * Using DistConfig in inference
      dc0702fe
  32. 30 3月, 2022 1 次提交
  33. 28 3月, 2022 2 次提交
  34. 24 3月, 2022 1 次提交
    • S
      test gpu graph engine's performance (#40775) · 83ae1619
      seemingwang 提交于
      * extract sub-graph
      
      * graph-engine merging
      
      * fix
      
      * fix
      
      * fix heter-ps config
      
      * test performance
      
      * test performance
      
      * test performance
      
      * test
      
      * test
      
      * update bfs
      
      * change cmake
      83ae1619
  35. 23 3月, 2022 1 次提交
    • Z
      two-phase training for ps (#40762) · b1a4668c
      zhaocaibei123 提交于
      * fix benchmark and communicator config
      
      * fix bugs of the_one_ps
      
      * multi program and fix bug in optimizer
      
      * multi program in the_one_ps
      
      * public commcontext
      
      * ps optimizer multi programs
      
      * cvm & datanorm backend
      
      * fix dim
      
      * fix unittest
      
      * fix
      
      * the one ps merge
      
      * remove comm
      
      * add DownpourLiteWorker
      
      * all
      
      * fix
      
      * fix
      
      * device worker downpour lite
      
      * fix
      
      * fix bug in global shuffle
      
      * save inference model
      
      * fix & add log
      
      * fix
      
      * remove log
      
      * fix
      
      * fix save summary
      
      * fix
      
      * fix pscore
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * remove logs
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * add some comments
      
      * fix
      Co-authored-by: Nesythan <esythan@126.com>
      b1a4668c
  36. 21 3月, 2022 1 次提交