1. 15 2月, 2023 1 次提交
  2. 13 2月, 2023 1 次提交
    • R
      Upgrade protobuf to 4.21.x (#49168) · 15d93394
      risemeup1 提交于
      * upgrade protobuf to 3.19.0 in cmake
      
      * recover protobuf python version
      
      * fix distribute compile
      
      * fix
      
      * fix framework.data_feed_pb2
      
      * fix macos ifdef
      
      * fix lite
      
      * test
      
      * update protoc from 3.19.0 t0 3.20.0
      
      * test
      
      * debug
      
      * test
      
      * test
      
      * debug
      
      * debug
      
      * debug
      
      * debug
      
      * test
      
      * debug
      
      * update protocol from 3.20.0 to 4.21.12
      
      * modify graph_brpc_client.h
      
      * modify graph_brpc_client.h
      
      * test
      
      * test
      
      * test
      
      * fix third_party cache problem on build ci
      
      * updata proto
      
      * test
      
      * test
      
      * test
      
      * test
      
      * test
      
      * test
      
      * fix coverage failed test
      
      * try to fix test_exe_fleet_model_run
      
      * fix cinn bug
      
      * fix windows compile problem
      
      * fix python/requirements
      
      ---------
      Co-authored-by: Npangyoki <pangyoki@126.com>
      15d93394
  3. 10 2月, 2023 1 次提交
  4. 09 2月, 2023 2 次提交
  5. 08 2月, 2023 6 次提交
  6. 07 2月, 2023 1 次提交
  7. 06 2月, 2023 1 次提交
  8. 01 2月, 2023 1 次提交
  9. 31 1月, 2023 2 次提交
  10. 30 1月, 2023 2 次提交
    • R
      Support stream priority for standalone executor (#49939) · 172d1de6
      Ruibiao Chen 提交于
      * Support stream priority for standalone executor
      
      * Fix compile error
      
      * Fix compile error
      
      * Fix compile error
      
      * Fix compile error
      
      * Fix compile error
      172d1de6
    • Z
      [Pglbox2.0] merge gpugraph to develop (#49946) · cb525d4e
      zmxdream 提交于
      * add set slot_num for psgpuwraper (#177)
      
      * add set slot_num_for_pull_feature for psgpuwarper
      
      * Add get_epoch_finish python interface (#182)
      
      * add get_epoch_finish interface
      
      * add return
      
      * delete return
      
      * add unzip op (#183)
      
      * fix miss key for error dataset (#186)
      
      * fix miss key for error dataset
      
      * fix miss key for error dataset
      Co-authored-by: Nyangjunchao <yangjunchao@baidu.com>
      
      * add excluded_train_pair and infer_node_type (#187)
      
      * support return of degree (#188)
      
      * fix task stuck in barrier (#189)
      Co-authored-by: Nyangjunchao <yangjunchao@baidu.com>
      
      * check node/feature format when loading (#190)
      
      * check node&feature format when loading
      
      * check node&feature format when loading (2£ (2)
      
      * degrade log (#191)
      
      * [PGLBOX]fix conflict
      
      * [PGLBOX]fix conflict
      
      * [PGLBOX]replace LodTensor with phi::DenseTensor
      
      * [PGLBOX]fix gpu_primitives.h include path
      
      * [PGLBOX]from platform::PADDLE_CUDA_NUM_THREADS to phi::PADDLE_CUDA_NUM_THREADS
      
      * [PGLBOX]fix unzip example code
      
      * [PGLBOX]fix unzip example code
      
      * [PGLBOX]fix unzip example code
      
      * [PGLBOX]fix unzip example code
      
      * [PGLBOX]fix unzip ut
      
      * [PGLBOX]fix unzip ut
      
      * [PGLBOX]fix code style
      
      * [PGLBOX]fix code style
      
      * [PGLBOX]fix code style
      
      * fix code style
      
      * fix code style
      
      * fix unzip ut
      
      * fix unzip ut
      
      * fix unzip ut
      
      * fix unzip
      
      * fix code stype
      
      * add ut
      
      * add c++ ut & fix train_mode_ set
      
      * fix load into memory
      
      * fix c++ ut
      
      * fix c++ ut
      
      * fix c++ ut
      
      * fix c++ ut
      
      * fix code style
      
      * fix collective
      
      * fix unzip_op.cc
      
      * fix barrier
      
      * fix code style
      
      * fix barrier
      
      * fix barrier
      
      * fix code styple
      
      * fix unzip
      
      * add unzip.py
      
      * add unzip.py
      
      * fix unzip.py
      
      ---------
      Co-authored-by: Nchao9527 <33347532+chao9527@users.noreply.github.com>
      Co-authored-by: NSiming Dai <908660116@qq.com>
      Co-authored-by: Nhuwei02 <53012141+huwei02@users.noreply.github.com>
      Co-authored-by: Nyangjunchao <yangjunchao@baidu.com>
      cb525d4e
  11. 29 1月, 2023 3 次提交
  12. 28 1月, 2023 1 次提交
  13. 20 1月, 2023 1 次提交
  14. 18 1月, 2023 2 次提交
  15. 17 1月, 2023 1 次提交
  16. 16 1月, 2023 2 次提交
  17. 15 1月, 2023 1 次提交
    • R
      support mp on xpu (#49815) · 6a56bce7
      Roc 提交于
      1 update xccl lib
      2 when using comm_ctx, the allocator should be set manually.
      6a56bce7
  18. 13 1月, 2023 3 次提交
  19. 12 1月, 2023 2 次提交
    • W
      Migrate collective communication checks to PHI (#49754) · c24e7fe1
      Wen Sun 提交于
      * refactor: migrate comm checks
      
      * refactor: add check in comm context
      
      * feat: add gloo static check
      
      * refactor: add place param in static check
      c24e7fe1
    • J
      Fix reduce func bug in process_group_bkcl (#49749) · 8e291bf7
      jameszhang 提交于
      * Fix reduce func bug in process_group_bkcl
      
      Also catch up with a recent process_group PR that failed to add XPU branch.
      Note that reduce is still accomplished by allreduce for xpu. Fix this should
      xccl lib be updated.
      
      * fix compile issue for non-XPU
      8e291bf7
  20. 10 1月, 2023 1 次提交
  21. 09 1月, 2023 2 次提交
  22. 07 1月, 2023 1 次提交
    • R
      Enable standalone executor for fleet training (#49293) · 67fc8e93
      Ruibiao Chen 提交于
      * Enable standalone executor for fleet training
      
      * Update code
      
      * Replace use_standalone_executor utils in auto parallel
      
      * Update code
      
      * Diable standalone executor for test_pass_sharding
      
      * Update code
      
      * Set sequential run for auto parallel
      
      * Fix dist_attr bug
      
      * Set sequential run for auto parallel
      67fc8e93
  23. 06 1月, 2023 2 次提交
    • W
      Fix hidden overloaded functions in process group (#49576) · 215c7ae7
      Wen Sun 提交于
      * fix: fix hidden virtual funcs
      
      * fix: add default impl
      215c7ae7
    • Y
      [Auto Parallel] Merge dist attrs from python into c++ (#49214) · c7899074
      Yulong Ao 提交于
      * [Auto Parallel] Rename methods of ProcessMesh
      
      * [Auto Parallel] Impl the python process_mesh by the c++ one
      
      * [Auto Parallel] Add some minor modifications
      
      * [Auto Parallel] Rename some methods
      
      * [Auto Parallel] Remove unnecessary codes
      
      * [Auto Parallel] Add back some removed files
      
      * [Auto Parallel] Fix bugs
      
      * [Auto Parallel] Fix a bug
      
      * Update process_mesh.cc
      
      * [Auto Parallel] Merge dist attrs of Python into C++
      
      * [Auto Parallel] Add back deleted importing
      
      * [Auto Parallel] Add back removed unittest
      
      * [Auto Parallel] Remove type qualifiers of return types
      
      * [Auto Parallel] Fix some bugs
      
      * [Auto Parallel] Fix a bug of the quant pass
      
      * [Auto Parallel] Fix the code style
      c7899074