1. 19 5月, 2022 3 次提交
    • C
      [CompileOpt] Refine enforce code and remove boost/variant include (#41093) · ca359fec
      Chen Weihang 提交于
      * refine enforce code
      
      * refine enforce code
      
      * fix compile failed
      
      * fix infrt failed
      ca359fec
    • S
      distribute label evenly among partitions in graph engine (#42846) · 68babef1
      seemingwang 提交于
      * enable graph-engine to return all id
      
      * change vector's dimension
      
      * change vector's dimension
      
      * enlarge returned ids dimensions
      
      * add actual_val
      
      * change vlog
      
      * fix bug
      
      * bug fix
      
      * bug fix
      
      * fix display test
      
      * singleton of gpu_graph_wrapper
      
      * change sample result's structure to fit training
      
      * recover sample code
      
      * fix
      
      * secondary sample
      
      * add graph partition
      
      * fix pybind
      
      * optimize buffer allocation
      
      * fix node transfer problem
      
      * remove log
      
      * support 32G+ graph on single gpu
      
      * remove logs
      
      * fix
      
      * fix
      
      * fix cpu query
      
      * display info
      
      * remove log
      
      * remove empyt file
      
      * distribute labeled data evenly in graph engine
      Co-authored-by: NDesmonDay <908660116@qq.com>
      68babef1
    • S
      [TensorRT] Support yolov5s (#42688) · a7778930
      shentanyue 提交于
      * support yolov5s static/int8
      
      * fix eltwise_sub and div weight compute
      
      * fix delete_fill_constant_pass
      a7778930
  2. 18 5月, 2022 7 次提交
  3. 17 5月, 2022 8 次提交
  4. 16 5月, 2022 11 次提交
  5. 13 5月, 2022 5 次提交
  6. 12 5月, 2022 6 次提交
    • S
      Fix some typos in paddle/. (#42408) · 2012672c
      Shuangchi He 提交于
      2012672c
    • F
      Xpups dev (#42692) · 272b7f1c
      Fan Zhang 提交于
      * Adapt XPUPS - 1st version - 3.24
      
      * Adapt XPUPS - update XPU PushSparse -  2nd version - 3.24
      
      * Adapt XPUPS - add XPU PullSparseOp - 3nd version - 3.25
      
      * refactor heter comm kernel
      
      * update. test=develop
      
      * Adapt XPUPS - modify by compilation - 4th version - 3.27
      
      * update calc_shard_offset. test=develop
      
      * update xpu kernel. test=develop
      
      * update args of calc_shard_offset
      
      * update. test=develop
      
      * remove customGradMerger
      
      * update. test=develop
      
      * heter_comm update
      
      * heter_comm update
      
      * update calc_shard_offset. test=develop
      
      * heter_comm update
      
      * update args of calc_shard_offset
      
      * update. test=develop
      
      * remove customGradMerger
      
      * update. test=develop
      
      * fix. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update optimizer kernel
      
      * Adapt XPUPS - use WITH_XPU_KP and modify wrapper kernel function - 5th version - 3.30
      
      * update. test=develop
      
      * update pslib.cmake
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * Adapt XPUPS - modify by kp compilation  - 6th version - 3.30
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update optimizer kernel
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * used by minxu
      
      * update heter_comm_inl
      
      * fix. test=develop
      
      * Adapt XPUPS - modify by kp compilation  - 7th version - 3.30
      
      * fix. test=develop
      
      * add optimizer kernel. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * 3.31 update
      
      * Adapt XPUPS - update kp compilation path  - 8th version - 3.31
      
      * add optimizer kernel. test=develop
      
      * fix kunlun not support size_t. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix kunlun not support size_t. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * update heter_comm_kernel.kps 3.31
      
      * fix. test=develop
      
      * fix. test=develop
      
      * update heter_comm_kernel.kps 3.31
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * update heter_comm.h 3.31
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * update hashtable. test=develop
      
      * update. test=develop
      
      * Adapt XPUPS - update by kp compilation  - 9th version - 4.1
      
      * update hashtable. test=develop
      
      * fix. test=develop
      
      * update hashtable 4.1
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * Adapt XPUPS - update by kp compilation  - 10th version - 4.1
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * update. test=develop
      
      * modify by compilation 4.1
      
      * update. test=develop
      
      * update. test=develop
      
      * fix. test=develop
      
      * modify by compilation 4.1
      
      * update. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * modify by compilation 4.1
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * modify by compilation 4.1 19:30
      
      * fix. test=develop
      
      * update ps_gpu_wrapper.kps 4.1
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * Adapt XPUPS - update by kp compilation  - 11th version - 4.1
      
      * fix. test=develop
      
      * Adapt XPUPS - update by kp compilation  - 12nd version - 4.2
      
      * fix. test=develop
      
      * fix. test=develop
      
      * modify by compilation 4.2
      
      * 4.2 update
      
      * fix. test=develop
      
      * template init. test=develop
      
      * update 4.6
      
      * fix. test=develop
      
      * template init. test=develop
      
      * 4.6 modify by compilation
      
      * hashtable template init. test=develop
      
      * hashtable template init. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=devlop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=devlop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * Adapt XPUPS - update by kp compilation  - 13nd version - 4.7
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * 4.11 update
      
      * fix. test=develop
      
      * fix. test=develop
      
      * 4.11 update
      
      * update by pre-commit
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * 4.12 update
      
      * fix. test=develop
      
      * Adapt XPUPS - update by kp compilation  - 14th version - 4.13
      
      * 4.13 update
      
      * 4.14 update
      
      * 4.14 update
      
      * 4.14 update
      
      * 4.14 modify by merged latest compilation
      
      * retry CI 4.14
      
      * 4.15 pass static check
      
      * 4.15 modify by gpups CI
      
      * 3.16 update by gpups CI - modify ps_gpu_wrapper.h
      
      * 4.16 update
      
      * 4.16 pass xpu compile
      
      * 4.16 retry CI
      
      * 4.16 update
      
      * Adapt XPUPS - adapt BKCL comm for XPUPS - 4.24
      
      * update by compilation
      
      * Adapt XPUPS - register PSGPUTrainer for XPUPS - 4.25
      
      * update device_worker_factory
      
      * Adapt XPUPS - split heter_ps into .cu and .cc - 4.27
      
      * Adapt XPUPS - register pull_box_sparse op under XPU_KP - 4.28
      
      * update
      
      * 5.7 modify ps_gpu_wrapper pull_sparse
      
      * 5.11 update ps_gpu_wrapper CopyKeysKernel
      Co-authored-by: Nzmxdream <zhangminxu01@baidu.com>
      272b7f1c
    • F
      [MLU] fix cnnl error when index is 2D (#42669) · 190cf44f
      fwenguang 提交于
      190cf44f
    • S
      Add cinn pass to program (#42623) · 9ac736c2
      sneaxiy 提交于
      * add cinn pass to program
      
      * remove build_cinn_pass ut
      
      * polish ut, add ut
      
      * guard ut with is_compiled_with_cinn
      
      * enable ut test_build_cinn_pass_resnet
      9ac736c2
    • z8hanghuan's avatar
      add xpu buffer_reader, *test=kunlun (#42578) · cc343a41
      z8hanghuan 提交于
      * add xpu buffer_reader, *test=kunlun
      
      * xpu buffer_reader, use XPUDeviceGuard, *test=kunlun
      
      * modify xpu.cmake, *test=kunlun
      
      * modify xpu.cmake, *test=kunlun
      
      * modify xpu.cmake, *test=kunlun
      
      * add xpu buffer_reader, *test=kunlun
      
      * add xpu buffer reader, *test=kunlun
      
      * add xpu buffer reader, *test=kunlun
      cc343a41
    • J
      Speed up sr accumulation (#42658) · 43d70bcc
      Jiabin Yang 提交于
      * Support Gradient Accumulation for sr
      
      * add ut
      
      * change ut to fit small vector
      
      * speed up accumulation
      43d70bcc