1. 16 5月, 2022 8 次提交
  2. 13 5月, 2022 5 次提交
  3. 12 5月, 2022 10 次提交
    • S
      Fix some typos in paddle/. (#42408) · 2012672c
      Shuangchi He 提交于
      2012672c
    • F
      Xpups dev (#42692) · 272b7f1c
      Fan Zhang 提交于
      * Adapt XPUPS - 1st version - 3.24
      
      * Adapt XPUPS - update XPU PushSparse -  2nd version - 3.24
      
      * Adapt XPUPS - add XPU PullSparseOp - 3nd version - 3.25
      
      * refactor heter comm kernel
      
      * update. test=develop
      
      * Adapt XPUPS - modify by compilation - 4th version - 3.27
      
      * update calc_shard_offset. test=develop
      
      * update xpu kernel. test=develop
      
      * update args of calc_shard_offset
      
      * update. test=develop
      
      * remove customGradMerger
      
      * update. test=develop
      
      * heter_comm update
      
      * heter_comm update
      
      * update calc_shard_offset. test=develop
      
      * heter_comm update
      
      * update args of calc_shard_offset
      
      * update. test=develop
      
      * remove customGradMerger
      
      * update. test=develop
      
      * fix. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update optimizer kernel
      
      * Adapt XPUPS - use WITH_XPU_KP and modify wrapper kernel function - 5th version - 3.30
      
      * update. test=develop
      
      * update pslib.cmake
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * Adapt XPUPS - modify by kp compilation  - 6th version - 3.30
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update optimizer kernel
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * used by minxu
      
      * update heter_comm_inl
      
      * fix. test=develop
      
      * Adapt XPUPS - modify by kp compilation  - 7th version - 3.30
      
      * fix. test=develop
      
      * add optimizer kernel. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * 3.31 update
      
      * Adapt XPUPS - update kp compilation path  - 8th version - 3.31
      
      * add optimizer kernel. test=develop
      
      * fix kunlun not support size_t. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix kunlun not support size_t. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * update heter_comm_kernel.kps 3.31
      
      * fix. test=develop
      
      * fix. test=develop
      
      * update heter_comm_kernel.kps 3.31
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * update heter_comm.h 3.31
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * update hashtable. test=develop
      
      * update. test=develop
      
      * Adapt XPUPS - update by kp compilation  - 9th version - 4.1
      
      * update hashtable. test=develop
      
      * fix. test=develop
      
      * update hashtable 4.1
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * Adapt XPUPS - update by kp compilation  - 10th version - 4.1
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * update. test=develop
      
      * modify by compilation 4.1
      
      * update. test=develop
      
      * update. test=develop
      
      * fix. test=develop
      
      * modify by compilation 4.1
      
      * update. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * modify by compilation 4.1
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * modify by compilation 4.1 19:30
      
      * fix. test=develop
      
      * update ps_gpu_wrapper.kps 4.1
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * Adapt XPUPS - update by kp compilation  - 11th version - 4.1
      
      * fix. test=develop
      
      * Adapt XPUPS - update by kp compilation  - 12nd version - 4.2
      
      * fix. test=develop
      
      * fix. test=develop
      
      * modify by compilation 4.2
      
      * 4.2 update
      
      * fix. test=develop
      
      * template init. test=develop
      
      * update 4.6
      
      * fix. test=develop
      
      * template init. test=develop
      
      * 4.6 modify by compilation
      
      * hashtable template init. test=develop
      
      * hashtable template init. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=devlop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=devlop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * Adapt XPUPS - update by kp compilation  - 13nd version - 4.7
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * 4.11 update
      
      * fix. test=develop
      
      * fix. test=develop
      
      * 4.11 update
      
      * update by pre-commit
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * 4.12 update
      
      * fix. test=develop
      
      * Adapt XPUPS - update by kp compilation  - 14th version - 4.13
      
      * 4.13 update
      
      * 4.14 update
      
      * 4.14 update
      
      * 4.14 update
      
      * 4.14 modify by merged latest compilation
      
      * retry CI 4.14
      
      * 4.15 pass static check
      
      * 4.15 modify by gpups CI
      
      * 3.16 update by gpups CI - modify ps_gpu_wrapper.h
      
      * 4.16 update
      
      * 4.16 pass xpu compile
      
      * 4.16 retry CI
      
      * 4.16 update
      
      * Adapt XPUPS - adapt BKCL comm for XPUPS - 4.24
      
      * update by compilation
      
      * Adapt XPUPS - register PSGPUTrainer for XPUPS - 4.25
      
      * update device_worker_factory
      
      * Adapt XPUPS - split heter_ps into .cu and .cc - 4.27
      
      * Adapt XPUPS - register pull_box_sparse op under XPU_KP - 4.28
      
      * update
      
      * 5.7 modify ps_gpu_wrapper pull_sparse
      
      * 5.11 update ps_gpu_wrapper CopyKeysKernel
      Co-authored-by: Nzmxdream <zhangminxu01@baidu.com>
      272b7f1c
    • F
      [MLU] fix cnnl error when index is 2D (#42669) · 190cf44f
      fwenguang 提交于
      190cf44f
    • S
      Add cinn pass to program (#42623) · 9ac736c2
      sneaxiy 提交于
      * add cinn pass to program
      
      * remove build_cinn_pass ut
      
      * polish ut, add ut
      
      * guard ut with is_compiled_with_cinn
      
      * enable ut test_build_cinn_pass_resnet
      9ac736c2
    • z8hanghuan's avatar
      add xpu buffer_reader, *test=kunlun (#42578) · cc343a41
      z8hanghuan 提交于
      * add xpu buffer_reader, *test=kunlun
      
      * xpu buffer_reader, use XPUDeviceGuard, *test=kunlun
      
      * modify xpu.cmake, *test=kunlun
      
      * modify xpu.cmake, *test=kunlun
      
      * modify xpu.cmake, *test=kunlun
      
      * add xpu buffer_reader, *test=kunlun
      
      * add xpu buffer reader, *test=kunlun
      
      * add xpu buffer reader, *test=kunlun
      cc343a41
    • J
      Speed up sr accumulation (#42658) · 43d70bcc
      Jiabin Yang 提交于
      * Support Gradient Accumulation for sr
      
      * add ut
      
      * change ut to fit small vector
      
      * speed up accumulation
      43d70bcc
    • J
      [Eager] Remove full reserved strategy (#42690) · 35c7c835
      Jiabin Yang 提交于
      * remove full reserved strategy
      
      * fix inplace error
      35c7c835
    • Z
      add exp,log trt converter (#42655) · 6e90ba1b
      zhupengyang 提交于
      6e90ba1b
    • W
      [Paddle-Inference] support transformer generation: some passes (#42664) · 5914b18a
      Wangzheee 提交于
      * [Paddle-Inference] support transformer generation: some passes
      5914b18a
    • F
      [MLU] add slice kernel (#42245) · ddb3868e
      fwenguang 提交于
      ddb3868e
  4. 11 5月, 2022 10 次提交
  5. 10 5月, 2022 7 次提交
    • R
      [CustomDevice] add inference support (#42036) · 02e5c4be
      ronnywang 提交于
      02e5c4be
    • P
      Rea-dd conv_affine_channel fuse pass as oneDNN only pass (#41998) · 3540d33b
      piotrekobi 提交于
      * Readd conv_affine_channel fuse pass as mkldnn pass
      
      * Fix formatting
      
      * Add new test to parallel_UT_rule.py
      
      * Fix Coverage and Windows CI issues
      
      * Revert "Fix Coverage and Windows CI issues"
      
      This reverts commit f33459846385c9fd51c07f9f44e7ff283a652637.
      
      * Fix CI errors
      
      * Remove unnecessary conv_eltwise_add_affine_channel fuse pass
      
      * Remove test from parallel_UT_rule.py
      3540d33b
    • T
      add fp16 for reshape op on kunlun2, *test=kunlun (#42605) · 754edf6e
      TTerror 提交于
      754edf6e
    • Z
      fix switch client multithread bug (#42600) · e2540c17
      ziyoujiyi 提交于
      * back fl
      
      * delete ssl cert
      
      * .
      
      * make warning
      
      * .
      
      * unittest paral degree
      
      * solve unittest
      
      * heter & multi cloud commm ready
      
      * .
      
      * .
      
      * arm_brpc compile
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * only output is ok
      
      * base is ok
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * add switch server bin
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * adapt brpc ssl
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * .
      
      * fix heter_server & heter_client
      
      * .
      
      * .
      
      * int->int64_t
      
      * .
      
      * safe map in multithread
      
      * fix heter unitest
      
      * .
      
      * fix code_style
      
      * .
      
      * fix bug
      
      * .
      e2540c17
    • J
      shape mkldnn kernel adapted to NHWC (#42548) · d47690b2
      Jacek Czaja 提交于
      * - shape mkldnn adapted to NHWC
      
      - NHWC shape mkldnn ut
      
      - fixes to UT
      
      - Fix to UT
      
      - Fixes to UT
      
      - Fix of compilation
      
      * - lint candidate fix
      d47690b2
    • J
      [Video detection] Added fill_constant FP32 FWD oneDNN kernel (#37216) · 66a10f36
      jakpiase 提交于
      * added fill_constant kernel
      
      * CI fix
      
      * ci fix
      
      * switched from nan to zero memory
      
      * CI FIX
      
      * ci fixes
      
      * CI rerun
      
      * ci fix
      
      * minor change
      
      * CI rerun
      66a10f36
    • B
      【PaddlePaddle Hackathon 2】18、为 Paddle 新增 paddle.heaviside 和 paddle.Tensor.heaviside API (#41872) · 4892d592
      BrilliantYuKaimin 提交于
      * Create elementwise_heaviside_op.cc
      
      * add ElementwiseHeavisideFunctor
      
      * Create test_elementwise_heaviside_op.py
      
      * 增加heaviside的python接口
      
      * add heaviside in white list
      
      * 增加heaviside的签名
      
      * 增加heaviside的核函数
      
      * 增加heaviside梯度的核函数
      
      * 增加heaviside梯度的注册
      
      * 调整代码格式
      
      * Update elementwise_sig.cc
      
      * add heaviside in __all__
      
      * Update heaviside docs
      
      * Update math.py
      
      * Update math.py
      
      * Update math.py
      4892d592