1. 04 7月, 2022 1 次提交
  2. 02 7月, 2022 2 次提交
    • L
      unify cpu context, part2 (#44012) · 755438a7
      Leo Chen 提交于
      * fix init()
      
      * delete test_device_context
      
      * replace CPUDeviceContext with CPUContext
      
      * fix test_scalar
      
      * remove dot_op.cc
      
      * fix compile
      755438a7
    • L
      unify cpu context (#43989) · 09096aeb
      Leo Chen 提交于
      * unify cpu context
      
      * fix init()
      
      * delete test_device_context
      
      * fix test_scalar
      09096aeb
  3. 30 6月, 2022 1 次提交
  4. 29 6月, 2022 1 次提交
  5. 28 6月, 2022 3 次提交
  6. 26 6月, 2022 1 次提交
  7. 24 6月, 2022 3 次提交
  8. 23 6月, 2022 1 次提交
  9. 21 6月, 2022 2 次提交
    • S
      Generalize conv+activation fuse pass (#43382) · 347e4b2e
      Sławomir Siwek 提交于
      * consolidate conv act passes
      
      * generalize conv_activation
      
      * integrate conv+act tests
      
      * code style format
      
      * whitespaces
      
      * remove timeout from old tests
      
      * implement comments from review
      
      * restore ut
      
      * whitespace
      
      * code style
      
      * transpose
      
      * fixes after review
      
      * method for gettin act
      
      * Change Paddle_enforce error type
      
      * code format
      
      * add missing opcompats
      347e4b2e
    • A
      [IPU] add activation ops (#43662) · 2353db3a
      Allen Guo 提交于
      * add argmin and argsort ops (#800)
      
      * add argmin and arsort ops
      
      * Add dot bmm ops (#803)
      
      * add bmm
      
      * add dot op
      
      * clean CreateConst
      
      * clean CreateCast
      
      * add activation ops (#808)
      
      * add activation ops
      
      * fix 1function-redefined error
      2353db3a
  10. 18 6月, 2022 1 次提交
  11. 17 6月, 2022 3 次提交
  12. 16 6月, 2022 1 次提交
  13. 15 6月, 2022 2 次提交
  14. 14 6月, 2022 1 次提交
  15. 13 6月, 2022 2 次提交
  16. 10 6月, 2022 3 次提交
  17. 09 6月, 2022 2 次提交
  18. 08 6月, 2022 1 次提交
  19. 07 6月, 2022 3 次提交
  20. 05 6月, 2022 1 次提交
  21. 04 6月, 2022 1 次提交
  22. 02 6月, 2022 4 次提交
    • L
      Add generate_proposals_v2 op and expend function of gather op for kunlun. *test=kunlun (#43162) · ff22a9c4
      Leo Guo 提交于
      * Add generate_proposals_v2 op and unittest for kunlun. *test=kunlun
      
      * Add the assign op to xpu2_op_list and expand the function of gather op. Add the unit-test of generate_proposals_v2. *test=kunlun
      ff22a9c4
    • S
      Fix bug of CUDAGraph kernel parameter comparation (#43163) · 3fcfcd51
      sneaxiy 提交于
      * fix cuda graph sizeof
      
      * fix tuple type
      3fcfcd51
    • F
      [XPUPS] modify BKCL comm op register (#43028) · 1bfbcfaf
      Fan Zhang 提交于
      * Adapt XPUPS - 1st version - 3.24
      
      * Adapt XPUPS - update XPU PushSparse -  2nd version - 3.24
      
      * Adapt XPUPS - add XPU PullSparseOp - 3nd version - 3.25
      
      * refactor heter comm kernel
      
      * update. test=develop
      
      * Adapt XPUPS - modify by compilation - 4th version - 3.27
      
      * update calc_shard_offset. test=develop
      
      * update xpu kernel. test=develop
      
      * update args of calc_shard_offset
      
      * update. test=develop
      
      * remove customGradMerger
      
      * update. test=develop
      
      * heter_comm update
      
      * heter_comm update
      
      * update calc_shard_offset. test=develop
      
      * heter_comm update
      
      * update args of calc_shard_offset
      
      * update. test=develop
      
      * remove customGradMerger
      
      * update. test=develop
      
      * fix. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update optimizer kernel
      
      * Adapt XPUPS - use WITH_XPU_KP and modify wrapper kernel function - 5th version - 3.30
      
      * update. test=develop
      
      * update pslib.cmake
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * Adapt XPUPS - modify by kp compilation  - 6th version - 3.30
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update optimizer kernel
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * used by minxu
      
      * update heter_comm_inl
      
      * fix. test=develop
      
      * Adapt XPUPS - modify by kp compilation  - 7th version - 3.30
      
      * fix. test=develop
      
      * add optimizer kernel. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * 3.31 update
      
      * Adapt XPUPS - update kp compilation path  - 8th version - 3.31
      
      * add optimizer kernel. test=develop
      
      * fix kunlun not support size_t. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix kunlun not support size_t. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * update heter_comm_kernel.kps 3.31
      
      * fix. test=develop
      
      * fix. test=develop
      
      * update heter_comm_kernel.kps 3.31
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * update heter_comm.h 3.31
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * update hashtable. test=develop
      
      * update. test=develop
      
      * Adapt XPUPS - update by kp compilation  - 9th version - 4.1
      
      * update hashtable. test=develop
      
      * fix. test=develop
      
      * update hashtable 4.1
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * Adapt XPUPS - update by kp compilation  - 10th version - 4.1
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * update. test=develop
      
      * modify by compilation 4.1
      
      * update. test=develop
      
      * update. test=develop
      
      * fix. test=develop
      
      * modify by compilation 4.1
      
      * update. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * modify by compilation 4.1
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * modify by compilation 4.1 19:30
      
      * fix. test=develop
      
      * update ps_gpu_wrapper.kps 4.1
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * Adapt XPUPS - update by kp compilation  - 11th version - 4.1
      
      * fix. test=develop
      
      * Adapt XPUPS - update by kp compilation  - 12nd version - 4.2
      
      * fix. test=develop
      
      * fix. test=develop
      
      * modify by compilation 4.2
      
      * 4.2 update
      
      * fix. test=develop
      
      * template init. test=develop
      
      * update 4.6
      
      * fix. test=develop
      
      * template init. test=develop
      
      * 4.6 modify by compilation
      
      * hashtable template init. test=develop
      
      * hashtable template init. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=devlop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=devlop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * Adapt XPUPS - update by kp compilation  - 13nd version - 4.7
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * 4.11 update
      
      * fix. test=develop
      
      * fix. test=develop
      
      * 4.11 update
      
      * update by pre-commit
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * 4.12 update
      
      * fix. test=develop
      
      * Adapt XPUPS - update by kp compilation  - 14th version - 4.13
      
      * 4.13 update
      
      * 4.14 update
      
      * 4.14 update
      
      * 4.14 update
      
      * 4.14 modify by merged latest compilation
      
      * retry CI 4.14
      
      * 4.15 pass static check
      
      * 4.15 modify by gpups CI
      
      * 3.16 update by gpups CI - modify ps_gpu_wrapper.h
      
      * 4.16 update
      
      * 4.16 pass xpu compile
      
      * 4.16 retry CI
      
      * 4.16 update
      
      * Adapt XPUPS - adapt BKCL comm for XPUPS - 4.24
      
      * update by compilation
      
      * Adapt XPUPS - register PSGPUTrainer for XPUPS - 4.25
      
      * update device_worker_factory
      
      * Adapt XPUPS - split heter_ps into .cu and .cc - 4.27
      
      * Adapt XPUPS - register pull_box_sparse op under XPU_KP - 4.28
      
      * update
      
      * 5.7 modify ps_gpu_wrapper pull_sparse
      
      * 5.11 update ps_gpu_wrapper CopyKeysKernel
      
      * 5.13 modify calc_shard_offset_kernel & fill_shard_key_kernel
      
      * modify fill_dvals_kernel & PullCopy & c_sync_calc_stream - 5.18
      
      * modify PushCopy & fill_shard_grads_kernel & register push_box_sparse - 5.19
      
      * Adapt XPUPS - modify BKCL comm op register - 5.26
      
      * Adapt XPUPS - modify BKCL comm op register - 5.27
      
      * Adapt XPUPS - modify BKCL comm op register - 5.27v2
      
      * Adapt XPUPS - modify BKCL comm op register - 5.27v3
      
      * Adapt XPUPS - modify c_comm_init_all_op to adapt BKCL init - 5.30
      
      * Adapt XPUPS - modify c_comm_init_all_op to adapt BKCL init v2 - 5.30
      
      * Adapt XPUPS - modify c_comm_init_all_op to adapt BKCL init v3 - 5.30
      
      * Adapt XPUPS - modify c_comm_init_all_op to adapt BKCL init v4 - 5.31
      Co-authored-by: Nzmxdream <zhangminxu01@baidu.com>
      1bfbcfaf
    • S
      Support CUDA Graph for partial graph in dygraph mode (#42786) · d05b940a
      sneaxiy 提交于
      * support CUDAGraph for partial graph
      
      * add ut
      
      * fix ci
      
      * fix ut again because of eager mode
      
      * fix kunlun ci
      
      * fix win ci
      d05b940a