1. 05 7月, 2023 1 次提交
  2. 19 6月, 2023 1 次提交
  3. 16 6月, 2023 1 次提交
    • R
      Run plan in standalone executor (#54394) · 752670e2
      Ruibiao Chen 提交于
      * Run plan in standalone executor
      
      * Update codes
      
      * Update atol and rtol for py3-CI
      
      * Add scope to cache key
      
      * Fix CI errors
      
      * Fix code style
      
      * Update codes
      
      * Remove fetch_name in standalone executor
      
      * Fix UT
      
      * Update codes
      
      * Fix new IR bug
      752670e2
  4. 15 6月, 2023 1 次提交
    • H
      [IR] [Baby step] New interprector support new ir (#54570) · ce0c5c27
      hong 提交于
      * add kernel dialect
      
      * change DenseTensorTypeStorage to DenseTensorType
      
      * add test case`
      
      * add first pd_op to kernel dialect
      
      * lower pd op to kernel dialect
      
      * update
      
      * update
      
      * remove useless code
      
      * add attrite print test
      
      * fix bug
      
      * update
      
      * update
      
      * update
      
      * update
      
      * polish code
      
      * fix bug
      
      * polish  code  and add python test
      
      * add test
      
      * fix test error
      
      * add env flag
      
      * fix bug
      
      * revert test env
      
      * change cc_test_old to cc_test
      
      * fix build_static bug
      
      * fix type test error
      
      * udpate cmake
      
      * disable test in windows
      
      * fix inference compile
      ce0c5c27
  5. 08 6月, 2023 1 次提交
  6. 31 5月, 2023 1 次提交
  7. 30 5月, 2023 1 次提交
  8. 10 4月, 2023 1 次提交
  9. 02 3月, 2023 1 次提交
    • R
      New executor static build for fluid kernel (#50670) · bf50784c
      Ruibiao Chen 提交于
      * Check structed kernel for new executor static build
      
      * Update code
      
      * Ready for resnet50
      
      * Move transfer_dtype to phi
      
      * Ready for transformer
      
      * Fix CI errors
      
      * Fix layer_norm InferMeta
      
      * Remove layer_norm infermeta fix
      bf50784c
  10. 16 2月, 2023 1 次提交
  11. 30 1月, 2023 1 次提交
  12. 17 1月, 2023 1 次提交
    • P
      support CUDA Graph for new executor (#49708) · 8e5ed04d
      pangyoki 提交于
      * new exe supports CUDA Graph
      
      * fix
      
      * fix
      
      * fix
      
      * fix FLAGS_use_stream_safe_cuda_allocator in unittest
      
      * insert output of coalesce_tensor op to skip_gc_var
      
      * fix
      8e5ed04d
  13. 28 12月, 2022 1 次提交
    • L
      [new-exec] Ahead-Of-Time choosing kernel (#48789) · 63d2d722
      Leo Chen 提交于
      * add skip run
      
      * alloc minimum memory
      
      * skip check_size in Alloc
      
      * skip check_size in Alloc
      
      * skip check_size in Alloc
      
      * fix cases when tensor is initialized or empty
      
      * alloc empty output for place info
      
      * add test
      
      * increase timeout
      
      * format code
      
      * skip cpu
      
      * add cudnn_deterministic
      
      * fit for hostAlloc
      
      * follow comments
      
      * change check_size to fake_alloc
      63d2d722
  14. 27 12月, 2022 2 次提交
  15. 28 11月, 2022 1 次提交
  16. 26 11月, 2022 1 次提交
  17. 25 11月, 2022 1 次提交
    • R
      Refactor stream anayzer (#48158) · 889318d8
      Ruibiao Chen 提交于
      * Move stream_anayzer to interpreter
      
      * Refactor StreamAnalyzer
      
      * Refactor RunNextInstructionList
      
      * Remove no_data_transform_index
      
      * Fix typos
      
      * Fix data_transfer OpFuncType error
      
      * Add event for depend_op
      
      * Update transfer OpFuncType for heter place
      889318d8
  18. 02 11月, 2022 1 次提交
  19. 31 10月, 2022 1 次提交
    • K
      [ControlFlow] replace executor in run method of control flow ops with standalone_executor (#45696) · 3b219e5e
      kangguangli 提交于
      * replace executor in conditional_block_op.run with standalone_executor
      
      * add block_id as the argument of standalone executor's method run; add print for program
      
      * fix scope bug about conditional block op
      
      * fix bug: unnecessary return of fetch value
      
      * fix typo
      
      * fix: quantization will set variable persistable, and these variables must exist in global scope
      
      * add interpretercore cache for conditional block op but not activate in default
      
      * fix bug: local scope reuse for conditional block op
      
      * reset scope when conditional block op runs
      
      * fix typo
      
      * fix typo and code style
      
      * add build scope for conditional block op
      
      * add skip for transfer_layout kernel
      
      * refind code
      
      * fix reset_scope
      
      * fix reset_scope
      
      * refine code
      
      * refine code
      
      * refine code
      
      1. remove flag use in conditional_block_op
      2. pass execution_config to BuildOpFuncList instead of individual parameter
      
      * refine code
      
      * remove the use of FLAGS_control_flow_use_new_executor_cache
      
      * change FLAGS_control_flow_use_new_executor to false
      3b219e5e
  20. 19 10月, 2022 1 次提交
  21. 12 10月, 2022 1 次提交
  22. 11 10月, 2022 1 次提交
  23. 10 10月, 2022 1 次提交
  24. 23 9月, 2022 1 次提交
  25. 20 9月, 2022 1 次提交
  26. 29 8月, 2022 1 次提交
    • Z
      [new_exe] Dy2Static support new_executor (#44450) · aba1295b
      zhangbo9674 提交于
      * add interpretercore
      
      * refine backward program id
      
      * add code
      
      * refine program
      
      * refine code
      
      * create forward/backward_program by prog2graph2prog method
      
      * test, do not care
      
      * refine code
      
      * refine code
      
      * refine code
      
      * test, do not care
      
      * add interpretorcore
      
      * add scope
      
      * refine scope create method
      
      * add jit for new_exe
      
      * solve conflict
      
      * delete unused code
      
      * polish code
      
      * polish code
      
      * refine scope in inplace
      
      * refine for datatransfer
      
      * refine _rebuild_from_desc
      
      * refine control eager deletion attr
      
      * refine used_for_jit
      
      * refine jit for infer
      
      * op size0 use ori program
      
      * polish code
      
      * refine jit
      
      * refine run_program_op ut
      
      * refine inplace
      
      * refine control
      
      * refine graph helper
      
      * refine control
      
      * refine inplace
      
      * refine buffer_share_inplace_pass
      
      * polish code
      
      * polish code
      
      * refine usage for compilerProgram
      
      * refine control
      
      * test
      
      * test core cache
      
      * refine code
      
      * refine io.py
      
      * increase test_seq2seq timeout
      
      * refine convert program
      
      * refine interpretercore_cache release
      
      * delete buildinplace
      
      * refine partial_program && io
      
      * refine code for io
      
      * test
      
      * test
      
      * test
      aba1295b
  27. 04 8月, 2022 1 次提交
  28. 02 8月, 2022 1 次提交
  29. 29 6月, 2022 1 次提交
  30. 23 6月, 2022 1 次提交
  31. 16 6月, 2022 1 次提交
  32. 18 4月, 2022 1 次提交
  33. 22 3月, 2022 1 次提交
  34. 17 2月, 2022 1 次提交
  35. 28 12月, 2021 2 次提交
    • F
      Utilize StreamSafeCUDAAllocator to support fast GC in new executor (#37642) · 0c7153a4
      From00 提交于
      * fix reshape move storage error
      
      * remove needless set type
      
      * alloc tensor by shared storage
      
      * Utilize StreamSafeCUDAAllocator to support fast GC in new executor
      
      * Fix compile error for Windows and ROCm
      
      * Fix compile error for Windows
      
      * Modify UT stream_safe_cuda_alloc_test
      
      * Modify UT stream_safe_cuda_alloc_test
      
      * Rewrite fast GC
      
      * Rewrite fast GC
      
      * Fix compile error for BOOST_GET_CONST
      
      * Fix compile error for BOOST_GET_CONST
      
      * Changes default stream for StreamSafeCUDAAllocator
      
      * Fix a small CI error
      
      * Remove some redundant code
      
      * Fix conflict
      
      * Fix compile error for ROCm
      
      * Fix Windoes CI error
      
      * Fix CI error
      
      * Remove some unnecessary code
      
      * Fix CI error
      
      * Add UT for fast GC
      
      * Fix CI error
      
      * add device-agnostic stream class
      
      * add stream.h
      
      * fix ut
      
      * fix cpu compile
      
      * Use RWLock in GetAllocator
      
      * Fix CI error
      Co-authored-by: NChen Weihang <chenweihang@baidu.com>
      Co-authored-by: Nzhiqiu <chenqiuliang@baidu.com>
      0c7153a4
    • L
      [new-exec] add completion_nofifier (#38447) · 404a4a6a
      Leo Chen 提交于
      * add completion_nofifier
      
      * fix bug
      
      * unregist event waiter
      404a4a6a
  36. 23 12月, 2021 1 次提交
    • L
      Upgrade work queue (#38335) · 198d11be
      liutiexing 提交于
      * add align for WorkQueue
      
      * add spinlock
      
      * merge develop
      
      * merge
      
      * Add EventsWaiter
      
      * Revert "Add EventsWaiter"
      
      This reverts commit e206173aa9be7401b83a53581627bfaf557c8fb2.
      
      * update EventsWater
      
      * fix
      
      * split workqueue files
      
      * add more tests
      
      * fix
      
      * bugfix
      
      * bugfix
      
      * update
      Co-authored-by: Nliutiexing <liutiexing@google.com>
      198d11be
  37. 26 11月, 2021 1 次提交
  38. 23 11月, 2021 1 次提交