• W
    Refine stream index getter (#4349) · 20029059
    Wang Tuo 提交于
    * XXId structs and IdUtil
    
    * rm useless header
    
    * update id_util by discuss
    
    * update generate common thrd id and independent thrd id by IdUtil api
    
    * minor update
    
    * use IdUtil to generate task id in UpdateTaskId
    
    * Global<IdUtil>
    
    * emplace CommNetThrdId and TickTockThrdId call
    
    * implement IDMgr MemZoneId related api with IdUtil MemZoneId api
    
    * add GenerateChainId api
    
    * replace IDMgr api with IdUtil
    
    * rm useless header
    
    * revert IDMgr mem_zone_id api
    
    * rm redefinition of GetGpuPhyIdFromMemZoneId
    
    * modify by review comment
    
    * safety modification
    
    * def TaskType hash function
    
    * XXId structs and IdUtil
    
    * rm useless header
    
    * update id_util by discuss
    
    * update generate common thrd id and independent thrd id by IdUtil api
    
    * minor update
    
    * use IdUtil to generate task id in UpdateTaskId
    
    * Global<IdUtil>
    
    * emplace CommNetThrdId and TickTockThrdId call
    
    * implement IDMgr MemZoneId related api with IdUtil MemZoneId api
    
    * add GenerateChainId api
    
    * replace IDMgr api with IdUtil
    
    * rm useless header
    
    * revert IDMgr mem_zone_id api
    
    * rm redefinition of GetGpuPhyIdFromMemZoneId
    
    * modify by review comment
    
    * safety modification
    
    * def TaskType hash function
    
    * rm old test
    
    * fix by self review
    
    * change name
    
    * fix typo and enhance error info
    
    * refactor thread manager
    
    * more check
    
    * rm AllocateCpuThrdIdEvenly
    
    * refactor StreamId and rm IdUtil
    
    * stream index generator
    
    * modify by review
    
    * update stream index
    
    * update id util
    
    * update comm net task node
    
    * add TaskIdGenerator
    
    * update task id generation
    
    * replace gen thrd_in in logical node
    
    * replace GetGpuComputeThrdId in boxing sub task graph builder
    
    * replace h2d and d2h thrd_id in CopyHdTaskNode
    
    * replace h2d and d2h thrd_id in SliceBoxingSubTskGphBuilder
    
    * update id_util header
    
    * CHECK NOTNULL stream index generator
    
    * add chain_id_generator
    
    * rm IdUtil Glabol New
    
    * rm stream type in thread manager
    
    * CHECK_NOTNULL stream_index_generator in logical node
    
    * update id manager
    
    * update id_util
    
    * fix compile errors
    
    * tidy code
    
    * tidy code
    
    * revert format
    
    * mv std::hash<TaskType> to task_node.h
    
    * use unique_ptr to manage thread
    
    * fix typo
    
    * format
    
    * modify by review
    
    * start up
    
    * rm chain id generator
    
    * move id serialization to independent implementation
    
    * rm useless friend
    
    * fix compile error under gcc 4.8
    
    * rm IsXxxStreamIndex
    
    * rm deprecated api in IDMgr
    
    * fix bug in CPUStreamIndexGenerator::GenerateComputeStreamIndex
    
    * refine id structs
    
    * refine id struct serialization
    
    * refine task id generator
    
    * refine StreamIndexGeneratorManager
    
    * refine copy task node
    
    * refine collective boxing sub task graph builder
    
    * refine slice boxing sub task graph builder
    
    * refine naive b2p sub task graph builder
    
    * refine logical node
    
    * refine id manager
    
    * refine thread manager
    
    * rm useless comment
    
    * remove magic number
    
    * revise header to be compatible with cpu-only compilation
    
    * more readable
    
    * fix bug
    
    * refine code
    
    * use HashCombine
    
    * replace type of bit shift const value with size_t
    
    * add testcase for fake dev
    
    * refactor mem_zone_id
    
    * reformat
    
    * add fake device allocator/deallocator
    
    * task_node InitProducedRegstMemCase add fakedev
    
    * Add stream_index_getter
    
    * format and fix tick tock task type
    Signed-off-by: Ndaquexian <daquexian566@gmail.com>
    
    * skip fake device test for now
    Signed-off-by: Ndaquexian <daquexian566@gmail.com>
    
    * refine Memcpy for fake dev
    
    * update for debug
    Signed-off-by: Ndaquexian <daquexian566@gmail.com>
    
    * some update for fake device
    Signed-off-by: Ndaquexian <daquexian566@gmail.com>
    
    * remove debug code
    Signed-off-by: Ndaquexian <daquexian566@gmail.com>
    
    * reg for fake device creating thread
    
    * minor fix
    
    * format
    
    * refine stream index getter
    Signed-off-by: Ndaquexian <daquexian566@gmail.com>
    
    * for debug
    
    * refine stream_index_getter
    
    * fix the code
    
    * delete fakedev unit test script
    
    * delete the code which is no relationship with stream_index_getter
    
    * delete test_tmp_dir
    
    * fix format
    
    * move xxx_compute_task_node.h from folder graph_impl to folder graph
    Co-authored-by: Nleaves-zwx <kunta0932@gmail.com>
    Co-authored-by: Nyaochi <later@usopp.net>
    Co-authored-by: NLdpe2G <liangdepeng@gmail.com>
    Co-authored-by: Ndaquexian <daquexian566@gmail.com>
    Co-authored-by: Noneflow-ci-bot <69100618+oneflow-ci-bot@users.noreply.github.com>
    20029059
task_graph.cpp 31.7 KB