1. 27 7月, 2018 1 次提交
  2. 25 7月, 2018 2 次提交
    • C
      add jpeg encoder (#1052) · 098db839
      chengtbf 提交于
      098db839
    • S
      Chain act graph (#1019) · 14698b5b
      strickland12 提交于
      * DFS ChainActGraph
      
      * provide interface to improver
      
      * Add RegstAct
      
      * rename to chain_act_graph
      
      * add new file
      
      * Add ForEachRegstActDuration
      
      * use RegstActCtx
      
      * use constructor
      
      * use pair instead of node_str
      
      * Construct RegstAct togather
      
      * change ParseActEvents to std::list<std::unique_ptr<ActEvent>>
      
      * fake_producer_outs
      
      * use Duration4ActEvent
      
      * add CHECK
      
      * delete act_graph.cpp
      
      * use node->ForEach
      
      * add CHECK
      
      * remove false CHECK
      14698b5b
  3. 24 7月, 2018 1 次提交
    • J
      optimzie chain merge with bitset (#1051) · 21a46ee9
      Jinhui Yuan 提交于
      * optimzie chain merge with bitset
      
      * fix task_uid_cnt_ initialization
      
      * parallel chain merging
      
      * refine
      
      * extract MergeTaskNodes
      
      * shrink the size of bitset with local task_uid
      21a46ee9
  4. 23 7月, 2018 2 次提交
  5. 22 7月, 2018 4 次提交
  6. 21 7月, 2018 2 次提交
    • L
      Dev check oom for experi (#1039) · 87ff1688
      Li Xinqi 提交于
      * check oom for experimental phrase
      
      * refine code
      87ff1688
    • S
      Dev bw activation (#1015) · 7aa76aaa
      ShawnXuan 提交于
      * initial attempt of removing backward add
      
      * workable
      
      * refine
      
      * add backward_activation for KernelConf
      
      * add 3 functions to support move backward activation.
      
      * simplify 3 functions to 1
      
      * add backward_activation in operator
      
      * add GetBackwardActivationType, refine GetActivationType.
      
      * add AfterBackwardActivation in KernelIf
      
      * SetEnumValue support in protobuf.h
      set activation func in operator
      
      * bug fix: set cur_node activation after pre_node
      
      * check cur_node op_vec size
      
      * set bw_node_ nullptr for add_fw_node
      
      * pre_node is in backward area or has loss op
      
      * AfterBackwardActivation -> PostBackwardActivation
      
      * remove activation blob in kernel
      
      * keep removing activation blob ...
      
      * post backward activation for loss kernel
      
      * Retrieve BuildAccuracyPrintStruct
      
      * modify post backward activation
      
      * resolve code review issues
      
      * refine
      
      * refine AddOneBackwardClone
      
      * refine RemoveOneBackwardAdd
      
      * add forward_activation for kernel proto
      
      * add ibn blob for bw clone task node
      
      * rm NeedOutWhenBackward
      
      * remove
      7aa76aaa
  7. 19 7月, 2018 2 次提交
  8. 18 7月, 2018 1 次提交
  9. 17 7月, 2018 3 次提交
  10. 16 7月, 2018 5 次提交
    • L
      kCudaMemAllocAlignSize (#1001) · ad3f5d65
      Li Xinqi 提交于
      ad3f5d65
    • L
      MinVal<T> (#1023) · 4faea69b
      Li Xinqi 提交于
      4faea69b
    • J
      Tune add op (#1022) · 57689ae2
      Jinhui Yuan 提交于
      * optimize the add op
      
      * remove useless code
      57689ae2
    • N
      feat: Add InputWiseActor for ReduceGlobalAdd and ReduceGather (#1012) · 0ffc781c
      Niu Chong 提交于
      * feat: avoid net contention by adding ctrl edge in ReduceStruct
      
      * refine(task_graph.h/cpp): refine AddCtrlEdgeInReduceStruct()
      
      * fix(graph/task_graph.cpp): fix the bug of machine order
      
      * fix(graph/task_graph.cpp): do not add ctrl edge with reduce scatter
      
      * feat: add ReduceGlobalAddCompActor
      
      * fix: fix the bug of reduce_global_actor/kernel
      
      * chore: remove used vim .swp file
      
      * fix(graph/task_graph.cpp): fix the bug of sorting copycomment when build reduce ctrl edge
      
      * fix(graph/task_graph.h/cpp): add CtrlEdge for ReduceGather
      
      * feat: revert add ctrl edge in reduce struct from this PR
      
      * refactor: rename ReduceGlobalAddCompActor to InputWiseCompActor for scalability
      
      * fix(kernel/reduce_global_add_kernel.cpp): use Memcpy other than Memset for first blob to be added
      
      * refactor(actor/input_wise_compute_actor.*): use HashMap and counter instead of HashSet for processed regst_desc
      
      * refactor: let ReduceGlobalAddCompActor inherit InputWiseCompActor
      
      * feature: add ReduceGatherCompActor that inherits InputWiseCompActor
      
      * fix(reduce_gather_kernel.cpp): add missing break
      
      * refactor: replace regst_desc_id2bn_in_op_ with regst_desc_id2in_bn_id_ in InputWiseCompActor
      
      * fix(reduce_global_add_kernel): remove useless class member parallel_id_
      
      * refactor: make ReduceLocalAdd kernel support inputwise, rename ReduceGlobalAddActor to ReduceAddActor for scalibility
      0ffc781c
    • J
      9a010dae
  11. 15 7月, 2018 4 次提交
  12. 13 7月, 2018 3 次提交
    • N
      feat: refactor reduce struct to control network order, avoiding network contention (#994) · 6d9663c7
      Niu Chong 提交于
      * feat: avoid net contention by adding ctrl edge in ReduceStruct
      
      * refine(task_graph.h/cpp): refine AddCtrlEdgeInReduceStruct()
      
      * fix(graph/task_graph.cpp): fix the bug of machine order
      
      * fix(graph/task_graph.cpp): do not add ctrl edge with reduce scatter
      
      * feat: add ReduceGlobalAddCompActor
      
      * fix: fix the bug of reduce_global_actor/kernel
      
      * chore: remove used vim .swp file
      
      * fix(graph/task_graph.cpp): fix the bug of sorting copycomment when build reduce ctrl edge
      
      * fix(graph/task_graph.h/cpp): add CtrlEdge for ReduceGather
      
      * revert: remove the ReduceGlobalAddCompActor from this PR
      
      * feat: add use_ordered_allreduce_in_mdupdt in OtherConf
      6d9663c7
    • J
      refactor comm_network (#975) · ea965893
      Jinhui Yuan 提交于
      * refactor comm_network
      
      * refine
      
      * AddWorkToStream
      
      * refine
      
      * add CHECK
      
      * refine
      ea965893
    • S
      alexnet, resnet and inception v3 generator (#993) · a68583b3
      ShawnXuan 提交于
      * alexnet, resnet and inception v3 generator
      
      * move net gen apps to tools
      a68583b3
  13. 12 7月, 2018 2 次提交
  14. 11 7月, 2018 5 次提交
  15. 10 7月, 2018 2 次提交
  16. 08 7月, 2018 1 次提交