1. 24 7月, 2018 3 次提交
  2. 23 7月, 2018 1 次提交
    • X
      lazy · 5e085f69
      Xinqi 提交于
      
      Former-commit-id: b03f9869f1305ba08920866b3231b319e7aff093
      5e085f69
  3. 22 7月, 2018 1 次提交
  4. 21 7月, 2018 6 次提交
  5. 20 7月, 2018 1 次提交
    • X
      lazy_blob · 38ae6eaf
      Xinqi 提交于
      
      Former-commit-id: e6bf6a3d3bcc701c96616b8e6e4d3f60611daaff
      38ae6eaf
  6. 18 7月, 2018 3 次提交
  7. 17 7月, 2018 3 次提交
  8. 16 7月, 2018 5 次提交
    • L
      kCudaMemAllocAlignSize (#1001) · 245b26b5
      Li Xinqi 提交于
      
      
      Former-commit-id: ad3f5d65
      245b26b5
    • L
      MinVal<T> (#1023) · c538bb68
      Li Xinqi 提交于
      
      
      Former-commit-id: 4faea69b
      c538bb68
    • J
      Tune add op (#1022) · 6a3ea699
      Jinhui Yuan 提交于
      * optimize the add op
      
      * remove useless code
      
      
      Former-commit-id: 57689ae2
      6a3ea699
    • N
      feat: Add InputWiseActor for ReduceGlobalAdd and ReduceGather (#1012) · 58ad693d
      Niu Chong 提交于
      * feat: avoid net contention by adding ctrl edge in ReduceStruct
      
      * refine(task_graph.h/cpp): refine AddCtrlEdgeInReduceStruct()
      
      * fix(graph/task_graph.cpp): fix the bug of machine order
      
      * fix(graph/task_graph.cpp): do not add ctrl edge with reduce scatter
      
      * feat: add ReduceGlobalAddCompActor
      
      * fix: fix the bug of reduce_global_actor/kernel
      
      * chore: remove used vim .swp file
      
      * fix(graph/task_graph.cpp): fix the bug of sorting copycomment when build reduce ctrl edge
      
      * fix(graph/task_graph.h/cpp): add CtrlEdge for ReduceGather
      
      * feat: revert add ctrl edge in reduce struct from this PR
      
      * refactor: rename ReduceGlobalAddCompActor to InputWiseCompActor for scalability
      
      * fix(kernel/reduce_global_add_kernel.cpp): use Memcpy other than Memset for first blob to be added
      
      * refactor(actor/input_wise_compute_actor.*): use HashMap and counter instead of HashSet for processed regst_desc
      
      * refactor: let ReduceGlobalAddCompActor inherit InputWiseCompActor
      
      * feature: add ReduceGatherCompActor that inherits InputWiseCompActor
      
      * fix(reduce_gather_kernel.cpp): add missing break
      
      * refactor: replace regst_desc_id2bn_in_op_ with regst_desc_id2in_bn_id_ in InputWiseCompActor
      
      * fix(reduce_global_add_kernel): remove useless class member parallel_id_
      
      * refactor: make ReduceLocalAdd kernel support inputwise, rename ReduceGlobalAddActor to ReduceAddActor for scalibility
      
      
      Former-commit-id: 0ffc781c
      58ad693d
    • J
      let mdupdt use compute stream instead of mix stream (#1021) · f7612714
      Jinhui Yuan 提交于
      
      
      Former-commit-id: 9a010dae
      f7612714
  9. 15 7月, 2018 4 次提交
  10. 13 7月, 2018 3 次提交
    • N
      feat: refactor reduce struct to control network order, avoiding network contention (#994) · 51eadcee
      Niu Chong 提交于
      * feat: avoid net contention by adding ctrl edge in ReduceStruct
      
      * refine(task_graph.h/cpp): refine AddCtrlEdgeInReduceStruct()
      
      * fix(graph/task_graph.cpp): fix the bug of machine order
      
      * fix(graph/task_graph.cpp): do not add ctrl edge with reduce scatter
      
      * feat: add ReduceGlobalAddCompActor
      
      * fix: fix the bug of reduce_global_actor/kernel
      
      * chore: remove used vim .swp file
      
      * fix(graph/task_graph.cpp): fix the bug of sorting copycomment when build reduce ctrl edge
      
      * fix(graph/task_graph.h/cpp): add CtrlEdge for ReduceGather
      
      * revert: remove the ReduceGlobalAddCompActor from this PR
      
      * feat: add use_ordered_allreduce_in_mdupdt in OtherConf
      
      
      Former-commit-id: 6d9663c7
      51eadcee
    • J
      refactor comm_network (#975) · 0f08ddae
      Jinhui Yuan 提交于
      * refactor comm_network
      
      * refine
      
      * AddWorkToStream
      
      * refine
      
      * add CHECK
      
      * refine
      
      
      Former-commit-id: ea965893
      0f08ddae
    • S
      alexnet, resnet and inception v3 generator (#993) · e5a46640
      ShawnXuan 提交于
      * alexnet, resnet and inception v3 generator
      
      * move net gen apps to tools
      
      
      Former-commit-id: a68583b3
      e5a46640
  11. 12 7月, 2018 2 次提交
  12. 11 7月, 2018 5 次提交
  13. 10 7月, 2018 2 次提交
  14. 08 7月, 2018 1 次提交