1. 12 1月, 2018 4 次提交
  2. 10 1月, 2018 5 次提交
    • Y
      Add IBVerbsCommNet to System & Fix Bug of Initializing Model on GPU (#502) · a57877c4
      Yi Zhu 提交于
      * branch init
      
      * init rdma_comm_network interface
      
      * Rdma rebuild (#369)
      
      * rebuild rdma main class
      
      * remove linux dir
      
      * fix bugs
      
      * Rdma rebuild (#370)
      
      * rebuild rdma main class
      
      * remove linux dir
      
      * add connection rpc
      
      * Post API params change to const ptr
      
      * fix compile bugs
      
      * implement read recv send function
      
      * updt connection interface & implement poll queue
      
      * fix bugs & updat interface
      
      * implement RdmaCommNet functions
      
      * Rdma rebuild (#371)
      
      * rebuild rdma main class
      
      * remove linux dir
      
      * add connection rpc
      
      * Post API params change to const ptr
      
      * Add MemDesc rpc and registerdone
      
      * Add clear rpc
      
      * Useless file
      
      * Rdma rebuild (#372)
      
      * rebuild rdma main class
      
      * remove linux dir
      
      * add connection rpc
      
      * Post API params change to const ptr
      
      * Add MemDesc rpc and registerdone
      
      * Add clear rpc
      
      * remove connection_pool and change API for MenDesc
      
      * remove connection pool file
      
      * remove .file
      
      * rebuild connection to endpoint manager
      
      * remove GetConnection
      
      * fix constructor and destructor fir rdmacommnet
      
      * fix bugs
      
      * fix deconstructor
      
      * Rdma rebuild (#373)
      
      * rebuild rdma main class
      
      * remove linux dir
      
      * add connection rpc
      
      * Post API params change to const ptr
      
      * Add MemDesc rpc and registerdone
      
      * Add clear rpc
      
      * remove connection_pool and change API for MenDesc
      
      * remove connection pool file
      
      * remove .file
      
      * rebuild connection to endpoint manager
      
      * remove GetConnection
      
      * Add RDMA to JOBCONF
      
      * Rdma rebuild (#374)
      
      * rebuild rdma main class
      
      * remove linux dir
      
      * add connection rpc
      
      * Post API params change to const ptr
      
      * Add MemDesc rpc and registerdone
      
      * Add clear rpc
      
      * remove connection_pool and change API for MenDesc
      
      * remove connection pool file
      
      * remove .file
      
      * rebuild connection to endpoint manager
      
      * remove GetConnection
      
      * Add RDMA to JOBCONF
      
      * fix RdmaInit bug
      
      * add log
      
      * Rdma rebuild (#375)
      
      * rebuild rdma main class
      
      * remove linux dir
      
      * add connection rpc
      
      * Post API params change to const ptr
      
      * Add MemDesc rpc and registerdone
      
      * Add clear rpc
      
      * remove connection_pool and change API for MenDesc
      
      * remove connection pool file
      
      * remove .file
      
      * rebuild connection to endpoint manager
      
      * remove GetConnection
      
      * Add RDMA to JOBCONF
      
      * fix RdmaInit bug
      
      * add log
      
      * Add Log
      
      * fix bug
      
      * fix bugs
      
      * Rdma rebuild (#376)
      
      * rebuild rdma main class
      
      * remove linux dir
      
      * add connection rpc
      
      * Post API params change to const ptr
      
      * Add MemDesc rpc and registerdone
      
      * Add clear rpc
      
      * remove connection_pool and change API for MenDesc
      
      * remove connection pool file
      
      * remove .file
      
      * rebuild connection to endpoint manager
      
      * remove GetConnection
      
      * Add RDMA to JOBCONF
      
      * fix RdmaInit bug
      
      * add log
      
      * Add Log
      
      * commnet add establish network
      
      * add qpn
      
      * fix bug
      
      * create conn_info and qp for each conn
      
      * Add commit and destruct for endpointmanager
      
      * Add proto use_rdma
      
      * fix bug
      
      * Delete redundent file
      
      * Dev rdma yx (#377)
      
      * rebuild rdma main class
      
      * remove linux dir
      
      * add connection rpc
      
      * Post API params change to const ptr
      
      * Add MemDesc rpc and registerdone
      
      * Add clear rpc
      
      * remove connection_pool and change API for MenDesc
      
      * remove connection pool file
      
      * remove .file
      
      * rebuild connection to endpoint manager
      
      * remove GetConnection
      
      * Add RDMA to JOBCONF
      
      * fix RdmaInit bug
      
      * add log
      
      * Add Log
      
      * commnet add establish network
      
      * add qpn
      
      * fix bug
      
      * create conn_info and qp for each conn
      
      * Add commit and destruct for endpointmanager
      
      * Add proto use_rdma
      
      * fix rpc bug
      
      * fix some problem
      
      * int64 and uint64
      
      * Dev rdma yx (#378)
      
      * rebuild rdma main class
      
      * remove linux dir
      
      * add connection rpc
      
      * Post API params change to const ptr
      
      * Add MemDesc rpc and registerdone
      
      * Add clear rpc
      
      * remove connection_pool and change API for MenDesc
      
      * remove connection pool file
      
      * remove .file
      
      * rebuild connection to endpoint manager
      
      * remove GetConnection
      
      * Add RDMA to JOBCONF
      
      * fix RdmaInit bug
      
      * add log
      
      * Add Log
      
      * commnet add establish network
      
      * add qpn
      
      * fix bug
      
      * create conn_info and qp for each conn
      
      * Add commit and destruct for endpointmanager
      
      * Add proto use_rdma
      
      * fix rpc bug
      
      * fix some problem
      
      * int64 and uint64
      
      * Fix bug
      
      * remove useless head file
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * updt interface
      
      * change conninfo rpc (#379)
      
      * change conninfo rpc
      
      * swap the place of var declare
      
      * fix zlib dependency
      
      * remove useless function (#381)
      
      * updt recv buffer size
      
      * updt proto & sh for multi-machines
      
      * remove useless LOG
      
      * updt cmake
      
      * remove soft link file
      
      * rearrage code
      
      * rdma proto & updt to ctrl
      
      * merge & fix bugs
      
      * updt to KV
      
      * add debug info
      
      * fix bug
      
      * remove redudant code
      
      * remove redundant code
      
      * recover blank line
      
      * updt interfaces & fix bug
      
      * updt proto
      
      * updt proto
      
      * updt proto
      
      * merge master
      
      * add macro guardian
      
      * fix include
      
      * fix rdma_memory
      
      * add RdmaMem
      
      * fix code
      
      * add endpoint_manager & rdma_comm_network
      
      * remove EstablishNetwork
      
      * updt resource port
      
      * refine SendMsg
      
      * refine connection interface
      
      * refine code
      
      * refine code
      
      * remove redundant changes
      
      * fix macro guardian bug
      
      * refine endpoint_manager && ibverbs_comm_net
      
      * merge master
      
      * refine code
      
      * replace ptr with normal var
      
      * fix bugs
      
      * remove useless log
      
      * rollback changes to job.prototxt
      
      * fix proto
      
      * refine code
      
      * updt to LOG(FATAL)
      a57877c4
    • N
      fix(register/blob.cpp): fix typo (#503) · 3facafa3
      Niu Chong 提交于
      3facafa3
    • Y
      IBVerbsCommNet and EndpointManager (#479) · 9072ef48
      Yi Zhu 提交于
      * add endpoint_manager & rdma_comm_network
      
      * refine code
      
      * refine code
      
      * fix macro guardian bug
      
      * refine endpoint_manager && ibverbs_comm_net
      
      * merge master
      
      * refine code
      
      * replace ptr with normal var
      9072ef48
    • Y
      Refine interface in CommNet (#499) · 39cc7788
      Yi Zhu 提交于
      * fix bug
      
      * updt interface
      
      * refine code
      39cc7788
    • N
      feat: add col_num(that is sequence length) in Blob and BlobDesc (#494) · 0719c5a8
      Niu Chong 提交于
      * feat: add seq_len in Blob and BlobDesc
      
      * fix: rename PieceStatus to BlobHeader and insert it to mem_ptr of Blob
      
      * fix: fix the comments
      
      * fix: fix typos and remove  vim .swp file
      
      * fix: fix the comment
      
      * fix: update max_col_num() to col_num() in Register
      
      * fix: fix due to the comment
      0719c5a8
  3. 09 1月, 2018 4 次提交
  4. 08 1月, 2018 1 次提交
  5. 05 1月, 2018 4 次提交
    • C
      fix bug of print kernel to print useless content (#495) · 0833ce24
      chengtbf 提交于
      0833ce24
    • W
      seq size -> seq len · 65dd29f7
      willzhang4a58 提交于
      65dd29f7
    • duduscript's avatar
      l1 l2 regularization (#493) · afe78490
      duduscript 提交于
      * add l1 l2 regularization
      
      * fix some problem
      
      * Fix bug
      
      * fix op API
      
      * Fix a bug in op
      
      * add l1l2
      
      * remove regularization_diff_blob
      
      * remove sign in kernel_util
      
      * fix naming problem and add l1l2 function
      
      * simplify code
      
      * remove else
      
      * update modelupdate API
      
      * Fix bug
      afe78490
    • Y
      BackwardTaskNode (#492) · b475fc8e
      Yi Zhu 提交于
      * refine produce & consume regst
      
      * add virtual func
      
      * fix bug & add FixRegisterNumRange
      
      * add CanBindInDiffWhenRecurrent func
      
      * refine code
      
      * refine virtual func
      
      * change func name
      b475fc8e
  6. 04 1月, 2018 6 次提交
  7. 03 1月, 2018 8 次提交
  8. 02 1月, 2018 3 次提交
  9. 29 12月, 2017 3 次提交
  10. 28 12月, 2017 2 次提交