1. 18 4月, 2008 3 次提交
    • S
      ocfs2/net: Add debug interface to o2net · 2309e9e0
      Sunil Mushran 提交于
      This patch exposes o2net information via debugfs. The information includes
      the list of sockets (sock_containers) as well as the list of outstanding
      messages (send_tracking). Useful for o2dlm debugging.
      
      (This patch is derived from an earlier one written by Zach Brown that
      exposed the same information via /proc.)
      
      [Mark: checkpatch fixes]
      Signed-off-by: NSunil Mushran <sunil.mushran@oracle.com>
      Reviewed-by: NJoel Becker <joel.becker@oracle.com>
      Signed-off-by: NMark Fasheh <mfasheh@suse.com>
      2309e9e0
    • J
      ocfs2/cluster: Get rid of arguments to the timeout routines · 409753bf
      Jeff Mahoney 提交于
      We keep seeing bug reports related to NULL pointer derefs in
      o2net_set_nn_state(). When I originally wrote up the configurable timeout
      patch, I had tried to plan for multiple clusters. This was silly.
      
      The timeout routines all use o2nm_single_cluster so there's no point in
      passing an argument at all. This patch removes the arguments and kills those
      bugs dead.
      Signed-off-by: NJeff Mahoney <jeffm@suse.com>
      Signed-off-by: NMark Fasheh <mfasheh@suse.com>
      409753bf
    • T
      ocfs2: Reconnect after idle time out. · 5cc3bf27
      Tao Ma 提交于
      Currently, o2net connects to a node on hb_up and disconnects on
      hb_down and net timeout.
      
      It disconnects on net timeout is ok, but it should attempt to
      reconnect back. This is because sometimes nodes get overloaded
      enough that the network connection breaks but the disk hb does not.
      And if we get into that situation, we either fence (unnecessarily)
      or wait for its disk hb to die (and sometimes hang in the process).
      
      So in this updated scheme, when the network disconnects, we keep
      attempting to reconnect till we succeed or we get a disk hb down
      event.
      
      If the other node is really dead, then we will eventually get a
      node down event. If not, we should be able to connect again and
      continue.
      Signed-off-by: NTao Ma <tao.ma@oracle.com>
      Signed-off-by: NMark Fasheh <mfasheh@suse.com>
      5cc3bf27
  2. 11 3月, 2008 1 次提交
  3. 06 12月, 2007 1 次提交
  4. 13 11月, 2007 1 次提交
  5. 10 8月, 2007 1 次提交
  6. 11 7月, 2007 2 次提交
  7. 03 5月, 2007 1 次提交
  8. 15 3月, 2007 1 次提交
  9. 08 2月, 2007 3 次提交
    • Z
      ocfs2: introduce sc->sc_send_lock to protect outbound outbound messages · 925037bc
      Zhen Wei 提交于
      When there is a lot of multithreaded I/O usage, two threads can collide
      while sending out a message to the other nodes. This is due to the lack of
      locking between threads while sending out the messages.
      
      When a connected TCP send(), sendto(), or sendmsg() arrives in the Linux
      kernel, it eventually comes through tcp_sendmsg(). tcp_sendmsg() protects
      itself by acquiring a lock at invocation by calling lock_sock().
      tcp_sendmsg() then loops over the buffers in the iovec, allocating
      associated sk_buff's and cache pages for use in the actual send. As it does
      so, it pushes the data out to tcp for actual transmission. However, if one
      of those allocation fails (because a large number of large sends is being
      processed, for example), it must wait for memory to become available. It
      does so by jumping to wait_for_sndbuf or wait_for_memory, both of which
      eventually cause a call to sk_stream_wait_memory(). sk_stream_wait_memory()
      contains a code path that calls sk_wait_event(). Finally, sk_wait_event()
      contains the call to release_sock().
      
      The following patch adds a lock to the socket container in order to
      properly serialize outbound requests.
      
      From: Zhen Wei <zwei@novell.com>
      Acked-by: NJeff Mahoney <jeffm@suse.com>
      Signed-off-by: NMark Fasheh <mark.fasheh@oracle.com>
      925037bc
    • S
      ocfs2: Binds listener to the configured ip address · ab81afd3
      Sunil Mushran 提交于
      This patch binds the o2net listener to the configured ip address
      instead of INADDR_ANY for security. Fixes oss.oracle.com bugzilla#814.
      Signed-off-by: NSunil Mushran <sunil.mushran@oracle.com>
      Signed-off-by: NMark Fasheh <mark.fasheh@oracle.com>
      ab81afd3
    • K
      ocfs2: Added post handler callable function in o2net message handler · d74c9803
      Kurt Hackel 提交于
      Currently o2net allows one handler function per message type. This
      patch adds the ability to call another function to be called after
      the handler has returned the message to the other node.
      
      Handlers are now given the option of returning a context (in the form of a
      void **) which will be passed back into the post message handler function.
      Signed-off-by: NKurt Hackel <kurt.hackel@oracle.com>
      Signed-off-by: NSunil Mushran <sunil.mushran@oracle.com>
      Signed-off-by: NMark Fasheh <mark.fasheh@oracle.com>
      d74c9803
  10. 14 12月, 2006 1 次提交
  11. 12 12月, 2006 1 次提交
    • A
      [patch 3/3] OCFS2 Configurable timeouts - Protocol changes · 828ae6af
      Andrew Beekhof 提交于
      Modify the OCFS2 handshake to ensure essential timeouts are configured
      identically on all nodes.
      
      Only allow changes when there are no connected peers
      
      Improves the logic in o2net_advance_rx() which broke now that
      sizeof(struct o2net_handshake) is greater than sizeof(struct o2net_msg)
      
      Included is the field for userspace-heartbeat timeout to avoid the need for
      further protocol changes.
      
      Uses a global spinlock to ensure the decisions to update configfs entries
      are made on the correct value.  The region covered by the spinlock when
      incrementing the counter is much larger as this is the more critical case.
      
      Small cleanup contributed by Adrian Bunk <bunk@stusta.de>
      Signed-off-by: NAndrew Beekhof <abeekhof@suse.de>
      Signed-off-by: NMark Fasheh <mark.fasheh@oracle.com>
      828ae6af
  12. 08 12月, 2006 1 次提交
  13. 22 11月, 2006 1 次提交
  14. 30 6月, 2006 1 次提交
  15. 28 6月, 2006 1 次提交
  16. 02 3月, 2006 1 次提交
  17. 04 2月, 2006 1 次提交
  18. 04 1月, 2006 1 次提交