1. 22 3月, 2012 3 次提交
  2. 26 10月, 2011 1 次提交
  3. 15 9月, 2011 1 次提交
    • J
      Remove unneeded version.h includes from include/ · e81b1516
      Jesper Juhl 提交于
      It was pointed out by 'make versioncheck' that some includes of
      linux/version.h are not needed in include/.
      This patch removes them.
      
      When I last posted the patch, the ceph bit was ACK'ed by Sage Weil, so
      I've added that below.
      
      The pwc-ioctl change generated quite a bit of discussion about V4L version
      numbers in general, but as far as I can tell, no concensus was reached on
      what the long term solution should be, so in the mean time I think we
      could start by just removing the unneeded include, which is why I'm
      resending the patch with that hunk still included.
      Signed-off-by: NJesper Juhl <jj@chaosbits.net>
      Acked-by: NSage Weil <sage@newdream.net>
      Signed-off-by: NJiri Kosina <jkosina@suse.cz>
      e81b1516
  4. 27 7月, 2011 1 次提交
  5. 05 3月, 2011 2 次提交
    • S
      libceph: fix msgr keepalive flag · e76661d0
      Sage Weil 提交于
      There was some broken keepalive code using a dead variable.  Shift to using
      the proper bit flag.
      Signed-off-by: NSage Weil <sage@newdream.net>
      e76661d0
    • S
      libceph: fix msgr backoff · 60bf8bf8
      Sage Weil 提交于
      With commit f363e45f we replaced a bunch of hacky workqueue mutual
      exclusion logic with the WQ_NON_REENTRANT flag.  One pieces of fallout is
      that the exponential backoff breaks in certain cases:
      
       * con_work attempts to connect.
       * we get an immediate failure, and the socket state change handler queues
         immediate work.
       * con_work calls con_fault, we decide to back off, but can't queue delayed
         work.
      
      In this case, we add a BACKOFF bit to make con_work reschedule delayed work
      next time it runs (which should be immediately).
      Signed-off-by: NSage Weil <sage@newdream.net>
      60bf8bf8
  6. 13 1月, 2011 1 次提交
    • T
      net/ceph: make ceph_msgr_wq non-reentrant · f363e45f
      Tejun Heo 提交于
      ceph messenger code does a rather complex dancing around multithread
      workqueue to make sure the same work item isn't executed concurrently
      on different CPUs.  This restriction can be provided by workqueue with
      WQ_NON_REENTRANT.
      
      Make ceph_msgr_wq non-reentrant workqueue with the default concurrency
      level and remove the QUEUED/BUSY logic.
      
      * This removes backoff handling in con_work() but it couldn't reliably
        block execution of con_work() to begin with - queue_con() can be
        called after the work started but before BUSY is set.  It seems that
        it was an optimization for a rather cold path and can be safely
        removed.
      
      * The number of concurrent work items is bound by the number of
        connections and connetions are independent from each other.  With
        the default concurrency level, different connections will be
        executed independently.
      Signed-off-by: NTejun Heo <tj@kernel.org>
      Cc: Sage Weil <sage@newdream.net>
      Cc: ceph-devel@vger.kernel.org
      Signed-off-by: NSage Weil <sage@newdream.net>
      f363e45f
  7. 10 11月, 2010 1 次提交
    • S
      ceph: explicitly specify page alignment in network messages · c5c6b19d
      Sage Weil 提交于
      The alignment used for reading data into or out of pages used to be taken
      from the data_off field in the message header.  This only worked as long
      as the page alignment matched the object offset, breaking direct io to
      non-page aligned offsets.
      
      Instead, explicitly specify the page alignment next to the page vector
      in the ceph_msg struct, and use that instead of the message header (which
      probably shouldn't be trusted).  The alloc_msg callback is responsible for
      filling in this field properly when it sets up the page vector.
      Signed-off-by: NSage Weil <sage@newdream.net>
      c5c6b19d
  8. 21 10月, 2010 2 次提交
    • Y
      ceph: factor out libceph from Ceph file system · 3d14c5d2
      Yehuda Sadeh 提交于
      This factors out protocol and low-level storage parts of ceph into a
      separate libceph module living in net/ceph and include/linux/ceph.  This
      is mostly a matter of moving files around.  However, a few key pieces
      of the interface change as well:
      
       - ceph_client becomes ceph_fs_client and ceph_client, where the latter
         captures the mon and osd clients, and the fs_client gets the mds client
         and file system specific pieces.
       - Mount option parsing and debugfs setup is correspondingly broken into
         two pieces.
       - The mon client gets a generic handler callback for otherwise unknown
         messages (mds map, in this case).
       - The basic supported/required feature bits can be expanded (and are by
         ceph_fs_client).
      
      No functional change, aside from some subtle error handling cases that got
      cleaned up in the refactoring process.
      Signed-off-by: NSage Weil <sage@newdream.net>
      3d14c5d2
    • Y
      ceph: messenger and osdc changes for rbd · 68b4476b
      Yehuda Sadeh 提交于
      Allow the messenger to send/receive data in a bio.  This is added
      so that we wouldn't need to copy the data into pages or some other buffer
      when doing IO for an rbd block device.
      
      We can now have trailing variable sized data for osd
      ops.  Also osd ops encoding is more modular.
      Signed-off-by: NYehuda Sadeh <yehuda@hq.newdream.net>
      Signed-off-by: NSage Weil <sage@newdream.net>
      68b4476b
  9. 30 5月, 2010 1 次提交
  10. 18 5月, 2010 5 次提交
  11. 12 5月, 2010 1 次提交
  12. 23 3月, 2010 1 次提交
    • S
      ceph: avoid reopening osd connections when address hasn't changed · 87b315a5
      Sage Weil 提交于
      We get a fault callback on _every_ tcp connection fault.  Normally, we
      want to reopen the connection when that happens.  If the address we have
      is bad, however, and connection attempts always result in a connection
      refused or similar error, explicitly closing and reopening the msgr
      connection just prevents the messenger's backoff logic from kicking in.
      The result can be a console full of
      
      [ 3974.417106] ceph: osd11 10.3.14.138:6800 connection failed
      [ 3974.423295] ceph: osd11 10.3.14.138:6800 connection failed
      [ 3974.429709] ceph: osd11 10.3.14.138:6800 connection failed
      
      Instead, if we get a fault, and have outstanding requests, but the osd
      address hasn't changed and the connection never successfully connected in
      the first place, do nothing to the osd connection.  The messenger layer
      will back off and retry periodically, because we never connected and thus
      the lossy bit is not set.
      
      Instead, touch each request's r_stamp so that handle_timeout can tell the
      request is still alive and kicking.
      Signed-off-by: NSage Weil <sage@newdream.net>
      87b315a5
  13. 02 3月, 2010 1 次提交
  14. 11 2月, 2010 1 次提交
  15. 26 1月, 2010 3 次提交
  16. 24 12月, 2009 3 次提交
    • S
      ceph: support ceph_pagelist for message payload · 58bb3b37
      Sage Weil 提交于
      The ceph_pagelist is a simple list of whole pages, strung together via
      their lru list_head.  It facilitates encoding to a "buffer" of unknown
      size.  Allow its use in place of the ceph_msg page vector.
      
      This will be used to fix the huge buffer preallocation woes of MDS
      reconnection.
      Signed-off-by: NSage Weil <sage@newdream.net>
      58bb3b37
    • S
      ceph: control access to page vector for incoming data · 350b1c32
      Sage Weil 提交于
      When we issue an OSD read, we specify a vector of pages that the data is to
      be read into.  The request may be sent multiple times, to multiple OSDs, if
      the osdmap changes, which means we can get more than one reply.
      
      Only read data into the page vector if the reply is coming from the
      OSD we last sent the request to.  Keep track of which connection is using
      the vector by taking a reference.  If another connection was already
      using the vector before and a new reply comes in on the right connection,
      revoke the pages from the other connection.
      Signed-off-by: NSage Weil <sage@newdream.net>
      350b1c32
    • S
      ceph: use connection mutex to protect read and write stages · ec302645
      Sage Weil 提交于
      Use a single mutex (previously out_mutex) to protect both read and write
      activity from concurrent ceph_con_* calls.  Drop the mutex when doing
      callbacks to avoid nested locking (the callback may need to call something
      like ceph_con_close).
      Signed-off-by: NSage Weil <sage@newdream.net>
      ec302645
  17. 22 12月, 2009 2 次提交
  18. 08 12月, 2009 1 次提交
  19. 19 11月, 2009 2 次提交
    • S
      ceph: negotiate authentication protocol; implement AUTH_NONE protocol · 4e7a5dcd
      Sage Weil 提交于
      When we open a monitor session, we send an initial AUTH message listing
      the auth protocols we support, our entity name, and (possibly) a previously
      assigned global_id.  The monitor chooses a protocol and responds with an
      initial message.
      
      Initially implement AUTH_NONE, a dummy protocol that provides no security,
      but works within the new framework.  It generates 'authorizers' that are
      used when connecting to (mds, osd) services that simply state our entity
      name and global_id.
      
      This is a wire protocol change.
      Signed-off-by: NSage Weil <sage@newdream.net>
      4e7a5dcd
    • S
      ceph: remove bad calls to ceph_con_shutdown · 42ce56e5
      Sage Weil 提交于
      We want to ceph_con_close when we're done with the connection, before
      the ref count reaches 0.  Once it does, do not call ceph_con_shutdown,
      as that takes the con mutex and may sleep, and besides that is
      unnecessary.
      Signed-off-by: NSage Weil <sage@newdream.net>
      42ce56e5
  20. 11 11月, 2009 1 次提交
  21. 04 11月, 2009 1 次提交
  22. 07 10月, 2009 1 次提交
    • S
      ceph: messenger library · 31b8006e
      Sage Weil 提交于
      A generic message passing library is used to communicate with all
      other components in the Ceph file system.  The messenger library
      provides ordered, reliable delivery of messages between two nodes in
      the system.
      
      This implementation is based on TCP.
      Signed-off-by: NSage Weil <sage@newdream.net>
      31b8006e