1. 02 8月, 2014 4 次提交
  2. 26 5月, 2014 2 次提交
    • A
      IB/iser: Fix a possible race in iser connection states transition · 66d4e62d
      Ariel Nahum 提交于
      In some circumstances (multiple targets), RDMA_CM ESTABLISHED event
      and ep_disconnect may race. In this case, the iser connection state
      may transition to UP (after ep_disconnect transitioned it to
      TERMINATING), while the connection is being torn down.
      
      Upon RDMA_CM event ESTABLISHED we allow iser connection state to
      transition to UP only from PENDING. We also make sure to protect this
      state change (done under the connection lock).
      Signed-off-by: NAriel Nahum <arieln@mellanox.com>
      Signed-off-by: NSagi Grimberg <sagig@mellanox.com>
      Reviewed-by: NOr Gerlitz <ogerlitz@mellanox.com>
      Signed-off-by: NRoland Dreier <roland@purestorage.com>
      66d4e62d
    • A
      IB/iser: Simplify connection management · b73c3ada
      Ariel Nahum 提交于
      iSER relies on refcounting to manage iser connections establishment
      and teardown.
      
      Following commit 39ff05db ("IB/iser: Enhance disconnection logic
      for multi-pathing"), iser connection maintain 3 references:
      
       - iscsi_endpoint (at creation stage)
       - cma_id (at connection request stage)
       - iscsi_conn (at bind stage)
      
      We can avoid taking explicit refcounts by correctly serializing iser
      teardown flows (graceful and non-graceful).
      
      Our approach is to trigger a scheduled work to handle ordered teardown
      by gracefully waiting for 2 cleanup stages to complete:
      
       1. Cleanup of live pending tasks indicated by iscsi_conn_stop completion
       2. Flush errors processing
      
      Each completed stage will notify a waiting worker thread when it is
      done to allow teardwon continuation.
      
      Since iSCSI connection establishment may trigger endpoint disconnect
      without a successful endpoint connect, we rely on the iscsi <-> iser
      binding (.conn_bind) to learn about the teardown policy we should take
      wrt cleanup stages.
      
      Since all cleanup worker threads are scheduled (release_wq) in
      .ep_disconnect it is safe to assume that when module_exit is called,
      all cleanup workers are already scheduled. Thus proper module unload
      shall flush all scheduled works before allowing safe exit, to
      guarantee no resources got left behind.
      Signed-off-by: NAriel Nahum <arieln@mellanox.com>
      Signed-off-by: NSagi Grimberg <sagig@mellanox.com>
      Reviewed-by: NRoi Dayan <roid@mellanox.com>
      Reviewed-by: NOr Gerlitz <ogerlitz@mellanox.com>
      Signed-off-by: NRoland Dreier <roland@purestorage.com>
      b73c3ada
  3. 02 4月, 2014 5 次提交
  4. 18 3月, 2014 8 次提交
  5. 15 2月, 2014 1 次提交
  6. 03 9月, 2013 1 次提交
  7. 10 8月, 2013 5 次提交
  8. 05 6月, 2013 2 次提交
  9. 02 5月, 2013 2 次提交
  10. 22 2月, 2013 1 次提交
  11. 04 10月, 2012 1 次提交
  12. 19 5月, 2012 1 次提交
  13. 05 3月, 2012 1 次提交
  14. 05 11月, 2011 1 次提交
  15. 26 5月, 2011 1 次提交
    • S
      RDMA/cma: Pass QP type into rdma_create_id() · b26f9b99
      Sean Hefty 提交于
      The RDMA CM currently infers the QP type from the port space selected
      by the user.  In the future (eg with RDMA_PS_IB or XRC), there may not
      be a 1-1 correspondence between port space and QP type.  For netlink
      export of RDMA CM state, we want to export the QP type to userspace,
      so it is cleaner to explicitly associate a QP type to an ID.
      
      Modify rdma_create_id() to allow the user to specify the QP type, and
      use it to make our selections of datagram versus connected mode.
      Signed-off-by: NSean Hefty <sean.hefty@intel.com>
      Signed-off-by: NRoland Dreier <roland@purestorage.com>
      b26f9b99
  16. 13 5月, 2010 4 次提交
    • D
      IB/iser: Fix error flow in iser_create_ib_conn_res() · 9fda1ac5
      Dan Carpenter 提交于
      We shouldn't free things here because we free them later.
      The call tree looks like this:
      	iser_connect() ==> initiating the connection establishment
      and later
      	iser_cma_handler() => iser_route_handler() => iser_create_ib_conn_res()
      if we fail here, eventually iser_conn_release() is called, resulting
      in a double free.
      Signed-off-by: NDan Carpenter <error27@gmail.com>
      Signed-off-by: NOr Gerlitz <ogerlitz@voltaire.com>
      Signed-off-by: NRoland Dreier <rolandd@cisco.com>
      9fda1ac5
    • O
      IB/iser: Enhance disconnection logic for multi-pathing · 39ff05db
      Or Gerlitz 提交于
      The iser connection teardown flow isn't over until the underlying
      Connection Manager (e.g the IB CM) delivers a disconnected or timeout
      event through the RDMA-CM.  When the remote (target) side isn't
      reachable, e.g when some HW e.g port/hca/switch isn't functioning or
      taken down administratively, the CM timeout flow is used and the event
      may be generated only after relatively long time -- on the order of
      tens of seconds.
      
      The current iser code exposes this possibly long delay to higher
      layers, specifically to the iscsid daemon and iscsi kernel stack. As a
      result, the iscsi stack doesn't respond well: this low-level CM delay
      is added to the fail-over time under HA schemes such as the one
      provided by DM multipath through the multipathd(8) service.
      
      This patch enhances the reference counting scheme on iser's IB
      connections so that the disconnect flow initiated by iscsid from user
      space (ep_disconnect) doesn't wait for the CM to deliver the
      disconnect/timeout event.  (The connection teardown isn't done from
      iser's view point until the event is delivered)
      
      The iser ib (rdma) connection object is destroyed when its reference
      count reaches zero.  When this happens on the RDMA-CM callback
      context, extra care is taken so that the RDMA-CM does the actual
      destroying of the associated ID, since doing it in the callback is
      prohibited.
      
      The reference count of iser ib connection normally reaches three,
      where the <ref, deref> relations are
      
       1. conn <init, terminate>
       2. conn <bind, stop/destroy>
       3. cma id <create, disconnect/error/timeout callbacks>
      
      With this patch, multipath fail-over time is about 30 seconds, while
      without this patch, multipath fail-over time is about 130 seconds.
      Signed-off-by: NOr Gerlitz <ogerlitz@voltaire.com>
      Signed-off-by: NRoland Dreier <rolandd@cisco.com>
      39ff05db
    • O
      IB/iser: Remove buggy back-pointer setting · d265b980
      Or Gerlitz 提交于
      The iscsi connection object life cycle includes binding and unbinding
      (conn_stop) to/from the iscsi transport connection object.  Since
      iscsi connection objects are recycled, at the time the transport
      connection (e.g iser's IB connection) is released, it is not valid to
      touch the iscsi connection tied to the transport back-pointer since it
      may already point to a different transport connection.
      Signed-off-by: NOr Gerlitz <ogerlitz@voltaire.com>
      Signed-off-by: NRoland Dreier <rolandd@cisco.com>
      d265b980
    • O
      IB/iser: Add asynchronous event handler · 2110f9bf
      Or Gerlitz 提交于
      Add handler to handle events such as port up and down.  This is useful
      when testing high-availability schemes such as multi-pathing.
      Signed-off-by: NOr Gerlitz <ogerlitz@voltaire.com>
      Signed-off-by: NRoland Dreier <rolandd@cisco.com>
      2110f9bf