1. 15 9月, 2021 4 次提交
  2. 25 8月, 2021 5 次提交
  3. 27 7月, 2021 1 次提交
  4. 19 7月, 2021 7 次提交
  5. 10 6月, 2021 1 次提交
  6. 22 5月, 2021 5 次提交
  7. 13 4月, 2021 5 次提交
  8. 25 3月, 2021 1 次提交
  9. 05 3月, 2021 9 次提交
  10. 30 1月, 2021 1 次提交
  11. 08 1月, 2021 1 次提交
    • J
      scsi: lpfc: Implement health checking when aborting I/O · a22d73b6
      James Smart 提交于
      Several errors have occurred where the adapter stops or fails but does not
      raise the register values for the driver to detect failure. Thus driver is
      unaware of the failure. The failure typically results in I/O timeouts, the
      I/O timeout handler failing (after several seconds), and the error handler
      escalating recovery policy and resulting in more errors. Eventually, the
      driver is in a position where things have spiraled and it can't do recovery
      because other recovery ops are still outstanding and it becomes unusable.
      
      Resolve the situation by having the I/O timeout handler (actually a els,
      SCSI I/O, NVMe ls, or NVMe I/O timeout), in addition to aborting the I/O,
      perform a mailbox command and look for a response from the hardware.  If
      the mailbox command fails, it will mark the adapter offline and then invoke
      the adapter reset handler to clean up.
      
      The new I/O timeout test will be limited to a test every 5s. If there are
      multiple I/O timeouts concurrently, only the 1st I/O timeout will generate
      the mailbox command. Further testing will only occur once a timeout occurs
      after a 5s delay from the last mailbox command has expired.
      
      Link: https://lore.kernel.org/r/20210104180240.46824-14-jsmart2021@gmail.comCo-developed-by: NDick Kennedy <dick.kennedy@broadcom.com>
      Signed-off-by: NDick Kennedy <dick.kennedy@broadcom.com>
      Signed-off-by: NJames Smart <jsmart2021@gmail.com>
      Signed-off-by: NMartin K. Petersen <martin.petersen@oracle.com>
      a22d73b6