1. 24 2月, 2016 1 次提交
    • S
      megaraid_sas: MFI IO timeout handling · 6d40afbc
      Sumit Saxena 提交于
      This patch will do proper error handling for DCMD timeout failure cases
      for Fusion adapters:
      
      1. For MFI adapters, in case of DCMD timeout (DCMD which must return
      SUCCESS) driver will call kill adapter.
      
      2. What action needs to be taken in case of DCMD timeout is decided by
      function dcmd_timeout_ocr_possible().  DCMD timeout causing OCR is
      applicable to the following commands:
      
      	MR_DCMD_PD_LIST_QUERY
      	MR_DCMD_LD_GET_LIST
      	MR_DCMD_LD_LIST_QUERY
      	MR_DCMD_CTRL_SET_CRASH_DUMP_PARAMS
      	MR_DCMD_SYSTEM_PD_MAP_GET_INFO
      	MR_DCMD_LD_MAP_GET_INFO
      
      3. If DCMD fails from driver init path there are certain DCMDs which
      must return SUCCESS. If those DCMDs fail, driver bails out. For optional
      DCMDs like pd_info etc., driver continues without executing certain
      functionality.
      Signed-off-by: NSumit Saxena <sumit.saxena@avagotech.com>
      Signed-off-by: NKashyap Desai <kashyap.desai@avagotech.com>
      Reviewed-by: NTomas Henzl <thenzl@redhat.com>
      Signed-off-by: NMartin K. Petersen <martin.petersen@oracle.com>
      6d40afbc
  2. 10 11月, 2015 1 次提交
  3. 29 10月, 2015 7 次提交
  4. 26 8月, 2015 1 次提交
  5. 01 6月, 2015 1 次提交
    • S
      megaraid_sas : Modify return value of megasas_issue_blocked_cmd() and... · 2be2a988
      Sumit.Saxena@avagotech.com 提交于
      megaraid_sas : Modify return value of megasas_issue_blocked_cmd() and wait_and_poll() to consider command status returned by firmware
      
      This patch is rebased on top of recently sent 18 patches(submitted by me) for
      megaraid_sas driver.
      
      Change the return value of wait_and_poll() and megsas_issue_blocked_cmd()
      based on MFI_STAT returned by firmware for that command. Earlier driver always
      send return type based on command completion (but never check MFI_STAT_OK for
      that command), so even if command is failed by firmware still driver will
      return SUCCESS status from these functions wait_and_poll() and
      megsas_issue_blocked_cmd() and if caller of these functions does not check
      command status (MFI_STAT), then it may endup using invalid data returned in
      DMA buffers(one of the example is megasas_ld_list_query DCMD). Best thing to
      avoid this type of issue is do error handling and set proper return type from
      caller function wait_and_poll() and megsas_issue_blocked_cmd().
      
      The change proposed in this patch will fix the regression introduced in patch-
      "90dc9d98 megaraid_sas : MFI MPT linked list corruption fix" inside function
      megasas_ld_list_query().  Prior to this MFI MPT linked list corruption fix
      patch, megasas_ld_list_query() function used to check DCMD status(returned by
      firmware) but with this linked list corruption fix patch, DCMD status will not
      be checked inside function megasas_ld_list_query() and introduced this issue
      of wrong data being used by function megasas_ld_list_query().
      
      Cc: <stable@vger.kernel.org>
      Signed-off-by: NKashyap Desai <kashyap.desai@avagotech.com>
      Signed-off-by: NSumit Saxena <sumit.saxena@avagotech.com>
      Reviewed-by: NTomas Henzl <thenzl@redhat.com>
      Signed-off-by: NJames Bottomley <JBottomley@Odin.com>
      2be2a988
  6. 26 5月, 2015 1 次提交
  7. 25 5月, 2015 9 次提交
  8. 17 4月, 2015 1 次提交
  9. 09 1月, 2015 6 次提交
  10. 24 11月, 2014 4 次提交
  11. 17 9月, 2014 8 次提交
    • S
      megaraid_sas : MFI MPT linked list corruption fix · 90dc9d98
      Sumit.Saxena@avagotech.com 提交于
      Resending the patch. Addressed the review comments from Tomas Henzl.
      Added comment for to-do work.
      
      Problem statement:
      MFI link list in megaraid_sas driver is used from mfi-mpt pass-through commands.
      This list can be corrupted due to many possible race conditions in driver and
      eventually we may see kernel panic.
      
      One example -
      MFI frame is freed from calling process as driver send command via polling method and interrupt
      for that command comes after driver free mfi frame (actually even after some other context reuse
      the mfi frame). When driver receive MPT frame in ISR, driver will be using the index of MFI and
      access that MFI frame and finally in-used MFI frame’s list will be corrupted.
      
      High level description of new solution -
      Free MFI and MPT command from same context.
      Free both the command either from process (from where mfi-mpt pass-through was called) or from
      ISR context. Do not split freeing of MFI and MPT, because it creates the race condition which
      will do MFI/MPT list corruption.
      
      Renamed the cmd_pool_lock which is used in instance as well as fusion with below name.
      mfi_pool_lock and mpt_pool_lock to add more code readability.
      Signed-off-by: NSumit Saxena <sumit.saxena@avagotech.com>
      Signed-off-by: NKashyap Desai <kashyap.desai@avagotech.com>
      Reviewed-by: NTomas Henzl <thenzl@redhat.com>
      Signed-off-by: NChristoph Hellwig <hch@lst.de>
      90dc9d98
    • S
      megaraid_sas : N-drive primary raid level 1 load balancing · d2552ebe
      Sumit.Saxena@avagotech.com 提交于
      Resending the patch. Addressed the review comments from Tomas Henzl.
      
      Current driver does fast path read load balancing between arm and mirror disk
      for two Drive Raid-1 configuration only.
      
      Now, Driver support fast path read load balancing for all (any number of disk) Raid-1 configuration.
      Signed-off-by: NSumit Saxena <sumit.saxena@avagotech.com>
      Signed-off-by: NKashyap Desai <kashyap.desai@avagotech.com>
      Reviewed-by: NTomas Henzl <thenzl@redhat.com>
      Signed-off-by: NChristoph Hellwig <hch@lst.de>
      d2552ebe
    • S
      megaraid_sas : Round down max sge supported by controller to power of two · a5fd2858
      Sumit.Saxena@avagotech.com 提交于
      Resending the patch. Addressed the review comments from Tomas Henzl.
      
      Round down the max sge to power of two.
      
      Earlier max sge limit is 70 SGE, which will allow block layer to send 280K IO frame.
      It is optimal to provide max IO size aligned to the smallest possible stripe size.
      E.a
      Consider that we have configured RAID Volumes which does not allow Fast Path across the stripe.
      Raid volume with stripe size = 256K, will have peformance hit if we get io frame of size 280K.
      Driver will not send IO frame large than stripe size to the Fast Path.
      Also, FW will convert 280K frame into 256K + 24K. This is an additional overhead.
      Signed-off-by: NSumit Saxena <sumit.saxena@avagotech.com>
      Signed-off-by: NKashyap Desai <kashyap.desai@avagotech.com>
      Reviewed-by: NTomas Henzl <thenzl@redhat.com>
      Signed-off-by: NChristoph Hellwig <hch@lst.de>
      a5fd2858
    • S
      megaraid_sas : Extended VD support · 51087a86
      Sumit.Saxena@avagotech.com 提交于
      Resending the patch. Addressed the review comments from Tomas Henzl.
      reserved1 field(part of union) of Raid map struct was not required so it is removed.
      
      Current MegaRAID firmware and hence the driver only supported 64VDs.
      E.g: If the user wants to create more than 64VD on a controller,
          it is not possible on current firmware/driver.
      
      New feature and requirement to support upto 256VD, firmware/driver/apps need changes.
      In addition to that there must be a backward compatibility of the new driver with the
      older firmware and vice versa.
      
      RAID map is the interface between Driver and FW to fetch all required
      fields(attributes) for each Virtual Drives.
      In the earlier design driver was using the FW copy of RAID map where as
      in the new design the Driver will keep the RAID map copy of its own; on which
      it will operate for any raid map access in fast path.
      
      Local driver raid map copy will provide ease of access through out the code
      and provide generic interface for future FW raid map changes.
      
      For the backward compatibility driver will notify FW that it supports 256VD
      to the FW in driver capability field.
      Based on the controller properly returned by the FW, the Driver will know
      whether it supports 256VD or not and will copy the RAID map accordingly.
      
      At any given time, driver will always have old or new Raid map.
      So with this changes, driver can also work in host lock less mode. Please
      see next patch which enable host lock less mode for megaraid_sas driver.
      Signed-off-by: NSumit Saxena <sumit.saxena@avagotech.com>
      Signed-off-by: NKashyap Desai <kashyap.desai@avagotech.com>
      Reviewed-by: NTomas Henzl <thenzl@redhat.com>
      Signed-off-by: NChristoph Hellwig <hch@lst.de>
      51087a86
    • S
      megaraid_sas : Firmware crash dump feature support · fc62b3fc
      Sumit.Saxena@avagotech.com 提交于
      Resending the patch. Addressed the review comments from Tomas Henzl.
      Move buff_offset inside spinlock, corrected loop at crash dump buffer free,
      reset_devices check is added to disable fw crash dump feature in kdump kernel.
      
      This feature will provide similar interface as kernel crash dump feature.
      When megaraid firmware encounter any crash, driver will collect the firmware raw image and
      dump it into pre-configured location.
      
      Driver will allocate two different segment of memory.
      #1 Non-DMA able large buffer (will be allocated on demand) to capture actual FW crash dump.
      #2 DMA buffer (persistence allocation) just to do a arbitrator job.
      
      Firmware will keep writing Crash dump data in chucks of DMA buffer size into #2,
      which will be copy back by driver to the host memory as described in #1.
      
      Driver-Firmware interface:
      ==================
      A.) Host driver can allocate maximum 512MB Host memory to store crash dump data.
      
      This memory will be internal to the host and will not be exposed to the Firmware.
      Driver may not be able to allocate 512 MB. In that case, driver will do possible memory
      (available at run time) allocation to store crash dump data.
      
      Let’s call this buffer as Host Crash Buffer.
      
      Host Crash buffer will not be contigious as a whole, but it will have multiple chunk of contigious memory.
      This will be internal to driver and firmware/application are unaware of it.
      Partial allocation of Host Crash buffer may have valid information to debug depending upon
      what was collected in that buffer and depending on nature of failure.
      
      Complete Crash dump is the best case, but we do want to capture partial buffer just to grab something rather than nothing.
      Host Crash buffer will be allocated only when FW Crash dump data is available,
      and will be deallocated once application copy Host Crash buffer to the file.
      Host Crash buffer size can be anything between 1MB to 512MB. (It will be multiple of 1MBs)
      
      B.) Irrespective of underlying Firmware capability of crash dump support,
      driver will allocate DMA buffer at start of the day for each MR controllers.
      Let’s call this buffer as “DMA Crash Buffer”.
      
      For this feature, size of DMA crash buffer will be 1MB.
      (We will not gain much even if DMA buffer size is increased.)
      
      C.) Driver will now read Controller Info sending existing dcmd “MR_DCMD_CTRL_GET_INFO”.
      Driver should extract the information from ctrl info provided by firmware and
      figure out if firmware support crash dump feature or not.
      
      Driver will enable crash dump feature only if
      “Firmware support Crash dump” +
      “Driver was able to create DMA Crash Buffer”.
      
      If either one from above is not set, Crash dump feature should be disable in driver.
      Firmware will enable crash dump feature only if “Driver Send DCMD- MR_DCMD_SET_CRASH_BUF_PARA with MR_CRASH_BUF_TURN_ON”
      
      Helper application/script should use sysfs parameter fw_crash_xxx to actually copy data from
      host memory to the filesystem.
      Signed-off-by: NSumit Saxena <sumit.saxena@avagotech.com>
      Signed-off-by: NKashyap Desai <kashyap.desai@avagotech.com>
      Reviewed-by: NTomas Henzl <thenzl@redhat.com>
      Signed-off-by: NChristoph Hellwig <hch@lst.de>
      fc62b3fc
    • S
      megaraid_sas : Update threshold based reply post host index register · db4fc864
      Sumit.Saxena@avagotech.com 提交于
      Resending the patch. Addressed the review comments from Tomas Henzl.
      
      Current driver updates reply post host index to let firmware know that replies are processed,
      while returning from ISR function, only if there is no oustanding replies in reply queue.
      
      Driver will free the request frame immediately from ISR but reply post host index is not yet updated.
      It means freed request can be used by submission path and there may be a tight loop in request/reply
      path. In such condition, firmware may crash when it tries to post reply and there is no free
      reply post descriptor.
      
      Eventually two things needs to be change to avoid this issue.
      
      Increase reply queue depth (double than request queue) to accommodate worst case scenario.
      Update reply post host index to firmware once it reach to some pre-defined threshold value.
      
      This change will make sure that firmware will always have some buffer of reply descriptor and
      will never find empty reply descriptor in completion path.
      Signed-off-by: NSumit Saxena <sumit.saxena@avagotech.com>
      Signed-off-by: NKashyap Desai <kashyap.desai@avagotech.com>
      Reviewed-by: NTomas Henzl <thenzl@redhat.com>
      Signed-off-by: NChristoph Hellwig <hch@lst.de>
      db4fc864
    • S
      megaraid_sas : Use writeq for 64bit pci write to avoid spinlock overhead · 07560409
      Sumit.Saxena@avagotech.com 提交于
      Resending the patch. Addressed the review comments from Tomas Henzl.
      Reduce the assingment for u64 req_data variable.
      
      Use writeq() for 64bit PCI write instead of writel() to avoid additional lock overhead.
      Signed-off-by: NSumit Saxena <sumit.saxena@avagotech.com>
      Signed-off-by: NKashyap Desai <kashyap.desai@avagotech.com>
      Reviewed-by: NTomas Henzl <thenzl@redhat.com>
      Signed-off-by: NChristoph Hellwig <hch@lst.de>
      07560409
    • A
      megaraid_sas: Fix reset_mutex leak · a2fbcbc3
      Adam Radford 提交于
      The following patch for megaraid_sas fixes a reset_mutex leak in megasas_reset_fusion().
      Signed-off-by: NAdam Radford <aradford@gmail.com>
      Reviewed-by: NMartin K. Petersen <martin.petersen@oracle.com>
      Signed-off-by: NChristoph Hellwig <hch@lst.de>
      a2fbcbc3