drivers/infiniband/hw/hfi1/verbs.h · 3341713c67d5eae5c68bab30add97e9f9ecfafa5 · openanolis / cloud-kernel

IB/hfi1: Fix yield logic in send engine · dd1ed108

由 Mike Marciniszyn 提交于 5月 04, 2017

When there are many RC QPs and an RDMA READ request
is sent, timeouts occur on the requester side because
of fairness among RC QPs on their relative SDMA engine
on the responder side.  This also hits write and send, but
to a lesser extent.

Complicating the issue is that the current code checks if workqueue
is congested before scheduling other QPs, however, this
check is based on the number of active entries in the
workqueue, which was found to be too big to for
workqueue_congested() to be effective.

Fix by reducing the number of active entries as revealed by
experimentation from the default of num_sdma to
HFI1_MAX_ACTIVE_WORKQUEUE_ENTRIES.  Retry counts were monitored
to determine the correct value.

Tracing to investigate any future issues is also added.
Reviewed-by: NMike Marciniszyn <mike.marciniszyn@intel.com>
Signed-off-by: NSebastian Sanchez <sebastian.sanchez@intel.com>
Signed-off-by: NDennis Dalessandro <dennis.dalessandro@intel.com>
Signed-off-by: NDoug Ledford <dledford@redhat.com>

dd1ed108

verbs.h 12.2 KB

openanolis / cloud-kernel 1 年多 前同步成功

Replace verbs.h

openanolis / cloud-kernel
1 年多前同步成功