提交 · 70a5f3bb5f80d9bc7aa746816b32ab17e3c56029 · openeuler / raspberrypi-kernel

05 2月, 2015 1 次提交

cxgb4: Add low latency socket busy_poll support · 3a336cb1

由 Hariprasad Shenai 提交于 2月 04, 2015

cxgb_busy_poll, corresponding to ndo_busy_poll, gets called by the socket
waiting for data.

With busy_poll enabled, improvement is seen in latency numbers as observed by
collecting netperf TCP_RR numbers.
Below are latency number, with and without busy-poll, in a switched environment
for a particular msg size:
netperf command: netperf -4 -H <ip> -l 30 -t TCP_RR -- -r1,1
Latency without busy-poll: ~16.25 us
Latency with busy-poll   : ~08.79 us

Based on original work by Kumar Sanghvi <kumaras@chelsio.com>
Signed-off-by: NHariprasad Shenai <hariprasad@chelsio.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

3a336cb1

28 1月, 2015 1 次提交

cxgb4: Move firmware version MACRO to t4fw_version.h · cd6c2f12

由 Hariprasad Shenai 提交于 1月 27, 2015

Move firmware version MACRO to a new t4fw_version.h file so that csiostor driver
can also use it.
Signed-off-by: NHariprasad Shenai <hariprasad@chelsio.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

cd6c2f12

27 1月, 2015 4 次提交
- H
  cxgb4: Added support in debugfs to dump PM module stats · b3bbe36a
  由 Hariprasad Shenai 提交于 1月 27, 2015
```
Signed-off-by: NHariprasad Shenai <hariprasad@chelsio.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
  b3bbe36a
- H
  cxgb4: Addded support in debugfs to dump CIM outbound queue content · c778af7d
  由 Hariprasad Shenai 提交于 1月 27, 2015
```
Signed-off-by: NHariprasad Shenai <hariprasad@chelsio.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
  c778af7d
- H
  cxgb4: Added support in debugfs to dump cim ingress bound queue contents · e5f0e43b
  由 Hariprasad Shenai 提交于 1月 27, 2015
```
Signed-off-by: NHariprasad Shenai <hariprasad@chelsio.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
  e5f0e43b
- H
  cxgb4: Added support in debugfs to dump sge_qinfo · dc9daab2
  由 Hariprasad Shenai 提交于 1月 27, 2015
```
Signed-off-by: NHariprasad Shenai <hariprasad@chelsio.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
  dc9daab2
25 1月, 2015 2 次提交

cxgb4: Add debugfs options to dump the rss key, config for PF, VF, etc · 688ea5fe

由 Hariprasad Shenai 提交于 1月 20, 2015

Adds support to dump the rss table, rss_config, rss_key, rss_pf_config and
rss_vf_config
Signed-off-by: NHariprasad Shenai <hariprasad@chelsio.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

688ea5fe

cxgb4: Add debugfs entry to dump the contents of the flash · 49216c1c

由 Hariprasad Shenai 提交于 1月 20, 2015

Adds support to dump the contents of the flash in the adapter
Signed-off-by: NHariprasad Shenai <hariprasad@chelsio.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

49216c1c

16 1月, 2015 1 次提交

cxgb4 : Update ipv6 address handling api · b5a02f50

由 Anish Bhatt 提交于 1月 14, 2015

This patch improves on previously added support for ipv6 addresses. The code
is consolidated to a single file and adds an api for use by dependent upper
level drivers such as cxgb4i/iw_cxgb4 etc.
Signed-off-by: NAnish Bhatt <anish@chelsio.com>
Signed-off-by: NDeepak Singh <deepak.s@chelsio.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

b5a02f50

09 1月, 2015 3 次提交

cxgb4: Add support for cim_qcfg entry in debugfs · 74b3092c

由 Hariprasad Shenai 提交于 1月 07, 2015

Adds debug log to get cim queue config
Signed-off-by: NHariprasad Shenai <hariprasad@chelsio.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

74b3092c

cxgb4: Add support for cim_la entry in debugfs · f1ff24aa

由 Hariprasad Shenai 提交于 1月 07, 2015

The CIM LA captures the embedded processor’s internal state. Optionally, it can
also trace the flow of data in and out of the embedded processor. Therefore, the
CIM LA output contains detailed information of what code the embedded processor
executed prior to the CIM LA capture.
Signed-off-by: NHariprasad Shenai <hariprasad@chelsio.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

f1ff24aa

cxgb4: Add support for devlog · 49aa284f

由 Hariprasad Shenai 提交于 1月 07, 2015

Add support for device log entry in debugfs
Signed-off-by: NHariprasad Shenai <hariprasad@chelsio.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

49aa284f

13 12月, 2014 1 次提交

cxgb4: Add support for QSA modules · 40e9de4b

由 Hariprasad Shenai 提交于 12月 12, 2014

Firmware 1.12.25.0 added support for QSA module, adding the driver code for it.
Also fixes some ethtool get settings for other module types.
Signed-off-by: NHariprasad Shenai <hariprasad@chelsio.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

40e9de4b

11 12月, 2014 1 次提交

cxgb4/cxgb4vf: global named must be unique · dd0bcc0b

由 Stephen Rothwell 提交于 12月 10, 2014

Signed-off-by: NStephen Rothwell <sfr@canb.auug.org.au>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

dd0bcc0b

10 12月, 2014 3 次提交

cxgb4/cxgb4vf: Use new interfaces to calculate BAR2 SGE Queue Register addresses · df64e4d3

由 Hariprasad Shenai 提交于 12月 03, 2014

Use BAR2 Going To Sleep (GTS) for T5 and later. Use new BAR2 User Doorbells for
T5 for both cxgb4 and cxgb4vf driver.

Based on original work by Casey Leedom <leedom@chelsio.com>
Signed-off-by: NHariprasad Shenai <hariprasad@chelsio.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

df64e4d3

cxgb4/cxgb4vf: Add code to calculate T5 BAR2 Offsets for SGE Queue Registers · e85c9a7a

由 Hariprasad Shenai 提交于 12月 03, 2014

Add new Common Code facilities for calculating T5 BAR2 Offsets for SGE Queue
Registers. This new code can handle situations where

    Queues Per Page * SGE BAR2 Queue Register Area Size > Page Size

Based on original work by Casey Leedom <leedom@chelsio.com>
Signed-off-by: NHariprasad Shenai <hariprasad@chelsio.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

e85c9a7a

H
cxgb4: Update FW version string to match FW binary version 1.12.25.0 · c5ac9704
由 Hariprasad Shenai 提交于 12月 03, 2014
```
Signed-off-by: NHariprasad Shenai <hariprasad@chelsio.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
c5ac9704

23 11月, 2014 1 次提交

RDMA/cxgb4/cxgb4vf/csiostor: Cleanup macros/register defines related to PCIE, RSS and FW · b2e1a3f0

由 Hariprasad Shenai 提交于 11月 21, 2014

This patch cleanups all PCIE, RSS & FW related macros/register defines that are
defined in t4fw_api.h and the affected files.
Signed-off-by: NHariprasad Shenai <hariprasad@chelsio.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

b2e1a3f0

11 11月, 2014 2 次提交

cxgb4: Cleanup macros so they follow the same style and look consistent, part 2 · e2ac9628

由 Hariprasad Shenai 提交于 11月 07, 2014

Various patches have ended up changing the style of the symbolic macros/register
defines to different style.

As a result, the current kernel.org files are a mix of different macro styles.
Since this macro/register defines is used by different drivers a
few patch series have ended up adding duplicate macro/register define entries
with different styles. This makes these register define/macro files a complete
mess and we want to make them clean and consistent. This patch cleans up a part
of it.
Signed-off-by: NHariprasad Shenai <hariprasad@chelsio.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

e2ac9628

H
cxgb4: Add cxgb4_debugfs.c, move all debugfs code to new file · fd88b31a
由 Hariprasad Shenai 提交于 11月 07, 2014
```
Signed-off-by: NHariprasad Shenai <hariprasad@chelsio.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
fd88b31a

15 10月, 2014 1 次提交

cxgb4: Fix FW flash logic using ethtool · 22c0b963

由 Hariprasad Shenai 提交于 10月 15, 2014

Use t4_fw_upgrade instead of t4_load_fw to write firmware into FLASH, since
t4_load_fw doesn't co-ordinate with the firmware and the adapter can get hosed
enough to require a power cycle of the system.

Based on original work by Casey Leedom <leedom@chelsio.com>
Signed-off-by: NHariprasad Shenai <hariprasad@chelsio.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

22c0b963

10 10月, 2014 1 次提交

cxgb4: Wait for device to get ready before reading any register · 8203b509

由 Hariprasad Shenai 提交于 10月 09, 2014

Call t4_wait_dev_ready() before attempting to read the PL_WHOAMI register
(to determine which function we have been attached to). This prevents us from
failing on that read if it comes right after a RESET.
Signed-off-by: NHariprasad Shenai <hariprasad@chelsio.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

8203b509

29 9月, 2014 2 次提交

cxgb4: Add support for adaptive rx · e553ec3f

由 Hariprasad Shenai 提交于 9月 26, 2014

Based on original work by Kumar Sanghvi <kumaras@chelsio.com>
Signed-off-by: NHariprasad Shenai <hariprasad@chelsio.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

e553ec3f

cxgb4: Use BAR2 Going To Sleep (GTS) for T5 and later. · d63a6dcf

由 Hariprasad Shenai 提交于 9月 26, 2014

Use BAR2 GTS for T5. If we are on T4 use the old doorbell mechanism;
otherwise ue the new BAR2 mechanism. Use BAR2 doorbells for refilling FL's.

Based on original work by Casey Leedom <leedom@chelsio.com>
Signed-off-by: NHariprasad Shenai <hariprasad@chelsio.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

d63a6dcf

22 8月, 2014 1 次提交

cxgb4: Fix race condition in cleanup · 29aaee65

由 Anish Bhatt 提交于 8月 20, 2014

There is a possible race condition when we unregister the PCI Driver and then
flush/destroy the global "workq". This could lead to situations where there
are tasks on the Work Queue with references to now deleted adapter data
structures. Instead, have per-adapter Work Queues which were instantiated and
torn down in init_one() and remove_one(), respectively.

v2: Remove unnecessary call to flush_workqueue() before destroy_workqueue()
Signed-off-by: NAnish Bhatt <anish@chelsio.com>
Signed-off-by: NCasey Leedom <leedom@chelsio.com>
Acked-by: NNeil Horman <nhorman@tuxdriver.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

29aaee65

08 8月, 2014 2 次提交

cxgb4: IEEE fixes for DCBx state machine · 10b00466

由 Anish Bhatt 提交于 8月 07, 2014

* Changes required due to 16eecd9b ("dcbnl : Fix misleading
  dcb_app->priority explanation")
* Driver was previously not aware of what DCBx version was negotiated by
  firmware, this could lead to DCB app table  in kernel or in firmware being
  populated wrong  since IEEE/CEE used different formats made clear by above
  mentioned commit
* Driver was missing a couple of state transitions that could be caused
  by other drivers that use chelsio hardware, resulting in incorrect behaviour
  (the change that addresses this also flips the state machine to switch on
   state instead of transition, hope this is okay in current window)
* Prio queue info & tsa is no longer thrown away

v2: Print DCBx state transition messages only when debug is enabled
Signed-off-by: NAnish Bhatt <anish@chelsio.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

10b00466

H
cxgb4: Update FW version string to match FW binary version · 6c5caae0
由 Hariprasad Shenai 提交于 8月 07, 2014
```
Signed-off-by: NHariprasad Shenai <hariprasad@chelsio.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
6c5caae0

05 8月, 2014 1 次提交

cxgb4: only free allocated fls · 5fa76694

由 Hariprasad Shenai 提交于 8月 04, 2014

Signed-off-by: NHariprasad Shenai <hariprasad@chelsio.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

5fa76694

16 7月, 2014 1 次提交

cxgb4/iw_cxgb4: use firmware ord/ird resource limits · 4c2c5763

由 Hariprasad Shenai 提交于 7月 14, 2014

Advertise a larger max read queue depth for qps, and gather the resource limits
from fw and use them to avoid exhaustinq all the resources.

Design:

cxgb4:

Obtain the max_ordird_qp and max_ird_adapter device params from FW
at init time and pass them up to the ULDs when they attach.  If these
parameters are not available, due to older firmware, then hard-code
the values based on the known values for older firmware.
iw_cxgb4:

Fix the c4iw_query_device() to report these correct values based on
adapter parameters.  ibv_query_device() will always return:

max_qp_rd_atom = max_qp_init_rd_atom = min(module_max, max_ordird_qp)
max_res_rd_atom = max_ird_adapter

Bump up the per qp max module option to 32, allowing it to be increased
by the user up to the device max of max_ordird_qp.  32 seems to be
sufficient to maximize throughput for streaming read benchmarks.

Fail connection setup if the negotiated IRD exhausts the available
adapter ird resources.  So the driver will track the amount of ird
resource in use and not send an RI_WR/INIT to FW that would reduce the
available ird resources below zero.
Signed-off-by: NSteve Wise <swise@opengridcomputing.com>
Signed-off-by: NHariprasad Shenai <hariprasad@chelsio.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

4c2c5763

02 7月, 2014 2 次提交

cxgb4: Replaced the backdoor mechanism to access the HW memory with PCIe Window method · fc5ab020

由 Hariprasad Shenai 提交于 6月 27, 2014

Rip out a bunch of redundant PCI-E Memory Window Read/Write routines,
collapse the more general purpose routines into a single routine
thereby eliminating the need for a large stack frame (and extra data
copying) in the outer routine, change everything to use the improved
routine t4_memory_rw.

Based on origninal work by Casey Leedom <leedom@chelsio.com> and
Steve Wise <swise@opengridcomputing.com>
Signed-off-by: NCasey Leedom <leedom@chelsio.com>
Signed-off-by: NSteve Wise <swise@opengridcomputing.com>
Signed-off-by: NHariprasad Shenai <hariprasad@chelsio.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

fc5ab020

cxgb4: Use FW interface to get BAR0 value · 0abfd152

由 Hariprasad Shenai 提交于 6月 27, 2014

Use the firmware interface to get the BAR0 value since we really don't want
to use the PCI-E Configuration Space Backdoor access which is owned by the
firmware.

Set up PCI-E Memory Window registers using the true values programmed into
BAR registers.  When the PF4 "Master Function" is exported to a Virtual
Machine, the values returned by pci_resource_start() will be for the
synthetic PCI-E Configuration Space and not the real addresses. But we need
to program the PCI-E Memory Window address decoders with the real addresses
that we're going to be using in order to have accesses through the Memory
Windows work.

Based on origninal work by Casey Leedom <leedom@chelsio.com>
Signed-off-by: NCasey Leedom <leedom@chelsio.com>
Signed-off-by: NHariprasad Shenai <hariprasad@chelsio.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

0abfd152

23 6月, 2014 2 次提交

cxgb4 : Update copyright year on all cxgb4 files · ce100b8b

由 Anish Bhatt 提交于 6月 19, 2014

Signed-off-by: NAnish Bhatt <anish@chelsio.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

ce100b8b

cxgb4 : Integrate DCBx support into cxgb4 module. Register dbcnl_ops to give... · 688848b1

由 Anish Bhatt 提交于 6月 19, 2014

cxgb4 : Integrate DCBx support into cxgb4 module. Register dbcnl_ops to give access to DCBx functions
Signed-off-by: NAnish Bhatt <anish@chelsio.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

688848b1

11 6月, 2014 1 次提交

iw_cxgb4: Allocate and use IQs specifically for indirect interrupts · cf38be6d

由 Hariprasad Shenai 提交于 6月 06, 2014

Currently indirect interrupts for RDMA CQs funnel through the LLD's RDMA
RXQs, which also handle direct interrupts for offload CPLs during RDMA
connection setup/teardown.  The intended T4 usage model, however, is to
have indirect interrupts flow through dedicated IQs. IE not to mix
indirect interrupts with CPL messages in an IQ.  This patch adds the
concept of RDMA concentrator IQs, or CIQs, setup and maintained by the
LLD and exported to iw_cxgb4 for use when creating CQs. RDMA CPLs will
flow through the LLD's RDMA RXQs, and CQ interrupts flow through the
CIQs.

Design:

cxgb4 creates and exports an array of CIQs for the RDMA ULD.  These IQs
are sized according to the max available CQs available at adapter init.
In addition, these IQs don't need FL buffers since they only service
indirect interrupts.  One CIQ is setup per RX channel similar to the
RDMA RXQs.

iw_cxgb4 will utilize these CIQs based on the vector value passed into
create_cq().  The num_comp_vectors advertised by iw_cxgb4 will be the
number of CIQs configured, and thus the vector value will be the index
into the array of CIQs.

Based on original work by Steve Wise <swise@opengridcomputing.com>
Signed-off-by: NSteve Wise <swise@opengridcomputing.com>
Signed-off-by: NHariprasad Shenai <hariprasad@chelsio.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

cf38be6d

15 3月, 2014 1 次提交

cxgb4/iw_cxgb4: Doorbell Drop Avoidance Bug Fixes · 05eb2389

由 Steve Wise 提交于 3月 14, 2014

The current logic suffers from a slow response time to disable user DB
usage, and also fails to avoid DB FIFO drops under heavy load. This commit
fixes these deficiencies and makes the avoidance logic more optimal.
This is done by more efficiently notifying the ULDs of potential DB
problems, and implements a smoother flow control algorithm in iw_cxgb4,
which is the ULD that puts the most load on the DB fifo.

Design:

cxgb4:

Direct ULD callback from the DB FULL/DROP interrupt handler. This allows
the ULD to stop doing user DB writes as quickly as possible.

While user DB usage is disabled, the LLD will accumulate DB write events
for its queues. Then once DB usage is reenabled, a single DB write is
done for each queue with its accumulated write count. This reduces the
load put on the DB fifo when reenabling.

iw_cxgb4:

Instead of marking each qp to indicate DB writes are disabled, we create
a device-global status page that each user process maps. This allows
iw_cxgb4 to only set this single bit to disable all DB writes for all
user QPs vs traversing the idr of all the active QPs. If the libcxgb4
doesn't support this, then we fall back to the old approach of marking
each QP. Thus we allow the new driver to work with an older libcxgb4.

When the LLD upcalls iw_cxgb4 indicating DB FULL, we disable all DB writes
via the status page and transition the DB state to STOPPED. As user
processes see that DB writes are disabled, they call into iw_cxgb4
to submit their DB write events. Since the DB state is in STOPPED,
the QP trying to write gets enqueued on a new DB "flow control" list.
As subsequent DB writes are submitted for this flow controlled QP, the
amount of writes are accumulated for each QP on the flow control list.
So all the user QPs that are actively ringing the DB get put on this
list and the number of writes they request are accumulated.

When the LLD upcalls iw_cxgb4 indicating DB EMPTY, which is in a workq
context, we change the DB state to FLOW_CONTROL, and begin resuming all
the QPs that are on the flow control list. This logic runs on until
the flow control list is empty or we exit FLOW_CONTROL mode (due to
a DB DROP upcall, for example). QPs are removed from this list, and
their accumulated DB write counts written to the DB FIFO. Sets of QPs,
called chunks in the code, are removed at one time. The chunk size is 64.
So 64 QPs are resumed at a time, and before the next chunk is resumed, the
logic waits (blocks) for the DB FIFO to drain. This prevents resuming to
quickly and overflowing the FIFO. Once the flow control list is empty,
the db state transitions back to NORMAL and user QPs are again allowed
to write directly to the user DB register.

The algorithm is designed such that if the DB write load is high enough,
then all the DB writes get submitted by the kernel using this flow
controlled approach to avoid DB drops. As the load lightens though, we
resume to normal DB writes directly by user applications.
Signed-off-by: NSteve Wise <swise@opengridcomputing.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

05eb2389

14 3月, 2014 2 次提交

cxgb4: Rectify emitting messages about SGE Ingress DMA channels being potentially stuck · 0f4d201f

由 Kumar Sanghvi 提交于 3月 13, 2014

Based on original work by Casey Leedom <leedom@chelsio.com>
Signed-off-by: NKumar Sanghvi <kumaras@chelsio.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

0f4d201f

cxgb4: Add code to dump SGE registers when hitting idma hangs · 68bce192

由 Kumar Sanghvi 提交于 3月 13, 2014

Based on original work by Casey Leedom <leedom@chelsio.com>
Signed-off-by: NKumar Sanghvi <kumaras@chelsio.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

68bce192

19 2月, 2014 3 次提交

cxgb4: Query firmware for T5 ULPTX MEMWRITE DSGL capabilities · 1ac0f095

由 Kumar Sanghvi 提交于 2月 18, 2014

Query firmware to see whether we're allowed to use T5 ULPTX MEMWRITE DSGL
capabilities.  Also pass that information to Upper Layer Drivers via the
new (struct cxgb4_lld_info).ulptx_memwrite_dsgl boolean.

Based on original work by Casey Leedom <leedom@chelsio.com>
Signed-off-by: NKumar Sanghvi <kumaras@chelsio.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

1ac0f095

cxgb4: Print adapter VPD Part Number instead of Engineering Change field · a94cd705

由 Kumar Sanghvi 提交于 2月 18, 2014

When we attach to adapter, print VPD Part Number instead of Engineering Change field.
Based on original work by Casey Leedom <leedom@chelsio.com>
Signed-off-by: NKumar Sanghvi <kumaras@chelsio.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

a94cd705

cxgb4: Add support to recognize 40G links · 72aca4bf

由 Kumar Sanghvi 提交于 2月 18, 2014

Also, create a new Common Code interface to translate Firmware Port Technology
Type values (enum fw_port_type) to string descriptions.  This will allow us
to maintain the description translation table in one place rather than in
every driver.

Based on original work by Scott Bardone and Casey Leedom <leedom@chelsio.com>
Signed-off-by: NKumar Sanghvi <kumaras@chelsio.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

72aca4bf