1. 10 4月, 2012 19 次提交
  2. 09 4月, 2012 1 次提交
  3. 31 3月, 2012 1 次提交
  4. 27 3月, 2012 7 次提交
  5. 24 3月, 2012 12 次提交
    • H
      poll: add poll_requested_events() and poll_does_not_wait() functions · 626cf236
      Hans Verkuil 提交于
      In some cases the poll() implementation in a driver has to do different
      things depending on the events the caller wants to poll for.  An example
      is when a driver needs to start a DMA engine if the caller polls for
      POLLIN, but doesn't want to do that if POLLIN is not requested but instead
      only POLLOUT or POLLPRI is requested.  This is something that can happen
      in the video4linux subsystem among others.
      
      Unfortunately, the current epoll/poll/select implementation doesn't
      provide that information reliably.  The poll_table_struct does have it: it
      has a key field with the event mask.  But once a poll() call matches one
      or more bits of that mask any following poll() calls are passed a NULL
      poll_table pointer.
      
      Also, the eventpoll implementation always left the key field at ~0 instead
      of using the requested events mask.
      
      This was changed in eventpoll.c so the key field now contains the actual
      events that should be polled for as set by the caller.
      
      The solution to the NULL poll_table pointer is to set the qproc field to
      NULL in poll_table once poll() matches the events, not the poll_table
      pointer itself.  That way drivers can obtain the mask through a new
      poll_requested_events inline.
      
      The poll_table_struct can still be NULL since some kernel code calls it
      internally (netfs_state_poll() in ./drivers/staging/pohmelfs/netfs.h).  In
      that case poll_requested_events() returns ~0 (i.e.  all events).
      
      Very rarely drivers might want to know whether poll_wait will actually
      wait.  If another earlier file descriptor in the set already matched the
      events the caller wanted to wait for, then the kernel will return from the
      select() call without waiting.  This might be useful information in order
      to avoid doing expensive work.
      
      A new helper function poll_does_not_wait() is added that drivers can use
      to detect this situation.  This is now used in sock_poll_wait() in
      include/net/sock.h.  This was the only place in the kernel that needed
      this information.
      
      Drivers should no longer access any of the poll_table internals, but use
      the poll_requested_events() and poll_does_not_wait() access functions
      instead.  In order to enforce that the poll_table fields are now prepended
      with an underscore and a comment was added warning against using them
      directly.
      
      This required a change in unix_dgram_poll() in unix/af_unix.c which used
      the key field to get the requested events.  It's been replaced by a call
      to poll_requested_events().
      
      For qproc it was especially important to change its name since the
      behavior of that field changes with this patch since this function pointer
      can now be NULL when that wasn't possible in the past.
      
      Any driver accessing the qproc or key fields directly will now fail to compile.
      
      Some notes regarding the correctness of this patch: the driver's poll()
      function is called with a 'struct poll_table_struct *wait' argument.  This
      pointer may or may not be NULL, drivers can never rely on it being one or
      the other as that depends on whether or not an earlier file descriptor in
      the select()'s fdset matched the requested events.
      
      There are only three things a driver can do with the wait argument:
      
      1) obtain the key field:
      
      	events = wait ? wait->key : ~0;
      
         This will still work although it should be replaced with the new
         poll_requested_events() function (which does exactly the same).
         This will now even work better, since wait is no longer set to NULL
         unnecessarily.
      
      2) use the qproc callback. This could be deadly since qproc can now be
         NULL. Renaming qproc should prevent this from happening. There are no
         kernel drivers that actually access this callback directly, BTW.
      
      3) test whether wait == NULL to determine whether poll would return without
         waiting. This is no longer sufficient as the correct test is now
         wait == NULL || wait->_qproc == NULL.
      
         However, the worst that can happen here is a slight performance hit in
         the case where wait != NULL and wait->_qproc == NULL. In that case the
         driver will assume that poll_wait() will actually add the fd to the set
         of waiting file descriptors. Of course, poll_wait() will not do that
         since it tests for wait->_qproc. This will not break anything, though.
      
         There is only one place in the whole kernel where this happens
         (sock_poll_wait() in include/net/sock.h) and that code will be replaced
         by a call to poll_does_not_wait() in the next patch.
      
         Note that even if wait->_qproc != NULL drivers cannot rely on poll_wait()
         actually waiting. The next file descriptor from the set might match the
         event mask and thus any possible waits will never happen.
      Signed-off-by: NHans Verkuil <hans.verkuil@cisco.com>
      Reviewed-by: NJonathan Corbet <corbet@lwn.net>
      Reviewed-by: NAl Viro <viro@zeniv.linux.org.uk>
      Cc: Davide Libenzi <davidel@xmailserver.org>
      Signed-off-by: NHans de Goede <hdegoede@redhat.com>
      Cc: Mauro Carvalho Chehab <mchehab@infradead.org>
      Cc: David Miller <davem@davemloft.net>
      Cc: Eric Dumazet <eric.dumazet@gmail.com>
      Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
      Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>
      626cf236
    • D
      crc32: select an algorithm via Kconfig · 5cde7656
      Darrick J. Wong 提交于
      Allow the kernel builder to choose a crc32* algorithm for the kernel.
      Signed-off-by: NDarrick J. Wong <djwong@us.ibm.com>
      Cc: Bob Pearson <rpearson@systemfabricworks.com>
      Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
      Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>
      5cde7656
    • D
      crc32: add self-test code for crc32c · 577eba9e
      Darrick J. Wong 提交于
      Add self-test code for crc32c.
      Signed-off-by: NDarrick J. Wong <djwong@us.ibm.com>
      Cc: Bob Pearson <rpearson@systemfabricworks.com>
      Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
      Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>
      577eba9e
    • D
      crypto: crc32c should use library implementation · 6a0962b2
      Darrick J. Wong 提交于
      Since lib/crc32.c now provides crc32c, remove the software implementation
      here and call the library function instead.
      Signed-off-by: NDarrick J. Wong <djwong@us.ibm.com>
      Cc: Herbert Xu <herbert@gondor.apana.org.au>
      Cc: Bob Pearson <rpearson@systemfabricworks.com>
      Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
      Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>
      6a0962b2
    • D
      crc32: bolt on crc32c · 46c5801e
      Darrick J. Wong 提交于
      Reuse the existing crc32 code to stamp out a crc32c implementation.
      Signed-off-by: NDarrick J. Wong <djwong@us.ibm.com>
      Cc: Herbert Xu <herbert@gondor.apana.org.au>
      Cc: Bob Pearson <rpearson@systemfabricworks.com>
      Cc: Randy Dunlap <rdunlap@xenotime.net>
      Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
      Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>
      46c5801e
    • B
      crc32: add note about this patchset to crc32.c · 78dff418
      Bob Pearson 提交于
      Add a comment at the top of crc32.c
      
      [djwong@us.ibm.com: Minor changelog tweaks]
      Signed-off-by: NBob Pearson <rpearson@systemfabricworks.com>
      Signed-off-by: NDarrick J. Wong <djwong@us.ibm.com>
      Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
      Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>
      78dff418
    • B
      crc32: optimize loop counter for x86 · 0292c497
      Bob Pearson 提交于
      Add two changes that improve the performance of x86 systems
      
      1. replace main loop with incrementing counter this change improves
         the performance of the selftest by about 5-6% on Nehalem CPUs.  The
         apparent reason is that the compiler can use the loop index to perform
         an indexed memory access.  This is reported to make the performance of
         PowerPC CPUs to get worse.
      
      2. replace the rem_len loop with incrementing counter this change
         improves the performance of the selftest, which has more than the usual
         number of occurances, by about 1-2% on x86 CPUs.  In actual work loads
         the length is most often a multiple of 4 bytes and this code does not
         get executed as often if at all.  Again this change is reported to make
         the performance of PowerPC get worse.
      
      [djwong@us.ibm.com: Minor changelog tweaks]
      Signed-off-by: NBob Pearson <rpearson@systemfabricworks.com>
      Signed-off-by: NDarrick J. Wong <djwong@us.ibm.com>
      Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
      Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>
      0292c497
    • B
      crc32: add slice-by-8 algorithm to existing code · 324eb0f1
      Bob Pearson 提交于
      Add slicing-by-8 algorithm to the existing slicing-by-4 algorithm.  This
      consists of:
      
      - extend largest BITS size from 32 to 64
      - extend tables from tab[4][256] to up to tab[8][256]
      - Add code for inner loop.
      
      [djwong@us.ibm.com: Minor changelog tweaks]
      Signed-off-by: NBob Pearson <rpearson@systemfabricworks.com>
      Signed-off-by: NDarrick J. Wong <djwong@us.ibm.com>
      Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
      Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>
      324eb0f1
    • B
      crc32: make CRC_*_BITS definition correspond to actual bit counts · 9a1dbf6a
      Bob Pearson 提交于
      crc32.c provides a choice of one of several algorithms for computing the
      LSB and LSB versions of the CRC32 checksum based on the parameters
      CRC_LE_BITS and CRC_BE_BITS.
      
      In the original version the values 1, 2, 4 and 8 respectively selected
      versions of the alrogithm that computed the crc 1, 2, 4 and 32 bits as a
      time.
      
      This patch series adds a new version that computes the CRC 64 bits at a
      time.  To make things easier to understand the parameter has been
      reinterpreted to actually stand for the number of bits processed in each
      step of the algorithm so that the old value 8 has been replaced with the
      value 32.
      
      This also allows us to add in a widely used crc algorithm that computes
      the crc 8 bits at a time called the Sarwate algorithm.
      
      [djwong@us.ibm.com: Minor changelog tweaks]
      Signed-off-by: NBob Pearson <rpearson@systemfabricworks.com>
      Signed-off-by: NDarrick J. Wong <djwong@us.ibm.com>
      Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
      Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>
      9a1dbf6a
    • B
      crc32: fix mixing of endian-specific types · ce4320dd
      Bob Pearson 提交于
      crc32.c in its original version freely mixed u32, __le32 and __be32 types
      which caused warnings from sparse with __CHECK_ENDIAN__.  This patch fixes
      these by forcing the types to u32.
      
      [djwong@us.ibm.com: Minor changelog tweaks]
      Signed-off-by: NBob Pearson <rpearson@systemfabricworks.com>
      Signed-off-by: NDarrick J. Wong <djwong@us.ibm.com>
      Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
      Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>
      ce4320dd
    • B
      crc32: miscellaneous cleanups · 60e58d5c
      Bob Pearson 提交于
      Misc cleanup of lib/crc32.c and related files.
      
      - remove unnecessary header files.
      
      - straighten out some convoluted ifdef's
      
      - rewrite some references to 2 dimensional arrays as 1 dimensional
        arrays to make them correct.  I.e.  replace tab[i] with tab[0][i].
      
      - a few trivial whitespace changes
      
      - fix a warning in gen_crc32tables.c caused by a mismatch in the type of
        the pointer passed to output table.  Since the table is only used at
        kernel compile time, it is simpler to make the table big enough to hold
        the largest column size used.  One cannot make the column size smaller
        in output_table because it has to be used by both the le and be tables
        and they can have different column sizes.
      
      [djwong@us.ibm.com: Minor changelog tweaks]
      Signed-off-by: NBob Pearson <rpearson@systemfabricworks.com>
      Signed-off-by: NDarrick J. Wong <djwong@us.ibm.com>
      Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
      Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>
      60e58d5c
    • B
      crc32: simplify unit test code · 3863ef31
      Bob Pearson 提交于
      Replace the unit test provided in crc32.c, which doesn't have a makefile
      and doesn't compile with current headers, with a simpler self test
      routine that also gives a measure of performance and runs at module init
      time.  The self test option can be enabled through a configuration
      option CONFIG_CRC32_SELFTEST.
      
      The test stresses the pre and post loops and is thus not very realistic
      since actual uses will likely have addresses and lengths that are at
      least 4 byte aligned.  However, the main loop is long enough so that the
      performance is dominated by that loop.
      
      The expected values for crc32_le and crc32_be were generated with the
      original version of crc32.c using CRC_BITS_LE = 8 and CRC_BITS_BE = 8.
      These values were then used to check all the values of the BITS
      parameters in both the original and new versions.
      
      The performance results show some variability from run to run in spite
      of attempts to both warm the cache and reduce the amount of OS noise by
      limiting interrutps during the test.  To get comparable results and to
      analyse options wrt performance the best time reported over a small
      sample of runs has been taken.
      
      [djwong@us.ibm.com: Minor changelog tweaks]
      Signed-off-by: NBob Pearson <rpearson@systemfabricworks.com>
      Signed-off-by: NDarrick J. Wong <djwong@us.ibm.com>
      Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
      Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>
      3863ef31