1. 28 8月, 2008 1 次提交
    • A
      tcp: Skip empty hash buckets faster in /proc/net/tcp · 6eac5604
      Andi Kleen 提交于
      On most systems most of the TCP established/time-wait hash buckets are empty.
      When walking the hash table for /proc/net/tcp their read locks would
      always be aquired just to find out they're empty. This patch changes the code
      to check first if the buckets have any entries before taking the lock, which
      is much cheaper than taking a lock. Since the hash tables are large
      this makes a measurable difference on processing /proc/net/tcp, 
      especially on architectures with slow read_lock (e.g. PPC) 
      
      On a 2GB Core2 system time cat /proc/net/tcp > /dev/null (with a mostly
      empty hash table) goes from 0.046s to 0.005s.
      
      On systems with slower atomics (like P4 or POWER4) or larger hash tables
      (more RAM) the difference is much higher.
      
      This can be noticeable because there are some daemons around who regularly
      scan /proc/net/tcp.
      
      Original idea for this patch from Marcus Meissner, but redone by me.
      Signed-off-by: NAndi Kleen <ak@suse.de>
      Signed-off-by: NDavid S. Miller <davem@davemloft.net>
      6eac5604
  2. 27 8月, 2008 8 次提交
    • G
      dccp ccid-3: Replace lazy BUG_ON with condition · eff253c4
      Gerrit Renker 提交于
      The BUG_ON(w_tot == 0) only holds if there is no more than 1 loss interval in
      the loss history. If there is only a single loss interval, the calc_i_mean()
      routine need in fact not be called (RFC 3448, 6.3.1). 
      Signed-off-by: NGerrit Renker <gerrit@erg.abdn.ac.uk>
      eff253c4
    • G
      dccp: Toggle debug output without module unloading · 157439fa
      Gerrit Renker 提交于
      This sets the sysfs permissions so that root can toggle the `debug'
      parameter available for nearly every DCCP module. This is useful 
      since there are various module inter-dependencies. The debug flag
      can now be toggled at runtime using
      
        echo 1 > /sys/module/dccp/parameters/dccp_debug
        echo 1 > /sys/module/dccp_ccid2/parameters/ccid2_debug
        echo 1 > /sys/module/dccp_ccid3/parameters/ccid3_debug
        echo 1 > /sys/module/dccp_tfrc_lib/parameters/tfrc_debug
      
      The last is not very useful yet, since no code at the moment calls
      the tfrc_debug() macro.
      Signed-off-by: NGerrit Renker <gerrit@erg.abdn.ac.uk>
      157439fa
    • G
      dccp: Empty the write queue when disconnecting · b569d5a1
      Gerrit Renker 提交于
      dccp_disconnect() can be called due to several reasons:
      
       1. when the connection setup failed (inet_stream_connect());
       2. when shutting down (inet_shutdown(), inet_csk_listen_stop());
       3. when aborting the connection (dccp_close() with 0 linger time).
      
      In case (1) the write queue is empty. This patch empties the write queue,
      if in case (2) or (3) it was not yet empty.
      
      This avoids triggering the write-queue BUG_TRAP in sk_stream_kill_queues()
      later on.
      
      It also seems natural to do: when breaking an association, to delete all
      packets that were originally intended for the soon-disconnected end (compare
      with call to tcp_write_queue_purge in tcp_disconnect()).
      Signed-off-by: NGerrit Renker <gerrit@erg.abdn.ac.uk>
      b569d5a1
    • G
      dccp: Fill in the Data fields for "Option Error" Resets · 5a056417
      Gerrit Renker 提交于
      This updates the use of the `out_invalid_option' label, which produces a 
      Reset (code 5, "Option Error"), to fill in the  Data1...Data3 fields as
      specified in RFC 4340, 5.6.
      Signed-off-by: NGerrit Renker <gerrit@erg.abdn.ac.uk>
      5a056417
    • G
      dccp: Silently ignore options with nonsensical lengths · 1efa6bba
      Gerrit Renker 提交于
      This updates the option-parsing code with regard to RFC 4340, 5.8:
       "[..] options with nonsensical lengths (length byte less than two or more
        than the remaining space in the options portion of the header) MUST be
        ignored, and any option space following an option with nonsensical length
        MUST likewise be ignored."
      
      Hence in the following cases erratic options will be ignored:
       1. The type byte of a multi-byte option is the last byte of the header
          options (i.e. effective option length of 1).
       2. The value of the length byte is less than the minimum 2. This has been 
          changed from previously 3: although no multi-byte option with a length
          less than 3 yet exists (cf. table 3 in 5.8), a length of 2 is valid.
          (The switch-statement in dccp_parse has further per-option length checks.)
       3. The option length exceeds the length of the remaining option space.
      Signed-off-by: NGerrit Renker <gerrit@erg.abdn.ac.uk>
      1efa6bba
    • W
      dccp: Always generate a Reset in response to option errors · 33c44967
      Wei Yongjun 提交于
      RFC4340 states that if a packet is received with an option error (such as a
      Mandatory Option as the last byte of the option list), the endpoint should
      repond with a Reset.
      
      In the LISTEN and RESPOND states, the endpoint correctly reponds with Reset,
      while in the REQUEST/OPEN states, packets with option errors are just ignored.
      
      The packet sequence is as follows:
      
      Case 1:
      
        Endpoint A                           Endpoint B
        (CLOSED)                             (CLOSED)
      
                     <----------------       REQUEST
      
        RESPONSE     ----------------->      (*1)
        (with invalid option)
                     <----------------       RESET
                                             (with Reset Code 5, "Option Error")
      
        (*1) currently just ignored, no Reset is sent
      
      Case 2:
      
        Endpoint A                           Endpoint B
        (OPEN)                               (OPEN)
      
        DATA-ACK     ----------------->      (*2)
        (with invalid option)
                     <----------------       RESET
                                             (with Reset Code 5, "Option Error")
      
        (*2) currently just ignored, no Reset is sent
      
      This patch fixes the problem, by generating a Reset instead of silently
      ignoring option errors.
      Signed-off-by: NWei Yongjun <yjwei@cn.fujitsu.com>
      Acked-by: NArnaldo Carvalho de Melo <acme@redhat.com>
      Acked-by: NGerrit Renker <gerrit@erg.abdn.ac.uk>
      33c44967
    • J
      IPVS: Rename ip_vs_proto_ah.c to ip_vs_proto_ah_esp.c · e3c2ced8
      Julius Volz 提交于
      After integrating ESP into ip_vs_proto_ah, rename it (and the references to
      it) to ip_vs_proto_ah_esp.c and delete the old ip_vs_proto_esp.c.
      Signed-off-by: NJulius Volz <juliusv@google.com>
      Signed-off-by: NSimon Horman <horms@verge.net.au>
      e3c2ced8
    • J
      IPVS: Integrate ESP protocol into ip_vs_proto_ah.c · 409a1966
      Julius Volz 提交于
      Rename all ah_* functions to ah_esp_* (and adjust comments). Move ESP
      protocol definition into ip_vs_proto_ah.c and remove all usage of
      ip_vs_proto_esp.c.
      
      Make the compilation of ip_vs_proto_ah.c dependent on a new config
      variable, IP_VS_PROTO_AH_ESP, which is selected either by
      IP_VS_PROTO_ESP or IP_VS_PROTO_AH. Only compile the selected protocols'
      structures within this file.
      Signed-off-by: NJulius Volz <juliusv@google.com>
      Signed-off-by: NSimon Horman <horms@verge.net.au>
      409a1966
  3. 23 8月, 2008 28 次提交
  4. 21 8月, 2008 2 次提交
  5. 19 8月, 2008 1 次提交
    • D
      pkt_sched: Prevent livelock in TX queue running. · 195648bb
      David S. Miller 提交于
      If dev_deactivate() is trying to quiesce the queue, it
      is theoretically possible for another cpu to livelock
      trying to process that queue.  This happens because
      dev_deactivate() grabs the queue spinlock as it checks
      the queue state, whereas net_tx_action() does a trylock
      and reschedules the qdisc if it hits the lock.
      
      This breaks the livelock by adding a check on
      __QDISC_STATE_DEACTIVATED to net_tx_action() when
      the trylock fails.
      
      Based upon feedback from Herbert Xu and Jarek Poplawski.
      Signed-off-by: NDavid S. Miller <davem@davemloft.net>
      195648bb