1. 27 10月, 2007 1 次提交
    • L
      Do linear-time/space rename logic for exact renames · 9027f53c
      Linus Torvalds 提交于
      This implements a smarter rename detector for exact renames, which
      rather than doing a pairwise comparison (time O(m*n)) will just hash the
      files into a hash-table (size O(n+m)), and only do pairwise comparisons
      to renames that have the same hash (time O(n+m) except for unrealistic
      hash collissions, which we just cull aggressively).
      
      Admittedly the exact rename case is not nearly as interesting as the
      generic case, but it's an important case none-the-less. A similar general
      approach should work for the generic case too, but even then you do need
      to handle the exact renames/copies separately (to avoid the inevitable
      added cost factor that comes from the _size_ of the file), so this is
      worth doing.
      
      In the expectation that we will indeed do the same hashing trick for the
      general rename case, this code uses a generic hash-table implementation
      that can be used for other things too.  In fact, we might be able to
      consolidate some of our existing hash tables with the new generic code
      in hash.[ch].
      Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>
      Signed-off-by: NJunio C Hamano <gitster@pobox.com>
      9027f53c