提交 · a4e9d71edb5f87b4dae57b8a474124451e397e6c · 李少辉-开发者 / git

24 3月, 2007 3 次提交

Fix path-limited "rev-list --bisect" termination condition. · a4e9d71e

由 Junio C Hamano 提交于 3月 23, 2007

In a path-limited bisection, when the $bad commit is not
changing the limited path, and the number of suspects is 1, the
code miscounted and returned $bad from find_bisection(), which
is not marked with TREECHANGE.  This is of course filtered by
the output routine, resulting in an empty output, in turn
causing git-bisect driver to say "$bad was both good and bad".

Illustration.  Suppose you have these four commits, and only C
changes path P.  You know D is bad and A is good.

	A---B---C*--D

git-bisect driver runs this to find a bisection point:

	$ git rev-list --bisect A..D -- P

which calls find_bisection() with B, C and D.  The set of
commits that is given to this function is the same set of
commits as rev-list without --bisect option and pathspec
returns.  Among them, only C is marked with TREECHANGE.  Let's
call the set of commits given to find_bisection() that are
marked with TREECHANGE (or all of them if no path limiter is in
effect) "the bisect set".  In the above example, the size of the
bisect set is 1 (contains only "C").

For each commit in its input, find_bisection() computes the
number of commits it can reach in the bisect set.  For a commit
in the bisect set, this number includes itself, so the number is
1 or more.  This number is called "depth", and computed by
count_distance() function.

When you have a bisect set of N commits, and a commit has depth
D, how good is your bisection if you returned that commit?  How
good this bisection is can be measured by how many commits are
effectively tested "together" by testing one commit.

Currently you have (N-1) untested commits (the tip of the bisect
set, although it is included in the bisect set, is already known
to be bad).  If the commit with depth D turns out to be bad,
then your next bisect set will have D commits and you will have
(D-1) untested commits left, which means you tested (N-1)-(D-1)
= (N-D) commits with this bisection.  If it turns out to be good, then
your next bisect set will have (N-D) commits, and you will have
(N-D-1) untested commits left, which means you tested
(N-1)-(N-D-1) = D commits with this bisection.

Therefore, the goodness of this bisection is is min(N-D, D), and
find_bisection() function tries to find a commit that maximizes
this, by initializing "closest" variable to 0 and whenever a
commit with the goodness that is larger than the current
"closest" is found, that commit and its goodness are remembered
by updating "closest" variable.  The "the commit with the best
goodness so far" is kept in "best" variable, and is initialized
to a commit that happens to be at the beginning of the list of
commits given to this function (which may or may not be in the
bisect set when path-limit is in use).

However, when N is 1, then the sole tree-changing commit has
depth of 1, and min(N-D, D) evaluates to 0.  This is not larger
than the initial value of "closest", and the "so far the best
one" commit is never replaced in the loop.

When path-limit is not in use, this is not a problem, as any
commit in the input set is tree-changing.  But when path-limit
is in use, and when the starting "bad" commit does not change
the specified path, it is not correct to return it.
Signed-off-by: NJunio C Hamano <junkio@cox.net>

a4e9d71e

git-bisect.sh: properly dq $GIT_DIR · 12cb8137

由 Junio C Hamano 提交于 3月 23, 2007

Otherwise you would be in trouble if your GIT_DIR has IFS letters in it.
Signed-off-by: NJunio C Hamano <junkio@cox.net>

12cb8137

git-bisect: typofix · 673e5838

由 Junio C Hamano 提交于 3月 23, 2007

The branch you are on while bisecting is always "bisect", and
checking for "refs/heads/bisect*" is wrong.  Only check if it is
exactly "refs/heads/bisect".
Signed-off-by: NJunio C Hamano <junkio@cox.net>

673e5838

23 3月, 2007 5 次提交

checkout: report where the new HEAD is upon detaching HEAD · abba9dbb

由 Junio C Hamano 提交于 3月 23, 2007

After "git reset" moves the HEAD around, it reports which commit
you are on, which gives the user a warm fuzzy feeling of
assurance.  Give the same assurance from git-checkout when
moving the detached HEAD around.
Signed-off-by: NJunio C Hamano <junkio@cox.net>

abba9dbb

Bisect: implement "git bisect run <cmd>..." to automatically bisect. · a17c4101

由 Christian Couder 提交于 3月 23, 2007

This idea was suggested by Bill Lear
(Message-ID: <17920.38942.364466.642979@lisa.zopyra.com>)
and I think it is a very good one.

This patch adds a new test file for "git bisect run", but there
is currently only one basic test.
Signed-off-by: NChristian Couder <chriscool@tuxfamily.org>
Signed-off-by: NJunio C Hamano <junkio@cox.net>

a17c4101

Bisect: convert revs given to good and bad to commits · cc65343a

由 Uwe Kleine-König 提交于 3月 22, 2007

Without this the rev could be (e.g.) a tag and then the condition to end the
bisect might fail and you have to check the already known to be bad revision
once more.
Signed-off-by: NUwe Kleine-König <ukleinek@informatik.uni-freiburg.de>
Signed-off-by: NJunio C Hamano <junkio@cox.net>

cc65343a

t4118: be nice to non-GNU sed · 3007a780

由 Johannes Schindelin 提交于 3月 22, 2007

Elias Pipping:
> I'm on a mac, hence /usr/bin/sed is not gnu sed, which makes
> t4118 fail.
Signed-off-by: NJohannes Schindelin <Johannes.Schindelin@gmx.de>
Ack'd-by: Elias Pipping <pipping@macports.org>
Signed-off-by: NJunio C Hamano <junkio@cox.net>

3007a780

git-apply: Do not free the wrong buffer when we convert the data for writeout · cc96fd09

由 Junio C Hamano 提交于 3月 22, 2007

When we write out the result of patch application, we sometimes
need to munge the data (e.g. under core.autocrlf).  After doing
so, what we should free is the temporary buffer that holds the
converted data returned from convert_to_working_tree(), not the
original one.

This patch also moves the call to open() up in the function, as
the caller expects us to fail cheaply if leading directories
need to be created (and then the caller creates them and calls
us again).  For that calling pattern, attempting conversion
before opening the file adds unnecessary overhead.
Acked-by: NLinus Torvalds <torvalds@linux-foundation.org>
Signed-off-by: NJunio C Hamano <junkio@cox.net>

cc96fd09

22 3月, 2007 13 次提交

J
Merge git://git2.kernel.org/pub/scm/gitk/gitk · 00cec846
由 Junio C Hamano 提交于 3月 22, 2007
```
* git://git2.kernel.org/pub/scm/gitk/gitk:
  [PATCH] prefer "git COMMAND" over "git-COMMAND" in gitk
```
00cec846

Merge branch 'maint' · aa576e6b

由 Junio C Hamano 提交于 3月 22, 2007

* maint:
  Documentation/pack-format.txt: Clear up description of types.
  fix typo in git-am manpage

aa576e6b

P
Documentation/pack-format.txt: Clear up description of types. · 979ea585
由 Peter Eriksen 提交于 3月 21, 2007
```
Signed-off-by: NPeter Eriksen <s022018@student.dtu.dk>
Signed-off-by: NJunio C Hamano <junkio@cox.net>
```
979ea585

update HEAD reflog when branch pointed to by HEAD is directly modified · 605fac8b

由 Nicolas Pitre 提交于 3月 21, 2007

The HEAD reflog is updated as well as the reflog for the branch pointed
to by HEAD whenever it is referenced with "HEAD".

There are some cases where a specific branch may be modified directly.
In those cases, the HEAD reflog should be updated as well if it is a
symref to that branch in order to be consistent.
Signed-off-by: NNicolas Pitre <nico@cam.org>
Signed-off-by: NJunio C Hamano <junkio@cox.net>

605fac8b

update-hook: abort early if the project description is unset · 0a0d080b

由 Andy Parkins 提交于 3月 20, 2007

It was annoying to always have the first email from a project be from
the "Unnamed repository; edit this file to name it for gitweb project";
just because it's so easy to forget to set it.

This patch checks to see if the description file is still default (or
empty) and aborts if so - allowing you to fix the problem before sending
out silly looking emails to every developer.
Signed-off-by: NAndy Parkins <andyparkins@gmail.com>
Signed-off-by: NJunio C Hamano <junkio@cox.net>

0a0d080b

git-merge: Put FETCH_HEAD data in merge commit message · 85295a52

由 Michael S. Tsirkin 提交于 3月 22, 2007

This makes git-fetch <URL> && git-merge FETCH_HEAD produce the
same merge message as git-pull <URL>.
Signed-off-by: NMichael S. Tsirkin <mst@dev.mellanox.co.il>
Signed-off-by: NJunio C Hamano <junkio@cox.net>

85295a52

git-rebase: make 'rebase HEAD branch' work as expected. · a1bf91e0

由 Junio C Hamano 提交于 3月 22, 2007

When you want to amend the commit message of 3 commits before
the tip of the current branch, say 'master',

	A--B--C--D--E(master)

it is sometimes handy to make your head detached at that commit
with:

	$ git checkout HEAD~3 ;# check out B
	$ git commit --amend ;# without modifying contents...

to create:

          .B'(HEAD)
         /
	A--B--C--D--E(master)

and then rebase 'master' branch onto HEAD with this:

	$ git rebase HEAD master

to result in:

          .B'-C'-D'-E(master=HEAD)
         /
	A--B--C--D--E

However, the current code interprets HEAD after it switches to
the branch 'master', which means the rebase will not do
anything.  You have to say something unwieldly like this
instead:

	$ git rebase $(git rev-parse HEAD) master

This fixes it by expanding the $onto commit name before
switching to the target branch.
Signed-off-by: NJunio C Hamano <junkio@cox.net>

a1bf91e0

tree_entry_interesting(): allow it to say "everything is interesting" · 1d848f64

由 Junio C Hamano 提交于 3月 21, 2007

In addition to optimizing pathspecs that would never match,
which was done earlier, this optimizes pathspecs that would
always match (e.g. "arch/" while the traversal is already in
"arch/i386/" hierarchy).

This patch makes the worst case slightly more palatable, while
improving average case.
Signed-off-by: NJunio C Hamano <junkio@cox.net>

1d848f64

tree-diff: avoid strncmp() · ccc744ab

由 Junio C Hamano 提交于 3月 21, 2007

If we already know that some of the pathspecs can match later
entries in the tree we are looking at, we do not have to do more
expensive strncmp() upfront before comparing the length of the
match pattern and the path, as a path longer than the match
pattern will not match it, and a path shorter than the match
pattern will match only if the path is a directory-component
wise prefix of the match pattern.
Signed-off-by: NJunio C Hamano <junkio@cox.net>

ccc744ab

Teach tree_entry_interesting() that the tree entries are sorted. · 7d2f667b

由 Junio C Hamano 提交于 3月 21, 2007

When we are looking at a tree entry with pathspecs, if all the
pathspecs sort strictly earlier than the entry we are currently
looking at, there is no way later entries in the same tree would
match our pathspecs, because the entries are sorted.
Signed-off-by: NJunio C Hamano <junkio@cox.net>

7d2f667b

Switch over tree descriptors to contain a pre-parsed entry · 4651ece8

由 Linus Torvalds 提交于 3月 21, 2007

This makes the tree descriptor contain a "struct name_entry" as part of
it, and it gets filled in so that it always contains a valid entry. On
some benchmarks, it improves performance by up to 15%.

That makes tree entry "extract" trivial, and means that we only actually
need to decode each tree entry just once: we decode the first one when
we initialize the tree descriptor, and each subsequent one when doing
"update_tree_entry()". In particular, this means that we don't need to
do strlen() both at extract time _and_ at update time.

Finally, it also allows more sharing of code (entry_extract(), that
wanted a "struct name_entry", just got totally trivial, along with the
"tree_entry()" function).
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>
Signed-off-by: NJunio C Hamano <junkio@cox.net>

4651ece8

Initialize tree descriptors with a helper function rather than by hand. · 6fda5e51

由 Linus Torvalds 提交于 3月 21, 2007

This removes slightly more lines than it adds, but the real reason for
doing this is that future optimizations will require more setup of the
tree descriptor, and so we want to do it in one place.

Also renamed the "desc.buf" field to "desc.buffer" just to trigger
compiler errors for old-style manual initializations, making sure I
didn't miss anything.
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>
Signed-off-by: NJunio C Hamano <junkio@cox.net>

6fda5e51

Remove "pathlen" from "struct name_entry" · a8c40471

由 Linus Torvalds 提交于 3月 21, 2007

Since we have the "tree_entry_len()" helper function these days, and
don't need to do a full strlen(), there's no point in saving the path
length - it's just redundant information.
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>
Signed-off-by: NJunio C Hamano <junkio@cox.net>

a8c40471

21 3月, 2007 10 次提交

[PATCH] prefer "git COMMAND" over "git-COMMAND" in gitk · 1ce09dd6

由 Brandon Casey 提交于 3月 19, 2007

Preferring git _space_ COMMAND over git _dash_ COMMAND allows the
user to have only git and gitk in their path. e.g. when git and gitk
are symbolic links in a personal bin directory to the real git and gitk.
Signed-off-by: NPaul Mackerras <paulus@samba.org>

1ce09dd6

fix typo in git-am manpage · a947ab79

由 Michael S. Tsirkin 提交于 3月 21, 2007

Fix typo in git-am manpage
Signed-off-by: NMichael S. Tsirkin <mst@dev.mellanox.co.il>
Signed-off-by: NJunio C Hamano <junkio@cox.net>

a947ab79

blame: cmp_suspect is not "cmp" anymore. · 171dccd5

由 Junio C Hamano 提交于 3月 20, 2007

The earlier round makes the function return "is it different"
and it does not return a value suitable for sorting anymore.  Reverse
the logic to return "are they the same suspect" instead, and rename
it to "same_suspect()".
Signed-off-by: NJunio C Hamano <junkio@cox.net>

171dccd5

minor git-prune optimization · 3254d218

由 Nicolas Pitre 提交于 3月 20, 2007

Don't try to remove the containing directory for every pruned object but
try only once after the directory has been scanned instead.
Signed-off-by: NNicolas Pitre <nico@cam.org>
Signed-off-by: NJunio C Hamano <junkio@cox.net>

3254d218

improve checkout message when asking for same branch · 57216856

由 Nicolas Pitre 提交于 3月 20, 2007

Change the feedback message if doing 'git checkout foo' when already on
branch "foo".
Signed-off-by: NNicolas Pitre <nico@cam.org>
Signed-off-by: NJunio C Hamano <junkio@cox.net>

57216856

Be more careful about zlib return values · ac54c277

由 Linus Torvalds 提交于 3月 20, 2007

When creating a new object, we use "deflate(stream, Z_FINISH)" in a loop
until it no longer returns Z_OK, and then we do "deflateEnd()" to finish
up business.

That should all work, but the fact is, it's not how you're _supposed_ to
use the zlib return values properly:

 - deflate() should never return Z_OK in the first place, except if we
   need to increase the output buffer size (which we're not doing, and
   should never need to do, since we pre-allocated a buffer that is
   supposed to be able to hold the output in full). So the "while()" loop
   was incorrect: Z_OK doesn't actually mean "ok, continue", it means "ok,
   allocate more memory for me and continue"!

 - if we got an error return, we would consider it to be end-of-stream,
   but it could be some internal zlib error.  In short, we should check
   for Z_STREAM_END explicitly, since that's the only valid return value
   anyway for the Z_FINISH case.

 - we never checked deflateEnd() return codes at all.

Now, admittedly, none of these issues should ever happen, unless there is
some internal bug in zlib. So this patch should make zero difference, but
it seems to be the right thing to do.

We should probablybe anal and check the return value of "deflateInit()"
too!
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>
Signed-off-by: NJunio C Hamano <junkio@cox.net>

ac54c277

Don't ever return corrupt objects from "parse_object()" · acdeec62

由 Linus Torvalds 提交于 3月 20, 2007

Looking at the SHA1 validation code due to the corruption that Alexander
Litvinov is seeing under Cygwin, I notice that one of the most central
places where we read objects, we actually do end up verifying the SHA1 of
the result, but then we happily parse it anyway.

And using "printf" to write the error message means that it not only can
get lost, but will actually mess up stdout, and cause other strange and
hard-to-debug failures downstream.
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>
Signed-off-by: NJunio C Hamano <junkio@cox.net>

acdeec62

index-pack: more validation checks and cleanups · 9096c660

由 Nicolas Pitre 提交于 3月 20, 2007

When appending objects to a pack, make sure the appended data is really
what we expect instead of simply loading potentially corrupted objects
and legitimating them by computing a SHA1 of that corrupt data.

With this the sha1_object() can lose its test_for_collision parameter
which is now redundent.
Signed-off-by: NNicolas Pitre <nico@cam.org>
Signed-off-by: NJunio C Hamano <junkio@cox.net>

9096c660

index-pack: use hash_sha1_file() · ce9fbf16

由 Nicolas Pitre 提交于 3月 20, 2007

Use hash_sha1_file() instead of duplicating code to compute object SHA1.
While at it make it accept a const pointer.
Signed-off-by: NNicolas Pitre <nico@cam.org>
Signed-off-by: NJunio C Hamano <junkio@cox.net>

ce9fbf16

don't ever allow SHA1 collisions to exist by fetching a pack · 8685da42

由 Nicolas Pitre 提交于 3月 20, 2007

Waaaaaaay back Git was considered to be secure as it never overwrote
an object it already had.  This was ensured by always unpacking the
packfile received over the network (both in fetch and receive-pack)
and our already existing logic to not create a loose object for an
object we already have.

Lately however we keep "large-ish" packfiles on both fetch and push
by running them through index-pack instead of unpack-objects.  This
would let an attacker perform a birthday attack.

How?  Assume the attacker knows a SHA-1 that has two different
data streams.  He knows the client is likely to have the "good"
one.  So he sends the "evil" variant to the other end as part of
a "large-ish" packfile.  The recipient keeps that packfile, and
indexes it.  Now since this is a birthday attack there is a SHA-1
collision; two objects exist in the repository with the same SHA-1.
They have *very* different data streams.  One of them is "evil".

Currently the poor recipient cannot tell the two objects apart,
short of by examining the timestamp of the packfiles.  But lets
say the recipient repacks before he realizes he's been attacked.
We may wind up packing the "evil" version of the object, and deleting
the "good" one.  This is made *even more likely* by Junio's recent
rearrange_packed_git patch (b867092f).

It is extremely unlikely for a SHA1 collisions to occur, but if it
ever happens with a remote (hence untrusted) object we simply must
not let the fetch succeed.

Normally received packs should not contain objects we already have.
But when they do we must ensure duplicated objects with the same SHA1
actually contain the same data.
Signed-off-by: NNicolas Pitre <nico@cam.org>
Signed-off-by: NJunio C Hamano <junkio@cox.net>

8685da42

20 3月, 2007 9 次提交

git-fetch: Fix single_force in append_fetch_head · 08727ea8

由 Santi Béjar 提交于 3月 19, 2007

This fixes the single force (+) when fetched with fetch_per_ref.

Also use $LF as separator because IFS is $LF.
Signed-off-by: NSanti Béjar <sbejar@gmail.com>
Signed-off-by: NJunio C Hamano <junkio@cox.net>

08727ea8

J
Merge git://git2.kernel.org/pub/scm/gitk/gitk · bb95e19c
由 Junio C Hamano 提交于 3月 19, 2007
```
* git://git2.kernel.org/pub/scm/gitk/gitk:
  [PATCH] gitk: bind <F5> key to Update (reread commits)
```
bb95e19c

make git clone -q suppress the noise with http fetch · 7e8c8255

由 Chris Wright 提交于 3月 19, 2007

We already have -q in git clone.  So for those who care to suppress
the noise during an http based clone, make -q actually do a quiet
http fetch.
Signed-off-by: NChris Wright <chrisw@sous-sol.org>
Cc: Fernando Herrera <fherrera@onirica.com>
Signed-off-by: NJunio C Hamano <junkio@cox.net>

7e8c8255

Fix loose object uncompression check. · 456cdf6e

由 Linus Torvalds 提交于 3月 19, 2007

The thing is, if the output buffer is empty, we should *still* actually
use the zlib routines to *unpack* that empty output buffer.

But we had a test that said "only unpack if we still expect more output".

So we wouldn't use up all the zlib stream, because we felt that we didn't
need it, because we already had all the bytes we wanted. And it was
"true": we did have all the output data. We just needed to also eat all
the input data!

We've had this bug before - thinking that we don't need to inflate()
anything because we already had it all..
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>
Signed-off-by: NJunio C Hamano <junkio@cox.net>

456cdf6e

contrib/continuous: a continuous integration build manager · 3e993bb6

由 Shawn O. Pearce 提交于 3月 20, 2007

This is a simple but powerful continuous integration build system
for Git.  It works by receiving push events from repositories
through the post-receive hook, aggregates them on a per-branch
basis into a first-come-first-serve build queue, and lets a
background build daemon perform builds one at a time.
Signed-off-by: NShawn O. Pearce <spearce@spearce.org>
Signed-off-by: NJunio C Hamano <junkio@cox.net>

3e993bb6

Provide some technical documentation for shallow clones · 1b89ef17

由 Johannes Schindelin 提交于 3月 20, 2007

There has not been any work on the shallow stuff lately, so it is hard
to find out what it does, and how. This document describes the ideas
as well as the current problems, and can serve as a starting point for
shallow people.
Signed-off-by: NJohannes Schindelin <Johannes.Schindelin@gmx.de>
Signed-off-by: NJunio C Hamano <junkio@cox.net>

1b89ef17

Add a HOWTO for setting up a standalone git daemon · e29b96d5

由 Johannes Schindelin 提交于 3月 20, 2007

Setting up a git-daemon came up the other day on IRC, and it is slightly
non trivial for the uninitiated.
Signed-off-by: NJohannes Schindelin <johannes.schindelin@gmx.de>
Signed-off-by: NJunio C Hamano <junkio@cox.net>

e29b96d5

xdiff/xutils.c(xdl_hash_record): factor out whitespace handling · 824f782c

由 Johannes Schindelin 提交于 3月 20, 2007

Since in at least one use case, xdl_hash_record() takes over 15% of the
CPU time, it makes sense to even micro-optimize it. For many cases, no
whitespace special handling is needed, and in these cases we should not
even bother to check for whitespace in _every_ iteration of the loop.
Signed-off-by: NJohannes Schindelin <Johannes.Schindelin@gmx.de>
Signed-off-by: NJunio C Hamano <junkio@cox.net>

824f782c

blame: micro-optimize cmp_suspect() · 57584d9e

由 Junio C Hamano 提交于 3月 19, 2007

The commit structures are guaranteed their uniqueness by the object
layer, so we can check their address and see if they are the same
without going down to the object sha1 level.
Signed-off-by: NJunio C Hamano <junkio@cox.net>

57584d9e

李少辉-开发者 / git 与 Fork 源项目一致

李少辉-开发者 / git
与 Fork 源项目一致