提交 · 243f801d1d08753cd4eff2a23e245f7575c37ad5 · 李少辉-开发者 / git

14 1月, 2007 32 次提交

Reuse the same buffer for all commits/tags in fast-import. · 243f801d

由 Shawn O. Pearce 提交于 8月 28, 2006

Since most commits and tag objects are around the same size and we
only generate one at a time we can reuse the same buffer rather than
xmalloc'ing and free'ing the buffer every time we generate a commit.
Signed-off-by: NShawn O. Pearce <spearce@spearce.org>

243f801d

Recycle data buffers for tree generation in fast-import. · e2eb469d

由 Shawn O. Pearce 提交于 8月 28, 2006

We only ever generate at most two tree streams at a time. Since most
trees are around the same size we can simply recycle the buffers from
one tree generation to the next rather than constantly xmalloc'ing
and free'ing them. This should perform slightly better when handling
a large number of trees as malloc has less work to do.
Signed-off-by: NShawn O. Pearce <spearce@spearce.org>

e2eb469d

Implemented tree delta compression in fast-import. · 4cabf858

由 Shawn O. Pearce 提交于 8月 28, 2006

We now store for every tree entry two modes and two sha1 values;
the base (aka "version 0") and the current/new (aka "version 1").
When we generate a tree object we also regenerate the prior version
object and use that as our base object for a delta. This strategy
saves a significant amount of memory as we can continue to use the
atom pool for file/directory names and only increases each tree
entry by an additional 24 bytes of memory.

Branches should automatically delta against their ancestor tree,
unless the ancestor tree is already at the delta chain limit.
Signed-off-by: NShawn O. Pearce <spearce@spearce.org>

4cabf858

S
Converted hash memcpy/memcmp to new hashcpy/hashcmp/hashclr. · 445b8599
由 Shawn O. Pearce 提交于 8月 28, 2006
```
Signed-off-by: NShawn O. Pearce <spearce@spearce.org>
```
445b8599
S
Don't crash fast-import if no branch log was requested. · 08d7e892
由 Shawn O. Pearce 提交于 8月 27, 2006
```
Signed-off-by: NShawn O. Pearce <spearce@spearce.org>
```
08d7e892

Added 'reset' command to clear a branch's tree. · 5fced8dc

由 Shawn O. Pearce 提交于 8月 27, 2006

Sometimes an import frontend may need to work with a temporary branch
which will actually contain many different branches over the life
of the import.  This is especially useful when the frontend needs
to create a tag from a set of file versions which are otherwise
never a commit.
Signed-off-by: NShawn O. Pearce <spearce@spearce.org>

5fced8dc

Map only part of the generated pack file at any point in time. · 53dbce78

由 Shawn O. Pearce 提交于 8月 27, 2006

When generating a very large pack file (for example close to 1 GB
in size) it may be impossible for the kernel to find a contiguous
free range within a 32 bit address space for the mapping to be
located at.  This is especially problematic on large imports where
there is a lot of malloc activity occuring within the same process
and the malloc'd regions may straddle the previously mapped regions,
thereby creating large holes in the address space.

So instead we map only 128 MB of the pack at any given time.
This will likely increase the number of times the file gets mapped
(with additional system time required to update the page tables
more frequently) but will allow the program to handle packs up to
4 GB in size.
Signed-off-by: NShawn O. Pearce <spearce@spearce.org>

53dbce78

S
Fixed compile error in fast-import. · 35ef237c
由 Shawn O. Pearce 提交于 8月 26, 2006
```
Signed-off-by: NShawn O. Pearce <spearce@spearce.org>
```
35ef237c

Fixed GPF in fast-import caused by unterminated linked list. · 2eb26d84

由 Shawn O. Pearce 提交于 8月 26, 2006

fast-import was encounting a GPF when it ran out of free tree_entry
objects but didn't know this was the cause because the last
tree_entry wasn't terminated with a NULL pointer. The missing NULL
pointer occurred when we allocated additional entries via xmalloc
but didn't set the last tree_entry's "next" pointer to NULL.
Signed-off-by: NShawn O. Pearce <spearce@spearce.org>

2eb26d84

Added --branch-log to option to fast-import. · 264244a0

由 Shawn O. Pearce 提交于 8月 25, 2006

This option can be used to have a record of every commit, the mark
(if supplied) and branch name of the commit recorded into a log file
when the commit is generated.  This log can be useful to verify the
results of an import as the commits can be compared to some source
repository matching commits through the mark value.
Signed-off-by: NShawn O. Pearce <spearce@spearce.org>

264244a0

Added option to export the marks table when fast-import terminates. · a6a1a831

由 Shawn O. Pearce 提交于 8月 25, 2006

The marks table can be used by the frontend to load any commit after
the import and compare it to whatever data the frontend knows about
that commit. If the mark idnums can be easily correlated to some
reference source then its relatively trivial to compare the GIT
tree to the reference to verify the accuracy of the import.
Signed-off-by: NShawn O. Pearce <spearce@spearce.org>

a6a1a831

S
Account for tree entry memory costs in fast-import. · 8435a9cb
由 Shawn O. Pearce 提交于 8月 25, 2006
```
Signed-off-by: NShawn O. Pearce <spearce@spearce.org>
```
8435a9cb

Moved from command to after data to help cvs2svn. · 02f3389d

由 Shawn O. Pearce 提交于 8月 24, 2006

cvs2svn has three phases: begin_commit, middle_commit, end_commit.
The ancester is computed in the middle_commit phase. So its easier
to generate a stream if the from command appears after the commit
message itself but before the file change commands.
Signed-off-by: NShawn O. Pearce <spearce@spearce.org>

02f3389d

Remove branch creation command from fast-import. · 00e2b884

由 Shawn O. Pearce 提交于 8月 24, 2006

Jon Smirl was finding it difficult to alter cvs2svn to generate
branch commands prior to the first commit of the same branch.
This change moves the 'from' command to be an optional parameter of
the 'commit' command, thereby allowing a new branch to be defined
at the moment it gets used to create the first commit on that branch.

This change makes it impossible to create a branch with no commits
on it as at least one commit is needed to register the branch.
Signed-off-by: NShawn O. Pearce <spearce@spearce.org>

00e2b884

Round out memory pool allocations in fast-import to pointer sizes. · 8d8928b0

由 Shawn O. Pearce 提交于 8月 24, 2006

Some architectures (e.g. SPARC) would require that we access pointers
only on pointer-sized alignments.  So ensure the pool allocator
rounds out non-pointer sized allocations to the next pointer so we
don't generate bad memory addresses.  This could have occurred if
we had previously allocated an atom whose string was not a whole
multiple of the pointer size, for example.
Signed-off-by: NShawn O. Pearce <spearce@spearce.org>

8d8928b0

Implemented tree reloading in fast-import. · 41e5257f

由 Shawn O. Pearce 提交于 8月 24, 2006

Tree reloading allows fast-import to swap out the least-recently used
branch by simply deallocating the data structures from memory that
were associated with that branch. Later if the branch becomes active
again it can lazily recreate those structures on demand by reloading
the necessary trees from the pack file it originally wrote them to.

The reloading process is implemented by mmap'ing the pack into
memory and using a much tighter variant of the pack reading code
contained in sha1_file.c. This was a blatent copy from sha1_file.c
but the unpacking functions were significantly simplified and are
actually now in a form that should make it easier to map only the
necessary regions of a pack rather than the entire file.
Signed-off-by: NShawn O. Pearce <spearce@spearce.org>

41e5257f

Implemented 'tag' command in fast-import. · 72303d44

由 Shawn O. Pearce 提交于 8月 24, 2006

Tags received from the frontend are generated in memory in a simple
linked list in the order that the tag commands were sent by the
frontend. If multiple different tag objects for the same tag name
get generated the last one sent by the frontend will be the one
that gets written out at termination. Multiple tag objects for
the same name will cause all older tags of the same name to be lost.
Signed-off-by: NShawn O. Pearce <spearce@spearce.org>

72303d44

Added branch load counter to fast-import. · d6c7eb2c

由 Shawn O. Pearce 提交于 8月 23, 2006

If the branch load count exceeds the number of branches created then
the frontend is causing fast-import to page branches into and out of
memory due to the way its ordering its commits. Performance can
likely be increased if the frontend were to alter its commit
sequence such that it stays on one branch before switching to another
branch, then never returns to the prior branch.
Signed-off-by: NShawn O. Pearce <spearce@spearce.org>

d6c7eb2c

Added mark store/find to fast-import. · d8397168

由 Shawn O. Pearce 提交于 8月 23, 2006

Marks are now saved when the mark directive gets used by the frontend
and may be used in place of a SHA1 expression to locate a previous
SHA1 which fast-import may have generated.  This is particularly
useful with commits where the frontend does not (easily) have the
ability to compute the SHA1 for an arbitrary commit but needs it
to generate a branch or tag from that commit.
Signed-off-by: NShawn O. Pearce <spearce@spearce.org>

d8397168

Converted fast-import to accept standard command line parameters. · d5c57b28

由 Shawn O. Pearce 提交于 8月 23, 2006

The following command line options are now accepted before the
pack name:

  --objects=n           # replaces the object count after the pack name
  --depth=n             # delta chain depth to use (default is 10)
  --active-branches=n   # maximum number of branches to keep in memory
Signed-off-by: NShawn O. Pearce <spearce@spearce.org>

d5c57b28

Fixed segfault in fast-import after growing a tree. · afde8dd9

由 Shawn O. Pearce 提交于 8月 23, 2006

Growing a tree caused all subtrees to be deallocated and put back
into the free list yet those subtree's contents were still actively
in use.  Consequently they were doled out again and got stomped
on elsewhere.  Releasing a tree is now performed in two parts,
either releasing only the content array or releasing the content
array and recursively releasing the subtree(s).
Signed-off-by: NShawn O. Pearce <spearce@spearce.org>

afde8dd9

Allow symlink blobs in trees during fast-import. · ace4a9d1

由 Shawn O. Pearce 提交于 8月 21, 2006

If a frontend is smart enough to import a symlink then we should
let them do so.  We'll assume that they were smart enough to first
generate a blob to hold the link target, as that's how symlinks
get represented in GIT.
Signed-off-by: NShawn O. Pearce <spearce@spearce.org>

ace4a9d1

S
Changed fast-import's pack header creation to use pack.h · c90be46a
由 Shawn O. Pearce 提交于 8月 16, 2006
```
Signed-off-by: NShawn O. Pearce <spearce@spearce.org>
Signed-off-by: NShawn O. Pearce <spearce@spearce.org>
```
c90be46a

Converted fast-import to a text based protocol. · c44cdc7e

由 Shawn O. Pearce 提交于 8月 14, 2006

Frontend clients can now send a text stream to fast-import rather
than a binary stream.  This should facilitate developing frontend
software as the data stream is easier to view, manipulate and debug
my hand and Mark-I eyeball.
Signed-off-by: NShawn O. Pearce <spearce@spearce.org>

c44cdc7e

Implement blob ID validation in fast-import. · 7111feed

由 Shawn O. Pearce 提交于 8月 14, 2006

When accepting revision SHA1 IDs from the frontend verify the SHA1
actually refers to a blob and is known to exist. Its an error
to use a SHA1 in a tree if the blob doesn't exist as this would
cause git-fsck-objects to report a missing blob should the pack get
closed without the blob being appended into it or a subsequent pack.
So right now we'll just ask that the frontend "pre-declare" any
blobs it wants to use in a tree before it can use them.
Signed-off-by: NShawn O. Pearce <spearce@spearce.org>

7111feed

Added tree and commit writing to fast-import. · 463acbe1

由 Shawn O. Pearce 提交于 8月 14, 2006

The tree of the current commit can be altered by file_change commands
before the commit gets written to the pack.  The file changes are
rather primitive as they simply allow removal of a tree entry or
setting/adding a tree entry.

Currently trees and commits aren't being deltafied when written to
the pack and branch reloading from the current pack doesn't work,
so at most 5 branches can be worked with at any one time.
Signed-off-by: NShawn O. Pearce <spearce@spearce.org>

463acbe1

Implemented branch handling and basic tree support in fast-import. · 6bb5b329

由 Shawn O. Pearce 提交于 8月 08, 2006

This provides the basic data structures needed to store trees in
memory while we are processing them for a branch.  What we are
attempting to do is track one complete tree for each branch that
the frontend has registered with us through the 'newb' (new_branch)
command.  When the frontend edits that tree through 'updf' or 'delf'
commands we'll mark the affected tree(s) as being dirty and recompute
their objects during 'comt' (commit).

Currently the protocol is decidedly _not_ user friendly.  I crashed
fast-import by giving it bad input data from Perl.  I may try to
improve upon it, or at least upon its error handling.
Signed-off-by: NShawn O. Pearce <spearce@spearce.org>

6bb5b329

Added basic command handler to fast-import. · 6143f064

由 Shawn O. Pearce 提交于 8月 08, 2006

Moved the new_blob logic off into a new subroutine and
invoked it when getting the 'blob' command.

Added statistics dump to STDERR when the program terminates listing
what it did at a high level.  This is somewhat interesting.
Signed-off-by: NShawn O. Pearce <spearce@spearce.org>

6143f064

Refactored fast-import's internals for future additions. · ac47a738

由 Shawn O. Pearce 提交于 8月 08, 2006

Too many globals variables were being used not not enough
code was resuable to process trees and commits so this is
a simple refactoring of the existing blob processing code
to get into a state that will be easier to handle trees
and commits in.
Signed-off-by: NShawn O. Pearce <spearce@spearce.org>

ac47a738

Cleaned up memory allocation for object_entry structs. · 27d6d290

由 Shawn O. Pearce 提交于 8月 08, 2006

Although its easy to ask the user to tell us how many objects they
will need, its probably better to dynamically grow the object table
in large units.  But if the user can give us a hint as to roughly
how many objects then we can still use it during startup.

Also stopped printing the SHA1 strings to stdout as no user is
currently making use of that facility.
Signed-off-by: NShawn O. Pearce <spearce@spearce.org>

27d6d290

S
Added automatic index generation to fast-import. · 8bcce301
由 Shawn O. Pearce 提交于 8月 06, 2006
```
Signed-off-by: NShawn O. Pearce <spearce@spearce.org>
```
8bcce301
S
Created fast-import, a tool to quickly generating a pack from blobs. · db5e523f
由 Shawn O. Pearce 提交于 8月 05, 2006
```
Signed-off-by: NShawn O. Pearce <spearce@spearce.org>
```
db5e523f

24 8月, 2006 2 次提交

J
Convert memset(hash,0,20) to hashclr(hash). · a8e0d16d
由 Junio C Hamano 提交于 8月 23, 2006
```
In the same spirit as hashcmp() and hashcpy().
Signed-off-by: NJunio C Hamano <junkio@cox.net>
```
a8e0d16d

Convert memcpy(a,b,20) to hashcpy(a,b). · e702496e

由 Shawn Pearce 提交于 8月 23, 2006

This abstracts away the size of the hash values when copying them
from memory location to memory location, much as the introduction
of hashcmp abstracted away hash value comparsion.

A few call sites were using char* rather than unsigned char* so
I added the cast rather than open hashcpy to be void*.  This is a
reasonable tradeoff as most call sites already use unsigned char*
and the existing hashcmp is also declared to be unsigned char*.

[jc: Splitted the patch to "master" part, to be followed by a
 patch for merge-recursive.c which is not in "master" yet.

 Fixed the cast in the latter hunk to combine-diff.c which was
 wrong in the original.

 Also converted ones left-over in combine-diff.c, diff-lib.c and
 upload-pack.c ]
Signed-off-by: NShawn O. Pearce <spearce@spearce.org>
Signed-off-by: NJunio C Hamano <junkio@cox.net>

e702496e

23 8月, 2006 2 次提交

Fix a comparison bug in diff-delta.c · b05faa2d

由 Pierre Habouzit 提交于 8月 23, 2006

(1 << i) < hspace is compared in the `int` space rather that in the
unsigned one.  the result will be wrong if hspace is between 0x40000000
and 0x80000000.
Signed-off-by: NPierre Habouzit <madcoder@debian.org>
Signed-off-by: NJunio C Hamano <junkio@cox.net>

b05faa2d

git-send-email: Don't set author_not_sender from Cc: lines · 68d42c41

由 Haavard Skinnemoen 提交于 8月 23, 2006

When an mbox-style patch contains a Cc: line in the header,
git-send-email will check the address against the sender specified
on the command line. If they don't match, sender_not_author will
be set to the address obtained from the Cc line.

When this happens, git-send-email inserts a From: line at the
beginning of the message body with the address obtained from the
Cc line in the header, and the sender might be accused of forging
patch authors.

This patch fixes this by only updating sender_not_author when
processing From: lines, not when processing Cc: lines.
Signed-off-by: NHaavard Skinnemoen <hskinnemoen@atmel.com>
Signed-off-by: NJunio C Hamano <junkio@cox.net>

68d42c41

22 8月, 2006 4 次提交

Remove unnecessary forward declaration of unpack_entry. · 44c10841

由 Shawn Pearce 提交于 8月 21, 2006

This declaration probably used to be necessary but the code has
been refactored since to use unpack_entry_gently instead.
Signed-off-by: NShawn O. Pearce <spearce@spearce.org>
Signed-off-by: NJunio C Hamano <junkio@cox.net>

44c10841

Verify we know how to read a pack before trying to using it. · da756011

由 Shawn Pearce 提交于 8月 21, 2006

If the pack format were to ever change or be extended in the future
there is no assurance that just because the pack file lives in
objects/pack and doesn't end in .idx that we can read and decompress
its contents properly.

If we encounter what we think is a pack file and it isn't or we don't
recognize its version then die and suggest to the user that they
upgrade to a newer version of GIT which can handle that pack file.
Signed-off-by: NShawn O. Pearce <spearce@spearce.org>
Signed-off-by: NJunio C Hamano <junkio@cox.net>

da756011

Add write_or_die(), a helper function · 7230e6d0

由 Rene Scharfe 提交于 8月 21, 2006

The little helper write_or_die() won't come back with bad news about
full disks or broken pipes.  It either succeeds or terminates the
program, making additional error handling unnecessary.

This patch adds the new function and uses it to replace two similar
ones (the one in tar-tree originally has been copied from cat-file
btw.).  I chose to add the fd parameter which both lacked to make
write_or_die() just as flexible as write() and thus suitable for
lib-ification.

There is a regression: error messages emitted by this function don't
show the program name, while the replaced two functions did.  That's
acceptable, I think; a lot of other functions do the same.
Signed-off-by: NRene Scharfe <rene.scharfe@lsrfire.ath.cx>
Signed-off-by: NJunio C Hamano <junkio@cox.net>

7230e6d0

Axe the last ent · 3f0073a2

由 Rene Scharfe 提交于 8月 21, 2006

In the name of Standardization, this cleanses the last usage string of
mystical creatures.  But they still dwell deep within the source and in
some debug messages, it is said.
Signed-off-by: NRene Scharfe <rene.scharfe@lsrfire.ath.cx>
Signed-off-by: NJunio C Hamano <junkio@cox.net>

3f0073a2

李少辉-开发者 / git 与 Fork 源项目一致

李少辉-开发者 / git
与 Fork 源项目一致