Go to file
Derrick Stolee 30696be71f p5313: add size comparison test
As custom options are added to 'git pack-objects' and 'git repack' to
adjust how compression is done, use this new performance test script to
demonstrate their effectiveness in performance and size.

The recently-added --name-hash-version option allows for testing
different name hash functions. Version 2 intends to preserve some of the
locality of version 1 while more often breaking collisions due to long
filenames.

Distinguishing objects by more of the path is critical when there are
many name hash collisions and several versions of the same path in the
full history, giving a significant boost to the full repack case. The
locality of the hash function is critical to compressing something like
a shallow clone or a thin pack representing a push of a single commit.

This can be seen by running pt5313 on the open source fluentui
repository [1]. Most commits will have this kind of output for the thin
and big pack cases, though certain commits (such as [2]) will have
problematic thin pack size for other reasons.

[1] https://github.com/microsoft/fluentui
[2] a637a06df05360ce5ff21420803f64608226a875

Checked out at the parent of [2], I see the following statistics:

Test                                         HEAD
---------------------------------------------------------------
5313.2: thin pack with version 1             0.37(0.44+0.02)
5313.3: thin pack size with version 1                   1.2M
5313.4: big pack with version 1              2.04(7.77+0.23)
5313.5: big pack size with version 1                   20.4M
5313.6: shallow fetch pack with version 1    1.41(2.94+0.11)
5313.7: shallow pack size with version 1               34.4M
5313.8: repack with version 1                95.70(676.41+2.87)
5313.9: repack size with version 1                    439.3M
5313.10: thin pack with version 2            0.12(0.12+0.06)
5313.11: thin pack size with version 2                 22.0K
5313.12: big pack with version 2             2.80(5.43+0.34)
5313.13: big pack size with version 2                  25.9M
5313.14: shallow fetch pack with version 2   1.77(2.80+0.19)
5313.15: shallow pack size with version 2              33.7M
5313.16: repack with version 2               33.68(139.52+2.58)
5313.17: repack size with version 2                   160.5M

To make comparisons easier, I will reformat this output into a different
table style:

| Test         | V1 Time | V2 Time | V1 Size | V2 Size |
|--------------|---------|---------|---------|---------|
| Thin Pack    |  0.37 s |  0.12 s |   1.2 M |  22.0 K |
| Big Pack     |  2.04 s |  2.80 s |  20.4 M |  25.9 M |
| Shallow Pack |  1.41 s |  1.77 s |  34.4 M |  33.7 M |
| Repack       | 95.70 s | 33.68 s | 439.3 M | 160.5 M |

The v2 hash function successfully differentiates the CHANGELOG.md files
from each other, which leads to significant improvements in the thin
pack (simulating a push of this commit) and the full repack. There is
some bloat in the "big pack" scenario and essentially the same results
for the shallow pack.

In the case of the Git repository, these numbers show some of the issues
with this approach:

| Test         | V1 Time | V2 Time | V1 Size | V2 Size |
|--------------|---------|---------|---------|---------|
| Thin Pack    |  0.02 s |  0.02 s |   1.1 K |   1.1 K |
| Big Pack     |  1.69 s |  1.95 s |  13.5 M |  14.5 M |
| Shallow Pack |  1.26 s |  1.29 s |  12.0 M |  12.2 M |
| Repack       | 29.51 s | 29.01 s | 237.7 M | 238.2 M |

Here, the attempts to remove conflicts in the v2 function seem to cause
slight bloat to these sizes. This shows that the Git repository benefits
a lot from cross-path delta pairs.

The results are similar with the nodejs/node repo:

| Test         | V1 Time | V2 Time | V1 Size | V2 Size |
|--------------|---------|---------|---------|---------|
| Thin Pack    |  0.02 s |  0.02 s |   1.6 K |   1.6 K |
| Big Pack     |  4.61 s |  3.26 s |  56.0 M |  52.8 M |
| Shallow Pack |  7.82 s |  7.51 s | 104.6 M | 107.0 M |
| Repack       | 88.90 s | 73.75 s | 740.1 M | 764.5 M |

Here, the v2 name-hash causes some size bloat more often than it reduces
the size, but it also universally improves performance time, which is an
interesting reversal. This must mean that it is helping to short-circuit
some delta computations even if it is not finding the most efficient
ones. The performance improvement cannot be explained only due to the
I/O cost of writing the resulting packfile.

The Linux kernel repository was the initial target of the default name
hash value, and its naming conventions are practically build to take the
most advantage of the default name hash values:

| Test         | V1 Time  | V2 Time  | V1 Size | V2 Size |
|--------------|----------|----------|---------|---------|
| Thin Pack    |   0.17 s |   0.07 s |   4.6 K |   4.6 K |
| Big Pack     |  17.88 s |  12.35 s | 201.1 M | 159.1 M |
| Shallow Pack |  11.05 s |  22.94 s | 269.2 M | 273.8 M |
| Repack       | 727.39 s | 566.95 s |   2.5 G |   2.5 G |

Here, the thin and big packs gain some performance boosts in time, with
a modest gain in the size of the big pack. The shallow pack, however, is
more expensive to compute, likely because similarly-named files across
different directories are farther apart in the name hash ordering in v2.
The repack also gains benefits in computation time but no meaningful
change to the full size.

Finally, an internal Javascript repo of moderate size shows significant
gains when repacking with --name-hash-version=2 due to it having many name
hash collisions. However, it's worth noting that only the full repack
case has significant differences from the v1 name hash:

| Test      | V1 Time   | V2 Time  | V1 Size | V2 Size |
|-----------|-----------|----------|---------|---------|
| Thin Pack |    8.28 s |   7.28 s |  16.8 K |  16.8 K |
| Big Pack  |   12.81 s |  11.66 s |  29.1 M |  29.1 M |
| Shallow   |    4.86 s |   4.06 s |  42.5 M |  44.1 M |
| Repack    | 3126.50 s | 496.33 s |   6.2 G | 855.6 M |

Signed-off-by: Derrick Stolee <stolee@gmail.com>
Signed-off-by: Junio C Hamano <gitster@pobox.com>
2025-01-27 13:21:43 -08:00
.github ci: create script to set up Git for Windows SDK 2024-10-09 11:33:04 -07:00
Documentation repack: add --name-hash-version option 2025-01-27 13:21:43 -08:00
block-sha1 sha1: do not redefine `platform_SHA_CTX` and friends 2024-09-27 11:27:47 -07:00
builtin pack-objects: add GIT_TEST_NAME_HASH_VERSION 2025-01-27 13:21:43 -08:00
ci Merge branch 'ss/duplicate-typos' 2024-11-01 12:53:23 -04:00
compat Merge branch 'sk/msvc-warnings' 2024-10-25 14:02:44 -04:00
contrib Merge branch 'ps/platform-compat-fixes' 2024-11-01 12:53:17 -04:00
ewah
git-gui
gitk-git
gitweb
mergetools Merge branch 'ak/typofix-2.46-maint' 2024-10-04 14:21:40 -07:00
negotiator Merge branch 'ps/leakfixes-part-6' 2024-09-20 11:16:30 -07:00
oss-fuzz
perl Merge branch 'ak/typofix-2.46-maint' 2024-10-04 14:21:40 -07:00
po Merge branch 'l10n-de-2.47' of github.com:ralfth/git 2024-10-06 12:06:21 +08:00
refs global: Fix duplicate word typos 2024-10-21 16:05:04 -04:00
reftable reftable: handle trivial `reftable_buf` errors 2024-10-17 16:59:56 -04:00
sha1 sha1: do not redefine `platform_SHA_CTX` and friends 2024-09-27 11:27:47 -07:00
sha1collisiondetection@855827c583
sha1dc
sha256
t p5313: add size comparison test 2025-01-27 13:21:43 -08:00
templates
trace2 Merge branch 'ak/typofix-2.46-maint' 2024-10-04 14:21:40 -07:00
xdiff
.cirrus.yml ci: update FreeBSD image to 13.4 2024-09-20 14:40:41 -07:00
.clang-format
.editorconfig
.gitattributes
.gitignore
.gitlab-ci.yml gitlab-ci: exercise Git on Windows 2024-10-09 11:33:05 -07:00
.gitmodules
.mailmap
.tsan-suppressions
CODE_OF_CONDUCT.md
COPYING
GIT-VERSION-GEN Start the 2.48 cycle 2024-10-10 14:22:30 -07:00
INSTALL
LGPL-2.1
Makefile Merge branch 'ps/platform-compat-fixes' 2024-11-01 12:53:17 -04:00
README.md
RelNotes Start the 2.48 cycle 2024-10-10 14:22:30 -07:00
SECURITY.md
abspath.c
abspath.h
aclocal.m4
add-interactive.c
add-interactive.h
add-patch.c
advice.c
advice.h
alias.c
alias.h
alloc.c
alloc.h
apply.c Merge branch 'ps/apply-leakfix' 2024-09-25 10:37:10 -07:00
apply.h
archive-tar.c
archive-zip.c
archive.c Merge branch 'rs/archive-with-attr-pathspec-fix' 2024-10-04 14:21:40 -07:00
archive.h
attr.c Merge branch 'ak/typofixes' 2024-09-23 10:35:07 -07:00
attr.h
banned.h
base85.c
base85.h
bisect.c
bisect.h
blame.c
blame.h
blob.c
blob.h
bloom.c diff: improve lifecycle management of diff queues 2024-09-30 11:23:05 -07:00
bloom.h Merge branch 'ak/typofixes' 2024-09-23 10:35:07 -07:00
branch.c submodule: fix leaking submodule entry list 2024-09-30 11:23:03 -07:00
branch.h
builtin.h
bulk-checkin.c
bulk-checkin.h
bundle-uri.c bundle-uri: plug leak in unbundle_from_file() 2024-10-10 11:47:24 -07:00
bundle-uri.h
bundle.c
bundle.h
cache-tree.c Merge branch 'ps/cache-tree-w-broken-index-entry' 2024-10-22 14:40:38 -04:00
cache-tree.h cache-tree: refactor verification to return error codes 2024-10-07 15:08:11 -07:00
cbtree.c
cbtree.h
chdir-notify.c
chdir-notify.h
check-builtins.sh
checkout.c
checkout.h
chunk-format.c
chunk-format.h
color.c
color.h
column.c
column.h
combine-diff.c diff: fix leaking orderfile option 2024-09-27 08:25:35 -07:00
command-list.txt
commit-graph.c commit-graph: remove unnecessary UNLEAK 2024-09-23 10:03:59 -07:00
commit-graph.h
commit-reach.c
commit-reach.h
commit-slab-decl.h
commit-slab-impl.h
commit-slab.h
commit.c commit: avoid leaking already-saved buffer 2024-09-25 10:24:53 -07:00
commit.h Merge branch 'jc/pass-repo-to-builtins' 2024-09-23 10:35:09 -07:00
common-main.c
config.c config: fix evaluating "onbranch" with nonexistent git dir 2024-09-24 09:18:17 -07:00
config.h
config.mak.dev Merge branch 'ak/typofix-2.46-maint' 2024-09-25 10:37:12 -07:00
config.mak.in
config.mak.uname Merge branch 'rj/cygwin-has-dev-tty' into maint-2.46 2024-09-23 10:32:59 -07:00
configure.ac
connect.c connect: clear child process before freeing in diagnostic mode 2024-09-25 10:24:53 -07:00
connect.h
connected.c packfile: use object_id in find_pack_entry_one() 2024-10-25 17:35:46 -04:00
connected.h
convert.c
convert.h
copy.c
copy.h
credential.c Merge branch 'ds/background-maintenance-with-credential' 2024-09-30 16:16:16 -07:00
credential.h
csum-file.c csum-file.c: use unsafe SHA-1 implementation when available 2024-09-27 11:27:47 -07:00
csum-file.h
ctype.c
daemon.c daemon: replace atoi() with strtoul_ui() and strtol_i() 2024-10-24 14:03:43 -04:00
date.c
date.h
decorate.c
decorate.h
delta-islands.c
delta-islands.h
delta.h
detect-compiler
diagnose.c
diagnose.h
diff-delta.c
diff-lib.c Merge branch 'jk/output-prefix-cleanup' 2024-10-10 14:22:30 -07:00
diff-merges.c
diff-merges.h
diff-no-index.c
diff.c Merge branch 'jk/output-prefix-cleanup' 2024-10-10 14:22:30 -07:00
diff.h Merge branch 'jk/output-prefix-cleanup' 2024-10-10 14:22:30 -07:00
diffcore-break.c diff: improve lifecycle management of diff queues 2024-09-30 11:23:05 -07:00
diffcore-delta.c
diffcore-order.c diffcore-order: fix leaking buffer when parsing orderfiles 2024-09-27 08:25:35 -07:00
diffcore-pickaxe.c diff: improve lifecycle management of diff queues 2024-09-30 11:23:05 -07:00
diffcore-rename.c diff: improve lifecycle management of diff queues 2024-09-30 11:23:05 -07:00
diffcore-rotate.c diff: improve lifecycle management of diff queues 2024-09-30 11:23:05 -07:00
diffcore.h diff: improve lifecycle management of diff queues 2024-09-30 11:23:05 -07:00
dir-iterator.c
dir-iterator.h
dir.c Merge branch 'ps/leakfixes-part-7' 2024-10-02 07:46:26 -07:00
dir.h
editor.c
editor.h
entry.c
entry.h
environment.c
environment.h
exec-cmd.c
exec-cmd.h
fetch-negotiator.c
fetch-negotiator.h
fetch-pack.c Merge branch 'ps/environ-wo-the-repository' 2024-09-23 10:35:05 -07:00
fetch-pack.h
fmt-merge-msg.c
fmt-merge-msg.h
fsck.c
fsck.h
fsmonitor--daemon.h
fsmonitor-ipc.c
fsmonitor-ipc.h
fsmonitor-ll.h
fsmonitor-path-utils.h
fsmonitor-settings.c fsmonitor-settings: fix a typo 2024-09-19 13:46:00 -07:00
fsmonitor-settings.h
fsmonitor.c global: Fix duplicate word typos 2024-10-21 16:05:04 -04:00
fsmonitor.h
generate-cmdlist.sh
generate-configlist.sh
generate-hooklist.sh
gettext.c
gettext.h
git-archimport.perl
git-compat-util.h
git-curl-compat.h
git-cvsexportcommit.perl
git-cvsimport.perl
git-cvsserver.perl
git-difftool--helper.sh
git-filter-branch.sh
git-instaweb.sh git-instaweb: fix a typo 2024-09-19 13:46:00 -07:00
git-merge-octopus.sh
git-merge-one-file.sh
git-merge-resolve.sh
git-mergetool--lib.sh
git-mergetool.sh
git-p4.py git-p4: fix typos 2024-09-19 13:46:00 -07:00
git-quiltimport.sh
git-request-pull.sh
git-send-email.perl
git-sh-i18n.sh
git-sh-setup.sh
git-submodule.sh
git-svn.perl
git-web--browse.sh
git-zlib.c
git-zlib.h
git.c Merge branch 'jc/a-commands-without-the-repo' 2024-10-25 14:02:36 -04:00
git.rc
gpg-interface.c Merge branch 'ak/typofix-2.46-maint' 2024-09-25 10:37:12 -07:00
gpg-interface.h
graph.c Merge branch 'jk/output-prefix-cleanup' 2024-10-10 14:22:30 -07:00
graph.h
grep.c grep: disable lookahead on error 2024-10-22 12:45:49 -04:00
grep.h
hash-lookup.c
hash-lookup.h
hash.h hash.h: set NEEDS_CLONE_HELPER_UNSAFE in fallback mode 2024-10-03 11:18:36 -07:00
hashmap.c
hashmap.h
help.c Merge branch 'jc/pass-repo-to-builtins' 2024-09-23 10:35:09 -07:00
help.h
hex-ll.c
hex-ll.h
hex.c
hex.h
hook.c
hook.h
http-backend.c
http-fetch.c http-fetch: clear leaking git-index-pack(1) arguments 2024-09-25 10:24:52 -07:00
http-push.c packfile: convert find_sha1_pack() to use object_id 2024-10-25 17:35:46 -04:00
http-walker.c packfile: convert find_sha1_pack() to use object_id 2024-10-25 17:35:46 -04:00
http.c Merge branch 'jk/dumb-http-finalize' 2024-11-01 12:53:32 -04:00
http.h http: fix leak of http_object_request struct 2024-09-25 10:24:55 -07:00
ident.c
ident.h
imap-send.c imap: replace atoi() with strtol_i() for UIDVALIDITY and UIDNEXT parsing 2024-10-24 14:03:44 -04:00
iterator.h
json-writer.c
json-writer.h
khash.h
kwset.c
kwset.h
levenshtein.c
levenshtein.h
line-log.c Merge branch 'jk/output-prefix-cleanup' 2024-10-10 14:22:30 -07:00
line-log.h
line-range.c
line-range.h
linear-assignment.c
linear-assignment.h
list-objects-filter-options.c
list-objects-filter-options.h
list-objects-filter.c
list-objects-filter.h
list-objects.c
list-objects.h
list.h
lockfile.c
lockfile.h
log-tree.c Merge branch 'ng/rebase-merges-branch-name-as-label' 2024-10-18 13:56:22 -04:00
log-tree.h rebase-update-refs: extract load_branch_decorations 2024-10-09 10:52:45 -07:00
loose.c loose: don't rely on repository global state 2024-10-09 11:51:31 -07:00
loose.h
ls-refs.c
ls-refs.h
mailinfo.c
mailinfo.h
mailmap.c
mailmap.h
match-trees.c
match-trees.h
mem-pool.c
mem-pool.h
merge-blobs.c
merge-blobs.h
merge-ll.c merge: replace atoi() with strtol_i() for marker size validation 2024-10-24 14:03:44 -04:00
merge-ll.h
merge-ort-wrappers.c
merge-ort-wrappers.h
merge-ort.c diff: improve lifecycle management of diff queues 2024-09-30 11:23:05 -07:00
merge-ort.h
merge-recursive.c
merge-recursive.h
merge.c
merge.h
mergesort.h
midx-write.c pack-write: fix return parameter of `write_rev_file_order()` 2024-09-30 11:23:08 -07:00
midx.c packfile: use object_id in find_pack_entry_one() 2024-10-25 17:35:46 -04:00
midx.h
name-hash.c
name-hash.h
notes-cache.c
notes-cache.h
notes-merge.c
notes-merge.h
notes-utils.c
notes-utils.h
notes.c
notes.h
object-file-convert.c
object-file-convert.h
object-file.c Merge branch 'tb/weak-sha1-for-tail-sum' 2024-10-02 07:46:27 -07:00
object-file.h finalize_object_file(): implement collision check 2024-09-27 11:27:47 -07:00
object-name.c
object-name.h
object-store-ll.h
object-store.h
object.c
object.h
oid-array.c
oid-array.h
oidmap.c
oidmap.h
oidset.c
oidset.h
oidtree.c global: Fix duplicate word typos 2024-10-21 16:05:04 -04:00
oidtree.h
pack-bitmap-write.c pack-bitmap-write: fix leaking OID array 2024-09-30 11:23:07 -07:00
pack-bitmap.c packfile: use object_id in find_pack_entry_one() 2024-10-25 17:35:46 -04:00
pack-bitmap.h
pack-check.c
pack-mtimes.c
pack-mtimes.h
pack-objects.c
pack-objects.h pack-objects: create new name-hash function version 2025-01-27 13:21:05 -08:00
pack-revindex.c
pack-revindex.h
pack-write.c Merge branch 'ps/leakfixes-part-8' 2024-10-10 14:22:29 -07:00
pack.h pack-write: fix return parameter of `write_rev_file_order()` 2024-09-30 11:23:08 -07:00
packfile.c packfile: use oidread() instead of hashcpy() to fill object_id 2024-10-25 17:35:46 -04:00
packfile.h packfile: use object_id in find_pack_entry_one() 2024-10-25 17:35:46 -04:00
pager.c
pager.h
parallel-checkout.c
parallel-checkout.h
parse-options-cb.c
parse-options.c parse-options: free previous value of `OPTION_FILENAME` 2024-09-27 08:25:35 -07:00
parse-options.h
parse.c
parse.h
patch-delta.c
patch-ids.c
patch-ids.h
path.c
path.h
pathspec.c
pathspec.h
pkt-line.c
pkt-line.h
preload-index.c
preload-index.h
pretty.c Merge branch 'ak/typofix-2.46-maint' 2024-09-25 10:37:12 -07:00
pretty.h
prio-queue.c
prio-queue.h
progress.c
progress.h
promisor-remote.c promisor-remote: fix leaking partial clone filter 2024-09-27 08:25:36 -07:00
promisor-remote.h
prompt.c
prompt.h
protocol-caps.c
protocol-caps.h
protocol.c
protocol.h
prune-packed.c
prune-packed.h
pseudo-merge.c pseudo-merge: fix leaking strmap keys 2024-09-30 11:23:06 -07:00
pseudo-merge.h pseudo-merge: fix various memory leaks 2024-09-30 11:23:06 -07:00
quote.c
quote.h
range-diff.c Merge branch 'jk/output-prefix-cleanup' 2024-10-10 14:22:30 -07:00
range-diff.h
reachable.c
reachable.h
read-cache-ll.h global: Fix duplicate word typos 2024-10-21 16:05:04 -04:00
read-cache.c Merge branch 'ps/cache-tree-w-broken-index-entry' 2024-10-22 14:40:38 -04:00
read-cache.h
rebase-interactive.c
rebase-interactive.h
rebase.c rebase: fix a typo 2024-09-19 13:46:01 -07:00
rebase.h
ref-filter.c ref-filter: format iteratively with lexicographic refname sorting 2024-10-21 16:46:03 -04:00
ref-filter.h
reflog-walk.c
reflog-walk.h
reflog.c
reflog.h
refs.c Merge branch 'ps/reftable-exclude' 2024-09-25 10:37:11 -07:00
refs.h Merge branch 'ak/typofix-2.46-maint' 2024-09-25 10:37:12 -07:00
refspec.c fetch: free "raw" string when shrinking refspec 2024-09-25 10:24:54 -07:00
refspec.h fetch: free "raw" string when shrinking refspec 2024-09-25 10:24:54 -07:00
remote-curl.c remote-curl: free HEAD ref with free_one_ref() 2024-09-25 10:24:56 -07:00
remote.c Merge branch 'xx/remote-server-option-config' 2024-10-15 16:56:43 -04:00
remote.h Merge branch 'xx/remote-server-option-config' 2024-10-15 16:56:43 -04:00
replace-object.c
replace-object.h
repo-settings.c
repo-settings.h
repository.c Merge branch 'ps/environ-wo-the-repository' 2024-09-23 10:35:05 -07:00
repository.h
rerere.c
rerere.h
reset.c
reset.h
resolve-undo.c
resolve-undo.h
revision.c revision: fix leaking saved parents 2024-09-30 11:23:07 -07:00
revision.h revision: fix a typo 2024-09-19 13:46:12 -07:00
run-command.c
run-command.h run-command: fix a typo 2024-09-19 13:46:12 -07:00
sane-ctype.h
scalar.c Merge branch 'ps/leakfixes-part-8' 2024-10-10 14:22:29 -07:00
send-pack.c Merge branch 'ps/leakfixes-part-6' 2024-09-20 11:16:30 -07:00
send-pack.h
sequencer.c rebase-merges: try and use branch names as labels 2024-10-09 10:52:46 -07:00
sequencer.h
serve.c
serve.h
server-info.c
server-info.h
setup.c worktree: link worktrees with relative paths 2024-10-08 11:49:22 -07:00
setup.h
sh-i18n--envsubst.c
sha1dc_git.c
sha1dc_git.h sha1: do not redefine `platform_SHA_CTX` and friends 2024-09-27 11:27:47 -07:00
shallow.c shallow: fix leak when unregistering last shallow root 2024-09-25 10:24:52 -07:00
shallow.h
shared.mak
shell.c shell: fix leaking strings 2024-09-30 11:23:03 -07:00
shortlog.h
sideband.c Merge branch 'ak/typofix-2.46-maint' 2024-09-25 10:37:12 -07:00
sideband.h
sigchain.c
sigchain.h
simple-ipc.h simple-ipc: split async server initialization and running 2024-10-08 12:03:56 -07:00
sparse-index.c Merge branch 'ds/sparse-checkout-expansion-advice' 2024-10-02 07:46:25 -07:00
sparse-index.h sparse-checkout: disable advice in 'disable' 2024-09-23 13:19:01 -07:00
split-index.c
split-index.h
stable-qsort.c
statinfo.c
statinfo.h
strbuf.c
strbuf.h
streaming.c
streaming.h
string-list.c
string-list.h
strmap.c
strmap.h
strvec.c
strvec.h
sub-process.c
sub-process.h
submodule-config.c submodule: fix leaking submodule entry list 2024-09-30 11:23:03 -07:00
submodule-config.h submodule: fix leaking submodule entry list 2024-09-30 11:23:03 -07:00
submodule.c Merge branch 'ps/leakfixes-part-7' 2024-10-02 07:46:26 -07:00
submodule.h submodule: fix leaking update strategy 2024-09-27 08:25:34 -07:00
symlinks.c
symlinks.h
tag.c
tag.h
tar.h
tempfile.c
tempfile.h
thread-utils.c
thread-utils.h
tmp-objdir.c Merge branch 'tb/weak-sha1-for-tail-sum' 2024-10-02 07:46:27 -07:00
tmp-objdir.h
trace.c
trace.h
trace2.c
trace2.h refs/reftable: wire up support for exclude patterns 2024-09-16 13:57:19 -07:00
trailer.c
trailer.h
transport-helper.c transport-helper: fix leak of dummy refs_list 2024-09-25 10:24:54 -07:00
transport-internal.h
transport.c transport.c:🤝 make use of server options from remote 2024-10-08 10:22:08 -07:00
transport.h transport: introduce parse_transport_option() method 2024-10-08 10:22:06 -07:00
tree-diff.c
tree-walk.c
tree-walk.h
tree.c
tree.h
unicode-width.h
unimplemented.sh
unix-socket.c
unix-socket.h
unix-stream-server.c
unix-stream-server.h
unpack-trees.c unpack-trees: detect mismatching number of cache-tree/index entries 2024-10-07 15:08:11 -07:00
unpack-trees.h
upload-pack.c Merge branch 'ak/typofix-2.46-maint' 2024-09-25 10:37:12 -07:00
upload-pack.h
url.c
url.h
urlmatch.c
urlmatch.h
usage.c
userdiff.c
userdiff.h
utf8.c
utf8.h utf8.h: squelch unused-parameter warnings with NO_ICONV 2024-10-02 15:52:48 -07:00
varint.c
varint.h
version.c
version.h
versioncmp.c
versioncmp.h
walker.c http-walker: use object_id instead of bare hash 2024-10-25 17:35:46 -04:00
walker.h http-walker: use object_id instead of bare hash 2024-10-25 17:35:46 -04:00
wildmatch.c
wildmatch.h
worktree.c worktree: link worktrees with relative paths 2024-10-08 11:49:22 -07:00
worktree.h worktree: link worktrees with relative paths 2024-10-08 11:49:22 -07:00
wrap-for-bin.sh
wrapper.c
wrapper.h
write-or-die.c
write-or-die.h
ws.c
ws.h
wt-status.c wt-status: fix leaking buffer with sparse directories 2024-09-30 11:23:03 -07:00
wt-status.h
xdiff-interface.c
xdiff-interface.h

README.md

Build status

Git - fast, scalable, distributed revision control system

Git is a fast, scalable, distributed revision control system with an unusually rich command set that provides both high-level operations and full access to internals.

Git is an Open Source project covered by the GNU General Public License version 2 (some parts of it are under different licenses, compatible with the GPLv2). It was originally written by Linus Torvalds with help of a group of hackers around the net.

Please read the file INSTALL for installation instructions.

Many Git online resources are accessible from https://git-scm.com/ including full documentation and Git related tools.

See Documentation/gittutorial.txt to get started, then see Documentation/giteveryday.txt for a useful minimum set of commands, and Documentation/git-<commandname>.txt for documentation of each command. If git has been correctly installed, then the tutorial can also be read with man gittutorial or git help tutorial, and the documentation of each command with man git-<commandname> or git help <commandname>.

CVS users may also want to read Documentation/gitcvs-migration.txt (man gitcvs-migration or git help cvs-migration if git is installed).

The user discussion and development of Git take place on the Git mailing list -- everyone is welcome to post bug reports, feature requests, comments and patches to git@vger.kernel.org (read Documentation/SubmittingPatches for instructions on patch submission and Documentation/CodingGuidelines).

Those wishing to help with error message, usage and informational message string translations (localization l10) should see po/README.md (a po file is a Portable Object file that holds the translations).

To subscribe to the list, send an email to git+subscribe@vger.kernel.org (see https://subspace.kernel.org/subscribing.html for details). The mailing list archives are available at https://lore.kernel.org/git/, https://marc.info/?l=git and other archival sites.

Issues which are security relevant should be disclosed privately to the Git Security mailing list git-security@googlegroups.com.

The maintainer frequently sends the "What's cooking" reports that list the current status of various development topics to the mailing list. The discussion following them give a good reference for project status, development direction and remaining tasks.

The name "git" was given by Linus Torvalds when he wrote the very first version. He described the tool as "the stupid content tracker" and the name as (depending on your mood):

  • random three-letter combination that is pronounceable, and not actually used by any common UNIX command. The fact that it is a mispronunciation of "get" may or may not be relevant.
  • stupid. contemptible and despicable. simple. Take your pick from the dictionary of slang.
  • "global information tracker": you're in a good mood, and it actually works for you. Angels sing, and a light suddenly fills the room.
  • "goddamn idiotic truckload of sh*t": when it breaks