Go to file
Toon Claes 2a04e8c293 last-modified: implement faster algorithm
The current implementation of git-last-modified(1) works by doing a
revision walk, and inspecting the diff at each level of that walk to
annotate entries remaining in the hashmap of paths. In other words, if
the diff at some level touches a path which has not yet been associated
with a commit, then that commit becomes associated with the path.

While a perfectly reasonable implementation, it can perform poorly in
either one of two scenarios:

  1. There are many entries of interest, in which case there is simply
     a lot of work to do.

  2. Or, there are (even a few) entries which have not been updated in a
     long time, and so we must walk through a lot of history in order to
     find a commit that touches that path.

This patch rewrites the last-modified implementation that addresses the
second point. The idea behind the algorithm is to propagate a set of
'active' paths (a path is 'active' if it does not yet belong to a
commit) up to parents and do a truncated revision walk.

The walk is truncated because it does not produce a revision for every
change in the original pathspec, but rather only for active paths.

More specifically, consider a priority queue of commits sorted by
generation number. First, enqueue the set of boundary commits with all
paths in the original spec marked as interesting.

Then, while the queue is not empty, do the following:

  1. Pop an element, say, 'c', off of the queue, making sure that 'c'
     isn't reachable by anything in the '--not' set.

  2. For each parent 'p' (with index 'parent_i') of 'c', do the
     following:

     a. Compute the diff between 'c' and 'p'.
     b. Pass any active paths that are TREESAME from 'c' to 'p'.
     c. If 'p' has any active paths, push it onto the queue.

  3. Any path that remains active on 'c' is associated to that commit.

This ends up being equivalent to doing something like 'git log -1 --
$path' for each path simultaneously. But, it allows us to go much faster
than the original implementation by limiting the number of diffs we
compute, since we can avoid parts of history that would have been
considered by the revision walk in the original implementation, but are
known to be uninteresting to us because we have already marked all paths
in that area to be inactive.

To avoid computing many first-parent diffs, add another trick on top of
this and check if all paths active in 'c' are DEFINITELY NOT in c's
Bloom filter. Since the commit-graph only stores first-parent diffs in
the Bloom filters, we can only apply this trick to first-parent diffs.

Comparing the performance of this new algorithm shows about a 2.5x
improvement on git.git:

    Benchmark 1: master   no bloom
      Time (mean ± σ):      2.868 s ±  0.023 s    [User: 2.811 s, System: 0.051 s]
      Range (min … max):    2.847 s …  2.926 s    10 runs

    Benchmark 2: master with bloom
      Time (mean ± σ):     949.9 ms ±  15.2 ms    [User: 907.6 ms, System: 39.5 ms]
      Range (min … max):   933.3 ms … 971.2 ms    10 runs

    Benchmark 3: HEAD     no bloom
      Time (mean ± σ):     782.0 ms ±   6.3 ms    [User: 740.7 ms, System: 39.2 ms]
      Range (min … max):   776.4 ms … 798.2 ms    10 runs

    Benchmark 4: HEAD   with bloom
      Time (mean ± σ):     307.1 ms ±   1.7 ms    [User: 276.4 ms, System: 29.9 ms]
      Range (min … max):   303.7 ms … 309.5 ms    10 runs

    Summary
      HEAD   with bloom ran
        2.55 ± 0.02 times faster than HEAD     no bloom
        3.09 ± 0.05 times faster than master with bloom
        9.34 ± 0.09 times faster than master   no bloom

In short, the existing implementation is comparably fast *with* Bloom
filters as the new implementation is *without* Bloom filters. So, most
repositories should get a dramatic speed-up by just deploying this (even
without computing Bloom filters), and all repositories should get faster
still when computing Bloom filters.

When comparing a more extreme example of
`git last-modified -- COPYING t`, the difference is even 5 times better:

    Benchmark 1: master
      Time (mean ± σ):      4.372 s ±  0.057 s    [User: 4.286 s, System: 0.062 s]
      Range (min … max):    4.308 s …  4.509 s    10 runs

    Benchmark 2: HEAD
      Time (mean ± σ):     826.3 ms ±  22.3 ms    [User: 784.1 ms, System: 39.2 ms]
      Range (min … max):   810.6 ms … 881.2 ms    10 runs

    Summary
      HEAD ran
        5.29 ± 0.16 times faster than master

As an added benefit, results are more consistent now. For example
implementation in 'master' gives:

    $ git log --max-count=1 --format=%H -- pkt-line.h
    15df15fe07

    $ git last-modified -- pkt-line.h
    15df15fe07	pkt-line.h

    $ git last-modified | grep pkt-line.h
    5b49c1af03	pkt-line.h

With the changes in this patch the results of git-last-modified(1)
always match those of `git log --max-count=1`.

One thing to note though, the results might be outputted in a different
order than before. This is not considerd to be an issue because nowhere
is documented the order is guaranteed.

Based-on-patches-by: Derrick Stolee <stolee@gmail.com>
Based-on-patches-by: Taylor Blau <me@ttaylorr.com>
Signed-off-by: Taylor Blau <me@ttaylorr.com>
Signed-off-by: Toon Claes <toon@iotcl.com>
Acked-by: Taylor Blau <me@ttaylorr.com>
[jc: tweaked use of xcalloc() to unbreak coccicheck]
Signed-off-by: Junio C Hamano <gitster@pobox.com>
2025-11-03 07:25:41 -08:00
.github Merge branch 'ps/ci-rust' 2025-10-28 10:29:09 -07:00
Documentation The 27th batch 2025-10-30 08:00:20 -07:00
bin-wrappers
block-sha1
builtin last-modified: implement faster algorithm 2025-11-03 07:25:41 -08:00
ci Merge branch 'ps/ci-rust' 2025-10-28 10:29:09 -07:00
compat mingw: order `#include`s alphabetically 2025-10-09 13:21:28 -07:00
compiler-tricks
contrib Merge branch 'kf/log-shortlog-completion-fix' 2025-10-30 08:00:20 -07:00
ewah
git-gui Merge branch 'master' of https://github.com/j6t/git-gui 2025-09-10 14:28:23 -07:00
gitk-git Merge branch 'master' of https://github.com/j6t/gitk 2025-10-05 13:32:47 -07:00
gitweb
mergetools
negotiator
oss-fuzz commit-graph: refactor `parse_commit_graph()` to take a repository 2025-08-15 09:34:47 -07:00
perl
po git-gui: sync Makefiles with git.git 2025-09-06 11:59:48 +02:00
refs Merge branch 'ps/symlink-symref-deprecation' 2025-10-30 08:00:19 -07:00
reftable Merge branch 'kn/reftable-consistency-checks' 2025-10-13 22:00:35 -07:00
sha1
sha1collisiondetection@855827c583
sha1dc
sha256
src rust: support for Windows 2025-10-15 08:10:17 -07:00
subprojects meson: update subproject wrappers 2025-07-11 09:56:34 -07:00
t last-modified: implement faster algorithm 2025-11-03 07:25:41 -08:00
templates
trace2 trace2: do not use strbuf_split*() 2025-08-02 22:44:58 -07:00
xdiff Merge branch 'en/xdiff-cleanup' 2025-10-14 12:56:09 -07:00
.cirrus.yml
.clang-format clang-format: exclude control macros from SpaceBeforeParens 2025-09-28 08:37:23 -07:00
.editorconfig
.gitattributes
.gitignore Merge branch 'ps/rust-balloon' 2025-10-08 12:17:55 -07:00
.gitlab-ci.yml Merge branch 'ps/gitlab-ci-disable-windows-monitoring' into maint-2.51 2025-10-26 19:48:19 -07:00
.gitmodules
.mailmap mailmap: change primary address for Jonathan Tan 2025-10-07 10:38:21 -07:00
.tsan-suppressions
CODE_OF_CONDUCT.md
COPYING
Cargo.toml ci: verify minimum supported Rust version 2025-10-15 08:10:17 -07:00
GIT-BUILD-OPTIONS.in
GIT-VERSION-FILE.in
GIT-VERSION-GEN Git 2.51.2 2025-10-26 19:48:21 -07:00
INSTALL
LGPL-2.1
Makefile Merge branch 'tb/incremental-midx-part-3.1' 2025-10-29 12:38:24 -07:00
README.md gitk: add README with usage, build, and contribution details 2025-08-28 19:51:31 +02:00
RelNotes Git 2.51.2 2025-10-26 19:48:21 -07:00
SECURITY.md
abspath.c
abspath.h
aclocal.m4
add-interactive.c Merge branch 'sj/string-list' 2025-10-14 12:56:08 -07:00
add-interactive.h add-interactive: retain colorbool values longer 2025-09-16 18:00:25 -07:00
add-patch.c Merge branch 'rs/add-patch-document-p-for-pager' 2025-10-24 13:48:05 -07:00
advice.c initial branch: give hints after switching the default name 2025-09-18 11:44:47 -07:00
advice.h initial branch: give hints after switching the default name 2025-09-18 11:44:47 -07:00
alias.c
alias.h
alloc.c alloc: fix dangling pointer in alloc_state cleanup 2025-09-04 15:24:16 -07:00
alloc.h alloc: fix dangling pointer in alloc_state cleanup 2025-09-04 15:24:16 -07:00
apply.c Merge branch 'ps/object-file-wo-the-repository' 2025-08-05 11:53:55 -07:00
apply.h
archive-tar.c config: drop `git_config()` wrapper 2025-07-23 08:15:18 -07:00
archive-zip.c archive: flush deflate stream until Z_STREAM_END 2025-08-04 13:36:35 -07:00
archive.c config: drop `git_config_get_bool()` wrapper 2025-07-23 08:15:20 -07:00
archive.h
attr.c
attr.h
banned.h
base85.c
base85.h
bisect.c revision: add wrapper to setup_revisions() from a strvec 2025-09-22 14:27:03 -07:00
bisect.h rev-list: make "struct rev_list_info" static to the only user 2025-07-21 15:40:46 -07:00
blame.c blame: drop explicit check for commit graph 2025-09-04 16:16:21 -07:00
blame.h
blob.c
blob.h
bloom.c commit-graph: return commit graph from `repo_find_commit_pos_in_graph()` 2025-09-04 16:16:22 -07:00
bloom.h bloom: replace struct bloom_key * with struct bloom_keyvec 2025-07-14 10:03:03 -07:00
branch.c config: drop `git_config_get_multivar_gently()` wrapper 2025-07-23 08:15:21 -07:00
branch.h
builtin.h Merge branch 'tc/last-modified' 2025-09-08 14:54:35 -07:00
bundle-uri.c Merge branch 'ps/object-store' 2025-07-15 15:18:18 -07:00
bundle-uri.h
bundle.c Merge branch 'bc/use-sha256-by-default-in-3.0' 2025-07-21 09:14:25 -07:00
bundle.h
cache-tree.c odb: add transaction interface 2025-09-16 11:37:06 -07:00
cache-tree.h
cbtree.c
cbtree.h
chdir-notify.c
chdir-notify.h
check-builtins.sh
checkout.c config: drop `git_config_get_string()` wrapper 2025-07-23 08:15:19 -07:00
checkout.h
chunk-format.c
chunk-format.h
color.c color: return bool from want_color() 2025-09-16 18:00:25 -07:00
color.h color: return bool from want_color() 2025-09-16 18:00:25 -07:00
column.c
column.h
combine-diff.c Merge branch 'tc/last-modified-recursive-fix' 2025-09-29 11:40:35 -07:00
command-list.txt Merge branch 'tc/last-modified' 2025-09-08 14:54:35 -07:00
commit-graph.c Merge branch 'ps/commit-graph-per-object-source' 2025-10-13 22:00:35 -07:00
commit-graph.h commit-graph: return commit graph from `repo_find_commit_pos_in_graph()` 2025-09-04 16:16:22 -07:00
commit-reach.c
commit-reach.h
commit-slab-decl.h
commit-slab-impl.h
commit-slab.h
commit.c Merge branch 'ps/remote-rename-fix' 2025-08-21 13:46:58 -07:00
commit.h Merge branch 'lm/add-p-context' 2025-08-04 08:10:33 -07:00
common-exit.c
common-init.c
common-init.h
common-main.c
config.c Merge branch 'jc/optional-path' 2025-10-14 12:56:09 -07:00
config.h Merge branch 'ps/config-wo-the-repository' 2025-08-04 08:10:33 -07:00
config.mak.dev
config.mak.in
config.mak.uname Merge branch 'bs/config-mak-freebsd' 2025-07-14 11:19:23 -07:00
configure.ac Merge branch 'rj/freebsd-sysinfo-build-fix' 2025-07-14 11:19:28 -07:00
connect.c Merge branch 'jc/string-list-split' 2025-08-21 13:46:59 -07:00
connect.h
connected.c packfile: introduce macro to iterate through packs 2025-10-16 14:42:39 -07:00
connected.h
convert.c config: drop `git_config()` wrapper 2025-07-23 08:15:18 -07:00
convert.h
copy.c
copy.h
credential.c
credential.h
csum-file.c
csum-file.h
ctype.c
daemon.c Merge branch 'ps/config-wo-the-repository' 2025-08-04 08:10:33 -07:00
date.c
date.h
decorate.c
decorate.h
delta-islands.c
delta-islands.h
delta.h
detect-compiler
diagnose.c
diagnose.h
diff-delta.c
diff-lib.c diff: teach tree-diff a max-depth parameter 2025-08-07 15:29:35 -07:00
diff-merges.c
diff-merges.h
diff-no-index.c diff --no-index: fix logic for paths ending in '/' 2025-09-25 11:35:20 -07:00
diff.c Merge branch 'jc/diff-from-contents-fix' 2025-10-24 09:10:37 -07:00
diff.h Merge branch 'tc/last-modified-recursive-fix' 2025-09-29 11:40:35 -07:00
diffcore-break.c
diffcore-delta.c
diffcore-order.c
diffcore-pickaxe.c
diffcore-rename.c
diffcore-rotate.c
diffcore.h
dir-iterator.c
dir-iterator.h
dir.c Merge branch 'ds/sparse-checkout-clean' 2025-10-28 10:29:09 -07:00
dir.h dir: add generic "walk all files" helper 2025-09-12 08:59:52 -07:00
editor.c config: drop `git_config_get_string()` wrapper 2025-07-23 08:15:19 -07:00
editor.h
entry.c
entry.h
environment.c Merge branch 'pw/3.0-commentchar-auto-deprecation' 2025-09-18 10:07:00 -07:00
environment.h config: warn on core.commentString=auto 2025-08-26 08:52:44 -07:00
exec-cmd.c
exec-cmd.h
fetch-negotiator.c
fetch-negotiator.h
fetch-pack.c packfile: split up responsibilities of `reprepare_packed_git()` 2025-09-24 11:53:50 -07:00
fetch-pack.h
fmt-merge-msg.c Merge branch 'ac/deglobal-fmt-merge-log-config' 2025-08-22 13:13:21 -07:00
fmt-merge-msg.h environment: remove the global variable 'merge_log_config' 2025-08-11 09:16:55 -07:00
for-each-ref.h Merge branch 'ja/doc-lint-sections-and-synopsis' 2025-08-25 14:22:02 -07:00
fsck.c fsck: consider gpgsig headers expected in tags 2025-10-09 17:46:14 -07:00
fsck.h Merge branch 'bc/sha1-256-interop-01' 2025-10-22 11:38:58 -07:00
fsmonitor--daemon.h
fsmonitor-ipc.c
fsmonitor-ipc.h
fsmonitor-ll.h
fsmonitor-path-utils.h
fsmonitor-settings.c
fsmonitor-settings.h
fsmonitor.c config: drop `git_config_get_int()` wrapper 2025-07-23 08:15:20 -07:00
fsmonitor.h
generate-cmdlist.sh
generate-configlist.sh
generate-hooklist.sh
generate-perl.sh
generate-python.sh
generate-script.sh git-gui: honor TCLTK_PATH in git-gui--askpass 2025-07-31 18:42:54 +02:00
gettext.c
gettext.h
git-archimport.perl
git-compat-util.h whatchanged: hint about git-log(1) and aliasing 2025-09-17 13:47:24 -07:00
git-curl-compat.h curl: add support for curl_global_trace() components 2025-08-27 09:49:43 -07:00
git-cvsexportcommit.perl
git-cvsimport.perl
git-cvsserver.perl
git-difftool--helper.sh
git-filter-branch.sh
git-instaweb.sh
git-merge-octopus.sh
git-merge-one-file.sh
git-merge-resolve.sh
git-mergetool--lib.sh
git-mergetool.sh
git-p4.py
git-quiltimport.sh
git-request-pull.sh
git-send-email.perl Merge branch 'nb/send-email-no-dup-reply-to' 2025-09-29 11:40:33 -07:00
git-sh-i18n.sh
git-sh-setup.sh
git-submodule.sh
git-svn.perl
git-web--browse.sh
git-zlib.c
git-zlib.h
git.c Merge branch 'kh/you-still-use-whatchanged-fix' 2025-10-02 12:26:12 -07:00
git.rc.in
gpg-interface.c Merge branch 'ob/gpg-interface-cleanup' 2025-10-30 08:00:19 -07:00
gpg-interface.h gpg-interface: refactor 'enum sign_mode' parsing 2025-09-17 11:18:28 -07:00
graph.c
graph.h
grep.c grep: don't treat grep_opt.color as a strict bool 2025-09-16 13:37:05 -07:00
grep.h color: use git_colorbool enum type to store colorbools 2025-09-16 17:59:53 -07:00
hash-lookup.c
hash-lookup.h
hash.c
hash.h Merge branch 'bc/use-sha256-by-default-in-3.0' 2025-07-21 09:14:25 -07:00
hashmap.c
hashmap.h
help.c help: report on whether or not Rust is enabled 2025-10-02 09:32:31 -07:00
help.h
hex-ll.c
hex-ll.h
hex.c
hex.h
hook.c
hook.h
http-backend.c packfile: introduce macro to iterate through packs 2025-10-16 14:42:39 -07:00
http-fetch.c config: move Git config parsing into "environment.c" 2025-07-23 08:15:22 -07:00
http-push.c Merge branch 'js/curl-off-t-fixes' into maint-2.51 2025-10-14 13:40:53 -07:00
http-walker.c
http.c packfile: introduce macro to iterate through packs 2025-10-16 14:42:39 -07:00
http.h Merge branch 'js/curl-off-t-fixes' 2025-10-07 12:25:27 -07:00
ident.c Merge branch 'ps/reflog-migrate-fixes' into maint-2.51 2025-10-15 10:29:28 -07:00
ident.h ident: fix type of string length parameter 2025-08-06 07:36:30 -07:00
imap-send.c Merge branch 'js/curl-off-t-fixes' into maint-2.51 2025-10-14 13:40:53 -07:00
iterator.h
json-writer.c
json-writer.h
khash.h
kwset.c
kwset.h
levenshtein.c
levenshtein.h
line-log.c Merge branch 'sg/line-log-boundary-fixes' into maint-2.51 2025-10-15 10:29:30 -07:00
line-log.h
line-range.c
line-range.h
linear-assignment.c
linear-assignment.h
list-objects-filter-options.c config: drop `git_config_set()` wrapper 2025-07-23 08:15:21 -07:00
list-objects-filter-options.h
list-objects-filter.c use repo_get_oid_with_flags() 2025-09-10 14:29:49 -07:00
list-objects-filter.h
list-objects.c
list-objects.h
list.h
lockfile.c
lockfile.h
log-tree.c Merge branch 'kh/format-patch-range-diff-notes' 2025-10-14 12:56:09 -07:00
log-tree.h color: use git_colorbool enum type to store colorbools 2025-09-16 17:59:53 -07:00
loose.c loose: write loose objects map via their source 2025-07-16 22:16:15 -07:00
loose.h loose: write loose objects map via their source 2025-07-16 22:16:15 -07:00
ls-refs.c config: drop `git_config()` wrapper 2025-07-23 08:15:18 -07:00
ls-refs.h
mailinfo.c config: move Git config parsing into "environment.c" 2025-07-23 08:15:22 -07:00
mailinfo.h
mailmap.c string-list: change "string_list_find_insert_index" return type to "size_t" 2025-10-06 09:11:07 -07:00
mailmap.h
match-trees.c odb: introduce `odb_write_object()` 2025-07-16 22:16:15 -07:00
match-trees.h
mem-pool.c
mem-pool.h
merge-blobs.c
merge-blobs.h
merge-ll.c config: drop `git_config()` wrapper 2025-07-23 08:15:18 -07:00
merge-ll.h
merge-ort-wrappers.c
merge-ort-wrappers.h
merge-ort.c Merge branch 'en/ort-rename-fixes' into maint-2.51 2025-10-15 10:29:28 -07:00
merge-ort.h
merge.c
merge.h
mergesort.h
meson.build Merge branch 'tb/incremental-midx-part-3.1' 2025-10-29 12:38:24 -07:00
meson_options.txt meson: add infrastructure to build internal Rust library 2025-10-02 09:32:31 -07:00
midx-write.c Merge branch 'ds/midx-write-fixes' into maint-2.51 2025-10-15 10:29:30 -07:00
midx.c packfile: move `get_multi_pack_index()` into "midx.c" 2025-09-24 11:53:50 -07:00
midx.h packfile: move `get_multi_pack_index()` into "midx.c" 2025-09-24 11:53:50 -07:00
name-hash.c
name-hash.h
notes-cache.c odb: introduce `odb_write_object()` 2025-07-16 22:16:15 -07:00
notes-cache.h
notes-merge.c
notes-merge.h
notes-utils.c config: drop `git_config()` wrapper 2025-07-23 08:15:18 -07:00
notes-utils.h
notes.c Merge branch 'jc/string-list-split' 2025-08-21 13:46:59 -07:00
notes.h
object-file-convert.c
object-file-convert.h
object-file.c Merge branch 'ps/packfile-store' 2025-10-07 12:25:27 -07:00
object-file.h odb: add transaction interface 2025-09-16 11:37:06 -07:00
object-name.c packfile: introduce macro to iterate through packs 2025-10-16 14:42:39 -07:00
object-name.h
object.c alloc: fix dangling pointer in alloc_state cleanup 2025-09-04 15:24:16 -07:00
object.h last-modified: implement faster algorithm 2025-11-03 07:25:41 -08:00
odb.c Merge branch 'ps/packfile-store' 2025-10-07 12:25:27 -07:00
odb.h Merge branch 'ps/odb-clean-stale-wrappers' 2025-10-07 12:25:28 -07:00
oid-array.c
oid-array.h
oidmap.c
oidmap.h
oidset.c
oidset.h
oidtree.c
oidtree.h
pack-bitmap-write.c Merge branch 'ps/object-store' 2025-07-15 15:18:18 -07:00
pack-bitmap.c packfile: introduce macro to iterate through packs 2025-10-16 14:42:39 -07:00
pack-bitmap.h
pack-check.c
pack-mtimes.c
pack-mtimes.h
pack-objects.c Merge branch 'ps/remove-packfile-store-get-packs' 2025-10-30 08:00:19 -07:00
pack-objects.h Merge branch 'ps/object-store' 2025-07-15 15:18:18 -07:00
pack-refs.c builtin/pack-refs: factor out core logic into a shared library 2025-09-19 10:02:55 -07:00
pack-refs.h builtin/pack-refs: factor out core logic into a shared library 2025-09-19 10:02:55 -07:00
pack-revindex.c midx: compute paths via their source 2025-08-11 09:22:23 -07:00
pack-revindex.h
pack-write.c object-file: get rid of `the_repository` in `finalize_object_file()` 2025-07-16 22:16:14 -07:00
pack.h object-file: get rid of `the_repository` in `finalize_object_file()` 2025-07-16 22:16:14 -07:00
packfile.c packfile: rename `packfile_store_get_all_packs()` 2025-10-16 14:42:40 -07:00
packfile.h packfile: rename `packfile_store_get_all_packs()` 2025-10-16 14:42:40 -07:00
pager.c
pager.h
parallel-checkout.c config: drop `git_config_get_int()` wrapper 2025-07-23 08:15:20 -07:00
parallel-checkout.h
parse-options-cb.c color: use git_colorbool enum type to store colorbools 2025-09-16 17:59:53 -07:00
parse-options.c Merge branch 'jc/optional-path' 2025-10-14 12:56:09 -07:00
parse-options.h Merge branch 'lm/add-p-context' 2025-08-04 08:10:33 -07:00
parse.c
parse.h
patch-delta.c
patch-ids.c
patch-ids.h
path-walk.c path-walk: create initializer for path lists 2025-08-25 09:01:17 -07:00
path-walk.h
path.c
path.h
pathspec.c string-list: split-then-remove-empty can be done while splitting 2025-08-02 22:34:45 -07:00
pathspec.h
pkt-line.c
pkt-line.h
preload-index.c
preload-index.h
pretty.c color: use git_colorbool enum type to store colorbools 2025-09-16 17:59:53 -07:00
pretty.h color: use git_colorbool enum type to store colorbools 2025-09-16 17:59:53 -07:00
prio-queue.c prio-queue: add prio_queue_replace() 2025-07-22 07:28:35 -07:00
prio-queue.h prio-queue: add prio_queue_replace() 2025-07-22 07:28:35 -07:00
progress.c progress: pay attention to (customized) delay time 2025-08-25 15:50:17 -07:00
progress.h
promisor-remote.c promisor-remote: use string_list_split() in mark_remotes_as_accepted() 2025-09-08 10:30:56 -07:00
promisor-remote.h
prompt.c interactive: do strip trailing CRLF from input 2025-07-31 14:17:54 -07:00
prompt.h
protocol-caps.c
protocol-caps.h
protocol.c Merge branch 'jc/string-list-split' 2025-08-21 13:46:59 -07:00
protocol.h
prune-packed.c object-file: get rid of `the_repository` in loose object iterators 2025-07-16 22:16:17 -07:00
prune-packed.h
pseudo-merge.c
pseudo-merge.h
quote.c
quote.h
range-diff.c range-diff: rename other_arg to log_arg 2025-09-25 11:34:11 -07:00
range-diff.h range-diff: rename other_arg to log_arg 2025-09-25 11:34:11 -07:00
reachable.c Merge branch 'ps/object-file-wo-the-repository' 2025-08-05 11:53:55 -07:00
reachable.h
read-cache-ll.h
read-cache.c Merge branch 'ps/rust-balloon' 2025-10-08 12:17:55 -07:00
read-cache.h
rebase-interactive.c config: drop `git_config_get_value()` wrapper 2025-07-23 08:15:18 -07:00
rebase-interactive.h
rebase.c
rebase.h
ref-filter.c Merge branch 'jc/string-list-split' 2025-08-21 13:46:59 -07:00
ref-filter.h color: use git_colorbool enum type to store colorbools 2025-09-16 17:59:53 -07:00
reflog-walk.c refs: pass refname when invoking reflog entry callback 2025-08-06 14:19:30 -07:00
reflog-walk.h
reflog.c Merge branch 'ps/remote-rename-fix' 2025-08-21 13:46:58 -07:00
reflog.h Merge branch 'ps/remote-rename-fix' 2025-08-21 13:46:58 -07:00
refs.c Merge branch 'kn/refs-files-case-insensitive' into maint-2.51 2025-10-15 10:29:31 -07:00
refs.h Merge branch 'kn/refs-files-case-insensitive' into maint-2.51 2025-10-15 10:29:31 -07:00
refspec.c
refspec.h
remote-curl.c Merge branch 'js/curl-off-t-fixes' into maint-2.51 2025-10-14 13:40:53 -07:00
remote.c Merge branch 'dl/push-missing-object-error' into maint-2.51 2025-10-15 10:29:28 -07:00
remote.h
repack-cruft.c packfile: introduce macro to iterate through packs 2025-10-16 14:42:39 -07:00
repack-filtered.c repack: move `write_filtered_pack()` out of the builtin 2025-10-16 10:08:57 -07:00
repack-geometry.c packfile: introduce macro to iterate through packs 2025-10-16 14:42:39 -07:00
repack-midx.c repack: 'write_midx_included_packs' API from the builtin 2025-10-16 10:08:56 -07:00
repack-promisor.c builtin/repack.c: remove "repack_promisor_objects()" from the builtin 2025-10-16 10:08:55 -07:00
repack.c packfile: introduce macro to iterate through packs 2025-10-16 14:42:39 -07:00
repack.h repack: move `write_cruft_pack()` out of the builtin 2025-10-16 10:08:57 -07:00
replace-object.c
replace-object.h
repo-settings.c
repo-settings.h
repository.c Merge branch 'pw/3.0-commentchar-auto-deprecation' 2025-09-18 10:07:00 -07:00
repository.h config: warn on core.commentString=auto 2025-08-26 08:52:44 -07:00
rerere.c config: move Git config parsing into "environment.c" 2025-07-23 08:15:22 -07:00
rerere.h
reset.c
reset.h
resolve-undo.c
resolve-undo.h
revision.c Merge branch 'ps/commit-graph-per-object-source' 2025-10-13 22:00:35 -07:00
revision.h Merge branch 'kh/format-patch-range-diff-notes' 2025-10-14 12:56:09 -07:00
run-command.c config: drop `git_config_get_bool()` wrapper 2025-07-23 08:15:20 -07:00
run-command.h
sane-ctype.h sane-ctype: fix compiler error on Amazon Linux 2 2025-07-10 11:18:37 -07:00
scalar.c commit-graph: add new config for changed-paths & recommend it in scalar 2025-10-22 10:40:11 -07:00
send-pack.c Merge branch 'ps/object-store' 2025-07-15 15:18:18 -07:00
send-pack.h
sequencer.c Merge branch 'pw/rebase-i-cleanup-fix' into maint-2.51 2025-10-15 10:29:31 -07:00
sequencer.h treewide: pass strvecs around for setup_revisions_from_strvec() 2025-09-22 14:27:03 -07:00
serve.c
serve.h
server-info.c packfile: introduce macro to iterate through packs 2025-10-16 14:42:39 -07:00
server-info.h
setup.c Merge branch 'jc/string-list-split' 2025-08-21 13:46:59 -07:00
setup.h
sh-i18n--envsubst.c
sha1dc_git.c
sha1dc_git.h
shallow.c treewide: pass strvecs around for setup_revisions_from_strvec() 2025-09-22 14:27:03 -07:00
shallow.h treewide: pass strvecs around for setup_revisions_from_strvec() 2025-09-22 14:27:03 -07:00
shared.mak Makefile: introduce infrastructure to build internal Rust library 2025-10-02 09:32:31 -07:00
shell.c
shortlog.h
sideband.c color: use git_colorbool enum type to store colorbools 2025-09-16 17:59:53 -07:00
sideband.h
sigchain.c
sigchain.h
simple-ipc.h
sparse-index.c sparse-index: improve advice message instructions 2025-10-20 09:20:50 -07:00
sparse-index.h
split-index.c
split-index.h
stable-qsort.c
statinfo.c
statinfo.h
strbuf.c strbuf: convert predicates to return bool 2025-07-16 08:18:06 -07:00
strbuf.h strbuf: convert predicates to return bool 2025-07-16 08:18:06 -07:00
streaming.c
streaming.h
string-list.c string-list: change "string_list_find_insert_index" return type to "size_t" 2025-10-06 09:11:07 -07:00
string-list.h string-list: change "string_list_find_insert_index" return type to "size_t" 2025-10-06 09:11:07 -07:00
strmap.c
strmap.h
strvec.c
strvec.h
sub-process.c sub-process: do not use strbuf_split*() 2025-08-02 22:44:58 -07:00
sub-process.h
submodule-config.c config: drop `git_config_set_in_file_gently()` wrapper 2025-07-23 08:15:21 -07:00
submodule-config.h
submodule.c treewide: use setup_revisions_from_strvec() when we have a strvec 2025-09-22 14:27:03 -07:00
submodule.h
symlinks.c
symlinks.h
tag.c
tag.h
tar.h
tempfile.c
tempfile.h
thread-utils.c
thread-utils.h
tmp-objdir.c object-file: get rid of `the_repository` in `finalize_object_file()` 2025-07-16 22:16:14 -07:00
tmp-objdir.h
trace.c
trace.h
trace2.c
trace2.h
trailer.c config: drop `git_config()` wrapper 2025-07-23 08:15:18 -07:00
trailer.h
transport-helper.c packfile: split up responsibilities of `reprepare_packed_git()` 2025-09-24 11:53:50 -07:00
transport-internal.h
transport.c Merge branch 'jk/color-variable-fixes' 2025-09-29 11:40:35 -07:00
transport.h
tree-diff.c diff: teach tree-diff a max-depth parameter 2025-08-07 15:29:35 -07:00
tree-walk.c
tree-walk.h
tree.c
tree.h
unicode-width.h unicode: update the width tables to Unicode 17 2025-10-21 10:03:00 -07:00
unimplemented.sh
unix-socket.c
unix-socket.h
unix-stream-server.c
unix-stream-server.h
unpack-trees.c
unpack-trees.h
upload-pack.c Merge branch 'ps/upload-pack-oom-protection' into maint-2.51 2025-10-15 10:29:30 -07:00
upload-pack.h
url.c
url.h
urlmatch.c
urlmatch.h
usage.c Merge branch 'kh/you-still-use-whatchanged-fix' 2025-10-02 12:26:12 -07:00
userdiff.c
userdiff.h
utf8.c
utf8.h
varint.c varint: use explicit width for integers 2025-10-02 09:32:32 -07:00
varint.h varint: use explicit width for integers 2025-10-02 09:32:32 -07:00
version-def.h.in
version.c
version.h
versioncmp.c config: drop `git_config_get_string_multi()` wrapper 2025-07-23 08:15:19 -07:00
versioncmp.h
walker.c Merge branch 'rs/pop-recent-commit-with-prio-queue' 2025-07-28 12:02:34 -07:00
walker.h
wildmatch.c
wildmatch.h
worktree.c config: drop `git_config_set_in_file_gently()` wrapper 2025-07-23 08:15:21 -07:00
worktree.h
wrapper.c config: values of pathname type can be prefixed with :(optional) 2025-10-07 10:05:48 -07:00
wrapper.h config: values of pathname type can be prefixed with :(optional) 2025-10-07 10:05:48 -07:00
write-or-die.c
write-or-die.h
ws.c
ws.h
wt-status.c Merge branch 'jk/status-z-short-fix' 2025-10-24 13:48:04 -07:00
wt-status.h color: use git_colorbool enum type to store colorbools 2025-09-16 17:59:53 -07:00
xdiff-interface.c config: move Git config parsing into "environment.c" 2025-07-23 08:15:22 -07:00
xdiff-interface.h diff: ensure consistent diff behavior with ignore options 2025-08-08 07:54:44 -07:00

README.md

Build status

Git - fast, scalable, distributed revision control system

Git is a fast, scalable, distributed revision control system with an unusually rich command set that provides both high-level operations and full access to internals.

Git is an Open Source project covered by the GNU General Public License version 2 (some parts of it are under different licenses, compatible with the GPLv2). It was originally written by Linus Torvalds with help of a group of hackers around the net.

Please read the file INSTALL for installation instructions.

Many Git online resources are accessible from https://git-scm.com/ including full documentation and Git related tools.

See Documentation/gittutorial.adoc to get started, then see Documentation/giteveryday.adoc for a useful minimum set of commands, and Documentation/git-<commandname>.adoc for documentation of each command. If git has been correctly installed, then the tutorial can also be read with man gittutorial or git help tutorial, and the documentation of each command with man git-<commandname> or git help <commandname>.

CVS users may also want to read Documentation/gitcvs-migration.adoc (man gitcvs-migration or git help cvs-migration if git is installed).

The user discussion and development of Git take place on the Git mailing list -- everyone is welcome to post bug reports, feature requests, comments and patches to git@vger.kernel.org (read Documentation/SubmittingPatches for instructions on patch submission and Documentation/CodingGuidelines).

Those wishing to help with error message, usage and informational message string translations (localization l10) should see po/README.md (a po file is a Portable Object file that holds the translations).

To subscribe to the list, send an email to git+subscribe@vger.kernel.org (see https://subspace.kernel.org/subscribing.html for details). The mailing list archives are available at https://lore.kernel.org/git/, https://marc.info/?l=git and other archival sites.

Issues which are security relevant should be disclosed privately to the Git Security mailing list git-security@googlegroups.com.

The maintainer frequently sends the "What's cooking" reports that list the current status of various development topics to the mailing list. The discussion following them give a good reference for project status, development direction and remaining tasks.

The name "git" was given by Linus Torvalds when he wrote the very first version. He described the tool as "the stupid content tracker" and the name as (depending on your mood):

  • random three-letter combination that is pronounceable, and not actually used by any common UNIX command. The fact that it is a mispronunciation of "get" may or may not be relevant.
  • stupid. contemptible and despicable. simple. Take your pick from the dictionary of slang.
  • "global information tracker": you're in a good mood, and it actually works for you. Angels sing, and a light suddenly fills the room.
  • "goddamn idiotic truckload of sh*t": when it breaks