Browse Source

coccinelle: use <...> for function exclusion

Sometimes we want to suppress a coccinelle transformation
inside a particular function. For example, in finding
conversions of hashcmp() to oidcmp(), we should not convert
the call in oidcmp() itself, since that would cause infinite
recursion. We write that like this:

  @@
  identifier f != oidcmp;
  expression E1, E2;
  @@
    f(...) {...
  - hashcmp(E1->hash, E2->hash)
  + oidcmp(E1, E2)
    ...}

to match the interior of any function _except_ oidcmp().

Unfortunately, this doesn't catch all cases (e.g., the one
in sequencer.c that this patch fixes). The problem, as
explained by one of the Coccinelle developers in [1], is:

  For transformation, A ... B requires that B occur on every
  execution path starting with A, unless that execution path
  ends up in error handling code.  (eg, if (...) { ...
  return; }).  Here your A is the start of the function.  So
  you need a call to hashcmp on every path through the
  function, which fails when you add ifs.

  [...]

  Another issue with A ... B is that by default A and B
  should not appear in the matched region.  So your original
  rule matches only the case where every execution path
  contains exactly one call to hashcmp, not more than one.

One way to solve this is to put the pattern inside an
angle-bracket pattern like "<... P ...>", which allows zero
or more matches of P. That works (and is what this patch
does), but it has one drawback: it matches more than we care
about, and Coccinelle uses extra CPU. Here are timings for
"make coccicheck" before and after this patch:

  [before]
  real	1m27.122s
  user	7m34.451s
  sys	0m37.330s

  [after]
  real	2m18.040s
  user	10m58.310s
  sys	0m41.549s

That's not ideal, but it's more important for this to be
correct than to be fast. And coccicheck is already fairly
slow (and people don't run it for every single patch). So
it's an acceptable tradeoff.

There _is_ a better way to do it, which is to record the
position at which we find hashcmp(), and then check it
against the forbidden function list. Like:

  @@
  position p : script:python() { p[0].current_element != "oidcmp" };
  expression E1,E2;
  @@
  - hashcmp@p(E1->hash, E2->hash)
  + oidcmp(E1, E2)

This is only a little slower than the current code, and does
the right thing in all cases. Unfortunately, not all builds
of Coccinelle include python support (including the ones in
Debian). Requiring it may mean that fewer people can easily
run the tool, which is worse than it simply being a little
slower.

We may want to revisit this decision in the future if:

  - builds with python become more common

  - we find more uses for python support that tip the
    cost-benefit analysis

But for now this patch sticks with the angle-bracket
solution, and converts all existing cocci patches. This
fixes only one missed case in the current code, though it
makes a much better difference for some new rules I'm adding
(converting "!hashcmp()" to "hasheq()" misses over half the
possible conversions using the old form).

[1] https://public-inbox.org/git/alpine.DEB.2.21.1808240652370.2344@hadrien/

Helped-by: Julia Lawall <julia.lawall@lip6.fr>
Signed-off-by: Jeff King <peff@peff.net>
Signed-off-by: Junio C Hamano <gitster@pobox.com>
maint
Jeff King 6 years ago committed by Junio C Hamano
parent
commit
4d168e742a
  1. 4
      contrib/coccinelle/commit.cocci
  2. 20
      contrib/coccinelle/object_id.cocci
  3. 2
      sequencer.c

4
contrib/coccinelle/commit.cocci

@ -15,10 +15,10 @@ expression c;
identifier f !~ "^(get_commit_tree|get_commit_tree_in_graph_one|load_tree_for_commit)$"; identifier f !~ "^(get_commit_tree|get_commit_tree_in_graph_one|load_tree_for_commit)$";
expression c; expression c;
@@ @@
f(...) {... f(...) {<...
- c->maybe_tree - c->maybe_tree
+ get_commit_tree(c) + get_commit_tree(c)
...} ...>}


@@ @@
expression c; expression c;

20
contrib/coccinelle/object_id.cocci

@ -20,10 +20,10 @@ expression E1;
identifier f != oid_to_hex; identifier f != oid_to_hex;
expression E1; expression E1;
@@ @@
f(...) {... f(...) {<...
- sha1_to_hex(E1->hash) - sha1_to_hex(E1->hash)
+ oid_to_hex(E1) + oid_to_hex(E1)
...} ...>}


@@ @@
expression E1, E2; expression E1, E2;
@ -35,10 +35,10 @@ expression E1, E2;
identifier f != oid_to_hex_r; identifier f != oid_to_hex_r;
expression E1, E2; expression E1, E2;
@@ @@
f(...) {... f(...) {<...
- sha1_to_hex_r(E1, E2->hash) - sha1_to_hex_r(E1, E2->hash)
+ oid_to_hex_r(E1, E2) + oid_to_hex_r(E1, E2)
...} ...>}


@@ @@
expression E1; expression E1;
@ -50,10 +50,10 @@ expression E1;
identifier f != oidclr; identifier f != oidclr;
expression E1; expression E1;
@@ @@
f(...) {... f(...) {<...
- hashclr(E1->hash) - hashclr(E1->hash)
+ oidclr(E1) + oidclr(E1)
...} ...>}


@@ @@
expression E1, E2; expression E1, E2;
@ -65,10 +65,10 @@ expression E1, E2;
identifier f != oidcmp; identifier f != oidcmp;
expression E1, E2; expression E1, E2;
@@ @@
f(...) {... f(...) {<...
- hashcmp(E1->hash, E2->hash) - hashcmp(E1->hash, E2->hash)
+ oidcmp(E1, E2) + oidcmp(E1, E2)
...} ...>}


@@ @@
expression E1, E2; expression E1, E2;
@ -92,10 +92,10 @@ expression E1, E2;
identifier f != oidcpy; identifier f != oidcpy;
expression E1, E2; expression E1, E2;
@@ @@
f(...) {... f(...) {<...
- hashcpy(E1->hash, E2->hash) - hashcpy(E1->hash, E2->hash)
+ oidcpy(E1, E2) + oidcpy(E1, E2)
...} ...>}


@@ @@
expression E1, E2; expression E1, E2;

2
sequencer.c

@ -4545,7 +4545,7 @@ int skip_unnecessary_picks(void)
if (item->commit->parents->next) if (item->commit->parents->next)
break; /* merge commit */ break; /* merge commit */
parent_oid = &item->commit->parents->item->object.oid; parent_oid = &item->commit->parents->item->object.oid;
if (hashcmp(parent_oid->hash, oid->hash)) if (oidcmp(parent_oid, oid))
break; break;
oid = &item->commit->object.oid; oid = &item->commit->object.oid;
} }

Loading…
Cancel
Save