git-hooks

mirror of https://github.com/AdaCore/git-hooks.git synced 2026-02-12 12:43:11 -08:00

Author	SHA1	Message	Date
Joel Brobecker	ef6c560214	exclude ignored refs from git-notes commit emails information A recent change enhanced the emails being sent for git notes to also provide the list of branches (technically, the references) which contain the annotated commit (S731-057). For instance: \| Subject: [notes][repo/branch] Annotated commit subject ^^^^^^^^^^^ And also this section in the "Diff:" \| For the record, the references containing the annotated \| commit above are: \| \| refs/heads/branch However, there is a small hole in the implementation. I forgot to take into account the "hooks.ignore-refs" config. As a result, for repositories hosted on Gerrit, the emails mention references which are internal to Gerrit. \| Subject: [notes][repo/master,(refs/changes/67/108467/1)] subject ^^^^^^^^^^^^^^^^^^^^^^^^^^ ... and ... in the email: \| For the record, the references containing the annotated commit \| above are: \| \| refs/changes/67/108467/1 \| refs/heads/master This commit fixes this oversight. Change-Id: I4e21d5c906e94c01b650615282258cc7b4fb81d9 TN: S731-057 (ticket introducing this feature) TN: V105-012 (ticket opened to fix the oversight)	2022-01-06 08:09:05 +04:00
Joel Brobecker	2ee1b2c47e	Git Notes commit emails: include references containing annotated commit This commit enhances the commit emails we sent for Git Notes updates. The goal is to include the names of the branches that contain the annotated commit, similar to how we include the name of the branch for regular commit emails, implemented in a way that this feature covers all references, rather than just all branches (some projects have branches under a non-standard namespace). In summary, notes commit emails have their subject changed from... [notes][repo] annotated commit subject ... to... [notes][repo/branch1,branch2] annotated commit subject Note that, if only one branch contains the annotated commit, and that branch name is "master", then the subject remains unchanged. This follows a practice we already have for regular commit emails. This commit also introduces a new config option called max-ref-names-in-subject-prefix, which controls how many such branch/reference names we include in the subject, to avoid making that subject too long. The full list of branches/references containing the annotated commit is also included, unabridged, at the start of the "Diff:" section. Change-Id: I61ef0c497862f1243d3435a429120d63a27e4b3b TN: S731-057	2022-01-04 07:54:02 +00:00
Joel Brobecker	f778d395eb	split_ref_name: New function, factorized out of AbstractUpdate class This commit factorizes some code out of the AbstractUpdate class into its own function, so that future code can use it. This is preparation work for a later change where we want to enhance commit emails sent for notes updates, where we want to include in the information provided in the email the list of references that contain the commit being annotated by each notes commit. We want to provide that information in a way that's as readable as possible, and this starts by spliting the reference name. Hence this factorization. One thing this refactorization made us realize is that Git currently does not allow a push on references whose name does not have what we call a namespace! This became apparent with this commit because it changes slightly the way the code is written, exposing the fact that this scenario wasn't covered by our testsuite. It was while trying to add coverage for that scenario that I realized that Git rejects such pushes. In the spirit of being flexible in what we accept, the code is left as is, rather than replaced by an error or an assertion. It is then validated via unit testing. Change-Id: I3108d540ac82c7353fd5e73a12cd958f0877245a TN: S731-057	2022-01-04 07:54:02 +00:00
Joel Brobecker	0e7b6ff7d5	Add support for new force-precommit-checks config option This commit add support for a new force-precommit-checks config option, which allows projects to request that, on branches of their choice, the precommit checks always be performed, even when the commits were already present in the repository via other references (pre-existing). Change-Id: Ie24099ee7804d3eeec7d8d333a09811f48e5bf32 TN: V103-002	2022-01-04 07:54:02 +00:00
Joel Brobecker	e5b28c6890	Add support for adding a warning banner when sending emails This warning banner is triggered by the presence of an environment variable to be set by the user prior to manually calling the post_receive hook. This banner is then inserted at the beginning of the email body in order to warn all readers of that email that the email was not automatically sent at the time of push, but rather at a later time. TN: UC02-038 Change-Id: I42dece88dd0619df33adcc44d7adfc3ccd63a162	2022-01-04 07:54:02 +00:00
Joel Brobecker	8821aec0fd	Force the use of quoted-printable as the Content-Transfer-Encoding This commit changes the hooks to always use the quoted-printable encoding when sending emails. This replaces the use of 7bit, 8bit and base64 encodings. A comment is added explaining the reasons for choosing this encoding. For the record, the expected output in our testsuite has been adjusted automatically using the following Python script... \| #! /usr/bin/env python3 \| \| import argparse \| \| def fixup(filename): \| with open(filename) as f: \| contents = f.read() \| \| new_contents = contents.replace( \| """\ \| remote: DEBUG: MIME-Version: 1.0 \| remote: Content-Transfer-Encoding: 7bit \| remote: Content-Type: text/plain; charset="utf-8" \| """, \| """\ \| remote: DEBUG: Content-Type: text/plain; charset="utf-8" \| remote: MIME-Version: 1.0 \| remote: Content-Transfer-Encoding: quoted-printable \| """, \| ) \| \| new_contents = new_contents.replace( \| "remote: Content-Transfer-Encoding: base64", \| "remote: Content-Transfer-Encoding: quoted-printable", \| ) \| \| if new_contents == contents: \| return \| \| with open(filename, "w") as f: \| f.write(new_contents) \| \| \| parser = argparse.ArgumentParser() \| parser.add_argument("files_list", nargs="+") \| args = parser.parse_args() \| \| for filename in args.files_list: \| fixup(filename) ... calling it as follow: $ cd testsuite/tests $ /path/to/myscript.py */run_test.py I think a perl oneliner was also possible, but I couldn't get it to work (multiline issue, I think). This still left one testcase that needed adjustement (post-receive_from_email); I just manually adjusted it. Change-Id: I8455d24f8fe0b9a732c68090a21091db060e03f2 TN: TB22-001	2022-01-04 07:54:02 +00:00
Joel Brobecker	2d7ef76b2a	Remove import of __future__.print_function in hooks/* Now that Python 3.x is required, this import is no longer useful. Note that this commit deliberately excludes the imports done in the testsuite, so as to allow these changes to be reviewed independently of the changes to be made in the testsuite. Change-Id: I28e1857df2cf0b2f9e7ddeab00b456d6ef513755 TN: U530-006	2021-11-30 17:58:55 +04:00
Joel Brobecker	0060fca7ea	guess_encoding: Remove Python-2.x-only code Change-Id: I97d56aa2f2638d6fd3b4d88a91afcb0840d5cebd TN: U530-006	2021-11-30 17:58:55 +04:00
Joel Brobecker	d554918d1c	class Email: Remove Python-2.x-only code Change-Id: Id8a37e459d4304a47f290111eb08c33d31c8f543 TN: U530-006	2021-11-30 17:58:55 +04:00
Joel Brobecker	5b164c12e0	Move search_config_option_list method to utils.py Other than pure convenience (and the fact that it allows callers to not have to pass the ref_name each time it is called), there was no real reason why this code needs to be defined as a method of the AbstractUpdate class. This commit moves the code to a function in the utils module, so as to allow it to be called from outside the AbstractUpdate class hierachry, something we'll want to do in an upcoming commit. We still keep the search_config_option_list method for the convenience it brings to the existing callers, but its implementation is simplified to call the new function in utils instead. Change-Id: Ie2e1f192da8522471c42910283a9c1f482daf6b0 TN: UA21-052	2021-11-30 17:41:15 +04:00
Joel Brobecker	23c0aea671	updates/sendmail.py: encode input and decode output when calling sendmail This is another preparation patch for the transition to Python 3.x. With Python 3.x, we need to make sure that the input used when calling sendmail is converted to a byte string. We also then need to make sure that the script's output is decoded into a string when printing it. Change-Id: I1b792638fb77c8d1b4ee2197b29b63922e0fe211 TN: U530-006	2021-10-06 11:27:20 -07:00
Joel Brobecker	f457d10a92	updates/emails.py: encode input and decode output when calling filer This is another preparation patch for the transition to Python 3.x. With Python 3.x, we need to make sure that the input used when calling the filer cmd is converted to a byte string. We also then need to make sure that the script's output is decoded into a string. Change-Id: I324410dd5c9b1e811252803b854d0f06ca65435d TN: U530-006	2021-10-06 11:27:20 -07:00
Joel Brobecker	010c33e913	encode input and decode output when calling hooks.mailinglist script This is another preparation patch for the transition to Python 3.x. The script's input needs to be encoded when called, and its output needs to be decoded into a string for us to process it. For input encoding, the same approach as for decoding is taken: In order to make progress towards Python 3.x support while at the same time preserving support for Python 2.x, we introduce a new function "encode_utf8" which only performs the encoding on Python 3.x. With Python 2.x, the function just returns the string unmodified. Change-Id: Ieb47d32c756405cdd0d300254e8cd7c8c3db50b5 TN: U530-006	2021-10-06 11:27:20 -07:00
Joel Brobecker	9c82498e7b	Introduce (the concept of) git command output decoding This commit is preparation work for the transition to Python 3.x, where the output obtained by running Git commands will become bytes as opposed to a string. In the vast majority of cases, we'll want to decode that output into a string. Ideally, we would want to do this in a way that is both compatible with Python 2.x and Python 3.x, but we have found that this requires a lot of work with many changes spread all over the code. So, instead, what this commit does is introduce the concept of decoding the output, but with the decoding only occurring when running under Python 3.x. That way, we can make progress towards Python 3.x while preserving the behavior under Python 2.x intact. Change-Id: I189577798ee96cba1fa55c7356babf102575642f TN: U530-006	2021-10-06 11:27:20 -07:00
Joel Brobecker	9a3a3e8b47	guess_encoding (Python 3.x): drop iso-8859-15 guess (in favor of UTF-8) This commit changes the guess_encoding function, in the Python 3.x case, to return UTF-8 instead of iso-8859-15 for strings with content which is compatible with iso-8859-15. The reason for this change is to standardize a little more towards UTF-8 as our encoding of choice when generating textual data. Using iso-8859-15 was not wrong as far as I can tell, but I believe the majority of users and applications have switched to UTF-8 now, so this commit simply follows that trend. This commit won't have any effect until we switch the testsuite over to Python 3.x, where some email header fields will end up being encoded using UTF-8 instead of iso-8859-15. For the moment, no visible change within the current testing, as it only supports being run with Python 2.x. TN: U530-006 Change-Id: I6b008cd5c2e12a4dbb97a567fb35a76f40e9782a	2021-10-04 08:16:01 -07:00
Joel Brobecker	f18c10b78c	stop bypassing the updates.sendmail module during testsuite runs The goal of this commit is to include the updates.sendmail module in our testing strategy, in order to make sure that the hooks are passing email data down to the sendmail program without issues. This will become particularly important when we switch over to using Python 3.x, because of the strong distinction between bytes and strings with newer versions of Python which can cause a lot problems. Hence the need to use this code during our testing. The main strategy introduced by this commit to achieve this is fairly simple: The testsuite framework introduces a new minimal script to be called in place of the standard sendmail. A new environment variable called GIT_HOOKS_SENDMAIL is introduced allowing the testsuite to tell the hooks to use its own (fake) sendmail instead of the system one. With that in place, the old code bypassing the use of updates.sendmail can be removed, thus allowing the testsuite to include it as part of the testing. The testsuite's (fake) sendmail script was written in a way to mimick the old bypassing code, so there is no change in output. Parallel to that, the hooks are enhanced to check that we can indeed find sendmail, and otherwise return immediately with an error if not. This way, we avoid emails silently being dropped due to the missing sendmail. A couple of testcases are also added to double-check some specific error situations. Note that I tried to think of ways to split this patch into smaller individual parts, but couldn't really find a way to do so in a meaningful way, while at the same time producing a commit where the coverage report stays clean (0 lines missed). TN: U530-006 (transition to Python 3.x) TN: U924-032 (test for sendmail not found) TN: U924-034 (test for sendmail override when in testsuite mode) Change-Id: I74b993592ec6d701347bbca5283a42e037411f1c	2021-09-24 17:41:10 -07:00
Joel Brobecker	26ee444039	sendmail.py: Remove fallback on smtplib The implementation of this module was originally inherited from gnatpython, where it was trying first to call sendmail, and if not available, then fallback on using Python's smtplib instead. This commit removes support for using smtplib, and instead assumes that sendmail is always available. The reasons for this change are two-fold: - For all the users of these scripts I know of, sendmail is always available, so we haven't really used the smtplib fallback. - While this code is currently excluded during testing (to avoid sending emails while running the testsuite), I'd like to enhance our testing strategy to start including this code as part of the testing. In particular, one thing we can do is for the testuite to eventually provide its own version of a sendmail program that would dump the traces to stdout rather than actually send an email. On the other hand, if we were to keep smtplib support as a fallback, I do not see how we could test that part without actually having it send email, something we absolutely do not want. This is related to the effort of moving to Python 3.x, where Python now makes a strong distinction between bytes and strings when passing data between processes. With Python 3.x, it's much more important to always test that data is passed correctly. TN: U530-006 Change-Id: Ic2153be62a80906dce709fb3d622e1194ca7c869	2021-09-24 09:09:37 -07:00
Joel Brobecker	cbbf70fd11	Add coverage pragma for Python2-only block of code in updates/emails.py This pragma allows us to exclude this block when doing coverage analysis when testing the git-hooks using a Python 3.x interpreter. Change-Id: Id2f61c2a1cbf965c93693771b6dcb9d55d6a2708 TN: U530-006	2021-08-22 07:26:54 -07:00
Joel Brobecker	fd889438a9	Add support for unicode strings to the "guess_encoding" function This is another commit to prepare for the transition to Python 3.x, where text will be converted early to unicode strings, instead of being kept as byte strings. When passed to "guess_encoding" in Python 3.x, unicode strings don't have a "decode" method, as the strings are already decoded. As a result, the current implementation always returns None (no encoding found), because we get an exception calling the non-existent method, promptly trapped and wrongly interpreted as being a decoding error. To prepare the transition to Python 3.x, this commit adds a check to see if we have a byte-string. If we do, then do the same as before. Otherwise, we must have a unicode string, and so check the encodings by trying to encoding rather than decode the string. TN: U530-006 Change-Id: I50cf689fec8c205a6e48b42fac3a95a6bb9886b4	2021-08-20 15:32:22 -07:00
Joel Brobecker	61a8e83e52	emails.py: Mark a couple of code blocks as being Python-2.x only This commit adds a couple of "# pragma: py2-only" comments to a couple of code blocks which are only expected to be run when the hooks are tested with Python 2.x (these two code blocks are conditioned on the version of Python being less than 3). This will help us manage the transition to Python 3.x until we are able to drop support for Python 2.x. That way, we can run coverage analysis with both Python 2.x and Python 3.x, and get the Python 3.x coverage analyzer to ignore those blocks we know we cannot cover with Python 3.x. Once the transition to Python 3.x is over, we will remove those code blocks. TN: U530-006 Change-Id: I44f1cd883c3fdf4e487e1e553158517e721416df	2021-07-17 17:03:22 -07:00
Joel Brobecker	84a3fd671e	Remove support for $HOME/.no_cvs_check file Support for this specific file is purely AdaCore-specific and historical. Since then, we have introduced (much!) better ways to support users who want to suppress checks for a given commit. We know this feature hasn't been used for many years, so it is time to remove support for it. Note that it wasn't even documented. TN: U627-004 Change-Id: Ie67532158abc1d302a3d98e57835f15dadfe0817	2021-06-30 07:23:07 -07:00
Joel Brobecker	8de30044a4	.pre-commit-config.yaml: Update to black version 21.5b1 This commit updates the pre-commit hook to black version 21.5b1. The hooks where then re-run on all files to update their formatting to this new version of black. Change-Id: Ib0866745ef8432cf93380a4d83fa23a479eb4a49	2021-06-15 05:52:32 -07:00
Joel Brobecker	eaef13cfb5	simplify "diff" section computation in commit emails This commit is inspired by the fact that I couldn't understand why I was skipping the first character from the output of a Git command whe computing a commit's "diff", like so): diff = git.show(commit.rev, [...], pretty="format:\|")[1:] (emphasis on the "[1:]" at the end). To understand, I remove the subscripting and reran the testsuite without it to see what failures I would get. This gave me the answer, which is we were intentionally starting the "format:" string with a "\|", and so we needed to strip that extra character. That's when I found a comment I wrote; I didn't see it at first because it was placed further up: # For the diff, there is one subtlelty: # Git commands calls strip on the output, which is usually # a good thing, but not in the case of the diff output. # Prevent this from happening by putting an artificial # character at the start of the format string, and then # by stripping it from the output. This may have made sense back when I wrote that comment, but we no longer strip the start of the output anymore (see commit `af06d5ea54`). So I decided to simplify the code by removing the extraneous character in the "format:" string. As it happens, this revealed that git behaves slightly differently when given an empty "format:" string. Before: \| $ git show -p -M --stat --pretty="format:\|" HEAD \| \|--- \| hooks/git.py \| 4 +--- \| 1 file changed, 1 insertion(+), 3 deletions(-) \| \| diff --git a/hooks/git.py b/hooks/git.py \| index fe2b36b..0669111 100644 \| [snip] After (removing the "\|" in the "format:" string): \| $ git show -p -M --stat --pretty="format:" HEAD \| hooks/git.py \| 4 +--- \| 1 file changed, 1 insertion(+), 3 deletions(-) \| \| diff --git a/hooks/git.py b/hooks/git.py \| [snip] What we can see is that the "---" separate line is no longer shown in the second command. Rather than forcing Git to print it, or rather than staying with the existing code, this commit simply hardcodes the separator line. One minor bonus of doing it this way is that, if Git decides to change that separator, this won't affect us, and thus we won't have to change hundreds of tests accordingly. And by doing so, this revealed that there was actually an inconsistency in the formatting produced by Git: In some cases (e.g. merge commits), it became apparent that Git was omitting this "---" separator line, even when the "format:" string was empty. The corresponding testcases where the inconsistency showed up were adjusted to match the new behavior, which is consisdered (slightly) better, because more consistent. Found while working on U530-006 (transition to Python 3.x). Change-Id: Ifc473fa471ba618e11c3c4bcc6d83cc6f82fc6bf	2021-06-05 17:32:18 -07:00
Joel Brobecker	d50cc11868	Use iso-8859-15 instead of iso-8859-1 when guessing the encoding When looking at the 8 code points that are different between iso-8859-1 and iso-8859-15, it seems like the iso-8859-15 has some characters might be more widely used than the ones in iso-8859-1 (some standalone accents, some fractions, a generic "currency sign"). This commit therefore changes the hooks to use iso-8859-15 instead of iso-8859-1. Note that this doesn't change the fact that UTF-8 is still the first-choice encoding we try. Change-Id: I36092552dc647935269b1f0f6b401d198e1a7bd6 TN: U528-040	2021-05-28 14:48:49 -07:00
Joel Brobecker	fe24b59970	always send all emails using a UTF-8 charset This commit simplifies the choice of the charset being used to send our emails to just using UTF-8. This makes all emails consistently using that charset, and in particular follows something we were already doing when the message body was found to be in unicode format (this happens when the message body comes from calling one of the project's hooks). The expectation is that this preliminary change will facilitate the transition to Python 3, where strings are unicode. Change-Id: I0e44baf460dd99a2505d94671ac6042304addfd2 TN: TB22-002	2021-05-03 05:14:07 -07:00

1 2 3 4 5 ...

223 Commits