The Use Case
Howdy! So, I ran into an interesting problem this week. In order to test something, I had to make a change to code that should NEVER be commit... then, I commit it. D'oh!
If you want to protect yourself from something similar, or prevent data leakage, read on.
The Setup
When you clone or initialize a git project, git creates a "./.git" folder for you. This is where it tracks information about the repository, and stores some git metadata and logs.
Another thing this folder contains are "hooks", which are run at some point during the git workflow. For instance, if we had a "pre-push" hook, it would run before the push goes up to the remote.
So, what can we do with these hooks? Anything that BASH lets us. The most common use case for hooks is stopping a commit if something isn't right with the project, which is what we'll be doing in our case. However, you could do things like:
- Print out the total number of lines changed
- Check for a string and just give warnings (i,e mispellings, swear words)
- Queue a local or remote backup
- Package the project
Really, anything.
Protecting Ourselves
We would like to not commit if a string is present in the file. Imagine if we wanted to search for //[DANGER]. The presence of [DANGER] in any tracked file would stop the commit, so you could remember to go back and change the reason for it being there.
To do this, open the pre-commit.sample file and save it as pre-commit (with no extension). If there isn't a sample file, simply make the pre-commit file.
Before we write any code, let's look at what we want:
- If a string is present ANYWHERE in the directory, fail
Alternatively, in large code bases, we want:
- Search the diff for newly added instances of the string
Since searching through every file is going to take too long to do on every commit.
The Steps
Make the File
You should now have a file called ./.git/hooks/pre-commit. You make need to mark the pre-commit file as executable with chmod + x pre-commit.
Add this code to the file
printf "[Check]: Searching for commit mollyguard: '%s'..." "$breakingString"
sleep 1
# VVVVVVV YOUR REGEX RIGHT THERE
if [ $(git diff --word-diff --cached --diff-filter=ACUM -U0 | grep -E "+*\[DANGER\]" | wc -l) -gt 0 ]
then
printf "\n"
cat <<\EOF
Error: Trying to commit a tree that contains uncommitable changes
Search the projects for $breakingString and figure out why the comment is there
EOF
exit 1;
fi
printf "Good!\n"
What this code is doing:
- Look at the last diff
- Search the diff for "+{string}" (because + means it was ADDED, not removed in the last commit)
- Count the lines
If the lines are greater than 0, and instance was found, so we shouldn't commit this branch.
Of course, this is searching just the latest commit, because searching a large software project is going to take a while. If you're working on a smaller project (or on a very fast machine), you can replace the if statement with simply
if [ $(grep -R "\[DANGER\]"../../ | wc- l) > 0 ]