views:

105

answers:

1

I have a project versioned with Git that I'd like to make open source, but it has some private information in it that is specific to the environment in which it was originally used. I'm going to change the information in question to load from a config file which is not included in the repository. I realize I should have done this in the first place, but since the private information still exists in previous commits, how can I go about removing it from my history? Do I just have to start a new repository based on the latest commit and lose all my history or is there a way to salvage the current repository while removing any record of the private information?

Edit: To clarify, I don't want to completely remove the files that contain this private information, because they are still used. Rather, I want to remove/blank out/change the occurrence of certain strings within them.

+2  A: 

I wrote a script for this a little while ago. You can find it here: http://dound.com/2009/04/git-forever-remove-files-or-folders-from-history/

The script builds on the git-filter-branch tool which comes with git. If you're curious, you can read more about removing files from a git repo here, but using the script from the link above should be easy and all you really need to accomplish removing that private information.

David Underhill
This looks like a great tool but I'm not sure it will work in my case. I should've been more clear in my question, but what I want to remove is the occurrence of certain strings - I don't want to delete entire files altogether because the files are still used.
Jimmy Cuadra
Oh, I see. That is a bit trickier.If you entered these private strings in commits which don't contain anything else that you want to keep, then you can use git-filter-branch to remove just those commits (without deleting the files). My script can't do that for you, but if you check out the man page for git-filter-branch I think you'll see how you can use it to remove individual commits too.
David Underhill
Also, you should be able to use git-filter-history to apply a custom filter (script) over your files. This sounds like it might be a bit harder than simply removing a file or commit, but it should do what you want (and sounds better than restarting your repository and losing all your history when you release it to the public).
David Underhill