We've got a large amount of static that HTML has links like e.g.
<a href="link.html#glossary">Link</a>
However some of them contain spaces in the anchor e.g.
<a href="link.html#this is the glossary">Link</a>
Any ideas on what kind of regular expression I'd need to use to find the Spaces after the # and replace them with a - or _
Update: Just need to find them using TextMate, hence no need for a HTML parsing lib.