ansaurus

Question

Regex to match url end-of-line or "/" character

Answer 1

+9 A:

/(.+)/(\d{4}-\d{2}-\d{2})-(\d+)(/.*)?$

Adam Tegen 2008-10-06 16:49:57

Answer 2

+2 A:

To match either / or end of content, use (/|\z)

This only applies if you are not using multi-line matching (i.e. you're matching a single URL, not a newline-delimited list of URLs).

To put that with an updated version of what you had:

/(\S+?)/(\d{4}-\d{2}-\d{2})-(\d+)(/|\z)

Note that I've changed the start to be a non-greedy match for non-whitespace ( \S+? ) rather than matching anything and everything ( .* )

Peter Boughton 2008-10-06 17:05:11

Answer 3

+1 A:

I can't really comment on the regex itself, but I wanted to let you know about a very useful tool, Expresso, which will help you construct and test your regexes before you try and code for them.

Ian Jacobs 2008-10-06 18:07:48

Answer 4

+7 A:

You've got a couple regexes now which will do what you want, so that's adequately covered. What hasn't been mentioned is why your attempt won't work: Inside a character class, $ (as well as ^, ., and /) has no special meaning, so [/$] matches either a literal / or a literal $ rather than terminating the regex (/) or matching end-of-line ($).

Dave Sherohman 2008-10-06 20:31:30

This is something frequently forgotten and not mentioned eneough in the regex docs.

Steve Dunn 2009-04-03 09:21:43

ansaurus

tags:

views:

answers:

Regex to match url end-of-line or "/" character

related questions