RegExp: Extract URLs (without HTML links)

tags:

regex
url

views:

1137

answers:

RegExp: Extract URLs (without HTML links)

Hello!

I want to extract links of a text using RegExp. There is no HTML code in the text so I can't search for the tags "<a...". How can I find the links, though?

Example:

"Please go to http://www.example.org/page1.html and click on ..."

I want to extract the text:

"http://www.example.org/page1.html"

As far as I know, a URL can contain the following characters:

a-z A-Z 0-9 /.#?=&+,@-_~

I hope you can help me. Thanks in advance!

The URL must start with "http" and before that, the must be a space or the beginning of the text.

+3 A:

Already answered:

http://stackoverflow.com/questions/6173/regular-expression-for-parsing-links-from-a-webpage

Codebrain 2009-04-12 12:26:19

Thanks! I used the search but I didn't find that. The accepted answer doesn't help me. But Jeff Atwood's answer does:"\b(https?|ftp|file)://[-A-Z0-9+

2009-04-12 12:46:35

Do you mean it returns the dot as part of the matched text? I doesn't do that when I try it.

Alan Moore 2009-04-12 15:30:44

Yes, it does. Try "Go to http://www.example.org." and you'll see.

2009-04-12 16:41:29

related questions

Passing a commented, multi-line (freespace) regex to preg_match

My regex is matching too much. How do I make it stop?

Using Regex to generate Strings rather than match them

Complexity of Regex substitution

What is the most brilliant regex you've ever used?

RFC calculation in Java need help with algorithm

What did I do wrong here? [Javascript Regex]

How do you use back-references to PCREs in PHP?

Need help writing a regex statement. [PHP]

Regex and unicode

Python Regular Expressions

Question about specific regular expression

Pre-built regular expression patterns or Regex Libraries?

Parsing attributes with regex in Perl

Regex Rejecting matches because of Instr

How do I bind a regular expression to a key combination in emacs?

How do you retrieve selected text using Regex in C#?

Remove Quotes and Commas from a String in MySQL

Regular expression for parsing links from a webpage?

What are good regular expressions?

Why is this regular expression faster?

Learning Regular Expressions

How far should one take e-mail address validation?

How can I get at the matches when using preg_replace in PHP?

Regex: To pull out a sub-string between two tags in a string

ansaurus

tags:

views:

answers:

RegExp: Extract URLs (without HTML links)

related questions