ansaurus

Question

Python and "re"

Answer 1

+1 A:

Are you using the re.match() or re.search() method? My understanding is that re.match() assumes a "^" at the begining of your expression and will only search at the beginning of the text, while re.search() acts more like the Perl regular expressions and will only match the beginning of the text if you include a "^" at the beginning of your expression. Hope that helps.

2008-09-16 13:52:02

Answer 2

+16 A:

In Python, there's a distinction between "match" and "search"; match only looks for the pattern at the start of the string, and search looks for the pattern starting at any location within the string.

Python regex docs
Matching vs searching

zweiterlinde 2008-09-16 13:53:03

Answer 3

+3 A:

>>> import re
>>> pattern = re.compile("url")
>>> string = "   url"
>>> pattern.match(string)
>>> pattern.search(string)
<_sre.SRE_Match object at 0xb7f7a6e8>

Aaron Maenpaa 2008-09-16 13:54:56

Answer 4

+1 A:

You are probably being tripped up by the different methods re.search and re.match.

mmaibaum 2008-09-16 13:56:13

Answer 5

+4 A:

from BeautifulSoup import BeautifulSoup 

soup = BeautifulSoup(your_html)
for a in soup.findAll('a', href=True):
    # do something with `a` w/ href attribute
    print a['href']

J.F. Sebastian 2008-09-16 22:44:42

ansaurus

tags:

views:

answers:

Python and "re"

related questions