ansaurus

Question

Python fetching <title>

Answer 1

+1 A:

Here You will find some libs for html/xml parsing. Choice depends on what You need and what fits Your needs.

http://blog.ianbicking.org/2008/03/30/python-html-parser-performance/

Rafal Ziolkowski 2009-11-02 09:53:58

Answer 2

A:

Use Beautiful Soup.

html = urllib2.urlopen("...").read()
from BeautifulSoup import BeautifulSoup
soup = BeautifulSoup(html)
print soup.title.string

orip 2009-11-02 09:54:09

Answer 3

+1 A:

Try Beautiful Soup:

url = 'http://www.example.com'
response = urllib2.urlopen(url)
html = response.read()

soup = BeautifulSoup(html)
title = soup.html.head.title
print title.contents

Dominic Rodger 2009-11-02 09:55:06

Answer 4

+5 A:

Yes I would recommend BeautifulSoup

If you're getting the title it's simply:

soup = BeautifulSoup(html)
myTitle = soup.html.head.title

or

myTitle = soup('title')

Taken from the documentation

It's very robust and will parse the html no matter how messy it is.

RobbR 2009-11-02 09:55:11

ansaurus

tags:

views:

answers:

Python fetching <title>

related questions