ansaurus

Question

Answer 1

+1 A:

you forgot to resolve the hostname:

addr = socket.gethostbyname(url[1])
...
sock.connect((addr,80))

Piotr Lesnicki 2008-11-19 23:08:52

Answer 2

+3 A:

sock.connect((url[0] + '://' + url[1],80))

Do not do that, instead do this:

sock.connect((url[1], 80))

connect expects a hostname, not a URL.

Actually, you should probably use something higher-level than sockets to do HTTP. Maybe httplib.

ddaa 2008-11-19 23:09:00

I've tried that too. It gives me Access Denied errors everywhere.

The.Anti.9 2008-11-19 23:15:57

Answer 3

+3 A:

Please please please please please please please don't do this.

urllib and urllib2 are your friends.

Read the "missing" urllib2 manual if you are having trouble with it.

Ali A 2008-11-20 00:02:29

Answer 4

+2 A:

Have you ever altered your Hosts file? If it has an entry for Reddit but not much else, that might explain that site's unique result.

2009-03-06 16:32:59

Answer 5

A:

Use urllib2. Or BeautifulSoup.

2009-03-06 16:44:05

(Python) socket.gaierror on every addres...except http://www.reddit.com?