According to this answer: http://stackoverflow.com/questions/1020892/python-urllib2-read-to-unicode
I have to get the content-type in order to change to unicode. However, some websites don't have a "charset".
For example, the ['content-type'] for this page is "text/html". http://bit.ly/6IcCtf/ I can't convert it to unicode.
encoding=urlResponse.headers['content-type'].split('charset=')[-1]
htmlSource = unicode(htmlSource, encoding)
TypeError: 'int' object is not callable
Is there a default "encoding" (English, of course)...so that if nothing is found, I can just use that?