ansaurus

Question

Answer 1

+1 A:

Access the start and end attributes of the caught exception object.

u = u'áiuê©'
try:
  l = u.encode('latin-1')
  print repr(l)
  l.decode('utf-8')
except UnicodeDecodeError, e:
  print e
  print e.start, e.end

Ignacio Vazquez-Abrams 2010-10-20 18:22:37

Beat me by 9 seconds. :-)

Omnifarious 2010-10-20 18:23:19

Why so much code in the try clause? why not only have the `decode()` there?

EOL 2010-10-20 19:49:18

Answer 2

+2 A:

try:
    line = line.decode('gb18030')
except UnicodeDecodeError, e:
    print "Error in bytes %d through %d" % (e.start, e.end)

Omnifarious 2010-10-20 18:22:46

Reference encoding error byte in Python