ansaurus

Question

Answer 1

+2 A:

Byte strings like "±" (in Python 2.x) are encoded in the source file's encoding, which might not be what you want. If col2 is really a Unicode object, you should use u"±" instead like you already tried. You might know that somestring.index raises an exception if it doesn't find an occurrence whereas somestring.find returns -1. Therefore, this

    if col2.index('±'):
        col2=col2[:col2.index('±')] # this is not indented correctly in the question BTW
        print(col2.encode("utf-8"))

should be

    if u'±' in col2:
        col2=col2[:col2.index(u'±')]
        print(col2.encode("utf-8"))

so that the if statement doesn't lead to an exception.

AndiDog 2010-10-07 20:38:28

you can also useif u'±' in col2: col2 = ...

2010-10-08 03:01:01

@user237182: Don't know why I didn't see that. Changing it in my answer.

AndiDog 2010-10-08 11:05:26

Thankyou so much. This did the trick. Furthermore i have corrected some of all the awfull faults (non correct indentati etc.). This was really nice.

Daniel 2010-10-08 12:38:52

ansaurus

tags:

views:

answers:

Special character use in Python 2.6

related questions