ansaurus

Question

Writing unicode strings via sys.stdout in Python

Answer 1

+2 A:

It's not clear to my why you wouldn't be able to do print; but assuming so, yes, the approach looks right to me.

Martin v. Löwis 2009-09-24 19:40:07

One reason I cannot use `print` is to avoid that extra space `print` prints. Look at the use of `sys.stdout` here: http://stackoverflow.com/questions/1396820/apt-like-column-output-python-library/1397382#1397382

Sridhar Ratnakumar 2009-09-24 19:52:44

You could build up complete lines, and then print them.

Martin v. Löwis 2009-09-24 20:04:01

Bravo! Yes, in that case I can use `print`

Sridhar Ratnakumar 2009-09-24 20:13:57

adding a comma to the end makes print suppress the newline: print "Some Text",

Georg 2009-09-24 20:17:44

adding a comma will not print a newline, but it will print an extra space. try running: python -c "print 2,; print 3,"

Sridhar Ratnakumar 2009-09-24 20:20:15

Martin, even using `print` did not help when piping the output to `less`. logging.StreamHandler works fine though.

Sridhar Ratnakumar 2009-09-24 20:25:47

If the output is to a pipe, it can't possibly know what encoding to use (as it can't know that less(1) is at the other end of the pipe). So your application will have to determine/decide the encoding for itself.

Martin v. Löwis 2009-09-24 20:44:45

In Python 3 you can do `print(stuff, sep='', end='')` to avoid extra spaces. And I suspect the encoding problem isn't present there either.

ilya n. 2009-09-26 15:26:27

Answer 2

+2 A:

Best idea is to check if you are directly connected to a terminal. If you are, use the terminal's encoding. Otherwise, use system preferred encoding.

if sys.stdout.isatty():
    default_encoding = sys.stdout.encoding
else:
    default_encoding = locale.getpreferredencoding()

It's also very important to always allow the user specify whichever encoding she wants. Usually I make it a command-line option (like -e ENCODING), and parse it with the optparse module.

Another good thing is to not overwrite sys.stdout with an automatic encoder. Create your encoder and use it, but leave sys.stdout alone. You could import 3rd party libraries that write encoded bytestrings directly to sys.stdout.

nosklo 2009-09-25 02:55:36

Answer 3

+1 A:

There is an optional environment variable "PYTHONIOENCODING" which may be set to a desired default encoding. It would be one way of grabbing the user-desired encoding in a way consistent with all of Python. It is buried in the Python manual here.

daveagp 2010-10-26 20:50:59

ansaurus

tags:

views:

answers:

Writing unicode strings via sys.stdout in Python

related questions