ansaurus

Question

Problem when using python logging in django and unicode

Answer 1

A:

I don't understand what it is you don't understand, if you see what I mean. Your middle paragraph:

So, my understanding is that the list starts to generate itself and does repr() on all its elements and they return their values - in this case it should be 's2 | ÅÄÖÖ', then the list presents itself as (ascii, the-stuff-in-the-list) and then when trying to Decode the ascii into unicode this will of course not work -- since one of the elements in the list has returened a u'...' of itself when repr was done on it.

explains exactly what is going on - outputting a list isn't the same as printing all its elements, because under the hood all it does is call repr() on each element in the list. Rather than outputting the raw list, you could log a list comprehension which calls unicode on each element, which would fix it.

Daniel Roseman 2010-01-21 20:10:10

yep - but I am sort of lazy and was hoping do be able to just output anything; lists, fields, .. and I have lots of code with warning/info/debug/error do go over and need to look at each statement and adapt if this is the case. Still if that is what it take then ...

jenlu 2010-01-21 22:29:01

Yes, there is another thing, first I simply tried to do:logging.debug('new groups %s' % list_of_groups) # WORKS SUDDENLYi.e. remove the u' and then lists/sets/dictionary all start to log great but then all my other statements where I log a field directly starts to fail with same error. So then e.g. would not work anymore:logging.debug('new group with name %s' % group.name) # FAILS

jenlu 2010-01-21 22:42:40

Answer 2

A:

I can't reproduce your problem with a simple test:

Python 2.6.4 (r264:75706, Dec  7 2009, 18:45:15) 
[GCC 4.4.1] on linux2
Type "help", "copyright", "credits" or "license" for more information.
>>> import logging
>>> group = u'Luleå'
>>> logging.warning('Group: %s', group)
WARNING:root:Group: Luleå
>>> logging.warning(u'Group: %s', group)
WARNING:root:Group: Luleå
>>>

So, as Daniel says, there is probably something which is not proper Unicode in what you're passing to logging.

Also, I don't know what handlers you're using, but make sure if there are file handlers that you explicitly specify the output encoding to use, and if there are stream handlers you also wrap any output stream which needs it with an encoding wrapper such as is provided by the codecs module (and pass the wrapped stream to logging).

Vinay Sajip 2010-01-21 20:18:49

yes, that works as you state. But my data is stored in django models and I work on those instances and when printing a group (i.e. instance of model class Group) with name, ... and lots of other field I just do print model -- this then calls the __unicode__ method of group to print or log itself. And this always works on instances. But when I have a list/set/dict of such instances and output them in debug.logging things crash if one of the instances returns unicode. Still, if there is not other way around this I will go over all my logging statements and adapt them :-(

jenlu 2010-01-21 22:34:38

The key to me here is that when I do debug.loggin('%s' % list_of_groups), then the list when 'gathering itself' states that it is ascii, and then adds the individual elements given from repr(). Have tried to call repr() on individual elements of the list and then I get u'Luleå' and so on. If only the list could when 'gathering itself' figure out that it is not in ascii then it would not try and Decode itself into Unicode, since it already is, and I would not be having this problem...

jenlu 2010-01-21 22:38:42

BTW - great with input from all of you !!!

jenlu 2010-01-21 22:39:16

Answer 3

A:

I ended following advice as answered and going over all code and doing list comprehension or similar when trying to log a set/list/dict/django queryset. So adapting and adding things like this solved it for me:

logging.debug(u"new groups: %s" % [unicode(g) for g in list_of_groups])

So now all I have to do is remember never ever to forget to do this ;-)

jenlu 2010-01-22 09:36:30

Answer 4

+1 A:

Try to use this code in the top of your views.py

#-*- coding: utf-8 -*-
...

Malcom.Z 2010-05-06 07:30:10

Answer 5

A:

have you tried manually making any result unicode?

logging.debug(u'new groups %s' % unicode(list_of_groups("UTF-8"))

Thomas 2010-08-11 06:42:54

Answer 6

A:

Here's my test code: #-- coding: utf-8 --
class Wrap:
def init(self, s): self.s = s def repr(self): return repr(self.s)
def unicode(self): return unicode(self.s) def str(self): return str(self.s)

s = 'hello'  # a plaintext string
u = 'ÅÄÖÖ'.decode('utf-8') 
l = [s,u]
test0 = unicode(repr(l))
test1 = 'string %s' % l
test2 = u'unicode %s' % l

The above works fine when you run it. However, if you change the declaration of repr to: def repr(self): return unicode(self.s)

Then it aborts with:

Traceback (most recent call last):
  File "mytest.py", line 13, in <module> unicode(l)
UnicodeEncodeError: 'ascii' codec can't encode characters in position 0-3:
   ordinal not in range(128)

So it looks like someone in the object hierarchy has a repr() implementation which is incorrectly returning a unicode string instead of a normal string. As someone else mentioned, when you do a format string like

'format %s' % mylist

and mylist is a sequence, python automatically calls repr() on it rather than unicode() (since there is no "correct" way to represent a list as a unicode string).

It may be django that's at fault here, or maybe you've implemented __repr__ incorrectly in one of your models.

apenwarr 2010-09-17 00:11:55

ansaurus

tags:

views:

answers:

Problem when using python logging in django and unicode

related questions