ansaurus

Question

Py3k memory conservation by returning iterators rather than lists

Answer 1

+6 A:

Many of them are not exactly iterators, but special view objects. For instance range() now returns something similar to the old xrange object - it can still be indexed, but lazily constructs the integers as needed.

Similarly dict.keys() gives a dict_keys object implementing a view on the dict, rather than creating a new list with a copy of the keys.

How this affects memory footprints probably depends on the program. Certainly there's more of an emphasis towards using iterators unless you really need lists, whereas using lists was generally the default case in python2. That will cause the average program to probably be more memory efficient. Cases where there are really big savings are probably going to already be implemented as iterators in python2 programs however, as really large memory usage will stand out, and is more likely to be already addressed. (eg. the file iterator is already much more memory efficient than the older file.readlines() method)

Converting is done by the 2to3 tool, and will generally convert things like range() to iterators where it can safely determine a real list isn't needed, so code like:

for x in range(10): print x

will switch to the new range() object, no longer creating a list, and so will obtain the reduced memory benefit, but code like:

x = range(20)

will be converted as:

x = list(range(20))

as the converter can't know if the code expects a real list object in x.

Brian 2009-03-31 16:27:59

Answer 2

+1 A:

Are iterators also generator expressions? Lazy evaluation?

An iterator is just an object with a next method. What the documentation means most of the time when saying that a function returns an iterator is that its result is lazily loaded.

Thus, with this the memory footprint of python is going to reduce drastically. Isn't it?

It depends. I'd guess that the average program wouldn't notice a huge difference though. The performance advantages of iterators over lists is really only significant if you have a large dataset. You may want to see this question.

Jason Baker 2009-03-31 16:39:11

ansaurus

tags:

views:

answers:

Py3k memory conservation by returning iterators rather than lists

related questions