ansaurus

Question

What should itertools.product() yield when supplied an empty list?

Answer 1

+4 A:

sth 2010-07-01 01:49:03

In mathematics, there are many product and multiplication functions where there is no "neutral element". For a broad class with several examples, see http://en.wikipedia.org/wiki/Direct_product

Daniel Stutzbach 2010-07-01 03:37:42

+1. Beautiful answer.You can make your product equality work with a slight modification: define a `flatten` function by: `flatten = lambda tups: sum(tups, ())`. Then `list(product(*(xs+ys)))` is equivalent to `map(flatten, product(product(*xs), product(*ys)))`. Moreover, the result of `itertools.product()` (with no args) is the correct one to make this equivalence continue to hold when either `xs` or `ys` (or both) is empty.

Mark Dickinson 2010-07-01 09:10:16

Ah; now I see that that's pretty much exactly what you did, with your `tproduct` function. Sorry for the noise. :)

Mark Dickinson 2010-07-01 09:17:02

@sth Thanks, I learned a lot from this.

FM 2010-07-03 14:04:57

Answer 2

+2 A:

As @sth already indicated, this behaviour is correct from a mathematical viewpoint. All you really need to convince yourself of is that list(itertools.product()) should have exactly one element, since once you know that it's clear what that element should be: it's got to be (for consistency) a tuple of length 0, and there's only one of those.

But the number of elements of itertools.product(l1, l2, l3, ...) should just be the product of the lengths of l1, l2, l3, ... . So the number of elements of itertools.product() should be the size of the empty product, and there's no shortage of internet sources that should persuade you that the empty product is 1.

I just wanted to point out that this is the correct practical definition as well as the correct mathematical one; that is, it's the definition that's most likely to 'just work' in boundary cases. For an example, suppose that you want to generate all strings of length n consisting of decimal digits, with the first digit nonzero. You might do something like:

import itertools

def decimal_strings(n):
    """Generate all digit strings of length n that don't start with 0."""
    for lead_digit in '123456789':
        for tail in itertools.product('0123456789', repeat=n-1):
            yield lead_digit + ''.join(tail)

What should this produce when n = 1? Well, in that case, you end up calling itertools.product with an empty product (repeat = 0). If it returned nothing, then the body of the inner for loop above would never be executed, so decimal_strings(1) would be an empty iterator; almost certainly not what you want. But since itertools.product('0123456789', repeat=0) returns a single tuple, you get the expected result:

>>> list(decimal_strings(1))
['1', '2', '3', '4', '5', '6', '7', '8', '9']

(When n = 0, of course, this function correctly raises a ValueError.)

So in short, the definition is mathematically sound, and more often that not it's also what you want. It's definitely not a Python bug!

Mark Dickinson 2010-07-01 08:58:40

+1 for the link to the Empty Product. It specifically mentions that for the Cartesian Product, the empty product is the singleton set containing the empty set.

Daniel Stutzbach 2010-07-01 12:51:40

Great answer, both for the Empty Product reference and for the practical illustration.

FM 2010-07-03 14:07:15

ansaurus

tags:

views:

answers:

What should itertools.product() yield when supplied an empty list?

related questions