ansaurus

Question

what's faster: merging lists or dicts in python?

Answer 1

+1 A:

You can use the timeit module to measure the speed of your code, but I'm going to guess that they'll be practically the same (since a set is probably implemented using a dictionary).

Michael Aaron Safyan 2010-05-17 03:31:14

Answer 2

+1 A:

Dicts and sets will be just as fast (and O(N), as you surmise). Lists, which you only mention in your Q's title and never in its text, might be slower, depending what you mean by "merging".

Given the json downstream requirements, dicts with values all set to 1 will be fastest overall -- not for the merging, but for the JSON serialization.

Alex Martelli 2010-05-17 03:32:26

Answer 3

+1 A:

If you are looking for duplicate elimination, sets are very, very fast.

>>> x = set(range(1000000,2000000))
>>> y = set(range(1900000,2900000))

the following happened in ~0.020s  
>>> z = set.intersection(x,y)
>>> len(z)
100000

Regarding output to json, just convert to a list...

json_encode(list(z))

gahooa 2010-05-17 03:33:03

@gahooa, but what about the time to convert to a list?

Michael Aaron Safyan 2010-05-17 03:36:43

Answer 4

A:

I'd be more worried about correctness. If you have duplicate keys, the list will duplicate your keys and values. A dictionary will only keep one of the values. Also, a list will keep the order consistent. Which do you prefer?

My gut reaction is that if you are searching for keys the dictionary will be faster. But how will you deal with duplication?

wisty 2010-05-17 05:27:36

Answer 5

A:

as Michael said, it's probably easiest to use the timeit module and see for yourself. It's very easy to do:

import timeit
def test():
    # do your thing here
    # including conversion to json
    pass

result = timeit.repeat(test, repeat=10, number=10000)
print '{0:.2}s per 10000 test runs.'.format(min(result))

Hope that helps.

Tripzilch 2010-05-17 08:36:19

ansaurus

tags:

views:

answers:

what's faster: merging lists or dicts in python?

related questions