ansaurus

Question

Is there a better way of replacing duplicates in a list (Python)

Answer 1

+5 A:

I would do something like this:

a1 = ['a', 'b', 'c', 'a', 'a', 'b']
a2 = []

d = {}

for i in a1:

    d.setdefault(i, -1)
    d[i] += 1

    if d[i] >= 1:
        a2.append('%s_%d' % (i, d[i]))
    else:
        a2.append(i)

print a2

pzr 2010-05-14 20:42:19

Answer 2

+1 A:

I think the output you're asking for is messy itself, and so there is no clean way of creating it.

How do you intend to use this new list? Would a dictionary of counts like the following work instead?

{'a':3, 'b':2, 'c':1}

If so, I would recommend:

from collections import defaultdict
d = defaultdict(int) # values default to 0
for key in l1:
    d[key] += 1

mathmike 2010-05-14 20:42:41

Hey! Thanks for all the replies. =) The reason I want to keep duplicates is because the list is actually going to be a list of keys that will be used by a dictionary and I didnt want to overwrite the same key unintentionally. =)

myeu2 2010-05-14 21:39:10

Answer 3

+15 A:

In Python, generating a new list is usually much easier than changing an existing list. We have generators to do this efficiently. A dict can keep count of occurrences.

l = ['a', 'b', 'c', 'a', 'a', 'b']

def rename_duplicates( old ):
    seen = {}
    for x in old:
        if x in seen:
            seen[x] += 1
            yield "%s_%d" % (x, seen[x])
        else:
            seen[x] = 0
            yield x

print list(rename_duplicates(l))

THC4k 2010-05-14 20:53:20

+1 I wrote same solution as a class, but this is better, generators are more pythonic than a class for this kind of work, as I see it anyhow.

daramarak 2010-05-14 22:39:25

Answer 4

A:

Based on your comment to @mathmike, if your ultimate goal is to create a dictionary from a list with duplicate keys, I would use a defaultdict from the `collections Lib.

>>> from collections import defaultdict
>>> multidict = defaultdict(list)
>>> multidict['a'].append(1)
>>> multidict['b'].append(2)
>>> multidict['a'].append(11)
>>> multidict
defaultdict(<type 'list'>, {'a': [1, 11], 'b': [2]})

Don O'Donnell 2010-05-15 07:06:12

ansaurus

tags:

views:

answers:

Is there a better way of replacing duplicates in a list (Python)

related questions