ansaurus

Question

Python - How to calculate equal parts of two dictionaries?

Answer 1

A:

in pseudocode:

Dictionary d3 = new Dictionary()
for (i = 0 to min(d1.size(), d2.size()))
{
  element shared = getSharedElements(d1[i], d2[i]);
  d3.store(i, shared);
}

function getsharedElements(array e1, array e2)
{
  element e3 = new element();
  for (int i = 0 to e1.length)
  {
    if (e2.contains(e1[i]))
    {
      e3.add[e1[i]];
    }
  }
  return e3;
}

tehvan 2009-03-12 12:26:47

Highly non-pythonic!

Joe Koberg 2009-03-12 14:17:12

Heh, the Python is actually more concise than the pseudocode! “Python is executable pseudocode”, indeed!

bobince 2009-03-12 15:10:58

This was posted before the original post mentioned python.

tehvan 2009-03-13 06:14:24

Answer 2

+7 A:

Assuming this is Python, you want:

dict((x, set(y) & set(d1.get(x, ()))) for (x, y) in d2.iteritems())

to generate the resulting dictionary "d3".

Python 3.0+ version

>>> d3 = {k: list(set(d1.get(k,[])).intersection(v)) for k, v in d2.items()}
{0: ['11', '25', '38'], 1: ['38'], 2: ['11', '18'], 3: ['11', '25']}

The above version (as well as Python 2.x version) allows empty intersections therefore additional filtering is required in general case:

>>> d3 = {k: v for k, v in d3.items() if v}

Combining the above in one pass:

d3 = {}
for k, v in d2.items():
    # find common elements for d1 & d2
    v3 = set(d1.get(k,[])).intersection(v)
    if v3: # whether there are common elements
       d3[k] = list(v3)

[Edit: I made this post community wiki so that people can improve it if desired. I concede it might be a little hard to read if you're not used to reading this sort of thing in Python.]

John Feminella 2009-03-12 12:30:54

I used your solution and got an error:TypeError: 'int' object is not iterableCan you help me with this one yet?

2009-03-12 13:20:35

Sorry, there was a typo. You want to use "d2.iteritems()" to get the list of items there.

John Feminella 2009-03-12 13:27:09

This returns: {0: set(['11', '25', '38']), 1: set(['38']), 2: set(['11', '18']), 3: set(['11', '25'])} so it's pretty close.

Adrian Archer 2009-03-12 13:30:06

Thank you very much John!!! Your solution is what I need ;-)

2009-03-12 13:33:42

Adrian Archer 2009-03-12 13:39:49

Impressive, but I don't think clever one-liners are particularly pythonic.

dangph 2009-03-12 13:40:12

@dangph: You're right; I was more or less shooting from the hip. So post a better solution and we'll upvote it! :)

John Feminella 2009-03-12 13:43:36

@john, good idea. Done :)

dangph 2009-03-12 14:09:03

@dangph: clever one-liners are extremely pythonic and quite often fast.

SilentGhost 2009-03-12 14:14:36

Pythonic is in the eye of the beholder, but I find this hard to grok — “Scheme-ic”, if that's a word! (It isn't). Might be clearer with some line breaks/indentation to resolve which brackets belong to which.

bobince 2009-03-12 15:07:05

I made this post community wiki if you guys want to improve on my solution. Have at it!

John Feminella 2009-03-12 15:51:17

I've added Python 3.0+ version.

J.F. Sebastian 2009-03-13 02:54:29

Answer 3

+1 A:

The problem boils down to determining the common elements between the two entries. (To obtain the result for all entries, just enclose the code in a loop over all of them.) Furthermore, it looks like each entry is a set (i.e. it has not duplicate elements). Therefore, all you need to do is find the set intersection between these elements. Many languages offer a method or function for doing this; for instance in C++ use the set container and the set_intersection function. This is a lot more efficient than comparing each element in one set against the other, as others have proposed.

Diomidis Spinellis 2009-03-12 12:39:48

Answer 4

+1 A:

If we can assume d1 and d2 have the same keys:

d3 = {}
for k in d1.keys():
    intersection = set(d1[k]) & set(d2[k])
    d3[k] = [x for x in intersection]

Otherwise, if we can't assume that, then it is a little messier:

d3 = {}
for k in set(d1.keys() + d2.keys()):
    intersection = set(d1.get(k, [])) & set(d2.get(k, []))
    d3[k] = [x for x in intersection]

Edit: New version taking the comments into account. This one only checks for keys that d1 and d2 have in common, which is what the poster seems to be asking.

d3 = {}
for k in set(d1.keys()) & set(d2.keys()):
    intersection = set(d1[k]) & set(d2[k])
    d3[k] = list(intersection)

dangph 2009-03-12 14:08:18

that's not more pythonic

SilentGhost 2009-03-12 14:13:34

@SilentGhost, why is that? Care to elaborate? Or you code post a more pythonic one.

dangph 2009-03-12 14:15:49

SilentGhost 2009-03-12 14:20:01

@SilentGhost, why is that better?

dangph 2009-03-12 14:28:26

because it's idiomatic?

SilentGhost 2009-03-12 14:30:02

@SilentGhost, that's not really an argument; it's just a subjective assertion. I think your version is bad because it takes effort to pull it apart and understand it. I would be annoyed if I found it in production code.

dangph 2009-03-12 14:34:30

Any reason not to use “d3[k]= list(intersection)”?

bobince 2009-03-12 15:08:13

btw, your second example wouldn't work in py3k: dict views are not summable.

SilentGhost 2009-03-12 15:20:07

@bobince, that didn't work in python 2.6. It works in python 3 however.

dangph 2009-03-12 15:45:34

@SilentGhost, thanks, I did not know that. It is easily fixed. Though the fix does make my code a bit more verbose.

dangph 2009-03-12 15:50:32

You version allows empty intersections i.e., `all(d3.values())` might be `False` in some cases. But the OP asks to include only those keys when there *are* some common numbers.

J.F. Sebastian 2009-03-13 03:03:31

bwt, `list(some_set)` works just fine. I've tested it in Python 2.5+

J.F. Sebastian 2009-03-13 03:06:55

@J.F., you are right about the list(set()). I don't know why I thought that didn't work. I must have misread some error message.

dangph 2009-03-13 08:58:47

Answer 5

+3 A:

Offering a more readable solution:

d3= {}
for common_key in set(d1) & set(d2):
    common_values= set(d1[common_key]) & set(d2[common_key])
    d3[common_key]= list(common_values)

EDIT after suggestion:

If you want only keys having at least one common value item:

d3= {}
for common_key in set(d1) & set(d2):
    common_values= set(d1[common_key]) & set(d2[common_key])
    if common_values:
        d3[common_key]= list(common_values)

You could keep the d1 and d2 values as sets instead of lists, if order and duplicates are not important.

ΤΖΩΤΖΙΟΥ 2009-03-13 01:15:00

J.F. Sebastian 2009-03-13 03:12:28

btw, `common_values` might be an empty set in general case. The OP asks "there are some" i.e. `all(d3.values())` must be True.

J.F. Sebastian 2009-03-13 03:19:32

ansaurus

tags:

views:

answers:

Python - How to calculate equal parts of two dictionaries?

Python 3.0+ version

EDIT after suggestion:

related questions