ansaurus

Question

Find unique common element from 3 arrays

Answer 1

+4 A:

If you sort all the arrays first O(n log n) then it will be pretty easy to find the common element in less than O(n^3) time. You can for example use binary search after sorting them.

Tuomas Pelkonen 2010-03-10 15:56:29

Or just merge sort them into one big array and check for a triplicate entry.

alxp 2010-03-10 15:57:35

What if some array already has duplicate elements?

Tuomas Pelkonen 2010-03-10 16:00:55

Answering my own comment : The original problem does not allow duplicates, but the converted problem does not say anything about uniqueness of the integers

Tuomas Pelkonen 2010-03-10 16:03:56

You don't actually merge them, you just use a merge sort-like algorithm and have it stop when all 3 inputs have the same value.

jmucchiello 2010-03-10 16:15:47

Answer 2

+2 A:

Use a hash table for each integer and encode the entries such that you know which array it's coming from - then check for the slot which has entries from all 3 arrays. O(n)

Jacob 2010-03-10 15:59:44

Answer 3

+5 A:

Some improvement in Pelkonen's answer:
From converted problem in OP:

"Given that there is one and only one common element in these 3 arrays."

We need to sort only 2 arrays and find common element.

WannaBeGeek 2010-03-10 16:12:44

+1 you need to iterate over all of 1 array anyway so you don't need to sort that one

jk 2010-03-10 16:21:21

It is possible that array A and B share common elements (a, b); array B and array C share common elements (b, c). Put together, array A, B and C share common element b. Therefore, to sort only 2 arrays is not sufficient.

SiLent SoNG 2010-03-15 11:33:03

Answer 4

A:

O(N) solution: use a hash table. H[i] = list of all integers in the three arrays that map to i.

For all H[i] > 1 check if three of its values are the same. If yes, you have your solution. You can do this check with the naive solution even, it should still be very fast, or you can sort those H[i] and then it becomes trivial.

If your numbers are relatively small, you can use H[i] = k if i appears k times in the three arrays, then the solution is the i for which H[i] = 3. If your numbers are huge, use a hash table though.

You can extend this to work even if you can have elements that can be common to only two arrays and also if you can have elements repeating elements in one of the arrays. It just becomes a bit more complicated, but you should be able to figure it out on your own.

IVlad 2010-03-10 16:42:57

Answer 5

+2 A:

Use a hashtable mapping objects to frequency counts. Iterate through all three lists, incrementing occurrence counts in the hashtable, until you encounter one with an occurrence count of 3. This is O(n), since no sorting is required. Example in Python:

def find_duplicates(*lists):
  num_lists = len(lists)
  counts = {}
  for l in lists:
    for i in l:
      counts[i] = counts.get(i, 0) + 1
      if counts[i] == num_lists:
        return i

Or an equivalent, using sets:

def find_duplicates(*lists):
  intersection = set(lists[0])
  for l in lists[1:]:
    intersection = intersection.intersect(set(l))
  return intersection.pop()

Nick Johnson 2010-03-10 16:58:09

+1: I was going to suggest something similar. You beat me to it!

KarstenF 2010-03-10 19:48:06

Answer 6

+1 A:

If you want the fastest* answer:

Sort one array--time is N log N.
For each element in the second array, search the first. If you find it, add 1 to a companion array; otherwise add 0--time is N log N, using N space.
For each non-zero count, copy the corresponding entry into the temporary array, compacting it so it's still sorted--time is N.
For each element in the third array, search the temporary array; when you find a hit, stop. Time is less than N log N.

Here's code in Scala that illustrates this:

import java.util.Arrays

val a = Array(1,5,2,3,14,1,7)
val b = Array(3,9,14,4,2,2,4)
val c = Array(1,9,11,6,8,3,1)

Arrays.sort(a)
val count = new Array[Int](a.length)
for (i <- 0 until b.length) {
  val j =Arrays.binarySearch(a,b(i))
  if (j >= 0) count(j) += 1
}
var n = 0
for (i <- 0 until count.length) if (count(i)>0) { count(n) = a(i); n+= 1 }
for (i <- 0 until c.length) {
  if (Arrays.binarySearch(count,0,n,c(i))>=0) println(c(i))
}

With slightly more complexity, you can either use no extra space at the cost of being even more destructive of your original arrays, or you can avoid touching your original arrays at all at the cost of another N space.

Edit: * as the comments have pointed out, hash tables are faster for non-perverse inputs. This is "fastest worst case". The worst case may not be so unlikely unless you use a really good hashing algorithm, which may well eat up more time than your sort. For example, if you multiply all your values by 2^16, the trivial hashing (i.e. just use the bitmasked integer as an index) will collide every time on lists shorter than 64k....

Rex Kerr 2010-03-10 17:34:17

How is this faster than a hashtable solution?

IVlad 2010-03-10 17:36:15

@lVlad: You can not guarantee, that the hashtable lookup is O(1). Depending on the hashfunction, how the impl. handles collisions and the actual input, then your performance might deterioate to O(n).

Mads Ravn 2010-03-10 17:56:08

@Mads Ravn: true, you cannot guarantee, but you can get very very close to a guarantee considering what this problem actually asks for. Read the comments to the OP's post. Under those restrictions, forcing a hash function to go into worst-case mode would be pretty difficult I think.

IVlad 2010-03-10 18:09:38

@lVlad: You are right. If the integers are either distinct (except for the the one we're searching for) or chosen from a large set of potential numbers with uniform prob., then it indeed would be very hard to imagine a "realistic" hash table impl. deterioating. I just have the nasty habit of thinking theoretical worst-case whenever I see a big Oh - damn you CLRS :) As an aside a CS professor once, tongue in cheek, told me: "If you want your programs to perform well in practice, then you use a hash table. If you want your programs to perform well in theory, then you use a balanced tree".

Mads Ravn 2010-03-10 18:36:20

Fastest worst-case solution, I meant (assuming the sort is done with mergesort or the like). I agree that a hashtable can be faster, especially when memory is no issue.

Rex Kerr 2010-03-10 21:36:51

Answer 7

+2 A:

Let N = 200, k = 3,

Create a hash table H with capacity ≥ Nk.
For each element X in array 1, set H[X] to 1.
For each element Y in array 2, if Y is in H and H[Y] == 1, set H[Y] = 2.
For each element Z in array 3, if Z is in H and H[Z] == 2, return Z.
throw new InvalidDataGivenByInterviewerException();

O(Nk) time, O(Nk) space complexity.

KennyTM 2010-03-10 17:47:00

ansaurus

tags:

views:

answers:

Find unique common element from 3 arrays

related questions