ansaurus

Question

Algorithm to find matching pairs in a list

Answer 1

+1 A:

I hope I got your problem.

Well, if IsMatch(i, j) and IsMatch(j, l) then IsMatch(i, l). More generally, the IsMatch relation is transitive, commutative and reflexive, ie. its an equivalence relation. The algorithm translates to which element appears the most times in the list (use IsMatch instead of =).

Alexandru 2009-11-27 00:24:08

Actually, that is not true. IsMatch is reflexive but not transitive. This is true: `IsMatch(i,j)` and `IsMatch(j,l)` then `!IsMatch(i,l)`, but if `IsMatch(l,m)` then `IsMatch(i,m)`. It's transitive for a chain of 3, not 2.

Victor Liu 2009-11-27 01:17:17

how is it possible that `IsMatch(i,j)` and `IsMatch(j,l)` and `IsMatch(l,m)` and `IsMatch(i,m)` __but not__ `IsMatch(i,l)` ? that makes no sense to me.

quack quixote 2009-11-27 02:09:25

Because `IsMatch(a,b) := a==1/b`, so it's obvious why you require a chain of 3, and not a chain of 2.

Victor Liu 2009-11-27 02:17:51

ok, that makes sense, thx.

quack quixote 2009-11-27 02:30:58

Answer 2

A:

(If I understand the problem...) Here is one way to match each pair of products in the two lists.

Multiply each pair N and save it to a structure with the product, and the subscripts of the elements making up the product.
Multiply each pair D and save it to a second instance of the structure with the product, and the subscripts of the elements making up the product.
Sort both structions on the product.
Make a merge-type pass through both sorted structure arrays. Each time you find a product from one array that is close enough to the other, you can record the two subscripts from each sorted list for a match.
You can also use one sorted list for an ismatch function, doing a binary search on the product.

xpda 2009-11-27 01:45:17

Answer 3

A:

well。。Multiply each pair D and save it to a second instance of the structure with the product, and the subscripts of the elements making up the product.

christian louboutin 2009-11-27 01:53:29

Answer 4

A:

I just asked my CS friend, and he came up with the algorithm below. He doesn't have an account here (and apparently unwilling to create one), but I think his answer is worth sharing.

// We will find the best match in the minimax sense; we will minimize
// the maximum matching error among all pairs. Alpha maintains a
// lower bound on the maximum matching error. We will raise Alpha until
// we find a solution. We assume MatchError returns an L_1 error.

// This first part finds the set of all possible alphas (which are
// the pairwise errors between all elements larger than maxi-min
// error.
Alpha = 0
For all i:
    min = Infinity
    For all j > i:
     AlphaSet.Insert(MatchError(i,j))
     if MatchError(i,j) < min
      min = MatchError(i,j)
    If min > Alpha
     Alpha = min

Remove all elements of AlphaSet smaller than Alpha

// This next part increases Alpha until we find a solution
While !AlphaSet.Empty()
    Alpha = AlphaSet.RemoveSmallest()
    sol = GetBoundedErrorSolution(Alpha)
    If sol != nil
     Return sol

// This is the definition of the helper function. It returns
// a solution with maximum matching error <= Alpha or nil if
// no such solution exists.
GetBoundedErrorSolution(Alpha) :=
    MaxAssignments = 0
    For all i:
     ValidAssignments[i] = empty set;
     For all j > i:
      if MatchError <= Alpha
       ValidAssignments[i].Insert(j)
       ValidAssignments[j].Insert(i)

     // ValidAssignments[i].Size() > 0 due to our choice of Alpha
     // in the outer loop

     If ValidAssignments[i].Size() > MaxAssignments
      MaxAssignments = ValidAssignments[i].Size()
    If MaxAssignments = 1
     return ValidAssignments
    Else
     G = graph(ValidAssignments)
     // G is an undirected graph whose vertices are all values of i
     // and edges between vertices if they have match error less
     // than or equal to Alpha
     If G has a perfect matching
      // Note that this part is NP-complete.
      Return the matching
     Else
      Return nil

It relies on being able to compute a perfect matching of a graph, which is NP-complete, but at least it is reduced to a known problem. It is expected that the solution be NP-complete, but this is OK since in practice the size of the given lists are quite small. I'll wait around for a better answer for a few days, or for someone to expand on how to find the perfect matching in a reasonable way.

Victor Liu 2009-11-27 02:25:04

Eh? Perfect matching is NPC? How about http://en.wikipedia.org/wiki/Edmonds%27s_matching_algorithm

Robert Obryk 2009-11-27 17:43:14

Yes, you're right. Finding a perfect matching is actually a reasonably polynomial time operation. Saw that after posting, but didn't bother fixing it. It's rather surprising, since it feels like one of those things which should be difficult. Either way, this answer is superseded by my other one now.

Victor Liu 2009-11-27 20:00:39

Answer 5

A:

You want to find j such that D(i)D(j) = N(i)N(j) {I assumed * is ordinary real multiplication}

assuming all N(i) are nonzero, let

Z(i) = D(i)/N(i).

Problem: find j, such that Z(i) = 1/Z(j).

Split set into positives and negatives and process separately.

take logs for clarity. z(i) = log Z(i).

Sort indirectly. Then in the sorted view you should have something like -5 -3 -1 +1 +3 +5, for example. Read off +/- pairs and that should give you the original indices.

Am I missing something, or is the problem easy?

Matt Kennel 2009-11-27 09:17:51

Answer 6

A:

Okay, I ended up using this ported Fortran code, where I simply specify the dense upper triangular distance matrix using:

complex_t num = N[i]*N[j] - D[i]*D[j];
complex_t den1 = N[j]*D[i];
complex_t den2 = N[i]*D[j];
if(std::abs(den1) < std::abs(den2)){
 costs[j*(j-1)/2+i] = std::abs(-num/den2);
}else if(std::abs(den1) == 0){
 costs[j*(j-1)/2+i] = std::sqrt(std::numeric_limits<double>::max());
}else{
 costs[j*(j-1)/2+i] = std::abs(num/den1);
}

This works great and is fast enough for my purposes.

Victor Liu 2009-11-27 09:52:10

Answer 7

A:

You should be able to sort the (D[i],N[i]) pairs. You don't need to divide by zero -- you can just multiply out, as follows:

bool order(i,j) {
  float ni= N[i]; float di= D[i];
  if(di<0) { di*=-1; ni*=-1; }

  float nj= N[j]; float dj= D[j];
  if(dj<0) { dj*=-1; nj*=-1; }

  return ni*dj < nj*di;
}

Then, scan the sorted list to find two separation points: (N == D) and (N == -D); you can start matching reciprocal pairs from there, using:

abs(D[i]*D[j]-N[i]*N[j])<epsilon

as a validity check. Leave the (N == 0) and (D == 0) points for last; it doesn't matter whether you consider them negative or positive, as they will all match with each other.

edit: alternately, you could just handle (N==0) and (D==0) cases separately, removing them from the list. Then, you can use (N[i]/D[i]) to sort the rest of the indices. You still might want to start at 1.0 and -1.0, to make sure you can match near-zero cases with exactly-zero cases.

comingstorm 2009-11-27 11:39:22

Answer 8

A:

hey this is some thing new which i haven't come across and things i know about database are very few...but its very informative site which is helpful

PHP programming india 2009-11-27 12:12:11

ansaurus

tags:

views:

answers:

Algorithm to find matching pairs in a list

related questions