Smallest sum of pairs

After googling a while, I found some other references to minimum weight perfect matching algorithms which might be easier to understand (at least, easier to a certain degree).

EDIT

Here I found a python implementation of one of those algorithms. It has 837 lines of code (+ some additional unit tests), and I did not try it out by myself. But perhaps you can use it for your case.

Here is a link to an approximation algorithm for the problem. Of course, the style of the paper is mathematical, too, but IMHO much easier to understand than the paper of Cook and Rohe. And it states in its preface that it aims exactly for the purpose to be easier to implement than Edmond's original algorithm, since you don't need a linear programming solver.

EDIT2:

After thinking a while about the problem, IMHO it must be possible to set up an A* search for solving this problem. The search space here is the set of 'partially matchings' (or partially paired point sets). As Moron already wrote in his comments, one can restrict the search to the situation where no pairs with crossing connecting lines exist. The path-cost function (to use the terms from Wikipedia) is the sum of the distances of the already paired points. The heuristic function h(x) can be any under-estimation for the remaining distances, for example, when you have 2M points not paired so far, take the sum of the M minimal distances between all of those remaining points.

That one will probably be not as efficient as the algorithm Moron pointed to, but I suspect it will be much better than 'brute force', and much easier to implement.

Maybe not quite ... the task here is geometric, not graph-based. You have more freedom without that restriction that all things must be packed into a graph.

Hamish Grubijan 2010-02-07 15:40:38

Could you please explain with some test-cases ?

Pie-Guy 2010-02-07 15:41:28

Me? Well, 2N points in 2d plane is the same as having a complete 2N graph, and that is going to eat up a lot of memory!

Hamish Grubijan 2010-02-07 15:45:38

@Hamish, why not?

Moron 2010-02-07 15:46:27

@ Hamish Grubijan : No,my question was for Moron.Thanks :)

Pie-Guy 2010-02-07 15:46:41

@Moron: that article is hard stuff. The OP is looking for a solution for the special case where G is a complete planar graph. Do you know of any simpler algorithms for this special case?

Doc Brown 2010-02-07 15:47:24

Perhaps the so called duality approach can help identify closest pairs sooner. Note that this is computational geometry stuff, not pure algorithms.

Hamish Grubijan 2010-02-07 15:51:20

@Moron why not? Not sure what which part you refer to, but basically an entire field of computational geometry emerged precisely because there are some things that you can do better when you do not restrict yourself to a graph data structure. There is still a very intimate relationship, but graph-only approaches (where every point is a vertex and every line segment is an edge) sometimes aren't as good.

Hamish Grubijan 2010-02-07 15:53:21

@Doc. Not off the top of my head. I would presume the triangle inequality restriction would be more important than planarity (which it is not, btw). I will update this later when I find something.

Moron 2010-02-07 16:21:36

@Doc: I have edited my post to add a link to a (hopefully) faster algorithm. That paper does not seem to have enough details, though.

Moron 2010-02-07 18:28:30

@moron Very interesting, but could you please outline the presented algorithm? -- that link seems more of an abstract than an answer to the question. Otherwise I'd no longer wonder why jet physics people use greedy algorithms (maybe with kernels) if CS can only be done in such opaque lingo.

honk 2010-02-07 18:45:37

@honk: Perfect Matching in graphs is a well known problem, you should find plenty of literature on that. As to the link(second one) I gave, I would have to read it myself and try to summarize it. It might be hard to do that though, given the prerequisites needed for even understanding that paper. I might make an attempt, but please don't count on it to happen soon.

Moron 2010-02-07 18:54:21

@Moron: I am not much familiar to Minimum weight perfect matching, what I found from goggling doesn't give basic knowledge to implement for my problem.Could you help me in this regard ?

Pie-Guy 2010-02-07 19:29:33

@Moron: when I wrote 'simpler', I meant 'simpler to understand', not 'faster' :-)

Doc Brown 2010-02-07 19:57:56

thanks for caring, about accessibility ;)

honk 2010-02-09 05:14:08

@Moron: wow, that summary is excellent and IMHO much better to understand than the publication.

Doc Brown 2010-02-10 12:00:14

-1. As I wrote to Hamish, a greedy algo like yours underestimates the difficulty of the problem. And having a short look to your links, they seem to point in a completely wrong direction.

Doc Brown 2010-02-07 16:42:43

@Doc I didn't say this would always work, and btw Harmish's algorithm started with the closest pair. Maybe you have something in mind that always works (I would be interested)

honk 2010-02-07 16:49:09

@honk: I think (besides the obvious 'brute force' solution) the links given by Moron contain valid algorithms. They are just hard to read for non-mathematicians and probably not easily to implement.

Doc Brown 2010-02-07 19:53:37

@Doc Agree. You or moron should really write up a schematic implementation of the algorithm for this 'simple' case though so we all can understand it. Looking at google results you could easily become the Number 1 hit for this topic ;)

honk 2010-02-08 00:28:48

How do you "sort points by their nearest neighbour distance"? The pairwise nearest neighbour distance doesn't provide a total order for points.

Nick Johnson 2010-02-08 09:58:44

@nick Agreed, for gridded points this might get it totally wrong. If the points are not on a grid though the difference between half and total order shouldn't matter too much *I'd guess*.I assume that there is a "loner point" dominating the sum (which has not to hold at all). If the points are too evenly distributed the sum I outlined might easily be dominated not by the first addend, but by the *last*.But for a not too small number of not too evenly distributed points this should still give a not too bad guess.Should I remove answer this answer since it depends on too many *not too's*?

honk 2010-02-08 11:35:09

No, my point is that you simply can't sort a list by a property that pairs of elements have unless that property provides a total order - and the euclidian distance between elements isn't that. Perhaps you meant something else by "sort points descending by their nearest-neighbor distance"?

Nick Johnson 2010-02-08 17:38:07

@nick sure you can: for every point find it's NN, measure the distance, that's the number *this* point gets, sort

honk 2010-02-09 00:58:32

The first link you gave has approximation algorithms.

Moron 2010-02-09 02:34:57

Ups, you are right, I overlooked that at the first glance. I will edit my answer appropriately.

Doc Brown 2010-02-09 06:39:21

ansaurus

tags:

views:

answers:

Smallest sum of pairs

related questions