ansaurus

Question

Categorizing input data into sets based on attribute.

Answer 1

+1 A:

Start at the point nearest the midpoint
select all values within half the range of it
Use your existing solution going out in both directions.
Repeat from step 2 using every other point in the original selection.
select the best result.

Worst case this is an O(n²) problem as you can replace step 4 with "repeat with all points".

BCS 2009-05-27 21:52:18

I don't understand how starting the greedy algorithm in the middle (arbitrarilly) changes the results materially. It would be different if the answer was: "for every entry you are going to make a pass through your algorithm where that entry is the starting position, and where you are then randomly jumping out in both directions." But I don't think that's what you meant.

earino 2009-05-27 21:55:07

Starting in the middle is just to give an edge free span to start with. The important thing is to run the greedy solution starting from each point in some span so that every possible result is found (I think this will work). You might get the same result by starting from the low end and stepping through the points until you find another point that give you the first result again. And worst comes to worst, run every start point.

BCS 2009-05-27 22:46:13

Answer 2

A:

If I understand the problem right, then the greedy algorithm is in fact optimal. Let G be the greedy solution and let S be any solution. Consider the first group where G differs from S. Since G is a greedy solution, G's group is larger than S's. For example,

G : [1, 2, 3][4, 5, 6, 7][8, 9, 10][11, 12]...
S : [1, 2, 3][4, 5, 6][7, 8, 9][10, 11, 12]...

We can derive a new solution S' from S that agrees with G on a longer prefix without introducing a new group. Just steal some items from the next group. The resulting groups are valid because one is already in G and the other is a subgroup of a valid group.

S': [1, 2, 3][4, 5, 6, 7][8, 9][10, 11, 12]...

By induction, it follows that G is in fact optimal.

Dave 2009-05-28 23:50:29

ansaurus

tags:

views:

answers:

Categorizing input data into sets based on attribute.

related questions