ansaurus

Question

subset sum problem

Answer 1

+2 A:

This does not prove P = NP.

You have failed to consider the possibility where the positive numbers are: 1, 2, 4, 8, 16, etc... and so there will be no duplicates when you sum subsets, so it will run in O(2^N) time in this case.

You can treat this as a special case but still the algorithm is still not polynomial for other similar cases. This assumption that you made is where you go away from the NP-complete version of subset sum to solving only easy (polynomial time) problems:

[assume the sum of the positive numbers grows] at a linear rate when we add extra elements.

Here you are effectively assuming that P (i.e. number of bits required to state the problem) is smaller than N. Quote from Wikipedia:

Thus, the problem is most difficult if N and P are of the same order.

If you assume that N and P are of the same order then you can't assume that the sum grows linearly indefinitely as you add more elements. As you add more elements to your set those elements also need to get larger to ensure that problem remains hard to solve.

If P (the number of place values) is a small fixed number, then there are dynamic programming algorithms that can solve it exactly.

You have rediscovered one of these algorithms. It's a nice piece of work but it isn't something new and it doesn't prove P = NP. Sorry!

Mark Byers 2010-06-26 23:07:43

It's true that there are worst cases. But then, the fact that you CAN make quicksort run in O(n^2) doesn't mean that it's not normally an O(n log n) algorithm. Infact, a speedup only doesn't occur with a data set where every value increases exponentially- something that I think can be detected beforehand.

DeadMG 2010-06-27 07:33:14

@DeadMG: I've expanded on my answer to address your comment.

Mark Byers 2010-06-27 10:03:45

@Mark Byers: No. I haven't done that at all. The space into which the sums fit isn't necessarily small (although it is fixed). It's only smaller than 2^N, which it is in any condition except where the values in N grow exponentially.

DeadMG 2010-06-27 10:12:29

@DeadMG: If your point is that you can solve subset sum in polynomial time for all cases except for the cases where you can't, then I have to agree with you.

Mark Byers 2010-06-27 10:28:49

@Mark Byers: I'm saying that the cases where you can't solve it like this in P time can almost certainly have special cases written for them. There's a difference between, solve in P time for a very small range of problems, and solve in P time for the vast majority of problems, where the problems that are still NP are clearly defined. I'm not going to argue that there's no room for improvement to eliminate these conditions, but that they *can* be removed and this algorithm *can* run in P time for a huge majority of problems.

DeadMG 2010-06-27 10:38:44

@DeadMG: That's exactly why I wrote 'You can treat this as a special case but still the algorithm is still not polynomial for other similar cases.' You can't solve them with special cases. That particular example I gave can be solved with a special case, but there are an infinite number of other examples where the elements grow exponentially. For each specific example I give you, you can of course think up a special case that handles that one specific example but you can't write an infinite number of special cases.

Mark Byers 2010-06-27 10:46:59

@Mark Byers: Subset sum isn't NP-hard, it's NP-complete. You can solve special cases with an appropriate definition of "special"- that is, design a special case for exponentially increasing values, and done. The values either increase exponentially or they don't- you don't have to write more special cases for different possibilities.

DeadMG 2010-06-27 10:56:43

Answer 2

A:

What this means is that as N increases, the number of duplicate subsets increases exponentially, and the number of unique, useful subsets increases only linearly.

Not necessarily - the number of duplicate subset sums is also determined by the value of the number closest to zero in the set (that the greater the minimum distance to zero - the fewer the duplicate subset sums for the set).

In general, we now move to enumerate all subsets in the set.

Unfortunately, enumerating all the sums of the subsets of the set requires performing an exponential number of addition operations (2^7 or 128 in your example). Otherwise, how would the algorithm determine what the unique sums happen to be? So, although the steps that follow the first step could very well have a polynomial running time, the algorithm as a whole has exponential complexity (because an algorithm is only as fast as its slowest part).

Incidentally, best known algorithm for solving the subset sum problem (Horowitz and Sahni, 1974) has O(2^N/2) complexity - which makes it about twice as fast as this algorithm.

Greg 2010-07-20 22:30:57

ansaurus

tags:

views:

answers:

subset sum problem

related questions