ansaurus

Question

How does a sorting network beat generic sorting algorithms?

Answer 1

A:

I think that loop unwinding is what causing the faster results on the sort network algorithm

Shay Erlichmen 2010-10-10 16:32:21

Answer 2

A:

Theoretically the code could be about the same if the compiler could fully unroll the loops in the Insertion Sort. The first loop can be easily unrolled, while the second can't be unrolled that easy.

It may also be the case that, because the code is not that simple as the network sorting code, the compiler can make less optimizations. I think there are more dependencies in the insertion sort than in the network sort, which may make a big difference when the compiler tries to optimize the code (correct me if I'm wrong).

George B. 2010-10-10 16:33:11

Answer 3

+10 A:

But here we are not using the parallelization.

Modern CPUs can figure out when instructions are independent and will execute them in parallel. Hence, even though there's only one thread, the sorting network's parallelism can be exploited.

Where exactly does insertion sort make unnecessary comparisons?

The easiest way to see the extra comparisons is to do an example by hand.

Insertion sort:
6 5 4 3 2 1
5 6 4 3 2 1
5 4 6 3 2 1
4 5 6 3 2 1
4 5 3 6 2 1
4 3 5 6 2 1
3 4 5 6 2 1
3 4 5 2 6 1
3 4 2 5 6 1
3 2 4 5 6 1
2 3 4 5 6 1
2 3 4 5 1 6
2 3 4 1 5 6
2 3 1 4 5 6
2 1 3 4 5 6
1 2 3 4 5 6

Sorting network:
6 5 4 3 2 1
6 4 5 3 2 1
5 4 6 3 2 1
4 5 6 3 2 1 # These three can execute in parallel with the first three
4 5 6 3 1 2 #
4 5 6 2 1 3 #
4 5 6 1 2 3
1 5 6 4 2 3
1 2 6 4 5 3
1 2 3 4 5 6
1 2 3 4 5 6

Daniel Stutzbach 2010-10-10 16:34:37

@Daniel: Okay, since these paths are completely different, we cannot compare them directly. Certainly, Sorting network allows us to sort in lesser number of comparisons. To state my question in a different way, **what stops us from optimizing insertion sort to use this sequence of swaps for any number of inputs?**

Lazer 2010-10-10 18:02:13

Lazer: I'm afraid I don't understand. Which sequence are you referring to when you say "this sequence of swaps"? Also, did you mean to say "optimizing insertion sort" or did you intend to refer to sorting networks?

Daniel Stutzbach 2010-10-10 19:54:47

@Daniel: Sorry for lack of clarity. In yet other terms, why do we use insertion sort at all if sorting networks are more *efficient*?

Lazer 2010-10-10 20:10:43

@Lazer: Ah, that makes more sense. :-) Thanks for the clarification! The trouble with sorting networks is that they only work for a fixed n. Furthermore, they're only practical when n is small since you have to write out all of the compare-and-swaps by hand and there will be O(n log n) of them. They're fast in part because the code is written out and there are no loops, so the speed is part and parcel with the limitation.

Daniel Stutzbach 2010-10-10 20:18:15

@Daniel: So, you mean to say there is no good way to write a program to generate the set of swaps to be performed (for network sort) for *any n*? Why do sorting networks work with fixed `n`? Why can't they be generalized?

Lazer 2010-10-10 20:33:27

@Lazer: Yes, that's what I mean. :-) If an algorithm works with variable n, it needs to have some kind of loop in it somewhere. A sorting network has no loops. You could write a program to generate the swaps and then execute them, but generating the swaps will use up more time than you will save by using a sorting network. The closest you can get is to use a recursive algorithm like MergeSort or QuickSort and use a sorting network as the base case.

Daniel Stutzbach 2010-10-11 14:09:45

@Daniel: thanks! I get it now.

Lazer 2010-10-11 15:17:05

Answer 4

A:

I think all of you questions are answered in Daniel Stutzbach answer to the original post:

The algorithm you posted is similar to an insertion sort, but it looks like you've minimized the number of swaps at the cost of more comparisons. Comparisons are far more expensive than swaps, though, because branches can cause the instruction pipeline to stall.

drewk 2010-10-10 16:35:45

You can't make that generalization. If your data objects are large but extracting and comparing the key is fast, comparisons are a lot cheaper than swaps. I would guess the only time swaps are cheaper is when your data elements are a simple type.

R.. 2010-10-11 01:37:56

Answer 5

A:

I believe the amount of 'work' done in a parallel algorithm and a serial algorithm is always almost same. Only that since work gets distributed you would get outputs faster. I think you would get output convincingly faster in case when the size of input is sufficient enough to justify using parallel algorithm.

In case of insertion sort division of array amongst processors is such that it forms a pipeline, and it would take some time to fill the pipeline and then it would produce benefits of parallel algorithm.

RIPUNJAY TRIPATHI 2010-10-10 17:00:24

Answer 6

+1 A:

The better question is why the sorting network only outperforms insertion sort (generally a very slow sort) by ~50%. The answer is that big-O is not so important when n is tiny. As for OP's question, Daniel has the best answer.

R.. 2010-10-10 17:48:45

ansaurus

tags:

views:

answers:

How does a sorting network beat generic sorting algorithms?

related questions