ansaurus

Question

Parallel processing via multithreading in Java

Answer 1

+1 A:

The overhead cost of synchronization may be comparatively large and prevent many optimizations.

Furthermore you are creating way too many threads.

The other is in the 'run' function of a class that extends thread, and recursively creates two new threads each time it is called.

You would be better off with a fixed number of threads, suggestively 4 on a quad core. This could be realized with a thread pool (tutorial) and the pattern would be "bag of tasks". But perhaps it would be better yet, to initially divide the task into four equally large tasks and do "single-threaded" sorting on those tasks. This would then utilize the caches a lot better.

Instead of having a "busy-loop" waiting for the threads to finish (stealing cpu-cycles) you should have a look at Thread.join().

aioobe 2010-05-21 06:24:27

While this is generally the case with these problem, there is no synchronization in this example.

Peter Lawrey 2010-05-21 06:31:26

Woops. In that case there should be some nasty race-conditions?

aioobe 2010-05-21 06:32:32

Oh, I see the busy-waiting loop now.

aioobe 2010-05-21 06:33:17

Yes this code _could_ suffer from memory consistency effects too, since `finished` isn't volatile (the busy wait loop might never complete). I think also the array elements need a joint to form a happens-before relationship.

Justin 2010-05-21 07:28:37

Thanks for your suggestions and the tutorial, aioobe. I found that Thread.join() is actually much less affective than use of a CountDownLatch.

Robz 2010-08-04 21:10:15

Answer 2

A:

How many elements in the array you have to do sort? If there are too few elements, the time of sync and CPU switching will over the time you save for dividing the job for paralleling

vodkhang 2010-05-21 06:27:39

N elements are in the array, and N is a very large number (greater than 1 million).

Robz 2010-05-21 06:33:47

@Robz, I'm sure you'd be shocked to discover that the Sun implementation of Arrays.sort has a minimum threshold value for which it uses insertion sort. Size matters, period.

Tim Bender 2010-05-25 06:05:12

Yeah, I know that:)

vodkhang 2010-05-25 06:18:06

Actually I'm not sure that it would matter how arbitrarily large the array is for this potential "size matters" problem to occur. Merge sort, as well as quick sort, uses the divide-and-conquer method. Thus, if X is some cutoff and N is larger than X then eventually during the course of sorting the entire array a sub-array would be sent to the merge/quick sort function that is of length less than X.Interestingly, in my personal experience a simple sequential quick sort that does not use a cutoff nor insertion sort calls goes just as fast as Arrays.sort with N being 1 million or more.

Robz 2010-08-04 21:08:06

Answer 3

+5 A:

The problem is not multi-threading: I've written a correctly multi-threaded QuickSort in Java and it owns the default Java sort. I did this after witnessing a gigantic dataset being process and had only one core of a 16-cores machine working.

One of your issue (a huge one) is that you're busy looping:

 // Wait for the two other threads to finish 
 while(!ma.finished || !mb.finished) ;

This is a HUGE no-no: it is called busy looping and you're destroying the perfs.

(Another issue is that your code is not spawning any new threads, as it has already been pointed out to you)

You need to use other way to synchronize: an example would be to use a CountDownLatch.

Another thing: there's no need to spawn two new threads when you divide the workload: spawn only one new thread, and do the other half in the current thread.

Also, you probably don't want to create more threads than there are cores availables.

See my question here (asking for a good Open Source multithreaded mergesort/quicksort/whatever). The one I'm using is proprietary, I can't paste it.

http://stackoverflow.com/questions/2210185/correctly-multithreaded-quicksort-or-mergesort-algo-in-java

I haven't implemented Mergesort but QuickSort and I can tell you that there's no array copying going on.

What I do is this:

pick a pivot
exchange values as needed
have we reached the thread limit? (depending on the number of cores)
- yes: sort first part in this thread
- no: spawn a new thread
sort second part in current thread
wait for first part to finish if it's not done yet (using a CountDownLatch).

The code spawning a new thread and creating the CountDownLatch may look like this:

            final CountDownLatch cdl = new CountDownLatch( 1 );
            final Thread t = new Thread( new Runnable() {
                public void run() {
                    quicksort(a, i+1, r );
                    cdl.countDown();
                }
            } };

The advantage of using synchronization facilities like the CountDownLatch is that it is very efficient and that your not wasting time dealing with low-level Java synchronization idiosynchrasies.

In your case, the "split" may look like this (untested, it is just to give an idea):

if ( threads.getAndIncrement() < 4 ) {
    final CountDownLatch innerLatch = new CountDownLatch( 1 );
    final Thread t = new Merger( innerLatch, b );
    t.start();
    mergeSort( a );
    while ( innerLatch.getCount() > 0 ) {
        try {
            innerLatch.await( 1000, TimeUnit.SECONDS );
        } catch ( InterruptedException e ) {
            // Up to you to decide what to do here
        }
    }
} else {
    mergeSort( a );
    mergeSort( b );
}

(don't forget to "countdown" the latch when each merge is done)

Where you'd replace the number of threads (up to 4 here) by the number of available cores. You may use the following (once, say to initialize some static variable at the beginning of your program: the number of cores is unlikely to change [unless you're on a machine allowing CPU hotswapping like some Sun systems allows]):

Runtime.getRuntime().availableProcessors()

Webinator 2010-05-21 07:21:02

+1 for busy-looping concept.

Bragboy 2010-05-21 07:25:21

oops silly me, I should have rewritten it instead of using your code: in the case where you do not spawn a new thread, it is pointless so split into 'a' and 'b' and then do a mergeSort(a) and mergeSort(b)... Simply mergeSort directly the whole array, before splitting.

Webinator 2010-05-21 08:01:42

Why on Earth would you put the call to CDL.await() in a while loop?Also, you're conditional (threads.getAndIncrement() < 4) would cause the 'count' of threads created to go up regardless of whether or not you spawn one. Likewise, you never really indicate when you reduce that count (though it could be assumed).

Tim Bender 2010-05-25 04:20:08

@Tim Bender: because a broken 3rd party API may be wrongly interrupt()ing me awaiting. In that case you *may* or *may not* want to keep waiting, which is why I put a huge comment saying *"//Up to you to decide what to do here"* (you could decide to decrement the latch and exit the loop, if you think the interrupt is legit). The number of threads is fine when it is incremented: it's only purpose is to not spawn more than 'x' threads. Seen that I'll never "roll over" 2**32-1 threads this won't overflow and the code will work just fine. Nitpick or help the OP: choose one. ;)

Webinator 2010-05-25 18:27:42

CountDownLatch is quite magical, and actually much better than thread.join(). I decided to go into quick sort as well. Being able to dynamically determine the number of processors is also a fantastic addition. I wouldn't have thought to start only one thread and then complete the other 'half' sequentially if you hadn't suggested it. So thanks, and have a belated bravo, webinator!

Robz 2010-08-04 21:20:33

Answer 4

+3 A:

As others said; This code isn't going to work because it starts no new threads. You need to call the start() method instead of the run() method to create new threads. It also has concurrency errors: the checks on the finished variable are not thread safe.

Concurrent programming can be pretty difficult if you do not understand the basics. You might read the book Java Concurrency in Practice by Brian Goetz. It explains the basics and explains constructs (such as Latch, etc) to ease building concurrent programs.

Julien Rentrop 2010-05-21 07:56:51

Answer 5

A:

Alright many months after the fact: I believe I have successfully completed this project, thanks to everyone here who contributed advice and suggestions!

I ended up conducting my work on a duel-core Macbook Pro. I achieved a decent percentage of speed increase (not quite the ideal 50%, but close to it) with both merge and quick sort algorithms when switching from their respective sequential functions to the threaded version. I also blew Arrays.sort(int[] a) 'out of the water' with a threaded quick sort method. I will post the code in another question (as this one is fairly old) and the link to it here shortly, asking if there is any other optimizations or rookie mistakes still there.

Thanks to you guys, I now really believe in parallel programming.

Robz 2010-08-04 21:28:46

http://stackoverflow.com/questions/3425126/java-parallelizing-quick-sort-via-multi-threading

Robz 2010-08-06 15:13:32

ansaurus

tags:

views:

answers:

Parallel processing via multithreading in Java

related questions