views:

633

answers:

4

i'm confused about the notion of "spark" Is it a thread in haskell ? or is the action of spawning a new thread ?

Thanks everybody: So to summarize, sparks are not thread but more of unit of computation (tasks to put it in C#/Java terms). So it's the haskell way of implementing the task parallelism

+10  A: 

Yes, the equivalent of spawning a thread in Haskell is called "sparking". A spark is not so much an actual thread as a computation that may be run in a thread. See A Gentle Introduction to Glasgow Parallel Haskell.

Parallelism is introduced in GPH by the par combinator, which takes two arguments that are to be evaluated in parallel. The expression p `par` e (here we use Haskell's infix operator notation) has the same value as e, and is not strict in its first argument, i.e. bottom `par` e has the value of e. (bottom denotes a non-terminating or failing computation.) Its dynamic behaviour is to indicate that p could be evaluated by a new parallel thread, with the parent thread continuing evaluation of e. We say that p has been sparked, and a thread may subsequently be created to evaluate it if a processor becomes idle. Since the thread is not necessarily created, p is similar to a lazy future.

[Emphasis in original]

Apocalisp
As I read it, a "spark" is not so much a thread as it is a computation which may be run in a thread. The runtime might have a fixed ceiling limit on threads which rotate through and evaluate sparks.
ephemient
That's how I understand it too. Let me reword a bit.
Apocalisp
Betterer, thanks.
Apocalisp
+17  A: 

Sparks are not threads. forkIO introduces Haskell threads (which map down onto fewer real OS threads). Sparks create entries in the work queues for each thread, from which they'll take tasks to execute if the thread becomes idle.

As a result sparks are very cheap (you might have billions of them in a program, while you probably won't have more than a million Haskell threads, and less than a dozen OS threads on half a dozen cores).

Think of it like this:

    K sparks
     | | ...  | |
 M Haskell threads
       | .. |
N OS threads
      | | | |
   4 cores
Don Stewart
+2  A: 

If I understand it correctly, a spark is an entry in a queue of jobs requiring work. A pool of threads take entries from this queue and runs them. Typically there is one thread per physical processor, so this scheme maximises throughput and minimizes thread context switching.

Paul Johnson
A: 

It looks like it is similar to a "task" in Intel Threading Building Blocks.

Aftershock