ansaurus

Question

Multithreading/Parallel Processing in PHP

Answer 1

+1 A:

I would try to figure out if you can cache or store the template in some faster to read format. I don't know if that's possible, but the PHPExcel forum is pretty good and is watched by the developers.

Scott Saunders 2010-04-20 21:30:32

Answer 2

A:

You can definitely spawn processes on CentOS with PHP (http://php.net/manual/en/function.pcntl-fork.php). Before doing that though, I'd consider at least one thing... If bottleneck appears to be on reading the template and writing the output, it might be an I/O bound issue only and therefore dealing with multiple processess might not help much... Personally I'd try to see if it's possible to do some caching instead...

maraspin 2010-04-20 21:33:13

Answer 3

+1 A:

You can't multithread but you can fork (pcntl_fork, pcntl_wait). As I'm sure know, you'll want to test carefully the process spawn times to make sure that this is even worth it for your situation.

$pid = pcntl_fork();

if ($pid == -1) {
  // fork failed

} elseif ($pid > 0) {
  // we're the parent! Wait for child to finish
  pcntl_waitpid($pid);

} else {
  // we're the child
}

webbiedave 2010-04-20 21:33:40

Answer 4

+2 A:

It is generally not a good idea to fork an Apache process. That can cause undetermined results. Instead, using some kind of queuing mechanism is preferable. Gearman is an open source queuing mechanism you can use. I also have a blog post on the Zend Server Job Queue that talks about running tasks asynchronously Do you queue? Introduction to the Zend Server Job Queue.

You could also use something like the Zend Framework Queuing classes to implement some of the asynchronous work. Zend_Queue

@Swisstack, also I will disagree with your assertion that PHP is not created for high performance. Very seldom are language features the cause of slow performance. Perhaps by doing a raw language test comparing $a++ among different languages you will see that, but that type of testing is irrelevant. I've done consulting on PHP for several years and I have never seen a performance problem that was due to the language.

Kevin Schroeder 2010-04-20 21:49:39

+1 Good to mention. I'm assuming he's not running this via web server.

webbiedave 2010-04-20 22:02:56

Well, one can't assume too much. But even so, a job queue of some kind is usually a better road to go. It's more manageable and such.

Kevin Schroeder 2010-04-20 23:20:22

Answer 5

+1 A:

If both reading the template, AND the db query were slow, then I'd say there's a decent chance that worthwhile performance could be gained by running the tasks in parallel. But, you said it yourself, reading the template is slow, and the db query is fast. So, even ignoring any additional overhead created by introduced by the additions needed to run the tasks in parallel, in the best case, you stand to save 0.001 seconds(the time needed for db query).

Running multiple tasks in parallel will always still require the time of the slowest task. Running tasks in series is the sum of all tasks. In your case, templateTime + queryTime(0.001)

Not worth it imo.

Usually the database is the turtle in the equation. You can do that part async without too much effort. See the newly added mysqli_poll() and friend functions.

chris 2010-04-20 22:05:45

Answer 6

A:

Read the template once, then do a clone for each workbook that you need to create from the data

Mark Baker 2010-04-26 18:59:09

ansaurus

tags:

views:

answers:

Multithreading/Parallel Processing in PHP

related questions