Few years ago, under Windows environment, I did some testing, by letting multiple instance of CPU computation intensive + Memory access intensive + I/O access intensive application run. I develop 2 versions. One is running under multi-process, another is running under multi-thread.
I found the performance is much better for multi-process. I read somewhere else (I cann't remember the site).
Which states that the reasons is due to under multi-thread, they are "fighting" for a single memory pipeline and I/O pipeline, which makes the performance is worst than multi-process
However, I cann't find that article anymore. I was wondering, till today, whether the below still hold true?
Under Windows, having the algorithm code to run under multi-process, high chance that, the performance will be better than multi-thread