ansaurus

Question

Multi threaded FFTW 3.1.2 on a shared memory computer

Answer 1

+2 A:

This looks like it could be a synchronisation problem. You can get this type of behaviour if all threads except one are locked out e.g. by a semaphore to a library call.

How are you calling the planner? Are all your function calls correctly synchronised? Are you creating the plans in a single thread or on all threads? I assume you've read the notes on thread safety in the FFTW docs... ;)

ire_and_curses 2009-09-16 21:23:16

Answer 2

A:

Thank you for your answer. I am not sur to understand everything. I am a student in Atmospheric Sciences and it is the first time I hear about a semaphore. I've tried to read and learn about that but I still don't understand what I could be doing wrong. Here is the piece of code where I create the plan. I follow this page:

    call sfftw_init_threads(err)
    print*, err

    call sfftw_plan_with_nthreads(nproc)

    call sfftw_plan_many_dft_r2c(plan_RK,2,size,9,DYN_R,
 .  inembed,1,idist,DYN_K,onembed,1,odist,FFTW_PATIENT)

    call sfftw_plan_many_dft_c2r(plan_KR,2,size,9,DYN_K,
 .  onembed,1,odist,DYN_R,inembed,1,idist,FFTW_PATIENT)

And then I use those plans as much as need, i.e. every time step (finite difference pseudo-spectral model) with a sfftw_execute command. DYN_K and DYN_R are in common blocks. Is this the problem?

Would it be a solution to create one plan for each of the 9 transforms?

Thank you for helping a newbie!

Fridooo 2009-10-08 00:36:45

Answer 3

A:

Unless your FFTs are pretty large, the automatic multithreading in FFTW is unlikely to be a win speed wise. The synchronization overhead inside the library can dominate the computation being done. You should profile different sizes and see where the break even point is.

xscott 2010-09-16 07:18:36

ansaurus

tags:

views:

answers:

Multi threaded FFTW 3.1.2 on a shared memory computer

related questions