ansaurus

Question

Unable to run OpenMPI across more than two machines

Answer 1

+1 A:

My first recommendation would be to simplify:

can you build the standard MPI 'hello, world' example?
can you run it several times on localhost?
can you execute on the other host via ssh
is the path identical

and if so, then

mpirun -H host1,host2,host3 -n 12 ./helloworld

should travel across. Once you have these basics sorted out, try the Boost tutorial ... and make sure you have Boost and MPI libraries on all hosts you plan to run on.

Dirk Eddelbuettel 2010-03-22 19:57:09

@Dirk: thanks for the suggestions. I've updated my question with the results of these observations.

rcollyer 2010-03-23 14:35:48

Answer 2

+2 A:

The answer turned out to be simple: open mpi authenticated via ssh and then opened up tcp/ip sockets between the nodes. The firewalls on the compute nodes were set up to only accept ssh connections from each other, not arbitrary connections. So, after updating iptables, hello world runs like a champ across all of the nodes.

Edit: It should be pointed out that the fileserver's firewall allowed arbitrary connections, so that was why an mpi program run on it would behave differently than just running on the compute nodes.

rcollyer 2010-04-06 17:21:49

ansaurus

tags:

views:

answers:

Unable to run OpenMPI across more than two machines

related questions