ansaurus

Question

Algorithmically suggest best node to perform demanding computation

Answer 1

A:

Have you considered a distributed approach to computation? Not all computations can be broken up such that more than one cpu can work on them. But perhaps your problem space can benefit from some parallelization. Have a look at Hadoop.

Asaph 2009-11-25 04:11:41

Unfortunately at this point our software is not candidate for distributed computing. However, I have looked into Hadoop, and I am pretty sure that the Map/Reduce will become a hot topic on our area in the short future - I may even use it for some file processing we've been working on. Thanks!

Arrieta 2009-11-25 04:31:05

Answer 2

+1 A:

You don't have enough data to make an well-informed decision. It sounds as though the scheduling is very volatile: "At any given time, there can be anywhere from zero to dozens of people connected to a given box." So the current load does not necessarily reflect the future load of the machines.

To properly asses what hosts someone should use to minimize computation time would require knowing when the current jobs will terminate. If a powerful machine is about to be done doing most of its jobs, it would be a good candidate even though it currently has a high load.

If you want to guess purely on the current situation, you can do a weighed calculation to find out which hosts have the most MFLOPS available.

MFLOPS available = host's MFLOPS + (number of logical processors - load average)

Sort the hosts by MFLOPS available and suggest them in a descending order.

This formula assumes that the MFLOPS of a host is linearly related to its load average. This might not be exactly true, but it's probably fairly close.

I would favor the most recent load average since it's closer to the current/future situation, whereas, jobs from 15 minutes ago might have completed by now.

Ben S 2009-11-25 04:18:11

I like your point, and I hope that by starting with this simple script, I will reduce the volatility on the node scheduling - if more people use the script, it will help balancing the load.As for the task duration, you are completely right, and I will find a way to factor that in (we do have an estimate for some tasks).I like your approach on "available MFLOPS". Thanks!

Arrieta 2009-11-25 04:29:28

You could calculate the approximate MFLO of a calculation by running it by itself on a host with a known MFLOPS and multiply the number of seconds the calculation ran by the MFLOPS of the host. Once you have that, you can estimate how long a calculation will take, given the available MFLOPS of a host. For bonus points, make a simple web-app to manage the scheduling so that users can schedule calculations of varying priority.

Ben S 2009-11-25 04:37:06

Answer 3

A:

You don't need to know FLOPS. beowulf modules paralell computing center has I go to has the script for sure

PDC operates leading-edge, high-performance computers on a national level. PDC offers easily accessible computational resources that primarily cater to the ...

LarsOn 2009-11-25 04:21:43

I'm sorry, I don't understand your answer.

Arrieta 2009-11-25 04:31:43

Link is beowulf.org

LarsOn 2009-11-25 04:35:40

ansaurus

tags:

views:

answers:

Algorithmically suggest best node to perform demanding computation

related questions