ansaurus

Question

Parallelize or vectorize all-against-all operation on a large number of matrices?

Answer 1

A:

If I understand correctly you have to perform 5000^2 matrix comparisons ? Rather than try to parallelise the compare function, perhaps you should think of your problem being composed of 5000^2 tasks ? The Matlab Parallel Compute Toolbox supports task-based parallelism. Unfortunately my experience with PCT is with parallelisation of large linear algebra type problems so I can't really tell you much more than that. The documentation will undoubtedly help you more.

High Performance Mark 2010-05-20 09:44:00

Answer 2

+1 A:

The second example can be easily sliced for use with the Parallel Processing Toolbox. This toolbox distributes iterations of your code among up to 8 different local processors. If you want to run the code on a cluster, you also need the Distributed Computing Toolbox.

%# who(), load() and struct2cell() calls place k data matrices in a 1D cell array called data.

parfor i=1:k-1 %# this will run the loop in parallel with the parallel processing toolbox
    %# only make the necessary comparisons
    H{i+1:k,i} = cellfun(@compare,data(i+1:k),repmat(data(i),k-i,1),'UniformOutput',false);

    %# if the above doesn't work, try this
    hSlice = cell(k,1);
    hSlice{i+1:k} = cellfun(@compare,data(i+1:k),repmat(data(i),k-i,1),'UniformOutput',false);
    H{:,i} = hSlice;
end

Jonas 2010-05-20 11:21:35

"PARFOR loop cannot run due to the way nextData is used." Also it doesn't want to slice data (which is a 1D cell array).

reve_etrange 2010-05-20 11:31:22

It won't slice `data`, because you use the entire array `data` in your call to cellfun. Also, I fixed the problem with `nextData`

Jonas 2010-05-20 12:33:17

Thank you...now M-Lint is happy. But can repmat return a cell array repeating element of data? It is concatenating them all together and creating a large matrix.

reve_etrange 2010-05-20 20:25:52

@reve_etrange. D'oh! You need to do repmat with data(i) instead data{i}. Fixed

Jonas 2010-05-20 21:10:57

@reve_etrange: I've also updated the solution so that you only make the necessary comparisons

Jonas 2010-05-20 21:16:39

Thanks again Jonas. The first way still has an issue with the indexing of H, I think that in a parfor the loop variable must be used as only one index. Also data (first occurrence) is still not sliced. I guess that because each iteration is sent two, non-overlapping parts of data it wants to send the entire thing?

reve_etrange 2010-05-21 07:33:30

@reve_etrange: `data` would be sliced only if it was exclusively indexed with `i`. If more than that is needed, all of data is passed to each worker and the variable is not sliced. Variables not being sliced is not necessarily a problem. Also, I thought the first way may be problematic, so I suggested the second one. I'm glad it works. Note that it only makes the comparisons below the diagonal.

Jonas 2010-05-21 11:00:32

I just calculated the memory usage for duplicate copies of the data; it's no problem. Now to combine with the out-of-memory version...in which case the triangular indexing remains essential.

reve_etrange 2010-05-21 11:59:57

Answer 3

+3 A:

Does

compare(a,b) == compare(b,a)

and

compare(a,a) == 1

If so, change your loop

for i=1:numel(list)
    for j=1:numel(list)
    ...
    end
end

to

for i=1:numel(list)
    for j= i+1 : numel(list)
    ...
    end
end

and deal with the symmetry and identity case. This will cut your calculation time by half.

MatlabDoug 2010-05-20 13:25:33

I should have seen that. +1

Jonas 2010-05-20 13:47:14

The symmetry cases are transposes and the identity case is not 1, but is not useful. Thanks, I think it will help my memory troubles as well.

reve_etrange 2010-05-20 20:31:00

ansaurus

tags:

views:

answers:

Parallelize or vectorize all-against-all operation on a large number of matrices?

related questions