ansaurus

Question

Dimension Reduction

Answer 1

+3 A:

SVD is a fine approach (probably). LSA (Latent Semantic Analysis) is based around it, and has basically the same dimensionality approach. I've talked about that (at length) at: lsa-latent-semantic-analysis-how-to-code-it-in-php or check out the LSA tag here on SO.

I realize it's an incomplete answer. Holler if you want more help!

Gregg Lind 2009-10-08 15:53:11

Thanks, that was helpful. In order to turn U into U', do I simply truncate everything after the second column, or is it fancier than that?

PlexLuthor 2009-10-08 15:57:53

I'm pretty sure it's exactly that simple (assuming matlab orders the columns such that the cols and eigenvals correspond)

Gregg Lind 2009-10-08 16:07:43

Ok. I just played around with it in the way I thought you said it would work, but I still can't take new 3-d data and get the 2-d projection without recalculating the whole UxSxV set. Did I miss something in LSA? That is, I have X (15x3), U, S, V, U', S', V', and now I get A (1x3). What should I do to get a 1x2 version of A?

PlexLuthor 2009-10-08 16:40:17

Duh, divide by V* is what I was looking for. I don't know why I missed that earlier.

PlexLuthor 2009-10-08 17:00:27

It sounds like you have it quite well in hand :) I can never remember the exact formulae, so I just noodle around until I get the right size end matrix, just as you are!

Gregg Lind 2009-10-08 18:03:40

Answer 2

+1 A:

% generate some random data (each row is a d-dimensional datapoint)
%data = rand(200, 4);
load fisheriris
data = meas;        % 150 instances of 4-dim

% center data
X = bsxfun(@minus, data, mean(data));

% SVD
[U S V] = svd(X, 'econ');       % X = U*S*V''

% lets keep k-components so that 95% of the data variance is explained
variances = diag(S).^2 / (size(X,1)-1);
varExplained = 100 * variances./sum(variances);
index = 1+sum(~(cumsum(varExplained)>95));

% projected data = X*V = U*S
newX = X * V(:,1:index);
biplot(V(:,1:index), 'scores',newX, 'varlabels',{'d1' 'd2' 'd3' 'd4'});

% mapping function (x is a row vector, or a matrix with multiple rows vectors)
mapFunc = @(x) x * V(:,1:index);
mapFunc([1 2 3 4])

Amro 2009-10-09 01:08:57

Answer 3

A:

I don't think there's a built-in way to update an existing SVD within Matlab. I google'd for "SVD update" and found this paper among the many results.

Victor Liu 2009-10-09 01:23:51

ansaurus

tags:

views:

answers:

Dimension Reduction

related questions