ansaurus

Question

Incremental PCA

Answer 1

+2 A:

There's a NIPALS algorithm, Non-linear iterative partial least squares, for computing a few PCs without slow SVD: see PyMVPA and MDP .

@jetxee: agree; but I think there'd be good market for SO-like forums in many other areas --
SO is relatively fun to use, and attracts some really good people.

Added 10May, incremental NIPALS: say you have G1 .. G100 1Gbyte each, and want PC (or any function at all) of the lot.
Two obvious approaches:

average / moving average PC1 .. PC100
model the distribution of G1, say 1000 bins (kdtree ?) then bin the rest like that -- google "dynamic quantiles".

I'd be wary though -- "we have more data with less validation than ever before."

Denis 2010-05-04 12:37:08

Denis - Question is whether it is incremental or not. In the references you provided is there a way to provide the data observations one at a time - most implementations I saw of the NIPALS algorithm require the whole data matrix which sometimes may be too big to fit in memory.

smichak 2010-05-10 06:33:48

Answer 2

A:

@Micha, I am looking forward to implement Incremental PCA on python too. I had gone through another paper titled "Candid Covariance-free Incremental Principal Component Analysis" by Juyang Weng et. al.

I will check out your python code too. Kindly let me know if you have any updates.

Varun

Raj 2010-09-23 09:52:19

ansaurus

tags:

views:

answers:

Incremental PCA

related questions