What is the BigO of linear regression? | ansaurus

tags:

views:

121

answers:

1

+3 Q:

What is the BigO of linear regression?

How large a system is it reasonable to attempt to do a linear regression on?

Specifically: I have a system with ~300K sample points and ~1200 linear terms. Is this computationally feasible?

+2 A:

You can express this as a matrix equation:

alt text

where the matrix alt text is 300K rows and 1200 columns, the coefficient vector is 1200x1, and the RHS vector is 1200x1.

If you multiply both sides by the transpose of the matrix alt text , you have a system of equations for the unknowns that's 1200x1200. You can use LU decomposition or any other algorithm you like to solve for the coefficients. (This is what least squares is doing.)

So the Big-O behavior is something like O(m*m*n), where m = 300K and n = 1200. You'd account for the transpose, the matrix multiplication, the LU decomposition, and the forward-back substitution to get the coefficients.

duffymo 2009-12-23 20:49:16

So, if I'm reading that correctly (and IIRC), generating the A will be O(n*m)~=O(m^2) (in my case `n/m=C`) and the multiplication will be O(n*n*m)~=O(n^3) and the inversion will be O(n^3) Now just to figure out the constant term.

BCS 2009-12-23 21:05:25

related questions

What is the O time in determining if a value is in a sorted array?

Difference between lower bound and tight bound?

What is the Big-O of a nested loop, where number of iterations in the inner loop is determined by the current iteration of the outer loop?

Computational complexity of Fibonacci Sequence

Worst Case Time Complexity for an algorithm

A range intersection algorithm better than O(n)?

Feasibility of LinkedList vs. Array for sorted vs. unsorted data?

What is the best solution for the 'Students and Lockers' problem?

How to find the kth largest element in an unsorted array of length n in O(n)?

Is list::size() really O(n)?

multiset, map and hash map complexity

Big O Notation Homework--Code Fragment Algorithm Analysis?

Recursion and Big O

Efficient traversal of a changelist

Constant Amortized Time

Is there a master list of the Big-O notation for everything?

Algorithm to determine if array contains n...n+m?

Balanced Distribution Algorithm

Which list<Object> implementation will be the fastest for one pass write, read, then destroy?

What is Big O notation? Do you use it?

What is the time complexity of indexing, inserting and removing from common data structures?

Big-O for Eight Year Olds?

Where can I find the time and space complexity of the built-in sequence types in Python

An O(1) Sort ~~~

Big O, how do you calculate/approximate it?