ansaurus

Question

Return value from column indicated in same row.

Answer 1

+2 A:

t(data[,1:30])[30*(0:399999)+data[,31]]

This works because you can reference matricies both in array format, and vector format (a 400000*31 long vector in this case) counting column-wise first. To count row-wise, you use the transpose.

James 2010-07-07 10:20:38

Answer 2

A:

Singe-index notation for the matrix may use less memory. This would involve doing something like:

i <- nrow(data)*(data[,31]-1) + 1:nrow(data)
a <- data[i]

Below is an example of single-index notation for matrices in R. In this example, the index of the per-row maximum is appended as the last column of a random matrix. This last column is then used to select the per-row maxima via single-index notation.

## create a random (10 x 5) matrix                                                                                                                           
M <- matrix(rpois(50,50),10,5)
## use the last column to index the maximum value of the first 5                                                                                             
## columns                                                                                                                                                   
MM <- cbind(M,apply(M,1,which.max))
##             column ID          row ID                                                                                                                     
i <- nrow(MM)*(MM[,ncol(MM)]-1) + 1:nrow(MM)
all(MM[i] == apply(M,1,max))

Using an index matrix is an alternative that will probably use more memory but is slightly clearer:

ii <- cbind(1:nrow(MM),MM[,ncol(MM)])
all(MM[ii] == apply(M,1,max))

nullglob 2010-07-07 11:55:11

Answer 3

A:

Try to change the code to work a column at a time:

M <- matrix(rpois(30*400000,50),400000,30)
MM <- cbind(M,apply(M,1,which.max))
a <- rep(0,nrow(MM))
for (i in 1:(ncol(MM)-1)) {
    a[MM[, ncol(MM)] == i] <- MM[MM[, ncol(MM)] == i, i]
}

This sets all elements in a with the values from column i if the last column has value i. It took longer to build the matrix than to calculate vector a.

Henrico 2010-07-07 17:59:12

ansaurus

tags:

views:

answers:

Return value from column indicated in same row.

related questions