ansaurus

Question

improve my code for collapsing a list of data.frames

Answer 1

+4 A:

I'm not claiming this to be the most elegant approach, but I think it is working

library(plyr)

ldply(sapply(1:length(walk.sample), function(i) 
           if (length(walk.sample[[i]]) > 1)
           cbind(walk.sample[[i]],session=rep(i,nrow(walk.sample[[i]])))
      ),rbind)

EDIT

After applying Marek's apt remarks

do.call(rbind,lapply(1:length(walk.sample), function(i)
           if (length(walk.sample[[i]]) > 1)
           cbind(walk.sample[[i]],session=i)  ))

gd047 2010-04-30 12:47:58

`cbind` don't need replication, you could just write `session=i`. And without plyr one could use `do.call(rbind, sapply(......))`.

Marek 2010-04-30 13:59:38

Hi gd047, I would just like to mention your solution wouldn't work when the data.frame have different number of rows. Also, when the number of rows is the same, the results are not correct (there is a mix with rows and columns. and also there are no column names)

Tal Galili 2010-04-30 14:30:05

I think that replace `sapply` with `lapply` may help.

Marek 2010-04-30 15:06:44

Good job, chaps! Exactly what the doctor ordered.

Roman Luštrik 2010-05-03 05:30:14

Answer 2

+6 A:

I think this will work...

lengths <- sapply(walk.sample, function(x) if (is.null(nrow(x))) 0 else nrow(x))
cbind(do.call(rbind, walk.sample[lengths > 1]),
      session = rep(1:length(lengths), ifelse(lengths > 1, lengths, 0)))

Jonathan Chang 2010-04-30 15:09:16

You should use `NROW` instead of `nrow`. For data from question your solution won't work.

Marek 2010-04-30 15:57:25

Good catch, NROW is one possible fix, but I dunno what the expected behavior is when you have a 1-row dataframe. I will change it by doing a NULL check instead...

Jonathan Chang 2010-04-30 16:04:46

Good solution Jonathan!

Tal Galili 2010-04-30 19:55:09

ansaurus

tags:

views:

answers:

improve my code for collapsing a list of data.frames

related questions