ansaurus

Question

Answer 1

+1 A:

A useful trick is to combine the id variables into a character vector and then do the reshape.

tbl$NEWID <- with(tbl, paste(ID, DATE1, DATE2, sep=";"))
tbl2 <- recast(tbl2, NEWID ~ VALTYPE, measure.var="VALUE")

It's about 40% faster in a problem of similar size in my intel core2 duo 2.2ghz macbook.

Eduardo Leoni 2009-12-07 16:38:14

Nope, using `recast()` shows same problem as with `cast()` method above - the process went over 5 Gb of virtual memory so I killed it after about 1 hour.

Alexander L. Belikoff 2009-12-07 18:02:01

Answer 2

+1 A:

What about doing this in a non-R-like manner? I assume you have a TYPE1 and a TYPE2 row for each value of ID,DATE1,DATE2? Then sort the dataframe by those variables, and write a big for loop. You can repeatedly do rbind() operations to build the table, or you could try to pre-allocate the table (maybe) and just assign the VALUE.TYPE1 and VALUE.TYPE2 slots with [<-, which should do the assignment in-place.

(Note that if you're using rbind(), I believe that it's inefficient if you have any factor variables, so make sure everything is a character instead!)

Harlan 2009-12-07 23:19:03

Without `rbind` and loops: `tbl <- tbl[with(tbl,order(ID,DATE1,DATE2,VALTYPE)),];tbl_out <- tbl[seq(1,nrow(tbl),by=2),-4];names(tbl_out)[4] <- "VALUE.TYPE1";tbl_out$VALUE.TYPE2 <- tbl$VALUE[seq(2,nrow(tbl),by=2)];`

Marek 2009-12-08 10:55:54

We can't really assume that there are exactly 2 entries for each ID/DATEs. This will immediately break Marek's code above. Also, it would be even more fragile than my working `by()/merge()` code in the body of the question.Overall, I don't have any problem with the loop approach except that I don't understand why the function specifically for that purpose (that is `reshape()` fails on such a trivial problem)

Alexander L. Belikoff 2009-12-08 18:16:07

Answer 3

+1 A:

Maybe you could use the cat() function?

Karsten W. 2009-12-07 23:35:03

ansaurus

tags:

views:

answers:

R performance with data reshaping

related questions