ansaurus

Question

as.data.frame of table() to summarize frequencies

Answer 1

+1 A:

Look at the xtabs method in the Matrix package which does sparse cross-tabulation.

Jonathan Chang 2010-04-26 16:06:34

Thanks for the suggestion. However, I think `xtabs` only works with two-ways tables. It turns out that in my specific case, I actually have a table that's three-ways.

andrewj 2010-04-27 14:05:19

Answer 2

+1 A:

library(plyr)
ddply(foo, ~ x + y, nrow,.drop=FALSE)

Ian Fellows 2010-04-26 16:25:30

Answer 3

+1 A:

I have this method for fast (sparse) cross tabulation. I think there are possibilities for further optimisation, but it's been good enough for me for large data sets. The key is the use of ninteraction from the plyr package to quickly generate a numeric id for each row.

tab <- function(df, drop = TRUE) {
  id <- plyr::ninteraction(df)
  ord <- order(id)

  df <- df[ord, , drop = FALSE]
  id <- id[ord]

  freq <- rle(id)$lengths
  labels <- unrowname(df[cumsum(freq), , drop = FALSE])

  data.frame(labels, freq)
}

hadley 2010-04-26 18:32:16

ansaurus

tags:

views:

answers:

as.data.frame of table() to summarize frequencies

related questions