ansaurus

Question

Creating multiple subsets all in one data.frame (possibly with ddply)

Answer 1

+3 A:

You could try:

ddply(df, .(x), subset, rnorm.100. > quantile(rnorm.100., 0.8))

And off topic: you could use df <- data.frame(x,y=rnorm(100)) to name a column on-the-fly.

Marek 2010-07-30 11:15:42

Thanks Marek, for the answer and the tip about specifying a colname on the fly - although not requested, it was something I was wondering how to do!

Brandon Bertelsen 2010-07-30 23:36:33

Answer 2

+2 A:

Here's a different approach with the little used ave() command. (very fast to calculate this way)

Make a new column that contains the quantile calculation across each level of x

df$quantByX <-  ave(df$rnorm.100., df$x, FUN = function (x) quantile(x,0.8))

Select the items of the new column and the x column.

df2 <- unique(df[,c(1,3)])

The result is one data frame with the unique items in the x column and the calculated quantile for each level of x.

John 2010-07-30 13:52:45

`ave` is one of most powerful R functions. But in this case I think you should use it in this way: `subset(df, rnorm.100. > ave(rnorm.100., x, FUN=function(v) quantile(v, 0.8)))`

Marek 2010-07-30 14:40:24

that clarifies the question for me... :)

John 2010-07-30 15:16:46

I've not had the opportunity to try this function before. Marek's solution above works well for my purposes. But thank you for this as well, I'll look into "ave".

Brandon Bertelsen 2010-07-30 22:07:26

ansaurus

tags:

views:

answers:

Creating multiple subsets all in one data.frame (possibly with ddply)

related questions