ansaurus

Question

Answer 1

+5 A:

Use an anonymous function:

> ddply(iris,"Species",function(X) data.frame(wmn=weighted.mean(X$Sepal.Length,
+                                                               X$Petal.Length),
+                                             mn=mean(X$Sepal.Length)))
     Species      wmn    mn
1     setosa 5.016963 5.006
2 versicolor 5.978075 5.936
3  virginica 6.641535 6.588
>

This computes a weighted mean of Sepal.Length (weighted by Petal.Length) as well as unweighted mean and returns both.

Dirk Eddelbuettel 2010-07-18 21:44:00

This is nice. Haven´t had much to do with anonymous functions so far. seems really worth a look. I don´t get the syntax / idea fully yet, but I will look into it, thx for your help! Do you need to print everything in one line because of no "{}" in there ? Where can I learn something about anonymous functions?

ran2 2010-07-18 21:50:44

Well, *all* these these `*apply`, `by`, ... functions use anonymous functions so you should find plenty of examples. Curly braces are needed once you group more than one command. Lastly, you do not have use an anonymous function -- you can also define your own -- but using them saves on typing :)

Dirk Eddelbuettel 2010-07-18 22:03:00

what about `lapply(split(iris, species), weighted.mean)` or smth like that?

aL3xa 2010-07-18 23:27:32

Answer 2

+3 A:

Use summarise (or summarize):

ddply(iris, "Species", summarise, 
  wmn = weighted.mean(Sepal.Length, Petal.Length),
  mn = mean(Sepal.Length))

hadley 2010-07-19 02:01:39

ansaurus

tags:

views:

answers:

group by in R, ddply with weighted.mean

related questions