ansaurus

Question

R Grouping functions: sapply vs. lapply vs. apply. vs. tapply vs. by vs. aggregate vs. ...

Answer 1

+7 A:

On the side note, here is how the various plyr functions correspond to the base *apply functions (from the intro to plyr document from the plyr webpage http://had.co.nz/plyr/)

Base function   Input   Output   plyr function 
---------------------------------------
aggregate        d       d       ddply + colwise 
apply            a       a/l     aaply / alply 
by               d       l       dlply 
lapply           l       l       llply  
mapply           a       a/l     maply / mlply 
replicate        r       a/l     raply / rlply 
sapply           l       a       laply

One of the goals of plyr is to provide consistent naming conventions for each of the functions, encoding the input and output data types in the function name. It also provides consistency in output, in that output from dlply() is easily passable to ldply() to produce useful output, etc.

Conceptually, learning plyr is no more difficult than understanding the base *apply functions.

plyr and reshape functions have replaced almost all of these functions in my every day use. But, also from the Intro to Plyr document:

Related functions tapply and sweep have no corresponding function in plyr, and remain useful. merge is useful for combining summaries with the original data.

JoFrhwld 2010-08-17 19:20:09

When I started learning R from scratch I found plyr MUCH easier to learn than the `*apply()` family of functions. For me, `ddply()` was very intuitive as I was familiar with SQL aggregation functions. `ddply()` became my hammer for solving many problems, some of which could have been better solved with other commands.

JD Long 2010-08-17 19:23:29

I guess I figured that the concept behind `plyr` functions is similar to `*apply` functions, so if you can do one, you can do the other, but `plyr` functions are easier to remember. But I totally agree on the `ddply()` hammer!

JoFrhwld 2010-08-17 19:36:22

Got it, I'll have to finally pick up plyr soon! Its prefix naming alone is gold...

grautur 2010-08-17 22:28:44

Couldn't have said it better myself. Thanks!

hadley 2010-08-18 02:07:54

Answer 2

A:

The plyr documentation is very clear and easy to follow, and I do recommend you read it from start to finish, because you'll learn lots of extra things that will help you later. In my view, this is one of those instances in which there will be large payoffs for taking the time to read the details from start to finish, as opposed to just skimming a help page to see the names of arguments.

dan 2010-08-17 23:56:28

ansaurus

tags:

views:

answers:

R Grouping functions: sapply vs. lapply vs. apply. vs. tapply vs. by vs. aggregate vs. ...

related questions