tags:

views:

281

answers:

1

Is aov appropriate for unbalanced datasets. According to help ...provides a wrapper to lm for fitting linear models to balanced or unbalanced experimental designs. But later on it says aov is designed for balanced designs, and the results can be hard to interpret without balance.

How should I perform a 2-way anova on an unbalanced dataset in R?

I would like to reproduce the different results for type I and type III sum of squares of SAS output (when using proc glm). I remember we were using type III sum of squares for unbalanced datasets.

Thank you in advance.

+8  A: 

Function anova (or summary.aov) will give you the so called type I (or sequential) sum of squares. To get type III sum of squares, you can use the Anova function from library car with parameter type="III". The difference between these two approaches in unbalanced datasets (and also sample R code to produce both tables) is presented in detail here.

gd047
Thank you very much. This paper seems to be very helpful and informative.
Brani
Thanks for the link to the Zahn paper. I hadn't seen it before.
Ian Fellows
@gd047: Could you clarify where is the sample R code?
unutbu
@~unutbu: In the appendix :-)
gd047
@gd047: Doh! :-) Thank you...
unutbu