ansaurus

Question

Answer 1

+5 A:

There are a lot of questions and issues raised here. I'll try to respond to each of them.

Arima() is just a wrapper for arima(), so it will give the same model.

arima() handles a model with differencing by using a diffuse prior. That is not the same as just differencing the data before fitting the model. Consequently, you will get slightly different results from arima(x,order=c(p,1,q)) and arima(diff(x),order=c(p,0,q)).

auto.arima() handles differencing directly and does not use a diffuse prior when fitting. So you will get the same results from auto.arima(x,d=1,...) and auto.arima(diff(x),d=0,...)

auto.arima() has an argument max.order which specifies the maximum of p+q. By default, max.order=5, so your arima(5,1,4) would not be considered. Increase max.order if you want to consider such large models (although I wouldn't recommend it).

You can't vectorize a loop involving nonlinear optimization at each iteration.

If you want to sort your output, you'll need to save it to a data.frame and then sort on the relevant column. The code currently just spits out the results as it goes and nothing is saved except for the most recent model fitted.

Rob Hyndman 2009-11-03 04:31:42

Thanks, Rob. With max.order=9 added, the AIC for ARIMA(5,1,4)/ARMA(5,4) is 1e+20, whatever it means, so it still selects ARIMA(3,1,0)/ARMA(3,0) as the best.

knot 2009-11-03 06:23:03

auto.arima() returns an AIC of 1e20 (i.e., 10^20) when there are problems with the fit. It may be a convergence problem, or the parameters may be near the boundaries of stationarity and invertibility. These signal that the model is likely to have problems and is better not to be used.

Rob Hyndman 2009-11-03 07:13:11

As always, thanks for your contributions to stackoverflow, Rob!

griffin 2009-11-03 16:44:37

Ok. The AICs differ by almost 1. The plots for ARMA(3,0) and ARMA(5,4) look alike. Anyone with brain would choose the former, and anything based solely on the AIC would suggest the latter. Perhaps a similar or even more interesting example prompted Leo Breiman to say that "automatic methods of model selection are to be shunned or, if use is absolutely unavoidable, are to be examined carefully...". The AIC can mislead. Arima(diff(data),c(5,0,4))plot(arima.sim(n = 630, list(ar = c( 0.3999, -0.4881, 0.0388, -0.2539, 0.5874 ), ma = c(0.7173, 0.7831, 0.7173, 0.9999)),sd = sqrt(7.436)))

knot 2009-11-03 22:55:11

@Knot; only if you use AIC in the sense of "the lower the better". Models within 2 AIC units difference are essentially equivalent in their fits relative to complexity. I would read nothing special into models that differed by "almost 1" in AIC. If I wanted "one" model then I'd go with the simpler model. If I was into model averaging, I might keep both models and consider them as two different models that have similar levels of parsimony. All models are wrong after all...

Gavin Simpson 2010-09-22 15:32:55

ansaurus

tags:

views:

answers:

R: ARIMA, ARMA and AICs?

related questions