A time series model can only be built on a stationary time series. Stationary series fulfill the following criteria:
1. The mean should be constant, wihtout increasing or decreasing with time.
2. The variance should also be constant along the time line (homoscedasticity).
3. The covariance between terms across a lag (t v. t - m) should not be a function of time.
White noise is stationary.
yt=ϵt
Sequence of identically distributed variables ϵi with ϵi∼N(0,σ2)
Here is an example with plots of the autocorrelation function
acf
telling us how the last value is correlated to the
immediate prior values, as well as the partial autocorrelation function
pacf
, showing whether there is any residual correlation of
yt to the prior values of y after eliminating yt−1.
library(cowplot)
library(fpp2)
set.seed(0)
a = rnorm(100)
plot.ts(a, col="gray", ylab="white noise")
abline(h=0, col="red3", lty=2)
# ACF plot
r1 <- ggAcf(a, lag.max = 20) + labs(title = "ACF PLot residuals Model 2")
# PACF plot
r2 <- ggPacf(a, lag.max = 20) + labs(title = "PACF PLot residuals Model 2")
# Plot both
plot_grid(r1, r2, ncol = 1)
A random walk is not stationary because the variance is time-dependent: var[yt]=var[yo]+∑t1var[ϵi]=tvar[ϵ].
yt=yt−1+ϵt=ϵt+ϵt−1+⋯+ϵ1
b = cumsum(rnorm(100))
plot.ts(b, col="chartreuse3", ylab="random walk")
abline(h=0, col="red3", lty=2)
To simulate the stock market (and aside from more sophisticted autoregressive and moving average effects), the random walk will not be symmetric (equal probabilities). Here is a random walk during an extremely bullish market:
n=100
x = cumsum(sample(c(1,-1), n, prob = c(0.75, 0.25), replace=T))
plot.ts(x, col="red", ylab="random walk")
abline(h=0, col="red3", lty=2)
yt=c+ρyt−1+ϵt
So notice that yt is regressed on itself (this is not the case in MA processes). We can recur the equation:
yt−1=c+ρyt−2+ϵt−1
Substituting 2 into 1:
yt=c+ρ(c+ρyt−2+ϵt−1)+ϵt=c+ρc+ρ2yt−2+ρϵt−1+ϵt
This “compound interest” expression is not possible in MA.
When ρ<1 the series is
“covariance stationary”. Notice how the influence of the immediate prior
position will be, to a certain extent, countered by the effect of ρ: the larger ρ is the closer AR
will
be to a random walk; and the smaller it is, the more similar to white
noise. ϵt can pull in any
direction, but a ρ<1 pulls
back yt−1 back towards 0.
After turning a time series into an AR series, we’ll be dealing with a stationary dataset, and we can fit a model by selecting optimal parameters, such as ρ.
Leaving the constant c=0, and set ρ=0.6, here is an example:
c = arima.sim(n=100,list(ar=0.6),innov=rnorm(100))
plot.ts(c, col="chocolate", ylab="AR(1)")
abline(h=0, col="red3", lty=2)
Notice how the lag 1 (yt−1)
value is significant. But if we suppress this dependency, there is no
further correlation upstream (except for some statistical random
outliers). Also, the acf
starts off comparing t to t, whereas the first line on the
pacf
plot already compares t to t−1.
For any AR(n):
yt=ρ1yt−1+ρ2yt−2+⋯+ρnyt−n+c+ϵt
In AR(2)
we have two lags:
d = arima.sim(n=100,list(ar=c(0.6,-.5)),innov=rnorm(100))
plot.ts(d, col="chocolate", ylab="AR(2)")
abline(h=0, col="red3", lty=2)
Notice the negative second lag related to the −0.5 AR value. Notice how the acf just gives us a significant correlation without the sign.
MA is also stationary, and by affecting noise, it’s effects are less long-lived than AR.
An AR(1) process is defined as:
yt=k+θϵt−1+ϵt
which can clearly be extended to MA(n).
Here is an MA(2)
process:
e = arima.sim(n=100,list(ma=c(.88,-.48)),innov=rnorm(100))
plot.ts(e, col="cyan", ylab="MA(2)")
abline(h=0, col="red3", lty=2)
yt=C+ρ1yt−1+ρ2yt−2+⋯+ρnyt−n+ϵt+θ1ϵt−1+θ2ϵt−2+⋯+θmyt−m
Example of ARIMA(2,2):
f = arima.sim(n=100,list(ar=c(0.6,-.5), ma=c(.88,-.48)),innov=rnorm(100))
plot.ts(f, col="blue3", ylab="MA(2)")
abline(h=0, col="red3", lty=2)
In one single shot:
So, after having a stationary dataset, the two questions to answer before fitting a model are:
1. Is it an AR or MA process?
2. What order of AR or MA process do we need to use?
In a moving average (MA) series of lag n, we will not get any correlation between x(t) and x(t – n -1). Hence, the total correlation chart (ACF) cuts off at n-th lag. So it becomes simple to find the lag for a MA series. For an AR series this correlation will gradually go down without any cut off value.
In an autoregression series (AR), the PACF will cut off after the degree of AR series: if we have a AR(1) series, if we exclude the effect of 1st lag (y_{t-1}) ), our second lag (y_{t-2}) ) is independent of y_t. Hence, the partial correlation function (PACF) will drop sharply after the 1st lag.
The series can be made stationary with one of the following techniques:
1. Detrending
2. Differencing
3. Incorporating seasonality
y_t = (\text{mean} + \text{trend} \times t) + \text{error}
gets rid of the parenthesis part.
We model the differences between time points:
y_t – y_{t-1} = \text{ARMA (p,q)}
Differencing is called as the Integration part in AR(I)MA. Now, we
have three parameters p : AR; d : I; q : MA
.
Seasonality can easily be incorporated in the ARIMA model directly.
In the case of AirPassengers
:
plot.ts(AirPassengers)
abline(lm(AirPassengers~time(AirPassengers)))
we can find a trend (the upward slope in the time series - solution: taking differences),
lm(AirPassengers~time(AirPassengers))
##
## Call:
## lm(formula = AirPassengers ~ time(AirPassengers))
##
## Coefficients:
## (Intercept) time(AirPassengers)
## -62055.91 31.89
seasonality:
x <- ts(AirPassengers, frequency=12) # 12 measurements per year
fit <- tbats(x)
!is.null(fit$seasonal)
## [1] TRUE
and increasing variance (heteroskedasticity of the time series) (solution: \log).
Testing for stationarity now:
```r
adf.test(diff(log(AirPassengers)), alternative="stationary", k=0)
##
## Augmented Dickey-Fuller Test
##
## data: diff(log(AirPassengers))
## Dickey-Fuller = -9.6003, Lag order = 0, p-value = 0.01
## alternative hypothesis: stationary
Now the data is stationary. But we still need to find the parameters:
acf(log(AirPassengers))
Clearly, the decay of ACF chart on the left is very slow, which means that the population is not stationary.
par(mfrow = c(1,2))
acf(diff(log(AirPassengers)))
pacf(diff(log(AirPassengers)))
The PACF shows many significant values, and there is one significant step in the ACF. Hence we can try fitting a model with a AR parameter of 0 and a MA parameter of 1, i.e. with parameters p, d and q, (p,d,q) = (0,1,1):
fit = arima(log(AirPassengers), c(0, 1, 1), seasonal = list(order = c(0, 1, 1), period = 12))
Now we can predict ten years ahead and plot:
pred <- predict(fit, n.ahead = 10*12)
ts.plot(AirPassengers,exp(pred$pred), log = "y", lty = c(1,3))
NOTE: These are tentative notes on different topics for personal use - expect mistakes and misunderstandings.