web

$next$ $up$ $previous$

Postscript version of these notes

STAT 804

Lecture 16 Notes

Distribution theory for sample autocovariances

The simplest statistic to consider is

$\displaystyle \frac{1}{T} \sum X_t X_{t+k}$

where the sum extends over those

for which the data are available. If the series has mean 0 then the expected value of this statistic is simply

$\displaystyle \frac{T-k}{T} C_X(k)$

which differs negligibly for

large compared to

from

. To compute the variance we begin with the second moment which is

$\displaystyle \frac{1}{T^2} \sum_s\sum_t {\rm E}(X_sX_{s+k}X_t X_{t+k})$

The expectations in question involve the fourth order product moments of

and depend on the distribution of the

's and not just on

. However, for the interesting case of white noise, we can compute the expected value. For

you may assume that

since the

cases can be figured out by swapping

and

in the

case. For

the variable

is independent of all 3 of $X_{s+k}$ ,

and $X_{t+k}$ . Thus the expectation factors into something containing the factor ${\rm E}(X_s)=0$ . For

, we get ${\rm E}(X_s^2){\rm E}(X_{s+k}^2)=\sigma^4$ . and so the second moment is

$\displaystyle \frac{T-k}{T^2}\sigma^4$

This is also the variance since, for

and for white noise,

For and or the expectation is simply $\sigma^4$ while for we get ${\rm E}(X_t^4)\equiv \mu_4$ . Thus the variance of the sample variance (when the mean is known to be 0) is

$\displaystyle \frac{T-1}{T} \sigma^4 + \mu_4/T - \sigma^4 = (\mu_4-\sigma^4)/T \, .$

For the normal distribution the fourth moment $\mu_4$ is given simply by $3\sigma^4$ .

Having computed the variance it is usual to look at the large sample distribution theory. For the usual central limit theorem applies to $\sum X_t^2 / T$ (in the case of white noise) to prove that

$\displaystyle \sqrt{T}({\hat C}_X(0) -\sigma^2)/\sqrt{\mu_4} \to N(0,1) \, .$

The presence of $\mu_4$ in the formula shows that the approximation is quite sensitive to the assumption of normality.

For the theorem needed is called the -dependent central limit theorem; it shows that

$\displaystyle \sqrt{T} {\hat C}_X(k)/\sigma^2 \to N(0,1) \, .$

In each of these cases the assertion is simply that the statistic in question divided by its standard deviation has an approximate normal distribution.

The sample autocorrelation at lag is

$\displaystyle {\hat C}_X(k)/{\hat C}_X(0) \, .$

For

we can apply Slutsky's theorem to conclude that

$\displaystyle \sqrt{T} {\hat C}_X(k)/{\hat C}_X(0) \to N(0,1) \, .$

This justifies drawing lines at $\pm 2/\sqrt{T}$ to carry out a 95% test of the hypothesis that the

series is white noise based on the

th sample autocorrelation.

It is possible to verify that subtraction of $\bar X$ from the observations before computing the sample covariances does not change the large sample approximations, although it does affect the exact formulas for moments.

When the series is actually not white noise the situation is more complicated. Consider as an example the model

$\displaystyle X_t = \phi X_{t-1} + \epsilon_t$

with $\epsilon$ being white noise. Taking

$\displaystyle {\hat C}_X(k) = \frac{1}{T} X_t X_{t+k}$

we find that

$\displaystyle T^2{\rm E}({\hat C}_X(k)^2) = \sum_s\sum_t \sum_{u_1} \sum_{u_2} ... ...rm E} (\epsilon_{s-u_1}\epsilon_{s+k-u_2} \epsilon_{t-v_1} \epsilon_{t+k-v_2})$

The expectation is 0 unless either all 4 indices on the $\epsilon$ 's are the same or the indices come in two pairs of equal values. The first case requires

and

and then

. The second case requires one of three pairs of equalities:

and

along with the restriction that the four indices not all be equal. The actual moment is then $\mu_4$ when all four indices are equal and $\sigma^4$ when there are two pairs. It is now possible to do the sum using geometric series identities and compute the variance of ${\hat C}_X(k)$ . It is not particularly enlightening to finish the calculation in detail.

There are versions of the central limit theorem called mixing central limit theorems which can be used for ARMA() processes in order to conclude that

$\displaystyle \sqrt{T} ( {\hat C}_X(k)-C_X(k))/\sqrt{{\rm Var}({\hat C}_X(k))}$

has asymptotically a standard normal distribution and that the same is true when the standard deviation in the denominator is replaced by an estimate. To get from this to distribution theory for the sample autocorrelation is easiest when the true autocorrelation is 0.

The general tactic is the $\delta$ method or Taylor expansion. In this case for each sample size you have two estimates, say and of two parameters. You want distribution theory for the ratio . The idea is to write where and then make use of the fact that and are close to the parameters they are estimates of. In our case is the sample autocovariance at lag which is close to the true autocovariance while the denominator is the sample autocovariance at lag 0, a consistent estimator of .

Write

$\displaystyle f(N_T,D_T)$

$\displaystyle =$

$\displaystyle f(C_X(k),C_X(0)) \cr$

If we can use a central limit theorem to conclude that

$\displaystyle (\sqrt{T}(N_T-C_X(k)), \sqrt{T}(D_T-C_X(0)))$

has an approximately bivariate normal distribution and if we can neglect the remainder term then

$\displaystyle \sqrt{T}(f(N_T,D_T)-f(C_X(k),C_X(0))) = \sqrt{T}({\hat\rho}(k)-\rho(k))$

has approximately a normal distribution. The notation here is that

denotes differentiation with respect to the

th argument of

. For

we have

and

. When

the term involving

vanishes and we simply get the assertion that

$\displaystyle \sqrt{T}({\hat\rho}(k)-\rho(k))$

has the same asymptotic normal distribution as ${\hat C}_X(k)/C_X(0)$ .

Similar ideas can be used for the estimated sample partial ACF.

Portmanteau tests

In order to test the hypothesis that a series is white noise using the distribution theory just given, you have to produce a single statistic to base youre test on. Rather than pick a single value of the suggestion has been made to consider a sum of squares or a weighted sum of squares of the ${\hat\rho}(k)$ .

A typical statistic is

$\displaystyle T\sum_{k=1}^K {\hat\rho}^2(k)$

which, for white noise, has approximately a $\chi_K^2$ distribution. (This fact relies on an extension of the previous computations to conclude that

$\displaystyle \sqrt{T}({\hat \rho}(1), \ldots , {\hat \rho}(K))$

has approximately a standard multivariate distribution. This, in turn, relies on computation of the covariance between ${\hat C}(j)$ and ${\hat C}(k)$ .)

When the parameters in an ARMA() have been estimated by maximum likelihood the degrees of freedom must be adjusted to . The resulting test is the Box-Pierce test; a refined version which takes better account of finite sample properties is the Box-Pierce-Ljung test. S-Plus plots the -values from these tests for 1 through 10 degrees of freedom as part of the output of arima.diag.

$next$ $up$ $previous$

Richard Lockhart
2001-09-30