next up previous

STAT 330 Lecture 26

Reading for Today's Lecture: 11.1, 11.2.

Goals of Today's Lecture:

Today's notes

Two way layout

Data

displaymath201

Model equation

displaymath203

where the tex2html_wrap_inline205 are independent random variables with mean 0 and variance tex2html_wrap_inline207 (usually with tex2html_wrap_inline209 distributions.

We decompose tex2html_wrap_inline211 as

displaymath213

which has many possible solutions for tex2html_wrap_inline215 , tex2html_wrap_inline217 , tex2html_wrap_inline219 and tex2html_wrap_inline221 . We choose

eqnarray25

Remarks:

  1. tex2html_wrap_inline221 's are called interactions.
  2. If K=1 we drop the subscript k and assume that the tex2html_wrap_inline221 are all 0.

Two numerical examples:

Table of hypothetical means:

Treatment Control
Men 120 130 1
Women 105 115 2
1 2

tex2html_wrap_inline231 , tex2html_wrap_inline233 and so on.

According to our definitions:

displaymath235

displaymath237

displaymath239

and

displaymath241

Similarly we find tex2html_wrap_inline243 , tex2html_wrap_inline245 and tex2html_wrap_inline247 .

Now consider instead the table

Treatment Control
Men 120 130 1
Women 115 105 2
1 2

for which tex2html_wrap_inline249 , tex2html_wrap_inline251 , tex2html_wrap_inline253 and tex2html_wrap_inline255 . Note that the main effect of treatment is 0; averaged over men and women the drug does nothing. However, the drug actually works for men, lowering BP by 10, while it is a total failure for women, raising BP by 10. In the presence of interactions the main effect of a factor is not a very meaningful quantity.

Analysis of the data when K > 1

  1. Estimation:

    eqnarray105

  2. Hypothesis testing: we have the following hypotheses to test:

    1. No interactions: tex2html_wrap_inline259 for all i,j.
    2. No effect of factor 1: tex2html_wrap_inline263 for all i.
    3. No effect of factor 2: tex2html_wrap_inline267 for all j.

Note: Hypotheses (b) and (c) are tested only if tex2html_wrap_inline271 in (a) is accepted. The logic of this is that if a drug affects men and women differently (that is sex and treatment have an interaction) then the drug certainly has some effect and the average effect tex2html_wrap_inline273 is of no real interest.

Tests are based on an ANOVA table

Sum of Mean
Source df Squares Square F P
tex2html_wrap353 I-1 tex2html_wrap_inline281 SS/df
tex2html_wrap355 J-1 tex2html_wrap_inline285 SS/df
tex2html_wrap357 (I-1)(J-1) tex2html_wrap_inline289 SS/df
tex2html_wrap359 IJ(K-1) tex2html_wrap_inline293 SS/df
Total n-1 tex2html_wrap_inline297

Remark: many of the formulas simplify. For example

displaymath299

Theory:

displaymath301

If all tex2html_wrap_inline303 then

displaymath305

If tex2html_wrap_inline307 then

displaymath309

and so on.

All of the sums of squares above the line in the table are independent.

There are 3 F-statistics for each of which P values come from F tables with degrees of freedom which are recorded in the degrees of freedom column:

Example: the variable X is plaster hardness. The factors are SAND content (with levels 0, 15 and 30%) and FIBRE content (with levels 0, 25 and 50%). We have 2 replicates so I=3, J=3 and K=2. SAS output shows that the ANOVA table is as follows:
Sum of Mean
Source df Squares Square F P
tex2html_wrap361 2 106.8 53.4 6.54 0.018
tex2html_wrap363 2 87.1 43.6 5.33 0.030
tex2html_wrap365 4 8.89 2.22 0.27 0.89
tex2html_wrap367 9 73.5 8.17
tex2html_wrap369 17 276.28

Conclusions:

  1. tex2html_wrap_inline271 : no interactions is accepted.
  2. There are significant SAND effects.
  3. There are significant FIBRE effects.

NEXT: multiple comparisons and Tukey confidence intervals.

SAND: 95% CI
0 15 30

In other words the level 0 differs from the level 30 but we cannot distinguish clearly 15 from either 30 or 0% SAND.


next up previous



Richard Lockhart
Fri Mar 6 10:14:40 PST 1998