soci209 module 8 - general linear tests

Soci709 Module 8 - GENERAL LINEAR TESTS

1. THE NEED FOR MORE GENERAL TESTS

Given a linear model (omitting the i subscript)

Y = b₀ + b₁X₁ + b₂X₂+ b₃X₃ + e

we can estimate the coefficients b_k and easily test hypotheses of the form

H₀: b_k = 0
H₁: b_k <> 0

for each coefficient by looking on the regression printout at the p-value of the t-ratio t_k* = b_k/s{b_k} which is distributed as Student t with df=n-p where p is the total number of independent variables (including the constant term).
We can also test whether all b_k are jointly equal to zero using the F-test for the regression model as a whole that is usually provided in the regression output.
There are categories of situations where we need more general tests of hypotheses involving the b_k .

1. Testing the Joint Significance of Several b_k

There are many situations in which we want to test whether several regression coefficients are simultaneously equal to zero.
For example, X₂ and X₃ might represent:

two indicators representing a qualitative variable with 3 categories (or, in general, (k-1) indicators representing a qualitative variable with k categories
the first order (X) and second order (X²) polynomial terms in a variable X
any two variables that are individually non-significant; before dropping both X₂ and X₃ from the model we want to make sure that they are jointly non-significant

In all these situations we want to test hypotheses of the form

H₀: b₂ = b₃ = 0
H₁: not both b₂ and b₃ = 0

2. Testing That 2 or More b_k Are Equal

The hypothesis to be tested is of the form

H₀: b₁ = b₂
H₁: b₁ <> b₂

3. Testing That Some Have Specific Values Other Than Zero

The hypothesis to be tested is of the form

H₀: b₁ = 3, b₃ = 5
H₁: not both equalities in H₀ hold

2. FULL & REDUCED MODELS

1. Setting Full & Reduced Models

The general approach to testing simultaneous hypotheses on the coefficients is to use an F-test based on contrasting a full model with a reduced (aka restricted, aka constrained) model. We explain the logic of the test in the case of testing the joint significance of several b_k . The hypothesis is

H₀: b₂ = b₃ = 0
H₁: not both b₂ and b₃ = 0

To test whether several b₂ and b₃ are simultaneously zero we contrast

Y = b₀ + b₁X₁ + b₂X₂ + b₃X₃ (full model, F)
Y = b₀ + b₁X₁ (reduced model, R)

Q - Why would the reduced model R also be called the constrained model?

The test is based on a comparison of the SSE of the full and reduced models, denoted SSE_F and SSE_R, respectively.
It is always true that SSE_F <= SSE_R, because a model with more parameters always fit the data as well or better. Thus

If SSE_F is not much less than SSE_R, it suggests that the extra variables in F do not reduce SSE much compared to R.
If SSE_F is much less than SSE_R, it suggests that the extra variables in F improve the fit a lot.

Thus the test is based on the difference between SSE_R and SSE_F.
The test statistic is

F* = (SSE_R - SSE_F)/(df_R - df_F) / (SSE_F/df_F) (8.1)

In words, F* is the ratio of the difference in SSE between reduced and full models divided by the difference in degrees of freedom between R and F, to the SSE of the full model divided by the degrees of freedom of F.
There is an equivalent formula for F* in terms of R²_F and R²_R, the coeffficients of determination of the full and reduced models, respectively. We know that R² = 1 - (SSE/SSTO), and that SSTO is the same in the full and reduced models. From this one can derive from (8.1) the equivalent formula for F*

F* = (R²_F - R²_R)/(df_R - df_F) / ((1 - R²_F)/df_F) (8.2)

This formula is particularly useful to test hypotheses from published regression results. (Also, it is the reason why one should not present the adjusted R_a² alone in published reports, because it makes it more difficult for readers to recover F* if they wish.)

Q - Why is it that SSTO is the same in the full and reduced model?

From the ANOVA table we know that the df of SSE are n-p where p is the total number of variables including the constant.
For the example above

df_F = n-4
df_R = n-2

F* = (SSE_R - SSE_F)/((n-2) - (n-4)) / (SSE_F/(n-4))
F* = (SSE_R - SSE_F)/2 / (SSE_F/(n-4))

Note that the df of the difference between SSE_R and SSE_F is equal to the number of parameters set to zero by the hypothesis.

2. Carrying Out the Test

From the earlier discussion

Small F* suggests that H₀ holds.
Large F* suggests that H₁ holds.

F* is distributed as F(df_R - df_F, df_F).
As usual one can test the hypothesis using two equivalent approaches: the p-value (Fisher inductive inference) approach and the critical-value (Neyman-Pearson decision theory) approach.

a. P-value Approach

The p-value associated with F* is the probability of finding a value of F greater than F* if H₀ is true or

P-value(F*) = P{F(df_R - df_F, df_F)>F*}

The decision rule is

if p-value(F*) >= a conclude H₀ (8.3a)
if p-value(F*) < a conclude H₁ (8.3b)

where a is the level of significance chosen.
The p-value approach is easiest when using the computer.

b. Critical-value Approach

The critical-value associated with a chosen level of significance a is the value of F(df_R - df_F, df_F) such that, if H₀ is true, the probability that F is less than the critical value is 1-a. The critical-value is denoted

F(1-a; df_R - df_F, df_F)

The decision rule is

if F* <= F(1-a; df_R - df_F, df_F) conclude H₀ (8.4a)
if F* > F(1-a; df_R - df_F, df_F) conclude H₁ (8.4b)

The critical-value approach is easiest when using printed statistical tables of the F distribution.

Q - Why is that?

The strategy for general linear tests is therefore

fit full model and obtain SSE_F or R²_F
fit reduced model under H₀ and obtain SSE_R or R²_R
calculate F* using (8.1) or (8.2)
use decision rule (8.3) or (8.4)

3. An Alternative Presentation - Extra Sums of Squares

In comparing a full model including X₁, X₂, and X₃ with a reduced model including X₁ only, the extra sum of squares of the full model, compared to the reduced model, is denoted SSR(X₂, X₃ | X₁) and defined as

SSR(X₂, X₃ | X₁) = SSE(X₁) - SSE(X₁, X₂, X₃)

The extra sum of squares SSR(X₂, X₃ | X₁) is thus the reduction in SSE achieved by including X₂ and X₃ in a model that already contains X₁.
One can derive the F-test in terms of SSR, but I am not using this approach in the course. See NKNW Sections 7.1 to 7.3 (pp. 260-274) for an exposition emphasizing extra sums of squares.

3. EXAMPLES OF TESTING JOINT HYPOTHESES

1. Body Fat Example (NKNW pp. 260-263)

In the model for body fat (Y) containing X₁, X₂, and X₃ one wants to test the joint significance of X₂ & X₃. The hypothesis setup is:

H₀: b₂ = b₃ = 0
H₁: not both b₂ and b₃ = 0

From the full and reduced models one obtains

	Full	Reduced
SSE	98.404888	143.119703
R²	0.801359	0.711097
df of SSE = (n-p)	16	18

Using formula (8.1) with SSE one gets

F* = (143.119703 - 98.404888)/(18 - 16) / ((98.404888)/16) = 3.635

Equivalently, using formula (8.2) with R² one gets

F* = (0.801359 - 0.711097)/(18 - 16) / ((1 - 0.801359)/16) = 3.635

a. P-value Approach

We find the p-value of F* = 3.635 as
>calc 1 - fcf(3.635,2,16)
0.049956
The p-value is just less than a = .05 so we conclude H₁ using decision rule (8.3). We say "The coefficients of X₂ and X₃ are jointly significant at the a = .05 level".

b. Critical-value Approach

We choose a = .05. To apply the decision rule we need the critical value F(0.95; 2, 16). Using SYSTAT's calculator we find
>calc fif(0.95,2,16)
3.633723
Since F* = 3.635 > 3.633723 (again, just barely!) we conclude H₁ using decision rule (8.4). We conclude that b₂ and b₃ are not both zero at the a = .05 level.

Note that while in the full model each coefficient b₂ and b₃ is individually non-significant, they are jointly significant.

2. Testing a Polynomial Function

Table 3 from Nielsen's (1994) article on income inequality is reproduced in the next exhibit.

Exhibit: Table 3 from Nielsen (1994)

We want to test the joint significance of the second-degree polynomial of energy consumption per capita. We do this by comparing Model 8 (F) with Model 6 (R). From the table we have R²_F = .818; df_F = 56-9 = 47; R²_R = .807; df_R = 49. Thus

F* = ((.818 - .807)/2) / ((1 - .818)/47) = 1.4203

We find the p-value of F* as
>calc 1 - fcf(1.4203,2,47)
0.251822
We conclude the polynomial coefficients are jointly non-significant.

3. Testing Effect of Qualitative Variable with Multiple Indicators

This example uses the Afifi and Claaark data set. The dependent variable is a depression score calculated as the square root of (total CESD score+1) (to normalize the distribution of the variable). A multiple regression of depression score on education, logarithm of income, sex (female) and religion is estimated as

depscore = 3.872 + (-.08)educatn + (-.896)l10inc + (.376)female + (.252)cath + (.785)jewi + (.588)none n=256 R²=.124
T-ratios are: (10.22) (-1.26) (-3.34) (2.35) (1.19) (2.79) (2.88)

Religion is represented by three (0,1) indicators for Catholic, Jewish and None (with Protestant as the omitted category).
1. From the regression results one can tell that, at the .05 level, there is no significant difference in depression score between Catholic and Protestant, but there are significant differences between Jewish and Protestant and between None and Protestant. (Q -- How can one tell?)
2. One also wants to test whether religion (jointly represented by the three indicators) is a significant predictor of depression. To do this the hypothesis setup is

H₀: b₄ = b₅ = b₆ = 0
H₁: not all three b's = 0

To do this one could estimate the reduced model corresponding to the null hypothesis and use the formula above; one can also use STATA (test cath jewi none) to test the joint significance of the three indicators. The program calculates the F-test as F* = 4.40, P{F(3, 249) > F} = 0.0049. Thus one concludes that religion is a significant predictor of the depression score (at the .05 and even .01 level).
3. Given that their coefficients are both large, one may also want to test whether Jewish and None are equally predisposed to depression. The hypothesis setup is

H₀: b₅ = b₆
H₁: b₅ <> b₆

Using STATA (test none=jewi) yields F* = .4, P{F(1, 249)>F*} = .526. Thus one concludes that there is no significant difference between categories Jewish and None with respect to depression score.
4. Suppose one has theological or other reasons to believe that Jewish is more vulnerable to depression than Catholic. The test for equality of the coefficients for these two categories yields F* = 2.81, P{F(1, 249)>2.81} = .0949, so the hypothesis of equality cannot be rejected at the .05 level. Can the hypothesis that Jewish is more vulnerable to depression than Catholic be rejected at the .05 level?

Exhibit: Stata commands for testing effects of religion on depression score

4. TESTS ON REGRESSION COEFFICIENTS USING FULL VS. REDUCED MODEL

General linear tests can be cast as a comparison of full and reduced model.

1. Test Whether a Single b_k = 0

H₀: b_k = 0
H₁: b_k<> 0

This is the usual test reported as the p-value of t_k* = b_k/s{b_k} on the regression printout. One can show that the corresponding F* from the full vs. reduced model comparison is equal to the square of t_k*, i.e., F* = (t_k*)². Thus the t-test and F-test for a single coefficient are equivalent.

2. Test Whether All b_k = 0

H₀: b₁ = b₂ = ... = b_p-1 = 0
H₁: not all b_k (k = 1, ..., p-1) = 0

This is the usual test reported as the p-value of F* = MSR/MSE on the regression printout. It follows as a special case of the general formula in which the full model has SSE(X₁, X₂, ..., X_p-1) with df=n-p and the reduced model has SSE = SSTO with df=n-1.

3. Test Whether Some b_k = 0

H₀: b_q = b_q+1 = ... = b_p-1 = 0
H₁: not all of the b_k in H₀ = 0

(The notation assumes that the variables are arranged so that the tested variables have subscripts q to p-1.) This is the situation discussed earlier.

Other tests can be carried out as a comparison of full & reduced model, using "tricks".

4. Test Equality of 2 Coefficients

H₀: b₁ = b₂
H₁: b₁ <> b₂

The full model is (omitting the i subscript)

Y = b₀ + b₁X₁ + b₂X₂+ b₃X₃ + e

The trick is to define the reduced model as

Y = b₀ + b_c (X₁ + X₂) + b₃X₃ + e

where b_c is the "common" regression coefficient of X₁ and X₂ under H₀. One estimates the reduced model as the regression of Y on a new variable calculated as the sum of X₁ and X₂. Then one calculates F* using formula (8.1) or (8.2). The ful model has df=n-4 and the reduced model has df=n-3 so F* has df=(1, n-4).

5. Test Whether Some b_k Have Specific Values Other than 0

H₀: b₁ = 3, b₃ = 5
H₁: not both equalities in H₀ hold

With the full model as above, one derives the reduced model by replacing b₁ and b₃ by their assumed values under H₀ and removing their effects from the dependent variable, as

W = Y - 3X₁ - 5X₃ = b₀ + b₂X₂ + e

where W is the new dependent variable. The reduced model is estimated as the regression of W on X₂. Then one calculates F* which has df=(2, n-4).

5. USING THE HYPOTHESIS COMMAND IN SYSTAT

The hypothesis command in SYSTAT implements the general linear test. To use it one only needs to estimate the full model. The hypothesis command applies to the last model estimated. A test begins with the hypothesis command and ends with the test command, with the specifics of the test in between. For the following examples assume that the full model has 5 independent variables X₁ to X₅. First estimate the full model as

model y = constant + x1 + x2 + x3 + x4 + x5
estimate

One way to test the hypothesis that the coefficient of X1 is zero is to use the effect command
hypothesis
effect = x1
test
An alternative method is to use the specify command
hypothesis
specify x1 = 0
test
Note that this test only repeats the test already on the regression printout.

To test the hypothesis that the coefficients of X1, X3, and X4 are simultaneously equal to zero using the effect command
hypothesis
effect = x1&x3&x4
test
Or, using the specify command:
hypothesis
specify x1 = 0; x3 = 0; x4 = 0
test
Note that equalities are listed on the same line separated by semicolons.

Testing more complicated hypotheses involving equality of coefficients or whether a coefficient has a specific nonzero value is done using the specify command.

To test that the coefficients of X2 and X3 are equal, i.e. that b₂ - b₃ = 0
hypothesis
specify x2 - x3 = 0
test

To test whether the coefficient of X3 is 3.5 times as large as the coefficient of X5, i.e. that b₃ - 3.5b₅ = 0
hypothesis
specify x3 - 3.5*x5 = 0
test

To test that coefficients have specific values, for example that b₁=4 and b₃=17, use the commands
hypothesis
specify x1 = 4 ; x3 = 17
test

To test that the difference between coefficient of X2 and X3 is equal to the specific value 20, use the commands
hypothesis
specify x2 - x3 = 20
test

Examples of actual tests are shown in the next exhibits.

6. MATRIX FORMULATION OF GENERAL LINEAR TEST (OPTIONAL)

General linear hypothesis tests in SYSTAT use an underlying approach based on matrices. You can see these matrices in the output from the HYPOTHESIS command. The null hypothesis H₀ for any linear hypothesis can be represented by specifying a matrix A and a vector d. Then the null hypothesis H₀ is represented as

H₀: Ab = d

where A is sxp, b is px1, and d is sx1; s is the number of constraints on the coefficients.
(In SYSTAT A and d are specified by the commands AMATRIX and DMATRIX, respectively. See SYSTAT V6/V7 - Statistics, pp. 284-289.)

Various specifications of A and d are shown in the following examples, based on a full model with a constant term and variables X₁, X₂, and X₃.

EX: H₀: b₁ = 0
A = [0 1 0 0] d = [0]

EX: H₀: b₁ = b₂ = 0
A =

0 1 0 0

0 0 1 0

d' = [0 0]

EX: H₀: b₁ = b₂
A = [0 1 -1 0] d = [0]

The curent edition of NKNW no longer presents this material. The following 3 pages from an older edition (Neter, Wasserman, and Kutner 1990, pp. 306-308) derive the general linear test in matrix notation. (NWK use the notation C for A and h for d.)

Last modified 20 Mar 2006

Soci709 Module 8 - GENERAL LINEAR TESTS

1. THE NEED FOR MORE GENERAL TESTS

1. Testing the Joint Significance of Several bk

2. Testing That 2 or More bk Are Equal

3. Testing That Some Have Specific Values Other Than Zero

2. FULL & REDUCED MODELS

1. Setting Full & Reduced Models

2. Carrying Out the Test

a. P-value Approach

b. Critical-value Approach

3. An Alternative Presentation - Extra Sums of Squares

3. EXAMPLES OF TESTING JOINT HYPOTHESES

1. Body Fat Example (NKNW pp. 260-263)

a. P-value Approach

b. Critical-value Approach

2. Testing a Polynomial Function

3. Testing Effect of Qualitative Variable with Multiple Indicators

4. TESTS ON REGRESSION COEFFICIENTS USING FULL VS. REDUCED MODEL

1. Test Whether a Single bk = 0

2. Test Whether All bk = 0

3. Test Whether Some bk = 0

4. Test Equality of 2 Coefficients

5. Test Whether Some bk Have Specific Values Other than 0

5. USING THE HYPOTHESIS COMMAND IN SYSTAT

6. MATRIX FORMULATION OF GENERAL LINEAR TEST (OPTIONAL)

1. Testing the Joint Significance of Several b_k

2. Testing That 2 or More b_k Are Equal

1. Test Whether a Single b_k = 0

2. Test Whether All b_k = 0

3. Test Whether Some b_k = 0

5. Test Whether Some b_k Have Specific Values Other than 0