Malhotra18-Discriminant Analysis-With SPSS Output Inserts-2003 Format

Chapter Eighteen
Discriminant Analysis
18-2
Chapter Outline
1) Overview 2) Basic Concept 3) Relation to Regression and ANOVA 4) Discriminant Analysis Model 5) Statistics Associated with Discriminant Analysis 6) Conducting Discriminant Analysis i. Formulation ii. Estimation iii. Determination of Significance
iv. Interpretation
v. Validation
18-3
Chapter Outline
7) i. ii. Multiple Discriminant Analysis Formulation Estimation
iii. Determination of Significance iv. Interpretation v. 8) 9) Validation Stepwise Discriminant Analysis Internet and Computer Applications
10) Focus on Burke 11) Summary 12) Key Terms and Concepts
Similarities and Differences between ANOVA, Regression, and Discriminant Analysis

Table 18.1
ANOVA REGRESSION One DISCRIMINANT ANALYSIS One
18-4
Similarities Number of dependent variables Number of independent variables

Differences Nature of the dependent variables Nature of the independent variables
One
Multiple
Multiple
Multiple
Metric Categorical
Metric Metric
Categorical Metric
18-5
Discriminant analysis is a technique for analyzing data when the criterion or dependent variable is categorical and the predictor or independent variables are interval in nature. The objectives of discriminant analysis are as follows: Development of discriminant functions, or linear combinations of the predictor or independent variables, which will best discriminate between the categories of the criterion or dependent variable (groups). Examination of whether significant differences exist among the groups, in terms of the predictor variables. Determination of which predictor variables contribute to most of the intergroup differences. Classification of cases to one of the groups based on the values of the predictor variables. Evaluation of the accuracy of classification.
18-6
When the criterion variable has two categories, the technique is known as two-group discriminant analysis. When three or more categories are involved, the technique is referred to as multiple discriminant analysis. The main distinction is that, in the two-group case, it is possible to derive only one discriminant function. In multiple discriminant analysis, more than one function may be computed. In general, with G groups and k predictors, it is possible to estimate up to the smaller of G - 1, or k, discriminant functions. The first function has the highest ratio of between-groups to within-groups sum of squares. The second function, uncorrelated with the first, has the second highest ratio, and so on. However, not all the functions may be statistically significant.
18-7
Discriminant Analysis Model

The discriminant analysis model involves linear combinations of the following form:
D = b0 + b1X1 + b2X2 + b3X3 + . . . + bkXk

where
D b 's X 's
= = =
discriminant score discriminant coefficient or weight predictor or independent variable
The coefficients, or weights (b), are estimated so that the groups differ as much as possible on the values of the discriminant function. This occurs when the ratio of between-group sum of squares to within-group sum of squares for the discriminant scores is at a maximum.
18-8
Statistics Associated with Discriminant Analysis
Canonical correlation. Canonical correlation measures the extent of association between the discriminant scores and the groups. It is a measure of association between the single discriminant function and the set of dummy variables that define the group membership. Centroid. The centroid is the mean values for the discriminant scores for a particular group. There are as many centroids as there are groups, as there is one for each group. The means for a group on all the functions are the group centroids. Classification matrix. Sometimes also called confusion or prediction matrix, the classification matrix contains the number of correctly classified and misclassified cases.
18-9
Discriminant function coefficients. The discriminant function coefficients (unstandardized) are the multipliers of variables, when the variables are in the original units of measurement. Discriminant scores. The unstandardized coefficients are multiplied by the values of the variables. These products are summed and added to the constant term to obtain the discriminant scores. Eigenvalue. For each discriminant function, the Eigenvalue is the ratio of between-group to withingroup sums of squares. Large Eigenvalues imply superior functions.
18-10
F values and their significance. These are
calculated from a one-way ANOVA, with the grouping variable serving as the categorical independent variable. Each predictor, in turn, serves as the metric dependent variable in the ANOVA. Group means and group standard deviations. These are computed for each predictor for each group. Pooled within-group correlation matrix. The pooled within-group correlation matrix is computed by averaging the separate covariance matrices for all the groups.
18-11
Standardized discriminant function coefficients. The standardized discriminant function coefficients are the discriminant function coefficients and are used as the multipliers when the variables have been standardized to a mean of 0 and a variance of 1. Structure correlations. Also referred to as discriminant loadings, the structure correlations represent the simple correlations between the predictors and the discriminant function. Total correlation matrix. If the cases are treated as if they were from a single sample and the correlations computed, a total correlation matrix is obtained. Wilks' . Sometimes also called the U statistic, Wilks' for each predictor is the ratio of the within-group sum of squares to the total sum of squares. Its value varies between 0 and 1. Large values of (near 1) indicate that group means do not seem to be different. Small values of (near 0) indicate that the group means seem to be different.
18-12
Conducting Discriminant Analysis

Fig. 18.1
Formulate the Problem
Estimate the Discriminant Function Coefficients
Determine the Significance of the Discriminant Function
Interpret the Results
Assess Validity of Discriminant Analysis

Formulate the Problem
18-13
Identify the objectives, the criterion variable, and the independent variables. The criterion variable must consist of two or more mutually exclusive and collectively exhaustive categories. The predictor variables should be selected based on a theoretical model or previous research, or the experience of the researcher. One part of the sample, called the estimation or analysis sample, is used for estimation of the discriminant function. The other part, called the holdout or validation sample, is reserved for validating the discriminant function. Often the distribution of the number of cases in the analysis and validation samples follows the distribution in the total sample.
18-14
A Sample Problem
Suppose we want to determine the salient characteristics of families that have already visited a vacation resort during the last two years. Data were obtained from a pretest sample of 42 households. Of these, 30 households were included in the analysis sample and the remaining 12 were part of the validation sample.
18-15
Variables in Data Set
DEPENDENT or CRITERION variable
INDEPENDENT or PREDICTOR variables

The households that visited a resort during the last two years are coded as 1; those that did not, as 2 (VISIT variable as criterion with two categories).
Annual family income (INCOME) Attitude toward travel (TRAVEL) Importance attached to family vacation (VACATION) Household size (HSIZE) Age of the head of household (AGE)
18-16
Information on Resort Visits: Analysis Sample

Table 18.2
Resort Visit Annual Family Income ($000) Attitude Toward Travel Importance Household Age of Attached Size Head of to Family Household Vacation Amount Spent on Family Vacation
No.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1
50.2 70.3 62.9 48.5 52.7 75.0 46.2 57.0 64.1 68.1 73.4 71.9 56.2 49.3 62.0
5 6 7 7 6 8 5 2 7 7 6 5 1 4 5
8 7 5 5 6 7 3 4 5 6 7 8 8 2 6
3 4 6 5 4 5 3 6 4 5 5 4 6 3 2
43 61 52 36 55 68 62 51 57 45 44 64 54 56 58
M (2) H (3) H (3) L (1) H (3) H (3) M (2) M (2) H (3) H (3) H (3) H (3) M (2) H (3) H (3)
18-17
Information on Resort Visits: Analysis Sample

Table 18.2 cont.
Resort Visit 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 Annual Family Income ($000) 32.1 36.2 43.2 50.4 44.1 38.3 55.0 46.1 35.0 37.3 41.8 57.0 33.4 37.5 41.3 Attitude Toward Travel 5 4 2 5 6 6 1 3 6 2 5 8 6 3 3 Importance Household Age of Attached Size Head of to Family Household Vacation 4 3 5 2 6 6 2 5 4 7 1 3 8 2 3 3 2 2 4 3 2 2 3 5 4 3 2 2 3 2 58 55 57 37 42 45 57 51 64 54 56 36 50 48 42 Amount Spent on Family Vacation L L M M M L M L L L M M L L L (1) (1) (2) (2) (2) (1) (2) (1) (1) (1) (2) (2) (1) (1) (1)
No.
16 17 18 19 20 21 22 23 24 25 26 27 28 29 30
Information on Resort Visits: Holdout Sample

Table 18.3
Resort Visit Annual Family Income ($000) Attitude Toward Travel Importance Household Age of Attached Size Head of to Family Household Vacation Amount Spent on Family Vacation
18-18
No.
1 2 3 4 5 6 7 8 9 10 11 12
1 1 1 1 1 1 2 2 2 2 2 2
50.8 63.6 54.0 45.0 68.0 62.1 35.0 49.6 39.4 37.0 54.5 38.2
4 7 6 5 6 5 4 5 6 2 7 2
7 4 7 4 6 6 3 3 5 6 3 2
3 7 4 3 6 3 4 5 3 5 3 3
45 55 58 60 46 56 54 39 44 51 37 49
M(2) H (3) M(2) M(2) H (3) H (3) L (1) L (1) H (3) L (1) M(2) L (1)
18-19
Estimate the Discriminant Function Coefficients

The direct method involves estimating the discriminant function so that all the predictors are included simultaneously. In stepwise discriminant analysis, the predictor variables are entered sequentially, based on their ability to discriminate among groups.
Results of Two-Group Discriminant Analysis (Table 18.4)

Group Statis tics visit 1 Mean Std. Deviation V alid N (lis tw is e) Mean Std. Deviation V alid N (lis tw is e) Mean Std. Deviation V alid N (lis tw is e) income 60.520 9.8307 15 15.000 41.913 7.5511 15 15.000 51.217 12.7952 30 30.000 travel 5.400 1.9198 15 15.000 4.333 1.9518 15 15.000 4.867 1.9780 30 30.000 vacation 5.800 1.8205 15 15.000 4.067 2.0517 15 15.000 4.933 2.0998 30 30.000 hs iz e 4.333 1.2344 15 15.000 2.800 .9411 15 15.000 3.567 1.3309 30 30.000 age 53.733 8.7706 15 15.000 50.133 8.2710 15 15.000 51.933 8.5740 30 30.000
18-20
Unw eighted Weighted
Total
Pooled Within-Groups M atrice s Correlation income travel vacation hs iz e age income 1.000 .197 .091 .089 -.014 travel .197 1.000 .084 -.017 -.197 vacation .091 .084 1.000 .070 .017 hs iz e .089 -.017 .070 1.000 -.043 age -.014 -.197 .017 -.043 1.000
Tes ts of Equality of Group M eans Wilks ' Lambda .453 .925 .824 .657 .954 F 33.796 2.277 5.990 14.636 1.338 df 1 1 1 1 1 1 df 2 28 28 28 28 28 Sig. .000 .143 .021 .001 .257
income travel vacation hs iz e age
Contd.
Results of Two-Group Discriminant Analysis (Table 18.4 contd.)

Eigenvalues Canonical Function Eigenvalue % of V arianc e Cumulativ e % Correlation 1 1.786a 100.0 100.0 .801 a. First 1 canonical discriminant f unctions w ere us ed in the analysis.
18-21
Wilks ' Lam bda Test of Function(s) 1 Wilks ' Lambda .359 Chi-s quare 26.130 df 5 Sig. .000
Standar dize d Canonical Dis crim inant Function Coe fficie nts Function 1 .743 .096 .233 .469 .209
Structure Matrix Func tion 1 .822 .541 .346 .213 .164
income travel vacation hs iz e age
income hs iz e vacation travel age
Pooled w ithin-groups c orrelations betw een discriminating variables and standardized canonic al disc riminant f unctions Variables ordered by absolute s iz e of correlation w ithin func tion.
Contd.
Results of Two-Group Discriminant Analysis (Table 18.4 contd.)

Canonical Dis crim inant Function Coe fficie nts Func tion 1 .085 .050 .120 .427 .025 -7.975
18-22
income travel vacation hs iz e age (Cons tant)
Unstandardiz ed coef f icients
Functions at Group Ce ntroids Func tion 1 1.291 -1.291
visit 1 2
Unstandardized c anonical disc riminant f unctions evaluated at group means
Contd.
Results of Two-Group Discriminant Analysis (Table 18.4)

Prior Pr obabilitie s for Groups Cases Us ed in Analysis Unw eighted Weighted 15 15.000 15 15.000 30 30.000
18-23
visit 1 2 Total
Prior .500 .500 1.000
a Clas sification Res ults
Original
Clas sification Function Coe fficie nts
Count %
visit income travel vacation hs iz e age (Cons tant) 1 .678 1.509 .938 3.322 .832 -57.532 2 .459 1.381 .628 2.218 .768 -36.936
visit 1 2 1 2
Predicted Group Membership 1 2 12 3 0 15 80.0 20.0 .0 100.0
Total 15 15 100.0 100.0
a. 90.0% of original grouped cas es c orrec tly clas sified.
Fisher's linear dis criminant f unctions
18-24
Determine the Significance of Discriminant Function

The null hypothesis that, in the population, the means of all discriminant functions in all groups are equal can be statistically tested. In SPSS this test is based on Wilks' . If several functions are tested simultaneously (as in the case of multiple discriminant analysis), the Wilks' statistic is the product of the univariate for each function. The significance level is estimated based on a chisquare transformation of the statistic. If the null hypothesis is rejected, indicating significant discrimination, one can proceed to interpret the results.

Interpret the Results
18-25
The interpretation of the discriminant weights, or coefficients, is similar to that in multiple regression analysis. Given the multicollinearity in the predictor variables, there is no unambiguous measure of the relative importance of the predictors in discriminating between the groups. With this caveat in mind, we can obtain some idea of the relative importance of the variables by examining the absolute magnitude of the standardized discriminant function coefficients. Some idea of the relative importance of the predictors can also be obtained by examining the structure correlations, also called canonical loadings or discriminant loadings. These simple correlations between each predictor and the discriminant function represent the variance that the predictor shares with the function. Another aid to interpreting discriminant analysis results is to develop a characteristic profile for each group by describing each group in terms of the group means for the predictor variables.
18-26
Access Validity of Discriminant Analysis

Many computer programs, such as SPSS, offer a leaveone-out cross-validation option. The discriminant weights, estimated by using the analysis sample, are multiplied by the values of the predictor variables in the holdout sample to generate discriminant scores for the cases in the holdout sample. The cases are then assigned to groups based on their discriminant scores and an appropriate decision rule. The hit ratio, or the percentage of cases correctly classified, can then be determined by summing the diagonal elements and dividing by the total number of cases. It is helpful to compare the percentage of cases correctly classified by discriminant analysis to the percentage that would be obtained by chance. Classification accuracy achieved by discriminant analysis should be at least 25% greater than that obtained by chance.
18-27
Results of Three-Group Discriminant Analysis

Table 18.5
Group Means AMOUNT INCOME TRAVEL VACATION HSIZE AGE
1 2 3 Total
38.57000 50.11000 64.97000 51.21667
4.50000 4.00000 6.10000 4.86667
4.70000 4.20000 5.90000 4.93333
3.10000 3.40000 4.20000 3.56667
50.30000 49.50000 56.00000 51.93333
Group Standard Deviations
1 2 3 Total
5.29718 6.00231 8.61434 12.79523
1.71594 2.35702 1.19722 1.97804
1.88856 2.48551 1.66333 2.09981
1.19722 1.50555 1.13529 1.33089

HSIZE
8.09732 9.25263 7.60117 8.57395

AGE
Pooled Within-Groups Correlation Matrix INCOME TRAVEL VACATION INCOME TRAVEL VACATION HSIZE AGE 1.00000 0.05120 0.30681 0.38050 -0.20939 1.00000 0.03588 0.00474 -0.34022
1.00000 0.22080 -0.01326
1.00000 -0.02512
1.00000
Contd.
18-28

Table 18.5 cont.
Wilks' (U-statistic) and univariate F ratio with 2 and 27 degrees of freedom. Variable INCOME TRAVEL VACATION HSIZE AGE Wilks' Lambda 0.26215 0.78790 0.88060 0.87411 0.88214
F
38.00 3.634 1.830 1.944 1.804
Significance 0.0000 0.0400 0.1797 0.1626 0.1840
CANONICAL DISCRIMINANT FUNCTIONS
Function
1* 2*
Eigenvalue
3.8190 0.2469
% of Variance 93.93 6.07
Cum Canonical After Wilks' % Correlation Function Chi-square df Significance : 0 0.1664 44.831 10 0.00 93.93 0.8902 : 1 0.8020 5.517 4 0.24 100.00 0.4450 :
* marks the two canonical discriminant functions remaining in the analysis. Standardized Canonical Discriminant Function Coefficients INCOME TRAVEL VACATION HSIZE AGE FUNC 1 1.04740 0.33991 -0.14198 -0.16317 0.49474 FUNC 2 -0.42076 0.76851 0.53354 0.12932 0.52447
Contd.
18-29

Table 18.5 cont.
Structure Matrix: Pooled within-groups correlations between discriminating variables and canonical discriminant functions (variables ordered by size of correlation within function) INCOME HSIZE VACATION TRAVEL AGE FUNC 1 0.85556* 0.19319* 0.21935 0.14899 0.16576 FUNC 2 -0.27833 0.07749 0.58829* 0.45362* 0.34079*
Unstandardized canonical discriminant function coefficients FUNC 1 FUNC 2 INCOME 0.1542658 -0.6197148E-01 TRAVEL 0.1867977 0.4223430 VACATION -0.6952264E-01 0.2612652 HSIZE -0.1265334 0.1002796 AGE 0.5928055E-01 0.6284206E-01 (constant) -11.09442 -3.791600
Canonical discriminant functions evaluated at group means (group centroids) Group FUNC 1 FUNC 2 1 -2.04100 0.41847 2 -0.40479 -0.65867 3 2.44578 0.24020
Contd.
18-30

Table 18.5 cont.
Classification Results: Actual Group Group Group Group 1 2 3 No. of Cases 10 10 10 Predicted Group Membership 1 2 3 9 90.0% 1 10.0% 1 10.0% 9 90.0% 0 0.0% 0 0.0% 8 80.0%
0 2 0.0% 20.0% Percent of grouped cases correctly classified: 86.67%
Classification results for cases not selected for use in the analysis Predicted Group Membership Actual Group No. of Cases 1 2 3 Group 1 4 3 75.0% 1 25.0% 0 0.0%
Group
Group
2
3
4
4
0 0.0%
3 75.0%
1 25.0%
3 75.0%
1 0 25.0% 0.0% Percent of grouped cases correctly classified: 75.00%
18-31
All-Groups Scattergram
Fig. 18.2
Across: Function 1 Down: Function 2 4.0
1
1
1 1 *1
1 1 1
0.0
23 12 * 2 2 1 2 2 2
3 3* 3 3 3
3
-4.0
* indicates a group centroid -6.0 -4.0 -2.0 0.0 2.0

4.0 6.0
18-32
Territorial Map
Fig. 18.3
13 13 Across: Function 1 13 Down: Function 2 13 13 * Indicates a 13 group centroid 13 113 112 3 112233 *1 1 1 2 2 2 2 3 3 * 1 1 2 2* 223 233 1122 2233 1122 223 11122 233 11222 223 1122 233 11122 2233 11122 223 1122 11122 233
8.0
4.0
0.0 -4.0
-8.0 -8.0
-6.0
-4.0
-2.0
0.0
2.0
4.0
6.0
8.0
18-33
Stepwise Discriminant Analysis
Stepwise discriminant analysis is analogous to stepwise multiple regression (see Chapter 17) in that the predictors are entered sequentially based on their ability to discriminate between the groups. An F ratio is calculated for each predictor by conducting a univariate analysis of variance in which the groups are treated as the categorical variable and the predictor as the criterion variable. The predictor with the highest F ratio is the first to be selected for inclusion in the discriminant function, if it meets certain significance and tolerance criteria. A second predictor is added based on the highest adjusted or partial F ratio, taking into account the predictor already selected.
18-34
Stepwise Discriminant Analysis
Each predictor selected is tested for retention based on its association with other predictors selected. The process of selection and retention is continued until all predictors meeting the significance criteria for inclusion and retention have been entered in the discriminant function. The selection of the stepwise procedure is based on the optimizing criterion adopted. The Mahalanobis procedure is based on maximizing a generalized measure of the distance between the two closest groups. The order in which the variables were selected also indicates their importance in discriminating between the groups.
18-35
SPSS Windows
The DISCRIMINANT program performs both twogroup and multiple discriminant analysis. To select this procedure using SPSS for Windows click: Analyze>Classify>Discriminant

Malhotra18-Discriminant Analysis-With SPSS Output Inserts-2003 Format

Transféré par

Informations du document

Description originale:

Copyright

Formats disponibles

Partager ce document

Partager ou intégrer le document

Options de partage

Avez-vous trouvé ce document utile ?

Ce contenu est-il inapproprié ?

Droits d'auteur :

Formats disponibles

Malhotra18-Discriminant Analysis-With SPSS Output Inserts-2003 Format

Transféré par

Droits d'auteur :

Formats disponibles

Chapter Eighteen

Similarities and Differences between ANOVA, Regression, and Discriminant Analysis

Similarities Number of dependent variables Number of independent variables

Discriminant Analysis Model

D = b0 + b1X1 + b2X2 + b3X3 + . . . + bkXk

discriminant score discriminant coefficient or weight predictor or independent variable

Statistics Associated with Discriminant Analysis

Statistics Associated with Discriminant Analysis

Statistics Associated with Discriminant Analysis

F values and their significance. These are

Statistics Associated with Discriminant Analysis

Conducting Discriminant Analysis

Estimate the Discriminant Function Coefficients

Determine the Significance of the Discriminant Function

Interpret the Results

Assess Validity of Discriminant Analysis

Conducting Discriminant Analysis

Variables in Data Set

DEPENDENT or CRITERION variable

INDEPENDENT or PREDICTOR variables

Information on Resort Visits: Analysis Sample

Information on Resort Visits: Analysis Sample

Information on Resort Visits: Holdout Sample

Conducting Discriminant Analysis

Estimate the Discriminant Function Coefficients

Results of Two-Group Discriminant Analysis (Table 18.4)

Unw eighted Weighted

Unw eighted Weighted

Unw eighted Weighted

income travel vacation hs iz e age

Results of Two-Group Discriminant Analysis (Table 18.4 contd.)

Structure Matrix Func tion 1 .822 .541 .346 .213 .164

income travel vacation hs iz e age

income hs iz e vacation travel age

Results of Two-Group Discriminant Analysis (Table 18.4 contd.)

income travel vacation hs iz e age (Cons tant)

Unstandardiz ed coef f icients

Functions at Group Ce ntroids Func tion 1 1.291 -1.291

Unstandardized c anonical disc riminant f unctions evaluated at group means

Results of Two-Group Discriminant Analysis (Table 18.4)

Prior .500 .500 1.000

a Clas sification Res ults

Predicted Group Membership 1 2 12 3 0 15 80.0 20.0 .0 100.0

Total 15 15 100.0 100.0

a. 90.0% of original grouped cas es c orrec tly clas sified.

Fisher's linear dis criminant f unctions

Conducting Discriminant Analysis

Determine the Significance of Discriminant Function

Conducting Discriminant Analysis

Conducting Discriminant Analysis

Access Validity of Discriminant Analysis

Results of Three-Group Discriminant Analysis

38.57000 50.11000 64.97000 51.21667

4.50000 4.00000 6.10000 4.86667

4.70000 4.20000 5.90000 4.93333

3.10000 3.40000 4.20000 3.56667

50.30000 49.50000 56.00000 51.93333

Group Standard Deviations

5.29718 6.00231 8.61434 12.79523

1.71594 2.35702 1.19722 1.97804

1.88856 2.48551 1.66333 2.09981

1.19722 1.50555 1.13529 1.33089

8.09732 9.25263 7.60117 8.57395