Académique Documents
Professionnel Documents
Culture Documents
Correlation
coefficient
Pearson
scale by scale
Sperman Rho
scale by ordinal
Kendall's tau_a
ordinal by ordinal
Kendall's tau_b
ordinal by ordinal
22 tables
Kendall's tau_c
ordinal by ordinal
33 tables
Assumptions
1.linear relationship between x and y
2.continuous random variables
3. both variables must be normally
distributed
4. x and y must be independent of each
other
1. Variables are measured on an
ordinal, interval or ratio scale.
2. Variables need NOT be normally
distributed.
3. There is a monotonic
relationship between the two variables,
i.e. either the variables increase in
value together or as one variable value
increases the other variable value
decreases.
4. This type of correlation is NOT very
sensitive to outliers.
Tau-a statistic tests the strength of
association of the cross tabulations.
Both variables have to be ordinal. Taua will not make any adjustment for ties.
example
comments
1. A nonparametric version of
the Pearson product-moment
correlation.
2. Spearmans is the more
widely used than Kendall
Contingency
coefficient
Nominal by nominal
RxC tables
Phi ( )
Nominal by nominal
22 tables
Biserial
Nominal by scale
Point Biserial
Nominal by scale
Rank Biserial
ordinal by scale
Other coefficients:
Lambda
This is a measure of association for cross tabulations of nominal-level variables. It measures
the percentage improvement in predictability of the dependent variable (row variable or
column variable), given the value of the other variable (column variable or row variable).
Thus, Gamma is the surplus of concordant pairs over discordant pairs, as a percentage of all
pairs, ignoring ties. Gamma defines perfect association as weak monotonicity. Under
statistical independence, Gammawill be 0, but it can be 0 at other times as well (whenever
concordant minus discordant pairs are 0).
Gamma is a symmetric measure and computes the same coefficient value, regardless of which is the
independent (column) variable. Its value ranges between +1 to 1.
In terms of the underlying assumptions, Gamma is equivalent to Spearmans Rho or Kendalls Tau; but in
terms of its interpretation and computation, it is more similar to Kendalls Tau than
Spearmans Rho.Gamma statistic is, however, preferable to Spearmans Rho and Kandalls Tau, when
the data contain many tied observations
to determine how certain one can be in making predictions from a certain model/graph. The coefficient
of determination is the ratio of the explained variation to the total variation. The coefficient of
determination is such that 0 < r 2 < 1, and denotes the strength of the linear association between x
and y.
A more meaningful approach, in determining the importance of a correlation coefficient, is the
coefficient of determination (r). By squaring the correlation coefficient, one obtains a measure of the
common variance between two variables, the proportion of variance accounted for in one of the
variables, or explained by, the other. If the correlation between marital satisfaction and number
of months married is 0.40, then 16% of the variance (-.40 x -.40 = .16) of one variable is accounted
for by the variance of the other. We could say that 16% of the variability in marital satisfaction and
number-of-months-married overlaps. It follows that 84% of the variability is unaccounted for.
In an education study, the results shows that the r2 value of r=0.07 is 0.0049, or 0.49%: one-half of
one percent of variance accounted for. Ninety-nine and three-fourths percent (99.51%) of variance was
unaccounted for. This was a meaningless significant finding to be sure.
Ordinal
Nominal
Dichotomous
Interval/Ratio
Pearson
Spearman*
Point Biserial
Point Biserial
Ordinal
Spearman*
spearman
Rank Biserial
Rank Biserial
Nominal
Point Biserial
Rank Biserial
Dichotomous
Point Biserial
Rank Biserial
Contingency coefficient
Phi