Académique Documents
Professionnel Documents
Culture Documents
Chapter 11:
Bivariate Relationships:
t-test for Comparing the
Means of Two Groups
2008 McGraw-Hill
Bivariate Analysis
Bivariate or two variable analysis
involves searching for statistical
relationships between two variables
A statistical relationship between two
variables asserts that the measurements of
one variable tend to consistently change
with the measurements of the other,
making one variable a good predicator of
the other
2008 McGraw-Hill
Independent and
Dependent Variables
The predictor variable is the
independent variable
The predicted variable is the
dependent variable
2008 McGraw-Hill
Measuring Correlation
Chapter 14-15. Correlation to
determine a relationship between two
interval/ratio variables
Web site Extensions to Chapter 15:
Rank-order correlation test to
determine a relationship between two
numbered ordinal level variables
2008 McGraw-Hill
2008 McGraw-Hill
When to Use an
Independent Samples t-test
Two variables from one population and sample,
one interval/ratio and one dichotomous
nominal/ordinal
Or: There are two populations and samples and
one interval/ratio variable; the samples are
representative of their population
The interval/ratio variable is typically the
dependent variable
The groups do not consist of same subjects
Population variances are assumed equal
2008 McGraw-Hill
Features of an
Independent Samples t-test
The t-test focuses on the computed
difference between two sample
means and addresses the question of
whether the observed difference
between the sample means reflects a
real difference in the population
means or is simply due to sampling
error
2008 McGraw-Hill
Features of an Independent
Samples t-test (cont.)
Step 1. Stating the H0:
The mean of population 1 equals the
mean of population 2
That is, there is no difference in the
means of the interval/ratio variable, X,
for the two populations
2008 McGraw-Hill
Features of an Independent
Samples t-test (cont.)
Step 2. The sampling distribution is the
approximately normal t-distribution
The pooled variance formula for the standard
error is used when we can assume that
population variances are equal
The separate variance formula for the
standard error is used when we cannot
assume that population variances are equal
2008 McGraw-Hill
Features of an Independent
Samples t-test (cont.)
Step 4. The effect is the difference
between the sample means
The test statistic is the effect divided
by the standard error
The p-value is estimated using the
t-distribution table
2008 McGraw-Hill
Assumption of Equality
of Population Variances
When one sample variance is not larger than twice
the size of the other, this suggests that the two
population variances are equal and we assume
equality of variances
We may use the pooled variance estimate of the
standard error
Equality of variances is also termed homogeneity
of variances or homoscedasticity
2008 McGraw-Hill
Assumption of Equality
of Population Variances (cont.)
Heterogeneity of variances, or
heteroscedasticity, is when variances
of the two populations appear unequal
Here we use the separate variance
estimate of the standard error and
calculate degrees of freedom
differently
2008 McGraw-Hill
When to Use a
Nonindependent Samples t-test
There is one population with a representative
sample from it
There are two interval/ratio variables with the
same score design
Or: There is a single variable measured
twice for the same sample subjects
There is a target value of the variable
(usually zero) to which we may compare the
mean of the differences between the two
sets of scores
2008 McGraw-Hill
Features of a Nonindependent
Samples or Matched-Pair t-test
Step 1. Stating the H0:
The mean of differences between
the scores in a population is
equal to zero
2008 McGraw-Hill
Nonindependent Samples or
Matched-Pair t-test (cont.)
Step 2. The sampling distribution is
the approximately normal
t-distribution
The standard error is calculated as
the standard deviation of differences
between scores divided by the square
root of n - 1
2008 McGraw-Hill
Nonindependent Samples or
Matched-Pair t-test (cont.)
Step 4. The effect is the mean of
differences between scores
The test statistic is the effect
divided by the standard error
The p-value is estimated using
the t-distribution table
2008 McGraw-Hill
2008 McGraw-Hill
Four Aspects of
Statistical Relationships
When examining a relationship between
two variables, we can address four things:
existence, direction, strength, and practical
applications
These four aspects provide a checklist for
what to say in writing up the results of a
hypothesis test
2008 McGraw-Hill
Existence of a Relationship
Existence: On the basis of statistical
analysis of a sample, can we conclude that
a relationship exists between two variables
among all subjects in the population?
Established by rejection of the H0
Testing for the existence of a relationship is
the first step in any analysis. If a
relationship is found not to exist, the other
three aspects of a relationship are irrelevant
2008 McGraw-Hill
Direction of a Relationship
Direction: Can the dependent variable
be expected to increase or decrease
as the independent variable
increases?
Direction is stated in the alternative
hypothesis (HA) of step 1 of the six
steps of statistical inference
2008 McGraw-Hill
Strength of a Relationship
Strength: To what extent are
errors reduced in predicting the
scores of a dependent variable
when an independent variable is
used as a predictor?
2008 McGraw-Hill
Practical Applications
of a Relationship
Practical Applications: In practical,
everyday terms, how does
knowledge of a relationship
between two variables help us
understand and predict outcomes
of the dependent variable?
2008 McGraw-Hill
2008 McGraw-Hill
2008 McGraw-Hill
Statistical Follies
Avoid a common tendency:
Difference in means testing is so
widely used that researchers
often focus too heavily on mean
differences while ignoring the
differences in variances (or
standard deviations)
2008 McGraw-Hill