Académique Documents
Professionnel Documents
Culture Documents
When is unknown
The t distributions
The t test
Confidence intervals
Matched pairs t procedures
Robustness
WHEN IS UNKNOWN
Population
distribution
population N(,)
Standard Normal
t distribution, df 100
t distribution, df 20
When n is large, s is a
good estimate of and
the t df n 1 distribution is
close to the standard
Normal distribution.
STANDARD DEVIATION VERSUS STANDARD
ERROR
For a sample of size n, 1
the sample standard deviation s is: s
n 1
i
( x x ) 2
The value s/n is called the standard error of the mean SEM.
Scientists often present their sample results as the mean SEM.
When is
known, we use
the Normal
distribution and
THE ONE-SAMPLE t TEST
H o : = o
x 0
t 1
s n
t
This resulting t test is robust to deviations from
Normality as long as the sample size is large enough.
The P-value is the probability, if H0 was true, of
randomly drawing a sample like the one obtained or
more extreme in the direction of Ha.
One-sided
(one-tailed)
x 0
t
s n
Two-sided
(two-tailed)
USING TABLE For Ha: > 0 if n = 10 and t = 2.70, then
C:
m m
m t*s n t* t*
Data on the blood cholesterol levels (mg/dl) of 24 lab rats give a sample mean
of 85 and a standard deviation of 12. We want a 95% confidence interval for the
mean blood cholesterol of all lab rats.
Data on the blood cholesterol levels (mg/dl) of 24 lab rats give a sample mean
of 85 and a standard deviation of 12. We want a 95% confidence interval for the
mean blood cholesterol of all lab rats.
df n1 23
* s 12
m t (2.069)( ) 5.07
n 24
x m 85 5.07 79.9 90.1mg/ dl
We are 95% confident that the true mean blood cholesterol
of all lab rats is between 79.9 and 90.1 mg/dl.
Data on the blood cholesterol levels (mg/dl) of 24 lab rats give a sample mean
of 85 and a standard deviation of 12. At a significance level of 0.05, is there
sufficient evidence to support the claim that average blood cholesterol in rats is
different than that in mice, = 90?
Data on the blood cholesterol levels (mg/dl) of 24 lab rats give a sample mean
of 85 and a standard deviation of 12. At a significance level of 0.10, is there
sufficient evidence to support the claim that average blood cholesterol in rats is
different than that in mice, = 90?
H 0 : 90
H a : 90
x 0 85 90
t 2.04
s/ n 12 / 24
df 23
.05 p .1
MATCHED PAIRS t PROCEDURES
Sometimes we want to compare treatments or conditions at
the individual level. The data sets produced this way are not
independent.
Twin studies often try to sort out the influence of genetic factors by
comparing a variable between sets of twins.
Using people matched for age, sex, and education in social studies
allows us to cancel out the effect of these potential lurking variables.
MATCHED PAIRS t PROCEDURES
and
population mean.
Study Participants: 53 obese children ages 9 to 12 with a BMI above
the 95th percentile for age and gender
Intervention: family counseling sessions on the stoplight diet
(green/yellow/red approach to eating food) - after 8 weekly sessions and
3 follow-up sessions
Assessment: Weight change at 15 weeks of intervention
Was the intervention effective in helping obese children lose
weight?
df 52, p 0.005
As a guideline:
When n < 15, the data must be close to Normal and without
outliers.
When 15 > n > 40, mild skewness is acceptable, but not
outliers.
When n > 40, the t statistic will be valid even with strong
Does oligofructose consumption stimulate calcium absorption?
Healthy adolescent males took a pill for nine days and had their calcium
absorption tested on the ninth day. The experiment was repeated three
weeks later. Subjects received either an oligofructose pill first or a
control sucrose pill first. The order was randomized and the experiment
was double-blind.
x = 5.5; s = 2.517; df = n 1 = 8