Académique Documents
Professionnel Documents
Culture Documents
1
Lecture Notes for Introductory Statistics
1
These lecture notes are intended to be used with the open source textbook “Introductory
Statistics” by Barbara Illowsky and Susan Dean (OpenStax College, 2013).
1
Chapter 9 Notes Hypothesis Testing D. Skipper, p 2
1.2. Conclusions. Once you have the null and alternative hypothesis nailed down,
there are only two possible decisions we can make, based on whether or not the
experimental outcome contradicts our assumption (null hypothesis).
(1) Reject the Null Hypothesis (and therefore, Support the Alternative Hy-
pothesis).
(2) Do Not Reject the Null Hypothesis (and therefore, Do Not Support the
Alternative Hypothesis).
Since we are basing our conclusion on incomplete information (“sample” data), we
must qualify our statements:
(1) The sample data leads us to reject the null hypothesis. (The sample data
supports the alternative hypothesis.)
(2) The sample data does not lead us to reject the null hypothesis. (The sample
data does not support the alternative hypothesis; it is possible that the null
hypothesis is true.)
Note that we can only reject or fail to reject the null hypothesis; we can never
“support” the null hypothesis. For this reason, the claim is often represented by
the alternative hypothesis, which CAN be supported by a hypothesis test.
Example 3. Stating Conclusions. We wish to determine if the mean time-to-
connect in a phone network is less than 3 seconds. Write the two possible conclu-
sions we could draw about this claim using a hypothesis test. (Use the null and
alternative hypotheses you found in Example 1.)
(1) Which hypothesis represents the claim?
(2) What is the conclusion about the claim if we decide to reject the null
hypothesis?
(3) What is the conclusion about the claim if we decide to fail to reject the null
hypothesis?
Example 4. Stating Conclusions. We wish to test the claim that the at most
14% of students leave campus for lunch. Write the two possible conclusions we
could draw about this claim using a hypothesis test. (Use the null and alternative
hypotheses you found in Example 2.)
(1) Which hypothesis represents the claim?
(2) What is the conclusion about the claim if we decide to reject the null
hypothesis?
(3) What is the conclusion about the claim if we decide to fail to reject the null
hypothesis?
Chapter 9 Notes Hypothesis Testing D. Skipper, p 3
When testing a claim about population mean µ, ONE of the following two
requirements must be met, so that the Central Limit Theorem applies and we can
assume the random variable X̄ is normally distributed:
• The sample size must be relatively large (many books recommend at least
30 samples), OR
• the sample appears to come from a normally distributed population.
It is very important to verify these requirements in real life. In the problems we
are usually told to assume the second condition holds if the sample size is small.
ZTest: The sample statistic is the sample mean of the data, x̄. If population
standard deviation
is known
(unlikely in real life), the distribution of the sample
means is X̄ ∼ N µ0 , √σn , where µ0 is the population mean assumed in the null
x̄−µ0
hypothesis. The test statistic is a z-score: z = √σ . The p-value is the tail area
n
under the X̄ normal curve beyond x̄ in the direction of the alternative hypothesis,
which is the same as the tail area under Z ∼ N (0, 1) beyond z.
TTest: Again the sample statistic is the sample mean of the data, x̄. If the pop-
ulation standard deviation is NOT known, the distribution of X̄ is normally
distributed as above, but we use a t-distribution, tdf = tn−1 , since we must use
s to approximate σ. The test statistic is a t-score: t = x̄−µ
√s
0
. The p-value is the
n
area under the T-distribution density curve, tdf , beyond t in the direction of the
alternative hypothesis.
1-PropZTest: The sample statistic is the sample proportion from the sample
data, p0 = nx . The distribution of the sample proportions is approximated by a
normal distribution with the same p mean and standard deviation as the associated
binomial distribution: P 0 ∼ N (p0 , p0nq0 ), where p0 is the population proportion
p0 −p0
assumed in the null hypothesis. The test statistic is a z-score: z = √ p0 q0 . The
n
p-value is the tail area under the density curve for P 0 beyond the sample statistic
p0 in the direction of the null hypothesis, which is the same as the tail area under
the standard normal density curve Z ∼ N (0, 1) beyond the test statistic z.
• If the p-value is very small, our sample outcome was very unlikely (a “rare
event”) based on our assumption, so we reject our assumption.
• Recall that our assumption is based on the null hypothesis, so rejecting the
assumption is the same as rejecting the null hypothesis.
• The smaller the p-value, the larger (in absolute value) the z/t-score, so an
“unusual” z/t-score gives us a hint about the p-value.
4.2. Decision. How small must the p-value be to reject the null hypothesis? This
can change and is set by the significance level, α, which will be provided in each
1 5
problem. Common significance levels are 0.01, or 100 , and 0.05, or 100 .
This silly little rhyme is from the Triola textbook “Elementary Statistics” and has
been helpful to many students: