Académique Documents
Professionnel Documents
Culture Documents
Hypothesis Testing
n Developing Null and Alternative Hypotheses
n Type I and Type II Errors
n Population Mean: s Known
n Hypothesis testing can be used to determine whether • It is not always obvious how the null and alternative
a statement about the value of a population parameter hypotheses should be formulated.
should or should not be rejected.
• Care must be taken to structure the hypotheses
n The null hypothesis, denoted by H 0 , is a tentative appropriately so that the test conclusion provides
assumption about a population parameter. the information the researcher wants.
n The alternative hypothesis, denoted by H a, is the • The context of the situation is very important in
opposite of what is stated in the null hypothesis. determining how the hypotheses should be stated.
n The hypothesis testing procedure uses data from a • In some cases it is easier to identify the alternative
sample to test the two competing statements hypothesis first. In other cases the null is easier.
indicated by H 0 and H a.
• Correct hypothesis formulation will take practice.
1
Developing Null and Alternative Hypotheses Developing Null and Alternative Hypotheses
Developing Null and Alternative Hypotheses Developing Null and Alternative Hypotheses
Developing Null and Alternative Hypotheses Developing Null and Alternative Hypotheses
2
Summary of Forms for Null and Alternative Null and Alternative Hypotheses
Hypotheses about a Population Mean
■ The equality part of the hypotheses always appears ■ Example: Metro EMS
in the null hypothesis. A major west coast city provides one of the most
n In general, a hypothesis test about the value of a comprehensive emergency medical services in the
population mean µ must take one of the following world. Operating in a multiple hospital system
three forms (where µ0 is the hypothesized value of with approximately 20 mobile medical units, the
the population mean). service goal is to respond to medical emergencies
with a mean time of 12 minutes or less.
H 0 : µ ³ µ0 H 0 : µ £ µ0 H 0 : µ = µ0 The director of medical services wants to
H a : µ < µ0 H a : µ > µ0 H a : µ ¹ µ0 formulate a hypothesis test that could use a sample
of emergency response times to determine whether
One-tailed One-tailed Two-tailed or not the service goal of 12 minutes or less is being
(lower-tail) (upper-tail) achieved.
The emergency service is meeting n Because hypothesis tests are based on sample data,
H 0: µ < 12 we must allow for the possibility of errors.
the response goal; no follow-up
action is necessary. ■ A Type I error is rejecting H0 when it is true.
■ The probability of making a Type I error when the
The emergency service is not
H a: µ > 12 null hypothesis is true as an equality is called the
meeting the response goal;
level of significance.
appropriate follow-up action is
necessary. ■ Applications of hypothesis testing that only control
the Type I error are often called significance tests.
where: µ = mean response time for the population
of medical emergency requests
Reject H0 Correct
Type I Error Decision
(Conclude µ > 12)
3
Critical Value Approach to
One-Tailed Hypothesis Testing
n The test statistic z has a standard normal probability
distribution.
Inference n We can use the standard normal probability
Population Parameter n The value of the test statistic that established the
boundary of the rejection region is called the
critical value for the test.
■ The rejection rule is:
• Lower tail: Reject H0 if z < -za
• Upper tail: Reject H 0 if z > za
Lower-Tailed Test About a Population Mean: Upper-Tailed Test About a Population Mean:
s Known s Known
■ Critical Value Approach ■ Critical Value Approach
Sampling Sampling
distribution distribution
of z =
x - µ0 x - µ0
s/ n of z =
Reject H 0 s/ n Reject H0
a = .10 a = .05
Do Not Reject H 0 Do Not Reject H0
z z
-za = -1.28 0 0 za = 1.645
4
Lower-Tailed Test About a Population Mean: Upper-Tailed Test About a Population Mean:
s Known s Known
p-Value Approach p-Value < a , p-Value < a ,
■ ■ p-Value Approach
so reject H0 . so reject H0 .
Sampling
a = .10 Sampling distribution a = .04
x - µ0
distribution of z =
s/ n
x - µ0
of z =
s/ n
p-value p-Value
= .072 = .011
z z
z = -za = 0 0 za = z=
-1.46 -1.28 1.75 2.29
Step 1. Develop the null and alternative hypotheses. Critical Value Approach
Step 2. Specify the level of significance a. Step 4. Use the level of significance to determine the
Step 3. Collect the sample data and compute the critical value and the rejection rule.
value of the test statistic. Step 5. Use the value of the test statistic and the rejection
rule to determine whether to reject H0.
p-Value Approach
Step 4. Use the value of the test statistic to compute the
p-value.
Step 5. Reject H 0 if p-value < a.
One-Tailed Tests About a Population Mean: One-Tailed Tests About a Population Mean:
s Known s Known
■ Example: Metro EMS n p -Value and Critical Value Approaches
The response times for a random sample of 40
medical emergencies were tabulated. The sample 1. Develop the hypotheses. H0: µ < 12
mean is 13.25 minutes. The population standard Ha: µ > 12
deviation is believed to be 3.2 minutes.
The EMS director wants to perform a hypothesis 2. Specify the level of significance. a = .05
test, with a .05 level of significance, to determine
whether the service goal of 12 minutes or less is 3. Compute the value of the test statistic.
being achieved. x - µ 13.25 - 12
z= = = 2.47
s / n 3.2 / 40
5
One-Tailed Tests About a Population Mean: One-Tailed Tests About a Population Mean:
s Known s Known
n p –Value Approach ■ p –Value Approach
n Critical Value Approach The manager of TSM Café is thinking about establishing a
d 11/22/10 6:31 PM Page 383
new billing system for the store’s credit customers. After
4. Determine the critical value and rejection rule. a thorough financial analysis, he determines that the new
system will be cost-effective only if the mean monthly
For a = .05, z.05 = 1.645
account is more than Rs. 170. A random sample of 400
Reject H 0 if z > 1.645 monthly accounts is drawn, for which the sample mean is
Rs. 178. The manager knows that the accounts are
5. Determine whether to reject H 0. approximately normally distributed with a standard
deviation of Rs 65. Can the manager conclude from this
Because 2.47 > 1.645, we reject H 0.
There is sufficient statistical evidence
thatI Nthe
T Rnew
O D Usystem
C T I O will
N TO be H YPOTHESIS TESTING
cost-effective?
383
to infer that Metro EMS is not meeting
Data
the response goal of 12 minutes.
asked how many minutes of sports he watched on shown here. Assuming that the standard deviation is
television daily. The responses are listed here. It is .05 inch, can we conclude at the 5% significance
known that ! ! 10. Test to determine at the 5% sig- level that the mean diameter is not .50 inch?
nificance level whether there is enough statistical
.48 .50 .49 .52 .53 .48 .49 .47 .46 .51
evidence to infer that the mean amount of television
watched daily by all young adult men is greater than 11.34 Xr11-34 Spam e-mail has become a serious and costly
50 minutes. nuisance. An office manager believes that the aver-
Illustration 2 3
age amount of time spent by office workers reading
50 48 65 74 66 37 45 68 64
andSpam
deleting spam
e-mail has becomeexceeds 25 costly
a serious and minutes per day.
nuisance.
65Federal
58 Express (FedEx) sends invoices to customers requesting
55 52 63 59 57 74 65
payment within 30 days. Each bill lists an address, and customers
are expected to use their own envelopes to return their payments.
ToAn office manager believes that the average amount of
test this belief, he takes a random
time spent by office workers reading and deleting spam sample of
11.30 Xr11-30 Thetheclub
Currently, professional
mean and at ofathedifficult
standard deviation public
amount of time 18 exceeds
workers and measures
25 minutes per day. Tothetest amount
this belief, of time each
he takes
taken to pay bills are 24 days and 6 days, respectively. The chief
course boasts
financial officerthat
(CFO)his course
believes is so tough
that including a stampedthatself- the spends reading
a random sampleand of deleting
18 workersspam. The results
and measures the are
amount of time each spends reading and deleting spam.
average golfer loses a dozen or more golf balls
addressed (SSA) envelope would decrease the amount of time. She
dur- listed here. If the population of times is normal
The results are listed here. If the population of times is with
calculates that the improved cash flow from a 2-day decrease in the
ing apayment
round of would
period golf.payA for
dubious
the costs ofgolfer sets out
the envelopes and to a standard
normal withdeviation
a standardof 12 minutes,
deviation can the
of 12 minutes, can manager
the
manager infer at the 1% significance level that he is
stamps. Any further decrease in the payment period would
showgenerate
that the proToistest
a profit. fibbing. He
her belief, sheasks a random
randomly selects 220sam- infer at the 1% significance level that he is correct?
correct?
ple ofcustomers
15 golfers who just completed their rounds
and includes a stamped self-addressed envelope with
their invoices. The numbers of days until payment is received were
to 35 48 29 44 17 21 32 28 34
report the number
recorded. Can the CFO of golfthat
conclude balls each
the plan lost.
will be Assuming
profitable?
23 13 9 11 30 42 37 43 48
that Data
the number of golf balls lost is normally distrib-
uted with a standard deviation of 3, can we infer at The following exercises require the use of a computer and soft-
the 10% significance level that the average number ware. The answers may be calculated manually. See Appendix A
of golf balls lost is less than 12? for the sample statistics.
1 14 8 15 17 10 12 6 11.35 Xr11-35 A manufacturer of lightbulbs advertises that,
14 21 15 9 11 4 8 on average, its long-life bulb will last more than
5,000 hours. To test the claim, a statistician took a
11.31 Xr11-31 A random sample of 12 second-year university random sample of 100 bulbs and measured the
6
students enrolled in a business statistics course was amount of time until each bulb burned out. If we
drawn. At the course’s completion, each student was assume that the lifetime of this type of bulb has a
asked how many hours he or she spent doing home- standard deviation of 400 hours, can we conclude at
work in statistics. The data are listed here. It is known the 5% significance level that the claim is true?
that the population standard deviation is ! ! 8.0. The
p-Value Approach to Critical Value Approach to
Two-Tailed Hypothesis Testing Two-Tailed Hypothesis Testing
n Compute the p-value using the following three steps: n The critical values will occur in both the lower and
1. Compute the value of the test statistic z. upper tails of the standard normal curve.
2. If z is in the upper tail (z > 0), compute the n Use the standard normal probability distribution
probability that z is greater than or equal to the table to find za/2 (the z-value with an area of a/2 in
the upper tail of the distribution).
value of the test statistic. If z is in the lower tail
(z < 0), compute the probability that z is less than or ■ The rejection rule is:
equal to the value of the test statistic. Reject H0 if z < -za/2 or z > za/2.
3. Double the tail area obtained in step 2 to obtain
the p –value.
n The rejection rule:
Reject H 0 if the p-value < a .
Two-Tailed Tests About a Population Mean: Two-Tailed Tests About a Population Mean:
s Known s Known
■ Example: Glow Toothpaste ■ Example: Glow Toothpaste
The production line for Glow toothpaste is Assume that a sample of 30 toothpaste tubes
designed to fill tubes with a mean weight of 6 oz. provides a sample mean of 6.1 oz. The population
Periodically, a sample of 30 tubes will be selected in standard deviation is believed to be 0.2 oz.
order to check the filling process.
Perform a hypothesis test, at the .03 level of
Quality assurance procedures call for the
significance, to help determine whether the filling
continuation of the filling process if the sample
process should continue operating or be stopped and
results are consistent with the assumption that the
mean filling weight for the population of toothpaste corrected.
tubes is 6 oz.; otherwise the process will be adjusted.
Two-Tailed Tests About a Population Mean: Two-Tailed Tests About a Population Mean:
s Known s Known
n p –Value and Critical Value Approaches n p –Value Approach
3. Compute the value of the test statistic. 5. Determine whether to reject H0.
7
Two-Tailed Tests About a Population Mean: Two-Tailed Tests About a Population Mean:
s Known s Known
n p-Value Approach n Critical Value Approach
Data
8
Tests About a Population Mean: Tests About a Population Mean:
s Unknown s Unknown
■ Test Statistic ■ Rejection Rule: p -Value Approach
Reject H0 if p –value < a
x - µ0
t=
s/ n ■ Rejection Rule: Critical Value Approach
H0: µ > µ0 Reject H0 if t < -ta
This test statistic has a t distribution
with n - 1 degrees of freedom. H0: µ < µ0 Reject H0 if t > ta
n The format of the t distribution table provided in most ■ One-Tailed Test About a Population Mean: s Unknown
statistics textbooks does not have sufficient detail A State Highway Patrol periodically samples
to determine the exact p-value for a hypothesis test. vehicle speeds at various locations on a particular
n However, we can still use the t distribution table to roadway. The sample of vehicle speeds is used to
identify a range for the p-value. test the hypothesis H0: µ < 65.
n An advantage of computer software packages is that The locations where H0 is rejected are deemed the
the computer output will provide the p-value for the best locations for radar traps. At Location F, a
t distribution. sample of 64 vehicles shows a mean speed of 66.2
mph with a standard deviation of 4.2 mph. Use a
= .05 to test the hypothesis.
One-Tailed Test About a Population Mean: One-Tailed Test About a Population Mean:
s Unknown s Unknown
n p –Value and Critical Value Approaches n p –Value Approach
9
One-Tailed Test About a Population Mean: One-Tailed Test About a Population Mean:
s Unknown s Unknown
n Critical Value Approach
10
Tests About a Population Proportion Two-Tailed Test About a
Population Proportion
■ Rejection Rule: p –Value Approach ■ Example: National Safety Council (NSC)
Reject H 0 if p –value < a For a Christmas and New Year’s week, the
National Safety Council estimated that 500 people
■ Rejection Rule: Critical Value Approach
would be killed and 25,000 injured on the nation’s
H 0: p < p0 Reject H 0 if z > za roads. The NSC claimed that 50% of the accidents
H 0: p > p0 Reject H 0 if z < -za would be caused by drunk driving.
A sample of 120 accidents showed that 67 were
H 0: p = p0 Reject H 0 if z < -za/2 or z > za/2
caused by drunk driving. Use these data to test the
NSC’s claim with a = .05.
3. Compute the value of the test statistic. 5. Determine whether to reject H0.
Because p–value = .2006 > a = .05, we cannot reject H0.
p0 (1 - p0 ) .5(1 - .5)
a common
sp = = = .045644
n 120
error is using
p in this p - p0 (67 /120) - .5
formula z= = = 1.28
sp .045644
11