Académique Documents
Professionnel Documents
Culture Documents
Refer to Chart. It shows the population distribution based on the data in Table 1 and the distribution
of the sample mean based on the data in Table 2. These observations can be made:
a. The mean of the distribution of the sample mean is equal to the mean of the population: 𝜇 = 𝜇𝑋�
b. The spread in the distribution of the sample mean is less than the spread in the population values.
The sample means range from $7.0 to $8.5 while the population values vary from $7 up to $9. If we
continue to increase the sample size, the spread of the distribution of the sample mean becomes
smaller.
c. The shape of the sampling distribution of the sample mean and the shape of the frequency
distribution of the population values are different. The distribution of the sample mean tends to be
more bell-shaped and to approximate the normal probability distribution.
Chapter 9: Estimation and Confidence Intervals
Reasons for Sampling
1. To contact the entire population is too time consuming.
2. The cost of studying all the items in the population is often too expensive.
3. The sample results are usually adequate.
4. Certain tests are destructive.
5. Checking all the items is physically impossible.
Point and Interval Estimates
→ A point estimate is a single value (point) derived from a sample and used to estimate a
population value.
→ A confidence interval estimate is a range of values constructed from sample data so that the
population parameter is likely to occur within that range at a specified probability. The
specified probability is called the level of confidence.
Factors Affecting Confidence Interval Estimates
The factors that determine the width of a confidence interval are:
1. The sample size, n.
2. The variability in the population, usually σ estimated by s.
3. The desired level of confidence.
Interval Estimates – Interpretation
For a 95% confidence interval about 95% of the similarly constructed intervals will contain the
parameter being estimated. Also 95% of the sample means for a specified sample size will lie within
1.96 standard deviations of the hypothesized population.
How to obtain z value for a Given Confidence Level
The 95 percent confidence refers to the middle 95 percent of the observations. Therefore, the
remaining 5 percent are equally divided between the two tails.
Decision Rule
Reject the null hypothesis and accept the alternate hypothesis if
Computed − z < Critical − z
Computed z > Critical z
Using the p-Value in Hypothesis Testing
P-Value
The probability, assuming that the null hypothesis is true, of finding a value of the test statistic at least
as extreme as the computed value for the test
Decision Rule
If the p-Value is larger than or equal to the significance level, α, 𝐻0 is not rejected.
If the p-Value is smaller than the significance level, α, 𝐻0 is rejected.
Calculated from the probability distribution function or by computer
Interpreting p-values
. 05 > 𝑝 > .10 = 𝑆𝑜𝑚𝑒 𝑒𝑣𝑖𝑑𝑒𝑛𝑐𝑒 𝐻0 𝑖𝑠 𝑛𝑜𝑡 𝑡𝑟𝑢𝑒
. 01 > 𝑝 > .05 = 𝑆𝑡𝑟𝑜𝑛𝑔 𝑒𝑣𝑖𝑑𝑒𝑛𝑐𝑒 𝐻0 𝑖𝑠 𝑛𝑜𝑡 𝑡𝑟𝑢𝑒
. 001 > 𝑝 > .01 = 𝑉𝑒𝑟𝑦 𝑆𝑡𝑟𝑜𝑛𝑔 𝑒𝑣𝑖𝑑𝑒𝑛𝑐𝑒 𝐻0 𝑖𝑠 𝑛𝑜𝑡 𝑡𝑟𝑢𝑒
Step Five: Make a decision.
Accept or Reject 𝐻0
One-Tailed Tests of Significance
The alternate hypothesis, 𝐻1 states a direction
Examples:
1. 𝐻1 : The mean yearly commissions earned by fulltime realtors are more than 35,000. (µ > 35,000)
2. 𝐻1 : The mean speed of trucks traveling on I-95 in Georgia is less than 60 miles per hour. (µ < 60)
3. 𝐻1 : Less than 20 percent of the customers pay cash for their gasoline purchase. (𝑝 < .20)
Sampling Distribution of the Statistic z, a Right-Tailed Test, .05 Level of Significance
Test for the population mean from a large sample with population standard deviation known
𝑋� − 𝜇
𝑧= 𝜎
√𝑛
Example 1: The processors of Fries’ Catsup indicate on the label that the bottle contains 16 ounces of
catsup. The standard deviation of the process is 0.5 ounces. A sample of 36 bottles from last hour’s
production revealed a mean weight of 16.12 ounces per bottle. At the .05 significance level is the
process out of control? That is, can we conclude that the mean amount per bottle is different from 16
ounces?
Step 1: State the null and the alternative hypotheses
𝐻0 : 𝜇 = 16
𝐻1 : 𝜇 ≠ 16
Step 2: Select the significance level
The significance level is .05.
Step 3: Identify the test statistic.
Because we know the population standard deviation, the test statistic is z.
Step 4: State the decision rule.
Reject 𝐻0 if 𝑧 > 1.96 or − 𝑧 < −1.96 or if 𝑝 < .05
Step 5: Make a decision and interpret the results.
𝑋� − 𝜇 16.12 − 16 0.12
𝑧= 𝜎 = = = 1.44
0.5 0.5
√𝑛 √36 6
The p(z > 1.44) is .1499 for a two − tailed test.
o Computed z of 1.44 < Critical z of 1.96
o p of .1499 > α of .05
Do not reject the null hypothesis.
We cannot conclude the mean is different from 16 ounces.
Testing for the Population Mean: Large Sample, Population Standard Deviation Unknown
As long as the sample size 𝑛 ≥ 30, z can be approximated using
𝑋� − 𝜇
𝑧= 𝑠
√𝑛
Here, s is unknown, so we estimate it with the sample standard deviation s.
Example 2: Roger’s Discount Store chain issues its own credit card. Lisa, the credit manager, wants to
find out if the mean monthly unpaid balance is more than $400. The level of significance is set at .05.
A random check of 172 unpaid balances revealed the sample mean to be $407 and the sample
standard deviation to be $38.
Should Lisa conclude that the population mean is greater than $400, or is it reasonable to assume that
the difference of $7 ($407-$400) is due to chance?
Step 1: State the null and the alternative hypotheses
𝐻0 : 𝜇 ≤ 400
𝐻1 : 𝜇 > 400
Step 2: Select the significance level
The significance level is .05.
Step 3: Identify the test statistic.
Because the sample is large we can use the z distribution as the test statistic.
Step 4: State the decision rule.
Reject 𝐻0 if 𝑧 > 1.65 or if 𝑝 < .05
Step 5: Make a decision and interpret the results.
𝑋� − 𝜇 407 − 400
𝑧= 𝑠 = = 2.42
38
√𝑛 √172
The p(z > 2.42) is .0078 for a one − tailed test.
o Computed z of 2.42 > Critical z of 1.65
o p of .0078 < α of .05
Reject the null hypothesis.
Lisa can conclude that the mean unpaid balance is greater than $400.
Testing for a Population Mean: Small Sample, Population Standard Deviation Unknown
The test statistic is the t distribution.
𝑋� − 𝜇
𝑡= 𝑠
√𝑛
The critical value of t is determined by its degrees of freedom equal to n – 1.
Example 3: The current rate for producing 5 amp fuses at Neary Electric Co. is 250 per hour. A new
machine has been purchased and installed that, according to the supplier, will increase the production
rate. The production hours are normally distributed. A sample of 10 randomly selected hours from
last month revealed that the mean hourly production on the new machine was 256 units, with a
sample standard deviation of 6 per hour.
At the .05 significance level can Neary conclude that the new machine is faster?
Step 1: State the null and the alternative hypotheses
𝐻0 : 𝜇 ≤ 250
𝐻1 : 𝜇 > 250
Step 2: Select the significance level. The significance level is .05.
Step 3: Find a test statistic. Use the t distribution since s is not known and n < 30.
Step 4: State the decision rule. There are 10 – 1 = 9 degrees of freedom.
Step 5: Make a decision and interpret the results.
𝑋� − 𝜇 256 − 250
𝑡= 𝑠 = = 3.162
6
√𝑛 √10
Computed t of 3.162 > Critical t of 1.833
Reject the null hypothesis. The mean number of amps produced is more than 250 per hour.
Proportion: The fraction or percentage that indicates the part of the population or sample having a
particular trait of interest.
𝑁𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑠𝑢𝑐𝑐𝑒𝑠𝑠 𝑖𝑛 𝑡ℎ𝑒 𝑠𝑎𝑚𝑝𝑙𝑒
𝑝=
𝑁𝑢𝑚𝑏𝑒𝑟 𝑆𝑎𝑚𝑝𝑙𝑒𝑑
Test Statistic for Testing a Single Population Proportion
𝑝−𝜋
𝑧=
�𝜋(1 − 𝜋)
𝑛
The sample proportion is p and π is the population proportion.
Example: In the past, 15% of the mail order solicitations for a certain charity resulted in a financial
contribution. A new solicitation letter that has been drafted is sent to a sample of 200 people and 45
responded with a contribution. At the .05 significance level can it be concluded that the new letter
is more effective?
Solution:
Step 1: State the null and the alternate hypothesis.
𝐻𝑜 : 𝑝 ≤ .15
𝐻1 : 𝑝 > .15
Step 2: Select the level of significance. It is .05.
Step 3: Find a test statistic. The z distribution is the test statistic.
Step 4: State the decision rule.
The null hypothesis is rejected if z is greater than 1.65.
Step 5: Make a decision and interpret the results.
45
𝑝−𝜋 − 0.15
𝑧= = 200 = 2.97
� 𝜋 ( 1 − 𝜋 ) � 0.15 ( 1 − 0.15)
𝑛 200
Because the computed z of 2.97 > critical z of 1.65, the null hypothesis is rejected. More than 15
percent responding with a pledge. The new letter is more effective.
Chapter 13: Linear Regression and Correlation
Correlation Analysis and Scatter Diagram
Correlation Analysis is the study of the relationship between variables. It is also defined as group of
techniques to measure the association between two variables.
A Scatter Diagram is a chart that portrays the relationship between the two variables. It is the usual
first step in correlations analysis.
Dependent vs. Independent Variable
Dependent Variable: The variable that is being predicted or estimated. It is scaled on the Y-axis.
Independent Variable: The variable that provides the basis for estimation. It is the predictor
variable. It is scaled on the X-axis.
Regression Example
The sales manager of Copier Sales of America, which has a large sales force throughout the United
States and Canada, wants to determine whether there is a relationship between the number of sales
calls made in a month and the number of copiers sold that month. The manager selects a random
sample of 10 representatives and determines the number of sales calls each representative made last
month and the number of copiers sold.
Sales Representative Number of Sales Calls Number of Copiers Sold
Tom Keller 20 30
Jeff Hall 40 60
Brian Virost 20 40
Greg Fish 30 60
Susan Welch 10 30
Carlos Ramirez 10 40
Rich Niles 20 40
Mike Kiel 20 50
Mark Reynolds 20 30
Soni Jones 30 70
Scatter Diagram:
Math
1. A social scientist sampled 140 people and classified them according to income level and
whether or not they played a state lottery in the last month. The sample information is
reported below. Is it reasonable to conclude that playing lottery is related to income level?
Use .05 significance level.
Low Middle High Total
Played 46 28 21 95
Not Played 14 12 19 45
Total 60 40 40 140
a) What is this table called?
b) State the null hypothesis and the alternate hypothesis.
c) What is the decision rule?
d) Make a decision on the null hypothesis and interpret the result.
2. The American Accounting Association classifies accounts receivables as “current”, “late” and
“not collectible”. The industry figures show that 60% of accounts receivable are current, 30%
are late and 10% are not collectible. Massa and Barr, a law firm has 500 accounts receivable:
320 are current, 120 are late and 60 are not collectible. Are these numbers in agreement
with the industry distribution? Use the .05 significance level.