Vous êtes sur la page 1sur 21

Module 3 helpful links

http://www3.nd.edu/~rwilliam/stats1/x21.pdf

http://onlinestatbook.com/2/normal_distribution/intro.html

http://sph.bu.edu/otlt/MPH-
Modules/BS/BS704_Confidence_Intervals/BS704_Confidence_Intervals_print.html

http://academic.udayton.edu/gregelvers/psy216/tables/area.htm

http://www3.nd.edu/~rwilliam/stats1/x21.pdf

http://www.psychassessment.com.au/pdf/mod2.pdf

http://www.camdencc.edu/html/departments/math/pubs/stat_final_review_suppl
ementalsp10.pdf

http://www.intmath.com/counting-probability/14-normal-probability-
distribution.php

http://www.terry.uga.edu/~pholmes/STAT6310/STAT6310_ClassnotesSummer04
_3.pdf

http://onlinestatbook.com/2/calculators/normal.html

http://easycalculation.com/statistics/confidence-interval-variance.php



Name: ___________________________________ Date: ______________
1. A study is designed to determine whether grades in a statistics course could be
improved by offering review material. The 250 students in an introductory statistics
class are also enrolled in one of 20 lab sections. The 20 lab sections are randomly
divided into 2 groups of 10 lab sections each. The students in a set of 10 lab sections
are given extra review material during the last 15 minutes of each lab session. The
students in the remaining 10 lab sections receive the regular lesson material, without
the review material. The grades of the students who reviewed were higher, on
average, than the students who did not review. What type of study is this?
A) An observational study. B) An experiment, but not a double-blind experiment.
C) A double-blind experiment. D) A matched-pairs experiment.

2. A sample of employees of a large company has been obtained. The length of time (in
months) they have worked for the company was recorded for each employee. A
stemplot of these data is shown below. In the stemplot 6|2 represents: 62 months.

What would be a better way to represent this data set?
A) Display the data in a time plot. B) Display the data in a boxplot.
C) Split the stems. D) Use a histogram with class width equal to 10.

Use this to answer question 3. Volunteers for a study were divided into 3 groups. Group
1 listened to Western religious music, group 2 listened to Western rock music, and group
3 listened to Chinese religious music. The blood pressure of each volunteer was
measured before and after listening to the music, and the change in blood pressure (blood
pressure before listening minus blood pressure after listening) was recorded.
3. What could we do to graphically explore the relationship between type of music and
change in blood pressure?
A) See if blood pressure decreases as type of music increases by examining a
scatterplot.
B) Make a histogram of the change in blood pressure for all of the volunteers.
C) Make side-by-side boxplots of the change in blood pressure, with a separate
boxplot for each group.
D) Do all of the above.

Use the following to answer 4-5. Many residents own more than one car but consider one
of their cars to be the main family vehicle. The age of these family vehicles can be
modeled by a normal distribution with mean 2 years and standard deviation 6 months.
4. What percentage of family vehicles are between 1 and 3 years old?
A) Cannot be determined based on the information given.
B) 68% C) 95% D) 99.7%

5. What is the standardized value for a family vehicle that is 3 years and 3 months old?
A) 0.22 B) 2.5 C) 2.6 D) 2.92
6. A free screening program is set up in the downtown area. People can walk in and
have their cholesterol measured. 173 people use the service, and their average
cholesterol is 217.8. The value 217.8 is an example of _________
A) a sample proportion. B) a statistic.
C) a parameter. D) an unbiased parameter.

7. As cheese matures, a variety of chemical processes take place. The taste of cheese is
related to the concentration of several chemicals. In a study of cheese in a certain
region, samples of cheese were analyzed for lactic acid concentration and were
subjected to taste tests. The numerical taste scores were obtained by combining the
scores from several tasters. A scatterplot of the observed data is shown below.

What is a plausible value for the correlation between lactic acid concentration and
taste rating? A) 0.999 B) 0.7 C) 0.07 D) 0.7

Use this to answer # 8. We are studying the relationship between sugar consumption and
weight. 12 people are randomly assigned to one of two groups. The first group of 5
people is put on a low sugar diet and the second group of 7 participants received 10% of
their calories from sugar. After 8 weeks, weight gain was recorded from each participant.
8. Which of the following basic principles of statistical design was not used in this
experiment? A) control B) randomization
C) repetition D) blinding

9. Which of the following sets of four numbers has the largest standard deviation?
A) 7, 8, 9, 10 B) 5, 5, 5, 5 C) 0, 0, 10, 10 D) 0, 1, 2, 3

Use the following to answer question 10. The Michigan Department of Transportation
(M-DOT) is working on a major project: 80% of the highways in Michigan need to be
repaved. To speed completion of this project, many contractors will be working for M-
DOT. Contractors are currently bidding on the next part of the project. To help make a
decision about which contractor to hire, M-DOT collects many variables besides just the
estimated cost. One of those variables is the contractor's estimate of the number of work
days required to finish the job. Twenty contractors have bid on the next job. The boxplot
below represents their estimates of the number of work days required.

10. What is (approximately) the interquartile range, based on the boxplot?
A) 140 days B) 270 days
C) 360 days D) 760 days

Use the following to answer # 11-12.
Twelve people, who suffer from chronic fatigue syndrome, volunteer to take part in an
experiment to see if shark fin extract will increase one's energy level. Eight of the
volunteers are men and four are women. Half of the volunteers are to be given shark fin
extract twice a day and the other half a placebo twice a day. We wish to make sure that
four men and two women are assigned to each of the treatments, so we decide to use a
block design with the men forming one block and the women the other.
11. For which reason is a block design appropriate in this experiment?
A) Gender equity is an important legal consideration in this study.
B) We believe men and women may respond differently to treatments.
C) We want the conclusions to apply equally to men and women.
D) All of the above.

12. Suppose one of the researchers is responsible for measuring if a subject displays an
increase in energy level. What design principle should be added to this experiment?
A) Two placebos.
B) Use a stratified sampling design to assign subjects to treatments.
C) Use fewer subjects but observe them more frequently.
D) Conduct the study as a double-blind experiment.

Use the following to answer # 13. An opinion poll in Ohio is used to determine whether
registered voters in the state approve of a measure to ban smoking in public areas. They
select a simple random sample of 50 registered voters from each county in the state and
ask whether they approve or disapprove of the measure.
13. What type of sampling method was used to collect the data?
A) A systematic county sample. B) A stratified sample.
C) A multistage sample. D) A simple random sample.

Use this to answer # 14-15. 100 volunteers who suffer from depression are available for
a study. 50 are selected at random and are given a drug that is thought to be particularly
effective in treating depression. The other 50 are given an existing drug for treating
depression. A psychiatrist evaluates the symptoms of all volunteers after 4 weeks to
determine if there has been substantial improvement in the severity of the depression.
14. What is the explanatory variable or factor in this study?
A) Which drug the volunteers receive.
B) The use of randomization and the fact that this was a comparative study.
C) The extent to which the depression was reduced.
D) The use of a psychiatrist to evaluate the severity of depression.

15. In which situation would this study be double-blind?
A) Neither drug had any identifying marks on it.
B) Volunteers were not allowed to see the psychiatrist nor was the psychiatrist
allowed to see the volunteers during the session when the psychiatrist evaluated
the severity of the depression.
C) Neither the volunteers nor the psychiatrist knew which treatment any person had
received.
D) All of the above.

16. A distributor is doing a customer satisfaction survey for a manufacturer of DVD
players. A sample of 68 clients is asked to rate a DVD player on ease of use, on a 1
to 10 scale, where 1 corresponds to the worst rating and 10 to the best possible
rating. A bar graph of the ease of use ratings classified by gender is given below.

What percentage of the sampled female clients rated the DVD player as not so easy
to use (a rating of 4 or lower)?
A) 28% B) 29%
C) 38% D) 62%

Use the following to answer # 17.. The number of undergraduates at Johns Hopkins
University is about 2000, while the number at Ohio State is approximately 40,000.
17. Say the actual proportion of undergraduates at Johns Hopkins University who feel
drinking is a problem among college students is 67%. A simple random sample of
50 undergraduates at Johns Hopkins University found that 60% of those sampled felt
that drinking was a problem among college students. Which value(s) can be labeled
as parameter(s)? A) 60% B) 67% C) 60% and 67% D) 50

Use this to answer # 18. The Consumers Union measured the gas mileage of 38 cars from
the 197879 model-year on a test track. The pie chart below provides information about
the number of cars from each country in this study by the Consumers Union.

18. What conclusion can we draw based on this pie chart?
A) Italian cars get significantly lower gas mileage than cars of other countries.
B) More than half of the cars in the study were from the United States.
C) Swedish cars get gas mileages that are between those of Japanese and U.S. cars.
D) Audi, Porsche, and BMW represent approximately a quarter of the cars tested.

Use the following to answer question 19.
19. Data on the mileage of 20 randomly selected cars is listed below. The values are
ordered for convenience.
12 13 15 16 16 17 18 18 19 19
20 20 22 23 24 26 26 27 27 29
What is the median mileage for these 20 cars?
A) 17.5 miles/gal B) 19 miles/gal C) 19.5 miles/gal D) 20 miles/gal

20. The average salary of all female workers is $35,000. The average salary of all male
workers is $41,000. What must be true about the average salary of all workers?
A) It must be $38,000. B) It must be larger than the median salary.
C) It could be anything from $35,000 to $41,000. D) It must be > $38,000.

21. A paper interviews an expert about a proposed system for rating the teaching ability
of faculty members. He says, The evidence indicates that the correlation between a
faculty member's research productivity and teaching rating is close to zero. What
would be a correct interpretation of this statement?
A) Good researchers tend to be poor teachers and vice versa.
B) Good teachers tend to be poor researchers and vice versa.
C) There isnt much relationship between teaching and research accomplishment.
D) Good research and good teaching go hand in hand.

22. In a study of 1991 model cars, a researcher computed the regression line of price
(in dollars) on horsepower. He obtained the following equation for this line.
price = 6677 + 175 * horsepower. Based on the least-squares regression line, what
would we predict the cost of a 1991 model car with horsepower equal to 200 to be?
A) $41,677 B) $35,000 C) $28,323 D) $13,354

23. A sample of 16 people is taken and their weights are measured. The standard
deviation of these 16 measurements is computed to be 5.4. What is the variance of
these measurements?
A) 2.24 lbs. B) 2.24 lbs
2
. C) 29.16 lbs. D) 29.16 lbs
2
.

24. To see if smoking causes cancer, researchers surveyed a large sample of adults. For
each they recorded whether the person had smoked regularly at any period in their
life and whether the person had cancer. They compared the proportion of cancer
cases in those who had smoked at some time in their lives with the proportion of
cases in those who had never smoked at any point in their lives. The researchers
found that there was a higher proportion of cancer among those who had smoked
than among those who had never smoked. What type of study is this?
A) An observational study. B) An experiment, but not a double-blind experiment.
C) A double-blind experiment. D) A block design.

The correlation between length and width is calculated to be r = 0.827.

25. In the plot, notice that length is treated as the response variable and width as the
explanatory variable. Suppose we had taken width to be the response variable and
length to be the explanatory variable. What would be the correlation between width
and length in this case? A) 0.827 B) 0.827 C) 0.000
D) Something between 0.827 and 0.827, but we cannot determine the exact value.


Answer Key - Exam 1

1. B
2. C
3. C
4. C
5. B
6. B
7. B
8. D
9. C
10. B
11. B
12. D
13. B
14. A
15. C
16. C
17. B
18. B
19. C
20. C
21. C
22. C
23. D
24. A
25. A



Practice Test 1 Bus 2023

Directions: For each question find the answer that is the best solution provided. There is
only one correct answer.

1. Facts and figures that are collected, analyzed and summarized for presentation and
interpretation are
a. data
b. variables
c. elements
d. Both variables and elements are correct.
ANSWER: a

2. All the data collected in a particular study are referred to as the
a. census
b. inference
c. variable
d. data set
ANSWER: d

3. Quantitative data
a. are always nonnumeric
b. may be either numeric or nonnumeric
c. are always numeric
d. are always labels
ANSWER: c

4. Qualitative data
a. are always nonnumeric
b. may be either numeric or nonnumeric
c. are always numeric
d. indicate either how much or how many
ANSWER: b

5. Arithmetic operations are inappropriate for
a. qualitative data
b. quantitative data
c. both qualitative and quantitative data
d. large data sets
ANSWER: a

6. In a questionnaire, respondents are asked to record their age in years. Age is an example
of a
a. qualitative variable
b. quantitative variable
c. qualitative or quantitative variable, depending on how the respondents answered
the question
d. ratio variable
ANSWER: b


7. In an application for a credit card, potential customers are asked for their social security
numbers. A social security number is an example of a
a. qualitative variable
b. quantitative variable
c. qualitative or quantitative variable, depending on how the respondents answered
the question
d. ratio variable
ANSWER: a

8. Data collected at the same, or approximately the same, point in time are
a. time series data
b. approximate time series data
c. cross-sectional data
d. approximate data
ANSWER: c

9. Data collected over several time periods are
a. time series data
b. time controlled data
c. cross-sectional data
d. time cross-sectional data
ANSWER: a

10. Statistical studies in which researchers do not control variables of interest are
a. experimental studies
b. uncontrolled experimental studies
c. not of any value
d. observational studies
ANSWER: d

11. Statistical studies in which researchers control variables of interest are
a. experimental studies
b. control observational studies
c. non experimental studies
d. observational studies
ANSWER: a

12. A frequency distribution is
a. a tabular summary of a set of data showing the fraction of items in each of
several nonoverlapping classes
b. a graphical form of representing data
c. a tabular summary of a set of data showing the number of items in each of
several nonoverlapping classes
d. a graphical device for presenting qualitative data
ANSWER: c






13. The sum of frequencies for all classes will always equal
a. 1
b. the number of elements in a data set
c. the number of classes
d. a value between 0 and 1
ANSWER: b
Exhibit 2-1
The numbers of hours worked (per week) by 400 statistics students are shown below.

Number of hours Frequency

0 - 9 20
10 - 19 80
20 - 29 200
30 - 39 100

14. Refer to Exhibit 2-1. The class width for this distribution
a. is 9
b. is 10
c. is 39, which is: the largest value minus the smallest value or 39 - 0 = 39
d. varies from class to class
ANSWER: b

15. Refer to Exhibit 2-1. The midpoint of the last class is
a. 50
b. 34
c. 35
d. 34.5
ANSWER: d

16. Refer to Exhibit 2-1. The number of students working 19 hours or less
a. is 80
b. is 100
c. is 180
d. is 300
ANSWER: b

17. Refer to Exhibit 2-1. The relative frequency of students working 9 hours or less
a. is 20
b. is 100
c. is 0.95
d. 0.05
ANSWER: d


18. Refer to Exhibit 2-1. The cumulative relative frequency for the class of 20 - 29
a. is 300
b. is 0.25
c. is 0.75
d. is 0.5
ANSWER: c

19. A graphical device for presenting qualitative data summaries based on subdivision of a
circle into sectors that correspond to the relative frequency for each class is a
a. histogram
b. stem-and-leaf display
c. pie chart
d. bar graph
ANSWER: c

20. Qualitative data can be graphically represented by using a(n)
a. histogram
b. frequency polygon
c. ogive
d. bar graph
ANSWER: d

21. The mean of a sample is
a. always equal to the mean of the population
b. always smaller than the mean of the population
c. computed by summing the data values and dividing the sum by (n - 1)
d. computed by summing all the data values and dividing the sum by the number of
items
ANSWER: d
22. After the data has been arranged from smallest value to largest value, the value in the
middle is called the
a. range
b. median
c. mean
d. None of the other answers are correct.
ANSWER: b

23. The 75th percentile is also the
a. first quartile
b. second quartile
c. third quartile
d. fourth quartile
ANSWER: c




24. Which of the following is NOT a measure of location?
a. mean
b. median
c. variance
d. mode
ANSWER: c

25. The measure of location that is the most likely to be influenced by extreme values in the
data set is the
a. range
b. median
c. mode
d. mean
ANSWER: d

26. The difference between the largest and the smallest data values is the
a. variance
b. interquartile range
c. range
d. coefficient of variation
ANSWER: c

27. The variance of the sample
a. can never be negative
b. can be negative
c. cannot be zero
d. cannot be less than one
ANSWER: a

28. The variance of a sample of 81 observations equals 64. The standard deviation of the
sample equals
a. 0
b. 4096
c. 8
d. 6,561
ANSWER: c

29. Which of the following symbols represents the size of the sample
a.
2

b.
c. N
d. n
ANSWER: d

30. Which of the following symbols represents the mean of the sample?
a.
2

b.
c.
d.
x

ANSWER: d

Exhibit 3-2
A researcher has collected the following sample data. The mean of the sample is 5.

3 5 12 3 2

31. Refer to Exhibit 3-2. The variance is
a. 80
b. 4.062
c. 13.2
d. 16.5
ANSWER: d

32. Refer to Exhibit 3-2. The standard deviation is
a. 8.944
b. 4.062
c. 13.2
d. 16.5
ANSWER: b

33. For any continuous random variable, the probability that the random variable takes on
exactly a specific value is
a. 1.00
b. 0.50
c. any value between 0 to 1
d. zero
ANSWER: d

34. The highest point of a normal curve occurs at
a. one standard deviation to the right of the mean
b. two standard deviations to the right of the mean
c. approximately three standard deviations to the right of the mean
d. the mean
ANSWER: d

35. A standard normal distribution is a normal distribution with
a. a mean of 1 and a standard deviation of 0
b. a mean of 0 and a standard deviation of 1
c. any mean and a standard deviation of 1
d. any mean and any standard deviation
ANSWER: b

36. Z is a standard normal random variable. The P(1.20 z 1.85) equals
a. 0.4678
b. 0.3849
c. 0.8527
d. 0.0829
ANSWER: d

37. Z is a standard normal random variable. The P(1.05 < z < 2.13) equals
a. 0.8365
b. 0.1303
c. 0.4834
d. None of the alternative answers is correct.
ANSWER: b

38. A numerical measure of linear association between two variables is the
a. variance
b. covariance
c. standard deviation
d. coefficient of variation
ANSWER: b

39. Positive values of covariance indicate
a. a positive variance of the x values
b. a positive variance of the y values
c. the standard deviation is positive
d. a positive relation between the x and the y variables
ANSWER: d

40. Positive values of covariance indicate
a. a positive variance of the x values
b. a positive variance of the y values
c. the standard deviation is positive
d. a positive relation between the x and the y variables
ANSWER: d

41. In regression analysis, the variable that is being predicted is the
a. dependent variable
b. independent variable
c. intervening variable
d. None of these answers is correct.
ANSWER: a

42. Regression analysis was applied between sales (in $1,000) and advertising (in $100), and the
following regression function was obtained.

Y
= 80 + 6.2 x

Based on the above estimated regression line, if advertising is $10,000, then the point
estimate for sales (in dollars) is
a. $62,080
b. $142,000
c. $700
d. $700,000
ANSWER: d

43.If the coefficient of correlation is 0.8, the percentage of variation in the dependent variable
explained by the estimated regression equation is
a. 0.80%
b. 80%
c. 0.64%
d. 64%
ANSWER: d

44. If the coefficient of determination is equal to 1, then the coefficient of correlation
a. must also be equal to 1
b. can be either -1 or +1
c. can be any value between -1 to +1
d. must be -1
ANSWER: b































Short Answer: Answer all of the following questions. Make sure to show all work.
Solutions with no work will receive no credit.

1.You are given the following data on the price/earnings (P/E) ratios for twelve
companies.
Construct a stem-and-leaf display. Specify the leaf unit for the display.

23 25 39 47 22 37
8 36 48 28 37 26

ANSWER: Leaf Unit = 10s
So place the values in ascending order and then put them under the appropriate stem.

Ordered: 8, 22, 23, 25, 26, 28, 36, 37, 37, 39, 47, 48
0| 8
1|
2| 2 3 5 6 8
3| 6 7 7 9
4| 7 8


Sample 4

Use the following to answer questions 1 and 2:

The histogram below represents the height (in inches) of the gold medalwinning
high jumps for the Olympic Games up to Sydney 2000:



1. What is approximately the mean height?
A) 75 inches C) 82 inches
B) 77.5 inches D) 90 inches
Answer: C


2. What is approximately the percentage of these winning jumps that were at least
71 high (85 inches)?
A) 9% C) 23%
B) 14% D) 35%
Answer: D



3. The variable Z has a standard Normal distribution. Find the value z such that 85%
of the observations fall below z.
A) z = 1.04
B) z = 0.80
C) z = 0.85
D) z = 1.04
Answer: D


Use the following scenario to answer questions 4 to 6:

A simple random sample of eight drivers was selected. All eight drivers are insured
with the same insurance company, and all have similar auto insurance policies. The
following table lists their driving experiences (in years) and monthly auto insurance
premiums:

Driving experience (years) 5 2 1
2
9 1
5
6 2
5
1
6
Monthly auto insurance premium ($) 6
4
7
7
5
0
7
1
4
4
5
6
4
2
6
0

4. Which of the two variables is the explanatory variable?
Answer: Driving experience (in years)


5. Complete the following sentence that explains what the value of the slope tells
us:
When driving experience (choose one) increases / decreases by __________
we expect the average monthly auto insurance premium to (choose one)
increase / decrease by __________
Answer: When driving experience increases by 1 year, we expect the average monthly
auto insurance premium to decrease by $1.30.


6. The correlation coefficient between driving experience and monthly auto
insurance premium is r = 0.775. If we would switch the role of explanatory
and response variables, what would happen to the correlation coefficient?
Answer: The correlation coefficient would stay the same.


Use the following to answer questions 7 and 8:

In a recent study, a random sample of children in grades 2 through 4 showed a
significant negative relationship between the amount of homework assigned and
student attitudes.

7. What type of study is this?
A) An experiment.
B) An observational study.
C) The establishing of a causal relationship through correlation.
D) A block design, with grades as blocks.
Answer: B


8. What type of variable is the amount of homework assigned?
A) An explanatory variable.
B) A response variable.
C) A confounding variable.
D) A lurking variable.
Answer: A


Use the following to answer questions 912:

If you draw an M&M candy at random from a bag of the candies, the candy you draw
will have one of six colors. The probability of drawing each color depends on the
proportion of each color among all candies made. Assume the table below gives the
probabilities for the color of a randomly chosen M&M:

Color Brown Red Yellow Green Orange Blue
Probability 0.3 0.3 ? 0.1 0.1 0.1

9. What is the probability of drawing a yellow candy?
A) 0.1
B) 0.2
C) 0.3
D) Impossible to determine from the information given.
Answer: A


10. What is the probability of not drawing a red candy?
A) 0.3
B) 0.6
C) 0.7
D) 0.9
Answer: C


11. What is the probability that you draw neither a brown nor a green candy?
A) 0.3
B) 0.6
C) 0.7
D) 0.9
Answer: B


12. If you select two M&Ms and the colors are independent, then what is the
probability that both are the same color?
A) 0.01
B) 0.09
C) 0.22
D) 0.25
Answer: C


Use the following to answer questions 1316:

Suppose that a college determines the following distribution for X = number of
courses taken by a full-time student this semester:

Value of X 3 4 5 6
Probabilit
y
0.07 0.25 0.28


13. The probability for X = 4 is missing. What is it?
A) 0.07 C) 0.40
B) 0.25 D) 0.50
Answer: C


14. What is the average number of courses full-time students at this college take this
semester?
A) 4 classes C) 4.74 classes
B) 4.26 classes D) 5 classes
Answer: C


15. What is the standard deviation of the number of courses full-time students at this
college take this semester?
A) 0.89 classes
B) 0.94 classes
C) 1 class
D) 23.36 classes
Answer: B


16. What is P(X > 4.74)?
A) 0.25
B) 0.28
C) 0.53
D) Impossible to calculate, because X cannot be 4.74.
Answer: C

Vous aimerez peut-être aussi