Vous êtes sur la page 1sur 13

# BA578Assignment1-Solutions: due by Midnight (11:59pm) Sunday, September 9th , 2012 (Chapters 1, 2 and 3) True/False questions carry 1 point each,

Multiple Choices carry two points each and the Essay type questions carry 6 points each. The total score is 70 points. True/False (one point each) Chapter 1 1. An individuals credit score is an example of a ratio scale variable. FALSE There is no intrinsic Zero. An arbitrary minimum is established. 2. An example of a qualitative variable is the miles per gallon rate of a car. FALSE 3. If we examine some of the population measurements, we are conducting a sample of the population. TRUE 4. Census always gives more accurate values than a sample. FALSE Sometimes Census involves larger non-sampling (or Measurement) errors, especially when the population is large and measurement requires higher skills/expertise. 5. The telephone number of an individual is an example of an Ordinal scale variable. FALSE It is a nominal scale variable used only for identification. Chapter 2 6. A bar chart is a graphic that can be used to depict qualitative data. TRUE 7. The sample cumulative distribution function is initially increasing, and then decreasing. FALSE It is never decreasing 8. When we wish to summarize the proportion (or fraction) of items in a class we use the relative frequency for the class. TRUE Chapter 3 9. The income distribution is skewed to the right; therefore the Median Income must be less than the Mean Income. TRUE 10. The range of the measurement is the largest measurement plus the smallest measurement. FALSE My Instructions on Ch 3 11. The median is said to be more resistant to extreme values. TRUE My Instructions on Ch 3 12. If there are 7 classes in a frequency distribution then the fourth class must contain the Median. FALSE (depends on how the class frequencies are distributed among the classes)

Multiple Choices (Two points each) Chapter 1 1. Ratio variables have the following unique characteristic: (select only one) A. Meaningful order B. Predictable C. Categorical in nature D. An inherently defined zero value 2. Which of the following is a categorical (qualitative) variable? A. Air Temperature B. Bank Account Balance C. Daily Sales in a Store D. Whether a Person Has a Traffic Violation E. Both A and D 3. Which of the following is a quantitative variable? A. The make of a TV B. A person's gender C. The age of a house D. Whether a person is a college graduate E. Whether a person has a charge account 4. The two types of qualitative variables are: A. Ordinal and ratio B. Interval and ordinal C. Nominative and ordinal D. Interval and ratio E. Nominative and interval 5. Letter Grade is an example of a(n) ________ variable. A. Nominative B. Ordinal C. Interval D. Ratio Chapter 2 6. A company collected the ages from a random sample of its middle managers with the resulting frequency distribution shown below: Class Interval 20 - 25 25 - 30 30 - 35 Frequency 8 16 11 2

35 - 40 40 45 45 50

5 7 6

What would be the approximate shape of the relative frequency histogram? (You can draw the histogram to give you a visual impression). A. Symmetrical B. Uniform C. Multiple peak D. Skewed to the left E. Skewed to the right

## class 20-25 25-30 30-35 35-40 40-45 45-50

Freq 8 16 11 5 7 6

Freq
20 15 10 5 0 20-25 25-30 30-35 35-40 40-45 45-50

Freq

It is clearly skewed to the right (long right-tail) and one highest peak.

7. A(n) ______ is a graph of a cumulative relative frequency distribution. A. Frequency Polygon B. Scatter plot C. Histogram D. Ogive 3

8. If there are 50 values in a data set, how many classes should be created for a frequency histogram? A. 4 B. 5 C. 6 D. 7 E. 8 Chapter 3 9. According to a survey of the top 10 employers in a major city in the Midwest, a worker spends an average of 413 minutes a day on the job. Suppose the standard deviation is 25 minutes and the time spent is approximately a normal distribution. What are the times that approximately 99.73% of all workers will fall? A. [388 438] B. [338 488] C. [363 463] D. [338 438] E. [388 488] 10. In a statistic class, 10 scores were randomly selected with the following results were obtained: 74, 73, 77, 77, 71, 68, 65, 77, 67, 66 What is the median? A. 71.5 B. 71.0 C. 72.0 D. 71.0 E. 73.0 11. In a statistic class, 10 scores were randomly selected with the following results were obtained (mean=71.5): 74, 73, 77, 77, 71, 68, 65, 77, 67, and 66. What is the Variance? A.12.00 B. 22.72 C. 4.77 D. 516.20 E. 144.00 12. When using the Chebyshevs theorem to obtain the bounds for a 99.73 percent of the values in a population, the interval generally will be ___________ the interval obtained for the same percentage if normal distribution is assumed (empirical rule). A. Wider than B. Narrower than C. The same as D. greater than or less than (depending on the size of the standard deviation) My Instructions on Ch 3 page 13.

13. Find the z-score for a IQ test score of 90 when the mean is 100 and the standard deviation is 12. A. 7.5 B. -0.7.5 C. -0.83 D. 0.83 E. -10 14. Time to degree has become a hot topic with federal legislators. At one state university it was necessary to do a quick calculation when one of the local congressmen called the president. Twenty students were randomly selected from the most recent graduating class and the number of semesters they were enrolled was calculated (mean=9.6) 7, 8, 10, 11, 8, 6, 10, 9, 9, 8, 13, 12, 8, 11, 11, 14, 8, 7, 10, 12 What is the variance? A. 8 B. 2.16 C. 9.5 D. 4.67 E. 21.846 You dont need to show the following table (I obtained using MegaStat) but I am giving it to show how quickly you can get answer with MegaStat Descriptive statistics count mean sample variance sample standard deviation minimum maximum range

## 20 9.60 4.67 2.16 6 14 8

Or you can use calculator and the formula given in the book and my instructions for sample variance if you find it difficult to use MegaStat. Essay Type Questions (6 points each) Chapter 2 1. Consider the following data on distances traveled by 40 people to visit the local amusement park. distance freq 1-8 15 9-16 12 17-24 7 25-32 5 33-40 1 total 40 5

Expand and construct the table adding columns for relative frequency and cumulative relative frequency and construct the histogram of frequencies, plot the frequency polygon and the Ogive curve using Excel.

distance freq rel.fr cum.rel.fr 1-8 15 0.375 0.375 9-16 12 0.3 0.675 17-24 7 0.175 0.85 25-32 5 0.125 0.975 33-40 1 0.025 1 total 40 1 na

freq
20 15 10 5 0 1-8 9-16 17-24 25-32 33-40 freq

16 14 12 10 8 6 4 2 0 1-8 miles 9-16 miles 17-24 miles 25-32 miles 33-40 miles Series1

2. Math test anxiety can be found throughout the general population. A study of 120 seniors at a local high school was conducted. The following table was produced from the data. Complete the missing parts. (Work step by step to solve this puzzle. Round the frequencies to the nearest whole number.)
Score Range Very anxious 37-50 Anxious/Tents 33-36 Mild Anxiety 27-32 Relaxed 20-26 Very Relaxed 10-19 Total Frequency 12 24 0.333 Rel frequency 0.20 Cumulative Rel. freq.

We have to work step by step using our knowledge of Frequency tables to solve this puzzle. For the first class, Relative Frequency and Cumulative Relative Frequency will be the same. So we write 0.20 in the first row last column. Moreover, we find the frequency for this class by multiplying Relative frequency 0.20 by total frequency 120 to get 24. Thus, first row is completely filled. In the second row we convert the given frequency 12 into relative frequency after dividing by 120 which gives 0.10. Therefore, the cumulative relative frequency in the second row will be 0.30. Thus, second row is filled too. Next we convert the given relative frequency in the fifth row into frequency after multiplying 0.333 by 120 and rounding to get 40. Since the total frequency is given as 120, we can find the remaining frequency for the third row once we have the frequencies for the other four rows. It is calculated as 20. The rest of the story should be clear to you. Just remember that the total of all frequencies must be the given number 120 and the total of all relative frequencies must always be 1.

Score Range Very anxious 37-50 Anxious/Tents 33-36 Mild Anxiety 27-32 Relaxed 20-26 Very Relaxed 10-19 Total

Frequency 24 12 20 24 40 120

## Cumulative Rel. freq. 0.20 0.30 0.467 0.667 1.000 NA

3. The number of weekly sales calls by a sample of 25 pharmaceutical salespersons is below: 24, 56, 43, 35, 37, 27, 29, 44, 34, 28, 33, 28, 46, 31, 38, 41, 48, 38, 27, 29, 37, 33, 31, 40, and 50. Construct a histogram and plot the frequency polygon. (Use Excel: Instructions given in Instructions for Chapter 2 and also in Textbook Appendix of Chapter 2) Using Excel, Data, then Data analysis, then Histogram (living bin blank) and checking appropriate boxes you can get Cumulative % 4.00% 28.00% 52.00% 80.00% 92.00% 100.00%

## Bin Frequency 24 1 30.4 6 36.8 6 43.2 7 49.6 3 More 2

Histogram
8 7 6 Frequency 5 4 3 2 1 0 24 30.4 36.8 Bin 43.2 49.6 More 20.00% 0.00% 80.00% 60.00% 40.00% Frequency Cumulative % 120.00% 100.00%

frequency polygon
8 7 6 5 4 3 2 1 0 0 10 20 30 40 50 60 1 3 2 freq 6 6 7

Some of you have determined five classes on your own and specified the bin ranges, which is also good. The answers will look different but both are acceptable, because the rule is only a recommendation or guide. There can be a little variation from the rule depending on the nature of the data or the problem under study. However in this course if you have to decide the number of classes on your own then follow the rule. If you let the computer determine the number of classes leaving the bin rane blank, that is good too. Finally, MegaStat and excel will give a little different answers depending on how they interpret the class boundaries (whether to include the upper limit of a class boundary or the lower limit). But both are acceptable. A researcher makes the final judgement . Suppose a researcher decided after looking at the data that the lower boundary should be 24 and number of classes only 5, and wanted to use MegaStat, then: Go to Excel, type the data as it is given (no need to order), go to Add-Ins, then MegaStat, then Frequency DtistributionQuantitative, then select the range of input, put 21 as lower boundary and 6 as class interval (6 as class intervals gives 6 classes in this case which the researcher finds too many for 25 observations), then check for histogram, ogives etc, and click ok. The result is as follows which different from above but is equally acceptable. Frequency Distribution - Quantitative Data lower 24 31 38 45 52 upper midpoint 31 28 38 35 45 42 52 49 59 55 width frequency 7 7 7 8 7 6 7 3 7 1 25 9 percent 28.0 32.0 24.0 12.0 4.0 100.0

Histogram

Percent

Data

## 100.0 Cumulative Percent 75.0 50.0 25.0 0.0 17 24 31

Ogive

38 Data

45

52

10

Chapter 3 4. The following frequency table summarizes the ages of 60 shoppers at the local grocery store.

First calculate the sample mean for the above frequency table using the method for grouped data. Then, calculate the sample variance and sample standard deviation for this data set. Age of the shopper Frequency(f) 15-23 10 24-32 21 33-41 10 42-50 8 51-59 5 60-68 6 TOTAL 60 = N = (f * M) / N = 2175/60 = 36.25 Mi - -17.25 -8.25 0.75 9.75 18.75 27.75 Midpoint(Mi) 19 28 37 46 55 64 --------------f* Mi 190 588 370 368 275 384 2175

(Mi - )2 297.5625 68.0625 0.5625 95.0625 351.5625 770.0625 Total 2 ) 2) / (N-1) = 11549.25 / 59 = 195.75 s = (f * (Mi f 10 21 10 8 5 6 Mi 19 28 37 46 55 64 s = square root of sample variance = 13.99

## f * (Mi - ) 2 2975.625 1429.3125 5.625 760.5 1757.8125 4620.375 11549.25

5. For the data in Essay question number 3 above, calculate the sample Mean, Variance and Standard deviation without grouping the data.
Using Excel (Data- Descriptive Statistics-Summary Statistics)

## Column1 Mean Standard Error 36.28 1.636582

11

Median Mode Standard Deviation Sample Variance Kurtosis Skewness Range Minimum Maximum Sum Count
Using MegaStat

## 35 37 8.182909 66.96 -0.13027 0.659149 32 24 56 907 25

Descriptive statistics
X 25 36.28 66.96 8.18 24 56 32 907.00 34,513.00 1,607.04

count mean sample variance sample standard deviation minimum maximum range sum sum of squares deviation sum of squares (SSX)

## You get the same results using calculator as follows:

No. of calls (X) Deviation from mean (X-36.28) (X-36.28) 150.80 388.88 45.70 1.64 0.52 86.12 53.00
2

## 24 -12.28 56 19.72 43 6.76 35 -1.28 37 0.72 27 -9.28 29 -7.28

12

44 7.72 34 -2.28 28 -8.28 33 -3.28 28 -8.28 46 12 31 -5.28 38 1.72 41 4.72 48 11.72 38 1.72 27 -9.28 29 -7.28 37 0.72 33 3.28 31 --5.28 40 3.72
Total = 907 and

59.60 5.20 68.56 10.76 81 68.56 27.88 2.96 22.28 137.35 2.96 86.11 53.00 0.52 10.75 27.88 13.84 188.23 Total = 1607.04

= 907/25 = 36.28

50 13.72
Total = 0

= 8.18

13