Académique Documents
Professionnel Documents
Culture Documents
Year : 2014
Session 2-3
DESCRIPTIVE STATISTICS
Topics
1. Introduction : Data and Statistics
2. Descriptive Statistics
3. Introduction to Probability
4. Discrete Probability Distributions
5. Continuous Probability Distributions
6. Sampling and Sampling Distributions
7. Interval Estimation
8. Hypothesis Tests
9. Analysis of Variance
10. Simple Linear Regression
4. Measures of Location
5. Measures of Variability
Bina Nusantara University 8
Frequency Distribution
Relative Frequency
Bar Graph
Pie Chart
9
8
Frequency
7
6
5
4
3
2
1
Rating
Poor Below AverageAbove Excellent
Average Average
Bina Nusantara University 17
(e) Pie Chart
The pie chart is a commonly used graphical device for
presenting relative frequency distributions for qualitative data.
First draw a circle; then use the relative frequencies to subdivide
the circle into sectors that correspond to the relative frequency
for each class.
Since there are 360 degrees in a circle, a class with a relative
frequency of .25 would consume .25(360) =
90 degrees of the circle.
Exc.
Poor
5%
10%
Below
Average
Above
15%
Average
45%
Average
25%
Quality Ratings
Bina Nusantara University 19
Insights Gained from the Preceding Pie Chart
Dot Plot
Histogram
Cumulative Distributions
Ogive
91 78 93 57 75 52 99 80 97 62
71 69 72 89 66 75 79 75 72 76
104 74 62 68 97 105 77 65 80 109
85 97 88 68 83 68 71 69 67 74
62 82 98 101 79 105 79 69 62 73
Then each data value is represented by a dot placed above the axis.
.. .. . . .
. . .. .....
.. ..........
.. .. .. .. . .. . . ...
. . ... .
50 60 70 80 90 100 110
Cost ($)
10
8
6
4
2 Parts
Cost ($)
50 60 70 80 90 100 110
Bina Nusantara University 31
(d) Cumulative Distribution
The cumulative frequency distribution shows the number of
items with values less than or equal to the upper limit of each
class.
The cumulative relative frequency distribution shows the
proportion of items with values less than or equal to the upper
limit of each class.
The cumulative percent frequency distribution shows the
percentage of items with values less than or equal to the upper
limit of each class.
cumulative frequencies, or
Ogive
Because the class limits for the parts-cost data are 50-59, 60-69,
and so on, there appear to be one-unit gaps from 59 to 60, 69 to
70, and so on.
These gaps are eliminated by plotting points halfway between
the class limits.
Thus, 59.5 is used for the 50-59 class, 69.5 is used for the 60-69
class, and so on.
Leaf Units
The left and top margin labels define the classes for the two
variables.
Crosstabulation
The number of Finger Lakes homes sold for each style and
price for the past two years is shown below.
Only three homes in the sample are an A-Frame style and priced
at more than $99,000.
Row Percentages
x
Bina Nusantara University 52
No Apparent Relationship
The median of a data set is the value in the middle when the data
items are arranged in ascending order.
425 430 430 435 435 435 435 435 440 440
440 440 440 445 445 445 445 445 450 450
450 450 450 450 450 460 460 460 465 465
465 470 470 472 475 475 475 480 480 480
480 485 490 490 490 500 500 500 500 510
510 515 525 525 525 535 549 550 570 570
575 575 580 590 600 600 600 600 615 615
The mode of a data set is the value that occurs with greatest
frequency.
If the data have exactly two modes, the data are bimodal.
If the data have more than two modes, the data are multimodal.
425 430 430 435 435 435 435 435 440 440
440 440 440 445 445 445 445 445 450 450
450 450 450 450 450 460 460 460 465 465
465 470 470 472 475 475 475 480 480 480
480 485 490 490 490 500 500 500 500 510
510 515 525 525 525 535 549 550 570 570
575 575 580 590 600 600 600 600 615 615
Range
Interquartile Range
Variance
Standard Deviation
Coefficient of Variation
425 430 430 435 435 435 435 435 440 440
440 440 440 445 445 445 445 445 450 450
450 450 450 450 450 460 460 460 465 465
465 470 470 472 475 475 475 480 480 480
480 485 490 490 490 500 500 500 500 510
510 515 525 525 525 535 549 550 570 570
575 575 580 590 600 600 600 600 615 615
s s2
If the data set is a population, the standard deviation is denoted
(sigma). 2
Bina Nusantara University 73
Exercises
1. The five top-selling vehicles during 2003 were the Chevrolet
Silverado/C/K pickup, Dodge Ram pickup, Honda Accord, and
Toyota Camry (Motor Trend, 2003). Data from a sample of 50
vehicle purchases are presented in table.