Académique Documents
Professionnel Documents
Culture Documents
5/13/17
Sampling, Classifying
and Graphing Data to
get information.
Week 2, Part 3:
Graphing
Quantitative
histogram
frequency polygon
ogive
Histograms
Minutes Commuting to work in New York
42
39
34 34
22
20
18
15
11
10
7
3
2 2
1
0
2.5 7.5 12.5 17.5 22.5 27.5 32.5 37.5 42.5 47.5 52.5 57.5 62.5 67.5 72.5 77.5
Minutes
A line version of the histogram
A plot of class midpoints and frequencies
A closed chart
Classes are mutually exclusive and
collectively exhaustive
Classes are always in numerical order
No classes are skipped, even if they are
empty
Strengths
Best way to represent several variables on
the same axes.
Weak Points
Based on a histogram, so same problems.
40
35
F 30
r
e 25
q
u 20
e
n 15
c
y
10
Minutes
Minutes Commuting to work in LA and New York
100
90
80
70
F
r 60
e
q 50
u
e 40
n
30
c
y 20
10
Minutes
LA NY Combined
Minutes Commuting to Work in LA, NY and Combined
100
90
80
F 70
r
e 60
q
50
u
e
40
n
c 30
y
20
10
0
2.5 7.5 12.5 17.5 22.5 27.5 32.5 37.5 42.5 47.5 52.5 57.5 62.5 67.5 72.5 77.5
Minutes
LA NY Combined
Agraph of a cumulative
frequency
Can be absolute or relative
cumulative
Non-decreasing graph
Plot of upper class limits and
frequencies
Always begins at (0,0)
The Ogive
Minutes Commuting to Work in New York
250
C
u
m
u 200
l
a
t
i
v
150
e
F
r 100
e
q
u
e
n 50
c
y
0
0 5 10 15 20 25 30 35 40 45 50 55 60 65 70 75 80
Minutes
Auseful alternative to the
histogram for appropriate data
sets.
Each observation is divided into a
stem and a leaf.
A stem is a subset of the values based on
leading digits.
A leaf is a list of trailing digits that have the
stem in common.
For example, consider observations
19, 27, 28, 32
The stems would be the leading digit in each
value, 1, 2 & 3
Stem-n-leaf plot
The leaves would be the trailing digit in
each value, 9, 7, 8 & 2.
The stems are listed vertically from smallest
to largest.
The leaves are listed horizontally, in numerical
order, to the right of the appropriate stem,
one value for every observation, even if values
repeat.
Observations Stem Leaves
242 24 2 4 5 5
244 25
245 26
245 27
Stem-n-leaf plot
33
34
0
9
1 3
35 0 5
Qualitative
bar or column chart
Frequencies are represented as
bars or columns, one for each
class
When differences between
classes is small, bars enable one
to better see the differences
Not a histogramcolumns do
NOT touch
Pie Charts
WorldOil Reserves
1.9%
1.4%
1.9% 1.7% Saudia Arabia
2.5% Canada
2.7%
Iran
3.0% 19.5%
Other
Iraq
5.1% United Arab Emirates
Kuwait
5.6% Venezuela
13.3% Russia
7.2% Libya
Nigeria
Mexico
7.2%
9.9% Kazakhstan
Angola
8.3%
9.0% United States
China
Pareto Diagram or chart
A column chart for categorical data.
Columns (categories) are ordered
from highest frequency to lowest
Sorts the important few from the
trivial many.
Only useful on non-numerical data.
If there are lots of categories, it
rapidly pinpoints the important
groups.
Pareto Diagram
WorldOil Reserves
300
B
250
i
l b
l a 200
i r
o r 150
n e
s l 100
s
o 50
f
Country