Académique Documents
Professionnel Documents
Culture Documents
mean
mode
median
C
h
a
p
t
e
r
x
i 1
x1 x2 x3 ... xn
C
h
a
p
t
e
r
Summation Notation
so if
x1 = 1, x2 = 2, x3 = 3 and x4 = 4,
then
4
x
i 1
1 2 3 4 10
C
h
a
p
t
e
r
Notation (continued)
Sometimes we will
have to square the
values before we
add them:
2
2
2
2
2
x
...
x
i 1 2 3
n
i 1
2
2
2
2
2
x
4
i
i 1
1 4 9 16 30
n
2
xi 1 2 3 4 10 2 100
i 1
C
h
a
p
t
e
r
Variability
shows how strongly the data
cluster around value(s)
2
spread
5
C
h
a
p
t
e
r
Mean
Median
Mode
center
2
6
C
h
a
p
t
e
r
x1 x2 ... xn
x
i 1
x x ... x N i 1
1 2
N
N
C
h
a
p
t
e
r
x
i 1
2,950 ,000
10
295,000
312,000
285,000
317,000
294,000
297,000
10
315,000
287,000
=2,950,000
C
h
a
p
t
e
r
10
97,000
93,000
110,000
i 1
2,950 ,000
10
295,000
121,000
113,000
95,000
100,000
122,000
99,000
2,000,000
=2,950,000
outlier
C
h
a
p
t
e
r
Lowtown
Fancytown
500000
outlier
1000000
1500000
$295,000
295000
2000000
C
h
a
p
t
e
r
Lowest
value
50%
Median
50%
11
C
h
a
p
t
e
r
The median is
between the
two middle
values
313,000
315,000
317,000
Median, M
C
h
a
p
t
e
r
The median is
between the
two middle
values
121,000
122,000
2,000,000
Median, M
100,000 110,000
$105,000
2
C
h
a
p
t
e
r
Mode
14
C
h
a
p
t
e
r
2
15
C
h
a
p
t
e
r
2
16
C
h
a
p
t
e
r
Range
Variance
Standard Deviation
spread
2
17
C
h
a
p
t
e
r
Range
Equal to the largest measurement minus the smallest
measurement.
Easy to compute, but not very informative
Considers only two observations (smallest / largest)
Monthly Salaries
3,000
2,000
5,000
8,000
5,000
4,000
9,000
18
C
h
a
p
t
e
r
x
)
(
x
x
)
...
(
x
x
)
2
n
s2 1
n 1
C
h
a
p
t
e
r
s s2
2
(
x
x
)
i
i 1
n 1
C
h
a
p
t
e
r
x2
1
2
Sample variance,
( x1 x ) 2 ( x 2 x ) 2 ( x 3 x ) 2
s
n 1
(1 2) 2 (2 2) 2 (3 2) 2
3 1
1
2
Sample mean,
s2
1 1
C
h
a
p
t
e
r
x 12, x
13
2
22
C
h
a
p
t
e
r
2
x 12, x
13
Sample variance:
2
2
2
(
12
)
x n
13
17 .283
s2
n 1
17 1
s2
.283 .532
23
C
h
a
p
t
e
r
Percentile
For any set of n measurements (arranged in ascending
or descending order), the pth percentile is a number
such that p% of the measurements fall below that
number and 100(1-p)% fall above it.
Upper Quartile (QU or Q3) = 75th percentile
Median (Q2) = 50th percentile
Lower quartile (QL or Q1) = 25th percentile
2
24
C
h
a
p
t
e
r
Sample
Dataset
3
5
Step 2:
The 4th location value of the ordered list is
the 50th percentile = 3
0
1
9
2
7
2
25
C
h
a
p
t
e
r
Sample
Dataset
3
5
0
1
Step 2:
The mean of the 4th location value and the
5th location value of the ordered list is the
50th percentile = (3+5)/2=4
9
2
7
10
26
C
h
a
p
t
e
r
Sample z-score
Tells the distance between a measurement
x and the mean (x ), expressed in terms of
standard deviations.
The sample z-score for a measurement x is
xx
s
2
27
C
h
a
p
t
e
r
1
s
75
28
C
h
a
p
t
e
r
Outlier
A measurement that is unusually large or
small relative to the other values.
Possible causes:
1. Observation, recording or data entry error
2. Item is from a different population
3. A rare, chance event
2
29
C
h
a
p
t
e
r
detect outliers
2
30
C
h
a
p
t
e
r
Lower Quartile
(QL)
Median
Upper Quartile
(QU)
Minimum Value
30
35
Maximum Value
40
45
50
55
BoxPlot
2
31
C
h
a
p
t
e
r
2
30
35
40
45
50
55
BoxPlot
32
C
h
a
p
t
e
r
Outliers
An observation is an outlier if it is
An outlier is extreme if it is
2
33
C
h
a
p
t
e
r
Outliers Example
Student Ages
17
19
19
20
21
22
22
25
18
19
19
20
21
22
23
26
18
19
19
20
21
22
23
28
18
19
19
20
21
22
23
28
18
19
19
20
21
22
23
30
18
19
19
20
21
22
23
37
19
19
20
21
21
22
23
38
19
19
20
21
21
22
24
44
19
19
20
21
21
22
24
47
19
19
20
21
21
22
24
Lower
Quartile
Median
Upper
Quartile
IQR = 22 19 = 3
2
34
C
h
a
p
t
e
r
Outliers Example
Student Ages
17
19
19
20
21
22
22
25
18
19
19
20
21
22
23
26
18
19
19
20
21
22
23
28
18
19
19
20
21
22
23
28
18
19
19
20
21
22
23
30
18
19
19
20
21
22
23
37
19
19
20
21
21
22
23
38
19
19
20
21
21
22
24
44
19
19
20
21
21
22
24
47
19
19
20
21
21
22
24
Lower
Quartile
Median
Upper
Quartile
Outliers
35
C
h
a
p
t
e
r
Outliers Example
Student Ages
17
19
19
20
21
22
22
25
18
19
19
20
21
22
23
26
18
19
19
20
21
22
23
28
18
19
19
20
21
22
23
28
Moderate
Outliers
18
19
19
20
21
22
23
30
18
19
19
20
21
22
23
37
19
19
20
21
21
22
23
38
19
19
20
21
21
22
24
44
19
19
20
21
21
22
24
47
19
19
20
21
21
22
24
Lower
Quartile
Median
Upper
Quartile
Extreme
Outliers
36
C
h
a
p
t
e
r
Outliers Example
Mild
Outliers
Smallest data
value not
an outlier
Extreme
Outliers
Largest data
value not
an outlier
37
C
h
a
p
t
e
r
2
38