Académique Documents
Professionnel Documents
Culture Documents
Statistical Description of
Data
to accompany
Chapter 3 - Learning
Describe
data using measures of
Objectives
Measures of
Central
Tendency,
The
Center
Mean
, population;x
, sample
Weighted Mean
Median
Mode
(Note comparison of mean,
median, and mode)
Measures
of
Dispersion,
Range
Mean absolute deviation
Variance
(Note the computational
difference between 2 and s2.)
The
Spread
Standard deviation
Interquartile range
Interquartile deviation
Coefficient of variation
Measures
of Relative
Position
Quantiles
Quartiles
Deciles
Percentiles
Residuals
Standardized values
Measures
of
Associatio
n
Coefficient of correlation, r
Direction of the relationship:
direct (r > 0) or inverse (r < 0)
Strength of the relationship:
When r is close to 1 or 1, the linear
relationship between x and y is
strong. When r is close to 0, the
linear relationship between x and y
is weak. When r = 0, there is no
linear relationship between x and y.
Coefficient of determination,
r2
The percent of total variation in y
that is explained by variation in x.
Mean
= (xi)/n
Comparing Measures of
Central Tendency
n 1
n1
for a sample,
s s2
Be sure you know how to get the values
easily from your calculator and computer
softwares.
Coefficient of Variation
100%
quartile.
Quartiles divide the values of a data set into four
subsets of equal size, each comprising 25% of
the observations.
To find the first, second, and third quartiles:
1. Arrange the N data values into an array.
2. First quartile, Q1 = data value at position (N + 1)/4
3. Second quartile, Q2 = data value at position 2(N +
1)/4
4. Third quartile, Q3 = data value at position 3(N + 1)/4
What is a Standardized
Value?
How
far above or below the individual
value is compared to the population mean
in units of standard deviation
How far above or below= (data value
mean)
which is the residual...
In units of standard deviation
x = divided by
Why is a Standardized
Value Important?
Why is a Standardized
Value Important?
0.08%
0.00 %
0.21 %
0.09 %
0.00 %
0.15%
0.03 %
0.01 %
0.05 %
0.16 %
0.18%
0.02%
0.11 % 0.17%
0.10 % 0.19 %
0.03 % 0.00 %
0.04 %
0.10 %
2008 Thomson South-Weste
Mean = 0.0736%
Standard Deviation =
0.0684%
1
1
)100%
Ans: (1 k2 )100%(1
2
1.50
(10.4444)100%55.55%
CV 100%
100%92.9%
0.0736%