Académique Documents
Professionnel Documents
Culture Documents
4) Cross-Tabulations
i. Two Variable Case
ii. Three Variable Case
Frequency Distribution
In a frequency distribution, one variable is
considered at a time.
A frequency distribution for a variable
produces a table of frequency counts,
percentages, and cumulative percentages for
all the values associated with that variable.
Frequency Distribution of 15-6
Familiarity
with the Internet
Cumulative
Value label Value Frequency (N) Percentage percentage
TOTAL 30 100.0
15-7
Frequency Histogram
8
7
6
Frequency
5
4
3
2
1
0
2 3 4 5 6 7
Familiarity
Statistics Associated with 15-8
Frequency Distribution
Measures of Location
X
n
The mean, or average value, is the most commonly used
measure of central tendency. The mean, ,is given by
Where, X= Σ X i /n
i=1
Xi = Observed values of the variable X
n = Number of observations (sample size)
Frequency Distribution
Measures of Location
Frequency Distribution
Measures of Variability
The range measures the spread of the data.
It is simply the difference between the largest
and smallest values in the sample. Range =
Xlargest – Xsmallest.
The interquartile range is the difference
between the 75th and 25th percentile. For a
set of data points arranged in order of
magnitude, the pth percentile is the value that
has p% of the data points below it and (100 -
p)% above it.
Statistics Associated with 15-11
Frequency Distribution
Measures of Variability
CV = sx/X
Statistics Associated with 15-12
Frequency Distribution
Measures of Shape
Skewness. The tendency of the deviations from
the mean to be larger in one direction than in the
other. It can be thought of as the tendency for one
tail of the distribution to be heavier than the other.
Skewness
Figure 15.2 of a Distribution
Symmetric
Distribution
Skewed Distribution
Mean
Media
n
Mode
(a)
Mean Median
Mode (b)
15-14
Cross-Tabulation
While a frequency distribution describes one
variable at a time, a cross-tabulation
describes two or more variables
simultaneously.
Cross-tabulation results in tables that reflect
the joint distribution of two or more variables
with a limited number of categories or distinct
values.
15-15
I nternet Us
15-17
Gender Ligh
15-18
Purchase of
Fashion
Clothing
15-19
Ownership of Expensive
Automobiles by Education Level
Own Expensive
Automobile
Ownership of Expensive Automobiles 15-21
Eat Frequently in F
Food Restaurants
Eating Frequently in Fast Food-
15-25
Restaurants
by Family Size & Income
Income
Eat Frequently in Fast- Low High
Food Restaurants
Family size Family size
Small Large Small Large
Yes 65% 65% 65% 65%
No 35% 35% 35% 35%
Column totals 100% 100% 100% 100%
Number of respondents 250 250 250 250
15-26
Steps Involved in
Hypothesis Testing
Formulate H and H 0 1
Parametric Non-parametric
Tests (Metric Tests (Nonmetric
Tests) Tests)