Académique Documents
Professionnel Documents
Culture Documents
Data Preparation
14-2
Chapter Outline
1) Overview
2) The Data Preparation Process
3) Questionnaire Checking
4) Editing
i.
5) Coding
i.
Coding Questions
ii.
Code-book
iii.
Coding Questionnaires
14-3
Chapter Outline
6) Transcribing
7) Data Cleaning
i.
Consistency Checks
ii.
Weighting
ii.
Variable Respecification
iii.
Scale Transformation
Adjusting
the
Data
14-4
14-5
Questionnaire Checking
A questionnaire returned from the field may be
unacceptable for several reasons.
Parts of the questionnaire may be incomplete.
The pattern of responses may indicate that
the respondent did not understand or follow
the instructions.
The responses show little variance.
One or more pages are missing.
The questionnaire is received after the
preestablished cutoff date.
The questionnaire is answered by someone
who does not qualify for participation.
14-6
Editing
Treatment of Unsatisfactory Results
Returning to the Field The questionnaires
with unsatisfactory responses may be returned
to the field, where the interviewers recontact
the respondents.
Assigning Missing Values If returning the
questionnaires to the field is not feasible, the
editor may assign missing values to
unsatisfactory responses.
Discarding Unsatisfactory Respondents
In this approach, the respondents with
unsatisfactory responses are simply discarded.
14-7
Data Cleaning
Treatment of Missing Responses
14-8
14-9
Years of
Sample Population
Education
Percentage
Percentage
Elementary School
0 to 7 years
2.49
4.23
1.70
8 years
1.26
2.19
1.74
High School
1 to 3 years
4 years
6.39
25.39
8.65
29.24
1.35
1.15
College
1 to 3 years
4 years
5 to 6 years
7 years or more
22.33
15.02
14.94
12.18
29.42
12.01
7.36
6.90
1.32
0.80
0.49
0.57
Totals 100.00
100.00
Weight
14-10
14-11
X
Zi = (Xi - )/sx
14-12
14-13
A Classification of Univariate
Techniques
Fig. 14.6
Univariate Techniques
Non-numeric Data
Metric Data
One Sample
* t test
* Z test
Two or More
Samples
Independe
nt
* TwoGroup
test
* Z test
* One-Way
ANOVA
Related
* Paired
t test
One Sample
* Frequency
* ChiSquare
* K-S
* Runs
* Binomial
Two or More
Samples
Independe
nt
* Chi-Square
* Mann-Whitney
* Median
* K-S
* K-W ANOVA
Related
*
*
*
*
Sign
Wilcoxon
McNemar
Chi-
14-14
A Classification of Multivariate
Techniques
Fig. 14.7
Multivariate Techniques
Dependence
Technique
One Dependent More Than One
Variable
Dependent
Variable
* CrossTabulation
* Analysis of
Variance and
Covariance
* Multiple
Regression
* Conjoint
Analysis
* Multivariate
Analysis of
Variance
and
Covariance
* Canonical
Correlation
* Multiple
Discriminant
Analysis
Interdependen
ce Technique
Variable
Interdependenc
e
* Factor
Analysis
Interobject
Similarity
* Cluster
Analysis
*
Multidimension
al Scaling