Académique Documents
Professionnel Documents
Culture Documents
General Considerations
Valid data
analysis needs:
solid planning
valid data
correct
analysis
correct
interpretation
Data Types
and Analysis
Two Examples that did not take into account the context
behind the study
Example One: A study on children's' mathematics ability concluded
that the larger the foot size, the smarter the child is, because these are
highly correlated.
Example Two: In a small city, the population size of the city is
predicted by the number of storks in the city, since the regression
model suggests that number of storks is accurate in predicting the
population size along the years.
These two examples ignore the context behind of the study. As the result,
the conclusion goes against common logic.
The problem with Example One is that the root cause of mathematics
ability for young children is associated with their age. It happens that foot
size is larger when they are older.
Example Two misuses the response and the predictor. The situation of the
Data Analysis
A valid data analysis starts with a solid
planning of the study.
For a survey study or an observational study where a controlled experiment can not be
conducted, one should begin by considering the adequate measurement, the target
population, sampling techniques, sample size, factors associated with the intended
characteristics, designing questionnaires, and ways of distributing and collecting the survey.
For a controlled experimental study, one should begin by considering the measurement, the
potential confounding factors associated with the measurement, the intended factors for the
experiment, the design of the experiment, experimental units, sample size, and possible
statistical techniques for analysis based on the experimental design.
In many situations, a controlled experiment may not be possible. However, a semiexperimental design may be possible. For these situations, the background and environmental
factors are extremely important. For example, in studying the effect of different teaching
pedagogical approaches, one may not be able to perform a randomization scheme to select
subjects for each teaching pedagogy. It may happen that one class has much better students
than the other class. Hence, the effectiveness of the teaching method is confounded with
students' initial ability. If we collect the information of potential confounding factors such as
their GPA, gender, and age, and conduct a pre-test, then a proper data analysis such as
Analysis of Covariance or Repeated Measure Analysis can be performed to make a proper
comparison.
It is often the case that appropriate statistical procedures are associated with the types of data.
Categorical data needs to be analyzed using procedures that are developed for analyzing
categorical data. We do not perform frequency analysis or crosstabulation procedures to
analyze continuous data. More detailed discussion is given in the Data Type and Possible
Analysis Section.
It happens often in data analysis that one needs to conduct several analyses before an
appropriate one is selected. One should expect that the analysis is never only a one step
process. It involves many back and forth analyses and decisions for a proper analysis.
Bottom line is
If you are not familiar with any step described above, seek
statistical consulting help.