Académique Documents
Professionnel Documents
Culture Documents
CS 5014
5014
Research
Research Methods
Methods inin CS
CS
Prof.
Prof. Frakes
Frakes
Research
Research Designs
Designs
© W. Frakes 2003 1
Experimentaion
Experiment - A procedure for
determining the effect of one set of
variables on another.
1
Scale of Experiments
Small Scale Experiments - involve a
few subjects, usually working alone on
a relatively simple task that can be
completed in a few hours or less.
Threats to Experiment
© W. Frakes 2003 4
2
Research Design
• Any study needs a structure or plan which defines
- Number and type of variables to be studying
- The relationships among the variables
Such a plan is called a design.
• Experimental vs. non-experimental designs
- The distinction is based on the degree of control the
experimenter has over subjects and conditions - The most
important difference being whether or not we can randomly
assign subjects to levels of the independent variables.
- many of the same principle apply to both experimental and
non-experimental design.
© W. Frakes 2003 5
Research Design -
the plan, structure, and strategy of investigation
• Purposes
1. To provide answers to research problems.
2. To control variability.
• Research Problems can be stated as hypothesis.
• Research Design sets up the framework for adequate
tests of the relationships among variables.
© W. Frakes 2003 6
3
Research Design (Continued)
• Basic Concepts: - Should already know these
- Variables
- Measurement - Measurement Error
- Reliability - Validity
- Control
- Randomization - random assignment of subjects to
treatment conditions
- Confounding (Third) Variable
- Generalizability of Results
© W. Frakes 2003 7
O = observation or measurement
© W. Frakes 2003 8
4
Threats to Internal Validity
1.History - Specific events ocurring between the
first and second measurement in addition to the
experimental variable.
O1 X O2
___________
History
If X is use of a new tool and O is a measure of productivity,
History might involve a strike, education, new mgmt
practices, etc.
© W. Frakes 2003 9
© W. Frakes 2003 10
5
3.Testing
- Hawthorne Effect
© W. Frakes 2003 11
4. Instrumentation
Metrics Metrics
Tool Tool’
Tool
Modified
© W. Frakes 2003 12
6
5.Statistical Regression
© W. Frakes 2003 13
© W. Frakes 2003 14
7
7. Experimental Mortality
© W. Frakes 2003 15
© W. Frakes 2003 16
8
12. Multiple treatment interference
© W. Frakes 2003 17
Pre-Experimental Designs
Quasi-experiments
(Campbell and Stanley)
© W. Frakes 2003 18
9
One shot case study
X O
© W. Frakes 2003 19
X O
e.g.
X = use Ada
O = person-months of effort
© W. Frakes 2003 20
10
One group pretest - Posttest design
O1 X O2
O1, O2: Productivity Measure X: Use of C++
© W. Frakes 2003 21
X = learn Ada
11
Static Group Comparison
X O1
-------
O2
O1 O2 O3 O4 X O5 O6 O7 O8
- used a lot in 19th century biological and physical
experimentation
e.g. if I have a bar of iron (unchanged in weight for many
months, then dip it in nitric acid - the loss in weight of the
iron bar would follow this experimental logic.
The logic of this is that a discontinuity in the measurement
series will be caused by the treatment X
© W. Frakes 2003 24
12
The time series experiment - example
Faults/
KNCSL
Time
© W. Frakes 2003 25
O X O
------------
O O
* Similar in structure to design 4, but without random
assignment
Experimental and Control Group do not have pre-
experimental sampling equivalence. Rather than groups
represent
e.g. A company has two programming sites (e.g. Pala Alto
and Fairfax). A new tool is introduced in Pala Alto. Both
sites are measured on same DV, before and after the
treatment.
© W. Frakes 2003 26
13
Correlational Design
• Correlational Design - Purely Observational - The
investigator does not intervene in any way, or
expose subjects to a manipulation.
- Rather measures are taken on something and relationships
are determined among the measures.
- These measures can be taken by
- Direct observation
- Questionaires
- Existing records
© W. Frakes 2003 27
© W. Frakes 2003 28
14
Cross Sectional Design (Cont’d)
• One way to strengthen the claim of cross sectional
data to causality would be to retake the same
measure at a later point in times.
Such a design would be represented by
O O’
where O is Causes Measure and O’ is Effects Measure.
© W. Frakes 2003 29
Quasi-experiment Exercise
• Break into groups and design a quasi-experiment to test the
effect of the WWW on teaching
• Which biases does your design handle?
• Which biases does your design not handle?
© W. Frakes 2003 30
15
Random assignment
© W. Frakes 2003 31
A True Experiment
Posttest - Only Design
R 1 X 1 O
R 2 X 2 O
R 3 X 3 O
. . .
. . .
. . .
R n X n O
© W. Frakes 2003 32
16
True Experimental Designs
Pretest-Posttest Control Group Design
R O1 X O2
R O3 O4
8 Programmers are randomly assigned to one of the two
groups. One group uses a coverage analyzer - The control
does not. The DV is number of faults discoverd.
How might your statistically analyze this data?
Take gain scores for each group
O2 - O1
O4 - O4
and do a t-test (or non-parametric equivalent) on them.
© W. Frakes 2003 33
Experiment Example
Treatment Measure
R Ada LOC
R PL-1 LOC
R C LOC
© W. Frakes 2003 34
17
Solomon 4 Group Design
R O1 X O2
R O3 O4
R X O5
R O6
• Allows us to estimate with estimate external validity factors.
Design 4 (O1 -O4) is paralleled with experimental and control
groups lacking the pretest.
• This allows the effect of testing and the interaction of testing
and X to be determined.
• The effect of X is replicated in 4 ways.
O2 > O1, O2 > O4, O5 > O6, O5 > O3
© W. Frakes 2003 35
N0 X X Main
Pretested O4 O2 Ef f ects
o f Pre-
No Pretest O6 O5 testing
Main Ef f ects o f X
© W. Frakes 2003 36
18
Posttest-Only Control Group Design
R X O1
R O2
• We allow the randomization to take care of the
equivalence of the groups before the treatment.
• Controls for testing as main effect, but does not
measure it.
Statistical tests
T- test (or non-parametric equivalent)
ANOVA - 2 group
© W. Frakes 2003 37
Factorial Design
• Most real experiments involve several IV’s and are
meant to determine their combined effect on the DV.
e.g. 2x2 Factorial Design
2 IV’s with 2 levels each DV= Faults/NCSL
Real MIS
Time
C
C++
with this design you examine 2 main effects: C vs C++, Real
Time vs MIS, and also on interaction of the 2 IV’s.
This can be analyzed with ANOVA.
© W. Frakes 2003 38
19
True Experiment Exercise
• Break into groups and design a true experiment to test the
effect of the WWW on teaching
• Which biases does your design handle?
• Which biases does your design not handle?
© W. Frakes 2003 39
20