Académique Documents
Professionnel Documents
Culture Documents
CHAPTER FOUR
In this chapter, we will provide an overview of the research process. This will en-
tail a description of the two overarching stages of research. We will identify
general purposes of health promotion research that fall within these two stages,
and delineate appropriate research designs that are used to achieve each purpose.
Examples from the health promotion research literature will be interjected to
illustrate the respective purposes and designs.
75
76 Research Methods in Health Promotion
nced investigat
ratory investigations Adva e populatio ions,
E x p lo sam ns
R e p li c s,
R e fi n e d
investigation
s ated investigation
w it h
priority populations different populations
As these examples indicate, the research question involves investigations that will
describe the issue. Depending on the outcomes, this type of research usually is
viewed as a springboard for further inquiry.
Choosing a Research Design 77
• How many new cases of HIV infection occurred among men who have sex
with men (MSM)?
• How many new cases of AIDS were diagnosed among injection drug users in
the Northeastern United States?
• What are the risk factors for HIV infection among African American women?
• What are the psychological and psychosocial effects of HIV diagnosis among
adolescents?
• Does social support protect against negative effects of HIV diagnosis among
homeless women?
• How well does a sexual risk-reduction program reduce unsafe HIV-related
behaviors as compared with an abstinence program?
• What are the long-term side effects of Highly Active Anti-Retroviral Therapy?
• What are the side effects of an HIV vaccine among black men?
78 Research Methods in Health Promotion
As these examples illustrate, conclusive research has clearly defined research ques-
tions. In comparison with exploratory research, the process is more formal and
structured. Findings inform hypothesis testing and decision making.
Both exploratory and conclusive research can be further subdivided into two
major categories of research purpose: descriptive and causal research. Descriptive research
provides data describing the “who, what, when, where, and how” of a health issue,
not what caused it. Therefore, studies that reveal who either is at risk or has a
particular health issue or condition; that document incidence, prevalence, or both;
that examine risk factors; that look at the effects of having the health issue;
and that assess scale properties designed to measure a construct related to a
health issue would be categorized as descriptive research. The main limitation
with descriptive research is that it cannot assess what caused the health issue. This
is where causal research steps in. Determining a cause-and-effect relationship is
imperative in situations in which the investigator must reveal the true cause(s) of
a particular disease or when evaluating whether or not a program caused the
observed changes.
◆ ◆ ◆
In a Nutshell
Determining a cause-and-effect relationship is imperative in situations in which the investigator
must reveal the true cause(s) of a particular disease or when evaluating whether or not a program
caused the observed changes.
◆ ◆ ◆
Descriptive Research
Descriptive research, whether exploratory or conclusive, uses observational designs
to achieve its objectives. The term observational research designs refers to research in
which variables are observed as they exist in nature—no manipulation of vari-
ables occurs. Furthermore, observational research can involve both qualitative meth-
ods and quantitative methods. Observational research that uses qualitative methods
does not entail the use of specific research designs per se; rather, depending on
the research question, different strategies are used (Morse, 1994). A few qualitative
strategies are phenomenology, ethnography, and grounded theory. Observational
research that uses quantitative methods would entail the use of cross-sectional designs,
successive independent samples design, and longitudinal designs (as noted in Chapter One,
these also include prospective cohort designs and panel studies).
Qualitative research methods and strategies are described in greater depth in
Chapter Seven; however, in simple terms, qualitative data consists of observations
Choosing a Research Design 79
that do not take the form of numbers. For example, conducting interviews with
people is one type of qualitative method used. The data collected consist of verbal
responses to either structured or semistructured questions. These responses may be
audiotaped and then transcribed. The investigator would conduct an analysis by
reviewing the transcriptions and identifying themes that emerged. All results are
conveyed through description. An example of a qualitative study that used
grounded theory as the strategy and face-to-face interviews as the method was con-
ducted by Melville and others (2003). They conducted interviews with twenty-four
participants who had a positive herpes simplex virus type 2 (HSV-2) serology by
western blot to assess the emotional and psychosocial responses to receiving a pos-
itive diagnosis. They identified three categories of themes: short-term emotional
responses (for example, denial, surprise), short-term psychosocial responses (for
example, fear of telling sex partner, anger at sex partner), and perceived ongoing
responses (for example, feeling socially stigmatized, feeling sexually undesirable).
The results were used to conceptualize a complex model involving these themes as
interrelated constructs that could be tested quantitatively in the future.
Conversely, the nature of quantitative data involves numerical values. Data
can be frequency of responses or occurrences, or the data can entail a process
whereby participants’ verbal or written responses are quantified by transforming
them into numerical values. Analyses of quantitative methods require the use of
statistical procedures. One example of a quantitative approach would be a sur-
veillance study. For example, the Centers for Disease Control and Prevention
(CDC) is interested in knowing how many adolescents engage in certain health
risk behaviors. The Youth Risk Behavior Surveillance System (YRBSS) monitors
six categories of priority health-risk behaviors among youths and young adults—
behaviors that contribute to unintentional injuries and violence; tobacco use;
alcohol and other drug use; sexual behaviors that contribute to unintended
pregnancy and sexually transmitted diseases (STDs), including human immun-
odeficiency virus (HIV) infection; unhealthy dietary behaviors; and physical in-
activity plus being overweight. Some results from the 2003 national Youth Risk
Behavior Survey (YRBS) demonstrated that among the high school students
surveyed, 30.2 percent had ridden with a driver who had been drinking alcohol;
17.1 percent had carried a weapon; 44.9 percent had drunk alcohol; and 22.4 per-
cent had used marijuana (Centers for Disease Control and Prevention, 2004).
24 percent tested positive for one of four STDs: Chlamydia trachomatis, Neisseria
gonorrhoeae, Trichomonas vaginalis, or Treponema pallidum. In another exam-
ple, Varma and colleagues (2004) also used a cross-sectional design to estimate the
prevalence of ocular hypertension and open-angle glaucoma (OAG) in adult
Latinos. They found that among 6,142 Latinos, forty years of age or older who
lived in the Los Angeles area, and who underwent a complete ophthalmologic
examination, approximately 5 percent were diagnosed with OAG and almost
4 percent had ocular hypertension. Moreover, Varma and colleagues made com-
parisons by age and gender. They found no gender differences, but significantly
higher prevalence rates for both conditions were found in older Latinos than in
younger Latinos.
In addition to assessing prevalence of disease, cross-sectional designs can be used
to estimate levels of knowledge about any given health threat or health protective
behavior, and health-related attitudes, beliefs, opinions, and behaviors. For exam-
ple, Lapidus and colleagues (2002) conducted a mail survey among pediatric pri-
mary care physicians to determine rates of screening practices for domestic violence
(DV)—the behavioral outcome in this study. Twelve percent of the respondents
reported routinely screening for DV at all well-child care visits, 61 percent reported
screening only certain patients, and 30 percent said they did not screen for DV at all.
◆ ◆ ◆
In a Nutshell
It is the manner in which you collect the data and not the statistical technique that allows one to
make causal inferences.
◆ ◆ ◆
◆ ◆ ◆
In a Nutshell
Although cross-sectional designs are used frequently in correlational research, the design is limited
in inferring causation. The design is also unable to establish directionality.
◆ ◆ ◆
cigarette smoking, being sexually active, and being physically inactive. Clearly,
the authors of this study were not attempting to imply a cause-and-effect rela-
tionship, but rather, they were trying to reveal a cluster of health risk behaviors to
address comprehensively in a health promotion program.
A successive independent samples design improves upon the time limitation of cross-
sectional designs by incorporating a series of cross-sectional studies conducted
over a period of time. Each cross-sectional survey is conducted with an indepen-
dent sample, which means that a new sample is drawn for each of the successive
cross-sectional surveys. This design is used to assess change in a population char-
acteristic (for example, tobacco use, condom use, pregnancy rates) over time and
is also referred to as a trend study.
As health promotion researchers for example, we may want to document the
change in certain health risk behaviors or the prevalence of disease for a given
population over a period of time. This would be very important information to
have and could inform programs and policy. In fact, this type of design is used
quite often in epidemiology—the discipline studying the incidence, distribution, and
control of disease in a population.
The CDC conducts and sponsors many studies using the successive inde-
pendent samples design. One example is the CDC’s serosurveillance system, which
is used to monitor the prevalence of human immunodeficiency virus type 1
(HIV-1) in the United States. In collaboration with state and local health depart-
ments, surveys were conducted in selected sites from 1988 through 1999. Popu-
lations that were included ranged from low risk (for instance, military, blood
donors) to high risk (such as injection drug users and men who have sex with men
(MSM)). The objectives of the serosurveillance system are (1) to provide stan-
dardized estimates of HIV prevalence among the selected populations, (2) to
describe the magnitude and changes over time of HIV infection in these popu-
lations, (3) to recognize new or emerging patterns of HIV infection among specific
subgroups of the U.S. population, and (4) to assist in directing resources and in
targeting programs for HIV prevention and care. The results showed that in
general, prevalence was higher among survey participants who were in the older
age categories and, with the exception of Job Corps entrants, among those who
were male. For most of the surveillance populations included in the report, preva-
lence by region, race or ethnicity, and age group either decreased or remained
stable from 1993–1997. For the MSM population, these results are depicted in
graph form and show trends in HIV prevalence by region (Figure 4.2) and race
or ethnicity (Figure 4.3).
The CDC also uses this type of design to track changes in behaviors over time.
For example, Serdula and others (2004) examined trends in fruit and vegetable
consumption among adults living in the United States from 1994 to 2000. Using
Choosing a Research Design 85
45
HIV Prevalence (% positive) 40
35
30
25
20
15
Northeast (n = 1,914)
10 South (n = 3,410)
5 West (n = 7,155)
0
1993 1994 1995 1996 1997
45
40
HIV Prevalence (% positive)
35
30
25
20
15
Black (n = 3,122)
10
Hispanic (n = 2,559)
5 White (n = 6,912)
0
1993 1994 1995 1996 1997
Note: Standardized to 1993 STD clinic population by region and age group.
were found, then you could reasonably attribute them to the event. Of course, one
major problem with longitudinal studies is attrition. Also referred to as mortality,
attrition is the loss of participants from the research study and can bias the results.
Severe attrition can be dealt with statistically by comparing those who returned
for follow-up interviews (or completed follow-up questionnaires) with those who
did not and determining critical differences between the two groups. For exam-
ple, in sex research, if those lost to attrition had baseline indicators of more fre-
quent risky sexual activity than those who remained in the study, then the loss to
follow-up could bias the study conclusions.
◆ ◆ ◆
In a Nutshell
Longitudinal designs improve vastly on some of the limitations of cross-sectional designs and of
the successive independent samples design.
◆ ◆ ◆
time, another cohort or cohorts are then added at varying intervals within that
time period. Changes over time for the first cohort can be assessed while also being
able to make cross-sectional comparisons among the different cohorts at specific
time points.
An example of cohort-sequential design was conducted by Chassin, Presson,
Rose, and Sherman (2001). They reported the results from a study designed to as-
sess age-related changes in health-relevant beliefs from the middle school years
through age thirty-seven. Participants were initially recruited between 1980 and
1983, and the sample comprised sixth through twelfth graders in a county school
system. Participants completed a survey each year. The final sample included
the participation of ten cohorts (that is, the graduating classes of 1980–1989). Fol-
low-up surveys were conducted in 1987, 1993, and 1999 with retention rates of
73 percent, 73 percent, and 71 percent respectively. The results showed system-
atic age-related changes in the perceived risks of cigarette smoking.
Causal Research
As compared with descriptive research, causal research transcends the natural
order of things through manipulation and provides an ability to make cause-effect
statements. Thus, causal research employs experimental designs so that causal infer-
ence can be made with a high degree of certainty, and in some instances quasi-
experimental designs. Experimental designs entail the manipulation of a variable to
test the effects of the manipulation on some outcome. Quasi-experimental designs
approximate experimental designs but do not use randomization. Why should we
care about making cause-effect statements? Making cause-effect statements pro-
vides both theoretical and practical benefits. For example, by isolating the cause
of a condition or disease, we may begin work on devising a cure. In addition,
causal research allows us to find out if a particular program is helpful in solving a
problem. This type of research is usually conducted in controlled settings with
small groups of people. If the treatment or program is efficacious, meaning it had
the ability to produce the desired effect, then it can be applied to a larger group
in real-life settings. Within the context of health promotion research, causal
research allows us not only to predict, understand, and control behavior, but
also to change behavior.
The two main defining characteristics of experimental research designs are
the manipulation of an independent variable and control over extraneous variables. The
independent variable is the variable chosen by the investigator to determine its
effect on the dependent variable and is manipulated or controlled by the investigator.
If the investigator is manipulating only one factor, then the design is a single-factor
design; if more than one factor, it is called a factorial design. The number of
Choosing a Research Design 89
independent variables equates with the number of factors in the design. The
dependent variable is the outcome variable of interest and is measured by the in-
vestigator. Extraneous variables are not directly related to the hypothesis being
tested, but may nevertheless have an effect on the dependent variable. You may
control the effects of extraneous variables either by holding them constant or by
randomizing their effects across treatment levels. To the extent that extraneous
variables provide alternative explanations or rival hypotheses for the findings, the
internal validity of the study will be threatened.
The internal validity of the study is the ability of the design to test the
hypothesis it was designed to test. Internal validity can be thought of as the ap-
proximate truth about inferences regarding cause-effect or causal relationships.
Seven common threats to internal validity have been identified and deserve con-
sideration when designing your study: history, maturation, testing, instrumenta-
tion, statistical regression, biased selection of subjects, and experimental mortality
(Campbell and Stanley, 1966). These threats to internal validity are defined in
Box 4.2. Common threats that may seriously threaten internal validity must be
addressed prior to conducting your study so that alternative hypotheses can
be ruled out.
Before the different types of experimental designs are discussed, an impor-
tant and related factor to understand when planning health promotion research
is that conducting experiments in a controlled laboratory with pathogens is
markedly different from conducting experiments with human subjects in the
real world. Although controlling for extraneous variables is critical to the integrity
of an experiment, the level of control may vary widely when the experiment takes
control variables that could be introduced into the selection process and could
cause not only low generalizability but also selection bias. Generalizability is the degree
to which the research findings can be applied to the population under study (for
example, black men). Using random selection helps to increase the representa-
tiveness of the sample thereby enhancing generalizability. As indicated in Box 4.2,
selection bias results when groups differ from each other. Differences could be on
either measured or unmeasured characteristics. Selection bias also occurs when
the sample is not representative of the population. If random selection is used,
then selection bias should not be an issue unless the response rate or the rate in
which participants agreed to be in the study was extremely low. Furthermore, if
random selection is not possible, then a nonrandom sampling technique such as
convenience sampling (using individuals who are convenient to access) or snowball sam-
pling (using recruited participants to identify others) would have to be used and
would not reduce selection bias in the sample. Thus, if participants differ in ways
that affect the dependent variable (side effects of HIV vaccine), then the results
might be attributed to these differences rather than to the vaccine.
Once the sample is recruited, the next factor to consider is how to assign the
subjects into the two groups. Other sources of error variance involving the subjects
such as socioeconomic status, age, and education level, to name a few, need to be con-
trolled. One method that would control (in this instance, by “control” we mean hold
the effects of these variables constant) these sources of error variance would be to use
random assignment (a.k.a. randomization) as the method to assign participants into
respective groups. Randomization entails assigning participants by chance to one of
two or more groups. There are different methods, such as using a random numbers
table, using a software program, or even flipping a coin. Randomization minimizes
any differences between or among groups by equally distributing subjects with par-
ticular characteristics between or among all the trial arms. Randomization rules
out many threats to internal validity, but not all of them.
◆ ◆ ◆
In a Nutshell
Randomization entails assigning participants by chance to one of two or more groups. There
are different methods, such as using a random numbers table, using a software program, or even
flipping a coin. Randomization minimizes any differences between or among groups by equally
distributing subjects with particular characteristics between or among all the trial arms.
◆ ◆ ◆
92 Research Methods in Health Promotion
The second and probably the most important issue of control is the spe-
cific experimental design chosen. There are different ways in which this specific
research question could be answered. The various designs that are used to conduct
causal research vary in quality of evidence they provide for making causal infer-
ence and in the level of control. Moreover, there are distinctions between exper-
imental designs, “true” experimental designs, and quasi-experimental designs that
further influence the level of control and the ability to attribute changes in the
dependent variable to the manipulation of the independent variable.
In the remaining section of this chapter, we will describe the various
experimental and quasi-experimental research designs that make up causal
research along with examples drawn from the scientific literature. We will pro-
vide an overview of the related concepts and terminology. Because the focus of
this book is on the research methods used in health promotion and practice,
we describe designs and methods related to health promotion. For example,
research concepts or methods used in the study of pathogen isolation (such as
Koch’s Postulates), albeit important, are outside the scope of this text and will
not be covered.
efficacy of a program or treatment because it allows for causal inference and has
the highest level of control possible in a real-world setting.
The number of levels of the independent variable dictates the number of
arms to be employed in an RCT. For example, if you were examining the effi-
cacy of an HIV-preventive intervention relative to a general health intervention,
then you would require two groups; if you were examining the efficacy of an
HIV-preventive intervention and an HIV-preventive intervention with booster
sessions relative to the general health intervention, then you would need three
groups. An example of a two-arm RCT was a study conducted by Shrier and
others (2001), in which they evaluated a theory-driven safer-sex intervention
for high-risk girls with cervicitis or pelvic inflammatory disease (N ⫽ 123). The
clinical trial took place in an urban children’s hospital adolescent clinic and
inpatient service. The intervention was administered by trained female health
educators during the time when the adolescents received STD treatment.
The adolescent participants were not randomly selected to participate, but they
were randomly assigned to the intervention group or to the control group. The
intervention included topics such as STD transmission, secondary abstinence,
and the female condom. Difference between groups on the behavioral outcomes
was not statistically significant at the P ⬍ .05 level (likely due in part to small
sample size).
Another type of between-subjects design, which may be useful in health pro-
motion research, is the matched groups design. This design is employed when the re-
searcher is aware of a particular subject characteristic that is strongly associated with
the dependent variable. Using randomization, the characteristic may not get equally
distributed between groups. Thus, to ensure that the characteristic is distributed
equally, subjects are matched on the characteristic prior to random assignment.
Depending on the number of groups in the experiment, subjects are matched with
other subjects as close as possible to form pairs (two groups), triplets (three groups),
or even quadruplets (four groups). Then, taking each set, subjects are assigned at
random into groups. The main advantage of this design is significant reduction of
the effect of the matched characteristic on the dependent variable. One example
would be matching adolescents enrolled in a two-arm sexual-risk-reduction inter-
vention on alcohol and drug use prior to randomization into groups. In this instance,
prior to randomization, subjects would be measured on their alcohol and drug
use. Subjects who reported similar behavior would be paired together. Each subject
would be randomly assigned to one of the two conditions.
Within health promotion research, often the use of this design involves the
site versus individual subjects as the unit of assignment (for instance, school, com-
munity, workplace, and so on). For example, Emmons, Linnan, Shadel, Marcus,
94 Research Methods in Health Promotion
are assessed at multiple time points (in other words, an indicator of a within-
subjects design). In this instance, the within-subjects factor is “time” with the
number of levels corresponding to the number of time points. This is slightly dif-
ferent from the within-subjects design described previously, in which the within-
subjects factor was the administration of all possible levels of the treatment to
the same group of subjects. Thus, in a mixed design, comparisons can be made
not only between groups and within groups, but also for the interaction of the
two factors.
The RCT is considered a mixed design when subjects are assessed at multi-
ple time periods. For example, in simplest terms, implementation of an RCT
would involve (1) assessing subjects at baseline, (2) randomizing subjects into
groups, (3) exposing subjects to the respective treatment, (4) assessing subjects
after completion of the treatment (could be immediately following or at a follow-
up period), and (5) assessing them again at subsequent follow-up period(s).
Analysis goals for this design could include determining overall differences
between groups (main effect for group), overall differences within groups (main
effect for time), or whether or not subjects in the treatment group changed
differentially over time as compared with subjects in the control group (an
interaction of group and time).
St. Lawrence, Wilson, Eldridge, Brasfield, and O’Bannon (2001) applied a
mixed design to test the efficacy of an HIV-preventive intervention based on the
theory of gender and power, an HIV-preventive intervention based on social
learning theory, and an HIV-preventive intervention based on cognitive behav-
ioral principles relative to a wait-list control condition. The population for the
evaluation study was disadvantaged African American women, and the sam-
ple of 445 was drawn from several community venues. Women were given base-
line assessments and then randomly assigned into one of four groups: one of
the three theoretically derived programs or the wait list control. Thus, this study
had four levels of the first factor. Assessments were also made at postinterven-
tion and at a one-year follow-up period indicating three levels of the second
factor. The results showed that at one-year follow-up, although there was no dif-
ference in condom use among the three treatment groups, in all of the three
treatment conditions, condom use increased relative to the wait list control
group.
◆ ◆ ◆
In a Nutshell
Because of this nonequivalence of groups at the start, the NCGD is especially susceptible to the
selection threat to internal validity.
◆ ◆ ◆
98 Research Methods in Health Promotion
Two main types of designs fall under the general category of NCGD: the
nonequivalent group, posttest only and the nonequivalent group, pretest-posttest.
The nonequivalent, posttest-only design consists of administering an outcome
measure to two groups such as a program-treatment group and a comparison
group or possibly two program groups, each receiving a different version of the
program. For example, one group of students at a high school might receive an
abstinence-only program while students at another school receive a comprehensive
sexual-risk-reduction program. After twelve weeks, a test measuring sexual debut
and risky sexual behaviors can be administered to see which program was more
effective. A major problem with this design is that as stated previously the two
groups might not be necessarily the same before the program takes place and
may differ in important ways that may influence the outcomes. For instance,
if it is found that the students in the abstinence-only program reported a delay
in sexual debut and less engagement in sexual risk behavior, there is no way of
determining if they were less likely to engage in those behaviors even before
the program or whether other factors (for example, level of parental monitor-
ing or parental communication about sex) may have influenced their sexual
behavior.
The nonequivalent group, pretest-posttest design improves on a major
limitation of the nonequivalent group, posttest-only design by allowing the re-
searcher to empirically assess differences in the two groups prior to implementa-
tion of the program. If the researcher finds that one group performs better than
the other on the posttest, then initial differences (if the groups were in fact simi-
lar on the pretest) can be ruled out as explanations for the differences. Further-
more, if in fact the groups did differ on the pretest measure, then the researcher
could potentially control for these differences statistically.
Bastani and others (2002) used a longitudinal nonequivalent control group
design to evaluate the effects of a systems-level intervention that sought to
increase cervical cancer screening among underserved women attending one of
several clinics composing the second largest County Health Department in the
United States. The intervention consisted of multiple strategies that involved
physician education, physician feedback, patient education, policy interventions,
and expanding the capacity to serve women better. The authors stated that
“logistical constraints, limited resources, and other practical consideration in the
County health care system largely dictated the choice of this design. Interven-
tion and control sites were selected in consultation with our partners in the
County Health Department. The system did not include sufficient sites to allow
for randomization to intervention and control conditions. Likewise, the structure
of the health care system made it impossible to randomize the intervention
Choosing a Research Design 99
to only a portion of the women receiving care at any one site” (Bastani and
others, 2002, p. 893). Intervention sites were a hospital, a comprehensive
health center (CHC), and three public health centers (PHCs) and were matched
on size, patient characteristics, and a range of services provided to another hos-
pital, a CHC, and three PHCs. Intervention effects were observed at the hospi-
tal and the CHC, where there was a significant increase in cervical cancer
screening.
The interrupted time series design (ITSD) is the strongest quasi-experimental
approach for evaluating the longitudinal effects of interventions. It involves
collecting data at multiple instances over time before and after a treatment of some
kind. The treatment can be a health promotion program, implementation of a
new health policy, or a naturally occurring event. Data used in a time series design
can range from individual behavior to cumulative incidence of sexually trans-
mitted diseases. It does require knowledge about the specific time in which the
program or intervention occurred in the time series. An advantage of collecting
a series of observations both before and after the treatment is that a reliable pic-
ture regarding the outcome of interest can be gleaned. Thus, the time series design
is sensitive to trends in performance. The aim is to determine whether or not
the intervention had an effect over and above any trend present in the data. For
example, the data collected prior to the treatment may reveal several trends in the
data such as a maturational trend (in which observations involve human subjects
who are changing their behavior as they age) or a seasonal trend (in which
observations are influenced by the season in which data are collected). These trends
can be identified and assessed, and treatment effects ascertained. A treatment effect
is demonstrated only if the pattern of posttreatment responses differs significantly
from the pattern of pretreatment responses. That is, the treatment effect is demon-
strated by a discontinuity in the pattern of pretreatment and posttreatment
responses. For example, an effect is demonstrated when there is a change in
the level or slope of the posttreatment responses as, or both, compared with the
pretreatment responses.
The most basic of the ITSDs is the simple interrupted time series, in which
there is only one experimental group. This design can be diagrammed as follows:
O1 O2 O3 O4 Treatment O5 O6 O7 O8
The greatest threat to the internal validity of this design is history, although other
threats such as instrumentation and selection can occur.
Adding a nonequivalent control group to the simple interrupted time series will
improve on some of the limitations mentioned. This design, called an interrupted
100 Research Methods in Health Promotion
O1 O2 O3 O4 Treatment O5 O6 O7 O8
O1 O2 O3 O4 O5 O6 O7 O8
As shown, a series of observations are collected both prior to and following the
administration of the treatment for the experimental group; during this same time
period, a series of observations are also collected for a comparison group. This
design allows the researcher to control for history effects, as a historical event would
most likely affect both treatment and control groups equally. What may be prob-
lematic, however, is when one group experiences a unique set of events. This
type of threat is deemed an interaction between selection and history and is called
a selection-history effect. It is important to note that for a historical event to
pose a threat to internal validity, the event must occur at the same time as the
intervention or treatment. Other threats to the internal validity are similar to
the basic NCGD, with selection being the greatest threat and the interaction of
selection with threats other than history such as selection-maturation, selection-
instrumentation, and selection-regression.
Wong, Chan, and Koh (2002) used a simple ITSD to examine the impact of
a sexual-risk-reduction program that encouraged condom use for oral sex among
female brothel-based sex workers in Singapore. The study observed two outcomes:
condom use trends and the pharyngeal gonorrhea trends. Comparisons on the
two study outcomes were made across four time periods that corresponded to
different phases of the intervention: a preintervention period of two years; an
intervention phase of two years; a postintervention phase of one year when
activities were withdrawn; and a follow-up period of one year when activities were
resumed and incorporated into the brothel’s health education policy for its sex
workers. The results indicated that for the preintervention phase, consistent con-
dom use for oral sex was stable at below 50 percent. Following the intervention,
condom use increased significantly. During the postintervention phase of one year,
when activities were withheld from brothels, condom use leveled off and decreased
to 79.7 percent. In the last year, when activities were institutionalized, condom use
increased significantly to 89.9 percent. The significant changing trend in condom
use was mirrored by changes in pharyngeal gonorrhea incidence rates. Preinter-
vention, pharyngeal gonorrhea incidence rates were stable and remained high
(12.4 to 16.6 per 1000 person-months). During the two-year intervention phase,
the pharyngeal gonorrhea incidence rate significantly decreased to 3.3 per 1000
person-months, but was followed by an increase when program activities were
Choosing a Research Design 101
withdrawn. This increase was in tandem with the decrease in condom use during
this same phase. During the final phase, when condom use increased significantly,
there was also a major decline in the pharyngeal gonorrhea incidence rate, from
14.4 to 4.0 per 1000 person-months.
issues may greatly facilitate the implementation of the research (step 7), and, con-
sequently, assist in the testing of the research hypotheses and the dissemination
of the results (step 9).
Summary
All of the designs described in this chapter have associated strengths, limitations,
and varying levels of rigor. The important thing to know is the strengths and lim-
itations of each research design, and how these may affect the study results and
the interpretation of those results. Thus, perhaps the most important decision to
make is the selection of a research design. This decision should be examined care-
fully by every member of the research team, and the pros and cons of each option
should be weighed with great attention to rigor. Many of these pros and cons were
outlined in this chapter. It may be useful to begin by defining whether the research
is exploratory or conclusive, and then asking whether it is descriptive or causal.
From that point, it is critical to consider what resources are available to the proj-
ect so that the selected study design is not only functional but also feasible. In
addition, although implicit in this chapter, it is important to remain vigilant
regarding your obligation to design research studies that are ethical in nature and
that do not violate the rights or well-being of the participants. Finally, it is
imperative that your research design optimizes the ability to provide a rigorous
and valid test of the research question(s) because your work may have great
potential to shape health promotion practice and policy and contribute another
link in the chain of research.
Issues to Consider
could benefit them greatly and prevent HIV infection in their baby. Was there
an alternative design that could have been used in this instance? What are the
ethical considerations of using an experimental design with a placebo-control
group?
2. The value of the cross-sectional study is not always recognized among
researchers. At one extreme, researchers might say that “cross-sectional stud-
ies are only good for hypothesis generation.” At the other extreme, researchers
might say that “cross-sectional studies could be very useful, even to the point
of supporting causal relationships.” Because such a large volume of health
promotion research is based on data collected using cross-sectional research
designs, this issue of “value” is indeed an important one. Thus, perhaps a crit-
ical question to ask is, Under what circumstances can findings from a cross-
sectional study be used to test a hypothesis?
3. Earlier in this chapter it was noted that within-subjects designs (also known as
the repeated-measures design) actually has less within-subject error variance
than a between-groups design. Alternatively, it was noted that the lack of a
control group implies that the within-subjects design is not a true experimental
design. Given that the use of a control group may not always be feasible (due
to limited research funds) or ethical (due to the potentially highly effective
nature of the HPP and the lack of any acceptable attention-control condition),
could a within-subjects design ever be preferable? If so, under what conditions
would this be so? Also, what types of research questions would be the most
appropriate for this design? Conversely, what types of research questions would
be the least appropriate for this design?
2. Please repeat the exercise outlined in number 1, but this time assume that you
have sufficient funds to conduct an experimental study.
3. Working with at least one colleague, please consider (and list) the advantages
and disadvantages of random assignment. Please rank each advantage on a
scale from one to five (with five being the most extreme level of concern). Sim-
ilarly, please rank each disadvantage on a scale from one to five (with five being
the most extreme level of “gain” resulting from randomization). Next, please
find another pair of colleagues who have completed this same exercise and
compare the two sets of lists and rankings.
4. For each research goal listed below, please select the best research design that
could be applied to most effectively address the goal. Parsimony is an impor-
tant criteria. Be prepared to defend your answer.
A. To determine the efficacy of a three-session intervention program designed
to promote the consumption of a diet low in saturated fats among cardiac
rehabilitation patients
B. To test the hypothesis that childhood activity levels predict obesity in early
adulthood
C. To determine the relationship of sleep to depression in college students
D. To explore the role of self-efficacy in the decision to enroll in a smoking
cessation course
E. To test the hypothesis that fewer high school students are engaging in sex
in 2004, compared with high school students in the 1980s
F. To assess whether a media program can promote the use of infant car seats
G. To identify the effectiveness of an alcohol use awareness program (note:
you have a strong suspicion that the assessment instrument used for alcohol
awareness is also likely to foster awareness—you would like to test this
suspicion as well)
References
Bastani, R., and others. (2002). Increasing cervical cancer screening among underserved
women in a large urban county health system: Can it be done? What does it take? Medical
Care, 40(10), 891–907.
Bhandari, N., and others. (2003). Effect of community-based promotion of exclusive
breastfeeding on diarrhoeal illness and growth: A cluster randomized controlled trial.
Lancet, 361(9367), 1418–1423.
Campbell, D. T., and Stanley, J. C. (1966). Experimental and quasi-experimental designs for research.
Chicago: Rand McNally.
Centers for Disease Control and Prevention. (2004, May 21). Surveillance Summaries.
Morbidity and Mortality Weekly Report, 53(SS-2).
Choosing a Research Design 105
Chassin, L., Presson, C. C., Rose, J. S., and Sherman, S. J. (2001). From adolescence to
adulthood: Age-related changes in beliefs about cigarette smoking in a Midwestern
community sample. Health Psychology, 20(5), 377–386.
DiClemente, R. J., and others. (2004). A descriptive analysis of STD prevalence among
urban pregnant African-American teens: Data from a pilot study. Journal of Adolescent
Health, 34(5), 376–383.
Emmons, K. M., Linnan, L. A., Shadel, W. G., Marcus, B., and Abrams, D. B. (1999). The
Working Healthy Project: A worksite health-promotion trial targeting physical activity,
diet, and smoking. Journal of Occupational & Environmental Medicine, 41(7), 545–555.
Finn, J. (2004). A survey of online harassment at a university campus. Journal of Interpersonal
Violence, 19(4), 468–483.
Glasgow, R. E., Vogt T. M., and Boles, S. M. (1999). Evaluating the public health impact of
health promotion interventions: The RE-AIM framework. American Journal of Public Health,
89, 1322–1327.
Hall, H. I., Jones, S. E., and Saraiya, M. (2001). Prevalence and correlates of sunscreen use
among U.S. high school students. Journal of School Health, 71(9), 453–457.
Huurre, T. M., Aro, H. M., and Jaakkola, J. J. (2004). Incidence and prevalence of asthma
and allergic rhinitis: A cohort study of Finnish adolescents. Journal of Asthma, 41(3),
311–317.
Klein, E., Campbell, J., Soler, E., and Ghez, M. (1997). Ending domestic violence: Changing public
perceptions/Halting the epidemic. Thousand Oaks, CA: Sage.
Kulczycki, A., Kim, D. J., Duerr, A., Jamieson, D. J., and Macaluso, M. (2004). The
acceptability of the female and male condom: A randomized crossover trial. Perspectives
on Sexual & Reproductive Health, 36(3), 114–119.
Lapidus, G., and others. (2002). A statewide survey of domestic violence screening behaviors
among pediatricians and family physicians. Archives of Pediatrics 7 Adolescent Medicine, 156(4),
332–336.
Melville, J., and others. (2003). Psychosocial impact of serological diagnosis of
herpes simplex virus 2: A qualitative assessment. Sexually Transmitted Infections, 79,
280–285.
Morse, J. (1994). Designing funded qualitative research. In N. K. Denzin and J. S. Lincoln
(Eds.), Handbook of qualitative research (pp. 220–235). Thousand Oaks, CA: Sage.
Repetto, P. B., Zimmerman, M. A., and Caldwell, C. H. (2004). A longitudinal study of the
relationship between depressive symptoms and alcohol use in a sample of inner-city black
youth. Journal of Studies on Alcohol, 65(2), 169–178.
Salazar, L. F., and Cook, S. L. (2002). Violence against women: Is psychology part of the
problem or the solution? A content analysis of psychological research from 1990 through
1999. Journal of Community and Applied Social Psychology, 12, 410–421.
Salazar, L. F., and others. (2004). Self-concept and adolescents’ refusal of unsafe sex:
A test of mediating mechanisms among African American girls. Prevention Science,
5(3), 137–149.
Serdula, M. K., and others. (2004). Trends in fruit and vegetable consumption among adults
in the United States: Behavioral risk factor surveillance system, 1994–2000. American
Journal of Public Health, 94(6), 1014–1018.
Shrier, L., and others. (2001). Randomized controlled trial of a safer sex intervention
for high-risk adolescent girls. Archives of Pediatrics and Adolescent Medicine, 155(1),
73–79.
106 Research Methods in Health Promotion
St. Lawrence, J. S., Wilson, T. E., Eldridge, G. D., Brasfield, T. L., and O’Bannon, R. E. III.
(2001). Community-based interventions to reduce low-income, African American
women’s risk of sexually transmitted diseases: A randomized controlled trial of three
theoretical models. American Journal of Community Psychology, 29(6), 937–964.
Varma, R., and others. (2004). Prevalence of open-angle glaucoma and ocular hypertension
in Latinos: The Los Angeles Latino Eye Study. Ophthalmology, 111(8), 1439–1448.
Wong, M. L., Chan, R. K., and Koh, D. (2002). Promoting condoms for oral sex: Impact
on pharyngeal gonorrhea among female brothel-based sex workers. Sexually Transmitted
Diseases, 29(6), 311–318.