Académique Documents
Professionnel Documents
Culture Documents
The National Reading Panel wishes to express its gratitude to the following individuals for their contributions
to its effort.
Acknowledgments .................................................................................................................. ii
Chapter 2: Alphabetics
Part I: Phonemic Awareness Instruction
Executive Summary ....................................................................................................... 2-1
Report ........................................................................................................................... 2-9
Appendices ................................................................................................................... 2-53
Part II: Phonics Instruction
Executive Summary ....................................................................................................... 2-89
Report ........................................................................................................................... 2-99
Appendices ................................................................................................................... 2-145
Chapter 3: Fluency
Executive Summary ............................................................................................................. 3-1
Report .................................................................................................................................. 3-5
Appendices .......................................................................................................................... 3-35
Chapter 4: Comprehension
Executive Summary ............................................................................................................. 4-1
Introduction .......................................................................................................................... 4-11
Part I. Vocabulary Instruction
Report ........................................................................................................................... 4-15
Appendices ................................................................................................................... 4-33
Part II: Text Comprehension Instruction
Report ........................................................................................................................... 4-39
Appendices ................................................................................................................... 4-69
Part III. Teacher Preparation and Comprehension Strategies Instruction
Report ........................................................................................................................... 4-119
Appendices ................................................................................................................... 4-133
Minority View
understanding of reading issues through a thorough • The importance of phonemic awareness, phonics,
analysis of the findings of the NRC report, Preventing and good literature in reading instruction, and the
Reading Difficulties in Young Children (Snow, Burns, & need to develop a clear understanding of how best
Griffin, 1998). Early in its deliberations the Panel made to integrate different reading approaches to
a tentative decision to establish subgroups of its enhance the effectiveness of instruction for all
members and to assign to each subgroup one of the students;
major topic areas designated by the NRC Committee as • The need for clear, objective, and scientifically
central to learning to read—Alphabetics, Fluency, and based information on the effectiveness of different
Comprehension. types of reading instruction and the need to have
Regional Public Hearings such research inform policy and practice;
• The importance of applying the highest standards of
As part of its information gathering, the Panel publicly scientific evidence to the research review process so
announced, planned, and held regional hearings in that conclusions and determinations are based on
Chicago, IL (May 29,1998), Portland, OR (June 5, findings obtained from experimental studies
1998), Houston, TX (June 8, 1998), New York, NY characterized by methodological rigor with
(June 23, 1998), and Jackson, MS (July 9, 1998). The demonstrated reliability, validity, replicability, and
Panel believed that it would not have been possible to applicability;
accomplish the mandate of Congress without first
hearing directly from consumers of this information— • The importance of the role of teachers, their
teachers, parents, students, and policymakers—about professional development, and their interactions and
their needs and their understanding of the research. collaborations with researchers, which should be
Although the regional hearings were not intended as a recognized and encouraged; and
substitute for scientific research, the hearings gave the • The importance of widely disseminating the
Panel an opportunity to listen to the voices of those who information that is developed by the Panel.
will need to consider implementation of the Panel’s
findings and determinations. The regional hearings gave Adoption of Topics To Be Studied
members a clearer understanding of the issues important Following the regional hearings, the Panel considered,
to the public. discussed, and debated several dozen possible topic
As a result of these hearings, the Panel received oral and areas and then settled on the following topics for
written testimony from approximately 125 individuals or intensive study:
organizations representing citizens—teachers, parents, • Alphabetics
students, university faculty, educational policy experts,
and scientists—who would be the ultimate users and - Phonemic Awareness Instruction
beneficiaries of the research-derived findings and - Phonics Instruction
determinations of the Panel.
• Fluency
At the regional hearings, several key themes were • Comprehension
expressed repeatedly:
- Vocabulary Instruction
• The importance of the role of parents and other
- Text Comprehension Instruction
concerned individuals, especially in providing
children with early language and literacy experiences - Teacher Preparation and Comprehension
that foster reading development; Strategies Instruction
• The importance of early identification and • Teacher Education and Reading Instruction
intervention for all children at risk for reading • Computer Technology and Reading Instruction
failure;
In addition, because of the concern voiced by the public 7. Does teacher education influence how effective
at the regional hearings that the highest standards of teachers are at teaching children to read? If so, how
scientific evidence be applied in the research review is this instruction best provided?
process, the methodology subgroup was tasked to Each subgroup also generated several subordinate
develop a research review process including specific questions to address within each of the major questions.
review criteria. It should be made clear that the Panel did not consider
Each topic and subtopic became the subject of the work these questions and the instructional issues that they
of a subgroup composed of one or more Panel represent to be the only topics of importance in learning
members. Some Panel members served on more than to read. The Panel’s silence on other topics should not
one subgroup. (The full report of each subgroup is be interpreted as indicating that other topics have no
included in this volume.) The subgroups formulated importance or that improvement in those areas would
seven broad questions to guide their efforts in meeting not lead to greater reading achievement. It was simply
the Congressional charge of identifying effective the sheer number of studies identified by Panel staff
instructional reading approaches and determining their relevant to reading (more than 100,000 published since
readiness for application in the classroom: 1966 and more than 15,000 prior to 1966) that
precluded an exhaustive analysis of the research in all
1. Does instruction in phonemic awareness improve areas of potential interest.
reading? If so, how is this instruction best provided?
The Panel also did not address issues relevant to second
2. Does phonics instruction improve reading language learning, as this topic was being addressed in
achievement? If so, how is this instruction best detail in a new, comprehensive NICHD/OERI (Office of
provided? Educational Research and Improvement) research
3. Does guided repeated oral reading instruction initiative. The questions presented above bear on
improve fluency and reading comprehension? If so, instructional topics of widespread interest in the field of
how is this instruction best provided? reading education that have been articulated in a wide
range of theories, research studies, instructional
4. Does vocabulary instruction improve reading programs, curricula, assessments, and educational
achievement? If so, how is this instruction best policies. The Panel elected to examine these and
provided? subordinate questions because they currently reflect the
5. Does comprehension strategy instruction improve central issues in reading instruction and reading
reading? If so, how is this instruction best provided? achievement.
In an important action critical to its Congressional screen, and evaluate the relevant research to complete
charge, the NRP elected to develop and adopt a set of their respective reports. Moreover, the Panel also had to
rigorous research methodological standards. These arrive at findings that all or nearly all of the members of
standards, which are defined in this section, guided the the NRP could endorse. Common procedures, grounded
screening of the research literature relevant to each topic in scientific principles, helped the Panel to reach final
area addressed by the Panel. This screening process agreements.
identified a final set of experimental or quasi-
experimental research studies that were then subjected to Search Procedures
detailed analysis. The evidence-based methodological Each subgroup conducted a search of the literature using
standards adopted by the Panel are essentially those common procedures, describing in detail the basis and
normally used in research studies of the efficacy of rationale for its topical term selections, the strategies
interventions in psychological and medical research. employed for combining terms or delimiting searches,
These include behaviorally based interventions, and the search procedures used for each topical area.
medications or medical procedures proposed for use in
the fostering of robust health and psychological Each subgroup limited the period of time covered by its
development and the prevention or treatment of searches on the basis of relative recentness and how
disease. much literature the search generated. For example, in
some cases it was decided to limit the years searched to
It is the view of the Panel that the efficacy of materials the number of most recent years that would identify
and methodologies used in the teaching of reading and in between 300 to 400 potential sources. This scope could
the prevention or treatment of reading disabilities should be expanded in later iterations if it appeared that the
be tested no less rigorously. However, such standards nature of the research had changed qualitatively over
have not been universally accepted or used in reading time, if the proportion of useable research identified was
education research. Unfortunately, only a small fraction small (e.g., less than 25%), or if the search simply
of the total reading research literature met the Panel’s represented too limited a proportion of the total set of
standards for use in the topic analyses. identifiable studies. Although the number of years
With this as background, the Panel understood that searched varied among subgroup topics, decisions
criteria had to be developed as it considered which regarding the number of years to be searched were made
research studies would be eligible for assessment. There in accord with shared criteria.
were two reasons for determining such guidelines or The initial criteria were established to focus the efforts
rules a priori. First, the use of common search, of the Panel. First, any study selected had to focus
selection, analysis, and reporting procedures would directly on children’s reading development from
ensure that the Panel’s efforts could proceed, not as a preschool through grade 12. Second, the study had to be
diverse collection of independent—and possibly published in English in a refereed journal. At a
uneven—synthesis papers, but as parts of a greater minimum, each subgroup searched both PsycINFO and
whole. The use of common procedures permitted a ERIC databases for studies meeting these initial
more unified presentation of the combined methods and criteria. Subgroups could, and did, use additional
findings. Second, the amount of research synthesis that databases when appropriate. Although the use of a
had to be accomplished was substantial. Consequently,
the Panel had to work in diverse subgroups to identify,
minimum of two databases identified duplicate working set of studies. The core working sets of studies
literature, it also afforded the opportunity to expand gathered by the subgroups were then coded as described
perspective and locate articles that would not be below and then analyzed to address the questions posed
identifiable through a single database. in the introduction and in the charge to the Panel.
Identification of each study selected was documented If a core set of studies identified by the subgroup was
for the record and each was assigned to one or more insufficient to answer critical instructional questions,
members of the subgroup who examined the title and less recent studies were screened for eligibility for, and
abstract. Based on this examination, the subgroup inclusion in, the core working sets of studies. This
member(s) determined, if possible at this stage, whether second search used the reference lists of all core studies
the study addressed issues within the purview of the and known literature reviews. This process identified
research questions being investigated. If it did not, the cited studies that could meet the Panel’s methodological
study was excluded and the reason(s) for the exclusion criteria for inclusion in the subgroups’ core working sets
were detailed and documented for the record. If, of studies. Any second search was described in detail
however, it did address reading instructional issues and applied precisely the same search, selection,
relevant to the Panel’s selected topic areas, the study exclusion, and inclusion criteria and documentation
underwent further examination. requirements as were applied in the subgroups’ initial
searches.
Following initial examination, if the study had not been
excluded in accord with the preceding criteria, the full Manual searches, again applying precisely the same
study report was located and examined in detail to search, selection, exclusion and inclusion criteria, and
determine whether the following criteria were met: documentation requirements as were applied in the
subgroups’ electronic searches, were also conducted to
• Study participants must be carefully described (age;
supplement the electronic database searches. Manual
demographics; cognitive, academic, and behavioral
searching of recent journals that publish research on
characteristics).
specific NRP subgroup topics was performed to
• Study interventions must be described in sufficient compensate for the delay in appearance of these journal
detail to allow for replicability, including how long articles in the electronic databases. Other manual
the interventions lasted and how long the effects searching was carried out in relevant journals to include
lasted. eligible articles that should have been selected, but were
• Study methods must allow judgments about how missed in electronic searches.
instruction fidelity was ensured.
Source of Publications: The Issue of
• Studies must include a full description of outcome Refereed and Non-Refereed Articles
measures.
The subgroup searches focused exclusively on research
These criteria for evaluating research literature are
that had been published or had been scheduled for
widely accepted by scientists in disciplines involved in
publication in refereed (peer reviewed) journals. The
medical, behavioral, and social research. The application
Panel reached consensus that determinations and
of these criteria increased the probability that objective,
findings for claims and assumptions guiding instructional
rigorous standards were used and that therefore the
practice depended on such studies. Any search or review
information obtained from the studies would contribute
of studies that had not been published through the peer
to the validity of any conclusions drawn.
review process but was consulted in any subgroups
If a study did not meet these criteria or could not be review was treated as separate and distinct from
located, it was excluded from subgroup analysis and the evidence drawn from peer-reviewed sources (i.e., in an
reason(s) for its exclusion was detailed and documented appendix) and is not referenced in the Panel’s report.
for the record. If the study was located and met the These non-peer-reviewed data were treated as
criteria, the study became one of the subgroup’s core preliminary/pilot data that might illuminate potential
• Whether participants were drawn from urban, • Pullout program (e.g., Reading Recovery©)
suburban, or rural setting • Tutorial
• List any pretests that were administered prior to
5. Design of Study
treatment
• List any special characteristics of participants • Random assignment of participants to treatments
including the following if relevant: (randomized experiment)
• Was trainers’ fidelity in delivering treatment • Were steps taken in statistical analyses to adjust for
checked? (yes/no) any lack of equivalence? (yes/no)
• Properties of teachers/trainers 9. Result (for each measure)
• Number of trainers who administered treatments • Record the name of the measure
• Teacher/student ratio: Number of participants to • Record whether the difference—treatment mean
number of trainers minus control mean—is positive or negative
• Type of trainer (classroom teacher, student teacher, • Record the value of the effect size including its sign
researcher, clinician, special education teacher, (+ or -)
parent, peer, other)
• Record the type of summary statistics from which
• List any special qualifications of trainers the effect size was derived
• Length of training given to trainers • Record number of people providing the effect size
• Source of training information
• Assignment of trainers to groups: 10. Coding Information
- Random • Record length of time to code study
- Choice/preference of trainer • Record name of coder
- All trainers taught all conditions If text was a variable, the coding indicated what is
• Cost factors: List any features of the training such as known about the difficulty level and nature of the texts
special materials or staff development or outside being used. Any use of special personnel to deliver an
consultants that represent potential costs intervention, use of special materials, staff development,
or other features of the intervention that represent
b. Moderator Variables potential cost were noted. Finally, various threats to
List and describe other nontreatment independent reliability and internal or external validity (group
variables included in the analyses of effects (e.g., assignment, teacher assignment, fidelity of treatment,
attributes of participants, properties or types of text). and confounding variables including equivalency of
subjects prior to treatment and differential attrition) were
7. Dependent (Outcome) Variables coded. Each subgroup also coded additional items
• List processes that were taught during training and deemed appropriate or valuable to the specific question
measured during and at the end of training being studied by the subgroup members.
• List names of reading outcomes measured A study could be excluded at the coding stage only if it
was found to have so serious a fundamental flaw that its
- Code each as standardized or investigator-
use would be misleading. The reason(s) for exclusion of
constructed measure
any such study was detailed and documented for the
- Code each as quantitative or qualitative measure record. When quasi-experimental studies were selected,
- For each, is there any reason to suspect low it was essential that each study included both pre-
reliability? (yes/no) treatment and post-treatment evaluations of performance
and that there was a comparison group or condition.
• List time points when dependent measures were
assessed Each subgroup conducted an independent re-analysis of
a randomly designated 10% sample of studies. Absolute
8. Nonequivalence of Groups
rating agreement was calculated for each category (not
• Any reason to believe that treatment/control group for forms). If absolute agreement fell below 0.90 for any
might not have been equivalent prior to treatments?
(yes/no)
category for occurrence or non-occurrence agreement, The subgroups weighted effect sizes by numbers of
the subgroup took some action to improve agreement subjects in the study or comparison to prevent small
(e.g., multiple readings with resolution, improvements in studies from overwhelming the effects evident in large
coding sheet). studies. Each subgroup used median and/or average
effect sizes when a study had multiple comparisons, and
Upon completion of the coding for recently published
only employed the comparisons that were specifically
studies, a letter was sent to the first author of the study
relevant to the questions under review by the subgroup.
requesting any missing information. Any information
that was provided by authors was added to the Expected Outcomes
database.
Analyses of effect sizes were undertaken with several
After its search, screening, and coding, a subgroup goals in mind. First, overall effect sizes of related studies
determined whether for a particular question or issue a were calculated across subgroups to determine the best
meaningful meta-analysis could be completed, or estimate of a treatment’s impact on reading. These
whether it was more appropriate to conduct a literature overall effects were examined with regard to their
analysis of that issue or question without meta-analysis, difference from zero (i.e., does the treatment have an
incorporating all of the information gained. The full effect on reading?), strength (i.e., if the treatment has an
Panel reviewed and approved or modified each decision. effect, how large is that effect?), and consistency (i.e.,
did the effect of the treatment vary significantly from
Data Analysis
study to study?). Second, the Panel compared the
When appropriate and feasible, effect sizes were magnitude of a treatment’s effect under different
calculated for each intervention or condition in methodological conditions, program contexts, program
experimental and quasi-experimental studies. The features, outcome measures, and for students with
subgroups used the standardized mean difference different characteristics. The appropriate moderators of
formula as the measure of treatment effect. The formula a treatment’s impact were drawn from the distinctions in
was: studies recorded on the coding sheets. In each case, a
statistical comparison was made to examine the impact
(Mt - Mc) / 0.5(sdt + sdc) of each moderator variable on average effect sizes for
where: each relevant outcome variable. These analyses enabled
the Panel to determine the conditions that alter a
Mt is the mean of the treated group, program’s effects and the types of individuals for whom
Mc is the mean of the control group, the program is most and least effective. Within-group
sdt is the standard deviation of the treated group, average effect sizes were examined, as were overall
and effect sizes, for differences from zero and for strength.
The analytic procedures were carried out using the
sdc is the standard deviation of the control group. techniques described in Cooper and Hedges (1994).
When means and standard deviations were not
available, the subgroups followed the guidelines for the
calculation of effect sizes as specified in Cooper and
Hedges (1994).
References
Cooper, H., & Hedges, L.V. (1994). The handbook Snow, C. E., Burns, S. M., & Griffin, P. (Eds.).
of research synthesis. New York: Russell Sage (1998). Preventing reading difficulties in young children.
Foundation. Washington, DC: National Academy Press.
1. Phoneme isolation, which requires recognizing PA is thought to contribute to helping children learn to
individual sounds in words, for example, “Tell me read because the structure of the English writing
the first sound in paste.” (/p/) system is alphabetic. Moreover, it is not easy to figure
out the system. Although most English words have
2. Phoneme identity, which requires recognizing the
prescribed spellings that consist of graphemes,
common sound in different words. For example,
symbolizing phonemes in predictable ways, being able to
“Tell me the sound that is the same in bike, boy, and
distinguish the separate phonemes in pronunciations of
bell.” (/b/)
words so that they can be matched to graphemes is
3. Phoneme categorization, which requires recognizing difficult. This is because spoken language is seamless;
the word with the odd sound in a sequence of three that is, there are no breaks in speech signaling where
or four words, for example, “Which word does not one phoneme ends and the next one begins. Rather,
belong? bus, bun, rug.” (rug) phonemes are folded into each other and are
coarticulated. Discovering phonemic units requires
4. Phoneme blending, which requires listening to a instruction to learn how the system works.
sequence of separately spoken sounds and
combining them to form a recognizable word. For Methodology
example, “What word is /s/ /k/ /u/ /1/?” (school)
How was the analysis of the research
5. Phoneme segmentation, which requires breaking a literature conducted?
word into its sounds by tapping out or counting the
sounds or by pronouncing and positioning a marker Before conducting a meta-analysis, the NRP
for each sound. For example, “How many systematically searched the research literature relevant
phonemes are there in ship? ” (three: /š/ /I/ /p/) to PA instruction. After a methodology established by
the Panel was followed, appropriate key words were
6. Phoneme deletion, which requires recognizing what entered to identify relevant studies in ERIC and
word remains when a specified phoneme is PsycINFO. The search was limited to articles
removed. For example, “What is smile without the / appearing in journals written in English, but no limit was
s/?” (mile) placed on the year of publication. This yielded a total of
In the studies reviewed by the NRP, researchers used 1,962 potentially relevant articles. Abstracts were
one or several of these tasks to assess how much PA printed and screened. In addition, references listed in
children possessed before training and how much they these articles and in several review papers were hand-
had learned at the end of training. Also, these tasks searched and screened. To qualify for analysis, studies
were the basis for activities that children practiced had to meet the following criteria:
during training. In some of the studies, children were 1. Studies had to adopt an experimental or quasi-
taught to perform these tasks with letters, for example, experimental design with a control group or a
segmenting words into phonemes and representing each multiple baseline method.
with a grapheme. In other studies, phoneme
manipulation was limited to speech. 2. Studies had to appear in a refereed journal.
To be clear, PA instruction is not synonymous with 3. Studies had to test the hypothesis that instruction in
phonics instruction that entails teaching students how to phonemic awareness improves reading
use grapheme-phoneme correspondences to decode or performance over alternative forms of instruction or
spell words. PA instruction does not qualify as phonics no instruction.
instruction when it teaches children to manipulate
4. Studies had to provide training in phonemic
phonemes in speech, but it does qualify when it teaches
awareness that was not confounded with other
children to segment or blend phonemes with letters.
instructional methods or activities.
5. Studies had to report statistics permitting the
calculation or estimation of effect sizes.
Applying these procedures, the NRP found 52 articles • Reading level of students: The children receiving
from which 96 instructional comparisons were drawn. instruction were at risk for developing reading
In each comparison, one group of children was taught problems, or were reading disabled, or were
PA while a control group received either another type normally developing readers.
of instruction or regular classroom instruction. Following • Grade level: The children were preschoolers,
training, the two groups were compared in their ability kindergartners, 1st graders, or 2nd through 6th
to read. graders.
The primary statistic used in the NRP analysis was • Socioeconomic status (SES): The children were low
“effect size,” the extent to which performance of the SES or middle-to-high SES.
treatment group exceeded performance of the control
group. An effect size of 1.0 indicates that the In addition, the NRP examined various features of the
treatment group mean was one standard deviation experiments to determine whether those showing strong
higher than the control group mean, revealing a strong effects were well designed or weakly designed. Among
effect of PA instruction. An effect size of 0 indicates the design features examined were whether children
that treatment and control group means were identical, were randomly assigned to treatment and control
revealing that training had no effect. To judge the groups, whether the size of the sample was small or
strength of an effect size, a value of 0.20 is considered large, and whether the study met criteria of rigor
small, 0.50 is moderate, and 0.80 is large. For each specified in a critique by Troia (1999).
comparison, three effect sizes were calculated to Results and Discussion
determine whether PA instruction improved children’s
phonemic awareness, reading, and spelling. What do results of the meta-analysis of PA
The studies in the NRP database varied in many
instruction studies show?
respects. These variations showed whether effect sizes The NRP examined whether PA instruction was
were bigger under some conditions than others. The significantly better than alternative forms of training in
NRP compared effect sizes associated with the helping children acquire phonemic awareness and
following variations: enabling them to apply this skill in their reading and
• Type of test: a standardized test was used or a test spelling. Results were positive. The overall effect size
devised by experimenters. on PA outcomes was large, 0.86. The overall effect
size on reading outcomes was moderate, 0.53. The
• Time of test: Outcomes were measured right after overall effect on spelling was also moderate, 0.59.
instruction or after a delay. Effects were significant on followup tests given several
• Type of PA training: Children received instruction months after training ended. Effects were significant on
that focused on one type of PA or two types of PA, measures of children’s ability to read words and
or they were taught three or more types of PA pseudowords as well as their reading comprehension.
skills. Effects were significant on standardized tests as well as
experimenter-devised tests. These findings show that
• Use of letters: Children were taught to manipulate
teaching children to manipulate phonemes in words was
phonemes using letters, or they were taught to
highly effective across all the literacy domains and
manipulate phonemes in speech only.
outcomes. Effects of training did not generalize to
• Size of groups: Children were taught individually or performance on math tests, indicating that halo/
in small groups or in larger classroom groups. Hawthorne effects did not account for the findings.
• Trainer: The source of the instruction was the
children’s classroom teacher or a researcher or a
computer.
• Length of instruction: Instruction varied from 1 hour
to 75 hours.
What were the effects of moderators on instruction. Small-group instruction produced larger
learning phonemic awareness? effect sizes on reading than individual instruction or
classroom instruction, albeit in an unanticipated fashion.
The NRP examined whether PA training was effective Specifically, the longer the training program, the smaller
under more specific conditions. Children acquired PA the effect size. Significant improvement in reading skills
successfully under all conditions, but some conditions following PA instruction was observed both in studies
produced larger effects than others. Effect sizes were involving classroom teachers and in computer formats,
larger when children received focused and explicit but the degree of transfer was less than that achieved
instruction on one or two PA skills than when they were in experimentally controlled studies. Large effect sizes
taught a combination of three or more PA skills. were obtained in studies of at-risk readers, with
Instruction that taught phoneme manipulation with moderate effect sizes obtained for disabled and
letters helped normally developing readers and at-risk normally developing readers.
readers acquire PA better than PA instruction without
letters. When children were taught PA in small groups, Moreover, preschoolers exhibited a much larger effect
their learning was greater than when they were taught size on reading than did students in the other grade
individually or in classrooms. The length of time spent levels. Children learning to read in English also showed
teaching children was influential, with treatments lasting larger transfer effects to reading than children learning
from 5 to 18 hours producing larger effect sizes than in other languages. The effects of PA training on
shorter or longer treatments. Classroom teachers were reading outcomes were also influenced by SES, with
very effective in teaching PA to children. Also, mid-to-high SES associated with larger effect sizes than
computers were effective. Although all levels of low SES.
readers acquired PA successfully, effect sizes were
What were the effects of moderators on
greater for children who were beginning readers at risk
learning to spell?
for reading failure and normally progressing readers
than for older disabled readers. Students in the lower The NRP also examined how different conditions
grades, preschool, and kindergarten, showed larger influenced the impact and transfer of PA training to
effect sizes in acquiring PA than children in 1st grade spelling. The effects of PA training on spelling for
and above. Children learning to read in English showed disabled readers was minimal, as indicated by effect
larger effects than children learning to read in other sizes that did not differ significantly from zero. This is
alphabetic languages. However, SES level exerted no consistent with other findings indicating that learning to
impact on effect size, indicating that low and mid-to- spell is especially difficult for disabled readers. Because
high SES children benefited similarly from PA training disabled readers were unevenly distributed across the
in acquiring phonemic awareness. conditions that were examined in relation to the effects
of PA training on spelling, along with the finding of a
What were the effects of moderators on
nonsignificant effect size, data obtained from studies of
learning to read? disabled readers were eliminated from the database.
The impact of these specific conditions on the amount The effects of conditions on spelling outcomes were
of transfer from PA training to other reading skills was analyzed for at-risk and normal readers. For these
also examined. For example, transfer was greater when groups, effect sizes involving spelling outcomes did not
experimenter-devised tests were used to measure differ across levels of the following properties of PA
reading skills than when standardized tests were used. training: whether one or two or multiple PA skills were
This was not surprising, given that standardized tests taught, whether training was conducted with individuals
tend to be less sensitive. Teaching that focused on one or small groups or classroom-size groups, how long
or two types of PA manipulations yielded larger effect training lasted, or whether the trainer was a classroom
sizes than teaching three or more PA skills. Teaching teacher or a researcher. However, effect sizes did
children to manipulate phonemes using letters produced differ across other conditions. Teaching children to
bigger effects than teaching without letters. Blending manipulate phonemes with letters exerted a much larger
and segmenting instruction exerted a significantly larger impact on spelling than teaching children without letters.
effect on reading development than did multiple-skill
beginning reading and spelling instruction, there is the NRP cannot infer that every teacher of every
obviously much more that needs to be taught to child in the studies was successful in promoting the
children to enable them to acquire reading and acquisition of PA or its transfer to reading and
writing competence. PA instruction is intended only writing. There was considerable variation within
as a critical foundational piece. It helps children and across individual studies. Likewise, the NRP
grasp how the alphabetic system works in their findings should not be used to dictate any
language and helps children read and spell words in oversimplified prescriptions regarding effective PA
various ways. However, literacy acquisition is a instruction, for example, how long PA training
complex process for which there is no single key to should last (e.g., 5 to 18 hours) to be most
success. Teaching phonemic awareness does not effective. There are many factors that govern the
ensure that children will learn to read and write. effectiveness of instruction.
Many other competencies must be taught for this to • More is not necessarily better. The NRP findings
happen. indicated that PA training was effective regardless
• A number of PA instructional programs were found of its length. However, effect sizes were largest
to be effective. The studies assessing these when training lasted less than 20 hours. This
programs are useful in identifying several factors suggests that teachers should make reasoned
that are important and should be considered in decisions and remain flexible about the amount of
planning classroom instruction or in evaluating time to devote to this component of their
published programs that purport to teach PA. In instructional programs. Children will differ in the
implementing PA instruction in their classrooms, time they need to acquire PA. The best solution is
teachers need to evaluate the methods they use to pretest for PA skills and adjust the amount of
against measured success in their own students. instruction to suit individual and class needs.
• One factor that is obviously important in any • Early PA instruction cannot guarantee later literacy
effective classroom program but has not been success. The most reasonable conclusion from the
specifically addressed in the research literature on findings of the NRP analysis is that adding well-
PA instruction is motivation of the students and of designed PA instruction to a beginning reading
the teachers. It seems self-evident that techniques program or a remedial reading program is very
to develop children’s PA in classrooms should be as likely to yield significant dividends in the acquisition
relevant and exciting as possible so that the of reading and writing skills. Whether the benefits
instruction engages children’s interest and attention are lasting will likely depend on the
in a way that promotes optimal learning. However, comprehensiveness and effectiveness of the entire
research has not specifically focused on this factor. literacy program that is taught. Additional factors
Neither has the research examined the specific that play a significant role in children’s literacy
techniques that are most engaging for teachers. For acquisition are detailed in other sections of the NRP
example, none of the studies inquired whether report.
teachers liked the programs they were given to
teach. It seems self-evident that teachers will be
Directions for Further Research
most effective when they are enthusiastic in their Many experiments have been conducted to test
teaching and enjoy what they are doing in the whether phonemic awareness instruction helps children
classroom. In selecting ways to teach PA in their learn to read. Results have been sufficiently positive to
classrooms, teachers need to take account of sustain confidence that this treatment is indeed
motivational aspects of programs for themselves as effective across a variety of child and training
well as their students. conditions. However, there are still some questions
• Results of the meta-analysis should not be needing further attention from researchers.
overinterpreted. Although most comparisons in the • Research is needed to identify what teachers need
analysis demonstrated significant mean effect sizes, to know and be able to do to teach PA effectively
and to integrate this instruction with other elements
In a few of the studies analyzed by the NRP, instruction 0.66 with reading achievement scores in kindergarten
was focused on teaching children to manipulate onsets and 0.62 with scores in 1st grade. Of interest in our
and rimes in words (Fox & Routh, 1984; Lovett, analysis was whether PA could be shown to play a
Barron, Forbes, Cuksts, & Steinbach, 1994; Treiman & causal role in learning to read.
Baron, 1983; Wilson & Frederickson, 1995). The onset
PA is thought to contribute in helping children learn to
is the single consonant or consonant blend that precedes
read because the structure of the English writing
the vowel, and the rime is the vowel and following
system is alphabetic. Moreover, it is not easy to figure
consonants, for example, j-ump, st-op, str-ong. Dividing
out the system. Words have prescribed spellings that
single-syllable words into these units is easier than
consist of graphemes symbolizing phonemes in
dividing the words in other places, for example, after
predictable ways. Being able to distinguish the separate
the vowel (Treiman, 1985). The NRP included these
phonemes in pronunciations of words so that they can
studies in the set because students were essentially
be linked to graphemes is difficult. This is because
manipulating phonemes when the onset was a single
spoken language is seamless and there are no breaks in
phoneme.
speech signaling where one phoneme ends and the next
Some forms of PA training in the data set qualified as one begins. Rather phonemes are folded into each other
phonics instruction, which involves teaching students and are coarticulated. Discovering phonemic units is
how to use grapheme-phoneme correspondences to helped greatly by explicit instruction in how the system
decode or spell words. For example, Williams’ (1980) works. This is underscored by research revealing that
ABD program taught students to use graphemes and people who have not learned to read and write have
phonemes to blend words—which is decoding. Ehri and great trouble performing phonemic awareness tasks
Wilce (1987b) taught students to use graphemes and (Morais, Bertelson, Cary, & Alegria, 1987). Likewise
phonemes to segment words—which is spelling. Also, people who have learned to read in a script that is not
Wise, King, and Olson (in press) taught both segmenting graphophonemic, such as Chinese, have difficulty
and blending with letters. What distinguished the NRP segmenting speech into phonemes (Mann, 1987; Read,
studies from the general pool of phonics training studies, Zhang, Nie, & Ding, 1987). For these reasons, it was
however, is that instruction given to treatment students expected that the impact of PA training on literacy
but withheld from controls was limited to grapheme- would be strongest in tasks assessing children’s ability
phoneme manipulation and did not go beyond this to to read and spell words.
include other activities such as reading decodable text
Research on word reading processes has distinguished
or writing stories.
several ways to read words (Ehri, 1991, 1994). The
Contribution of PA in Learning to Read process of decoding words never read before involves
transforming graphemes into phonemes and then
As mentioned above, PA measured at the beginning of blending the phonemes to form words with recognizable
kindergarten is one of the two best predictors of how meanings. The PA skill centrally involved in decoding is
well children will learn to read. In a study by Share et blending. To assess decoding skill, researchers often
al. (1984), kindergartners were assessed on many test children’s ability to read pseudowords such as blig
measures when they entered school, including phonemic or nef.
segmentation, letter name knowledge, memory for
sentences, vocabulary, father’s occupational status, A second way to read unfamiliar words is by analogy to
parental reports of reading to children, TV watching, known words (Gaskins, Downer, Anderson,
and many more. These researchers examined which of Cunningham, Gaskins, Schommer, & the Teachers of
these measures best predicted how well the children Benchmark School, 1988; Glushko, 1979; Goswami,
would be reading at the end of kindergarten and at the 1986; Marsh, Freidman, Welch, & Desberg, 1981). A
end of 1st grade. Results showed that PA was the top common basis for analogizing is recognizing that the
predictor along with letter knowledge. PA correlated rime segment of an unfamiliar word is identical to that
of a familiar word, and then blending the known rime
with the new onset, for example, reading brick by Some of the studies in the NRP database measured
recognizing that -ick is contained in the known word reading comprehension as well as word reading. In
kick. Reading by analogy is thought to require the PA order to comprehend a text, readers must be able to
skills of onset-rime segmentation and blending. read most of the words. However, other capabilities
influence reading comprehension as well, such as
Another way to read words is from memory, sometimes
readers’ vocabulary, their world knowledge, and their
called sight word reading. This requires prior
memory for text. It was expected that PA training
experience reading the words and retaining information
would benefit children’s reading comprehension
about them in memory. In order for individual words to
because of its dependence on effective word reading.
be represented in memory, beginning readers are
However, the degree of influence was expected to be
thought to form connections between graphemes and
less than that observed with word reading because the
phonemes in the word. These connections bond
influence is indirect.
spellings to their pronunciations in memory (Ehri, 1992;
Ehri & Wilce, 1987a; Rack, Hulme, Snowberg, & Design Features of Phonemic Awareness
Wightman, 1994; Reitsma, 1983). The PA skill thought Training Studies
to be important for developing word memory is being
able to segment pronunciations into phonemes that link Many correlational studies have reported strong
to graphemes. Formulation of this concept led to the relationships between phonemic awareness and learning
expectation PA training would benefit children’s word to read (for reviews, see Blachman, in press; Ehri,
reading, particularly when they received practice 1979; Stahl & Murray, 1994; Wagner & Torgesen,
learning to read the words. 1987). In correlational studies, researchers measure
children’s ability to manipulate phonemes and also their
The processes involved in writing words, either by reading ability. Typical findings show that students who
generating approximate spellings of the words or by have superior phonemic awareness are better readers
retrieving correct spellings from memory, require than students with low PA. However, such findings are
phonemic segmentation skill (Griffith, 1991). Phonemic insufficient to show that PA was the underlying cause
segmentation is required for spellers to select letters to enabling some students to read better than others. This
represent the phonemes. Phonemic segmentation is is because the finding does not rule out other causal
required to help children retain correct spellings in explanations for the relationship. Perhaps the
memory by connecting graphemes to phonemes. In the correlation was observed because cause operated in the
analysis it was expected that PA training would benefit reverse direction; that is, learning to read improved
children’s ability to spell. students’ PA. Or perhaps a third factor operated as an
Various kinds of word reading outcomes were assessed underlying cause boosting both PA and reading, for
across the studies the Panel reviewed. The simplest example, vocabulary size, memory, or general
task given to preschoolers required them to look at a intelligence.
word (sat) and decide whether it says sat or mat In order to show that PA operates as a direct cause in
(Byrne & Fielding-Barnsley, 1991). Studies with older helping children learn to read, the NRP needed to
children gave them lists of words to read either from assess evidence from experimental studies with
standardized tests or experimenter-devised tests. Also, treatment and control groups. A well-designed
word learning tasks were used. For example, experiment that provides strong evidence for cause
kindergartners first reviewed four letter-sound relations should include the following steps:
and then practiced learning to read five words over
several trials, am, at, mat, sat, Sam (O’Connor, Jenkins, 1. Pretesting should be given to students before they
& Slocum, 1995). Also, pseudoword reading tasks were receive any training. Pretests verify that children
used in which children read nonwords such as feem, have not already acquired PA and hence can profit
hote, cliss. Spelling tasks were included as well. from training. Pretest performance can be
Younger children were given credit for inventing compared to posttest performance on PA, reading,
phonetically plausible spellings of words while older and spelling tasks to evaluate gains resulting from
children were scored for producing correct spellings. PA training. Also, pretests indicate whether
treatment and control groups were equivalent prior Although these features characterize a well-designed
to training. If not, pretests can be use to equate the experiment, there were studies in the NRP database
groups statistically when effects of training are that lacked some of these features. Because of this, the
evaluated on outcome measures. relationship between design features and outcomes was
assessed. Studies varied in whether they compared
2. The group receiving PA training should be
performance of the PA-trained groups to performance
compared to a control group that is equivalent in all
of treated control groups or untreated control groups. If
respects except for receiving the PA training.
Hawthorne effects have influenced comparisons, one
Control groups may receive another type of training
would expect bigger effects when PA treatment groups
involving equal time but no PA instruction, or
are compared to untreated control groups than when
control groups may receive no special training
compared to treated control groups. However, Bus and
beyond that provided in the students’ classrooms at
van Ijzendoorn (1999) in their meta-analysis reported
school. The use of an alternative-treatment control
the reverse, finding bigger effects in comparisons
group is considered preferable to a no-treatment
between PA treatment groups and control groups
control group because the former rules out the
receiving an alternative treatment. The Panel attempted
Hawthorne effect as the explanation for any
replication of their findings with the NRP data set.
outcome differences favoring the experimental
group. The Hawthorne effect occurs when a The Panel also assessed whether PA training affected
treatment group outperforms a no-treatment control outcomes in three types of designs: (1) in true
group because the treated group received special experiments where students were randomly assigned to
attention and as a result was more motivated to treatment and control groups; (2) in quasi-experiments
perform. where students were members of pre-existing groups
which were not randomly assigned to treatment and
3. Random assignment should be used to place
control conditions; and (3) in studies where students
students in treatment and control groups. Random
from treatment and control groups were matched.
assignment makes it likely that treatment and
Although random assignment is preferable, researchers
control groups do not differ systematically in any
may be limited to a quasi-experimental design when
way that would explain outcome differences
they evaluate PA programs in schools where
following training. In other words, this step helps to
classrooms already exist or when they employ as
establish that the treatment, rather than some other
trainers teachers who are already familiar with a
factor, was the cause of any improvement in
program and teach it to their students. The procedure of
reading outcomes.
matching children on the basis of pretest scores is done
4. Posttests should be given to students following to minimize any pretreatment differences between the
training. Posttests to assess PA verify that training groups being compared. In the NRP analysis, the
worked, that the PA-trained group made greater effects of PA training separately for the three types of
gains than the control group. Posttests to assess studies were examined.
reading and spelling show that PA training
In a recent critique of PA training studies, Troia (1999)
transferred and improved students’ reading and
identified several design flaws and applied these criteria
spelling performance.
to rate PA training studies for their lack of
5. Followup posttests should assess the long-term methodological rigor. To evaluate the impact of these
effects of PA training on students’ progress in flaws on outcomes, the Panel examined the relationship
reading and spelling. Between the end of training between Troia’s assessments of the PA studies and the
and the followup tests, both experimental and effects reported in these studies. The purpose of this
control students receive regular instruction at school analysis was to rule out the possibility that claims about
but no further specialized training in PA. PA training effects are supported mainly by poorly
designed studies.
Other Features of PA Training Studies personnel, then PA instruction should not be imposed on
classroom teachers. In the NRP analyses, the effects
Studies in our data set varied in the types of students of PA training were examined separately for teachers,
who received PA training. The NRP wanted to know for computers, and for researchers.
whether certain types of students benefited more than
other types. Studies varied in the grade level of their There is substantial evidence that one-to-one tutoring is
participants and ranged from preschool to 6th grade. the most effective form of instruction (Bloom, 1984;
Studies varied in whether their students showed any Cohen, Kulik, J., & Kulik, C., 1982; Glass, Cahen,
signs of having reading problems. Three types of Smith, & Filby, 1982; Pinnel, Lyons, DeFord, Byrk, &
readers were distinguished across the studies. Some Seltzer, 1994; Wasik & Slavin, 1993). However, Bus
focused on children at risk for developing reading and van Ijzendoorn (1999), in their meta-analysis of PA
difficulties in the future. These were children below 2nd training studies, found that teaching PA to small groups
grade. Being at risk was defined as having low PA or of children produced a bigger impact on outcomes than
low reading in 83% of the cases. Low socioeconomic teaching students individually or in classrooms. The aim
status (SES) characterized only 27% of the cases. was to attempt replication of this finding with the NRP
Some studies focused on children who had already data set that included more studies than those in the
fallen behind classmates in their reading, referred to as previous meta-analysis.
disabled readers. These were children in 1st grade and
It is common wisdom that greater time spent training
above. The remaining studies sampled children who
students yields superior learning. However, instructional
were judged to be making normal progress in learning to
time in schools is very limited because of the many
read. This judgment was based on the fact that the
subjects and skills that must be taught. The studies in
children were not identified as having any reading
the NRP data set varied in the length of time spent
problems.
teaching PA to students. To address the question of
One common finding reported in many correlational how much time might be sufficient for teaching PA, the
studies is that children who are or will become disabled relationship between training time and effects on
readers have poor phonemic awareness, substantially learning was examined.
below that expected of students at their reading levels
The NRP database included PA training studies
(Bradley & Bryant, 1983; Bruck, 1992; Fawcett &
conducted not only in English but also in other
Nicholson, 1995). Researchers have suggested that this
languages, such as Norwegian, Finnish, Swedish,
deficiency underlies and explains their difficulty in
Danish, Spanish, Hebrew, Dutch, and German. In most
learning to read. In the NRP analysis, the Panel
of these languages, the grapheme-phoneme connections
examined whether PA training was effective in
are more transparent than in English. Of interest was
teaching PA to at-risk and disabled readers and
whether PA training might exert a larger impact in
whether this improved their reading and spelling
English because it is harder for beginning readers to
performance, thus providing evidence for a causal
discover the graphophonemic system in English than in
connection.
other languages.
Studies varied in how the PA training was delivered. In
some studies, researchers or their specially trained Methodology
assistants taught children to manipulate phonemes. In Database
other studies, classroom teachers were the trainers. In
a few studies, training was presented primarily by An electronic search of two databases, ERIC and
computers. Because classroom teachers are the PsycINFO, was conducted. Six terms involving
purveyors of reading instruction for most children, it is phonemic awareness were crossed with 15 terms
important to determine whether they can teach PA related to reading performance. The PA terms were:
effectively. If training requires specially trained phonemic awareness, phonological awareness, spelling,
blending, learning to spell, and invented spelling. The
reading terms were: reading, reading ability, reading
achievement, reading comprehension, reading
development, reading disabilities, reading skills, remedial effect sizes for each treatment-control comparison
reading, beginning reading, beginning reading instruction, consisted of the mean of the treatment group minus the
reading acquisition, word identification, word reading, mean of the control group divided by a pooled standard
oral reading, and miscues. The search was limited to deviation.
articles appearing in journals written in English, but no
From the 52 studies, 96 cases comparing individual
limit was placed on the year of publication. Using this
treatment and control groups were derived. Because
procedure, the Panel located 637 articles through ERIC,
some of the studies included more than one treatment
and 1,325 articles through PsycINFO. Abstracts were
or control group, the cases included comparisons
printed and screened. In addition, the Panel hand-
utilizing the same group more than once. There were
searched and screened references cited in the studies
seven treatment groups appearing twice because they
located by the electronic search and in several review
were compared to two different control groups. There
papers (Apthorp, 1998; Blachman, in press; Bus & van
were 16 control groups appearing twice because they
Ijzendoorn, 1999; Stahl & Murray, 1994; Troia, 1999;
were compared to 2 different treatment groups. There
Wagner, 1988).
was one control group appearing three times because it
To qualify for the analysis, studies had to meet the was compared to three treatment groups. In sum, there
following criteria: were 47 independent comparisons and 49 comparisons
having a group that overlapped with one or at most two
1. Studies had to adopt an experimental or quasi-
other comparisons. Although this meant that effect
experimental design with a control group or a
sizes were not completely independent across cases,
multiple baseline method.
the Panel preferred this alternative to combining
2. Studies had to appear in a refereed journal. treatment and control groups within studies because it
was important not to obscure important moderator
3. Studies had to test the hypothesis that training in variables of interest. For example, Davidson and
phonemic awareness improves reading Jenkins (1994) studied three treatment groups, one
performance over alternative forms of training or taught to blend, one taught to segment, and one taught
no training. to both to segment and blend. They compared the
4. Studies had to provide training in phonemic performance of each treatment to the same control
awareness that was not confounded with other group. The Panel wanted to retain these as separate
instructional methods or activities. comparisons in our analysis, so the same control group
was allowed to recur in three comparisons.
5. Studies had to report statistics permitting the
calculation or estimation of effect sizes. A few studies in the NRP database included treatment
or control groups that were not deemed appropriate for
From the various lists of references, the Panel identified analysis. One reason was that the treatment groups
and located 78 articles that appeared to meet our provided not only phonemic awareness training but also
criteria. Upon closer inspection, 26 articles did not reading or writing training that was not provided to
match all criteria: 5 lacked sufficient information to control groups, thus confounding PA training with
determine effect size; 5 lacked an adequate control reading and writing training. The following describes
group; 12 did not assess reading as an outcome; and 4 which treatment or control groups were eliminated from
lacked appropriate phonemic awareness training. The the analysis and why: a treatment group given decoding
final set of studies meeting our criteria numbered 52 training and word reading (Barker & Torgesen, 1995); a
(see Appendix A). treatment group given a reading and writing program
The primary statistic used in the Panel’s analysis of (Brennan & Ireson, 1997); a treatment group taught to
performance on outcome measures was effect size, manipulate syllables rather than phonemes (Sanchez &
indicating the extent to which performance of the Rueda, 1991); a treatment group taught semantic
treatment group exceeded performance of the control categorization with written words (Defior & Tudela,
group, with the difference expressed in standard 1994); treatment groups in which the teacher-trainers
deviation units. The formula used to calculate raw failed to spend the time prescribed for training
(Olofsson & Lundberg, 1983); treatment groups in
which children not only analyzed phonemes but also Characteristics of children receiving the training were
read words in sentences and stories, unlike children in coded. Children were grouped into four categories to
the control groups who only listened to stories or reflect their grade levels: preschool, kindergarten, 1st
remained in their classrooms (Solity, 1996; Weiner, grade, and 2nd through 6th grades. Also children were
1994); a control group lacking not only PA training but grouped by reading ability. At-risk children were those
also the Reading Recovery© instruction given the judged by authors of the studies to be at risk for
treatment group (Iversen & Tunmer, 1993): and a developing reading problems. In the majority of cases
control group that did not control for all of the non-PA (77%), this was indicated by poor performance on PA
elements of training (Lovett et al., 1994; Vellutino & tasks. Other indicators used in a few studies were low
Scanlon, 1987). These treatment or control groups were reading, low SES, developmental or language delays, or
not included in the database. cognitive disabilities. Only 27% of the cases were low
SES, while 37% were middle-to-high SES. These
The studies in the NRP database were coded for many
children were all below 2nd grade.
characteristics that the Panel felt were important to
include as moderator variables in the meta-analyses. Children who had already developed reading problems
These characteristics are listed in Table 1 (Appendix were coded as disabled readers. All but three cases
B). Various properties of phonemic awareness training involved children between 2nd and 6th grade levels.
were coded. Training programs varied in whether they The three cases involved 1st graders who qualified for
focused on specific PA manipulations. Single-focus Reading Recovery© programs (Hatcher, Hulme, & Ellis,
studies taught blending, categorization, identity, 1994; Iversen & Tunmer, 1993). Being reading disabled
segmention, or onset-rime only. Double-focus studies meant reading below grade level despite at least
involved combinations of blending, segmenting, deletion, average cognitive ability in most studies. In one study,
or categorization. Global treatments taught three or the school’s definition of learning disabled was used
more PA skills. Programs that only taught onset-rime (Williams, 1980). In one study, students were not only
manipulation were coded as onset-rime training, even reading disabled but also had neurological impairment
though the training might have involved blending and and language learning problems (Lovett et al., 1994).
segmenting (e.g., Fox & Routh, 1976). Training varied
Samples of children not reported as being at risk or
in whether children were taught to manipulate
reading disabled were coded as normally progressing
phonemes using letters or whether attention was limited
readers. These studies included children selected not to
to phonemes in speech. Training that had children
have reading problems as well as children selected
manipulate blank markers was coded as a nonletter
without regard to reading ability. The socioeconomic
treatment.
level of children was coded into two categories, low
The training unit varied across studies. Students were SES or middle-to-high SES, based on assertions by
tutored individually in some studies and in either small authors. The language spoken by children and used to
groups or whole classrooms in other studies. The size of teach PA was coded as English or non-English. Non-
the small groups varied from two to seven students. English languages included Dutch, Finnish, German,
The identity of trainers varied across studies. The Panel Hebrew, Norwegian, Spanish, and Swedish.
compared classroom teachers to others who were
Some features of the methodology used in the
mostly researchers or trained assistants. Credentialed
experiments were coded. Children were assigned to
teachers who conducted the training but were not the
treatment and control groups in one of three ways.
students’ classroom teacher were coded as others. In a
They were randomly assigned. Or they were members
few studies, PA training was provided mainly by
of intact groups that were not randomly assigned to
computers. The Panel compared this training to training
conditions, referred to by researchers as nonequivalent
provided by noncomputers (all others). The length of
groups. In some studies two classrooms were assigned
training varied from 1 to 75 hours. Comparisons were
randomly, one to the treatment and one to the control
conducted by dividing training time into four blocks.
condition. These cases were categorized as
nonequivalent groups. In other studies, several
classrooms were assigned randomly to treatment and
control conditions. These cases were categorized as One final characteristic of the NRP studies was coded
random assignment. The third way of assigning children and analyzed, the year of publication. Years were cast
to conditions involved matching children on the basis of into four blocks. Other characteristics of the studies
similar test scores. Typically, members of a match are were coded as well but were not analyzed either
randomly assigned, one to the treatment group and one because there was little interest or because there was
to the control group. However, in some studies, this step an insufficient number of cases to support a meaningful
was not stated explicitly; so, it is impossible to be sure analysis.
that random assignment was always used.
Four individuals coded the studies and entered values
The Panel coded studies to reflect whether fidelity to into the SPSS database. The reliability of moderator-
treatment was checked, that is, whether researchers variable codes was checked by comparing codes in the
observed trainers to make sure they adhered to database to codes generated by one of the coders who
treatment procedures. In addition, comparisons were re-coded 14 of the articles (15% of the cases). The
coded for the type of control group, that is, whether or percentage of agreement of the codes was 94%. All of
not control students received a special alternative the means, standard deviations, and sample sizes that
treatment or remained untreated. The number of were entered into the database were verified at least
students participating in the comparison was coded to twice for accuracy.
reflect sample size. The numbers were grouped into
There were three outcomes of primary interest:
four blocks to distinguish sample sizes ranging from
phonemic awareness, reading, and spelling
small to large.
performance. Some studies included multiple tasks
To evaluate the relationship between the methodological measuring these outcomes. These measures were
quality of studies and the effect sizes found, the Panel combined by calculating raw effect sizes (g) for
adopted the five methodological criteria applied by Troia individual tasks and then averaging the effect sizes
(1999) in his critique of the internal and external validity across tasks. The composite measure for reading
of PA training studies. Internal validity refers to the included many different types and measures of reading.
authenticity of cause-and-effect relationships in a study, For example, word reading, pseudoword reading,
that is, whether the treatment caused the outcome reading comprehension, reading speed, time to reach a
observed, or whether other variables could have criterion of learning, and miscues were included. The
impacted the outcome. External validity refers to the phonemic awareness composite included only those
generalizability of the findings, that is, whether or not measures that required manipulating phoneme-size
the results of a study can be applied to other persons in units, not larger syllabic units. The types of
other settings at other times. To evaluate the internal manipulations in the composite included segmentation,
and external validity of studies, Troia used four blending, reversing, deletion, identity, and categorization.
summary measures: percentage of internal validity The spelling composite included measures of the quality
criteria met by the studies, number of critical flaws of invented spellings as well as correct spellings of
challenging a study’s internal validity (e.g., no random words and pseudowords.
assignment, no alternative treatment given to the control
The Panel also examined more specific outcome
group, no assessment of trainer fidelity to treatment),
measures that included various types of phonemic
percentage of external validity criteria met, and number
awareness, reading, spelling, and math. The specific
of critical flaws challenging a study’s external validity
measures are listed in Table 1. Also of interest was a
(e.g., insufficient information about the sample of
comparison of effect sizes on outcomes measured
participants or about how disability was defined and
immediately after training to outcomes assessing long-
assessed). Troia evaluated 28 of the studies included in
term learning. Delayed posttests were administered
the NRP database. The Panel applied his ratings and
from 2 to 36 months following training.
rankings to the 56 cases derived from these studies.
The Panel did this without checking Troia’s evaluations
for accuracy; so, any incorrect codings of the studies
arise from Troia’s procedures, not from the Panel’s.
Inspection of spelling outcomes in Table 4 reveals that and spelling in both English and non-English languages,
all but three effect sizes were significantly greater than and among low SES as well as middle-to-high SES
zero. This indicates that, across most of the conditions children. Many types of PA training programs are
and characteristics considered, phonemic awareness effective for improving reading and spelling, including
training transferred and improved spelling skills more those that teach one or multiple types of phonemic
than alternative forms of training or no training. Effect awareness, those that incorporate letters into training,
sizes for spelling outcomes were insignificant when and those that limit phoneme manipulation to speech.
computers were used in the training, and when the Not only researchers but also classroom teachers and
students trained were disabled readers or children in computers can deliver PA instruction effectively.
2nd grade and above. As documented below, the Instruction can be conducted successfully with
absence of significant effects on spelling outcomes in individuals as well as small groups and whole
the latter cases arose primarily because disabled classrooms. Training does not have to be lengthy to be
readers’ spelling benefited little from PA training, and effective.
these readers were overrepresented in these categories
(i.e., 2nd through 6th graders, receiving PA instruction Were Effect Sizes Homogeneous?
on computers). In addition to determining whether mean effect sizes
Some of the studies evaluated the effects of PA training were significant, the Panel also tested whether the set
on an outcome not expected to be affected (e.g., of effect sizes was sufficiently homogeneous to render
mathematics). Tests to assess math were administered the mean effect size representative of that set. A
following training in 12 comparisons and following some homogeneity analysis calculates how probable it is that
delay in three comparisons. Results in Table 3 show the variance exhibited among the effect sizes would be
that the effect size was nonsignificant and close to zero observed if only sampling error was making them
(d = 0.03). This indicates that the effects of PA training different (Cooper, 1998). The 95% confidence intervals
did not influence all outcomes but rather were limited to for effect sizes presented in Tables 2 to 4 reveal how
outcomes related to literacy. These findings argue variable they were. When the pool of effect sizes is not
against the operation of any halo/Hawthorne effect homogeneous, the next step is to examine whether
explaining the positive effect sizes. moderator variables reduce the variability among effect
sizes to create homogeneity, indicating their power to
In sum, these findings led the Panel to conclude with explain the variance.
much confidence that phonemic awareness training is
more effective than alternative forms of training or no At the top of Tables 2, 3, and 4 in Appendix C, it is
training in helping children acquire phonemic awareness apparent that on the immediate outcome measures of
and in facilitating transfer of PA skills to reading and PA, reading, and spelling, effect sizes were not
spelling. PA training improves children’s reading homogeneous, as indicated by “No” in the homogeneity
performance in various types of tasks, including word column. Effect sizes involving followup measures of PA
reading, pseudoword reading, and reading and spelling outcomes were homogeneous, but followup
comprehension. Benefits are evident on standardized reading effect sizes were not. Thus, there is reason to
tests as well as experimenter-designed tests of reading examine moderator variables that may explain effects
and spelling. Improvement in reading and spelling is not on immediate outcomes and on followup tests involving
short-lived but lasts beyond the immediate training reading outcomes.
period. Did Moderator Variables Influence Effect
PA training improves reading performance in Sizes?
preschoolers and elementary students, and in normally Studies varied in many respects as indicated in Table 1
progressing children, as well as in older disabled readers (Appendix B). The Panel examined whether these
and younger children at risk for reading difficulties. PA moderator variables enhanced or limited the
training improves spelling performance in effectiveness of PA training for teaching PA and for
kindergartners, 1st graders, and at-risk students, but not facilitating transfer to reading and spelling. It is
in older disabled readers. PA training boosts reading important to recognize the limitations of this type of
analysis and the tentative nature of any conclusions that PA training benefited children’s reading and spelling
are drawn. Findings involving the impact of moderator performance not only on experimenter-devised (E) tests
variables on effect sizes cannot support strong claims but also on standardized (S) tests, although the effect
about causality. Moderator findings are no more than size was significantly larger with experimenter tests (d
correlational. The biggest source of uncertainty is = 0.61 E vs. 0.33 S for reading; d = 0.75 E vs. 0.41 S
whether there is a hidden variable that is confounded for spelling). This is perhaps not surprising.
with the variable in focus and is the true cause of the Standardized tests are designed to assess reading and
difference; thus, the conclusions drawn should be spelling across a wide range of ability levels and hence
regarded as tentative and suggestive rather than the are less sensitive to differences at any one level in the
final word. range. Also, experimenter tests may be more sensitive
because often they are tailored to detect the phonemes
Another caution to keep in mind in interpreting findings
and graphemes that were taught.
involving moderator variables is that the same 96 cases
in the database do not contribute to the calculation of all Some studies assessed reading performance with
effect sizes. Rather the set of cases changes across pseudowords in order to measure children’s ability to
moderator variables, either because some of the studies decode unfamiliar words. From Table 3, it is apparent
lacked the information to be coded, they did not assess that PA training benefited decoding skill. Effects were
the outcome in interest, or they did not include a moderate and equivalent on both experimenter-devised
measure of the outcome at that test point. Any tests (d = 0.56) and standardized tests (d = 0.49).
instability in the pattern of findings may arise from this
The effect of PA training on reading comprehension
source, particularly when only a few cases contribute.
was assessed in 18 cases. From Table 3, it is apparent
Outcome Measures that training boosted reading comprehension
The immediate goal of phonemic awareness training significantly (d = 0.32), although the effect size was
across these studies was to improve children’s smaller than for word reading. This is not surprising. PA
phonemic awareness. From Table 2, it is apparent that training would be expected to influence comprehension
the effect size after training was large (d = 0.86), and it primarily through its impact on word reading. The task
did not decline significantly at the followup test (d = of reading, understanding, and remembering information
0.73). Thus, PA training taught phonemic awareness in the text involves multiple processes. Not only must
very effectively, and students retained their skill after students read the words, but also they must do so
training ended. Comparison of specific PA skills rapidly and accurately and must construct meaning
acquired during training indicated that effects were across the words and sentences. These other
larger for segmentation and deletion outcomes than for processing demands could be expected to dilute the
blending. Perhaps blending was harder to teach, or influence of PA training.
perhaps it was easier for controls to pick up without
Pr oper
Proper ties of P
operties A Training
PA
instruction.
Studies varied in whether one skill, two skills, or multiple
The strong gains in PA were observed to transfer to skills were taught. These skills consisted mainly of
reading and spelling, and effects persisted through the teaching children to identify or categorize phonemes, or
second followup test. As evident in Table 3, reading- to blend, segment, or delete phonemes, or to manipulate
outcome effect sizes were moderate, and the effect onset-rime units. From PA outcomes in Table 2, it is
size after training (d = 0.53) was equivalent to that at apparent that focusing instruction on one or two skills
the first followup test (d = 0.45). A significant effect was significantly more effective for teaching phonemic
size was still present but significantly smaller at the awareness than focusing on multiple skills (d = 1.16 for
second followup test (d = 0.23). Table 4 shows that one vs. d = 1.03 for two vs. d = 0.70 for multiple). One
spelling outcomes were boosted by PA training. The explanation for lower effect sizes is that children who
effect size following training (d = 0.59) was moderate were taught many different ways to manipulate
and significantly greater than the effect sizes at the two phonemes may have become confused about which
delayed posttests (d = 0.37 and 0.20) that did not differ. manipulation to apply when the various kinds of PA
were assessed after training. Another possibility is that
insufficient time was spent on any one type of PA to on spelling performance (d = 0.79) than did the multiple
teach it well in the multiple condition. A third possibility skill treatment (d = 0.23), but very likely this resulted
is that multiple skills instruction involved teaching higher from disabled readers’ dominating the multiple
level PA skills mainly to older children having difficulty treatment condition (see below). From these findings,
acquiring PA. the Panel concludes that blend-and-segment training
benefited children’s reading more than multiple skills
The Panel examined whether focused training in PA
training did.
produced greater transfer to reading than multiple-skill
training. From reading outcomes in Table 3, it is Also of interest was whether some types of single
apparent that transfer was twice as great when PA phoneme manipulation activities, for example, blending,
training focused on one (d = 0.71) or two (d = 0.79) PA segmenting, or categorizing, were more effective than
skills than when a multitude of skills were taught (d = other types. However, in examining the database, there
0.27). The advantage of focused over multiple-skill were too few instances of each type to permit
training for reading persisted at the followup test, comparison; so, this question was not addressed in the
especially for the two-skill focus that produced Panel’s analysis.
significantly larger effects than the one-skill focus. This
Studies in the database differed in whether or not
indicates that teaching two PA skills to children has
children were taught to manipulate phonemes using
greater long-term benefit for reading than teaching only
letters during training. For example, some children
one PA skill or teaching a global array of skills.
learned to segment words into phonemes by selecting
As evident in Table 4, spelling effect sizes for focused plastic letters for the sounds they spoke, whereas other
and multiple skills instruction showed the same pattern. children only spoke the sounds or they represented the
In fact, effects for the one-skill condition (d = 0.74) and sounds with unmarked tokens. Of interest was whether
the two-skill condition (d = 0.87) were over three times letters might improve children’s learning because they
as large as the effect size for the multiple condition (d = provide concrete, lasting symbols for sounds that are
0.23). These findings suggest that focused PA short-lived and hard to grasp. From PA outcomes in
instruction may benefit spelling more than multiple skill Table 2, it is apparent that children trained with letters
instruction does. However, it is likely that the lower did not acquire stronger PA (d = 0.89) than children
effect size in the multiple condition arose because trained without letters (d = 0.82). The absence of a
disabled readers dominated this category and PA difference may have occurred, however, because
instruction did not improve their spelling (see below). almost all comparisons involving disabled readers fell in
the letter use category, and disabled readers exhibited
Various types of phoneme manipulations might be
smaller effect sizes than nondisabled readers on PA
taught. However, two types, blending and segmenting,
outcomes (see Table 2). As described below, when
are thought to be directly involved in reading and
effects of letter use were examined after disabled
spelling processes. Blending phonemes helps children to
readers were removed from the database, a significant
decode unfamiliar words. Segmenting words into
advantage of letter use was detected. From these
phonemes helps children to spell unfamiliar words and
findings, the Panel concludes that teaching PA with
also to retain spellings in memory. A number of studies
letters is more effective in helping nondisabled readers
examined PA training that taught children to blend and
acquire phonemic awareness than teaching PA without
segment phonemes. To assess its value, the Panel
letters.
compared the effect size for this treatment to the effect
size for the multiple (3 or more skills) treatment. As It was expected that teaching PA with letters would
evident in Table 2 reporting PA outcomes, neither form facilitate greater transfer to reading and spelling than
was more effective than the other for teaching PA. teaching PA without letters. This is because reading
However, as evident in Table 3 for reading outcomes, and spelling processes require knowing how phonemes
teaching students to blend and segment benefited their are linked to letters. From reading outcomes in Table 3,
reading much more (d = 0.67) than did a multiple-skills it can be seen that teaching children to manipulate
approach (d = 27). As shown in Table 4, the blending phonemes with letters created effect sizes almost twice
and segmenting treatment also produced a larger effect as large as teaching children without letters (d = 0.67
vs. 0.38). The same pattern persisted at the followup seven cases (15%) to the total of 45 cases in which
test as well (d = 0.59 vs.0.36). Likewise, letters children were trained in small groups. The small number
benefited spelling more than no letters, with the effect of instances serves to rule out this explanation for the
size almost twice as great (d = 0.61 vs. 0.34). These larger effect sizes associated with small group training.
findings reveal that PA training makes a stronger
The length of time allocated for PA training varied from
contribution to reading and spelling performance when
1 hour to 75 hours across studies. Cases were grouped
the training includes teaching children to manipulate
into four time blocks to determine whether there was an
phonemes with letters than when training is limited to
optimum length of time for teaching PA. From
speech.
phonemic awareness outcomes in Table 2, it is evident
Studies varied in whether PA training was provided to that effect sizes were significantly larger for the two
individual students or small groups or classrooms of middle time periods lasting from 5 to 9.3 hours (d =
students. From PA outcomes in Table 2, it is evident 1.37) and from 10 to 18 hours (d = 1.14). Periods that
that the most effective way to teach PA was in small were either shorter or longer than this were less
groups. The effect size produced by small groups was effective for teaching PA, in fact, only half as effective
very large (d = 1.38), over twice the size of effects for (d = 0.61 and 0.65).
individuals (d = 0.60) and classrooms (d = 0.67). This
On reading outcomes, training programs that were long-
was surprising given that it is easier to tailor instruction
lasting yielded a significantly smaller effect size than
and corrective feedback when students are taught
shorter training programs as shown in Table 3. Effect
individually, and it was expected that this advantage
sizes for the three shorter time blocks did not differ.
would make individual instruction more effective.
The same pattern was evident on spelling outcomes.
Explanations for the effectiveness of training in groups
promoting the acquisition of PA may involve enhanced These findings run counter to the expectation that more
attention, social motivation to achieve, or observational extensive training in PA should enable children to
learning opportunities. acquire superior phonemic awareness with stronger
benefits for reading and spelling. These findings suggest
The superior PA skills acquired by children taught in
that PA training does not need to be lengthy to exert its
small groups transferred and boosted their reading and
strongest effect on reading and spelling. However,
spelling performance as well. Effect sizes on reading
caution is needed in drawing conclusions. There are
outcomes for small groups were d = 0.81 on the
various reasons why effect sizes might have been
immediate posttest and d = 0.83 on the followup
smaller when training was extensive. Perhaps the goals
posttest. In contrast, effect sizes for children taught
of instruction were more complex and harder to
individually or in classrooms ranged from d = 0.30 to
achieve. Or perhaps the students who received
0.45 on the immediate and delayed posttests. On
extended training were harder to teach. Alternatively,
spelling outcomes, small group instruction produced a
perhaps shorter instruction is better. The value of PA
larger effect size than individual instruction did, but the
instruction may be to initiate insight into the alphabetic
small group effect size did not differ from the classroom
system. Adding further nuances or complexities may
effect size (see Table 4).
erode learning by producing confusion or boredom. In
The possibility that small group effect sizes might be sum, the optimum length of PA training remains an
inflated for statistical reasons was considered. Studies issue needing further research.
that treated groups as the unit of analysis in statistical
Classroom teachers are the primary purveyors of
comparisons may have exhibited larger effect sizes than
reading instruction so, it is important to verify that they
studies using individuals as the unit of analysis because
can teach PA effectively. Results of the analysis of
the standard deviations of group means are smaller than
phonemic awareness outcomes (see Table 2) showed
the standard deviations of individual scores. However,
that the effect size produced by classroom teachers
there were only five studies that used groups as the
was large (d = 0.78) although not as large statistically
statistical unit of analysis, and these contributed only
findings show that PA training is not effective for and 4). It might be noted that most studies of disabled
improving disabled readers’ spelling skills, perhaps readers did not report the students’ SES; so, disabled
because their spelling skills are much harder to reader effect sizes did not contribute to SES effect size
remediate than their reading skills. In contrast, PA calculations.
training was found to transfer to spelling in at-risk and
The NRP database included many studies conducted in
normally progressing readers, indicating that PA training
English-speaking countries as well as a smaller number
does benefit spelling in nondisabled readers.
of studies conducted in countries speaking languages
The Panel also examined the effects of PA training at other than English. A comparison of effect sizes
various grade levels: preschool, kindergarten, 1st grade, revealed that PA training exerted a larger impact on the
and 2nd through 6th grades. From PA outcomes in acquisition of PA by English-speaking students (d =
Table 2, it is evident that preschoolers showed a very 0.99) than by the non-English students (d = 0.65).
large effect size in acquiring PA (d = 2.37). However, Transfer to reading outcomes was also greater for
only two cases contributed to this value, making it less English students (d = 0.63) than for others (d = 0.36) on
reliable. The effect on PA outcomes in kindergarten (d the immediate test but not the followup test. However,
= 0.95) was significantly larger than the effect in 1st there were no differences in effects sizes on spelling
grade (d = 0.48) and in 2nd through 6th grades (d = outcomes.
0.70). The latter two effect sizes did not differ. These
A possible reason for the absence of effects on spelling
findings indicate that younger students gained the most
is that most of the studies involving disabled readers
PA, not surprisingly since they started out with the least
were in the pool of English studies. This may have
PA.
suppressed the English effect size in spelling. To check
Effect sizes for reading outcomes in Table 3 reveal that on this, effect sizes were recalculated with the reading-
PA training transferred to reading to a similar extent for disabled (RD) comparisons removed (see below).
kindergartners, 1st graders, and 2nd through 6th graders Results confirmed suspicion; they changed from no
(ds from 0.48 to 0.49). The effect size for preschoolers effect on spelling to a significant effect favoring English
was much larger (d = 1.25). The same pattern was not (d = 0.95) over non-English (d = 0.51).
apparent on spelling outcomes, as evident in Table 4.
One intriguing reason for the larger effect sizes in
Transfer of PA training to spelling was greater among
English may be that the English writing system is not as
kindergartners (d = 0.97) than among 1st graders (d =
transparent in representing phonemes as it is in the
0.52). There was no transfer to spelling among the 2nd
other languages; so, explicit training may make a bigger
through 6th graders for whom the effect size did not
contribution to clarifying phoneme units and how they
differ from zero (d = 0.14). (Spelling was not measured
link to graphemes in words for English-speaking
in the preschool studies.) The absence of an effect on
students.
spelling among the older children arose primarily
because the majority of the cases in 2nd through 6th Analysis of Moderator Effects With Disabled
grades (78%) consisted of disabled readers who failed Readers Removed From the Database
to show transfer effects from PA training to spelling
In the analysis of effects associated with the three
(see below).
types of readers, effect sizes were significantly smaller
The Panel examined the relationship between the for disabled readers than for at-risk and normal readers
socioeconomic status of students across studies and the on two outcomes, phonemic awareness and spelling. In
size of effects produced by PA training. As evident for fact, on the spelling outcome, no significant effect of
PA outcomes in Table 2, low and mid-to-high SES PA training was detected for disabled readers.
levels did not differ, and both levels showed large effect Moreover, the pool of spelling effect sizes for disabled
sizes in acquiring PA. However, transfer to reading and readers was homogeneous, indicating that no further
spelling was significantly greater among among mid-to- analysis of moderator variables was needed to locate
high SES than among low SES students (see Tables 3 cause and allowing us to conclude that PA training does
not improve spelling in disabled readers.
In the NRP database, there were 17 comparisons reveals that in the category of letters manipulated, the
involving disabled readers (18% of the total number dropped from 39 to 25 cases. Declines in the
comparisons). The Panel worried that conclusions about other categories listed in Table 5 were minimal. This
how moderator variables regulate the impact of PA verifies that disabled readers were unevenly distributed
training on phonemic awareness and spelling outcomes across levels of these moderators. The SES variable
might be different if cases involving disabled readers was not affected and hence not re-analyzed because
were removed from the database. As discussed above, most studies involving disabled readers did not report
in our analysis of English and non-English studies, the SES level of the readers.
findings changed for spelling outcomes with reading
In all but one analysis of spelling outcomes, the pattern
disabled cases eliminated. This was because the
of effect sizes changed when disabled readers were
distribution of disabled reader cases was uneven, with
removed from the database. PA teaching that focused
most cases falling in the English pool of effect sizes.
on one or two skills was no longer superior to multiple
There were other moderator variables with an uneven
PA skill teaching. (However, note in Table 5 that there
distribution of disabled readers across levels as well.
were only three cases left in the multiple skills category,
Disabled readers were older (mostly in grades 2
raising doubt about the reliability of this effect size.)
through 6), they tended to receive PA instruction
Small group instruction no longer produced better
involving multiple skills taught with letters, the
transfer to spelling than individual instruction. Training
instruction was individualized, it tended to be lengthy
periods lasting 20 or more hours were no longer less
(over 19 hours), and researchers or computers rather
effective than shorter training periods. Classroom
than teachers were most often the trainers.
teachers no longer differed from researchers in
To examine whether findings involving these facilitating transfer to spelling. In the analysis of spelling
moderators would be different without disabled readers, outcomes across grades, the 2nd through 6th grade
effect sizes were re-analyzed after removing disabled category had no comparisons to contribute effect sizes.
reader comparisons from the database. The following The loss of cases in the upper grades shows that
specific moderator variables were re-analyzed: PA disabled readers clearly dominated effect sizes in this
skills taught, use of letters, grade, language, training unit, category. The greater effect of PA training on spelling
teachers vs. others as trainers, and length of training. among kindergartners than 1st graders remained the
Computer effects were not re-analyzed because there same.
were too few cases.
There were two moderators that did not differentially
Findings involving spelling outcomes were altered for influence spelling or PA outcomes when the whole
several moderators when disabled readers were database was analyzed; but when disabled reader
removed. Findings involving PA outcomes were altered effects were removed, significant differences appeared.
for one moderator. However, findings were not altered As evident in Table 5, language now impacted spelling
at all in the analyses of reading outcomes. Results are effect sizes, with English-speaking students benefiting
given in Table 5 (Appendix D). more from PA training than non-English-speaking
students. Also, letter use now impacted phonemic
Comparison of the number of cases contributing effect
awareness effect sizes such that children who
sizes to spelling outcomes with and without disabled
manipulated letters acquired more PA than children
readers (Tables 4 vs. Table 5) reveals that the numbers
who did not. Removal of disabled readers rendered
dropped substantially in the following categories: three
findings for these moderators consistent across all three
or more PA skills taught (drop from ten to three cases),
outcomes. That is, language exerted the same impact
letters manipulated (from 27 to 17 cases), individual
on PA, reading, and spelling outcomes, with English
instruction (from 14 to 8 cases), small group instruction
producing larger effects than non-English. Also letter
(from 20 to 15 cases), training lasting 20 to 75 hours
use exerted the same impact on PA, reading and
(from 18 to 9 cases), researcher as trainer (from 30 to
spelling, with letter manipulation producing larger
20 cases), 2nd through 6th graders (from 8 to 0 cases),
effects than no letters.
English language (from 32 to 22 cases). The same
comparison for PA outcomes (Table 2 vs. Table 5)
In sum, these findings support the following conclusions. significantly greater than the effect size for
PA training does not improve spelling in disabled nonequivalent groups (d = 0.40). However, the opposite
readers, but it does improve spelling in normally was found on spelling outcomes, with nonequivalent
developing readers below 2nd grade and children at risk groups showing a significantly larger effect size (d =
for future reading problems. Among nondisabled 0.86) than random groups (d = 0.37). These findings
readers, the benefit to spelling is positive and does not show that larger effect sizes in our database did not
depend on whether one or two or multiple PA skills are consistently arise from weaker designs involving
taught, whether instruction is delivered to individuals or nonequivalent groups. Moreover, average effect sizes
to small groups, how long training lasts, or whether for the most rigorous assignment procedure, random
teachers or researchers are the trainers. However, the assignment, ranged from low-moderate to large.
benefit to spelling among nondisabled readers does
Some researchers in the database administered fidelity
depend upon the language, with PA training in English
checks to ensure that trainers adhered to prescribed
exerting a bigger impact on spelling than PA training in
training procedures, whereas other researchers did not,
other languages.
or at least did not report, doing this. A comparison
Regarding the acquisition of phonemic awareness by revealed that significantly larger effect sizes arose in
nondisabled readers, our findings support the conclusion studies not checking for fidelity than in studies checking
that PA training is more effective when it is taught by for fidelity. This was true across all three outcome
having children manipulate letters than when measures (see Tables 2, 3, and 4 in Appendix C).
manipulation is limited to speech. Although weaker studies involving lack of fidelity
checking were associated with larger effects, fidelity
It is important to note that the pattern of effect sizes on
studies nevertheless yielded significant effects that
reading outcomes remained unchanged when
were moderate in size. This verifies that lack of rigor in
comparisons involving reading disabled students were
fidelity checking does not explain effect sizes in the
removed. Specifically, teaching one or two PA skills still
NRP database.
resulted in larger effect sizes on reading than teaching a
multitude of PA skills. Small groups still produced Bus and van Ijzendoorn (1999) reported an unexpected
superior transfer to reading than individual instruction. finding in their PA meta-analysis, that studies using
Lengthy training periods still yielded smaller effects on treated control groups yielded larger effect sizes than
reading than shorter training periods. These findings studies using untreated control groups. This finding was
serve to sustain our conclusions about the influence of examined in the present meta-analysis. Results were
moderators on reading outcomes. mixed. On PA outcomes, the two types of control
groups did not yield significantly different effect sizes.
Design Features On reading outcomes, they did, with studies using
Studies in the database varied in methodological rigor. treated controls showing larger effects than those using
The Panel examined some of these properties to see untreated controls, consistent with Bus and van
whether design weaknesses inflated effect sizes. Ijzendoorn’s finding. On spelling outcomes, studies with
untreated controls showed larger effects than studies
Studies varied in whether or not subjects were
with treated controls, the reverse pattern.
randomly assigned to treatment and control groups. In
some cases, nonrandom, nonequivalent groups were The foregoing results emerged from an analysis of all
assigned to treatment and control conditions. In some the studies. However, these studies varied in many
cases, group assignment involved matching individual respects besides the type of control group they used. In
children on the basis of similar test scores. Effect sizes the NRP database, there were eight studies that
for the three assignment types were determined (see compared PA training to both a treated control group
Tables 2, 3, and 4 in Appendix C). Comparison of PA and an untreated control group. In limiting the analysis
outcomes revealed very similar effect sizes that did not to these studies, the Panel found that, out of 20
differ statistically and ranged from 0.83 to 0.92. comparisons, ten showed bigger effects in cases using
Comparison of reading outcomes revealed that the
effect size for randomly assigned groups (d = 0.63) was
treated controls and ten showed bigger effects in cases primarily to studies that were the least rigorous.
using untreated controls across the three outcome Comparisons were grouped into blocks of three or four
measures. Thus, the picture arising from this analysis in order to reveal effect sizes at the various levels of
was mixed. rigor.
Although the findings reveal no clear pattern favoring The findings are reported in Appendix E—Table 6 for
treated or untreated control groups, the fact that studies PA outcomes and Table 7 for reading outcomes. Both
using untreated controls did not uniformly yield larger tables reveal that effect sizes were significantly greater
effect sizes serves to challenge the commonly held than zero across all blocks on all five measures. This
belief that untreated control groups always yield larger shows that significant effect sizes were not limited to
effects. It is not the case that Hawthorne effects the weakest studies.
always prevail. Other factors appear to influence
In Table 6, reporting effects of PA training on PA
outcomes as well. Perhaps Hawthorne effects are
outcomes, it is apparent that across all five measures
more characteristic of older participants with better
the largest effect sizes occurred for the blocks
developed metacognitive sensitivities.
reflecting the most rigor. This shows that the best
Among studies in the NRP database, samples included designed studies produced the largest effect sizes on
as few as nine students or as many as 383 students. To the acquisition of PA.
examine whether effects differed as a function of
In Table 7, reporting effect sizes for reading outcomes,
sample size, the studies were divided into blocks of
the same pattern is evident but is not quite as strong.
approximately equal numbers of cases. Outcomes
The effect size associated with the most rigorous level
reported in Tables 2 to 4 reveal that larger effect sizes
is close to the strongest, if not the strongest, effect size
tended to occur in the smaller samples, whereas the
on four of the five measures: the two internal validity
smallest effect sizes occurred in the largest samples.
measures, the external validity critical flaws measure,
This is consistent with meta-analytic findings in general
and the overall rigor ranking. On the remaining
(Johnson & Eagley, in press). The fact that effect sizes
measure, percent of external validity criteria met, the
were significantly greater than zero even in the largest
effect size is moderately strong though less so than the
samples shows that the PA training effects observed
largest effect size. This evidence indicates that the
did not arise primarily from the weaker studies with
better designed studies tended to produce stronger
small samples.
transfer effects in reading than the weaker studies.
Recently Troia (1999) published a critique of phonemic
In sum, although Troia (1999) finds fault with PA
awareness training studies. He identified several criteria
training studies, his findings do not undermine claims
to assess methodological rigor and applied these criteria
about the effectiveness of PA training for helping
to 39 PA training studies of which 29 were in the NRP
children learn to read. Troia’s concluding plea, that
database. (The remaining studies did not assess reading
researchers maintain high standards in designing their
as an outcome so were not among the studies
studies, is supported by Panel findings that show that
considered.) The Panel incorporated his summary
researchers stand a better chance of obtaining sizeable
ratings into the NRP database and examined the
effects when they design strong studies than when they
relationship between these evaluations and effect sizes.
design weak studies threatened by violations to internal
Troia devised two measures and applied them to
and external validity.
evaluate the internal validity separately from the
external validity of studies: the percentage of criteria One final characteristic of studies examined was the
met and the number of critical flaws. Also he ranked year of publication. From Tables 2 and 3, it is apparent
the studies to indicate their overall methodological rigor. that there was one period in which a spate of PA
The Panel’s purpose was to consider and rule out the training studies was published, from 1991 to 1994. Over
possibility that effects of PA training were limited twice as many studies were published during this period
as during the other periods. The 1991 to 1994 studies PA skills yielded larger effects on PA learning than
also tended to yield larger effect sizes on PA and programs teaching three or more of these
reading outcomes than studies in time periods before or manipulations. Instruction that taught phoneme
after this. Why this occurred is not clear. manipulation with letters helped children acquire PA
skills better than instruction without letters. Facilitation
Discussion from letters was observed among at-risk readers and
normally developing readers below 2nd grade. It was
Summary of Findings
not possible to assess the contribution of letters among
To summarize results of the meta-analyses, the Panel disabled readers because most studies used letters to
examined 96 cases, each comparing a treatment group teach PA to disabled readers.
that received PA training, to a control group that
Teaching children in small groups produced larger
received an alternative form of instruction or no special
effect sizes on PA acquisition than teaching children
instruction; they examined effects on three main
individually or in classroom-size groups. Classroom
outcome variables, PA, reading, and spelling.
teachers produced large effect sizes, indicating that
PA training was found to be very effective in teaching they were very successful in teaching PA to students,
phonemic awareness to students. Effect sizes were although researchers produced somewhat larger
large immediately after training (d = 0.86), and they effects. Computers also taught PA effectively. The
remained strong over the long term (d = 0.73). PA length of training influenced PA acquisition. Effect sizes
training succeeded in teaching children various ways to were larger when PA instruction lasted from 5 to 18
manipulate phonemes, including segmentation, blending, hours than when either less or more time than this was
and deletion. PA training was effective in teaching PA spent.
skills across all levels of the moderator variables
Characteristics of students influenced how much
examined.
phonemic awareness they acquired from training.
PA training improved children’s ability to read and spell Disabled readers showed smaller effect sizes than at-
in both the short and the long term. The effect size was risk students or normally progressing readers, indicating
moderate following training on reading (d = 0.53) and that PA was harder for disabled readers to learn. Also
on spelling (d = 0.59). Tests of word reading, students in the lower grades, namely preschool and
pseudoword reading, and reading comprehension all kindergarten, showed larger effect sizes in acquiring PA
yielded statistically significant effect sizes on both than children in 1st grade and above. SES exerted no
experimenter-devised tests as well as standardized differential impact on learning PA. However, the
tests. Few instances occurred in which moderator language spoken by the children did. English-speaking
variables reduced effect sizes to chance levels, and children showed larger effects of training on PA
these were limited to spelling outcomes. Whereas PA acquisition than children learning in other languages.
training exerted strong effects on reading and spelling, it
These moderator variables also influenced how much
did not impact children’s performance on math tests.
transfer to reading and spelling resulted from PA
This indicates that halo/Hawthorne effects did not
training. The type of test used to measure reading and
explain findings and that training effects were limited to
spelling influenced effect sizes that were larger on
the targeted domain.
experimenter-devised tests than on standardized tests
Several moderator variables were found to influence measuring real word reading and spelling. Effect sizes
children’s acquisition of phonemic awareness. PA did not differ on experimenter-devised and standardized
training programs varied in whether children were pseudoword reading tests.
taught to manipulate phonemes in one, two, or multiple
Properties of training procedures influenced the extent
ways, and in the type of phoneme manipulations taught,
of transfer to reading. Teaching that focused on one or
segmenting, blending, deleting, identifying, or
two types of PA manipulations yielded larger effect
categorizing phonemes, or manipulating onsets and
sizes than teaching three or more PA skills. Teaching
rimes. Properties of the training procedures exerted an
children to manipulate phonemes using letters produced
impact. Programs that focused on teaching one or two
bigger effects than teaching without letters. Blending difference. Kindergartners benefited more in their
and segmenting instruction showed a much larger effect spelling than did 1st graders. Students classed as mid-
size on reading than multiple-skill instruction did. Small to-high SES showed a larger effect size in spelling than
group instruction produced larger effect sizes on low SES students. PA training in English produced a
reading than individualized instruction or classroom larger effect on spelling than PA training in other
instruction. Length of training exerted an influence as languages.
well, with the lengthiest training associated with the
Features of the design of experiments were related to
smallest effect size. Classroom teachers provided PA
effect sizes. Findings indicated that rigorous designs
training that was effective in promoting transfer to
yielded strong effects. The majority of the studies used
reading although the effect size of teachers was smaller
random assignment, and their effect sizes on PA and
than the effect size of other trainers. PA training on
reading outcomes ranged from moderate to large.
computers transferred to reading as well.
About one-third of the studies checked on whether
Characteristics of learners influenced the extent that trainers remained faithful to treatment procedures.
PA training transferred to reading. Effect sizes on Effect sizes in these studies were significant and
reading were large for at-risk readers while they were moderate in size. Some studies compared PA treatment
moderate for disabled and normally developing readers. groups to control groups that were given some other
Preschoolers exhibited a much larger effect size on treatment while other studies used untreated control
reading than did the other grade levels whose effect groups. Neither type of control group consistently
sizes did not differ. SES made a difference, with mid-to- produced larger effect sizes. Failure to find larger
high SES associated with larger effects than low SES. effects for untreated than for treated control groups
Also larger effect sizes were evident in reading for indicates that Hawthorne effects did not inflate effect
English-speaking children than for children speaking sizes. Studies using smaller samples of children tended
other languages. to have larger effect sizes than studies using larger
samples, a finding consistent with other meta-analyses.
Analysis of moderator variables as they affected
However, even in the largest samples, effect sizes were
spelling outcomes was complicated by the fact that PA
positive and significant.
training did not help disabled readers improve in spelling
and the pool of spelling effect sizes for disabled readers The Panel also assessed the relationship between
was homogeneous, indicating that further analyses using methodological rigor and effect size by applying Troia’s
moderators was not necessary to explain the result. The (1999) criteria to the NRP studies. On PA outcomes,
effects of moderators were re-analyzed with disabled the best designed studies produced the largest effect
readers removed from the database. Conclusions sizes on all five measures of rigor. On reading
regarding the effects of moderator variables on spelling outcomes, effect sizes associated with the most
outcomes thus centered on the nondisabled readers. rigorous level were close to the largest, if not the
largest, effect sizes on four out of five measures: two
The only characteristic of PA training that influenced
internal validity measures, one external validity
spelling outcomes for nondisabled readers was the use
measure, and the overall ranking of rigor. This indicates
of letters. Children who were taught to manipulate
that the better designed studies produced larger transfer
phonemes with letters benefited more in their spelling
effects in reading than the weaker studies. In sum,
than children whose manipulations were limited to
findings show that larger effect sizes did not arise
speech. Whether instruction focused on one or two
mainly from weaker studies that were flawed by threats
skills or on multiple skills did not influence spelling in
to internal and external validity.
nondisabled readers. Instruction delivered to individuals
was as effective as instruction delivered to small Interpretations and Issues
groups, and both were more effective than classroom-
size groups. The length of training exerted no Results of the experimental studies allow the Panel to
differential impact on spelling outcomes. Whether the infer that PA training was the cause of improvement in
trainer was a teacher or a researcher made no students’ phonemic awareness, reading, and spelling
difference. Characteristics of learners did make a performance following training. These findings were
replicated multiple times across experiments and thus It is noteworthy that low SES children were found to
provide solid support for causal claims. However, benefit as much from PA training as middle-to-high SES
results of the analysis of moderator variables rest on children in acquiring phonemic awareness. This runs
more tentative ground. Assessing features of the counter to Dressman (1999) who argues that low SES
studies that were associated with stronger or weaker children will exhibit low PA in research studies because
effect sizes is at root a correlational endeavor and thus their phonological systems differ from that of testers
precludes strong inferences about cause. The primary and because they suffer from inhibition when tested by
difficulty is that a third unknown factor may lie in the sociolinguistically foreign researchers. Dressman bases
background explaining the relationships observed. his expectations on studies showing that low SES
Although findings are suggestive, any conclusions must children perform more poorly on PA tests than middle-
remain tentative because multiple explanations are class children. He ignores evidence examining how
possible. In this section, potential misinterpretations of much low SES children gain in PA when they receive
the findings and issues needing further attention from training. According to the NRP findings, low SES
researchers are considered. children can benefit as much from training as middle-to-
high SES children, despite being phonologically or
The studies in the NRP database included investigations
culturally different from the trainers.
of children at risk for future reading problems as well as
children low in SES. However, contrary to the common One very striking finding was that in contrast to at-risk
view that the criteria for identifying at-risk readers and normally developing readers, disabled readers’
includes being economically disadvantaged, authors of spelling did not benefit at all from PA training. Various
the studies investigating at-risk readers did not reasons for this can be entertained. Other studies have
uniformly require them to be low in SES. In fact, of the found that disabled readers have special difficulty
cases investigating at-risk readers, only 27% were low learning to spell (Bruck, 1993). Perhaps processing
in SES while 37% were middle-to-high SES, and the difficulties associated with being reading disabled make
SES of the remainder was not specified. At risk was spelling especially hard to learn. Alternatively, perhaps
defined by low phonemic awareness in 77% of the PA training fails to help older disabled readers with their
cases. In defense of these studies, research findings spelling because the types of words that are spelled in
show that one of the two best predictors of reading higher grades require knowledge of spelling patterns
success is phonemic awareness (Share et al., 1984), so rather than phonemic segmentation and knowledge of
selecting at-risk readers by measuring their PA makes individual letter-sound correspondences. Effects of PA
sense. However, because the training targeted this skill, training on spelling may be limited to less complex
large effect sizes may be less surprising. words that are more phonemically transparent, those
taught to beginning readers.
The fact that studies in the NRP database departed
from the common conception of what it means to be at According to NRP findings, children who received
risk serves to reconcile discrepancies between results training that focused on one or two PA skills exhibited
for at-risk readers and results for low SES readers. The stronger PA and stronger transfer to reading than
Panel found that at-risk children showed large effect children who were taught three or more PA skills.
sizes in acquiring PA (d = 0.95) and in transferring Various explanations might account for the difference.
these skills to reading (d = 0.86) and spelling (d = 0.76). Perhaps focused instruction resulted in more students
Low SES children also showed large effect sizes in mastering the skills that were taught. Perhaps teaching
phonemic awareness (d = 1.07) and spelling (d = 0.76), multiple skills created some confusion about which
but only a moderate effect size in reading (d = 0.45). manipulations to apply in the reading transfer tasks, or
Smaller effect sizes in reading among low SES children perhaps it obscured children’s grasp of the alphabetic
than among at-risk children is explained by the fact that principle. Clarifying why multiple skills instruction might
the majority of the at-risk children were not low in SES. limit children’s gains in PA and reading needs further
Based on these findings, one would expect at-risk study. However, the findings suggest that when multiple
children who are both low PA and low SES to exhibit
large gains in PA and spelling as a result of PA training
but to exhibit moderate gains in reading.
PA skills are the objective, it is prudent to teach one at One surprising finding in the analysis involved the
a time until each is mastered before moving on to the relationship between training time and outcomes. Effect
next, and to teach students how each skill applies in sizes were larger when PA instruction lasted between 5
reading or spelling tasks. and 18 hours than when either less or more time was
spent training students. However, caution is needed in
More important than the number of PA skills to teach is
interpreting this finding because multiple explanations
the question of which skills should be taught to children.
are possible. Perhaps the goals of instruction were
In all of the studies, children were given PA instruction
more complex in longer programs. Perhaps the students
that was considered appropriate for their level of
receiving instruction were harder to teach. Perhaps
literacy development. The manipulations taught to
spending many hours in PA training deprived students
preschoolers were quite different from the
of the reading instruction benefiting control groups.
manipulations taught to older students. Easier PA tasks
Perhaps PA instruction is valuable mainly in helping
were taught to younger children or to less mature
children achieve basic alphabetic insight. Going beyond
readers while harder PA tasks were taught to older
this by adding further nuances or complexities may
readers. Factors making PA tasks easy or difficult
erode learning by producing confusion or boredom.
include the type of manipulation applied to phonemes,
These are only some of the possible reasons why longer
the number and phonological properties of phonemes in
training sessions might have produced smaller effect
the words manipulated, whether the words are real or
sizes. Questions regarding the optimum length of PA
nonwords, and whether letters are included. To
training and factors determining optimum length invite
illustrate, the following tasks are ordered from easy (1)
further research. However, two conclusions seem self-
to difficult (6) based on findings of Schatschneider,
evident: that length of training should be regulated by
Francis, Foorman, Fletcher, and Mehta (1999):
how long it takes students to acquire the PA skills that
1. First-sound comparison—identifying the names of are taught and that the NRP findings should not be
pictures beginning with the same sound translated into any prescriptions regarding how long
teachers should spend teaching PA.
2. Blending onset-rime units into real words
One important moderator variable that was not
3. Blending phonemes into real words considered in the analysis is dialect because none of the
4. Deleting a phoneme and saying the word that studies paid attention to this variable. However, regional
remains differences at the phonemic level of language are likely
to be important. For example, vowel phoneme
5. Segmenting words into phonemes categories are not the same across the United States.
6. Blending phonemes into nonwords. Some dialects make more phonemic distinctions among
vowels than other dialects. Vowels in the three words,
In the illustrative studies described below, tasks that are marry, Mary, and merry are pronounced identically in
appropriate to teach at different grade and reader levels some areas of the West but differently in some areas of
can be seen. The final decision about which PA the East. As a result, no generalizations about these
manipulations to teach should take account of several vowel phonemes will suit everyone receiving PA
factors, not only task difficulty, but also whether or not instruction. Another dialectal difference involves
students can already perform the manipulations being preserving or deleting the final consonants in words, for
taught as determined by pretests, and the use that example, past-tense markers such as the /t/ in looked.
students are expected to make of the PA skill being More research on the impact of dialectal variations on
taught. The reason to teach first-sound comparisons is PA learning is needed. The fact that regional phonemic
to draw preschoolers’ or kindergartners’ attention to the variations exist means that teachers implementing PA
fact that words have sounds as well as meanings. A training programs need to be aware of their students’
reason to teach phoneme segmentation is to help dialects and whether they deviate from the phonological
kindergartners or 1st graders generate more complete systems that are assumed in the programs. Ignoring
spellings of words. The reason to teach phoneme deviations is likely to undermine the credibility of the
blending is to help 1st graders decode words. instruction.
words are read in text, students may apply decoding segmenting and blending with letters. The best
skills, or they may combine grapheme-phoneme cues approach is for teachers to assess students’ PA prior to
with meaning cues to derive the word (Tunmer & beginning PA instruction. This will indicate which
Chapman, 1998). children need the instruction and which do not; which
children need to be taught rudimentary levels of PA, for
It is important to note that acquiring phonemic
example, segmenting initial sounds in words; and which
awareness is a means rather than an end. PA is not
need more advanced levels involving segmenting or
acquired for its own sake but rather for its value in
blending with letters.
helping children understand and use the alphabetic
system to read and write. This is why including letters In the rush to teach phonemic awareness, it is important
in the process of teaching children to manipulate not to overlook the need to teach letters as well. The
phonemes is important. PA training with letters helps NRP analysis showed that PA instruction was more
learners determine how phonemes match up to effective when it was taught with letters. Using letters
graphemes within words and thus facilitates transfer to to manipulate phonemes helps children make the
reading and spelling. transfer to reading and writing. However, teaching
children all the letters of the alphabet is not easy,
It is important to recognize that children will acquire
particularly when they come to school knowing few of
some phonemic awareness in the course of learning to
them. There are 52 capital and lower-case letter
read and spell even though they are not taught PA
shapes, names, and sounds to learn. The shapes of
explicitly. The process of learning letter-sound relations
many letters are similar, and, therefore, easily confused
and how to use them to read and spell enhances
with one another. Letter learning requires retaining
children’s ability to manipulate phonemes. This is
shapes, names, and sounds in memory and, in fact,
indicated by evidence that people who do not learn to
overlearning them so that letters can be processed
read in an alphabetic system do not develop PA (Mann,
automatically in reading and writing words (Adams,
1987; Morais et al., 1987; Read et al., 1987). It is also
1990). Thus, to ensure that instruction in phonemic
indicated by the fact that, in many of the studies
awareness is effective, it needs to include instruction in
reviewed, control groups showed improvement in
graphemes as well as instruction in the connections
phonemic awareness from pretests to posttests, very
between graphemes and phonemes to read and spell
likely because of the reading and writing instruction
words.
they received in their regular classrooms. However, the
extent of PA needed to contribute maximally to In addition to teaching PA skills with letters, it is
children’s reading development does not arise from important for teachers to help children make the
incidental learning or instruction that is not focused on connection between the PA skills taught and their
this objective. This is indicated by the finding that application to reading and writing tasks. In most of the
children receiving explicit training in PA gained much studies reviewed, researchers did not do this when they
more PA and reading skill than children in the control presented the transfer tasks to students following
groups. training. Despite this, significant and sizable transfer
effects were observed. In a study by Cunningham
It is important to recognize that children will differ in
(1990), who did examine application effects, students in
their phonemic awareness and that some will need
one group not only were taught to segment and blend
more instruction than others. In kindergarten, most
but also were shown how to apply these skills in reading
children will be nonreaders and will have little phonemic
words. Another group received the same PA training
awareness; therefore, PA instruction should benefit
but not the application training. Effect sizes on reading
everyone. In 1st grade, some children will be reading
outcomes were much larger when 1st graders received
and spelling while others may know only a few letters
the application instruction than when they did not. This
and have no reading skill. The nonreaders will need
suggests that results of the NRP meta-analysis actually
much more PA and letter instruction than those already
underestimate the magnitude of effects that would
reading. Among readers in 1st and 2nd grades, there
result if children received explicit instruction and
may be variation in how well children can perform more
practice in applying PA skills in their reading and
advanced forms of PA, that is, manipulations involving
writing.
It is important to note that when PA is taught with spelling activities by making it more systematic,
letters, it qualifies as phonics instruction. When PA thorough, and explicit, is likely to improve these
training involves teaching students to pronounce the programs’ success in helping children learn to read and
sounds associated with letters and to blend the sounds spell.
to form words, it qualifies as synthetic phonics. When
PA training involves teaching students to segment Classroom Instruction in PA: Some
words into phonemes and to select letters for those Illustrations
phonemes, it is the equivalent of teaching students to NRP findings show that PA training programs
spell words phonemically, which is another form of implemented by teachers in classrooms are effective in
phonics instruction. These methods of teaching phonics teaching phonemic awareness to students, and this
existed long before they became identified as forms of training boosts children’s reading and spelling
phonemic awareness training (Balmuth, 1982; Chall, performance. To identify characteristics of programs
1967). Although teaching children to manipulate sounds that were used successfully by classroom teachers, the
in spoken words may be new, phonemic awareness Panel examined a few illustrative studies selected from
training that involves segmenting and blending with a total of 15 (Blachman, Ball, Black, & Tangel, 1994;
letters is not. Only the label is new. Explicit instruction Brady, Fowler, Stone, & Winbury, 1994; Brennan &
in the alphabetic principle necessarily includes attention Ireson, 1997; Bus, 1986; Haddock, 1976; Kennedy &
to phonemes because these are the phonological units Backman, 1993; Kozminsky & Kozminsky, 1995; Lie,
that match up to letters. According to NRP findings, it 1991; Lundberg, Frost, & Petersen, 1988; McGuinness,
is likely that the inclusion of phonemic awareness McGuinness, & Donohue, 1995; O’Connor, Notari-
training in phonics instruction is a key component Syverson, & Vadasy, 1996; Olofsson & Lundberg,
contributing to its effectiveness in teaching children to 1983; Schneider, Kuspert, Roth, Vise, & Marx, 1997;
read. Tangel & Blachman, 1992; Williams, 1980).
It is important to note that various approaches to One 8-month-long, carefully structured program for
beginning reading instruction may provide at least some kindergartners was developed and tested by Lundberg,
phonemic awareness training although it may not be Frost, and Peterson (1988). Twelve classroom teachers
presented systematically or thoroughly enough to in Denmark taught children daily to attend to sounds in
maximize its contribution to reading and writing. Whole speech and to manipulate sounds through games and
language instruction that teaches students to invent exercises that increased in difficulty as the year
spellings by detecting phonemes in words and progressed. The program began with easy listening
representing them with letters offers a form of PA activities followed by rhyming exercises. Then
training. In Reading Recovery© (RR), students may kindergartners learned to segment sentences into words
acquire phonemic awareness through the spelling and to focus on the length of words in speech. Then
instruction they receive (Clay, 1985). Three studies in words were analyzed into syllables. For example,
the database compared outcomes of standard whole children listened to a troll who spoke peculiarly, syllable
language instruction, or RR instruction, to outcomes of by syllable, and they figured out what he said. Phoneme
the same instruction with PA training added (Castle et analysis was introduced in the 3rd month by having
al., 1994; Hatcher et al., 1994; Iversen & Tunmer, children identify phonemes in initial positions of words,
1993). Overall effect sizes were variable ranging from mainly continuants and vowel sounds which are easy to
negative to large positive (see Appendix and illustrative stretch out and hold. The teacher helped children find
studies below). One factor possibly limiting outcome the sounds by stretching them, for example,
differences between treatment and control groups is the “Mmmmmark” or by repeating the stop consonants that
extent to which control students acquired PA from the cannot be held, for example, “T-T-T-Tom.” Children
instruction they received. Although whole language also practiced adding and deleting phonemes from
programs and RR programs include some phonemic words. In the 5th month of the program, phoneme
awareness training, findings of the NRP meta-analysis segmentation and blending were introduced, first with
indicate that strengthening the training offered in
two-phoneme words and then longer words. Many of Foorman, Lundberg, & Beeler, 1998). Whether
the activities were designed for children’s enjoyment teachers need further help beyond the manual to
and consisted of dancing, singing, and other implement the program effectively with their students
noncompetitive social games. needs to be studied.
Teachers were trained in an inservice course that The Danish program did not include letter manipulation.
provided theoretical background as well as videotaped However, the meta-analysis showed that when PA is
examples of training sessions. They practiced and taught with letters, it is more effective. A program for
refined the skills necessary to teach the program during kindergartners that included letters was developed and
the year prior to implementing it. Teachers of the tested by Blachman and her colleagues (Ball &
control group followed the regular preschool program, Blachman, 1991; Blachman et al., 1994; Tangel &
which emphasized social and aesthetic aspects of Blachman, 1992). Blachman et al. (1994) taught 10
development rather than cognitive and linguistic teachers and their teaching assistants to deliver PA
aspects. Treatment and control schools were located in training to low-income, inner-city kindergartners.
geographically distant parts of Denmark. Children were taught in groups of four or five for 15 to
20 minutes per day, 4 times each week. The program
The Danish program was adapted and tested by other
lasted 11 weeks. The teachers were trained in seven 2-
researchers including Schneider et al. (1997) who
hour inservice workshops, during which they were
taught PA to German kindergartners. His study included
taught a theoretical framework; they practiced
two experiments and a total of 22 teachers who taught
instructional activities; and they asked questions about
PA in the treatment conditions. Control groups received
ways of implementing the program.
the regular kindergarten curriculum. The second
experiment was conducted to improve on the first. A key activity in Blachman et al.’s (1994) program was
Teacher training was less extensive in the German the “say it and move it” procedure. Children learned to
study than in the Danish study. It lasted 2 months and move a blank tile down a page as they pronounced each
included theoretical background and tutoring sessions in phoneme in a word. After children practiced
which teachers practiced the games and exercises and segmenting two- and three-phoneme words in this way,
received feedback. letter-sound correspondences were taught and they
practiced segmenting the words with blank markers and
In both the Danish and German studies, training
letters. Additional segmentation activities were included
produced large effect sizes on the acquisition of
such as moving markers into Elkonin boxes to represent
phonemic awareness, ranging from 0.70 to 0.82. Effect
phonemes in three-phoneme words. A variety of games
sizes on reading outcomes were small to moderate
was used to reinforce grapheme-phoneme
when measured the following year in 1st grade: d =
correspondences. The control group in this study
0.19 (Denmark), d = 0.26 and 0.45 (Germany).
followed a traditional kindergarten curriculum that
An adaptation of the Danish program was tested with included instruction in letter names and sounds. Results
English-speaking kindergartners by Brennan and Ireson of the study were very positive. Children receiving PA
(1997). However, only one teacher and her class of 12 training outperformed controls on PA tasks, with an
students formed the PA treatment group, which was effect size of d = 1.83, and training transferred to
compared to one no-treatment control class. Although reading, d = 0.65, and to spelling, d = 0.94.
this is a weaker design yielding less reliable findings, the
Another program in the NRP set of studies was
effect size was impressive. The impact of training on
administered by teachers to small groups of older
word reading was large, with an effect size of d = 1.17.
disabled readers. Williams (1980) developed and tested
This provides some evidence that the Danish program
the ABDs program, which taught students ages 7 to 12
can be used effectively in American classrooms. A
to segment and blend phonemes first in speech and then
translation of the program has been published (Adams,
using letters. Children worked with a limited set of
seven consonants and two vowels. Lessons progressed
from segmenting words into syllables to segmenting
words into phonemes, at first two phonemes and then
three phonemes. Then blending was applied to the same the extra time in metacognitive activities that included
words. Children performed manipulations with wooden learning about the goals and purposes of each PA
markers at first and letters later on. Their work blending manipulation, reviewing how that lesson related to
letters was the equivalent of learning to decode, and previous lessons, and observing and practicing how to
their work segmenting with letters was equivalent to use the skill for reading. The control group spent equal
learning to spell the sounds in words. More letters were time engaged in a story listening treament.
added to the set later in the program. Words with
Results showed that at the end of PA training, the two
consonant clusters were introduced. Finally two-syllable
treatment groups outperformed the control group on
words were added. The program included various
measures of PA and reading in both grades. In addition,
games, worksheets, and activities to teach these skills.
1st graders who had received both PA and
Teachers attended a half-day session to learn about the metacognitive training achieved higher reading scores
program, which was fully presented and described in a than 1st graders receiving only PA training. One
manual. The 17 teachers were asked to use the possible reason why the advantage was limited to 1st
program 20 minutes daily. Their instruction was closely grade is that 1st graders, but not kindergartners, were
monitored. Although there were 12 units, only a few receiving formal reading instruction concurrently in their
teachers got through the entire program in the 26-week classrooms, so they had a chance to apply their PA
period. knowledge on a daily basis. In fact, some 1st graders
told the experimenter that they used what they had
Williams evaluated the ABDs program again the
been taught to decode words in their classroom reading
following year, this time not with volunteer teachers but
groups. These findings indicate that a metacognitive
with 20 teachers who were mandated to use the
component may be valuable in providing a bridge
program. They completed on average 6.6 units, about
between PA skills and reading processes. This may be
half the program. The treatment groups were compared
particularly true in PA programs that do not teach
to untreated control groups. The influence of PA
phoneme manipulation with letters.
instruction on students’ ability to decode words and
nonwords was measured at the end of training. Effect The ADD program (Auditory Discrimination in Depth)
sizes were large, d = 1.05 for the 1st year, and d = 0.97 was developed by Lindamood and Lindamood (1975) to
for the 2nd year. This indicates that the ABDs program teach PA. The unique feature of this program is that it
was highly effective at teaching decoding skill to teaches children to identify and monitor articulatory
disabled readers. gestures associated with phonemes. As already
discussed, phoneme segmentation is difficult because
Other Programs to Teach PA there are no boundaries in speech telling us where one
Various programs were used to teach PA across phoneme ends and the next begins. Rather phonemes
studies. Presenting descriptions of these programs are coarticulated to produce speech without any seams.
serves to clarify how studies in the database were One very helpful way to identify separate phonemes is
structured and the variety of ways that PA was taught. to monitor the changes that occur in the mouth as one
Some programs had special features that enhanced pronounces words. This involves directing attention to
their effectiveness. In the study by Cunningham (1990), the position and shape of the lips and tongue. For
one treatment group was taught metacognitive skills example, there are three phonemes in meat and these
along with PA. Cunningham worked with normally are reflected in three successive mouth movements:
progressing readers in kindergarten and 1st grade. A your lips closing for /m/, your lips opening into a smile
puppet was utilized to interact with children. PA training shape for the vowel, then your tongue tapping the roof
was limited to the oral mode, with no letter-sound of your mouth for /t/. Pictures of mouth positions can be
instruction. Training was conducted in small groups for used to help children distinguish phonemes in
10 weeks. Three treatments were compared. One pronunciations of words. Also, mirrors help children
treatment group received PA training in segmenting and explore what their own mouths are doing when they
blending phonemes. Another group received a pronounce words.
somewhat abbreviated version of this training and spent
Four studies in the NRP database implemented the children were shown a word and identified it from two
ADD program to teach PA (Kennedy & Backman, spoken choices (e.g., “Does this [sat] say sat or
1993; McGuinness et al., 1995; (Wise, King, & Olson, mat?”). Trained students read more words than control
1999; Wise, King, & Olson, in press). Children received students, indicating that PA training improved
extensive training discovering and categorizing the preschoolers’ rudimentary word recognition skill.
various phonemes in English by analyzing their own
These researchers also investigated the long-term
mouth movements, often using mirrors. They learned to
impact of PA training (Byrne & Fielding-Barnsley,
label these sounds, for example, lip poppers, tip tappers,
1993, 1995). Children were tested during the next 3
and scrapers. They learned to track movements in
years in school. At the end of kindergarten, trained
spoken words in order to identify the separate
children were only slightly superior to controls in PA,
phonemes and then to represent the phonemes with
indicating that learning to read had narrowed the gap in
graphemes. Effect sizes on reading outcomes were
PA between the two groups. At the end of each
variable, ranging from 1.22 for 1st graders
successive grade, the PA-trained group read
(McGuinness et al., 1995) to 0.15 for older disabled
significantly more pseudowords than controls, indicating
readers (Wise, King, & Olson, 1999).
that PA training benefited children’s decoding skill. At
An example of a program focused on teaching only one the end of 2nd grade, there was a marginal difference
type of phoneme manipulation was that studied by in reading comprehension favoring the PA-trained
Byrne and Fielding-Barnsley (1991) for preschoolers, students. However, the 2nd graders did not differ in
called Sound Foundations. This program taught reading real words or in spelling words.
phoneme identity. Children learned to recognize
One possible reason why long-term training effects
instances of the same sound in both initial and final
were not stronger in this study is that the formal reading
positions across different words. The following sounds
and spelling instruction that children received in school
received primary attention: /s/, /š/ as in ship, /l/, /m/, /p/,
was sufficiently effective to compensate for the
/t/, /g/, /ae/ as in bat, /ε/ as in bet. Children were shown advantage provided by preschool training in PA. Also,
several large posters covered with pictures of objects. the PA training that students received was focused
Their job was to pick out from a larger set the objects rather than comprehensive and amounted to only 6
having a specified beginning or ending sound, for hours total. It may take a more comprehensive and
example, sea, seal, sailor, sand. Also, children were extensive training program to exert stronger long-term
shown an array of pictures on worksheets or cards, and effects.
they selected those having targeted sounds. In each
session, one phoneme in one position was taught. The The effectiveness of different ways to teach PA was
letter representing that phoneme was introduced as examined by O’Connor et al. (1995), who inquired
well. whether PA training has to be broad rather than
focused to be most effective. They selected at-risk
In this study, preschoolers averaging 4.5 years of age kindergartners with low PA and randomly assigned
received either the PA training described above or them to one of three training conditions. In the
control training that focused on story reading and comprehensive treatment, children performed a variety
semantic activities with the same posters and of sound manipulation activities that included isolating,
worksheets. Children were trained in groups of 4 to 6 segmenting, blending, and deleting phonemes;
children, one 30-minute lesson per week for 12 weeks. segmenting and blending syllables and onset-rime units;
At the end of training, children in the PA-trained group and working with rhyming words. In the focused
were able to identify substantially more initial and final treatment, children practiced segmenting and blending
phonemes in words than control students. They onsets, rimes, and phonemes only. Training extended for
demonstrated superior skill identifying not only sounds 10 weeks, two 15-minute sessions per week, totaling 5
they had practiced but also unpracticed sounds, hours. Beginning in the 5th week, letter-sound
indicating that phoneme identity skill transferred to associations were taught for the sounds being practiced
untaught phonemes. These researchers also gave
students a simplified word reading task in which
orally in both groups. However, children were not practiced and received feedback in learning to read and
taught how to use letters to manipulate phonemes in the learning to spell two-phoneme words. These words
PA activities. The third treatment, a control condition, were formed from the same letter-sounds but they had
received only the letter-sound instruction. not been taught during training.
Comparison of phonemic awareness following training Results showed that the groups learned the PA skill that
showed that the treated groups performed equally well they were taught but performed poorly on the untaught
and both outperformed controls, indicating that both skill. This indicates that teaching students either
types of training were equally effective in teaching PA. segmentation or blending does not improve their
To measure transfer to reading, a simplified word performance in the other skill. On the measures of
learning task was devised. After children learned to reading and spelling, both the segmentation and
associate four letters and sounds, they were given combination groups performed similarly and
practice learning to read five words composed of the outperformed the control group. However, the blending
letters and sounds: am, at, mat, sat, sam. Each word group did not do better than the control group. This
was taught by saying, “This is aaaaat, at.” Results indicates that teaching beginners to segment is as
revealed that only the focused group learned to read the effective for learning to read words as teaching
words in fewer trials than the control group, not the beginners to segment and blend. In contrast, teaching
comprehensive group. This suggests that concentrating beginners only to blend is not effective. These findings
instructional time on segmenting and blending may were replicated in a similar study by Torgesen et al.
contribute more to reading skill than diverting attention (1992).
to many PA activities. These findings are consistent
Although blending made a poor showing in these
with those in the NRP meta-analysis indicating the
studies, Reitsma and Wesseling (1998) reported more
greater impact of segmenting and blending than
success in a study with kindergartners in the
multiskill instruction on reading outcomes. One might
Netherlands. They used a computer to teach
question the use of a simplified word reading task to
kindergartners how to blend three-phoneme Dutch
draw inferences about general reading acquisition.
words (e.g., lief, geit, met). No limits were placed on
However, these kindergartners were beginning readers
the variety of phonemes in the words. All phoneme
so that more advanced reading tests would have been
manipulations were conducted in speech without any
too difficult.
letters. First, children were taught a set of vocabulary
The separate and combined contributions of instruction words, and then these were used in various blending
in segmentation and blending were examined by exercises. Children listened to a sequence of segmented
Davidson and Jenkins (1994), who gave kindergartners sounds, and then clicked on the picture corresponding to
with low PA one or another of four types of training. In that word. Children listened to two successively
the segmentation treatment, each word was segmented words and clicked “same” or “different.”
pronounced, and children were taught to say its Children listened to words, either pronounced as wholes
separate sounds. In the blending treatment, children or segmented, and then had to find which of several
listened to the separate sounds and learned to blend boxes on the screen contained the other form of the
them into words. In the segmentation-and-blending word. If a whole word was heard, they had to find its
treatment, children learned first to segment, then to segmented form. If a segmented word was heard, they
blend the words. In the control condition, children had to find its whole form. In all these exercises, the
listened to stories. Children were taught to a criterion of incorrect word choices differed by several phonemes
mastery. The words and nonwords analyzed during from the correct choice for some items but only by one
training had two phonemes formed out of continuant phoneme for other items, making processing more
consonants and long vowels (e.g., my, vo, low, way). At difficult. In the control group, children completed
the end of training, all students were taught eight letters vocabulary exercises on the computer.
for the sounds that treatment groups had practiced.
Then two literacy tests were given in which children
At the end of kindergarten, PA tests of children’s ability mastered shorter words before advancing to longer
to blend and to segment words revealed superior words. Children in the control group practiced matching
blending performance by the trained group over the the ten letters and sounds in isolation. Articulatory
control group, but no difference in segmentation gestures and letter names were used to correct their
performance. Thus, training effects were limited to errors as well. On posttests after training, effect sizes
blending which was the skill taught, and blending skill were large on measures of segmentation and spelling.
did not transfer to segmentation. The following year, in The measure of reading involved giving children
1st grade, children’s ability to read words was practice learning to read 12 similarly spelled words for
examined. Long-term effects of the blending exercises several trials. The words were spelled phonemically
were evident because trained children read more words with the letter-sounds taught, for example, SEL (seal),
than control children. However, no effects on spelling SNAK (snake), SLIS (slice). The effect size was large,
were detected. These results suggest that extensive d = 0.97. These findings indicate that teaching children
training to develop blending skills does benefit reading to segment and spell helps them learn to read as well as
acquisition. Blending is thought to contribute to reading spell words.
by enabling children to decode new words they have
In many PA training studies, the instructional context
not yet learned to read. Also, findings indicate the
was not considered. However, there were some
effectiveness of using computers to teach PA to
exceptions. Iversen and Tunmer (1993) incorporated
kindergartners.
PA training into Clay’s (1985) Reading Recovery©
One instructional activity that is maximally effective for program to examine whether systematic instruction in
teaching PA in a way that builds a bridge to reading and PA would make the program more effective. At-risk
spelling is that of teaching children to invent readers in 1st grade were assigned to one of three
phonemically more complete spellings of words. groups, a group receiving standard Reading Recovery©
Typically, kindergartners who know letter names or instruction, a group receiving modified RR instruction,
sounds can represent the more salient sounds in words and an alternative, non-RR intervention group. In the
such as beginning and ending sounds, for example, modified RR treatment, after children had learned most
writing B to spell beaver or R to spell arm. Sometimes letters, they manipulated magnetic letter forms to make,
their spellings are not conventional, for example, writing break, and build new words having similar spellings and
Y to spell wife. However, the important achievement is pronunciations, for example, reading and and then
that they can distinguish sounds in words. Once they changing it to hand, sand, band. Training progressed
can do this, then teachers can help them detect from initial sounds to final sounds and then to medial
additional sounds in words and learn conventional sounds. Children added, deleted, and substituted letters
spellings for those sounds. in their manipulations and also read the changed words.
Later, the task becomes a writing rather than a
In a study by Ehri and Wilce (1987b), kindergartners
manipulation task.
were taught individually how to generate phonemic
spellings of words and nonwords by segmenting words Findings showed that both forms of RR enabled
into phonemes and selecting letters representing those children to reach prescribed reading levels that qualified
phonemes. Children who qualified for the study could them to exit the remedial program. However, children
already name the six consonant and four vowel letters who received modified RR attained prescribed levels
that were used in training. All names contained the more quickly than children receiving the standard
relevant sounds in their names (T, S, N, L, K, P, A, E, program (i.e., a mean of 41.75 lessons for modified RR
I, O). vs. 57.31 lessons for standard RR). This indicates that
adding PA training improved RR by increasing its
Instruction began with two-phoneme words and
efficiency. At the end of training, however, both groups
nonwords and progressed to three-phoneme words and
performed at very similar levels on PA outcomes and
words with consonant clusters. Children were helped to
reading outcomes, indicating that both forms of the RR
break words into phonemes by directing their attention
to articulatory gestures. They were helped to select
letters by focusing on sounds in letter names. They
program enabled children to attain similar levels of PA pronounced whatever the child assembled, and the child
and reading. On followup tests given at the end of the continued to manipulate letters until achieving a match.
school year, performance of the groups remained very Then the computer asked the child to change the word
similar. to feem. Lessons began with two-phoneme words and
progressed to longer words. There were several other
Hatcher et al. (1994) also examined whether adding PA
PA activities besides this one.
training to a Reading Recovery© program would
improve its success. The participants were 7-year-old On the posttests, PA-trained children outperformed
poor readers. The PA training that was added to RR controls on tests of phonemic awareness and
involved teaching children to perform different types of pseudoword reading tests. Also, they read more words
PA, including segmentation, blending, deletion, when there were no time constraints. However,
substitution, and transposition of phonemes. Children controls displayed superior time-limited word reading.
also practiced linking letters to phonemes in various Both groups made similar gains in spelling and reading
spelling and writing tasks. Effect sizes, though small, comprehension. Interestingly, when the analysis of word
favored the PA-trained group (d = 0.24 for PA, d = 0.31 reading took account of grade level, 2nd graders gained
for reading and spelling). more than older children and they showed a much
greater advantage for PA training over the control
Castle et al. (1994) examined the contribution of PA
training than did older children. These findings suggest
training to reading acquisition in a whole language
that PA training may be more beneficial to younger than
program. Kindergartners with low PA were assigned to
to older disabled readers.
treatment and control groups. PA training included
segmentation, blending, substitution, and deletion. In sum, these illustrative studies enrich the
Letters were incorporated into the PA activities later in understanding of the data contributing to the NRP
the program. Two control groups were included, one meta-analysis. They show that various types of
receiving an alternative, unrelated treatment and instruction were utilized to teach PA at various grade
another receiving no treatment other than the whole levels. They show how different studies were designed
language instruction provided to all participants in their and the nature of their findings. Also, they draw
classrooms. Results showed that the PA-trained group attention to other potentially important features that
spelled more words and decoded many more were not addressed in the meta-analysis because of an
pseudowords than the two control groups. However, the inadequate number of cases.
groups did not differ in reading real words or in reading
connected text. These findings indicate that adding PA Implications for Reading Instruction
instruction to a whole language program enhances
1. Can phonemic awareness be taught,
students’ decoding and spelling skills but not their other
and does it help children learn to read
reading skills.
and spell?
Wise et al. (in press) evaluated the effects of PA
training against training that taught children reading Results of the meta-analysis showed that teaching
comprehension strategies and gave them extensive text phonemic awareness to children is clearly effective. It
reading practice on computers. The children were 200 improves their ability to manipulate phonemes in
disabled readers in grades 2 to 5. Both treatment and speech. This skill transfers and helps them learn to read
control groups spent time reading stories on the and spell. PA training benefits not only word reading but
computer. They could touch any unknown word with a also reading comprehension. PA training contributes to
cursor and have it identified. Comprehension questions children’s ability to read and spell for months, if not
were answered periodically. Controls spent extra time years, after the training has ended. Effects of PA
reading on the computer while the PA-trained group training are enhanced when children are taught how to
completed various types of PA activities administered apply PA skills to reading and writing tasks.
by the computer. For example, the computer asked the
child to show feef. The child selected and ordered
letter-sound symbols with a mouse. Synthetic speech
2. Which students benefit in their reading? 5. Which methods of teaching PA have the
greatest impact on learning to read?
Teaching phonemic awareness helps many different
students learn to read, including preschoolers, Although all of the approaches exert a significant effect
kindergartners, and 1st graders who are just starting to on reading, instruction that focuses on one or two skills
learn to read. This includes beginners who are low in produces greater transfer than a multiskilled approach.
PA and are thus at risk for developing reading problems Teaching students to segment and blend benefits
in the future. This includes older disabled readers who reading more than a multiskilled approach. Teaching
have already developed reading problems. This includes students to manipulate phonemes with letters yields
children from various SES levels. This includes students larger effects than teaching students without letters, not
who are taught to read in English, as well as students surprisingly because letters help children make the
taught to read in other alphabetic languages. connection between PA and its application to reading.
Teaching children to blend the phonemes represented
3. Which students benefit in their spelling?
by letters is the equivalent of decoding instruction.
Teaching phonemic awareness helps preschoolers, Being explicit about the connection between PA skills
kindergartners, and 1st graders learn to spell. It helps and reading also strengthens training effects.
children at risk for future reading problems also. It helps
6. Which methods of teaching PA have the
low as well as middle-to-high SES children. It helps
students learning to spell in English as well as students greatest impact on learning to spell?
learning in other languages. However, PA training is Teaching PA helps nondisabled readers below 2nd
ineffective for improving spelling in reading-disabled grade learn to spell. Methods that teach children to
students. This is consistent with other research manipulate phonemes with letters are more effective
indicating that disabled readers have a hard time than methods limiting manipulation to spoken units.
learning to spell. Teaching children to segment phonemes in words and
represent them with letters is the equivalent of invented
4. Which methods of teaching PA work
spelling instruction.
best in helping children acquire
phonemic awareness? 7. How important is it to teach letters as
well as phonemic awareness?
Various forms of phoneme manipulation might be
taught, including identifying or categorizing the It is essential to teach letters as well as phonemic
phonemes in words, segmenting words into phonemes, awareness to beginners. PA training is more effective
blending phonemes to form words, deleting phonemes when children are taught to use letters to manipulate
from words, or manipulating onsets and rimes in words. phonemes. This is because knowledge of letters is
In some programs, only one PA skill is taught, while in essential for transfer to reading and spelling. Learning
other programs, two or more skills are combined. Some all the letters of the alphabet is not easy, particularly for
programs teach children to use letters to manipulate children who come to school knowing few of them.
phonemes and others limit training to speech. All of Shapes, names, and sounds need to be overlearned so
these approaches appear to be effective for helping that children can work with them automatically to read
children learn to manipulate phonemes. Focusing on one and spell words. Thus, if children do not know letters,
or two skills produces larger effects than a multiskilled this needs to be taught along with PA.
approach. Teaching PA with letters helps students
acquire PA more effectively than teaching PA without 8. How much time is required for PA
letters. instruction to be effective?
In the NRP analysis, studies that spent between 5 and
18 hours teaching PA yielded very large effects on the
acquisition of phonemic awareness. Studies that spent
longer or less time than this also yielded significant
effect sizes, but effects were moderate and only half as
large. Transfer to reading was greatest for studies was not possible to specify the amount of training
lasting less than 20 hours. In fact, effect sizes were required to enable trainers to be effective. This
more than twice as large for shorter programs than for relationship was not examined in the studies. Only 15
the longest-lasting programs. studies reported the length of training provided to
trainers. It ranged from 2 to 90 hours, with a mean of
Caution is needed in drawing conclusions from this
21 hours. This suggests that the amount of training
finding. Although it suggests that less instructional time
required may be quite modest and reasonable for
is better, it ignores reasons why training that lasted
inservice instruction.
longer might have been less effective. Perhaps the PA
skills being taught were more complex, or perhaps the 10. Is instruction most effectively delivered
learners were harder to teach, or perhaps, as a result of to individual students, to small groups, or
time spent in training, PA-trained students received less to full classrooms of students?
instruction in reading than students in the control
groups. Although individual tutoring is commonly regarded as
the most effective unit of instruction, NRP findings
The Panel concludes that it is wrong to make any indicate that small groups are the best way to teach
declarations about how long effective instruction in PA phonemic awareness to children. Also, small groups
needs to last based on the NRP findings. Rather, facilitate greater transfer to reading than the other two
decisions should be influenced by reason, moderation, teaching units. This may hold true for several reasons.
and situational factors. The answer depends on the Children may benefit from observing their peers
goals of instruction, how many different PA skills are to respond and receive feedback or from listening to their
be taught, whether letters are included, how much or peers’ comments and explanations. Or children may be
how little the learners already know about PA when more attentive and motivated to learn so that they do
they begin, whether they are disabled readers, whether well in the eyes of their peers.
provision is made for facilitating transfer to reading and
spelling, and so forth. Individual children will differ in 11. Is evidence for the effectiveness of PA
the amount of training time they need to acquire PA. training on reading outcomes derived
What is probably most important is to tailor training time from strongly designed or weakly
to student learning by assessing who has and who has designed studies?
not acquired the skills being taught as training proceeds.
Children who are still having trouble should continue PA The NRP analyses show that the evidence rests solidly
training while those who have learned the skills should on well-designed studies. Significant effect sizes were
move on to other reading and writing instruction. apparent on standardized tests as well as experimenter-
designed tests. Random assignment of children to
Not only the total training time but also the length of groups yielded significant effects. In fact, this effect
single training sessions must be considered. In the NRP size was larger than that for the nonequivalent group
database, the average length of sessions was 25 design. Studies in which treatment fidelity was checked
minutes. Few sessions lasted more than 30 minutes, and yielded a moderate effect size. Significant effects
these tended to occur with older disabled readers, not occurred not only when PA-trained groups were
with younger children. From this, the Panel concludes compared to untreated control groups but also when
that sessions should probably not exceed 30 minutes in they were compared to treated controls. Significant
length. effects were detected with larger as well as smaller
samples of children. When Troia’s (1999) criteria for
9. Can classroom teachers teach PA
methodological rigor were applied to studies, the most
effectively to their students?
rigorous studies yielded the largest effect sizes. The
Classroom teachers are definitely able to teach PA Panel concludes that evidence for the effectiveness of
effectively. In the NRP analysis, their effect size on the PA training on reading outcomes comes from well-
acquisition of PA was large. The training they provided
transferred and improved students’ reading and spelling,
and the effect on reading continued beyond training. It
designed experiments. In fact, researchers are advised the research examined which techniques are most
that they have the best chance of observing strong engaging for teachers. It seems self-evident that
effects if they apply the most rigor in designing their PA teachers are most effective when they are
studies. enthusiastic and enjoy what they are teaching. In
selecting ways to teach PA, teachers need to take
12. Are the results ready for account of motivational aspects of programs for
implementation in the classroom? themselves as well as their students.
This section of the NRP report includes many ideas that • Teachers should recognize that acquiring phonemic
provide guidance to teachers in designing PA instruction awareness is a means rather than an end. PA is not
and in evaluating and selecting programs with the best acquired for its own sake but rather for its value in
chance for success. However, in implementing PA helping learners understand and use the alphabetic
instruction in the classroom, teachers should bear in system to read and write. This is why it is important
mind several serious cautions: to include letters when teaching children to
manipulate phonemes and why it is important to be
• PA training does not constitute a complete reading
explicit about how children are to use the PA skills
program. Although the present meta-analysis
in reading and writing tasks.
confirms that PA is a key component that
contributes significantly to the effectiveness of • It is important to recognize that children will acquire
beginning reading and spelling instruction, there is some phonemic awareness in the course of learning
obviously much more that children need to be to read and spell even though they are not taught
taught to acquire reading and writing competence. PA explicitly. The process of learning letter-sound
PA instruction is intended only as a foundational relations and how to use them to read and spell
piece. It helps children grasp how the alphabetic enhances children’s ability to manipulate phonemes.
system works. It helps children read and spell However, incidental instruction that does not focus
words in various ways. However, literacy on teaching PA falls short in its contribution to
acquisition is a complex process for which there is children’s reading and spelling development.
no single key to success. Teaching phonemic • It is important to recognize that children will differ
awareness does not ensure that children will learn in their phonemic awareness and that some will
to read and write. Many competencies must be need more instruction than others. In kindergarten,
acquired for this to happen. most children will be nonreaders and will have little
• Exactly how PA instruction should be taught by phonemic awareness; so, PA instruction should
teachers in their classrooms is not clearly specified benefit everyone. In 1st grade, some children will
by the research. A variety of programs was found be reading and spelling already while others may
to be effective. The studies are useful in identifying know only a few letters and have no reading skill.
features that are important and should be The nonreaders will need much more PA and letter
considered in selecting programs and planning instruction than those already reading. Among
classroom instruction. Ultimately, though, teachers readers in 1st and 2nd grades, there may be
need to evaluate the methods they use against variation in how well children can perform more
measured success in their own students. advanced forms of PA, that is, manipulations
involving segmenting and blending with letters. The
• One factor that is very important to effective
best approach is for teachers to assess students’
classroom instruction but has not been addressed in
PA prior to beginning PA instruction. This will
the PA training research is the extent to which
indicate which children need the instruction and
these programs motivate both students and
which do not; which children need to be taught
teachers. It seems self-evident that instructional
rudimentary levels of PA, for example, segmenting
techniques for developing PA need to be relevant,
initial sounds in words; and which need more
engaging, interesting, and motivating in order to
advanced levels involving segmenting or blending
promote optimal learning in children. However, the
with letters.
research has not focused on this factor. Neither has
5. Programs to Help Parents Teach PA were largest for studies that were methodologically
rigorous. It is important for future researchers to
Many parents of preschoolers are anxious to help their maintain the quality of the designs adopted. This is not
children acquire the knowledge and skills they need to to say that all studies must use random assignment
become successful when they enter school and begin rather than nonequivalent groups. Sometimes
reading instruction. However, none of the studies experimenters have no choice if they want to conduct
reviewed utilized parents as trainers. Research is studies in school classrooms. However, researchers
needed to address this gap in our knowledge. In addition must take steps to maximize the rigor of their studies by
to informal activities that parents might use to draw addressing as many threats to internal and external
children’s attention to sounds in words, the validity as possible. Not only does this enhance
effectiveness of activities that help parents teach letters confidence in the findings but also, as the NRP meta-
to preschoolers might be explored and assessed. analysis shows, it gives researchers a better chance of
6. High-Quality Research detecting treatment effects when they exist.
experimental training studies. Journal of Educational * Davidson, M., & Jenkins, J. (1994). Effects of
Psychology, 91, 403-414. phonemic processes on word reading and spelling.
Journal of Educational Research, 87, 148-157.
* Byrne, B., & Fielding-Barnsley, R. (1991).
Evaluation of a program to teach phonemic awareness * Defior, S., & Tudela, P. (1994). Effect of
to young children. Journal of Educational Psychology, phonological training on reading and writing acquisition.
83, 451-455. Reading and Writing: An Interdisciplinary Journal, 6,
299-320.
* Byrne, B., & Fielding-Barnsley, R. (1993).
Evaluation of a program to teach phonemic awareness Dressman, M. (1999). On the use and misuse of
to young children: A 1-year follow-up. Journal of research evidence: Decoding two states’ reading
Educational Psychology, 85, 104-111. (First followup to initiatives. Reading Research Quarterly, 34, 258-285.
Byrne & Fielding-Barnsley, 1991)
Ehri, L. (1979). Linguistic insight: Threshold of
* Byrne, B., & Fielding-Barnsley, R. (1995). reading acquisition. In T. G. Waller & G. E. MacKinnon
Evaluation of a program to teach phonemic awareness (Eds.), Reading research: Advances in theory and
to young children: A 2- and 3-year follow-up and a new practice (Vol. 1, pp. 63-114). New York: Academic
preschool trial. Journal of Educational Psychology, 87, Press.
488-503. (second followup to Byrne & Fielding-
Barnsley, 1991) Ehri, L. (1984). How orthography alters spoken
language competencies in children learning to read and
* Castle, J. M., Riach, J., & Nicholson, T. (1994). spell. In J. Downing & R. Valtin (Eds.), Language
Getting off to a better start in reading and spelling: The awareness and learning to read (pp. 119-147). New
effects of phonemic awareness instruction within a York: Springer Verlag.
whole language program. Journal of Educational
Psychology, 86, 350-359. Ehri, L. (1991). Development of the ability to read
words. In R. Barr, M. L. Kamil, P. Mosenthal, & P. D.
Chall, J. (1967). Learning to read: the great debate. Pearson (Eds.), Handbook of reading research, (Vol. 2,
New York: McGraw Hill. pp. 383-417). New York: Longman.
Clay, M. (1985). The early detection of reading Ehri, L. (1992). Reconceptualizing the development
difficulties (3rd ed.). Auckland, New Zealand: of sight word reading and its relationship to recoding. In
Heinemann. P. Gough, L. Ehri, & R. Treiman (Eds.), Reading
acquisition (pp. 107-143). Hillsdale, NJ: Erlbaum.
Cohen, J. (1988). Statistical power analysis for the
behavior sciences (2nd ed.). Hillsdale, NJ: Erlbaum. Ehri, L. (1994). Development of the ability to read
words: Update. In R. Ruddell, M. Ruddell, & H. Singer
Cohen, P., Kulik, J., & Kulik, C. (1982). Educational (Eds.), Theoretical models and processes of reading
outcomes of tutoring: A meta-analysis of findings. (4th ed., pp. 323-358). Newark, DE: International
American Educational Research Journal, 19, 237-248. Reading Association.
Cooper, H. (1998). Synthesizing research. Ehri, L., & Wilce, L. (1987a). Cipher versus cue
Thousand Oaks, CA: Sage. reading: An experiment in decoding acquisition. Journal
of Educational Psychology, 79, 3-13.
* Cunningham, A. (1990). Explicit versus implicit
instruction in phonemic awareness. Journal of * Ehri, L., & Wilce, L. (1987b). Does learning to
Experimental Child Psychology, 50, 429-444. spell help beginners learn to read words? Reading
Research Quarterly, 22, 48-65.
* Farmer, A., Nixon, M., & White, R. (1976). * Hatcher, P., Hulme, C., & Ellis, A. (1994).
Sound blending and learning to read: An experimental Ameliorating early reading failure by integrating the
investigation. British Journal of Educational Psychology, teaching of reading and phonological skills: The
46, 155-163. phonological linkage hypothesis. Child Development, 65,
41-57.
Fawcett, A., & Nicolson, R. (1995). Persistence of
phonological awareness deficits in older children with * Hohn, W., & Ehri, L. (1983). Do alphabet
dyslexia. Reading and Writing: An Interdisciplinary letters help prereaders acquire phonemic segmentation
Journal, 7, 361-376. skill? Journal of Educational Psychology, 75, 752-762.
* Fox, B., & Routh, D. (1976). Phonemic * Hurford, D., Johnston, M., Nepote, P.,
analysis and synthesis as word-attack skills. Journal of Hampton, S., Moore, S., Neal, J., Mueller, A.,
Educational Psychology, 68, 70-74. McGeorge, K., Huff, L., Awad, A., Tatro, C., Juliano,
C., & Huffman, D. (1994). Early identification and
* Fox, B., & Routh, D. (1984). Phonemic remediation of phonological-processing deficits in
analysis and synthesis as word-attack skills: Revisited. first-grade children at risk for reading disabilities.
Journal of Educational Psychology, 76, 1059-1064. Journal of Learning Disabilities, 27, 647-659.
Gaskins, I., Downer, M., Anderson, R., * Iversen, S., & Tunmer, W. (1993). Phonological
Cunningham, P., Gaskins, R., Schommer, M., & The processing skills and the Reading Recovery Program.
Teachers of Benchmark School. (1988). A Journal of Educational Psychology, 85, 112-126.
metacognitive approach to phonics: Using what you
know to decode what you don’t know. Remedial and Johnson, B. (1989). DSTAT: Software for the meta-
Special Education, 9, 36-41. analytic review of research literatures. Hillsdale, NJ:
Erlbaum.
Glass, G., Cahen, L., Smith, M., & Filby, N. (1982).
School class size. Beverly Hills, CA: Sage. Johnson, B., & Eagly, A. (in press). Quantitative
synthesis of social psychological research. In H. Reis &
Glushko, R. J. (1979). The organization and C. Judd (Eds.), Handbook of research methods in social
activation of orthographic knowledge in reading aloud. psychology (pp. 496-528). London: Cambridge
Journal of Experimental Psychology: Human Perception University Press.
and Performance, 5, 674-691.
Juel, C. (1988). Learning to read and write: A
Goswami, U. (1986). Children’s use of analogy in longitudinal study of fifty-four children from first
learning to read: A developmental study. Journal of through fourth grade. Journal of Educational
Experimental Child Psychology, 42, 73-83. Psychology, 80, 437-447.
Griffith, P. (1991). Phonemic awareness helps first Juel, C., Griffith, P., & Gough, P. (1986).
graders invent spellings and third graders remember Acquisition of literacy: A longitudinal study of children
correct spellings. Journal of Reading Behavior, 23, 215- in first and second grade. Journal of Educational
233. Psychology, 78, 243-255.
* Gross, J., & Garnett, J. (1994). Preventing * Kennedy, K., & Backman, J. (1993).
reading difficulties: Rhyme and alliteration in the real Effectiveness of the Lindamood Auditory
world. Educational Psychology in Practice, 9, 235-240. Discrimination in depth program with students with
learning disabilities. Learning Disabilities Research and
* Haddock, M. (1976). Effects of an auditory and Practice, 8, 253-259.
an auditory-visual method of blending instruction on the
ability of prereaders to decode synthetic words. Journal
of Educational Psychology, 68, 825-831.
* Korkman, M., & Peltomaa, A. (1993). Moats, L. (1994). Knowledge of language: The
Preventive treatment of dyslexia by a preschool training missing foundation for teacher education. Annals of
program for children with language impairments. Dyslexia, 44, 81-102.
Journal of Clinical Child Psychology, 22, 277-287.
Morais, J., Bertelson, P., Cary, L., & Alegria, J.
* Kozminsky, L., & Kozminsky, E. (1995). The (1987). Literacy training and speech segmentation. In P.
effects of early phonological awareness training on Bertelson (Ed.), The onset of literacy: cognitive
reading success. Learning and Instruction, 5, 187-201. processes in reading acquisition (pp. 45-64). Cambridge,
MA: The MIT Press.
Liberman, I., Shankweiler, D., Fischer, F., & Carter,
B. (1974). Explicit syllable and phoneme segmentation * Murray, B. (1998). Gaining alphabetic insight: Is
in the young child. Journal of Experimental Child phoneme manipulation skill or identity knowledge
Psychology, 18, 201-212. causal? Journal of Educational Psychology, 90, 461-475.
* Lie, A. (1991). Effects of a training program * O’Connor, R., & Jenkins, J. (1995). Improving
for stimulating skills in word analysis in first-grade the generalization of sound/symbol knowledge: Teaching
children. Reading Research Quarterly, 26, 234-250. spelling to kindergarten children with disabilities. The
Journal of Special Education, 29, 255-275.
Lindamood, C. & Lindamood, P. (1975). The ADD
program: Auditory discrimination in depth. Austin, TX: * O’Connor, R., Jenkins, J., & Slocum, T. (1995).
DLM. Transfer among phonological tasks in kindergarten:
Essential instructional content. Journal of Educational
* Lovett, M., Barron, R., Forbes, J., Cuksts, B., Psychology, 87, 202-217.
& Steinbach, K. (1994). Computer speech-based
training of literacy skills in neurologically impaired * O’Connor, R., Notari-Syverson, A., & Vadasy,
children: A controlled evaluation. Brain and Language, P. (1996). Ladders to literacy: The effects of
47, 117-154. teacher-led phonological activities for kindergarten
children with and without disabilities. Exceptional
* Lundberg, I., Frost, J., & Petersen, O. (1988). Children, 63, 117-130.
Effects of an extensive program for stimulating
phonological awareness in preschool children. Reading * O’Connor, R., Notari-Syverson, A., & Vadasy,
Research Quarterly, 23, 263-284. P. (1998). First-grade effects of teacher-led
phonological activities in kindergarten for children with
Mann, V. (1987). Phonological awareness: The role mild disabilities: A follow-up study. Learning Disabilities
of reading experience. In P. Bertelson (Ed.), The onset Research and Practice, 13, 43-52. (followup to
of literacy: Cognitive processes in reading acquisition O’Connor et al., 1996)
(pp. 65-92). Cambridge, MA: The MIT Press.
* Olofsson, A., & Lundberg, I. (1983). Can
Marsh, G., Freidman, M., Welch, V., & Desberg, P. phonemic awareness be trained in kindergarten?
(1981). A cognitive-developmental theory of reading Scandinavian Journal of Psychology, 24, 35-44.
acquisition. In G. Mackinnon & T. G. Waller (Eds.),
Reading research: Advances in theory and practice * Olofsson, A., & Lundberg, I. (1985). Evaluation
(Vol. 3, pp. 199-221). New York: Academic Press. of long-term effects of phonemic awareness training in
kindergarten: Illustrations of some methodological
* McGuiness, D., McGuiness, C., & Donohue, J. problems in evaluation research. Scandinavian Journal
(1995). Phonological training and the alphabet principle: of Psychology, 26, 21-34. (followup to Olofsson &
Evidence for reciprocal causality. Reading Research Lundberg, 1983)
Quarterly, 30, 830-852.
Pinnel, G., Lyons, C., DeFord, D., Bryk, A., & Share, D., Jorm, A., Maclean, R., & Matthews, R.
Seltzer M. (1994). Comparing instructional models for (1984). Sources of individual differences in reading
the literacy education of high-risk 1st graders. Reading acquisition. Journal of Educational Psychology, 76,
Research Quarterly, 29, 9-39. 1309-1324.
Rack, J., Hulme, C., Snowling, M., & Wightman, J. Snow, C., Burns, M., & Griffin, P. (Eds.), (1998).
(1994). The role of phonology in young children learning Preventing reading difficulties in young children.
to read words: The direct-mapping hypothesis. Journal Washington DC: National Academy Press.
of Experimental Child Psychology, 57, 42-71.
* Solity, J. (1996). Phonological awareness:
Read, C., Zhang, Y., Nie, H., & Ding, B. (1987). Learning disabilities revisited? Educational and Child
The ability to manipulate speech sounds depends on Psychology, 13, 103-113.
knowing alphabetic writing. In P. Bertelson (Ed.), The
Onset of Literacy: Cognitive Processes in Reading Stahl, S., & Murray, B. (1994). Defining
Acquisition (pp. 31-44). Cambridge, MA: The MIT phonological awareness and its relationship to early
Press. reading. Journal of Educational Psychology, 86, 221-
234.
Reitsma, P. (1983). Printed word learning in
beginning readers. Journal of Experimental Child * Tangel, D., & Blachman, B. (1992). Effect of
Psychology, 75, 321-339. phoneme awareness instruction on kindergarten
children’s invented spelling. Journal of Reading
* Reitsma, P., & Wesseling, R. (1998). Effects of Behavior, 24, 233-261.
computer-assisted training of blending skills in
kindergartners. Scientific Studies of Reading, 2, * Torgesen, J., Morgan, S., & Davis, C. (1992).
301-320. Effects of two types of phonological awareness training
on word learning in kindergarten children. Journal of
* Sanchez, E., & Rueda, M. (1991). Segmental Educational Psychology, 84, 364-370.
awareness and dyslexia: Is it possible to learn to
segment well and yet continue to read and write poorly? Treiman, R. (1985). Onsets and rimes as units of
Reading and Writing: An Interdisciplinary Journal, 3, spoken syllables: Evidence from children. Journal of
11-18. Experimental Child Psychology, 39, 161-181.
Scarborough, H., Ehri, L., Olson, R., & Fowler, A. * Treiman, R., & Baron, J. (1983).
(1998). The fate of phonemic awareness beyond the Phonemic-analysis training helps children benefit from
elementary school years. Scientific Studies of Reading, spelling sound rules. Memory and Cognition, 11,
2, 115-142. 382-389.
Schatschneider, C., Francis, D., Foorman, B., Troia, G. (1999). Phonological awareness
Fletcher, J., & Mehta, P. (1999). The dimensionality of intervention research: A critical review of the
phonological awareness: An application of item experimental methodology. Reading Research
response theory. Journal of Educational Psychology, 91, Quarterly, 34, 28-52.
439-449.
Tunmer, W., & Chapman, J. (1998). Language
* Schneider, W., Kuspert, P., Roth, E., Vise, M., prediction skill, phonological recoding ability, and
& Marx, H. (1997). Short- and long-term effects of beginning reading. In C. Hulme and M. Joshi (Eds.),
training phonological awareness in kindergarten: Reading and spelling: Development and disorders (pp.
Evidence from two German studies. Journal of 33-68). Mahwah, NJ: Erlbaum.
Experimental Child Psychology, 66, 311-340.
* Uhry, J., & Shepherd, M. (1993).
Segmentation/spelling instruction as part of a first-grade
reading program: Effects on several measures of * Warrick, N., Rubin, H., & Rowe-Walsh, S.
reading. Reading Research Quarterly, 28, 218-233. (1993). Phoneme awareness in language-delayed
children: Comparative studies and intervention. Annals
* Vadasy, P., Jenkins, J., Antil, L., Wayne, S., & of Dyslexia, 43, 153-173.
O’Connor, R. (1997). Community-based early reading
intervention for at-risk first graders. Learning Wasik, B., & Slavin, R. (1993). Preventing early
Disabilities Research and Practice, 12, 29-39. reading failure with one-to-one tutoring: A review of
five programs. Reading Research Quarterly, 28, 179-
* Vadasy, P., Jenkins, J., Antil, L., Wayne, S., & 200.
O’Connor, R. (1997). The effectiveness of one-to-one
tutoring by community tutors for at-risk beginning * Weiner, S. (1994). Effects of phonemic
readers. Learning Disability Quarterly, 20, 126-139. awareness training on low- and middle-achieving first
graders’ phonemic awareness and reading ability.
* Vellutino, F., & Scanlon, D. (1987). Journal of Reading Behavior, 26, 277-300.
Phonological coding, phonological awareness, and
reading ability: Evidence from a longitudinal and * Williams, J. (1980). Teaching decoding with an
experimental study. Merrill-Palmer Quarterly, 33, emphasis on phoneme analysis and phoneme blending.
321-363. Journal of Educational Psychology, 72, 1-15.
Venezky, R. (1970). The structure of English * Wilson, J., & Frederickson, N. (1995).
orthography. The Hague: Mouton. Phonological awareness training: An evaluation.
Educational & Child Psychology, 12, 68-79.
Venezky, R. (1999). The American way of spelling.
New York: Guilford Press. * Wise, B., King, J., & Olson, R. (1999). Training
phonological awareness with and without explicit
Wagner, R. (1988). Causal relations between the attention to articulation. Journal of Experimental Child
development of phonological processing abilities and the Psychology, 72, 271-304.
acquisition of reading skills: A meta-analysis. Merrill-
Palmer Quarterly, 32, 261-279. * Wise, B., King, J., & Olson, R. (in press).
Individual differences in benefits from computer-
Wagner, R., & Torgesen, J. (1987). The nature of assisted remedial reading. Journal of Experimental Child
phonological processing and its causal role in the Psychology.
acquisition of reading skills. Psychological Bulletin, 101,
192-212.
Appendix A
Studies Included in the Meta-Analysis
* Ball, E., & Blachman, B. (1991). Does * Brennan, F., & Ireson, J. (1997). Training
phoneme awareness training in kindergarten make a phonological awareness: A study to evaluate the effects
difference in early word recognition and developmental of a program of metalinguistic games in kindergarten.
spelling? Reading Research Quarterly, 26, 49-66. Reading and Writing: An Interdisciplinary Journal, 9,
241-263.
* Barker, T., & Torgesen, J. (1995). An
evaluation of computer-assisted instruction in * Bus, A. (1986). Preparatory reading instruction
phonological awareness with below average readers. in kindergarten: Some comparative research into
Journal of Educational Computing Research, 13, methods of auditory and auditory-visual training of
89-103. phonemic analysis and blending. Perceptual and Motor
Skills, 62, 11-24.
* Bentin, S., & Leshem, H. (1993). On the
interaction between phonological awareness and * Byrne, B., & Fielding-Barnsley, R. (1991).
reading acquisition: It’s a two-way street. Annals of Evaluation of a program to teach phonemic awareness
Dyslexia, 43, 125-148. to young children. Journal of Educational Psychology,
83, 451-455.
* Blachman, B., Ball, E., Black, R., & Tangel, D.
(1994). Kindergarten teachers develop phoneme * Byrne, B., & Fielding-Barnsley, R. (1993).
awareness in low-income, inner-city classrooms: Does Evaluation of a program to teach phonemic awareness
it make a difference? Reading and Writing: An to young children: A 1-year follow-up. Journal of
Interdisciplinary Journal, 6, 1-18. Educational Psychology, 85, 104-111. (First followup to
Byrne & Fielding-Barnsley, 1991)
* Bradley, L., & Bryant, P. (1983). Categorizing
sounds and learning to read: A causal connection. * Byrne, B., & Fielding-Barnsley, R. (1995).
Nature, 301, 419-421. Evaluation of a program to teach phonemic awareness
to young children: A 2- and 3-year follow-up and a new
* Bradley, L., & Bryant, P. (1985). Rhyme and preschool trial. Journal of Educational Psychology, 87,
reason in reading and spelling. International Academy 488-503. (Second followup to Byrne & Fielding-
for Research in Learning Disabilities, Monograph Barnsley, 1991)
Series, 1, 75-95. Ann Arbor, MI: The University of
Michigan Press. (a more complete report of Bradley & * Castle, J. M., Riach, J., & Nicholson, T. (1994).
Bryant, 1983) Getting off to a better start in reading and spelling: The
effects of phonemic awareness instruction within a
* Brady, S., Fowler, A., Stone, B., & Winbury, N. whole language program. Journal of Educational
(1994). Training phonological awareness: A study with Psychology, 86, 350-359.
inner-city kindergarten children. Annals of Dyslexia, 44,
26-59. * Cunningham, A. (1990). Explicit versus implicit
instruction in phonemic awareness. Journal of
Experimental Child Psychology, 50, 429-444.
* Davidson, M., & Jenkins, J. (1994). Effects of first-grade children at risk for reading disabilities.
phonemic processes on word reading and spelling. Journal of Learning Disabilities, 27, 647-659.
Journal of Educational Research, 87, 148-157.
* Iversen, S., & Tunmer, W. (1993). Phonological
* Defior, S., & Tudela, P. (1994). Effect of processing skills and the Reading Recovery Program.
phonological training on reading and writing acquisition. Journal of Educational Psychology, 85, 112-126.
Reading and Writing: An Interdisciplinary Journal, 6,
299-320. * Kennedy, K., & Backman, J. (1993).
Effectiveness of the Lindamood Auditory
* Ehri, L., & Wilce, L. (1987b). Does learning to Discrimination in depth program with students with
spell help beginners learn to read words? Reading learning disabilities. Learning Disabilities Research and
Research Quarterly, 22, 48-65. Practice, 8, 253-259.
* Farmer, A., Nixon, M., & White, R. (1976). * Korkman, M., & Peltomaa, A. (1993).
Sound blending and learning to read: An experimental Preventive treatment of dyslexia by a preschool training
investigation. British Journal of Educational Psychology, program for children with language impairments.
46, 155-163. Journal of Clinical Child Psychology, 22, 277-287.
* Fox, B., & Routh, D. (1976). Phonemic * Kozminsky, L., & Kozminsky, E. (1995). The
analysis and synthesis as word-attack skills. Journal of effects of early phonological awareness training on
Educational Psychology, 68, 70-74. reading success. Learning and Instruction, 5, 187-201.
* Fox, B., & Routh, D. (1984). Phonemic * Lie, A. (1991). Effects of a training program
analysis and synthesis as word-attack skills: Revisited. for stimulating skills in word analysis in first-grade
Journal of Educational Psychology, 76, 1059-1064. children. Reading Research Quarterly, 26, 234-250.
* Gross, J., & Garnett, J. (1994). Preventing * Lovett, M., Barron, R., Forbes, J., Cuksts, B.,
reading difficulties: Rhyme and alliteration in the real & Steinbach, K. (1994). Computer speech-based
world. Educational Psychology in Practice, 9, 235-240. training of literacy skills in neurologically impaired
children: A controlled evaluation. Brain and Language,
* Haddock, M. (1976). Effects of an auditory and 47, 117-154.
an auditory-visual method of blending instruction on the
ability of prereaders to decode synthetic words. Journal * Lundberg, I., Frost, J., & Petersen, O. (1988).
of Educational Psychology, 68, 825-831. Effects of an extensive program for stimulating
phonological awareness in preschool children. Reading
* Hatcher, P., Hulme, C., & Ellis, A. (1994). Research Quarterly, 23, 263-284.
Ameliorating early reading failure by integrating the
teaching of reading and phonological skills: The * McGuiness, D., McGuiness, C., & Donohue, J.
phonological linkage hypothesis. Child Development, 65, (1995). Phonological training and the alphabet principle:
41-57. Evidence for reciprocal causality. Reading Research
Quarterly, 30, 830-852.
* Hohn, W., & Ehri, L. (1983). Do alphabet
letters help prereaders acquire phonemic segmentation * Murray, B. (1998). Gaining alphabetic insight: Is
skill? Journal of Educational Psychology, 75, 752-762. phoneme manipulation skill or identity knowledge
causal? Journal of Educational Psychology, 90, 461-475.
* Hurford, D., Johnston, M., Nepote, P.,
Hampton, S., Moore, S., Neal, J., Mueller, A., * O’Connor, R., & Jenkins, J. (1995). Improving
McGeorge, K., Huff, L., Awad, A., Tatro, C., Juliano, the generalization of sound/symbol knowledge: Teaching
C., & Huffman, D. (1994). Early identification and spelling to kindergarten children with disabilities. The
remediation of phonological-processing deficits in Journal of Special Education, 29, 255-275.
* O’Connor, R., Jenkins, J., & Slocum, T. (1995). * Tangel, D., & Blachman, B. (1992). Effect of
Transfer among phonological tasks in kindergarten: phoneme awareness instruction on kindergarten
Essential instructional content. Journal of Educational children’s invented spelling. Journal of Reading
Psychology, 87, 202-217. Behavior, 24, 233-261.
* O’Connor, R., Notari-Syverson, A., & Vadasy, * Torgesen, J., Morgan, S., & Davis, C. (1992).
P. (1996). Ladders to literacy: The effects of Effects of two types of phonological awareness training
teacher-led phonological activities for kindergarten on word learning in kindergarten children. Journal of
children with and without disabilities. Exceptional Educational Psychology, 84, 364-370.
Children, 63, 117-130.
* Treiman, R., & Baron, J. (1983).
* O’Connor, R., Notari-Syverson, A., & Vadasy, Phonemic-analysis training helps children benefit from
P. (1998). First-grade effects of teacher-led spelling sound rules. Memory and Cognition, 11,
phonological activities in kindergarten for children with 382-389.
mild disabilities: A follow-up study. Learning Disabilities
Research and Practice, 13, 43-52. (followup to * Uhry, J., & Shepherd, M. (1993).
O’Connor et al., 1996) Segmentation/spelling instruction as part of a first-grade
reading program: Effects on several measures of
* Olofsson, A., & Lundberg, I. (1983). Can reading. Reading Research Quarterly, 28, 218-233.
phonemic awareness be trained in kindergarten?
Scandinavian Journal of Psychology, 24, 35-44. * Vadasy, P., Jenkins, J., Antil, L., Wayne, S., &
O’Connor, R. (1997). Community-based early reading
* Olofsson, A., & Lundberg, I. (1985). Evaluation intervention for at-risk first graders. Learning
of long-term effects of phonemic awareness training in Disabilities Research and Practice, 12, 29-39.
kindergarten: Illustrations of some methodological
problems in evaluation research. Scandinavian Journal * Vadasy, P., Jenkins, J., Antil, L., et al. (1997).
of Psychology, 26, 21-34. (Followup to Olofsson & The effectiveness of one-to-one tutoring by community
Lundberg, 1983) tutors for at-risk beginning readers. Learning Disability
Quarterly, 20, 126-139.
* Reitsma, P., & Wesseling, R. (1998). Effects of
computer-assisted training of blending skills in * Vellutino, F., & Scanlon, D. (1987).
kindergartners. Scientific Studies of Reading, 2, Phonological coding, phonological awareness, and
301-320. reading ability: Evidence from a longitudinal and
experimental study. Merrill-Palmer Quarterly, 33,
* Sanchez, E., & Rueda, M. (1991). Segmental 321-363.
awareness and dyslexia: Is it possible to learn to
segment well and yet continue to read and write poorly? * Warrick, N., Rubin, H., & Rowe-Walsh, S.
Reading and Writing: An Interdisciplinary Journal, 3, (1993). Phoneme awareness in language-delayed
11-18. children: Comparative studies and intervention. Annals
of Dyslexia, 43, 153-173.
* Schneider, W., Kuspert, P., Roth, E., Vise, M.,
& Marx, H. (1997). Short- and long-term effects of * Weiner, S. (1994). Effects of phonemic
training phonological awareness in kindergarten: awareness training on low- and middle-achieving first
Evidence from two German studies. Journal of graders’ phonemic awareness and reading ability.
Experimental Child Psychology, 66, 311-340. Journal of Reading Behavior, 26, 277-300.
* Wilson, J., & Frederickson, N. (1995). * Wise, B., King, J., & Olson, R. (in press).
Phonological awareness training: An evaluation. Individual differences in benefits from computer-
Educational & Child Psychology, 12, 68-79. assisted remedial reading. Journal of Experimental Child
Psychology.
* Wise, B., King, J., & Olson, R. (1999). Training
phonological awareness with and without explicit
attention to articulation. Journal of Experimental Child
Psychology, 72, 271-304.
Appendix B
Ta b l e 1 : D e p e n d e n t a n d M o d e r a t o r Va r i a b l e s
Included in the Meta-Analyses
OUTCOME MEASURES
1. Composite measures
Phonemic awareness
Reading
Spelling
2. Measures of phonemic awareness
Segmentation
Blending
Deletion
Other
3. Measures of reading
Standardized vs. experimenter-devised tests of word reading
Standardized vs. experimenter-devised tests of nonword reading
Reading comprehension
4. Measures of spelling
Standardized vs. experimenter-devised tests of spelling
5. Measure of math achievement
6. Test points
Immediately after training
First followup test (delay of 2 to 15 months)
Second followup test (delay of 7 to 36 months)
CHARACTERISTICS OF PARTICIPANTS
1. Reader level: at-risk readers; disabled readers; normally progressing readers
2. Grade level: preschool; kindergarten; 1st grade; 2nd through 6th grades
3. Language: English; other (Dutch, Finnish, German, Hebrew, Norwegian, Spanish, Swedish)
4. Socioeconomic status: low SES; middle-to-high SES
Table 1 (continued)
Appendix C
Ta b l e 2 : P h o n e m i c A w a r e n e s s O u t c o m e s
Phonemic Awareness Outcomes: Mean Effect Sizes (d) as a Function of Moderator Variables and
Tests to Determine Whether Effect Sizes Were Significantly Greater Than Zero at p < 0.05, Were
Homogeneous at p < 0.05, and Differed From Each Other at p < 0.05. Effect Sizes Are
Immediately After Training Unless Labeled as Followup.
Moderator Variables and Levels No. Cases Mean d Homogen. 95% C I Contrasts
Time of Posttest
Outcome Measures of PA
Characteristics of PA Training
Table 2 (continued)
Moderator Variables and Levels No. Cases Mean d Homogen. 95% C I Contrasts
Length of training
Characteristics of Trainers
Characteristics of Participants
Reading level
Table 2 (continued)
Moderator Variables and Levels No. Cases Mean d Homogen. 95% C I Contrasts
Grade
Socioeconomic status
Language
Characteristics of Design
Table 2 (continued)
Moderator Variables and Levels No. Cases Mean d Homogen. 95% C I Contrasts
Size of sample
Characteristics of Study
Year of publication
Reading Outcomes: Mean Effect Sizes (d) as a Function of Moderator Variables and Tests to Determine Whether
Effect Sizes Were Significantly Greater Than Zero at p < 0.05, Were Homogeneous at p < 0.05, and Differed
From Each Other at p < 0.05. Effect Sizes Are Immediately After Training Unless Labeled as Followup.
Moderator Variables and No. Cases Mean d Homogen. 95% C I Contrasts
Levels
Characteristics of Outcome
Measures
Time of posttest
Characteristics of PA Training
Immediate posttest
Table 3 (continue d)
Followup posttest
Immediate posttest
Followup posttest
Immediate posttest
Followup posttest
Table 3 (continue d)
Length of training
Characteristics of Trainers
Immediate posttest
Classroom teachers
22 0.41* No 0.33 to 0.49 RO > CT
(CT)
Followup posttest
Classroom teachers
12 0.32* Yes 0.20 to 0.43 RO > CT
(CT)
Table 3 (continue d)
Characteristics of Participants
Grade
Socioeconomic status
Language
Immediate posttest
Table 3 (continue d)
Followup posttest
Characteristics of Design
Immediate posttest
Followup posttest
Size of sample
Table 3 (continue d)
Characteristics of Study
Year of publication
* indicates that effect size was significantly greater than zero at p < 0.05.
Characteristics of PA Training
Blend & segment only (BS) 7 0.79* Yes 0.49 to 1.09 BS > 3 +
Moderator Variables and Levels No. Cases Mean d Homogen. 95% C I Contrasts
Length of training
Characteristics of Trainers
Yes
Computers (Com) 6 0.09ns - 0.10 to O > C om
0.28
Others (O) 33 0.74* No 0.63 to 0.85
Characteristics of Participants
Reading level
At risk (A) 13 0.76* No 0.54 to 0.98 A=N>D
- 0.00 to
Disabled (D) 11 0.15ns Yes
0.31
Normal progress (N) 15 0.88* No 0.74 to 1.02
Moderator Variables and Levels No. Cases Mean d Homogen. 95% C I Contrasts
Grade
Preschool (P) 0 Ñ Ñ Ñ
Kindergarten (K) 15 0.97* No 0.82 to 1.13 K>1>2
Socioeconomic status
Language
Characteristics of Design
Moderator Variables and Levels No. Cases Mean d Homogen. 95% C I Contrasts
Size of sample
* indicates that effect size was significantly greater than zero at p < 0.05.
ns indicates not significantly different from zero
Appendix D
Table 5: Re s ults
Mean Effect Sizes (d) With Reading Disabled Comparisons Removed from the Data Base and Tests to
Determine Whether Effect Sizes Were Significantly Greater Than Zero at p < 0.05, Were
Homogeneous at p < 0.05, and Differed From Each Other at p < 0.05.
Moderator Variables and Levels No. Cases Mean d Homogen. 95% C I Contrasts
SPELLING OUTCOMES
PA Skills Taught
Training Unit
Length of training
1 to 4.5 hrs 0 0 Ñ Ñ
Moderator Variables and Levels No. Cases Mean d Homogen. 95% C I Contrasts
Trainer
Classroom teachers 8 0.74* No 0.58 to 0.91 ns
Grade
Preschool (Pre) 0
Language
PHONEMIC AWARENESS
OUTCOMES
Appendix E
Table 6
Phonemic Awareness Outcomes: Mean Effect Sizes (d) Asssociated With Troia's
Indicators of Methodological Rigor and Tests to Determine Whether Effect Sizes
Were Significantly Greater than Zero at p < 0.05, Were Homogeneous at p < 0.05,
and Differed From Each Other at p < 0.05.
Internal Validity
% of criteria met
Critical Flaws
4- 5 (3) 11 0.97* No
External Validity
% of criteria met
Critical flaws
1 8 0.96* No 1=2=3
2 13 0 . 6 1* Yes
3 9 0.97* No
Table 6 (continue d)
Ranking
* indicates that effect size was significantly greater than zero at p < 0.05.
ns indicates not significantly different from zero.
Table 7
Reading Outcomes: Mean Effect Sizes (d) Asssociated With Troia's Indicators of
Methodological Rigor and Tests to Determine Whether Effect Sizes Were
Significantly Greater than Zero at p < 0.05, Were Homogeneous at p < 0.05, and
Differed From Each Other at p < 0.05.
Internal Validity
% of criteria met
Critical Flaws
4- 5 (3) 16 0.56* No
External Validity
% of criteria met
Table 7 (continue d)
Critical Flaws
1 11 0.51* No 2 = all
2 17 0.57* Yes
3 11 0.92* No
Ranking
* indicates that effect size was significantly greater than zero at p < 0.05.
ns indicates not significantly different from zero.
08 - Se g me nt & cate g . + le t vs. No tre at. 2 Ye s S mG Te ach 12.3 No r K Eng l Lo NE No 159 . 1.83 0.65 0.94
National Reading Panel
Appendices
09 - Pho n. cate g . vs. Se mantic cate g . 1 No Ind Othe r 11.67 AR 1st Eng l . M/ R No 39 . 0.5 0.39
10 - Pho n. cate g . vs. No tre atme nt 1 No Ind Othe r 11.67 AR 1st Eng l . M/ R No 26 . . 0.86 1
11 - Pho n. cate g . + le t vs. Se mantic cate g . 1 Ye s Ind Othe r 11.67 AR 1st Eng l . M/ R No 39 . . 1.17 1.59
12 - Pho n. cate g . + le t vs. No tre atme nt 1 Ye s Ind Othe r 11.67 AR 1st Eng l . M/ R No 26 . . 1.53 2.18
Appendix F (continued)
Reports of the Subgroups
13 - Mult. PA vs. No tre atme nt 3+ No Clas Te ach 18 AR K Eng l Lo NE Ye s 42 0.46 0.47 0.23
14 - Se g me nt & b le nd vs. No tre atme nt 2 No Clas Te ach 48 No r K Eng l M-H NE Ye s 24 . 3.92 1.17 2.17
17 - Pho n. cate g . + le t vs. Se mantic cate g . 1 Ye s S mG Othe r 6 No r Pre Eng l M-H R No 126 . 3.14 1.61 .34*
18 - Mult. PA + le t vs. Lang uag e 3+ Ye s S mG Othe r 5 No r K Eng l M-H M/ R No 34 . 3.81 1.06 1.27
19 - Mult. PA + le t vs. No tre atme nt 3+ Ye s S mG Othe r 5 No r K Eng l M-H M/ R No 34 . 2.62 1.09 1.73
Cunningham, 1990 . . . . . . . . . . . . 84 . . .
21 - Se g me nt, b le nd , me ta. vs. Sto rie s 2 No S mG Othe r 6 No r K Eng l M-H M/ R No 28 . 2.3 0.56 .
22 - Se g me nt & b le nd vs. Sto rie s 2 No S mG Othe r 6 No r 1st Eng l M-H M/ R No 28 . 0.99 0.08 .
23 - Se g me nt, b le nd , me ta. vs. Sto rie s 2 No S mG Othe r 6 No r 1st Eng l M-H M/ R No 28 . 1.27 0.51 .
25 - Ble nd , LS vs. Sto rie s, LS 1 No S mG Othe r 8.33 No r K Eng l . NE No 20 . 3.11 0.71 0.49
26 - Se g me nt & b le nd , LS vs. Sto rie s, LS 2 No S mG Othe r 8.33 No r K Eng l . NE No 20 . 3.93 1.56 1.13
28 - Cate g . + le t vs. Hand manip ulatio n 1 Ye s S mG Othe r 30 AR 1st Sp an M-H R No 22 . . 0.73 1.03
30 - Cate g . vs. Hand manip ulatio n 1 No S mG Othe r 30 AR 1st Sp an M-H R No 21 . . 0.14 0.02
32 - Ble nd + le t vs. Lab e l p icture s 1 Ye s Ind Othe r . No r 1st Eng l . R No 20 . 0.78 0.96 .
Appendices
36 - Onse t-rime , LS vs. No tre at., LS 1 No S mG Othe r 5 AR K Eng l . R No 21 . 0.75 -0.19 .
Haddock, 1976 . . . . . . . . . . . . 80 . . .
41 - Mult. PA vs. No tre atme nt 3+ No Ind Othe r 20 RD 1st Eng l . M/ R Ye s 61 . 0.64 0.13 0.25
42 - Mult. PA + le t in Re ad Re c. vs. Re ad
3+ Ye s Ind Othe r 20 RD 1st Eng l . M/ R Ye s 63 . 0.24 0.31 0.31
Re c.
2-82
45 - Ble nd & d e le tio n + le t vs. No tre at. 2 Ye s Ind Co mp 12 AR 1st Eng l M-H M/ R No 99 . 0.61 0.49 .
46 - Mult. PA + le t in Re ad Re c. vs. Re ad
3+ Ye s Ind Othe r 20.88 RD 1st Eng l . M/ R Ye s 64 -0.33 0.42 -0.02
Re c.
47 - Mult. PA + le t vs. No tre atme nt 3+ Ye s S mG Te ach 75 RD 2nd + Eng l . M/ R Ye s 20 . 1.43 0.39 0.53
49 - Mult. PA vs. Visual mo to r inte g ratio n 3+ No Clas Te ach 21.33 No r K He b r Lo NE No 61 . 0.24 .57* .
54 - Mult. PA vs. No tre atme nt 3+ No Clas Te ach 48 No r K Dan Lo NE No 383 . 0.74 0.19 .60*
Murray, 1998 . . . . . . . . . . . . 48 . . .
Appendices
LS
62 - Se g me nt & b le nd + le t vs. No tre at. 2 Ye s Clas Te ach 20 No r K Eng l . NE Ye s 66 . 0.62 0.11 0.73
65 - Mult. PA sche d ule d vs. No tre atme nt 3+ No Clas Te ach 12.25 No r K Swe d . NE Ye s 26 . 0.27 -0.37 0.16*
67 - Ble nd o n co mp ute rs vs. No tre atme nt 1 No Ind Co mp 4 No r K Dutch . R No 56 . 0.74 .27* .28*
69 - Mult. PA vs. No tre atme nt 3+ No Clas Te ach 43.75 No r K Ge rm . NE No 371 . 0.7 0.22 .27*
70 - Mult. PA vs. No tre atme nt 3+ No Clas Te ach 20 No r K Ge rm . NE Ye s 331 . 0.82 0.05 .38*
Solity, 1996 . . . . . . . . . . . . 24 . . .
72 - Se g me nt & cate g . + le t vs. No tre at. 2 Ye s S mG Te ach 13.2 No r K Eng l Lo NE No 149 . 1.81 0.67 0.94
Appendix F (continued)
75 - Onse t-rime vs. Re p e at syllab le s 1 No Ind Othe r . No r Pre Eng l M-H Ye s 8 . . 0.62 .
78 - Se g me nt & cate g . + le t vs. No tre at. 2 Ye s Ind Othe r 54 AR 1st Eng l Lo R Ye s 35 . 0.74 0.44 0.67
79 - Se g me nt & b le nd + le t vs. No tre at. 2 Ye s Ind Othe r 50 AR 1st Eng l Lo R Ye s 40 . 0.42 0.27 0.4
80 - Mult. PA + le t vs. No tre atme nt 3+ Ye s Ind Othe r 2.5 RD 2nd + Eng l . R No 30 . 1.15 0.72 .
81 - Mult. PA, le t, wo rd vs. Wo rd 3+ Ye s Ind Othe r 2.5 RD 2nd + Eng l . R No 30 . 0.74 0.3 .
National Reading Panel
82 - Mult. PA + le t vs. No tre atme nt 3+ Ye s Ind Othe r 2.5 No r 2nd + Eng l . R No 30 . 0.33 0.47 .
Appendices
83 - Mult. PA, le t, wo rd vs. Wo rd 3+ Ye s Ind Othe r 2.5 No r 2nd + Eng l . R No 30 . 1.1 0.71 .
84 - Mult. PA + le t vs. No tre atme nt 3+ Ye s Ind Othe r 2.5 RD 2nd + Eng l . R No 30 . 0.89 0.49 .
85 - Mult. PA, le t, wo rd vs. Wo rd 3+ Ye s Ind Othe r 2.5 RD 2nd + Eng l . R No 30 . 1.01 0.48 .
86 - Mult. PA + le t vs. No tre atme nt 3+ Ye s Ind Othe r 2.5 No r 2nd + Eng l . R No 30 . -0.07 0.33 .
87 - Mult. PA, le t, wo rd vs. Wo rd 3+ Ye s Ind Othe r 2.5 No r 2nd + Eng l . R No 30 . 0.66 0.52 .
Appendix F (continued)
Reports of the Subgroups
Weiner, 1994 . . . . . . . . . . . . 36 . . .
89 - Mult. PA vs. No tre atme nt 3+ No S mG Othe r 5 AR 1st Eng l M-H R No 10 . 0.81 0.17 .
90 - Mult. PA vs. No tre atme nt 3+ No S mG Othe r 5 No r 1st Eng l M-H R No 26 . 0.17 -0.06 .
91 - Se g me nt & b le nd + le t vs. No tre at. 2 Ye s Clas Te ach 62.83 RD 2nd + Eng l . NE Ye s 102 . 0.35 1.05 .
92 - Se g me nt & b le nd + le t vs. No tre at. 2 Ye s Clas Te ach 28.13 RD 2nd + Eng l . R Ye s 102 . 1.11 0.97 .
93 - Onse t-rime + le t vs. No tre atme nt 1 Ye s S mG Othe r 26.67 RD 2nd + Eng l Lo NE Ye s 48 . 0.12 0.47 0.49
96 - Mult. PA + le t vs. Re cip . Te ach 3+ Ye s Ind Co mp 24.98 RD 2nd + Eng l . R No 200 . 0.77 0.23 -0.05
Abreviations:
LS = Le tte r-so und training p ro vid e d se p arate ly Mult. = Multip le PA in 3 o r mo re skills
Categ. = Cate g o rizatio n o r id e ntity training Recip. Teach = Re cip ro cal te aching strate g ie s le arne d and ap p lie d to re ad ing
Meta. = Me taco g nitive activitie s to und e rstand p urp o se s, use o f PA * = Effe ct size s we re d rawn fro m fo llo wup te st p o ints.
Read Rec. = Re ad ing Re co ve ry® p ro g ram
Report
difficulties among children who have not made disabled readers; older children who were progressing
normal progress in learning to read? poorly in reading and who varied in intelligence with at
• Does phonics instruction improve children’s ability least some of them achieving poorly in other academic
to read and comprehend text as well as their areas, referred to as low-achieving readers.
decoding and word-reading skills? For children to learn to read, several capabilities must
• Does phonics instruction have an impact on be developed. The focus of systematic phonics
children’s growth in spelling? instruction is on helping children acquire knowledge of
the alphabetic system and its use to decode new words,
• Is phonics instruction effective with children at and to recognize familiar words accurately and
different socioeconomic (SES) levels? automatically. Knowing how letters correspond to
• Does the type of instruction given to control groups phonemes and larger subunits of words is essential for
as part of a study to evaluate phonics make a enabling beginning readers to sound out word segments
difference? and blend these parts to form recognizable words.
• If phonics instruction is found to be more effective Alphabetic knowledge is needed to figure out new
than less-phonics or no-phonics instruction, were words by analogy and to help beginners remember
the experiments that showed these effects well words they have read before. Knowing letter-sound
designed or poorly designed? relations also helps children to be more accurate in
predicting words from context. In short, knowledge of
Beginning reading programs that do not teach phonics the alphabetic system contributes greatly to children’s
explicitly and systematically may be of several types. In ability to read words in isolation or connected text.
whole-language programs, the emphasis is upon
meaning-based reading and writing activities. Phonics To study whether systematic phonics instruction
instruction is integrated into these activities but taught improves children’s ability to read words in various
incidentally as teachers decide it is needed. Basal ways, different measures have been used. Decoding
programs consist of a teacher’s manual and a complete was tested by having children read regularly spelled
set of books and materials that guide the teaching of words. To test whether children could read novel
beginning reading. Some basal programs focus on words, pseudowords (e.g., gan, bloff, trusk) were used.
whole-word or meaning-based activities with limited Sight vocabulary was examined through sets of leveled,
attention to letter-sound constituents of words and little miscellaneous words, not all of which were spelled
or no instruction in how to blend letters to pronounce regularly. In addition to word-reading, children’s
words. In sight word programs, children begin by performance on measures of oral reading, text
building a reading vocabulary of 50 to 100 words, and comprehension, and spelling was measured.
then later they learn about the alphabetic system. These To provide solid evidence, experiments to test the
types of non-phonics programs were among those contribution of systematic phonics instruction to reading
taught to children in the control groups of experiments acquisition must be well designed. Random assignment
examined by the NRP. Distinctions among the various of students to treatment and control groups is a
types of non-phonics programs are not absolute. procedure that controls for other factors and allows
However, their defining characteristic is that they do not researchers to conclude that the treatment itself was
provide explicit, systematic phonics instruction. the cause of any growth in reading. However,
Phonics programs have been used to teach young sometimes the realities of schools and teachers make it
children to read as they progress through the primary impossible to randomly assign students, so researchers
grades and to remediate the reading difficulties of poor have to use quasi-experimental designs, assigning
readers. The Panel analyzed studies that examined the treatment and control conditions to already existing
effectiveness of phonics programs with three types of groups. Although researchers should administer pretests
problem readers: children in kindergarten or 1st grade to determine whether the treatment and control groups
who were at risk for developing reading problems; older differed prior to treatment and then remove any
children of average or better intelligence who were not differences statistically when outcomes are analyzed,
making normal progress in reading, referred to as this is not always done. Also, larger sample sizes
provide more reliable findings, but access to many The primary statistic used in the analysis of
students is not always possible. In evaluating the performance on outcome measures was effect size,
evidence, the Panel attempted to rule out weak designs indicating whether and by how much performance of
as the explanation for any positive effects that were the treatment group exceeded performance of the
produced by systematic phonics instruction. control group, with the difference expressed in standard
deviation units. From the 38 studies entered into the
Methodology database, 66 treatment-control group comparisons were
To evaluate the evidence, the NRP conducted a meta- derived.
analysis. The literature was searched electronically to Studies were coded for several characteristics that
locate potential studies. To qualify for the analysis, were included as moderators in the meta-analysis:
studies had to meet the following criteria:
• Type of phonics program (synthetic programs
1. Studies had to adopt an experimental or quasi- emphasizing instruction in the sounding out and
experimental design with a control group. blending of words vs. programs teaching students to
2. Studies had to appear in a refereed journal after decode using larger subunits of words such as
1970. phonograms, as well as letters and sounds vs.
miscellaneous programs),
3. Studies had to provide data testing the hypothesis
• Specific phonics programs that were evaluated in at
that systematic phonics instruction improves reading
least three different studies (Direct Instruction;
performance more than instruction providing
Lippincott; Orton Gillingham; Sing Spell Read and
unsystematic phonics or no phonics instruction. To
Write; Benchmark Word ID; New Primary Grades
be considered an instance of phonics instruction, the
Reading System)
treatment had to teach children to identify or use
symbol-sound correspondences systematically. • Type of program taught to the control group (basal
program, regular curriculum, whole language
4. Studies had to measure reading as an outcome. approach, whole word program, miscellaneous
5. Studies had to report statistics permitting the programs)
calculation or estimation of effect sizes. • Group assignment procedure (random assignment
6. Studies were not those already included in the or nonequivalent groups)
NRP’s meta-analysis of phonemic awareness • Number of participants (blocked into quartiles)
training studies. • Grade level (kindergarten, 1st grade, 2nd through
From the potentially relevant list of references, 75 6th grades)
studies that appeared to meet the criteria were • Reading ability (normally developing, at risk, low
identified and located. These were carefully reviewed achiever, reading disabled)
to determine their suitability for the meta-analysis.
Studies of instructional interventions that might be found • Socioeconomic status (low, middle, varied, not
in schools were sought. Short-term laboratory studies given)
and studies that taught only a limited set of processes • Instructional delivery unit (class, small groups,
were eliminated. Also eliminated were studies that 1:1 tutoring).
simply compared different forms of phonics instruction Children identified as being low achieving or at risk for
but did not include a control group receiving reduced reading failure were those tested and shown to have
phonics or no phonics. Of the 75 studies screened, 38 poor letter knowledge, poor phonemic awareness, or
were retained and 37 were eliminated from the final set poor reading skills, or those in schools with low
used to calculate effect sizes. achievement, or those identified by teachers as needing
special help in reading, or those who qualified for
remedial programs in schools but the criteria for
selection were not specified. Children classified as
reading disabled were those identified according to IQ- limiting instructional attention to children with reading
reading discrepancy criteria in standard use by problems accounted for 65% of the comparisons, 38%
researchers or those given tests to determine that the involving poor readers considered at risk or low
disability was reading-specific. In some cases, achieving, and 27% diagnosed as reading disabled
exclusionary criteria were applied as well (e.g., no (RD). Studies involving first graders were
neurological, behavioral, or emotional disorders). overrepresented in the database, accounting for 38% of
the comparisons. Fewer kindergartners (12%) and
Across the studies, the effects of phonics instruction on
children in 2nd through 6th grades (23%) were
reading were most commonly assessed at the end of
represented. Children in the RD group spanned several
training. For programs lasting longer than one year,
ages and grades, ranging from ages 6 to 13 and grades
outcomes were measured at the end of each year in
2 through 6. Most of the studies (72%) were recent,
most cases. The primary outcome used in the meta-
conducted in the last 10 years.
analysis was that assessed at the end of training or at
the end of one year, whichever came first. Effect sizes Systematic phonics instruction typically involves
were calculated on six types of outcome measures: explicitly teaching students a prespecified set of letter-
sound relations and having students read text that
• Decoding regularly spelled real words
provides practice using these relations to decode words.
• Reading novel words in the form of pseudowords Instruction lacking an emphasis on phonics instruction
• Reading miscellaneous words some of which were does not teach letter-sound relations systematically and
irregularly spelled selects text for children according to other principles.
The latter form of instruction includes whole word
• Spelling words programs, whole language programs, and some basal
• Comprehending text read silently or orally reader programs.
• Reading text accurately aloud. The meta-analyses were conducted to answer several
The mean effect size across these measures was questions about the impact of systematic phonics
calculated to yield a general literacy measure for each instruction on growth in reading when compared to
comparison. A statistical program was employed to instruction that does not emphasize phonics. Findings
calculate effect sizes and to test the influence of provided strong evidence substantiating the impact of
moderator variables on effect sizes. An effect size of systematic phonics instruction on learning to read.
d = 0.20 is considered small; a moderate effect size is 1. Does systematic phonics instruction
d = 0.50; an effect size of d = 0.80 or above is large. help children learn to read more
Results and Conclusions effectively than nonsystematic phonics
instruction or instruction teaching no
There were 38 studies from which 66 treatment-control phonics?
group comparisons were derived. Although each
comparison could contribute up to six effect sizes, one Children’s reading was measured at the end of training
per outcome measure, few studies did. The majority if it lasted less than a year or at the end of the first
(76%) of the effect sizes involved reading or spelling school year of instruction. The mean overall effect size
single words while 24% involved text reading. The produced by phonics instruction was moderate in size
imbalance favoring single words is not surprising given and statistically greater than zero, d = 0.44. Findings
that the focus of phonics instruction is on improving provided solid support for the conclusion that systematic
children’s ability to read and spell words. Moroever, phonics instruction makes a bigger contribution to
many of the studies were conducted with beginning children’s growth in reading than alternative programs
readers whose reading development at the time of the providing unsystematic or no phonics instruction.
study was too limited to assess textual reading. Studies
2. Are some types of phonics instruction 3. Is phonics taught more effectively when
more effective than others? Are some students are tutored individually or when
specific phonics programs more effective they are taught in small groups or when
than others? they are taught as classes?
Three types of phonics programs were compared in the All three delivery systems proved to be effective ways
analysis: (1) synthetic phonics programs which of teaching phonics, with effect sizes of d = 0.57
emphasized teaching students to convert letters (tutoring), d = 0.43 (small group), and d = 0.39 (whole
(graphemes) into sounds (phonemes) and then to blend class). All effect sizes were statistically greater than
the sounds to form recognizable words; (2) larger-unit zero, and no one differed significantly from the others.
phonics programs which emphasized the analysis and This supports the conclusion that systematic phonics
blending of larger subparts of words (i.e., onsets, rimes, instruction is effective when delivered through tutoring,
phonograms, spelling patterns) as well as phonemes; (3) through small groups, and through teaching classes of
miscellaneous phonics programs that taught phonics students.
systematically but did this in other ways not covered by
the synthetic or larger-unit categories or were unclear 4. Is phonics instruction more effective
about the nature of the approach. The analysis showed when it is introduced to students not yet
that effect sizes for the three categories of programs reading, in kindergarten or 1st grade,
were all significantly greater than zero and did not differ than when it is introduced in grades
statistically from each other. The effect size for above 1st after students have already
synthetic programs was d = 0.45, for larger-unit begun to read?
programs, d = 0.34, and for miscellaneous programs, d
= 0.27. The conclusion supported by these findings is Phonics instruction taught early proved much more
that various types of systematic phonics approaches are effective than phonics instruction introduced after first
significantly more effective than non-phonics grade. Mean effect sizes were kindergarten d = 0.56;
approaches in promoting substantial growth in reading. first grade d = 0.54; 2nd through 6th grades d = 0.27.
The conclusion drawn is that phonics instruction
There were seven programs that were examined in produces the biggest impact on growth in reading when
three or more treatment-control group comparisons in it begins in kindergarten or 1st grade before children
the database. Analysis of the effect sizes produced by have learned to read independently. These results
these programs revealed that all were statistically indicate clearly that systematic phonics instruction in
greater than zero and none differed statistically from kindergarten and 1st grade is highly beneficial and that
the others in magnitude. Effect sizes ranged from d = children at these developmental levels are quite capable
0.23 to 0.68. In most cases there were only three or of learning phonemic and phonics concepts. To be
four comparisons contributing effect sizes, so results effective, systematic phonics instruction introduced in
may be unreliable. The conclusion drawn is that specific kindergarten must be appropriately designed for
systematic phonics programs are all significantly more learners and must begin with foundational knowledge
effective than non-phonics programs; however, they do involving letters and phonemic awareness.
not appear to differ significantly from each other in their
effectiveness although more evidence is needed to
verify the reliability of effect sizes for each program.
of d = 0.09 did not differ significantly from zero. One The conclusion supported by these findings is that the
factor contributing to the difference is that younger effectiveness of systematic phonics instruction found in
children were given credit for using phonics-based the present meta-analysis did not depend on the type of
knowledge to produce letter-sound spellings of words as instruction that students in the control groups received.
well correct spellings whereas older children were not. Students taught phonics systematically outperformed
Another factor may be that as children move up in the students who were taught a variety of nonsystematic or
grades, remembering how to spell words requires non-phonics programs, including basal programs, whole
knowledge of higher level regularities not covered in language approaches, and whole-word programs.
phonics programs. A third reason for the poor showing
among older students may be that the majority were 10. Were studies reporting the largest
poor readers, known to have difficulty learning to spell. effects of phonics instruction well
designed or poorly designed
The conclusion drawn is that systematic phonics experiments? That is, was random
instruction contributed more than non-phonics
assignment used? Were the sample sizes
instruction in helping kindergartners and 1st graders
sufficiently large? Might results be
apply their knowledge of the alphabetic system to spell
words. However, it did not improve spelling in students explained by differences between
above 1st grade. treatment and control groups that existed
prior to the experiment rather than by
8. Is phonics instruction effective with differences produced by the experimental
children at different SES levels? intervention?
Systematic phonics instruction helped children at all The effects of systematic phonics instruction were not
SES levels make significantly greater gains in reading diminished when only the best designed experiments
than did non-phonics instruction. The effect size for low were singled out. The mean effect size for studies using
SES students was d = 0.66 and for middle-class random assignment to place students in treatment and
students was d = 0.44. Both were statistically greater control groups, d = 0.45, was essentially the same as
than zero and did not differ from each other. The that for studies employing quasi-experimental designs, d
conclusion drawn is that systematic phonics instruction = 0.43, which used existing groups to compare phonics
is beneficial to students regardless of their SES. instruction and non-phonics instruction. The mean
9. Does the type of control group used to effect size for studies administering systematic phonics
and non-phonics instruction to large samples of students
evaluate the effectiveness of phonics
did not differ from studies using the fewest students.
instruction make a difference?
For studies using between 80 and 320 students, d =
The type of nonsystematic or non-phonics instruction 0.49; for studies using between 20 and 31students, d =
given to control groups to evaluate the effectiveness of 0.48. There were some studies that did not use random
systematic phonics instruction varied across studies and assignment and either failed to address the issue of pre-
included the following types: basal programs, regular existing differences between treatment and control
curriculum, whole language approaches, whole word groups or mentioned that a difference existed but did
programs, and miscellaneous programs. The question of not adjust for differences in their analysis of results.
whether systematic phonics instruction produced better The effect sizes changed very little when these
reading growth than each type of control group was comparisons were removed from the database, from d
answered affirmatively in each case. The effect sizes = 0.44 to d = 0.46.
were all positive favoring systematic phonics, were all
The conclusion drawn is that the significant effects
statistically greater than zero, and ranged from d = 0.31
produced by systematic phonics instruction on children’s
to 0.51. No single effect size differed from any of the
growth in reading were evident in the most rigorously
others.
designed experiments. Significant effects did not arise
primarily from the weakest studies.
11. Is enough known about systematic In addition to this general caution, several particular
phonics instruction to make concerns should be taken into consideration to avoid
recommendations for classroom misapplication of the findings. One concern relates to
implementation? If so, what cautions the commonly heard call for “intensive, systematic”
phonics instruction. Usually the term “intensive” is not
should be kept in mind by teachers
defined, so it is not clear how much teaching is required
implementing phonics instruction?
to be considered “intensive.” Questions needing further
Findings of the Panel regarding the effectiveness of answers are: How many months or years should a
systematic phonics instruction were derived from phonics program continue? If phonics has been taught
studies conducted in many classrooms with typical systematically in kindergarten and 1st grade, should it
classroom teachers and typical American or English- continue to be emphasized in 2nd grade and beyond?
speaking students from a variety of backgrounds and How long should single instructional sessions last? How
SES levels. Thus, the results of the analysis are much ground should be covered in a program? That is,
indicative of what can be accomplished when how many letter-sound relations should be taught and
systematic phonics programs are implemented in how many different ways of using these relations to
today’s classrooms. Systematic phonics instruction has read and write words should be practiced for the
been used widely over a long period of time with benefits of phonics to be maximum? These are among
positive results. A variety of phonics programs have the many questions that remain for future research.
proven effective with children of different ages,
Secondly, the role of the teacher needs to be better
abilities, and socioeconomic backgrounds. These facts
understood. Some of the phonics programs showing
should persuade educators and the public that
large effect sizes are scripted in such a way that
systematic phonics instruction is a valuable part of a
teacher judgment is largely eliminated. Although scripts
successful classroom reading program. The Panel’s
may standardize instruction, they may reduce teachers’
findings summarized above serve to illuminate the
interest in the teaching process or their motivation to
conditions that make phonics instruction especially
teach phonics. Thus, one concern is how to maintain
effective. However, caution is needed in giving a
consistency of instruction and at the same time
blanket endorsement to all kinds of phonics instruction.
encourage unique contributions from teachers. Another
It is important to recognize that the goals of phonics concern involves what teachers need to know. Some
instruction are to provide children with some key phonics programs require a sophisticated understanding
knowledge and skills and to insure that they know how of spelling, structural linguistics, and word etymology.
to apply this knowledge in their reading and writing. Teachers who are handed the programs but are not
Phonics teaching is a means to an end. To be able to provided with sufficient inservice training to use these
make use of letter-sound information, children need programs effectively may become frustrated. In view
phonemic awareness. That is, they need to be able to of the evidence showing the effectiveness of systematic
blend sounds together to decode words, and they need phonics instruction, it is important to ensure that the
to break spoken words into their constituent sounds to issue of how best to prepare teachers to carry out this
write words. Programs that focus too much on the teaching effectively and creatively is given high priority.
teaching of letter-sounds relations and not enough on Knowing that all phonics programs are not the same
putting them to use are unlikely to be very effective. In brings with it the implication that teachers must
implementing systematic phonics instruction, educators themselves be educated about how to evaluate different
must keep the end in mind and insure that children programs, to determine which are based on strong
understand the purpose of learning letter-sounds and evidence and how they can most effectively use these
are able to apply their skills in their daily reading and programs in their own classrooms.
writing activities.
As with any instructional program, there is always the
question: “Does one size fit all?” Teachers may be
expected to use a particular phonics program with their
class, yet it quickly becomes apparent that the program
suits some students better than others. In the early
grades, children are known to vary greatly in the skills motivational factors are taken into account—not only
they bring to school. There will be some children who learners’ but also teachers’ motivation to teach? How
already know most letter-sound correspondences, some does the use of decodable text as early reading material
children who can even decode words, and others who contribute to the effectiveness of phonics programs?
have little or no letter knowledge. Should teachers
proceed through the program and ignore these 1. Active Ingredients
students? Or should they assess their students’ needs Systematic phonics programs vary in many respects. It
and select the types and amounts of phonics suited to is important to determine whether some properties are
those needs? Although the latter is clearly preferable, essential and others are not. Because instructional time
this requires phonics programs that provide guidance in during the school day is limited, teachers and publishers
how to place students into flexible instructional groups of beginning reading programs need to know which
and how to pace instruction. However, it is common for ingredients of phonics programs yield the most benefit.
many phonics programs to present a fixed sequence of
lessons scheduled from the beginning to the end of the 2. Motivation
school year. Phonics instruction has often been portrayed as
Finally, it is important to emphasize that systematic involving “dull drill” and “meaningless worksheets.”
phonics instruction should be integrated with other Few if any studies have investigated the contribution of
reading instruction to create a balanced reading motivation to the effectiveness of phonics programs, not
program. Phonics instruction is never a total reading only the learner’s motivation to learn but also the
program. In 1st grade, teachers can provide controlled teacher’s motivation to teach. The lack of attention to
vocabulary texts that allow students to practice motivational factors by researchers in the design of
decoding, and they can also read quality literature to phonics programs is potentially very serious because
students to build a sense of story and to develop debates about reading instruction often boil down to
vocabulary and comprehension. Phonics should not concerns about the “relevance” and “interest value” of
become the dominant component in a reading program, how something is being taught, rather than the specific
neither in the amount of time devoted to it nor in the content of what is being taught. Future research on
significance attached. It is important to evaluate phonics instruction should investigate how best to
children’s reading competence in many ways, not only motivate children in classrooms to learn the letter-sound
by their phonics skills but also by their interest in books associations and to apply that knowledge to reading and
and their ability to understand information that is read to writing. It should also be designed to determine which
them. By emphasizing all of the processes that approaches teachers prefer to use and are most likely
contribute to growth in reading, teachers will have the to use effectively in their classroom instruction.
best chance of making every child a reader.
3. Decodable Text
Directions for Further Research Some systematic phonics programs are designed so that
Although phonics instruction has been the subject of a children are taught letter-sound correspondences and
great deal of study, there are important topics that have then provided with little books written carefully to
received little or no research attention, and there are contain the letter-sound relations that were taught.
other topics that, although previously studied, require Some programs begin with a very limited set and
further research to refine our understanding. expand these gradually. The intent of providing books
that match children’s letter-sound knowledge is to
Three important but neglected questions are prime enable them to experience success in decoding words
candidates for research: What are the that follow the patterns they know. The stories in such
“active ingredients” in effective systematic phonics books often involve pigs doing jigs and cats in hats.
programs? Is phonics instruction improved when Systematic phonics programs vary in the percentage of
decodable words in 1st-grade stories and in the
percentage of sight words introduced holistically to • Are there ways to improve the effectiveness of
make a good story. Surprisingly, very little research has systematic phonics instruction for poor readers
attempted to determine the contribution of decodable above 1st grade? Does this instruction need to take
books to the effectiveness of phonics programs. account of any maladaptive reading habits the
students have acquired or any sources impeding the
There are other important topics to be addressed in
incorporation of alphabetic knowledge and decoding
future research as well. These include the following:
strategies into their reading? Does this instruction
• Should systematic phonics instruction continue need to take account of the type of reading
beyond 2nd grade? If so, what are the goals of instruction they experienced in earlier years? Does
more advanced forms of phonics instruction and decoding instruction need to be combined with
does this instruction contribute to growth in comprehension instruction?
reading?
The purpose of literacy instruction in schools is to help • Are some types of phonics instruction more
children master the many challenges of written effective than others? Are some specific phonics
language. While teachers use a variety of activities to programs more effective than others?
accomplish this purpose, one central approach is to • Is phonics instruction more effective when it is
teach the alphabetic code that represents oral language introduced to students not yet reading, in
in writing. Children need to understand how letters, kindergarten or 1st grade, than when it is introduced
called graphemes, stand for the smallest sounds, called in grades above 1st after students have already
phonemes, in spoken words. Systematic phonics begun to read?
instruction teaches beginning readers the alphabetic
code consisting of a large set of correspondences • Is phonics instruction beneficial for children who
between graphemes and phonemes and perhaps larger are having difficulty learning to read? Is it effective
sub-units of words and how to use this knowledge to in preventing reading failure among children who
read words. In some phonics programs, beginners are are at risk for developing reading problems in the
taught a routine for transforming spellings into blends of future? Is it effective in remediating reading
phonemes that are recognized as words. Learning about difficulties among children who have not made
letter-sound associations helps beginners break the code normal progress in learning to read?
in learning to read. However, the English writing system • Is phonics taught more effectively when students
has other higher level, word-based regularities as well, are tutored individually, or when they are taught in
so, although phonics instruction contributes, it is not the small groups, or when they are taught as classes?
complete solution to word identification that it is in other
• Does phonics instruction improve children’s ability
written languages that are more fully phonemic (e.g.,
to read connected text as well as their decoding and
Spanish).
word reading skills?
Over the years educators have disagreed about how • Does phonics instruction have an impact on
beginning reading should be taught. Some have children’s growth in spelling?
advocated starting with a systematic phonics approach
while others have argued for a whole word approach or • Is phonics instruction effective with children at
a whole language approach. Disagreement has different socioeconomic levels?
centered on whether teaching should begin with • Does the type of instruction given to control groups
systematic explicit instruction in symbol-sound and used to evaluate the effectiveness of phonics
correspondences, whether it should begin with whole instruction make a difference? That is, is systematic
words, or whether initial instruction should be phonics more effective than forms of instruction
meaning-centered with correspondences taught that do not emphasize phonics, such as the whole
incidentally in context as needed. Most recently the word approach or meaning-centered approaches?
pendulum has swung toward providing children with
• If phonics instruction is found to be more effective
more explicit phonics instruction. Educators advocating
than less-phonics or no-phonics instruction, were
this shift have claimed that there is substantial research
the experiments showing these effects well
showing that approaches with an emphasis on phonics
designed or poorly designed?
instruction are more effective than approaches that do
not emphasize the teaching of phonics. To evaluate the evidence, a meta-analysis was
conducted. The Panel searched the literature to locate
The purpose of this report was to examine the research experimental studies published after 1970 that
evidence concerning phonics instruction. The Panel administered systematic phonics instruction to one
sought answers to the following questions: group of children and administered another type of
• Does systematic phonics instruction help children instruction that involved unsystematic phonics or no
learn to read more effectively than unsystematic phonics to a control group. Also the studies had to
phonics instruction or instruction teaching no examine phonics programs of the sort used in schools
phonics? rather than single-process-focused laboratory
procedures. The studies had to measure reading as an
outcome of instruction. In addition, studies were which he argued that children were being abused by the
excluded if they were in the Panel’s other database then-current whole word methodology. Flesh asserted
used to conduct a meta-analysis examining effects of that if children were taught only the 44 letter-sound
phonemic awareness instruction on reading. A total of correspondences, they would be able to read any word
38 studies meeting the NRP research criteria was they encountered, and there would be no reading
found. The studies were coded for various problems. Spurred on partially by Flesch and partially by
characteristics of students, instruction, and experimental advances in linguistics, new phonics programs were
design. A meta-analysis was conducted to examine the developed and began achieving wider usage in reading
size of effects that resulted when the performance of instruction (Aukerman, 1981; Popp, 1975).
students receiving systematic phonics instruction was
Chall’s (1967) review examined both the underlying
compared to that of students receiving another form of
theory and the classroom realities of these new phonics
instruction that did not focus on phonics. The outcomes
programs. But the core of her study was a
measured following instruction included children’s ability
comprehensive analysis of the research up to the
to read words and pseudowords, to read and
mid-1960s, including the then-unpublished First Grade
comprehend text, and also to spell words.
Studies. Chall’s basic conclusion continues to be cited to
Background and Rationale for this day, her finding that early and systematic instruction
the Meta-Analysis in phonics seems to lead to better achievement in
reading than later and less systematic phonics
Historical Overview instruction.
The question of whether instruction that includes an
It is important to note that Chall, in the 1967 edition of
initial emphasis on systematic phonics is more effective
her review, did not recommend any particular type of
than other forms of instruction in teaching children to
phonics instruction. Common forms of phonics
read has been addressed many times in the literature.
instruction in the 1960s included synthetic instruction,
The particular issues underlying interest in this question
analytic instruction, and linguistic readers (Aukerman,
have shifted over the years, but the topic has remained
1981). All of these challenged the sight word approach
controversial, and this has spawned a number of
of the day. However, in the 1983 edition of her review,
reviews of research.
Chall did suggest that synthetic phonics instruction held
In the 1960s, the Office of Education funded the a slight edge over analytic phonics instruction. Even in
Cooperative Research Program in First Grade Reading this, her recommendation was temperate.
(Bond & Dykstra, 1967, 1998) and Project Literacy
Chall’s (1967) basic finding has been reaffirmed in
(Levin & Williams, 1970). The First Grade studies
nearly every research review conducted since then
involved a wide-ranging research project, consisting of
(e.g., Adams, 1990; Anderson et al., 1985; Balmuth,
29 separate studies in different sites, all aimed at
1982). Also, one of the coordinators of the First Grade
determining the “best” approach to teaching beginning
Studies (Dykstra, 1968) published an analysis in which
reading. In contrast, Project Literacy attempted to
he concluded that the results of that project supported
identify the basic psychological and linguistic processes
Chall’s basic finding (Adams, 1990). Nevertheless, the
involved in learning to read and did not focus directly on
controversy has persisted over this issue (Grundin,
the pedagogy of reading. At the same time, the
1994; Taylor, 1998; Weaver, 1998). Part of the reason
Carnegie Foundation funded Jeanne Chall’s (1967)
that the debate has continued is that phonics instruction
comprehensive review of beginning reading instruction,
has become entangled with politics and ideology
Learning to Read: The Great Debate. That review, like
(Goodman, 1993; McKenna, Stahl, & Reinking, 1994;
the present report, was intended to analyze the results
Stahl, 1999). Another reason has been philosophical
of previous research.
disagreements about how children learn to read and
Concern about beginning reading instruction was not confusions about the implications of these varied points
confined just to the educational community but was of view.
very much in public discourse. Flesch (1955) had
authored a best selling book Why Johnny Can’t Read in
prespecify the relations to be taught. It is presumed that The issue of the control group is crucial. A meta-
exposing children to letter-sound relations as they read analysis compares a treatment to what is supposedly a
text will foster incidental learning of the relations they constant. However, in reality, the size of the effect is a
need to develop as readers. result of what goes on in both the treatment and the
control groups. A treatment can be very effective but
The meta-analysis was conducted to compare the
yield only a small effect size if instruction in the control
effectiveness of systematic phonics instruction to other
group is also effective. On the other hand, if the control
forms of instruction lacking an emphasis on phonics.
group’s instruction is particularly ineffective, by design
Included in the database were several studies that
or by accident, then the effect size is inflated. One must
provided whole language instruction to control groups
consider the nature of the control group in order to
and studies teaching whole word programs to control
interpret an effect size. The question addressed in the
groups. In fact, two studies in the database were
meta-analysis was whether phonics instruction
conducted for the purpose of evaluating the effects of
produced greater growth in reading than each of the
whole language programs, not phonics programs. In
various types of instruction given to control groups.
these studies, phonics was the form of instruction given
to control groups (Klesius et al., 1991; Freppon, 1991). Types of Phonics Instruction
Not only whole language and whole word instruction The hallmarks of systematic phonics programs are that
but also other forms of control-group instruction were children receive explicit, systematic instruction in a set
present in the database. Several control groups received of prespecified associations between letters and sounds,
some type of basal instruction, usually a program and they are taught how to use them to read, typically in
prescribed by the school or district. Basal programs texts containing controlled vocabulary. However,
consist of a whole package of books and supplementary phonics programs vary considerably in exactly what
materials that are used to teach reading. Teachers work children are taught and how they are taught (Adams,
from a thick manual that details daily lesson plans based 1990; Aukerman, 1981). Approaches to phonics
on a scope and sequence of the reading skills to be instruction may differ in several important ways
taught. Students are given workbooks to practice on including the following:
skills. Tests are used to place students in the proper 1. How many letter-sound relations are taught, how
levels of the program and to assess mastery of skills they are sequenced, whether phonics
(Aukerman, 1981). Basal reading programs do vary, but generalizations are taught as well (e.g., “When
one can assume that basal readers of the same era are there are two vowels side by side, the long sound of
roughly similar in their characteristics. The basal the first one is heard and the second is usually
programs given to control groups provided only limited silent.”), whether special marks are added to letters
or no systematic phonics instruction. to indicate their sounds, for example, curved or
A few studies utilized as their baseline control the straight lines above vowels to mark them as short
performance of comparable classes of students enrolled or long
in the same schools the year prior to the treatment 2. The size of the unit taught (i.e., graphemes and
(Snider, 1990; Vickery et al., 1987). In one case, a basal phonemes, or larger word segments called
program was used. In the other case, the type of phonograms, for example, -ing, or -ack which
program was not specified. Campbell and Stanley represent the rimes in many single-syllable words)
(1966) suggest that this design contains certain threats
to external validity, especially the differential history of 3. Whether the sounds associated with letters are
the two groups. pronounced in isolation (synthetic phonics) or only
in the context of words (analytic phonics)
Some studies in the database included more than one
control group. The Panel selected for the meta-analysis 4. The amount and type of phonemic awareness that
the group receiving the least phonics instruction. is taught, for example, blending or segmenting
sounds orally in words
5. Whether instruction is sequenced according to a A majority of the programs in the database used a
hierarchical view of learning with the steps synthetic approach to teach phonics. This instruction
regarded as a series of prerequisites (i.e., letters, typically begins by teaching children relations between
then letter-sound relations, then words, then individual letters and pairs of letters called digraphs
sentences) or whether multiple skills are learned (e.g., TH, AI, CH, OI) and all 44 sounds or phonemes
together of the language. These correspondences are introduced
systematically and sequentially. Children are taught to
6. The pace of instruction
decode unfamiliar words by sounding out the letters and
7. The word reading operations that children are blending them to pronounce a recognizable word.
taught, for example, sounding out and blending
However, the synthetic strategy presents two
letters, or using larger letter subunits to read words
difficulties for children. One is that blending words
by analogy to known words
containing stop consonants requires deleting “extra”
8. The involvement of spelling instruction (schwa vowel) sounds produced when letters are
pronounced separately, for example, blending “tuh-a-
9. Whether learning activities include extensive oral puh” requires deleting the “uh” sounds to produce the
drill-and-practice, reciting phonics rules, or filling blend “tap.” The second problem is that when the
out worksheets sounds to be blended exceed two or three, it becomes
10. The type of vocabulary control provided in text harder to remember and manage the ordering of all
(e.g., is the vocabulary limited mainly to words those sounds, for example, blending “s-tuh-r-ea-m” to
containing familiar letter-sound associations or are say “stream.”
sight words introduced to help create a meaningful Phonics programs have been developed to address
story?) these difficulties. One approach used has been to teach
11. Whether phonics instruction is embedded in or students to read larger subunits of words as well as
segregated from the literacy curriculum phonemes. For example, children learn to recognize ST,
AP, EAM, as blends so that there are not so many
12. The teaching approach, whether it involves direct separate parts of words to sound out and remember in
instruction in which the teacher takes an active role blending them. The larger units taught might include
and students passively respond, or whether a onsets (i.e., the consonants that precede the vowel such
“constructivist” approach is used in which the as “st” in stop) and rimes (i.e., the vowel and following
children learn how the letter-sound system works consonants such as “op” in stop), also called
through problemsolving phonograms, and spelling patterns characterizing the
13. How interesting and motivating the instructional common parts of word families (e.g., -ack as in pack
activities are for teachers and for students. and stack, -oat as in goat and float). Teaching children
to analyze and pronounce parts of words provides the
Systematic phonics programs included in the Panel’s basis for teaching them the strategy of reading new
database varied in many of these ways; so, it should not words by analogy to known words (e.g., reading stump
be assumed that the programs taught phonics uniformly. by analogy to jump). In the database, these studies are
One purpose of the meta-analysis was to examine distinguished and classified as teaching children to
whether different properties of phonics programs analyze and blend words by using larger phonological
influenced how effective they were in teaching children units.
to read. However, this purpose was thwarted by the
fact that most studies did not describe the phonics The database included 43 treatment-control
instruction in sufficient detail to permit coding the comparisons that taught synthetic phonics to the
properties listed above. As a result, the Panel selected treatment groups, 11 studies that used phonics
only one property for coding: whether programs treatments emphasizing larger subunits for blending
emphasized a synthetic approach in teaching children to words, two comparisons that combined both types of
read words or whether the emphasis was on larger
subunits of words.
programs, and ten comparisons that fit neither category, concepts about print, phonological awareness, and letter
referred to as miscellaneous. In the meta-analysis, names prior to formal reading instruction. Studies
effect sizes of the three larger sets of phonics types indicate that knowing letters and having phonemic
were compared. awareness are essential for learning to use the
alphabetic system to read and spell words (see the
In the database were seven phonics programs whose
NRP review of phonemic awareness instruction). Thus,
effectiveness was assessed in at least three different
formal, systematic phonics instruction that expects
treatment-control group comparisons. All but one of the
students to learn to decode words in kindergarten may
programs, Lovett’s analogy program, taught synthetic
be too much.
phonics. These programs together with the dates of
publication are listed below: On the other hand, in countries such as New Zealand
and the United Kingdom, the practice of introducing
• Direct Instruction, also referred to as DISTAR and
children to reading and writing at the age of 5 in full-day
Reading Mastery (1969, 1978, 1979, 1980, 1987,
programs has existed for many years. The Reading
1988)
Recovery© program (Clay, 1993) is designed to pick up
• Lovett’s adaptation of Direct Instruction (1994) the stragglers having difficulty at the age of 6, when
• Lovett’s adaptation of the Benchmark Word North American children are typically just beginning
Identification program (1994) reading instruction. Thus, the notion that kindergartners
are not ready for formal reading instruction at age 5 is
• The Lippincott Basic Reading program (1963, 1981) questionable.
• Beck and Mitroff’s New Primary Grades Reading
In some studies in the database, a middle road was
System (1972)
taken. Children were introduced to simplified reading
• Orton Gillingham programs (1940, 1956, 1969, 1979, and spelling activities using a basic set of letters and
1984) sounds that they were taught. Instruction began by
• Sing, Spell, Read, and Write (1972). providing a foundation for students and then building on
this to ease students into reading when they became
For each program, there were at least three treatment- ready for it. (See Blachman et al., 1999; Vandervelden
control group comparisons testing effects of that form & Siegel, 1997). In the meta-analysis, the contribution
of phonics instruction; so, effect sizes were examined of phonics instruction at the kindergarten level was
separately in a meta-analysis. Most of these programs examined across studies that varied in how much
were developed over 20 years ago, providing phonics material was covered.
researchers with more time to study them than recently
developed programs. The question addressed in the The most important grade for teaching phonics is
meta-analysis was whether these programs were thought to be 1st grade when formal instruction in
effective in promoting growth in reading and whether reading typically begins in the United States. Children
they differed in effectiveness. There was no apriori have foundational knowledge and are ready to put it to
reason to expect any differences. Likewise there was use in learning to read and write. In contrast,
no reason to expect these programs to be more introducing phonics instruction in grades above 1st
effective than programs not in the set being compared. means that children who were taught to read in some
other way may be required to switch gears in order to
Grade and Reading Ability incorporate phonics procedures into their reading and
A question of particular interest to the Panel was when writing. The database included studies that introduced
should phonics instruction begin. Should it be introduced phonics to students at various grade levels. The
in kindergarten when children may know very little question addressed in the meta-analysis was whether
about letters, phonemic awareness, or should it be the grade level in which phonics instruction was
started in 1st grade after children have received introduced made any difference in the outcomes
prereading or emergent reading experiences in observed. Another related question is whether phonics
kindergarten? According to Chall (1996a, b), beginners
need to develop foundational knowledge such as
instruction that was started in kindergarten is more of eight studies taught phonics through tutoring. The
effective than phonics instruction begun in 1st or 2nd remainder of the studies utilized small groups or whole
grade. Data were probed for an answer to this question classes to deliver instruction. Of interest was whether
as well. one type of delivery system produced greater gains in
reading than the other types. In the Panel’s analysis of
Phonics instruction has also been widely regarded as
phonemic awareness training effects, comparison of
particularly beneficial to children with reading problems
instructional units revealed that small groups produced
(e.g., Foorman et al., 1998). Many studies have shown
superior learning. However, it was expected that
that reading disabled children have exceptional difficulty
tutoring would be the most effective way to teach
decoding words (Rack, Snowling, & Olson, 1992). In
phonics.
fact, their level of performance falls below that of
younger non-disabled readers who read at the same Word Reading Pr ocesses: Assessing Gr
Processes: owth
Growth
grade-equivalent level, indicating a serious deficit in It is important to distinguish between the methods of
decoding skill. Phonics instruction that teaches disabled teaching reading and the processes that learners
readers to decode words should remediate this deficit acquire as they receive instruction and learn to read.
and should enable these students to make better Sometimes the two may be confused. For example, the
progress in learning to read. The meta-analysis term “sight word” has a “methods” meaning and a
evaluated the contribution made by phonics instruction “process” meaning. As a method, sight words are the
to growth in reading among children having difficulty high-frequency, irregularly spelled words students are
learning to read. taught to read as unanalyzed wholes, often on flash
Two types of children with reading problems have been cards, for example, said, once, their, come. In contrast,
distinguished by researchers, children who are the “process meaning” of sight words refers to words
unexpectedly poor readers because their intelligence that are stored in readers’ heads and that enable them
(an index of learning aptitude for some academic skills) to read those words immediately upon seeing them. Not
is higher than their reading ability, and children whose just high-frequency words but all words that readers
below-average reading is not surprising given that their practice reading become retained as sight words in
intelligence is also below average. Various labels such memory.
as dyslexic or learning disabled or reading disabled have Methods of teaching reading are aimed at helping
been applied to children whose higher IQs are learners acquire the processes they need to develop
discrepant with their poor reading skill. Children whose skill as readers. In considering how phonics instruction
lower reading scores are consistent with their lower promotes growth in reading, it is important to describe
IQs have been referred to as low achievers or garden the reading processes that learners are expected to
variety poor readers (Stanovich, 1986). The question of acquire.
interest was whether phonics instruction helps to
remediate reading difficulties for both types of poor Learning to read can be analyzed as involving two basic
readers. Studies in the database were brought to bear processes (Gough & Tunmer, 1986; Hoover & Gough,
on this question. 1990). One process involves learning to convert the
letters into recognizable words. The other involves
Deliver
Delivery y Systems for Teaching Phonics comprehending the meaning of the print. When children
There are various delivery systems that might be used attain reading skill, they learn to perform both of these
to teach phonics. Tutoring one-on-one is regarded as processes so that their attention and thought are
the ideal form of instruction for students who are having focused on the meaning of the text while word reading
difficulties because it allows teachers to tailor lessons to processes operate unobtrusively and out of awareness
address individual students’ needs. One of the best for the most part. Children acquire comprehension skill
known tutoring programs is Reading Recovery© (Clay, in the course of learning to speak. Comprehension
1993). The database included three studies that processes that children use to understand spoken
modified Reading Recovery© lessons to include language are thought to be the same ones that they use
systematic phonics instruction (Greaney et al., 1997;
Santa & Hoien, 1999; Tunmer & Hoover, 1993). A total
to read and understand text. In contrast, children do not Processing letter-sound relations in the words through
acquire word reading skill in the course of learning to decoding or analogizing creates alphabetic connections
speak. This achievement requires special experiences that establish the words in memory as sight words (Ehri,
and instruction. 1992; Share, 1995).
Many mental processes are active when readers read Systematic phonics instruction is thought to contribute to
and understand text. Readers draw on their knowledge the process of learning to read words in these various
of language to create sentences out of word sequences. ways by teaching readers use of the alphabetic system.
They access their background knowledge to construct Alphabetic knowledge is needed to decode words, to
meaning from the text. They retain this information in retain sight words in memory, and to call on sight word
memory and update it as they interpret more text. memory to read words by analogy. In addition, the
Readers monitor their comprehension to verify that the process of predicting words from context benefits from
information makes sense. alphabetic knowledge. Word prediction is made more
accurate when readers can combine context cues with
A central part of text processing involves reading the
letter-sound cues in guessing unfamiliar words in text
words. Four different ways can be distinguished (Ehri,
(Tunmer & Chapman, 1998).
1991, 1994):
One purpose of the meta-analysis was to examine
1. Decoding: Readers convert letters into sounds and
whether phonics instruction improves readers’ ability to
blend them to form recognizable words; the letters
decode words and to read words by sight. To study the
might be individual letters, or digraphs such as TH,
impact of phonics instruction on the various ways to
SH, OI, or phonograms such as ER, IGH, OW, or
read words, different measures have been used. The
spellings of common rimes such as -AP, -OT, -ICK.
ability to decode words is tested by giving children
Ability to convert letter subunits into sounds comes
regularly spelled words to read. The ability to decode
from readers’ knowledge of the alphabetic system.
novel words never read before is tested by having
2. Sight: Readers retrieve words they have already children read pseudowords. Children’s sight vocabulary
learned to read from memory. is examined by giving them miscellaneous words
including irregularly spelled words that are ordered by
3. Analogy: Readers access in memory words they grade level from preprimer to the highest grades.
have already learned and use parts of the spellings
to read new words having the same spellings (e.g., Methodology
using -ottle in bottle to read throttle).
Database
4. Prediction: Readers use context cues, their linguistic
and background knowledge, and memory for the An electronic search was conducted in two databases,
text to anticipate or guess the identities of unknown ERIC and PsycINFO. Three sets of terms were used
words. in the search. These terms were derived by the Panel
on the basis of analyses of various reference guides
Text reading is easiest when readers have learned to including the Literacy Dictionary (Harris & Hodges,
read most of the words in the text automatically by sight 1995), the Handbook of Research on Teaching the
because little attention or effort is required to process English Language Arts (Flood, Jensen, Lapp, & Squire,
the words. When written words are unfamiliar, readers 1991), the Encyclopedia of English Studies and the
may decode them or read them by analogy or predict Language Arts (Purves, 1994), and the Handbook of
the words, but these steps take added time and shift Reading Research (Barr, Kamil, Mosenthal, & Pearson,
attention at least momentarily from the meaning of text 1991; Pearson, Barr, Kamil, & Mosenthal, 1984).
to figuring out the words.
• Set 1: Alphabetic code, analogy approach, code
Readers need to learn how to read words in the various emphasis, compare-contrast, decodable text,
ways to develop reading skill. The primary way to build decoding, phonemic decoding, phonetic decoding,
a sight vocabulary is to apply decoding or analogizing phonological decoding, direct code, direct
strategies to read unfamiliar words. These ways of instruction, Reading Mastery, explicit instruction,
reading words help the words to become familiar.
explicit phonological processes, grapheme-phoneme 2. Studies had to appear in a refereed journal after
correspondences, graphophonic, Initial Teaching 1970.
Alphabet, letter training, letter-sound
3. Studies had to provide data testing the hypothesis
correspondences, linguistic method, McCracken,
that systematic phonics instruction improves reading
Orton-Gillingham, phoneme analysis, phoneme
performance more than instruction providing
blending, phoneme-grapheme correspondences,
unsystematic phonics or no phonics instruction. To
phonics, Alphabetic phonics, analytic phonics,
be considered an instance of phonics instruction, the
embedded phonics, structured phonics, synthetic
treatment had to teach children to identify or use
phonics, systematic phonics, phonological
symbol-sound correspondences systematically.
processing, Recipe for Reading, recoding,
phonological recoding, Slingerland approach, 4. Studies had to measure reading as an outcome.
Spaulding approach, word study, word sort, words
by analogy. These were combined using “or” 5. Studies had to report statistics permitting the
statements, meaning that all articles indexed by any calculation or estimation of effect sizes.
of these terms would be located. 6. Studies were not those already included in the
• Set 2: Beginning reading, beginning reading National Reading Panel’s meta-analysis of
instruction, instruction, intervention, learning to phonemic awareness training studies.
decode, reading improvement, reading instruction, From the various lists of references, 75 studies that
remedial training, remedial reading, remediation, appeared to meet the criteria were identified and
teaching, training, disabled readers, dyslexia, located. The goal was to analyze studies that resembled
reading difficulties, reading disability, reading failure, each other so that the corpus would be more
reading problems. These were combined in the homogeneous. Studies of instructional interventions that
search using “or” statements. might be found in schools were sought. Short-term
• Set 3: Miscues, oral reading, reading ability, reading laboratory studies and studies that provided instruction
achievement, reading acquisition, reading aloud, on only a limited set of processes were eliminated. Also
reading comprehension, reading development, eliminated were studies that simply compared different
reading processes, reading skills, silent reading, forms of phonics instruction but did not include a control
story reading, word attack, word identification, group receiving reduced phonics or no phonics. Of the
word recognition, word reading, nonword reading. 75 studies screened, 38 were retained and 37 were
These, too, were combined with “or” statements. eliminated from the final set used to calculate effect
sizes. The reasons for eliminating studies and the
The three sets of terms were used to locate potentially numbers of studies eliminated are listed in Table 1 on
relevant studies in the two databases. Articles selected the next page.
were those that included at least one term from each
set. Because the term spelling had not been included in Some minor deviations from the above procedures
Set 1, the search was run a second time with spelling occurred. More recent studies that would not yet have
crossed with Set 2 and Set 3 terms. The first search appeared in electronic searches were obtained from
uncovered 391 articles in PsycINFO and 520 articles in current issues of journals and preprints of in press
ERIC. The second search uncovered 252 articles in papers sent to members of the Panel. Also, Blachman
PsycINFO and 210 articles in ERIC. Abstracts were et al. (1999) conducted a 3-year longitudinal study to
printed and screened. evaluate the effects of phonemic awareness and
phonics instruction on children as they progressed from
To qualify for the analysis, studies had to meet the kindergarten through 2nd grade. Results of the first
following criteria: year were published as a separate study and included in
the Panel’s phonemic awareness meta-analysis. Results
1. Studies had to adopt an experimental or quasi-
of the more extensive 3-year study were included in the
experimental design with a control group.
phonics instruction database. This was the only study
analyzed in both reports.
Table 1
Reasons for Excluding Studies From the Database
Total: 37 studies
The primary statistic used in the analysis of • Type of control group (basal, regular instruction,
performance on outcome measures was effect size, whole language, whole word, miscellaneous)
indicating whether and by how much performance of • Group assignment procedure (random assignment
the treatment group exceeded performance of the or nonequivalent groups)
control group, with the difference expressed in standard
deviation units. The formula used to calculate raw • Number of participants (blocked into quartiles)
effect sizes for each treatment-control comparison • Grade level or age
consisted of the mean of the treatment group minus the
• Reading ability (normally developing, at risk/low
mean of the control group divided by a pooled standard
achiever, reading disabled)
deviation.
• Socioeconomic status (low, middle, varied, not
From the 38 studies entered into the database, 66 given)
treatment-control group comparisons were derived.
There were six cases in which the same control group • Instructional delivery unit (class, small groups,
was compared to two different phonics treatment 1:1 tutoring).
groups. There was one study in which the same control The studies, their properties, and effect sizes are listed
group was compared to four different treatments in Appendix G.
(Lovett et al., in press). Each comparison was treated
as a separate case with separate effect sizes in the Although the length of treatment was coded, it was not
database. used as a moderator variable. Many of the studies were
vague about the amount of time devoted to phonics
Studies were coded for several characteristics that instruction; so, it was not possible to calculate precise
were included as moderators in the meta-analysis: amounts of time spent, particularly in classroom studies
• Type of phonics program (synthetic vs. larger which provided instruction regularly throughout the
subunits vs. a combination of synthetic and larger school year. Also, treatment length was confounded
subunits vs. miscellaneous) with other variables considered to be more important,
such as whether students were tutored or taught in Six types of outcomes assessing growth in reading or
classes, whether students were poor or normally spelling were distinguished:
developing readers, whether students were beginners or
• Decoding of real words chosen to contain regular
older readers when they began instruction.
spelling-to-sound relationships
Some studies in the database selected normally • Reading nonsense words or pseudowords chosen to
developing readers to include in their experiments represent regular spelling-to-sound relationships.
whereas other studies singled out poor readers. These
students were grouped into four types of readers for • Word identification (in some cases, words were
analysis: chosen to represent irregular spelling-to-sound
relationships)
1. Normally developing readers: this category included
studies in which poor readers were excluded and • Spelling, assessed using either developmental stages
studies where no attempt was made to distinguish for younger children (Bear et al., 2000) or number
children by reading ability. of words correct
• Comprehension of material read silently or orally
2. Disabled readers: this category included children
who were identified as reading disabled according • Oral reading of connected text (accuracy).
to IQ-reading discrepancy criteria in standard use Measures reported in studies were classified into these
by researchers, or were given tests to determine types, and effect sizes were computed for each type of
that the disability was reading-specific; in some outcome. Some studies included several measures of an
cases, exclusionary criteria were applied as well outcome type and reported means on each measure. In
(e.g., no neurological, behavioral, economic, or these cases, effect sizes were calculated on each
emotional disorders); most of these children were measure and then averaged. This step insured that no
above 1st grade. single treatment-control comparison contributed more
3. Children at risk for developing reading difficulties in than one effect size to any single outcome category.
the future (kindergartners and 1st graders). Some studies included tests to assess whether students
were able to read or spell words that were taught
4. Children who were below average in their reading directly during phonics instruction. These results were
referred to as low achievers (children above 1st not included as outcomes in the database.
grade).
For each comparison, the mean effect size was
The latter two groups included children who exhibited calculated across whichever of the six measures had
poor letter knowledge, poor phonemic awareness, or been assessed in that study. This yielded an overall
poor reading skills, or those in schools with low outcome measure for each comparison. When studies
achievement, or those identified by teachers as needing reported performance on a general reading test but no
special help in reading, or those who qualified for more specific tests, the overall effect size was based on
remedial programs in schools but the criteria for the general measure. Outcomes that did not fit into the
selection were not specified. The at-risk label was above categories were not entered into the database.
applied to children in kindergarten and 1st grade
because they were still at a beginning level in their Performance of students was measured at various
learning. Children labeled low achievers in reading were points before, during, and after instruction. Entered into
those in 2nd grade and above whose identity as poor the database were outcomes of posttests measured at
readers was considered to be better established. Both three points in time: at the end of training, at the end of
groups included children who also had lower than the first school year if the program was taught for more
average IQs qualifying them as garden variety poor than one year, after a delay following training to assess
readers with generally low academic achievement, but long-term effects. The type of posttest most commonly
the groups were not limited to children with low IQs given was that occurring at the end of the program or at
because researchers either did not measure IQ or did the end of the school year when the program continued;
not use it to limit the readers selected for study. so, this was the outcome used in most of the analyses
of moderator variables.
Many of the studies limited instructional attention to An overall effect size was calculated for each of the 66
children with reading problems. These studies treatment-control group comparisons. This was the
accounted for 65% of the comparisons, with 38% average of the six specific outcome effect sizes (i.e.,
involving poor readers considered “at risk” or low decoding, word reading, comprehension, etc.) or the
achieving, and 27% involving children diagnosed as effect size from a general reading measure if no
reading disabled (RD). Studies involving 1st graders specific outcomes were measured. In the analyses, this
were overrepresented in the database compared to overall effect size is interpreted as assessing the impact
other grades and accounted for 38% of the of phonics instruction on growth in reading. Although
comparisons. Fewer studies involved kindergartners and one of the six was a spelling measure, spelling effect
children in 2nd through 6th grades, with these groups sizes contributed only 16% of the effect sizes that were
contributing 12% and 23% of the comparisons, averaged and reading measures contributed the rest
respectively. Children in the RD group spanned several (84%). Mean effect sizes obtained on various outcomes
ages and grades, ranging from ages 6 to 13 and grades associated with levels of the moderator variables are
2 to 6. Several properties of the studies in our database reported in Table 3 (Appendix E). Effect sizes were
were examined. Of interest was whether the studies tested statistically to determine whether each was
were older or more recent. A tally revealed the significantly greater than zero, indicating that superior
following distribution: performance of phonics-trained groups over control
groups was not a result of chance at p < 0.05.
1970 to 1979: 1 study
1980 to 1989: 9 studies Inspection across the effect sizes listed in Table 3
reveals that the vast majority were significantly greater
1990 to 2000: 28 studies
than zero (those marked with an asterisk). This
Thus, the majority of the studies were conducted over indicates that systematic phonics instruction was
the last 10 years. Most (66%) were carried out in the effective across a variety of conditions and
United States, but 24% were done in Canada, and the characteristics. The overall mean effect size of phonics
remainder in the United Kingdom, Australia, and New instruction on reading was d = 0.41 when effects of
Zealand. Thus, the evidence came from a variety of programs were tested at their conclusion. A few
locales. Other properties of comparisons in the programs lasted longer than 1 school year. To obtain
database are listed in Table 2 in Appendix D. another index of effects, outcomes measured either at
the end of the program or the end of the first school
Effects of Phonics Instruction on Outcome
year, whichever came first, were calculated. Results
Measures
revealed an effect size of d = 0.44. These findings
The statistic used to assess the effectiveness of phonics indicate that the effect produced by phonics instruction
instruction on children’s growth in reading was effect on reading was moderate in size. Unless otherwise
size which measures how much the mean of the stated, the test point used to assess effects of
phonics group exceeded the mean of the control group moderator variables in the meta-analyses was that
in standard deviation units. An effect size of 1.0 occurring at the end of training or at the end of the first
indicates that the treatment group mean was one school year, whichever came first.
standard deviation higher than the control group mean,
Phonics instruction in most of the studies lasted 1 school
suggesting a strong effect of training. An effect size of
year or less. However, there were four treatment-
0 indicates that treatment and control group means
control comparisons in which longer training was
were identical, suggesting that training had no effect. To
provided. In these studies, children at risk for reading
judge the strength of an effect size, values suggested by
problems began phonics instruction in kindergarten or
Cohen (1988) are commonly used. An effect size of
1st grade and continued for 2 or 3 years. Outcomes
0.20 is considered small; a moderate effect size is 0.50;
were measured at the end of each school year
an effect size of 0.80 or above is large.
(Blachman et al., 1999; Brown & Felton, 1990;
Torgesen et al., 1999). Characteristics and results of the
four comparisons drawn from these studies are
presented in Table 4. Mean effect sizes across the four
comparisons were sizeable and their strength was outcome measures examined, not only word reading
maintained across the grades: kindergarten d = 0.46; 1st and spelling but also text processing. Inspection of the
grade d = 0.54; 2nd grade d = 0.43. This indicates the size of the effects provided support for the various
value of starting phonics early and continuing to teach it hypotheses. The strongest effects occurred on
for 2 to 3 years. (See results below for additional measures of decoding regularly spelled words (d =
evidence regarding the value of teaching phonics early.) 0.67) and pseudowords (d = 0.60). These effects were
In the Blachman et al. (1999) study, instruction was not statistically larger than effects observed on the other
given to all 2nd graders but only to those who had not measures which did not differ from each other. This
attained the goals of the program after 2 years of indicates that phonics instruction was especially
instruction. These findings point to the importance of effective in teaching children to decode novel words,
programs providing tests for teachers to use to one of the main goals of phonics.
determine which children need additional systematic
Effect sizes on comprehension measures (d = 0.27) and
phonics instruction and which have mastered the
oral reading measures (d = 0.25) were statistically
processes taught.
greater than zero, indicating that phonics instruction
A few studies examined effects of phonics instruction significantly improved children’s text processing skills as
several months after the treatment had ended. The well as their word reading skills. The fact that effects
specific comparisons together with their properties are of phonics instruction on reading comprehension were
listed in Table 4 (Appendix E). Followup tests were positive serves to dispel any belief that teaching phonics
administered from 4 months to 1 year after training. As to children interferes with their ability to read and
shown in Table 3, the effect size remained significantly comprehend text. Quite the opposite is the case.
greater than zero, indicating that the impact of phonics
Several reasons explain why effects were somewhat
instruction lasted well beyond the end of training
smaller on text processing measures than on word
although its size was somewhat diminished (from d =
reading measures. The tests of comprehension were
0.51 to d = 0.27).
predominantly standardized tests which are less
The aim of phonics instruction is to help children sensitive when the range of performance is limited. The
acquire knowledge and use of the alphabetic system to target of phonics instruction is teaching children how to
read and spell words. Phonics was expected to exert its read words. Although word recognition skill influences
greatest impact on the ability to decode regularly spelled how well children can read and comprehend text, there
words and nonwords. Phonics instruction was also are other processes that are important as well.
expected to exert a large effect when spelling was Moreover, readers can still get meaning from text even
measured using a developmental spelling scale, which when they cannot read some of the words.
gives credit for letter-sound spellings as well as correct
spellings (e.g., Bear et al., 2000; Blachman et al., 1999). Analysis of Moderator Variables
These capabilities all benefit directly from alphabetic Studies in the database varied in several respects that
knowledge. Phonics instruction was expected to exert a were coded and analyzed as moderator variables. Of
significant but smaller impact on the ability to read interest was whether these moderator variables
miscellaneous words that included irregularly spelled enhanced or limited the effectiveness of systematic
words. Although alphabetic knowledge is not helpful for phonics instruction on growth in reading. It is important
decoding irregularly spelled words, it does help children to recognize the limitations of this type of analysis and
remember how to read these words (Ehri, 1998). the tentative nature of any conclusions that are drawn.
Phonics instruction was expected to impact text reading Findings involving the impact of moderator variables on
processes. The effect was expected to be significant effect sizes cannot support strong claims about
but smaller because its influence is indirect. moderators being the cause of the difference.
From Table 3 (Appendix E), it is apparent that effect Moderator findings are no more than correlational. The
sizes for all six types of measures were statistically biggest source of uncertainty is whether there is a
greater than zero, indicating that phonics instruction hidden variable that is confounded with the moderator
significantly improved performance on all of the and is the true cause of the difference.
comparisons that measured reading as an outcome. In (Appendix E) show that, among kindergartners and 1st
the PA analysis, effects were moderate in size and graders, phonics instruction produced significant growth
statistically significant. The effect size in the PA on all six outcome measures whose effect sizes were
analysis (d = 0.48) was close to the effect size statistically greater than zero. Because a central goal in
produced by phonics instruction (d = 0.58). Combined, phonics programs is to teach students to decode novel
these findings clearly support the importance of words, one would expect the strongest effects to be
teaching phonemic awareness and grade-appropriate evident in decoding tasks. This is what was found. The
phonics in kindergarten. Indeed, some of the phonemic largest effect size was produced on the measure of
awareness training studies that taught children to decoding regularly spelled words (d = 0.98). Moderately
analyze phonemes using letters would have qualified as large effects were also produced on measures of
phonics studies. If these PA studies had not been decoding pseudowords (d = 0.67) and spelling words (d
excluded from the phonics database, there would have = 0.67). The effect size was somewhat reduced on the
been more kindergarten comparisons. word identification outcome (d = 0.45). This is not
surprising since tests of word identification often
The above findings suggest that when phonics
included irregularly spelled words not amenable to
instruction is introduced and taught in kindergarten or
decoding.
1st grade to readers who have little reading ability, it
produces a larger effect than when phonics is Phonics instruction with its emphasis on teaching letter-
introduced in grades above 1st grade with readers who sound relations would be expected to improve beginning
have already acquired some reading skills. However, readers’ ability to spell words by writing the sounds
before concluding that phonics is truly less effective they hear. Studies with younger children commonly
with older children, it is important to consider several employed developmental spelling scoring systems that
mitigating factors. The majority of the comparisons in gave credit for phonetically plausible spellings, for
the older group, 78%, involved either low achieving or example, spelling feet as FET or car as KR (Tangel &
disabled readers. Remediating their reading problems Blachman, 1995; Morris & Perney, 1984). This may
may be especially difficult. In addition, there were only explain the sizeable effect observed on the spelling
seven comparisons involving older, normally developing outcome (d = 0.67).
readers, and four of these came from one study using
Among beginning readers, phonics instruction exerted a
the Orton-Gillingham method, a program developed for
significant impact on reading comprehension. The
disabled readers, not for non-disabled upper elementary
effect size, based on ten 1st grade and one kindergarten
level readers. Perhaps other types of phonics programs
comparisons, was moderate (d = 0.51). However, the
designed expressly to improve reading in older non-
effect size on another measure of text reading, oral
disabled children might prove more effective. This
reading, was smaller but also significantly greater than
question awaits more research.
zero (d = 0.23 based on two kindergarten and four 1st
The set of effect sizes for the older students proved to grade comparisons). Why phonics skills facilitated
be homogeneous, indicating that chance, rather than reading comprehension more than oral reading is not
other moderator variables, explains the variation in clear. It may have to do with the nature of the tests.
effect sizes. The two types of poor readers, low Standardized comprehension tests at this level generally
achievers and RDs, contributed the majority of the use extremely short (usually one sentence) “passages.”
effect sizes to this pool. These findings indicate that low On these short passages, the effects of decoding should
achieving readers and disabled readers do not differ in be strong. Some tests, such as the Gates-MacGinitie,
their response to phonics instruction. favor phonetically regular words in these passages.
Oral reading measures, on the other hand, use longer
Specific Outcomes in Younger Readers passages, sometimes containing pictures which would
Because the younger and older children differed in their enhance the utility of context.
response to phonics instruction, the question of whether
phonics instruction impacted children’s ability to decode
and spell words and to read text was answered
separately for the two groups. Results in Table 3
One would expect effect sizes on text reading and word This effect size was not statistically different from zero.
reading to be similar because 1st graders’ ability to read Likewise, phonics programs did not produce significant
and understand text is heavily influenced by their ability growth in reading comprehension (d = 0.12) although a
to read the words in the text, perhaps somewhat more small, statistically significant effect was observed on
so than in later grades. This is supported by Juel (1994) oral reading (d = 0.24).
who found a very high correlation between word
Because the comparisons involving older children
recognition and reading comprehension in 1st grade
included a large number focusing on disabled readers,
(r = 0.87) and found that the correlation was somewhat
the 17 RD comparisons were analyzed separately.
lower in 2nd grade (r = 0.73).
Effect sizes proved almost identical to those for the
In sum, these findings show that systematic phonics larger group reported in Table 3 (Appendix E) with one
instruction helped beginning readers acquire and use the important exception. The effect size on the measure of
alphabetic system to read and spell words in and out of reading comprehension, though small, was statistically
text. Children who were taught phonics systematically greater than zero (d = 0.27, based on eight comparisons
benefited significantly more than beginners who did not that were homogeneous). This indicates that, contrary
receive phonics instruction in their ability to decode to the general finding of no effect, systematic phonics
regularly spelled words and nonwords, in their ability to instruction did help reading disabled students
remember how to read irregularly spelled words, and in comprehend text more successfully than nonsystematic/
their ability to invent phonetically plausible spellings of no-phonics programs.
words. In addition, phonics instruction contributed
Because most of the comparisons above 1st grade
substantially to students’ growth in reading
involved poor readers (78%), the conclusions drawn
comprehension and somewhat less to their oral text
about the effects of phonics instruction on specific
reading skill.
reading outcomes pertain mainly to them. Findings
Specific Outcomes in Older Readers indicate that phonics instruction helps poor readers in
Students above the 1st grade were introduced to 2nd through 6th grades improve their word reading
phonics instruction in their classes or in pull-out skills. However, phonics instruction appears to
programs for periods lasting up to a school year. These contribute only weakly, if at all, in helping poor readers
students included children who were low achieving apply these skills to read text and to spell words. There
readers as well as children diagnosed as reading were insufficient data to draw any conclusions about
disabled. Effects of phonics instruction on six outcome the effects of phonics instruction with normally
measures were compared. Results in Table 3 developing readers above 1st grade.
(Appendix E) show that substantial growth occurred in The absence of effects on spelling is noteworthy since
learning to decode regularly spelled words (d = 0.49) the same finding was detected in the Panel’s meta-
and pseudowords (d = 0.52), with effect sizes analysis of phonemic awareness instruction. In the PA
statistically greater than zero in the moderate range. review, the Panel found that younger readers
This shows that phonics programs were significantly experienced growth in spelling as a result of phonemic
more effective than control programs in improving these awareness training, but the older disabled readers did
students’ knowledge and use of the alphabetic system not show improvement over controls. One possible
which is the focus of phonics programs. Growth in the explanation is that poor readers experience special
reading of miscellaneous words with irregularities was difficulty learning to spell (Bruck, 1993). Remediation of
somewhat smaller but significant (d = 0.33), indicating this difficulty may require special instruction targeted at
that phonics improved students’ ability to read spelling. Another explanation may be that as readers
irregularly spelled words, presumably by improving their move up in the grades, remembering the spellings of
memory for these words. words is less a matter of applying letter-sound
In contrast to strong positive effects of phonics correspondences and more a matter of knowing more
instruction on measures of word reading, these advanced spelling patterns and morphologically based
programs were not more effective than other forms of regularities which is not typically addressed in phonics
instruction in producing growth in spelling (d = 0.09). instruction.
Further research is needed to explore the value of that phonics instruction contributed to growth in reading
phonics instruction in grades beyond 1st grade. Perhaps in all groups but the 2nd through 6th grade low achiever
phonics instruction could be made stronger by group. Among the at-risk and normal readers in
combining it with instruction that helps children learn to kindergarten and 1st grades, effect sizes were
read words in other ways, specifically, reading words moderate to high, ranging from d = 0.48 to d = 0.74.
from memory, reading words by analogy to known Effect sizes were smaller for 2nd though 6th grade
words, and reading words using spelling patterns and normal readers (d = 0.27) and disabled readers (d =
multisyllabic decoding strategies. Some phonics 0.32). These findings extend the analysis above by
programs in the database did teach children about revealing effect sizes for specific reader ability groups
spelling patterns and the use of an analogy strategy to at each grade level. Findings indicate that the strong
read words (see results presented below). Also it may impact of phonics instruction was evident in normally
be important for phonics programs to include systematic developing 1st graders as well as at-risk kindergartners
instruction in reading fluency and automaticity when and 1st graders.
phonics is taught to older students. A few of the
There was one group for whom phonics instruction
programs in the database included exercises to promote
failed to exert a statistically significant impact on the
fluency. Very likely, phonics programs that emphasize
students’ growth in reading. This occurred in the eight
decoding exclusively and ignore the other processes
comparisons involving low achievers in 2nd through 6th
involved in learning to read will not succeed in making
grades (d = 0.15). Although smaller, the effect size for
every child a skilled reader.
low achievers did not differ significantly from the effect
Separation of Reader Ability Groups size of disabled readers (d = 0.32).
at Each Grade Level Alternative explanations for the ineffectiveness of
To clarify whether and how readers with different phonics instruction with older poor readers in 2nd
reading abilities across the different grades responded through 6th grades can be offered. Their reading
to phonics instruction, treatment-control group difficulties may have arisen from sources other than
comparisons were grouped by grade and reading ability. decoding, such as lack of fluency or poor reading
There were 62 comparisons with posttests administered comprehension skills (see other sections of the NRP
when the program was completed or at the end of the report for elaboration of these reading processes). The
first year of the program, whichever came first. Table 5 fact that the IQs of some of the children in these
(Appendix E) shows how these comparisons were studies were below normal points to comprehension
distributed across the grade-by-reader-ability cells. difficulties as a possibility. Another explanation may be
Six groups were formed for the meta-analysis: that these children were not given sufficiently intensive
phonics instruction to remediate their difficulties. In
• 1st grade normally achieving readers Table 4 are listed properties of the treatment-control
• 2nd through 6th grade normally achieving readers group comparisons involving low achievers. Inspection
of the characteristics of these studies reveals that only
• kindergarten children at risk for reading problems one provided tutoring, thought to be the most effective
• 1st grade children at risk way to teach phonics (but see below), whereas seven
involved class instruction. However, there may be too
• 2nd through 6th grade low achievers
few studies of low achieving readers in the database
• disabled readers. (only eight) to draw firm conclusions. Further research
More precise grade and age information is given in is needed to explore how best to remediate their reading
Table 2 (Appendix D), which lists characteristics of difficulties.
each treatment-control group comparison.
Ef fects of Phonics Instruction Lasting 2 to 3 Years
Effects
The outcome measure was the overall effect size The evidence on older readers above 1st grade
averaged across the six specific measures. Effect sizes reviewed so far provides no information about the
significantly greater than zero were evident for five of effects of phonics instruction on older students who
the six groups of readers. From Table 3, it is apparent began phonics instruction in kindergarten or 1st grade.
complemented reading in most programs. The programs than control programs in helping children learn to read.
in this category included Edmark, Hiebert’s embedded The 39 synthetic phonics programs produced a
code program, three Reading Recovery© programs moderate impact on growth in reading (d = 0.45). The
modified to include systematic phonics, and a program 11 programs that emphasized larger units created a
derived from the Benchmark Word Identification somewhat smaller impact (d = 0.34) and likewise the
program. ten miscellaneous programs’ effect was smaller (d =
0.27). However, the three effect sizes did not differ
One of the 11 studies in the Larger Unit category, that
statistically from each other (p > 0.05). There were
by Tunmer and Hoover (1993), produced an atypical
relatively few comparisons in the larger unit group.
effect size, d = 3.71, which was much larger than the
Additional research would be useful for determining
other effects. It should be noted that this study was
whether the small difference between the synthetic and
atypical in that it was more intensive than most others.
larger unit approaches is a reliable one.
It involved one-on-one tutoring by highly trained
teachers, and it combined phonemic awareness, Specific Phonics Programs
phonics, and Reading Recovery© instructional There were seven phonics programs that were studied
strategies. To reduce the influence of this comparison in three or more treatment-control comparisons. The
on the overall mean, its effect size was reduced to identities of programs and properties of the comparisons
equal the next largest effect size in the set, d = 1.41. testing their effectiveness are listed in Table 6
(This method of adjusting effect sizes to deal with (Appendix F). Descriptions of the programs are
outliers was only applied in analyses that involved a provided in Table 7 (Appendix E). Effect sizes of these
small number of comparisons.) comparisons were subjected to a meta-analysis. Results
The third category, referred to as miscellaneous, in Table 3 (Appendix F) reveal that all effect sizes were
consisted of phonics programs that did not fit into the statistically greater than zero, indicating that all the
synthetic or larger unit categories. In some studies, the phonics programs produced significantly greater growth
descriptions of programs did not state that a synthetic in reading than control group programs. The sets of
strategy was taught. If the program was not known to effect sizes for all but one of the programs proved to be
teach this decoding strategy, then it was placed in the homogeneous. Effect sizes ranged from a high of d =
miscellaneous category. Also, if the scope of instruction 0.68 for the Lippincott program to a low of d = 0.23 for
was limited and did not constitute a full phonics program the Orton-Gillingham-based programs. Possible reasons
(i.e., Haskell et al., 1992; Lovett et al., 1990), it was for lower effect sizes in the case of Orton Gillingham
considered to be miscellaneous. This set included a comparisons are evident in Table 6 (Appendix F).
spelling program, traditional phonics basal programs, Class-based instruction predominated, and this
and some researcher-devised instruction that focused instruction was tested exclusively with older students
on word analysis procedures. (2nd through 6th graders) many of whom were poor
readers. These conditions may have made it harder to
The fourth category, referred to as combination produce substantial growth in reading.
programs, included only two comparisons. However,
these could not be fit into the other categories because Although there appear to be sizeable differences in
they examined the effects of teaching two of the other effect sizes distinguishing the programs, the statistical
categories, a synthetic phonics program and a larger- test was not significant. However, drawing the
units word analogy program (Lovett et al., in press). conclusion that these programs are equally effective is
The comparisons differed in the order that the two premature because there were too few comparisons
programs were taught. The mean effect size for the assessing each program to yield reliable results. Rather,
combined programs was d = 0.42. findings should be considered suggestive in need of
more studies for verification.
Effect sizes reported in Table 3 show that programs in
all three categories produced effect sizes that were
significantly greater than zero. This verifies that the
three types of phonics programs were more effective
Evaluation of these separate programs was undertaken phonemic awareness training studies, findings indicated
in the meta-analysis solely because of their prevalence that effect sizes were significantly greater with small
in the database. The programs are older and hence groups than with classrooms or tutoring. Because
more frequently studied than newer programs. But this classrooms involve a much higher ratio of students to
does not mean that they are considered to be any better teachers, phonics instruction delivered in this setting
than newer programs that were not analyzed. was expected to be less effective than in the other two
settings.
Impact of Synthetic Phonics Programs on
Different Groups of Readers In categorizing studies, it was easiest to determine
Because there were so many comparisons (39) when tutoring was used because this was clearly stated
assessing the effects of synthetic phonics programs, it and described. Identifying whether studies used small
was possible to examine whether this type of program groups was also straightforward because training
was more beneficial for some grade and reader ability procedures included this descriptive although it was not
groups than for others. Two groups, at-risk always clear that this was the only way that instruction
kindergartners and at-risk 1st graders, had the same was delivered. However, in the case of whole class
effect size so they were combined into one group instructional, sometimes this category was attributed to
comprising nine comparisons. As evident in Table 3, all studies by default. In many reports, descriptions made
groups but one showed effect sizes significantly greater clear that the phonics program was taught by teachers
than zero, and all but one group had homogeneous sets to their classrooms of students, but the unit of
of effects. This indicates that synthetic phonics instruction they used to teach the phonics part of
programs produced stronger growth in reading than programs was not explicitly stated; so, it was inferred to
control programs in most of the different reader groups. be the class.
Possible reasons why low-achieving readers in 2nd Before the meta-analysis was conducted, an adjustment
through 6th grades did not benefit were suggested was made to one effect size in the tutoring
earlier. comparisons. This was considered important because
Effect sizes varied across the groups. A test to there were only eight comparisons in this set. One of
determine whether some groups benefited more from the tutoring studies (Tunmer & Hoover, 1993) produced
synthetic phonics than other groups showed that effects an atypical effect size, d = 3.71, which was much larger
were significantly greater for at-risk kindergartners and than the other effects. To limit the influence of this
first graders (d = 0.65) than for the two groups of older comparison on the overall mean, its value was reduced
2nd through 6th grade readers. These findings indicates to equal the next largest effect size in the set, d = 1.99.
that synthetic phonics programs were especially Results of the analysis of effect sizes for the three
effective for younger, at-risk readers. types of instructional units revealed that all produced
positive effects that were statistically greater than zero,
Instructional Delivery Unit indicating that tutoring, small groups and classes were
Another property of systematic phonics instruction all effective ways to deliver phonics instruction to
expected to influence growth in reading was the students (see Table 3). In addition, the set of effect
delivery unit. Three types were distinguished. There sizes for comparisons involving small groups was
were eight treatments in which students received one- homogeneous, indicating that small group effects are
to-one tutoring. This was expected to be the most not explained by additional moderator variables and that
effective form of phonics instruction, particularly for the mean is a good estimate of the actual effect size, d
low achieving and disabled readers, because it was = 0.43.
tailored to individual students. Small group instruction
was also expected to be especially effective because Tutoring produced an effect size of d = 0.57 which was
attention to individual students was still possible, and in greater than the effect size for small groups, d = 0.43,
addition, the social setting was expected to enhance and for classrooms, d = 0.39. However, none of these
motivation to perform and opportunities for effects differed statistically from each other. This
observational learning. In the Panel’s review of evidence falls short in supporting the expectation that
tutoring would prove especially effective for teaching years was used as a baseline without additional
phonics. However, perhaps there were too few description of the actual program taught. In
comparisons assessing the effects of tutoring (only comparisons involving students identified as at risk by
eight) to yield reliable findings. On the other hand, it schools, control groups received the standard
might be noted that the instructional delivery given to intervention offered by the schools to treat reading
the control groups against which tutoring was compared problems.
did not involve tutoring in the majority (62%) of the
Whole language was the label used by authors to
cases. This inequality should have given tutoring an
characterize programs. In two studies (Freppon, 1991;
extra advantage. However, it did not.
Klesius et al., 1991), the purpose was to examine the
Inspection of effect sizes for individual studies in Table effectiveness of whole language programs, not phonics
2 reveals that some whole class programs produced programs that were taught to control groups. In both
effect sizes as large, and sometimes larger, than those cases, phonics was taught with a “skill and drill” basal
produced by small groups or tutoring. Given the program that was not well described. Control groups
enormous expense and impracticality of delivering that were taught with a Big Books program and with
instruction in small groups or individually—except for language experience were labeled as whole language.
children who have serious reading difficulties—
There were a few programs given to control groups
research is needed to determine what makes whole
that taught whole words or sight words without much
class phonics instruction effective.
attention to letter-sound relations. These were classed
It is interesting to note that the same comparison of as whole word programs.
instructional units was conducted in the meta-analysis
Control group programs that did not fit into one of these
of phonemic awareness training effects. Results
categories were placed in a miscellaneous category.
showed that small groups were significantly more
These included programs teaching traditional spelling,
effective than tutoring or classrooms. Why small groups
academic study skills, and tutoring in academic subjects.
were more effective for teaching phonemic awareness
In one case, as a control for parents teaching their own
but not phonics is not clear and awaits further research.
children systematic phonics, the children spent time
Type of Contr ol Gr
Control oup
Group reading books to their parents (Leach & Siddall, 1990).
To test whether systematic phonics programs produced Of interest was whether phonics instruction would
superior growth in reading, researchers utilized control produce superior growth in reading regardless of the
groups that received unsystematic phonics or no- type of control group, and whether phonics instruction
phonics instruction. The types of control groups chosen would appear more effective when compared to some
by researchers varied across the studies. As mentioned types of control groups than to others. There were no a
earlier, some studies included more than one type of priori reasons for expecting effect sizes to be influenced
control group. Selected for analysis were the control by the type of control group, particularly since the
groups that were taught the least amount of phonics. criteria of standard-classroom instruction with minimal
These were categorized into five types based on phonics had been applied consistently across studies in
descriptions and labels provided in the studies: basal, selecting control groups.
regular curriculum, whole language, whole word, and
miscellaneous. Results in Table 3 (Appendix E) reveal that all of the
control groups yielded effect sizes that were statistically
Usually basal programs were those already in use at greater than zero and all favored the phonics treatment.
schools. “Regular curriculum” was the label covering Effects sizes ranged from d = 0.31 for whole language
cases in which controls received the traditional controls to d = 0.51 for whole word controls. Effect
curriculum or the regular class curriculum in use at sizes for basal and miscellaneous control groups were
schools with no further specification of its contents homogeneous. Additional tests revealed that none of the
other that asserting it did not teach phonics
systematically. This category covered cases where
performance in that grade at that school during previous
effect sizes differed significantly from the others. These Pre-Experimental Differences
findings indicate that systematic phonics instruction Studies were also coded for the presence of possible or
proved effective regardless of the type of control group actual pretest differences between treatment and
that was used. control groups. Effect sizes for questionable studies
were calculated separately from studies that were not
Design of Studies
questionable in this regard. There were 15 comparisons
Studies in the database varied in methodological rigor. It for which no information about pretests was provided
is important to rule out the possibility that the positive and the groups were not randomly assigned. The mean
effects of phonics instruction detected in the meta- effect size was d = 0.49. There were ten studies that
analysis arose from poorly designed studies. Three reported pretest differences and did not use random
features of the studies were coded and analyzed to assignment. The mean effect size in this case was d =
determine whether more rigorous designs yielded larger 0.37. When studies containing potential or actual pretest
or smaller effect sizes: assignment of participants to differences were removed from the dataset, effect
treatment and control groups, potential presence of pre- sizes changed very little and in fact increased slightly,
experimental differences between groups, and sample from d = 0.44 to d = 0.46. These findings indicate that
size. pretreatment differences between experimental and
control groups did not explain why phonics-trained
Random Assignment groups outperformed control groups on outcome
Experimental designs that randomly assign students to measures across studies. It was the phonics instruction
treatment and control groups have stronger internal itself that very likely produced the greater gains in
validity than designs that assign already existing groups reading.
to the treatment and control conditions. The latter
procedure is referred to as nonequivalent group Sample Size
assignment. The goal of experiments is to provide solid Another factor indexing the rigor of studies and the
evidence that the treatment or lack of it, rather than reliability of outcomes is sample size, with results of
anything else, explains gains observed in performance larger studies producing stronger results than smaller
following the treatment. Random assignment serves to studies. The number of students participating in
reduce the likelihood that pre-experimental differences, comparisons included in the database varied from 20 to
rather than treatment effects, explain differences 320. Sample sizes were used to group the comparisons
between treatment and control groups on outcome into quartiles, and effect sizes were calculated for each
measures. When nonequivalent groups are used, quartile. From Table 3, it is apparent that effect sizes
statistical techniques can be applied to eliminate pretest were very similar across quartiles and were all
differences between groups when outcome measures statistically greater than zero. The largest effect size, d
are analyzed. However, this is not as satisfactory a = 0.49, emerged in studies having the largest samples.
solution as random assignment. These findings show that the positive effects of
systematic phonics instruction were not limited to
Most studies in the database provided information studies that produced effects with relatively few
regarding how students were assigned to treatment and students.
control groups. If this was not mentioned, then the study
was considered to have used nonequivalent groups. Discussion
Table 3 (Appendix E) shows that studies using random
assignment and studies using nonequivalent groups Findings of the meta-analysis allow us to conclude that
yielded very similar effect sizes, both of which were systematic phonics instruction produces gains in reading
statistically greater than zero. These findings confirm and spelling not only in the early grades (kindergarten
that the positive effects of systematic phonics and 1st grades) but also in the later grades (2nd through
instruction did not arise primarily from studies with 6th grades) and among children having difficulty
weaker nonequivalent group designs. learning to read. Effect sizes in the early grades were
significantly larger (d = 0.55) than effect sizes above comprehending and thinking about the meaning of what
1st grade (d = 0.27). These results support Chall’s is being read. This may be another reason why effect
(1967) assertion that early instruction in systematic sizes on text comprehension were smaller than effect
phonics is especially beneficial to growth in reading. sizes on word reading.
Although there was some thought that kindergartners In the present analysis, systematic phonics instruction
might not be ready for phonics instruction because they exerted a lower than expected impact on reading
first need to acquire extensive knowledge about how growth in low achieving readers (d = 0.15) and disabled
print works (e.g., Stahl & Miller, 1989; Chall, 1996a, b), readers (d = 0.32). The Panel’s meta-analysis of
findings did not support this possibility. Phonics phonemic awareness training studies included
instruction produced similar effect sizes in kindergarten comparisons involving poor readers. Most of these
(d = 0.58) and 1st grade (d = 0.54). studies would qualify as phonics studies because letter-
sound manipulations were part of the phonemic
Phonics instruction can be described in terms of the
awareness training. The studies were not included in
method used to teach children about letter-sound
the phonics database in order to avoid duplication of
relations and how to use letter-sounds to read or spell.
studies across meta-analyses. The effect size on
There are synthetic, analytic, analogy, spelling-based,
reading outcomes in the PA meta-analysis involving
and embedded approaches to teaching phonics. Phonics
poor readers was d = 0.45, a value quite a bit higher
instruction can also be described in terms of the content
than the effect sizes produced by phonics instruction. It
covered, for example, short vowels, long vowels,
may be that including more phonemic awareness
digraphs, phonics generalizations, onsets and rimes,
training with letters might improve the quality of phonics
phonograms, and so forth. In the present meta-analysis,
instruction given to poor readers. However, there may
only the types of methods were compared in terms of
be other factors that explain the difference as well.
the effect sizes produced, and no significant differences
Closer scrutiny of the two sets of studies is needed to
among methods were detected.
identify possible reasons. For example, RD students in
Stahl et al. (1998) suggest that the benefits of phonics the phonics analysis may have been older than students
instruction and differences among phonics approaches in the PA analysis.
may arise from the amount of content covered and
The overall effect size of systematic phonics instruction
learned by students rather than from properties
in 1st grade was d = 0.54. Although moderate in size,
distinguishing the various methods. Synthetic methods
this value is somewhat low when compared to effect
tend to be efficient in covering content and tend to
sizes found in other similar reviews. Stahl and Miller
cover an ambitious number of sound-symbol
(1989) conducted a meta-analysis of phonics instruction
correspondences in the 1st grade year. Other
and drew their comparisons from the Cooperative First
approaches vary considerably in the amount that they
Grade Studies (Bond & Dykstra, 1967, 1998) whose
cover. To understand phonics instruction and its effects
participants should be similar to 1st graders in the
on student learning, research is needed to study
present database. Stahl and Miller found effect sizes of
separately the effects of teaching methods from the
0.91 on the Stanford Word Reading subtest and 0.36 on
effects of content coverage. Systematic phonics
the Paragraph Meaning subtest for children who
instruction is focused on teaching children the
received phonics instruction similar to that studied here.
alphabetic system and explicitly how to apply it to read
Overall, these are higher effect sizes than those
and spell words. Phonics skills would be expected to
detected in the present meta-analysis.
show effects on text comprehension to the extent that
phonics skills help children read the words in texts. This The discrepancy may arise from differences in the way
is one reason why phonics instruction may have exerted the Panel created its database. Whereas the Panel’s
less impact on text comprehension outcomes than on review was limited to studies published in peer-
word reading outcomes, because the impact is indirect. reviewed journals, authors of the previous meta-
In addition, although phonics programs do give children analyses made a great effort to find “fugitive” or
practice reading connected text, the purpose of this unpublished studies to include. One reason to search
practice is centered on word recognition rather than on widely for studies is that the publishing process tends to
screen out studies reporting null effects, and this runs assess outcomes of instructional treatments but also to
the risk of biasing the data set towards positive effects. document the nature of the instruction received by
However, such a bias would be expected to favor a treatment and control groups to verify whether and how
larger effect size using National Reading Panel they actually differed.
procedures, and this did not happen. Another possible
reason for the discrepancy is that the previous analyses Studies to Illustrate Systematic Phonics
included unpublished studies, thus running the risk of Instruction and Its Contribution to Growth
admitting studies of poor quality with inflated effect in Reading
sizes. Limiting studies to those passing the test of peer Some of the studies in the database are described to
review minimizes this risk. provide a glimpse of the experiments contributing effect
Another possible explanation for the Panel’s smaller sizes and to portray various types of phonics instruction
effect size is that the database involved more recent that were examined.
studies. There may have been more of a tendency for
later studies to focus on at-risk, low-achieving, and Phonics Instruction in Kindergarten
disabled readers for whom growth in reading may be Systematic phonics instruction in kindergarten was
harder to achieve. Perhaps the reading instruction studied in six articles. The main goals included teaching
experienced by students in control groups included more children the shapes of letters and their sounds, how to
phonics than the reading instruction received by control analyze sounds in words (phonemic awareness), and
groups in earlier years. In the 1960s, basal readers used how to use letter sounds to perform various reading or
a whole word methodology whereas the control writing tasks appropriate for children just starting out. In
conditions in more recent studies are presumably more the study by Stuart (1999), three kindergarten teachers
eclectic. Table 2 identifies the control groups used by utilized the Jolly Phonics program (Lloyd, 1993), and
studies in the corpus. Whereas some groups were true three teachers centered their instruction around
“no-phonics” controls, other groups received some Holdaway’s (1979) Big Book approach. Teachers
phonics instruction. It may be that, instead of examining taught these programs 1 hour per day for 12 weeks
the difference between phonics instruction and no during the latter half of kindergarten.
phonics instruction, a substantial number of studies Big Book instruction included work with letters.
actually compared more systematic phonics instruction Teachers drew children’s attention to written words in
to less phonics instruction. This would produce smaller the books and they talked about letters in words. Also,
differences between treatment and control groups and teachers employed various “imaginative and fun
hence smaller effect sizes. activities” to help children learn letters and their sounds.
In one of the studies in the database, Evans and Carr However, the instruction was not systematic; the
(1985) conducted extensive observations of the sequence of teaching letters was not prescribed, and no
instruction received by treatment and control groups special system for remembering letter-sound relations
and reported their observations numerically. They found was taught.
that the phonics classes spent 13.38% of the group time The Jolly Phonics program was more systematic and
and 11.94% of independent work time on word analysis, prescribed in its teaching of letters. This program was
whereas the control group spent 5.37% of the group developed by Lloyd (1993), a teacher, for 4- and 5-
time and 1.84% of the independent time on word year-olds in their first year of schooling in the United
analysis. Although there is a difference favoring the Kingdom. Central to the program is the use of
phonics group, the finding shows that control classes did meaningful stories, pictures, and actions to reinforce
spend some time on word analysis as well. Chall and recognition and recall of letter-sound relationships, and
Feldmann (1966) found that there was considerable precise articulation of phonemes. There are five key
variation in instruction, even in classes professing to be elements to the program: (1) learning the letter sounds,
using the same methods. This underscores the (2) learning letter formation, (3) blending for reading,
importance of researchers taking steps not only to (4) identifying the sounds in words for writing, and (5)
tricky words that are high frequency and irregularly
spelled. The program includes activities and instruction letters and have names prompting the relevant sound,
specifically designed to address those skills most for example, Sammy Snake, Hairy Hat Man, Fireman
needed in the development of early literacy. Unlike Fred, Annie Apple. The task of learning the shapes and
many older phonics approaches, however, Jolly Phonics sounds of all the alphabet letters is difficult and time-
promotes playful, creative, flexible teaching that fits consuming, particularly for children who come to school
well with whole language practice and leads directly to knowing none. The relations are arbitrary and
authentic reading and writing. meaningless. Techniques to speed up the learning
process are valuable in helping kindergartners prepare
At the end of training in either Jolly Phonics or Big
for formal reading instruction.
Books, children were given various tests to compare
effects of the programs. Results showed that Jolly The motivational value of associating letters with
Phonics at-risk kindergartners were able to read interesting characters or hand motions and incorporating
significantly more words and pseudowords and to write this into activities and games that are fun is important
more words than the Big Book group. The overall for promoting young children’s learning. If the task of
effect size was d = 0.73. A year later, the children were teaching letters is stripped bare to one of memorizing
retested. The Jolly Phonics group outperformed the letter shapes and sounds, children will become bored
control group in reading and spelling words but not in and easily distracted and will take much longer to learn
reading comprehension. These results show that the associations.
phonics instruction in kindergarten is effective in
boosting children’s progress in learning to read and A Developmental Approach to Phonics
write words. Instruction in Kindergarten
Another phonics program for kindergartners was
One interesting feature of the Jolly Phonics program is studied by Vandervelden and Siegel (1997). The
that children are taught hand gestures to help them interesting feature of their approach was to tailor the
remember the letter-sound associations. For example, intervention to individual children’s level of knowledge.
they make their fingers crawl up their arm portraying an This is important because kindergartners vary greatly in
ant as they chant the initial sound of “ant” associated how much they already know about letters when they
with the letter a. The value of mnemonics for teaching enter school. The instruction lasted 12 weeks, with
letter-sound relations to kindergartners is supported by children receiving two sessions per week. There were
evidence. In a study by Ehri, Deffner, and Wilce (1984), 15 children that received phonics instruction and 15 that
children were shown letters drawn to assume the shape received the same instructional format but focused on
of a familiar object, for example, s drawn as a snake, h classroom activities and materials. Children were
drawn as a house (with a chimney). Memory for the pretested. The three children who showed the least
letter-sound relations was mediated by the name of the knowledge received one-on-one tutoring, the next eight
object. Children were taught to look at the letter, be lowest scoring children were instructed in pairs, and the
reminded of the object, say its name, and isolate the four highest scoring children worked in a small group.
first sound of the name to identify the sound (i.e., s -
snake - /s/). With practice they were able to look at the The skills taught to phonics-treated children who lacked
letters and promptly say their sounds. Children who them included the following: learning sounds for
were taught letters in this way learned them better than consonant letters; use of initial letter-sound matches to
children who were taught letters by rehearsing the recognize, spell, and read words; segmenting words into
relations with pictures unrelated to the letter shapes sounds and spelling the sounds; orally reading text
(e.g., house drawn with a flat roof and no chimney) and containing the words learned in this way; learning
also better than children who simply rehearsed the correct spellings of words by analyzing letter-sound
associations without any pictures. constituents; and use of rime analogy in reading and
spelling words. Easier skills were taught before harder
Application of this principle can be found in Letterland skills. Instruction began at levels appropriate for
(Wendon, 1992), a program that teaches kindergartners individual learners.
letter-sound associations. In this program, all the letters
are animate characters that assume the shape of the
In the control group, children engaged in activities used knew on average only two letter sounds and could not
in their classrooms. This included letter learning and yet write their names. Thus, the participants were
phonemic awareness. However, children were not starting from zero in their alphabetic learning. By the
explicitly guided in the use of these skills to read and end of kindergarten, children knew on average 19 letter
write. names and 13 letter-sounds, indicating that substantial
learning had occurred.
Results showed that the phonics groups outperformed
the control group on tests of phonemic awareness and At the beginning of 1st grade, there was still wide
letter-sound relations but not letter names. Also, the variation in children’s letter knowledge and phonemic
phonics group did better on tests of speech-print awareness. This underscores the fact that even though
matching of words and pseudowords (e.g., which children receive the same instruction, they still differ in
written word, milk, monk, or mask says “mask”), on how quickly they learn what they are taught. To
tests of writing the sounds in words, and on some but address the variation, children were assigned to ability
not all measures of word reading. The overall effect groups. The core of the reading program involved daily,
size was d = 0.47. It is important to recognize, however, 30-minute lessons consisting of five steps that
that these kindergartners were still at a rudimentary emphasized the alphabetic code:
level in their development as readers. For example, at
1. Teaching new sound-symbol correspondences with
the end of the treatment, they were able to match 43%
vowels highlighted in red
of the written and spoken words correctly; they read
only a mean of 10 out of 60 high frequency words such 2. Teaching phoneme analysis and blending
as up, yes, and book, and they spelled only 46% of the
sounds in words. This suggests that teaching students to 3. Reading regularly spelled, irregularly spelled, and
use phonics skills to read and spell words at the high-frequency words on flash cards to develop
kindergarten level may yield only limited success. automaticity
However, perhaps this program was not optimally 4. Reading text containing phonetically controlled
designed or did not last long enough. words
A 2.5-Y ear Phonics Pr
2.5-Year ogram Beginning With
Program 5. Writing four to six words and a sentence to
Phonemic Awar eness
wareness dictation.
A lengthier, more comprehensive program lasting more By the end of the program, children had been
than 2 years was studied by Blachman et al. (1999). introduced to all six syllable types: closed (fat), final E
Classroom teachers used the program with low SES, (cake), open (me), vowel team (pain), vowel + r (burn),
inner-city children. Instruction began in kindergarten and consonant le (table). Vocabulary development and
with a focus on phonemic awareness training lasting 11 work on reading comprehension was incorporated as
weeks. In 1st grade, explicit, systematic instruction in well, with more time spent reading text as the year
the alphabetic code was taught. This instruction progressed and children’s reading vocabulary grew.
continued in 2nd grade for children who did not
complete the program in 1st grade. Control children Inservice workshops held once a month were used to
participated in the school’s regular basal reading instruct teachers how to implement the program. The
program that included a phonics workbook that children instruction presented information about how children
used independently. acquire literacy skills and the role of phonological
processes in learning to read. Teachers learned how to
The phonemic awareness instruction taught children to provide explicit instruction in the alphabetic code. The
perform a “say it and move it” procedure in which they issue of pacing was stressed. Developing students’
moved a disk down a page as they pronounced each phonemic awareness, letter-sound knowledge, and word
phoneme in a word. They practiced segmenting two- recognition skills was identified as being more important
and three-phoneme words in this way. Then a limited than “covering the material.”
set of eight letter-sound relations was taught, and
children moved the letters rather than the disks. It is
noteworthy that when children began this program, they
To assess how far children had progressed in their An Intensive 3-Y ear Tutoring Pr
3-Year ogram: Synthetic
Program:
reading and writing, various tests were given at the end vs. Embedded Phonics Instruction
of kindergarten, 1st grade, and 2nd grade. Results Another study in the database, by Torgesen et al.
showed that kindergartners receiving PA training (1999), also provided phonics instruction throughout the
outperformed control students, with d = 0.72. At the primary grades. In this study, two different forms of
end of 1st grade, children who received explicit phonics phonics instruction were compared, one which provided
training achieved significantly higher scores than very explicit and intensive instruction in PA and
controls, with d = 0.64. During 2nd grade, children in phonemic decoding called PASP (phonological
the phonics group who had not met the program’s goals awareness plus synthetic phonics), while the other
received additional instruction while the rest received provided systematic but less explicit instruction in
regular classroom instruction. On posttests at the end of phonemic decoding in the context of more instruction
the year, the phonics-trained group continued to and practice in text comprehension, called EP
outperform the control group, with d = 0.36. (embedded phonics). Instruction was provided by tutors
rather than classroom teachers. Kindergarten children
These findings show that the explicit systematic
with poor PA and letter knowledge received 88 hours of
instruction in phonics provided by the Blachman
tutoring over 2.5 years, with sessions lasting 20 minutes
program improved low SES children’s ability to read
and scheduled four times per week. Instruction was
words more than a basal program less focused on
individually paced according to the progress that
teaching children alphabetic knowledge and word
children made. This instruction was added to the
reading skills. Several features of this program are
reading instruction they received in the classroom.
noteworthy and may underlie its effectiveness. The
There were two control groups, one that received
same program continued over three grades, thus
tutoring that supported regular classroom instruction,
insuring consistency and continuity in children’s learning
and one in which children received only regular
the alphabetic system and how to use it to read and
classroom instruction. Instruction in the tutoring control
spell. The program began in kindergarten with
condition included some phonics oriented activities.
alphabetic code instruction that was appropriate for
There were 180 children from 13 schools. Children
children’s level of knowledge. They were taught
were randomly assigned to one of the four conditions.
phonemic awareness and a limited set of letter-sound
relations which they used to make and break words. The PASP children received the Auditory
Both PA and letter knowledge are known to be the Discrimination in Depth program (Lindamood &
strongest predictors of how well children will succeed in Lindamood, 1984). This program began by teaching
learning to read. Delivery of instruction was tailored to children phonemic awareness in a unique way. Children
enable all students to complete the program. Tests were were led to discover and label the articulatory gestures
given to assess children’s progress and to distinguish associated with each phoneme by analyzing their own
those children who needed further instruction from mouth movements as they produced speech. For
those who did not. Instruction in the alphabetic code example, children learned that the word beat consists of
included various kinds of reading and writing skills, not a lip popper, a smile sound, and a tongue tapper.
only sounding out and blending words but also building Children learned to track the sounds in words with
memory for words, spelling words, and reading words in mouth pictures as well as colored blocks and letters.
text. An extensive set of letter-sound relations including Most of the time in this program was spent building
vowels was taught and applied to various types of children’s PA and their decoding skills although some
words organized by syllable structure. Teachers were attention was given to the recognition of high frequency
provided with inservice workshops during the school words, text reading, and comprehension.
year to help them not only provide instruction correctly
but also to understand the reading processes and their The EP program began by teaching children to
course of acquisition in students. These properties of recognize whole words. Instruction in letter-sounds
the Blachman phonics program may account for its occurred in the context of learning to read words from
effectiveness. Further research to examine the memory (by sight). Also, children wrote sentences and
contribution of such properties is needed. read what they wrote. In this context, phonemic
awareness was taught by having children segment the compared to the no treatment control group, while the
sounds in words before writing them. When children explicit PASP group did. Based on comparisons to the
had sufficient reading vocabulary, they began reading classroom control group, effect sizes for the two
short stories to build their reading vocabulary further. phonics groups were
The emphasis was on acquiring word level reading
• PASP: d = 0.33 (kindergarten), 0.75 (1st grade),
skills, including sight words and phonemic decoding
0.67 (2nd grade)
skills. Also, attention was given to constructing the
meanings of stories that were read. • EP: d = 0.32 (kindergarten), 0.28 (1st grade), 0.17
(2nd grade).
One step taken in the Torgesen et al. study was to
videotape 25% of the PASP and EP tutorial sessions Clearly, effects of synthetic phonics instruction
and analyze the interaction to verify how phonics persisted more strongly over the grades than effects of
instruction differed in the two programs. The embedded phonics instruction. Left unclear is whether
percentages of time spent on the following types of PASP’s effectiveness resulted from the greater time
activity were spent teaching alphabetic and phonological processes,
or the specific content of the instruction, or a
• PA, letter-sounds, phonemic reading/writing of combination of both factors.
words: 74% (PASP) vs. 26% (EP)
Although the comparisons between individual groups
• Sight word instruction: 6% (PASP) vs. 17% (EP) were not significant for the comprehension measures,
• Reading/writing connected text: 20% (PASP) when the outcomes for the PASP group were
vs. 57% (EP). compared to those of the EP and RCS groups
In comparing the groups’ performance on outcomes combined, the effect size for the passage
measures across the grades, Torgesen et al. found that comprehension test of the Woodcock Reading Mastery
the PASP group read significantly more real words and Test-Revised was 0.43. The corresponding effect size
nonwords and spelled more words than one or both of for the comprehension measure for the Gray Oral
the control groups. However, the EP group did not Reading Test–3 was 0.21. While reading
outperform the control groups on any of the measures. comprehension depends upon other processes besides
There was a significant overall effect of interventions word reading, one would expect to see transfer,
on the comprehension measures, but individual contrasts particularly in the primary grades where text reading is
between groups were not statistically significant. heavily influenced by word recognition skills. One
Comparison of the PASP and EP groups revealed possible explanation is that the tests of comprehension
superior performance by PASP on measures of were standardized and hence were not sufficiently
phonological awareness, phonemic decoding accuracy sensitive to detect small within-grade differences. This
and efficiency, and word reading accuracy. However, is because standardized tests are designed to detect
the groups did not differ in word reading efficiency differences across the whole range of grades; so, there
(taking account of speed as well as accuracy) or in the are only a small number of items at each grade level.
individual contrasts for reading comprehension. Thus, Another possibility is that compensatory processes are
findings revealed that intensive training in phonics sufficiently strong to dilute the contribution that superior
produced superior word reading skills compared to word recognition skill makes to text reading. That is,
embedded phonics training or training given to control children read and comprehend text by utilizing their
groups. Interestingly, neither of the two instructional linguistic and background knowledge combined with
control groups, embedded phonics or supported their word reading skill. When word reading skill is
classroom instruction, produced significant effects somewhat weaker, children can rely more heavily on
their knowledge about the subject and memory for what
they have read to still make sense of the text.
Tunmer and Hoover (1993) performed a similar study in requires one-on-one tutoring delivered in schools by a
which the letter segment of the RR lesson was replaced few highly trained RR teachers, it is expensive; so, a
by more systematic phonics instruction. Children were savings in time can mean either that more students are
taught to make, break, and build new words that had helped or that fewer teachers are required.
similar letters and sounds. Instruction began by focusing
A third study in the database also modified the RR
on phonograms or rime spellings in words (e.g., make,
format to include more systematic phonics instruction.
bake, cake, take). A metacognitive strategy training
In the study by Santa and Hoien (1999), at-risk 1st
approach rather than a skill and drill approach was used
graders received tutoring that involved story reading,
to make children aware of how letters and sounds work
writing, and phonological skills based on a program
in words and how to use their alphabetic knowledge to
developed by Morris (1992). The unique part of this
read and spell.
phonics program was that it used word study activities
Two control groups were included in the study. One to develop phonological awareness and decoding skill.
group received unmodified RR lessons. The other group Word study consumed 5 to 6 minutes of the 30-minute
received the standard treatment given to poor readers lesson. Children were given cards to sort into
by the school district. This was a pull-out program in categories. They might sort picture cards that shared
which teachers worked with children in small groups. the same initial sounds, or word cards sharing the same
Some word analysis activities were included. The vowel sounds. The typical sort involved three patterns
children were all 1st graders in their 2nd year of reading with four words in each pattern. Initially, children
instruction. They were the poorest readers in their worked with phonograms (e.g., -at in hat, cat, sat, rat)
class. Posttests were given when RR children achieved and then advanced to shared phonemes as the basis for
the goals of the program. Results showed that the sorting words. Children also were taught to spell by
modified RR group outperformed the group receiving writing letters for the sounds heard in words.
the standard small-group instruction on all measures. Metacognitive strategies were taught including an
The overall effect size was d = 3.71, indicating that the analogy strategy in which children were urged to use
modified RR phonics program produced an enormous words they know to read words they don’t know.
advantage over the treatment received by the standard
The control group received small group, guided reading
control group.
instruction. They practiced reading and rereading books
In contrast, the modified RR group performed very in 30-minute lessons but did not receive any word study
similarly to the unmodified RR group on the reading activities. Results showed that the word study program
measures following training. The only difference was produced much greater growth in reading than the
that it took significantly fewer sessions for the modified guided reading program, d = 0.76. Gains were greater in
RR group to achieve the goals of RR than the reading comprehension as well as word reading. These
unmodified RR group. The effect size showing the findings provide evidence for the effectiveness of
advantage in reduced time was d = 1.40. The same teaching children phonics through the use of larger units
advantage in time, but not in reading outcomes, was along with phonemes.
uncovered by Iversen and Tunmer (1993) who
conducted a very similar study. (The Iversen and Systematic Phonics to Remediate the Reading
Tunmer data were included in the Panel’s meta-analysis Difficulties of Disabled Readers
of phonemic awareness instruction.) Findings of both Children who have been diagnosed as reading disabled
studies show that Clay’s Reading Recovery© program have severe reading difficulties that are not explained
produced the same growth in reading even though it by low intelligence. Systematic phonics programs have
provided less systematic phonics instruction than the been developed to remediate their reading difficulties.
modified program and provided it mainly through writing RD children have special problems in acquiring word
exercises rather than decoding activities. Although reading skills. Not only do they struggle to read
reading outcomes were the same, the fact that one pseudowords, but they also have trouble remembering
program took less time makes the more intensive how to read words they have read before.
phonics approach preferable. Because the RR program
Maureen Lovett and her associates (Lovett et al., 1994; The children participating in the studies were referred
Lovett & Steinbach, 1997; Lovett et al., in press) have to Lovett’s clinic because they had severe reading
conducted several studies to examine how to improve problems. Children were randomly assigned to receive
the word reading skills of severely disabled readers. the PHAB program, the WIST program, or a non-
They have explored the effectiveness of two types of reading control program that involved teaching students
phonics programs, a synthetic program they call PHAB academic survival skills such as organization and
and a larger-unit program, which teaches children to problem solving relevant to the classroom. The students
use subparts of words they know to read new words, ranged in age from 6 to 13 years or grades 2nd through
referred to as WIST. 6th. The three programs took the same amount of time.
In one study, it was 35 hours; in another study, 70 hours.
The PHAB synthetic phonics program adopted the
Direct Instruction model developed by Engelmann and To evaluate the effectiveness of the programs,
his colleagues (see Appendix) to remediate the performance of students receiving either PHAB or
decoding and phonemic awareness difficulties of the WIST were compared to performance of the control
disabled readers. Children were taught to segment and group. There were four comparisons assessing effects
blend words orally. They were taught letter-sound of PHAB and four assessing WIST in the database.
associations in the context of word recognition and Although the effect sizes were somewhat variable, the
decoding instruction. The program taught a left-to-right average effect size across the comparisons indicated
decoding strategy to sound out and blend letters into that both programs produced about the same growth in
words. Special marks on letters and words provided reading, d = 0.41 for PHAB and d = 0.48 for WIST. In
visual cues to aid in decoding, such as symbols over two of the comparisons, both reading comprehension
long vowels, letter size variations, and connected letters and word reading were measured. Substantial gains
to identify digraphs. Cumulative, systematic review and were evident on both measures. These findings indicate
many opportunities for overlearning were used. New that the two approaches to teaching systematic phonics,
material was not introduced until the child had fully one teaching synthetic phonics, and one teaching the
mastered previously instructed material. Children were use of larger subunits of words to read by analogy,
taught in small groups. were quite effective in helping disabled readers improve
their reading skills.
The larger-unit, word analogy program called WIST
was adapted from the Benchmark Word Identification/ Conclusions
Vocabulary Development program developed by
Gaskins et al. (1986). This program had a strongly There were 38 studies from which 66 treatment-control
metacognitive focus. It taught children how to use four group comparisons were derived. Although each
metacognitive strategies to decode words: reading comparison could contribute up to six effect sizes, one
words by analogy, detecting parts of words that are per outcome measure, few studies did. The majority
known, varying the pronunciations of vowels to maintain (76%) of the effect sizes involved reading or spelling
flexibility in decoding attempts, and “peeling off” single words, whereas 24% involved text reading. The
prefixes and suffixes in words. Children learned a set of imbalance favoring single words is not surprising given
120 key words exemplifying high-frequency spelling that the focus of phonics instruction is on improving
patterns, five words per day. They learned to segment children’s ability to read and spell words. Studies
the words into subunits so that they could use known limiting instructional attention to children with reading
words and their parts to read other similarly spelled problems accounted for 65% of the comparisons, 38%
words. They learned letter-sound associations for involving poor readers considered “at risk” or low
vowels and affixes. Various types of texts provided achieving and 27% diagnosed as reading disabled (RD).
children with practice applying the strategies that were Studies involving 1st graders were overrepresented in
taught. the database, accounting for 38% of the comparisons.
Fewer kindergartners (12%) and children in 2nd
through 6th grades (23%) were represented. Children in
the RD group spanned several ages and grades, ranging blending of larger subparts of words (i.e., onsets, rimes,
from ages 6 to 13 and grades 2nd through 6th. Most of phonograms, spelling patterns) as well as phonemes;
the studies (72%) were recently conducted, in the past and (3) miscellaneous phonics programs that taught
10 years. phonics systematically but did this in other ways not
covered by the synthetic or larger-unit categories or
Systematic phonics instruction typically involves
were unclear about the nature of the approach. The
explicitly teaching students a prespecified set of letter-
analysis showed that effect sizes for the three
sound relations and having students read text that
categories of programs were all significantly greater
provides practice using these relations to decode words.
than zero and did not differ statistically from each other.
Instruction lacking an emphasis on phonics instruction
The effect size for synthetic programs was d = 0.45;
does not teach letter-sound relations systematically and
for larger-unit programs, d = 0.34; and for
selects text for children according to other principles.
miscellaneous programs, d = 0.27. The conclusion
The latter form of instruction includes whole-word
supported by these findings is that various types of
programs, whole language programs, and some basal
systematic phonics approaches are more effective than
reader programs.
non-phonics approaches in promoting substantial growth
The meta-analyses were conducted to answer several in reading.
questions about the impact of systematic phonics
There were seven programs that were examined in
instruction on growth in reading when compared with
three or more treatment-control group comparisons in
instruction that does not emphasize phonics. Findings
the database. Analysis of the effect sizes produced by
provided strong evidence substantiating the impact of
these programs revealed that all were statistically
systematic phonics instruction on learning to read.
greater than zero and none differed statistically from
1. Does systematic phonics instruction the others in magnitude. Effect sizes ranged from d =
help children learn to read more 0.23 to 0.68. In most cases there were only three or
effectively than unsystematic phonics four comparisons contributing effect sizes, so results
may be unreliable. The conclusion drawn is that specific
instruction or instruction teaching no
systematic phonics programs are all more effective than
phonics?
non-phonics programs and they do not appear to differ
Children’s reading was measured at the end of training significantly from each other in their effectiveness
if it lasted less than a year or at the end of the first although more evidence is needed to verify the
school year of instruction. The mean overall effect size reliability of effect sizes for each program.
produced by phonics instruction was significant and
3. Is phonics taught more effectively when
moderate in size (d = 0.44). Findings provided solid
support for the conclusion that systematic phonics students are tutored individually, when
instruction makes a more significant contribution to they are taught in small groups, or when
children’s growth in reading than do alternative they are taught as classes?
programs providing unsystematic or no phonics All three delivery systems proved to be effective ways
instruction. of teaching phonics, with effect sizes of d = 0.57
2. Are some types of phonics instruction (tutoring), d = 0.43 (small group), and d = 0.39 (whole
more effective than others? Are some class). All effect sizes were statistically greater than
zero, and no one differed significantly from the others.
specific phonics programs more effective
This supports the conclusion that systematic phonics
than others?
instruction is effective when delivered through tutoring,
Three types of phonics programs were compared in the through small groups, and through teaching classes of
analysis: (1) synthetic phonics programs that students.
emphasized teaching students to convert letters
(graphemes) into sounds (phonemes) and then to blend
the sounds to form recognizable words; (2) larger-unit
phonics programs that emphasized the analysis and
4. Is phonics instruction more effective instruction such as poor comprehension, or that there
when it is introduced to students not yet were too few cases (i.e., only eight treatment-control
reading, in kindergarten or 1st grade, comparisons pulled from three studies) to yield reliable
than when it is introduced in grades findings.
above 1st after students have already The conclusion drawn from these findings is that
begun to read? systematic phonics instruction is significantly more
effective than non-phonics instruction in helping to
Phonics instruction taught early proved much more
prevent reading difficulties among at-risk students and
effective than phonics instruction introduced after 1st
in helping to remediate reading difficulties in disabled
grade. Mean effect sizes were kindergarten d = 0.56;
readers. No conclusion is drawn in the case of low-
1st grade d = 0.54; and 2nd through 6th grades d =
achieving readers because it is unclear why systematic
0.27. The conclusion drawn is that systematic phonics
phonics instruction produced little growth in their
instruction produces the biggest impact on growth in
reading and whether the finding is even reliable. Further
reading when it begins in kindergarten or 1st grade
research is needed to determine what constitutes
before children have learned to read independently. To
adequate remedial instruction for low-achieving
be effective, phonics instruction introduced in
readers.
kindergarten must be appropriately designed for
learners and must begin with foundational knowledge 6. Does systematic phonics instruction
involving letters and phonemic awareness. improve children’s reading
5. Is phonics instruction beneficial for comprehension ability as well as their
children who are having difficulty learning decoding and word-reading skills?
to read? Is it effective in preventing Systematic phonics instruction was most effective in
reading failure among children who are improving children’s ability to decode regularly spelled
at risk for developing reading problems in words (d = 0.67) and pseudowords (d = 0.60). This was
the future? Is it effective in remediating expected because the central focus of phonics
reading difficulties in children who have programs is upon teaching children to apply the
been diagnosed as reading disabled and alphabetic system to read novel words. Phonics
programs also produced growth in the ability to read
children who are low-achieving readers?
irregularly spelled words although the effect size was
Phonics instruction produced substantial reading growth significantly lower, d = 0.40. This is not surprising
among younger children at risk of developing future because a decoding strategy is less helpful for reading
reading problems. Effect sizes were d = 0.58 for these words. However, alphabetic knowledge is useful
kindergartners at risk and d = 0.74 for 1st graders at for establishing connections in memory that help
risk. Phonics instruction also improved the reading children read irregular words they have read before.
performance of disabled readers (i.e., children with This may explain the contribution of phonics.
average IQs but poor reading) for whom the effect size
Systematic phonics instruction produced significantly
was d = 0.32. These effect sizes were all statistically
greater growth than non-phonics instruction in younger
greater than zero. However, phonics instruction failed to
children’s reading comprehension ability (d = 0.51).
exert a significant impact on the reading performance
However, the effects of systematic phonics instruction
of low-achieving readers in 2nd through 6th grades (i.e.,
on text comprehension in readers above 1st grade were
children with reading difficulties and possibly other
mixed. Although gains were significant for the subgroup
cognitive difficulties explaining their low achievement).
of disabled readers (d = 0.32), they were not significant
The effect size was d = 0.15, which was not statistically
for the older group in general (d = 0.12).
greater than chance. Possible reasons might be that the
phonics instruction provided to low-achieving readers The conclusion drawn is that growth in word-reading
was not sufficiently intense, that their reading skills is strongly enhanced by systematic phonics
difficulties arose from sources not treated by phonics instruction when compared to non-phonics instruction
for kindergartners and 1st graders as well as for older
struggling readers. Growth in reading comprehension is 9. Does the type of control group used to
also boosted by systematic phonics instruction for evaluate the effectiveness of systematic
younger students and reading disabled students. phonics instruction make a difference?
Whether growth in reading comprehension is produced
generally in students above 1st grade is less clear. The type of nonsystematic or non-phonics instruction
given to control groups to evaluate the effectiveness of
7. Does systematic phonics instruction systematic phonics instruction varied across studies and
have an impact on children’s growth in included the following types: basal programs, regular
spelling? curriculum, whole language approaches, whole word
programs, and miscellaneous programs. The question of
Systematic phonics instruction produced much growth
whether phonics produced better reading growth than
in spelling among the younger students, that is,
each type of control group was answered affirmatively
kindergartners and 1st graders, d = 0.67, but not among
in each case. The effect sizes were all positive favoring
the older students above 1st grade, whose effect size of
systematic phonics, were all statistically greater than
d = 0.09 did not differ from zero. One factor
zero, and ranged from d = 0.31 to 0.51. No single effect
contributing to the difference is that younger children
size differed from any of the others.
were given credit for using phonics-based knowledge to
produce letter-sound spellings of words as well as The conclusion supported by these findings is that the
correct spellings whereas older children were not. effectiveness of systematic phonics instruction found in
Another factor may be that as children move up in the the present meta-analysis did not depend on the type of
grades, remembering how to spell words requires instruction that students in the control groups received.
knowledge of higher level regularities not covered in Students taught systematic phonics outperformed
systematic phonics programs. A third reason for the students who were taught a variety of nonsystematic or
poor showing among older students may be that the non-phonics programs, including basal programs, whole
majority were poor readers who are known to have language approaches, and whole word programs.
difficulty learning to spell.
10. Were studies reporting the largest
The conclusion drawn is that systematic phonics effects of systematic phonics instruction
instruction contributed more than non-phonics well designed or poorly designed
instruction in helping kindergartners and 1st graders experiments? That is, was random
apply their knowledge of the alphabetic system to spell
assignment used? Were the sample sizes
words. However, it did not improve spelling in students
sufficiently large? Might results be
above 1st grade.
explained by differences between
8. Is systematic phonics instruction treatment and control groups that existed
effective with children at different prior to the experiment rather than by
socioeconomic levels? differences produced by the experimental
Systematic phonics instruction helped children at all intervention?
SES levels make greater gains in reading than did non- The effects of systematic phonics instruction were not
phonics instruction. The effect size for low-SES diminished when only the best designed experiments
students was d = 0.66, and for middle-class students it were singled out. The mean effect size for studies using
was d = 0.44. Both were statistically greater than zero random assignment to place students in treatment and
and did not differ from each other. The conclusion control groups, d = 0.45, was essentially the same as
drawn is that systematic phonics instruction is beneficial that for studies employing quasi-experimental designs,
to students regardless of their socioeconomic status. d = 0.43, which utilized existing groups to compare
phonics instruction and non-phonics instruction. The
mean effect size for studies administering systematic
phonics and non-phonics instruction to large samples of
students did not differ from studies using the fewest
students: for studies using between 80 and 320 students, make use of letter-sound information, children need
d = 0.49; for studies using between 20 and 31 students, phonemic awareness. That is, they need to be able to
d = 0.48. There were some studies that did not use blend sounds together to decode words, and they need
random assignment and either failed to address the to break spoken words into their constituent sounds to
issue of pre-existing differences between treatment and write words. Programs that focus too much on the
control groups or mentioned that a difference existed teaching of letter-sounds relations and not enough on
but did not adjust for differences in their analysis of putting them to use are unlikely to be very effective. In
results. The effect sizes changed very little when these implementing systematic phonics instruction, educators
comparisons were removed from the database, from must keep the end in mind and ensure that children
d = 0.44 to d = 0.46. understand the purpose of learning letter-sounds and
are able to apply their skills in their daily reading and
The conclusion drawn is that the significant effects
writing activities.
produced by systematic phonics instruction on children’s
growth in reading were evident in the most rigorously In addition to this general caution, several particular
designed experiments. Significant effects did not arise concerns should be taken into consideration to avoid
primarily from the weakest studies. misapplication of the findings. One concern relates to
the commonly heard call for “intensive, systematic”
11. Is enough known about systematic phonics instruction. Usually the term “intensive” is not
phonics instruction to make defined, so it is not clear how much teaching is required
recommendations for classroom to be considered intensive. Questions needing further
implementation? If so, what cautions answers are: How many months or years should a
should be kept in mind by teachers phonics program continue? If phonics has been taught
implementing phonics instruction? systematically in kindergarten and 1st grade, should it
continue to be emphasized in 2nd grade and beyond?
Findings of the panel regarding the effectiveness of How long should single instructional sessions last? How
systematic phonics instruction were derived from much ground should be covered in a program? That is,
studies conducted in many classrooms with typical how many letter-sound relations should be taught and
classroom teachers and typical American or English- how many different ways of using these relations to
speaking students from a variety of backgrounds and read and write words should be practiced for the
SES levels. Thus, the results of the analysis are benefits of phonics to be maximum? These are among
indicative of what can be accomplished when the many questions that remain for future research.
systematic phonics programs are implemented in
today’s classrooms. Systematic phonics instruction has Second, the role of the teacher needs to be better
been used widely over a long period with positive understood. Some of the phonics programs showing
results. A variety of phonics programs have proven large effect sizes are scripted so that teacher judgment
effective with children of different ages, abilities, and is largely eliminated. Although scripts may standardize
SES backgrounds. These facts should persuade instruction, they may reduce teachers’ interest in the
educators and the public that systematic phonics teaching process or their motivation to teach phonics.
instruction is a valuable part of a successful classroom Thus, one concern is how to maintain consistency of
reading program. The Panel’s findings summarized instruction and at the same time encourage unique
above serve to illuminate the conditions that make contributions from teachers. Another concern involves
systematic phonics instruction especially effective. what teachers need to know. Some systematic phonics
However, caution is needed in giving a blanket programs require a sophisticated understanding of
endorsement to all kinds of phonics instruction. spelling, structural linguistics, and word etymology.
Teachers who are handed the programs but are not
It is important to recognize that the goals of phonics provided with sufficient inservice training to use these
instruction are to provide children with some key programs effectively may become frustrated. In view
knowledge and skills and to ensure that they know how of the evidence showing the effectiveness of systematic
to apply this knowledge in their reading and writing. phonics instruction, it is important to ensure that the
Phonics teaching is a means to an end. To be able to issue of how best to prepare teachers to carry out this
teaching effectively and creatively is given high priority. Directions for Further Research
Knowing that all phonics programs are not the same
brings with it the implication that teachers must Although phonics instruction has been the subject of a
themselves be educated about how to evaluate different great deal of study, there are certain extremely
programs and to determine which are based on strong important topics that have received little or no research
evidence and how they can most effectively use these attention, and there are other topics that, although
programs in their own classrooms. previously studied, require further research to refine our
understanding.
As with any instructional program, there is always the
question: “Does one size fit all?” Teachers may be Neglected Topics
expected to use a particular phonics program with their
Three important but neglected questions are prime
class, yet it quickly becomes apparent that the program
candidates for research:
suits some students more than others. In the early
grades, children are known to vary greatly in the skills (1) What are the “active ingredients” in effective
they bring to school. There will be some children who systematic phonics programs? (2) Is phonics instruction
already know most letter-sound correspondences, some improved when motivational factors are taken into
children who can even decode words, and others who account—not only learners’ motivation to learn but also
have little or no letter knowledge. Should teachers teachers’ motivation to teach? (3) How does the use of
proceed through the program and ignore these decodable text as early reading material contribute to
students? Or should they assess their students’ needs the effectiveness of phonics programs?
and select the types and amounts of phonics suited to
those needs? Although the latter is clearly preferable, 1. Active Ingredients
this requires phonics programs that provide guidance in Systematic phonics programs—even those of the same
how to place students into flexible instructional groups type, such as synthetic phonics programs—vary in
and how to pace instruction. However, it is common for many respects, as indicated in the Panel’s report above.
many phonics programs to present a fixed sequence of It is important to determine whether some properties
lessons scheduled from the beginning to the end of the are essential and others are not. Because instructional
school year. time during the school day is limited, teachers and
publishers of beginning reading programs need to know
Finally, it is important to emphasize that systematic which ingredients of phonics programs yield the most
phonics instruction should be integrated with other benefit. One example of this line of questions involves
reading instruction to create a balanced reading the content covered. It is clear that the major letter-
program. Phonics instruction is never a total reading sound correspondences, including short and long vowels
program. In 1st grade, teachers can provide controlled and digraphs, need to be taught. However, there are
vocabulary texts that allow students to practice other regularities of English as well. How far should
decoding, and they can also read quality literature to instruction extend in teaching all of these potential
students to build a sense of story and to develop regularities explicitly? Should children be taught to state
vocabulary and comprehension. Phonics should not regularities, or should emphasis be placed on application
become the dominant component in a reading program, in reading and writing activities? To what extent do
neither in the amount of time devoted to it nor in the mnemonic devices such as those used in Jolly Phonics
significance attached. It is important to evaluate (Lloyd, 1993) and Letterland (Wendon, 1992) speed up
children’s reading competence in many ways, not only the process of learning letter shapes, sounds, and names
by their phonics skills but also by their interest in books and facilitate their application in reading and writing?
and their ability to understand information that is read to What contribution is made by the inclusion of special
them. By emphasizing all of the processes that markings added to written words to clarify how they
contribute to growth in reading, teachers will have the should be decoded? Research investigating not only
best chance of making every child a reader. these ingredients of phonics programs but other
ingredients as well is needed. These studies should use of decodable books and select the beginning reading
include systematic observation in classrooms to record material on some other basis. Some educators reject
and analyze the activities of teachers and children using decodable books outright as too stilted and boring.
the programs. Surprisingly, very little research has attempted to
determine whether the use of decodable books in
2. Motivation systematic phonics programs has any influence on the
Phonics instruction has often been portrayed as progress that some or all children make in learning to
involving “dull drill” and “meaningless worksheets.” read.
Such characterizations may accurately describe aspects
of some phonics programs, even “effective” ones. Few Other Important Topics
if any studies have investigated the contribution of The findings of the Panel indicated that systematic
motivation to the effectiveness of phonics programs, not phonics instruction provides beginning readers, at-risk
only the learner’s motivation to learn but also the readers, disabled readers, and low-achieving readers
teacher’s motivation to teach. It seems self-evident that with a substantial edge in learning to read over
the specific techniques and activities used to develop alternative forms of instruction not focusing at all or
children’s letter-sound knowledge and its use in reading only incidentally on the alphabetic system. However,
and writing should be as relevant and motivating as studies in the database were insufficient in number or in
possible to engage children’s interest and attention to design to address several important satellite questions
promote optimal learning. Moreover, it seems obvious about the effects of phonics instruction.
that when the teaching techniques presented to
teachers in a phonics program are not only effective but Some programs teach many letter-sound relations
also engaging and enjoyable, teachers will be more before children begin using them while other programs
successful in their ability to deliver phonics instruction introduce a few and then provide reading and writing
effectively. The lack of attention to motivational factors activities that allow children to apply the
by researchers in the design of phonics programs is correspondences they have learned right away. The
potentially very serious because debates about reading latter approach would appear to be preferable, but is it?
instruction often boil down to concerns about the In what ways does earlier application facilitate growth
“relevance” and “interest value” of how something is in reading and writing?
being taught, rather than the specific content of what is Programs differ in how much time is consumed
being taught. Future research on phonics instruction teaching alphabetic knowledge and word-reading skills.
should investigate how best to motivate children in It is unclear how long phonics instruction should
classrooms to learn the letter-sound associations and to continue through the grades. A few studies in the
apply that knowledge to reading and writing. It should Panel’s database indicated that large effect sizes were
also be designed to determine which approaches produced and maintained in the 2nd and 3rd years of
teachers prefer to use and are most likely to use instruction for children who were at risk for future
effectively in their classroom instruction. reading problems and who began receiving systematic
phonics instruction in kindergarten or 1st grade
3. Decodable Text
(Blachman et al., (1999); Brown & Felton, 1990;
Some systematic phonics programs are designed so that
Torgesen et al., 1999). See Table 4 (Appendix E). This
children are taught letter-sound correspondences and
suggests that systematic phonics instruction should
then provided with little books written carefully to
extend from kindergarten to 2nd grade, but the question
contain the letter-sound relations that were taught.
remains whether additional instruction will produce
Some programs begin with a very limited set and
further benefits.
expand these gradually. The intent of providing books
that match children’s letter-sound knowledge is to It will also be critical to objectively determine the ways
enable them to experience success in decoding words in which systematic phonics instruction can be optimally
that follow the patterns they know. The stories in such incorporated and integrated in complete and balanced
books often involve pigs doing jigs and cats in hats. programs of reading instruction. Part of this effort
Other systematic phonics programs make little or no
should be directed at preservice and inservice education When systematic phonics instruction is introduced to
to provide teachers with decisionmaking frameworks to children who have already acquired some reading skill
guide their selection, integration, and implementation of as a result of another program that does not emphasize
phonics instruction within a complete reading program. phonics, one wonders about the impact of attempting to
teach students new strategies when old tricks have
Another line of questions for research centers around
already been learned. Findings of the Panel indicated
older children above 1st grade who have acquired some
that the impact of systematic phonics instruction was
reading ability but are reading substantially below grade
much reduced among children who were introduced to
level. When systematic phonics instruction is introduced
it presumably for the first time in 2nd grade and above.
to these children, do they have difficulty acquiring
(This presumption may not be accurate, however,
alphabetic knowledge and decoding strategies because
because most studies did not state what kind of
they have already learned other ways to process print
instruction children had already experienced.)
that undermine the acquisition and incorporation of
Additional research is needed to study how systematic
these new processes into their reading? If so, perhaps
phonics instruction is received by children who are
special steps are required to address this problem. A
already reading; whether there are sources of conflict;
related question is how can systematic phonics
and, if so how to address them instructionally. A related
instruction be made more effective for low-achieving
question is whether the sequence of instruction makes a
readers who have below-average intelligence as well as
difference. It may be that children do better when a
reading problems. Perhaps instruction in decoding needs
year of systematic phonics instruction precedes a year
to be combined with instruction in reading
of whole language instruction than when the reverse is
comprehension strategies to remediate their reading
the case.
problems.
Anderson, R. C., Hiebert, E. F., Wilkinson, I. A. G., Bruck, M. (1993). Component spelling skills of
& Scott, J. (1985). Becoming a nation of readers. college students with childhood diagnoses of dyslexia.
Champaign, IL: Center for the Study of Reading. Learning Disability Quarterly, 16, 171-184.
Aukerman, R. C. (1971). Approaches to beginning Campbell, D., & Stanley, J. (1966). Experimental
reading. New York: John Wiley & Sons. and quasi-experimental designs for research. Chicago:
Rand McNally.
Aukerman, R. C. (1981). The basal reader
approach to reading. New York: John Wiley & Sons. Carnine, D., & Silbert, J. (1979). Direct instruction
reading. Columbus, OH: Merrill.
Aukerman, R. C. (1984). Approaches to beginning
reading (2nd ed.). New York: John Wiley & Sons. Chall, J. S. (1967). Learning to read: The great
debate (1st ed.). New York, NY: McGraw-Hill.
Balmuth, M. (1982). The roots of phonics: A
historical introduction. New York: McGraw-Hill. Chall, J. S. (1983). Learning to read: The great
debate (2nd ed.). New York, NY: McGraw-Hill.
Barr, R., Kamil, M., Mosenthal, P., & Pearson, P.
(1991). Handbook of reading research (Vol. 2). New Chall, J. (1992). The new reading debates:
York: Longman. Evidence from science, art, and ideology. Teachers
College Record, 94, 315-328.
Baumann, J., Hoffman, J., Moon, J., & Duffy-
Hester, A. (1998). Where are teachers’ voices in the Chall, J. (1996a). Learning to read: The great
phonics/whole language debate? Results from a survey debate (revised, with a new foreword). New York:
of U.S. elementary classroom teachers. The Reading McGraw-Hill.
Teacher, 51, 636-650.
Chall, J. (1996b). Stages of reading development
Bear, D. R., Invernizz, M., Templeton, S., & (2nd ed.). Fort Worth, TX: Harcourt-Brace.
Johnson, F. (2000). Words their way: Word study for
phonics, vocabulary and spelling instruction. Upper Chall, J., & Feldmann., S. (1966). First grade
Saddle River, NJ: Merrill. reading: An analysis of the interactions of professed
methods, teacher implementation, and child background.
Beck, I., & Mitroff, D. (1972). The rationale and The Reading Teacher, 19, 569-575.
design of a primary grades reading system for an
individualized classroom. Pittsburgh: University of Clay, M. M. (1993). Reading recovery: A
Pittsburgh Learning Research and Development guidebook for teachers in training. Portsmouth, NH:
Center. Heinemann.
Blachman, B. (in press). Phonological awareness. Cohen, J. (1988). Statistical power analysis for the
In M. Kamil, P. Mosenthal, P. Pearson, & R. Barr behavior sciences (2nd ed.). Hillsdale, NJ: Erlbaum.
(Eds.), Handbook of reading research (Vol. 3).
Mahwah, NJ: Erlbaum. Cox, A. R. (1991). Structures and techniques:
Multisensory teaching of basic language skills.
Bond, G., & Dykstra, R. (1967/1998). The Cambridge, MA: Educators Publishing Service.
cooperative research program in first grade reading.
Reading Research Quarterly, 2, 5-142.
Dahl, K. L., Sharer, P. L., Lawson, L. L., & Engelmann, S., & Osborn, J. (1987). DISTAR
Grogran, P. R. (1999). Phonics instruction and student Language I. Chicago: Science Research Associates.
achievement in whole language first-grade classrooms.
Reading Research Quarterly, 34, 312-341. Ehri, L. C. (1998). Grapheme-phoneme knowledge
is essential for learning to read words in English. In J.
Dickson, S. (1972a). Sing, spell, read, & write. L. Metsala & L. C. Ehri (Eds.), Word recognition in
Chesapeake, VA: International Learning Systems. beginning literacy (pp. 3-40). Mahwah, NJ: Erlbaum.
Dykstra, R. (1968). The effectiveness of code- and Ehri, L., Deffner, N., & Wilce, L. (1984). Pictorial
meaning-emphasis in beginning reading programs. The mnemonics for phonics. Journal of Educational
Reading Teacher, 22, 17-23. Psychology, 76, 880-893.
Ehri, L. (1991). Development of the ability to read Engelmann, S. (1980). Direct instruction.
words. In R. Barr, M. Kamil, P. Mosenthal, & P. Englewood Cliffs, NJ: Prentice-Hall.
Pearson (Eds.), Handbook of reading research (Vol. 2)
(pp. 383-417). New York: Longman. Engelmann, S., & Bruner, E. (1969). Distar reading
program. Chicago: SRA.
Ehri, L. (1992). Reconceptualizing the development
of sight word reading and its relationship to recoding. In Engelmann, S., & Bruner, E. (1978). DISTAR
P. Gough, L. Ehri, & R. Treiman (Eds.), Reading reading: An instructional system. Chicago: Science
acquisition (pp. 107-143). Hillsdale, NJ: Erlbaum. Research Associates.
Ehri, L. (1994). Development of the ability to read Engelmann, S., & Bruner, E. (1988). Reading
words: Update. In R. Ruddell, M. Ruddell, & H. Singer mastery I/II fast cycle: Teacher’s guide. Chicago:
(Eds.), Theoretical models and processes of reading Science Research Associates, Inc.
(4th ed. pp. 323-358). Newark, DE: International
Reading Association. Engelmann, S., & Osborn, J. (1987). DISTAR
Language I. Chicago: Science Research Associates.
Ehri, L. C. (1998). Grapheme-phoneme knowledge
is essential for learning to read words in English. In J. Ehri, L. C. (1998). Grapheme-phoneme knowledge
L. Metsala & L. C. Ehri (Eds.), Word recognition in is essential for learning to read words in English. In J.
beginning literacy (pp. 3-40). Mahwah, NJ: Erlbaum. L. Metsala & L. C. Ehri (Eds.), Word recognition in
beginning literacy (pp. 3-40). Mahwah, NJ: Erlbaum.
Ehri, L., Deffner, N., & Wilce, L. (1984). Pictorial
mnemonics for phonics. Journal of Educational Ehri, L., Deffner, N., & Wilce, L. (1984). Pictorial
Psychology, 76, 880-893. mnemonics for phonics. Journal of Educational
Psychology, 76, 880-893.
Engelmann, S. (1980). Direct instruction.
Englewood Cliffs, NJ: Prentice-Hall. Engelmann, S. (1980). Direct instruction.
Englewood Cliffs, NJ: Prentice-Hall.
Engelmann, S., & Bruner, E. (1969). Distar reading
program. Chicago: SRA. Engelmann, S., & Bruner, E. (1969). Distar reading
program. Chicago: SRA.
Engelmann, S., & Bruner, E. (1978). DISTAR
reading: An instructional system. Chicago: Science Engelmann, S., & Bruner, E. (1978). DISTAR
Research Associates. reading: An instructional system. Chicago: Science
Research Associates.
Engelmann, S., & Bruner, E. (1988). Reading
mastery I/II fast cycle: Teacher’s guide. Chicago:
Science Research Associates, Inc.
Engelmann, S., & Bruner, E. (1988). Reading disability. Journal of Educational Psychology, 89, 645-
mastery I/II fast cycle: Teacher’s guide. Chicago: 651.
Science Research Associates, Inc.
Grundin, H. U. (1994). If it ain’t whole, it ain’t
Engelmann, S., & Osborn, J. (1987). DISTAR language—or back to the basics of freedom and dignity.
Language I. Chicago: Science Research Associates. In F. Lehr & J. Osborn (Eds.), Reading, language, and
literacy (pp. 77-88). Mahwah, NJ: Erlbaum.
Foorman, B., Francis, D., Fletcher, J.,
Schatschneider, C., & Mehta, P. (1998). The role of Harris, T., & Hodges, R. (Eds.). (1995). The
instruction in learning to read: Preventing reading failure literacy dictionary. Newark, DE: International Reading
in at-risk children. Journal of Educational Psychology, Association.
90, 37-55.
Haskell, D., Foorman, B., & Swank, P. (1992).
Freppon, P. A., & Dahl, K. L. (1991). Learning Effects of three orthographic/phonological units on first-
about phonics in a whole language classroom. grade reading. Remedial and Special Education, 13, 40-
Language Arts, 68, 190-197. 49.
Freppon, P. A., & Headings, L. (1996). Keeping it Holdaway, D. (1979). The foundations of literacy.
whole in whole language: A first grade teacher’s Sydney: Ashton-Scholastic.
instruction in an urban whole language classroom. In E.
McIntyre & M. Pressley (Eds.), Balanced instruction: Hoover, W., & Gough, P. (1990). The simple view
Strategies and skills in whole language (pp. 65-82). of reading. Reading and Writing, 2, 127-160.
Norwood, MA: Christopher-Gordon.
Iversen, S., & Tunmer, W. (1993). Phonological
Gaskins, I., Downer, M., & Gaskins, R. (1986). processing skills and the Reading Recovery Program.
Introduction to the Benchmark school word Journal of Educational Psychology, 85, 112-126.
identification/vocabulary development program. Media,
PA: Benchmark School. Johnson, B. (1989). DSTAT: Software for the meta-
analytic review of research literatures. Hillsdale, NJ:
Gilllingham, A., & Stillman, B. (1940). Remedial Erlbaum.
training for children with specific disability in reading,
spelling, and penmanship. New York, NY: Sackett & Juel, C. (1994). Learning to read and write in one
Williams Lithographing Co. elementary school. New York: Springer-Verlag.
Gillingham, A., & Stillman, B. W. (1979). Remedial Kameenui, E. J., Simmons, D. C., Chard, D., &
training for children with specific disability in reading, Dickson, S. (1997). Direct instruction reading. In S. A.
spelling, and penmanship. Cambridge, MA: Educators Stahl & D. A. Hayes (Eds.), Instructional models in
Publishing Service. reading (pp. 59-84). Mahwah, NJ: Erlbaum.
Goodman, K. S. (1993). Phonics phacts. Klesius, J., Griffith, P., & Zielonka, P. (1991). A
Portsmouth, NH: Heinemann. whole language and traditional instruction comparison:
Overall effectiveness and development of the
Gough, P., & Tunmer, W. (1986). Decoding, alphabetic principle. Reading Research and Instruction,
reading, and reading disability. Remedial and Special 30, 47-61.
Education, 7, 6-10.
Leach, D., & Siddall, S. (1990). Parental
Greaney, K., Tunmer, W., & Chapman, J. (1997). involvement in the teaching of reading: A comparison of
Effects of rime-based orthographic analogy training on hearing reading, paired reading, pause, prompt, praise,
the word recognition skills of children with reading and direct instruction methods. British Journal of
Educational Psychology, 60, 349-355.
Lindamood, C. H., & Lindamood, P. C. (1984). McCracken, G., & Walcutt, C. (1963, 1965, 1966,
Auditory discrimination in depth. Austin, TX: PRO-ED, 1975). Basic reading. Philadelphia, PA: Lippincott.
Inc.
McKenna, M., Stahl, S., & Reinking, D. (1994). A
Lloyd, S. (1993). The phonics handbook. UK: Jolly critical commentary on research, politics, and whole
Learning. language. Journal of Reading Behavior, 26, 211-233.
Levin, H., & Williams, J. (Eds.). (1970). Basic Meyer, L. A. (1983). Increased student
studies on reading. New York: Basic Books. achievement in reading: One district’s strategies.
Research in Rural Education, 1, 47-51.
Lovett, M., Borden, S., DeLuca, T., Lacerenza, L.,
Benson, N., & Brackstone, D. (1994). Treating the core Mills, H., O’Keefe, T., & Stephens, D. (1992).
deficits of developmental dyslexia: Evidence of transfer Looking closely: Exploring the role of phonics in one
of learning after phonologically- and strategy-based whole language classroom. Urbana, IL: National
reading training programs. Developmental Psychology, Council of Teachers of English.
30, 805-822.
Morris, D. (1992). Case studies in teaching
Lovett, M., Lacerenza, L., Borden, S., Frijters, J., beginning readers: The Howard Street tutoring manual.
Steinbach, K., & DePalma, M. (in press). Components Boone, NC: Fieldstream.
of effective remediation for developmental reading
disabilities: Combining phonological and strategy-based Morris, D., & Perney, J. (1984). Developmental
instruction to improve outcomes. Journal of Educational spelling as a predictor of first grade reading
Psychology. achievement. Elementary School Journal, 84, 441-457.
Lovett, M., & Steinbach, K. (1997). The Ogden, S., Hindman, S., & Turner, S. (1989).
effectiveness of remedial programs for reading disabled Multisensory programs in the public schools: A brighter
children of different ages: Does the benefit decrease future for LD children. Annals of Dyslexia, 39, 247-
for older children? Learning Disability Quarterly, 20, 267.
189-210.
Pearson, P., Barr, R., Kamil, M., & Mosenthal, P.
Lovett, M., Warren-Chaplin, P., Ransby, M., & (1984). Handbook of reading research. New York:
Borden, S. (1990). Training the word recognition skills Longman.
of reading disabled children: Treatment and transfer
effects. Journal of Educational Psychology, 82, 769-780. Popp, H. M. (1975). Current practices in the
teaching of beginning reading. In J. B. Carroll & J. S.
Lovitt, T. C., & DeMier, D. M. (1984). An Chall (Eds.), Toward a literate society (pp. 101-146).
evaluation of the Slingerland method with LD New York: McGraw-Hill.
youngsters. Journal of Learning Disabilities, 17, 267-
272. Purves, A. (Ed.). (1994). Encyclopedia of English
studies and language arts. New York: Scholastic.
McCracken, G., & Walcutt, C. (1963, 1965, 1966,
1975). Basic reading. Philadelphia, PA: Lippincott. Rack, J., Snowling, M., & Olson, R. (1992). The
nonword reading deficit in developmental dyslexia: A
McKenna, M., Stahl, S., & Reinking, D. (1994). A review. Reading Research Quarterly, 27, 29-53.
critical commentary on research, politics, and whole
language. Journal of Reading Behavior, 26, 211-233. Routman, R. (1996). Literacy at the crossroads.
Lovitt, T. C., & DeMier, D. M. (1984). An evaluation Portsmouth, NH: Heinemann.
of the Slingerland method with LD youngsters. Journal
of Learning Disabilities, 17, 267-272.
Santa, C., & Hoien, T. (1999). An assessment of of first-grade children: A one-year follow-up. Journal of
early steps: A program for early intervention of reading Reading Behavior, 27, 153-185.
problems. Reading Research Quarterly, 34, 54-79.
Taylor, D. (1998). Beginning to read & the spin
Shankweiler, D., & Liberman, I. Y. (1972). doctors of science: The political campaign to change
Misreading: A search for causes. In J. F. Kavanaugh & America’s mind about how children learn to read.
I. G. Mattingly (Eds.), Language by eye and by ear (pp. Urbana, IL: National Council of Teachers of English.
293-317). Cambridge, MA: MIT Press.
Torgesen, J., Wagner, R., Rashotte, C., Rose, E.,
Share, D. (1995). Phonological recoding and self- Lindamood, P., Conway, T., & Garvan, C. (1999).
teaching: Sine qua non of reading acquisition. Cognition, Preventing reading failure in young children with
55, 151-218. phonological processing disabilities: Group and individual
responses to instruction. Journal of Educational
Share, D., Jorm, A., Maclean, R., & Matthews, R. Psychology, 91, 579-593.
(1984). Sources of individual differences in reading
achievement. Journal of Educational Psychology, 76, Tunmer, W., & Chapman, J. (1998). Language
1309-1324. prediction skill, phonological recoding ability, and
beginning reading. In C. Hulme & R. Joshi (Eds.),
Snider, V. (1990). Direct instruction reading with Reading and spelling: Development and disorders (pp.
average first-graders. Reading Improvement, 27, 143- 33-67). Mahwah, NJ: Erlbaum.
148.
Tunmer, W., & Hoover, W. (1993). Phonological
Stahl, S. A. (1999). Why innovations come and go: recoding skill and beginning reading. Reading and
The case of whole language. Educational Researcher. Writing: An Interdisciplinary Journal, 5, 161-179.
Stahl, S. A., Duffy-Hester, A. M., & Stahl, K. A. Vandervelden, M., & Siegel, L. (1997). Teaching
D. (1998). Everything you wanted to know about phonological processing skills in early literacy: A
phonics (but were afraid to ask). Reading Research developmental approach. Learning Disability Quarterly,
Quarterly, 35, 338-355. 20, 63-81.
Stahl, S. A., & Miller, P. D. (1989). Whole language Vickery, K., Reynolds, V., & Cochran, S. (1987).
and language experience approaches for beginning Multisensory teaching approach for reading, spelling,
reading: A quantitative research synthesis. Review of and handwriting, Orton-Gillingham based curriculum, in
Educational Research, 59 (1), 87-116. a public school setting. Annals of Dyslexia, 37, 189-200.
Appendix A
Studies Included in the Meta-Analysis
Note: Studies were assigned numbers during the 08 Eldredge, L. (1991). An experiment with a
screening process. Numbers missing in the list were modified whole language approach in first-grade
assigned to studies that were rejected for the analysis. classrooms. Reading Research and Instruction, 30, 21-
These are listed separately. 38.
62 Barr, R. (1972). The influence of instructional 09 Evans, M., & Carr, T. (1985). Cognitive
conditions on word recognition errors. Reading abilities, conditions of learning, and the early
Research Quarterly, 7 (3), 509-529. development of reading skill. Reading Research
Quarterly, 20, 327-350.
63 Barr, R. (1974). The effect of instruction on
pupil reading strategies. Reading Research Quarterly, 11 Foorman, B., Francis, D., Fletcher, J.,
10, (4), 555-582. Schatschneider, C., & Mehta, P. (1998). The role of
instruction in learning to read: Preventing reading failure
03 Blachman, B., Tangel, D., Ball, E., Black, R. &
in at-risk children. Journal of Educational Psychology,
McGraw, D. (1999). Developing phonological
90, 37-55.
awareness and word recognition skills: A two-year
intervention with low-income, inner-city children. 12 Foorman, B., Francis, D., Novy, D., &
Reading and Writing: An Interdisciplinary Journal, 11, Liberman, D. (1991). How letter-sound instruction
293-273. mediates progress in first-grade reading and spelling.
Journal of Educational Psychology, 83, 456-469.
61 Blachowicz, C., McCarthy, L., & Ogle, D.
(1979). Testing phonics: A look at children’s response 13 Foorman, B., Francis, D., Winikates, D., Mehta,
biases. Illinois School Research and Development, 16 P., Schatschneider, C., & Fletcher, J. (1997). Early
(1), 1-6. interventions for children with reading disabilities.
Scientific Studies of Reading, 1, 255-276.
04 Bond, C., Ross, S., Smith, L., & Nunnery, J.
(1995-96). The effects of the sing, spell, read, and write 59 Freppon, P. (1991). Children’s concepts of the
program on reading achievement of beginning readers. nature and purpose of reading in different instructional
Reading Research and Instruction, 35, 122-141. settings. Journal of Reading Behavior, 23, 139-163.
05 Brown, I., & Felton, R. (1990). Effects of 15 Fulwiler, G., & Groff, P. (1980). The
instruction on beginning reading skills in children at risk effectiveness of intensive phonics. Reading Horizons,
for reading disability. Reading and Writing: An 21, 50-54.
Interdisciplinary Journal, 2, 223-241.
72 Gersten, R., Darch, C., & Gleason, M. (1988).
64 Carnine, D. (1977). Phonics versus look-say: Effectiveness of a direct instruction academic
Transfer to new words. The Reading Teacher, 30, 636- kindergarten for low-income students. The Elementary
640. School Journal, 89, 227-240.
71 Carnine, D. (1980). Phonic versus whole-word 66 Gersten, R., Keating, T., & Becker, W. (1988).
correction procedures following phonic instruction. The continued impact of the direct instruction model:
Education and Treatment of Children, 3(4), 323-329.
Longitudinal studies of follow through students. 29 Leinhardt, G., & Engel, M. (1981). An iterative
Education and Treatment of Children, 11 (4), 318-327. evaluation of NRS: Ripples in a pond. Evaluation
Review, 5, 579-601.
68 Gillon, G., & Dodd, B. (1997). Enhancing the
phonological processing skills of children with specific 75 Lovett, M., Lacerenza, L., Borden, S., Frijters,
reading disability. European Journal of Disorders of J., Steinbach, K., & DePalma, M. (in press).
Communication, 32, 67-90. Components of effective remediation for developmental
reading disabilities: Combining phonological and
17 Gittelman, R., & Feingold, I. (1983). Children strategy-based instruction to improve outcomes. Journal
with reading disorders—I. Efficacy of reading of Educational Psychology.
remediation. Journal of Child Psychology and
Psychiatry and Allied Disciplines, 24, 167-191. 32 Lovett, R., Ransby, M., Hardwick, N., Johns,
M., & Donaldson, S. (1989). Can dyslexia be treated?
18 Greaney, K., Tunmer, W., & Chapman, J. Treatment-specific and generalized treatment effects in
(1997). Effects of rime-based orthographic analogy dyslexic children’s reponse to remediation. Brain and
training on the word recognition skills of children with Language, 37, 90-121.
reading disability. Journal of Educational Psychology, 89,
645-651. 33 Lovett, M., & Steinbach, K. (1997). The
effectiveness of remedial programs for reading disabled
60 Griffith, P., Klesius, J., & Kromey, J. (1992). children of different ages: Does the benefit decrease
The effect of phonemic awareness on the literacy for older children? Learning Disability Quarterly, 20,
development of first grade children in a traditional or a 189-210.
whole language classroom. Journal of Research in
Childhood Education, 6, 85-92. 34 Lovett, M., Warren-Chaplin, P., Ransby, M., &
Borden, S. (1990). Training the word recognition skills
22 Haskell, D., Foorman, B., & Swank, P. (1992). of reading disabled children: Treatment and transfer
Effects of three orthographic/phonological units on first- effects. Journal of Educational Psychology, 82, 769-780.
grade reading. Remedial and Special Education, 13, 40-
49. 35 Lum, T., & Morton, L. (1984). Direct
instruction in spelling increases gain in spelling and
73 Kameenui, E., Stein, M., Carnine, D., & reading skills. Special Education in Canada, 58, 41-45.
Maggs, A. (1981). Primary level word attack skills
based on isolated word, discrimination list and rule 36 Mantzicopoulos, P., Morrison, D., Stone, E., &
application training. Reading Education, 6 (2), 46-55. Setrakian, W. (1992). Use of the SEARCH/TEACH
tutoring approach with middle-class students at risk for
26 Klesius, J., Griffith, P., & Zielonka, P. (1991). A reading failure. Elementary School Journal, 92, 573-586.
whole language and traditional instruction comparison:
Overall effectiveness and development of the 37 Marston, D., Deno, S., Kim, D., Diment, K., &
alphabetic principle. Reading Research and Instruction, Rogers, D. (1995). Comparison of reading intervention
30, 47-61. approaches for students with mild disabilities.
Exceptional Children, 62, 20-37.
28 Leach, D., & Siddall, S. (1990). Parental
involvement in the teaching of reading: A comparison of 38 Martinussen, R., & Kirby, J. (1998). Instruction
hearing reading, paired reading, pause, prompt, praise, in successive and phonological processing to improve
and direct instruction methods. British Journal of the reading acquisition of at-risk kindergarten children.
Educational Psychology, 60, 349-355. Developmental Disabilities Bulletin, 26, 19-39.
41 Oakland, T., Black, J., Stanford, G., Nussbaum, 52 Traweek, K., & Berninger, V. (1997).
N., & Balise, R. (1998). An evaluation of the dyslexia Comparisons of beginning literacy programs:
training program: A multisensory method for promoting Alternative paths to the same learning outcome.
reading in students with reading disabilities. Journal of Learning Disability Quarterly, 20, 160-168.
Learning Disabilities, 31, 140-147.
53 Tunmer, W., & Hoover, W. (1993).
65 Peterson, M., & Haines, L. (1992). Phonological recoding skill and beginning reading.
Orthographic analogy training with kindergarten Reading and Writing: An Interdisciplinary Journal, 5,
children: Effects on analogy use, phonemic 161-179.
segmentation, and letter-sound knowledge. Journal of
Reading Behavior, 24 (1), 109-127. 69 Umbach, B., Darch, C., & Halpin, G. (1989).
Teaching reading to low performing first graders in rural
44 Santa, C., & Hoien, T. (1999). An assessment schools: A comparison of two instructional approaches.
of early steps: A program for early intervention of Journal of Instructional Psychology, 16, 23-30.
reading problems. Reading Research Quarterly, 34, 54-
79. 54 Vandervelden, M., & Siegel, L. (1997).
Teaching phonological processing skills in early literacy:
47 Silberberg, N., Iversen, I., & Goins, J. (1973). A developmental approach. Learning Disability
Which remedial reading method works best? Journal of Quarterly, 20, 63-81.
Learning Disabilities, 6, 18-27.
55 Vickery, K., Reynolds, V., & Cochran, S.
48 Snider, V. (1990). Direct instruction reading (1987). Multisensory teaching approach for reading,
with average first-graders. Reading Improvement, 27, spelling, and handwriting, Orton-Gillingham based
143-148. curriculum, in a public school setting. Annals of
Dyslexia, 37, 189-200.
74 Stuart, M. (1999). Getting ready for reading:
Early phoneme awareness and phonics teaching 70 Wise, B., Olson, R., Anstett, M., Andrews, L.,
improves rading and spelling in inner-city second Terjak, M., Schneider, V., Kostuch, J., & Driho, L.
language learners. British Journal of Educational (1989). Implementing a long-term computerized
Psychology, 69, 587-605. remedial reading program with synthetic speech
feedback: Hardware, software, and real-world issues.
67 Torgesen, J., Wagner, R., & Rashotte, C. Behavior Research Methods, Instruments and
(1997). Prevention and remediation of severe reading Computers, 21 (2), 173-180.
disabilities: Keeping the end in mind. Scientific Studies
of Reading, 1 (3), 217-234. 57 Wilson, K., & Norman, C. (1998). Differences
in word recognition based on approach to reading
51 Torgesen, J., Wagner, R., Rashotte, C., Rose, instruction. Alberta Journal of Educational Research,
E., Lindamood, P., Conway, T., & Garvan, C. (1999). 44, 221-230.
Preventing reading failure in young children with
phonological processing disabilities: Group and individual
responses to instruction. Journal of Educational
Psychology, 91, 579-593.
Appendix B
L i s t o f S t u d i e s i n t h e O r i g i n a l D a t a b a s e T h a t We r e E x c l u d e d
01 Becker, W., & Gersten, R. (1982). A follow-up 21 Hall, D., & Cunningham, P. (1988). Context as
of follow through: The later effects of the direct a polysyllabic decoding strategy. Reading Improvement,
instruction model on children in fifth and sixth grades. 25, 261-264.
American Educational Research Journal, 19(1), 75-92.
23 Jeffrey, W., & Samuels, S. (1967). Effect of
02 Berninger, V., Vaughan, K., Abbott, R., Brooks, method of reading training on initial learning and
A., Abbott, S., Rogan, L., Reed, E., & Graham, S. transfer. Journal of Verbal Learning and Verbal
(1998). Early intervention for spelling problems: Behavior, 6, 354-358.
Teaching functional spelling units of varying size with a
multiple-connections framework. Journal of Educational 24 Kaiser, S., Palumbo, K., Bialozor, R., &
Psychology, 90, 587-605. McLaughlin, T. (1989). The effects of direct instruction
with rural remedial students: A brief report. Reading
06 Eddowes, E. (1990). Teaching reading in Improvement, 26, 88-93.
kindergarten: Two contrasting approaches. Reading
Improvement, 27, 220-223. 25 Kibby, M. (1979). The effects of certain
instructional conditions and response modes on initial
07 Ehri, L., & Wilce, L. (1987). Cipher versus cue word learning. Reading Research Quarterly, 15, 147-
reading: An experiment in decoding acquisition. Journal 171.
of Educational Psychology, 79 (1), 3-13.
27 Kuder, S. (1990). Effectiveness of the DISTAR
10 Fielding-Barnsley, R (1997). Explicit instruction Reading Program for children with learning disabilities.
in decoding benefits children high in phonemic Journal of Learning Disabilities, 23, 69-71.
awareness and alphabet knowledge. Scientific Studies
of Reading, 1, 85-98. 30 Lovett, M., Borden, S., DeLuca, T., Lacerenza,
L., Benson, N., & Brackstone, D. (1994). Treating the
14 Foorman, B., & Liverman, D. (1989). Visual core deficits of developmental dyslexia: Evidence of
and phonological processing of words: A comparison of transfer of learning after phonologically- and strategy-
good and poor readers. Journal of Learning Disabilities, based reading training programs. Developmental
22, 349-355. Psychology, 30, 805-822.
16 Gettinger, M. (1987). Prereading skills and 31 Lovett, M., Ransby, M., & Barron, R. (1988).
achievement under three approaches to teaching word Treatment, subtype, and word type effects in dyslexic
recognition. Journal of Research and Development in children’s response to remediation. Brain and
Education, 19, 1-9. Language, 34, 328-349.
19 Guyer, B., Banks, S., & Guyer, K. (1993). 39 McNeil, J., & Donant, L. (1980). Transfer
Spelling improvement for college students who are effect of word recognition strategies. Journal of
dyslexic. Annals of Dyslexia, 43, 186-193. Reading Behavior, 12(2), 97-103.
20 Guyer, B., & Sabatino, D. (1989). The 40 Meyer, L. (1982). The relative effects of word-
effectiveness of a multisensory alphabetic phonetic analysis and word-supply correction procedures with
approach with college students who are learning poor readers during word-attack training. Reading
disabled. Journal of Learning Disabilities, 22, 430-434. Research Quarterly, 17(4), 544-555.
42 Olson, R., & Wise, B. (1992). Reading on the 49 Spiegel, D., Fitzgerald, J., & Heck, M. (1985).
computer with orthographic and speech feedback. Teaching children to use a context-plus-phonics
Reading and Writing: An Interdisciplinary Journal, 4, strategy. Reading Horizons, 25, 176-185.
107-144.
50 Sullivan, H., Okada, M., & Niedermeyer, F.
43 Olson, R., Wise, B., Ring, J., & Johnson, M. (1971). Learning and transfer under two methods of
(1997). Computer-based remedial training in phoneme word-attack instruction. American Educational
awareness and phonological decoding: Effects on the Research Journal, 8(2), 227-239.
post-training development of word recognition.
Scientific Studies of Reading, 1(3), 235-253. 56 Walton, P. (1995). Rhyming ability, phoneme
identify, letter-sound knowledge, and the use of
45 Scherzer, C. & Goldstein, D. (1982). Children’s orthographic analogy by pre-readers. Journal of
first reading lesson: Variables influencing within-lesson Educational Psychology, 87(4), 587-597.
emotional behavior and post-lesson achievement.
Journal of Educational Psychology, 74(3), 382-392. 58 Wise, B., & Olson, R. (1992). How poor
readers and spellers use interactive speech in a
46 Schwartz, S. (1988). A comparison of computerized spelling program. Reading and Writing:
componential and traditional approaches to training An Interdisciplinary Journal, 4, 145-163.
reading skills. Applied Cognitive Psychology, 2, 189-
201.
Appendix C
05 Brown & Felton, 1990 Pollack, C., & Atkins, R. (1971). The intersensory
reading program: My own reading book, part II.
Lippincott basic reading. (1981). Riverside, NJ: Brooklyn, NY: Book-Lab, Inc.
Macmillan Publishing Company.
Pollack, C., & Lane, P. (1969). The lipreader, part
11 Foorman et al., 1998 I. Brooklyn, NY: Book-Lab, Inc.
Open Court Reading. (1995). Collections for young Pollack, C., & Lane, P. (1971) The lipreader, part
scholars. Chicago and Peru, IL: SRA/McGraw-Hill. II. Brooklyn, NY: Book-Lab, Inc.
Engelmann, S., Johnson, G., Carnine, L., Meyer, L., Engelmann, S., & Bruner, E. C. (1978). DISTAR
Becker, W., & Eisele, J. (1988). Corrective reading: reading: An instructional system. Chicago: Science
Decoding strategies, decoding B1. Chicago: Science Research Associates.
Research Associates, Inc.
55 Vickery et al., 1987
37 Marston et al., 1995
Gillingham, A., & Stillman, B. W. (1956). Remedial
Engelmann, S. (1980). Direct instruction. training for children with specific disability in reading,
Englewood Cliffs, NJ: Educational Technology spelling, and penmanship (5th ed). Cambridge, MA:
Publications. Educators Publishing Service.
Science Research Associates (1988). Reading Cox, A. (1984). Structures and techniques:
mastery. New York: SRA, Macmillan/McGraw-Hill. Multisensory teaching of basic language skills.
Cambridge, MA: Educators Publishing Service.
Waites, L., & Cox, A. (1969). Developmental Phonics Programs Emphasizing Larger
language disability . . . Basic training . . . Remedial Phonological Subunits
language training. Cambridge, MA: Educators
Publishing Service. 11 Foorman et al., 1998
69 Umbach et al., 1992 Hiebert, E., Colt, J., Catto, S., & Gary, E. (1992).
Reading and writing of first-grade students in a
Reading mastery series (1986) restructured chapter 1 program. American Educational
Research Journal, 29, 545-572.
Carnine, D., & Silbert, J. (1979). Direct instruction
reading. Columbus, OH: Merrill. 13 Foorman et al., 1997
72 Gersten et al., 1988 Traub, N., & Bloom, F. (1992). Recipe for reading
(3rd ed.). Cambridge, MA: Educators Publishing
Becker, W., Engelmann, S., Carnine, D., & Rhine, Service.
R. (1981). Direct instruction models. In R. Rhine (Ed.),
Encouraging change in American schools: A decade of 33 Lovett & Steinbach, 1997
experimentation (pp. 95-154). New York: Academic
Press. Gaskins, I., Downer, M., & Gaskins, R. (1986).
Introduction to the Benchmark school word
Carnine, D., Carnine, L., Karp, J., & Weisbert, P. identification/vocabulary development program. Media,
(1988) Kindergarten for economically disadvantaged PA: Benchmark School.
students: The direct instruction component. In C.
Wargen (Ed.), A resource guide to public school early Gaskins, I., Downer, M., Anderson, R., et al.
childhood programs (pp. 73-98). Alexandria, VA: (1988). A metacognitive approach to phonics: Using
ASCD what you know to decode what you don’t know.
Remedial and Special Education, 9, 36-41, 66.
Engelmann, S., & Osborn, J. (1987). DISTAR
language I. Chicago: Science Research Associates. 44 Santa & Hoien, 1999
74 Stuart, 1999 Morris, D., Shaw, B., & Perney, J. (1990). Helping
low readers in grades 2 and 3: An after school
Lloyd, S. (1992). The phonics handbook. UK: Jolly volunteer program. Elementary School Journal, 91, 133-
Learning. 150.
Dixon, R., Engelmann, S., & Meier, M. (1980). Bechthold, C., & Bechthold, D. (1976). Signs for
Spelling mastery, a direction instruction series - Level sounds. Hillsborough, CA: Educational Support
A. Chicago: Science Research Asssociates. Systems.
APPENDIX D
Table 2
Treatment-Control Group Comparisons in the Database Grouped by Type of Phonics Program and Coded for
Instructional Unit, Grade, Reading Ability of Participants, Length of Treatment, Type of Control Group, and Overall
Effect size on Literacy Outcome Measures
Identity/Typea Inst. Grade/ Lengthc Controld de
of Program Unit Abil.b
EMPHASIS ON SYNTHETIC PHONICS (S)
74 Jolly Phonics (S) Class K at risk 12 wks. Big Books (WL) 0.73
38 Successive phonics (S) Sm gp K at risk 8 wks. Reg. curr. 0.62
03 Blachman PA (S) Sm gp K at risk 2-3 yrs. Basal 0.72/0.36
04 SingSpellReadWrite (S) Class K 1 yr. Basal 0.51
51 Lindamood PA (S) Tutor K at risk 3 yrs Reg. curr. 0.33/0.67
72 Direct Instruction (S) Class K at risk 4 yrs. Reg. curr. -- /0.24
29 NRS-2 (Beck) (S) Sm gp 1st 1 yr. Basal 0.45
29 NRS-3 (Beck) (S) Sm gp 1st 1 yr. Basal 0.44
29 NRS-4 (Beck) (S) Sm gp 1st 1 yr. Basal 0.33
29 NRS-6 (Beck) (S) Sm gp 1st 1 yr. Basal 0.70
12 Synthetic basal (S) Class 1st 1 yr Whole word 2.27
04 SingSpellReadWrite (S) Class 1st 1 yr. Basal 0.25
15 Lippincott (S) Class 1st 1 yr. Whole word 0.84
48 Direct Instruction (S) Sm gp 1st 1 yr. Basal/Prev. yr. 0.38f
28 Direct Instruction (S) Tutor 1st 10 wks. Misc. (child reads) 1.99
08 Modif. Whole Lang (S) Class 1st at risk 1 yr. Basal 0.63
11 Open Court (S) Class 1st at risk 1 yr. Whole language 0.91
52 Direct Instruction (S) Class 1st at risk 1 yr. Whole language 0.07
69 Direct Instruction (S) Sm gp 1st at risk 1 yr. Basal 1.19
05 Lippincott (S) Sm gp 1st at risk 2 yrs. Whole word 0.48/.52
72 Direct Instruction (S) Class 1st at risk 3 yrs. Reg. curr. --/0.00
04 SingSpellReadWrite (S) Class 2nd 1 yr. Basal 0.38
57 Sequential phonics (S) Class 2nd 1 yr. Whole language -0.47
11 Open Court (S) Class 2nd lo ach. 1 yr. Whole language 0.12
37 Direct Instruction (S) Class gr 1-6 lo ach 10 wks. Reg. curr. 0.01
55 Orton-Gillingham (S) Class 3rd 1 yr. Previous prog. (RC) 0.04
55 Orton-Gillingham (S) Class 4th 1 yr. Previous prog. (RC) 0.04
55 Orton-Gillingham (S) Class 5th 1 yr. Previous prog. (RC) 0.61
55 Orton-Gillingham (S) Class 6th 1 yr. Previous prog. (RC) 0.43
33 Lovett Dir. Inst. (S) Sm gp gr 2-3 RD 9 wks(35 s) Misc. (Study skills) 0.24
33 Lovett Dir. Inst. (S) Sm gp gr 4 RD 9 wks(35 s) Misc. (Study skills) 1.42
33 Lovett Dir. Inst. (S) Sm gp gr 5-6 RD 9 wks(35 s) Misc. (Study skills) 0.09
17 Intersensory method (S) Tutor age 7-13 RD 18 wks. Misc. (Subj. tutor) 0.53
Table 2 (Continued)
Table 2 (Continued)
a
The programs listed as Direct Instruction include Reading Mastery and DISTAR.
b
Information about grade/reading ability refers to the point in time when instruction began. RD refers to children
classified as reading disabled. Lo ach refers to children above first grade who were identified as low achievers in
their ability to read. At risk refers to kindergartners or first graders who performed poorly either on reading tests
or on tests predictive of poor reading. If not marked, the sample consisted of normally developing readers.
c
s refers to the number of sessions.
d
RC means regular curriculum. WL means whole language. Misc. means miscellaneous category.
e
Effect sizes listed singly are those observed at the end of training that lasted one year or less. When training
lasted longer than one year, the first effect size reports the outcome at the end of the first year and the second
effect size reports the outcome at the end of training.
f
This effect size was not measured immediately after training but following a delay of six months.
g
RRD refers to a program derived from Reading Recovery that was modified to include systematic phonics
instruction in which phonemes were taught along with larger phonological units such as onsets, rimes and spelling
patterns.
APPENDIX E
Table 3
Mean Effect Sizes (d) as a Function of Moderator Variables and Tests to Determine Whether Effect Sizes Were
Significantly Greater Than Zero at p < 0.05, Whether Effect Sizes Were Homogeneous at p < 0.05, and Whether
Effect Sizes Differed From Each Other at p < 0.05. Effect Sizes Refer to Outcomes Immediately After Training
or At the End of One School Year, Whichever Came First, Unless Labeled as Followup or End of Training.
Moderator Variables No. Mean Homogen. 95% Contrasts
and Levels Cases d CI
Time of Posttest
End of Training 65 0.41* No 0.36 to 0.47 n.s.
End of Training or One Yeara 62 0.44* No 0.38 to 0.50
Followup 7 0.28* Yes 0.10 to 0.46
End of Trainingb 6 0.51* Yes 0.32 to 0.70 n.s.
Followup 6 0.27* Yes 0.07 to 0.46
Outcome Measures
Decoding regular words 30 0.67* No 0.57 to 0.77 DecR = DecP;
Decoding pseudowords 40 0.60* No 0.52 to 0.67 Both >
Reading misc. words 59 0.40* No 0.34 to 0.46 RW, Spel,
Spelling words 37 0.35* No 0.28 to 0.43 Oral,
Reading text orally 16 0.25* No 0.15 to 0.36 Comp.
Comprehending text 35 0.27* No 0.19 to 0.36
Characteristics of Participants
Grade
Kind. & First 30 0.55* No 0.47 to 0.62 K-1st >
2nd-6th, RD 32 0.27* Yes 0.18 to 0.36 2nd-6th/RD
Younger Grades
Kindergarten 7 0.56* Yes 0.40 to 0.73
First Grade 23 0.54* No 0.46 to 0.63
Kindergarten and First
Graders on Outcome Measures
Decoding regular words 8 0.98* No 0.81 to 1.16 DecR >
Decoding pseudowords 14 0.67* No 0.56 to 0.78 RMW, Co, Or;
Reading misc. words 23 0.45* No 0.37 to 0.53 Spel > Or;
Spelling words 13 0.67* No 0.54 to 0.79 DecP > Or.
Reading text orally 6 0.23* No 0.05 to 0.41
Comprehending text 11 0.51* No 0.36 to 0.65
Table 3 (Continued)
Table 3 (Continued)
______________________________________________________________________________________________
Moderator Variables No. Mean Homogen 95% Contrasts
and Levels Cases d CI
______________________________________________________________________________________________
Sample Size
20 to 31 14 0.48* No 0.26 to 0.70 n.s.
32 to 52 16 0.31* Yes 0.15 to 0.47
53 to 79 16 0.36* No 0.23 to 0.49
80 to 320 16 0.49* No 0.41 to 0.57
________________________________________________________________________________
* indicates that effect size was significantly greater than zero at p < 0.05.
ns indicates not significantly different from zero.
a
Effect sizes indicate literacy outcomes at the end of training for studies lasting 1 year or less, and at the end of
the first school year for studies that continued training beyond 1 year.
b
The six studies in both comparisons were the same studies.
c
The kindergarten and 1st grade at-risk groups had identical ds and were combined.
d
This effect size was adjusted to reduce the impact of one atypically large outlier.
Table 4
Characteristics of Sets of Studies of Special Interest
Table 5
Number of Comparisons by Grade and Reading Ability
Kindergarten 1 6 (K-AR) -- 7
Total 22 23 17 62
________________________________________________________________________________
Note. The symbols in parentheses refer to the groups that were created for the meta-analysis.
Appendix F
Table 6
Characteristics of the Treatment-Control Group Comparisons Utilizing Specific Phonics Programs That Were
Included in the Meta-Analysis
28 Direct Instruction (S) Tutor 1st 10 wks. Misc. (child reads) 1.99
52 Direct Instruction (S) Class 1st at risk 1 yr. Whole language 0.07
69 Direct Instruction (S) Sm gp 1st at risk 1 yr. Basal 1.19
37 Direct Instruction (S) Class gr 1-6 lo ach 10 wks. Reg. curr. 0.01
Appendix F (continued)
Table 7
1. Direct Instruction. The Direct Instruction program is based on a behavioral analysis of the steps involved in
learning to decode (Carnine & Silbert, 1979; Engelmann, 1980; Engelmann & Bruner, 1969, 1978, 1988;
Engelmann & Osborn, 1987; Kameenui et al., 1997). At the beginning of the program, students are not taught
letter names but only letter-sound relations through highly structured instruction that uses cueing and reinforcement
procedures derived from a behavioral analyses of instruction. The task of decoding is broken down into its
component parts, and each of these parts is taught separately, from letter sounds to blending to reading words in
context. Instruction is scripted and the lessons are fast paced, with high student participation. The text for the
first-year program is written in a script that, although it preserves English spelling, contains printed marks that cue
the reader about silent letters and different vowel sounds. Children practice in specially constructed books
containing taught sounds, although children may be encouraged to read widely in children’s literature as well (e.g.,
Meyer, 1983).
2. Lovett Direct Instruction. The synthetic phonics program used by Lovett and Steinbach (1997) and Lovett et
al. (in press) adopts the Direction Instruction model to remediate the decoding and phonemic awareness difficulties
of severely disabled readers. Children are taught phonological analysis and blending (phonemic awareness) orally
and also letter sound associations in the context of word recognition and decoding instruction. The program
focuses on training sound blending and acquisition of a left-to-right phonological decoding strategy. The special
orthography highlights salient features of many letters and provides visual cues such as symbols over long vowels,
letter size variations, and connected letters to facilitate learning. Cumulative, systematic review and many
opportunities for overlearning are hallmarks of this approach. New material is not introduced until the child fully
masters previously instructed material.
3. Lovett Analogy. A second program also used with severely disabled readers by Lovett and Steinbach (1997)
and Lovett et al. (in press) was adapted from the Benchmark Word Identification/Vocabulary Development
program developed by Gaskins et al. (1986). This program is strongly metacognitive in its focus. It teaches
children how to use four metacognitive strategies to decode words: reading words by analogy, detecting parts of
words that are known, varying the pronunciations of vowels to maintain flexibility in decoding attempts, and
“peeling off” prefixes and suffixes in words. Children learn a set of 120 key words exemplifying high-frequency
spelling patterns, 5 words per day. They learn to segment the words into subunits so that they can use these
known words and their parts to read other similarly spelled words. They learn letter-sound associations for vowels
and affixes. Various types of texts provide children with practice applying the strategies taught.
4. Lippincott. The Lippincott Basic Reading Series (McCracken & Walcutt, 1963, 1975) is a direct code method
which, from the outset, approaches reading from a phonic/linguistic perspective. Beginning with children’s spoken
language, the Lippincott program teaches in a systematic manner how to use the alphabetic code to move from
printed words to oral language. Instruction begins with short-a and builds knowledge of regular sound/symbol
relationships. Children are first taught to decode phonetically regular words, with blending of phonic elements
directly taught. Once they are proficient, long vowels and irregular spellings are introduced. Although the primary
instructional focus is on decoding, another goal of this method is the instant recognition of words. However, rather
than relying on a “context clue” approach to word recognition, children are taught how and why the letters come to
Appendix F (continued)
Table 7 (continued)
represent these words, and they learn to “break the code” to decipher new words independently. Review and
reinforcement are an integral part of the program. Spelling is sometimes taught as one component of the reading
lesson with spelling lists developed from the words introduced in each unit of reading instruction (Brown & Felton,
1990).
5. NRS by Beck and Mitroff. The New Primary Grades Reading System for an Individualized Classroom (NRS)
was developed by Beck and Mitroff (1972). It is a code-breaking approach. The program begins by teaching
self-management skills, letter-sound correspondences, and chain blending to decode words. Children are taught to
pronounce the first letter of a word followed by the second letter and then to blend the two sounds; then they
pronounce the third letter and add it to the blend. In the first lesson, children are taught five isolated letter-sound
relations, and once they are known, children are immediately taught to blend them to form real words. Subsequent
letter-sounds are taught one at a time and blended with the earlier letters. Not only synthetic phonics but also
analytic phonics is taught as children explore words and their parts. The method is linguistic as well because the
major spelling patterns of words are displayed in texts to draw attention to similarities and contrasts, and because
there is minimum teaching of explicit pronunciation rules. Instruction is individualized. After the first two levels,
children work through the curriculum at different rates.
6. Orton Gillingham. The Orton-Gillingham approach (Cox, 1991; Gillingham & Stillman, 1979) begins with the
direct teaching of individual letters paired with their sounds using a Visual-Auditory-Kinesthetic-Tactile (VAKT)
procedure that involves tracing the letter while saying its name and sound, blending letters together to read words
and sentences, and finally reading short stories constructed to contain only taught sounds. Spelling words from
dictation is also part of an Orton-Gillingham lesson. Each letter-sound is learned to mastery through repetition.
More advanced lessons involve teaching learners to blend syllables together and read more complex texts.
Among those approaches based on Orton and Gillingham’s work are the Slingerland approach (Lovitt & DeMier,
1984), the Spaulding Approach, Recipe for Reading, and Alphabetic Phonics (Ogden, Hindman, & Turner, 1989).
There are differences among these approaches, largely in the sequencing of materials, but they all have the
general characteristics discussed.
7. Sing, Spell, Read & Write. The Sing, Spell, Read and Write (SSRW) program (Dickson, 1972) also teaches
synthetic phonics. It consists of several charts, books (both readers and workbooks), letter and word cards, tests,
and audio tapes. The tapes contain songs about several phonics generalizations. Through the tapes, the students
learn the sounds of letters and letter combinations. Also songs combined with charts help students learn the
spellings of words. The lessons begin by teaching letter-sounds in isolation for each letter of the alphabet. When
students have mastered certain sounds, they begin reading phonetic storybooks. The first five books each focus on
a different vowel sound. The remaining books expand the vocabulary in a way that is consistent with the letter-
sounds taught. Students are taught to spell the words they learn to read, with the words presented in sentences.
Most of the writing students do involves filling in blanks or answering questions related to words being learned.
The program has a “racetrack” which is posted in classrooms and notes students’ progress by placement of a race
car on the chart (Bond et al., 1995-96).
03 - Blachman et .
. . . . . . . . . . . . . . . .
al., (1999)
2-3yrs(41s- .
Blachman PA Sy n Basal S mG K AR Lo w NE No 159 Imm. 0.72 -0.17 1.08 0.94 . 1.04 .
,20m/d )
(1st g r=30 .
Blachman PA 128 2nd yr tr. 0.64 0.35 0.81 0.53 . 0.86 .
m/ d )
(2nd g r=30 .
Blachman PA 106 3rd yr tr. 0.36 0.42 0.55 0 . 0.45 .
m/ d )
04 - Bond et al.,
1995
2-169
.
Lip p inco tt 2nd yr tr. 0.52 0.51 0.63 0.38 . 0.55 .
National Reading Panel
08 - Eldredge,
Appendices
1991
Mo d ifie d Who le 1 yr. .
Sy n Basal Class 1st AR Lo w NE No 105 Imm. 0.63 . . . 0.83 0.43 .
Lang uag e (15m/d )
09 - Evans, 1985
20*(N=- .
Trad itio nal Basal Mis c Wh.L. Class 1 yr. 1st N Var NE NG Imm. 0.6 . . . 0.6 . .
247)
Appendix G (continued)
Reports of the Subgroups
11 - Foorman et
al., 1998
1 yr. .
Op e n Co urt Sy n Wh.L. Class 1st AR Var NE NG 68 Imm. 0.91 1.63 1.14 0.56 0.32 . .
(30m/d )
.
E mb e d d e d LU Wh.L. Class 1 yr. 1st AR Var NE NG 70 Imm. 0.36 0.56 0.51 0.26 0.1 . .
.
Op e n Co urt Sy n Wh.L. Class 1 yr. 2nd LA Var NE NG 35 Imm. 0.12 0.52 0.32 -0.19 -0.19 . .
.
E mb e d d e d LU Wh.L. Class 1 yr. 2nd LA Var NE NG 57 Imm. 0.03 0.37 0.22 -0.25 -0.24 . .
12 - Foorman et
al., 1991
1 yr. (45 6*(N=8- .
Synthe tic b asal Sy n Wh.W. Class 1st N Mid NE No Imm. 2.27 1.92 2.67 2.21 . . .
m/ d ) 0)
2-170
13 - Foorman et
al., 1997
1 yr. (60 .
Orto n-Gilling ham Sy n Wh.W. S mG g r 2-3 RD Mid NG Ye s 67 Imm. 0.27 0.17 0.58 0.05 . . .
m/ d )
.
Onse t-rime LU Wh.W. S mG 1 yr. g r 2-3 RD Mid NG Ye s 85 Imm. -0.11 -0.19 0.09 -0.23 . . .
15 - Fulwiler &
Groff, 1980
.
Lip p inco tt Sy n Wh.W. Class 1 yr. 1st N NG NE NG 147 Imm. 0.84 . 0.91 . 0.76 . .
17 - Gittelman &
Feingold, 1983
Inte rse nso ry 18 .
Sy n Mis c . Tuto r 7-13yr RD Mid R No 56 Imm. 0.53 0.76 0.67 0.12 0.57 . .
Me tho d wks.(54s)
18 - Greaney et
al., 1997
11
RRD-Rime .
LU Wh.W. Tuto r wks(31s,3- g r 2-5 LA NG R No 36 Imm. 0.37 0.39 . . . 0.51 0.2
analo g y
0m)
RRD-Rime .
34 fo llo w up 0.56 0.47 . . . 0.76 0.44
analo g y
Appendix G (continued)
Characteristics of Training Characteristics of Part. Features of Design Effect Sizes on Post-tests
Gen.
Author and Year, Type of Control Length of Grade/ Reading Group Sig Pre- Time of Oral
Tr.unit SES Total N Mean Word ID Dec Spell Comp Nonw Read
Treatment Phonics Group Training Age Ability Assign. test Diff Post-test Read
22 - Haskell et
al., 1992
Analyze Onse t- 6 wks(15s, .
Mis c Wh.W. S mG 1st N Mid R No 24 Imm. 0.14 0.2 0.09 . . . .
Rime s 20m)
Analyze 6 wks(15s, .
Mis c Wh.W. S mG 1st N Mid R No 24 Imm. -0.07 -0.08 -0.06 . . . .
Pho ne me s 20m)
26 - Klesius et
al., 1991
6*(N=1- .
Trad itio nal Basal Mis c Wh.L. Class 1 yr. 1st N Var NE Ye s Imm. 0.2 . . 0.36 0.18 0.07 .
12)
28 - Leach &
Siddall, 1990
10 wks .
Dire ct Instructio n Sy n Mis c . Tuto r 1st N NG R No 20 Imm. 1.99 . . . 1.8 . 2.18
(15m/d )
2-171
29 - Leinhardt &
Engel, 1981
NRS-stud y 2 .
Sy n Basal S mG 1 yr. 1st N NG NE Ye s 187 Imm. 0.45 0.45 . . . . .
(Be ck)
NRS-stud y 3 .
Sy n Basal S mG 1 yr. 1st N NG NE Ye s 263 Imm. 0.44 0.44 . . . . .
(Be ck)
NRS-stud y 4 .
Sy n Basal S mG 1 yr. 1st N NG NE Ye s 256 Imm. 0.33 0.33 . . . . .
(Be ck)
NRS-stud y 6 .
Sy n Basal S mG 1 yr. 1st N NG NE Ye s 241 Imm. 0.7 0.7 . . . . .
(Be ck)
32 - Lovett et al.,
National Reading Panel
1989
40 se s .
De co d ing Skills Sy n Mis c . S mG 8-13yr RD Mid R No 118 Imm. 0.39 0.78 0.7 0.42 0.07 0.1 0.27
Appendices
(33-40h)
33 - Lovett &
Steinbach, 1997
.
Lo ve tt Analo g y LU Mis c . S mG 9wks (35h) g r 2/3 RD NG R No 28 Imm. 0.49 -0.12 0.85 . . 0.75 .
.
Lo ve tt Analo g y LU Mis c . S mG 9wks (35h) g r 4 RD NG R No 22 Imm. 1.41 0.84 2.06 . . 1.33 .
Appendix G (continued)
Reports of the Subgroups
.
Lo ve tt Analo g y LU Mis c . S mG 9wks (35h) g r 5/6 RD NG R No 24 Imm. -0.25 -0.49 -0.15 . . -0.1 .
Lo ve tt Dire ct .
Sy n Mis c . S mG 9wks (35h) g r 2/3 RD NG R No 32 Imm. 0.24 0.02 0.24 . . 0.46 .
Instructio n
Lo ve tt Dire ct .
Sy n Mis c . S mG 9wks (35h) g r 4 RD NG R No 25 Imm. 1.42 1.03 1.53 . . 1.7 .
Instructio n
Lo ve tt Dire ct .
Sy n Mis c . S mG 9wks (35h) g r 5/6 RD NG R No 27 Imm. 0.09 -0.24 0.25 . . 0.25 .
Instructio n
34 - Lovett et al.,
1990
.
Analytic Mis c Mis c . S mG 9wks (35h) 7-13yr RD Mid R NG 36 Imm. 0.16 0.13 0.11 0.23 . . .
35 - Lum &
Morton, 1984
2-172
1 yr.(20-30 .
Sp e lling Maste ry Mis c Rg .cls. Class 2nd N NG NE No 36 Imm. 0.38 0.31 . 0.45 . . .
m/ d )
36 -
Mantzicopoulos
et al., 1992
Pho ne tic 50s .
Mis c Rg .cls. Tuto r 1st AR Mid R No 112 Imm. 0.53 . . . . 0.53 .
re ad /sp e ll (1h/wk)
Pho ne tic .
112 fo llo w up 0.32 . 0.33 0.3 0.08 0.56 .
re ad /sp e ll
37 - Marston et
al., 1995
10 wks .
Dire ct Instructio n Sy n Rg .cls. Class g r 1-6 LA NG NE Y/ Ad j 53 Imm. 0.01 . . . . . 0.01
(45m/d )
38 - Martinussen
& Kirby, 1998
Succe ssive 8 wks(40- .
Sy n Rg .cls. S mG K AR NG R No 26 Imm. 0.62 0.53 0.63 0.68 . 0.62 .
p ho nics 60m/wk)
41 - Oakland et
al., 1998
2 .
Orto n-Gilling ham Sy n Rg .cls. S mG M=11y RD NG NE Ye s 48 2nd yr tr. 0.54 0.71 . 0.23 0.62 0.61 .
yrs.(350h)
Appendix G (continued)
Characteristics of Training Characteristics of Part. Features of Design Effect Sizes on Post-tests
Gen.
Author and Year, Type of Control Length of Grade/ Reading Group Sig Pre- Time of Oral
Tr.unit SES Total N Mean Word ID Dec Spell Comp Nonw Read
Treatment Phonics Group Training Age Ability Assign. test Diff Post-test Read
44 - Santa &
Hoien, 1999
1 .
RRD-Early Ste p s LU Wh.L. Tuto r 1st AR Var NE No 49 Imm. 0.76 0.93 . 0.63 0.73 . .
yr.(30m/d )
.
RRD-Early Ste p s 41 fo llo w up 0.86 0.57 . . 0.87 1.15 .
47 - Silberberg et
al., 1973
.
Lip p inco tt Sy n Wh.W. S mG 1 yr. gr 3 RD NG NE Ye s 69 Imm. 0.5 0.7 . . 0.36 . 0.45
.
Orto n-Gilling ham Sy n Wh.W. S mG 1 yr. gr 3 RD NG NE Ye s 65 Imm. 0.04 0.31 . . 0.09 . -0.29
.
Lip p inco tt 62 fo llo w up 0.33 0.37 . . -0.04 . 0.66
.
2-173
48 - Snider, 1990
.
Dire ct Instructio n Sy n Basal S mG 1yr.(60m/d ) 1st N Mid NE No 66 fo llo w up 0.38 . 0.6 0.44 0.1 . .
51 - Torgesen et .
. . . . .
al., 1999
2.5
.
Lind amo o d PA Sy n Rg .cls. Tuto r yrs.(80m/- K AR NG R No 65 Imm. 0.33 0.08 . . . 0.58 .
wk)
2.5
.
National Reading Panel
Appendices
.
Lind amo o d PA 65 2nd yr tr. 0.75 0.64 . . 0.49 1.13 .
.
E mb e d d e d 68 2nd yr tr. 0.28 0.24 . . 0.29 0.31 .
.
Lind amo o d PA 65 3rd yr tr. 0.67 0.67 . 0.64 0.36 1.01 .
.
E mb e d d e d 68 3rd yr tr. 0.17 0.25 . 0.1 0.17 0.16 .
Appendix G (continued)
Reports of the Subgroups
52 - Traweek &
Berninger, 1997
.
Dire ct Instructio n Sy n Wh.L. Class 1yr. 1st AR Lo w NE Y/ Ad j 38 Imm. 0.07 0.07 . . . . .
53 - Tunmer &
Hoover, 1993
42 s .
RRD-Pho no g rams LU Rg .cls. Tuto r 1st AR NG NG NG 64 Imm. 3.71 2.94 . 1.63 . 1.49 8.79
(30m/d )
54 - Vandervelden
& Siegel, 1997
12wks(30-- .
De ve lo p me ntal Mis c Rg .cls. S mG K AR Lo w NE No 29 Imm. 0.47 0.04 . 1.11 . 0.57 0.15
45m/wk)
55 - Vickery et
al., 1987
2-174
1 yr.(55 0.04
Orto n-Gilling ham Sy n Rg .cls. Class 3rd N NG NE NG 63 Imm. 0.04 . . . . . .
m/ d )
1 yr.(55 0.04
Orto n-Gilling ham Sy n Rg .cls. Class 4th N NG NE NG 71 Imm. 0.04 . . . . . .
m/ d )
1 yr.(55 0.61
Orto n-Gilling ham Sy n Rg .cls. Class 5th N NG NE NG 74 Imm. 0.61 . . . . . .
m/ d )
1 yr.(55 0.43
Orto n-Gilling ham Sy n Rg .cls. Class 6th N NG NE NG 79 Imm. 0.43 . . . . . .
m/ d )
1 yr.(55 0.63
Orto n-Gilling ham Sy n Rg .cls. Class 3rd LA NG NE NG 46 Imm. 0.63 . . . . . .
m/ d )
1 yr.(55 0.19
Orto n-Gilling ham Sy n Rg .cls. Class 4th LA NG NE NG 47 Imm. 0.19 . . . . . .
m/ d )
1 yr.(55 -0.2
Orto n-Gilling ham Sy n Rg .cls. Class 5th LA NG NE NG 45 Imm. -0.2 . . . . . .
m/ d )
1 yr.(55 0.13
Orto n-Gilling ham Sy n Rg .cls. Class 6th LA NG NE NG 41 Imm. 0.13 . . . . . .
m/ d )
57 - Wilson &
Norman, 1998
Se q ue ntial .
Sy n Wh.L. Class 1 yr. 2nd N NG NE No 54 Imm. -0.47 -0.33 . . -0.61 . .
p ho nics
Appendix G (continued)
Characteristics of Training Characteristics of Part. Features of Design Effect Sizes on Post-tests
Gen.
Author and Year, Type of Control Length of Grade/ Reading Group Sig Pre- Time of Oral
Tr.unit SES Total N Mean Word ID Dec Spell Comp Nonw Read
Treatment Phonics Group Training Age Ability Assign. test Diff Post-test Read
59 - Freppon,
1991
Se q ue ntial .
Mis c Wh.L. Class 1 yr. 1st N Mid NE Ye s 24 Imm. 0 . . . . . 0
p ho nics
60 - Griffith et al.,
1992
.
Trad itio nal b asal Mis c Wh.L. Class 1 yr. 1st N NG NE No 24 Imm. -0.33 -1.11 . -0.54 -0.43 0.78 .
69 - Umbach et
al., 1989
1 yr.(50 .
Dire ct Instructio n Sy n Basal S mG 1st AR Lo w R No 31 Imm. 1.19 1.3 . . 1.08 . .
m/ d )
72 - Gersten et
al., 1988
2-175
0.27
Dire ct Instructio n Sy n Rg .cls. Class 4 yrs. K AR Lo w NE No 101 4th yr tr 0.24 . . 0.16 0.28 . .
0.02
Dire ct Instructio n Sy n Rg .cls. Class 3 yrs. 1st AR Lo w NE No 141 3rd yr tr. 0 . . -0.12 0.11 . .
74 - Stuart, 1999
12
.
Jo lly Pho nics Sy n Wh.L. Class wks(60m/- K AR Lo w NE Y/ Ad j 112 Imm. 0.73 0.56 . 1.11 0.36 0.9 .
d)
.
Jo lly Pho nics 112 fo llo w up 0.28 0.11 . 0.5 0.31 -0.03 0.49
75 - Lovett et al.,
National Reading Panel
(in press)
Dir. Instructio n + .
Appendices
Co m Mis c . S mG 70h 6-13yr RD Var R NG 37 Imm. 0.6 0.36 1 0.15 0.27 1.22 .
Analo g y
Analo g y + Dire ct .
Co m Mis c . S mG 70h 6-13yr RD Var R NG 32 Imm. 0.21 0.04 0.55 -0.2 0.12 0.52 .
Instructio n
Lo ve tt Dire ct .
Sy n Mis c . S mG 70h 6-13yr RD Var R NG 40 Imm. 0.24 0.21 0.36 -0.19 0.42 0.42 .
Instructio n
.
Lo ve tt Analo g y LU Mis c . S mG 70h 6-13yr RD Var R NG 42 Imm. 0.5 0.47 0.75 0.01 0.6 0.66 .
Chapter 2, Part II: Phonics Instruction
Appendix G (continued)
Abbreviations Key
FLUENCY
Executive Summary
in unsupervised independent reading at school or home. A similar search process was carried out to identify and
This report examined the evidence concerning the locate articles on the effectiveness of encouraging
effectiveness of both guided oral reading procedures independent silent reading practice. Search of the
and approaches that encourage students to read more. PsycINFO database identified 478 articles, while the
ERIC database identified 325 articles. Removing
Methodology redundant articles resulted in 603 unique articles on
instruction in the various approaches to encouraging
How Was the Analysis of the Research
independent reading practice. Review of each of the
Literature Conducted?
article’s adherence to the NRP criteria resulted in the
The NRP conducted an extensive and systemic identification of 92 articles. Further careful analysis of
literature review on these two approaches to the these articles according to their adherence to the
development of fluency. Using the methodology and methodology of the NRP selection procedures resulted
criteria developed for this purpose by the NRP, to reach in further reduction, with a resulting 14 of which could
its conclusions on the effectiveness of each approach, be used in the meta-analysis to address the Panel’s
the Panel included only: question of whether this instructional approach has
proven to be effective in improving reading fluency.
1. Studies that were experimental tests of the Additionally, this analysis was bolstered through a
procedures under examination. qualitative analysis of 37 other studies that also met
2. Studies that were conducted with students in these criteria but that could not be used in the meta-
kindergarten through grade 12. analysis for various reasons. These studies were
checked for their consistency of findings with those
3. Studies that had appeared in a refereed journal. analyzed in the meta-analysis.
4. Studies that had been carried out with English As a result of the limitations of the number and quality
language reading. of studies examining the effectiveness of encouraging
Each study which met these criteria was summarized independent reading, a meta-analysis was appropriate
and coded. Where appropriate, the studies were only in examining the effectiveness of repeated oral
analyzed for their effect sizes, as this allowed the Panel reading instructional approaches. In the meta-analysis,
to determine quantitatively the amount of difference the primary statistic used was “effect size,” indicating
such procedures made in children’s reading the extent to which performance of the treatment group
development. Studies that could not be analyzed is greater than performance of the control group. For
quantitatively were also examined in order to evaluate example, an effect size of 1.0 indicates that the
the consistency of their findings with those obtained treatment group mean was one standard deviation
from the quantitative studies. higher than the control group mean, revealing a strong
effect of guided oral reading instruction. In contrast, an
In its work, the Panel searched two separate databases: effect size of 0 indicates that treatment and control
PsycINFO and ERIC. The search using PsycINFO group means were identical and that the treatment had
identified 1,260 potential articles on instructional no measurable effect on measured reading
PsycINFO approaches to teaching repeated oral performance. In practice, the strength of an effect size
reading. This number was deemed too large to search can be gauged: a value of 0.20 is considered small; 0.50
efficiently, so the Panel limited its search to articles that is moderate, and 0.80 is large. When available, effect
had been published since, and including, 1990. This sizes were calculated to determine whether repeated
reduced the number of articles for this topic to 346. A oral reading improved children’s accuracy, fluency, and
parallel search using ERIC identified 410 potential comprehension.
articles. Removing redundant articles between the two
databases resulted in 364 unique articles. Review of
each of these article’s adherence to the NRP criteria
resulted in a total of 77 articles that were coded for
possible use in the final analysis.
In contrast, the NRP did not find evidence supporting fluency? Research is needed to attempt to disentangle
the effectiveness of encouraging independent silent the particular contributions of components of guided
reading as a means of improving reading achievement. reading, such as oral reading, guidance, repetition, and
text factors. And it is important to know for which
The results of this study indicate that teachers should
children, at what level of reading ability and in what
assess fluency regularly. Both informal as well as
setting and by whom (teachers, classroom aides, peers,
standardized assessments of oral reading accuracy, rate
parents) and for how long do different approaches to
and comprehension are available and referenced in the
guided oral reading work best?
full report.
Research is needed over longer time spans to provide
The demonstrated effectiveness of guided oral reading
information about the emergence of fluency and its
compared to the lack of demonstrated effectiveness of
relationship to specific instructional practices. And
strategies encouraging independent silent reading
where along the development of reading are what
suggests the importance of explicit compared to more
specific approaches to encouraging fluency most
implicit instructional approaches for improving reading
effective?
fluency.
Research is needed to study in more analytic and
Directions for Further Research rigorous ways, the impact of independent reading on a
The National Reading Panel’s extensive review range of reading outcomes. Since encouraging
demonstrated good reason to provide instruction independent reading is so intuitively appealing and so
encouraging the development of fluency and overall frequently recommended, it is critical to clarify in a
reading proficiency, and indicated which specific more definitive way the relationship between programs
approaches the evidence supports as being most that encourage independent reading and reading
effective in increasing fluency. However, this review development. There is a clear need for rigorous
reveals important gaps in our knowledge. Future experimental research on the impact of programs that
research is necessary to address some of these encourage reading on different populations of students
questions. at varying ages and reading levels using several
different reading outcomes, including amount of reading
Research is needed to address the question of the and specific components of reading achievement, and
relationship between guided oral reading instruction and where the amount of independent reading is carefully
the development of fluency. What elements of monitored.
instructional practice are most responsible for improved
FLUENCY
Report
The purpose of this report of the NRP is to review the theory have reconceptualized this aspect of reading
changing concepts of fluency as an essential aspect of performance. Research has increasingly turned towards
reading and to consider the effectiveness of two major considerations of how instruction and reading
instructional approaches to fluency development and the experience contribute to fluency development.
readiness of these approaches for wide use by the
The purpose of this report is to review the changing
schools: first, procedures that emphasize repeated oral
concepts of fluency as an essential aspect of reading
reading practice or guided repeated oral reading
and to consider the effectiveness of two major
practice; and second, all formal efforts to increase the
instructional approaches to fluency development and the
amounts of independent or recreational reading that
readiness of these approaches for wide use by the
children engage in, including sustained silent reading
schools. The first major approach that will be analyzed
programs. Because of the fundamental differences in
here includes procedures that emphasize repeated oral
these two approaches, and because of the differing
reading practice or guided repeated oral reading
amounts and nature of the articles in these two areas,
practice. These procedures include repeated reading
the Panel was able to perform meta-analysis only on
(Samuels, 1979), neurological impress (Heckelman,
studies relevant to the first topic, repeated oral or
1969), radio reading (Greene, 1979), paired reading
guided reading. There were too few experimental
(Topping, 1987), and a variety of similar techniques
studies of the variety of approaches to silent reading for
aimed at developing fluent reading habits. The second
such an analysis; therefore, the Panel performed a
major approach considered here includes all formal
more informal analysis of these studies, but felt that
efforts to increase the amounts of independent or
some discussion of the studies was nonetheless
recreational reading that children engage in, including
important.
sustained silent reading programs (Hunt, 1970), the
As a result of these different types of analyses, this Accelerated Reader (Advantage Learning Systems,
report is organized in a slightly different way from the 1986), and various incentive programs (i.e., Shanahan,
other subreports by the Panel. First, an overall Wojciehowski, & Rubik, 1998).
introduction addresses the importance of the
Why is fluency important and how well are students
development of fluency in reading and provides
doing in achieving fluency? The National Assessment
background for two subsections. From that point, the
of Educational Progress conducted a large study of the
report is organized in two major sections, with individual
status of fluency achievement in American education
methods, results and discussion, implications for reading
(Pinnell et al., 1995). That study examined the reading
instruction and directions for future research. Finally,
fluency of a nationally representative sample of 4th
the Panel offers overall conclusions on extant research
graders and found 44% of students to be disfluent even
addressing reading fluency.
with grade-level stories that the students had read under
Introduction supportive testing conditions. Moreover, that study
found a close relationship between fluency and reading
Fluency, the ability to read a text quickly, accurately, comprehension. Students who are low in fluency may
and with proper expression, has been described as the have difficulty getting the meaning of what they read.
“most neglected” reading skill (Allington, 1983), and Given this, it is not surprising that the National Research
with good reason. For much of the 20th century, Council report, Preventing Reading Difficulties in Young
researchers and practitioners alike assumed that Children (Snow, Burns, & Griffin, 1998), states
fluency was the immediate result of word recognition “Adequate progress in learning to read English (or any
proficiency, so efforts were directed towards the alphabetic language) beyond the initial level depends on
development of word recognition, whereas fluency itself sufficient practice in reading to achieve fluency with
was largely ignored. That neglect has started to give different texts” (p. 223), and that it recommends,
way during the past three decades as research and
“Because the ability to obtain meaning from print perform complex acts with ease, and the Bryan and
depends so strongly on the development of word Harter (1899) studies described how telegraph
recognition accuracy and reading fluency, both should operators learned to send and receive Morse code
be regularly assessed in the classroom, permitting timely accurately in larger and larger units.
and effective instructional response when difficulty or
Not all research was carried out during this early period
delay is apparent” (p. 7).
addressed psychomotor behavior, however. Huey’s
Changing Concepts of Fluency (1905) book on the reading process became a classic in
the field in part because it summarized the research
Over the past three decades, our understanding of what findings of the 1800s on word recognition and eye
is involved in reading fluency has been altered and movements during reading and in part because it was
enlarged. One finds, for example, in the 1974 LaBerge the harbinger for what would later develop into the
and Samuels’ article on automatic information cognitive psychology paradigm. In that work, Huey
processing in reading, an emphasis on word recognition. made the following perceptive observation about the
This same focus persists in the The Literacy Dictionary development of fluency:
definition (Harris & Hodges, 1995) that states that
fluency is “freedom from word identification problems.” Perceiving being an act, it is, like all other things
More recent conceptualizations of fluency, however, that we do, performed more easily with each
have been extended beyond word recognition and may repetition of the act. To perceive an entirely new
embrace comprehension processes as well (Thurlow & word or other combination of strokes requires
van den Broek, 1997). considerable time, close attention, and is likely to be
imperfectly done, just as when we attempt some
In its early conception, it was recognized that fluency new combination of movements, some new trick in
requires high-speed word recognition that frees a the gymnasium or new “serve” at tennis. In either
reader’s cognitive resources so that the meaning of a case, repetition progressively frees the mind from
text can be the focus of attention. However, it is now attention to details, makes facile the total act,
clear that fluency may also include the ability to group shortens the time, and reduces the extent to which
words appropriately into meaningful grammatical units consciousness must concern itself with the process
for interpretation (Schreiber, 1980, 1987). Fluency (p. 104).
requires the rapid use of punctuation and the
determination of where to place emphasis or where to From about 1910 until the middle of the 1950s, during
pause to make sense of a text. Readers must carry out what we now designate as the period of “Behaviorism,”
these aspects of interpretation rapidly—and usually little research was done on automaticity or reading
without conscious attention. Thus, fluency helps enable fluency. Researchers who worked within psychology’s
reading comprehension by freeing cognitive resources behavioral paradigm tended to shy away from research
for interpretation, but it is also implicated in the process on reading as a psychological process. But, by the
of comprehension as it necessarily includes preliminary 1970s, the pendulum had moved away from
interpretive steps. behaviorism and back to studies of “inside-the-head”
phenomena such as problemsolving and reading. As a
Early Research on Expertise and Fluency result, cognitive psychologists of the period again
Recognition of the importance of automatic processes considered issues such as letter recognition (Posner &
and reading fluency is not new to psychology or Snyder, 1975) and lexical access (Neely, 1977).
education. During the last century, and certainly in the It was during this period that linguists attempted to
last 30 years, there has been interest in skills acquisition describe the reading process. Fries (1962), for example,
and expertise. Many early investigations of expertise discussed the importance of mapping spoken language
focused on perceptual-motor skills. For example, the onto print within reading. According to Fries, to be
Principles of Psychology (James, 1890) explained the considered a fluent reader, a person has to do this
importance of practice and repetition in the language mapping rapidly and easily. Soon after,
development of the skills that enabled someone to LaBerge and Samuels (1974) published their general
theory of automatic information processing in reading in would include various reading behaviors or processes
which they explained why automaticity in word because it is clear that it takes a considerable period of
recognition was an important prerequisite to skilled time and substantial practice before even the fastest
reading comprehension. This insight was echoed and learners can be considered to be fluent readers.
expanded in later work.
Furthermore, researchers have generated property lists
By this point, theoreticians began to wonder about how that can be used to distinguish automatic from non-
fluency skills develop. Stanovich (1990), for example, automatic processes. According to Logan (1997), “The
was critical of assumptions regarding cognitive resource general strategy was to find a list of properties that
limitations, and Logan’s (1997) instance theory could be used to define and diagnose automaticity, so
explained how a single exposure to a word could leave that processes, tasks, or performances that possessed
a sufficient memory trace to allow it to be recognized those properties could be designated ‘automatic,’ and
automatically in the future. processes, tasks, and performances that did not possess
them could be designated ‘non-automatic’ ” (p. 124).
Defining Automaticity and Fluency
One such list described three general properties
There has been a high degree of overlap in the use of essential to automaticity (Posner & Snyder, 1975),
terms such as “automaticity” and “fluency.” Most indicating that the behavior be carried out without
scholars treat automaticity as the more general term immediate intention, without conscious awareness, and
that embraces a wide variety of behaviors, ranging from without interfering with other process that are occurring
motor skills such as driving and typing to cognitive skills at the same time. Shiffrin and Schneider (1977)
such as reading. Some would prefer to reserve the term augmented this list to include two additional properties.
“fluency” for reading or other language phenomena. They claim that automatic processes are acquired
This distinction, however, is not universally recognized. gradually as the result of extended practice and that
For example, The Literacy Dictionary (Harris & once activated these processes continue to completion
Hodges, 1995) defines “fluency” as “freedom from because they are difficult to suppress. The importance
word identification problems that might hinder of practice in the development of automaticity is also
comprehension . . .” whereas, in the same source, evident in Ackerman’s (1987) description:
“automaticity” is defined as “fluent processing of
information that requires little effort or attention.” In Automatic processes are characterized as fast,
other words, automaticity and fluency are often used effortless (from a standpoint of allocation of
synonymously. cognitive resources), and unitized (or
proceduralized) such that they may not be easily
Actually, the fundamental idea of automaticity requires altered by a subject’s conscious control, and they
much more than that information be processed with may allow for parallel operation with other
little effort or attention. This definition has the information processing within and between tasks. . .
advantage of simplicity, but it suffers from the fact that These processes may be developed only through
it includes within its scope acts that result from innate extensive practice under consistent conditions,
forces. For example, many behaviors would fall within which are typical of many skill acquisition
this definition of automaticity—such as the avoidance of situations [p. 4, emphasis added].
a steep dropoff by newborn mountain goats or the eye
blinking and avoidance behaviors exhibited by 3-week- Logan (1997) applies the automaticity construct to
old infants at the rapid approach of a looming object— reading directly by highlighting the role of speed,
even though these are not highly skilled expert effortlessness, autonomy (i.e., ability to be completed
behaviors. A proper definition of automaticity would without intention or deliberation), and lack of
rule out behaviors that can be carried out without much consciousness or awareness, although he fails to
previous experience. Automaticity involves the emphasize the importance of practice or repetition
processing of complex information that ordinarily within his description. However, Logan emphasizes one
requires long periods of training before the behavior can more essential dimension of automaticity in reading that
be executed with little effort or attention. This definition makes his contribution essential to this discussion.
The property list approach defines automaticity well. Continued reading practice helps make the word
in terms of a list of binary-opposite properties. recognition process increasingly automatic. In some
. . . This view has suggested to some that situations, however, teachers may persist in trying to
automatic processes should share all of the develop a high degree of word recognition accuracy
properties associated with automaticity (i.e., without commensurate attention to other essential
they should be fast, effortless, autonomous, dimensions of fluency (i.e., speed, expression) or may
and unconsciousness) (Logan, 1997). accept recognition accuracy as a sufficient outcome of
instruction without any emphasis on true fluency.
However, according to Logan, automaticity should be
Although accuracy in word recognition is, indeed, an
viewed as a continuum rather than a dichotomy. This
important reading milestone, accuracy is not enough to
distinction has important implications for reading.
ensure fluency—and without fluency, comprehension
To show the importance of thinking of fluency as a might be impeded.
continuum, consider reading speed as one example.
Why do problems with reading accuracy, speed, and
Reading speed at the early stages of instruction tends to
expression interfere with comprehension? To answer
be slow and even labored. However, if we examine a
this question, we need to examine the reading process
student after years of practice, we will typically find
in terms of two basic cognitive tasks. The reader must
that a rapid rate of reading speed has been attained.
recognize the printed words (decoding) and construct
Was the shift from slow to fast an abrupt one in which
meaning from the recognized words (comprehension).
the reader was transformed from a nonfluent to a fluent
Both decoding and comprehension require cognitive
reader, or was this a more gradual change? This
resources. At any given moment, the amount of
question can be answered using data gathered as
cognitive resources available for these two tasks is
children practice reading over time. Such data reveal a
restricted by the limits of memory. If the word
gradual, continuous improvement in reading speed in
recognition task is difficult, all available cognitive
which only the beginning and end points could be
resources may be consumed by the decoding task,
justifiably characterized as “slow” or “fast.” Reading
leaving little or nothing for use in interpretation.
speed, like other aspects of fluency or other automatic
Consequently, for the nonfluent reader, difficulty with
behaviors, shows gradual or incremental improvement
word recognition slows down the process and takes up
through practice (Samuels, 1979).
valuable resources that are necessary for
Beyond Accuracy to Automaticity: comprehension. Reading becomes a slow, labor-
Why Automatic Decoding Matters intensive process that only fitfully results in
understanding.
One of the key reasons for the abiding interest in the
word recognition process is the consistent finding that The reading task for the fluent reader is easier than the
development of efficient word recognition skills is one facing the nonfluent reader. After considerable
associated with improved comprehension (Calfee & practice, the fluent reader has learned how to recognize
Piontkowski, 1981; Herman, 1985; Stanovich, 1985). To the printed words with ease and speed, and few
understand how efficient word recognition skills can cognitive resources are consumed in the process. In
influence other reading processes such as essence, the reader has become automatic at the word
comprehension, word recognition must be fractionated recognition task. Because the cognitive demands for
into its component elements such as accuracy of word word recognition are so small while the word
recognition and the automaticity of word recognition. In recognition process is occurring, there are sufficient
the early stage of reading instruction, the beginning cognitive resources available for grouping the words
reader may be accurate in word recognition but the into syntactic units and for understanding or interpreting
process is likely to be slow and effortful. With increased the text. The fluent reader is one who can perform
practice and repeated exposure to the words in the multiple tasks—such as word recognition and
texts that the student reads, word recognition continues comprehension—at the same time. The nonfluent
to be accurate but there would be improvements reader, on the other hand, can perform only one task at
evident in the speed and ease of word recognition as a time. The “multitask functioning” of the fluent reader
is made possible by the reduced cognitive demands overlap of these fixations improve in efficiency as well,
needed for word recognition and other reading allowing fluent readers to integrate the information from
processes, thus freeing cognitive resources for other each fixation more effectively (McConkie & Zola,
functions, such as drawing inferences. 1979; Rayner, McConkie, & Zola, 1980).
Being an “automatic” or “fluent” reader should not be Rayner (1998) has summed up the differences in eye
thought of as a stage of development in which all words movements between good and poor readers:
can be processed quickly and easily. Even highly skilled
There are well-known individual differences in eye
readers may encounter uncommon, low-frequency
movement measures as a function of reading skill:
words such as oenology, epistrophe, anfractuous,
Fast readers make shorter fixations, longer
faience, casuistically, and contralesional—words that
saccades [the jump of the eye from one fixation to
they cannot recognize automatically but that require
another], and fewer regressions than slow readers
some reliance on decoding strategies. Skilled readers
(Everatt, Bradshaw, & Hibbard, 1998; Everatt &
usually have several options available for word
Underwood, 1994; Rayner, 1978b; Underwood,
recognition. They can recognize words automatically or,
Hubbard, & Wilkinson, 1990) . . . . In characterizing
in cases like these, they can use controlled effortful
the eye movement patterns of dyslexic readers,
strategies to decode the word. Unskilled readers, on the
Olson, Kliegl, Davidson, & Foltz (1985) categorized
other hand, are limited to controlled effortful word
such readers as plodders and explorers; plodders
recognition.
made relatively short forward saccades, and more
Research on the eye in the past 2 decades has provided regressions, whereas explorers showed more
a perspective from which to observe the fluent reading frequent word skipping, longer forward saccades,
process. These studies take a picture of how the eye and more regressions (p. 392).
moves and what it fixates on during reading. For the
most part, readers—no matter how fluent—have to Indicators of Fluent Reading
fixate on or look at each word in a text. However, more A number of informal procedures can be used in the
skilled readers come to fixate on function words (words classroom to assess fluency. Informal reading
such as of, the, to, etc.) less often than on content inventories (Johnson, Kress, & Pikulski, 1987), miscue
words. It is not so much that fluent readers skip analysis (Goodman & Burke, 1972), pausing indices
function words as that their facility with such words (Pinnell et al., 1995), running records (Clay, 1972), and
allows them to see them adequately at the edge of their reading speed calculations (Hasboruck & Tindal, 1992).
visual field—while fixating on other words—without All these assessment procedures require oral reading of
having to stop to look at them specifically (Carpenter & text, and all can be used to provide an adequate index
Just, 1983; Rayner & Duffy, 1988; Radach & Kempe, of fluency.
1993). Skilled readers also get better at seeing a word
in a single fixation; therefore, they evidence fewer For example, informal reading inventories (IRI) require
refixations on the same words and fewer short students to read grade-level passages aloud and silently.
regressions in which they have to come back to look at The teacher determines a reading level by calculating
a word again after they have read other words (Frazier the proportion of words read accurately in the passage.
& Rayner, 1982; Kennedy, 1983; Kennedy & Murray, To ensure that students do not focus solely on
1987a, 1987b; Murray & Kennedy, 1988). Skilled fluency—at the expense of comprehension—the
readers learn to develop a broader perceptual span or student is expected to summarize or answer questions
word identification span during reading that allows them about the text.
to take in more information about words in a single
fixation (Ikeda & Saida, 1978; McConkie & Rayner,
1975; McConkie & Zola, 1987; Rayner, 1986;
Underwood & McConkie, 1985). The placement and
The Gray Oral Reading Test–3 (GORT–3) (Wiederholt There is ample evidence that one of the major
& Bryant, 1992) is a standardized measure requiring differences between poor and good readers is the
oral reading and providing scoring for reading accuracy, difference in the quantity of total time they spend
rate, and passage comprehension. In addition, Wagner, reading. Allington (1977) in his article “If they don’t
Torgesen, and Rashotte (1999) have recently published read much, how they ever gonna get good?” found that
a standardized measure of word reading efficiency that the students who needed the most practice in reading
tests the speeded reading of single words. spent the least amount of time in actual reading.
Biemiller (1977-1978) similarly reported substantial
The National Assessment of Educational Progress
ability group differences related to how much reading
fluency study noted earlier (Pinnell et al., 1995)
was done, and Allington (1984) in a sample of first
calculated speed and accuracy but performed most
grade students found that as little as 16 words were
analyses on the basis of a four-point pausing scale. This
read in a week by one child in a low-reading group
scale provided a description of four levels of pausing
compared to a high of 1,933 words for a child in a high-
efficiency with one point assigned to readings that were
reading group. Nagy and Anderson (1984) claimed that
primarily word by word with no attention to the author’s
good readers may read ten times as many words as the
meaning, to four points for readings that attended to
poor readers in a given school year. Stanovich (1986), in
comprehension and that paused only at the boundaries
his article “Matthew effects in reading,” suggested that
of meaningful phrases and clauses.
students who start out as poor readers often remain that
Fluency and Practice way. In the Bible chapter on Matthew (Matthew,
25:29), there is the phrase “The rich get richer and the
How does one become so fluent in reading that words poor get poorer.” Stanovich applied this Biblical phrase
are recognized accurately, quickly, and with ease and so as a metaphor to reading, claiming poor readers read
that a text sounds like spoken language when read less than good readers, and he speculated that because
aloud? The conventional wisdom is that it is only of this difference, year after year the gap between the
through extended practice in which large quantities of two groups increases. More recent empirical evidence
material are read that the student develops fluency skills indicates that while poor readers remain poor readers,
that go beyond accuracy of recognition to automaticity the gap between the two groups does not increase
of recognition (Allington, 1977, 1984; Snow, Burns, & (Shaywitz et al., 1995).
Griffin, 1998). But how accurate is conventional
wisdom? One might assume that with all the research Although correlational findings may be useful, they also
that has been done on factors that produce superior can be deceptive because correlations tell nothing about
readers, that there would be solid experimental the direction or sequence of a relationship. That good
evidence showing a causal connection between input readers read more could be because reading practice
variables such as time spent reading or the amount read contributes to reading attainment, but it could also be
and reading outcomes such as fluency. simply that better readers choose to read more because
they are good at it. If this is true, then it is reading
What is surprising is that most of the evidence linking achievement that stimulates reading practice, not the
up input variables such as amount read and output reverse. Although there is an extensive amount of
variables such as reading ability is correlational. For correlational data linking amount of reading and reading
example, in a longitudinal study of 54 children, Juel achievement (Cunningham & Stanovich, 1998;
(1988) estimated that 1st grade children with good word Krashen, 1993), such studies do not permit a clear
recognition skills were exposed to about twice as many delineation of what is antecedent and what is
words in basal text as children with poor word consequent.
recognition skills. Biemiller (1977-1978) also reported
similar differences in print exposure among readers What kinds of practice develop fluency? If fluency
with different levels of reading ability, and Taylor and were just a word recognition phenomenon, then having
her colleagues (Taylor et al., 1999) found that high- students reviewing and rehearsing word lists might
achieving primary classes allotted more time for make sense. Although there is some benefit to isolated
independent reading. word recognition study of this type, the evidence is that
such training is insufficient as it may fail to transfer
when the practiced words are presented in a Newer guided repeated oral reading techniques share
meaningful context (Fleischer, Jenkins, & Pany, 1979). several key features. First, most of these procedures
Competent reading requires skills that extend beyond require students to read and reread a text over and
the single-word level to contextual reading, and this skill over. This repeated reading usually is done some
can best be acquired by practicing reading in which the number of times or until a prespecified level of
words are in a meaningful context. proficiency has been reached. Second, many of these
procedures increase the amount of oral reading practice
In the sections below, the Panel examines the evidence
that is available through the use of one-to-one
supporting two major approaches to teaching fluency—
instruction, tutors, audiotapes, peer guidance, or other
first, repeated oral reading and then, silent reading
means. In round-robin reading, time was severely
practice.
limited because the teacher was the only one allowed to
Repeated Reading and Guided provide expert guidance; that is not true of the newer
procedures. Third, some of the procedures have
Repeated Oral Reading carefully designed feedback routines for guiding the
Although theories of fluency have emphasized the reader’s performance.
primacy of practice effects in reading development, The purpose of this section of the review is to provide a
most of the evidence has been correlational or research synthesis of empirical studies that have tested
ambiguous. Fortunately, several procedures for the efficacy of repeated reading and other guided oral
developing fluency directly through instructional reading procedures. The Panel’s purpose is to
practice have been proposed and evaluated during the determine whether the use of such procedures
past two decades. These procedures typically improves student fluency and whether such
emphasize repeated reading or guided oral reading improvements are evident in better reading
practice, including techniques such as repeated reading, comprehension, how appropriate such procedures
neurological impress, radio reading, paired reading, and would be for regular classroom application, and what
a variety of other similar procedures. The purpose of additional research is needed.
each of these procedures is to help students through
oral reading practice and guidance to develop fluent Repeated and Guided Repeated
reading habits that would allow them to read text more Oral Reading: Methodology
quickly, accurately, and with appropriate expression and
understanding. Database
Historically, most of the instructional attention accorded The Panel determined that the literature search for a
to oral fluency was developed through round-robin research synthesis must be conducted in a systematic,
reading, a still widely used approach in which teachers replicable way and that these procedures be described
have students take turns reading parts of a text aloud thoroughly. This methodology will allow others to weigh
(Opitz & Rasinski, 1998). These procedures have been the appropriateness of the procedures for answering the
criticized as boring, anxiety provoking, disruptive of research questions and to check for bias and error.
fluency, and wasteful of instructional time, and their use
has been found to have little or no relationship to gains Consideration of Extant Literature Searches.
in reading achievement (Stallings, 1980). It is evident This search started with the location of two published
that with round-robin procedures students receive little literature reviews on the impact of repeated reading
actual practice in reading because no child is allowed to procedures (Strecker, Roser, & Martinez, 1998: Toward
read for very long. Such procedures do provide students understanding oral reading fluency. Forty-seventh
with some guidance or feedback—although studies Yearbook of the National Reading Conference (pp. 295-
suggest that teachers vary greatly in their ability to 310); Dowhower, 1994: Repeated reading revisited:
provide this effectively (Pflaum & Pascarella, 1980). Research into practice. Reading and Writing Quarterly,
But even when this guidance is of high quality, students 10, 343-358). These literature searches were used in
rarely have the opportunity to perfect their performance two ways. First, they were examined carefully to
of a passage, as most texts tend to be read only once. identify appropriate terminology that could be used to
4. The study had to have been carried out with For this search, the target pool included 6,730 potential
English-language reading. items. This was reduced to 2,053 items on combination
with the focus pool of 39,694 items. This set was
If an article was clearly inappropriate in terms of these
further reduced to 840 potential articles by omitting non-
criteria, it was rejected without search. Rejected
English language reports and nonjournal articles. For the
articles were designated as (1) nonrefereed, (2)
sake of consistency, 1990 inclusive was again the cut-
nonresearch, (3) off topic/off sample, or (4) non-English
off year for the electronic search. This reduced the
language instruction. Although an abstract might
ERIC search to 410 potential items.
indicate several violations, only one needed to be noted
for an article to be rejected. A conservative application Of these 410 items, a review of the abstracts indicated
of these criteria was used to ensure the inclusion of any that only 50 of these had potential value for our
article that might be tangentially appropriate to our purposes. Many of these, however, had already been
search goals because this would allow us to make sense identified in the PsycINFO search and did not need to
of articles that could reveal important information about be double counted. Thus, the ERIC search resulted in
fluency learning. Because of this, analyses of the the identification of only 18 additional potential studies
relationships among various fluency measures, studies or articles.
of the correlation of fluency and comprehension, or
literature searches on related topics were all retained in Location of Articles
the pool at this stage. Such articles would not be used As a result of these two searches, the Panel set out to
for the final analysis of whether guided repeated oral find 99 articles on guided repeated oral reading. Of
reading procedures are effective, but they were used to these, the Panel was able to locate 76 articles, or 77%
help identify relevant studies outside the boundaries of of the total. Of the articles that could not be located,
these search procedures. As a result of this screening, only 11 met or appeared to meet all of the selection
the Panel attempted to locate 81 articles for further criteria; it was recognized that the other 12 papers did
consideration. not actually meet the criteria although these papers had
some apparent relevance to the topic. Of the 11 papers
The same basic terminology and search procedures the abstracts of which suggest that they might have met
were used in the ERIC system. The search for target the criteria, nine abstracts claimed positive and
pool items was identical to that carried out in substantial improvements in reading due to the
PsycINFO. Because ERIC uses a larger collection of procedures used, one reported no significant difference,
reading-relevant terminology, the focus pool was and one reported mixed results. It is possible that
expanded to ensure the widest possible inclusion of locating these missing studies could alter the findings of
reading articles. The focus pool included basal reading, this report. Any alteration, however, would likely
beginning reading, content area reading, critical reading, strengthen the support for guided oral reading
decoding, directed reading activity, early reading, procedures given that the vast majority of these appear
independent reading, individualized reading, oral reading, to provide evidence on that side of the equation.
reading, reading ability, reading achievement, reading
aloud to others, reading comprehension, reading Each of the 77 articles that were located was reviewed
difficulties, reading failure, reading habits, reading to determine its relevance to the topic and its adherence
improvement, reading instruction, reading material to the various selection criteria. Any study that
selection, reading materials, reading motivation, reading appeared to meet the criteria was then coded for
processes, reading programs, reading rate, reading possible use in the final analysis.
research, reading skills, reading strategies, recreational
reading, remedial reading, silent reading, speed reading, Further Identification of Articles
story reading, supplementary reading materials, OR The Panel’s search procedures were biased against
sustained silent reading. older studies of these instructional procedures. Only
studies that had been published since 1990 were
included in the selection procedures up to this point. To
expand on that set of studies in an effective manner, the
Panel analyzed the reference lists of all studies that
There were 16 articles in this set. These studies could Nine of these studies considered the impact of repeated
be directly evaluated through meta-analysis to test the reading (Faulkner & Levy, 1999; Levy, Nicholls, &
claim that guided repeated oral reading procedures Kohen, 1993; Neill, 1979; O’Shea, Sindelar, & O’Shea,
improve reading ability. 1985; Rasinski, 1990; Sindlar, Monda, & O’Shea, 1990;
Stoddard, Valcante, Sindlar, O’Shea, & Algozzine, 1993;
The third set of articles, Single Subject Studies, used
Turpie & Parratore, 1995; VanWagenen, Williams, &
multiple baseline single-subject designs to examine the
McLaughlin, 1994), although in other studies, repeated
impact of repeated reading and other guided oral
reading was combined with other procedures such as a
reading procedures on the reading abilities of students in
particular type of oral reading feedback (Reitsma, 1988)
grades K through 12 (see Appendix C). These studies
or phrasing support for the reader (Taylor, Wade &
had to have some measure of reading transfer. These
Yekovitch, 1985). Repeated reading studies either
studies could be used to directly evaluate the claim that
required a set number of repetitions (as few as one and
guided oral reading procedures improve reading ability,
as many as seven) or required students to practice
but they were not used in the meta-analysis. Data from
repetition for some amount of time or until some fluency
these studies were used to confirm or contradict the
criteria were reached. Other studies had students
meta-analysis results.
practicing oral reading while listening to the text being
The fourth set of studies, Methods Comparisons, read simultaneously (Bon, Boksebeld, Freide, & van
compared different methods for doing repeated reading den Hurk, 1991; Rasinski, 1990; Smith, 1979),
or guided repeated oral reading but did not have a true previewing a text through listening (Reitsma, 1988;
control group (see Appendix D). These studies were Rose & Beatty, 1986), or receiving particular types of
based on the assumption that guided repeated oral feedback during oral reading (Anderson, Wilkinson, &
reading procedures improve reading ability, and they Mason, 1991; Pany & McCoy, 1988).
were usually attempting to discern which methods work
All these interventions saw clear improvement, although
best. The lack of control group meant that these studies
some conditions were better than others. For example,
could not be used to evaluate the claim of whether
repeated reading with phrasing support seemed to be no
guided repeated oral reading improves reading ability,
better than repeated reading alone in a study of 45
but these studies could help guide any further analysis
good- and poor-reading 5th graders (Taylor, Wade, &
or help determine the applicability of such methods to
Yekovich, 1985), whereas repeated reading with
regular classrooms. There were eight of these studies.
feedback or guidance (Pany & McCoy, 1988) was
Repeated and Guided Repeated superior to repeated reading alone with 3rd graders.
Oral Reading: Results and These studies in their totality examined the reading of
Discussion 752 subjects ranging from 1st grade through college.
Four of these studies used normal populations, two
Immediate Effects Articles compared the performances of good and poor readers,
and the rest dealt with students who were somewhat
There were 14 studies found that dealt with the
below grade level, substantially behind grade level, or
immediate impact of different programs of repetition
designated as learning disabled. The studies found clear
and feedback during oral reading on the reading
improvements across multiple readings regardless of
performance of a specific passage or article. It is
students’ reading levels or age levels although greater
important to note that these studies did not fail to find
gains were sometimes attributed to poor readers. Given
transfer effects for these procedures, only that these
the lack of transfer measures in this study, the greater
studies did not attempt to measure such transfer. These
gains for low readers could be an artifact of the design
studies typically measured some aspects of fluency or
because these readers’ initial performances would be
comprehension with a particular passage and then
relatively more deficient and would therefore be most
monitored changes in this performance from one
amenable to improvement.
reading to another. Not surprisingly, all 14 studies
reported demonstrable improvements from a first
passage reading to a final passage reading with
whatever measures were used.
What inferences can be made from this set of studies? were calculated for each guided oral reading group
It certainly cannot infer that repeated reading or other compared with a control group, so if a study had two
guided repeated oral reading procedures would be experimental groups and one control group, there would
effective in raising reading achievement on the basis of be two effect sizes for each measure for that study.
these studies alone. However, the clear improvements However, if one of these experimental interventions
in reading rate, accuracy, and comprehension found for was not a form of guided repeated oral reading, no
a wide range of readers under a wide range of effect size would be calculated for that comparison, and
conditions suggest the possibility that such procedures those subjects would be dropped from the analysis.
could have transfer effects worth examining. Even with these omissions, because most studies
included multiple outcomes, 99 effect sizes were
Group Experimental Studies: calculated for direct comparisons of experimental and
Meta-Analysis control group performance. When multiple-effect-size
Sixteen studies met the criteria for inclusion in the statistics were calculated for a single study, the mean of
meta-analysis; these studies met the NRP review effect sizes for that study was calculated to determine a
methodology. Each of these studies had pre- and post- study effect size.
tests that allowed for an analysis of the improvement or Were Effect Sizes Greater Than Zero?
lack of improvement in reading and treatment and
control groups that would allow the changes in In all but two of the studies, comparisons resulted in
outcomes to be attributed to the instructional procedures significant differences for the guided repeated oral
of interest. Of the 16 studies, 2 did not provide reading groups over the control groups. Lorenz and
sufficient information to allow inclusion in the meta- Vockell (1979) found no benefit of these procedures for
analysis (Labbo & Teale, 1990; Lorenz & Vockell, LD students after 13 weeks of neurological impress
1979) although the findings of these studies will be training with either reading comprehension or
considered in this section and their data will be included vocabulary. The other study that did not result in a
in calculations wherever relevant and possible. The positive outcome (Mathes & Fuchs, 1993) compared
Lorenz and Vockell study found no differences because peer-mediated repeated reading with both peer-
of the treatments; however, the Labbo and Teale study mediated silent reading and a control group. There were
found clear improvement as a result of repeated no significant differences between these treatments
reading. with LD students in a special education setting. All
other comparisons significantly favored the guided
Although these studies were meta-analyzed, this repeated oral reading groups.
analysis does not go very far. That is, the NRP did not
attempt to evaluate all possible comparisons. Such Great variance was evident in these study effect sizes;
thorough analysis can be informative for future they ranged from as low as 0.05 (almost no effect) to
research, but given the national scope of this effort and as high as 1.48 (a substantial effect). The average of
the potential significance of these determinations, the these study effect sizes was 0.48. However, these
NRP decided to consider only questions that could be studies reported data on as few as 12 subjects and as
answered with a high degree of certainty (i.e., those many as 78. This means that the small studies would
that could be answered using all or most of these data). have as large an impact on this average as the largest
The studies in this set were conducted from 1970 to studies. A weighted average is probably more accurate
1996, and most were carried out in the 1990s. in this case, and it results in a study effect size average
of 0.41. The largest effect sizes were obtained with
Calculation of Effect Sizes some of the smaller samples, but this is probably an
Effect sizes were calculated for each relevant effect of the treatment features of these studies rather
comparison. These effect sizes used either the d index than an artifact of sample size. The smaller studies
(Cooper, 1998, p. 128) or the d index calculated from were less likely to use peer tutors; that is the students in
the F tests (Cooper, 1998, p. 129). When there were the small studies received guidance and feedback from
multiple experimental groups in a study, effect sizes adults (teachers or researchers) rather than from other
kids. These effect sizes, weighted or not, suggest that was not as great as this suggests. These 16 studies
guided oral reading procedures have a moderate impact included 398 students who were selected as poor
on the reading achievement of the types of students readers (although data on only 324 of them were used
who participated in these studies. in the meta-analysis) and 281 good readers.
Characteristics of Students The average effect sizes for these two groups of
studies (those examining low-level readers and those
These 16 studies included data from 752 elementary that considered average readers) were highly similar
and secondary education students. The data were and close to the overall average (0.49 for the nine low-
drawn from students from six U.S. states and two other level reader studies and 0.47 for the five average-
countries. The students attended 47 different schools reader studies). When weighted by sample sizes, the
(one study did not report the number of schools so this average effect sizes diverged more but, surprisingly, the
is an underestimate) and 98 classrooms (again, an nonimpaired reader studies showed the superior
underestimate because five studies, including some with outcomes (0.50 versus 0.33). This is probably
relatively large sample sizes, did not provide this attributable, at least in part, to the longer time evident in
information). Not all were included in the analyses, the nonimpaired reader studies (an average of 24 to 25
however. As has been noted, two studies provided clear hours in nonimpaired reader studies but only about 18 to
experimental evidence concerning the efficacy of the 19 hours in the poor-reader studies).
procedures but failed to include sufficient information
for effect size calculation. These studies reported data Although some of the studies speculated that poor
on 74 subjects, and they were not included in effect size students might benefit more from these procedures,
calculations. Also, given that not all comparisons within fluency is developmental and students must continue to
each study were relevant to our research questions, the meet the challenge of increasingly more difficult text as
Panel dropped from its analysis the data from an they develop as readers. It is possible, as Faulkner and
additional 73 subjects. Thus, the meta-analysis is based Levy (1999) have shown, that good and poor students
on data from 605 students. benefit from different aspects of this treatment, with
poor readers learning more about the words and good
The students in these studies ranged from grade 2 readers developing a stronger command of the prosody
through grade 9. The studies that focused on average of the passages. All of these studies tried to assign
reading level samples or normal classroom populations students to materials considered to be of appropriate
focused on students in grades 2 through 4, while studies levels of difficulty for the particular students, and this
of poor readers included students from grades 2 through masks or complicates the true meaning of the
9, with most of these drawn from the upper elementary performance disparity for good and poor readers.
grades. These studies as a collection have not provided
sufficient data to allow for a sound analysis of the Properties of Instructional Approach
relative impact of repeated reading procedures on
Many different instructional procedures were examined
students at different grade levels. It is evident from the
in these studies, so many that it is impossible to
studies included in this set that repeated reading
determine the best of the few studies. No method was
procedures have a clear impact on the reading ability of
used so often that a reliable estimate of effect size
nonimpaired readers at least through grade 4, as well as
would be possible. Also, variations across studies are
on students with various kinds of reading problems
subtle in terms of material selection and amount and
throughout high school. Future research needs to
type of repetition and feedback. Some treatments were
determine at what point such instruction is no longer
delivered by teachers or researchers, some by parents,
beneficial to normal readers.
some by other students, and some by the students
Eleven of these studies (including the two not used in themselves with computers or tape recorders. The
the meta-analysis) focused on poor readers, whereas treatments went under names such as neurological
only five studied average classrooms. The sample sizes impress, repeated reading, peer tutoring, shared reading,
of these studies differed so much, however, that the assisted reading, and oral recitation method. All were
disparity between numbers of average and poor readers associated with positive effect sizes. Some might be
better, or better in particular circumstances, but the Finally, four of the comparisons considered aggregate or
sample sizes associated with any of these associated full-scale reading scores (these tended to be
treatments were too small to allow for a meaningful combinations of the other measures noted above) and
partialing of variance. Given what is known, all of these included both full-scale scores from standardized tests
procedures seem to have a reasonably high likelihood of of reading and reading-level scores derived from
success. informal reading inventories. The average effect size
for these four aggregate comparisons from four
Outcome Measures different studies was 0.50.
These studies used a range of outcome measures,
Implications for Reading Instruction
including tests of word knowledge, comprehension, and
fluency, as well as combinations of these as overall As expected, the biggest effect of these procedures
scores derived from standardized reading measures. was on word recognition and fluency measures, with
Some studies had multiple comprehension or fluency the smallest effects evident in reading comprehension.
measures as well. The Panel attempted to determine It appears that oral reading practice and feedback or
whether these guided procedures had a greater impact guidance is most likely to influence measures that
on some aspects of reading than on others. These assess word knowledge, reading speed, and oral
studies made 99 different comparisons that were accuracy. Nevertheless, the impact of these procedures
relevant to the analyses. Only one pooled effect size on comprehension (and on total reading scores) is not
per study per category (word recognition, fluency, inconsiderable, and in several comparisons it was
comprehension, total score) was drawn from each actually quite high. These changes in comprehension
study, and each of these was weighted by the numbers might take place simultaneously, with the improvements
of subjects whose data were represented in each. in word recognition and fluency mediating the
improvements in comprehension, or there could be a
Across these studies, considering all sample
hierarchical order to this, as Faulkner and Levy (1999)
comparisons and all measures, there were 49 different
have speculated, with the lowest level readers
comparisons that used some form of comprehension
improving in word recognition and the highest ones in
test as an outcome measure. They included
comprehension.
standardized tests of reading comprehension in which
students read passages and answered multiple choice Studies Using Single-Subject Designs
questions, as well as informal measures such as
questions and passages, retellings, and maze tests. The Twelve additional studies reported experiments that
mean weight effect size for these 49 comparisons used single-subject designs. See Appendix C for a list
drawn from 12 separate studies was 0.35. of these studies. The single-subject studies, because of
their designs, were not combined in the meta-analysis,
There were 35 comparisons that used some fluency although the data were examined to evaluate the
measure as an outcome. They included standardized conclusions drawn from the meta-analysis. These
tests of reading rate and accuracy, as well as informal studies focused on the reading of small groups of
measures of these using instruments such as informal students, as few as 2 and as many as 13 (an average of
reading inventories. The mean weighted effect size for 4 to 5). All these studies addressed the learning needs
these 35 comparisons drawn from 10 different studies of elementary grade students with learning problems
was 0.44. (i.e., special education, learning disabilities, autism,
disfluent readers, readers substantially below grade
There were 11 comparisons that used some measure of
level). All these studies provided some kind of one-to-
word recognition. They included standardized tests of
one tutoring to students (sometimes parent or peer
word knowledge as well as informal measures that
tutoring) or repeated reading work with tape recorders,
examined students’ ability to read particular words or
for varying lengths of time (as little as 4 weeks and as
word lists. The mean effect size for these 11
long as 1.5 years, with most treatments lasting fewer
comparisons drawn from eight different studies was
than 10 weeks).
0.55.
With one exception (Law & Kratochwill, 1993), all time studied. These studies provided comparisons of the
these studies found clear and substantial improvements efficacy of various oral reading procedures or were
in reading accuracy, speed, or comprehension. The best meant as feasibility studies to evaluate the classroom
of these studies calculated a clear reading performance readiness of the procedures.
baseline over several days. Then they intervened with
There were not enough comparisons of guided repeated
repeated reading, oral reading feedback, or reading-
oral reading procedures to allow for a systematic
while-listening treatments and monitored student growth
determination of best procedures. For the most part, the
with new materials during the treatment and with
comparisons that were done resulted in no differences.
standardized tests at the conclusion. For example, Blum
In other words, each of the procedures examined did
and colleagues (1995) found that the introduction of
about as well as the others. Some of the comparisons
repeated reading with tape recorders led to marked
that were made included repeated reading with and
improvements in student reading performance; that
without feedback (Dowhower, 1987), guided repeated
when the training ended, the students maintained their
reading and assisted nonrepetitive reading (Homan,
gains; but when the intervention ended, the accelerating
Lesius, & Hite, 1993), and various peer or parent
improvement ceased. Another example of a well-
tutoring procedures in which students read aloud
designed, single-subject study was reported by Kamps
together or read to their parents (Lindsay, Evans, &
and her colleagues (Kamps, Barbetta, Leonard, &
Jones, 1985; Winter, 1986, 1988). The lack of clear
Delquadri, 1994).
differences among procedures is consistent with the
The one study that found no effects resulting from findings of the meta-analysis and again suggests the
paired reading of students with parents also found no robustness of these procedures for stimulating reading
improvements in word accuracy or reading speed after improvement.
6 weeks of treatment. This study had an especially
One exception to the no-differences finding, which
weak design (failed to calculate a stable baseline in
should be noted, was reported by Rashotte and
student reading performance and did not check on
Torgeson (1985). They did not vary the procedures, but
fidelity of treatment). In any event, no gains were found
tried out passages that either shared or did not share
in this study of lst through 3rd grade students.
lots of words with the outcome measures. They found
The pattern of findings for these studies is almost clear gains after 3 weeks for the passages with shared
identical to what was reported in the meta-analysis. words but not for those without. This suggests that, at
Most, but not all, of the studies reported clear least for very poor readers, the first thing that is
improvements. The changes described here were a bit probably learned from repeated reading is the words
larger in magnitude, but all but one of these studies (Faulkner & Levy, 1999) and that this growth might be
were conducted with a one-to-one teacher-student ratio facilitated by using passages that share lots of
and all were carried out with low-level—sometimes vocabulary.
very low-level—readers, and either of these factors
Only one study was found that directly evaluated the
could magnify the effect. Again, the conclusion is that
feasibility of these procedures for use in regular school
repeated reading and other related oral reading
settings, though several of the studies already noted
procedures have clear value for improving reading
have done just that. Dixon-Krauss (1995) conducted a
ability.
feasibility study of partner reading with 24 1st and 2nd
Methods Comparisons graders in regular classrooms. The program proved to
be manageable for the regular classroom teachers, and
Nine additional experiments were located that dealt the students were positive about the activity. What was
with repeated reading and other guided repeated oral so notable about this study was that it focused on the
reading procedures. None of these studies used a true teacher’s abilities to use these procedures on a targeted
control group, however, so it is not clear whether these basis with struggling readers, rather than with whole
gains were greater than expected in the amounts of classes. The findings from this study are consistent with
the findings of the other studies that considered
classroom effects, including Rasinski’s (1990), which
had regular classroom teachers applying such Repeated and Guided Repeated
procedures on a classwide basis for almost an entire
Oral Reading: Directions for Further
school year. Several other studies showed that regular
teachers, with little or no extra training, could Research
successfully use these procedures (for instance, Conte There is a need for more research on these issues.
& Humphrey, 1989; Labbo & Teale, 1990; Reutzel & Clearly there is a need for longitudinal research that
Hollingsworth, 1993; and Shany & Biemiller, 1995). examines the impact of these procedures on the reading
There were also several special education studies in development of normal readers at different points along
which students provided peer tutoring to their the continuum. The methods used should be
classmates under the direction of their teachers characterized not by labels such as repeated reading,
(Mathes & Fuchs, 1993; Simmons et al., 1994; but by treatment descriptions that are explicit with
Simmons et al., 1995). Teachers, parents, or peer tutors regard to how much rereading there is, the nature and
at most were provided 1 to 4 hours of training, and timing of the feedback, and the level of difficulty of the
usually the procedures did not require special materials materials. Some effort should be made to document the
(though some interventions used tape recorders or changes that take place in student reading and
elaborate computerized tutoring). knowledge during the intervention rather than just at the
Implications for Reading Instruction end.
Longitudinal studies of the impact of these procedures
Increasingly, teacher educators and educational
on nonimpaired readers could clarify how long the
researchers and theorists have called for more attention
benefits can be maintained. It would be especially
to direct instruction in fluency. Various procedures have
useful if these were examined under various conditions
been proposed for teaching students to read quickly,
in terms of passage difficulties and feedback
accurately, and with proper expression, though it is
procedures. However, given the clear and substantial
evident that this remains a serious weakness among
improvements produced by a wide range of reading
many schoolchildren.
procedures, the Panel thinks it advisable that teachers
A very thorough search for studies that evaluated the include such activities in their regular instructional
efficacy of various guided repeated oral reading routines at least during the elementary grades, and
procedures was made. Those studies provide a certainly with struggling readers.
persuasive case that repeated reading and other
One word of caution can be drawn from a short-term
procedures that have students reading passages orally
study (Anderson, Wilkinson, & Mason, 1991) that found
multiple times while receiving guidance or feedback
that too much attention to fluency issues within a
from peers, parents, or teachers are effective in
reading lesson could detract from reading
improving a variety of reading skills. It is also clear that
comprehension. It should be noted that in all of these
these procedures are not particularly difficult to use; nor
studies, the fluency work was only part of the
do they require lots of special equipment or materials,
instruction that students received. In most cases, the
although it is uncertain how widely used they are at this
fluency work was relatively brief (15 to 30 minutes per
time. These procedures help improve students’ reading
lesson), and students who received these lessons were
ability, at least through grade 5, and they help improve
still engaged in other reading activities including
the reading of students with learning problems much
comprehension instruction. Guided repeated oral
later than this.
reading and repeated reading provide students with
practice that substantially improves word recognition,
fluency, and—to a lesser extent—reading
comprehension. They appear to do so, however, in the
context of an overall reading program, not as stand-
alone interventions.
As with the search on repeated reading and guided oral free reading recreational reading
reading, it is important to proceed in a systematic, voluntary reading independent reading
replicable way and to describe these procedures SSR sustained silent reading
thoroughly so that others can examine this work USSR uninterrupted sustained
critically. SQUIRT silent reading
DEAR super quiet reading time
Consideration of Extant Literature Searches reading volume Matthew effects
This search started with the location of a published summer reading volume of reading
literature review on the impact of reading [Cunningham reading amount reading time
& Stanovich (1998). What reading does for the mind. book flood amount of reading
American Educator, 22(1-2), 8-15.] This paper was community literacy leisure reading
examined carefully to identify appropriate terminology Accelerated Reader self selection
that could be used to conduct a thorough electronic leisure time choice behavior
search of the literature, and the reference list from that Magazine Recognition Author Recognition Test
study was examined for additional, potentially relevant Test free voluntary reading
studies on this topic. Input hypothesis Stephen Krashen
articles were deleted, 909 articles remained. Because 3. The study itself had to have appeared in a refereed
this was judged to be too many to search for, the Panel journal.
limited the search to 1991 (inclusive) and identified 478
4. The study had to be have been carried out with
potential articles in the intersection of the target and
English language reading.
focus pools for those years.
If an article was clearly inappropriate in terms of these
Next the Panel completed a similar search of the ERIC
criteria, it was rejected without search. Rejected
system. The Panel used all the terms listed in Table 2 to
articles were designated as (1) nonrefereed, (2)
develop a target pool. This resulted in the identification
nonresearch, (3) off topic/off sample, or (4) non-English
of 5,645 possible articles published since 1984. The
language instruction. Although an abstract might have
Panel then developed a focus pool using the terms:
had several violations, only one needed to be noted for
basal reading, beginning reading, content area reading,
an article to be rejected. As a result of this screening,
corrective reading, critical reading, decoding, directed
the Panel attempted to locate 92 articles for further
reading activity, early reading, functional reading,
consideration.
independent reading, individualized reading, informal
reading inventories, reading, reading ability, reading Location of Articles
achievement, reading assignments, reading attitudes,
Of the 92 articles on encouraging students to read
reading comprehension, reading difficulties, reading
more, the Panel was able to locate 82, or 89% of the
failure, reading habits, reading improvement, reading
total. Each of the 79 articles that was located was
instruction, reading interests, reading material selection,
reviewed to determine its relevance to the topic and its
reading materials, reading motivation, reading
adherence to the various selection criteria. Any study
processes, reading programs, reading rate, reading
that appeared to meet the criteria was then coded for
research, reading skills, reading strategies, recreational
possible use in the final analysis. Only nine papers
reading, remedial reading, silent reading, story reading,
survived this review because most of these turned out
supplementary reading materials, OR sustained silent
to be correlational studies that just attempted to test
reading. There were 38,799 potential articles in the
whether better readers read more, something that the
focus pool that included 1984. These were then crossed
Panel accepts as already proven.
with the target pool, and this led to the identification of
1,669 potential articles, which were then limited to Additional Identification of Articles
journal articles written in the English language (655 The Panel’s search procedures neglected older studies
articles), with 325 of these published since 1991. of these instructional procedures. Only studies published
Analysis since 1991 had been included in the selection
procedures up to this point. To expand on this set of
The NRP combined the two searches to eliminate studies in an effective and efficient manner, the Panel
duplication and found 603 unique articles on these topics analyzed the reference lists of all studies that were
as a result of the two searches. Each abstract was read located through the previously described procedures.
and coded to determine whether to include it in this Even studies that were determined to be in violation of
analysis. The criteria for inclusion were that: the final selection criteria were analyzed in this way.
1. The study had to be a research study that appeared This led to the consideration of 46 additional papers, and
to consider the effect of encouraging students to of these, the Panel was able to locate 42 or 91%. For
read more on reading achievement. the most part, these second-generation papers had been
published before 1990. Of the 42 papers, 10 appeared
2. The study had to focus on English reading to meet all of the selection criteria. These 10 studies
education, conducted with children (K-12). were added to the 9 previously identified, and these
were designated for further examination and coding.
On closer examination, the Panel discovered that five of Sustained Silent Reading (SSR)
these studies were actually correlational studies and not One study of SSR (Evans & Towner, 1975) compared
experimental studies. This left only 14 studies with the effect of SSR on reading achievement with that of
potential for answering this question. having students complete various reading skills
exercises with commercial materials (i.e., worksheets).
Consistency With NRP Methods
Reading gains were identical for both groups of 2nd
The methods of the NRP were followed in the conduct graders at the end of 10 weeks.
of the literature searches and the examination of the
In a similar, though larger study, Reutzel and
articles obtained. However, in the case of these 14
Hollingsworth (1991) compared skills practice and SSR
studies, the Panel quickly realized that there were very
with 61 4th graders and 53 6th graders. These
few papers. Furthermore, the Panel evaluated a variety
procedures were used for 1 month, and there were,
of procedures and found that many of the papers
again, no reading differences for the two approaches.
suffered from especially weak research design. Several
As with the previous study, the skills work was
of these 14 studies, although they met the selection
assembled by the researchers specifically to serve as a
criteria, could not be analyzed because of serious
control activity, and was not part of the regular
methodological or reporting flaws that undermined their
instructional program that these students received from
results. Because of these concerns, the Panel did not
their teachers.
think it appropriate to carry out a meta-analysis of the
data. The Panel’s concern was that the meta-analysis Collins (1980) conducted an analysis of the impact of
would be potentially misleading given the very limited SSR on the reading achievement of 220 students from
data set that would be used for the analysis. Thus, this ten classrooms in grades 2 through 6. Students were
set of studies prohibited the of the NRP criteria for randomly assigned to the experimental and control
multiple studies. groups. This daily program was evaluated after 15
weeks (different grade levels allotted different amounts
Encouraging Students to Read of time to SSR—2nd graders had 10 to 30 minutes per
More: Results and Discussion day; 3rd graders received 15 minutes daily; 4th graders,
30 minutes; and 5th and 6th graders, 15 to 25 minutes
Description of the Studies each day). The control group worked on spelling during
Given that only 14 studies fit the selection criteria, it these time periods. The SSR procedures led to no
seems reasonable to summarize each one. The studies significant differences in vocabulary or comprehension
are listed in Appendix E. Most of the 14 studies as measured by various standardized tests, although the
examined the impact of sustained silent reading (SSR), SSR groups appeared to move slightly faster through
but some other approaches were also studied. SSR their basal readers during this period.
goes under a variety of labels including USSR Langford and Allen (1983) examined the impact of SSR
(uninterrupted sustained silent reading), DEAR (drop on the reading attitudes and achievement of 11 5th and
everything and read), and SQUIRT (super quiet reading 6th grade classes. These classes were randomly
time). In most cases, these procedures require the assigned to SSR or control conditions, resulting in 131
provision of approximately 20 minutes per day in which students in the SSR group (60 5th graders and 71 6th
students are allowed to read material silently on their graders) and 119 students in the control group. Students
own with no monitoring. In most cases, the students in the control group learned about health and grooming
select their own material, and there is no discussion or while the SSR activities took place with the
written assignment tied to this reading. Teachers and experimental subjects. The study failed to report the
other adults in the school setting are to read during this length of the instructional period or the duration of the
time as well. Such programs are described in nearly all intervention. Although there was significantly better
teacher preparation textbooks and have become widely improvement in word reading for the SSR group, these
popular in American classrooms in both elementary and differences appear to be small in terms of educational
secondary schools.
importance. In any event, it is difficult to evaluate the In one of the best-designed studies on SSR, Holt and
value of these gains without more information about the O’Tuel (1989) randomly assigned teachers and 211 7th
length of the program. There were no differences in and 8th grade students to an SSR condition and a
reading attitude that resulted from the intervention. regular reading instruction condition. Students in the
SSR condition read self-selected materials for 20
In still another evaluation of SSR, this one conducted in
minutes per day for 3 days each week, and they carried
a junior high school, Cline and Kretke (1980) examined
out sustained silent writing for two additional 20-minute
the effectiveness of the procedure over a 3-year period.
periods each week. During the time these activities
This study compared the reading achievement of 111
were carried out, the control group subjects worked on
students who had been enrolled for 3 years at a junior
their regular reading instruction. At the end of 10
high school that was using SSR with that of control
weeks, the students in the SSR groups had evidenced
group students drawn from two other schools that did
greater growth in vocabulary knowledge than was true
not have this program. This study found no differences
for the control subjects. Reading comprehension did not
between the two groups. However, it was poorly
improve for either group, however.
designed, and it would be impossible to be certain
whether there were gains. The study apparently Burley (1980) randomly assigned 85 high school
compared gains between different achievement tests students enrolled in an Upward Bound summer program
used at different grade levels (something that is not at a local college to one of four groups: SSR,
statistically sound), and it failed to provide any programmed textbooks, programmed cassette tapes,
information about the length of the SSR time or how and programmed skill development kits. The students in
this time was used at the control school. all groups received 75 minutes of reading instruction per
day for 30 days, but part of this time was devoted to the
Davis (1988) considered the effect of SSR on reading
SSR or other practice activities. In all, students
comprehension with 8th graders. Fifty-six students
practiced reading for about 14 hours in addition to the
were randomly assigned to one of two English classes.
summer reading instruction during this 6-week period.
These classes met daily for 50 minutes. Approximately
This study found a small, positive, statistically significant
half the time was devoted to either SSR or, alternatively,
difference favoring SSR over the other procedures on
to directed reading activities with the teacher. This
reading comprehension but no differences on a
effort continued for an entire school year. Although the
vocabulary measure.
researcher intended to analyze these data for high-,
medium-, and low-ability students separately, attrition in Summers and McClelland (1982) examined the effect
the low-ability groups rendered this impossible. Two of a 5-month program of SSR with 65 intact treatment
comparisons were made for the high- and medium- and control classes from nine elementary schools. They
ability groups, and it was found that the medium-ability found no significant differences in covariance-adjusted
students made much greater gains with SSR than with mean scores from standardized and informal reading
directed reading (n = 19), but there were no significant achievement and attitude measures and no significant
differences among the two high-ability groups (15 interaction effects for reading achievement, attitude,
students in these two groups). The gains attributed to grade level, and sex. This study included approximately
SSR for the medium-ability group were substantial and 1,400 children. This study was unique not only in terms
educationally meaningful (about 1 year of difference on of its extensive sample, but also in that it carefully
a standardized test). Unfortunately, the study is monitored the delivery of the treatments.
somewhat sketchy in terms of the statistical analysis: it
In yet another study of SSR (Manning & Manning,
provided no means or standard deviations and told little
1984), three variations of SSR were tested with 4th
about the analysis of covariance that was used (i.e.,
graders. These variations were compared across an
How big were the initial differences across the groups?
entire school year with a poorly described control
Was heterogeneity tested?).
group. Students (n = 415) from 24 classrooms were
assigned to the four groups (intact classes were
randomly assigned). The treatment lasted for an entire
school year. This study found that two of the SSR
variations led to higher reading achievement and that valid to subtract these test scores from each other.
one did not. The pure SSR variation (i.e., the one that Given this serious problem and the limited data reporting
matched the recommended procedures), in which that was evident, it is unclear whether any real
students read for an extra 35 minutes per day, led to no difference in achievement can be attributed to this
greater reading growth than was evident for the control program on the basis of this study.
group. However, when SSR was coupled with teacher
In another study of the Accelerated Reader (Vollands,
conferences or peer discussion, then slight improvement
Topping, & Evans, 1999), two small experiments were
in reading was evident for the SSR groups. This
carried out. In one experiment, there was a small
suggests that reading alone might provide no clear
advantage due to participation in the program; in the
benefit but that additional reading in combination with
other, there was not. Neither study had well-matched
other activities could be effective.
samples of students, and in the study that demonstrated
Not all the studies in this category focused on SSR, an advantage, students also used a form of assisted
however. Morrow and Weinstein (1986), for instance, reading similar to those examined earlier in this paper.
worked with six 2nd-grade reading classes to determine
Carver and Liebert (1995) provided one of the clearest
the efficacy of being involved in either a home- or
tests of the effect of reading by studying students
school-based voluntary reading program in terms of
during the summer. This study did not have a control
amount of reading and reading achievement. This
group but simply examined the reading scores at the
program, which provided students with enriched library
beginning of the program and 6 weeks later after the
materials and extended reading time, lasted for 9
students had completed approximately 60 hours of self-
weeks. Students did more school reading as a result of
selected reading. These students, in 3rd through 5th
being in this program, and they continued to do so when
grades, made no gains in reading achievement at all,
the program ended, but achievement levels in reading
even though the books were at an appropriate level.
were unrelated to program participation, and the
program did not alter reading attitudes or the amount of Encouraging Students to Read
home reading.
More: Implications for Reading
Accelerated Reader (AR) Instruction
AR is a commercial program designed to increase the
None of these studies attempted to measure the effect
amount of reading that students do with appropriate
of increased reading on fluency. Instead, most of these
materials. Peak and Dewalt (1994) compared reading
studies considered the impact of encouraging more
gains for two schools, one that used this program and
reading on overall reading achievement as measured by
one that did not. To make this comparison, they
standardized and informal tests. It would be difficult to
randomly selected 50 9th graders from each school. To
interpret this collection of studies as representing clear
be selected, a student had to have attended these
evidence that encouraging students to read more
schools since grade 3. Because standardized reading
actually improves reading achievement. Only three
test scores (California Achievement Test) were
studies (Burley, 1980; Davis, 1988; Langford & Allen,
available for each school at 3rd, 6th, and 8th grades,
1983) reported any clear reading gains from
comparisons were made between these two groups at
encouraging students to read, and in the third of these
each point. They found a slight reading advantage in 3rd
studies the gains were so small as to be of questionable
grade scores for the school that did not use AR and a
educational value. Most of the studies, including the
slight advantage for the AR group at the end of the year.
best designed and largest ones (Collins, 1980; Holt &
Students in the AR group had taken part in 5 to 6 hours
O’Tuel, 1989; Summers & McClelland, 1982), reported
per week of in-class reading during the 5 years of this
no appreciable benefit to reading from such procedures
study, but there is no information on what the other
(Holt & O’Tuel found improvement in vocabulary
students were doing during this time. More problematic
scores, but these did not translate into better reading
is the calculation of gain scores across forms of a
comprehension). The most direct test of the effect of
standardized test. The scores of each of these normative
reading on learning was provided by Carver and Liebert
grade level tests are independent scales, and it is not
(1995), and they found no clear benefit resulting from programs to encourage more reading if the intended
60 hours of additional reading. Perhaps 60 hours of goal is to improve reading achievement. It is not that
reading is insufficient for improving achievement in a studies have proven that this cannot work, only that it is
measurable way. yet unproven.
Only two of the studies compared SSR with nonreading There are few beliefs more widely held than that
instruction (Collins, 1980; Langford & Allen, 1983). teachers should encourage students to engage in
One of these found no benefit, and the other found a voluntary reading and that if they did this successfully,
very small benefit from SSR. More of the studies better reading achievement would result. Unfortunately,
compared additional reading time with reading research has not clearly demonstrated this relationship.
instruction itself. Often these studies interpreted the In fact, the handful of experimental studies in which this
lack of difference between SSR and the control idea has been tried raise serious questions about the
condition as meaning that SSR was as good as some, efficacy of some of these procedures.
usually unspecified, form of reading instruction.
Comparing SSR with instructional routines that have no Encouraging Students to Read
evidence of success—or whose success has been More: Directions for Further
found to be unrelated to achievement gains (Leinhardt, Research
Zigmond, & Cooley, 1981)—is meaningless. Although
several reviews of the literature have concluded that There is a need for rigorous evaluations of the
procedures like SSR work simply because reading effectiveness of encouraging wide reading on reading
achievement does not decline once they are instituted, achievement, particularly with popular programs such
that is not a sound basis on which to recommend such as SSR, DEAR, and AR. These studies need to monitor
procedures as effective. SSR may or may not work, but the amounts of reading—in and out of school—by both
it is unreasonable to conclude that it does on the basis the experimental and control group students. To really
of such flawed reasoning. For the most part, these understand the implications of such reading, it is
studies found no gains in reading due to encouraging important to compare these routines against procedures
students to read more. It is unclear whether this was in which students actually read less. Without such
the result of deficiencies in the instructional procedures information, one might only be comparing the effects of
themselves or to the weaknesses and limitations evident different forms of reading practice rather than
in the study designs. comparing differences in amount of reading practice.
Finally, none of these studies could even demonstrate
It is impossible to sustain a negative conclusion with that they clearly increased the amount of student
research. That is, the NRP cannot ultimately prove that reading because none of them measured an adequate
a procedure or approach does not work under any baseline of current or previous reading engagement.
conditions. No matter how many studies show a lack of That, too, should be addressed in future studies.
effect due to an instructional routine, it is always
possible that under some yet-unstudied condition the That encouraging more reading does as well as certain
procedure could be made to work. Given the paucity of instructional activities in stimulating learning does not
studies on increasing the amount of student reading— speak well of those instructional activities. Voluntary
and the uneven quality of much of this work—there is a reading within the school day should be compared
need to be especially cautious. Few of the studies against nonreading activities or activities in which the
reviewed here provided much monitoring of the amount amount of reading can be closely measured. (In fact,
of reading that students actually did in the programs, the field should consider adopting a new research
and only one kept track of the control student reading; convention for methodological studies with students in
therefore, in most cases, it is unclear whether the the 2nd grade or higher. The amount of gain attributable
interventions actually led to more reading or just to reading alone should be the baseline comparison
displaced other reading that students might have done against which the efficacy of instructional procedures is
otherwise. Nevertheless, given the evidence that exists, tested. If an instructional method does better than
the Panel cannot conclude that schools should adopt reading alone, it would be safe to conclude that method
works.) Studies should consider the effect of increasing
student reading on both fluency and overall reading There is clear and substantial research evidence that
achievement. However, until such evidence is shows that such procedures work under a wide variety
forthcoming, the National Reading Panel cannot of conditions and with minimal special training or
indicate that research has proven that such procedures materials. Even with this evidence, there is a need for
actually work. more research on this topic, including longitudinal
studies that examine the impact of these procedures on
Overall Conclusions different levels of students over longer periods. It would
Fluency is an essential part of reading, and the NRP has also be worthwhile to determine the amount of such
reviewed its theoretical and practical implications for instruction that would be needed with most students and
reading development. In addition, the Panel has the types of materials that lead to the biggest gains
conducted two research syntheses, one on guided oral when these procedures are used.
reading procedures such as repeated reading and the The results of the analysis of programs that encourage
other on the effect of procedures that encourage students to read more were much less encouraging.
students to read more. These two procedures have Despite widespread acceptance of the idea that schools
been widely recommended as appropriate and valuable can successfully encourage students to read more and
avenues for increasing fluency and overall reading that these increases in reading practice will be
achievement. translated into better fluency and higher reading
The NRP found a better, and more extensive, body of achievement, there is not adequate evidence to sustain
research on guided oral reading procedures. Generally, this claim. Few studies have attempted to increase the
the Panel found that these procedures tended to amount of student reading. Those that have investigated
improve word recognition, fluency (speed and accuracy such issues have tended to find no gains in reading as a
of oral reading), and comprehension with most groups. result of the programs. This does not mean that
Although there has been some speculation that fluency procedures that encourage students to read more could
development is complete for most students by grade 3 not be made to work—future studies should explore this
or 4, the Panel’s analysis found that these procedures possibility—but at this time, it would be unreasonable to
continue to be useful far beyond that—at least for some conclude that research shows that encouraging reading
readers. Repeated reading and other guided oral has a beneficial effect on reading achievement.
reading procedures have clearly been shown to improve
fluency and overall reading achievement.
References
Advantage Learning Systems. (1986). Accelerated Calfee, R. C., & Piaotkowski, D. C. (1981). The
Reader. Wisconsin Rapids, WI: Advantage Learning reading diary: Acquisition of decoding. Reading
Systems. Research Quarterly, 16, 346-373.
Ackerman, P. L. (1987). Individual differences in Carpenter, P. A., & Just, M. A. (1983). What your
skill reading: An integration of psychometric and eyes do while your mind is reading. In K. Rayner (Ed.),
information processing perspectives. Psychological Eye movements in reading: Perceptual and language
Bulletin, 102, 3-27. processes (pp. 275-307). New York: Academic Press.
Allington, R. (1984). Content coverage and Cattell, J. M. (1885). Uber die zeit der erkennung
contextual reading in reading groups. Journal of und bennenung von schriftzeichen. Bildern und Farben
Reading Behavior, 16, 85-96. Studien, 2, 635-650.
Allington, R. L. (1983). Fluency: The neglected Clay, M. M. (1972). The early detection of reading
reading goal in reading instruction. The Reading difficulties. Auckland, NZ: Heinemann.
Teacher, 36, 556-561.
Cooper, H. (1998). Synthesizing research (3rd ed.).
Allington, R. L. (1977). If they don’t read much, Thousand Oaks, CA: Sage.
how they ever gonna get good? Journal of Reading, 21,
57-61. Cunningham, A. E., & Stanovich, K. E. (1998).
What reading does for the mind. American Educator,
Anderson, R. C., Wilkinson, I. A. G., & Mason, J. 22(1-2), 8-15.
M. (1991). A microanalysis of the small-group, guided
reading lesson: Effects of an emphasis on global story Donahue, P. L., Voelkl, K. E., Campbell, J. R., &
meaning. Reading Research Quarterly, 26, 417-441. Mazzeo, J. (1999). NAEP 1998 reading report card for
the nation and states. Washington, DC: U.S.
Barr, R., Kamil, M. L., Mosenthal, P., & Pearson, Department of Education, Office of Educational
P. D. (1991). Handbook of reading research (Vol. 2). Research and Improvement, National Center for
New York: Longman. Education Statistics.
Bryan, W. L., & Harter, N. (1899). Studies of the Dowhower, (1994). Repeated reading revisited:
telegraphic language. The acquisition of a hierarchy of Research into practice. Reading and Writing Quarterly,
habits. Psychological Review, 6, 345-375. 10, 343-358.
Buswell, G. T. (1922). Fundamental reading habits: Everatt, J., Bradshaw, M. F., & Hibbard, P. B.
A study of their development. Supplementary (1998). Individual differences in reading and eye
Educational Monographs, No. 21. Chicago: University movement control. In G. Underwood (Ed.), Eye
of Chicago Press. guidance in reading and scene perception (pp. 223-242).
Oxford, England: Elsevier.
Buswell, G. T. (1937). How adults read.
(Supplementary Educational Monographs). Chicago:
University of Chicago Press.
Fleischer, L. S., Jenkins, J., & Pany, D. (1979). Homan, S. P., Klesius, J. P., & Hite, C. (1993).
Effects on poor readers’ comprehension of training in Effects of repeated readings and nonrepetitive
rapid decoding. Reading Research Quarterly. 14, 30-48. strategies on students’ fluency and comprehension.
Journal of Educational Research, 87, 94-99.
Flood, J., Jensen, J. M., Lapp, D., & Squire, J. R.
(1991). Handbook of research on teaching the English Huey, E. B. (1905). The psychology and pedagogy
language arts. New York: Macmillan. of reading. Cambridge, MA: MIT Press.
Frazier, L., & Rayner, K. (1982). Making & Hunt, L. C., Jr. (1970). The effect of self-selection,
correcting errors during sentence comprehension: Eye interest, and motivation on independent, instructional,
movements in the analysis of structurally ambiguous and frustrational levels. The Reading Teacher, 24, 146-
sentences. Cognitive Psychology, 14, 178-210. 151, 158.
Fries, C. C. (1962). Linguistics and reading. New Ikeda, M., & Saida, S. (1978). Span of recognition
York: Holt. in reading. Vision Research, 18, 83-88.
Goodman, Y. M., & Burke, C. L. (1972). Reading James, W. (1890). The principles of psychology.
miscue inventory: Procedure for diagnosis and New York: Holt.
correction. New York: Macmillan.
Johnson, M. S., Kress, R. A., & Pikulski, J. J.
Gough, P. B. (1972). One second of reading. In J. (1987). Informal reading inventories (2nd ed.). Newark,
F. Kavanagh, & I. G. Mattingly (Eds.), Language by IL: International Reading Association.
ear and by eye. (pp. 331-358). Cambridge, MA: MIT
Press. Juel, C. (1988). Learning to read and write: A
longitudinal study of fifty-four children from first
Greene, F. P. (1979). Radio reading. In C. Pennock through fourth grades. Journal of Educational
(Ed.), Reading comprehension at four linguistic levels Psychology, 80, 437-447.
(pp. 104-107). Newark, DE: International Reading
Association. Kennedy, A. (1983). On looking into space. In K.
Rayner (Ed.), Eye movements in reading: Perceptual
Harris, T. L., & Hodges, R. E. (1995). The literacy and language processes (pp. 237-251). New York:
dictionary. Newark, DE: International Reading Academic Press.
Association.
Kennedy, A., & Murray, W. S. (1987a). The
Hasbrouck, J. E., & Tindal, G. (1992). Curriculum- components of reading time: Eye movement patterns of
based oral reading fluency norms for students in grades good and poor readers. In J. K. O’Reagan & A. Levy-
2 through 5. Teaching Exceptional Children, 24(3), 41- Schoen (Eds.), Eye movements: From physiology to
44. cognition (pp. 509-520). Amsterdam: North Holland.
Kennedy, A., & Murray, W. S. (1987b). Spatial Neely, J. H. (1977). Semantic priming and retrieval
coordinates and reading: Comments on Monk. from lexical memory: Roles of inhibitionless spreading
Quarterly Journal of Experimental Psychology, 39A, activation and limited capacity attention. Journal of
649-656. Experimental Psychology: General, 106, 226-254.
Krashen, S. D. (1993). The power of reading: Neill, K. (1979). Turn kids on with repeated
Insights from the research. Englewood, CO: Libraries reading. Teaching Exceptional Children, 12, 63-64.
Unlimited.
Olson, R. K., Kliegl, R., Davidson, B. J., & Foltz,
LaBerge, D., & Samuels, S. J. (1974). Toward a G. (1985). Individual differences and developmental
theory of automatic information processing in reading. differences in reading disability. In G. MacKinnon & T.
Cognitive Psychology, 6, 293-323. G. Waller (Eds.), Reading research: Advances in theory
and practice (pp. 1-64). New York: Academic Press.
Leinhardt, G., Zigmond, N., & Cooley, W. W.
(1981). Reading instruction and its effects. American Opitz, M. F., & Rasinski, T. V. (1998). Good-bye
Educational Research Journal, 18, 343-361. round robin. Portsmouth, NH: Heinemann.
Levy, B. A., Nicholls, A., & Kohen, D. (1993). O’Shea, L. J., Sindelar, P. T., & O’Shea, D. J.
Repeated readings: Process benefits for good and poor (1985). The effects of repeated readings and attentional
readers. Journal of Experimental Child Psychology, 56, cues on reading fluency and comprehension. Journal of
303-327. Reading Behavior, 17, 129-142.
Logan, G. D. (1997). Automaticity and reading: Pearson, P. D., Barr, R., Kamil, M. L., &
Perspectives from the instance theory of Mosenthal, P. (1984). Handbook of reading research.
automatization. Reading and Writing Quarterly, 13, 123- New York: Longman.
146.
Pflaum, S. W., & Pascarella, E. T. (1980).
McConkie, G. W., & Rayner, K. (1975). The span Interactive effects of prior reading achievement and
of the effective stimulus during a fixation in reading. training in context on the reading of learning-disabled
Perception and Psychophysics, 17, 578-586. children. Reading Research Quarterly, 16, 138-158.
McConkie, G. W., & Zola, D. (1979). Is visual Pinnell, G. S., Pikulski, J. J., Wixson, K. K.,
information integrated across successive fixations in Campbell, J. R., Gough, P. B., & Beatty, A. S. (1995).
reading? Perception and Psychophysics, 25, 221-224. Listening to children read aloud. Washington, DC:
Office of Educational Research and Improvement, U.
McConkie, G. W., & Zola, D. (1987). Visual S. Department of Education.
attention during eye fixations while reading. In M.
Coltheart (Ed.), Attention and performance (Vol. 12, pp. Posner, M. I., & Snyder, R. R. (1975). Attention
385-401). London: Erlbaum. and cognitive control. In R. L. Solso (Ed.), Information
processing and cognition: The Loyola Symposium (pp.
Murray, W. S., & Kennedy, A. (1988). Spatial 55-85). Hillsdale, NJ: Erlbaum.
coding in the processing of anaphor by good and poor
readers: Evidence from eye movement analyses. Purves, A. C. (1994). Encyclopedia of English
Quarterly Journal of Experimental Psychology, 40A, studies and language arts. New York: Scholastic.
693-718.
Radach, R., & Kempe, V., (1993). An individual
Nagy, W., & Anderson, R. C. (1984). How many analysis of initial fixation positions in reading. In G.
words are there in printed school English? Reading d’Ydewalle & J. Van Rensbergen (Eds.), Perception
Research Quarterly, 19, 304-330. and cognition: Advances in eye movement research
(pp. 213-226). Amsterdam: North Holland.
Rasinski, T. V. (1990). Effects of repeated reading Shiffrin, R. M., & Schneider, W. (1977). Controlled
and listening-while-reading on reading fluency. Journal and automatic information processing: Perceptual
of Educational Research, 83, 147-150. learning, automatic attending, and general theory.
Psychological Review, 96, 84, 127-190.
Rayner, K. (1978b). Eye movements in reading and
information processing. Psychological Bulletin, 85, 618- Shanahan, S., Wojciehowski, J., & Rubik, G.
660. (1998). A celebration of reading: How our school read
for one million minutes. The Reading Teacher, 52, 93-
Rayner, K. (1986). Eye movements and the 96.
perceptual span in beginning and skilled readers. Journal
of Experimental Child Psychology, 41, 211-236. Shaywitz, B., Holford, T.R., Holahan, J.M.,
Fletcher, J.M., Steubing, K.K., Francis, D.J., &
Rayner, K. (1998). Eye movements in reading and Shaywitz, S.E. (1995). A Matthew effect for IQ but not
information processing: 20 years of research. for reading: Results from a longitudinal study of reading.
Psychological Bulletin, 124, 372-422. Reading Research Quarterly, 30, 894-906.
Rayner, K., & Duffy, S. A. (1988). On-line Sindelar, P. T., Monda, L. E., & O’Shea, L. J.
comprehension processes and eye movements in (1990). Effects of repeated readings on instructional-
reading. In M. Daneman, G. E. MacKinnon, & T. G. and mastery-level readers. Journal of Educational
Waller (Eds.), Reading research: Advances in theory Research, 83, 220-226.
and practice (pp. 13-66). San Diego, CA: Academic
Press. Snow, C. E., Burns, M. S., & Griffin, P. (1998).
Preventing reading difficulties in young children.
Rayner, K., McConkie, G. W., & Zola, D. (1980). Washington, DC: National Academy Press.
Integrating information across eye movements.
Cognitive Psychology, 12, 206-226. Stallings, J. (1980). Allocated academic learning
time revisited, or beyond time on task. Educational
Rayner, K., & Pollatsek, A. (1994). The psychology Researcher, 8(11), 11-16.
of reading. Mahwah, NJ: Erlbaum.
Stanovich, K. (1990). Concepts in developmental
Samuels, S. J. (1979). The method of repeated theories of reading skill: Cognitive resources,
readings. The Reading Teacher, 32, 403-408. automaticity, and modularity. Developmental Review,
10, 72-100.
Samuels, S. J., LaBerge, D., & Bremer, C. (1978).
Units of word recognition: Evidence for developmental Stanovich, K. (1986). Mathew effects in reading:
changes. Journal of Verbal Learning and Verbal Some consequences of individual differences in the
Behavior, 17, 715-720. acquisition of literacy. Reading Research Quarterly, 22,
360-406.
Samuels, S. J., Miller, N., & Eisenberg, P. (1979).
Practice effects on the unit of word recognition. Journal Strecker, S., Roser, N., & Martinez, M. (1998).
of Educational Psychology, 71, 514-520. Toward understanding oral reading fluency. In T.
Shanahan & F. Rodriguez-Brown (Eds.) Forty seventh
Schreiber, P. A. (1980). On the acquisition of Yearbook of the National Reading Conference, (pp.
reading fluency. Journal of Reading Behavior, 12, 177- 295-310). Chicago, IL: The National Reading
186. Conference.
Taylor, B. M., Pearson, P. D., Clark, K. F., & Underwood, N. R., & McConkie, G. W. (1985).
Walpole, S. (1999). Beating the odds in teaching all Perceptual span for letter distinctions during reading.
children to read. (CIERA Rep. No. 2-006). Ann Arbor, Reading Research Quarterly, 20, 153-162.
MI: Center for the Improvement of Early Reading
Achievement, University of Michigan. Van Bon, W. H. J., Boksebeld, L. M., Font Freide,
T. A. M., & van den Hurk, A. J. M. (1991). A
Taylor, S., Frackenpohl, H., & Pettee, J. (1960). comparison of three methods of reading-while-listening,
Grade level norms for the components of the Journal of Learning Disabilities, 24, 471-476.
fundamental reading skill. (Bulletin No. 3). Huntington,
NY: Educational Developmental Laboratories. VanWagenen, M. A., Williams, R. L., &
McLaughlin, T. F. (1994). Use of assisted reading to
Terry, P., Samuels, S. J., & LaBerge, D. (1976). improve reading rate, word accuracy, and
The effects of letter degradation and letter spacing on comprehension with ESL Spanish-speaking students.
word recognition. Journal of Verbal Learning and Perceptual and Motor Skills, 79, 227-230.
Verbal Behavior, 15, 577-585.
Wagner, R., Torgesen, J. & Rashotte, C. (1999).
Thurlow, R., & van den Broek, P. (1997). Comprehensive test of phonological processes. Austin,
Automaticity and inference generation. Reading and TX: Pro-Ed.
Writing Quarterly, 13, 165-184.
Wiederholt, J. L. & Bryant, B. R. (1992). Gray
Topping, K. (1987). Paired reading: A powerful Oral Reading Tests. (3rd ed.). Austin, TX: Pro-Ed.
technique for parent use. The Reading Teacher, 40,
608-614. Winter, S. (1986). Peers as paired reading tutors.
British Journal of Special Education, 13, 103-106.
Underwood, G., Hubbard, A., & Wilkinson, H.
(1990). Eye fixations predict reading comprehension: Winter, S. (1988). Paired reading: A study of
The relationship between reading skill, reading speed process and outcome. Educational Psychology, 8, 135-
and visual inspection. Language and Speech, 33, 69-81. 151.
FLUENCY
Appendices
Appendix A
Studies of Repeated Reading and Guided Oral Reading
T h a t Te s t e d I m m e d i a t e I m p a c t o f t h e Pr o c e d u r e s
o n R e a d i n g Pe r f o r m a n c e ( n o t r a n s f e r )
Faulkner, H. J., & Levy, B. A. (1999). Fluent and Sindelar, P. T., Monda, L. E., & O’Shea, L. J.
nonfluent forms of transfer in reading: Words and their (1990). Effects of repeated readings on instructional-
message. Psychonomic Bulletin and Review, 6, 111-116. and mastery-level readers. Journal of Educational
Research, 83, 220-226.
Levy, B. A., Nicholls, A., & Kohen, D. (1993).
Repeated readings: Process benefits for good and poor Smith, D. D. (1979). The improvement of children’s
readers. Journal of Experimental Child Psychology, 56, oral reading through the use of teacher modeling.
303-327. Journal of Learning Disabilities, 12 (3), 39-42.
Neill, K. (1979). Turn kids on with repeated Stoddard, K., Valcante, G., Sindelar, P., O’Shea, L.,
reading. Teaching Exceptional Children, 12, 63-64. & Algozzine, B. (1993). Increasing reading rate and
comprehension: The effects of repeated readings,
O’Shea, L. J., Sindelar, P. T., & O’Shea, D. J. sentence segmentation, and intonation training. Reading
(1985). The effects of repeated readings and attentional Research and Instruction, 32, 53-65.
cues on reading fluency and comprehension. Journal of
Reading Behavior, 17, 129-142. Taylor, N. E., Wade, M. R., & Yekovich, F. R.
(1985). The effects of text manipulation and multiple
Pany, D., & McCoy, K. M. (1988). Effects of reading strategies on the reading performance of good
corrective feedback on word accuracy and reading and poor readers. Reading Research Quarterly, 20, 566-
comprehension of readers with learning disabilities. 574.
Journal of Learning Disabilities, 21, 546-550.
Turpie, J. J., & Paratore, J. R. (1995). Using
Rasinski, T. V. (1990). Effects of repeated reading repeated reading to promote success in a
and listening-while-reading on reading fluency. Journal heterogeneously grouped first grade. In K. A.
of Educational Research, 83, 147-150. Hinchman, D.J. Leu, & C.K. Kinzer (Eds.)
Perspectives on literacy research and practice: Forty-
Reitsma, P. (1988). Reading practice for beginners: fourth Yearbook of the National Reading Conference
Effects of guided reading, reading-while-listening, and (pp. 255-263). Chicago: The National Reading
independent reading with computer-based speech Conference.
feedback. Reading Research Quarterly, 23, 219-235.
VanWagenen, M. A., Williams, R. L., &
Rose, T. L., & Beattie, J. R. (1986). Relative McLaughlin, T. F. (1994). Use of assisted reading to
effects of teacher-directed and taped previewing on improve reading rate, word accuracy, and
oral reading. Learning Disability Quarterly, 9, 193-199. comprehension with ESL Spanish-speaking students.
Perceptual and Motor Skills, 79, 227-230.
Appendix B
Articles Included in Meta-Analysis on Guided
Oral Reading Procedures
Conte, R., & Humphreys, R. (1989). Repeated Rasinski, T., Padak, N., Linek, W., & Sturtevant, E.
readings using audiotaped material enhances oral (1994). Effects of fluency development on urban
reading in children with reading difficulties. Journal of second-grade readers. Journal of Educational Research,
Communications Disorders, 22, 65-79. 87, 158-165.
Eldredge, J. L. (1990). Increasing the performance Reutzel, D. R., & Hollingsworth, P. M. (1993).
of poor readers in the third grade with a group-assisted Effects of fluency training on second graders’ reading
strategy. Journal of Educational Research, 84, 69-77. comprehension. Journal of Educational Research, 86,
325-331.
Eldredge, J. L., Reutzel, D. R., & Hollingsworth, P.
M. (1996). Comparing the effectiveness of two oral Shany, M. T., & Biemiller, A. (1995). Assisted
reading practices: Round-robin reading and the shared reading practice: Effects on performance for poor
book experience. Journal of Literacy Research, 28, readers in grade 3 and 4. Reading Research Quarterly,
201-225. 30, 382-395.
Hollingsworth, P. M. (1978). An experimental Simmons, D., Fuchs, D., Fuchs, L. S., Hodge, J. P.,
approach to the impress method of teaching reading. & Mathes, P. G. (1994). Importance of instructional
Reading Teacher, 31, 624-627. complexity and role reciprocity to classwide peer
tutoring. Learning Disabilities Research and Practice, 9,
Hollingsworth, P. M. (1970). An experiment with 203-212.
the impress method of teaching reading. Reading
Teacher, 24, 112-114, 187. Simmons, D.C., Fuchs, L. S., Fuchs, D., Mathes, P.,
& Hodge, J. P. (1995). Effects of explicit teaching and
Labbo, L. D., & Teale, W. (1990). Cross-age peer tutoring on the reading achievement of learning-
reading: A strategy for helping poor readers. Reading disabled and low-performing students in regular
Teacher, 43, 362-369. classrooms. Elementary School Journal, 95, 387-408.
Lorenz, L., & Vockell, E. (1979). Using the Thomas, A., & Clapp, T. (1989). A comparison of
neurological impress method with learning disabled computer-assisted component reading skills training and
readers. Journal of Learning Disabilities, 12, 67-69. repeated reading for adolescent poor readers. Canadian
Journal of Special Education, 5, 135-144.
Mathes, P. G., & Fuchs, L. S. (1993). Peer
mediated reading instruction in special education Young, A. R., Bowers, P. G., & MacKinnon, G. E.
resource rooms. Learning Disabilities Research and (1996). Effects of prosodic modeling and repeated
Practice, 8, 233-243. reading on poor readers’ fluency and comprehension.
Applied Psycholinguistics, 17, 59-84.
Miller, A., Robson, D., & Bushell, R. (1986).
Parental participation in paired reading: A controlled
study. Educational Psychology, 6, 277-284.
Appendix C
Blum, I. H., Koskinen, P. S., Tennant, N., Parker, E. Mefferd, P. E., & Pettegrew, B. S. (1997).
M., Straub, M., & Curry, C. (1995). Using audiotaped Fostering literacy acquisition of students with
books to extend classroom literacy instruction into the developmental disabilities: Assisted reading with
homes of second-language learners. Journal of Reading predictable trade books. Reading Research and
Behavior, 27, 535-563. Instruction, 36, 177-190.
Gilbert, L. M., Williams, R. L., & McLaughlin, T. F. Morgan, R. T. (1976). “Paired reading” tuition: A
(1996). Use of assisted reading to increase correct preliminary report on a technique for cases of reading
reading rates and decrease error rates of students with deficit. Child: Care, Health and Development, 2, 13-28.
learning disabilities. Journal of Applied Behavior
Analysis, 29, 255-257. Morgan, R., & Lyon, E. (1979). “Paired reading”–
A preliminary report on a technique for parental tuition
Herman, P. A. (1985). The effect of repeated of reading-retarded children. Journal of Child
readings on reading rate, speech pauses, and word Psychiatry, 20, 151-160.
recognition accuracy. Reading Research Quarterly, 20,
553-565. Rose, T. L. (1984). The effects of two prepractice
procedures on oral reading. Journal of Learning
Kamps, D. M., Barbetta, P. M., Leonard, B. R., & Disabilities, 17, 544-548.
Delquadri, J. (1994). Classwide peer tutoring: An
integration strategy to improve reading skills and Tingstrom, D. H., Edwards, R. P., & Olmi, D. J.
promote peer interactions among students with autism (1995). Listening previewing in reading to read: Relative
and general education peers. Journal of Applied effects on oral reading fluency. Psychology in the
Behavior Analysis, 27, 49-61. Schools, 32, 318-327.
Langford, K., Slade, K., & Barnett, A. (1974). An Weinstein, G., & Cooke, N. L. (1992). The effects
examination of impress techniques in remedial reading. of two repeated reading interventions on generalization
Academic Therapy, 9, 309-319. of fluency. Learning Disability Quarterly, 15, 21-28.
Appendix D
Carver, R. P., & Hoffman, J. V. (1981). The effect Lindsay, G., Evans, A., & Jones, B. (1985). Paired
of practice through repeated reading on gain in reading reading versus relaxed reading: A comparison. British
ability using a computer-based instructional system. Journal of Educational Pscyhology, 55, 304-309.
Reading Research Quarterly, 3, 374-390.
Rashotte, C., & Torgesen, J. K. (1985). Repeated
Dixon-Krauss, L. A. (1995). Partner reading and reading and reading fluency in learning disabled
writing: Peer social dialogue and the zone of proximal children. Reading Research Quarterly, 20, 180-188.
development. Journal of Reading Behavior, 27, 45-63.
Van Bon, W. H. J., Boksebeld, L. M., Font Freide,
Dowhower, S. L. (1987). Effects of repeated T. A. M., & van den Hurk, A. J. M. (1991). A
reading on second-grade transitional readers’ fluency comparison of three methods of reading-while-listening,
and comprehension. Reading Research Quarterly, 22, Journal of Learning Disabilities, 24, 471-476.
389-406.
Winter, S. (1986). Peers as paired reading tutors.
Homan, S. P., Klesius, J. P., & Hite, C. (1993). British Journal of Special Education, 13, 103-106.
Effects of repeated readings and nonrepetitive
strategies on students’ fluency and comprehension. Winter, S. (1988). Paired reading: A study of
Journal of Educational Research, 87, 94-99. process and outcome. Educational Psychology, 8, 135-
151.
Appendix E
Burley, J. E. (1980). Short-term, high intensity Langford, J. C., & Allen, E. G. (1983). The effects
reading practice methods for Upward Bound Students: of U.S.S.R. on students’ attitudes and achievement.
An appraisal. Negro Educational Review, 31, 156-161. Reading Horizons, 23, 194-200.
Carver, R. P., & Liebert, R. E., (1995). The effect Manning, G. L., & Manning, M. (1984). What
of reading library books in different levels of difficulty models of recreational reading make a difference.
on gain in reading ability. Reading Research Quarterly, Reading World, 23, 375-380.
30, 26-48.
Morrow, L. M., & Weinstein, C. S. (1986).
Cline, R. K. J., & Kretke, G. L. (1980). An Encouraging voluntary reading: The impact of a
evaluation of long-term SSR in the junior high school. literature program on children’s use of library centers.
Journal of Reading, 23, 503-506. Reading Research Quarterly, 21, 330-346.
Collins, C. (1980). Sustained silent reading periods: Peak, J., & Dewalt, M. W. (1994). Reading
Effects on teachers’ behaviors and students’ achievement: Effects of computerized reading
achievement. Elementary School Journal, 81, 108-114. management and enrichment. ERS Spectrum, 12(1), 31-
34.
Davis, Z. T. (1988). A comparison of the
effectiveness of sustained silent reading and directed Reutzel, D. R., & Hollingsworth, P. M. (1991).
reading activity on students’ reading achievement. The Reading comprehension skills: Testing the
High School Journal, 72(1), 46-48. distinctiveness hypothesis. Reading Research and
Instruction, 30, 32-46.
Evans, H. M., & Towner, J. C. (1975). Sustained
silent reading: Does it increase skills? Reading Teacher, Summers, E. G., McClelland, J. V. (1982). A field-
29, 155-156. based evaluation of sustained silent reading (SSR) in
intermediate grades. Alberta Journal of Educational
Holt, S. B., & O’Tuel, F. S. (1989). The effect of Research, 28, 100-112.
sustained silent reading and writing on achievement and
attitudes of seventh and eighth grade students reading Vollands, S. R., Topping, K. J., & Evans, R. M.
two years below grade level. Reading Improvement, 26, (1999). Computerized self-assessment of reading
290-297. comprehension with the Accelerated Reader: Action
Research. Reading and Writing Quarterly, 15, 197-211.
COMPREHENSION
Executive Summary
analyses do not meet the formal criteria for inclusion in Teacher Pr eparation and Compr
Preparation ehension
Comprehension
the analysis, they are relevant to the issues at hand. Strategies Instruction
Consequently, they are included in the discussions of A study had to meet the following criteria to be included
findings. in the review:
Vocabular
ocabulary y Instruction Vocabulary Instruction: Findings
A formal meta-analysis was not possible. Inspection of
the research studies that were included in the database The importance of vocabulary knowledge has long been
revealed a heterogeneous set of methodologies, recognized. In 1925, the National Society for Studies in
implementations, and conceptions of vocabulary Education (NSSE) Yearbook (Whipple, 1925) noted:
instruction. The Panel found no research on vocabulary “Growth in reading power means, therefore, continuous
measurement that met the NRP criteria; therefore, a enriching and enlarging of the reading vocabulary and
detailed review of implicit evidence is presented. increasing clarity of discrimination in appreciation of
word values.” (Davis, 1942, p. 76) presented evidence
For the analysis of research on vocabulary instruction, that comprehension comprises two “skills”: Word
there were recent meta-analyses that dealt with only knowledge or vocabulary and reasoning. Vocabulary
the variables amenable to a meta-analysis. For the most occupies an important position in learning to read. As a
part, the experimental research in vocabulary instruction learner begins to read, reading vocabulary encountered
involves many different variables and methodologies. It in texts is mapped onto the oral vocabulary the learner
was deemed inappropriate to analyze such disparate brings to the task. The reader learns to translate the
studies as a group. For comprehension instruction, there (relatively) unfamiliar words in print into speech, with
were simply too many studies involving too many the expectation that the speech forms will be easier to
variables to allow for a simple meta-analysis. The comprehend. Benefits in understanding text by applying
decision was made to do the preliminary work letter-sound correspondences to printed material come
necessary to organize comprehension instruction about only if the target word is in the learner’s oral
research for possible future analyses. For research on vocabulary. When the word is not in the learner’s oral
preparation of teachers to teach comprehension, there vocabulary, it will not be understood when it occurs in
were only four studies, and a meta-analysis was not print. Vocabulary occupies an important middle ground
possible. The general analysis of teacher education and in learning to read. Oral vocabulary is a key to learning
professional development found too few studies on too to make the transition from oral to written forms.
many variables to conduct a formal meta-analysis. Reading vocabulary is crucial to the comprehension
Similarly, for computer technology and reading processes of a skilled reader.
instruction, there were relatively few studies, most of
which used variables that differed from each other. Vocabulary Instruction Methods
Five main methods of teaching vocabulary were
Text Compr ehension Instruction
Comprehension
identified:
A formal meta-analysis was not possible because even
the studies identified in the same instructional category 1. Explicit Instruction: Students are given definitions or
used widely varying sets of methodologies and other attributes of words to be learned.
implementations. Therefore, the Panel found few
2. Implicit Instruction: Students are exposed to words
research studies that met all the NRP criteria; however,
or given opportunities to do a great deal of reading.
to the extent possible, NRP criteria were employed in
the analyses. NRP criteria for evaluating existing 3. Multimedia Methods: Vocabulary is taught by going
reviews of research were used in the analyses of the beyond text to include other media such as graphic
two Rosenshine and colleagues meta-analyses. representations, hypertext, or American Sign
Language that uses a haptic medium.
Teacher Pr eparation and Compr
Preparation ehension
Comprehension
Strategies Instruction 4. Capacity Methods: Practice is emphasized to
A formal meta-analysis was not possible because of the increase capacity through making reading
small number of studies identified. However, automatic.
comprehensive summaries according to NRP guidelines 5. Association Methods: Learners are encouraged to
for each of the four studies are included in the report. draw connections between what they do know and
words they encounter that they do not know.
3. What are the more general effects of vocabulary The bulk of instruction of text comprehension research
instruction across the grades? during the past 2 decades has been guided by the
cognitive conceptualization of reading described above.
4. Empirical support has been found for the facilitation
In the cognitive research of the reading process,
of vocabulary learning with computers as ancillary
reading is purposeful and active. A reader reads a text
aids and replacements of other technologies
to understand what is read and to put this understanding
(Reinking & Rickman, 1990; Heise, 1991; Davidson
to use. A reader can read a text to learn, to find out
et al., 1996). What is the optimal use of computer
information, or to be entertained. These various
(and other) technologies in vocabulary instruction?
purposes of understanding require that the reader use
What is the precise role of multimedia learning in
knowledge of the world, including language and print.
vocabulary acquisition?
This knowledge enables the reader to make meanings
5. What is the precise role of multimedia learning in of the text, to form memory representations of these
vocabulary instruction across the grades? meanings, and to use them to communicate information
with others about what was read.
6. How should vocabulary be integrated in
comprehension instruction for optimal benefit to the Although instruction on text comprehension has been a
student? major research topic for more than 20 years, the explicit
teaching of text comprehension before 1980 was done
7. What are the optimal combinations of the various largely in content areas and not in the context of formal
methods of vocabulary instruction, including direct reading instruction. The idea behind explicit instruction
and indirect instruction, as well as different methods of text comprehension was that comprehension could
within these categories? be improved by teaching students to use specific
8. What sort of professional development is needed cognitive strategies or to reason strategically when they
for teachers to become proficient in vocabulary encountered barriers to comprehension in reading. The
instruction? goal of such training was the achievement of competent
and self-regulated reading.
Text Comprehension Instruction:
Readers normally acquire strategies for active
Findings comprehension informally. Comprehension strategies
Comprehension is a complex process. There exist as are specific procedures that guide students to become
many interpretations of comprehension as there are of aware of how well they are comprehending as they
reading. This may be so because comprehension is attempt to read and write. Explicit or formal instruction
often viewed as “the essence of reading” (Durkin, of these strategies is believed to lead to improvement in
1993). Reading comprehension is further defined as text understanding and information use. Instruction in
“intentional thinking during which meaning is comprehension strategies is carried out by a classroom
constructed through interactions between text and teacher who demonstrates, models, or guides the reader
reader” (Durkin, 1993). According to this view, in their acquisition and use. When these procedures are
meaning resides in the intentional, problem-solving, acquired, the reader becomes independent of the
thinking processes of the reader that occur during an teacher. Using them, the reader can effectively interact
interchange with a text. The content of meaning is with the text without assistance.
influenced by the text and by the reader’s prior Comprehension Instruction Methods
knowledge and experience that are brought to bear on
it. Reading comprehension is the construction of the Analyses of the 203 studies on instruction of text
meaning of a written text through a reciprocal comprehension led to the identification of 16 different
interchange of ideas between the reader and the kinds of effective procedures. Of the 16 types of
message in a particular text (Harris & Hodges, 1995, instruction, 8 offered a firm scientific basis for
definition #2, p. 39). concluding that they improve comprehension. The eight
kinds of instruction that appear to be effective and most
promising for classroom instruction are
1. Comprehension monitoring in which the reader comprehension tests. Teachers can learn to teach
learns how to be aware or conscious of his or her students to use comprehension strategies in natural
understanding during reading and learns procedures learning situations. Furthermore, when teachers teach
to deal with problems in understanding as they these strategies, their students learn them and improve
arise. their reading comprehension.
2. Cooperative learning in which readers work A common aspect of individual and multiple-strategy
together to learn strategies in the context of instruction is the active involvement of motivated
reading. readers who read more text as a result of the
instruction. These motivational and reading practice
3. Graphic and semantic organizers that allow the
effects may be important to the success of multiple-
reader to represent graphically (write or draw) the
strategy instruction.
meanings and relationships of the ideas that underlie
the words in the text. Multiple-strategy instruction that is flexible as to which
strategies are used and when they are taught over the
4. Story structure from which the reader learns to ask
course of a reading session provides a natural basis for
and answer who, what, where, when, and why
teachers and readers to interact over texts. The
questions about the plot and, in some cases, maps
research literature developed from the study of isolated
out the time line, characters, and events in stories.
strategies to their use in combination to the preparation
5. Question answering in which the reader answers of teachers to teach them in interaction over texts with
questions posed by the teacher and is given readers in naturalistic settings. The Panel regards this
feedback on the correctness. development as the most important finding of the
Panel’s review because it moves from the laboratory to
6. Question generation in which the reader asks the classroom and prepares teachers to teach strategies
himself or herself what, when, where, why, what in ways that are effective and natural.
will happen, how, and who questions.
Implications for Reading Instruction
7. Summarization in which the reader attempts to
identify and write the main or most important ideas The empirical evidence reviewed favors the conclusion
that integrate or unite the other ideas or meanings that teaching of a variety of reading comprehension
of the text into a coherent whole. strategies leads to increased learning of the strategies,
to specific transfer of learning, to increased retention
8. Multiple-strategy teaching in which the reader uses
and understanding of new passages, and, in some cases,
several of the procedures in interaction with the
to general improvements in comprehension.
teacher over the text. Multiple-strategy teaching is
effective when the procedures are used flexibly and The important development of instruction of
appropriately by the reader or the teacher in comprehension research is the study of teacher
naturalistic contexts. preparation for instruction of multiple, flexible strategies
with readers in natural settings and content areas and
Results of Comprehension Instruction the assessment of the effectiveness of this instruction
With respect to the scientific basis of the instruction of by trained teachers on comprehension.
text comprehension, the Panel concludes that
Directions for Further Research
comprehension instruction can effectively motivate and
teach readers to learn and to use comprehension The Panel’s analysis of the research on instruction of
strategies that benefit the reader. text comprehension left a number of questions
unanswered.
These comprehension strategies yield increases in
measures of near transfer such as recall, question
answering and generation, and summarization of texts.
These comprehension strategies, when used in
combination, show general gains on standardized
1. More information is needed on the effective ways analyzing teacher and reader performance during
to teach teachers how to use proven strategies for instruction; use of appropriate units (individual,
instruction in text comprehension. This information group, classroom) in analyses; and assessment of
is crucial to situations in which teachers and either long-term effects or generalization of the
readers interact over texts in real classroom strategies to other tasks and materials.
contexts.
Teacher Preparation and
2. Some evidence was reviewed that indicated that
Comprehension Strategies
instruction in comprehension in content areas
benefits readers in terms of achievement in social Instruction: Findings
studies. However, it is not clear whether instruction The preparation of teachers to deliver comprehension
of comprehension strategies leads to learning skills strategy instruction is important to the success of
that improve performance in content areas of teaching reading comprehension. As indicated by the
instruction. If so, then it might be efficient to teach Panel’s review of text comprehension, reading
reading comprehension as a learning skill in content comprehension can be improved by teaching students to
areas. use specific cognitive strategies or to reason
3. Instruction of comprehension has been successful strategically when they encounter barriers to
over the 3rd to 6th grade range. A next step will be comprehension when reading. The goal of such training
to determine whether certain strategies are more is the achievement of competent and self-regulated
appropriate for certain ages and abilities, what reading. The research on comprehension strategies has
reader characteristics influence successful evolved dramatically over the course of the last 2
instruction of reading comprehension, and which decades. At first, investigators focused on teaching
strategies, in combination, are best for younger students one strategy at a time. A wide variety of
readers, poor or below-average readers, or for strategies was studied, including imagery, question
learning-disabled and dyslexic readers. generating, prediction, and a host of others. In later
studies, several strategies were taught in combination.
4. It will be important to know whether successful However, implementation in the context of the actual
instruction generalizes across different text genres classroom of this promising approach to comprehension
(e.g., narrative and expository) and texts from has been problematic. Acquiring and practicing
different subject content areas. The NRP review of strategies in isolation and then attempting to provide
the research indicated that little or no attention was transfer opportunities during the reading of text is not
given to the kinds of text that were used and that the kind of instruction that is required in naturalistic
there was little available information on the contexts. Proficient reading involves a constant, ongoing
difficulty level of texts. adaptation of many cognitive processes.
5. It will also be important to determine what teacher Thus, teachers must be skillful in their instruction and
characteristics influence successful instruction of must respond flexibly and opportunistically to students’
reading comprehension and what the most effective needs for instructive feedback as they read. To be able
ways are to train teachers, both preservice and to do this, teachers must themselves have a firm grasp
inservice. not only of the strategies that they are teaching the
6. Criteria of internal and external validity should be children but also of instructional strategies that they can
considered in the design of future research, to employ to achieve their goal. Many teachers find this
address problems that were noted in prior studies. type of teaching a challenge, most likely because they
Specifically, these issues were random assignment have not been trained to do such teaching. The focus of
of students to treatments and control conditions; the review was on four recent and promising studies
exposure of experimental and control participants to that addressed the need for specific teacher preparation
the same training materials; provision of information in the implementation of strategy instruction in
about the amount of time spent on dependent naturalistic classroom contexts. In these four studies,
variable tasks; the study of fidelity of treatment and teachers were trained to teach strategies, and the focus
was on the effectiveness of that training on students’ In both approaches, teachers explain specific strategies
reading. It is not surprising that only a few relevant to students and model the reasoning associated with
studies have been done on this topic. Interest in the their use. Both approaches include the use of
topic is rather new, and preparing teachers to deliver systematic practice of new skills, as well as scaffold
effective strategy instruction is a lengthy and complex support, in which teachers gradually withdraw the
process. amount of assistance they offer to students. The
different emphases of the two approaches (explanation
Methods of Teacher Preparation vs. discussion) result in differences in the level of
There have been two major approaches to collaboration among students.
comprehension strategy instruction in the classroom:
Results of Teacher Preparation
direct explanation (DE) and transactional.
All four studies showed that teachers can be taught to
The DE approach was designed to improve on the
be effective in teaching comprehension to their students
approach in which students are taught to use one or
in naturalistic reading contexts. These studies indicate
several strategies as described and reviewed in the
that teaching teachers to use comprehension instruction
previous section on text comprehension instruction. This
methods leads to students’ awareness of strategies and
kind of instruction did not attempt to provide students
use of strategies, which can in turn lead to improved
with an understanding of the reasoning and mental
reading.
processes involved in reading strategically. Therefore,
Gerald Duffy and Laura Roehler developed the DE Implications for Reading Instruction
approach. In this approach, teachers do not teach
individual strategies but focus instead on helping Teachers need training to become effective in
students to (1) view reading as a problem-solving task explaining fully what it is that they are teaching (what to
that necessitates the use of strategic thinking and (2) do, why, how, and when), modeling their own thinking
learn to think strategically about solving reading processes for their students, encouraging students to
comprehension problems. The focus is on developing ask questions and discuss possible answers and problem
teachers’ ability to explain the reasoning and mental solutions among themselves, and keeping students
processes involved in successful reading comprehension engaged in their reading by providing tasks that demand
in an explicit manner, hence the use of the term “direct active involvement.
explanation.” The implementation of DE requires There should be greater emphasis in teacher education
specific and intensive teacher training on how to teach on the teaching of reading comprehension. Such
the traditional reading comprehension skills found in instruction should begin during preservice training, and it
basal readers as strategies, for example, to teach should be extensive, especially with respect to preparing
students the skill of how to find the main idea by casting teachers to teach comprehension strategies.
it as a problem-solving task and reasoning about it
strategically. The transactional strategy instruction Directions for Further Research
(TSI) approach includes the same key elements as the It will be important to determine how well teachers
direct explanation approach, but it takes a somewhat maintain their effectiveness in the classroom after they
different view of the role of the teacher in strategy have had preparation in teaching reading
instruction. The TSI approach focuses on the ability of comprehension. Thus, several research questions
teachers to facilitate discussions in which students remain to be investigated.
(1) collaborate to form joint interpretations of text and
(2) explicitly discuss the mental processes and cognitive 1. Which components of a successful teacher
strategies that are involved in comprehension. In other preparation program are the effective ones? These
words, the emphasis is on the interactive exchange could include characteristics of the teacher
among learners in the classroom, hence use of the term preparation program itself, such as its focus or
“transactional.”
intensity, as well as characteristics of the instruction 6. Should the teaching of reading comprehension begin
delivered to the students, such as the amount of during preservice or inservice training or both, and
instruction provided, the particular strategies taught, how extensive should it be?
and the amount of collaborative discussion involved.
Conclusions
2. Can instruction in reading comprehension strategies
be successfully implemented and incorporated into Vocabulary is one of the most important areas within
content area instruction? comprehension and should not be neglected. The NRP
found a variety of methods by which readers acquire
3. How should the effectiveness of strategy vocabulary through explicit instruction and improve their
instruction be assessed, especially with respect to comprehension of what they read. The Panel also found
reading achievement and subject matter proficiency, that although there has been considerable success in
but also with respect to student interest and teacher teaching a variety of effective text comprehension
satisfaction? strategies that lead to improved text comprehension, the
4. Comprehension instruction research has been most promising lines of research within the reading
limited to grades 3 through 8. It will be important to comprehension strategies area focused on teacher
examine comprehension instruction in the primary preparation to teach comprehension. Teachers can be
grades when children are mastering phonics and helped by intensive preparation in strategy instruction,
word recognition and developing reading fluency. and this preparation leads to improvement in the
performance of their students.
5. How should the effectiveness of strategy
instruction be assessed, for example, by reading
achievement or subject matter achievement?
COMPREHENSION
Introduction
Learning to read is one of the most important given the importance of comprehension strategies in
things children accomplish in elementary fully understanding text, it is critical to know how
school because it is the foundation for most of teachers can best be prepared to teach their students
their future academic endeavors. From the specific strategies that will facilitate this understanding.
middle elementary years through the rest of
With this as background, this section of the NRP report
their lives as students, children spend much of
is organized in three subsections. The first section
their time reading and learning information
examines specific evidence on the effects of
presented in text. The activity of reading to
vocabulary instruction on reading achievement,
learn requires students to comprehend and
particularly text comprehension. In the second section,
recall the main ideas or themes presented in
data relevant to the effects of different types of
. . . text. (Stevens et al., 1991, p. 8).
instruction to facilitate the comprehension of text are
Inherent in the view expressed by Stevens et al. (1991) analyzed and interpreted. In the third section, available
in the preceding quotation is the critical importance of data on the preparation of teachers to teach reading
reading comprehension and the developmental view that comprehension strategies are reviewed and interpreted.
children must first learn how to recognize and relate In addition, a comprehensive description of the review
print to oral language knowledge and to make this methodology employed in the evaluation of studies
recognition automatic through practice. Indeed, relevant to vocabulary instruction, text comprehension
comprehension has come to be viewed as the “essence instruction, and teacher preparation is provided in
of reading” (Durkin, 1993), essential not only to Appendix A of this report.
academic learning but to lifelong learning as well.
In contrast to the sections in the NRP report that
Despite the critical and fundamental importance of
address phonemic awareness, phonics, and fluency,
reading comprehension, comprehension as a cognitive
studies undertaken with young children at risk for
process began to receive scientific attention only in the
reading failure and older disabled readers were not
past 30 years.
reviewed and analyzed as part of the Comprehension
As the National Reading Panel initiated its analysis of report. Time and resource limitations prevented the
the extant research data on reading comprehension, Panel from examining this subtopic in an optimal
three major themes emerged. First, reading manner. Thus, it was decided to examine only literature
comprehension as a process and an amalgam of that pertained to the normal reading process.
complex skills cannot be understood without examining
The reader will also note that the organization of this
the critical role and importance of vocabulary and
section of the report differs from the structure
vocabulary instruction. Second, in contrast to earlier
employed in adjacent sections in that each subsection
speculation that reading comprehension was a passive
(Vocabulary Instruction, Text Comprehension
process, more recent data clearly indicate that robust
Instruction, Teacher Preparation and Comprehension
comprehension is dependent on active and thoughtful
Strategies Instruction) provides a topic-specific
interaction between the text and the reader. Therefore,
introduction, an overview of methodology, results and
the development and investigation of reading
discussion of the findings, implications for reading
comprehension strategies now occupy a central role in
instruction, and directions for future research. This
teaching readers how to maximize their understanding
organization was adopted to accommodate the
of what is read and in explaining the different strategies
complexity of each of these subtopics and differences
that enable readers to optimally understand text. Third,
in the review and data analytic procedures employed
for each of the topics.
COMPREHENSION I
Vocabulary Instruction
that we would rarely use. Vocabulary is also Persons who can correctly identify unfamiliar words
subcategorized as oral vs. reading vocabulary, where are assumed to have larger vocabularies. The more
oral refers to words that are recognized in speaking or unfamiliar words that can be identified, the larger the
listening while reading vocabulary refers to words that vocabulary. However, these are estimates, rather than
are used or recognized in print. Sight vocabulary is a precise measurements. Furthermore, the definition of
subset of reading vocabulary that does not require “familiar” or “known” words is difficult to pin down
explicit word recognition processing. Conclusions about outside of a specific context. What does it mean if a
some of these different types of vocabularies often do learner “almost” knows a word? The assessment of
not apply to all; what may be true for one may or may such a circumstance has no objective answer.
not be true for another.
Finally, evaluation of vocabulary knowledge is measured
At a conceptual level, vocabulary can be measured in either by standardized tests or by informal,
many ways. One major distinction in the measurement experimenter- or teacher-generated tests on one
of vocabulary parallels the receptive/productive dimension and by receptive vs. productive techniques
distinction. Vocabulary that is recognized by an on another dimension.
individual is often different from vocabulary that is
produced. Another distinction is made between reading Methodology
vocabulary and writing vocabulary—the vocabularies
Database
that are available to the reader or writer—and between
speaking and listening vocabularies. Still another type of A search using Endnote 3.0 connected to the ERIC
vocabulary is often referred to as sight vocabulary— online database with a Z39.50 connection was initiated.
those words that can be identified without explicit Using the term “vocabulary” alone (in any field) yielded
decoding during reading. 18,819 citations. A search using “vocabulary” and
“instruction” and “reading” and “research” and
Because there are so many definitions of vocabulary,
“method” yielded 141 citations. A similar search
the format for assessing or evaluating vocabulary is an
undertaken using the PsycINFO database yielded a
important variable in both practice and research. One
total of 56 nonoverlapping citations. The 197 citations
way of assessing recognition vocabulary is to have the
were downloaded into an Endnote library for further
learner select a definition for a word from a list of
analysis. From this set, citations were removed if they
alternatives. Conversely, the task could be to select a
were not reports of research, did not report
word for the definition. In many cases, such as
experimental or quasi-experimental studies, dealt with
standardized tests, this method is used as a means of
foreign languages or non-English-speaking groups, or
obtaining efficiency in testing. A second method of
dealt exclusively with learning disabled or other special
assessing vocabulary is by having the learner generate
populations, including second-language learners.
a definition for a word. Because this method requires a
judgment about the response, it is often deemed less There are many studies that describe aspects of
efficient than a recognition method. Most often, vocabulary without specifically addressing the questions
recognition vocabulary is measurably larger than of how vocabulary instruction is conducted. The Panel
productive vocabulary. does recognize the importance of many of these studies
in designing vocabulary instruction, but the Panel did not
Another difficulty with the measurement of vocabulary
analyze these studies unless they contained at least
is that we can only ask a learner for a relatively small
some experimental work on instructional methods.
number of words. Those words must be representative
of a larger pool of vocabulary items. In short, we can Additional bibliographic searching was conducted,
never know exactly how large a vocabulary an guided by three meta-analyses (Stahl & Fairbanks,
individual has. Instead, we often measure only specific 1986; Klesius & Searls, 1990; Fukkink & de Glopper,
vocabulary items that we want the individual to know, 1998) and two reviews of the literature on vocabulary
for example, in the context of a reading or a science instruction research (Nagy & Scott, in press;
lesson. Standardized tests attempt to deal with this by Blachowicz & Fisher, in press). These procedures
selecting words that differ widely in their familiarity.
• Meyerson, Ford, and Jones (1991) found that 5th Computer Use for Vocabular
ocabulary y Instruction
graders were more likely than 3rd graders to assign A small but clear trend in recent years shows computer
science vocabulary into conceptual groupings. technology making inroads in literacy and literacy
• Tomesen and Aarnoutse (1998) studied reciprocal instruction. Four studies that employ computers for
teaching and direct instruction in deriving word vocabulary instruction appear in the database. These
meanings from context as provided to 4th graders; studies show learning gains with computer use as
the instruction was more helpful for poor readers compared to traditional methods or when computers are
rather than average readers. used as an ancillary aid.
• Robbins and Ehri (1994) found that storybook • Heller, Sturner, Funk, and Feezor (1993) examined
readings helped teach children meanings of the issue of cognitive demands of technology for
unfamiliar words; those with larger entering preschool learners, by studying the effect of
vocabularies learned more words. different input devices (touch screen vs. keyboard)
on vocabulary identification. They concluded that
• Nicholson and Whyte (1992) explored how 8- to
the greater cognitive demands of keyboard use
10-year-old students learned vocabulary from
disrupted the children’s ability to process the limited
incidental exposure (listening to stories). The largest
acoustic information available in speech.
effects were for high-ability students. They propose
that low-ability and average students should do • Reinking and Rickman (1990) found that 6th grade
more independent reading with a dictionary than students receiving computer instruction of difficult
listening to stories. text words with electronic text scored higher on
vocabulary measures than students reading printed
• McGivern and Levin (1983) reported positive
pages with dictionaries or glossaries.
effects for the keyword method, with greater
effects for low- than for high-ability students; the • Heise, Papelweis, and Tanner (1991) compared 3rd
low-ability students had more difficulty in and 6th through 8th grade students in conditions
operationalizing dual components of the task. with computer-assisted and conventional direct
instruction; the trend was for improved
performance with computer assistance, although • Medo and Ryder (1993) found that text-specific
the difference was not statistically significant. vocabulary instruction prior to reading expository
• Davidson, Elcock, and Noyes (1996) used a texts helped 8th grade students to make causal
computer that gave speech prompts when the connections and that this method benefited both
learner requested them; 5- to 7-year-old students average and high-ability students.
improved on three measures of vocabulary with However, one study found that vocabulary instruction
these prompts. did not transfer to general reading comprehension.
Tomesen and Aarnoutse (1998) conducted vocabulary
Vocabular
ocabulary y Instruction Ef fects on Compr
Effects ehension
Comprehension instruction in the context of reciprocal teaching for 4th
In this category are studies that attempt to map the grade students. They used direct instruction in deriving
causal relationships between vocabulary and word meanings from context and found it to be more
comprehension. The following studies underscore the helpful for poor readers than for average readers, but
notion that comprehension gains and improvement on they reported a lack of transfer to general reading
semantic tasks are results of vocabulary learning. comprehension.
Although all of these studies focus on vocabulary, they
also typify the heterogeneity among definitions and Keyword Method
implementations of vocabulary instruction. In the database, some positive findings with the
• Beck, Perfetti, and McKeown (1982) demonstrated keyword method research indicate that this method may
that 4th graders receiving vocabulary instruction significantly augment recall, and may be more helpful
performed better on semantic tasks than those who than many other vocabulary instruction methods. One
did not receive instruction. study found that the keyword method interacts with
student ability levels, and that low-ability students had
• McKeown, Beck, Omanson, and Perfetti (1983) considerably more difficulty with certain keyword
also found that vocabulary instruction had a strong methods than high-ability students. However, another
relation to text comprehension for 4th grade study reported that the initial keyword gains were
students. temporary, fading out within a week.
• Wixson (1986) examined teaching the concept vs. • Levin and colleagues (1984) noted gains for 4th and
dictionary definitions and showed that pre-teaching 5th grade students with the keyword method as
vocabulary words for understanding was effective, compared to semantic and contextual analysis
although the precise effects were unclear because methods in the short term. However, the advantage
of interaction with story. had faded in the 1-week-delayed test.
• Carney, Anderson, Blackburn, and Blessings (1984) • Levin, McCormick, Miller, and Berry (1982) found
found that for 5th grade students, pre-teaching that 4th grade students outperformed controls in
vocabulary words had a significant effect on vocabulary acquisition with the keyword method as
retention and acquisition of social studies content. compared to the picture context, control and
• Kameenui, Carnine, and Freschi (1982) found that experiential context conditions.
substitution of easy for hard vocabulary words, • Levin, Levin, Glasman, and Nordwall (1992) found
inclusion of redundant information, and instruction strong effects for 3rd, 4th, 7th, and 8th grade
on difficult words facilitated comprehension. students when comparing the keyword method to
• Stahl and Fairbanks (1986) conducted a meta- free study and science context vocabulary methods.
analysis and concluded that vocabulary instruction • McGivern and Levin (1983) found that 5th grade
was an important component for comprehension. students showed positive effects of the keyword
The best instructional techniques were mixes of method. However, there was more of a difference
definitional and contextual programs; the keyword for low-ability students than for high-ability
method produced some significant gains in recall. students, although low-ability students had more
Repeated exposures to words were also found to difficulty in operationalizing the components of the
be effective. task.
Indirect Learning Effects • Stahl, Richek, and Vandevier (1991) evaluated the
Because of the rapid rate at which vocabulary is indirect learning of vocabulary words among 6th
acquired, it has always been assumed that much grade students designated as less able readers and
vocabulary was learned incidentally. One instantiation found that the students were able to learn a
of this method is found in vocabulary learning in the significant number of vocabulary words from
context of storybook reading. Recent research studies listening to orally presented passages.
in the area suggest that indirect learning can definitely Two studies revealed great detail about the actual
occur, and that vocabulary can be acquired through process of vocabulary learning by examining the
incidental exposure. In addition, one particular study characteristics of words that were most conducive to
(Schwanenflugel, Stahl, & McFall, 1997) is important vocabulary acquisition. Schwanenflugel, Stahl, and
because it looks beyond the issue of whether word McFalls (1997) found that among their 4th grade
acquisition occurs from reading, examining the sample, certain word characteristics had a significant
characteristics of words and texts that were most impact on vocabulary learned from reading stories. In
amenable to vocabulary acquisition from stories. In this particular, non-noun words (verbs, adverbs, and
study of 4th grade students, researchers found that non- adjectives) were learned better than nouns, and
noun words (adverbs, verbs, and adjectives) were concrete words (high in imageability) were learned
easier to learn than nouns and that words with high more readily than less easily imageable words. The
imageability were easier to learn from the stories. authors conclude that the characteristics of vocabulary
• Robbins and Ehri (1994) demonstrated that words are more important variables in the learning of
storybook readings helped teach children meanings vocabulary words from stories than are text features
of unfamiliar words. However, those with larger (word repetitions, contextual support, etc.). Another
entering vocabulary learn more words. study, McFalls, Schwanenflugel, and Stahl (1996),
examined the impact of semantic variables related to
• Leung (1992) studied kindergartners and 1st grade concreteness on the development of reading vocabulary
students, finding that the frequency of a target word among a predominantly African American and low SES
in stories influenced the occurrence of the word in 2nd grade sample. They found that the children read
the child’s retellings and that read-aloud events abstract words with less accuracy than concrete words
seemed to help children to learn new words by on tasks of recognition and reading accuracy and that
incidental learning. the concreteness of the words determined whether
• Senechal and Cornell (1993) found that for 4- to 5- children were able to remember them and to learn to
year-old children, one single book reading was read them more easily.
enough to significantly improve new expressive
The nature of the interaction (emphasizing active
vocabulary of ten target words in the stories. In a
participation) during storybook readings may also have
delayed transfer test after 1 week, 5-year-old
an impact on learning. Three studies found that student-
children remembered more than 4-year-olds.
initiated talk or active participation was important.
• Nicholson and Whyte (1992) explored student
vocabulary learning through incidental exposure by • Dickinson and Smith (1994) examined storybook
having children 8 to 10 years old listen to stories; readings for preschoolers and the effects of teacher
the largest effects were for high-ability students. talk on vocabulary acquisition and concluded that
They proposed that low-ability and average-ability the amount of child-initiated analytic talk was
students do more independent reading with a important for vocabulary gains.
dictionary than listening to stories. • Senechal (1997) found that for pre-kindergarten
• Stewart, Gonzalez et al. (1997) examined children, repeated readings of a story created
acquisition of sight-reading vocabulary learned greater performance gains in vocabulary. Students
incidentally during articulation training and found learned more from answering questions during
that this learning generalized beyond printed words readings than they did when simply listening to the
on cards to words on a list. narrative.
• Drevno, Kimball, Possi, Heward, Gardner, and • Kameenui, Carnine, and Freschi (1982) found that
Barbetta (1994) examined the effects of active providing redundant information facilitated
student response (ASR) error correction on the comprehension and that instruction on difficult
learning of science vocabulary for a small group of vocabulary words also helped vocabulary learning
elementary students. In the ASR condition, when a in grades 4 through 6.
student made an error, the teacher modeled the • Dole, Sloan, and Trathen (1995) worked with 10th
correct definition and the student repeated it, but in grade students on an “alternative” vocabulary
the no response (NR) condition, students would not treatment condition: teach students how to select
repeat the definition. ASR was found to be superior relevant words, learn the words on a deep level,
to the NR error-correction condition on all the and discuss them. These students outscored
dependent variables. students taught with the traditional conditions in
Vocabular
ocabularyy Gains Fr om Repeated, Multiple
From which students did not learn this criterion or discuss
Exposures the words in context.
One trend that was strongly reflected in the database Pr e-instruction of Vocabular
Pre-instruction ocabulary y Words
was that high frequency and multiple, repeated It has been a given for reading instruction in almost
exposures to vocabulary material are important for every formal lesson format that vocabulary instruction
learning gains. In accordance with this finding, a trend will occupy a central part of the lesson, typically prior to
was also noted that extended and rich instruction of reading. This pre-instruction has often been justified on
vocabulary (applying words to multiple contexts, etc.) the basis of making the passage easier to comprehend
was superior to less comprehensive methods. The by reducing the cognitive load during subsequent
following studies share this finding: reading. In fact, a few studies suggest that pre-
• Senechal (1997) found that for pre-kindergarten instruction of vocabulary words facilitates both
children, repeated readings of a story were vocabulary acquisition and comprehension.
associated with greater performance gains in • Brett, Rothlein, and Hurley (1996) found that 4th
vocabulary. grade students who were given pre-instruction of
• Leung (1992) studied kindergarten and 1st grade target words in the story had greater vocabulary
students, finding that the frequency of a target word gains than the children in the non-instructional
in stories influenced occurrence of the word in a control group.
child’s retellings. • Wixson (1986) pre-taught vocabulary words to
• Daniels, M. (1994) showed that pre-K students grade students. Although there were some gains in
who learned American Sign Language (ASL) did understanding, the instructional treatment (concept
significantly better than controls on the Peabody vs. dictionary) effects were unclear because of
Picture Vocabulary Test (PPVT). In a 1996 study, interaction with story.
Daniels also found that kindergarten students who • Carney, Anderson, Blackburn, and Blessing (1984)
learned ASL did significantly better on language also pre-taught vocabulary to 5th grade students;
development and vocabulary growth measures of the treatment had a significant effect on retention
the PPVT than those who had not learned ASL. and acquisition of social studies content.
Effect of Rich Contexts on Vocabular
Effect ocabularyy Gr owth
Growth Restructuring the Task
• McKeown, Beck, Omanson, and Pople (1985) One emergent trend in the database is the restructuring
found that 4th graders performed well with of the task (materials or procedures) in various ways to
instruction that extended beyond single class facilitate vocabulary acquisition and comprehension. A
periods and involved multiple exposures in authentic way of doing this is to alter the passage, such as
contexts. The instruction added activities to extend substituting easy for hard words. Another is clarifying
use of learned words beyond the classroom and the task of learning vocabulary definitions for students,
high-frequency encounters with words. such as teaching what components make a good
definition, and selecting relevant words. Group-assisted
reading in student dyads also yielded significant vocabulary program. The 7th and 8th grade
vocabulary gains over the comparison, unassisted students in the reciprocal peer-tutoring group had
group. Although the diversity among these studies is a significantly higher scores on weekly vocabulary
salient feature, the following studies did found positive quizzes.
results with a wide range of task alterations:
Context Method
• Kameenui, Carnine, and Freschi (1982) found that The research dealing with contextual approaches to
providing redundant information facilitated vocabulary acquisition yielded some interesting findings
comprehension and that instruction on difficult on the role of context and definitional approaches. In
vocabulary words also helped vocabulary learning accordance with the research findings on rich, extended
in grades 4 through 6. instruction and multiple exposures to words, one
• Gordon, Schumm, Coffland, and Doucette (1992) emerging trend was the possibility that the mix of
revised text versions to help define vocabulary definitional and contextual approaches worked better
words for 5th grade students. Using these revised than either method used alone. Two studies reflect this
texts helped students understand passages better. finding. Kolich (1991) provided computer-assisted
practice for 11th grade students; those receiving mixed
• Schwartz and Raphael (1985) clarified the task of
instruction (context optional word choices and
defining a word for 4th and 5th grade students,
definitional) scored highest. Similarly, Stahl (1983) found
giving them the components of a definition; this
those 5th grade students receiving a mixed treatment
increased students’ independent vocabulary
(definitional and contextual) outscored both students
acquisition.
receiving the definitional alone and the students in the
• Scott and Nagy (1997) evaluated the effect of control conditions.
altering presentation of vocabulary definitions
(traditional dictionary definition with or without a However, some studies found specific gains using a
sample sentence and definitions that were single approach. Margosein, Pascarella, and Pflaum
specifically written to be easier to understand) on (1982) worked with junior high school students and
the learning of novel vocabulary words. In general, found significant effects for semantic mapping over
regardless of the type of definition given, both the context-rich or target-word treatment; their work
4th and 6th grade students scored poorly on the suggests that students should focus on word with
task of assessing whether vocabulary usage was similarities to other known words. Gipe and Arnold
consistent with the definition in sentence fragments. (1979) compared several vocabulary methods for 3rd
However, small but significant gains were found and 5th grade students: instruction from context,
when students were given sample sentences along association, dictionary, and category. They found the
with the definitions. highest gains for the context method.
• Wu and Solman (1993) investigated the effects of Several studies demonstrated that direct instruction in
extrapictorial prompts on the learning of words by learning word meanings was helpful for vocabulary
kindergartners. They found that the best learning acquisition.
occurred equally in two circumstances: in the • Tomesen and Aarnoutse (1998) included
absence of the pictorial prompts where words were vocabulary instruction for 4th grade students in a
presented alone, and in a feedback cueing program of reciprocal teaching. Students were
condition. given direct instruction in deriving word meanings
• Eldredge (1990) devised a group-assisted reading from context. This was found to be more helpful for
method for 3rd grade students. The vocabulary poor than for average readers, but there was no
gains for students reading in dyads were greater transfer to general reading comprehension
than for the comparison group of unassisted • White, Graves, and Slater (1990) explored the need
students who did independent reading. for assisting minority or disadvantaged children in
• Malone and McLaughlin (1997) compared grades 1 through 4 and found that direct instruction
reciprocal peer tutoring with a traditional
in meaning and decoding may help them to an It was possible to determine what types of assessments
extent. (standardized of experimenter-generated) were used in
• Dole, Sloan, and Trathen (1995) worked with 10th 37 of the studies as dependent variables. Figure 2 on
grade students on an “alternative” vocabulary the next page shows the distribution of studies in the
treatment condition: teaching students how to select database as a function of the type of assessment used.
relevant words, learn them on a deep level, and There were six studies that used standardized
discuss them. These students outscored students assessments as the only dependent variable. One of
taught with the traditional conditions in which these studies used two measures. There was almost no
students did not learn to this criterion or discuss the overlap in the type of standardized measures used, with
words in context. six different instruments represented.
• Rinaldi, Sells, and McLaughlin (1997) worked with One other feature in the data was that of the 50 studies
10.8- to 11.5-year-old students and 3rd graders with coded, 32 administered pretests. Of these 32 studies, 17
reading difficulties to examine effectiveness of a used standardized tests. There were 11 different
drill and practice intervention on sight word instruments represented in the total.
acquisition. During the intervention, all the students
more than doubled their correct rates in oral reading These analyses seem to suggest two implications that
and reduced their numbers of errors. might be drawn for practice. First, the standardized
tests did not seem to be sufficiently sensitive to
• Dana and Rodriguez (1992) studied the effects of
vocabulary changes to be used as dependent measures.
the TOAST (test, organize, anchor, say, test)
For practice, this would suggest that assessing
method of vocabulary learning as compared to
vocabulary growth would be best done with teacher-
various student-selected methods of vocabulary
generated instruments as at least one component of
instruction among 6th grade students. They found
evaluation. It also suggests that there may be a need for
that students using the TOAST method scored
the development of standardized measures that are
higher than those using student-selected methods on
much more sensitive to the nuances and complexities
measures of both immediate and delayed retention
involved in vocabulary acquisition. A further implication
of words.
is that standardized instruments appear to be useful for
• Stump, Lovitt, Fister, Kemp, Moore, and Schroeder general screening pretests. Again, the implication for
(1992) assessed the effects of a precision teaching practice might be that standardized tests could be used
intervention for general and special education. to identify students who need vocabulary instruction.
Assessments of timed vocabulary quizzes supported However, a note of caution is critical here. These
the finding that the majority of students in the study implications are tentative and need to be researched
scored higher on measures of accuracy and before being implemented.
fluency.
Despite the relatively small body of data available, the
Results and Discussion collective body of research clearly indicates that
vocabulary increases with instruction of many different
Measurement of Vocabulary sorts.
What is available on the issue of measuring vocabulary,
despite the noted research gap, is some implicit Direct and Indirect Instruction
evidence, which the Panel provided in a breakdown of It is clear that vocabulary should be taught both directly
the types of measures that have been used by and indirectly. Vocabulary instruction should be
researchers studying vocabulary. To obtain this incorporated into reading instruction. There is a need
information the Panel tallied, for each study, whether for direct instruction of vocabulary items that are
the vocabulary assessment instrument was standardized required for a specific text to be read as part of the
or experimenter-generated. In some of the studies, lesson. Direct instruction was found to be highly
vocabulary was assessed with a pretest as well as a effective for vocabulary learning (Tomeson &
posttest. Aarnoutse, 1998; White, Graves, & Slater, 1990; Dole,
Sloan, & Trathen, 1995; Rinalid, Sells, & McLaughlin, Repetition and Multiple Exposures
1997). In addition, the more connections that can be It also seems clear from the Panel’s data set that
made to a specific word, the better it seems to be having students encounter vocabulary words often and
learned. For example, there is empirical evidence in various ways can have a significant effect (Senechal,
indicating that making connections with other reading 1997; Leung, 1992; Daniels, 1994, 1996; Dole, Sloan, &
material or oral language in other contexts seems to Trathen, 1995). Although not a surprising finding, this
have large effects. does have direct implications for instruction. Students
should not only repeat vocabulary items in learning; they
Pre-instruction of vocabulary in reading lessons can
should be given items that will be likely to appear in
have significant effects on learning outcomes (Brett,
many other contexts.
Rothlein, & Hurley, 1996; Wixson, 1986; Carney,
Anderson, Blackburn, et al., 1984). At least, it Context
guarantees that there will be fewer unfamiliar concepts
In much the same way that multiple exposures are
in the material to be read. It also helps in making the
important, the context in which a word is learned is
translation of print to speech meaningful by trying to
critical (McKeown, Beck, Omanson, and Pople, 1985;
guarantee that the vocabulary items are in the oral
Kameenui, Carnine, & Freschi, 1982; Dole, Sloan, &
language of the reader. Because almost all early
Trathen, 1995). Vocabulary words should be words that
reading is based on oral language, this is a critically
the learner will find useful in many contexts. To that
important implication.
end, a large portion of vocabulary items should be
derived from content learning materials. This would
serve at least two functions: first, it would assist the
learner in dealing with the specific reading matter in modalities to the teaching of vocabulary and,
content area materials; second, it would provide the consequently, helping ensure more effective vocabulary
learner with vocabulary that would be encountered learning. The availability of online access to vocabulary
sufficiently often to make the learning effort definitions combines both of these possibilities.
worthwhile.
Implicit Learning
Task Restructuring It is both a theoretical and an empirical fact that not all
Direct vocabulary instruction often assumes that the vocabulary can or must be learned through formal
learner is fully aware of what the task is and how to instruction and that vocabulary words can also be
complete it. However, restructuring tasks can ensure learned through incidental and indirect ways (Robbins
this. Some empirical research has demonstrated the & Ehri, 1994; Leung, 1992; Senechal & Cornell, 1993;
efficacy of being certain that students fully understand Nicholson & Whyte, 1992; Stewart et al., 1997).
the task and the components of vocabulary learning, Estimates of vocabulary size seem to suggest that there
rather than creating a focus only on the words to be would never be sufficient classroom time to instruct
learned (Schwartz & Raphael, 1985). Restructuring the students to the level of their acquired vocabulary. This
task, such as group learning or revising learning implies that much of a student’s vocabulary will have to
materials, can also lead to increased vocabulary be learned in the course of doing things other than
learning (Kameenui, Carnine, & Freschi, 1982; Gordon, explicit vocabulary learning. Students may well pick up
Schumm, Coffland, and Doucette, 1992; Wu & Solman, vocabulary in contexts different from the formal
1993; Eldredge, 1990; Malone & McLaughlin, 1997). learning of a classroom reading group. It may even be
This seems to be most effective for low-achieving or that the vocabulary acquired in this way is more
at-risk students. memorable, given the role of motivation in its acquisition
because the vocabulary acquired in this way may be far
Active Engagement more useful. Repetition, richness of context, and
The few studies that addressed active engagement in motivation may also add to the efficacy of incidental
learning all reported results consistent with conventional learning.
wisdom about learning: Active learning is best. When
students were engaged in the tasks in which they were Assessment and Evaluation of Vocabular ocabularyy
learning vocabulary, they had larger gains (Dickinson & Although there is no research in the NRP database that
Smith, 1994; Senechal, 1997; Drevno et al., 1994; bears directly on the issue of how vocabulary is
Daniels, 1994, 1996). This suggests that vocabulary assessed, the Panel believes that the way vocabulary is
learning tasks that advance other knowledge would be measured can have differential effects on instruction.
more effective. The Panel bases this belief on several things. First, the
plethora of ways in which vocabulary was measured
Computer Technology and evaluated in the studies in our database clearly
While the use of computer technology in reading is still indicates that there is no single standard. Consequently,
in its infancy, the few studies reported in the literature the Panel suggests that using more than a single
suggest that this may be a powerful way of increasing measure of vocabulary is critical for sound evaluation.
vocabulary (Reinking & Rickman, 1990; Heise et al., Second, each way of measuring vocabulary produces
1991; Davidson, Elcock, & Noyes, 1996; Heller, different results. Furthermore, the category of
Sturner, Funk & Feezor, 1993). Two possibilities arise vocabulary being measured varies. Receptive
here. The first is that the computer might be used as an vocabulary is clearly different from productive
adjunct to direct vocabulary instruction. In this way, vocabulary, and sight vocabulary is yet another concept.
students could obtain more practice in learning Finally, the fact that the Panel found most of the
vocabulary. A second possibility is that computer researchers using their own instruments to evaluate
technology could bring to bear many different media. vocabulary suggests the need for this to be adopted in
This is one way of adding a number of different pedagogical practice. That is, the more closely the
assessment matches the instructional context, the more
appropriate the conclusions about the instruction will be.
Standardized tests provide a global measure of 6. Computer technology can be used to help teach
vocabulary and may be used to provide a baseline. Few vocabulary.
researchers depended on standardized instruments to
7. Vocabulary can be acquired through incidental
assess the efficacy of the instruction they studied. The
learning.
implication for practice is the same: instruments that
match the instruction will provide better information 8. How vocabulary is assessed and evaluated can
about the specific learning of the students related have differential effects on instruction.
directly to that instruction. The implications for the use
of standardized instruments need to be viewed as 9. Dependence on a single vocabulary instruction
tentative until the findings can be confirmed by method will not result in optimal learning.
instructional research. Directions for Further Research
Single vs. Multiple Methods of Instruction The following questions do not seem to have clear
The Panel is reluctant to suggest a single method of answers in the research reviewed for this report. They
learning vocabulary because there were rarely more are questions at a relatively high level of generality and
than a few studies on each individual method. The are not, in the present form, researchable. That is, they
categories represented in the earlier discussion and the need to be translated into the appropriate variables,
summary of specific methods in Table 1 (Appendix A) operations, and data collection techniques before
reinforce this point. A comprehensive analysis of the research can be conducted.
collective research studies suggests that a variety of
direct and indirect methods of vocabulary instruction The need in vocabulary instruction research is great.
can be effective. Effective instructional methods Our knowledge of vocabulary acquisition exceeds our
emphasized multimedia aspects of learning, richness of knowledge of pedagogy. That is, the Panel knows a
context in which words are to be learned, active student great deal about the ways in which vocabulary
participation, and the number of exposures to words increases under highly controlled conditions, but the
that learners will receive. Panel knows much less about the ways in which such
growth can be fostered in instructional contexts. There
Moreover, the age and ability effects discussed above is a great need for the conduct of research on these
suggest that different methods may be differentially topics in authentic school contexts, with real teachers,
effective. In light of this, dependence on a single under real conditions.
method would be a risky course of action.
1. What are the best ways to evaluate vocabulary
Implications for Reading Instruction size, use, acquisition, and retention? What is the role
of standardized tests, what other measures should
Based on these trends in the data, the Panel offers the be used, and under what circumstances?
following implications for practice:
2. Given the preliminary findings that age and ability
1. Vocabulary should be taught both directly and levels can affect the efficacy of various vocabulary
indirectly. instruction methods (Tomesen & Aarnoutse, 1998;
2. Repetition and multiple exposures to vocabulary Robbins & Ehri, 1994; Nicholson & Whyte, 1992;
items are important. McGivern & Levin, 1983), what are the specific
vocabulary instruction needs of students at different
3. Learning in rich contexts is valuable for vocabulary grade and ability levels?
learning.
3. What are the more general effects of vocabulary
4. Vocabulary tasks should be restructured when instruction across the grades?
necessary.
4. Empirical support has been found for the facilitation
5. Vocabulary learning should entail active of vocabulary learning with computers as ancillary
engagement in learning tasks. aids and replacements of other technologies
(Reinking & Rickman, 1990; Heise, 1991;
Davidson, Elcock, & Noyes, 1996). What is the 7. What are the optimal combinations of the various
optimal use of computer and other technologies in methods of vocabulary instruction, including direct
vocabulary instruction? What is the precise role of and indirect instruction, and of different methods
multimedia learning in vocabulary acquisition? within these categories?
5. What is the precise role of multimedia learning in 8. What sort of professional development is needed
vocabulary instruction across the grades? for teachers to become proficient in vocabulary
instruction?
6. How should vocabulary be integrated into
comprehension instruction for optimal benefit to the
student?
Brett, A., Rothlein, L., & Hurley, M. (1996). Durkin, D. (1993). Teaching them to read. (6th ed.)
Vocabulary acquisition from listening to stories and Boston, MA: Allyn and Bacon.
explanations of target words. Elementary School
Journal, 96(4), 415-422. Eldredge, J. L. (1990). Increasing the performance
of poor readers in the third grade with a group-assisted
Carney, J. J., Anderson, D., Blackburn, C., & strategy. Journal of Educational Research, 84(2), 69-77.
Blessing, D. (1984). Preteaching vocabulary and the
comprehension of social studies materials by Fukkink, R. G., & de Glopper, K. (1998). Effects of
elementary school children. Social Education, 48(3), instruction in deriving word meaning from context: A
195-196. meta-analysis. Review of Educational Research, 68(4),
450-469.
Dana, C., & Rodriguez, M. (1992). TOAST: A
system to study vocabulary. Reading Research and Gipe, J. P., & Arnold, R. D. (1979). Teaching
Instruction, 31(4), 78-84. vocabulary through familiar associations and contexts.
Journal of Reading Behavior, 11(3), 281-285.
Daniels, M. (1994). The effect of sign language on
hearing children’s language development. Gordon, J., Schumm, J. S., Coffland, C., &
Communication Education, 43(4), 291-298. Doucette, M. (1992). Effects of inconsiderate vs.
considerate text on elementary students’ vocabulary
Daniels, M. (1996). Bilingual, bimodal education for learning. Reading Psychology, 13(2), 157-169.
hearing kindergarten students. Sign Language Studies,
90, 25-37. Heise, B. L., Papalewis, R., & Tanner, D. E.
(1991). Building base vocabulary with computer-
Davidson, J., Elcock, J., & Noyes, P. (1996). A assisted instruction. Teacher Education Quarterly, 18(1),
preliminary study of the effect of computer-assisted 55-63.
practice on reading attainment. Journal of Research in
Reading, 19(2), 102-110. Heller, J. H., Sturner, R. A., Funk, S. G., & Feezor,
M. D. (1993). The effect of input mode on vocabulary
Davis, F. B. (1942). Two new measures of reading identification performance at low intensity. Journal of
ability. Journal of Educational Psychology, 33, 365-372. Educational Computing Research, 9(4), 509-518.
Dickinson, D. K., & Smith, M. W. (1994). Long- Jenkins, J. R., Matlock, B., & Slocum, T. A. (1989).
term effects of preschool teachers’ book readings on Two approaches to vocabulary instruction: The teaching
low-income children’s vocabulary and story of individual word meanings and practice in deriving
comprehension. Reading Research Quarterly, 29, 104- word meaning from context. Reading Research
122. Quarterly, 24(2), 215-235.
Kameenui, E., Carnine, D., & Freschi, R. (1982). Margosein, C. M., Pascarella, E. T., & Pflaum, S.
Effects of text construction and instructional procedures W. (1982). The effects of instruction using semantic
for teaching word meanings on comprehension and mapping on vocabulary and comprehension. Journal of
recall. Reading Research Quarterly, 17(3), 367-388. Early Adolescence, 2(2), 185-194.
Klesius, J. P., & Searls, E. F. (1990). A meta- McFalls, E. L., Schwanenflugel, P. J., & Stahl, S.
analysis of recent research in meaning vocabulary A. (1996). Influence of word meaning on the acquisition
instruction. Journal of Research and Development in of a reading vocabulary in second-grade children.
Education, 23(4), 226. Reading and Writing, 8(3), 235-250.
Kolich, E. M. (1991). Effects of computer-assisted McGivern, J. E., & Levin, J. R. (1983). The
vocabulary training on word knowledge. Journal of keyword method and children’s vocabulary learning: An
Educational Research, 84(3), 177-182. interaction with vocabulary knowledge. Contemporary
Educational Psychology, 8(1), 46-54.
Koury, K. A. (1996). The impact of preteaching
science content vocabulary using integrated media for McKeown, M. G., Beck, I. L., Omanson, R. C., &
knowledge acquisition in a collaborative classroom. Perfetti, C. A. (1983). The effects of long-term
Journal of Computing in Childhood Education, 7(3-4), vocabulary instruction on reading comprehension: A
179-197. replication. Journal of Reading Behavior, 15(1), 3-18.
Leung, C. B. (1992). Effects of word-related McKeown, M. G., Beck, I. L., Omanson, R. C., &
variables on vocabulary growth repeated read-aloud Pople, M. T. (1985). Some effects of the nature and
events. In C. K. Kinzer & D. J. Leu (Eds.), Literacy frequency of vocabulary instruction on the knowledge
research, theory, and practice: Views from many and use of words. Reading Research Quarterly, 20(5),
perspectives: Forty-first Yearbook of the National 522-535.
Reading Conference (pp. 491-498). Chicago, IL: The
National Reading Conference. Medo, M. A., & Ryder, R. J. (1993). The effects of
vocabulary instruction on readers’ ability to make
Levin, J. R., Levin, M. E., Glasman, L. D., & causal connections. Reading Research and Instruction,
Nordwall, M. B. (1992). Mnemonic vocabulary 33(2), 119-134.
instruction: Additional effectiveness evidence.
Contemporary Educational Psychology, 17(2), 156-174. Meyerson, J. J., Ford, M. S., & Jones, W. P.
(1991). Science vocabulary knowledge of third and fifth
Levin, J., Johnson, D., Pittelman, S., Levin, K., grade students. Science Education, 75(4), 419-428.
Shriberg, L., Toms-Bronowski, S., & Hayes, B. (1984).
A comparison of semantic- and mnemonic-based Nagy, W., & Scott, J. (in press). Vocabulary
vocabulary-learning strategies. Reading Psychology, processes. In M. Kamil, P. Mosenthal, P. D. Pearson,
5(1-2), 1-15. & R. Barr, (Eds.). Handbook of reading research
(Vol. 3). Mahwah, NJ: Lawrence Erlbaum Associates.
Levin, J., McCormick, C., Miller, G., & Berry, J.
(1982). Mnemonic versus nonmnemonic vocabulary- Nicholson, T., & Whyte, B. (1992). Matthew
learning strategies for children. American Educational effects in learning new words while listening to stories.
Research Journal, 19(1), 121-136. In C. K. Kinzer & D. J. Leu (Eds.), Literacy research,
theory, and practice: Views from many perspectives:
Malone, R. A., & McLaughlin, T. F. (1997). The Forty-first Yearbook of the National Reading
effects of reciprocal peer tutoring with a group Conference (pp. 499-503). Chicago, IL: The National
contingency on quiz performance in vocabulary with Reading Conference.
seventh- and eighth-grade students. Behavioral
Interventions, 12(1), 27-40.
Reinking, D., & Rickman, S. S. (1990). The effects Stahl, S. (1983). Differential word knowledge and
of computer-mediated texts on the vocabulary learning reading comprehension. Journal of Reading Behavior,
and comprehension of intermediate-grade readers. 15(4), 33-50.
Journal of Reading Behavior, 22(4), 395-411.
Stahl, S. A., & Fairbanks, M. M. (1986). The
Rinaldi, L., Sells, D., & McLaughlin, T. F. (1997). effects of vocabulary instruction: A model-based meta-
The effects of reading racetracks on the sight word analysis. Review of Educational Research, 56(1), 72-
acquisition and fluency of elementary students. Journal 110.
of Behavioral Education, 7(2), 219-233.
Stahl, S. A., Richek, M. A., & Vandevier, R. J.
Robbins, C., & Ehri, L. C. (1994). Reading (1991). Learning meaning vocabulary through listening:
storybooks to kindergartners helps them learn new A sixth-grade replication. In J. Zutell & S. McCormick
vocabulary words. Journal of Educational Psychology, (Eds.), Learner factors/teacher factors: Issues in
86(1), 54-64. literacy research and instruction: Fortieth Yearbook of
the National Reading Conference (pp. 185-192)
Ryder, R. J., & Graves, M. F. (1994). Vocabulary Chicago, IL: The National Reading Conference.
instruction presented prior to reading in two basal
readers. Elementary School Journal, 95(2), 139-153. Stevens, R. J., Slavin, R. E., & Farnish, A. M.
(1991). The effects of cooperative learning and direct
Schwanenflugel, P. J., Stahl, S. A., & McFalls, E. instruction in reading comprehension strategies on main
L. (1997) Partial word knowledge and vocabulary idea identification. Journal of Educational Psychology,
growth during reading comprehension. Journal of 83(1), 8-16.
Literacy Research, 20, 531-553.
Stewart, S. R., Gonzalez, L. S., & Page, J. L.
Schwartz, R. M., & Raphael, T. E. (1985). (1997). Incidental learning of sight words during
Instruction in the concept of definition as a basis for articulation training. Language, Speech, and Hearing
vocabulary acquisition. In J. A. Niles & R.V. Lalik Services in Schools, 28(2), 115-125.
(Eds.), Issues in literacy: A research perspective:
Thirty-fourth Yearbook of the National Reading Stump, C. S., Lovitt, T. C., Fister, S., Kemp, K.,
Conference (pp. 116-124). Rochester, NY: The Moore, R., & Shroeder, B. (1992). Vocabulary
National Reading Conference. intervention for secondary-level youth. Learning
Disability Quarterly, 15(3): 207-222.
Scott, J. A., & Ehri, L. C. (1990). Sight word
reading in prereaders: Use of logographic vs. alphabetic Tomesen, M., & Aarnoutse, C. (1998). Effects of
access routes. Journal of Reading Behavior, 22(2), 149- an instructional programme for deriving word meanings.
166. Educational Studies, 24(1), 107-128.
Scott, J., & Nagy, W. (1997). Understanding the Whipple, G. (Ed.). (1925). The Twenty-fourth
definitions of unfamiliar verbs. Reading Research Yearbook of the National Society for the Study of
Quarterly, 32, 184-200. Education: Report of the National Committee on
Reading. Bloomington, IL: Public School Publishing
Senechal, M. (1997). The differential effect of Company.
storybook reading on preschoolers’ acquisition of
expressive and receptive vocabulary. Journal of Child White, T. G., Graves, M. F., & Slater, W. H.
Language, 24(1), 123-138. (1990). Growth of reading vocabulary in diverse
elementary schools: Decoding and word meaning.
Senechal, M., & Cornell, E. H. (1993). Vocabulary Journal of Educational Psychology, 82(2), 281-290.
acquisition through shared reading experiences.
Reading Research Quarterly, 28(4), 360-374.
Wixson, K. K. (1986). Vocabulary instruction and Wu, H.-M., & Solman, R. T. (1993). Effective use
children’s comprehension of basal stories. Reading of pictures as extra stimulus prompts. British Journal of
Research Quarterly, 21(3), 317-329. Educational Psychology, 63(1), 144-160.
Keyword Method Stud e nts are instructe d to le arn the me aning s o f ne w wo rd s b y le arning a ke ywo rd Le vin, Le vin, Glassman, & No rd wall, 1992;
"wo rd clue " fo r e ach vo cab ulary wo rd . The ke ywo rd s are usually wo rd s aco ustically McCarville , 1993; Le vin, Le vin, Glassman, &
similar to a salie nt p art o f the vo cab ulary wo rd . So me time s, re latio nal illustratio ns are No rd wall, 1992; Le vin, Le vin, Co tto n, Bartho lo me w,
sho wn to stud e nts, o r stud e nts are aske d to g e ne rate the ir o wn imag e s linking the Hasty, Hug he s, & To wnse nd , 1990; Pre ssle y, Le vin,
two wo rd s. Kuip e r, Bryant, & Mitche ne r, 1982; Atkinso n, 1975.
Semantic Mapping Stud e nts are taug ht the me aning s o f ne w vo cab ulary wo rd s b y cate g o rizing the m into Le vin, Le vin, Glassman, & No rd wall, 1992;
familiar to p ics with o the r kno wn wo rd s. Ne w wo rd s are le arne d b y id e ntifying McCarville , 1993; Le vin, Le vin, Glassman &
similaritie s and d iffe re nce s with re late d , kno wn wo rd s. Targ e t wo rd s are o fte n No rd wall, 1992; Le vin, Le vin, Co tto n, Bartho lo me w,
intro d uce d in cate g o rie s, and se mantic map s are d e ve lo p e d fo r e ach se t o f ite ms. Hasty, Hug he s, & To wnse nd , 1990; Pre ssle y, Le vin,
Kuip e r, Bryant, & Mitche ne r, 1982; Atkinso n, 1975.
4-33
Contextual Analysis Stud e nts use co nte xt clue s e mb e d d e d in p arag rap hs to he lp the m le arn me aning s o f Buike ma & Grave s, 1993; Frie d l&, 1992; Giffo rd ,
the targ e t vo cab ulary wo rd s. Usually, the wo rd s and d e finitio ns are the n re vie we d . 1993.
Sign Language Enrichme nt o f curriculum with sig n lang uag e fo r p re -kind e rg arte n he aring child re n to Danie ls, 1994, 1998.
imp ro ve the ir re ce p tive Eng lish vo cab ulary.
Wide Reading Liste ning /re ad ing sto rie s (with o r witho ut p re -e xp lanatio n o f targ e t wo rd s). So me And e rso n & Nag y, 1992; Rid d e ll, 1988; Elle y, 1988;
salie nt variab le s to co nsid e r includ e the numb e r o f e xp o sure s to the wo rd s, Krashe n, 1989.
fre q ue ncy o f b o o k re ad ing s, nature o f instructio n (using q ue stio ning , e tc.), wo rd
re d und ancy, and time b e twe e n re ad ing s.
Deriving Word Meanings Stud e nts are taug ht strate g ie s fo r d e riving me aning o f an unfamiliar wo rd . One To me so n, 1998; Je nkins, Matlo ck, & Slo cum, 1989.
e xamp le o f a strate g y is the SCANR me tho d (sub stitute a wo rd fo r unkno wn wo rd ;
che ck the co nte xt fo r clue s; ask if sub stitutio n fits co nte xt clue s; ne e d a ne w id e a?;
National Reading Panel
Appendices
Elaborate/Rich Instruction Stud e nts le arn to id e ntify the re latio nship b e twe e n wo rd s, re sp o nd to wo rd s b o th McKe o wn, Be ck, Omanso n, & Po p le , 1985; Stanle y
affe ctive ly and co g nitive ly, and ap p ly wo rd s to vario us co nte xts. Pro mo te s a & Ginthe r, 1991; Stahl, 1983.
stud e nt' s use o f wo rd s o utsid e o f vo cab ulary class and e licits p rio r kno wle d g e .
Roots/Affix Analysis Stud e nts use wo rd o rig in clue s and le arn the me aning s o f co mmo n ro o ts, p re fixe s, Irwin, 1991; Ryd e r & Grave s, 1994; Le vin, Carne y, &
and affixe s to d e te rmine vo cab ulary d e finitio ns. Pre ssle y, 1988.
Reports of the Subgroups
Dictionary/Glossary Stud e nts are g ive n d ictio narie s o r g lo ssarie s to find the d e finitio ns o f unkno wn Knig ht, 1994; Wixso n, 1986; Gip e & Arno ld , 1979.
wo rd s. Variatio ns o f this me tho d includ e g iving stud e nts p assag e s to re ad alo ng with
a d ictio nary o r g lo ssary to find the d e finitio ns o f unkno wn wo rd s, writing ne w
se nte nce s with the wo rd s, and co mp le ting wo rkshe e ts and cro sswo rd p uzzle s.
Frayer Model Fraye r mo d e l (and Grave ' s mo d ificatio n). A me tho d to te ach sp e cific ne w wo rd s Fraye r, Fre d e rick, & Klausme ie r, 1969; Grave s,
using a se ve n-ste p mo d e l. Basic te ne ts o f the Fraye r mo d e l includ e : g ive 1984, 1985; Ryd e r & Grave s, 1994.
wo rd /name and its re le vant attrib ute s, e liminate irre le vant attrib ute s, g ive e xamp le s,
g ive no ne xamp le s, and list sub o rd inate , sup e ro rd inate , and co o rd inate te rms.
Task Clarification With the p re mise that stud e nts have o nly a vag ue no tio n o f what co nstitute s a Guzze tti, Snyd e r, Glass, & Gamas, 1993; Hag g ard ,
d e finitio n, vo cab ulary instructio n is d e sig ne d to clarify the stud e nt' s kno wle d g e o f the 1982, 1985; Fishe r, Blacho wicz, & Smith, 1991;
task. Stud e nts are instructe d o n ways to g athe r info rmatio n fro m re le vant so urce s to Fishe r & Danie lse n, 1998; Palinscar & Bro wn, 1984.
unco ve r the co mp o ne nts o f a d e finitio n.
Computer/Multimedia Vario us me tho d s inco rp o rate co mp ute r and multime d ia te chno lo g y to aid in the Te rre ll & Danilo ff, 1996; Re inking & Rickman, 1990.
4-34
Text Revision Stud e nts are g ive n re vise d ve rsio ns o f te xt p assag e s. Variatio ns includ e sub stituting Britto n, Wo o d ward , & Binkle y, 1993; Me ye r, 1975.
e asy fo r d ifficult vo cab ulary wo rd s, ad d ing re d und ant info rmatio n to facilitate wo rd
le arning and co mp re he nsio n, and writing vo cab ulary wo rd s with co nte xt info rmatio n to
co nstrain vo cab ulary wo rd le arning .
Interactive Vocabulary Vario us te chniq ue s that allo w stud e nts to g e t active ly invo lve d in wo rd le arning . Duffe lme ye r, 1980; Re krut, 1993; Pre ssle y & Le vin,
Techniques Examp le s includ e stud e nts acting o ut wo rd me aning s, se lf-se le ctio n o f vo cab ulary 1988.
wo rd s to le arn, and allo wing stud e nts to co mp are strate g ie s and me tho d s.
Passage Integration Te ache rs sto p and p ro mp t the stud e nts to g e ne rate the me aning s o f the d ifficult Kame e nui, Carnine , & Fre schi, 1982.
Training vo cab ulary wo rd s imme d iate ly afte r the y e nco unte r the m d uring the p assag e re ad ing .
Concept Method Assists stud e nts in le arning wo rd s as co nce p ts rathe r than as d ictio nary d e finitio ns. Fraye r e t al., 1969; Klausme ie r, 1976,1979; Me rrill &
Base d o n a co nce p t-attainme nt mo d e l, this me tho d re lie s mo re he avily o n Te nnyso n, 1977.
d iscussio n than o n ind e p e nd e nt activitie s. Stud e nts stud y e xamp le s and
no ne xamp le s to id e ntify the critical attrib ute s o f e ach wo rd o r co nce p t.
Pre-Instruction of Stud e nts are taug ht o r e xp o se d to the d e finitio ns o f re le vant vo cab ulary wo rd s b e fo re Ko ury, 1996; Ryd e r & Grave s, 1994; Wixso n, 1986.
Vocabulary Words re ad ing the m in co nte xt. In ad d itio n to asse ssing e ffe cts o n vo cab ulary acq uisitio n,
this is o fte n re se arche d as a way to e nhance re ad ing co mp re he nsio n.
TABLE 1: A SUMMARY OF VOCABULARY INSTRUCTION METHODS (CONTINUED)
Association Methods Pairs unkno wn wo rd with familiar syno nym. Stud e nts must me mo rize the p airing s to Gip e & Arno ld 1979; McKe o wn, Be ck, Omanso n, &
re write the o rig inal p airs. Po p le , 1985.
TOAST Program Stud e nts are taug ht a me tho d o f vo cab ulary instructio n b y the acro nym o f TOAST that Dana & Ro d rig ue z, 1992.
p ro mp ts stud e nts to : te st, o rg anize wo rd s, ancho r wo rd s, and te st targ e t wo rd s.
Basic Mnemonic Trad itio nal me mo ry te chniq ue s, includ ing vo cab ulary d rills, flash card s, vo cab ulary McLaug hlin, 1997; Rinald i, Se lls, & McLaug hlin,
Techniques g ame s, no te b o o ks, re p e titio ns, and re call te sts. An e xamp le p ro g ram is the Re ad ing 1997.
Race track, which use s e rro r co rre ctio n, timing , and d rill and p ractice p ro ce d ure s to
he lp b uild sig ht wo rd acq uisitio n and re ad ing flue ncy.
Decoding Instruction To e nhance re ad ing fluid ity with the inte ntio n o f facilitating vo cab ulary co mp re he nsio n, Eld re d g e , Quinn, & Butte rfie ld , 1990.
instructio n is g ive n in me tho d s such as p ho no lo g ical training , p ho ne mic aware ne ss,
4-35
Appendices
Report
COMPREHENSION II
Text Comprehension Instruction
however, beginning readers were seldom taught on strategy instruction by Duffy and Roehler (1989);
cognitive strategies that could assist them in reading. Lysynchuk, Pressley, d’Ailly, Smith, and Cake (1989);
Durkin’s (1979) highly cited observational studies of Pressley, Johnson, Symons, McGoldrick, and Kurita
reading instruction in grade 4 showed that teachers, in (1989); Pressley (1998); Rosenshine and Meister
fact, spent little time on comprehension instruction. Only (1994); and Rosenshine, Meister, and Chapman (1996)
20 minutes of comprehension instruction was observed proved to be very helpful. As a result, an additional 28
in 4,469 minutes of reading instruction. This lack was studies not found initially in the electronic search were
echoed by Duffy, Lanier, and Roehler (1980). They added to the Panel’s review.
described teachers as spending time in assigning
activities, supervising and monitoring students as to Analysis
being on task, directing recitation sessions as a way of In order to be included in the NRP’s scientific review of
assessing what the students were doing, and providing the research literature on instruction of text
corrective feedback when the students erred. The comprehension, a study had to be:
teachers did not teach or show the students skills,
strategies, or processes that they could use in reading to 1. Relevant to instruction of reading or comprehension
comprehend what they read and to be successful in among normal readers. This criterion, in particular,
learning information in the text. excluded studies on comprehension instruction in
reasoning and mathematics problem solving
Research on instruction of comprehension strategies (Schoenfeld, 1985), physics (Larkin & Reif, 1976),
that could help students improve their reading and writing (Englert & Raphael, 1989; Scardamalia
comprehension began in the late 1970s and has thrived & Bereiter, 1985).
since. According to Rosenshine, Meister, and Chapman
(1996), the earliest uses of the term “comprehension 2. Published in a scientific journal. A few exceptions
monitoring” is found in Markman (1978, 1979), Gagne are dissertations and conference proceedings that
(1977), and Weinstein (1978). Researchers and were reviewed in two meta-analyses by
educators have long been interested in what we think Rosenshine and his colleagues (Rosenshine &
about thinking, in how our knowledge develops, and in Meister, 1994; Rosenshine, Meister, & Chapman,
how what we know about how our own thought 1996).
processes affect reading comprehension. The focus on 3. Have an experiment that involved at least one
what we know about cognition has led to the treatment and an appropriate control group or have
development of practical strategies for improving one or more quasi-experimental variables with
students’ comprehension. The cumulative result of variations that served as comparisons between
nearly 3 decades of research is that “there is ample treatments. The latter was rare.
extant research supporting the efficacy of cognitive
strategy training during reading as a means to enhance 4. In so far as could be determined, have the
students’ comprehension” (Baumann, 1992, p. 162). participants or classrooms randomly assigned to the
treatment and control groups or matched on initial
Methodology measures of reading comprehension. This criterion
was relaxed in a number of studies where random
Database assignment of classrooms was not carried out.
In order to conduct a scientific review of the research The application of these criteria reduced the number of
on comprehension instruction during the past 2 decades, studies to be reviewed from 481 to 205. The Panel then
the Panel located studies since 1980 by searching the coded and entered the coded contents of these studies
PsycINFO and ERIC databases electronically. The into a database to identify the types of comprehension
Panel used the terms comprehension, strategy, and instruction that were reported as effective. Because the
instruction. From this search, the Panel identified 453 studies numbered 205, the Panel first analyzed the
studies on comprehension. In addition, the Panel added abstracts of the studies, coding the kind of instruction,
other studies that were from the 1970s or otherwise not experimental treatments and controls (independent
revealed in the search. In this regard, reviews or studies
variables), grade and reading level of readers, instructor the same instructional category used widely varying
(teacher or experimenter), assessments (dependent sets of methodologies and implementations. Therefore,
variables), and kind of text. The Panel then classified the Panel found few research studies that met all the
and grouped studies based upon the kinds of instruction NRP criteria; however, to the extent possible, NRP
used. The Panel identified 16 distinct categories of criteria were employed in the analyses. An examination
instruction. Table 1, on the following page, summarizes of the quality of the research studies appears in the
the 16 categories of a total of 203 studies that met the Discussion section of this report. NRP criteria for
NRP criteria for inclusion as scientific studies on Evaluating Existing Reviews of Research were used in
comprehension instruction. It shows the type of the analyses of the two Rosenshine and colleagues
instruction used, the number of studies using that kind of meta-analyses.
instruction, a brief rationale as to why instruction was
used, and generally whether and how it was effective. Results
Each category of studies is summarized in Appendix A. Of the 16 categories of instruction, 7 appear to have a
The summaries define and describe the rationale for firm scientific basis for concluding that they improve
each kind of instructional strategy, the procedures used, comprehension in normal readers. The seven individual
and how the instruction is assessed by the researchers. strategies that appear to be effective and most
The Panel then evaluated the category of instruction, promising for classroom instruction are (in alphabetical
based on reported results. order) comprehension monitoring, cooperative learning,
graphic and semantic organizers including story maps,
In Appendix B, a table summarizes the 16 categories of question answering, question generation, and
instruction, describing the effects claimed by the summarization. In addition, many of these strategies
researchers, the grade levels that were studied, and have also been effectively used in the category
how the method might be taught in a classroom setting. “multiple strategy,” where readers and teachers interact
In order to draw scientific conclusions about a finding, over texts.
one needs evidence that an experimental effect is Mental Imagery and Mnemonic (Keyword)
reliable, robust, replicable, and general. Reliability of an Strategies have reliable effects on improving memory
effect is decided by differences that statistically favor a for text. These procedures may be useful when
treatment. Robustness of an effect is determined by the teachers wish to use an alternative way of having the
magnitude of effects over replications. Replication is reader try to understand and represent text. These
determined by independent validation of significant procedures are useful for recall of individual sentences
treatment effects. Generality is determined by the or paragraphs.
transfer measures. In this review, experimenter tasks
reflect near transfer and standardized tests reflect far Curriculum-Plus-Strategies, Psycholinguistic, and
transfer. The NRP evaluated how well each strategy Listening Actively studies were so few that an
met these criteria. The main criteria that the NRP used assessment of the scientific merit of a particular
are reliability, replication, and generality. Robustness treatment could not be made. The use of instructional
was not determined in most cases because effect sizes procedures that activate prior knowledge was found to
could not be calculated for almost all of the studies. be quite varied. The activation of prior knowledge may
Effect size data, however, were available from two be obtained through other means such as question
meta-analyses by Rosenshine and his colleagues elaboration, question generation, or question answering
(Rosenshine & Meister, 1994; Rosenshine et al., 1996). as well as other forms of content area exposure such as
teacher lectures, films, and discussion before reading.
Consistency With the Methodology
of the National Reading Panel Two categories on which there were few studies have,
in the view of the NRP, considerable promise for future
The methods of the NRP were followed in the conduct study. Only four studies were found on the Preparation
of the literature searches and the examination and of Teachers on comprehension instruction strategies.
coding of the articles obtained. A formal meta-analysis These studies are important because they represent a
was not possible because even the studies identified in culmination in the evolution of text comprehension
TABLE 1
CATEGORIES OF COMPREHENSION INSTRUCTION
Prior Knowledge 14 Readers may not have Readers improve memory and
relevant knowledge during comprehension for text.
reading.
TABLE 1
CATEGORIES OF COMPREHENSION INSTRUCTION (continued)
instruction during the past 2 decades. These studies also Comprehension Monitoring meets criteria of
represent essential investigations because in most of the reliability and replication for the specific learning of the
text comprehension strategy instruction reviewed, strategy (100% effectiveness in 14 studies across
strategies were taught by experimenters rather than grades 2 through 6). Although comprehension
classroom teachers. It is important to know whether monitoring is believed to be important as a part of a
strategies can be learned and used faithfully and multiple strategy method, the evidence for it alone
effectively by teachers in classroom contexts. These having a general effect is less compelling. Reliable
four studies are intensively reviewed as a part of the effects are reported on only three experimenter tasks
Comprehension report section on teacher preparation. (error detection, recall, question answering) with two
reported failures on 2nd graders. The number of studies
Success in instruction on the relation of vocabulary to
reporting the use of transfer tests is small (four on
comprehension has been found in only two studies with
reliable experimenter effects and five on reliable
8th graders. This is an important kind of instruction that
standardized tests). The method does not seem to
needs to be investigated on a wider range of grade
generalize for 2nd graders. Nevertheless, it may be a
levels. The Panel would like to know what the
useful addition to a program of instruction that employs
relationship is between word learning and
flexibility and the teaching of multiple comprehension
comprehension. The review on vocabulary in
strategies.
Comprehension I (Vocabulary Instruction) shows that
vocabulary can be successfully taught over a wide
range of grades.
Summarization has a large number of studies (18) that There were 11 other multiple strategy studies on
replicate treatment effects, mainly at grades 5 and 6. readers in grades 2 through 11, with grade 4 as the
Summarization presupposes writing as well as reading modal grade. The strategies taught varied across these
skill, hence its late study. The effects are largely studies. In 6 of the 12 studies, students were taught
specific to improving the writing of summaries, but summarizing or identification of main ideas. Three
there are 11 studies that show transfer effects on recall studies used question answering or generation, two used
of what was summarized and on question answering. monitoring, and others used cooperative reading, recall,
Standardized tests as general transfer were used rarely retelling, hypothesis testing, story structure, and
(only two studies). Instruction of summarization psycholinguistic training (word, phrase, and sentence
succeeds in that readers improve on the quality of their classification, morphological analysis). There was
summaries of text, mainly identifying the main idea but evidence for specific learning and near transfer. No
also in leaving out detail, including ideas related to the studies reported the use of standardized tests.
main idea, generalizing, and removing redundancy. This
Taken together, the evidence supports the use of
indicates that summarizing is a good method of
combinations of reading strategies in natural learning
integrating ideas and generalizing from the text
situations. These findings build on the empirical
information. Furthermore, the instruction of
validation of strategies alone and attest to their use in
summarization improves memory for what is read, both
the classroom context. A common aspect of individual
in terms of free recall and answering questions. This
and multiple strategy instruction is the active
strategy instruction is used as a part of treatments that
involvement of motivated readers who read more text
teach multiple strategies.
as a result of the instruction. These motivational and
Multiple Strategy Instruction represents an reading practice effects may be important to the
evolution in the field from the study of individual success of multiple strategy instruction. Furthermore,
strategies to their flexible and multiple use. This method multiple strategy instruction that is flexible as to which
finds considerable scientific support for its effectiveness strategies are used and when they are taught over the
as a treatment, and it is the most promising for use in course of a reading session provides a natural basis on
classroom instruction where teachers and readers which teachers and readers can interact over texts.
interact over texts. The NRP reviewed 11 studies not
covered by the meta-analysis of Rosenshine and Discussion
Meister (1994), who reviewed 16 reciprocal teaching In the preceding section, the Panel summarized the
studies on readers in grades 3 through 7. research claims and implications for instruction of
One of the main methods is to have the teacher model comprehension. In this section, the kinds of claims being
an approach by showing how she or he would try to made are illustrated by three quotations:
understand the text, using two or more combinations of “The best way to pursue meaning is through
four strategies: question generation, summarization, conscious, controlled use of strategies” (Duffy,
clarification, and prediction of what might occur. 1993, p. 223).
Rosenshine and Meister found strong evidence that the
reciprocal teaching treatment showed near transfer. “Becoming an effective transactional strategies
Experiment tests in ten studies had an average effect instruction teacher takes several years” (Brown et
size of 0.88. There was also support for general al., 1996, p. 20).
transfer in nine studies where the average effect size
“The data suggests that students at all skill levels
was 0.32. All readers show more near transfer benefit
would benefit from being taught these strategies”
in these treatments, whereas only the better readers
(Rosenshine, Meister, & Chapman, 1996, p. 201).
show significant effect sizes in the 0.32 range. These
data suggest that good readers benefit and generalize The past 2 decades of research appear to support the
what they learn as strategies more than poor or below- enthusiastic advocacy of instruction of reading
average readers. Furthermore, the significant effect strategies expressed in the above quotations. The
sizes do not occur for grade 3, are mixed for grades 4 Panel’s review of the literature indicates that there has
through 6, and do occur for grades 7 and 8. been an extensive effort to identify reading
In some classrooms . . . we observed explicit summarize what they read, identify confusing
comprehension instruction only rarely, despite a points in a text, construct questions pertaining
great deal of research in the past two decades to a text, or predict what might be next in a
on how to promote children’s comprehension of text. That is, they were asked to respond to
what they read . . . Indeed, the situation seemed questions constructed around the cognitive
to be much as Durkin (1979) described it two processes involved in skilled comprehension
decades ago, with a great deal of testing of (i.e., summarizing, monitoring confusion, self-
comprehension but very little teaching of it questioning, predicting based on prior
(Pressley, 1998, p. 198). knowledge). However, there was little
evidence that students were being taught to
Durkin (1981) observed that when comprehension skill
self-regulate comprehension processes as they
instruction is present, in many classrooms teachers
read, and in some classrooms, there was no
appear to be “mentioning” a skill to students and
evidence that they were being taught the
“assigning” it to them rather than employing the
active comprehension process validated in the
effective instruction modeling and transactional
last two decades. In general, students were
practices that research supports (Durkin, 1981; Reutzel
provided with opportunities to practice
& Cotter, 1988). In the United States, reading from
comprehension strategies, but were not
basal reading series accounts for 75% to 90% of
actually taught the strategies themselves nor
classroom reading instruction time (Franklin et al.,
the utility value of applying them. (Pressley,
1992). Although some basal teachers’ manuals do
1998, p. 198).
provide more evaluative comprehension skill lessons,
these lessons are usually not instructional and offer little Deshler and Schumaker (1988) have taught learning
structure and rationale for helping teachers give disabled students how to comprehend, write, and
effective skill instruction (Reutzel & Cotter, 1988). remember in a learning disabilities curriculum. They
emphasize the role of controllable factors, such as the
In a 5-year study of how teachers help low-achieving
use of strategies. One problem they encountered is that
students become strategic readers, using monthly
learning disabled students make attributions that render
inservice strategy preparation sessions, biweekly
them dysfunctional (e.g., “I am stupid.”). These kinds
individual teacher coaching with a strategy expert staff
of attributions can defeat what might otherwise be
developer, and collaborative discussion of principals’
effective comprehension instruction. Alternatively,
and teachers’ experiences in individual schools, Duffy
effective comprehension instruction might lead learning
(1993) suggests that effective reading instruction is
disabled students to make more positive, functional
associated more with independent teacher action than
attributions.
with implementation of basal text prescriptions. He
argues that developing metacognitive readers who When conscientious, diligent, and highly professional
understand their reasoning requires teachers who teachers apply their strategy instruction in the
themselves understand their reasoning, as well as a classroom, even when applied imperfectly, their
supportive environment in the schools for strategy students do improve in reading comprehension
learning. Pressley’s (1998) recent observations suggest (Bramlett, 1994; Duffy, 1993; Pressley, Johnson,
that too little has changed in the classroom since Symons, McGoldrick, & Kurita, 1989). However, close
Durkin’s 1978–1979 school year observations: observation of inservice trained strategy teachers
suggests that:
A twist on this [1995–1996 school year]
situation, however, was that the Progress was not easily accomplished. It was
comprehension tasks now being given to a struggle. For much of the academic year, the
students did seem to be informed by the four [strategic] teachers [in the study] required
comprehension process research of the past from their students counterproductive
two decades. It was not uncommon, for ‘answers’ and ‘routes’—that is, answers and
example, for students to be asked to respond thinking that led students to construct
to short-answer questions requiring them to inaccurate conceptions [of strategies].
Although by May it appeared that [their grade What is the scientific basis for claims
2 poor reading] students were developing an made about instruction of
integrated concept of what it means to be comprehension?
strategic, students’ responses to interview
probes during fall and winter suggested The Panel now begins a more critical analysis of the
incomplete conceptions or misconceptions literature on instruction of comprehension. First, the
about what it means to be strategic (Duffy, quality of the studies is discussed. Second, scientific
1993, p. 237). criteria are applied and the Panel’s prior evaluations to
arrive at an overall set of conclusions are discussed.
In spite of heavy emphasis on modeling and
metacognitive instruction, even very good teachers may Quality of Studies: An Overlooked Issue
have trouble implementing, and may even omit, crucial
In half the studies reviewed by Rosenshine and Meister
aspects of strategic reasoning. The research suggests
(1994), experimenters failed to address the quality of
that, when partially implemented, students of strategy
instruction in the intervention study. There are several
teachers will still improve. But it is not easy for
papers, however, that have raised questions about the
teachers or readers to develop readers’ conceptions
quality issues of reading research: Almasi, Palmer,
about what it means to be strategic. It takes time and
Gambrell, and Pressley (1994); Lysynchuk, Pressley,
ongoing monitoring of success to evolve readers into
d’Ailly, Smith, and Cake (1989); Pressley et al. (1989);
becoming good strategy users.
Rosenshine et al. (1996); Rosenshine and Meister
Helping teachers [become good strategy (1994); and Troia (1999). Of these, Lysynchuk et al.
teachers] will require a significant change in (1989) evaluated the methodological adequacy of 37
how teacher educators and staff developers studies of reading comprehension instruction. Several
work with teachers and what they count as problems were identified. Of particular importance
important about learning to be a teacher. were (1) failure to randomly assign students to
Current practices that require teachers to treatments and control conditions, (2) failure to expose
successfully complete university course work, experimental and control participants to the same
to attend mandated half-day in-service training materials, (3) failure to provide information
programs, or to be ‘trained’ in the ‘right way’ about the amount of time spent on dependent variable
to teach and then [be] held accountable for tasks, (4) failure to study fidelity of treatment by not
that encourage teachers, like the children . . . including analysis of teacher and reader performance
to learn only the labels of professional during instruction, (5) use of inappropriate units
knowledge without learning how to be (individual, group, classroom) in analyses, and (6) failure
strategic themselves. Such practices must be to assess either long-term effects or generalization of
replaced by teacher education/staff the strategies to other tasks and materials.
development experiences that account for (1)
Lysynchuk et al. (1989) applied 24 criteria of internal
the complexity involved in teaching [students]
validity (classified in four categories as to general
to be strategic and for (2) the creative
design, possible confounds, measurement, and statistics)
adaptations teachers must make as they deal
and five criteria of external validity (theory, sample,
with that complexity (Duffy, 1993, p. 244-245).
reading ability, text properties, measures of transfer).
Strategic reading requires strategic teaching, which The range of percentages of studies that met internal
involves putting teachers in positions where their minds validity criteria was from 17 to 100, median = 78%. For
are the most valued educational resource (Duffy, 1993). external validity, the range was from 8% to 100%,
Skilled reading is constructive reading, and the activities median = 82.5 percent. Although most studies specified
of the reader matter (Pressley, Harris, & Marks, 1992; the experimental and control groups and the
Pressley & Afflerbach, 1995). independent and dependent variables in their general
design presentations, only 64% randomly assigned
participants or classes to the experimental and control
conditions, compromising cause-and-effect conclusions.
With respect to confounds, in 75% of the studies, can be reduced by motivating controls to believe that
control subjects were lead to believe that they were in they are receiving the same benefits and treatment as
an experimental condition; therefore, 25% were not, experimental participants. Often the tasks themselves
allowing for possible Hawthorne effects. In nearly one- motivate experimental and controls differently,
third of the studies, there were possible confounds of confounding motivation with the variable of study.
differences in training materials between the Similarly, Hawthorne effects on teachers can occur if
experimental and control groups with the experimental they believe that the experimental group will benefit
groups given more materials to read. However, in these more than controls. One way to deal with this problem
studies they were, with one exception, exposed to is to assign the teacher to both groups but with the
materials for the same amount of time. belief that either treatment would benefit the
participants.
In other studies, time on task was confounded with
condition. Experimental groups may have been allowed Future studies should include fidelity to treatment
more time to read than control groups. Only 10 of 37 measures of the preparation of teachers, of the
studies reported the amount of time, and 8 of 10 of teachers’ teaching the strategies as intended, and of the
these were the same. However, these studies did not students’ performance during training. There is a need
analyze what students did during the time assigned; to observe, document, and analyze all components of
therefore, it is unknown whether they used the time to the experiment, from training to implementation to
read. In addition, there were possible experimenter-by- learning to assessment. The amount of time on each
condition or teacher-by-condition confounds in some task should be recorded and reported as well as
studies because neither the experimenters nor the examined in relation to outcome measures. Floor and
teachers were randomly assigned to groups. ceiling effects on measures should be avoided. The unit
Measurement problems involved not measuring of analysis should be the same as the unit of treatment.
reliability (37% of the studies), floor and ceiling effects All these steps would improve the design and internal
(33% of the studies), and failure to assess fidelity of validity of studies on reading strategy instruction.
treatment through checks on manipulation (only 37% External validity could be improved by the inclusion and
did so for teachers, and 27% measured ongoing measurement of training and transfer of training to
processes). On statistical practices, the most serious other measures, particularly performance in content
flaw was in the use of appropriate units—if one assigns areas. Text, as a variable, has been sorely neglected.
groups to conditions and then conducts analyses on The external validity of a study could also be improved
individuals, the unit of analysis differs from the unit of by the kind of texts used (both expository and narrative
treatment. Errors then cannot be assumed to be and sampled from content areas), an analysis of text
independent. With respect to external validity, most difficulty, the content and structure of the text, the
studies met theory and reporting of sample criteria. vocabulary and sentence complexity of the text,
Other problems involved omission of data on reading appropriateness of the level of text difficulty to the
level (16%), failures to measure transfer or delayed ability of readers, and possible interactions between
effects (76%), and failures to measure transfer to difficulty of the text and ability of reader. Long-term
school subjects (92%). benefits could be assessed through followup studies
later so that the effects are not just short term.
Future studies would benefit from attention to quality
criteria for internal and external validity. In particular, In the section of this Text Comprehension report on
researchers should conduct reliability assessments of quality of studies, the Panel describes a set of criteria
their scoring of data when raters are used; should use for internal and external validity that should be used to
random assignment of experimenters, teachers, plan, conduct, and report research in individual studies
classrooms, or students where possible; or should at but also that can be applied in evaluation of single and
least collect data on comparability of instructors and on multiple studies and reviews of studies. That section
participant characteristics in the treatment and control includes several criteria for internal and external
conditions. Researchers should try to meet quasi- validity. These criteria incorporate, elaborate, extend,
experimental criteria if random assignment is not and adapt to the reading situation the 24 categories of
possible (Cook & Cambell, 1979). Hawthorne effects the Lysynchuk et al. (1989) review.
Scientific Evaluation of the Claims In Figure 1, grades 3 through 6 constitute 76% of the
Made in the Literature grade levels studied. The modal grade is 4 with the next
highest percentages occurring with grades 3 and 5.
The empirical evidence reviewed favors the conclusion Thus, instruction of comprehension begins mainly at the
that teaching of a variety of reading comprehension 3rd grade and continues through the 6th grade. In
strategies leads to increased learning of the strategies, examining the studies, the Panel found that the lower
to specific transfer of learning, to increased memory three grades (K through 2) were studied primarily as a
and understanding of new passages, and, in some cases, part of an experimental curriculum. The higher grades
to general improvements in comprehension. In (above grade level 6) tend to focus on less able readers.
particular, individual strategies that can be used in The increase in percentage at grade level 3 suggests
natural reading or content area instruction and through that researchers taught readers who had achieved
interaction with the teacher over a text appear to have decoding and other basic reading skills before they
a strong scientific support for their effectiveness and were taught strategies.
for their inclusion in classroom programs on
comprehension instruction. To determine the effectiveness of instruction and
whether it was related to grade level, the Panel found
The NRP now integrates its evaluations of the the percentage of reported significant findings where
instruction strategies that have the best scientific basis the experimental treatment was favored over the
for effectiveness and use by teachers in the classroom. control group. The overall average percentages of
The Panel first considers the grade level success, as measured by experimenter tasks or by
appropriateness and general effectiveness, then the standardized tests, were 97 and 93%, respectively. The
evidence of reliability, robustness, replication, and high overall rates of success are not surprising because
transfer for a set of particular strategies in support of these data are based upon published studies. For grades
the general conclusion above. K through 1 and 7 through 11, the reported percentage
On what grade levels has text comprehension of success was 100 on experimenter tasks and
instruction been effectively studied? Figure 1 shows the standardized tests; for grades 2 through 6, the average
distribution of the grade levels at which investigations of was 92%. For standardized tests, the average success
instruction in comprehension have been successfully was 89% for grades 2 through 6. There was no
carried out. relationship between grade level and the respective
percentages of success in treatment.
Figure 1. Distribution of Grade Levels Across
These data indicate that instruction is likely to be more
All Studies of Direction
successful when measured on experimenter designed
tasks than on standardized tests of comprehension. The
instruction of comprehension appears to be effective on
grades 3 through 6.
With respect to the scientific basis of the instruction of
text comprehension, the NRP concludes that
comprehension instruction can effectively motivate and
teach normal readers to learn and to use comprehension
strategies that benefit them.
These comprehension strategies yield increases in
measures of near transfer such as recall, question
answering and generation, and summarization of texts.
Furthermore, when used in combination, these
6. Prior studies suffer when the quality of the studies (c) Failure to study fidelity of treatment, by failing
is assessed (Lysynchuk et al., 1989) according to to analyze teacher and reader performance
criteria of internal and external validity. These during instruction
issues need to be considered when designing future (d) Use of inappropriate units (individual, group,
research. The main problems were: classroom) in analyses
(a) Failure to randomly assign students to (e) Failure to assess either long-term effects or
treatments and control conditions and failure to generalization of the strategies to other tasks
expose experimental and control participants to and materials.
the same training materials
(b) Failure to provide information about the amount
of time spent on dependent variable tasks
Durkin, D. (1981). Reading comprehension Lysynchuk, L. M., Pressley, M., D’Ailly, H., Smith,
instruction in five basal reading series. Reading M., & Cake, H. (1989). A methodological analysis of
Research Quarterly, 14, 481-533. experimental studies of comprehension strategy
instruction. Reading Research Quarterly, 24(4), 458-
Durkin, D. (1993). Teaching them to read (6th ed.). 470.
Boston, MA: Allyn & Bacon.
Lysynchuk, L. M., Pressley, M., & Vye, N. J.
Englert, C. S., & Raphael, T. (1989). Constructing (1990). Reciprocal teaching improves standardized
well-formed prose: Process, structure, and reading-comprehension performance in poor
metacognitive knowledge. Exceptional Children, 54, comprehenders. Elementary School Journal, 90(5), 469-
513-520. 484.
Franklin, M. R., Roach, P. B., & Clary, E., Jr. Markman, E. M. (1978). Realizing that you don’t
(1992). Overcoming the reading comprehension barriers understand: A preliminary investigation. Child
of expository texts. Educational Research Quarterly, 16 Development, 48, 986-992.
(1), 5-14.
Markman, E. M. (1979). Realizing that you don’t
Gagne, R. M. (1977). The conditions of learning understand: Elementary school children’s awareness of
(3rd ed). New York: CBS College Publishing. inconsistencies. Child Development, 50(3), 643-655.
Grant, J., Elias, G., & Broerse, J. (1989). An Markman, E. M. (1981). Comprehension
application of Palinscar and Brown’s comprehension monitoring. In W. P. Dickson (Ed.), Children’s oral
instruction paradigm to listening. Contemporary communication skills (pp. 61-84). New York: Academic
Educational Psychology, 14(2), 164-172. Press.
Harris, T. L., & Hodges, R. E. (Eds.) (1995). The Mier, M. (1984). Comprehension monitoring in the
literacy dictionary: The vocabulary of reading and elementary classroom. Reading Teacher, 37(8), 770-
writing. Newark, DE: International Reading 774.
Association.
Palinscar, A. S., & Brown, A.L. (1984). Reciprocal
Kintsch, W., & van Dijk, T. A. (1978). Toward a teaching of comprehension-fostering and
model of discourse comprehension and production. comprehension-monitoring activities. Cognition and
Psychological Review, 83, 363-394. Instruction, 2, 117-175.
Klingner, J. K., Vaughn, S., & Schumm, J. S. Paris, S. G., Saarnio, D. A., & Cross, D. R. (1986).
(1998). Collaborative strategic reading during social A metacognitive curriculum to promote children’s
studies in heterogeneous fourth-grade classrooms. reading and learning. Australian Journal of Psychology,
Elementary School Journal, 99(1), 3-22. 38(2), 107-123.
Larkin, J. H., & Reif, F. (1976). Analysis and Peters, E. E., & Levin, J. R. (1986). Effects of a
teaching of a general skill for studying scientific text. mnemonic imagery strategy on good and poor readers’
Journal of Educational Psychology, 72, 348-350. prose recall. Reading Research Quarterly, 21, 179-192.
Levin, J. R., & Divine-Hawkins, P. (1974). Visual Pressley, M. (1998). Reading instruction that works:
imagery as a prose-learning process. Journal of The case for balanced teaching. NY: The Guilford
Reading Behavior, 6, 23-30. Press.
Pressley, M., Almasi, J., Schuder, T., Bergman, J., Scardamalia. M., & Beretier, C. (1985). Fostering
& Kurita, J. A. (1994). Transactional instruction of the development of self-regulation in children’s
comprehension strategies: The Montgomery County, knowledge processing. In S. F. Chipman, J.W. Segal, &
Maryland, SAIL program. Reading & Writing R. Glaser (Eds.). Thinking and learning skills: Vol. 2.
Quarterly: Overcoming Learning Difficulties, 10(1), 5- Research and open questions (pp. 563-577). Hillsdale,
19. NJ: Erlbaum.
Pressley, M., Gaskins, I. W., Wile, D., Cunicelli, E. Scardamalia, M., & Bereiter, C. (1986). Research
A., & Sheridan, J. (1991). Teaching literacy strategies on written composition. In M. C. Wittrock (Ed.),
across the curriculum: A case study at Benchmark Handbook of research on teaching, 3rd ed. (pp. 778-
School. In J. Zutell & S. McCormick (Eds.). Learner 803). NY: MacMillan.
factors/teacher factors: Issues in liberacy research and
instruction: Fortieth Yearbook of the National Reading Schoenfeld, A. (1985). Mathematical problem
Conference (pp. 219-228). Chicago, IL: The National solving. NY: Academic Press.
Reading Conference.
Spires, H. A., Gallini, J., & Riggsbee, J. (1992).
Pressley, M., Harris, K. R., & Marks, M. B. Effects of schema-based and text structure-based cues
(1992). But good strategy instructors are on expository prose comprehension in fourth graders.
constructivists! Educational Psychology Review, 4(1), Journal of Experimental Education, 60(4), 307-320.
3-31.
Stevens, R. J., Slavin, R. E., & Farnish, A. M.
Pressley, M., Johnson, C. J., Symons, S., (1991). The effects of cooperative learning and
McGoldrick, J. A., & Kurita, J. A.. (1989). Strategies instruction in reading comprehension strategies on main
that improve children’s memory and comprehension of idea identification. Journal of Educational Psychology,
text. Elementary School Journal, 90(1), 3-32. 83(1), 8-16.
Pressley, M., & McCormick, C. B. (1995). Taylor, B. M. (1982). Text structure and children’s
Cognition, teaching, and assessment. NY: Harper comprehension and memory for expository material.
Collins College Publishers. Journal of Educational Psychology, 74(3), 323-340.
Rinehart, S. D., Stahl, S. A., & Erickson, L. G. Troia, G. A. (1999) Phonological awareness
(1986). Some effects of summarization training on intervention research: A critical review of the
reading and studying. Reading Research Quarterly, experimental methodology. Reading Research
21(4), 422-438. Quarterly, 34, 28-52.
Rosenshine, B., & Meister, C. (1994). Reciprocal Vygotsky, L. S. (1978). Mind in society: The
teaching; A review of the research. Review of development of higher psychological processes In M.
Educational Research, 64(4), 479-530. Cole, V. John-Steiner, S. Scriber, and E. Souberman
(Eds. & Trans.). Cambridge, MA: Harvard University
Rosenshine, B., Meister, C., & Chapman, S. (1996). Press.
Teaching students to generate questions: A review of
the intervention studies. Review of Educational Weinstein, C. E. (1978). Teaching cognitive
Research, 66( 2), 181-221. elaboration learning strategies. In H. F. O’Neal (Ed.).
Learning strategies. NY: Academic Press.
Reutzel, D. R., & Cooter, R. B., Jr. (1988).
Research implications for improving basal skill Whipple, G. (Ed.). (1925). The Twenty-fourth
instruction. Reading Horizons, 28(3), 208-215. Yearbook of the National Society for the Study of
Education: Report of the National Committee on
Reading. Bloomington, IL: Public School Publishing
Company.
Wright, B., & Stenner, A. J. (1998). Readability and Markman, E. M. (1977). Realizing that you don’t
reading ability. Paper presented at the Australian understand: A preliminary investigation. Child
Council on Education Research (ACER), Melbourne, Development, 46, 986-992.
Australia.
Miller, G. E. (1985). The effects of general and
Category References specific self-instruction training on children’s
comprehension monitoring performances during
Comprehension Monitoring reading. Reading Research Quarterly, 20(5), 616-628.
Babbs, P. J. (1984). Monitoring cards help improve
Miller, G. E. (1987). The influence of self-
comprehension. Reading Teacher, 38(2), 200-204.
instruction on the comprehension monitoring
performance of average and above average readers.
Baker, L., & Zimlin, L. (1989). Instructional effects
Journal of Reading Behavior, 19(3), 303-317.
on children’s use of two levels of standards for
evaluating their comprehension. Journal of Educational
Miller, G. E., Giovenco, A., & Rentiers, K. A.
Psychology, 81(3), 340-346.
(1987). Fostering comprehension monitoring in below
average readers through self-instruction training.
Baumann, J. F., Seifert-Kessell, N., & Jones, L. A.
Journal of Reading Behavior, 19(4), 379-394.
(1992). Effect of think-aloud instruction on elementary
students’ comprehension monitoring abilities. Journal of
Nelson, C. S., et al. (1996). The effect of teacher
Reading Behavior, 24(2), 143-172.
scaffolding and student comprehension monitoring on a
multimedia/interactive videodisc science lesson for
Block, C. C. (1993). Strategy instruction in a
second graders. Journal of Educational Multimedia and
literature-based reading program. Special issue:
Hypermedia, 5(3-4), 317-348.
Strategies instruction. Elementary School Journal, 94(2),
139-151.
Paris, S. G., Cross, D. R., & Lipson, M. Y. (1984).
Informed strategies for learning: A program to improve
Carr, E. M., Dewitz, P., & Patberg, J. P. (1983).
children’s reading awareness and comprehension.
The effect of inference training on children’s
Journal of Educational Psychology, 76(6), 1239-1252.
comprehension of expository text. Journal of Reading
Behavior, 15(3), 1-18.
Paris, S. G., & Jacobs, J. E. (1984). The benefits of
informed instruction for children’s reading awareness
Cross, D. R., & Paris, S. G. (1988). Developmental
and comprehension skills. Child Development, 55(6),
and instructional analyses of children’s metacognition
2083-2093.
and reading comprehension. Journal of Educational
Psychology, 80(2), 131-142.
Paris, S. G., Saarnio, D. A., & Cross, D. R. (1986).
A metacognitive curriculum to promote children’s
Elliot-Faust, D. J., & Pressley, M. (1986). How to
reading and learning. Australian Journal of Psychology,
teach comparison processing to increase children’s
38(2), 107-123.
short- and long-term listening comprehension
monitoring. Journal of Educational Psychology, 78, 27-
Payne, B. D., & Manning, B. H. (1992). Basal
33.
reader instruction: Effects of comprehension monitoring
training on reading comprehension, strategy use and
Hasselhorn, M., & Koerkel, J. (1986).
attitude. Reading Research and Instruction, 32(1), 29-
Metacognitive versus traditional reading instructions:
38.
The mediating role of domain-specific knowledge on
children’s text-processing. Human Learning: Journal of
Schmitt, M. C. (1988). The effects of an elaborated
Practical Research and Applications, 5(2), 75-90.
directed reading activity on the metacomprehension
skills of third graders. National Reading Conference
Yearbook, 37, 167-181.
Schunk, D. H., & Rice, J. M. (1984). Strategy self- Soriano, M., Vidal-Abarca, E., & Miranda, A.
verbalization during remedial listening comprehension (1996). Comparación de dos procedimentos de
instruction. Journal of Experimental Education, 53(1), instrucción en comprensión y aprendizaje de textos:
49-54. Instrucción directa y enseñanza recíproca. [Comparison
of two procedures for instruction in comprehension and
Schunk, D. H., & Rice, J. M. (1985). Verbalization text learning: Direct instruction and reciprocal
of comprehension strategies: Effects on children’s teaching.] Infancia y Aprendizaje, 74, 57-65.
achievement outcomes. Human Learning: Journal of
Practical Research and Applications, 4(1), 1-10. Stevens, R. J., Madden, N. A., Slavin, R. E., &
Farnish, A. M. (1987). Cooperative integrated reading
Silven, M. (1992). The role of metacognition in and composition: Two field experiments. Reading
reading instruction. Scandinavian Journal of Educational Research Quarterly, 22(4), 433-454.
Research, 36(3), 211-221.
Stevens, R. J., Slavin, R. E., & Farnish, A. M.
Tregaskes, M. R., & Daines, D. (1989). Effects of (1991). The effects of cooperative learning and
metacognitive strategies on reading comprehension. instruction in reading comprehension strategies on main
Reading Research and Instruction, 29(1), 52-60. idea identification. Journal of Educational Psychology,
83(1), 8-16.
Cooperative Learning References
Uttero, D. A. (1988). Activating comprehension
Bramlett, R. K. (1994). Implementing cooperative
through cooperative learning. Reading Teacher, 41(4),
learning: A field study evaluating issues for school-
390-395.
based consultants. Journal of School Psychology, 32(1),
67-84.
Curriculum Plus Strategies References
Guthrie, J. T., et al. (1996). Growth of literacy Au, K. H., & Scheu, J. A. (1989). Guiding students
engagement: Changes in motivations and strategies to interpret a novel. Reading Teacher, 43(2), 104-110.
during concept-oriented reading instruction. Reading
Research Quarterly, 31(3), 306-332. Bramlett, R. K. (1994). Implementing cooperative
learning: A field study evaluating issues for school-
Judy, J. E., Alexander, P. A., Kulikowich, J. M., & based consultants. Journal of School Psychology, 32(1),
Wilson, V. L. (1988). Effects of two instructional 67-84.
approaches and peer tutoring on gifted and non gifted
sixth-grade students’ analogy performance. Reading Feathers, K. M., & Smith, F. R. (1987). Meeting
Research Quarterly, 23(2), 236-256. the reading demands of the real world: Literacy based
content instruction. Journal of Reading, 30(6), 506-511.
Klingner, J. K., Vaughn, S., & Schumm, J. S.
(1998). Collaborative strategic reading during social Freeman, R. H., & Freeman, G. G. (1987). Reading
studies in heterogeneous fourth-grade classrooms. acquisition: A comparison of four approaches to reading
Elementary School Journal, 99(1), 3-22. instruction. Reading Psychology, 8(4), 257-272.
Mathes, P. G., et al. (1994). Increasing strategic Noyce, R. M., & Christie, J. F. (1983). Effects of
reading practice with Peabody classwide peer tutoring. an integrated approach to grammar instruction on third
Learning Disabilities Research and Practice, 9(1), 44- graders’ reading and writing. Elementary School
48. Journal, 84(1), 63-69.
Romance, N. R., & Vitale, M. R. (1992). A curriculum Gordon, C. J., & Rennie, B. J. (1987).
strategy that expands time for in-depth elementary Restructuring content schemata: An intervention study.
science instruction by using science-based reading Reading Research and Instruction, 26(3), 162-188.
strategies: Effects of a year-long study in grade four.
Journal of Research in Science Teaching, 29(6), 545- Simmons, D. C., et al. (1988). Effects of teacher-
554. constructed pre- and post-graphic organizaer instruction
on sixth-grade science students’ comprehension and
Tharp, R. G. (1982). The effective instruction of recall. Journal of Educational Research, 82(1), 15-21.
comprehension: Results and description of the
Kamehameha early education program. Reading Sinatra, R. C., Stahl-Gemake, J., & Berg, D. N.
Research Quarterly, 17(4), 503-527. (1984). Improving reading comprehension of disabled
Webster, R. E., & Braswell, L. A. (1991). Curriculum readers through semantic mapping. Reading Teacher,
bias and reading achievement test performance. 38(1), 22-29.
Psychology in the Schools, 28(3), 193-199.
Vidal-Abarca, E., & Gilabert, R. (1995). Teaching
Graphic Organizer References strategies to create visual representations of key ideas
in content area text materials: A long-term intervention
Alvermann, D. E., & Boothby, P. R. (1983). A
inserted in school curriculum. Special Issue: Process-
preliminary investigation of the differences in children’s
oriented instruction: Improving student learning.
retention of “inconsiderate” text. Reading Psychology,
European Journal of Psychology of Education, 10(4),
4(3-4), 237-246.
433-447.
Alvermann, D. E., & Boothby, P. R. (1986).
Children’s transfer of graphic organizer instruction. Listening References
Reading Psychology, 7(2), 87-100.
Boodt, G. M. (1984). Critical listeners become
critical readers in remedial reading class. Reading
Armbruster, B. B., Anderson, T. H., & Meyer, J. L.
Teacher, 37(4), 390-394.
(1991). Improving content-area reading using
instructional graphics. Reading Research Quarterly,
Shany, M. T., & Biemiller, A. (1995). Assisted
26(4), 393-416.
reading practice: Effects on performance for poor
readers in grades 3 and 4. Reading Research Quarterly,
Armbruster, B. B., Anderson, T. H., & Meyer, J. L.
30(3), 382-395.
(1992). “Improving content-area reading using
instructional graphics”: Erratum. Reading Research
Shepherd, T. R., & Svasti, S. (1987). Improved
Quarterly, 27(3), 282.
sixth grade social studies test scores via instruction in
listening. Journal of Social Studies Research, 11(2), 20-
Baumann, J. F. (1984). The effectiveness of an
23.
instruction paradigm for teaching main idea
comprehension. Reading Research Quarterly, 20(1), 93-
Sippola, A. E. (1988). The effects of three reading
115.
instruction techniques on the comprehension and word
recognition of first graders grouped by ability. Reading
Berkowitz, S. J. (1986). Effects of instruction in
Psychology, 9(1), 17-32.
text organization on sixth-grade students’ memory for
expository reading. Reading Research Quarterly, 21(2),
Mental Imagery References
161-178.
Borduin, B. J., Borduin, C. M., & Manley, C. M.
Darch, C. B., Carnine, D. W., & Kameenui, E. J. (1994). The use of imagery training to improve reading
(1986). The role of graphic organizers and social comprehension of second graders. Journal of Genetic
structure in content area instruction. Journal of Reading Psychology, 155(1), 115-118.
Behavior, 18(4), 275-295.
Gambrell, L. B., & Bales, R. J. (1986). Mental Dermody, M. (1988). Metacognitive strategies for
imagery and the comprehension-monitoring development of reading comprehension for younger
performance of fourth and fifth-grade poor readers. children. Paper presented at the American Association
Reading Research Quarterly, 21, 454-464. of Colleges for Teacher Education, New Orleans, LA.
Peters, E. E., & Levin, J. R. (1986). Effects of a Fischer Galbert, J. L. (1989). An experimental
mnemonic imagery strategy on good and poor readers’ study of reciprocal teaching of expository test with
prose recall. Reading Research Quarterly, 21, 179-192. third, fourth, and fifth grade students enrolled in chapter
1 reading. Unpublished doctoral dissertation, Ball State
Pressley, G. M. (1976). Mental imagery helps eight- University, Muncie, IN.
year olds remember what they read. Journal of
Educational Psychology, 68, 355-359. Jones, M. P. (1987). Effects of reciprocal teaching
method on third graders’ decoding and comprehension
Shriberg, L. K., Levin, J. R., McCormick, C. B., & abilities. Unpublished doctoral dissertation, Texas A&M
Pressley, M. (1982). Learning about “famous” people University.
via the keyword method. Journal of Educational
Psychology, 74, 238-247. Labercane, G., & Battle, J. (1987). Cognitive
processing strategies, self-esteem, and reading
Mnemonics References comprehension of learning disabled students. Journal of
Special Education, 11, 167-185.
Levin, J. R., Levin, M. E., Glasman, L. D., &
Nordwall, M. B. (1992). Mnemonic vocabulary Lysynchuk, L. M., Pressley, M., & Vye, N. J.
instruction: Additional effectiveness evidence. (1990). Reciprocal teaching improves standardized
Contemporary Educational Psychology, 17(2), 156-174. reading-comprehension performance in poor
comprehenders. Elementary School Journal, 90(5), 469-
Levin, J. R., Shriberg, L. D., & Berry, J. K. (1983).
484.
A concrete strategy for remembering abstract prose.
American Educational Research Journal, 20, 277-290. Padron, Y. N. (1985). Utilizing cognitive reading
strategies to improve English reading comprehension of
McCormick, C. B., & Levin, J. R. (1984). A
Spanish-speaking bilingual students. Unpublished
comparison of different prose-learning variations of the
doctoral dissertation, University of Houston.
mnemonic keyword method. American Educational
Research Journal, 21, 379-398. Palinscar, A. S. (1987). Collaborating for
collaborative learning of text comprehension. Paper
McCormick, C. B., Levin, J. R., Cykowski, F., &
presented at the Annual Meeting of the American
Danilovics, P. (1984). Mnemonic-strategy reduction of
Educational Research Association, Washington, DC.
prose-learning interference. Educational
Communication and Technology Journal, 32, 145-152. Palinscar, A. S., & Brown, A.L. (1984). Reciprocal
teaching of comprehension-fostering and
Multiple Strategies References comprehension-monitoring activities. Cognition and
Instruction, 2, 117-175.
Recipr ocal Teaching Studies (Reviewed by
Reciprocal
Rosenshine & Meister
Meister,, 1994)
Rich, R. Z. (1989). The effects of training adult
Brady, P. I. (1990). Improving the reading poor readers to use text comprehension strategies.
comprehension of middle school students through Unpublished doctoral dissertation, Columbia University,
reciprocal teaching and semantic mapping strategies. New York.
Unpublished doctoral dissertation, University of Alaska.
Rush, R. T., & Milburn, J.L. (1988). The effects of Kelly, M., Moore, D. W., & Tuck, B. F. (1994).
reciprocal teaching on self-regulation of reading Reciprocal teaching in a regular primary school
comprehension in a post-secondary technical school classroom. Journal of Educational Research, 88(1), 53-
program. Paper presented at the National Reading 61. Klingner, 1998
Conference, Tucson, AZ.
Klingner, J. K., Vaughn, S., & Schumm, J. S.
Shortland-Jones, B. (1986). The development and (1998). Collaborative strategic reading during social
testing of an instructional strategy for improving reading studies in heterogeneous fourth-grade classrooms.
comprehension based on schema and metacognitive Elementary School Journal, 99(1), 3-22.
theories. Unpublished doctoral dissertation, University
of Oregon. Loranger, A. L. (1997). Comprehension strategies
instruction: Does it make a difference? Reading
Taylor, B. M., & Frye, B. J. (1992). Comprehension Psychology, 18(1), 31-68.
strategy instruction in the intermediate grades. Reading
Research and Instruction, 32(1), 39-48. Lysynchuk, L. M., Pressley, M., & Vye, N. J.
(1990). Reciprocal teaching improves standardized
Williamson, R. A. (1989). The effect of reciprocal reading-comprehension performance in poor
teaching on student performance gains in third grade comprehenders. Elementary School Journal, 90(5), 469-
basal reading instruction. Unpublished doctoral 484.
dissertation, Texas A&M University.
Palinscar, A. S., David, Y. M., Winn, J. A., &
Other Recipr ocal Teaching Studies (not rreviewed
Reciprocal eviewed Stevens, D. D. (1991). Examining the context of
by Rosenshine & Meister
Meister,, 1994) strategy instruction. Special issue: Cognitive instruction
Carnine, D., & Kinder, D. (1985). Teaching low- and problem learners. RASE: Remedial and Special
performing students to apply generative and schema Education, 12(3), 43-53.
strategies to narrative and expository material.
Soriano, M., Vidal-Abarca, E., & Miranda, A.
Remedial and Special Education, 6, 20-30.
(1996). Comparación de dos procedimentos de
Chan, L. D. S., & Cole, P. G. (1986). Effects of instrucción en comprensión y aprendizaje de textos:
inference training on children’s comprehension of Instrucción directa y enseñanza recíproca. [Comparison
expository text. Remedial and Special Education, 7, 33- of two procedures for instruction in comprehension and
40. text learning: Direct instruction and reciprocal
teaching.] Infancia y Aprendizaje, 74, 57-65.
Gilroy, A., & Moore, D. W. (1988). Reciprocal
teaching of comprehension-fostering and Taylor, B. M., & Frye, B. J. (1992). Comprehension
comprehension-monitoring activities with ten primary strategy instruction in the intermediate grades. Reading
school girls. Special Issue: Changing academic Research and Instruction, 32(1), 39-48.
behavior. Educational Psychology, 8(1-2), 41-49.
Other Multiple Strategy Treatments
Grant, J., Elias, G., & Broerse, J. (1989). An Adams, A., Carnine, D., & Gersten, R. (1982).
application of Palinscar and Brown’s comprehension Instructional strategies for studying content area texts in
instruction paradigm to listening. Contemporary the intermediate grades. Reading Research Quarterly,
Educational Psychology, 14(2), 164-172. 18(1), 27-55.
Jacobs, J. E., & Paris, S. G. (1987). Children’s Anderson, V., & Roit, M. (1993). Planning and
metacognition about reading: Issues in definition, implementing collaborative strategy instruction for
measurement, and instruction. Educational Psychologist, delayed readers in grades 6-20. Special Issue:
22, 255-278. Strategies instruction. Elementary School Journal, 94(2),
121-137.
Blanchard, J. S. (1980). Preliminary investigation of Stevens, R. J., Madden, N. A., Slavin, R. E., &
transfer between single-word decoding ability and Farnish, A. M. (1987). Cooperative integrated reading
contextual reading comprehension by poor readers in and composition: Two field experiments. Reading
grade six. Perceptual and Motor Skills, 51(3), t2. Research Quarterly, 22(4), 433-454.
Brown, R., Pressley, M., Van Meter, P., & Schuder, Stevens, R. J., Slavin, R. E., & Farnish, A. M.
T. (1996). A quasi-experimental validation of (1991). The effects of cooperative learning and
transactional strategies instruction with low-achieving instruction in reading comprehension strategies on main
second-grade readers. Journal of Educational idea identification. Journal of Educational Psychology,
Psychology, 88(1), 18-37. 83(1), 8-16.
Carr, E., Bigler, M., & Morningstar, C. (1991). The Prior Knowledge
effects of the CVS strategy on children’s learning.
Au, K. (1980). Participation structures in a reading
Pelow, R. A., & Colvin, H. M. (1983). PQ4R as it lesson with Hawaiian children. Anthropology and
affects comprehension of social studies reading Education Quarterly, 11, 91-115.
material. Social Studies Journal, 12, 14-22 (Spring).
Brown, A. L., Smiley, S. S., Day, J. D., Townsend,
Reutzel, D. R., & Hollingsworth, P. M. (1991). M. A. R., & Lawton, S. C. (1977). Intrusion of a
Reading comprehension skills: Testing the thematic idea in children’s comprehension and retention
distinctiveness hypothesis. Reading Research and of stories. Child Development, 48, 1454-1466.
Instruction, 30(2), 32-46.
Dewitz, P., Carr, E. M., & Patberg, J. P. (1986).
Reutzel, D. R., & Hollingsworth, P. M. (1991). Effects of inference training on comprehension and
Reading time in school: Effect on fourth graders’ comprehension monitoring. Reading Research
performance on a criterion-referenced comprehension Quarterly, 22, 109-119.
test. Journal of Educational Research, 84(3), 170-176.
Dole, J. A., Valencia, S. W., Greer, E. A., &
Ritchie, P. (1985). Graduate research: reviews and Wardrop, J. L. (1991). Effects of two types of
commentary: The effects of instruction in main idea and prereading instruction on the comprehension of
question generation. Reading-Canada-Lecture, 3(2), narrative and expository text. Reading Research
139-146. Quarterly, 26(2), 142-159.
Sindelar, P. T. (1982). The effects of cross-aged Hansen, J., & Pearson, P. D. (1983). An
tutoring on the comprehension skills of remedial reading instructional study: Improving the inferential
students. Journal of Special Education, 16(2), 199-206. comprehension of good and poor fourth-grade readers.
Journal of Educational Psychology, 75(6), 821-829.
Smith, K., Johnson, D. W., & Johnson, R. T.
(1981). Can conflict be constructive? Controversy Manzo, A. V. (1979). The reQuest procedure.
versus concurrence seeking in learning groups. Journal Journal of Reading, 2, 123-126.
of Educational Psychology, 73(5), 651-663.
Linden, M., & Wittrock, M. C. (1981). The
Stevens, R. J. (1988). Effects of strategy training teaching of reading comprehension according to the
on the identification of the main idea of expository model of generative learning. Reading Research
passages. Journal of Educational Psychology, 80(1), 21- Quarterly, 17(1), 44-57.
26.
Neuman, S. B. (1988). Enhancing children’s
comprehension through previewing. National Reading
Conference Yearbook, 37, 219-224.
Palinscar, A. S., & Brown, A. L. (1984). Davey, B., & McBride, M. (1986). Effects of
Reciprocal teaching of comprehension-fostering and question-generation on reading comprehension. Journal
comprehension-monitoring activities. Cognition and of Educational Psychology, 22, 2-7.
Instruction, 2, 117-175.
Lysynchuk, L. M., Pressley, M., & Vye, N. J.
Prince, A. T., & Mancus, D. S. (1987). Enriching (1990). Reciprocal teaching improves standardized
comprehension: A scheme altered basal reading lesson. reading-comprehension performance in poor
Reading Research and Instruction, 27(1), 45-54. comprehenders. Elementary School Journal, 90(5), 469-
484.
Roberts, T. A. (1988). Development of pre-
instruction versus previous experience: Effects on MacGregor, S. K. (1988). Use of self-questioning
factual and inferential comprehension. Reading with a computer-mediated text system and measures of
Psychology, 9(2), 141-157. reading performance. Journal of Reading Behavior,
20(2), 131-148.
Spires, H. A., Gallini, J., & Riggsbee, J. (1992).
Effects of schema-based and text structure-based cues Palinscar, A. S. (1987). Collaborating for
on expository prose comprehension in fourth graders. collaborative learning of text comprehension. Paper
Journal of Experimental Education, 60(4), 307-320. presented at the Annual Meeting of the American
Educational Research Association, Washington, DC.
Tharp, R. G. (1982). The effective instruction of
comprehension: Results and description of the Palinscar, A. S., & Brown, A. L. (1984).
Kamehameha Early Education Program. Reading Reciprocal teaching of comprehension-fostering and
Research Quarterly, 17(4), 503-527. comprehension-monitoring activities. Cognition and
Instruction, 2, 117-175.
Wood, E. G., Winne, P., & Pressley, M. (1988).
Elaborative interrogation, imagery, and provided precise Taylor, B. M., & Frye, B. J. (1992). Comprehension
elaborations as facilitators of children’s learning of strategy instruction in the intermediate grades. Reading
arbitrary prose. Paper presented at the American Research and Instruction, 32(1), 39-48.
Educational Research Association, New Orleans.
Williamson, R. A. (1989). The effect of reciprocal
Psycholinguistic teaching on student performance gains in third grade
basal reading instruction. Unpublished doctoral
Baumann, J. F. (1986). Teaching third-grade dissertation, Texas A&M University.
students to comprehend anaphoric relationships: The
application of a direct instruction model. Reading
Generic Questions or Question Stems Prompts
Research Quarterly, 21(1), 70-90.
King, A. (1989). Effects of self-questioning training
Question Generation (Reviewed by on college students’ comprehension of lectures.
Rosenshine, Meister, & Chapman, 1996) Contemporary Educational Psychology, 14, 366-381.
Garner, R., Hare, V. C., Alexander, P. A., Haynes, Rowls, M. D. (1976). The facilitative and
J., & Winograd, P. (1984). Inducing use of a text interactive effects of adjunct questions on retention of
lookback strategy among unsuccessful readers. eight graders across three prose passages: Dissertation
American Educational Research Journal, 21, 789-798. in prose learning. Journal of Educational Psychology, 68,
205-209.
Garner, R., Macready, G. B., & Wagoner, S.
(1984). Readers’ acquisition of the components of the Serenty, M. L., & Dean, R. S. (1986). Interspersed
text-lookback strategy. Journal of Educational post passage questions and reading comprehension
Psychology, 76, 300-309. achievement. Journal of Educational Psychology, 78(3),
228-229.
Griffey, Q. L., Jr., et al. (1988). The effects of self-
questioning and story structure training on the reading Sheldon, S. A. (1984). Comparison of two teaching
comprehension of poor readers. Learning Disabilities methods for reading comprehension. Journal of
Research, 4(1), 45-51. Research in Reading, 7(1), 41-52.
Levin, J. R., & Pressley, M. (1981). Improving Watts, G. H. (1973). The “arousal” effect of
children’s prose comprehension: Selected strategies that adjunct questions on recall from prose materials.
seem to succeed. In C. M. Santa & B. L. Hayes Australian Journal of Psychology, 25, 81-87.
(Eds.), Children’s prose comprehension: Research and
practice (pp. 44-71). Newark, DE: International Wixson, K. K. (1983). Questions about a text: What
Reading Association. you ask about is what children learn. Reading Teacher,
37(3), 287-293.
Pressley, M., & Forrest-Pressley, D. (1985).
Questions and children’s cognitive processing. In A. C. Story Structure
G. B. Black (Ed.), The psychology of questions (pp.
Baumann, J. F., & Bergeron, B. S. (1993). Story
277-296). Hillsdale, NJ: Erlbaum.
map instruction using children’s literature: Effects on
Raphael, T. E., & McKinney, J. (1983). An first graders’ comprehension of central narrative
examination of fifth- and eighth-grade children’s elements. Journal of Reading Behavior, 25(4), 407-437.
question-answering behavior: An instructional study in
Buss, R. R., Ratliff, J. L., & Irion, J. C. (1985).
metacognition. Journal of Reading Behavior, 15(3), 67-
Effects of instruction on the use of story structure in
86.
comprehension of narrative discourse. National Reading
Raphael, T. E., & Pearson, P. D. (1985). Increasing Conference Yearbook, 34, 55-58.
students’ awareness of sources of information for
Fitzgerald, J., & Spiegel, D. L. (1983). Enhancing
answering questions. American Educational Research
children’s reading comprehension through instruction in
Journal, 22, 217-235.
narrative structure. Journal of Reading Behavior, 15(2),
Raphael, T. E., & Wonnacott, C. A. (1985). 1-17.
Heightening fourth-grade students’ sensitivity to
Gordon, C. J., & Rennie, B. J. (1987).
sources of information for answering comprehension
Restructuring content schemata: An intervention study.
questions. Reading Research Quarterly, 20(3), 282-296.
Reading Research and Instruction, 26(3), 162-188.
Richmond, M. G. (1976). The relationship of the
Greenewald, M. J., & Rossing, R. L. (1986). Short-
uniqueness of prose passages to the effect of question
term and long-term effects of story grammar and self-
placement and question relevance on the acquisition and
monitoring training on children’s story comprehension.
retention of information. In G. H. Mc Ninch (Ed.).
National Reading Conference Yearbook, 35, 210-213.
Reflections and investigations on reading. Twenty-fifth
Yearbook of the National Reading Conference (pp.
268-278). Clemson, SC: National Reading Conference.
Griffey, Q. L., Jr., et al. (1988). The effects of self- Varnhagen, C. K., & Goldman, S. R. (1986).
questioning and story structure training on the reading Improving comprehension: Causal relations instruction
comprehension of poor readers. Learning Disabilities for learning handicapped learners. Reading Teacher,
Research, 4(1), 45-51. 39(9), 896-904.
Singer, H., & Donlan, D. (1982). Active Brown, A. L., Day, J. D., & Jones, R. S. (1983).
comprehension: Problem-solving schema with question The development of plans for summarizing texts. Child
generation for comprehension of complex short stories. Development, 48, 968-979.
Reading Research Quarterly, 17(2), 166-186.
Carnine, D. W., Kameenui, E. J., & Woolfson, N.
Spiegel, D. L., & Fitzgerald, J. (1986). Improving (1982). Training of textual dimensions related to text-
reading comprehension through instruction about story based inferences. Journal of Reading Behavior, 14(3),
parts. Reading Teacher, 39(7), 676-682. 335-340.
Reutzel, D. R., & Hollingsworth, P. M. (1988). Duffy, G. G., Roehler, L. R., Sivan, E., Rackliff, G.,
Highlighting key vocabulary: A generative-reciprocal Book, C., Meloth, M. Vavrus, L., Wesselman, R.,
procedure for teaching selected inference types. Rutnma, J., & Bassiri, D. (1987). Effects of explaining
Reading Research Quarterly, 23(3), 358-378. the reasoning associated with using reading strategies.
Reading Research Quarterly, 22(3), 347-368.
Rinehart, S. D., Stahl, S. A., & Erickson, L. G.
(1986). Some effects of summarization training on Franklin, M. R., et al. (1993). Overcoming the
reading and studying. Reading Research Quarterly, reading comprehension barriers of expository texts.
21(4), 422-438. Educational Research Quarterly, 16(1), 5-14.
COMPREHENSION II
Appendices
Appendix A
A total of 203 studies met the Panel’s criteria for noticing the comprehension blocks. Comprehension
inclusion as scientific studies on comprehension monitoring training is intended to provide readers with
instruction. These studies were grouped into 16 steps that they can take to resolve reading problems as
different categories, each representing a particular they arise. Steps may include formulating what the
instructional strategy or collection of strategies. In the difficulty is, restating what was read, looking back
following pages, each category of studies is through the text, and looking forward in the text for
summarized. The Panel defines and describes the information that might help to resolve a problem
rationale for each kind of instructional strategy, the (Bereiter & Bird, 1985).
procedures used, and how the instruction was assessed
The Panel found 20 studies on comprehension
by the researchers. The Panel then evaluates the
monitoring. Table 2, on the following page, summarizes
category of instruction, based on reported results.
the rationale, procedures, and assessment of research
Comprehension Monitoring (Also Known studies on the instruction of comprehension monitoring
as Metacognitive Awareness) strategies.
TABLE 2
COMPREHENSION M ONITORING INSTRUCTION
TABLE 3
COOPERATIVE LEARNING INSTRUCTION
The aim of cooperative learning is to Students are taught and allowed Experimenter tests
teach children to read together with to participate in partner reading,
a partner. Readers learn to read summarization of paragraphs, and Analyses of peer talk
aloud with a partner and to listen to turn- taking in making predictions. during cooperative learning
the partner's reading. Readers are Oral reading and listening is done Summarization
given activities that teach them by reader and peers. Prediction
strategies for effective reading
comprehension. Training is given, and children Standardized tests
learn to carry out activities that
The readers become independent of follow the self or partner reading,
the teacher and learn to tutor each including word recognition
other. This reduces the amount of (decoding), story structure,
time that the teacher spends with a prediction, and story summary
student. activities related to texts.
TABLE 4
CURRICULUM PLUS STRATEGY INSTRUCTION
The goal of a curriculum strategy The focus of these studies is Experimenter tests
is to provide students with the interaction between - Comprehension
multiple opportunities, in an teachers and students. The - Monitoring
ongoing school context, to idea behind adding strategic
become aware of and develop teaching is to attain Standardized tests
their cognitive processes across consistency in this
school subjects and throughout interaction despite variation Achievement grades
the school year. in content.
TABLE 5
GRAPHIC ORGANIZER INSTRUCTION
Readers are instructed to make Teachers show readers how Experimenter tests
graphic representations of text to create graphic - Summaries
material. organization of ideas. - Text recall
Teachers may provide
Graphic organizers include graphic metaphors such as Standardized tests
semantic maps, expository maps, making an umbrella for main - Comprehension subtest of
story maps, story schema, and ideas and putting details - Gates- MacGinitie
graphic metaphors. below the topic. Reading Test
or
Experimenter Tests The Panel found four studies on listening instruction and
Seven studies used recall of the text content to evaluate comprehension of text. Table 6, on the following page,
the effect of training on the use of a graphic organizer. summarizes the rationale, procedures, and assessment
Six of the seven report significant benefits to the of research studies on listening instruction.
experimental groups; one reported a null finding. Four
studies (three other than those using recall) report Evaluation
significant achievement gains in the content area. Thus, The Panel found four studies that investigated how
the main effect of graphic organizers is on improving listening during reading affects comprehension
the reader’s memory for the content that is read.
Grade Level
Standardized Tests Listening studies were carried out on students in grade
Two studies reported positive findings on grades 6 level 1, n = 1; level 4, n = 1; level 5, n = 1; and level 6,
through 8 for standardized tests to evaluate transfer n = 1.
from learning to organize content graphically.
Experimenter Tests
Summary Evaluation of Graphic Questions answering showing improvement in two
Organizer Instruction studies.
Teaching students to use a systematic, visual graph to Standardized Tests
organize the ideas that they are reading about develops Improvement is reported in two studies on standardized
the ability of the students to remember what they read tests.
and may transfer in general to better comprehension
and achievement in social studies and science content Summary Evaluation of Listening Instruction
areas. Direct instruction on learning to listen to others
Listening Actively (teachers or peers) who read while following in the text
what is read may benefit students’ comprehension in
Listening is the “act of understanding speech.” A specific and in more general ways.
child’s “listening comprehension level” is the “highest
grade level of material that can be comprehended well Mental Imagery
when it is read aloud to the student,” also known as A mental image is “a perceptual representation or
“auding, the processes of perceiving, recognizing, ideational picture of a perceptual experience,
interpreting, and responding to oral language” (Harris & remembered or imagined” (Harris & Hodges, 1995,
Hodges, 1995, p. 140 and p. 14, respectively). p. 152).
Listening to another person read and following what is In imagery training, students are instructed to construct
being read by reading the text is a method used to teach visual images to represent a text as they read it. The
students how to listen while reading. In the 1970s, text is often a short passage or a sentence. Imagery
efforts were made to train listening skills in general. training improves students’ memory (Levin & Divine-
Dickson (1981) summarizes the relevant work on this Hawkins, 1974) and inferential reasoning about written
kind of training. text (Borduin, 1994).
Active listening by the student can promote reading The Panel found seven studies on mental imagery
comprehension. Students have been taught more instruction. Table 7, on the following page, summarizes
effective listening by applying Palinscar’s and Brown’s the rationale, procedures, and assessment of research
(1984) reciprocal teaching (see below) strategies to studies on mental imagery instruction.
listening (Grant, 1989). For students in a remedial
reading class, listening lessons improved their critical
listening, critical reading, and general reading
comprehension.
TABLE 6
LISTENING ACTIVELY INSTRUCTION
TABLE 7
M ENTAL IMAGERY STRATEGY INSTRUCTION
TABLE 8
M NEMONIC INSTRUCTION
One variant of multiple-strategy instruction is called The effect sizes (Rosenshine & Meister, 1994, Table 5,
“reciprocal teaching.” The teacher first models page 194) for experimenter tests (10) studies averaged
(demonstrates through personal use) and then explains 0.88; for standardized tests (9 studies), the average
what a strategy is and when to use it (Palinscar & effect size was 0.32. These values were about the
Brown, 1984; Lysynchuk et al., 1990). At first, the same for high- and low-quality studies (0.88 and 0.86,
teacher guides the reader in applying and practicing respectively, for experimenter tests; 0.31 and 0.36,
strategies while reading a passage. Modeling includes respectively, for standardized tests). The low-quality
not only examples but the teacher “thinking aloud” to studies showed the same effect (0.87) for experimenter
demonstrate the coordinated use of strategies. tests but a small negative effect (-0.12) for standardized
Gradually, the student begins to practice and implement tests. Excluding the low-quality studies, the effect size
each strategy independently. In explicit transactional for standardized tests was raised to 0.36 (seven
approaches that use multiple strategies, the teacher will studies).
explain a strategy before modeling it in a passage
Effect size varied as a function of reader ability. Table
(Rosenshine & Meister, 1994).
11 summarizes these data.
The Panel found 38 studies on multiple-strategies
In Table 10, it can be seen that the magnitude of the
instruction. Of these, 27 studies were on “reciprocal
effect size for experimenter tests was larger for below-
teaching.” The definitions, rationales, procedures, and
average or poor readers. Despite greater efficacy of
assessments for “reciprocal teaching” are described in
specific training, scores of standardized tests declined
Table 9, on the following page. The 11 studies on other
as did the ability of the reader. These data suggest that
treatments of multiple strategies are summarized in
good readers benefit and generalize what they learn as
Table 12.
strategies more than do poor or below-average readers.
Evaluation of Reciprocal Teaching
Rosenshine and Meister (1994) tested for the
Meta-analysis significance of effect sizes and examined their results
as a function of grade level, excluding below-average
In “reciprocal teaching,” the teacher models by showing
readers. These data are summarized in Table 12. Their
how she or he would try to understand the text, using
results show that reciprocal teaching of strategies is not
two or more combinations of four strategies: question
significant for grade 3, is mixed for grades 4, 5, and 6,
generation, summarization, clarification, and prediction
and is significant for grades 7 and 8. Thus, as measured
of what might occur. Rosenshine and Meister (1994)
by significant effect sizes, the older readers benefit
conducted a meta-analysis on 16 reciprocal training
most from reciprocal teaching.
studies. Rosenshine and Meister used the criteria of
selection that was adopted by us: a study had to be an Reciprocal Teaching Studies Not
experimental study with controls and use random Reviewed by Rosenshine & Meister, 1994
assignment or matching of conditions. The grade levels
studied were 1 through 8, distributed as level 1, n = 1; The Panel located 11 studies on reciprocal teaching that
level 2, n = 1; level 3, n = 4; level 4, n = 6; level 5, n = 3; were not covered in the meta-analysis of Rosenshine
level 6, n = 4; level 7, n = 4; and level 8, n = 1. The and Meister (1994). These studies covered grade levels
modal grade for reciprocal teaching was grade 4, but from 1 to 6 (level 1, n = 1; level 2, n = 1; level 3, n = 3;
high numbers occur for grades 3 through 7 in these level 4, n = 3; level 5, n = 3; and level 6, n = 1). These
studies (4 on average). Reciprocal teaching using studies tended to use more strategies (seven had
multiple strategies presumes basic reading (decoding) combinations of summarization, question generation,
skills, even on those two or more grades below level. clarification, and prediction) and added, in one case
each, either monitoring or collaborative learning. Four
The kinds of strategies included varied from one to four studies reported improvement on experimenter tests,
components of summarization, question generation, and three reported significant improvement on
clarifying, and predicting. Question generation was most standardized tests. These data are consistent with those
frequent (nine studies), followed by summarizing (six of Rosenshine and Meister (1994).
studies).
TABLE 9
MUL TIPLE STRA
ULTIPLE TEGIES: RECIPROCAL TEACHING
TRATEGIES
TABLE 9
M ULTIPLE STRATEGIES: RECIPROCAL TEACHING
Multiple- strategies instruction The teacher guides the reader in Experimenter tests
is designed to take place in applying and practicing strategies - Learning and use of strategies
the context of a dialog while reading a passage. is assessed by analyses of:
between the teacher and the · Recall
students- - each of whom The teacher models each strategy · Generating
reads text passages. In some in the context of reading a · Answering questions
cases, the teacher also passage. The student then applies
· Summarizing (main idea)
explains a strategy. the strategy to his or her own
· Predicting (what will happen in
reading of a passage.
new passage)
There are four main strategies
Content area achievement
(varies from two to four):
Standardized tests
1. Generation of questions during
reading
TABLE 10
EFFECT SIZE AS FUNCTION OF READER ABILITY
Standard Experimenter
TABLE 11
EFFECT SIZE SIGNIFICANCE AND GRADE LEVEL
Good- poor/All
3 X
3 X
4 X
4&6 X
4&7 X
5&6 X
6, 7, 8 X
7 X
TABLE 12
M ULTIPLE STRATEGIES: OTHER TREATMENT COMBINATIONS
The instruction takes There are several skills that are Same as reciprocal
place primarily through practiced here. teaching
the student practicing a Packages of skills vary in
given strategy, with number from 2 to 5:
feedback from the
teacher. The teacher may Self study of the passage.
initially model the Oral reading
strategy. Rereading
Retelling
Review
Summarization of main ideas
Generation of questions
Testing hypotheses
Deriving word meaning from
morphemes
Word recognition training
Vocabulary instruction
Drawing conclusions
Filling in blanks in the passage
(Cloze procedure)
Monitoring of comprehension
Story structure
Collaborative learning with
partner, including listening to
partner reading. Debating or
arguing with the author of the
text or with the teacher or
partner
Classification of words,
phrases, and sentences
TABLE 13
PRIOR KNOWLEDGE ELICITATION
procedure (Martin & Pressley, 1991). This suggests teacher, and readers then read text that relates to what
that question elaboration, generation (see below), and has been learned. Prior knowledge activation occurs
answering (see below) are related in that they all with several strategies, notably question elaboration,
necessarily activate and use prior knowledge. generation, and answering.
The Panel found 14 studies on prior knowledge Psycholinguistic Instruction
instruction. Table 13, on the previous page, summarizes
the rationale, procedures, and assessment of research Psycholinguistics is “the interdisciplinary field of
studies on prior knowledge instruction. psychology and linguistics in which language behavior is
examined. Psycholinguistics includes such areas of
Evaluation inquiry as language acquisition, conversational analysis,
and the sequencing of themes and topics in discourse”
Grade Level (Harris & Hodges, 1995, p. 197).
The activation and use of what the reader knows that is
relevant to what is being read has been studied The Panel found only one study that trained readers on
experimentally for students in grades 1 through 9. The a psycholinguistic skill, for example, understanding the
distribution of these grade levels is level 1, n = 1; level referents of pronouns. This kind of instruction helps
2, n = 2; level 3, n = 1; level 4, n = 6; level 5, n = 2; level young and developing readers recognize “words that
6, n = 2; and level 9, n = 1. stand for other words” in “anaphoric” relationships, that
is, personal pronouns or repeated nouns such as when
Methods the word “it” refers to a preceding noun, noun phrase,
Most of the studies activated knowledge prior to or clause (Baumann, 1986). Baumann’s study on
reading by asking the students to think about topics teaching 3rd graders anaphoric reference found that the
relevant to the passage to be read (five studies). The experimental treatment group increased in accuracy in
remaining studies varied in how prior knowledge was identifying referents. No transfer or standardized tests
made available: teaching the relevant knowledge (two were used.
studies), pre-reading (one study), predicting based on Table 14, on the following page, summarizes the
one’s own experience (one study), making associations rationale, procedures, and assessment of research
during reading (one study), and previewing the story or studies on psycholinguistic instruction.
text (two studies). Two studies did not specify their
methods in the abstracts. Evaluation
Readers need to learn that words Teachers model or show readers how to Experimenter Tests
that stand for or refer to other identify the antecedents of pronouns and Students answer pronoun-
words, e.g., "she" stands for a to answer questions based on identified specific questions after
female referent introduced earlier antecedents. reading expository or
in the text. narrative texts.
Readers learn to identify noun substitutes,
This strategy is used to verb substitutes, and clause substitutes. Students write down the
communicate the use of a word or antecedents for underlined
phrase that stands for a preceding anaphoric terms in
word or phase, like a pronoun. expository text.
TABLE 15
QUESTION ANSWERING
Question- answering strategy Teachers ask students questions during or Experimenter Tests
instruction assists students learning after reading passages of text. - Recall
from a text. A question focuses - Short answer questions
the student on particular content Teachers ask students to look back to - Look back in text to
and can facilitate reasoning (e.g., find answers to questions that they answer question
answering why or how). cannot answer after one reading.
training, young readers are not likely to question of whether the text is being understood. When the
themselves. Nor are they likely to use questions teacher is present, the reader’s creation of questions
spontaneously to make inferences. The assumption of may signal success or failure in comprehension and
question generation instruction is that readers will learn prompt the teacher or the reader to attempt to
to engage text by making queries that lead to the compensate for comprehension failure. Finally, question
construction of better memory representations. The generation has been studied in isolation or as a multiple-
goal is to teach students to make these self-questions strategy instruction program such as reciprocal
while reading. If one asks why, how, when, where, teaching.
which, and who kinds of questions, it is possible to
In the Panel’s search, it located a recent literature
integrate segments of text, to thereby improve reading
review on question generation by Rosenshine, Meister,
comprehension and memory for what is read, and to
and Chapman (1996). Rosenshine and his colleagues
gain a deeper understanding of the text. Question
conducted a meta-analysis of 30 studies that instructed
generation should also increase the reader’s awareness
students how to generate questions during reading, statistically significant. Thus the effects of instruction of
either as a single strategy or in combination with other question generation are specific to learning the
reading strategies. Of these, 11 studies used the particular strategy and may not generalize to
“reciprocal teaching” method, and question generation standardized tests.
was part of a set of two or more strategies that were
taught. These studies were described in Table 10 Summary Evaluation of
above. Nineteen additional studies reviewed by Question Generation
Rosenshine et al. (1996) investigated instruction of There is strong empirical and scientific evidence that
question generation alone or in combination with instruction of question generation during reading
strategies not taught by reciprocal teaching methods. benefits reading comprehension in terms of memory
The Panel found 27 studies on question generation and answering questions based on text as well as
instruction. Table 16, on the following page, integrating and identifying main ideas through
summarizes the rationale, procedures, and assessment summarization. There is mixed evidence that general
of research studies on question generation instruction. reading comprehension improved on standardized
comprehension tests. Question generation may be best
Evaluation used as a part of a multiple-strategy instruction
program.
The main evaluation of question generation is based on
the meta-analysis of Rosenshine, Meister, and Chapman Story Structure
(1996) who employed the same criteria as Rosenshine
and Meister (1994) for selection of studies. A story is “an imaginative tale shorter than a novel but
with a plot, characters, and setting, as a short story.” A
Grade Level “story map” is “a time line showing the ordered
The study of question generation instruction begins with sequence of events in a text” or “a semantic map
grade 3 and has been carried out up to grade 9. The showing the meaning of relationships between events or
distribution of grade levels in this study of this kind of concepts in the text, regardless of their order.” (Harris
instruction is level 3, n = 3; level 4, n = 6; level 5, n = 4; & Hodges, 1995, pp. 243-244). Story structure refers to
level 6, n = 9; level 7, n = 4; level 8, n = 3; level 9, n = 2. the finding in discourse analysis that the content of
The modal level is grade 6. stories is systematically organized into episodes and that
the plot of a story is a set of episodes. Knowledge of
Experimenter Tests episodic content (setting, initiating events, internal
The respective effect sizes for multiple choice (n = 6), reactions, goals, attempts, and outcomes) helps the
short-answer (n = 14), and summary (n = 3) measures reader understand the who, what, where, when, and
were 0.95, 0.85, and 0.85. why of stories as well as what happened and what was
done.
Standardized Tests
Story structure instruction is a method by which the
The median effect size for 13 studies that used
teacher teaches the reader knowledge and procedures
standardized comprehension tests was 0.36. The
for identifying the content of the story and the way it is
median effect sizes for standardized vs. experimenter
organized into a plot structure. In addition to learning
tests are reported in Table 17 (following Table 16),
the episodic content, the reader can learn to infer causal
broken down by reciprocal teaching and other
and other relationships between sentences that contain
treatments. The magnitude of the median effect sizes in
the content. This learning gives the reader knowledge
Table 17 is approximately the same as that found for
and procedures for deeper understanding of stories and
reciprocal teaching of multiple strategies. There is an
allows the reader to construct more coherent memory
overlap of studies here so that the similarity is likely a
representations of what occurred in the story.
result of common studies. It is of interest that although
there is a positive effect size for standardized tests, only
3 out of 13 are statistically significant. Experimenter
tests fare better here because 16 out of 19 are
TABLE 16
QUESTION GENERATION INSTRUCTION
The goal of question generation is Teachers ask children to generate Experimenter tests
to teach readers to become questions during the reading of a - Quality of questions
independent, active, self- passage. The questions should integrate generated
questioners. The assumption is that information across different parts of the - Question answering
readers will learn more and passage.
construct better memory Standardized comprehension
representation when self- questions Teachers ask children to evaluate their tests
are asked while reading. questions about whether the questions
covered important material, were
Integrative questions that capture integrative and could be answered based
large units of meaning should on what is in the text.
improve reading comprehension
and memory of text by making Teachers provide feedback on the quality
readers more active while reading. of the questions asked or assist students
in answering the questions generated.
Question generation is often a part
of a multiple- strategy program Teachers teach the students to evaluate
such as reciprocal teaching. whether their questions covered
important information, whether the
Question generation should questions were integrative, and whether
increase students' awareness of they themselves could answer the
whether they are comprehending questions.
text.
O THER
TREATMEN TS
N umber Significant 3 of 7 9 of 12
TABLE 18
STORY STRUCTURE INSTRUCTION
Instruction is aimed at teaching the Teachers teach students to ask and Experimenter tests
student how stories and their plots answer five questions: - Retell the story (recall)
are organized into episodes. - Short- answer questions
1. Who is the main character?
Readers know a great deal about 2. Where and when did the story occur?
the content and structure of stories 3. What did the main characters do?
as a genre. However, training in 4. How did the story end?
how stories and their plots are 5. How did the main character feel?
organized into episodes can aid a
reader in understanding the who, Students learn to identify the main
what, where, when, and why of characters of the story, where and when
narratives. the story took place, what the main
characters did, how the story ended, and
Stories often entail problems that how the main characters felt.
are faced by people, and they
provide a context in which students Students learn to construct a story map
can learn about problemsolving by recording the setting, problem, goal,
experiencing the lives of others. action, and outcome over time.
Asking and answering the questions
of who, what, when, where, and Students construct a story map while
why, as well as learning about reading stories. Some mapping
problems and their solutions, are procedures require recording the setting,
useful procedures that are trans- problem, goal, action, and outcome
situational and apply to stories as information.
well as to real life.
The Panel found 17 studies on story structure The assumption in teaching students how to summarize
instruction. Table 18, which follows Table 17, what they read is that most students do not summarize
summarizes the rationale, procedures, and assessment well. The central aim of most summarization instruction
of research studies on story structure instruction. is to teach the reader how to identify the main or
central ideas of a paragraph or a series of paragraphs.
Evaluation
Summarization training is effective. It can be
Grade Level transferred to situations requiring general reading
Research on story structure instruction begins in grade comprehension, and it leads to improved written
3, n = 2, but increases in grade 4, n = 8 (four studies on summaries. Summarization training can make students
poor readers). This trend continues into grade 5, n = 7 more aware of the way a text is structured and how
(on poor readers), and grade 6, n = 2 (on poor readers). ideas are related. If asked to summarize, students have
to pay closer attention to the text while they read. They
Experimenter Tests also learn to spend more time on reading and trying to
The main kinds of tests used to evaluate experimental understand what they read. In some instances, training
training on story structure are recall (n = 10 successes increases the quality of students’ note taking and recall
and 1 failure in grade 5 among normal readers), of major information (Rinehart, 1986).
question answering on the stories (n = 8 successes, and
1 failure in grade 5 among normal readers), and The Panel found 18 studies on summarization
identifying the elements of a story structure (n = 5 instruction. Table 19, on the following page,
successes and 2 failures: 1 in grade 3 and 1 in grade 5, summarizes the rationale, procedures, and assessment
both with normal readers). All studies on poor readers of research studies on summarization instruction.
report improvement on experimenter tests. Evaluation
Standardized Tests Grade Level
Three studies report the use of standardized tests Summarization instruction studies are rare below grades
following training in story structure. There were two 5 and 6. Of those reporting information on grades
successes and one failure (grade 5, normal readers). studied, we found one level 3 and one level 4. There
Summary Evaluation of Story were four and nine studies on grades 5 and 6,
respectively. There was one study at the high school
Structure Instruction
level. Summarization often presupposes writing as well
Instruction in the content and organization of stories as reading skill. This may be one reason for its use for
improves comprehension of stories as measured by the upper elementary school grades.
ability of the reader to answer questions and recall what
was read. This improvement is more marked for less Experimenter Tests
able readers. More able readers may already know The majority of the studies reported improvement of the
what a story is about and therefore do not benefit as quality of summaries (n = 11). Other studies reported
much from the training. However, this kind of improved recall of what was summarized (n = 7) and
instruction aids both kinds of readers. improved question answering (n = 4). No negative
findings were reported.
Summarization
Standardized Tests
A summary is “a brief statement that contains the
essential ideas of a longer passage or selection” (Harris Standardized tests were rarely used. Only two studies
& Hodges, 1995, p. 247). To be able to create a reported using them on 6th graders; one succeeded and
summary of what one has just read, one must discern the other failed in increasing comprehension.
the most central and important ideas in the text. One
also must be able to generalize from examples or from
things that are repeated. In addition, one has to ignore
irrelevant details.
TABLE 19
SUMMARIZATION INSTRUCTION
TABLE 20
TEACHER PREPARATION ON COMPREHENSION INSTRUCTION
The aim of teacher Teachers undergo preparation in multiple Experimenter tests Fidelity to
preparation is to instruct strategies and explanation of strategies. treatment by teachers:
teachers in teaching reading - Do teachers learn and
comprehension strategies in Teachers are instructed in strategic teach the strategies in
the classroom context and in reading techniques and a collaborative which they were
natural interaction with transactional approach to reading trained?
students. informational texts. - Videotape pre- and
posttests
Teachers are prepared to make decisions - Reading sessions
and explain mental processing associated
with reading skills as strategies. Comprehension by students:
- Do students learn and
Self- evaluative workshops are often used practice the strategies
for learning and feedback. Teachers also taught?
learn from the use of transcripts of - Do students show gains in
lessons, videos, and post- lesson reading comprehension?
interviews.
Awareness of lesson content
Achievement in content
learning
TABLE 21
VOCABULARY INSTRUCTION AND RELATION TO COMPREHENSION
The aim of vocabulary Teacher models being a "word detective," Experimenter Tests
instruction is to use looking for contextual clues to find word - Word meanings
instruction and reciprocal meaning, a synonym, or an antonym by - Cloze tests
teaching methods to teach analyzing words and word parts and by
strategies for discovering looking at surrounding text description for Standardized tests
the meanings of unfamiliar clues to meaning.
words.
Teachers elaborate on word meanings and
Intensive vocabulary use them in diverse contexts, adding activities
instruction is designed to to extend use of learned words beyond the
promote word knowledge classroom.
that will enhance text
comprehension. The learning tasks provide definitions,
knowledge, fluent access to word meanings,
context interpretation, and story
comprehension.
Vocabulary Instruction and Relation to story comprehension. The author reports success in
Comprehension learning of the words and use of word meanings and in
increased story comprehension. In addition, there is a
Vocabulary knowledge is correlated with reading study by Tomeson and Aarnouste (1998), who applied
comprehension (see the Comprehension I report). The reciprocal teaching methods to teach vocabulary to 4th
rationale and procedures for teaching vocabulary are grade students. Students learned to derive word
found in Beck, Perfetti, and McKeown (1982). meanings from text, but transfer to more general
The instruction of vocabulary and assessment of reading comprehension as assessed by a Dutch
learning vocabulary with respect to comprehension can standardized test was not successful.
show whether this correlation is, in fact, causal. Summary Evaluation of Vocabulary
Although the first section of the subcommittee report
Instruction and Relation to
shows that vocabulary can be acquired through
Comprehension
instruction, few of those studies examined whether
successful instruction of vocabulary leads to increased More experimental studies on the relationship between
comprehension. Four studies were found on learning vocabulary and reading comprehension are
vocabulary-comprehension instruction. Table 21, which needed. There is a high correlation between vocabulary
follows Table 20, summarizes the rationale, procedures, knowledge and comprehension. Is there a causal
and assessment of research studies on vocabulary and direction between learning vocabulary and improving
its relation to comprehension instruction. reading comprehension? Furthermore, vocabulary
learning is a part of normal content area learning.
Evaluation
Instruction in vocabulary in content areas may lead to
The Panel found two studies by McKeown (1983, better reading and listening comprehension and to
1984) on teaching vocabulary that also assessed improvement in course achievement. This is a promising
students on comprehension. These 4th grade students area of research because it bridges early reading skill
were tested on word meanings, Cloze procedures, and development and later comprehension training.
Appendix B
This Appendix summarizes information on three Table 22, on the following page, provides information
questions: on the 16 categories of instruction to answer these
questions. For each category, there are sections that
• What are the claims in the literature about the
describe the effects claimed by the researchers, the
effectiveness of instruction on comprehension?
grade levels that were studied, and ways in which the
• What grades have been studied? method might be taught in a classroom setting.
• What are some of the implications for instruction in
the classroom?
TABLE 22
RELEVANCE OF INSTRUCTION
ELEVANCE
TABLE 22
RELEVANCE OF INSTRUCTION (CONTINUED)
ELEVANCE
TABLE 22
RELEVANCE OF INSTRUCTION (CONTINUED)
ELEVANCE
TABLE 22
RELEVANCE OF INSTRUCTION (CONTINUED)
ELEVANCE
TABLE 22
RELEVANCE OF INSTRUCTION (CONTINUED)
ELEVANCE
TABLE 22
RELEVANCE OF INSTRUCTION (CONTINUED)
ELEVANCE
No studies on preparation
of teachers or students on
the use of imagery in
reading or content areas
have been done.
TABLE 22
RELEVANCE OF INSTRUCTION (CONTINUED)
ELEVANCE
TABLE 22
RELEVANCE OF INSTRUCTION (CONTINUED)
ELEVANCE
M ultiple Strate gie s There is very strong 3 to 8 Teachers can be trained in the use
empirical, scientific of multiple strategy instruction in
evidence that the instruction natural reading or content areas.
of more than one strategies Current programs of transactional
in a natural context leads to research are promising examples
the acquisition and use of of this.
these reading strategies and
transfers to standard Fidelity to treatment by both
comprehension tests. teachers and students is desired
and should be studied.
Preparation of teachers in
the use of multiple Studies need to be done on when,
strategies in interactive where, and how to implement
instruction has been strategy instruction in natural
successful (see Teacher instructional contexts.
Preparation below).
Teachers could be trained on
multiple reading strategy instruction
in- service or pre- service.
TABLE 22
RELEVANCE OF INSTRUCTION (CONTINUED)
ELEVANCE
Prior Knowle dge The activation of relevant 2 to 6 Teacher teach content areas in a
world knowledge helps variety of ways which provide the
children understand and kind of knowledge that readers
remember what they read. can later activate to understand the
current text. Prior knowledge
The activation of prior studies indicate that prior learning
knowledge occurs naturally or learning that precedes reading
in contexts where subject enhances comprehension of what
content is taught by the is read. In this sense, reading
teacher and readers then about a subject after learning about
read text that relates to it in other ways would be a part of
what has been learned. a program of instruction in a
content area.
It is not clear that this
procedure has to be Research on how learning content
explicitly taught, especially prior to reading about it and its
in content areas. benefits needs to be studied.
TABLE 22
RELEVANCE OF INSTRUCTION (CONTINUED)
ELEVANCE
TABLE 22
RELEVANCE OF INSTRUCTION (CONTINUED)
ELEVANCE
Que s tion Ge ne ration There is strong empirical 3 to 9 Question generation should be part
and scientific evidence that of a program of instruction of
instruction of question reading comprehension strategies
generation during reading in a natural reading or content area
benefits reading context.
comprehension in terms of
memory and answering Teachers can be taught to ask
questions based upon text readers to generate questions and
as well as integrating and to provide feedback in these
identifying main ideas contexts. Students can learn to
through summarization. generate and find answers to their
There is mixed evidence own questions.
that general reading
comprehension is improved Fidelity to treatment by teachers
on standardized and students needs to be assessed.
comprehension tests. The relation of successful learning
Question Generation may needs to be related to content area
be best used as a part of a achievement as well as
multiple strategy instruction standardized tests.
program.
Question Generation
enables the student to be
actively involved in reading
and to be motivated by his
own queries rather than
those of the teacher in
question answering.
TABLE 22
RELEVANCE OF INSTRUCTION (CONTINUED)
ELEVANCE
Que s tion Ans we ring Instruction of Question 3 to 8 Question asking by teachers and
Answering leads to an question answering by students is a
improvement in answering part of natural reading and content
questions after reading area instruction. It should be
passages and in strategies explicitly taught to teachers with
of finding answers. the addition that they give
feedback on answers and
elaborate the feedback in the
context of the text or content area
being taught.
TABLE 22
RELEVANCE OF INSTRUCTION (CONTINUED)
ELEVANCE
TABLE 22
RELEVANCE OF INSTRUCTION (CONTINUED)
ELEVANCE
Instruction of summarization
improves memory for what
is read, both in terms of
free recall and answering
questions.
TABLE 22
RELEVANCE OF INSTRUCTION (CONTINUED)
ELEVANCE
TABLE 22
RELEVANCE OF INSTRUCTION (CONTINUED)
ELEVANCE
COMPREHENSION III
Teacher Preparation and Comprehension Strategies Instruction
literature on cognitive strategy instruction for reading Four studies met these criteria. A detailed outline of
comprehension has yielded valuable information, it has each of the selected studies, organized to permit
not provided a satisfactory model for effective comparison across studies, is presented in Appendix A.
instruction as it occurs in the classroom.
Our Panel subcommittee reviewed the research in
The area within comprehension strategy instruction that reading comprehension instruction broadly and also
currently seems to have the most potential for moving selected certain specific topics for a deeper focus, e.g.,
the field along is teacher preparation. In this report, the vocabulary and teacher preparation for teaching reading
NRP discusses four studies in which teachers are comprehension strategies. It should be noted that there
trained to teach strategies and in which the focus is the are other relevant aspects of comprehension instruction,
effectiveness of that training on students’ reading. Four for example, instruction in listening comprehension and
studies is not a large number; but it is not surprising that in writing, that were not addressed. In addition, the
only a few relevant studies have been done. Interest in Panel subcommittee did not focus on special populations
the topic is rather new, and preparing teachers to such as children whose first language is not English and
deliver effective strategy instruction is a lengthy children with learning disabilities. It did not review the
process. research evidence concerning special populations and
thus cannot say that its conclusions are relevant to
Methodology them.
Database Consistency With the Methodology of the
The NRP searched the ERIC and PsycINFO National Reading Panel
databases to locate relevant studies conducted since The methods of the NRP were followed in the conduct
1980. The search terms used were “comprehension,” of the literature searches and the examination and
“strategy,” and “instruction.” There were 453 articles. coding of the articles obtained. A formal meta-analysis
In addition, the Panel searched using the terms “direct was not possible because of the small number of studies
explanation” and “teacher explanation”; this added 182 identified. However, comprehensive summaries
nonoverlapping items. Recent research reviews were according to NRP guidelines for each of the four
also examined: Lysynchuk, Pressley, d’Ailly, Smith, and studies appears in Appendix B.
Cake (1989), Pressley (1998), Rosenshine and Meister
(1994), and Rosenshine, Meister, and Chapman (1996); Results
these reviews did not identify any relevant studies that
the searches had not revealed. The results of the selected studies suggest that, in fact,
good teacher preparation can result in the delivery of
Analysis instruction that leads to improvements in students’
reading comprehension. However, the variations among
To be included, a study had to be
the four studies to be discussed here raise questions
• Focused on the preparation of teachers for about what the best approach to teaching teachers to do
conducting reading comprehension strategy strategy instruction might be.
instruction.
There have been two major approaches to
• Published in a scientific journal. comprehension strategy instruction: Direct Explanation
• Empirical. (DE) and Transactional Strategy Instruction (TSI). Two
studies that represent each approach are described.
• Experimental using random assignment or quasi-
experimental with initial matching on the basis of Direct Explanation
reading comprehension scores.
The Direct Explanation approach was designed to
• Comprehensive in reporting the complete set of improve on the standard direct instruction approach to
results of the study. (Ancillary articles that focused strategy instruction used in most of the early studies, in
on specific aspects of the same database were not which students are simply taught to use one or several
included but are listed in the References.) strategies as described above. Arguing that direct
instruction was insufficient because it did not attempt to received no further training. The results of this study
provide students with an understanding of the reasoning indicated that students of teachers who received
and mental processes involved in reading strategically, training in the use of the explanation model had
Duffy, Roehler, and colleagues (1986) developed the significantly greater awareness of (1) what strategies
DE approach. In this approach, teachers do not teach were taught, (2) why they are important, and (3) how
individual strategies but focus instead on helping they are used than did students of the comparison
students to (1) view reading as a problem-solving task teachers.
that necessitates the use of strategic thinking and
The Duffy et al. (1986) study thus demonstrated the
(2) learn to think strategically about solving reading
effectiveness of training teachers, and it showed that
comprehension problems. The focus in DE is on
explicit explanations by teachers can lead to greater
developing teachers’ ability to explain the reasoning and
general awareness among students of reading
mental processes involved in successful reading
strategies. However, the question of the extent to which
comprehension in an explicit manner, hence the use of
students were able to apply these strategies and ways
the term “direct explanation.” The implementation of
of thinking to their actual reading practice, that is,
DE requires specific and intensive teacher training on
whether the use of such methods leads to significant
how to teach the traditional reading comprehension
improvements in reading comprehension performance,
skills found in basal readers as strategies, for example,
was not answered positively. The treatment and the
to teach students the skill of how to find the main idea
comparison classrooms did not differ on the posttest
by casting it as a problem-solving task and reasoning
administration of the comprehension subtest of the
about it strategically.
Gates-MacGinitie Test.
Duffy, G. G., Roehler, L. R., Meloth, M. S.,
Duffy and colleagues (1986) did find, however,
Vavrus, L. G., Book, C., Putnam, J., &
that students of the treatment teachers spent
Wesselman, R. (1986). The relationship
significantly more time answering the items on
between explicit verbal explanations during
the comprehension test than did the other
reading skill instruction and student awareness
students. This suggested to them that perhaps
and achievement: A study of reading teacher
these students were being more thoughtful and
effects. Reading Research Quarterly, 21(3),
strategic in their reading.
237-252.
There is little point in adapting new teaching methods if
The first study done by Duffy and Roehler’s research
they are not shown to be effective in improving actual
team investigated whether training teachers to be
performance. Thus, the 1986 study by Duffy and
explicit in their teaching of reading strategies would be
colleagues cannot be considered conclusive about the
effective in increasing the explicitness of their verbal
value of training teachers to provide explicit
explanations and whether this explicitness would be
explanations about how to read strategically. However,
related to students’ meta-cognitive awareness of
the results were promising enough to persuade the
strategies and to their achievement. Twenty-two
same research team to undertake another study, similar
teachers were randomly assigned to either the
to this one in many respects, but incorporating a more
treatment or the control condition. Treatment teachers
elaborate program of teacher preparation.
were trained to use an explanation model that was
designed to help them explain reading strategies Duffy, G. G., Roehler, L. R., Sivan, E.,
explicitly to their 5th grade students in low-level reading Rackliffe, G., Book, C., Meloth, M. S., Vavrus,
groups. After an initial training session, the treatment L. G., Wesselman, R., Putnam, J., & Bassiri,
teachers received 10 hours of additional training spaced D. (1987). Effects of explaining the reasoning
throughout the school year. During these training associated with using reading strategies.
sessions, the explanation model was described, and Reading Research Quarterly, 23(3), 347-368.
teachers designed lessons according to the model. Their
teaching was observed and discussed on four
occasions. Control teachers participated in a workshop
on classroom management at the start of the study and
In a 1987 study (Duffy et al.), as in the Duffy and actually reading connected text, and (2) described the
colleagues 1986 study, there was random assignment of reasoning employed when using the strategies. In
teachers to condition. Treatment teachers were shown contrast, students of control teachers were unable
how to provide explicit explanations, in this case to 3rd to do so.
grade low-level reading students. In addition, the
The 1987 study also used standardized measures to
teachers were trained to analyze the skills prescribed in
assess students’ reading performance. The
their basal reading texts and to recast these skills as
comprehension and word skills subtests of the Stanford
problem-solving strategies. In essence, the emphasis in
Achievement Test (SAT) were used. Overall, students
this study was on the effects of training teachers to
of the treatment teachers outperformed the others on
provide students with explicit descriptive information
the posttest. This difference was significant for the
about the types of reasoning and mental processes that
word skills subtest but was not significant for the
are used strategically by skilled readers, as opposed to
comprehension subtest. A second standardized test, the
simple prescriptions of how to perform the basal text
Michigan Educational Assessment Program (MEAP),
skills. Included in the 12 hours of training were one-on-
was administered as a delayed posttest, to assess
one coaching, collaborative sharing among the teachers,
whether the overall advantage of students of treatment
observation of lessons and feedback, and videotaped
teachers persisted over time. It was found that even 5
model lessons. Comparison teachers were trained in
months after the instruction ended, students of the
classroom management and used management
trained teachers had significantly higher reading scores
principles throughout the study.
than students of the control teachers.
The effectiveness of this approach was measured in
The results of these two investigations of the DE
terms of both student awareness and student
approach to comprehension strategy instruction suggest
achievement. Student awareness of strategic reasoning
that although this approach is clearly useful for
was assessed in interviews conducted both immediately
increasing student awareness of the need to think
following lessons and at the end of the yearlong
strategically while reading, the effects on actual reading
treatment. As in the Duffy and colleagues (1986) study,
comprehension ability are less clearcut. As noted
the results indicated that, compared with students of
above, both of the Duffy and colleagues studies
untrained teachers, the students of trained teachers had
produced only mixed results on the standardized
higher levels of awareness of specific reading
measures of reading performance. It should be noted,
strategies, as well as a greater awareness of the need
however, that the 1987 study reported that many of
to be strategic when reading.
their lessons were oriented toward acquisition of word-
The fact that students have high awareness of the level processes and not to what are usually considered
reasoning associated with strategic reading does not comprehension processes.
necessarily mean that they are proficient in using such
strategies and better in reading comprehension. Duffy Transactional Strategy Instruction
et al. (1987) designed an achievement measure to The TSI approach includes the same key elements as
assess both students’ ability to use the basal skills they the DE approach, but it takes a somewhat different
had been taught and the degree to which their view of the role of the teacher in strategy instruction.
responses reflected the reasoning associated with using Whereas emphasis in DE is on teachers’ ability to
skills as strategies. Results indicated that there was no provide explicit explanations, the TSI approach focuses
difference between students of treatment and control not only on that but also on the ability of teachers to
teachers in the ability to use the skills. However, the facilitate discussions in which students (1) collaborate to
students of treatment teachers were found to have a form joint interpretations of text and (2) explicitly
greater ability to reason strategically when reading. discuss the mental processes and cognitive strategies
Results on a task involving paragraph reading also that are involved in comprehension. In other words,
indicated that students of treatment teachers although TSI teachers do provide their students with
(1) reported that they used such reasoning when
explicit explanations of strategic mental processes used described ways in which teachers and students typically
in reading, the emphasis is on the interactive exchange behave during remedial reading instruction and then
among learners in the classroom, hence use of the term described contrasting behaviors that characterize or
“transactional.” promote active reading. The teachers were also given a
set of principles for fostering active reading through
In both DE and TSI, teachers explain specific strategies
reading instruction with specific teaching techniques for
to students and model the reasoning associated with
each principle. Each treatment teacher was also
their use. Both approaches include the use of
assigned a previously trained teacher for peer support.
systematic practice of new skills, as well as scaffolded
There were seven comparison teachers, who received
support, in which teachers gradually withdraw the
no training.
amount of assistance they offer to students. Perhaps
the most salient distinction to be made between DE and In the intervention, both teacher groups taught reading
TSI is the manner in which the different emphases of comprehension for 3 months, using expository texts.
the two approaches (explanation vs. discussion) result The instruction in treatment classrooms emphasized
in differences in the level of collaboration among both direct explanation and collaborative discussion. To
students that takes place in each approach. In the DE evaluate the effects of the TSI approach, the phonics,
approach, strategy instruction is primarily conducted by structural analysis, and reading comprehension subtests
the teacher. In contrast, the TSI approach is more of the Stanford Diagnostic Reading Test were
collaborative: Although explicit teacher explanation is an administered. There was no difference from pretest to
important part of this approach, TSI is designed for posttest in the performance of students of the trained
learning to occur primarily through the interactive and untrained teachers on the phonics and structural
transactions among the students during classroom analysis subtests. However, significantly more students
discussion. of the trained teachers (80%) made gains on the
reading comprehension subtest than did students of the
Anderson, V. (1992). A teacher development
other teachers (50%), suggesting that preparation given
project in transactional strategy instruction for
the teachers was effective in improving reading
teachers of severely reading-disabled
comprehension performance. The amount of gain was
adolescents. Teaching and Teacher Education,
not reported.
8(4) 391-403.
Brown, R., Pressley, M., Van Meter, P., &
Anderson (1992) worked with experienced teachers of
Schuder, T. (1996). A quasi-experimental
severely reading-disabled adolescent students. The
validation of transactional strategies instruction
students ranged from grades 6 through 11, but three-
with low-achieving second-grade readers.
quarters of them had incoming reading levels of grade 3
Journal of Educational Psychology, 88(1),
or below. The teachers were randomly assigned to a
18-37.
treatment or control condition. The nine treatment
teachers received three 3-hour sessions of training in Over the past decade, Pressley and associates have
the use of the TSI approach, held at intervals during the developed a transactional strategy instruction program
period during which the actual reading intervention with called Students Achieving Independent Learning
the students was going on. Special features of (SAIL). In SAIL, reading processes are taught as
Anderson’s teacher preparation included (1) the strategies through direct explanation, teacher modeling,
involvement of the teachers as coresearchers who coaching, and scaffolded practice. An important feature
were part of the development of the project and (2) the of the program is its emphasis on collaborative
availability of a previously trained peer coach for each discussion among teacher and students, including
teacher throughout the project. extended interpretive discussions of text, with these
discussions emphasizing student application of
In their training, the teachers were given a list of
strategies. A goal of the SAIL program is for students
changes, or “shifts,” that need to be made in most
to develop more personalized and integrative
classrooms for more active reading to be fostered. This
understanding of text.
list of 20 teacher shifts and 12 student shifts first
A yearlong study by Brown, Pressley, Van Meter, and effect sizes were substantial, suggesting that these
Schuder (1996) provides evidence of the effectiveness initial attempts to provide effective instruction for
of the TSI approach as exemplified by the SAIL teachers in reading comprehension strategy training are
program. In this study, SAIL was contrasted with a promising and worth following up.
more traditional approach to reading instruction. There
It is encouraging to see that random assignment is
was no specific teacher preparation within the context
indeed feasible in these real-life classroom situations.
of this study; the five SAIL teachers had all been
This statement is not intended as a criticism of Brown
previously trained and had at least 3 years of
and colleagues’ quasi-experiment, which was done
experience as SAIL teachers. The five comparison
carefully and which, in fact, posed a question that could
teachers had even more years of teaching experience
not be tested in a true experiment: What is the effect of
than the SAIL teachers had, but they had no SAIL
a particular model of instruction (TSI) delivered by
training. The students in this study were in 2nd grade;
teachers experienced and committed to it, working in
all were reading below grade level at the beginning of
the context of schools also committed to that approach?
the study.
This is an important question. But most of the relevant
The SAIL teachers and comparison teachers were research questions do not demand a quasi-experimental
matched on a variety of measures to form five pairs. In design, and therefore a much better choice would be a
each pair of classrooms, data were collected on six true experiment. Sometimes researchers argue that
low-achieving students from each classroom who were school administrators refuse to allow random
matched on the basis of their reading comprehension assignment because it disrupts their schools. Perhaps
scores. Thus, Brown and colleagues (1996) did the researchers should make serious and sincere efforts to
careful matching required when doing a quasi- find schools that will cooperate, because they do exist;
experiment. and researchers should also help the field by making an
effort to educate school administrators about random
Students’ strategy awareness was assessed through
assignment and other important design standards.
interviews. Students of SAIL teachers reported more
awareness of comprehension and word-level strategies These comments should not be taken as implying that it
than did students of comparison teachers is easy to do classroom-based naturalistic studies of the
(operationalized as the number of strategies they type discussed here. It is difficult, and the difficulty
claimed to use during reading). In an evaluation of story should not be minimized. Such research cannot be
recall, the SAIL students did better on literal recall of undertaken without substantial funding and adequate
story content and also were more interpretive in their institutional support. It also requires collaboration among
recalls. On a think-aloud task, SAIL students used more researchers; school personnel, including both teachers
strategies on their own than did the other students. and administrators; and parents, which does not come
Student reading achievement was also assessed, using about quickly—it requires time and effort. And doing
the comprehension and word skills subtests of the this type of research takes commitment and energy.
Stanford Achievement Test. Over the course of the The research team must remain motivated and
study, students of the SAIL teachers showed greater effective during a lengthy developmental phase and
improvement than the students of the other teachers, then during the study itself. Moreover, a high-quality
and at posttest, they significantly outperformed the study of this type has probably been preceded by
others on both subtests. descriptive and correlational work. The emphasis on the
importance of experimental studies should not be
Discussion interpreted as negating the valuable contributions of
Every one of these studies reported significant these other research paradigms in preparing to do
differences, and although none of them reported effect intervention research.
sizes, they provided enough information so that effect Of course, any evaluation of these instructional
sizes could be calculated for most of the effects. The approaches is limited by the fact that these studies
cannot easily be compared. They differed in terms of
specific purpose, teacher preparation method,
intervention, type of student (age, reading level, etc.), What they have shown, and this is an important new
control group, and other characteristics. Nevertheless, direction in which to take our research efforts, is that
taken together, the studies do indicate that instructional intensive instruction of teachers can prepare them to
methods that generate high levels of student teach reading comprehension strategically and that such
involvement and engagement during reading can have teaching can lead students to greater awareness of
positive effects on reading comprehension. The what it means to be a strategic reader and to the goal of
classroom procedures in each of the studies required improved comprehension.
substantial cognitive activity on the part of the students.
Also, these studies demonstrate that providing teachers Implications for Reading Instruction
with instruction that helps them use such methods leads General guidelines for teachers that derive from the
to students’ awareness of strategies and use of research evidence on comprehension instruction with
strategies, which can in turn lead to improved reading normal children include the suggestions that teachers
comprehension. help students by explaining fully what it is they are
These findings beg the question as to what it is, in fact, teaching: what to do, why, how, and when; by modeling
that makes for effective strategy instruction. Is it the their own thinking processes; by encouraging students
teacher preparation? (If so, how extensive does it have to ask questions and discuss possible answers among
to be? Would the teachers maintain their instructional themselves; and by keeping students engaged in their
effectiveness without the supports inherent in an reading via providing tasks that demand active
ongoing study?) Is it the use of direct explanation and/or involvement.
collaborative discussion when teaching students? Is it The current dearth of comprehension instruction
the particular strategies that are taught, or would a research at the primary grade level should not lead to
broader repertoire of instructional activities also be the conclusion that such instruction should be neglected
effective? Is it a combination of some or all of these during the important period when children are mastering
possibilities or of other factors not mentioned here? phonics and word recognition and developing reading
Clearly, more research is warranted on this topic. In fluency.
light of the findings to date, one can expect that further
work in this area will yield valuable knowledge In evaluating the effectiveness of strategy instruction in
concerning optimal conditions for improvement in the classroom, the primary focus must be not on the
reading comprehension. students’ performance of the strategies themselves.
The appropriate assessment is of the students’ reading
Thus, the results of the research to date represent achievement and, in addition, other outcome measures
significant progress in our understanding of the nature such as how interested students are in reading and how
of reading comprehension and of how to teach it. There satisfied teachers are with their instructional methods.
is much more to learn, of course. What we must
remember is that reading comprehension is extremely Implementation of effective comprehension instruction
complex and that teaching reading comprehension is is not a simple matter; substantial teacher preparation is
also extremely complex. The work of the researchers usually required for teachers to become successful at
discussed here makes this clear. They have not teaching comprehension.
recommended an “instructional package” that can be
There is a need for greater emphasis in teacher
prescribed for all students. They have not identified a
education on the teaching of reading comprehension.
specific set of instructional procedures that teachers
Such instruction should begin at the preservice level,
can follow routinely. Indeed, they have found that
and it should be extensive, especially with respect to
reading comprehension instruction cannot be routinized.
teaching teachers how to teach comprehension
strategies.
8. The research base is scanty with respect to the 3. Can teachers maintain their proficiency after their
development of effective methods of vocabulary own preparation to teach comprehension has been
instruction, especially methods that incorporate completed?
direct instruction, how these might vary across age
4. Does the fact that teachers are involved in an
and reading levels and abilities, and how vocabulary
ongoing research study make a difference in their
training can be integrated optimally with other types
performance?
of comprehension instruction.
9. Research is needed on how writing is related to Other Important Concerns
reading comprehension.
1. Teacher characteristics
10. It will be important to develop further the use of How does a teacher’s age, amount of teaching
videotapes, technology in general, and other experience, type of preservice education, or other
techniques for teacher preparation. characteristics affect success in comprehension
instruction? Which components of successful teacher
11. There is little evidence from cost-benefit analyses
preparation programs are the effective ones? What
to determine the amount of gain in student
characteristics of the teacher preparation itself (its
achievement (and other outcome measures) relative
focus, its intensity, its timing) affect the success of a
to the cost of implementing a reading
teacher preparation program?
comprehension instructional program.
12. With respect to comprehension strategy instruction 2. Reader characteristics
and teacher preparation: How do a student’s age, reading level, learning ability,
proficiency in English, or other characteristics affect
Comprehension Strategy Instruction: success in comprehension instruction?
Maintenance and Transfer
3. Text characteristics
1. Do teaching comprehension strategies have lasting
Does the difficulty level of the texts used in instruction
effects on students?
make a difference?
2. Do the effects generalize to other reading
Can one expect transfer from one text genre to another
situations, such as content area instruction?
(e.g., from narrative to expository text)?
3. Can comprehension instruction be done
successfully within the context of content area 4. Task characteristics
instruction? What characteristics of the instruction delivered to the
students are the effective ones? The direct explanation?
Teacher Preparation The collaborative discussion? The particular strategies
1. How much teacher preparation is required for and tasks taught to the students? The amount of
successful performance? instruction? The active involvement on the part of the
students? Other factors?
2. How should teacher preparation be conducted at
the preservice and at the inservice levels?
Casazza, M. E. (1993). Using a model of direct Duffy, G. (1991). What counts in teacher
instruction to teach summary writing in a college education? Dilemmas in educating empowered
reading class. Journal of Reading, 37(3), 202-208. teachers. In J. Zutell & S. McCormick (Eds.). Learning
factors/teacher factors: Literacy research and
Collins, C. (1991). Reading instruction that instruction. Fortieth Yearbook of the National Reading
increases thinking abilities. Journal of Reading, 34(7), Conference (pp. 1-18). Chicago: National Reading
510-516. Conference.
Collins Block, C. (1993). Strategy instruction in a Duffy, G. (1993). Rethinking strategy instruction:
literature-based reading program. Elementary School Four teachers’ development and their low achievers’
Journal, 94(2), 139-150. understandings. Elementary School Journal, 93(3), 231-
247.
Duffy, G. (1997). Powerful models or powerful Pressley, M. (1998). Reading instruction that works:
teachers? An argument for teacher-as-entrepreneur. In The case for balanced teaching. New York: The
S. Stahl & D. Hayes (Eds.), Instructional models in Guilford Press.
reading (pp. 351-363). Mahwah, NJ: Lawrence
Erlbaum Associates. Pressley, M., El-Dinary, P. B., Gaskins, I., Schuder,
T., Bergman, J., Almasi, J., & Brown, R. (1992).
Ellis, E. S. (1993). Integrative strategy instruction: Beyond direct explanation: Transactional instruction of
A potential model for teaching content area subjects to reading comprehension strategies. Elementary School
adolescents with learning disabilities. Journal of Journal, 92(5), 513-555.
Learning Disabilities, 26 (6), 358-383, 398.
Pressley, M., Johnson, C., Symons, S., McGoldrick,
Fairhurst, M. A. (1981). Satisfactory explanations J., & Kurita, J. (1989). Strategies that improve
in the primary school. Journal of Philosophy of children’s memory and comprehension of text.
Education, 15(2), 205-214. Elementary School Journal, 90(1), 3-32.
Frager, A., & Hahn, A. (1988). Turning Pressley, M., & Wharton-Mcdonald, R. (1997).
contemporary reading research into instructional Skilled comprehension and its development through
practice. Reading Horizons, 28(4) 263-271. instruction. School Psychology Review, 26(3), 448-466.
Hahn, A. (1987). Developing a rationale for Pressley, M., Wharton-McDonald, R., Rankin, J.,
diagnosing and remediating strategic-text behaviors. Mistretta, J., Yokoi, L., & Ettenberger, S. (1996) The
Reading Psychology, 8(2), 83-92. nature of outstanding primary grades literacy
instruction. In E. McIntyre & M. Pressley (Eds.),
Heller, M. F. (1986). How do you know what you Balanced instruction: Strategies and skills in whole
know? Metacognitive modeling in the content areas. language (pp. 251-276). Norwood, MA: Christopher-
Journal of Reading, 29(5) 415-422. Gordon.
Herrmann, B. A. (1988). Two approaches for Pressley, M., Yokoi, L., Rankin, J., Wharton-
helping poor readers become more strategic. Reading McDonald, R., & Menke, D. (1997). A survey of the
Teacher, 42(1), 24-28. instructional practices of grade-5 teachers nominated as
effective in promoting literacy. Scientific Studies of
Herrmann, B. A. (1989). Characteristics of explicit Reading, 1(2), 145-160.
and less explicit explanations of mathematical problem
solving strategies. Reading Research and Instruction, Roehler, L., & Duffy, G. (1991). Teachers’
28(3), 1-17. instructional actions. In R. Barr, M. Kamil, P.
Mosenthal, & P. D. Pearson (Eds.), Handbook of
Lysynchuk, L., Pressley, M., d’Ailly, H., Smith, M., reading research, Vol. 2 (pp. 861-883). White Plains,
& Cake, H. (1989). A methodological analysis of NY: Longman.
experimental studies of comprehension strategy
instruction. Reading Research Quarterly, 24(4), 458- Roehler, L., Duffy, G., & Warren, S. (1988).
470. Adaptive explanatory actions associated with effective
teaching of reading strategies. Dialogues in literacy
Palincsar, A.S., David, Y. M., Winn, J. A., & research: Thirty-seventh yearbook of the National
Stevens, D. D. (1991). Examining the context of Reading Conference (pp. 339-345). Chicago, IL:
strategy instruction. RASE: Remedial and Special National Reading Conference.
Education, 12(3), 43-53.
Rosenshine, B., & Meister, C. (1994). Reciprocal Rosenshine, B., Meister, C., & Chapman, S. (1996).
teaching: a review of the research. Review of Teaching students to generate questions: A review of
Educational Research, 64(4), 479-530. the intervention studies. Review of Educational
Research, 66(2), 181-221.
Rosenshine, B., & Meister, C. (1997). Cognitive
strategy instruction in reading. In S. Stahl & D. Hayes Schuder, T. (1993). The genesis of transactional
(Eds.), Instructional models in reading. (pp. 85-107). strategies instruction in a reading program for at-risk
Mahwah, NJ: Lawrence Erlbaum Associates. students. Elementary School Journal, 94(2), 183-200.
Teacher Duffy et al. (1986) Duffy et al. (1987) Anderson (1992) Brown et al. (1996)
Participants
Ye ars o f No t re p o rte d . No t re p o rte d . No t re p o rte d . The SAIL te ache rs had an ave rag e o f 10.4 ye ars
e xp e rie nce o f g e ne ral te aching e xp e rie nce , and all o f the m
had taug ht in the SAIL p ro g ram fo r b e twe e n 3
and 6 ye ars.
Student Duffy et al. (1986) Duffy et al. (1987) Anderson (1992) Brown et al. (1996)
Participants
State s No t re p o rte d , Mid we st USA One Mid we ste rn state No t re p o rte d One mid -Atlantic state
National Reading Panel
re p re se nte d
Numb e r o f No t re p o rte d . Tre atme nt g ro up : 9 No t re p o rte d . Numb e r no t re p o rte d ; all scho o ls we re in the
Appendices
d iffe re nt Co ntro l g ro up : 8 same d istrict.
scho o ls All scho o ls we re in the same
d istrict.
Teacher Duffy et al. (1986) Duffy et al. (1987) Anderson (1992) Brown et al. (1996)-
Participants
Ye ars o f No t re p o rte d . No t re p o rte d . No t re p o rte d . The SAIL te ache rs had an ave rag e o f 10.4 ye ars
e xp e rie nce o f g e ne ral te aching e xp e rie nce , and all o f the m
had taug ht in the SAIL p ro g ram fo r b e twe e n 3
and 6 ye ars.
g ive n b ase line sco re s o n the ir The autho rs state that "p re p aring te ache rs to
classro o m manag e me nt skills b e co me co mp e te nt transactio nal strate g ie s
(hig h, me d ium, lo w). instructo rs is a lo ng -te rm p ro ce ss; the re fo re , we
fe lt we co uld no t rand o mly assig n te ache rs,
Re se arche rs the n rand o mly p ro vid e p ro fe ssio nal d e ve lo p me nt, and wait fo r
assig ne d te ache rs within e ach te ache rs to b e co me e xp e rie nce d in te aching
manag e me nt le ve l to e ithe r the SAIL in a re alistic time frame ."
tre atme nt o r co ntro l g ro up .
Student Duffy et al. (1986) Duffy et al. (1987) Anderson (1992) Brown et al. (1996)
Participants
State s No t re p o rte d , Mid we st USA One Mid we ste rn state No t re p o rte d One mid -Atlantic state
re p re se nte d
Numb e r o f No t re p o rte d . Tre atme nt g ro up : 9 No t re p o rte d . Numb e r no t re p o rte d ; all scho o ls we re in the
d iffe re nt Co ntro l g ro up : 8 same d istrict.
scho o ls All scho o ls we re in the same
d istrict.
Grade 5th g rad e . 3rd g rad e . Stud e nts rang e d fro m 6th thro ug h 11th 2nd g rad e .
g rad e .
Reading level Lo w-le ve l re ad ing g ro up s. Lo w-le ve l re ad ing g ro up s. Se ve re ly re ad ing d isab le d . Re ad ing b e lo w 2nd g rad e le ve l.
Setting Larg e urb an scho o l d istrict. Ele me ntary scho o l classro o ms in No t re p o rte d . Uncle ar.
an urb an scho o l d istrict in the
Mid we st.
Exceptional All stud e nts sco re d mo re than 1 "The ind ivid uals in the lo w g ro up s "All b ut a ve ry fe w had b e e n d iag no se d as No ne re p o rte d .
4-135
learning ye ar b e lo w g rad e le ve l in re p re se nte d the typ ical rang e o f le arning d isab le d ," and mo re than 75% o f
characteristics re ad ing achie ve me nt. re ad ing d ifficultie s asso ciate d with the m had inco ming re ad ing le ve ls o f g rad e
lo w-le ve l re ad ing g ro up s in urb an 3 o r b e lo w.
ce nte rs. Mainstre ame d sp e cial
e d ucatio n stud e nts, immig rant
child re n with se ve re lang uag e
p ro b le ms, and stud e nts with
b e havio ral d iso rd e rs we re all
includ e d ."
Selection No ne re p o rte d . No ne re p o rte d . No t re p o rte d . Only six stud e nts in o ne SAIL class me t
restrictions e lig ib ility re q uire me nts, so the re se arche rs
d e cid e d to use six matche d p airs in e ach
National Reading Panel
Appendices
All English Ye s. Ye s, altho ug h the autho rs no te that Ye s. Ye s.
speaking? the samp le includ e d "immig rant
child re n with se ve re lang uag e
p ro b le ms."
Duratio n o f stud y Duffy et al. (1986) Duffy et al. (1987) Anderson (1992) Brown et al. (1996)
Brief description Duffy et al. (1986) Duffy et al. (1987) Anderson (1992) Brown et al. (1996)
of instructional
approach
Dire ct e xp lanatio n (DE) with DE with a fo cus o n e xp laining the TSI with a fo cus o n p ro g re ssive shifts o f TSI with a fo cus o n e valuating the e ffe ctive ne ss
a fo cus o n the use o f an re aso ning asso ciate d with skill te ache r atte ntio n to ward fo ste ring active o f an e xisting TSI p ro g ram.
e xp lanatio n mo d e l fo r and strate g y usag e . re ad ing .
te aching strate g ie s. The TSI ap p ro ach co ntains all the e le me nts o f
4-136
Ap p ro ach co ntains all the The TSI ap p ro ach co ntains all the e le me nts DE and also includ e s e xte nd e d d iscussio ns that
The DE ap p ro ach includ e s e le me nts o f DE b ut also re q uire s o f DE and also includ e s e xte nd e d e mp hasize jo int co nstructio n o f te xt
d ire ct e xp lanatio n o f te ache rs to analyze the skills d iscussio ns that e mp hasize jo int inte rp re tatio ns and stud e nt strate g y usag e .
strate g y usag e , mo d e ling , p re scrib e d in b asal te xts and to co nstructio n o f te xt inte rp re tatio ns and
syste matic p ractice , and re cast the se skills as stud e nt strate g y usag e .
scaffo ld ing . p ro b le mso lving strate g ie s.
Specific elements Duffy et al. (1986) Duffy et al. (1987) Anderson (1992) Brown et al. (1996)
of instructional
approach
Teacher analysis Ye s Ye s No No
of skills in basal
textbook; -
recasting these
skills as
strategies?-
Dire ct e xp lanatio n Ye s Ye s Ye s Ye s
o f strate g y usag e
(What is the
strate g y? Whe n
can it b e use d ?
Ho w is it d o ne ?)
Mo d e ling ? Ye s Ye s Ye s Ye s
Syste matic Ye s Ye s Ye s Ye s
p ractice ?
Scaffo ld ing ? Ye s Ye s Ye s Ye s
Exte nd e d No No Ye s Ye s
d iscussio ns that
e mp hasize jo int
co nstructio n o f te xt
inte rp re tatio ns and
stud e nt strate g y
usag e ?
Stud e nt cho ice o f No No Ye s. (Te ache rs and stud e nts co llab o rate d No
re ad ing mate rials? o n cho ice o f te xts.)
Complete Duffy et al. (1986) Duffy et al. (1987) Anderson (1992) Brown et al. (1996)
description of
instructional
approach and
curricular
emphasis
4-137
The curriculum in this stud y This re se arch is b ase d o n the The te ache r d e ve lo p me nt mo d e l stud ie d in "The p urp o se o f SAIL is the d e ve lo p me nt o f
co mp rise d the skills asse rtio n that "b e cause p o o r this re se arch is b ase d o n the p rincip le s o f ind e p e nd e nt, se lf-re g ulate d me aning -making
p re scrib e d in Ho ug hto n- re ad e rs lack und e rstand ing o f the TSI. fro m te xt."
Mifflin and Ginn b asal strate g ic nature o f re ad ing ,
te xtb o o ks fo r use with lo w- instructio n ne e d s to p lace g re ate r Acco rd ing to the autho r, TSI is a me tho d o f The SAIL p ro g ram use s a TSI ap p ro ach to
le ve l re ad ing g ro up s in the e mp hasis o n the d e ve lo p me nt o f te aching re ad ing that e mp hasize s te aching re ad ing co mp re he nsio n to lo w-
p o st-p rimary g rad e s, such p o o r re ad e rs' ab ility to re aso n "transactio ns o r ne g o tiatio ns that o ccur p e rfo rming stud e nts.
as id e ntifying main id e as, strate g ically." amo ng te ache r and stud e nts, and stud e nts
d rawing co nclusio ns, using and stud e nts while wo rking to g e the r to Acco rd ing to the autho rs, "the sho rt-te rm g o al o f
g lo ssarie s, and d e co d ing . Acco rd ing to the autho rs, "it may d e te rmine te xt me aning ." TSI is the jo int co nstructio n o f re aso nab le
Fo r the p urp o se s o f this b e ne ce ssary whe n wo rking with inte rp re tatio ns b y g ro up me mb e rs as the y ap p ly
stud y, skills are no t vie we d p o o r re ad e rs fo r te ache rs to The vie w o f te ache r e d ucatio n p re se nte d in strate g ie s to te xts. The lo ng -te rm g o al is the
as rule s to b e me mo rize d e xp lain e xp licitly, in co nsiste nt this stud y invo lve s a p ro g re ssive shift o f the inte rnalizatio n and co nsiste ntly ad ap tive use o f
as p ro ce d ural alg o rithms. ways o ve r e xte nd e d instructio nal te ache r' s atte ntio n. strate g ic p ro ce ssing whe ne ve r stud e nts
National Reading Panel
Inste ad , the y are taug ht as p e rio d s, the me ntal p ro ce ssing e nco unte r d e mand ing te xt. Bo th g o als are
strate g ie s o r fle xib le p lans asso ciate d with [a g ive n] strate g y, The first stag e shifts atte ntio n fro m o ve rt p ro mo te d b y te aching re ad ing g ro up me mb e rs
Appendices
fo r re aso ning ab o ut ho w to whe n it can b e use d , and ho w to p e rfo rmance o f tasks to the und e rlying to co nstruct te xt me aning b y e mulating e xp e rt
re mo ve b lo ckag e s to ap p ly it in a fle xib le manne r." co mp re he nsio n p ro ce sse s. The ne xt stag e re ad e rs' use o f co mp re he nsio n strate g ie s."
me aning . Rathe r than b e ing shifts fro m te ache r q ue stio ning , mo d e ling ,
ap p lie d auto matically, skills In p articular, the autho rs are and e xp laining to stud e nts carrying o ut
are ap p lie d tho ug htfully, inte re ste d in the re latio nship the se p ro ce sse s.
co nscio usly, and ad ap tive ly. b e twe e n the e xp licitne ss o f
te ache r strate g y e xp lanatio ns o n The final stag e shifts fro m stud e nts' carrying
the o ne hand and stud e nt strate g y o ut active p ro ce sse s und e r te ache r
aware ne ss and re ad ing ab ility o n g uid ance to the ir assump tio n o f that
the o the r. re sp o nsib ility the mse lve s.
Reports of the Subgroups
Materials Duffy et al. (1986) Duffy et al. (1987) Anderson (1992) Brown et al. (1996)
Basal re ad ing te xtb o o ks; Se co nd g rad e b asal re ad ing A to tal o f 135 sing le -p ag e , e xp o sito ry te xts It is no t e ntire ly cle ar what te xts we re use d
d ifficulty no t re p o rte d . te xtb o o ks. was p re p are d , and it was le ft to the d uring the co urse o f the scho o l ye ar. The thre e
te ache rs and stud e nts to d e cid e which o f te xts use d in the stud y fo r asse ssme nts we re
the te xts the y wishe d to re ad .ÑTe xts we re illustrate d sto rie s fro m trad e b o o ks, with
d rawn and e d ite d (p rimarily sho rte ne d ) fro m numb e rs o f wo rd s and re ad ab ility le ve ls as
a varie ty o f "re al te xt" so urce s (e .g ., Cricke t fo llo ws:
Mag azine , Op e n Co urt Pub lishing ). 341 wo rd s; 2.4
512 wo rd s; 2.2
Re ad ab ility le ve ls rang e d fro m g rad e s 2 to 129 wo rd s; 3.9
8, with the majo rity o f te xts at g rad e s 4 and
5.
What were treatment Duffy et al. (1986) Duffy et al. (1987) Anderson (1992) Brown et al. (1996)
teachers taught?
Tre atme nt te ache rs we re Tre atme nt te ache rs we re taug ht to mo d ify the A se t o f 20 te ache r shifts and 12 stud e nt shifts was The tre atme nt (SAIL) te ache rs we re
taug ht to e mp hasize the curricular and instructio nal skill p re scrip tio ns p re se nte d to the tre atme nt te ache rs. The shifts no t traine d sp e cifically fo r this
me ntal p ro ce ssing o ne d o e s o f the b asal te xt so that the e mp hasis was o n re p re se nt chang e s that ne e d to b e mad e in o rd e r stud y; ho we ve r, the y all had
whe n using the skills the me ntal p ro ce ssing invo lve d in using skills to fo ste r mo re active re ad ing . e xte nsive e xp e rie nce (i.e ., 3 o r
p re scrib e d in the b asal as strate g ie s. mo re ye ars) te aching in the SAIL
te xtb o o k. The te ache rs we re This list o f shifts first d e scrib e s ways in which p ro g ram.
traine d to talk to stud e nts Sp e cifically, tre atme nt te ache rs we re taug ht to te ache rs and stud e nts typ ically b e have in re me d ial
ab o ut the re aso ning o ne ad ap t the ir b asal te xt instructio n in the re ad ing se ssio ns, and the n p ro vid e s a co ntrasting
d o e s whe n e nco unte ring a fo llo wing ways: list o f b e havio rs that characte rize o r p ro mo te active
b lo ckag e to me aning , ho w re ad ing .
the skill b e ing taug ht can b e Be cause b asal te xtb o o ks o fte n p re se nt
ap p lie d to re mo ve a p re scrib e d skills as iso late d me mo ry-b ase d The se t o f stud e nt shifts that was p re se nte d to
p articular b lo ckag e , and the tasks, tre atme nt te ache rs we re taug ht to te ache rs includ e d the fo llo wing as d e sire d g o als:
me ntal ste p s o ne fo llo ws re cast the p re scrib e d skills as p ro b le m- Particip ating in re ad ing to le arn ne w info rmatio n;-
whe n using the skill. so lving strate g ie s b y analyzing the co g nitive trying to re ad d ifficult o r unfamiliar mate rial;
and me taco g nitive co mp o ne nts o f the skill. fo cusing o n co llab o rating with the g ro up in re ad ing
Tre atme nt te ache rs we re to ld se ssio ns;
to p re se nt skills no t within Be cause the te aching sug g e stio ns in the re ve aling and inve stig ating e rro rs in re ad ing ;
the co nte xt o f wo rkb o o k b asal te xt te ache r' s g uid e e mp hasize d ire cting e ffo rt to ward e xp laining ho w to arrive at
e xe rcise b ut within the p ro ce d ural skill e xe rcise s and d rill, tre atme nt co rre ct answe rs;
co nte xt o f the use o f tho se te ache rs we re taug ht to sup p le me nt the se atte mp ting to take o n the ro le o f the te ache r;
4-139
skills in actual re ad ing sug g e stio ns with mo d e ling o f the co g nitive asking q ue stio ns;
situatio ns. and me taco g nitive acts invo lve d in p e rfo rming re acting to te xt;
the skills. p ro vid ing mo d e ls fo r o the rs;-g iving e lab o rate d
To assist in the ir p lanning , re sp o nse s;
te ache rs we re taug ht to Te ache rs we re taug ht "to e xp lain e xp licitly, in fo cusing o n le arning fro m the re ad ing ; and
o rg anize the ir instructio nal co nsiste nt ways o ve r e xte nd e d instructio nal se e king challe ng e s in thinking .
talk into a five -ste p le sso n p e rio d s, the me ntal p ro ce ssing asso ciate d
fo rmat: intro d uctio n, with [a g ive n] strate g y, whe n it can b e use d , Te ache rs we re also g ive n a se t o f p rincip le s fo r
mo d e ling , g uid e d and ho w to ap p ly it in a fle xib le manne r." fo ste ring active re ad ing thro ug h re ad ing instructio n,
inte ractio n, p ractice , and with sp e cific te ache r te chniq ue s fo r e ach p rincip le .
ap p licatio n. To he lp te ache rs Te ache rs we re taug ht "to p re se nt the ir Particular atte ntio n was g ive n to :
use the le sso n p lan, the y e xp lanatio ns to stud e nts as d e scrip tive o f p ro ce d ure s fo r making thinking e xp licit b y thinking
we re taug ht ho w to mo d e l what g o o d re ad e rs d o , rathe r than as alo ud , and fo r turning o ve r re sp o nsib ility fo r this to
National Reading Panel
the me ntal p ro ce ssing p re scrip tio ns to b e p ro ce d urally ap p lie d in all stud e nts, co llab o rative p ro b le m-so lving , as we ll as
re ad e rs d o b y "talking o ut situatio ns." acce ssing , ap p lying and e valuating stud e nts'
Appendices
lo ud " ab o ut the ir o wn use o f e xisting and alte rnative p ro b le m-so lving strate g ie s
the skill, d ire ct atte ntio n to Tre atme nt te ache rs we re no t p ro vid e d with "up g rad ing " q ue stio ning b y b o th te ache rs and
the salie nt fe ature s o f the scrip ts fo r te aching skills in this way. Inste ad , stud e nts to b e le ss co nte nt-sp e cific, and mo re
skill, re fo cus stud e nt the y use d the info rmatio n fro m the re se arch fo cuse d o n the use o f strate g ie s, turning
atte ntio n d uring inte ractio ns, inte rve ntio n se ssio ns to d e ve lo p the ir o wn q ue stio ning and the e ntire re ad ing se ssio n o ve r to
re vie w, p ro vid e p ractice , e xp lanatio ns fo r e ach le sso n. stud e nts, and incre asing stud e nt talk and
and he lp stud e nts ap p ly the d e cre asing te ache r talk d uring re ad ing d iscussio ns.
skill in co nne cte d te xt.
Reports of the Subgroups
How did treatment Duffy et al. (1986) Duffy et al. (1987) Anderson (1992) Brown et al. (1996)
teacher training
All te ache rs atte nd e d an Tre atme nt te ache rs we re to ld that the p urp o se The training o f the tre atme nt te ache rs invo lve d thre e The SAIL te ache rs we re no t
initial o rie ntatio n me e ting in o f the p ro je ct was to stud y te ache r se ssio ns o f 3 ho urs e ach, he ld at o ne mo nth traine d sp e cifically fo r this
No ve mb e r. Sub se q ue nt to e xp lanatio n. inte rvals while the te ache rs we re co nd ucting stud y; ho we ve r, the y all had
the initial me e ting , the re ad ing se ssio ns with the ir stud e nts. e xte nsive e xp e rie nce (i.e .,
tre atme nt te ache rs re ce ive d The y re ce ive d six 2-ho ur training se ssio ns in 3 o r mo re ye ars) te aching in
10 ho urs o f training o n ho w the co urse o f o ne acad e mic ye ar. In the se training se ssio ns, tre atme nt te ache rs we re the SAIL p ro g ram.
to inco rp o rate e xp licit instructe d in p rincip le s and te chniq ue s fo r fo ste ring
e xp lanatio ns into the ir The se se ssio ns e mp hasize d :-ho w to make active re ad ing . The training mo d ule includ e d the
o ng o ing re ad ing skill d e cisio ns ab o ut re casting p re scrib e d b asal fo llo wing e le me nts and te chniq ue s:
instructio n. This training te xt skills as strate g ie s;
e mp hasize d : ho w to d e cid e o n e xp licit state me nts ab o ut Re se arch invo lve me nt:-The tre atme nt te ache rs
the strate g y b e ing taug ht, whe n it wo uld b e p articip ate d in d iscussio ns ab o ut the stud y
ho w to re cast p re scrib e d use d , and ho w to d o the me ntal p ro ce ssing p ro ce d ure s. "Eve ry e ffo rt was mad e to make
b asal te xt skills as strate g ie s invo lve d ; te ache rs fe e l the y we re a p art o f the d e ve lo p me nt
use ful whe n re mo ving ho w to o rg anize the se state me nts into a and e vo lutio n o f the p ro je ct."
b lo ckag e s to me aning s, ho w le sso n fo rmat that p ro g re sse d fro m an
to make e xp licit state me nts intro d uctio n, to mo d e ling , to inte ractio n Te aching shifts:
ab o ut the re ad ing skill b e ing b e twe e n te ache r and stud e nts, to clo sure . As d e scrib e d ab o ve , a se t o f 20 te ache r shifts and
4-140
What was the Duffy et al. (1986) Duffy et al. (1987) Anderson (1992) Brown et al. (1996)
intervention for the
control group?
The co ntro l g ro up re ce ive d Tre ate d -co ntro l te ache rs we re to ld that the The co ntro l te ache rs we re to ld that the y wo uld The co ntro l te ache rs
a p re se ntatio n o n e ffe ctive p urp o se o f the stud y was to valid ate at the re ce ive the same training as the tre atme nt te ache rs re ce ive d no sp e cial training ;
classro o m manag e me nt. 3rd g rad e le ve l the re sults o f a p re vio us afte r the re se arch d ata we re co lle cte d . ho we ve r, the y we re all
(unre late d ) stud y invo lving classro o m "hig hly re g ard e d fo r the ir
National Reading Panel
Appendices
o n fo ur o ccasio ns fo llo wing The y re ce ive d thre e 2-ho ur training se ssio ns
the b ase line o b se rvatio n. o n using the manag e me nt p rincip le s In ad d itio n, the co ntro l g ro up
e mp lo ye d in the 1st g rad e stud y. had , o n ave rag e , a g re ate r
numb e r o f ye ars o f te aching
In the classro o m, the y fo llo we d the ir usual e xp e rie nce than the
instructio nal ro utine s re g ard ing b asal te xtb o o k tre atme nt te ache rs.
skill instructio n, while ad d ing the manag e me nt
p rincip le s o f the 1st g rad e stud y.
Reports of the Subgroups
The te ache rs we re unaware Ne ithe r the tre atme nt no r the co ntro l g ro up No t re p o rte d . No t re p o rte d .
that the two g ro up s re ce ive d was mad e aware o f the o the rs' e xiste nce .
d iffe re nt info rmatio n.
Bo th g ro up s o f te ache rs re ce ive d id e ntical
info rmatio n ab o ut ho w to imp le me nt an
Uninte rrup te d Sustaine d Sile nt Re ad ing
(USSR) p ro g ram and ho w to p re p are stud e nts
to take a stand ard ize d re ad ing te st.
Outcome measures Duffy et al. (1986) Duffy et al. (1987) Anderson (1992) Brown et al. (1996)
Student reading Gate s-MacGinitie Re ad ing Stanfo rd Achie ve me nt Te st (SAT): Stanfo rd Diag no stic Re ad ing Te st: Stanfo rd Achie ve me nt Te st
4-142
achievement: Te st: The co mp re he nsio n and wo rd skills sub te sts The p ho nics, structural analysis, and re ad ing (SAT):-The co mp re he nsio n
The co mp re he nsio n sub te st, we re use d . co mp re he nsio n-sub te sts we re use d and wo rd skills sub te sts
Le ve l D (d e sig ne d fo r use (PRE and POST). (PRE and POST). we re use d .
with g rad e s 4-6) was use d . Michig an Ed ucatio nal Asse ssme nt Pro g ram (PRE and POST).
(PRE and POST) (MEAP):
The MEAP was ad ministe re d five mo nths afte r Sto ry re call:
the tre atme nt e nd e d . Stud e nts we re aske d cue d
(DELAYED POST). and p icture
cue d re te lling q ue stio ns
ab o ut 2 sto rie s. This
me asure was d e sig ne d to
asse ss b o th re call skills and
the d e g re e to which
stud e nts we re inte rp re tive in
the ir re te lling o f the sto ry.
(POST).
Student strategy Le sso n inte rvie ws: Le sso n inte rvie ws: No t me asure d . Strate g y aware ne ss inte rvie w:
awareness: Imme d iate ly fo llo wing Imme d iate ly fo llo wing a re ad ing le sso n, stud e nts In Octo b e r and No ve mb e r, (i.e ., whe n SAIL
e ach o f the fo ur we re inte rvie we d to d e te rmine whe the r the y we re co mp o ne nts we re b e ing intro d uce d to SAIL
o b se rve d le sso ns co nscio usly aware o f what strate g y the te ache r taug ht stud e nts) and in March and Ap ril, a strate g ie s
sub se q ue nt to the d uring the le sso n (d e clarative kno wle d g e ), whe n to inte rvie w was ad ministe re d to all stud e nts
b ase line o b se rvatio n, use it (situatio nal kno wle d g e ), and ho w to use it p articip ating in the stud y. This inte rvie w tap p e d
stud e nts we re (p ro ce d ural kno wle d g e ). stud e nts' re p o rte d aware ne ss o f strate g ie s, as
inte rvie we d to (DURING). me asure d b y the numb e r and typ e s o f strate g ie s
d e te rmine whe the r the y claime d to use d uring re ad ing . It was also
the y we re Co nce p t inte rvie ws: d e sig ne d to me asure stud e nts' aware ne ss o f
co nscio usly aware o f At the e nd o f the ye ar, stud e nts we re inte rvie we d to whe re , whe n, and why to use strate g ie s.
what strate g y the me asure the ir aware ne ss o f the g e ne ral ne e d to b e (DURING).
te ache r taug ht d uring strate g ic whe n re ad ing .
the le sso n (POST). Stud e nts we re aske d the fo llo wing six o p e n-
(d e clarative e nd e d q ue stio ns, ad ap te d fro m the o ne s use d
kno wle d g e ), whe n to b y Duffy e t al. (1987):
use it (situatio nal
kno wle d g e ), and ho w What d o g o o d re ad e rs d o ?
to use it (p ro ce d ural
kno wle d g e ). What make s so me o ne a g o o d re ad e r?
(DURING).
What thing s d o yo u d o b e fo re yo u start to re ad a
4-143
sto ry?
What d o yo u d o whe n yo u co me to a wo rd yo u
d o no t kno w?
Student strategy No t me asure d . Sup p le me ntal Achie ve me nt Me asure (SAM): No t me asure d . Think-alo ud me asure :
usage: This me asure was d e sig ne d b y the e xp e rime nte rs to
d e te rmine whe the r stud e nts co uld p e rfo rm the Stud e nts we re sto p p e d at fo ur p o ints while
National Reading Panel
sp e cific skill tasks the y had b e e n taug ht (Part I), and re ad ing a d ifficult sto ry ind ivid ually with a
whe the r the ir ratio nale fo r cho o sing an answe r re se arche r, and aske d to d e scrib e the ir thinking
Appendices
re fle cte d the re aso ning asso ciate d with using skills as and the ir strate g y usag e .
strate g ie s (Part II).
(POST). (POST).
whe n it wo uld b e use d , info rmatio n. Te ache rs we re rate d o n the ir (a) re ad ing ,
the fe ature s to atte nd to , intro d uctio n to the le sso n, (b ) mo d e ling , (c) se tting re ad ing g o als b e fo re re ad ing ,
the se q ue nce to fo llo w, and the d iminishing assistance d uring inte ractio n, (d ) p ro b le m so lving d uring re ad ing ,
e xamp le s use d . e liciting o f stud e nt re sp o nse s, and (e ) summarizing to che ck
The se co nd asp e ct fo cuse d o n clo sure .ÑPart III fo cuse d o n the co he sio n b o th co mp re he nsio n,
the p e d ag o g ical me ans b y which within the le sso n and acro ss le sso ns.Ñ re fle cting o n re ad ing g o als afte r te xt,
the info rmatio n was co nve ye d , and
and includ e d 6 sub -cate g o rie s, stre ssing ne w le arning fro m te xt.
fo cusing o n the te ache rs use o f:
mo d e ling , Te ache r e ffe ctive ne ss was also
hig hlig hting , asse sse d b y rating stud e nts o n the
fe e d b ack, fo llo wing 8 d ime nsio ns:
re vie w, Tre ating re ad ing p ro b le ms o p e nly,
p ractice , and fo cusing o n ho w to so lve p ro b le ms,
ap p licatio n. e xp re ssing thinking ,
asking q ue stio ns,
g iving e lab o rate d answe rs,
taking te ache r ro le ,
fo cusing o n g ro up co llab o ratio n,
invo lve me nt in se ssio ns.
Results Duffy et al. (1986) Duffy et al. (1987) Anderson (1992) Brown et al. (1996)
Stud e nt Gate s-MacGinitie Re ad ing Te st: Stanfo rd Achie ve me nt Te st (SAT):-Stud e nts o f Stanfo rd Diag no stic Re ad ing Te st: Stanfo rd Achie ve me nt Te st (SAT):
re ad ing The re was no sig nificant tre atme nt te ache rs sco re d sig nificantly hig he r A sig nificantly hig he r numb e r o f Stud e nts o f tre atme nt te ache rs sco re d
achie ve me nt: d iffe re nce b e twe e n stud e nts in than stud e nts o f co ntro l te ache rs o n wo rd skills stud e nts o f tre atme nt te ache rs (ab o ut sig nificantly hig he r than stud e nts o f
the tre atme nt and co ntro l (ES = +1.63), b ut no t o n co mp re he nsio n (ES = 80%) mad e g ains o n the re ad ing co ntro l te ache rs o n the
classro o ms o n the +0.25). co mp re he nsio n sub te st than stud e nts co mp re he nsio n sub te st (ES = +1.70)
co mp re he nsio n sub te st at o f co ntro l te ache rs (ab o ut 50%). and the wo rd skills sub te st (ES =
p o stte st (ES = 0.24). Michig an Ed ucatio nal Asse ssme nt Pro g ram +1.67); the y also sho we d sig nificantly
(MEAP): The re was no sig nificant d iffe re nce in g re ate r imp ro ve me nt o n the se
Stud e nts in tre atme nt and co ntro l Stud e nts o f tre atme nt te ache rs sco re d the numb e r o f stud e nts o f tre atme nt me asure s o ve r the co urse o f the
classro o ms sp e nt e q ual amo unts sig nificantly hig he r than stud e nts o f co ntro l te ache rs and the numb e r o f stud e nts stud y.ÑSto ry re te lling q ue stio ns:
answe ring co mp re he nsio n te st te ache rs (ES = +1.33). o f co ntro l te ache rs who mad e g ains Stud e nts o f the tre atme nt (SAIL)
ite ms o n the p re te st, b ut o n the o n the p ho nics and the structural te ache rs re calle d mo re lite ral
p o stte st, tre atme nt stud e nts sp e nt analysis sub te sts. info rmatio n (Sto ry 1: ES = +0.69; Sto ry
sig nificantly mo re time answe ring 2: ES = +1.37) and we re sig nificantly
q ue stio ns (ES = +0.42). mo re inte rp re tive in the ir re te lling o f
the sto rie s (Sto ry 1: ES = +1.01; Sto ry
2: ES = +1.07) than we re stud e nts o f
co ntro l te ache rs.
4-145
Stud e nt Strate g y aware ne ss inte rvie w: Le sso n inte rvie ws: No t me asure d . Strate g ie s inte rvie w:
strate g y Stud e nts o f tre atme nt te ache rs Le sso n inte rvie w re sp o nse s o f stud e nts o f To ward the e nd o f the tre atme nt, the
aware ne ss: sco re d sig nificantly hig he r than tre atme nt te ache rs we re rate d sig nificantly stud e nts o f the tre atme nt (SAIL)
stud e nts o f co ntro l te ache rs o n hig he r than the re sp o nse s o f stud e nts o f te ache rs re p o rte d mo re aware ne ss o f
strate g y aware ne ss rating s (ES = co ntro l te ache rs. The se find ing s we re d ue to co mp re he nsio n (ES = +4.03) and
+1.39). sig nificantly hig he r rating s g ive n to stud e nts o f wo rd -le ve l strate g ie s (ES = +1.38)
tre atme nt te ache rs o n situatio nal kno wle d g e d uring the inte rvie w than d id the
(ES = +2.22) and p ro ce d ural kno wle d g e (ES = stud e nts o f co ntro l g ro up te ache rs.
+1.50). No d iffe re nce in re sp o nse rating s was
fo und b e twe e n g ro up s fo r d e clarative
kno wle d g e (ES = +0.84).
National Reading Panel
Appendices
te ache rs we re rate d sig nificantly hig he r than
the re sp o nse s o f stud e nts o f the co ntro l
te ache rs (ES = +1.15), thus sug g e sting that the
tre atme nt stud e nts we re mo re aware o f the
strate g ic nature o f re ad ing .
Reports of the Subgroups
Te ache r Te ache r e xp licitne ss Te ache r e xp licitne ss me asure : Vid e o tap e d te aching se ssio ns: Classro o m o b se rvatio ns:
e ffe ctive ne ss me asure : The tre atme nt te ache rs we re fo und to b e mo re The tre atme nt te ache rs sho we d The tre atme nt (SAIL) te ache rs we re fo und to
Acro ss all e xp licit in e xp laining the re aso ning asso ciate d with sig nificant imp ro ve me nts acro ss have taug ht sig nificantly mo re
o b se rvatio ns afte r using re ad ing skills as strate g ie s than the tre ate d - all 14 d ime nsio ns: co mp re he nsio n strate g ie s (ES = +5.48) and
the b ase line co ntro l te ache rs (ES = +1.67). mo re wo rd -le ve l strate g ie s (ES = +1.38) than
o b se rvatio n, Tre ating re ad ing p ro b le ms o p e nly co ntro l te ache rs.
tre atme nt te ache rs (ES = +3.80),
we re rate d as fo cusing o n ho w to so lve
sig nificantly mo re p ro b le ms (ES = +2.80),-p ro vid ing
e xp licit in the ir mo d e ls o f thinking (ES = +3.25),
e xp lanatio ns than te aching q ue stio n-asking (ES =
co ntro l te ache rs (ES +2.00), asking tho ug ht-p ro vo king
= +2.11). q ue stio ns (ES = +3.14),-allo wing
stud e nt co ntro l (ES = +2.08),
fo cusing o n g ro up co llab o ratio n
(ES = +2.56),-info rming stud e nts
o f le arning (ES = +2.35),
fo cusing o n te xt and le arning
ab o ut
re ad ing (ES = +2.52),
se tting re ad ing g o als b e fo re
re ad ing (ES = +3.99),
p ro b le m so lving d uring re ad ing (ES
= +5.73),
summarizing to che ck
co mp re he nsio n (ES = +1.90),
re fle cting o n re ad ing g o als afte r te xt
(ES = +2.21), and
stre ssing ne w le arning fro m te xt (ES
= +2.45).
+1.48),
taking te ache r ro le (ES = +2.74),
fo cusing o n g ro up co llab o ratio n (ES
= +2.46), and
invo lve me nt in se ssio ns (ES =
+2.14).
Appendices
co nd itio n. The re le vant d ata are no t
p re se nte d .
Appendix B:
Comprehensive Summaries Based on NRP Guidelines
The curriculum in this study comprised the skills 5. Design of the Study
prescribed in the Houghton-Mifflin and Ginn basal Random assignment of participants (teachers) to
textbooks for use with low reading groups in the treatments (randomized experiment), after a pretest of
postprimary grades, such as identifying main ideas, classroom management skills and stratification on this
drawing conclusions, using glossaries, and decoding. For dimension.
the purposes of this study, skills are not viewed as rules
to be memorized as procedural algorithms. Instead, they 6. Independent Variables
are taught as strategies, or flexible plans for reasoning a. Treatment variables
Treatment
about how to remove blockages to meaning. Rather
Describe all treatments and control conditions.
than being applied automatically, skills are applied
thoughtfully, consciously, and adaptively. All teachers attended an initial orientation meeting in
November. Subsequent to the initial meeting, the
The recasting of traditional reading skills as strategies is
treatment teachers received 10 hours of training on how
based on cognitive science research and on the
to incorporate explicit explanations into their ongoing
application of such research to reading comprehension.
reading skill instruction. This training emphasized:
The particular curricular goal for this study was for
• How to recast prescribed basal text skills as
readers, when they encounter meaning blockages, to (a)
strategies useful when removing blockages to
know what skills can be used as strategies for removing
meanings
the blockage, (b) select a specific strategy, and (c) use
that strategy to remove the blockage. • How to make explicit statements about the reading
skill being taught, when it would be used and how to
Treatment teachers, therefore, were trained to recast apply it
basal skills as strategies and to teach students in low
reading groups to use them when encountering meaning • How to organize these statements for presentation
blockages. to students.
Specifically, treatment teachers were taught to
How was the sample obtained? emphasize the mental processing one does when using
The teachers volunteered in response to a survey of all the skills prescribed in the basal textbook. The teachers
5th grade teachers of low reading groups in the district. were trained to talk to students about
The students were assigned to reading groups by
teachers as part of the participating school district’s • The reasoning one does when encountering a
policy of using the Joplin Plan to group 5th grade blockage to meaning
students homogeneously for reading. Student • How the skill being taught can be applied to remove
assignments to reading groups were made on the basis a particular blockage
of Stanford Achievement Test scores from the previous • The mental steps one follows when using the skill.
year and the recommendations of previous teachers. All
the low-group students in this study scored more than 1 That is, teachers were told to present skills not within
year below grade level in reading achievement. the context of workbook exercises but within the
context of the use of those skills in actual reading
Attrition: Not reported. situations.
4. Setting of the Study To assist in their planning, teachers were taught to
Elementary school classroom with low-group reading organize their instructional talk into a five-step lesson
students. format: introduction, modeling, guided interaction,
practice, and application. To help teachers use the
lesson plan, they were taught how to
• Model the mental processing readers do by “talking
out loud” about their own use of the skill
• Direct attention to the salient features of the skill Teacher/student ratio: Not reported
and how to refocus student attention during
Type of trainer (teacher): Classroom teacher
interactions
• Review Length of training given to trainers (teachers): See
above.
• Provide practice
Source of training: The researchers.
• help students apply the skill in connected text.
The five training sessions were conducted immediately Assignment of trainers (teachers) to group:
after school, beginning in late November and continuing Teachers were observed and given baseline
at about 1-month intervals through March. All the scores on their classroom management skills
training sessions except one were timed to occur (high, medium, low). This resulted in teachers
approximately 1 week before each scheduled round of being assigned to the following management
classroom observations. levels:
Each training session followed a four-stage sequence. “High” = 8
First, the teachers were provided with information about “Average” = 4
strategy instruction, and links were made to teachers’ “Low” = 2
background experiences in reading instruction, to basal Researchers then randomly assigned teachers within
textbook experiences, and to expected student each management level to either the treatment or
responses. Second, the researchers modeled strategy control group.
instruction and assisted teachers as they developed their
own instructional plans. Third, teachers read the Management ratings were made again at four
transcripts of their own previous lessons and student observation points during the year to validate the initial
interviews, and the researchers guided them in management ratings.
analyzing and critiquing the transcripts. Finally, the Teachers were also observed at the beginning of the
researchers provided teachers with oral feedback study to obtain a baseline measure of their skill
following each observation about the appropriateness of instruction to establish that all 22 teachers were
their explanations. This feedback was consistent with relatively equal in the explicitness of their explanations.
the information provided to teachers during training
interventions. Baseline data were unavailable for two teachers (1
treatment and 1 control).
The control group received a presentation on effective
classroom management. In addition, these teachers Cost factors: Not reported.
were observed teaching classes on four occasions
following the baseline observation. b. Moderator variables
List and describe other nontreatment independent
Was instruction explicit or implicit? Explicit. variables included in the analyses of effects: None
Difficulty and nature of texts used: Basal texts, reported.
difficulty not reported.
7. Dependent (Outcome) Variables
Was trainers’ fidelity in delivering treatment List processes that were taught during training and
checked? Yes, via classroom observation. measured during and at the end of training: See #6
above.
Properties of trainers (teachers)
Number of teachers who administered Student strategy awareness:
treatments:
Experimental = 11 Student awareness data for both treatment and control
Control = 11 classrooms were obtained in interviews with five
Total = 22 randomly selected low-group students from each
classroom immediately following each of the four
9. Result (for each measure) Number of classrooms providing the effect size
a. Name of Measure: Student strategy awareness information: Ns = 11 and 11
interview
Duffy et al. (1987)
Students of treatment teachers scored significantly
1. Reference
higher than students of control teachers on strategy
awareness ratings. Duffy, G. G., Roehler, L. R., Sivan, E., Rackliffe,
G., Book, C., Meloth, M.S., Vavrus, L. G., Wesselman,
Value of effect size: +1.39 R., Putname, J. & Bassiri, D. (1987). Effects of
explaining the reasoning associated with using reading
Type of summary statistics from which effect size
strategies. Reading Research Quarterly, 23(3), 347-368.
was derived: ANOVA
Number of classrooms providing the effect size 2. Research Question
information: Ns = 11 and 11 The purpose of this study was to investigate the effects
b. Name of Measure: Teacher explicitness of explaining the reasoning associated with using
reading strategies. Three specific research questions
Across all observations after the baseline were posed.
observation, treatment teachers were rated as
significantly more explicit in their explanations than • Can teachers learn to be more explicit in explaining
control teachers. the reasoning associated with using basal text skills
as strategies?
Value of effect size: +2.11.
• Can explicit teacher explanations increase low-
Type of summary statistics from which effect size group students’ awareness of both lesson content
was derived: ANOVA and the need to be strategic while reading?
Number of classrooms providing the effect size • Can explicit teacher explanations increase low-
information: Ns = 11 and 11 group students’ conscious use of skills as strategies
and lead, ultimately, to greater reading
c. Name of Measure: Student Achievement achievement?
There was no significant difference between students in
3. Sample of student participants
the treatment and control classrooms on the
States or countries represented: The Midwest (no
comprehension subtest at either pretest or posttest.
state given), USA
Value of effect size: 0.24.
Number of different schools: Treatment Group = 8;
Type of summary statistics from which effect size Control Group = 9
was derived: ANOVA
Number of different classrooms = 20
Number of classrooms providing the effect size
Number of student participants:
information: Ns = 11 and 11
Total: 148
Students in treatment and control classrooms spent
equal amounts of time answering comprehension test Treatment group: 71
items on the pretest, but on the posttest, treatment
students spent significantly more time answering Control group: 77
questions. Number per group: Ranged from 3 to 16 students per
Value of effect size: +0.42 class.
Type of summary statistics from which effect size Overall average: 7.4 per classroom.
was derived: t-test. Age: Not listed
Direct explanation (DE) with a focus on explaining the 5. Design of the Study
reasoning associated with skill and strategy usage. Random assignment of participants (teachers) to
treatments (randomized experiment). Each teacher’s
Duffy et al.’s approach contains all the elements of DE
pre-existing reading groups remained intact. Pretest
but also requires teachers to analyze the skills
measures revealed no significant differences between
prescribed in basal texts, and to recast these skills as
the participating groups of students.
problem-solving strategies.
This research is based upon the assertion that “because 6. Independent Variables
poor readers lack understanding of the strategic nature a. Treatment variables
Treatment
of reading, instruction needs to place greater emphasis Describe all treatments and control conditions.
on the development of poor readers’ ability to reason
strategically.” Treatment teachers were taught to modify the
curricular and instructional skill prescriptions of the
According to the authors, “it may be necessary when basal text so that the emphasis was on the mental
working with poor readers for teachers to explain processing involved in using skills as strategies.
explicitly, in consistent ways over extended instructional Specifically, treatment teachers were taught to adapt
periods, the mental processing associated with [a given] their basal text instruction in the following ways:
strategy, when it can be used, and how to apply it in a
flexible manner.”
• Because basal textbooks often present prescribed Treated-control teachers were told that the purpose of
skills as isolated memory-based tasks, treatment the study was to validate at the 3rd grade level the
teachers were taught to recast the prescribed skills results of a previous (unrelated) study involving
as problem-solving strategies by analyzing the classroom management for 1st graders. They received
cognitive and metacognitive components of the skill. three 2-hour training sessions on using the management
principles employed in the 1st grade study. In the
• Because the teaching suggestions in the basal text
classroom, they followed their usual instructional
teacher’s guide emphasize procedural skill
routines regarding basal textbook skill instruction, while
exercises and drill, treatment teachers were taught
adding the management principles of the 1st grade
to supplement these suggestions with modeling of
study.
the cognitive and metacognitive acts involved in
performing the skills. Neither the treatment nor the control group was made
• Teachers were taught “to explain explicitly, in aware of the other’s existence.
consistent ways over extended instructional periods, Both groups of teachers received identical information
the mental processing associated with [a given] about how to implement an uninterrupted sustained
strategy, when it can be used, and how to apply it in silent reading (USSR) program and how to prepare
a flexible manner.” students to take a standardized reading test.
• Teachers were taught “to present their explanations Was instruction explicit or implicit? Explicit.
to students as descriptive of what good readers do,
rather than as prescriptions to be procedurally Difficulty and nature of texts used: Basal reading
applied in all situations.” textbooks for the 2nd grade.
Treatment teachers were not provided with scripts for Was trainers’(teachers’) fidelity in delivering
teaching skills in this way. Instead, they used the treatment checked? Yes, by observations and
information from the research intervention sessions to checklists.
develop their own explanations for each lesson.
Properties of teachers/trainers:
Treatment teachers were told that the purpose of the
project was to study teacher explanation. They received Number of teachers who administered treatments:
six 2-hour training sessions in the course of one Treatment group = 10
academic year. These sessions emphasized how to
Control group = 10
• Make decisions about recasting prescribed basal
text skills as strategies. Total = 20
• Decide on explicit statements about the strategy Teacher/student ratio: Depended on class; ranged
being taught, when it would be used, and how to do from 1:3 to 1:16.
the mental processing involved.
Type of trainer: Classroom teacher.
• Organize these statements into a lesson format that
progressed from an introduction, to modeling, to Any special qualification of trainers (teachers)? No.
interaction between teacher and students, to Length of training given to trainers (teachers): 12
closure. hours (six 2-hour sessions over the course of the school
The training interventions also included one-on-one year).
coaching, collaborative sharing between the teachers, Source of training: The researchers.
specific feedback regarding observed lessons, and
videotapes of model lessons. Assignment of trainers to groups: Teachers were
already assigned to students at beginning of study.
Type of summary statistics from which effect size Type of summary statistics from which effect size
was derived: was derived:
MANOVA MANOVA
g. Number of classrooms providing the effect size Number of classrooms providing the effect size
information: information:
Ns = 10 and 10 Ns = 10 and 10
Name of measure: SAM: Part II “According to these GORP results, low-group students
who received explicit explanations about the reasoning
(reasoning associated with use of skills as
associated with using skills as strategies (1) reported
strategies)
that they used such reasoning when actually reading
Students of treatment teachers were significantly connected text and (2) described the reasoning
superior to students of control teachers in their employed when using the strategies.”
performance on Part II.
Teacher effectiveness:
Value of effect size: +1.67
j. Name of measure: Teacher explicitness measure
Type of summary statistics from which effect size
The treatment teachers were found to be more explicit
was derived:
in explaining the reasoning associated with using
MANOVA reading skills as strategies than the treated-control
teachers.
Number of classrooms providing the effect size
information: Value of effect size: +1.67
Ns = 10 and 10 Type of summary statistics from which effect size
was derived:
h. Name of measure:
ANOVA
GORP: Word meaning ratings
Number of classrooms providing the effect size
Students of treatment teachers scored significantly information:
higher on the word meaning subtest.
Ns = 10 and 10
Value of effect size: +1.51
Type of summary statistics from which effect size Anderson (1992)
was derived: 1. Reference
Anderson, V. (1992). A teacher development
MANOVA
project in transactional strategy instruction for teachers
Number of classrooms providing the effect size of severely reading-disabled adolescents. Teaching and
information: Teacher Education, 8(4) 391-403.
Ns = 10 and 10
2. Research Question
i. Name of measure: The purpose of this study was to test the effectiveness
GORP: Word recognition ratings of a teacher development model designed to provide
teachers with collaborative transactional strategies for
Students of treatment teachers scored significantly helping severely reading-delayed adolescents take a
higher on the word recognition subtest. more active approach to understanding informational
Value of effect size: +5.00 texts.
The research question addressed by this study is: Does Selection restrictions used to limit the sample of
the use of the TSI approach to reading instruction result participants: Not reported.
in positive changes in severely reading-delayed
Contextual information (concurrent reading
adolescent students’ reading performance?
instruction that participants received in their
3. Sample of Student Participants classrooms during the study): Not reported.
States or countries represented: Not reported. Description of curriculum/instructional approach:
Number of different schools: Not reported. TSI with a focus on progressive shifts of teacher
Number of participants (total, per group): attention toward fostering active reading. The TSI
approach contains all the elements of DE and also
Total: 83 includes extended discussions that emphasize joint
construction of text interpretations and student strategy
Per group: Ranged from 2 to 10 and was
usage.
“approximately equal” across groups.
According to the author, TSI is a method of teaching
Age: Not reported.
reading that emphasizes “transactions or negotiations
Grade: Ranged from 6 to 11. that occur among teacher and students, and students
and students while working together to determine text
Reading levels of participants:
meaning.”
Severely reading disabled: “All but a very few had been
The view of teacher education presented in this study
diagnosed as learning disabled.” More than 75% of the
involves a progressive shift of the teacher’s attention.
adolescent students in the study had incoming reading
levels of grade 3 or below. • The first stage is to shift the attention from overt
performance of tasks to the underlying
Setting: Not reported.
comprehension processes.
Pretests administered before to treatment: • The next stage shifts from teacher questioning,
At the beginning of the study, teachers in both an modeling, and explaining to students carrying out
experimental and a control group were videotaped these processes.
giving a reading lesson for approximately 30 minutes, • The final stage shifts from students’ carrying out
using one of two expository passages developed for the active processes under teacher guidance to their
study that were matched for difficulty but had different assuming that responsibility themselves.
content.
How was sample obtained?
In addition, students were given three subtests of the
Teachers were invited to volunteer via a letter from the
Stanford Diagnostic Reading Test (phonics, structural
participating board of education.
analysis, and reading comprehension).
Attrition:
The purpose of these steps was to establish pretest
baseline measures of teaching style and student ability. Experimental: 1 teacher
Special characteristics, if relevant: Control: 3 teachers
SES: Not reported. (originally, there were 10 teachers in each
group)
Ethnicity: Not reported.
Exceptional learning characteristics? 4. Setting of the Study
Small-group reading session, in which teachers work
Learning disabled: yes directly with students on the reading and understanding
Reading disabled: yes of informational text.
Treatment teachers received peer support and coaching Length of training given to teachers:
from previously trained teachers who attended the
Experimental teachers participated in three afternoon
training sessions and were available as needed for
sessions of 3 hours each, held at 1-month intervals
teachers.
while the teachers were conducting reading sessions
The control teachers were told that they would receive with their students.
the same training as the treatment teachers after the
Source of training: The researchers
research data were collected.
Assignment of trainers to groups: Random
Was instruction explicit or implicit? Explicit.
Cost factors: Not reported.
Difficulty and nature of texts used:
b. Moderator variables:
A total of 135 single-page, expository texts was
prepared, and it was left to the teachers and students to List and describe other nontreatment independent
decide which of the texts they wished to read during the variables included in the analyses of effects: None
approximately 20 reading sessions in which they would reported.
engage.
7. Dependent (Outcome) Variables
Texts were drawn and edited (primarily shortened)
from a variety of “real text” sources, e.g., Cricket Student reading achievement:
Magazine. Stanford Diagnostic Reading Test:
Readability levels ranged from grades 2 to 8, with the The phonics, structural analysis, and reading
majority of texts at grades 4 and 5. comprehension
(Because the intervention included a particular subtests were used.
emphasis on identifying reading problems and sharing
problem-solving strategies, all texts were somewhat (PRE and POST).
challenging so that problems would arise during Teacher effectiveness:
reading.)
Videotaped classroom observation:
Was trainers’ (teachers’) fidelity in delivering
treatment checked? Teachers were videotaped giving a reading lesson for
approximately 30 minutes. (PRE and POST). A rating
Yes: experimental teachers were videotaped 3 times scale was developed using the teacher and student
during the study (pretest, intervention, and posttest). shifts as a base. Teachers were rated on the following
Properties of trainers (teachers): 14 dimensions:
10. Setting reading goals before reading The treatment teachers showed large significant
improvements across all dimensions.
11. Problem-solving during reading
Value of effect size:
12. Summarizing to check comprehension
1. Treat reading problems openly: +3.8
13. Reflecting on reading goals after text
2. Focus on how to solve problems: +2.80
14. Stressing new learning from text.
3. Provide models of thinking: +3.25
Teacher effectiveness was also assessed by rating
students on the following eight dimensions: 4. Teach question-asking: +2.00
1. Treating reading problems openly 5. Ask thought-provoking questions: +3.14
2. Focusing on how to solve problems 6. Allow student control: +2.08
3. Expressing thinking 7. Focus on group collaboration: +2.56
4. Asking questions 8. Inform students: +2.35
5. Giving elaborated answers 9. Focus on text and learning about reading: +2.52
6. Taking teacher role 10. Set reading goals before reading: +3.99
7. Focusing on group collaboration 11. Problemsolve during reading: +5.73
8. Involvement in sessions. 12. Summarize to check comprehension: +1.90
8. S: Involvement in session: +2.14 Number of different schools: Not reported; all schools
in the same district.
Type of summary statistics from which effect size
was derived: t-tests Number of different classrooms: 10.
Number of classrooms providing the effect size Number of participants (total, per group):
information: Ns = 9 and 7 SAIL group = 30
Control group = 30
Name of measure: Videotaped teaching sessions:
Total = 60
Teaching incidents involving problem-solving and
collaboration Number per group = 6
Treatment teachers showed a far greater percentage of The SAIL and non-SAIL reading groups were matched
teaching incidents that involved problem-solving and on the basis of school demographic information and the
collaboration at posttest than at pretest. No statistical students’ fall standardized test performances (see
test is presented. below).
Name of measure: Videotaped teaching sessions: Age: Not reported.
Student and teacher talk
Grade: Second.
There was a significant increase (t-test) in student talk
Reading levels of participants: Reading below second
and a decrease in teacher talk in the treatment
grade level.
condition. The relevant data are not presented.
Setting: Not reported.
Brown, Pr
Brown, essley,, et al. (1996)
essley
Pressley
Pretests administered before treatment:
1. Reference
Brown, R., Pressley, M., Van Meter, P., & Schuder, J. Comprehension subtest of the SAT. (Primary 1, Form J;
(1996). A quasi-experimental validation of transactional Grade level 1.5 to 2.5); administered in late November
strategies instruction with low-achieving second-grade or early December.
readers. Journal of Educational Psychology, 88(1), 18- Special characteristics:
37.
SES: Not reported.
2. Research Question
Ethnicity: Not reported.
The purpose of this research was to evaluate the
effectiveness of the Students Achieving Independent Exceptional learning characteristics: None, other
Learning (SAIL) program. Three hypotheses were than reading below grade level.
examined:
Selection restrictions used to limit the sample of
Participating in SAIL would enhance reading participants:
comprehension as measured by a standardized test.
Only six students in one SAIL class met eligibility
After a year of SAIL instruction, there would be clear requirements so the researchers decided to use six
indications of students learning and using strategies. matched pairs in each classroom as the basis of
comparison.
Students would develop deeper, more personalized, and
interpretive understandings of text after a year of Contextual information (concurrent reading
SAIL. instruction that participants received in their
classrooms during the study): Not reported.
3. Sample of Student Participants
Description of curriculum/instructional approach:
States or countries represented: Mid-Atlantic state
(unnamed), United States
The SAIL program uses a TSI approach to teaching • Think aloud as they practice applying
reading comprehension to low-performing students. The comprehension strategies during reading instruction.
TSI approach contains all the elements of DE and also Overreliance on any one strategy is discouraged. In
includes extended discussions that emphasize joint general, students are taught that getting the overall
construction of text interpretations and student strategy meaning of text is more important than understanding
usage. every word.
“The purpose of SAIL is the development of When SAIL instruction occurs in reading groups, it
independent, self-regulated meaning-making from text.” differs in a number of ways from more conventional
Students are taught to adjust their reading to their reading group instruction:
specific purpose and to text characteristics. Prereading discussion of vocabulary is eliminated in
According to the authors, “the short-term goal of TSI is favor of discussion of vocabulary in the context of
the joint construction of reasonable interpretations by reading.
group members as they apply strategies to texts. The The almost universal classroom practice of asking
long-term goal is the internalization and consistently comprehension-check questions as students read in
adaptive use of strategic processing whenever students group (e.g., Mehan, 1979) is rarely observed in
encounter demanding text. Both goals are promoted by transactional strategies instruction groups (Gaskins et
teaching reading group members to construct text al., 1993). Instead, a teacher gauges literal
meaning by emulating expert readers’ use of comprehension as students think aloud after reading a
comprehension strategies.” text segment.
SAIL teachers are taught to achieve the goals of TSI There are extended interpretive discussions of text, with
through: these discussions emphasizing student application of
• Direct explanations strategies to text.
• Modeling Although reading group is an important SAIL
• Coaching component, the teaching of strategies extends across
the school day, during whole-class instruction, and as
• Scaffolded practice. teachers interact individually with their students.
In addition, SAIL teachers are taught to facilitate Reading instruction is also an across-the-curriculum
extended discussions of text, which emphasize student activity.
application of strategies to text comprehension. How was sample obtained?
In the SAIL reading program, students are taught The five SAIL teachers exhausted the pool of 2nd
strategies for adjusting their reading to their specific grade teachers in the district with extensive experience
purpose and to text characteristics. Specifically, (i.e., 3 or more years) teaching in the SAIL program.
students are instructed to: The comparison teachers were recommended by
• Predict upcoming events principals and district reading specialists.
5. Design of the Study The article states that there were “informal
observations of the comparison teachers over the year,
Quasi-experimental, in that teachers and students were
[which] confirmed that they were more eclectic in their
not randomly assigned to conditions.
approach to reading instruction than the SAIL
The authors state that “preparing teachers to become teachers . . .” However, it does not indicate whether
competent transactional strategies instructors is a long the SAIL teachers were also observed.
process; therefore, the Panel felt that it could not
Properties of trainers (teachers):
randomly assign teachers, provide professional
development, and wait for teachers to become Number of trainers (teachers):
experienced in teaching SAIL in a realistic time frame.”
SAIL group = 5
However, as noted above, each of the SAIL groups
was matched with a comparison group that was “close Control group = 5
in reading achievement level at the beginning of the Total = 10
study” (based on standardized test performance) and
from a school that was “demographically similar to the Teacher/student ratio:
school the school representing the SAIL group.” It is unclear how many students were in each teacher’s
class; however, the reading groups within each class
6. Independent Variables
that were compared had six students each, for a ratio of
a. Treatment variables 1:6.
Describe all treatments and control conditions:
Type of teacher: Classroom teacher.
The treatment (SAIL) teachers were not trained
Any special qualification of trainers (teachers)?
specifically for this study; however, they all had
extensive experience (i.e., 3 or more years) teaching in All the SAIL teachers had between 3 and 6 years of
the SAIL program. experience teaching in the SAIL program; therefore,
one may assume that they delivered the treatment
The control teachers received no special training;
effectively.
however, they were all “highly regarded for their
teaching abilities by district personnel.” In addition, the The SAIL teachers had an average of 10.4 years of
control group had, on average, a greater number of teaching experience compared to an average of 23.4
years of teaching experience than the treatment years for the comparison teachers.
teachers.
The authors acknowledge that given this difference,
Was instruction explicit or implicit? Explicit. “there is no way to separate out the effects that years
of experience may have had on the way teachers
Difficulty and nature of texts used:
taught their students.”
It is not entirely clear what texts were used during the
However, they state that readers should “bear in mind
course of the school year. The three texts used in the
that the comparison teachers were highly regarded for
study for group comparisons were illustrated stories
their teaching abilities by district personnel; therefore, if
from trade books, with numbers of words and
anything, their greater number of years of experience
readability levels as follows:
could be construed as an advantage.”
• 341 words; 2.4
Length of training given to trainers: Not reported.
• 512 words; 2.2
Source of training: Not reported.
• 129 words; 3.9 (used for a different measure than
the previous two). Assignment of trainers (teachers) to groups:
“The five SAIL teachers exhausted the pool of 2nd What do you think about before you start to read a
grade teachers in the district with extensive experience story?
teaching in the SAIL program.”
What do you do when you come to a word you do
Cost factors: Not reported. not know?
Moderator variables: What do you do when you read something that does
not make sense?
List and describe other nontreatment independent
variables included in the analyses of effects: None Student strategy usage:
reported.
Think-aloud measure:
7. Dependent (Outcome) Variables: Students were stopped at four points while reading a
Student reading achievement: difficult story individually with a researcher and asked
to describe their thinking and their strategy usage.
SAT:
(POST).
The comprehension and word skills subtests were used.
Teacher effectiveness:
(PRE and POST).
Classroom observation:
Story recall:
SAIL and non-SAIL teachers were observed teaching
Students were asked cued and picture-cued retelling
two story lessons and were compared in terms of the
questions about two stories. This measure was designed
number of strategies they taught in each lesson.
to assess both recall skills and the degree to which
students were interpretive in their retelling of the story. (DURING).
(POST).
8. Nonequivalence of groups
Student strategy awareness: Any reason to believe that treatment and control
Strategy awareness interview: groups might not have been equivalent before
treatments?
In October and November (i.e., when SAIL
components were being introduced to SAIL students) Although it is possible, because the groups were not
and in March and April, a strategies interview was randomly assigned, it is unlikely because of the careful
administered to all students participating in the matching done in the fall on both mean performance
study. This interview tapped students’ reported and variability on standardized reading comprehension
awareness of strategies, as measured by the number tests.
and types of strategies they claimed to use during
Were steps taken in statistical analyses to adjust for
reading. It was also designed to measure students’
any lack of equivalence?
awareness of where, when, and why to use
strategies. Not reported.
(DURING). 9. Result (for each measure):
Students were asked the following five open-ended Student reading achievement:
questions, adapted from the ones used by Duffy et
a. Name of measure: SAT: Comprehension
SAT
al. (1987):
Students of treatment teachers scored significantly
What do good readers do? What makes someone a
higher than students of control teachers on the
good reader?
comprehension subtest of the SAT.
What things do you do before you start to read a
story?
Value of effect size: + 1.70 Type of summary statistics from which effect size
was derived: t-test
Type of summary statistics from which effect size
was derived: t-test Number of classrooms providing the effect size
information:
Number of classrooms providing the effect size
information: Ns = 5 and 5
Ns = 5 and 5 Student strategy awareness:
b. Name of measure: SAT: Word skills
SAT e. Name of measure: Strategy awareness interview:
Students of treatment teachers scored significantly Comprehension strategies
higher than students of control teachers on the word Toward the end of the treatment, the students of the
skills subtest of the SAT. treatment (SAIL) teachers reported more awareness
of comprehension strategies during the interview
Value of effect size: +1.67 than did the students of control group teachers.
Type of summary statistics from which effect size Value of effect size: +4.03
was derived: t-test
Type of summary statistics from which effect size
Number of classrooms providing the effect size was derived: t-test
information:
Number of classrooms providing the effect size
Ns = 5 and 5 information:
c. Name of measure: Story recall: Literal Ns = 5 and 5
information
f. Name of measure: Strategy awareness interview:
Students of the treatment (SAIL) teachers recalled
Word-level strategies
more literal information than students of control
teachers. Toward the end of the treatment, the students of the
treatment (SAIL) teachers reported more awareness
Value of effect size: of word-level strategies during the interview than
Story 1: +0.69 did the students of control group teachers.
Type of summary statistics from which effect size Type of summary statistics from which effect size
was derived: t-test was derived: t-test
Number of classrooms providing the effect size Number of classrooms providing the effect size
information: information:
Ns = 5 and 5
Ns = 5 and 5
d. Name of measure: Story recall: Interpretation Student strategy usage:
Students of the treatment (SAIL) teachers were g. Name of measure: Think-aloud measure
significantly more interpretative in their retelling of the Students of treatment (SAIL) teachers applied
stories than were students of control teachers. significantly more strategies during the think-aloud task
than did the students of control teachers.
Value of effect size:
Value of effect size: +2.98
Story 1: +1.01
Type of summary statistics from which effect size
Story 2: +1.07
was derived: t-test
is possible to “train” teachers to use particular methods Cruickshank and Metcalf express this sentiment:
to teach, it seems more appropriate to educate teachers
Literature on the conduct, objectives, and the
in a professional context that will give them control over
effectiveness of training in teacher education is
a wide range of decisionmaking tools.
sparse . . . . Given the historic brouhaha over
The Panel also distinguishes between teacher education training in teacher preparation, it would be expected
(largely preservice or prior to certification) and that a considerable available related literature would
professional development (largely inservice or exist. Such is not the case (Cruickshank & Metcalf,
postcertification). The Panel has done this for two 1990, p. 491).
reasons. First, it is conceptually important to distinguish
between programs in which participants are essentially Database
full-time students and part-time teachers and those in To examine the research related to teacher education
which participants are full-time teachers and part-time and professional development, electronic searches were
students. The second reason is that the research fell performed on the ERIC, PsycINFO, OCLC World
into these distinct categories. Different concerns and Catalog, and OCLC Article First databases. The search
different research variables and outcomes were terms used and numbers of articles returned are
involved in the two different research literatures. included in Appendix A.
Despite the division, the Panel does believe they are
clearly related. The initial selection process identified more than 300
papers; many of these were nonexperimental and were
Taken together, the many theoretical formulations, therefore not included. The resultant set of studies was
empirical findings, and practical concerns suggest how then divided into two categories: research on preservice
important teacher education is in the teaching of and research on inservice or professional development.
reading. It was deemed appropriate to conduct an The criteria used were that preservice research was
analysis of the research on teacher education to primarily concerned with the training of prospective
determine what can be supported by research. teachers before certification or full-time work in
The analysis was guided by the three primary questions: classrooms, whereas inservice work was concerned
with teachers who were already teaching in school
1. How are teachers taught to teach reading? environments.
2. What do studies show about the effectiveness of To supplement the electronic searches, the
teacher education? bibliographies of the articles identified in the electronic
3. How can research be applied to improve teacher searches and a recent review of teacher education
development? research in reading (Anders, Hoffmann, & Duffy, 2000)
were examined for additional citations that did not
Two secondary questions were also posed prior to the appear in the electronic searches themselves.
analysis: Appropriate citations that had not been identified in the
1. What findings can be used immediately? electronic searches were added to the pool of research
studies to be examined. There were four studies
2. What important gaps remain in our knowledge? reviewed in the comprehension subgroup report on
preparing teachers to teach reading comprehension.
Methodology Those four studies were included in the teacher
There is a widespread belief that there is little research education analysis as well.
on teacher education, despite the great interest in the A total of 32 studies met the final criteria: 11 preservice
issue. and 21 inservice. Because of the way in which the
results of some of the underlying research was
reported, there were more articles than studies. That is,
there were two instances where two published papers There are also notable programs where teacher
reported on different aspects of the same research education or professional development is an important
project. An additional eight studies focused on inservice component of the intervention. Reading Recovery© is
on teaching for special education or learning disability one example of such a program; Success for All is
students. These have not been coded but are noted here another. However, most of the research studies on
as a subgroup of the inservice studies. these programs do not include measures of teacher
changes in their results. Again, as in most instructional
Analysis research, the focus is on the specific interventions and
It was determined that to conduct meta-analyses on student outcomes rather than teacher change. The
these data would be inappropriate because there is not Panel did not find studies that met the NRP criteria that
a critical mass of studies researching the same were in either of the two categories.
variables or theoretical positions. Moreover, although all One reason that teacher education has been ignored in
the studies do address the general problems of these research contexts is that researchers believe that
improving teacher education, the underlying rationales any changes in student outcomes are attributable to the
for the studies represent an eclectic mix of theories and intervention, which is, in turn, delivered by the
conceptualizations. participating teachers. This would logically imply that
Consistency With the Methodology of the teachers had learned to deliver the instruction in the
National Reading Panel way the research program dictated. This is, in part, the
criterion of fidelity to the intervention. However, the
The methods of the NRP were followed in the conduct issue goes well beyond fidelity of teaching to the many
of the literature searches and the examination and other variables that relate to teaching rather than to
coding of the articles obtained. Because a meta- learning.
analysis was deemed inappropriate, the data were
coded using a subset of the coding scheme adopted by Although these studies have not been analyzed as part
the NRP. These data are contained in Appendix B. of the pool of studies, they have some relevance to the
interpretation of the analysis. Consequently,
Some Additional Considerations in recommendations at the end of the analysis have been
Research on Teacher Education influenced by these concerns.
When research is conducted on instructional variables, Results
it is often the case that the participating teachers
receive instruction in the instructional interventions. For In the presentation of results, the research on
example, when comprehension strategy research is preservice teacher education has been separated from
conducted in classrooms, the instructors (either that on professional development with inservice
classroom teachers or the researchers) must be taught teachers. The Panel believes this is fundamentally
to conduct instruction in the appropriate manner. In this appropriate because different quality criteria and
sense, almost all of the research the NRP has identified outcome measures can be applied to the research
contains some elements relative to teacher education. studies. In particular, the criteria of success are
However, in these circumstances, the focus is almost different for the two sets of studies.
exclusively on student outcomes, without detailed data That is, for preservice studies, the focus is almost
on changes in teacher behaviors. Although the NRP entirely on changing teacher behavior, without a
recognizes the importance of the more general form of concomitant focus on the outcomes of students who are
teacher education and professional development, it (eventually) instructed by those teachers. The Panel
determined that these factors would not be included in found no instances of research in the pool that
the current analysis because of the lack of teacher continued with preservice teachers as they moved into
performance data. full-time teaching positions. There is no inherent reason
why this is the case. The reasons seem, instead, to be
pragmatic and related to the complexities of research
that would be introduced in attempting to follow
teachers into full-time teaching. Although the lack of • General methods: Directed Reading-Thinking
student data limits the conclusions one can draw about Activities (DRTA); teaching word recognition skills;
the results of this research, it does provide an important Directed Reading Activity (DRA); Informal
background for other teacher education and Reading Inventory (IRI) (4)
professional development research. If teacher
• Materials: Estimating readability levels; teacher
behaviors cannot be transformed by changes in the
decisionmaking and awareness of materials (2)
curriculum in preservice programs, it is unlikely that
teacher behaviors can be changed later. • Others: Case method; study skills; theoretical
orientations to reading (3)
For inservice research, the ultimate test of success is
whether students benefit from instruction delivered by The majority of the preservice studies reviewed
teachers as a result of that intervention. Consequently, (10 of 11) reported improvements in teacher knowledge.
the Panel invoked a strong criterion that student Of these ten, two reported mixed or modest effects.
outcomes must be part of the research on inservice Only one study, which looked at the accuracy of
teachers. However, another criterion is also critical. If teachers in estimating the readability levels of materials,
there is no change in teachers as a result of the did not report any effect from having either theoretical
intervention, it is not possible to attribute changes in knowledge of reading or teaching experience, or both,
student outcomes to the teacher development compared with a control group with neither theoretical
intervention. Other factors must be invoked to account knowledge nor teaching experience.
for the changes in students. Consequently, the NRP
must have both teacher changes and student changes to The duration of the studies reviewed here ranged from
agree that inservice interventions are effective. 5 to 6 weeks to about a year, which corresponds closely
Although the Panel believes that preservice and to the structure of university-based coursework.
inservice research form two different bodies of work, Although these studies show that preservice courses
they are related in that preservice does provide improved prospective teachers’ knowledge, there is no
evidence for the efficacy of producing teacher change. way of knowing whether this increased knowledge
Those changes can be important in designing inservice actually translates into effective teaching because none
interventions. of the studies reports data on the teachers after their
participation in the experimental program.
Preservice Studies
In the NRP sample, no studies of larger scale
Eleven preservice studies met the criteria for this interventions at the program level were found. For
portion of the NRP analysis. These preservice studies, example, there were no experimental studies that
with coded information, are grouped in Table 1 in looked at changes in the format of teacher education
Appendix B. Table 2 in Appendix B lists two studies that programs like the use of professional development sites
involved preservice interventions as well as inservice or the use of standards-based programs.
interventions. Most of the preservice research (ten
studies) focused on elementary reading instruction. Two Inservice Studies
(of the ten) studies had a broad range of grade samples, There were 21 inservice studies that met the criteria for
spanning grade levels from K through 8 and 1 through this review. These studies are listed in Appendix B:
6. For one study it was not possible to determine the Coding of Studies. There are four groupings: studies that
grade level. involved both inservice and preservice interventions
The content of the teacher education in these studies is (Table 2), studies that measured only teacher outcomes
a primary variable in distinguishing among studies. The (Table 3), studies that measured both teacher and
11 studies can be classified into the following four student outcomes (Table 4), and studies that measured
categories. For each category, the number of studies is only student outcomes (Table 5).
indicated in parentheses.
• Comprehension and strategy instruction:
Questioning techniques (2)
The first analysis of the data was to determine the It appears to be the case that the emphasis is on
grade levels of the teachers who participated in the specific methods of teaching reading, rather than the
inservice work. For 18 of the studies, it was possible to general methods that characterize preservice research.
do so. Because the studies often involved multiple grade There is much less emphasis on the general aspects of
levels, there was a total of 70 different samples of teaching reading. Three studies investigated ways in
teachers represented in the 18 studies. These data are which to improve teacher attitudes, reflecting the needs
represented in Figure 1 on the next page. of teachers on the job.
It is evident that the inservice instruction is targeted at Effectiveness of Inservice Instruction
the elementary grades with approximately equal
emphasis. The numbers of studies across grades 1 Only 11 studies in the NRP pool measured both teacher
through 5 are equal. There are far fewer studies at the and student outcomes. Six other studies measured only
middle and high school grade levels, with only a single teacher outcomes, whereas four measured only student
study at each of the high school grades. outcomes. As noted above, it is necessary to have both
teacher and student outcomes to be able to determine
A second analysis examined the focus of inservice whether teacher education is effective. If it is, it must
instruction for teachers of reading. Compared with the change both teacher and student behavior. That is,
work in preservice programs, inservice instruction teachers must adopt new ways of teaching and students
seems to be more eclectic, ranging from training in must show appropriate improvement if the results are to
specific methods (e.g., how to use reading groups) to be attributed to the new ways of educating teachers.
more extensive instruction encompassing ways to teach
reading, classroom management, and lesson design. The The measures of teacher change and student outcomes
topics fell into the following categories, with the number used in this body of research were a combination of
of studies indicated in parentheses. informal, researcher-designed assessments and
standardized evaluations. As a generalization, the
• Comprehension and strategy instruction: teacher outcome measures were all researcher-
Higher order questioning, explicit instruction in using designed, whereas the student measures tended to be
reading skills strategically; questioning and student- standardized instruments. At times, student outcomes
teacher interactions; Transactional Strategy were measured with a combination of researcher-
Instruction (TSI); questioning and response designed and standardized measures. Given that the
guidance cues (8) researchers designed the treatments, standardized
measures of outcomes often did not exist, necessitating
• General methods: Skills vs. Language Experience
the development of researcher-designed instruments.
Approach (LEA); DRA; whole language; phonics,
question-and-answer, and giving feedback; teaching Another set of analyses examined the duration of the
a language arts/integrated curriculum (5) project and the number of hours of instruction delivered.
Figure 2 presents the data on the duration of projects.
• Classroom management: Small groups; reading
groups; conducting cooperative learning activities; Of the 21 studies, only 4 had durations of 6 months or
using performance assessment; translating less. However, the duration of the project is not
Madeline Hunter’s Instructional Theory Into necessarily the crucial variable. Where possible, the
Practice, focusing on effective classroom total amount of time spent in instruction was also
management, motivation and lesson design (5) examined. It was possible to determine the number of
hours of instruction in 11 studies. For many of the
• Impr oving teachers’ attitudes: Teaching writing
Improving
studies, the number of hours of instructional intervention
as a process to facilitate change in teachers’
is not specified; these studies were not included in this
attitudes to language; improving content area
analysis. Often what are reported are phrases like “a
teachers’ skills and attitudes to teaching reading;
monthly meeting” or “weekly workshops.” No attempt
enthusiasm training. (3)
was made to interpret these; only those studies for which Figure 3 shows that for the 12 studies for which
unambiguous determinations could be made were instructional time could be determined, the greatest
analyzed. The data for instructional time are presented in number of hours of instruction was 60. The majority of
Figure 3. the studies (8 of 12) presented 15 or fewer hours of
instruction.
Report
Reports of the Subgroups
Report
Chapter 5: Teacher Education and Reading Instruction
Studies Reporting Positive Changes in However, three long-term inservice programs reported
Teacher Outcomes by Talmage, Pascarella, and Ford (1984), Miller and
Ellsworth (1985), and Duffy and coworkers (1987a)
Seventeen out of the 21 studies reviewed measured showed gains by teachers and significant or partial
teacher outcomes. Fifteen of these studies reported achievement gains by students. Because of this
significant or modest improvements in teachers’ discrepancy, the Panel could find no relationship
knowledge or practice. Out of the fifteen studies that between the amount of instructional time (or duration of
measured student outcomes, 13 reported improvements programs) and student outcomes. This may be a
in student achievement. One clear trend in the data is function of the limited number of research studies for
that where teacher outcomes showed significant which the Panel could make the relevant
improvement, so did student achievement. In studies determinations.
where no gains are reported for the teachers, no gains
are reported for the students in the same study. In It is difficult to compare the studies reviewed here in
general, one can conclude that inservice professional terms of the duration of instruction that the teachers
development does lead to improved teacher knowledge received. Hence, it is not possible to draw specific
and practice and improved student achievement. conclusions about the relationship between length or
Because the content of each of these studies is widely intensity of instruction and outcomes. The duration of
divergent, it is not possible to reach a specific the inservice intervention depends on the specific
conclusion about the content of instruction. objectives and requirements of the program. Sometimes
the intervention consisted of the dissemination by mail
Studies Reporting No Change in Teacher of a manual (Coladarci & Gage, 1984) or two meetings
Outcomes and the discussion of a teaching manual (Anderson,
Evertson, & Brophy, 1979). It could take the form of a
Three studies (Coladarci & Gage, 1984; Morrison,
series of workshops or meetings spread over 2 days
Harris, & Auerbach, 1969; Stallings & Krasavage,
(Scheffler, Richmond, & Kazelskis, 1993) or a year
1986) reported no change in teacher outcomes, in at
(Shepard, Flexer, Hiebert, Marion, Mayfield, & Weston,
least some of the conditions in the research projects. In
1996) or three workshops spread over 3 summers
two of these studies, where student outcomes were
(Spanjer and Layne, 1983). It could also take the form
measured, student achievement did not improve either.
of a systematic 3-year staff development program
A closer look at these studies reveals two interesting (Stallings, Robbins, Presbrey, & Scott, 1986; Stallings &
points. First, one study (Coladarci & Gage, 1984) did Krasavage, 1986). The studies do not report the
not involve any formal instruction for teachers. Instead, duration of the intervention in a consistent manner:
teachers in the treatment group were given “teacher some report the number of hours of instruction,
education packets” consisting of materials on a diverse whereas others report the overall duration of the project
range of topics, including behavior management, large- or duration of the staff development program.
group instruction, use of question-and-answer, phonics,
Two other issues were difficult to assess. The Panel
questioning, and feedback strategies.
was unable to determine the amount of resources
Second, all three studies were long-term projects. The (personnel, equipment, and materials) from the reports
study in which teachers received no formal instruction of the research. This amount would have a direct
lasted about a year. The other two were 3 years in bearing on the ultimate effectiveness of the
duration. Morrison and colleagues (1969) caution interventions. It was also not possible to find any
against using short-term results to validate teacher experimental research on inservice professional
education efforts because, in the course of their 3-year development that related to the issues surrounding
study, they found that teachers and administrators standards-based education.
reverted to what they had been doing before the project
The NRP did not conduct a separate analysis of the
began. Stallings and Krasavage (1986), at the end of
research on preparation of teachers for comprehension
their 3-year study, also reported that teacher and
instruction. An extensive analysis of this research is
student outcome measures actually declined although
included in the report from the comprehension
gains by teachers and students were reported during the
subgroup.
first 2 years of the study.
improvement in their teaching; this improvement leads Directions for Further Research
directly to higher achievement on the part of their
students. These findings were demonstrated in an What important gaps remain in our
overwhelming majority of the research studies knowledge?
reviewed.
Perhaps the most apparent feature of the research
However, there is insufficient research to draw exact analyzed in this study is that there are significant gaps in
conclusions about the content of teacher education and our knowledge of teacher education and development
professional development programs. Rather, a wide across the board. Part of the difficulty is that high-
range of techniques and content seemed to produce quality teacher education research is expensive and
improvement in teaching and in student outcomes. The requires intensive collaborative efforts from all the
body of research on these topics is fragmented when it stakeholders. In subsequent sections, the Panel details
comes to this level of questioning. There are studies of what it considers the most important questions that need
specific methods of teacher education with specific to be resolved.
content as well as more general studies that offer no
The Panel found no studies in the sample that
guidance on content.
addressed questions related to the development of
Teacher attitudes do change as a result of intervention standards. Therefore, it makes no conclusion about the
in both preservice and inservice contexts. This is an efficacy of establishing either content standards for
important finding because it is the predisposition of students or for teaching teachers on the basis of those
teachers to change that makes change possible. Without standards. Many of the interventions clearly include
a change in attitude, it is extremely difficult to effect elements that are also contained in many standards-
changes in practice. Most of the research that based programs. However, too many other factors are
measured attitudes demonstrated that attitudes did involved to be able to attribute causal relationships.
change as a result of the interventions, indicating that at
The Panel also found that the reporting of studies was
least one of the major prerequisites for teacher change
inconsistent. Many studies were not described in
can be taught.
sufficient detail to make comparisons. Foremost was a
Teacher practices improve as a result of education, but lack of consistent attention to the amount of instruction
it is not clear for how long these changes are sustained. and the frequency of instruction in the description of the
Teachers may use the new methods only when studies, which makes it difficult to tell whether it was
observed. Although some of the studies in this sample reasonable to expect either success or failure in
were long term, exceeding 2 years, there is little individual studies. Some studies reported only the
evidence on the sustainability of the interventions. That number of sessions, others only the amount of
is not to say that the interventions were not sustained, instruction, and still others neither.
but that in most of the studies there was simply no
Another important oversight was a description of the
evidence presented that spoke to this issue.
resources (personnel, time, money, facilities, etc.)
Student achievement outcomes can be improved as a required to implement particular programs. It was often
result of teacher development. For inservice studies that impossible to tell what it would take to implement some
measured both teacher and student outcomes, this was of the interventions. Consequently, no assessment could
a clear finding. These studies represent the most be made about the cost-effectiveness of most of the
effective types of research, recognizing the need to programs or interventions.
assess both teachers and students. However, even in
There is a large body of nonexperimental literature that
these studies, sustainability of the student improvements
addresses teacher education issues. Under the
is an issue that was not addressed.
guidelines established for the review, this literature was
to be used to help interpret findings from the analysis of
the experimental literature. However, because of the to having control over their own domains and often do
lack of convergence in the experimental research, the not want to relinquish control to any outside influences.
Panel was unable to bring this nonexperimental Moreover, new “alliances” need to be formed. For
literature to bear on the current analysis. example, to answer the questions about effectiveness of
preservice education, graduating teachers will need to
The NRP believes that the nonexperimental literature is
be followed as they assume teaching jobs. Those who
a rich source for future research programs. Teacher
do the preparation of teachers will have to work with
education research involves particularly complex
persons in the new locations where the graduates work.
problems. Doing the research is expensive and time
(Because schools rarely hire teachers en masse, the
consuming. Therefore, one particular contribution of the
alliances may have to span districts or other geographic
nonexperimental literature may be to provide a source
locations to be able to study teachers in sufficient
of problems to be studied under more controlled
numbers.)
conditions. That is, the descriptive literature could be
brought to bear to reveal current practices, variables, To accomplish the kind of reforms that accompany
and so forth, that seem promising (or not) under general teacher education improvement requires years of
conditions. Such insights could guide research that looks sustained effort at keeping all elements of the system in
more closely at causal relationships or in more specific balance. All of this must take place against a backdrop
situations. In addition, the Panel refers the reader to the where the participating individuals may change over the
conclusions of the Text Comprehension report, in the course of a research project. Placed against the other
belief that the principles underlying them apply more demands (tenure, teaching, publication) on many
broadly to other subject areas and could also serve to academic researchers, commitments to the long-term
guide future research in teacher preparation. nature of teacher education research often seem
daunting.
The small set of experimental studies reviewed does not
allow us to address all the questions that originally In addition to the appropriate resources, stronger and
guided the analysis. Some of these remain unanswered more coherent conceptualizations of teacher education
because of the eclectic nature of the work found. Many and professional development are needed. These
are unanswered because they were not addressed conceptualizations need to combine research from a
specifically in the experimental body of research. There wide variety of perspectives and paradigms to provide
was a great deal of nonexperimental research that fell the most coherent description of teacher education
outside the scope of the experimental domain examined. possible. Such conceptualizations will guide research in
This research addresses a few of the relevant questions more systematic ways, rather than allowing the highly
that are listed below, but not all and certainly not eclectic forms of investigations that characterize
definitively. A general conclusion here is that although current teacher education research. There are excellent
we have a great deal of knowledge about teacher examples of good teacher education research; more are
education, much more remains to be learned. needed, as is better reporting of the results as they are
disseminated so that subsequent research can build on
Many of the questions are unanswered because of the
completed research rather than begin anew with each
resource intensity of teacher education research. It
effort.
takes a great deal of time and money to do teacher
education research in ways that will yield appropriate We need to find out how teachers can be supported over
answers. It takes a commitment from stakeholders, and the long term to ensure sustained implementation of new
it takes a great deal of coordination among them. methods or programs, as well as the sustainability of
Rarely do all of these elements come together in a way student achievement. There is a trend in the research
that admits of experimental research. analyzed that suggests that teachers may revert to their
original methods of teaching; it is important to determine
However, simply providing money and time is
how best to have teachers maintain any improvements
insufficient. High-quality teacher education research
they make in their teaching abilities.
must bring together persons who are engaged in quite
different endeavors in school contexts. They are used
Another problem that needs to be addressed in teacher Because teacher education is a labor-intensive
education research is the precise nature of the endeavor, new ways of instruction need to be developed
interventions. In the literature the NRP analyzed, there that make it possible for instruction to be more
is only sparse information on the precise content of effective. In the sample of studies, the Panel found a
what teachers were taught. Rather, there is a mix of total of seven preservice and inservice research studies
techniques, methods, theories, and materials that are that used various forms of technology to improve
often confounded with each other in the instructional teacher education. This is a promising direction.
contexts. Some of the instructional methods focus on Computer technology has made the use of video
teacher attitudes while others focus on the use of modeling and simulation even more available than it has
specific materials. This question should be addressed in been. The use of either simulated or real teaching
a systematic way. cases, linked with appropriate instruction, can provide
supplemental experiences to classroom instruction in
There is a need to develop and refine the ways in which
teaching.
we study the link between teacher education and
student outcomes. Only a few inservice studies looked The list of questions that remains is a long one.
at both teacher and student outcomes. None of the However, there is a growing consensus on many
preservice research made the link between teacher elements of the problems in teacher education and
outcomes and ultimate student performance. Although professional development. The technology to improve
all the inservice research that reported improved teacher knowledge and performance exists. Positive
teacher outcomes also reported improvement in student changes in teacher education have been demonstrated
achievement, there is no evidence that this is true for by a wide variety of interventions. Further studies are
preservice programs. needed to address the problems that remain.
Cruickshank, D. R., & Metcalf, K. K. (1990). Shulman, L. S. (1986). Those who understand:
Training within teacher preparation. In W. R. Houston Knowledge growth in teaching. Educational
(Ed.), Handbook of Research on Teacher Education Researcher, 15(2), 4-14.
(pp. 469-497). NY: Macmillan Publishing Company.
Appendix A
Studies Analyzed
Anderson, L. M., Evertson, C. M., & Brophy, J. E. Copeland, W. D., & Decker, D. L. (1996). Video
(1979). An experimental study of effective teaching in cases and the development of meaning making in
first-grade reading groups. Elementary School Journal, preservice teachers. Teaching and Teacher Education,
79(4), 193-223. 12(5), 467-481.
Baker, J. E. (1977). Application of the in-service Duffy, G. G., Roehler, L., Meloth, M. S., Vavrus, L.
training/classroom consultation model to reading G., Wesselman, R., Putnam, J., & Bassiri, D. (1986).
instruction. Ontario Psychologist, 9(4), 57-62. The relationship between explicit verbal explanations
during reading skill instruction and student awareness
Block, C. C. (1993). Strategy instruction in a and achievement: a study of reading teacher effects.
literature-based reading program. Elementary School Reading Research Quarterly, 21(3), 237-252.
Journal, 94(2), 139-51.
Duffy, G. G., Roehler, L. R., Sivan, E., Rackliffe,
Book, C. L., Duffy, G. G., Roehler, L. R., Meloth, G., Book, C., Meloth, M. S., Vavrus, L. G., Wesselman,
M. S., & Vavrus, L. G. (1985). A study of the R., Putnam, J., & Bassiri, D. (1987). Effects of
relationship between teacher explanation and student explaining the reasoning associated with using reading
metacognitive awareness during reading instruction. strategies. Reading Research Quarterly, 22(3), 347-368.
Communication Education, 34(1), 29-36.
Dupuis, M. M., Askov, E. N., & Lee, J.W. (1979).
Brown, R., El-Dinary, P.B., Pressley, M., & Coy- Changing attitudes toward content area reading: the
Ogan, L. (1995). A transactional strategies approach to content area reading project. Journal of Educational
reading instruction (National Reading Research Research, 73(2), 65-74.
Center). Reading Teacher, 49(3), 256-58.
Greenberg, K. H., Woodside, M. R., & Brasil, L.
Brown, R., Pressley, M., Van Meter, P., & Schuder, (1994). Differences in the degree of mediated learning
T. (1996). A quasi-experimental validation of and classroom interaction structure for trained and
transactional strategies instruction with low-achieving untrained teachers. Journal of Classroom Interaction,
second-grade readers. Journal of Educational 29(2), 1-9.
Psychology, 88(1), 18-37.
Hoover, N. L., & Carroll, R. G. (1987). Self-
Coladarci, T., & Gage, N. L. (1984). Effects of a assessment of classroom instruction: an effective
minimal intervention on teacher behavior and student approach to inservice education. Teaching and Teacher
achievement. American Educational Research Journal, Education, 3(3), 179-191.
21(3), 539-555.
Johnson, C. S., & Evans, A. D. (1992). Improving
Conley, M. M. W. (1983). Increasing students’ teacher questioning: A study of a training program.
reading achievement via teacher inservice education. Literacy Research, 10.
Reading Teacher, 36(8), 804-808.
Klesius, J. P., Searls, E. F., & Zielonka, P. (1990). Stallings, P., Robbins, P., Presbrey, L., & Scott, J.
A comparison of two methods of direct instruction of (1986). Effects of instruction based on the Madeline
preservice teachers. Journal of Teacher Education, Hunter model on students’ achievement: findings from a
41(4), 34-44. follow-through project. Elementary School Journal,
86(5), 571-587.
Levin, B. B. (1995). Using the case method in
teacher education: The role of discussion and Streeter, B. B. (1986). The effects of training
experience in teachers’ thinking about cases. Teaching experienced teachers in enthusiasm on students’
and Teacher Education, 11(1), 63-79. attitudes toward reading. Reading Psychology, 7(4),
249-259.
Miller, J. W., & Ellsworth, R. (1985). The
evaluation of a two-year program to improve teacher Talmage, H., Pascarella, E. T., & Ford, S. (1984).
effectiveness in reading instruction. Elementary School The influence of cooperation learning strategies on
Journal, 85(4), 485-496. teacher practices, student perceptions of the learning
environment, and academic achievement. American
Morrison, C., Harris, A. J., & Auerbach, I. T. Educational Research Journal, 21(1), 163-179.
(1969). Staff after-effects of participation in a reading
research project: a follow-up study of the craft project. Tyre, B. B., & Knight, D. W. (1972). Teaching
Reading Research Quarterly, 4, 366-395. word recognition skills to preservice teachers: An
analysis of three procedures. Southern Journal of
Olson, M. W., & Gillis, M. (1983). Teaching reading Educational Research, 6(3), 113-122.
study skills and course content to preservice teachers.
Reading World, 23(2), 124-133. Wedman, J. M., Hughes, J. A., & Robinson, R. R.
(1993). The effect of using systematic cooperative
Reid, E. R. (1997). Exemplary Center for Reading learning approach to help preservice teachers learn
Instruction (ECRI). Behavior and Social Issues, 7(1), informal reading inventory procedures. Innovative
19-24. Higher Education, 17(4), 231-241.
Scheffler, A. J., Richmond, M., & Kazelskis, R. Wedman, J. M., & Moutray, C. (1991). The effect
(1993). Examining shifts in teachers’ theoretical of training on the questions preservice teachers ask
orientation to reading. Reading Psychology, 14(1), 1-13. during literature discussions. Reading Research and
Instruction, 30(2), 62-70.
Shepard, L. A., Flexer, R. J., Hiebert, E. H.,
Marion, S. F., Mayfield, V., & Weston, T. J. (1996). Wedman, J. M., & Robinson, R. (1988). Effects of
Effects of introducing classroom performance a decision-making model on preservice teachers’
assessments on student learning. Educational decision-making practices and materials use. Reading
Measurement: Issues and Practice, 15(3), 7-18. Improvement, 25(2), 110-116.
Spanjer, R. A., & Layne, B. H. (1983). Teacher Westermark, T. I., & Crichlow, K. A. (1983). The
attitudes toward language: Effects of training in a process effect of theoretical and situational knowledge of reading
approach to writing. Journal of Educational Research, on teachers’ estimates of readability. Reading
77(1), 60-62. Psychology, 4(2), 129-139.
Stallings, J., & Krasavage, E. M. (1986). Program Wham, M. A. (1993). The relationship between
implementation and student achievement in a four-year undergraduate course work and beliefs about reading
Madeline Hunter follow-through project. Elementary instruction. Journal of Research and Development in
School Journal, 87(2), 117-138. Education, 27(1), 9-17.
Appendix B
Search Details
Se arch Te rms Us e d and Numbe r of Article s Re turne d: Nove mbe r 12, 1998
Preservice reading
teachers
De ve lo p me nt
Staff De ve lo p me nt 84 0 53 14 3 75 17 0 445
Re ad ing 213 5 44 85 27 84 76 0 25 51
Writing 94 5 9 19 5 27 35 0 9 43
Lite racy 55 3 6 11 9 25 26 0 6 22
Appendices
Co p e land , W.D.,
Tr: Ho w te ache rs inte rp re te d
& De cke r, D.L. Gro up -b ase d d iscussio n Mo re than o ne -third o f the to p ics
(me aning -making ) vid e o d ata &
(1996). Te aching e xp e rie nce with vid e o o f DRTA d iscusse d we re ad o p te d , transfo rme d , o r
Q No Ye s Pre 9 4* imp ro ve d afte r g ro up d iscussio n.
and Te ache r with 4th g rad e stud e nts 6 cre ate d b y the re sp o nd e nts to d e scrib e
Analyze d b y to p ics.
Ed ucatio n,12(5), we e ks the ir o wn me aning -making 3 we e ks late r.
St: No ne
467-481.
Ye s + Tr: 1) Surve y o f Stud y Hab its & Exp e rime ntal g ro up had sig nificant g ains
Rand o m Attitud e s o n surve y o f stud y hab its and attitud e s and
Pre 121
Olso n, M.W., & assig n-- Le arning stud y skills and 2) Vo cab ulary: Stanfo rd Diag no stic co urse co nte nt. The co ntro l g ro up tuto re d
National Reading Panel
n=?
Gillis, M. (1983). me nt o f co nte nt (to p ic: fund ame ntals o f No t Re ad ing Te st child re n in an e le me ntary scho o l. This
E No attritio n
Re ad ing Wo rld , te ache rs re ad ing ) co ncurre ntly. 1 state d 3) Co mp re : Stanfo rd DRT sug g e sts that p re se rvice te ache rs lack
Appendices
no t
124-133. in se me ste r (infe rre d ) 4) Re se arche r-d e sig ne d MCQ p ro ficie nt re ad ing hab its b ut can imp ro ve
re p o rte d
e xisting co urse co nte nt te st with le sso ns o n stud y skills taug ht at the
p ro g ram St: No ne same time as co nte nt instructio n.
Reports of the Subgroups
Ye s. 3
Que stio ning : Lite ral, te xt-b ase d
g ro up s (A,
infe re ntial q ue stio ns (TBIQ) and Bo th Gro up s A & B g e ne rate d TBIQ with
B, C).
Prio r Kno wle d g e -b ase d hig he r o ve rall fre q ue ncy & p e rce ntag e
We d man, J.M., Assig nme nt
infe re ntial q ue stio ns (PKBIQ) A: than C. As mo re p ractice was p ro vid e d ,
& Mo utray, C. b y GPA. Tr: Writte n q ue stio ns & aud io
Lite ral & infe re ntial q ue stio ns fre q ue ncy o f TBIQ incre ase d fo r all
(1991). Re ad ing Eq ual Pre re co rd o f q ue stio ns g e ne rate d
E No taug ht. B: Lo we r & hig he r le ve l J r e le g ro up s' q ue stio ns. Gro up B p e rfo rme d
Re se arch and numb e rs o f 36 d uring d iscussio ns
q ue stio ns in b asals we re sig nificantly hig he r than A o r C fo r TBIQ.
Instructio n, 30(2), lo w, St: No ne
taug ht. C: No instructio n in All g ro up s fo rmulate d PKBIQ. Gro up C
62-70. mid d le , &
q ue stio ning . Instructio n o n aske d mo re g e ne ral q ue stio ns than A o r
hig h.
classro o m use s o f lite rature . 8- B.
Pro b ab ly
we e k p ro je ct.
rand o mize d
Table 1: Pre s e rvice Studie s (continue d)
Particip ants' d e cisio n-making p ractice s and
mate rials use chang e d fro m p re te st to
We d man, J.M., & Te ache rs' kno wle d g e o f Tr: Me asure s o f d e cisio nmaking
p o stte st. Po stte st tho ug ht units re late d to
Ro b inso n, R. re ad ing and its e ffe cts o n (p ro b le m-so lutio n and analysis b y
Pre d e cisio ns ad d re sse d p ractice s re q uiring
(1988). Re ad ing Q No Ye s d e cisio nmaking p ractice s and E le "tho ug ht units" and p e rce p tio n) (3-
27 instructio nal re e valuatio n o r chang e . The
Imp ro ve me nt, aware ne ss o f mate rials. 1 p o int scale ).
numb e r o f mate rials me ntio ne d sub stantially
25(2), 110-116. se me ste r (infe rre d ) St: No ne
incre ase d fro m p re te st to p o stte st.
Q No Ye s Pre N=35 Final p hase o f the stud y K-8 vid e o tap e d to e nsure co nsiste ncy
De ve lo p me nt in as p up ils. The curre nt stud y sug g e sts that
e xamine d chang e s in b e twe e n re ad ing instructio n and
Ed ucatio n, 27(1), the me tho d s co urse s and the stud e nt
o rie ntatio n fro m p re -co urse wo rk re sp o nse s to TORP.
9-17. te aching e xp e rie nce are o nly mo d e stly
to p o st-stud e nt te aching . St: No ne
re late d to chang e s in the the o re tical
Duratio n: ab o ut 3 se me ste rs
o rie ntatio ns o f p re se rvice te ache rs.
National Reading Panel
Appendices
Reports of the Subgroups
Pre &
Autho rs, Date , & Co ntro l: Pre / Typ e Of Te ache r Training & De p e nd e nt Me asure s: Find ing s
Exp / Quasi Po st: Gr
Pub lishe r Ye s/No In-S Duratio n Te ache r (Tr) & Stud e nt (St)
Ye s/No
Ye s. No t
Op p o rtunity to re ad , write , and d iscuss a
re p o rte d
case affe cte d te ache rs' thinking ab o ut the
whe the r the re
case . Fo r e xp e rie nce d te ache rs,
was rand o m
Le vin, B.B. Case me tho d in te ache r d iscussio n was a catalyst fo r re fle ctio n
Q assig nme nt.
(1995). e d ucatio n. and me taco g nitio n. Le ss e xp e rie nce d
Particip ants Each g ro up Tr: analyse s o f writte n
Te aching and 8 p re 2 case s: te aching writing in 4th te ache rs and p re se rvice te ache rs we re
fro m had e q ual Ye s 4* re sp o nse to case s
Te ache r 16 in g rad e Re ad ing & writing ab o ut ab le to clarify and e lab o rate the ir thinking .
e xisting numb e rs o f St: No ne
5-28
Ed ucatio n,11(1),- case vs. re ad ing , d iscussing , Only re ad ing and writing ab o ut a case
p ro g ram. stud e nt,
63-79. & writing . Ab o ut 5 we e ks p ro vid e d little stimulus fo r te ache rs to
b e g inning ,
e lab o rate the ir und e rstand ing o r incre ase
and
the ir p e rsp e ctive s o n issue s in the case .
e xp e rie nce d
te ache rs.
1-9. St: No ne
q ue stio n d yad variab le s. the m to think mo re d e e p ly thro ug h
Duratio n: 3 ye ars. Mo re than re p hrasing and g iving clue s. Limitatio n:
Appendices
60 ho urs o f training . small samp le .
Reports of the Subgroups
Te aching writing as a
p ro ce ss to facilitate a Tr: Te ache rs' attitud e s we re The p o stte st me an was sig nificantly
Sp anje r, R.A., &
chang e in te ache rs' attitud e asse sse d using the Lang uag e g re ate r than the p re te st me an. The
Layne , B.H.
to lang uag e . Wo rksho p 38 e le m.- Inq uiry (Fro g ne r, 1969) inve nto ry. p ro ce ss ap p ro ach to writing may
(1983). Ye s.
In curriculum ad ap te d fro m mid -sch.; The instrume nt co ve rs stand ard s influe nce te ache rs' attitud e s to ward
Jo urnal o f Q No Te ache rs
78 Be rke le y' s Bay Are a Pro je ct 41 se c.- in using Ame rican Eng lish. and lang uag e (i.e ., le ss rule b o und &
Ed ucatio nal o nly
(1977). Duratio n: 3 ye ars. p o st-se c. o n lang uag e stud y & te aching . p re scrip tive , mo re se nsitive to
Re se arch, 77(1),
3 wo rksho p s o ve r 3 1 missing p re te st sco re , n=78 usag e acco rd ing to p urp o se and
60-62.
summe rs. Training ho urs no t St: No ne . co nte xt).
re p o rte d .
5-31
National Reading Panel
Appendices
Reports of the Subgroups
79(4), 193-223. assig ne d to we re he ld . Training ho urs: the tre atme nt p ro b ab ly had e ffe cts o n
achie ve me nt.
co ntro l o r No ne . stud e nt achie ve me nt, b ut o the r
tre atme nt. e ffe cts (e .g ., scho o l e ffe cts) canno t
b e rule d o ut.
Pre &
Autho r/s, Date , Ex p / Co ntro l: Typ e o f Te ache r De p e nd e nt Me asure s: Find ing s
Po st: Pre / In-S Gr
& Pub lishe r Quasi Ye s/No Training & Duratio n Te ache r (Tr) & Stud e nt (St)
Ye s/No
o f Ed ucatio nal assig ne d . stud e nts ye ar. Training ho urs: no t re te lling and se q ue ncing o f 2
mo re inte rp re tive in the ir re te lling o f
Psycho lo g y, 88 re p o rte d . The e xp e rime ntal sto rie s. c) Think-alo ud task to
the sto rie s. Stud e nts o f tre atme nt
Appendices
(1), 18-37 g ro up te ache rs d id no t d e te rmine whe the r stud e nts
te ache rs re p o rte d mo re aware ne ss o f
re ce ive training fo r this we re mo re te xt- o r re ad e r-
co mp re he nsio n and wo rd -le ve l
stud y. All had e xte nsive b ase d in the ir re sp o nse s to
strate g ie s than d id the stud e nts o f
p rio r e xp e rie nce with TSI. p ro b e s. d ) Stand ard ize d
co ntro l te ache rs.
sub te sts o f re ad ing
co mp re he nsio n and wo rd
skills (Stanfo rd Achie ve me nt
Te st [SAT]).
Reports of the Subgroups
Te ache r e d ucatio n
p acke ts (TEP) b y
Crawfo rd e t al., 1978
we re g ive n to tre atme nt
As an e xp e rime nt, this stud y faile d to
Ye s; and co ntro l g ro up
co rro b o rate the p o sitive re sults
p re and te ache rs. TEP co nsiste d Tr: Classro o m o b se rvatio n p re -
Co lad arci, T., & o b taine d p re vio usly in similar
p o st fo r o f a) b e havio r manag e - and p o sttre atme nt. Ob se rvatio n
Gag e , N.L. classro o m-b ase d e xp e rime nts.
te ache rs me nt & d iscip line b ) re co rd s yie ld e d ro ug h
(1984). To ward the e nd o f the scho o l ye ar,
& larg e -g ro up instructio n, e stimate s o f the e xte nt to
Ame rican the e xp e rime ntal g ro up te ache rs d id
E Ye s stud e nt- In 32 use o f Q&A, & p ho nics 4,5,6 which te ache r b e havio r
Ed ucatio nal no t sho w ap p re ciab ly g re ate r
s. 28 e xe rcise s in re ad ing ; c) re fle cte d TEP
Re se arch co nfo rmity to the TEP
classe s q ue stio ning and re co mme nd atio ns St:
5-34
Co mp re he nsio n
instructio n (lite ral,
infe re ntial, critical, and
cre ative ). Includ e d hig he r
o rd e r q ue stio ning Grad e 6
Te ache rs b e ne fite d (e vid e nt fro m
te chniq ue s. No te : mate rials
Co nle y, M.M.W. Ye s. Te ache rs Tr: Ong o ing fo rmative q ualitative d ata d uring inse rvice
Ye s, fo r Stud e nts we re all b lack & we re
(1983). Re ad ing in e ach scho o l e valuatio n. St: Gate s- e valuatio ns and fe e d b ack), b ut mo re
E stud e nts In 32 fro m lo w so cio e co no mic use d ;
Te ache r, 36(8), we re rand o mly MacGinitie Re ad ing Te st imp o rtant we re sig nificant
o nly b ackg ro und s. The y we re stud e nts
804-808. assig ne d (le ve l E) co mp re he nsio n g ains fo r stud e nts.
se le cte d b e cause the y we re
re ad b e lo w the natio nal ung rad e d
no rm fo r the ir ag e le ve l.
Duratio n: 6 mo nths.
Ab o ut 10-15 ho urs o f
training .
Table 4: Ins e rvice Studie s With Te ache r and Stude nt Outcome M e as ure s (continue d)
De p e nd e nt Me asure s:
Autho r/s, Date , Ex p / Co ntro l: Pre & Po st: Typ e o f Te ache r Find ing s
Pre / In-S Gr Te ache r (Tr) & Stud e nt
& Pub lishe r Quasi Ye s/No Ye s/No Training & Duratio n
(St)
1) Ye s:
Base line
Te ache rs who we re traine d we re rate d
d ata o n
sig nificantly hig he r than tho se in the
te ache r
Duffy, G.G., co ntro l g ro up in e xp licit strate g y
e ffe ctive ne ss Tr: Rating s o f te ache rs'
Ro e hle r, L.R., instructio n. Stud e nts in the e xp e rime ntal
thro ug h instructio nal e xp lanatio ns
Me lo th, M.S., Exp licit instructio n and g ro up s sho we d sig nificantly mo re
o b se rvatio ns (transcrip ts) St: Rating s o f
Vavrus, L.G., e xp lanatio n in using re ad ing aware ne ss o f re ad ing strate g ie s. But
+ p o st- "aware ne ss" afte r le sso ns
Bo o k, C., skills strate g ically. Lo w the re we re no achie ve me nt g ains in
Ye s; rand o mly tre atme nt (transcrip ts): 5 stud e nts
Putnam, J., & E In 22 re ad ing g ro up s. Duratio n: 7 5 co mp re he nsio n. The q ualitative d ata fro m
assig ne d o b se rvatio ns. inte rvie we d p e r te ache r.
We sse lman, R. mo nths. 1 me e ting & 3 g o o d te ache rs and 3 le ss e ffe ctive
2) Ye s: Gate s-MacGinitie Re ad ing
(1986). Re ad ing p re se ntatio n + 10 ho urs o f te ache rs sho we d the fo rme r p ro d ucing
stud e nts Te st (1978) Time take n
Re se arch training . sig nificantly g re ate r g ro wth in
we re fo r co ntro l and tre atme nt
Quarte rly, 21(3), achie ve me nt. Stud e nts in the tre atme nt
me asure d g ro up s to d o the te st.
237-252. g ro up to o k lo ng e r to co mp le te the
p re - and
p o stte st.
p o st- o n
5-35
Putnam, J., & assig ne d . stud e nts Lo w re ad ing g ro up s. inte rvie ws (imme d iate ly tre atme nt te ache rs sco re d sig nificantly
Bassiri, D. Duratio n: 1 acad e mic ye ar. fo llo wing a le sso n) & hig he r than stud e nts o f co ntro l te ache rs
Appendices
(1987). Training ho urs: 12 co nce p t inte rvie ws (at the o n MEAP. Stud e nts o f tre atme nt te ache rs
Re ad ing e nd o f the ye ar) d ) sco re d hig he r also in -Le sso n inte rvie ws
Re se arch Sup p le me ntal -Co nce p t inte rvie ws -SAM (Part 2 o nly,
Quarte rly, 22 (3), Achie ve me nt Me asure no t Part 1) -Mo d ifie d GORP te st.
347-368. (SAM) [re se arche r-
d e sig ne d ] e ) Mo d ifie d
Grad e d Oral Re ad ing
Parag rap h (GORP)
Table 4: Ins e rvice Studie s With Te ache r and Stude nt Outcome M e as ure s (continue d)
Reports of the Subgroups
De p e nd e nt Me asure s:
Autho r/s, Date , Ex p / Co ntro l: Pre & Po st: Typ e o f Te ache r Find ing s
Pre / In-S Gr Te ache r (Tr) & Stud e nt
co mp ariso n o f
stud e nts. stud e nt re ad ing inte re sts. b e havio rs in all six are as than d id a
p articip ating and no n-
Duratio n: 2 ye ars. Training samp le o f no np articip ating te ache rs (N =
p articip ating te ache rs'
ho urs: no t re p o rte d . 17). A p o stho c analysis sho we d that
stud e nts.
the re we re sig nificant d iffe re nce s in
stud e nt achie ve me nt at 0.05 le ve l.
Tr: Te ache rs we re
The co ntro l g ro up sho we d so me g ains,
o b se rve d p re - and
b ut no t as much as the e xp e rime ntal
p o sttraining . Variab le s
g ro up . Training le d to incre ase d le ve ls o f
includ e vo cal d e live ry,
Stre e te r, B.B. o b se rvab le te ache r e nthusiasm. Only
e ye s, g e sture s,
(1986). Enthusiasm training fo r o ne o f the fo ur d ime nsio ns o f the stud e nt
mo ve me nts, facial
Re ad ing Ye s. Te ache rs te ache rs. Vid e o tap e s use d me asure sho we d sig nificant chang e .
In e xp re ssio ns, wo rd
Psycho lo g y: An E we re rand o mly Ye s fo r p o stco nfe re ncing . 1-5 The re was a d ro p in the "Exp re sse d
19 se le ctio n, acce p tance o f
National Reading Panel
Inte rnatio nal assig ne d Duratio n: 2 we e ks. Re ad ing Difficulty" d ime nsio n, sho wing
id e as, and o ve rall
Quarte rly, 7(4), 10 ho urs o f training . le ss p e rce ive d d ifficulty with re ad ing .
e ne rg y.
Appendices
249-259 He nce , te ache rs' hig he r le ve ls o f
St: Attitud e s to re ad ing
e nthusiasm p o sttraining had an e ffe ct o n
me asure d b y the SRA
stud e nts' attitud e s to re ad ing .
Primary Le ve l (p re and
p o st).
Reports of the Subgroups
Tr: Classro o m
o b se rvatio ns, inte rvie ws
and p re - and Co o p e rative le arning strate g ie s can b e
p o stme asure s o f te aching le arne d e ffe ctive ly b y te ache rs thro ug h
p ractice s and te ache r lo ng -te rm inse rvice p ro g rams. The re
Talmag e , H.,
attitud e s we re use d . we re p o sitive e ffe cts o f te ache r
Pascare lla, E.T., Ye s.
St: 1) Stud e nts' e xp e rie nce with co o p e rative g ro up ing o n
& Fo rd , S. Te ache rs Incre asing te ache rs' skills in
p e rce p tio ns o f the ir stud e nt p e rce p tio ns o f co o p e ratio n.
(1984). and co nd ucting co o p e rative
Ye s. No t In classro o m le arning The re we re so me e ffe cts o n stud e nts'
Ame rican Q stud e nts le arning activitie s. 2-6
rand o m 107 e nviro nme nt we re re ad ing sco re s b ut no t fo r lang uag e arts.
Ed ucatio nal (e xce p t Duratio n: 3 ye ars. Training
o b taine d . So me amb ig uity still e xists in acco unting
5-38
Pre &
Autho r/s, Date , & Ex p / Co ntro l: Typ e o f Te ache r De p e nd e nt Me asure s: Find ing s
Po st: Pre / In-S Gr
Pub lishe r Quasi Ye s/No Training & Duratio n Te ache r (Tr) & Stud e nt (St)
Ye s/No
Tr: No ne
St: a) Io wa Te st o f Basic Skills was Exp e rime ntal stud e nts sco re d
ad ministe re d p o stte st. It was no t sig nificantly hig he r than co ntro ls o n the
re p o rte d whe the r this was use d at p o stte sts fo r re ad ing co mp re he nsio n,
p re te st. vo cab ulary, and to tal b atte ry sco re s. No
b ) Ob se rvatio ns: The last le sso n sig nificant d iffe re nce s we re fo und
Re se arch
Strate g y instructio n in a taug ht in e ach e xp e rime ntal and b e twe e n the two g ro up s' sco re s o n the
assistants
stud e nt-ce nte re d co ntro l class was vid e o tap e d and Eng lish g rammar p o stte st.
Blo ck, C. (1993). Ye s. Classe s we re
curriculum, i.e ., stud e nt rate d fo r le ve ls o f co mp re he nsio n On the b asis o f vid e o tap e d le sso n
Ele me ntary we re Ye s (fo r use d .
E cho ice o f o b je ctive s and 2-6 and thinking ab ilitie s se e n in o b se rvatio ns, rate rs ranke d stud e nts in
Scho o l Jo urnal, rand o mly stud e nts) No . no t
mate rials. Duratio n: 8 d iscussio ns. e xp e rime ntal classe s as "b e tte r thinke rs"
94(2), 139-151. assig ne d . re p o rte d .
mo nths. Training ho urs: c) Stud e nts' se lf-e ste e m, id e a than co ntro ls. Exp e rime ntal stud e nts d id
352
5-39
no t re p o rte d . g e ne ratio n, and re fle ctive thinking b e tte r than co ntro ls o n me asure s o f se lf-
stud e nts
ab ility we re asse sse d p re - and e ste e m, id e a g e ne ratio n, ab ility to
p o stte st. transfe r thinking skills to re al-life
d ) Re aso ning ab ility was situatio ns, re fle ctive thinking , re aso ning ,
me asure d using the Califo rnia and p ro b le mso lving .
State De p artme nt o f Ed ucatio n
State wid e Asse ssme nt Te st (1989).
Appendices
re ad ." 1-ye ar stud y. co mp re he nsio n and wo rd stud y skills.
(1995). e xisting 12 in Bro wn, Pre ssle y, Van Me te r, &
Training ho urs: no t Te ache rs b e lie ve d it incre ase d stud e nts'
Re ad ing Te ache r, classro o ms stud e nts Schud e r (1996), Jo urnal o f
re p o rte d . se lf-co nfid e nce and e njo yme nt as
49(3), 256-258. Ed ucatio nal Psycho lo g y, 88(1),
re ad e rs, imp ro ving inte ractio ns amo ng
18-37.
stud e nts d uring re ad ing . Te ache rs also
fo und it challe ng ing to te ach stud e nts to
use a re p e rto ire o f strate g ie s.
Reports of the Subgroups
ho urs.
Appendix D
Standards
The 1989 NCATE Approved Curriculum Guidelines 8. Programs include opportunities to study,
of the ACEI for the basic programs for the analyze, and practice effective models of
preparation of elementary education teachers include classroom management in campus and field-
the following standards: (Note that indicators are based settings and to engage in a gradual
provided for Standard 13, the standard dealing with increase in responsibility.
literacy.)
9. Programs should provide study and
1. Programs should provide teacher candidates experiences for critically selecting and using
with an understanding of the roles of materials, resources, and technology
elementary school teachers and the appropriate to the age, development level,
alternative patterns of elementary school cultural and linguistic backgrounds, and
organization. exceptionalities of students.
2. Programs should provide study and 10. Programs should provide for indepth study in
experience concerning the role of the at least one academic discipline by including
teaching profession in the dynamics of significant course work beyond the
curriculum change and school improvement. introductory level to reflect processes of
inquiry and research.
3. Programs should include study and
experiences, throughout the professional 11. Programs should develop understandings of
studies sequence that link child development positive health behaviors, movement skills,
to elementary school curriculum and and physical fitness to allow teacher
instruction. candidates to provide appropriate health
education and physical education experiences
4. Programs should develop the teacher
for students.
candidates’ capacities to organize and
implement instruction for students. 12. Programs should prepare teacher candidates
to become confident in their ability to do
5. Programs should include study and
mathematics and to create an environment in
application of a variety of developmentally
which students become confident learners
appropriate experiences that demonstrate
and doers of mathematics.
varied approaches to knowledge construction
and application in all disciplines. 13. Programs in the area of students’ literacy
development should be designed to help
6. Programs should include study and
teacher candidates create experiences for
application of current research findings about
their students in reading, writing, and oral
teaching and learning.
language. These programs should stress the
7. Programs should provide a well-planned integration of reading, writing, and oral
sequence of varied clinical/field experiences language with each other and with the
with students of different ages, cultural and content areas of the elementary school
linguistic backgrounds, and exceptionalities. curriculum.
These experiences should connect course
Program emphasis include study of and
content with elementary school practice.
experiences with:
13.1 The cognitive and linguistic 13.11 The literature of childhood including
foundations of literacy development (a) knowing a range of books, (b)
in students knowing how to share literature with
students, and (c) knowing how to
13.2 Ways of promoting vocabulary
guide students to respond to books in
growth in students
a variety of ways
13.3 The flexible use of a variety of
13.12 Promoting creative thinking and
strategies for recognizing words in
expression, as through storytelling,
print
drama, choral/oral reading,
13.4 Teaching of the conventions of imaginative writing, and the like.
language needed to compose and
14. Programs in science for teacher candidates
comprehend oral and written texts
should focus on academic, personal, social,
(e.g., text structure, punctuation,
and career applications of the biological,
spelling)
earth, and physical sciences and should
13.5 The strategies readers can use to develop skills in instruction to promote these
discover meaning from print and to understandings and positive attitudes among
monitor their own comprehension students and youth.
13.6 The ways listening, speaking, 15. Programs should prepare teacher candidates
reading, and writing relate to each to translate knowledge and data-gathering
other and to the rest of the processes from history and the social sciences
elementary curriculum into appropriate and meaningful social studies
experiences for students.
13.7 Identifying and developing
appropriate responses to differences 16. Programs should prepare teacher candidates
among language learners (e.g., to translate knowledge of and experience in
linguistic, sociocultural, intellectual, the visual and performing arts into
physical) appropriate experiences for students.
13.8 Communicating with parents The 1983 NCATE Approved Curriculum Guidelines
concerning the school language of the International Reading Association for advanced
program and developmentally programs in reading education follow in this report,
appropriate language experiences at but readers should be aware that IRA has published a
home 1998 revision of the standards for reading
professionals. The 1998 standards will be applied to
13.9 Speaking and writing that vary in programs of institutions currently seeking
form, subject, purpose, audience, accreditation or continuing accreditation.
point of view, tone, and style
Competencies required of candidates from those
13.10 Ways to promote reading, writing, institutions presently approved are the following:
and oral language for personal
growth, lifelong learning, enjoyment, 1. Philosophy of Reading Instruction: Reading
and insight into human experience is a complex, interactive, and constructive
process.
4.4 Believes that all students can learn 5.6 Understands the importance of
to read and share in the language development in relation to
communication process reading and writing.
4.5 Recognizes the importance of using 6. Knowledge of the Reading Process
reading in positive ways in the
6.1 Perceives reading as the process of
classroom
constructing meaning through the
4.6 Recognizes the value and importance interaction of the reader’s existing
of creating a supportive and positive knowledge, the information
environment for literacy learning suggested by the written language,
and the context of the reading
4.7 Recognizes the importance of giving
situation
learners opportunities in all aspects of
literacy as readers, authors, and 6.2 Is aware of relationships among
thinkers reading, writing, listening, and
speaking
4.8 Recognizes the importance of
implementing literacy programs 6.3 Has knowledge of emergent literacy
designed to meet the needs of and the kinds of experiences that
readers rather than imposing support literacy
prescribed, inflexible programs
6.4 Is aware that reading develops best
4.9 Recognizes the importance of through activities that embrace
building on the strengths of concepts about the purpose and
individual learners rather than function of reading and writing and
emphasizing weaknesses. the conventions of print
5. Language, Development, Cognition, and 6.5 Understands the role of models of
Learning thought that operate in the reading
process
5.1 Understands that language is a
symbolic system 6.6 Is able to explain the model various
word recognition, vocabulary, and
5.2 Understands major theories of
comprehension strategies used by
language development, cognition, and
fluent readers
learning and uses them to implement
a well-planned and comprehensive 6.7 Understands the role of
reading program metacognition in reading
5.3 Is aware of the linguistic, 6.8 Has knowledge of the importance for
sociological, cultural, cognitive, and reading in language development;
psychological bases of the reading listening ability; cognitive, social, and
process emotional development; and
perceptual motor abilities
5.4 Is aware of the physical, emotional,
social, cultural, environmental, and 6.9 Understands the nature and multiple
intellectual factors on learning, causes of reading disabilities
language development, and reading
6.10 Understands the relationship of
5.5 Understands dialect variations and phonemic, morphemic, semantic,
respects linguistic differences and syntactic systems of language to
the reading process.
10.1 Understands how to design, select, 12.5 Teaches word recognition through
modify, and evaluate materials that the use of context, word analysis,
reflect curriculum goals, current and syntactic cueing strategies
knowledge, and the interests,
12.6 Helps students learn that word
motivation, and needs of individual
recognition strategies aid
learners
comprehension
10.2 Understands the structure and
12.7 Helps students learn effective
content of various texts used for
techniques and strategies for the
instruction
ongoing development of vocabulary
10.3 Understands and uses new
12.8 Helps students analyze information
instructional technologies
presented in a variety of texts
10.4 Understands methods for
12.9 Helps students connect prior
determining whether materials are
knowledge with new information
clear and appropriate for individual
students. 12.10 Assists students in assuming control
of their reading
11. Knowledge of Instructional Strategies—
Teaching Strategies 12.11 Helps students use new technology
and media effectively.
11.1 Provides direct instruction and
models what, when, and how to use 13. Demonstrate Knowledge of Assessment
reading strategies with narrative and Principles and Techniques
expository texts
13.1 Recognizes assessment as an
11.2 Models questioning strategies ongoing and indispensable part of
reflective teaching and learning
11.3 Employees strategies to encourage
and motivate students to pursue and 13.2 Recognizes and understands that
respond to reading and writing for assessment must take into account
personal growth and fulfillment the complex nature of reading,
writing, and language and must be
11.4 Teaches effective study strategies
based on a range of authentic
12. Learning Strategies literacy tasks using a variety of texts
12.1 Helps students learn and apply 13.3 Is able to conduct assessment that
comprehension strategies for a involves a consideration of multiple
variety of purposes indicators of learner progress and
that takes into account the context of
12.2 Helps students monitor their
teaching and learning
comprehension and reading
processes 13.4 Is knowledgeable about the
characteristics and appropriate
12.3 Understands and helps students learn
applications of widely used and
and apply reading comprehension
evolving assessment approaches
strategies in the content areas
13.5 Uses information from norm-
12.4 Helps students gain understanding of
referenced tests, criterion-referenced
the conventions of language and
tests, formal and informal
literacy
inventories, constructed-response
measures, portfolio-based
Background Methodology
Despite the current intense interest in computer A database had previously been developed on this topic.
technology, there has been relatively little systematic That database covered the period from 1986 to 1996
research into problems of involving computers or other and included all the studies on technology and literacy
technologies. Several factors seem responsible for the (e.g., writing as well as reading). Because this database
limited research on computers in literacy contexts. First, had been developed by a combination of electronic and
many reading researchers did not and do not consider hand searching all of the journals, it was deemed
technology to be a mainstream topic. That is, they often expedient to use the database and update it with more
believe that reading instruction can be delivered only by recent work. Only those studies that met the criteria of
a human. Others believe that technology must be the National Reading Panel were included.
considered in the overall context of reading instruction.
Those in the latter category believe that other problems Results and Discussion
in reading instruction should be attended to before issues There is a small body of research on the problems of
of technology are addressed. These general impressions computer technology and reading. The work that met
are reinforced by some of the factors described in the the National Reading Panel requirements is substantially
following paragraphs. smaller. Many of the research studies that have been
Until recently, computers did not have all (or even most) published are explorations of capabilities of computers,
of the capabilities that were needed to implement a comparisons of computers with traditional tasks, word
complete program of reading instruction. A primary lack processing, and learning. Very few of these studies
among these capabilities was the inability to comprehend directly examined the effects of using computer
oral reading and judge its accuracy. Another lack was technology for reading instruction.
the inability of computers to accept free-form responses A total of 21 studies was found, representing
to comprehension questions, leading to reliance solely experimental manipulations of problems across the entire
on recognition tests such as multiple-choice formats. spectrum of reading instruction. As a first step to further
The situation is currently very different, with most new analysis, the problems addressed by these studies were
computers capable of speech recognition, as well as a categorized. The largest group of studies (six) included
those that studied the addition of speech to computer-
presented text. There were two studies that examined
the effects of vocabulary instruction, two more that
looked at word recognition instruction, and two that Implications for Reading Instruction
investigated comprehension instruction, broadly defined.
One study examined spelling, and two studies examined Although the Panel is encouraged at the reported
the effects of broad programs on learning to read. These successes in using computer technology for reading
last studies looked at the delivery of reading instruction instruction, there are relatively few specific instructional
by comprehensive software that covered many, if not applications to be gleaned from the research. It is clear
most, elements of reading instruction. that some students can benefit from the use of
computer technology in reading instruction. In particular,
Conclusions studies on the addition of speech to print suggest that
this may be a promising alternative, especially given the
It is extremely difficult to make specific instructional
powerful multimedia computers now available and those
conclusions based on the small sample of experimental
being developed. In addition, the use of hypertext and
research available. One conclusion is that it is possible
word processing appear to hold promise for application
to use computer technology for reading instruction. All
to reading instruction.
the studies in the analysis report positive results. The six
studies that examined the addition of speech to print Directions for Further Research
presented on computers suggest that this may be a
promising alternative, particularly in light of the The reported successes to date in using computer
powerful multimedia computers now available. technology for reading instruction indicate that this is an
area that needs a great deal of additional exploration.
There are two other trends that should be examined
There are many questions that still need to be
more systematically. A small, but growing, body of
addressed and many areas in which research does not
research examines the use of hypertext in learning
exist. Particularly striking in its absence is research on
environments. Although this is technically not reading
Internet applications as they might be incorporated in
instruction, it is possible that hypertext might be used in
reading instruction. Another area is the use of computer
instructional contexts to some advantage.
technology to perform speech recognition. Although
A second area outside the scope of the current review great strides have been made in this technology, there
is that of using computers for word processing. Given have been no recent studies of speech recognition
that instruction in reading is most efficacious when applied to reading instruction, despite its increasing use.
combined with writing instruction, the use of word Finally, the issue of multimedia presentations has not
processing has the potential to make reading instruction been addressed in the context of reading instruction.
more effective. There are many questions that remain about the
efficacy of multimedia incorporated in reading
instruction.
Introduction Second, for much of the time since the initial reports of
computerized reading instruction, computers did not
Reading is based on the technology of writing and have all (or even most) of the capabilities that were
printing. The history of reading instruction reflects a needed to implement a complete program of reading
recurrent interest in the application of other instruction. A primary lack among these capabilities
technologies (Kamil & Lane, 1998; Kamil, Intrator, & was the inability to comprehend oral reading and judge
Lane, 2000). (The “other technologies” include, for its accuracy. Another lack was the inability of computers
example, reading pacers, tachistoscopes, and to accept free-form responses to comprehension
television.) The use of computers in reading instruction questions, leading to sole reliance on recognition tests
dates only to the mid-1960s with the work of Suppes, like multiple choice formats.
Atkinson, and their colleagues. For example, Atkinson
and Hansen (1966-1967) published the first report of Lack of those capabilities meant that computer
the use of computers in teaching reading. This technology often was considered useful only as a
pioneering work in demonstrating the efficacy of using supplement to conventional instruction, rather than as a
computers to deliver reading instruction set the way for primary delivery system. As a supplemental device, at
much of the subsequent research. Although there were best, it occupied a less prominent position in the problem
debates about whether or not they were really teaching space of reading researchers. Indeed, because
reading at the time (Spache, 1968-1969; Atkinson, computer software was relatively incapable of speech
1968-1969), such public debates no longer seem to recognition or text comprehension, there were only a
exist. few activities that the computer seemed to be capable
of handling independently. At least in the early history
Despite the current intense interest in computer of computers and reading, this was reflected in the
technology, there has been relatively little systematic translation of things like paper and pencil worksheets to
research in problems of involving computers or other the computer screen. The situation is currently very
technologies. Kamil and Intrator (1998) conducted an different, with most new computers capable of speech
extensive review of the research in literacy and recognition, as well as a host of multimedia presentation
technology and found that between 1986 and 1996 there capabilities. Artificial intelligence is beginning to make
were only 350 published research journal articles that inroads into software for instruction, and systems for
reported investigations of reading and writing. The text comprehension are fairly sophisticated, even on
yearly proportion of these technology studies was home computers.
relatively constant over that time period, ranging from
2% to 5% of the total of all research articles on reading A third consideration in the history of computers and
and writing. These totals reflect all research on reading has been the cost factor. With the introduction
computers and other technologies, not simply of microcomputers, the steady decline in prices,
instructional research. accompanied by a steady increase in capabilities, has
produced computers that cost only a few hundred
Several factors seem responsible for the limited dollars. These machines can easily outperform the
research on computers in literacy contexts. First, many machines of a decade ago. Most new computers are
reading researchers did not and do not consider capable of presenting audio and video, controlling
technology to be a mainstream topic. That is, many external devices, and being expanded. They have
believe that reading instruction can be delivered only by substantial amounts of memory and a great deal of
a human. Others believe that technology must be external storage capacity. In addition, there are low-
considered in the overall context of reading instruction; cost printers, scanners, cameras, and a host of other
they believe that other problems in reading instruction peripherals that can be attached, typically for far less
should be attended to before issues of technology. even a few years ago. All of these make unbelievable
These general impressions were reinforced by some of some of the original predictions that computers would
the factors described in the following paragraphs. never be cost effective in classrooms.
Finally, there was often resistance to the idea that a A review of the research on computer technology and
computer could deliver reading instruction. One reading was undertaken to document the trends in this
important reason for this simply seems to be the age-old area. To accomplish this task, the first step was to
debate about whether teaching reading is an art or a interrogate both the ERIC and PsycINFO databases.
science. Software, to match teacher performance, must Any journal research article that matched the
be adaptable to a very broad range of responses from descriptors of technology, computers, reading, writing,
learners. It must be able to analyze responses to or literacy was listed.
questions, separating correct from incorrect; respond to
Queries were generated in the form of SUBJECT
idiosyncratic responses in appropriate ways; and bring
READ# and SINCE 1986 not YEAR = 1996 and
multiple methods to bear on pedagogical contexts.
DOCTYPE = research and DOCTYPE = journal
Computers are still unable to do many of these activities
article and S = technology. Other queries were
today, despite advances in hardware and software. This
composed to cover similar topics in reading, writing,
problem is not limited to the use of computers to deliver
speaking, listening, and literacy. Both “technology” and
reading instruction. It is endemic to much current
“computer” were used as qualifiers. The Panel decided
software. Despite dramatic developments in learning
that single descriptors would yield a more liberal
theory and software design, this seems to be the most
sampling of articles, even though such a procedure
serious impediment to progress. The rapid pace of
yields more “false positives.” (For example, using
technological innovation in both hardware and software,
literacy as a descriptor yielded many studies of science
however, suggests that this issue is being addressed.
literacy and computer literacy that did not deal with
At the same time, a different sort of development has reading or writing.) These queries yielded a total of 965
caused a renewed interest in computer technology. The articles in 159 different journals.
development of the Internet and the linking of schools
In a preliminary hand search of the journals, evidence
and school computers to it have combined to provide a
was found that there were articles that did not appear in
new interest in computer usage. The kinds of
the ERIC or PsycINFO databases. Consequently, each
information resources available have provided a stimulus
of the 159 journals was hand-searched for relevant
for renewed efforts to deliver instruction of all sorts,
articles that were missed or missing from the database
including reading, by computer. Coupled with the much
interrogation. In addition, many of the articles in the
greater capability of computers and major advances in
original set did not meet the criteria of true research
software, use of the Internet has led to a dramatic new
reports about literacy and technology. For example,
wave of interest in using computers in reading
some of the articles were mere speculation; others
instruction.
were about computer literacy rather than reading. Still
The current review was undertaken to examine the others were commentaries arguing for or against the
research that used computers to deliver reading efficacy of technology interventions in literacy learning.
instruction in an effort to determine what the results are, The Panel applied a simple criterion to include or
what the potential is, and what questions remain. exclude articles. To be included, an article had to deal
with the areas laid out above and had to be based on an
Methodology empirical data collection. However, reviews of research
studies based on empirical studies were included.
Database
Because the original search was conducted prior to the
The Technology Subcommittee began its task by using a end of 1996, the Panel included any available 1996
database that was assembled by Kamil and Intrator issues of journals in the hand search.
(1997). This database was deemed an appropriate
Subsequent additions to the database were made by
starting point because it was assembled by a
queries of the INSPEC database and hand-searching
combination of electronic searches and exhaustive hand
similar to that described above. This yielded an ultimate
searches of all the journals that appeared in the electronic
pool of 350 studies. Information on all relevant articles
searches. The following paragraphs describe the methods
was entered into a Filemaker Pro database. Each
used in the creation of the database in that study.
article was assigned a value for number of pages, Studies that were about word processing were not
literacy type, technology type, subject population, considered further, because many or most of these did
special population characteristics, problem, platform, not involve any connection with reading. Finally, studies
methodology, findings, recommendations, and quality. of special populations, non-native speakers of English,
or adult readers were deemed inappropriate for further
For the present analysis, the studies in the database
analysis. (A number of studies dealt with learning to
were filtered to identify a subset of experimental or
read in a second language, for example, and fell outside
quasi-experimental instructional studies. Of the 350
the scope of the charge to the National Reading Panel.)
studies, a total of 92 investigated reading using
This produced a final pool of only 21 studies.
experimental or quasi-experimental methods. Of the 92,
only 47 studies were instructional. Studies that merely Analysis
compared computerized versions of a task with
conventional versions were not counted as instructional. The 21 studies represent experimental manipulations of
Studies that merely examined effects of the computer, problems across the entire spectrum of reading. As a
unless attended by some instructional intervention, were first step to further analysis, the problems addressed by
also eliminated from the pool. What this last criterion these 21 studies were categorized. This procedure
did was to remove a few studies that simply translated ultimately yielded seven categories. The largest group
existing materials for use in a computer presentation. of studies (six) studied the addition of speech to
Studies that did not deal with computer technologies computer-presented text. There were two studies that
were also eliminated. (In the original database, for examined the effects of vocabulary instruction, two
example, there were studies that examined instructional more that looked at word recognition instruction, and
uses of television.) two that investigated comprehension instruction, broadly
defined. One study examined spelling, and two studies
examined the effects of broad programs on learning to
read. The last studies looked at the delivery of reading
instruction by comprehensive software that covered
5
Number of Studies
0
W o rd R eco g n itio n V o cab u lary S p eech C o m p reh en sio n S p ellin g G en eral O th er
Pro b lem S tu d ied
many, if not most, elements of reading instruction. One Reading Panel. Even though these numbers are low,
study examined the learning of picture-word they are in agreement with the conclusions of Kamil
relationships and was not classified. Figure 1 presents and Intrator (1997), Kamil and Lane (1998), and Kamil,
these data graphically. Intrator, and Kim (2000) that there has been a dearth of
research on problems in technology and literacy.
Consistency With NRP Methods According to Kamil and Intrator (1997), there was no
Meta-analysis was judged inappropriate because the 21 significant increase in published research on technology
studies were spread across the entire spectrum of and literacy over the time span from 1986 to 1996.
variables and across populations ranging from preschool
to high school. The distribution of the final pool of
studies as a function of grade levels is included as
6
Number of Studies
0
P re K P rim ary E le m e n t ary M id d le S c h o o l H ig h S c h o o l
G rad e L e v e l
Finally, the NRP looks to the promise of hypertext as an • Word processing is a useful addition to
application for the future. A small, but steadily reading instruction.
increasing, cluster of studies points the way toward
potentially important applications revolving around A very large portion of the database involved studies of
hypertext and hypermedia. There were 12 studies that word processing. Because writing is often part of
involved hypertext in the assembled database, but they reading instruction, the findings concerning word
do not adequately reflect the growing interest in the processing are relevant, even though the studies fell
topic. Many of the studies do not meet the experimental outside the criteria for analysis. Word processing has
or instructional criteria, but they will provide important many benefits for writing, particularly in its close match
data as researchers and practitioners conceptualize new with process writing approaches. Although the
ways to apply hypermedia in reading and learning to implication has not been experimentally tested (in terms
read. It will be those applications that must be of its effect on reading instruction), this seems likely to
researched and validated for use in reading instruction. occur in the future. One implication seems to be that
Hypertext and hypermedia may also involve developing word processing alone is unlikely to make a difference;
new modes of instruction for students to use them it must be embedded in other instruction.
effectively. What is most exciting about this trend is that • Multimedia computer software can be
it represents truly new ways of applying computer used for reading instruction.
technology to reading and reading instruction.
There are many unanswered questions about the
Implications for Reading Instruction efficacy of multimedia learning. All of the conditions
under which multimedia learning is more effective than
There are few implications for practice that can be conventional learning are not known. However, there
drawn from the small set of instructional studies in the appear to be many students who benefit from the
database. What is important is that there are uses for addition of multimedia instruction to a conventional
the computer that do impinge on reading instruction. curriculum. One example that was tested in several
The following is an attempt to draw some of these studies was the addition of speech (computerized or
implications, with the caveat that the implications are not) to the instructional context. When multimedia
clearly tentative and need to be verified by continued software is available and appropriate, it should be
research. exploited.
• Computers can be used for some • Computers do have a motivational use
reading instructional tasks. in reading instruction.
Although there are only a few experimental studies that Although there were no experimental instructional
are relevant to this point, they do report successes. studies that supported this implication explicitly, the
What is clear is that as computer software becomes motivational aspects of computers should not be
more capable, the opportunities for computers to be overlooked. This effect may diminish as computers
used in reading instruction will expand. become ever more common. For the time being, they
At the very least, computers can provide opportunities still retain some motivational advantage over
for students to interact instructionally with text for conventional instruction.
greater amounts of time than they can if only • Hypertext has a great deal of potential
conventional instruction is provided. Although there was in reading instruction.
no research that provided a general rule for determining
what works, careful selections from available software There is a growing interest in hypertext because of its
can provide additional instructional assistance in potential to allow the reader to control some of the
classrooms. Although there is a publication bias to presentation of text, determining what to read at various
report only positive differences, there were no junctures in the text. Another potential is the use of
instructional studies in which the computer did not hypertext to assist the reader who is having difficulty
provide a significant addition to the instructional context. with a passage. Despite the fact that there were no
experimental instructional studies on this topic that met
the NRP criteria, the application of hypertext concepts One of the most striking findings of this analysis is that
to reading and reading instruction seems to have a great there is a surprising lack of published research. For
deal of potential. The use of hypertext and hypermedia whatever reason, the volume of published research has
on the Internet almost mandates the need to address this not kept pace with the interest in computer technology.
issue in reading instruction. In the meantime, hypertext, Research is urgently needed to answer these and other
particularly coupled with Internet access, seems to have questions that will affect the penetration of computer
been adopted in many classrooms, regardless of the lack technology into conventional reading instruction.
of research.
1. What is the proper role for integration of computers
Directions for Further Research in reading instruction? In what contexts can they be
used to either replace or supplement conventional
It is inappropriate to separate the applications of instruction?
computer technologies to reading instruction into a set
of issues about technology and a set about reading. The 2. What are the conditions under which multimedia
Panel believes that technology is not a problem to be presentation is useful or desirable in reading text?
studied in and of itself. The problem is, rather, how the 3. What are the requisite characteristics of software to
technology is applied to specific problems in reading teach reading?
instruction. To that end, the following questions are
offered as among the most important ones to be 4. What is the appropriate mix of reading and writing
answered by future research. They are neither simple instruction delivered by computer?
nor easily answered. Answering them will involve 5. How can professional development programs be
issues as complex as professional development for structured to help teachers effectively integrate
teachers and as simple as the utility of drill and practice computer solutions with instruction?
exercises.
6. How are the effects of computer usage in pedagogy
Research on these topics needs to be relatively most effectively measured? Do conventional
independent of specific computer platforms and assessments measure all of the learning that takes
software because the rapidity of innovation makes place in computer environments?
specific choices obsolete in short time periods. One
argument for not conducting more research has been 7. What is the utility of hypertext in instructional
that the technology outpaces the research. However, contexts?
not all of the important questions are dependent on the
8. How can Internet resources be incorporated in
state of technological innovation.
reading instruction?
The Panel believes that the following list of questions
represents relatively short-term needs for today and Overall Conclusions
shortly beyond the horizon of current development. The current analysis has found general agreement in the
Some effort should be directed at conceptualizing new experimental literature that computer technology can be
uses for computer technology—uses that will augment used to deliver a variety of types of reading instruction
conventional reading instruction in beneficial ways. The successfully. There has been relatively little research in
list does not include questions that may become this important area and, consequently, many unanswered
important in the future—such as the role of literacy in a questions remain.
much more graphically oriented world. These may not
be researchable, but the implications of these The rapid development of capabilities of computer
developments need to be systematically explored by technology, particularly in speech recognition and
research. multimedia presentations, promises even more
successful applications in literacy for the future. To be
certain that these new developments are incorporated in
instruction as efficiently as possible, it is important that
research be initiated to answer the questions that have
not been addressed to date.
Reinking, D. (1988). Computer-mediated text and Tobias, S. (1988). Teaching strategic text review by
comprehension differences: The role of reading time, computer and interaction with student characteristics.
reader preference, and estimation of learning. Reading Computers in Human Behavior, 4, 299-310.
Research Quarterly, 23, 484-498.
Weber, W., & Henderson, E. (1989). A computer-
Reinking, D., & Rickman, S. S. (1990). The effects based program of word study: Effects on reading and
of computer-mediated texts on the vocabulary learning spelling. Reading Psychology, 10, 157-171.
and comprehension of intermediate grade readers.
Journal of Reading Behavior, 22, 395-411. Wise, B. (1992). Whole words and decoding for
short-term learning: Comparisons on a talking-computer
Reitsma, P. (1988). Reading practice for beginners: system. Journal of Experimental Child Psychology, 54,
Effects of guided reading, reading-while-listening, and 147-167.
independent reading with computer-based speech
feedback. Reading Research Quarterly, 23, 219-235. Wise, B., Olson, R., & Treiman, R. (1990).
Subsyllabic units in computerized reading instruction:
Roth, S., & Beck, I. (1987). Theoretical and Onset-rime vs. postvowel segmentation. Journal of
instructional implications of the assessment of two Experimental Child Psychology, 49, 1-19.
microcomputer word recognition programs. Reading
Research Quarterly, 22, 197-218.
Appendix A
Studies Analyzed
Barker, A. B., & Torgeson, J. K. (1995). An Gore, D., Morrison, G., Maas, M., & Anderson, E.
evaluation of computer-assisted instruction in (1989). A study of teaching reading skills to the young
phonological awareness with below average readers. child using microcomputer assisted instruction. Journal
Journal of Educational Computing Research, 13, 89-103. of Educational Computing Research, 5, 179-185.
Bass, G., Ries, R., & Sharpe, W. (1986). Teaching Icabone, D., & Hannaford, A. (1986). Comparison
basic skills through microcomputer assisted instruction. of two methods of teaching unknown reading words to
Journal of Educational Computing Research, 2, 207- fourth graders: Microcomputer and tutor. Educational
219. Technology, 26, 36-39.
Chambless, J., & Chambless, M. (1994). The Kolich, E. (1991). Effects of computer-assisted
impact of instructional technology on reading/writing vocabulary training on word knowledge. Journal of
skills of 2nd grade students. Reading Improvement, 31, Educational Research, 84, 177-182.
151-155.
Norton, P., & Resta, V. (1986). Investigating the
Chang, L., & Osguthorpe, R. (1990). The effects of impact of computer instruction on elementary students’
computerized picture-word processing on reading achievement. Educational Technology, 26, 35-
kindergartners’ language development. Journal of 41.
Research in Childhood Education, 5, 73-84.
Olson, R. K., Foltz, G., & Wise, B. W. (1986).
Davidson, J. (1994). The evaluation of computer- Reading instruction and remediation with the aid of
delivered natural speech in the teaching of reading. computer speech. Behavior Research Methods,
Computers and Education, 22, 181-185. Instruments, and Computers, 18, 93-99.
Davidson, J., Coles, D., Noyes, P., & Terrell, C. Reinking, D. (1988). Computer-mediated text and
(1991). Using computer-delivered natural speech to comprehension differences: The role of reading time,
assist in the teaching of reading. British Journal of reader preference, and estimation of learning. Reading
Educational Technology, 22, 110-118. Research Quarterly, 23, 484-498.
Davidson, J., Elcock, J., & Noyes, P. (1996). A Reinking, D., & Rickman, S. S. (1990). The effects
preliminary study of the effect of computer-assisted of computer-mediated texts on the vocabulary learning
practice on reading attainment. Journal of Research in and comprehension of intermediate grade readers.
Reading, 19, 102-110. Journal of Reading Behavior, 22, 395-411.
Gillingham, M., Garner, R., Guthrie, J., & Sawyer, Reitsma, P. (1988). Reading practice for beginners:
R. (1989). Children’s control of computer-based reading Effects of guided reading, reading-while-listening, and
assistance in answering synthesis questions. Computers independent reading with computer-based speech
in Human Behavior, 5, 61-75. feedback. Reading Research Quarterly, 23, 219-235.
Roth, S., & Beck, I. (1987). Theoretical and Wise, B. (1992). Whole words and decoding for
instructional implications of the assessment of two short-term learning: comparisons on a talking-computer
microcomputer word recognition programs. Reading system. Journal of Experimental Child Psychology, 54,
Research Quarterly, 22, 197-218. 147-167.
Tobias, S. (1988). Teaching strategic text review by Wise, B., Olson, R., & Treiman, R. (1990).
computer and interaction with student characteristics. Subsyllabic units in computerized reading instruction:
Computers in Human Behavior, 4, 299-310. Onset-rime vs. postvowel segmentation. Journal of
Experimental Child Psychology, 49, 1-19.
Weber, W., & Henderson, E. (1989). A computer-
based program of word study: Effects on reading and
spelling. Reading Psychology, 10, 157-171.
MINORITY VIEW
Joanne Yatvin, Ph.D.
Oregon Trail School District, Sandy, Oregon
The charge from Congress to the National Reading 7. What important gaps remain in our knowledge of
Panel (NRP) was to “assess the status of research- how children learn to read, the effectiveness of
based knowledge, including the effectiveness of various different instructional methods for teaching reading,
approaches to teaching children to read.” In explicating and improving the preparation of teachers in
that charge, the National Institute of Child Health and reading instruction that could be addressed by
Development (NICHD), which convened the Panel, additional research?
listed seven questions for the Panel to address. They
From this charge, it seems reasonable to infer that
were:
Congress’s goal was to settle the “Reading Wars,”
1. What is known about the basic processes by which putting an end to the inflated rhetoric, partisan lobbying,
children learn to read? and uninformed decisionmaking that have been so
widespread and so detrimental to the progress of
2. What are the most common instructional
reading instruction in America’s schools. Clearly, the
approaches in the United States to teach children to
main thrust of the charge is toward determining which
learn to read? What are the scientific underpinnings
of the many teaching methods used in schools, and
for each of these methodologic approaches, and
promoted by advocates, really work best.
what assessments have been done to validate their
underlying scientific rationale? What conclusions Whether a review of the existing reading research
about the scientific basis for these approaches does literature could have provided answers to all of
the Panel draw from these assessments? Congress’s questions, the Panel’s obligation was to dig
in and find out. I am filing this minority report because I
3. What assessments have been made of the
believe that the Panel has not fulfilled that obligation.
effectiveness of each of these methodologies in
From the beginning, the Panel chose to conceptualize
actual use in helping children develop critical
and review the field narrowly, in accordance with the
reading skills, and what conclusions does the Panel
philosophical orientation and the research interests of
draw from these assessments?
the majority of its members. At its first meeting in the
4. Based on the answers to the preceding questions, spring of 1998, the Panel quickly decided to examine
what does the Panel conclude about the readiness research in three areas: alphabetics, comprehension,
for implementation in the classroom of these and fluency, thereby excluding any inquiry into the fields
research results? of language and literature. After some debate, members
agreed to expand their investigations to two other areas:
5. How are teachers trained to teach children to read, computer-linked instruction and teacher preparation.
and what do studies show about the effectiveness Five subcommittees were formed, and within the
of this training? How can this knowledge be applied chosen areas, each selected a number of topics of
to improve this training? interest. As work on the initial choices of topics
6. What practical findings from the Panel can be used proceeded, however, it became apparent that the Panel
immediately by parents, teachers, and other had insufficient time and support personnel to cover all
educational audiences to help children learn to read, it had identified. Ultimately, the Panel subgroups
and how can the conclusions of the Panel be produced reviews of the research on the following
disseminated most effectively? topics: phonemic awareness, phonics, fluency,
comprehension strategies, vocabulary development,
1
Minority View
computer technology and reading instruction, teacher schools and parents. (In order to be specific about
preparation in general, and teacher preparation to teach topics the panel did not cover, I have included two lists
comprehension strategies. In addition, the Panel in Appendix B.) The panel chose not to pursue any of
developed a set of criteria and procedures for these approaches.
evaluating reading studies, which all subgroups used and
Furthermore, to have fully answered its charge, the
which the Panel hopes will serve as future guidelines
Panel needed to assess the implications for practice
for other researchers.
growing out of research findings. As a body made up
These reviews show comprehensive and painstaking mostly of university professors, however, its members
work by the subcommittees. They will prove valuable, I were not qualified to be the sole judges of the
think, to other experimental researchers as they seek to “readiness for implementation in the classroom” of their
expand the body of knowledge on those topics and fill in findings or whether the findings could be “used
the gaps. On the other hand, the reviews are of limited immediately by parents, teachers, and other educational
usefulness to teachers, administrators, and policymakers audiences.” Their concern, as scientists, was whether
because they fail to address the key issues that have or not a particular line of instruction was clearly enough
made elementary schools both a battleground for defined and whether the evidence of its experimental
advocates of opposing philosophies and a prey for success was strong. What they did not consider in most
purveyors of “quick fixes.” And, unfortunately, the cases were the school and classroom realities that
reviews are of even less use to parents because they make some types of instruction difficult—even
do not touch on early learning and home support for impossible—to implement. Outside teacher reviewers
literacy, matters which many experts believe are the should have been brought in to critique the Panel’s
critical determinants of school success or failure. conclusions, just as outside scientists were to critique its
processes. Despite repeated suggestions that this be
To have properly answered its charge, the Panel had
done, it was not.
to look at the field of reading both horizontally and
vertically, examining the basic theoretical models of In fairness to the Panel, it must be recognized that the
reading, the methods that grow out of them, and the charge from Congress was too demanding to be
processes of learning that begin in infancy and continue accomplished by a small body of unpaid volunteers,
through young adulthood. (See Appendix A for working part time, without staff support, over a period
definitions and descriptions of the three models of a year and a half. (The time Congress originally
underlying methods of instruction in American schools allotted was only 6 months.)
today.) The scientific basis for each of these models
Congress did not realize—and the Panel itself did not
needed to be examined, then the effectiveness of the
fully comprehend at the beginning of its labors—how
methods they have generated. The research on
large, uneven, and intractable the field of reading
language development, pre-reading literary knowledge,
research really is. The Panel’s preliminary electronic
understanding of the conventions of print, and all the
searches of databases uncovered thousands of articles
other experiences that prepare young children to learn
on some topics, hundreds on others, only a handful on
to read also demanded the Panel’s attention. And
some. Their completed reviews on several topics
finally, the changing needs and strategies of adolescent
disclosed that the critical question of generalizability
readers called for a review of the existing research.
(i.e., Does a skill or strategy taught and learned carry
If the Panel could not cover the whole field—as, in fact, over to new experiences?) often was not answered by
it could not because of time and resource limitations—it researchers. The reviews show, in addition, that
should have concentrated on topics of highest interest questions relevant to the success of an instructional
and controversy in the public arena. Or, as technique, such as “how much” to teach and “when,”
professionally distasteful as the task might have been, it were not even examined in most studies.
should have assessed the validity of the claims of
various commercial programs being sold as cure-alls to
2
Minority View
Also in fairness to the Panel, I must acknowledge that a of research on some of the topics are inconclusive.
few of the topics I have identified as neglected are They will not hear the Panel’s calls for more and more
included in some of the reports. Still, they receive only fine-tuned research. Ironically, the report that Congress
peripheral attention when public interest demands much intended to be a boon to the teaching of reading will
more. In the review on phonemic awareness, for turn out to be a further detriment.
example, the critical question of whether all children
As an educator with more than 40 years of experience
need special training in phonemic awareness was not
and as the only member of the NRP who has lived a
addressed, even though several studies suggest that
career in elementary schools, I call upon Congress to
many children grasp the concept and are able to apply it
recognize that the Panel’s majority report does not
through ordinary reading instruction. Other topics of
respond to its charge nor meet the needs of America’s
interest, such as students’ need for “direct instruction,”
schools. In spite of the Panel’s diligent efforts and its
appear in reviews only as assumptions about successful
valuable findings on a select number of instructional
practices, but are never tested against their
practices, we still cannot answer the first and most
philosophical opposites.
central question of the charge: “What is known about
In the end, the work of the NRP is not of poor quality; it the basic processes by which children learn to read.”
is just unbalanced and, to some extent, irrelevant. But We still do not know what types of instruction are
because of these deficiencies, bad things will happen. suitable for different ages and populations of children.
Summaries of, and sound bites about, the Panel’s We still do not know the relative effectiveness of the
findings will be used to make policy decisions at the three models of reading as bases for instruction. We do
national, state, and local levels. Topics that were never not even know whether the existing body of research
investigated will be misconstrued as failed practices. can answer those questions. Therefore, I ask Congress
Unanswered questions will be assumed to have been not to take actions that will promote one philosophical
answered negatively. Unfortunately, most policymakers view of reading or constrain future research in the field
and ordinary citizens will not read the full reviews. They on the basis of the Panel’s limited and narrow set of
will not see the Panel’s explanations about why so few findings.
topics were investigated or its judgments that the results
3
Minority View
Appendix A
Definitions
Word Identification Model of Reading Reader motivation is a part of this model, but it is seen
The word identification model hypothesizes that readers mostly as an external factor: What must the teacher do
read by matching letters to sounds, then blending sounds to move children to read this story and do the
into pronounceable words. In asserting that children accompanying activities?
who have mastered the skills of decoding “can read
anything,” it separates word pronunciation from word Integration of Language and
understanding and defines the former as reading. Thinking Model of Reading
Instructional materials evade the issue by using mostly According to this model, children begin acquiring the
decodable words in stories that reflect familiar life knowledge and skills needed for reading long before
experiences of children and have only literal meanings. they face the challenge of decoding print. Even at the
earliest stages of reading, they are able to use what
Although proponents of this model recognize that they know about language, literature, and the world to
readers need vocabulary knowledge and skills of perform multiple operations in dealing with a text.
analysis and interpretation to understand advanced and Reading means not only recognizing words and knowing
specialized materials, they believe that the job of their meanings, but also understanding how they fit into
developing those skills properly belongs in subject a context of grammatical structure, speech phrasing and
matter classes. Getting students to understand the main intonation, literary forms and devices, and print
idea of a short story, for example, is the business of the conventions.
literature teacher, not the reading teacher, and is better
left to middle and high school grades. Because readers bring their own skills and knowledge
to any text, and because written language is redundant,
This model does not consider the factor of reader they are able to orchestrate their own reading
motivation. At all levels the reader is viewed as a experiences. When one skill or knowledge source is
passive recipient of content. Children should learn to weak in relation to a particular text, such as life
read because adults want them to. They should experience would be in reading about the history of a
remember the facts in a text and accept the teacher’s foreign country, stronger skills, such as vocabulary, may
interpretation of meaning. Because of these beliefs, carry the reader through. In this model, learning to read
there are few attempts to make reading an interesting and reading to learn are inseparable.
or rewarding experience for children.
Although this model also recognizes the need for reader
Word Identification Plus Skills Model of Reading strategies in dealing with more difficult texts, it views
In this model, learning to read is a two-tier process. The strategies as the products of individual needs and
first tier is very much like that of the previous model, purposes, sometimes devised by the reader and
except that it defines reading as understanding words as sometimes prompted or provided by others at the point
well as pronouncing them. Children are able to read of need. Motivation, then, needs to be intrinsic. The
sentences, paragraphs, and whole texts by stringing teacher’s job is to create or allow situations where
together the pronunciations and meanings of individual children want to read and are willing to work hard at it.
words.
Learning to read in this model involves “others” in many
The second tier of the process is “reading to learn.” As ways. Readers expand their vocabularies and
readers gain speed and automaticity in recognizing background knowledge through listening to the teacher
words and verbalizing sentences naturally, they free up read stories aloud and conversing with their peers. They
their mental abilities to deal with larger vocabulary loads adopt and adapt strategies modeled by others. They
and implied meanings. However, because this model, modify their understanding of texts by listening to what
like the first, views readers as recipients of content, others have to say. At the same time, roles continually
they need direct instruction in comprehension strategies. change: the questioner is questioned, and the explainer
Through instruction, readers learn how to deal with is corrected. Thus, social interaction is a necessary
different kinds of texts and their increasing length, component of this model.
complexity, and subtlety.
4
Minority View
Appendix B
Below are two lists of topics not investigated by the My List of Topics of Public Concern
National Reading Panel. The first is drawn from a • Direct instruction
survey of leaders in reading from across the United
• Use of decodable texts
States done by the International Reading Association
(Reading Today, December 1999). These leaders were • Embedded skills instruction
asked to identify what topics they perceived to be “hot” • Reading aloud to children
in the field today. The second list is my own view of
topics that teachers and parents are concerned about, • Invented spelling
either because they are now in wide use or are being • Use of predictable texts
advocated for inclusion in the reading curriculum. • Early language development (vocabulary, grammar,
International Reading Association List and literary language)
of “Hot” Topics • Integrated reading and writing
• Balanced reading instruction • Home-teaching programs
• Decodable text • Access to quality literature
• Direct instruction • Whole-class instruction
• Early intervention • Scripted instruction
• Performance assessment • Teacher modeling
• Standards • Children’s understanding of print conventions
• State/national assessment
• Volunteer tutoring
5
Minority View
Appendix C
Joanne Yatvin
February 27, 2000