Académique Documents
Professionnel Documents
Culture Documents
The Hype
Effectiveness research is now at the point of sophistication wherein black-box outcomes studies are no longer acceptable. Mowbray, Holter, Teague and Bybee, 2003 89,900 Google hits on October 10, 2009, for the phrase, what works best for whom
2
Lofty Goals
What social programs, policies, and interventions work? For whom do they work, and under what conditions? And why do they workor fall short? Preface to Learning More from Social Experiments, edited by Howard Bloom
(34,000 Google hits on book title)
3
Thesis
It is better not to attempt to measure fidelity in GRTs It is counter-productive to try to answer questions about efficacy (intervention effects under ideal conditions) in a trial designed to measure effectiveness (intervention effects under realistic conditions) Other forms of measurement about the intervention process in the hopes of learning more about alternate interventions are also vain and wasteful
6
Outline
Opportunities & perspectives The preconditions for useful fidelity measurement Operational challenges in fidelity measurement Statistical issues in the estimation of fidelityadjusted intervention effectiveness A case study Another perspective
Opportunities
Many educational, social, and behavioral interventions are complex
Multidimensional Incorporate aspects of culturally accepted best practices (traditions and fads) Require the participation of trained intervenors and of intervention subjects over extended periods of time Can never be detailed enough to handle every eventuality
8
Cynics Perspective
If there is failure:
Blame the subjects Blame the intervenors Disqualify or discount the work of control intervenors who by virtue of superior skill, appear to infringe on the developers recipe, possibly merely by implementing the culturally accepted best practices
9
10
Operational Challenges
Choice of informant
Subject Intervenor Trainer/ senior intervenor adviser Neutral observer
Intervenor Informant
Likely to think they are doing just fine if asked to summarize their fidelity
Let's Begin with the Letter People: ECE
24 22 20 18 16 14 12 10 8 6 4 2 0 1 2 3 Project Director rating 4 5
24 22 20 18 16 14 12 10 8 6 4 2 0 1 2 3 Project Director rating 4 5
Frequency
Frequency
Frequency
Frequency
14
15
Trainer/Advisor Informant
Can have vested interest Blind neither to treatment status nor outcome outlook Possible to read the writing on the wall and rate the intervenors with unfavorable average outcomes as having low fidelity, thereby protecting the fidelity-adjusted effectiveness of the intervention
16
17
Neutral Observer
Very costly Need staff who fully understand the intervention model Need extensive training for consistent rating Usually need travel Results in strong pressure for additional clustering
18
Statistical Issues
Most advocates of fidelity measurement have unwarranted optimism about the ability of statisticians to do anything useful with the data Of course, one can always hunt for the statistician who will provide rosy promises of artful analyses
20
22
Heroic Assumption #1
Can only render the mediating role of one (one!) unidimensional summary of fidelity By definition, Z is an instrumental variable for the effect of X on Y if the only effect of Z on Y is through X In other words, one must be able to rule out a priori that there could be any effects of Z on Y that do not run through X In the context of fidelity-adjusted effect estimation, this means that there is a unique plausible summarization of fidelity
23
24
Heroic Assumption #2
If one relies upon the adequacy of covariate measurement, one quickly runs up against sample size problems A typical group randomized trial will have only a few dozen intervenors per arm (maybe just one or two dozen, and I have seen less than one dozen)
26
27
28
CASE STUDY
29
CLIO
Randomized field trial of curricula for Even Start Centers 5 arm study 4 active, 1 control Three fidelity measurements:
Local Even Start center director Curriculum designer Neutral observer
30
Relationship between developer-rated fidelity and emergent child English literacy (arm A2)
Outcome
Fidelity
33
Relationship between developer-rated fidelity and emergent child English literacy (arm B2)
Outcome
Fidelity
34
Relationship between developer-rated fidelity and emergent child English literacy (arm A1)
Outcome
Fidelity
35
Relationship between developer-rated fidelity and emergent child English literacy (arm B1)
Outcome
Fidelity
36
Relationship between developer-rated fidelity and emergent child English literacy (control)
Outcome
Fidelity
37
Relationship between observer-rated fidelity and emergent child English literacy (arm A2)
Outcome
Fidelity
38
Relationship between observer-rated fidelity and emergent child English literacy (arm B2)
Outcome
Fidelity
39
Relationship between observer-rated fidelity and emergent child English literacy (arm A1)
Outcome
Fidelity
40
Relationship between observer-rated fidelity and emergent child English literacy (arm B1)
Outcome
Fidelity
41
Relationship between observer-rated fidelity and emergent child English literacy (control)
Outcome
Fidelity
42
44
Other voices
Peter Schochet, Mathematica Policy Research, in a recent IES white paper, final line: Thus, these classroom practice mediators may be of little help in confirming the studys conceptual model and identifying teacher practices that are most associated with student learning gains.
45
Josh Angrist
Instrumental Variables Methods in Experimental Criminological Research: What, Why, and How? 2004. Journal Of Experimental Criminology. Especially noteworthy is the fact that, in marked contrast with an unfortunate trend in education research, criminologists do not appear to have been afflicted with what social scientist Tom Cook (2001) calls sciencephobia. This is a tendency to eschew rigorous quantitative research designs in favor of a softer approach that emphasizes process over outcomes.
46