Académique Documents
Professionnel Documents
Culture Documents
S ST Ud AJISU E A IN N N OJEf , N I U d
qBnaqan1uuall)
r{freeseuIerfosroJ selnuue^es
Copr.right O 2008 by Princeton University Press
I,ullished by Princcton University Press,41 \Tilliam Streer,Princeton, New Jersel'08'540
ln the United l(ingdom: Princeton tlniversitv Press, 3 N{arket Place, Wbodstock,
Oxfordshire OX20 1SY
press.pilnceton.edLr
10 9 8 7 6 5 4 3 2 1
To Sarah Elizabeth Gingrich Firebaugh-
louing daughter-in-law, engineer,athlete,
enuironmental actiuist, womdn of faith-
uhose life was cut short by an errdnt driuer.
We miss you.
7007023942
Contents
Preface xt
\eierences 241
. ,:,1ex 253
Preface
Although chapters5 and 6 (coveringrules 5 and 6) are the most challeng- S eve r alot hcr indir r . i- : '
ing chapters in the book and could be omitted in some undergraduate tl-risbook in signific:rni ., .,
courses,the persistentreader wili be rewarded since rules 5 and 6 reflect ar A crdem ia Sinicl . r n. :: ' .
the current turn of social researchtoward issuesof causalinference(rule those lectures,As rhe 1.,
-5)and the analysisof longitudinal data (ruie 6). agementand suggesriir:1 l
The rules are liberallv illustrated with applicationsfrom economics, Mattl-rews,and Sal Or, ,:-.:
polirical science,sociologl',and relatedfields.I sometimesrely on exam- arrd pro vided euur r r r cr . , .
ples fror-r-rm\- own rvork in sociology, since I know that research best. S atvi kaChalasr ni,D. i. -
Equatior.rs are necessarv, but I keep them to a minimum, and in any caseI Frisco, David Johnso: . l.
presenr:lrgumentsand examplesthat rely more on intuition than on for- good, Alan Sica.. inJ l-- -
mal proofs. The book is much heavier on examplesand thought experi- w ere es peciallldet . r i . -
menrs rhan on formal statistics.Although the book is geared toward ron U niver sit yPr c. . ; - -
quantitative methods, the rules (and certainly the principles behind them) me to s peakm or e Jr : =- -
applv to qualitativeresearchas well. Fi na lly.I woLr l. l ir L=
It is fair to say that, for the most part, I learned the sevenrules either in bot-,k.but her supp, , r : . . ,
the processof my orvn researchor from my erperiencesas a journal edi- numero ust o m ent l( ] n.
tor. I did not learn them in graduate school-at least not in a codified
form. I expect to hear from other social scientistswho are in a hurry to
tell me about the rules that I missed.I concedethe point: It would be fool-
hardy to claim that the sevenrules here are exhaustive. (So send me your
additional rules,in casethere is a secondedition of this book.) What I do
claim is that the sevenrules are timely and important. Adherence to these
rules would materiaily improve current researchin the social sciences.
The idea for this book was hatched after a week-long seriesof lectures
I presented to faculty and others at Academia Sinica (Taipei, Taiwan) in
January 2005. The theme of the lectures was "Let method be the servant,
not the master." At the end of the week severalparticipants in the course
encouragedme to publish the lectures. My first thought was to carve out
severaljournal articles from the lectures. Upon further reflection, how-
ever, it became apparent that a seriesof articles would not do-I would
need a book. I resistedthe idea at first but later warmed to it, in part due
to the encouragementof Tim Sullivan at Princeton University Press.And
s o r his book w a s b o rn .
An early draft was written in sabbatical facilities kindly provided by
Harvard University. I thank Peter Marsden and Mary'Waters for inviting
me to spendthe 2004-05 academicyear at Harvard, and for providing me
with an office with such a grand view of Cambridge. I thank the other
'William
occupants of JamesHall, and Christopher \X/inship in particular,
for stimulating conversationsabout matters sometimesrelevant (and some-
times not) to the issuesof this book. I also thank Susan\felch, dean of
the College of the Liberal Arts at Penn State,for her efforts in providing a
timely and welcomesabbaticalto focus on a number of researchprojects.
Preface . xi i i
The FirstRule
OF SURPRISE
THERE SHOULDBE THE POSSIBILITY
IN SOCIALRESEARCH
s.cial Survel'). l-r,en if r,ou plan to collect vorlr own srlrvL.ydata, yorr
should consult sr-rchprior survevsearlv in your project, if c,,riyfor guid
ance on hou. to u.ord vour questions.
other reseirrchis ruled out becauseit is r-urethical. TcrjLrr'p aheadto r.rr
e r amp l eth ilr is c enr r alin a s trb s e q u e ndti s c u s s i o no f c a usal i ry(chaprcr5),
rl'e did not kno\\', until a ferv decadesago, whether or rot sr-r-rokingcalrscs
lung carrcer-solle expertssaid it did, some said that tl.recvidencervasin
conclusive.Scienrificalh.,the best wav to settlethe disputewor-rldhave been
to .rssignindividualsr:rndomlvto one of two groupsra smoking group and
a nonsmokinggroup. In tenns of sciencc,then, the question.Lril.l b.
srveredb1'doing an experiment.hr practice,however,such an exDerlment ^n-
rvould haveraisedseriousethical quesrionsregarclingrln indiviclual.sright
to smoke (or not smoke),Thc important point hereis that ethicalconsicler_
atlons are irnportant-important cnough to rcndcr somc t)-pcsof rcsearch
un doabl e . r
\7hirt, then, d.es a researchable qrestion look likci Researchable ques-
ti<lnsin the s.cial sciencestend to be questionsthat are too \pe-
cific (for exarnple,about a specific individual or cvenr)'either n.r to<l grand
( "w hl 'i rre t her er v ir r s ?" ) T
. h e c l u e s ti .n" w h y i s my u rcl c an al cohol i c?"
is a gclodresearchquestion becausein the final anrrlysisit is imp.ssr-
'ot
ble to predict the behavior of a given individual; social lar.vsare nor dcrer-
n"rinistic.(Besides,the question isn't very interesting to anyone outsicle
vour farnilv.)It is possiblcto make that questionresearchableby gencral-
izing it to alc.holics in ge'eral. \wecould ask, for example,r,vhysomepeo-
ple tend to be more prone to alcoholisnrthan others are. To addrcssthat
questiorl, we firsr ask what characteristicsdistinguish alcoholics from
nonalcoholics.h.rother words, we ask what individual or communitr.trairs
correldtelvith alcohoLsm.
In the social scienceswbdt questionsgenerally are easier to answer
thitn why questions.As Lieberson(198.i, p.2I9, italics removed)purs ir,
"Empirical data can rell us what is happeningfar nrore readily than they
can tell us whv it is happcnrng."consider alcoh.lisrn again. with appro-
priate data it is a straightforward natter to find the correlate, .rf
h.lisrn, that is, to find the characteristicsthat distinguishalcoholics"1..,- from
others. \ffhether thesecorrelatesare cduses<lf alcohoiism is another mrrrrer
(w e take u p t he c aus alit y q u e s ti o n u n d e r ru l e 5 , " C ompare ri ke rvi th
lI n large p a r t b e c a u s e
o f f a mo tr s ( o r ir tfa n io u s)e xp e r im e n tso r ,e r the past century that
h arm ed ht t ma n s u b j c c t so r p l a c cdtlr e n ra t r isk o f h a r n r , u n ive r siticsn o rv
m,ri ntai ngui dcl i nes
regarding ar]y investrgation involving hurniln subjects. At the outsct of ttrrr
,..sJ,rrchvou
th , t l tlJ 6, v 111 . 1r ,;,1r r r r r n i r e r r r t r ' ' rir r r titL r tio lr llr e vit.g .h o lr J tlRRr fi,r
thc fr,,ce,l urr,, t,, i ,,1.
lo$' in nrur ou'n research.The central pnnciples are that participation b1, hun.,a,,
subjects
shoulclbe voluntarv. that anonvmitv or confidcnti;rlitv ot subjectsshor,rlcl be mailtainecl, arrcl
that the benefirsof fhe reseirrchshould out.,l,eighrrnl' foreseeablerisks to thc subjects.
! o Chapter.l
like"). But that does not make the correlates uninformative. Even if we
want to determine causes,generallythe first step is to find correlates,since
knowing the what gives us clues about rhe why.
Because"mere description" is sometimesdevaluedin the social sciences,
it is important to underscorethe point that "what" questionscome before
"why" questions.Factscome first, as SherlockHolmes statedfarnouslyin
his warning to Dr. Vatson that "It is a capital mistake to theorizein ad-
vanceof the facts" ("The Adventure of the SecondStain"). To punctuate
the detective'spoint, later I describean instancewhere social theorizing
went astravLrvattemptingto accountfor "rising global inequality" when
in fact global inequalitvwas not nsing. Obviously we needto get the facts
right about -,r,hatthere is to explain before we concoct an explanation. As
the first marim of Galileo's Discorsstates,"description first, explanation
s ec ond"( s eeP e a rl2 0 0 0 , p p . 3 3 4 -3 5 ).
Getting the facts right is critical, second, becauseaccurate description
sometimesis itself the end goal. Consider the problem of motion. Me-
dieval attempts to understand motion grappled with the nature and ori-
gins of motion. Current understandingtakes motion as given and attempts
to describeit preciselyand accurately.As Andrew Abbott (2003' p. 7)
polnts oul:
In this instanceprogress was stalled until the question was switched from
the unknowable (why things move) to the knowable (how they move). \7e
have returned, then, to the central point of this section: that a research
question must first of all be answerable.The secondrequirement is that it
is interesting.
INrnRn,srrNcQunsrroNs
. -:: .'-.:i to that conversatlon. anyone else interested in your research is a telltale sign that you need to
, : -.,: ,i irs scientificrelevance: rethink the project. Rethinking the project does not necessarilymean that
- - :i.iision in the social sci- you need to abandon your idea. But it does mean that you need to rework
:, : . , .:: i i i s s oc iallyim por t a n t; your question to make it interesting to others.
( o Chapter1
The issue,then, is how to reviseyour researchquestion to make it more surpri set hat dist ingur ihi: . -
interesting to others. (I assumehere that you have done the necessarypre- vour soc ialr esear chpr - ( ,: , . .
liminary review of other researchon the topic, so you have a basic idea of of surprise,you need ro I' r: - '
what othershave found before you.2)First you should discussthe project forehand. Sometin-res rc\-: r\ -
with others to find out why ir is uninteresrillg to them. Most likely they searsh uninteresting) l.r,; ..- -
will say that your question is uninrerestingeither becausewe already -
are measured,So if orhs:r ..,
kr-rowthe answer, or becausewe would not care about the answer even if to make sure that \-olrr -:-:. -
we knew it. That is: resul t.In exam iningf hr ; : : . , -
o The no-surprises objection: We won't learn anything from the re- i l cti on. f or exam ple.ir - ' -
searchbecausethe answer either is already well docume'ted by prior tw o con cept s,so t hat r h; : : .. ' .
research or is preordained by the research method. In other words, ticlnship between the frr', .
the researchis gratuitous, sincewe know the answer before we do the A l ternat ively.per h. r p.: : -- .
research. prior researchlyou'r'el.t.:' --
. The "so what" objection: The answer to the researchquestion either ment, however, it is rnr: , '
has no relevancefor social sciencetheory or for everydaylife, or mat- f ront well-estqblishetl t';.;; ; . - -
ters to such a small handful of people that the question is trivial. that resear cherhcve s r r - r . : '. .--
' el f can st im ulat er e\ eJr - : - .
"\flhy is my uncle an alcoholic?" is a good example of a trivial research
the opp osit e.Bccauser - i: , - -
question. No matter how important the answer might be to you, the ques-
rnixed, inconclusir.e.0r ., :l
tion is trivial to social scientistsbecausethe answer marrersto such a .-
u,hereresearchis u-ell
small handful of people. To make the question inreresting we musr gener- -s:,,-^
firlls into that categorr'.f i:: r ,
alize it. Instead of asking why your uncle is an alcoholic, we ask what jection, sinceprior findin-. : -
general traits distinguish alcoholics from others. So the triviality problem
piestof circumstances. \'(,-. .,
generally is solved by casting a wider net in the researchquestion.
l oncl usion and shou'. u : - . : -
Irrelevanceis also solved by casting a wider net. In truth, of course,few
i s great pot ent inl f , 'r . r . - ' - -
questions are completely irrelevant; relevanceis a matter of degree.As ed-
much researchbut littlr :: : .
itor of the American Sociological Reuiew I found thar one of rhe most fre-
That s it uat ionis qr r ir c- ". ',
quent complaints of reviewers was "\fhy should I care about the results
aonsens us on r esear eh: : * '- ,
of this paper?" As social scientistswe care about researchwhen the re-
rvel l -es t ablished f indinc. : - --
sults speak to ongoing conversarionsand debaresin our field. The more
pri se.S o if you helicr t '.: - ^
such connections we can make, the more interesting our results tend to
stratedthat married f-(,: :
be. so it is important to show how our researchquesrion links to theories
rrnmarr iedpeople.r hcr . . . .
and other empirical work.
marital statusand happ,, ,.-
The other objection (no-surprises)is consequentialbecause,as rule 1 -..
some new direction 1L-\,-.: -: :, -
states,surprise is a hallmark of social research.It is the possibility of
t'ou belieueprior rese.;' .
2 In early stagesof your literature
search you should look particularly for revier.varticles. terestingand significaflrr:-: -- -
A good review article will provide ( l ) a theoretical grid or template for thinking about the prior research.There li :- :- --
important issuesin the field, (2) an overview of key findings and key unresolved issues,and
ti ndi ngsin t he soci. r \l ! 1. - : . -
(3) a description of the most i'rfluential and important studies in the area. The
Journal of those findings.
Economic Perspectiuesand the Joumal of Economic Litercture are yaluable sourcesof sum-
marv articles of the latest findings in particular areas of economics. In the other social sci- In sho r t , if you ul: : : :
ences review articles are available in the Annual Reuiew serjes (e,g., Annual Reuieu, of establishedfindings. \ o,r i :. : -
sociology). \(/hen beginning a new researchproject it is a good idea to scan the table ofcon- r[1'/extendthe findittg:. t - '.
t e n t s o f t h e A n n tu lRe u ie w to se e if te ce n tsu m m a r ya rti cl esar:eavai l abl e,andgofrornthere.
.rnd i nduct ivewor k: Fr : . : '
The FirstRule o J
flaw in prior research (deduction), then you show, empirically. that the others rate you on lelel , : :,
flaw matters (induction). vou may 6nd t hat nr , r r r : . : - .
The other option is to clarify or extend establishedfindings. The clari- you measurehappines.. ) -,
fication of findings is deductive(or largely so), involving further concep- that marriage is rel:rred :. .
tualization and theorizing. The extensionof fir-rdingsis inductive, involving different ways. In orhcr ',',
further data analysis.I considereach of theseoptions in turn. lenge ar' old findir-rg.\r-,.- .:
' '.'..empirically, that the others rate you on level of happiness.Very possibly you will be wrong-
you may find that marriage is associatedwith happinessno matter hou'
. -, -i r r r dings. T he c l a ri - you measure happiness. Your study nonethelesscontributes by showrng
'.r nq t ur r herc onc e p - that marriage is related to happinessevenwhen we measure happinessin
'-- . ' r J r r c t iv e,
inv olv i n g different ways. In other words, when you use a new approach to chal-
lenge an old finding, you add to knowledge whether you are right or
B ox 1.1
- ..: J c r i uf pr oof r e s ts
' '.. .: s er iollsenoug hto Case Study: Rethinking the Findings of Rcsearch on Global
- -..::, Ir-rsot-ta1 anecdotes Income Inequality
: :r i- r f d t o t he es ta b -
. : ::. ri ri'on't do to say Contrary to some earlier research,recent str-rdiesof global income
. - :; : eoplc I k now" o r inequality find that, when China and India are given their due weight,
global income inequality is not rising (Bourguignon and Morrisson
- . . ,'-:-r.llots of rnarried
- ,.-- , r it r he m ar k , y o u 2002; Firebaugh 2003; Goesling 2001; Melchior and Telle 2001;
r ::'.:ri\' preciselywhere Sala-i-Martin 2006; Schulz 1998). (To be clear, there is massive
- : ,.:-:red adequately,or global income inequality in the world today-but it is not growing
: .:: lr t ) blem s wit h th e rapidly, as once feared.)The "yes it is/no it isn't" story of rising global
income inequality provides a good example of the importance of
: -:.:,rl.Lrshed social sci- knowing what it is that you are trying to explain before you try to
.' - r r c , I r c h. E ar ly a c - explain it. In a number of popular articles and books, inequality
- -' : ,.: J r am at ic allyin th e has been linked to economic globalization, one of the hortest topics
in social science,to fashion a growing literature on globalization
and the "explosion" of global income inequality. But the argumenr
--. .',;'''ar. recent reSearCh
' : '.'. .rt .rll over that pe- that globalization is the cause of rising global income inequality is
-:
wrongheaded-it tries to erplain a "fact" that isn't.
'- -r. r r . h is eas ilych a l - It is instructive to seewhere the earlier researchwent wrong. In-
: ' : - :.i r r lv dubiousab o u t come inequality refers to the disproportionate distribution of income
r - -- -,,uld ac c ountf or th e across individuals or households. Social scientists have devised
ways to measure disproportionality in income (Allison 1978). By
- . :'. ,,n tl-refinding about
- ' , -:-.rng is bas edon s e l f- applying thesemeasuresto income data over time we can determine
':..c\ e that self-reported whether income inequality is rising or falling for a country-or even
- : -.-:rLqirtbe that, because the whole world.
...::ic.'I.the married are Global income inequality refers to the disproportionate distribu-
-
: .::. or not, and the un- tion of income across all the world's citizens, where each citizen is
, : : ''.:.ther t hey ar e or n o t, given equal weight. \flith regard to the discussion here, the impor-
.: . - r :: rh.rt accountsfor the tant point to note is that the claim of rapidly growing global income
inequality in earlier research was subject to challenge because it
j-ir cnttque mattersem- rested on a seriesof dubious assumptions and incomolete analvses.
::i. \'ou rvould hypothe- These included:
:tiness disappearswhen
:.'-i.leu'a1's,such as how (continued on next page)
10 . Chapter1
rcuntinttcdfrom PreuiuuspoKc)
o Extrapolation
from resultsfor the tails of the income distribu-
tion. Some earlier studies concluded that global income ine-
quality is rising rapidly on the basis of a comparison of income
trends for the very richest and very poorest countries. But the
vast majority of the world's people live somewhere else. \fe
cannot reach defensibleconclusions about the inequality rrend
for all the world's population by examining income trends only
for those who live in the very richest and very pooresr regrons.
. Unequal weighting of indiuiduals. Early studies often weighted
all countriesequally.Ho-,vever, becauseglobal income inequal-
ity refersto the unequaldistnbution of income acrossindiuid-
uals-and manv more individuals live in poor countries (e.g.,
China) l'here incomesare grorving rapidly than in poor coun-
tries u'here incomes are gror,r.ingslowly (Firebaugh2003)-
the failure ro \\reighrcounrriesby their population produced
misleadingresultsin those studies.
o Extrapolation from historical trends. Global income inequality
rose rapidly over the nineteenthcentury and the first half of the
twentieth (Bourguignon and Morrisson 2002), as the \il/esrtook
off economically and Asia and Africa lagged behind. Global in-
come inequality rose rapidly becausethe richest regions of the
world were the world's growth leadersand the poorest regions
tended to grow slowly, if at all. In recent decades,however,the
world has changed.Now many of the fastest-growingeconomies
are in poorer regions of the world (Garrett 2004)-a poinr not
fully taken into account in some early studies.Becauseincome
inequality is now declining across counrries, rhe longstanding
growth in global income inequality has halted. (At the same
time, income inequality has increasedwithin many countries.
Declining inequality across counrries and rising inequality
within them implies a "new geography of global income ine-
quality" in which countries are receding in rmportance as eco-
nomic units: seeFirebaught20031.)
T h c Fi rstR ul e . l
parochial school, and in your review of the literature you find that no
one has examined that issue separatelvfor parochial schools, which
(you suspect)tend to be more conservativethan other colleges.In that
case you might design a survev for freshmen and seniors in your
school to seeif you find the same result for your college (better yet,
administer your survey in a broad range of parochial collegesin addi-
tion to your own). Becauseno one has studiedthis issuefor parochial
schools,you can be surprised b,vthe results.The results of your study
likely would be of interest to a number of people,including adminis-
trators and others in parochial schools, educational sociologists,and
those who study hor,v political attitudes are formed. (A word of
warning here: subpopulationssuch as "my friends" or "my sororiry
sisters" are too narrow. You might begin by pretestingyour survey on
your friends, but you do not want to end there. Not many people are
interested in the political attitudes of your friends or those in your
sorority. But a number of people are interested in the political atti-
tudes of studentsin parochial colleges,a much more substantial sub-
set of the generalcollege population.)
. IJsea new population.In this casethe issueis whether or not earlier
research findings apply to some new population. You might want to
know u'hether there is an associationbetween marriage and happi-
nessin the caseof arrangedmarriages.To find out, you could inves-
tigate the effect of marriage on happinessin societieswhere arranged
marriagesare more common than they are in the United States.
o Use a different time period. Here the issueis whether or not prior re-
searchfindings apply to a different era. In some studiesthat new time
period might be historical: '$ilasthere a positive associationbetween
marriage and happinessin the nineteenth-centuryUnited States,or has
that associationcome about more recently?Historical data to address
such questionsoften are lacking, however,so rnost replication studies
of this type are updating studies-studies that seek to update earlier
researchwith more recentdata. For example,studiesof racial attitudes
in the United Stateshave consistently found that white southernersare
more likely than whites outside the South to disapproveof marriage
between blacks and whites, to disapprove of racial integration of
schools,and so on. Many of those studieswere done in the 1970s and
1980s (for example, Middleton 1976;Tuch 1987), however,so peri-
odically new studiesappear (for example,J. S. Carter 2005) to deter-
mine how quickly regional differencesare disappearing(if at all).
c fJsea new population or subpopulation to test a specific hypotbesis.
In this caseyou choose a population or subpopulation that provides
a test of some hypothesis. For example, you might hypothesize that
the positive relationship between marriage and happinessis basedon
The FirstRule . I.l
: - : -: lcrilture You find that no companionship,so you expect iittle or no effect of marriagc on hap
' :--rrochialschools,which pinesswhere long periods of separation are comrnon. You rr-righttest
--- .
',:-- ri ,. :l'.,.r1 orhercolleges.In that this hypothesis by estimating the effect of marriagc on happiness
- .j -.,.:-.: rrtr ] d s enior sin y o u r among those in the navy (where long periods of spousal separarion
r - , .' : ',(lrlr college(betteryet, are commonplace) and comparing that effect with the marriage effect
' :.;:,, ; hr alc ollegesin a d d i - in branches of the military where separation is not the norm. Note
- - -
: - .. - . . - :..:i t r s is s uef or P ar o c h i a l that in this instance a noneffect (no association between rnarrrage
: - :- l '.:-resultsof Your studY
-: and happinessamong those in the navy) provides support for your
, - -: - - .: r. e. inc ludingad m i n i s - companionship explanation of the general marriage-happinessrela-
a- -. :: ....::. , - r n,s1l oc iologis tsa, n d tionship.
1--- r: -.:: iornled, (A word of
- ::::; r J s " or " m y s o ro ri tY In short, one possibility for well-establishedresearch rcsults is to see
-- '.
-: - , - - , -l :.:r s t illg v our s urv e yo n whether or not old results apply in new contexts (strategicsubpopulations,
: . : - ':::. \ ot m anYP eo p l ea re new populations, or different tirne periods).
Another possibility is to extend or refine the cawsalexplanations of the
:: - - -.: ::.c i- r ds or t hos e in y o u r
"i :::.i :i in t he P olit ic a la tti - observedempiricai patterns found in prior research.The social sciences
: - -:
. . --' .-:: : lor e s ubs t anti asl u b - are pregnant with claims about what causeswhat. The evidencewe have
:- :
for these causal claims, however, is often very weak. A critical task in the
:
or not e a rl i e r social sciencesis more and deeperresearchon causal relations. There are
rr ,: r- : -- ., ..
" r ilet her severalgeneralstrategiesfor doing so. The major possibilitiesare discussed
: : - .. ,:.:. ,r'1. \bu might want to
' - .'.:::1 m.lf riage and haPPi- below.
:: ::
-: :: : ' - -:-.; O Llt v
. O U CO ul di n Ve S -
CAUSAL C H AIN S: IN T R OD U C E M ED IAT IN G VAR IABLES T O AC C OU N T
::: : -- -l rvhere arranged
- 'alaties FOR R ESU LT S OF PR IOR ST U D IES
- - ' . - . .:: .:-.ihe Unit ed S t at e s .
: - - . . . . - .'.r lc t lr c ror not pr i o r re - The idea here is to developtheories that identify the mechanismsor parh-
-' :t'. .'- - ::le studiesthat new time ways by which some independent (explanatory,exogenous)variable X af-
:-r :: - I :l i l v c t ls s O c iat iObe n tw e e n fects some dependent (outcome, endogenous) variable Y. One way to
r,:'.:: -. --r:-.:!1rv United States,or has account for the effect of X on Y is to find some set of variables Q that me-
' : :: , : ' . i Hi sr or r c aldat a t o a d d re s s diate the effect of X on Y, that is, X -+ Q + Y. This strategy is often em-
: ::. i\) Illost replicationstudies ployed in social scienceto account for the effects of ascribed traits such as
--- . .. - .:: :h.rt seekto update earlier gender or race on outcome variables such as earnings. It is well knorvn,
t : , .'.:-:... studiesof racial attitudes for example, that women tend to earn lessthan men in the United States.
'
:-t r ,.--.ilhet n'hite southerners are Explanations of this gender gap in earnings typically focus on mediating
i-: ) .::: :o disaPProveof marriage or intervening variables Q in this chain: gender -+ Q -+ earnings. In this
: -,',::: ic of racial integration of chain Q might include things like interrupted careers (women are more
*: i. -.-.:: \\ er€clonein the 1970sand likely to have interrupted work careers,and that's one reason for gender
differences in earnings) and hours worked (men are more likely than
:' ,, : ': ': :. J. S. Car t er2005) t o d e te r- women to work full-time). One way to make a researchcontribution, then,
' : - : - . ::: .| sa ppear ing( if at al l ). is to find important mediating variables that have been missed or under-
| : : . - . 1 .' t :' , tes ta s pec if ichy po th e s i s . valued in prior research.
1: .. ; .ll.population that provides The mediating variables strategy works best when reverse causation
:...j:-: .'ou might hypothesizethat can be ruled out. That is why the strategy is often popular for explaining
- .-. , : : .,i-. the effects of ascribed traits such as qender and race. In the illustration
:e a nd happines sis ba s e do n
14 . Chapter1
above, for example, it is clear that gender affects hours worked, not vrce
versa.
Sometimescausal direction is not so obvious. In the caseof marriage
and happiness,it is not clear which con-resfirst, marriage or happiness
(seereversecausationbelow). In other rvords,there could be a kind of se-
lection effect in which those who are selectedir-rtornarriagetend to be
happier than those who are not selectedinto n-rarriage.Even if marriage is
not selectivewith respectto happinessper se, marriage could be selective
with respectto possiblecorrelatesof happiness,such as physicai attractive-
ness,or cluses of happiness,such as physical health. If in fact the beautiful
and the healthy are more likely to marry and more likely to be happy, then
at least some of the associationof marriage u'ith happinessis due to the as-
sociation of marriage with attractivenessand health. Thr-rsrnarriage could
be associatedwith happinessevenif marriagehas no cdttsqleffect on hap-
prness,
This observation-that nonmediating variables could account for the
empirical associationbetweenmarriage and happiness-leadsto another
possibility for extendingprior research.
F O R R E S U L TS OF PRIOR STUDIf,S
The idea here is the same as in the mediating variables case:'We want to
develop theories that explain rvir-vr'veobserve an association between X
and Y. In the caseof nonmediating variables,though, the mechanismsare
not causal. Sometimesthe term sptn'iousdssociationis used to describeas-
sociationsbetweenX and Ythat can be accountedfor by their associations
with other variables,where those other variablesare not causalmechanrsms
linking X and Y. (For example, X and Y might be associatedbecause
some third variable Z causesboth of them. Or X and Y might be associ-
ated becauseX is associatedwith other variablesthat are causesof Y.)
The term spurious here means that there is no causal link from X to Y. It
does not mean that the associationof X and Y itself is a mirage. Ifhen we
say, for example, that the associationof marriage and happinessis spuri-
ous, we do not mean that, on secondthought, married people aren't hap-
'What
pier after all. we mean is that getting married typically won't make
you happier. (If the distinction is unclear, it should become clearer with
the discussionof causalityin rule 5, "Compare like with like.")
Often an associationbetween X and Y is partly causal and partly not,
so some of the so-calledcontrol uariablesthat are added to a model to ac-
count for an associationof interest are mediating and some are not. Ifhat-
everthe situation-whether the associationof X and Y is due to mediating
variables or nonmediating variables or a combination of the two-it is
nearly impossible to overstatethe importance of careful conceptualization
The FirstRule . 1-i
AN D Q U AL I F Y , R E S U LT S O F P RIOR STUD1ES
C H E C K F OR R EVERSE CAUSATION
: t:Ji S er r r lier ,m os t n o - useful to think about how the study of religious inheritance could be
- ,:: rn happines swil l b e linked to the study of classinheritance,which has been studied in much
. -:: -:.;-llong thosein other greater depth. Are the processesof classinheritanceand religious inher-
- ,i. or r rc om par in grh e itance similar? Are parents in the United Statesmore effective in trans-
.. =: i. , r P nlat ionswe , see mitting their classstanding or their religious beliefsl To my knowledge
'"' -:'r.rtor variableframc- no one has tackled that question (I supposebecauseir is difficult to find
. .. .ri\\ popul;rtionsand a common metric for comparing the two types of inheritance).The at-
' tempt to span the two fields could have high payoff, however, in terms
- ::. r ! lent s in P ar o c h i a l 'We
: : r'. u'e could append of both substanceand method. could generalizethe issuemore-and
' " i -:-trJon clther colleges make it even more interesting-by asking what overarching principles
. r - . - : . . ^ .vdrr<
. ^-:rb l e . govern the transmissionof beliefs,attitudes,values,and statusesacross
- _ililufrdtrrr5
- :: J.rt.1should not be a generations.For example,are parentsgenerallymore successfulin trans-
-. -. ::rermine whetherthe mitting their beliefs and values (political, religious, and so on) or their
.-, ,-:.: i of nay y m ar ri a g e s material statuses(such as their social class)to the next generation?In
j i ::-'.:l-o111 two Samples- synthesizingstudiesof this sort, prior findings for particular fields of in-
Lrr in the navy-and quiry become the touchstones for more general theories and research
about human behavior.
r nt er es t ing: r n d re s e a rc h rrb re
r' e, r' ri k e l ' th e i ssuehas bee' studi ed
ber of rimcs, s. tl.rereis cxrensir.erescarch . num_
,,n i, ni..".ii."iri.'*.n.rn,r,
difficult, ther, ro fircl a feirsirrlercsearc-rr
c.lucstlon
r !'v s r r v r r rhat
( r r or ris
J rir,porrant
r r r Pu vet
ha s ro t be c, stLr cr ied
That dilenrmapr'babl' erprai's ri.h' researchers-in-trairring
stru,qglcto find a rcsL.arcitro1.i.-.11.,.,. sornerirncs
tr.r-in vai. to fi.d o,r'ar.rar,,ng
rluesri.n rhat n. .ne has inresrigated
tefirre. Trre exarr.rple, ,ru. are i._
to make,thepoi'r rhat rl.t.b.rt soci;rlresearch "t
:;rd;d m'sr oftcn is research
rrcshperspectives
a'ctnewi'sightsto old and_niinu,,rgu..o.
:ji.::#:r,li
S r r . r c r r nc A S AMt,rr.
F,irst.princiTtle
of sampring: you cl.n't need ro eat
rhe whore ox ro know
that it is tough.
The FirstRule . 19
.;- 1135[e en stu die d a n um - SamuelJohnson is often credited (probably incorrectly) with this color-
: -rlready. It is generallY ful way to express the idea-later applied to the sampling of human
,: ,:r thirt is imPortant Yet populations-that characteristicsof the whole can be inferred from char-
acteristicsof the part. All sampling is based on this principle'
, .;:. irt-rt'.rinin gsoln c t im c s The same laws of probability that underlie the first principle of sarn-
' r"' rind a n itttere s t ing pling also lead directly to the first principle of sample sile.
- ': r'\.lnlPles above are ln-
First principle of sample size: For practical purposes,very large popula-
: -.r iltost often is research tions do not require larger samplesthan smaller populations do.
. ,.J .rnc1continuing areas
Suppose we want to know, with a certain degree of precision, which of
two candidateswill carry the statesof California and\Tyoming in a pres-
idential election. If the race is equally competitive in both states, we do
not need a larger sample for California than we do for the state of
'lfyoming,
even though California has seventy times more people than
'Sfyoming
Wyoming does.aIn fact, if the race is closer in than it is in Cal-
. .. ..::J ro cleterminewhom
' '' ' t:- lr rpl1l31iot-tyour results ifornia, then we would need a larger sample to predict the winner in
'-.'-rr . d in dr awing c o n - Wyoming.
The first principle of sample size implies an important corollary.
. ::.'.tcould be overlYam-
. - - -'-i : r ior ldr v ide wor-rl db e Sample size corollary:'We can make confident generalizationsabout a
, ' -. ' '- ,n lv A r ner ic anc o l l e g e large population from a sample containing only a tiny fraction of that
-' : i.lrrple, you need to population.
, :-:-:Jl \.
Although students generally have little trouble grasping the first principle
. r r r ' c dt o det er m i n ea
of sampling, they often are surprised by the first principle of sample size
:: r s'l|l1?le-from that
., and its corollary. It seemsto defy common sensethat anyone could deter-
- -.> lr r r lsabout t he e n ti re
mine what millions of Americans are thinking by talking to merely 1,500
- --:r : ih. lt P oP r iiat io ni,t i s
of them. Yet that is exactly what pollsters try to do.
. I iopulation (or the sub-
' George C. \7allace, four-term governor of Alabama and unsuccessful
- : : :: nrirtcipleof samPling,
- I r' -':r.presentative sample' third-party candidate for president in 1968, provides the most memo-
rable attacks on the reliability of political polls and the logic of polling.
- -:. .-.-..clmPletelyat random
'. : ': :.ti!et PoPulationhas an
aNote that the principle states
- - - : :... ^\ san-rPle selectedin for practical purposes. Strictly speaking, all elseequal, we
to the finite population correction
ryglq1neg4 ejlightly larger san.rplein California, due
.' ::,t;tic e, t hough, a s i mP l e rt(N-ny(N-t), rvhere n is sample size and N is size of the population. (This correction is
" -, ,,rher sampling methods neededbecausethe standard error approacheszero as z approachesN.) In practice z is usu-
- - :-. r(rr es ellt at iv e.or v i rtu - ally so small relative to Nthat the correction is trivial. According to U.S. Census Bureau es-
.- '.-:. :ri oi rhese methods are timates for 2003, California had 35.48 million people and Wyoming had 0.50 million
people. Thus for a sample size of 1,000, the finite population correction factor is 0.999986
'- ' . --.:r.1 I s eeno needt o c o v e r
for California and 0.998999 for Wyoming. In other words, becausea sample of 1,000 con-
. :..i rs emphasizesome over- stitutes a larger fraction of the population in Wyoming than in California, the standard er-
ror shrinks more in'Wyoming (by a factor <tf 0.998999) than in California, where the
standard error is reduced only by a factor of 0.999986. The practical implication is that, all
'--:
- r ' ..1tthe whole ox to know elseequal, we would need a sample size of 1,001 for California to match the precision of the
results for a sample of 1.000 in Wvomins.
20 . Chapter1
most often the case)covary wirh other causesor correlatesof the depend- and a minority population .::
ent variable. In general, the more bighly correlated the cduses,the more estimates of differences be:'.'.
caseswe need to sepdrate tbeir effects. sample of casesfrom each :
To illustrate, suppose you want to tesr a theory about only-children. representativefor the PoP;.::
Your theory argues that children reared in a home without siblings grow encesabout that overarch::-: :
up in a very different environment from that experiencedby children with sampl esto m ake t hem r eP: . . . :
siblings, and that the difference in childhood environment implies differ- lustrate, imagine we find. ::.
ent adult outcomes-perhaps only-children tend to be more self-confident ers, that adults who \\-ere , :
as adults, and more likely to assumeleadershippositions. You want ro ex- averagewhile other adults -:..
amine this theory by doing a qualitative study of 100 American adulrs, ucational difference is ts'i' .
some of whom have siblings and some of whom do not. .Weassumethat ti on for a ll adult s is 11, . ej -
N= 100 is a fixed upper limit-time and budget consrraints will not allow weights of the two group: .: :
you to collect data on more than 100 individualstotal. In short , t he use oi 5f r . r :- :
One issue you must worry about is obtaining enough variance on your populationsis often strilt-i,- :
explanatory variable. The problem arisesbecauseonly about 5 percent of groups, particularlv in th; -:
American adults are only-children. So in a random sample of 100 adults, sample size may he nto.:..:
you expect about 5 only-children-hardly enough to make confident in- oversamplingof this [\ Fs ,..
ferencesabout the differences berween only-children and others. The so- small samples:the lack (rt :
lution, as before, is to use stratified sampling: Divide the population into C onsi dcra lr er nat ivcc\ f t - "
those who are only-children and those who are not, and randomly select no-sibs (no siblings) i-Ind: -'
50 adults from each group. In other words, you deliberately oversample ri st, for ex am ple,m ight 'l: : - . .
only-children. In that way you increasethe variance on your independent fi rst-bornef f ecrin dt r {t t : - . .
variable(let'scall it no-sibs)more than five-fold, from0.0475 to 0.25 (the senceof siblings,but bcr : :
variancefor a dichotomy is n(1-n), where n is proportion in one of the lated with the variable first-'r
categories). children. The correlation :-. ',
By increasing the number of only-children from 5 to 50, you increase children are only-childre: --.:,
the reliability of your estimatesof Y (your dependentvariable) for only- the bi rth-or der ef f ect f r , : : : -
children. Of course, you also reduce the reliability of your estimatesof y principle. But to do that ',',. -
for others, since you have reduced the sample of others from 95 down to groups, not two-onlr.-;:-: :
50. But the gain in reliability in moving from 5 to 50 casesmore rhan off- not first born-and comi.,-:: :
sets the loss in reliability in moving from 95 to 50. You can have more l f i t i s no-sibst hat m . t r r r : - :
confidence in your estimatesof growp differenceswhere the groups con- group should differ from l :
sist of 50 each than in estimateswhere N= 5 for one group and N= 95 for i f i r i s fi rst - bor nt hat m lr t ; : - -
the other. should be between the :.:.:
But, you ask, doesnor oversamplingfly in the face of the notion of rep- for fi rst bor n wit h or r r i: - - '
resentativesampling? It was the oversampling of Republicans that led to first born.
the infamous Literary Digest blunder on rhe 1936 presidential election.
How are we to reconcile the advice here to oversamplewith the earlier -Some
m erhodol ogi .t: .rr- :-
discussionabout the importance of using representativesamples? mi ni sti c rath er than probebrL.:::. .
The key is to note that the aim of the Literary Digest poll was to draw tion. If so, then small sample. ; - : -,
to eliminate a different erpl.::.-:: -
inferencesabout a single population (voters in the 1936 election), whereas
r [q9l ) poi nts out, s ma]l \ - -- .
the aim of the only-child researchis to draw inferences about differences here becauseI believethere rs ..-:.: :,
between two populations, a majority population (adults who had siblings) social research (seerule 5 t.
The First Rule . 25
and a minority population (adults who were only-children). For the best
estimates of differences between populations, we want a representative
sample of casesfrom each population. The sample does not need to be
representativefor the populations merged unless we want to draw infer-
encesabout that overarching population-then we need to reweight our
samplesto make them representativeof the overarching population. To il-
lustrate, imagine we find, in our 50/50 sample of only-children and oth-
ers, that adults who were only-children attain 14 years of education on
averagewhile other adults average12 years. Then our estimate of the ed-
ucational difference is two years, but our estimate of the averageeduca
t ion for a ll adult s is 12( .9 5 )+ 1 4 (.0 5 )= 1 2 .1 y e a rs ,re flecti ngthe rel ati ve
weights of the two groups in the overarching adult population.
In short, the use of stratified random sampling to oversample smaller
popul a ti o nsis of t en s t r ate g i cfo r c o mp a ri n gm i n o ri ry groups ro maj ori ty
groups, particularly in the case of qualitative research, where the total
sample size may be modest. In the final analysis, though, purposive
oversampling of this type cannot alleviate a fundamental problem with
small samples:the lack of power to adjudicatecompeting explanations.T
Consider alternative explanations for an association between the variable
no-sibs (no siblings) and some outcome variable Y. A birth-order theo-
rist, for example, might argue that any only-child effect that you find is a
first-born effect in disguise. In other words, what matters is not the ab-
sence of siblings, but being born first. And no-sibs is inherently corre-
lated with the variable first-born, since only-children are also first-born
children. The correlation nonethelessis not perfect, since some first-born
children are only-children and some are not-so it ls possible to separate
the birth-order effect from the only-child effect empirically, at least in
principle. But to do that we need to divide our sample further into three
groups, not two-only-children, first born among those with siblings,
not first born-and compare the outcome variable Y for the three groups.
If it is no-sibs that matters for the outcome, then Y for the only-children
group should differ from Y for the other two groups. On the other hand,
if it is first-born that matters for the outcome variable, then the difference
should be between the first born and others: Y should be the same
for first born rvith or without siblings, and differ for those who are not
first born.
The important point here is the demand placed on data by the need to
examine alternative explanations, even when we stratify on explanarory
variables. The more explanations there are, the more such comparisons
we need to make; the more comparisonswe make, the more finely we must
stratify; the more finely we strarifv, the smaller the subsamples upon
which our comparisons are made. The solution to this dilemma is a large
overall sample. That is what I mean rvhen I say that a larger sample has
greater "analytic power"-a larger sample permits more comparisons.
and with more comparisons you can investigatemore explanations.
Without the luxury of a large sample to provide the basis for one com-
parison after another, qualitative researchers very often must decide
which comparisons are strategic,and selecta sample accordingly. It helps
to stratify on explanatory variables and oversample the smaller popula-
tions within those strata, but a stratified sample helps only up to a poinr:
In the final analysis there are limits to the number of strata you can use
when the overall sample is small. As a result, qualitative studies generally
are not well-suited for omnibus tests of many.'different explanations for
some observed association. Qualitative methods are well-suited for pro-
viding thick description that can help place quantitative results in proper
context. Qualitative methods sometimescan also provide precise critical
tests-tests that often are difficult with conventional quantitative ap-
proaches. Indeed, through strategic data selection, qualitative approaches
often extend or correct quantitative studies, as rve discover subsequently
with examplesof rule 3, "Build reality checksinro your research."
I s M T I NT Nc F U LS o c rA LR rs s e n c n P o s s r err?
October 7,1903, is famous in aviation history for the tesr thar failed. As
news reporters watched expectantl,v,test pilot Charles Manly assumedhis
position aboard a curious-looking airship perched on a houseboat in the
Potomac River. To avoid fatal landings, the plan was to crash into the wa-
ter after demonstrating the possibilitv of human flight in a heavier-than-
air machine. As propeller wheels whirred a foot from Manly's head, a
mechanic cut the cable holding rhe carapult, and the airship tumbled ig-
nominiously into the warers of the Potomac.
That spectacular failure, along with a second well-publicized plunge
into the icy Potomac two months later, resulred in public ridicule of the
airship's designer,the distinguishedphysicist and astronomer Samuelpier-
pont Langley. Armed with a $50,000 grant from the ITar Department
and another $20,000 from the Smithsonian Institution (which combined
would be worth over $1.5 million in today's money), Langley had set out
to build a heavier-than-air aircraft piloted by humans. His highly visible
The First Rule . 27
-:: : : ,:-:i o n dat a by t he ne e dto failures added credibility to Lord Kelvin's famous claim, eight years ear-
: .i .: '.'.: stratifl.'on explanatory lier, that "Heavier-than-air flying machines are impossible."
-: : - : . :-.: ::u r e : uCh c om pa fi s o n s After Langley's firsr failed attempt a New York Times editorial had
! .r : ::--.... rhe more finelywe must this to say about the prospects of human flight ("Flying Machines Which
:- . ; : - .,,..c: th e s ubs am plesu p o n Do Not Fly," New York Times, October 9,1903)t
: i," : r :,r this dilemma is a large
.,r:.- - :.:'. :har a larger samplehas [k] might be assumedthat the flying machine which will really fly might
.:. -- : . : ; J: : : l!5 mo re COmp ar iSO nS. be evolved by the combined and contir-ruousefforts of mathernati
' -: - (rre exnla na tion s. cians and mechaniciansin from one million to ten million years. . . .
'. . :....
No doubt the problem has attractions for those it inreresrs, but to
: : :. ..;. rhebasisfor onecom-
'"; ; . - -: . : '..rr often must decide the ordinary man it would seem as if effort might be employed more
profitably.
!;
-i : :,.::rrle accordingly.It helps
::i-,-:.rie the smaller popula- In one of the ironies of history, as thesewords were being penned \Wilbur
: -': - ^ :.:: onlv up t o a p o i n t: and Orville'Wright were preparing their own launch in a remote area on
: r -..r-.r:lof strata you can use the coast of North carolina. The first manned flight did nor take place a
:. - --'--ri.rtilestudiesgenerally million years later, but just a little more rhan two months after the Times
::crent explanationsfor editorial. On December1,7,1903, Orville and'sfilbur took turns piloting
r : . - i: .;'e $-ell-suitedfor pro- a flying machine (that cost about $1,000 to build-$22,000 in today's
r : -: : .:..-:::ilatile resultsin proper money) over the sand dunes of Kitty Hawk. With no governmenrfinansial
:,: .;- .r.:r provide precisecritical backing and little fanfare, two determined bicycle builders from Ohio
: . , :. ":::uonal quantitative ap- had proved Lord Kelvin and the New York Tinres wrong,
:-z '.: :-: -:1.qr-ralitative approaches In academiathere are Lord Kelvins among us today who make similar
-* : : . .r:'.'.edis c ov ers ubs eq u e n tl y assertionsabout the impossibility of meaningful social research. Let me
- l.:- : : : ! (l V O U f f e s e a f c h . " be very clear: I am not talking about those who say that good social re-
search is difficult, or rhat social researchis often badly done, or thar some
questions about human behavior are inherently unanswerable. (I happen
to agree with all those assertions.)Rather, I am talking about those who
are skepticalabout the possibility of discoveringgeneralprinciplesof hu-
. :,,r rhe test that failed. As man behavior. In the view of such skeptics, either such principles don't
i::rles Manly assumedhis exist, or the principles are so "contextualized" that we cannot generalize
,:.J on a houseboatin the from situation to situation. Empirical nihilism of this sort tends to be most
\\'asto crash into the wa- prominent in schools of thought containing a "post" prefix: postmod-
, - - '_r. "'b'_
rl i 'eht -in
__
*a heav
' ier -th a n - ernism, postpositivism,poststructuralism,post-Kuhnian philosophy of sci-
:::: , -': '!rt irom Manly's head, a ence,and so on.
:- : : - : . ,: J the air s hip t um ble d i g - In its most extreme version, empirical nihilism in the social sciences
t- . denies the possibility of discovering even regularities in human behavior.
:-- i . . , :J ri ell- public iz edpl u n g e That position is obviously silly. consider the life insurance industry. Ac-
-_ i ;- ^PuuuL
, , k t ; ^rI. idic ule o f th e ru ari al tabl espresumeknow l edgeabout regul ari ri esi n death r "ur . r .Al-
'rr
-i
:: :t -. lstronomer SamuelPier-
'. - though we cannor predict when specific individuais will die, we do know
!-,: ::,tm the 'war Department that women tend to outlive men (in most Western countries anyway), that
-:
-:..ritution (which combined nonsmokers tend to outlive smokers, and that it's good to exerciseand eat
': - - I ^-^l^-, L^l
:r\)ne\'),
-----,\ Langleyhad set out
^^- ^,.-
your vegetables.or consider government efforts to monitor economic
:'. r'limans,His highly visible growth and control inflation. If the discoveryof generalprinciples beanng
28 . Chapter1
on economic behavior were beyond the ken of humans, the Federal Re-
serve Board would have no reason to meet to discuss alterations in the
prime rate.
Empirical nihilism is nor new, nor is it unique to the social sciences.
Before attempting to discover principles about the world of narure,
physicistsand other natural scientistspresupposethat observablenatu-
ral phenomena,such as motion, are real, and that they are arnenableto
investigation. The ancient Greek philosopher Zeno of Elea believedother-
wise. Zeno, like his teacher Parmenides,vierved the world of senseas an
illusion. To defend that view, Zeno used paradoxes to demonstrate the
impossibility of motion. One paradox can bv illustrated by imagining an
arrow in flight. At each mornent in rime, rhe arrow is located at a specific
position. In subsequentmoments, the arrorv is at rest for the same reason.
So the arrow is at rest at all moments. Thus if time is a continuous series
qf moments there is no time for the arrow ro move, and motion is impos-
sible. The paradores are challenging-thev have engaged philosophers
and otherssincethey were introduced by Zeno almost 2,500 yearsago-
and important, since they undermine the con\ienrional view that the sen-
sory world is real. (The arrow paradox and other paradoxes of motion
credited to Zeno are generallythought to have been resolvedby calculus
and Newtonian physics, although quantum mechanics has complicated
that resolution in some instances.)
Is the social world real? Social sciencer,r'ouldbe a strangechoice of vo-
cation for someonewho answered that question negatively.Yet if we press
the social contextualization argument far enough rve arrive at an appar-
ent denial of the social world. Substitute the term "context" for the rerm
"moment in time" and you see that the contextualization argument is
similar to Zeno's argument about the arro\\'. In effect Zeno viewed each
moment as a separatecontext and concluded that the physical world is an
illusion. If social acts are so radically contextualized that there are no
general principles to discover, then one u'onders what is the glue that
holds human societiestogether. The existenceof routine, predictable hu-
man interaction suggeststhat there is something "there" in the social
world, even as the existence of motion suggeststhat there is something
"there" in the physicalworld.
The best responseto empirical nihilism is to ignore it and do the re-
search. The aim of this book is to help both novice and experiencedre-
searchers"do the research." I have tried to make the book eminently
practical.I provide examplesof good social research,as well as examples
of the other kind. obviously I believe that meaningful social research is
possible-imperfect research, but research that can be valuable nonethe-
Iess.(I say "can be valuable" becausethe findings of social researchare
subyectto misuse.In the worst-casescenario,social sciencecould be used
The FirstRule . 29
-:::n.ut q,t he F eder a lR e - by authoritarian governments for the purpose of large-scalesocial engi-
- .;.rss :rlterationsin the neering, which generally has been a miserable failure.) The pitfalls of so-
cial researchmay be many, but so too are the potential benefits.The seven
: -- --.: i o t he s oc ial s c ien c e s . rules are intended to move us closer to that Dotential in our social re-
- , - - ..: :n e u' or ld of nat u re , searchprojects.
. . . : i : :h. r r obs er v ablen a tu -
: -- .--r: thev ar e ar nenab l eto
- : * :.. ,.'iE leabeliev edoth e r- Sul.uenv
: : - ::- .. or ld of S ens ea Sa n
. . 1---\r'i ro demonstrate the There should be the possibility of surprise in social research.Social re-
. .::.1. r eJ
bv im aginin ga n search requires that researchersremain open to new evidence,whether or
- : .'- .,,-- locatedat a specific not that evidencesupports their cherished theories or beliefs. This open-
, - -: :::: f or t he s am er ea s o n . ness leads to the possibility of surprise. Openness also makes research
. .r uor.rtinuousseries more exciting. If we know the results beforehand, what is the impetus for
. - ;. .:rd motion is imPos- the research?
': - ',,: :lrg.lged philosoPhers An important first step for effective research is choosing an appropri-
, -. - : :r ,,s r3. 500 y ear sag o - ate researchquestion-one where surpriseis possible.Researchquestions
-- :..... ::l vieu, that the sen- should be both researchable and interesting.Researchquestionsare unin-
-
1 - i :.:: :J r adox es of m o ti o n teresting when there is no possibility of learning anything new, lvhen
- r -:::-. r es oh' edbY c alc u l u s there is no perceivedrelevanceto theory or everydaylife, or when the an-
- :, -.,..ai.sharscomPlicated swer is significant only to a very small number of people.
Becauseimportant social issuesof the day typicaliy have been investi-
.- - i: -1itrange choiceof vo- gated in one form or another by social scientists,most often researchevi-
: - - .::.'.ti\.el\'. Yet if we press dence already exists on the question you want to investigate.In that case
:- .-.-.-...a .lrrive at an appar- you need to design your research to ensure that surprise is possible. The
Jrlnfext" for the term first step is to distinguish well-establishedfindings from the other kind. By
: : 1.:,.,1l1zi1tion
argument ls "the other kind" I mean findings that are mixed or inconclusive, or find-
:-- -' .:::;t Z eno v iewed e a c h ings that are new and have not yet been replicated in subsequentstudies.
. ,.: :re
-:: - . P hv s ic alwor ld i s a n Surprise ls possible in those instances,becausein those instancesthere are
- - '-, :-.'. :ze d t hat t ner e are n o lingering questions about earlier results. Hence your research,if propcrly
-,: 'i::: \\'ht1tis the glue that
executed, contributes to knowledge.
-i'. - -: : :, ,tltine,predictablehu- By contrast, if prior findings are widely viewed as settled, you con-
: "there" in the social tribute little or nothing by merely repeating the analysesof others. Instead
::r.rt there is something you must either challenge the earlier findings or extend/clarify them. If
there is a good reason to doubt the results of prior studies, then of course
:i n or e it and do t h e re - it is worthwhile to challenge them with new research. In that case,
: - :-. .,:Je and experienced re- though, the burden of proof rests on you to identify the flaws in pnor re-
: ' ::,:.{- the book eminently search that are serious enough to cast doubt on their findings (flaws that
: ::-:--.:.h. as rvell as examples you will correct in your research).The other strategy-to extend or clar-
.: r':.r:lrrgful social researchls ify earlier findings-can be implemented by examining results for a sub-
- : .:: nonethe-
-.rn be valuable population or for a different population or time period. New understanding
- :.:.:s of social researchare of old findings can also be gained by introducing mediating variables or
. . . ';ial sciencecould be used
moderating variablesor nonmediating variables (or some combination of
30 . Chapter1
StudentExerciseson Rule 1
;- - . . ,:-::rine. Then you Selecta
. ,', --rckresourcesto collect
. .i..;,r:c of a san-rple is not its
-i- -.r.,-.:ns am plesar e pr op e rl y
; . . . r-
The exercisesfor chapter 1 use 1972-2004 data from the General Social
: - - ..i:,rnsabout large PoPula-
,i- ,- -. '. ... 1-fonl,v a tiny fraction Survey (GSS).The GSS is a probability sample of noninstitutionalized
- : ::.:;i pl e s ar e t he s am efo r English-speakingindividuals, age eighteenand older, in the United States'
-: From 1972 to 1.994 the GSS was an annual survey (with a few years
\-: : : : : ::sear c h,howev er ,u s u -
- :.-. :r ..:mples.Stratifiedran- missed); after 1994 it has been conducted in even-numberedyears' The
r :. ,:'.:- populations might sample sizes of each wave vary from about 1,500 for the annual surveys
-r to about 3,000 for the biennial surveys.
r - - - .. .-. . :l r he aim is t o t es t a n
,..''.::. limit your ability to To do the exercisesbelow, go to the web site http://sda'berkeley.edu,
--1r.
click on "SDA archive," and follow the instructions below. The instruc-
:-: : : :: :r. Fc lrt his r eas on,q u a l -
t es t sof m u l - tions should generateall the statisticsyou need to answer the questions,so
: :: :.:--: '-r m nibus
t-
you should not need to do any calculationsby hand. Severalquestionsask
:: l-
about the statisticalsignificanceof associations.For theseexercisesuse the
.01 significancelevelas the criterion for statisticalsignificance,that is, when
the p-value is .01 or less,we consider the associationto be statisticallysig-
nificant. To assistyou in completing the assignments,variable namesare rn
boldface in the computer assignmentsin this book. For answersto selected
problems, go to http://press.princeton.edu/titles/85 93.html.
Note well: (1) The figures given below are for the 1'972-2004 GSS,the
most recent GSS data available when this book was written. If you are
working with more recent data, to replicate exactly the figures given be-
low you will need to use the "selection filter" in the SDA program to re-
move years aker 2004. (2) The instructions below are based on the
format used by SDA at the time this chapter was written, in 2006. That
format might change slightly in subsequentyears, so some of the specific
directions below might be out of date, but you still should be able to fol-
low the logic of the instructions to obtain the tables you need to answer
the questions.
The GSS asks the following question, denoted abany, about abortion:
"Pleasetell me whether or not you think it should be possible for a preg-
nant woman to obtain a legal abortion if the woman wants it for anv rea-
son?" [possibleanswers:yes, no]. This question is askedafter a seriesot
32 . Chapter1
"Run the table" should give you a 2x2 table with abany as the row
variable and sex as the column variable. Nore the legend for the table
under "Frequency Distribution." The numbers in your table should in-
dicate that there are 11,389 male respondentsand 14,7rs female re-
spondents, for a total N of 26,104 cases.Your table should say that
41. 5 per c ent o f m a l e re s p o n d e n rsa n d i 9 . 4 percenr of femarerespon-
dents responded ves, that a woman should have the legal right to an
abortion for any reason.You should also find that 55.1 percent of those
who agree (that abortion should be a iegal right for any reason) are
women, and 44.9 percent of those who agree are men (secondrow of
t able) .
:' :. ' :\amples: shouldabortion Now look at the figures under the heading "summary statistics." The
. J :: - : \\ .lnt any more children? statisticslabeled ett) R, Somers' d, gamma, taw-b, and tau-a rcfer to mea-
, , ,s o in c ont ex tth e
:: - . . . . : .-.;1 '31.?) sures of association,that is, they refer to how strongly thE tw<.rvariables
i: : :'- l.'galright to an abortion are related. In this casethe figures are close to zero) since the association
between sex and abany is small, as you can tell from the percentap5es.
- r.-t :- -: .rreabany and sex (male,
Question 2. (a) Is the genderdifferencein abany statisticallysignificant?
To determine this, look at the p-valuesfor Chi-square (labeled Chisq(P) in
- -r- -....rch ir . e, t"hen go t o th e the printout).p=.00 in the printout means that the p-value is less than
.0O5-which is highly significant. The smaller the p-value, the more con-
:--.: -:.' :-iecn. tvPe in abanY fot
fident we can be that an associationwe observe in our data is not due to
: : : - .-.:In lar iable,
sampling error.
;: rr: rr :-iou'n bor for "'Weight"
'::>-rm ples . " (b) Is the genderdifferencevery large?What doesthat suggestabout statrs-
l. i- ,
'.'. both column percent- tical significanceand substantiuesignificance(how large an effect is)?
: : :
(c) Are you surprised by the direction of the gender difference here? Ex-
:-. : - .: ^ ,-'res if t hey ar e no t a l -
plain.
- tl- ..-; :re d r oP - downm en u to
\--1L l)Lll) .
Question 3. Are women more likely (than men) to approve of abortion
for any of the specific reasons investigated in the GSS, such as when a
. gives 1'ou the wording
woman is pregnant as the result of rape? To find out, return to the cross-
tabulation program and replace abany with abrape as the row variable
:: (dependentvariable),then "run tables" again. To avoid confusion, this
:-: .,.:-nenu to indicatethat
time suppressthe row percentages(that is, ask only for column percent-
ages).Repeat using abdefect, abnomore, abhlth, abpoor, and absingle as
the row variable. \X/hat do you find?
: :. :re legend for the table Question 4. One thing we learn from these results is that, for Americans,
"
- -' r : :: : n vour table should in- the reason for an abortion matters. \il/hile most Americans believe that
: : -: : : -.:: ,rn d 14, 715 f em al e re - abortion should be legal for some reasons, such as when a woman's
- l: : ': ).:l r ta ble s hould s ay th a t health is endangered,more than half believethat there are conditions un-
- - - r r-r.ent of f em ale r es p o n - der which abortion should not be legal.
-:,- ;r...-:r.rvethe legal right to an (a) Use the appropriate percentagesin your seven tables to rank-
.. . i: . : rh ar 55. 1 per c entof th o s e order the sevenconditions here (pregnancy is due to rape, etc.) from
. :i. . . - rrght for any r eas on ) a re "strongest approval for legal abortion" to "least approval for legal
.*, -:-:::. .rre men (secondrow of abortion."
(b) Is the ranking exactly as you would have predicted, or are you sur-
prised by some of the results?
tr: .: : ' .:--1 1.5 per c entv er s us3 9 .4
Question 5. Now cross-tabulateabany (row) and race (column). The total
. : - --:'.:, ' 100 percentwithin cate- N should be 26,104 cases.
,:: i::.'.:: rffect abortion attitudes, (a) Are race differencesstatisticallysignificant?(Again, use p<.01 as the
cutoff point.)
.\ .:
-iitlq). (b) Are the differencesin the direction you expected?Explain.
34 . Chapter1
The GSS variable sexfreq asks, "About how often did you have sex dur-
ing the last 12 months? [not at all, once or rwice, once a month, 2-3
times a month, weekly, 2-3 times per rveek,4+ times per week]."
In this exercisewe use the cross-tabularion program to investigatethe
associationbetween marriage and frequencv of sex, then to investigatethe
association between hunting and sexual frequency. The first thing you
want to do is convert the GSS variable marital rvith five categories(mar-
ried, divorced, widowed, separated,never married) into a new variable
with just two categories,married and not married (which includes sepa-
rated couples).Let's call that new variable married. Return to the original
screenwhere you selected"frequenciesor cross-tabulation," but this time
select "recode" to create the variable married, as follows:
r Under "createvariables,"click on "recode variables."
r Type in married for "name for the nerv variable to be created."
. For "Replacethat variable,if it alreadvexists?"click on "Yes."
. Type in marital for "Var 1" under "Name of existing variables to
use for the recode."
r For row 1 under "Output variable,"ti'pe "1" for "valuer" "married"
for "label," and "1" for "Var 1." For the secondrow, type "0" for
"valuer" "not married" for "label," and "l--i" for "Var 1."
o Click on "start recoding." If you have done this correctly, you should
have 20,674 not married casesand 25,828 married cases.
we can also use the GSS to investigate the association between hunting
and frequency of sex. The hunting variable in the GSS is hunt: ,,Do you
(or does your [husband/wife]) go hunting? [respondent hunts, spouse
hunts, both, neither]." Repeat the procedures you used to recode marital
The First Rule . 35
- : :'- - ' :,rr "r'aluer" "married" Question 8. (a) Give two explanations for the association between who-
hunts and sexfreq.
. - : . - . r:-,rod r ow, t y pe " 0" fo r (b) Think about the nature of the two explanations you have given. Do
:- : " --i " fo r " V ar 1. " your explanations imply that the association is causal-that is, do your
. : : ": :.^.:S CoflectlY,You ShOuld
explanations imply that sexfreq will increase for people who take up
: - t -' -' ::--..i r ied
c as es .
hunting?
, : -: -,':-.-:ulat ion"s c r een ,a n d (c) If you were to test your first explanation, would the test include non-
:,i:::.: t-::.r sexfreq as the row mediating variables?Mediating variablesin a causal chain? Moderaring
- -:: : I :-,-ii<esure you focus on variables that condition the effect of whohunts on sexfreq? Reversecau-
-- : ::-::i ,1 qe5. O t her wis e ,s e - sation?Explain.
: : r : : : r.. \,l u r t ot al N S hO u l db e (d) \fhat about a test of your second explanation-would that test in-
clude nonmediating variables? Mediating variables in a causal chain?
-., - : i ,: . c,u don' t s eep- v al u e s , Moderating variables that condition the effect of whohunts on sexfreq?
Reversecausation?Explain.
r:: :i: .::i .erual frequencY-do
':- : - '-: I' r her em or e v ar i a n c e
.:
,: rhree sentences.
Is this
The SecondRule
LOOK FOR DIFFERENCESTHAT MAKE A DIFFERENCE.
AND REPORTTHEM
:.FILRENCE,
You CeN'r ExplerN A VARIABLEwITH A CoNsrRNr
- : ".-::: n Jifferences that make the twentieth century (fig.2.1)- The generalpicture is that of a secular de-
- j '::::t'tces afe and why they cline in fertility that was interrupted by a stunning increasein fertility in
'.:
1967. From 1.965 to 1'967 fertlltty in Romania nearly doubled, from an
- r-, i:.rrChfor differencesthat averageof about 1.9 children per woman to an averageof about 3.7 chil-
, im nt t r r anc e- a n d I dren per woman (Bradatan and Firebaugh 2007).
'i
; - - ..: - ..:e related,but they are How are we to accountfor the 7967 baby boom in Romania?Cultural
r -- - : ng, htlegsand lef t l e g s explanations aren't promising, since people'sviews of ideal family size dg
: .^: :nJ a sizablecorrelation not change overnight. Nor is it likely that the jump was due to changes
::: \ . ,l u'ould find a very big in Romania's population composition (for example, an influx of high-
- - ^ ::.i nr . B is ef f ec r sar e n o r fertility immigrants) since a nation's population composition usually does
not change suddenly either. In this instance' the most promising route is
-: -:i r:r. Ther e ar e ins t a n c e s to examine the historical context.'What eventswere occurring in Romania
: : .:.r' reasons.Yet in those at this point in time?
38. Chapter2
4.0
3.5
3.0
2.0
t.)
t.0
0.5
0
1956 1960 1965 1970 1975 1980 1985 1990 1995 2ooo
Figure 2.1. Total Fertility Rate in Romania, 1956-2000. source: Romanian
DemographicYearbook,2001.
From figure 2.1 we nore that fertility had declined sharply in the de-
cadebefore 1967. rn 1956 Romanian women were giving birth to almost
three babies,on average,over their childbearingyears.By the early 1960s
fertility had dropped below replacemenr level, meaning that Romanians
were not having enough children to replace themselves.Hence, as Ro-
manian government officials well understood, if that fertility rate contin-
ued the population of Romania would be destined to level off and then
decline.
It is not surprising, then, that the communist government took stepsro
try to boost the fertility rate. First a media campaign was launched in
scanteia, the official newspaperof the Romanian communist party. Tyo-
ically the first two pages of scanteia were devored to political atrd eco-
nomic news: foreign dignitaries visiting Romania, .r.*..o.ro-ic successes
of the Romanian communist Party, and so on. That focus chanqed some-
what in the summer of 1966, however, as articles abo.rt m"teinitv as a
woman's duty to the nation also began to occupy centerutag. in th.
newspaper.On August 4, 1966, Ursula Schiopu,a psychologist,argued
that "feminine charm is regeneratedby maternity" (s,canteii, Attgust 4,
1966, p.2). Two weekslater there appeareda first-pagearticle on schools
and children, and on August 21 another article about "children's love."
In a September19 article titled "Maternity's Advantages," a physician ar-
gued that pregnancyheals acne, eczemas,depression,and melancholy. on
september 23, women who ask for abortion were called "superficial" and
legislation was blamed for letting them obtain abortions so easilv.and the
The SecondRule . 39
In the example we have just considered, our aim was to account for the
1967 baby boom in Romania. In other words, we were trying to find
the cause or causesof an effect. Increasingly in social research,however,
we want to isolate the effect of a particwlar cause,We start witl-r the sause
and try to design an experiment or some other method to determine the
magnitude of that effect on some outcome variable.
This strategyis commonplacein medical research.Before a new drug
for the treatment of asthma is approved for the general public, cxperi-
ments are performed to determine how much effect, if any, that drug has
on asthma, and what side effects the treatment might have. Or-to use
an example that is discussedin detail in chapter 5-we might want to es-
timate the causal effect of smoking on lung cancer. Smoking is not the
only cause of lung cancer, of course. But there is not much we can do to
control many of the other causes, and smoking is something we have
l Over time Romanians figured out \"'aysto circumvent the restrictions-for example, bv
bribing doctors to say the mother's life u'as endangered, so a legal abortion could be
performed-so fertility began to track back down, as we see in figure 2.1. The abortion las-
was rescinded in 1990, and since then fertility in Romania has fallen to one of the los'est
levelsin the world.
40 . Chapter2
.i.:-:-,:- in the causaleffectof well could be a significant factor in domestic violence.) The point of over-
sampling is not to magnify a minute effect in the population, but to en'
I r::: r', ^. \lrhoughit would be sure that there is enough variance in the sample to provide a fair test of
-- - .: ':r--:l;rton, or dorneStiC vi- the effect of a cause.
: -: - : :iJ States,the most in-
':: :. : .,:.-.;n.rlrle to public policy
.. -'i : r-: ,f rrling to understand SrzEvpnsus Starrsrtcer SrcNrprceNcr,
-. - .:t i r v r in ir s ent ir e ty -
-: - - ' ' - '' . , 'f mA nVCA US eS O f, i n Before we can talk about the size of an effect, we want to have some sense
, : -,-- I : :nlt.h about-effective of how confident we can be that the apparent effect observedin our sam-
.-i : '. il: 'll a q o ne Caus ewell. ple is not due to sampling error. That of course is the purpose of conven-
. : - ' . .:,-:rrm iz et he v ar ianc eo f tional significancetests in the social sciences.ln practice, however, social
:' --: ' .',i a .t ninc r eas ev ar ia n c e scientiststoo often conflate statistical significanceand substantivesignifi-
-: .,: - . ::1: rhe targetpopulation
- cance,resulting in deemphasison the size of effects. Generally researchers
test the null hypothesis that the effect is zero. When the null is rejected,
;-.'j : -.: ',lusal Yariable,so we researchersconclude that there is an effect in the population. When thc
i -i j ,.: .;.r1e. To illustrate,think null is not reiected, researchersconclude that there is no effect.
',: , : , -,:: ro determine if there is Researchersoften appear to forget, however,that statistical significance
, :: .r. -\lthough cohabita- is a function of sample size as well as size of the actual effect in a popula-
-
: . - :ime heterosexualmar- tion. Thus a sample coefficient can be insignificant for two very different
reasons: becausethere is no effect in the population, or becausewe have
- , - : - :' . more than ten tO One
,r.nles nreL e rrn 9 ner - estimated it so imprecisely, due to a small sample' (This problem leads
: \: . : : : . l rcn i n a s im pler and o m some statisticiansto recommend the use of larger p-values with smaller
-: - : samples.)
- 'r-rb i t 'c oded 1 f or c oh a b -
- - . = '.r,)S 2. B y s am plinga n Even worse, when findings are statistically significant, significancetests
:- - I :.. \\'c r r iple t he v ar ia n c e sometimes are misused to blur important differences in the sizes of ef-
i fects. This blurring occurs when researchersfail to distinguish statistical
-: :::lling its variance,we
- . :: ..-.:.i.:tsthe consequences of and substantivesignificance,resulting in a flattened world where there are
only two outcomes,effects and noneffects.The world is flattened because
, --':- : -. .lrffcrence"you need to statistically significant effects are all treated the same-stupendous ef-
-. i : ::.,rr re s ear c h.B ec aus ei t fects, minute effects, and all effects in between are dumped into the same
i . : ': -:,t. rhe effect of a partic- basket. The reader wants commentary on the size of the effect. Vhat she
- . r: i-1mple.To test properly often is given instead is an asterisk indicating how strongly the sample
. : ::. '.,ru might need to over- data contradict the null hypothesisthat the effect is zero in the population.
- r - r -:. i . es pec iallywhen y o u r That practice reduces the effectivenessof social research, as McCloskey
and Zlliak 11996) and Zlliak and McCloskey (2004) have forcefully
: : .--:: !e of oversamplingis to noted.
:, .,.:.. -:t losing sight of the ac- So the first point I want to make regarding the size of effects is "Talk
about the size of effects in your report." In most contexts such advice
- :. ::: >:,ries.cohabitationcould would be gratuitous, but it seems necessary here, given the apparent
: : . , : " ::rc L' nit ed S t at es ,s i n c e trained incapacityfor describingeffect sizesin some fields of social science.
'We return, then, to the question "How big is big?" It is useful to dis-
r : r - - - .',r rhrt. t ln f ac t , t hou g h .
: - . :r-so cohabitation very tinguish relative size and absolute size (although the two are related of
42 . Chapter2
course).In the caseof relative effect size the other causesprovide a ready-
made referencepoint. How large is the effect of X on Y relative to the ef-
fects of other causesof Y? In the caseof absolute effect size, researchers
must sometimes search outside their model for an appropriate yardstick
or referencepoint-some sort of benchmark to assistintuition. This is not
always easy to do, and perhaps for that reason it is not always done in
social research.
In the case of relative size we are interesredin which predictors have the
strongest effects. What are the most important determinants of one's so-
cioeconomic status (SES)in the United States(Jenckset al. 1972)? Educa-
tional attainment? Intelligence?Your parents' SES? Racial or ethnic
identity?Ambition? Hard work? Luck?
Determining the relative sizes of effects is particularly important in
testing theories sincein social sciencetheory very often it is effect size that
distinguishesone theory from another. There are, for example, numerous
hypothesesand theoriesin social sciencethat attempt to explain why some
countries are so rich and others so poor. Sometheories emphasizenatural
resourcesor geography, some emphasizeculture, some emphasizepoliti-
cal systemsand the importance of appropriate policies, some empha-
size the pattern of technology diffusion, and still others emphasrze
international relations or colonialism or exploitation. These causes are
not mutually exclusive, and cultural theories of developmentdo not say,
for example, that political systemsdon't mamer at all-just that culture rs
primary. Thus tests of theories very often are tesrs of the relative sizesof
effects.
It is not surprising, then, that much of the empirical work in the social
sciencesturns on the relative importance of causes.\X/hat ls surprising is
how frequently one finds a disjuncture between theory and findings in re-
searcharticles, with the findings secrion giving equal billing to all statisti-
cally significant effects, as if all statistically significant effects are the
same.But size matters, and effective researcharms readerswith the infor-
mation neededto distinguish bigger effects from smaller ones.
There are two situations to consider: the case where the explanatory
variableshave a common metric, and the casewhere they do not. It is eas-
ier to compare effectswhen the explanatory variableshave the samemetric
(dollars,years,etc.). Supposewe want to know, among married couples,
whether family size (the number of children a couple has) is linked more
closely to the number of siblings the mother has or the number of siblings
'We
the father has. might hypothesize,for example, that individuals who
The SecondRule . 43
a :. : :.-:: -al rs espr ov ide ar e a d Y- grew up in large families tend to have more children than individuals who
grew up in small families, but that the family size experienced by a
:- : r -: -\ o n l' r elat iv e t o t h e e f-
. : :,- a -,.:; eif ect size,researchers woman in her youth is more consequential for her fertility than the fam
' ,i:' r : .rr appropriate yardstick ily size experiencedby her husband.
'With
- j-. appropriate data on completed fertility for couples,we could test
-r.i ri t int uit ion.T his is n o t
fr ' : . : - :: i 5 not alway s don e i n the hypothesis by regressingnumber of children for a couple on number
of siblings in the wife's family and number of siblings in the husband's
family. Because the explanatory variables have the same metric, it is
straightforward to compare their effects. If size of the wife's family tends
' .i '.i t)\ -\lE r nr C to be more consequential for the number of children a couple has, then
the slope for the variable wifesibs should be steeperthan the slope for the
,..rich predictorshave the variable husbandsibs.
'-: i;rerminants of one's so- It is useful to think more concretely about how we would graph the re-
i-:-..kset aL l'972)? Educa- sults in two-dimensional space. Although generally we need a separate
- :.' SFS ? Rac ial or et h n i c graph for each explanatory variable, the graphs can be superimposedwhen
explanatory variables have a common metric. Here, for example, the
rr:-i: - :.liticularlY imPortant in graphs for B*r, dfld Ft,u,bo,dcan be placed on the sameX-Y plane because
'. : the X-axis iJ the same for both variables. By superimposing graphs and
:':'. rrftenit is effect sizethat
comparing the steepness of the slopes,we can seeimmediately which vari-
l: : -: .,1 :.i o r ex am P le,num e ro u s
. , , L . , SOme
^ - . * l ^ : - why able has the greater effect.
: :: -:,:-,::::lpt tO explain "^-.
-': ::i()ries emPhasizenatural The issue of which graphs can be superimposed and which cannot is
'r :
:: , - :,.:J. some emPhasizePoliti- central, then, to the comparison of effect sizes. Consider a different sort
:r: : : ,.-:. --- ^,-l;-i-" ies ,s
polic "^-o em p h a -
om e o-^ho- of comparison, one where we compare the effect of the same X across
-. ,.-.i sti l l ot her s em ph a s i z e different groups. \Wemight want to know, for example, whether the effect
-:
: : , : . ::.]ti o n.T hes e c aus e sa re of religiosity on altruistic behavior differs across religious groups (for ex-
ample, Christians, Jews, Muslims, and Hindus), where religiosity is mea-
-: : :: : jeveloPmentdo not sa5
'i --rir:i ,..1ell-just that culture is sured by an index based on frequency of attendanceat religious services'
:: ::: i::!i of the relativesizesof participation in other forms of religious activity such as prayer, the inten-
sity of beliefs, and so on. The key here is that X is the same' so the X-axis
: :-: :::rl i ri ca l lv or k in t he s o c i a l is the same, so the graphs can be superimposed. It is straightforward to
-.-. ' - .:...1\. V ' har ls s ur pr is i n gi s merge the separateX-Y planes (one for each of the religious groups) and
:,:: i ::r iheorv and findingsin re- compare the size of the slopes,even if the units of measurementhave no
- : : rquai billing to all statisti- clear intuitive meaning.
-
. :. . -; . . . r:g ni fic antef f ec t sare th e As a third example, suppose we want to know which has the greater
'!(r:-:-. ...:::rs readerswith the infor- effect on a woman's fertility-her religiosity, or the number of siblings
:: , . . : ,:-.sma llerones . she has. In that casewe could regressfertility on religiosity and number
:: : : :,.-.e il'here the explanatory of siblings, but the slopes would be more difficult to compare. The two
,-: -,:.: .'.irerethev do not. It is eas- X-Y planes could not be merged becausereligiosity and number of sib-
:: : . .- ...:'i chav s er he s am em e tri c lings are not measuredin the same units.
r i r.r ',.,.among married couples, As these three examples demonstrate, one way to think about the size
: ::- .: ;Ouple has) is linked more comparison issue is to think about how to combine separateX-Y planes
:::: :-.-1! or the number of siblings (one for each of the explanatory variables)into a common plane. Because
: : :\..:.:tple.that individuals who the outcome variable Y is the same for all Xs, the Y-axis is the same for
44 . Chapter2
each of the separateX-Y planes, so the metric for Y doesn't affect com-
binability. But the metric for X does. S7herethe metrics differ, we must
calibrate the different merrics to compare the effects. That is the subject
of the next section.
Sometimes you find yourself in the happy circumstance that your ex-
planatory variables have the same merric, so you can compare the coeffi-
cients directly. In the caseof longitudinal srudies,for example,the variables
might all be expressedas growth rates. or perhaps all the explanatory
variables are expressedin years, or in dollars, or as a simple count, such
as number of siblings.
Very often, however, explanatory variablesdo not come ready-madein
the same metric. In that caseyou must calibrate the various metrics of the
explanatory variables to gauge the relative sizesof the effects. In the case
of policy analysis,for example, ir mav be possible ro convert variables to
a common standard by asking how much it costs to change each of the
variables by one unit.
Perhapsthe most common practice, though, is to calibrate metrics by
converting them either to standarddeviation units or to percentagechange.
The first strategy results in coefficientsrhar are called standardized coeffi-
cients, and the second results in coefficienrs rhar reflect elasticities. The
approachesare somewhat similar. In rhe caseof standardizedvariableswe
ask how many standard deviations we expect an outcome variable y to
change for a one-standard-deviationchange in an explanatory variable X.
In the case of elasticitieswe ask what percentagechange we expecr in y
for a 1,percent change in X.
For the sake of illustration, I will focus on standardization. Because
standardizedvariableshave a standard deviation of 1.0, a one-unit change
in a standardized variable refers ro a one-standard-deviationchange in
that variable. Thus if all our explanatory variables are standardized, the
slope for any given explanatory variable reflects the expected change in
the dependentvariable given a one-standard-deviationchange in that ex-
planatory variable, controlling for the other explanatory variables in the
model. In this way we can compare the size of effects directly by compar-
ing the sizesof the slopes-assuming it makes senseto think of standard-
deviation changesas having equivalent theoretical meaning for the Xs.
Standardization is common in studies where variables lack a natural
metric. such variables are commonplace in anthropology, education and
testing, evaluation studies, psychology, political science,and sociology.
The SecondRule . 45
. tor Y doesn't affect corrr- Measures of individual traits such as altruism, intelligence, self-esteem,
::rL'mctrics differ, we must authoritarianism, prejudice) tolerance' risk-taking, and religiosity' as well
Jflects.That is the subject as national traits such as degree of political freedom, very often are in
dexes that are constructed from the results of test items or from the rat'
ings of experts. In such instances the common unit of measurement
acrossthe indexes is the standard deviation of the tests or the ratings'
' 1-1 1a DLtl By using standard deviations as the unit, with standardized variables
we in essencecompare effects on the basis of rankings on diffErent vari-
ables.Where distributions are the samefor explanatory variables-normal
i: : : ' jt:aunlstan ce th at you r ex - or approximately normal, for example-a one-standard-deviationchange
:r i. . . ),r can compare the coeffi- in one X is, in terms of change in rank, comparable to a one-standard-
i i. - :.::. frtr example, the variables deviation change in another X. On the basis of standardizedscores,then,
r::. : :::haps all the explanatory we can compare the relative ranking of individuals across variables. We
' : .::.. ,i as i1 simple count, such can ask, for example, whether people who score high on a measureof re-
ligiosity also tend to score high on a measureof tolerance-which is what
r .:. j , iror come ready-made in we mean when we say that there is an associationbetween religiosity and
. .r -:.:..:' :he r-arious metrics of the tolerance.
:: Introductory statistics textbooks often warn students not to confuse
:: : -: - a to convert variablesto the association of X and Y with the effect of X on Y. That warning bears
-- - - : , i i s to c hangeeac h o f th e on the discussionhere becausethe regressionof one standardizedvari-
able on another yields the measure of association r. That's not what we
:. :- -, -... :s ro c alibr at em et r i c sb y want when we are comparing effects. As a general rule, then, standardi-
:: - - :'.:: or to percentageChange. zatron is not a reliable way to solve the different metrics problem when
: : - ,, : -: ::,lIed standardized coeffi- comparing the size of effects. Other methods should be used where pos-
-- :-i: ::-..lireflect elasticities.The sible.
.-.: - .,.: : .tarrdardizedvariableswe Consider a simple example. Supposewe collect a large random sample
: :', ::-: -rr1outcome variable Y to that includes data on individuals' earnings (in thousandsof dollars), edu-
-:- ' rr erp lanat or yv ar ia b l eX. cation (years completed, 0 to 20), and background socioeconomic status
- -.
: ': - - - .:.:-s eh. r ngewe ex pec ti n y (an index of the socioeconomic status of the respondent's family of ori-
gin, with values ranging from 0 to 100). Supposefurther that the regres-
: ,,. . :: sr.lndardization.Because sion of earnings on education and socioeconomicbackground (SES)yields
-: ..:. :r oi 1.0, a one-unit change this equation. where standard errors are in parentheses:
, -.-.:-'-:-,.l.rrd-deviation change in
= 2.0 Education+0.4 SES+ a.
Earnings (2.1)
: :-. , .:.---i.les are standardized,the
- : . : : . :,:. rh e ex pec t edc han g ei n (0 . 1 ) (0 . 0 2 )
:-i . : - -'-.': i r.rrr onc hangein t ha t e x - From thesefindings it isn't obvious whether education or SESbackground
:-.:: :rplanatory variablesin the has the greater direct effect on earnings. The issue turns on the calibra-
: :.:: : cffectsdirectly by compar- tion of SESunits with education units. If we say one SESunit equals one
i r-=.lr:: ienseto think of standard- education unit (one year of education), then we conclude that educarion
I - - : - . - .'tl me : lninqf or t he X s . is five times more important than socioeconomic background since the
:.;: ',.,:::revariableslack a natural education coefficient is five times larger. The claim of equivalent units is
-1.,^^ ^r ^- , . - , luc ar io na n d problematic, though, becauseSESranges from 0 to 100 whereas educa-
r. : ,.::ical science,and sociology. tion ranges from 0 to 20. Some might sa5 then, that we should compare
46 . Chapter2
individuals at the highest and lowesr levels on the two variables. Using
that method, we find that education and socioeconomicsrarusare equally
important, since the predicted difference in earnings is $40,000 more for
those with 20 years of education as opposed to those with no education
(controlling for SES),and $40,000 more for those in the highest SES
category as opposed to those in the lowest SEScategory (controlling for
education).
The problem with using the range for comparison is that we don't
know how realistic that comparison is. Perhapsmost individuals cluster
tightly in the middle of the SES distribution, with no one in the highest
and lowest categories of SES, whereas individuals are more evenly dis-
persed across years of education. Standardization takes into account this
difference in dispersion. In other words, by standardizing variables we
obtain summary coefficients that take into account both the steepnessof
t he or iginals l o p ea n d th e d i s p e rs i o no f e x p l anaroryvari abl esi n the parti c-
ular sample that we have. The dependenceof standardizedcoefficients on
sample variances is important to nore, since that dependencecan affect
the generalizability or external ualidity (Campbell and Stanley 1966) of
your results using standardized coefficients. The issue arises when vari-
ancesin the sample deviatefrom variancesin the population to which you
want to generalize.
Of the three bad options above-compare the unstandardized coeffi-
cients for education and income, compare predicted Y-valuesusing the
high and low valuesof education and income, and compare coefficients
for education and income as srandardized variables-the comparison of
standardized coefficients is the least likely to mislead. Generally, though,
there are better options, and you should work hard to find them. Where
a common metric is not readily available, you should attempt to supple-
ment your discussionof coefficientsize with a more targeted discussron
that employs concretecomparisons.That is the topic of the next section.
:a: t\\'o variables.Using for the income and marital status dummy variables, we compare the dif-
. :-()micstatusare equallY ferencein the expectedvalues (means)of Y for the statuses"narried" and
(n,
:.r:. ...:s\ c r n n n n morefor
. - 540,000
- -. rs ^n,o "not married" with the difference in the expectedvalues of Y for the sta-
tuses "above-averageincome" and "below-average income."
- . ' , -')i e in t he highest S ES Dichotomizing a continuous variable to compare its effect with rhar of
',' .-: r : ) ( c ont r ollin gfo r
--'.I.g or \ ' a true dichotomy is a simple example of a more generalstrategythat could
be called substantiueprofiling: the translation of model coefficients into
: - , .:-r-rrisonis that we don't substantivelymeaningful results by repordng predicted Y-values across a
, : . - -. . .! :ros t indiv idualsc lu s te r range of strategicX-values. Effects of two variablesare compared by not-
. .,..:::'r no one in the highest ing the size of the differencesin the predicted valuesfor Y at the values se-
:*,rls are more evenly dis- lected for each of the Xs. Profiling in this way gives substantiveflesh to the
: :' rekes into account this coefficientswhether or not the variableshave a common metric.
. . .- r.- r : - : - ^ ,v .dL
^-rdu
: ^Ll e sw e The use of substantiveprofiling for determining the relative sizesof ef-
-.:..U.1LUt41116
.-- -.:rrboth the steepness of fects is most effective when you can defend your choice of values for the Xs
- -', . ar iahlesin t he pa rti c - as representing comparable differences on the Xs. Most often the com-
: -:,.:-.Jrrdized coefficientson parison is somewhat loose, since it is difficult to determine precisely what
: .'.,..rdependencecan affect differences are comparable when variables are measured in dif{icult-to-
- -:'.j l a nd S t anley1' 96 6 )o I calibrate metrics (for example, years and dollars). An inexact comparison
. .: t::u e ar is eswhen v a ri - generally is better than no comparison at all, however' sincethe aim of the
: - . :opulation to which You exerciseis to assistthe reader'sintuition in gauging which effects are the
larger ones.
:.r; unstandardized coeffi- Probably the most common example of substantiveprofiling is the sim-
,i :cte d Y - v aluesus in g th e ple comparison of differences in means (for categorical Xs) or expected
.:rd compare coefficients values of Y for strategicvalues of X (for continuous Xs)' Imagine a regres-
: ,-.i',es-thecomparison of sion analysis of annual salariesfor university faculty where we find that
::--.'e.rd. Generally,though, the coefficientfor gender(a dummy variablecoded female=1) is b=-1.0
. :,rrd to find them. Nfhere with a standarderror a=0.1. If salary is coded in thousandsof dollars,
>:rruld attempt to supple- and we have controlled for other determinants of faculty salary such as
. :tore targeteddiscussron academicfield, researchproductivity, successin obtaining researchfund-
-,,pi . of t he nex t s ec ti o n . ing, teaching evaluations, experience, and so on, then we can say that
"Other things equal, faculty salariesare on average$1,000 lessfor women
than for men."2
]. L O \ I P A R IS ON S It is hard to evaluatewhether a gender gap of $1,000 is big or small in
-
rhe absenceof context: $i.000 compared to what? If the issue is the size
:. :r3trics is informal or im-
: llrze a continuousvariable 2ln many instancesa great deal of information can be imparted very simply by translat-
'We
: :-'. . . -:chas marriage. might ing results from statistical language into substantive language, as in the previous sentence.
- r r.ri: s e has a s r eate re f- This translation is easyrvhen the dependentvariable is already measuredin some readily un-
: .:-r:\\'eJ. rve might first create derstandablemetric, such as dollars or percentages.Someresearchersmay hesitateto include
such translations in their report for fear of insulting the intelligence of their audience.That
: : - -e ri ith above-average in-
point may be valid when the audience is statistically sophisticatedand gender is not a focus
-- ,:e. and regresshappiness
of the analysis.If gender is a focus of the study, however,it is important to state the effect in
: : :-,::::ri status.(A dummy vari- plain English, if only as a starting point for a discussionof whether the gender effect is large
-: - .:. ;omparing the coefficients or small.
48 . Chapter2
::::- : ::t- other determinantsof effects while missing the five-hundred-pound gorilla in their midst. Even
seasonedresearcherssometimesmake that mistake. In fact, "missing the
i; : .... :,,,..the effectof publishing five-hundred-pound gorilla" was a complaint I received fairly often from
',..:-: : . l: l.lrger or smaller than re v i e w ersduri ng my sti nt as a i ournaledi tor.
A final point: It is possible to miss the interesting story by focusing lust
- - -,.'.: Hou large is it compared
''r' r:i-l: : :: .ompared to the effect on means or on predicted Ys more generally. Sometimesthe "differcncc
:- - : -r; ro I humanitiesdePart- that makes a difference" lies in dispersionrather than in averages.Consider
the question of whether working women are happier than housewivestn
': - i*:. :..J.' of t he $1, 000 g e n d e r
.: : : - ,-: r,rP to gaps for other di-
::::- -: -'-::.1.
een associateprofessors B ox 2.L
' -.\,l- ner r m ent s an d fa C -
Examples of Telling Verbal Comparisons in Gauging Effect Sizes
-,1 . .- :: riessorson averagemake
. 'r. - : ::rs differenceshrinks to
The following claims-accurate or not-provide good exarnplesof
: : :.:..:,ir productivity, teaching comparisons that are memorable becausethey help readers gauge
.:-. i- -,-::": unaccounted-fOr gender the size of observedeffects.
i'f :: - :lr unaccounted-forgaP in
o C)n the effect of religious attendance c-tnlife expectancy. On
::.. .. -r:re1 let the reader decide
i:: -:'-r :1rr. evenwith all our con- the basis of longitudinal data from a national sample of the
': j " ,.: ::rcldeldictate that average U.S. population,Hummer et al. (1.999,p.277) find substantial
:- i- .: : ,r humanities faculty by differencesin life expectancy by religious attendance:"For the
::: :,: .. -1 percentas large as the overall population, the life expectancygap [at age 201between
' : : .-'..::rering versushumanities those who attend more than once a week (62.9) and those who
never attend (55.3) is over sevenyears, similar to the female-
male and white-black gaps in U.S. life expectancy."
-: : rhe gendergap, we com-
. On gender differences in the effect of marriage on life ex-
: : - . ...lLlesof Y for selectedval-
:i 'r rhe regressionmodel) for pectancy. "Other factors constant, nine out of ten married
- - -'c di i ier enc ef or eng i n e e r- women alive today at age 48 would still be alive at age 65; by
-. . ..,,i rh th e inc om e dif f e re n c e contrast, eight out of 10 never-marriedwomen would survive
:-..\\ e can benchmark using to age 65. The corresponding comparison for men reveals a
::: ,:sk. for exarnple,how the more pronounced difference: nine out of ten for the married
:-: i:nLler gap in relatedfields, group versussix out of ten for those who were never married"
, --:h school teachers.Or we (\Waiteand Lehrer 2003, p.256).
:: -:::.s todal' comparesto the . On the effect of race on employment Using an employment
audit where job applicants were similar on other background
. . :: characteristics,Devah Pager (2002) found that callbacks were
- rre\t for readersto deter-
r . i: i: - r >!-eex am plesin box 2 .1 ). much higher for applicants without felony convictions. Among
:,:l -:. :r,ir]\' researchersappear to both white and black applicants, a felony conviction reduced
:: - . :l,.ri' often meaningful com- one'schancesof a callback by one-half or more (from 34 per-
- cent down to 17 percent for white applicants and from 1,4 per-
- :. - .:- rssueis not just that re-
i: :-' rhat will leave a lasting cent down to 5 percent for black applicants). As expected,
- :::
-.:: :- :-:-.rbsent the disciplineim-
-r:: - :: hecome firated on small (continued on next page)
50 . Chapter2
(continuedfrom preuiouspage)
CiS . a biennial survey of Yet that comparison masks a telling difference in happiness between
. .,-,qcther, how would you housewives and working women. Compared to women who are in the
', rrll i:rfevery happy, pretty paid labor force, housewivesare significantly more likely to report being
' "pretty happy" as 2, either very huppy or not too happy: 3.1 percent more housewivesreport
-.5-i.
i:ii of American adults rs being very hnppy (35.3% versus 32.2%) and 3.4 percent more house
. rui closer to the former. wives report being not too happy (13/% versus10.0%). As a result, the
i*: \\'orlen who work full- cross-tabulation of happiness by housewife status yields a Chi-square
in averagehapprness: value (a test of statistical independence)that is highly significant, indicat-
-:.ice
L v for housewives(seethe ing that happinessfor American women is nol independent of their loca-
tion in the workplace versusthe home. Here, however, the difference that
makes a difference is not the averagelevel of happiness,but the uariance
on happinessin the two groups.
:,:tre,lfrom preuious page)
The moral is that good social research looks at r,ariancesas weli as at
means.And in presentingfindings-whether basedon variancesor means-
.:rplovment prospects good research most often relies on pictures as well as on words. That is
. - pcrc entc allbac kr a te th e topi c of the nexr secti on.
. i 'rqust hc l4 per c e n t
: :'' :!: J s onv ic t ion.I n t h i s
: :-;i s of a c allbac k m o re Vlsuer PnnsENterroNop REsurts
.-:;.-,it'r,tncbisement in the A good graph is often worth a thousand words of text in a research arti-
:.., n. and ex-felonsshould cle. Graphic displays are pictures of numbers. The aim is to make large
:'.:J ir.rterms of civil rights data sets coherent by revealing the data without distorting what the data
.- ,.\'.\'er!the issuehas pro- have to say. Edward Tufte's The Visual Display of Quantitatiue Informa-
.-r::on as a whole. On the tion (19831 is a landmark work for presenting data graphically, and it is
-:-:.:istics, disenfranchised worth quoting from the book's introduction (p. 8):
:- rc.rvilr for Democratic
'. )ie (Uggen and Manza Data graphics visually display measured quantities by means of the
combined use of points, lines, a coordinate system, numbers, symbols,
:'.. -rnd er-felons currently
words, shading, and color. The use of abstract, non-representational
:-: :::rg age population, and
pictures to show numbers is a surprisingly recent invention. . . . It was
: : .-:iit ten presidential elec-
- not until 1750-1800 that statistical graphics-length and area to show
: ::; i'oting-age population,
'. :.-..ibeen a decisivefactor quantity, time-series,scatterplots,and multivariate displays-were in-
vented, long after such triumphs of mathematical ingenuity as loga-
:^ ,',\'flbr-the projectionsof
rithms, Cartesiancoordinates,the calculus,and the basicsof probability
.:.,-::George\7. Bush would
theory.
.::; rhus the election,if dis-
,,:: io I'ote in Florida; and "Excellence in statistical graphics," Tufte writes (p. 13), "consists of
: . : -:r(rns(w on by Dem oc ra ts complex ideas communicated with clarity, precision, and efficiency." The
::::. :ispectively)might have best way to make the point is through examples. Box 2.2 provides two
:: .="els of felon disenfran- examples focusing on displays depicting trends in American society (see
Tufte for other examples).
52 . Chapr er2
Box2.2
of Trends
Visual Presentations
o l5O
a'-
o 1nn
E
c I
l
f
F
50
I L
nr r '
" 1930 1940 1950 1960 1970 1980 1990
1930-199-5.
Example 1. Prison Adrnissionsby Race in the United S rares,
8'
Sour ce: http://purl.access.gpo.gov/GPO/LPS2026
Porrcy IupoRreucE
rTo be
sure, a ban on vehrcular travel
*ourd nor save 42,000 rives, since
would find alrernative means of t.rnrpn.tutinn Ame.cans
;;;l* carry the risk of cleath. The ba'
54 . Chapter2
There is another reason why small effects can be important for policy:
Sometimes major determinants cannot be manipulated, or they are un-
known, so we focus policies on the small effects that are within the reach
of policy and that we know something about. Examples are commonplace.
'We
know, for example, that many diseaseshave a genericcomponent that
we cannot do much about. Hence policies focus on what we can more
readily control, such as the promotion of health pracricesthat are known
to foster good health and reduce the risk of disease(exercise,eat your
vegetables,don't smoke). We promote rhese practices even though in
some instancestheir effects are relatively small compared to one's genetic
endowment. Consider physical exercise.There is some debate over jusr
how much one's life expectancy is increasedby regular exercise-the an-
swer might be "not much." Yet the effect, even if small, is important,
since people desire long and healthy lives, and amount of physical exer-
c is eis s om eth i n gw e c a n c o n tro l .
would no doubt dramatically reduce the number of miles Americans travel, however, and
that alone shor-rldreduce the number of deaths.In any case,the point is that there are limits
to the price Americans will pay ro reduce deaths.ril/e are not u'illing to ban cigarette smok-
ing, for example, eventhough we knou'that a smoking ban would improve the overall health
of Americans and savethousandsof lives each year.
The SecondRule . 55
: ::: - .: , ,rn be important for policy: affected by how much money you make relatiue to your peers. In othet
: r: -- r.:tipulated,or theY are un- words, income gains produce no gains in overall happinessin a society be-
: . :-. : -:s that ar e wit hin t he r e a c h causeour standardskeep rising as we try to "keep up with the Joneses"'
::" -:. i remples are commonPlace. This argument has profound implications. In a rich country that is
t::: 1.,:- .: I geneticcomponentthat growing richer, individuals become engagedin what Frank (1"997) calls a
' ;consumption arms race" akin to a military arms race-That is, if your con-
- :i .-ilSOn What we can more
'':-'. 11 frilctises that are known sumption lowers my happiness,and incomes are rising, then I must con-
: - - . -rrsJs e ( ex er c is e.eat y o u r sume more just to keep up, resulting in a never-ending"hedonic treadrnill"
:: :-: : : rra cti c es ev en t houg h i n for individuals (Layard 2005, chap. 4). This treadmill could haveperYerse
;- . ..:'-,,. Jt)mpxredto one'sgenetic effects for society in generalas well as for individuals, since we could end
.: l:.: . ls some debate over just up consuming more than is socially optimal.
...' r'. reqular erercise-the an- To test the hedonic treadmill theory with data at the individual level,
: , i. : ' ,:r1 i f s m all, is im porta n t, Laura Tach and I analyzed survey data for over sixteen thousand respon-
.. .. ,- -).lrolrntof physical exer- dents in the 1'972-2002 General Social Surveys (Firebaugh and Tach
'We
2005). examined responsesto the generalhappinessquestion in the
GSS ("Taken all together, how would you say things are thesedays. . . .").
Note that the preliminary statement "Taken all together" invites respon-
dents to think of happinessin terms of a durable trait rather than a tem-
porary emotion or mood-which is what we wanted, since our focus was
..: : :-.: lrrtplicationsfor theorY as on happinessas an enduring individual trait rather than a temporafy state
-:. r : :\.lrlple, is fairlY modest in o, of course,it might be argued that individuals don't really know
::: - r :'. happinessis nonetheless whether they are huppy or not-is self-reported happinessa valid indica-
-ood.
-:. ::: :ssumed that income has tor of one's true feelings?A number of studies have addressedthe issue,
-- -. - .::-i .tetion or haP P ine sos n e with the weight of the evidenceindicating that a respondent is very often
:: : :---.:"\'hat haPPierthan Poorer in the best position to judge his or her own happiness(Konow and Earley
. : - -r.i:J articlethree decadesago, 2003 ).
,:: :. :tomic growth imProve the Becausethe hedonic treadmill requires a specific type of relative in-
.: '. : '...,,rksi nc e,E as t er lin( 1 9 7 4 , come effect-one where keeping up with the Jonesesmeans continually
--:- .r-,,.\t not in rich countries'" In increasing one's own income, becausewe can be sure that the Jonesesare
- -^,: \\est is not being translated increasing theirs-we used age as the basis for our peer groups. Age is
:r:riss. as demonstrated,for strategic for testing the hedonic treadmill hypothesis, since average in-
:: : ... -.'r'elof haPPinessin JaPan comes in the United Statesrise over most of the working lifespan. So as
-.::: r-.a(lmes have grown substan- Americans we must continually increaseour income to keep up with the
Joneseswho are our age peers.
--.-:. :r in hts book LuxurY Feuer: Do, then, Americans compare themselvesto other Americans the same
:.- r-.-i -'r99t. There ls an income ef- age, resulting in a hedonic treadmill as individuals try to keep up eco-
. : . : : . , : i nco m eef f ec t :HaP P i n e s si s nomically with their same-agepeers?In line with the treadmill hypothe-
sis, we found that the higher the income of others in one's age group' the
lower one's happiness,other things equal (we controlled for the individ-
.. \mericans travel, however, and
ual's own income, age, physical health, education, marital status, and so
.:. rhe point is that there are limits
:..,,rii rlling to ban cigarette smok-
on). The relative income effect we detectedwas modest, however. Health
: ri ould improve the overall health and marital status have much stronger effects on happinessthan relative
income does (at least as we were able to measurerelative income).
56 . Chapter2
CoNcrusroN
The exercisesbelow use 7972-2004 data from the General Social Survey.
The GSS is a probability sample of noninstirutionalized English-speaking
individuals, age eighteen and older, in the United states. Fro^ i972 to
1994 the GSSwas an annual survey (with a few years missed); after 1994
it has been conducted in even-numberedyears. The sample sizes of
each
wave vary from about 1,500 for the annual surveysto about 3,000 for the
biennial surveys.
To do the exercisesbelow, go to the web site http://sda.berkeley.edu.
The instructions below should generateall the statisiics you need ro an-
swer the questions,so you should not need to do any calcuiations by hand.
Use the .01 significancelevel for questions about statistical signihcance.
Again, variable names are in boldface in the computer assignmentsin this
book, and answersto selectedproblems are provided at http://press.prince
ton.edu/titles/8593.html.
Note: The student exerciseson rule 1 (at the end of chapter 1) use the
samedata and web site. The site is user-friendly,so if you did the chapter 1
exercisesyou should have no trouble doing the exerciseshere. To avoid re-
dundancy,the instructions below include only the key commands;I do not
give step-by-stepinstructions as I did in rhe exercisesfor rule 1. Recall that
the figures given below are for the 1972)004 GSS, the most recent GSS
data available when this book was writren. so if you are uorkins with
more recent data, to replicate the figures exactly you will need to ise the
"selection fiber" in the sDA program to remoue years
after 2004. Also re-
call that the format in the SDA web site might have changed somewhat
sincethesedirections were wrirren, so some of the specificdiiections below
might be our of dare, but you still should be able to follow the logic below
to obtain the tablesyou need ro answer the questions.
The GSSasks the following question, denoted happy: "Taken all together,
how you say things are these days-*o.trd-you say th"t y"ou
_would
"r.
very happy, pretty happy, or not too happy?" Note that the question asks
about happinessin general,not about happinessin a specificiomain, such
as happinesswith one's job or marriage or financial situation.
The SecondRule . 59
(b) Give a plausible explanation for why the association differs for men
and women.
Now use the Berkeley SDA web site to determine whether men or women
tend to be happier in the United States.
. Cross-tabulate happy-re (row) with sex (column). If you select
"Oversamp-\Xieight for black oversamples" along with the other
options chosen above, you should get 43,326 cases.Select3 decimal
points for statistics.
associationsare large, which are medium, which are small, and which
are trivial?
Question B. Before doing the analysis, make your best guessabout what
percentageof American adults (1980-2004)think that:
o homosexuality is always wrong
. premarital sex is always wrong
. sex between unmarried teens is always wrong
. extramarital sex is always wrong
(a) \X/ritedown your ranking, from type of sex you think is most frowned
upon to type of sex that is most tolerated.
(b) Then, for each of the above, write down whether you think the per-
centage of Americans saying "always wrong" has increased,declined, or
stayedabout the samesince1980.
The next step is to use the recode command to create a variable called
decadefrom the GSS variable year. You want to enter three values undEr
"Value" for the output variable: 7980, L990, and 2000. For 1980, enter
::- 1:: '.:'J happinessin the United "1980s" under "Label" and "1980-1989" under "Var 1." For 1990, en
i , i . .,.:rrchor.reof the following ter "1990s" under "Label" and "1'990-1'999" under "Var 1." For 2000'
' ',:-,. :::' .rseand happiness(older enter "2000s" under "Label" and "2000-2004" under "Var 1." Click
. - : .: r iiJtive association'To see "start recoding." You should have 14,241 casesfor the 1980s, I3,ZZ3
- , . - '. ,;rl s.60s , 70s ( or old e r)- casesfor the 1990s, 8,394 casesfor 2000-2004, and 10,652 casesof "no
...-- - \ crr l9-code them with data" (theseare the respondentsin the earlier surveys).
: L'a!. re. \\'hat do you find? The final step is to cross-tabulateeach of the sex variablEswith decadc
'Weight
(column variable). Select "'$7t for black oversamples" from the
dropdown menu, and remove the check beside "color coding." Select"Pie
Chart(s)" under "chart options" and then run the table.
.:.:ls to visually represent Question 9. (a) Compare the percentageswith your guessesin question 8.
l.l:t Quart€r century' \tre (b) Then summarize the results in one or two paragraphs.There are two
big issuesto discuss:first, the level of Americans' approval or disapproval
of premarital sex, teenagesex, homosexuality, and extramarital sex; sec-
ond, the trends in theseattitudes. The pie charts should be especiallyhelp-
-.: L.estguessabout what fu l i n seei ngthe trends.
-. i t h; lt: (c) Which resultssurprisedyou the most?
::S
The first step in any analysisof quantitative data is to check means, vari
ances,and correlations for plausibility. Experienced researchersknow to do
this but are often eagerto "run some models" before carefully checkingthe
data. Researchers are especiallylikely to skip this stepwhen using secondary
survey data from large survey firms, where the data presumably are already
clean.It is nonethelessa good idea routinely to look at the means'variAncesl
and rangesfor each of the variablesin our analysis.Are the m€ansand vari-
: ::,':.:l.cof a shiP from the
: : ::!.'-;ri,rr r . T r iangulat i o n i s ancesreasonable?Do all the valuesfall within a plausiblerange?
o f t he bas eof a t ri a n - You should eramine scatterplots for outliers and nonlinearities in the
- -:i :
early stagesof your analysis.For surprising correlations-ones that are
: :::: t ase,then we can cal-
smaller or larger than expected, or not in the predicted direction-
:Js. u'e use elementarY
scatterplotscan revealif there are singlecasesor clustersof casesthat might
inown distance from a
account for the unexpectedresult.
use It is especially important throughout the analysis to keep track of the
:: : - : -: ^-' .o ci al s c ient is t swho,
a studY as a item nonresponses,that is, missingentriesbecausea respondentfailed to
: :..:- :r.' method in
r'\'c more confident answer a question. How are those missing values coded? How do you
:: - ::.:: can be
want to code the responsecategory "ns1 5s19"-is it coded as missing, or
i- : - 1 .l l : 'onc lus ion.
as neutral? Item nonresponsesare a potential pitfall in any analysis, and
. - :: - .:'. -'k ru le. I n t he c haP te rI
inattention to missing values can lead to serious blunders. (It is also im-
tr *r : -:S mor e r out inelyinto s o -
portant to pay arrentlonto missingdata due to nonrespondents-ind:id-
:: ': - : . . : ::l -Ih ods is one aP P r o a c h ,
uals who were chosen for the sample but who fail to participate. This
: : : ::,:.litv checksapply to data
issue is discussedin the next chapter.)
. ,. '.:-j \ome aPPIYto checks on
r.:i-:.--:>tl-rervord "check" would
M ISSIN G D AT A
::-. :'. rieck" deliberately'to em- TRAD IT ION AL M ET H OD S F OR D EALIN G W IT H
j-1. :: ::: (tur analysisoccasionally If you encounter missing entries in your study-as you will sooner or
-.. * -,.::'.. :rslancesthe reality checks later-the first question to ask is how much missing data there is. That
:-,:: : ..:iriencedresearcherssome- question is first because,as Paul Allison (2002) observes,a standard old
s t udies .I nd e e d , method calledlistwise deletion is an attractive option for dealing with in-
- r :, r .> .rro their
- ir-, - : : i ,f res ear c ht hat has g o n e complere data, so long as the amount of missing data is not intolerable.
: : .: in rhe hoPe that as social Listwise deletion (also known as cdsewise deletion or complete cdse
-: ::-tistakes as from our suc- analysis) removesindividuals with incomplete information. Although that
-*
::: .. ::om researchin sociologY, appearsto be very wasteful, listwise deletion has severalimportant advan-
a rre s t-
-r - : : iili \ p ro v ide t he m os t tages.First, it can be used for any kind of statisticalanalysis,from ordinary
: . =- . ; r l i n eso f r es ear c hinv o l v i n g regressionto the most sophisticatedmethods, with no special software re-
. l-. . : stance swher e it was aP Pro - quired. You use the methods that you had planned to use, with off-the-
. . -r: lr.]tea Point , I hoP e au th o rs shelf software, but you include only the caseswith complete data. Second'
:r: ; . . . .:i tvof their wor k t hat c o m- although your standard errors will tend to be larger becauseyou are using
less information, you will obtain appropriate estimatesof those standard
66 . Chapter3
with mean substitution, since that also "makes up data." The difference
is that mean substitution makes up worse data.)
The second point is that there are proper and improper ways to im-
pute missing values from the data. tWealready noted that the old strategy
of mean substitution is a bad idea. It's a bad idea becausewe end up
turning a variableinto a constant for respondentswith missingvalueson
that variable. As a result, we fail to control for the variable over that por-
tion of the sample where the data are missing. Mean imputation pro-
ducesbiasedestimatesof regressioncoefficients,variances,and standard
erfofs.
Newer imputation methods differ from earlier ones by deliberately rn-
troducing random variation as part of the imputation procedure (Rubin
1987; Schafer 1997\. Multiple imputation (MI) uses Monte Carlo tech-
niques to replace the missing values with m> 1 simulated versions.There
are three steps:imputation, analysis,and pooling.
. ImPutation. Random draws from some distribution are used to fill in
the missing entries. (The distribution is selected to inject the right
amount of uncertainty to produce unbiased estimates of variances
and standarderrors: seeAllison 2002, chap.5.) The missing entries
are filled in z times, resulting in m completedata sets,where m typ-
ically is small (say 3-10).
o Analysis. Each of the m data setsis analyzed,resulting in z analyses.
. Pooling.The m analysesare combined into a final result.
:: .-.i;regate analyses are As a general rule there is no special difficulty in interpreting the results
:\- -- .' :: rates, fatios, and So of studies with composite measureswhen the composites are all formed
' ii 1' :- ::l,lt are COmpOSedOf the sameway (Kuh and Meyer 1955; Firebaughand Gibbs 1985). An ex-
: :,.:: j-:'!ire, for example, is ample would be studies where all the variables are expressedas percent-
r:rl: - :-::1erate iS nUmber Of ages, such as percent single-parent households and percent poor in a
::r- - -: .ensus tract iS num- neighborhood, or as per capita measures,such as income per capita and
:--.i ::-,.:. lomPosite measures nurses per capita in a country. The denominator, population size, is the
:" : . , - -:. l. ' S e afCh . samefor all variables.Becausea researcherusesper capita measuresto ad-
just for the fact that the units (censustracts, schools,countries, and so on)
vary in size, there would be no need for such a denominator if all Lrnits
:. 'rom preuious page)
'.,.lh of per capita . 6 > 0. Conclusion: In the short run, foreign investment boosts
: ,:eign investment economi cgrow th i n poor countri es.
. y<0.Conclusion: In the long run, poor countries are better
-:rgn investment at
:; bv other factors off economically without foreign investment.
. Bnu,,",oror)0.Conclusion: In the short run, domestic invest-
were the same size. In effect, by using per capita measures,a researcher
uses a mathematical operation (division) to control for differencesin the
populations of units. Alternatively, a researchercould use raw totals rather
than per capita measures,and control for population by adding popula-
tion size as a control variable in a regressionmodel. Whichever method is
used-controlling for population by division (per capita method) or by
residualization2 (regression method)-there should be no special diffi-
culty in interpreting the results.3
Sometimesresearchersperform analyseswhere the numerator of one
variable is the denominator of another variable. In that case, you are not
using division in a consistent fashion to conrrol for population size, so re-
sults are not always easy to interpret. The general point is that the use of
dissimilar composite variables (where, for example, some variables are
per capita measuresand others are raw counts, or where the numerator of
one variable is the denominator of another) in some instancescan produce
resultsthat are tricky to interpret.
Reality checks are particularly important when you estimatemodels
containing dissimilar compositevariables.In the caseof dissimilar com-
posites, it is often useful to try to express the model in alternarive
forms. The aim is to see if there are different ways to express the same
model that will aid your intuition in interpreting the model's coeffi-
cients.
Consider this example, again from the cross-countryregressionliter-
ature. On the basis of an analysis of income growth for about five dozen
poor non-\Westerncountries from 1960 to 1998, Giovanni Arrighi, Bev-
erly Silver, and Benjamin Brewer (2003, hereafter ASB) claim thar in-
dustrialization does not promote economic advancement in poor
countries. "For most [poor] countries," they write (p. 18), "industrial-
ization [has] turned out to be an ineffectual means of economic ad-
vancement." This claim is questioned by Alice Amsden (2003), who
notes the contradiction between the ASB claim and the strong associa-
tion between industrialization and income growrh in poor countries in
recent decades.
I r',: :: ::leasL1res, a researcher Amsden appears to be correct: Using the same countries as ASB, and
: . :: . ;or differer-rces in the weighting by population size, I found a whopping correlation of r=0.98
:--: , user awt ot als r at h e r between industrial growth and income growth over the period 1960 to
-..:
: : : . : ::1'\nbi' adding popul a- 1998. How do we account, then, for the ASB claim that industrialization
' - : \\-hichevermethod is has not benefitedpoor countries economically? Amsden suggeststhat the
: : - r:: ..rp i ta m et hod) or b Y ASB finding must be due to a "bug in their program" (2003, p. 34). In
:t; : . ...,.it'c no special diffi- their reply, however, ASB state that they double-checkedthEir computer
program and are satisfiedthat their finding is not due to a programming
,:i ., : -: :he numerator of one errof.
: :: : -r :hat case,you are not It turns out that ASB appear to be correct on this point: The problem is
--.- ::,rp ulat ion s iz e.s o re - not their computer program. The problem lies in their model. To under-
: i :r: i- -- :o i n t is t hat t he us e o f stand where the confusion comes from, begin with the essential model
r :, . , , r1 1.c.som e v ar iablesare that ASB estimate:
r- : : i. '',','5eret he num er at oro f (Y/P)'=Br+B, (M/Y),'+
e, (3.2)
: . : -.: lllstancesCanPrOdUCe
where Y denotesincome (total national output), P denotespopulation, M
:: 1 : , .-:1 \'OUes t im at em ode l s denotes manufacturing output, prime denotes growth rate, and the sub-
. - - : -. - ;,rseo f dis s im ilarc om- script I identifiesthe individual case(here,the country). Hence (Y/P),'is
::: - . : -.. nodel in alt er nat i v e rate of growth of per capita income in the i th poor country and (M/Y),' is
:::-: '.-:.'! i fO expressthe Same rate of growth of manufacturing output as a fraction of total output in the
r : :: - : : : . :l g the m odel' s c oef fi - i th poor country. For this model, ASB report a slope of zero for fifty-eight
poor countries, L960-1,980, as well as a slope of zero for fifty-nine poor
- - ' . -'-L rl l n tr vr egr es s ionlit e r- countries, L980-1998.
-: :: '.'.ih for about five dozen Now let's rewrite the ASB model. Becauserate of growth of a ratio
: - - - '. C,iovanniArrighi' Bev- A/B is the rate of growth of A minus the rate of growth of B, we can write
(Y/P I ,' as Y l -P ,' and 1M/Y l ,' as M,' - Y ' . Thus w e can rew rj te equat ion
" - : r: , irr:f -\S B ) c laim t hat i n -
.i Jr.rn c em ent in P o o r 3.2 as;
- - .: . ,.:::. rp. 1B ) , " indus t r ia l - (Yi'-P i')=Fo+F.,(M,'-Y,')+e,. (3.3)
:-: ---:, ::1-.111S Of eCOnomicad- 'lfhen
- , . ,- -\msden ( 2003) , wh o expressedthis way, it is clear that this model does not actually esti-
: - -: :.-.,.-rldthe strong associa- mate the economic effect of industrialization in poor countries.In plain En-
-: r - '.',::ti n P oor c ount r iesi n glish, the ASB model statesthat the ertent to which income growth exceeds
(or lags behind) population growth is a function of the extent to which
manufacturing growth exceeds(or lags behind) income growth. Thus the
coefficient F, reflects the effect of segregated manufacturing growth on
:. of X on Y controlling for economic growth in poor countries. Because Fris zero, we conclude that
.: :- : rrhc res i d u a l ( X - X ) , w h e re
the difference between income growth rate and population growrh rare is
' - - ^ . : . : egre s s i o t o
] f X o n p o p ula -
uncorrelated with the difference between manufacturing growth rate and
I aomposlte measuresper se. income growth rate (growth rate of total output)-precisely as we would
. :. . -: . : c omm i t t i n g t h e " e c o l o g ica l expect if industrialization boosts economic performance.Successfulindus-
r: - . .:: : : om ag g r e €a a t e - l erveel la t i o n - trialization means that manufacturing growth boosts not just manufactur-
I ' :- '- : :: ilent at i c a s s u m p t i o nt h a t r e -
ing but other economic sectorsas \.ell, so that manufacturing growth rate
r - r ' : . r : '. -i: ric lu al le v e l ( s e et h e s e c tio n
does not greatly outrun rate of growth for total output.
76 . Chapter3
In short, Amsden and ASB are talking about different things. For the
countries that ASB analyzed, the correlation between industrial growth
and income growth is close to r= 1.0 for the 1'965-1'998period, so Ams-
den is correct in her claim that industrialization benefits poor countries.
Poor countries that are industrializing clearly tend to grow faster econom-
ically than poor countries that are not. Yet it is also true, as the ASB results
indicate, that successfulindustrialization means manufacturing growth
that links to growth in the rest of the economy.
The bigger lesson here is that reality checks are important for models,
especially in the case of models with dissimilar composite variables. So-
cial science researchersshould try to express models formally, and to
search for alternative expressionsthat facilitate interpretation of empiri-
cal results.
The reality checks describedto this point are internal, in the sensethat
they can be done by researcherswithout reference to other data sets or
results from other studies.Thus we can check our data for dubious val-
ues, we can check our measuresfor consistencywith each other and with
the conceptsthey are intended to capture, and (sometimes)we can ex-
press our models in alternativeforms to check the interpretation of our
results.
In this part of the chapter I turn to methods for evaluatingthe sensibil-
ity of social scienceresults by comparing them with results drawn from
other types of data and methods. I have in mind purposiue checks for the
sensibility of our results. The checks might involve thought experiments,
simulations, or the direct comparison of quantitative and qualitative re-
search. I give severalexamples below from the research of others. These
examplesdo not exhaust the possibilities but are intended instead to srim-
ulate thinking in this area.
In describing these methods I do not mean to suggestthat current re-
search practice ignores external reality checks. The most obvious reality
check is the comparison of one's results with the results of prior research,
and that is done routinely. Journal articles in the social sciencestypically
include a discussion-of-resultssection, for example, and the comparison
of one's results with the results of other studies is standard fare in that
'We
section. But we can do better. need to devote greater attention to the
development of cleverer and more direct reality checks for our conclu-
slons.
Th e T h i r d R t r l e ,
1.,:,,drua1StudY of Adolescent
the Research
Is ,lt.tltiracial? Assessing Other Examples of Multiple-Method
--i set is ideal for such a
, J.,,
Multiple-method studies employ more than one type of analysis.often
... i;;;.; parents'for adoles-
these studies draw on data from more than one source as well. The kev is
.l-. st comParingParent-based that the multiple data setsand methods are used in a single s/zdy. Multiple-
fini considerablefluid-
,,'.i si,-,-, merhod studiesdiffer from what I call replication studiesin rule 4. Replica-
: r,.:.sisof resPonsesadolescents
tion studies apply the same metbods to different samples. In replicarion
= On the basis of re-
',.',,-rito.iul. studies the aim is to see if identical analysesyield similar resuits for
..,...t of vouth are multiracial'
82 . Chapter3
CoNcI-uorNGREMARK
. : lustratedby research
qii,tl Reuiew (Jacobs
.i fPre s ident A br aha m
::, 'nr diversesources-
' : - : : : . . :.-.:n tso f Linc oln in hi s -
- : : : :;hs-artz and Schuman
as a
' - : - :-,,l n's r eput at ion
r :- ',.:. for t hat r eP ut at i o n
. -ol n as s av ior of th e
i:a
':r: , : .'..:hodsare emPloYedin
- : i : : -- :. ,ri the ot her .T hus , fo r
: .: -' i n ter v ieweda s m a l l
: : - : :...1r studYof s oc ial t i e s
-
:r. l: i.>Lrl t and A bigail S ag u Y
-: :,:: :: .!jnducting a statistical
r :.-.- - rhe United Statesand
r : - :-.:! irst. In a recent studY
:'. :. l .j+). for example, re-
s u'ith a seriesof re-
:..::.r.ie
: : : :-t:rilies over a Period of
: ,. . , . .::-.i ;cdinc r eas inglyin t h e
' . - -: - .. \\-up qualit at iv ec o m-
: i -',:: :. (ln the Phenomenonof
on Rule 3
StudentExercises
1 (ron rvEnvoNl)
AssrcNrvrE'Nt
Question 1. Vhat is the correlation between DGI and Gini for these
twenty-four market economies?[ANswrn: r=-.16.] Doesthat correlation
provide strong support for the argument that government intervention in
the economy is associatedwith lower levels of income inequality in capi-
talist countries? Defend your answer.
'$7hile
it is ciear that income inequality was (and is) higher in Latin Amer-
ican countries such as Bolivia, BrazII, and Peru than in Scandinaviancoun-
tries such as Norway and Sweden, estimates of incorne inequality for
1,960arc far from perfect.To gaugethe consequences of this unreliability
for correlations involving Gini, usethe random number generatorin your
computer program to create a standard normal deviate R (that is, a nor-
mally distributed random variable that has a mean of zero and a standard
deviation of 1.0). For tbe tu,enty-four market econornies,urseR to simu-
late the effect of measurement error on the Gini by creating three new
variables,as follows:
c Newginijl = Gini(l + .01R)
. Neugini2S = Gini(l+.25R)
o NeuginiS0 = Gini(l+.50R)
Note that each of the Netugini variables adds random error to Gini by
multiplying Giniby 1+ftR, where A is a constant and R is a random vari-
able with a mean o{ zero and a standald deviation of 1.0. You are adding
error randomly to each of the twenty-four countries individually. Con-
sider Argentina, for example. You are multiplying the stated Gini value
for Argentina (0.42) by (1+AR), where R is a normally distrihuted ran-
donr variable rvhose averagevalue is zero. Hence you expect the Netugini
value for Argentina to be larger than Gini half the time and smalier than
Gini half the time. The same logic holds for the values of Newgini versus
Gini for the remaining tu'enty-three countries.
TheThi rd R u lv . ):
Question 4. (a) Correlate Netugini0l with DGI for the rwenty-four mar-
:; '. : . : 1f K G RO U N I ) ket econornies.Is the correlation larger or smaller than it was using Ginl?
(b) Repeatnineteenmore times, using a new R for each trial. Calculate
the mean of the twenty correlations o{ Newgini}T with DGL Is the mean
:. : - . :isher in Latin Amer- of the Nerugini}l-DGl correlations similar to the original correlation of
'' ::' re. t ndinav ianc ou n - GiniwithDGI?
- -r: - : :lJolTletnequalitY for
- -: - .::-..:i of this unreliabilitY Question 5. Norv introduce greater measurementerror on the income rn-
- - - - " .':l 'crq ener at or in 1' o u r equaiity measure by using Netugini2S. Repeat the procedures you used
for Neugini0l to generaretwenty correlations of Newgini2S with DGI.
-....ri c R r t hat is , a no r-
:- : '-: , .-. ,i ze r o and a s t and a rd (a) Is the mean of the Newgini2S-DGI correlations similar to the original
, , ':. - ' : - ' ': , rit ie s , U S e R t O S i m U- correlation of Gini with DGl?
' , : | ;ttii i. ' c r e a t i n g t h r e e n e lv (b) What about the uariance of the correlations: Do the correlations with
DGI vary more with greater measurementerror (Netugini2S) or lessmea-
surement error (Neargini} 7) ?
This chapter also servesto reinforce one of the major themes of the
book-that progress in social researchwill come mainly from better rc-
searchdesign,not from statisticalwizardry (seerule 7),THe parallel analy-
sis of multiple data sets opens new possibilities, and we should be
thinking now about how best to exploit those possibilities in social re-
search.
rN SocIer RBsnaRcn
Souncns oF UNCERTATNTy
- -. : ' -. :lstural follow-uP to It is easy to be lulled into a false senseof security about uncertainty rn
'We
.. : : - , - R,rl e 3 adv is esY ou t o social research. are very consciousof the possibility of sampling error
' . i - . . :: --11l r' - us ingot her in - and generally devote a great deal of attention to it in our research.Stan-
dard statistical packages all report standard errors, and we dutifully
- , - : -.:i crent m et hodsand
::*- - -.: :r th e pr ev iousc haP - gather that information and report it in tables in our researchreports.
i - * rr:r::lt Sortsof data lead The false sense of security arises when we forget about the other
sourcesof uncertainty and assumethat by ruling out sampling error we
- . (Same measUrest have ruled out all uncertainty. In many instances sampling error is not
-: .-, ., )iS
, : - : . '.:..:-l srd if f er ents am pl e s even the most important source of uncertainty in social research.Indeed,
: j - - -.:-'-rre s ult s .I n r eP lic a - for very large data sets-which are becoming more common in the social
:..t -:- ::iJ sin-rilarresultsfor sciences-sampling error becomesrelatively less in'rportant as a source of
- : :..,uql- r tof as a s pec i a l uncertainty. Nonsampling error, by contrast, generally is not relieved by
-: -
-: - j:.:. . ,lnalvsisof multiPle larger samples.
I want to make two main points in this chapter. The first is that we
' : - . - ,r:ten it is difficult to should take a holistic approach to uncertainty in social research.Robert
Groves et al. (2004) describe the move toward a total suruey error para-
- * - : -- -:' rr.1 l.F or t hat r eas o n
:\ : - : .r,rqlestudiesthat re- digm in survey methodology. The virtues of such a paradigm are apparent
, -: -: ' :l l be m or e ex am P l e s for all types of observational studies in the social sciences,not just survey
analyses.We should aim to developand refine atotal uncertainty perspec-
' . , -, - c an b e tive that is based on all sources of error, not just sampling error. To be
- :]l (rr eon what
- i - . .. - .,..ri r'd s hould bec ome sure, sampling error receivesa lot of attention in social research.\(e give
'We
it a lot of attention in part becausewe can: have methods for cstimat-
' : - -....r -*ersincreasesin the
- -- :.: r ferv decadesago ing the uncertainty arising from the use of a probability sample. Other
- - r - - :l -.i a nd s oc iologYan , d sourcesof uncertainty are harder to quantify, but they are often no less
r: -l - *-,:': fare in economtcs, important than sampling error. We need to find better ways to gauge the
:: - : , j .'.tas et sgener allya re effects of these other sourcesof uncertainty as well.
:.:: - . ,. -:.:rr'eask as social sci- The second big point in this chapter is that replication prouides a
'i I r--: .:':i l S St udieshav e be e n meansfor gauging toldl uncertainty.This is the triangularionpoint again.
- - - :: - ...::;het t er longit udin a l except that here the triangulation involves the same methods applied to
- - - --: , :: ,r Je s ignsdes c r ibedi n different data sets.The juxtaposition of results from identical analysesof
:- : , , . . ..rr o f P ar alleldat a s e ts parallel data setsin a single study should provide more realistic appraisals
::: : -. 1-. i >et S C U m U l a t e . of the uncertainty in our conclusions than do significancetests alone. If
92 . Chapter4
so, then comparison of results across data sets might become a star-rdard
feature of articlesin social sciencejournals in the future, iust as significance
tests are today,
It is useful to begin with an inventory of the sourcesof uncertainty in
observationalstudiesin the social sciences.Clifford Clogg and Aref Daiani
(7997) identify six separatesourcesof uncertainty in the modeling of social
statistics."Our main point," they write (pp. 12-13' italicsadded),"is that
we must seriouslyconsider the uncertainty in inferencescreated along the
way by the data collection processesso common in social research." Their
thesis,and mine, is that uncertainty "crops up all along the way" (p' 15) in
data collection due to numefous problems that are encountered,and deci-
sions that must be made,in the processof collecting data. Thus evenif there
were no samplingerror, the use of the samemodels and estimationmethods
acrossdifferent data setsmost likely would deliver somewhat different re-
sults. Any given data set, then, is subjectto uncertainty about how much (if
at all) the conclusionswere affected by the data collection methods'
'$(here possible,then, it is imporrrnt to compare results across data
sets.Of coufse,we hope to find consistencyin our results,since similar
rWhererve
results across data setsincreasesconfidencein our conclusions'
find inconsistent results, we must temper our conclusions accordingly, or
find reasonsfor the divergence.In some instanceswe might be successful
in pinpointing the major sources,and in other instancesw-emight not be
able to do so.
\(hat are the sourcesof uncertainty associatedwrth data coiiection?1
For convenienceI focus on survey data, but the generalprinciples apply to
other types of social sciencedata as well. My accounting schemerelies
heavily on discussionsin Kish (1987), Clogg and Dajani (1991), and
.We want to focus on sources of uncertainty other
Groves et al. (2004).
than sampling error since methods for estimating and evaluating sampling
error are alreadywell known. In typical studieswe know much lessabout
the narure of the other sourcesof uncertainty arising from the data col-
lection process.
Let me make the point as cleariy as I can. Imagine we have two iarge
probability samples(say of 100,000 caseseach) of the same population.
The samplesare collected by two first-rate survey firms. l)ue to the size of
each sample,samplingerror should be minimal in either case.Yer rhe tyr,o
data sets could yield different results even if they used exactly the same
questionnaire.\ilihat might account for the differences?
I The p;oblen of uncertaintv due to data collectior.ris inclependentof other major prob-
lens q.e face in social research,such as the problem of omittecl variables bias {discussedin
chapter 5). So erlrr-free data n'ould not solve all the problems n'e face in social research.
Even n,ith error-free data, for example, lve rvould still need to find rvays to compare Jike
u ' i t h l i k e n ' h e n m a kin g ca u sa lin fe r e n ce s( r u le .5 ) .
The FourthRulc . 93
Target Population
[coverageerror]
i
Frame Population
Isamplingerror]
+
SelectedSample
I
[nonresponseerror]
V
CollectedSampie
Figure4.1. Sourcesof
ExclusionError in Data
Collection
There is the possibility of error at each step, so each step adds uncer-
tainty about your results. 'We examine this uncertainty for each of the
stepsin turn. It is convenient to consider the steps in reverseorder, begin-
ning with selectedsample -->collected sample, since as analysts our infer-
encesmove in opposition to the direction of the arrows. In other words,
as analysts,we move in reverseorder to the winnowing process,using ob-
servations based on the collected sample to draw conclusions about the
target population.
Typically the sample you obtain differs from the sample you selectedbe-
cause some individuals who are selectedare never contacted (they might
be difficult to locate, for example), and some of those contacted decline to
participate. The failure of some selectedunits to respond (for whatever
reason)is called unit nonresponse(Groveset aL.2004, p. 45) or total non-
response(Kish 1987, fig. 2.1,.1.).Itis important to distinguishunit nonre-
sponse from item nonresponse-the failure of participants to answer
particular questions-a subject addressedin the discussionof missing val-
ues in chapter 3. Although both involve missing data, item nonresponseis
often addressedusing data imputation methods, as describedin chapter 3.
In the caseof unit nonresponse,however,there is no information about the
The FourthRule . ej
error). As a practical matter, sampling error becomes less worrisome in resp0nserateSCOritr:: --:
the social sciencesas we accumulatelarger data sets.Nonresponse error is ticipate that s;rrnpi;:^:,
a different matter. Responseratesappeafto be decliningin the United States data ser s,hr r r , r li, ', . -
and elsewhere(Groveset aL.2004, chap. 6). Large samplesor not, nonre- mofe s ev erer c l u€ tt, ,-.
(continuedfrom preuiouspage)
on next Pdge)
cr,tntinued. (continued an next page)
98 . Chapter 4
(continuedfrom PreuiuusPdg()
The frame population is only as good as the list from which it is derived.If
we used u pno". book to selectour sample, for example, we would miss
those who do not own phones as well as owners whose numbers afe not
listecl.This would not pose a problem if our researchwas on behalf of the
phone company, and they were interested only in the characteristicsof
iustomers with-listed numbers. As social researchers,though, we typically
are interestedin a more generalpopulation, such as all adults in some re-
gion, not just peopie with listed phone numbers.
Exclusion error due to incomplete sampling frames is called couerage
errol (Gtoves er al. 2004). Lists are never complete, and even fairly com-
plete lists are soon out of date, particularly in geographicallymobile so-
cieties. Sometimes exclusions are dictated by the nature of the
population. Consider,for example,the U.S. adult population (age eight-
e..r older). At first blush U.S. adults might appearto be an easypop-
"r-rd
ulation to demarcateand sample. But some adult U.S. citizens afe ln
relativelyinaccessibleplaces,such as in prison, or serving in the military
The FourthRule . 99
:.1 iront PreuiousPdge) overseas.Some do not speak English. Some are homelessand have no
postal address.Hence obtaining a perfectly representativesample of the
:ri rflrl at each income U.S. adult population is more complicatedthan it might at first appcar.
of the scatter- Survey firms have limited budgets. To get the rnost for the dollar, com-
- .: srde
.- :r.jcnts (the missing promises must be made. The frame population for the U.S. General
:.. :rr l-iale come from
Social Survey, for example, consists of the noninstitutionalized L,r-rglish-
of speakingadult population. (Spanishinterviews were added in 2006.)
. -r orlr e\tilni.lte
: -' . ' iI('/t'r'g / o ur esti- In some instances,then, the exclusion is purposive, arising from bud-
getary limits, as in the caseof the absenceof institutionalized adults in the
)- - ;.:. :.; is c;utsallYrelated GSSframe population. In other instancesexclusions occur becauscof dif-
ficulty in identifying the target population very precisely. Consider polls
. ,:: .,: '- -::l : r' trffectsParticiPa-
::ritive regardlessof before elections,for erample. The target population here is voters, since it
:"J IO O\'€testimate the is voters who will determine the outcome of the election. But voting has
. : .'.. -\loreover'unless not yet taken place, so it is impossible to identify the members of the tar-
:'r relatedto X; that get population with complete precision. For our frame population, then,
: -t,\\er)incomelevels. we want to use those who are most likely to vote. How do we determine
::::ted. that? One possibility is to use voter registration lists. But even if we had
accessto up-to-date lists, the lists are rarely complete, and some registered
. ::-ige error. coverage voters don't vote while others who aren't registereddo.
'..r:i.rbleX when non-
In short, each of the steps from target population to frame population
. : , niasedestimatesof to selectedsampleto collected sampleexcludespeople,and erclusion adds
:,.:r is causallYrelated uncertainty. Some individuals are excluded becauseof incomplete lists for
the frame population (coverageerror), some are excluded because they
weren't selectedfor the sample,and some exclude themselves(unit nonre-
sponse).
\fhile all exclusions add uncertainty, some exclusions are more prob-
: ER R O R lematic than others. The key is whether the consequencesof the exclusron
:: :l n"hich it is derived. If can be modeled, as in the caseof random exclusion or some other kind of
:\.1mple' we would mrss exclusion that relies on the laws of probability. Probabilistic exclusion
. s hose numbers are not results in uncertainty, but a type of uncertainty that can be estimated-
-..;h u-ason behalf of the that's what significancetests are all about. We could say, then, that prob-
:ir the characteristicsof abilistic exclusion results in "manageableuncertainty" in the sensethat
-:rs. though' we tYPicallY we can at least quantify our level of uncertainty. Coverageerrors and unit
-. ;. all adults in some re- nonresponse errors, by contrast, are likely to be nonprobabilistic and
thus more elusive.With nonprobabilistic exclusion it is more difficult to
:rrnes is called couerage bound our uncertainty: Sfe are lesscertain about the level of uncertainty.
:-c-te.and evenfairlY com- Becauseconventional standard errors do not reflect nonprobabilistic
:.'osraPhicallY mobile so- exclusion error, conventional confidence intervals understate the uncer-
-
hrl the nature of the tainty in our inferences(Clogg and Dajani 1,991),sometimesdramatically
. - : ,:;.ilt PoPulation(age eight- so. From a total error perspecrive,it is important to go beyond sampling
: : - : - . l PPe art o be an eas yP o P- error to try to gauge the level of uncertainty. That is why replication is
.: , - . .rJul t U . S . c it iz ensare i n valuable. It is perhaps the best tool we have for providing a sober assess-
: :: . i .-. o r ser v ingin t he m il i ta rY ment of the total level of uncertainty in our inferences.
100 . Chapter4
short-sightedio {ri.:.f::
Measwrement Ertor as ct Source of Uncertainty tempt s t O Calt ur r l
In addition to exclusion efror, there are other sourcesof uncertainty that errof.
'Within
can arise in the processof data collection. surveysthemselves,the I t is t inr c nu\ \ lL, : . . .
greatest source tf uncertainty involves question wording' An apparently
Lrno..n, change in wording sometimes prompts a notable change tn re-
sponses,so survey ,.r."r.h.rs are loathe to modernize the wording of a Ilr,uslRartc-lN:Tu o \.l"
'lrhen wording is
question that has a long tradition in prior surveys. new
To ilir r st r at et hc sr r r . -
introduced (for erample, changing the word "blacks" to "African Amer-
cussecl.I dcseril'c .; . -
icans',) the time-seriesis usually spliced by using two forms of the survey
volves the pressrnL.
in the transition survey,with respondentsrandomly assignedto one of the declinrng.[r rurns , .:: :
rwo forms. In that way it is possible to determine if the wording change sure d.The decline . -
itself has affectedresponses. ci ti z enswho ar e p, , , :
The concern about precisewording is fueled by a number of famous ex- ci on dar a r han ir i. . . ,-
amples in the history of tutu.yt. ln 1'992,for example, a stir was created from householdsur , - . .
*hen a Roper poll commissionedby the American Jewish committee was In t his case,t h. 'n. . : - .
said to haue fouttd that22 percent of Americans thought the 1939-1945 i cs seekst o under r l. , : . _
Holocaust neverhappened.Here is the question: "Does it seempossibleor 2003; Tr iplet t lqq- . :
does it ,eem imposslble to you that the Nazi extermination of the Jews changein world po\ .---,
never happenedi" A subsequentpoll, using clearer language, found that sual obser vat ionr crr . , ,
not
fewer than one American in 200 said the Holocaust "definitely" did abjectconditions.Tl:; :
happen, and about one in 50 said it "probably" did not happen-
off mar er iallyt h. r nr - - -
pr.r..rring a far different picture from the earlier 22 percent figure' coun t r iesis a ver eJ . . .
'
Anothir potential source of uncertainty involves the general content are those r hat havc . : -
and ordering of questions in the survey. For example, a question about cons um e.Poor eor r : : : .
tolerance to*"rd atheistsmight elicit somewhat different responsesif em- val u ablecom m odir ir . .
bedded in a general survey on tolerance than in a survey on religious be- How do we lr ( . r : . : -
liefs. Moreover, respondents' answers to a question might be influenced pl i fy. ler 'ssuppo\ c \ , , : -
by the immediately preceding quesrions. In the 1972 and 1985 General goods and sen. ices. l: : -.
Social surveys, for example, a question about general happinesswas pre- w ei ghr dif f er enrgo, , - . .
ceded by a similar question about the respondent'smarital happiness,and estimatethe total tj/t.;1:
Smith (1990) found that this ordering may have influenced responsesto ti on in a socier r '.O : , '
the general happinessquesrion-a survey-context effect. In some in- retail side, eir her I . . . -
stanJes,then, two ,,1ru.yr with the same question may eli"it somewhat have pr oduced. or h. . .
different results becauseotber questionsin the survey are different. have sold. Alt er nau. . . .
Responsesfor some questions might be particularly sensitiveto world
ev enr sat t h e ti m e o f th e s u rv e y ,re s u l ti n gi n anothersourceof uncertai nty' rOf c ours e.s etri rre.r.-..-
Responsesto questions about suicide, for example, might be sensitivein reki ng . Fi rs r.qual rrr r.r.--- ..
\ame qual i rv .Ir i ' harJ - .
the ihort term to highly publicized suicides, such as Marilyn Monroe's
l i rrn;] O.l aafor s ome l , . .
suicide in August t962.This is also a type of context effect, but the con- i ererrc esi n rel ari r.ep-r-,.
text here is time, not the other items in the questionnaire' di ffere nc esi n qual i rr. \l -.
In short, there are many sourcesof uncertainty in social researchother pri cer. W e ex pec t hl rr.rr:.,.
than sampling error. Thus a preoccupation with sampling error alone is checp in c ountri esrr hc rc -
The FourthRule . 1 I
this caseyou would survey consumefs to determine how much they have mofe recent purcir,r:::
c ons um ed. foun d t hat a r eeai_: - .
using eirher approach, we find that living standards are improving for crea sedr epor r eJc\ ; : . . -
the worid as a whole, as well as in most regions of the world (sub-Saharan perc entf or all pr r ". - : .
Africa is the notable exception in recent decades).Typically, however, the sum pt ion som ef in- - . '
production-basedestimatesare higher than the consumption-basedesti- tinguished.In orhe: ,,, '
mates: Production data generally suggestsomewhat higher levels of wel- surv ey,Sur ver s, r l -
fare and somewhatmore rapid growth in welfare than do the consumption householdm cr r r l'<r ,. : .
data. To shed light on the possible sources of these discrepancies,Iet us permit t edr o spc. r k -
consider the consumption and production data in light of the exclusion affecf resulrs.r\ft'!
'We
errors and measurementerrors described above. begin with the con- Even wit h p€r f uJr i. :
sumption data. exclusioner r or . \ \ : - , .
national exper r Jr rr r - . . .
N A T I o N A L E X IENDIT URE ( co Nsu u lr lo N) su R vevs
country, dfld somr . . -
One way to estimate consumption is to ask people about their expendi- Hence there is rhe : ..
tures. The number of such surveys available to the research community hous eholdlisr . . r . r , - - - - -.
has mushroomed in recent years. For example' Angus Deaton (2005' ous e xclusionpr ul. . : : - - .
table 1) locates 557 surveys fuom I27 different countries that can be used is especiallyproblei.r..:: -
t o es t im at ea v e ra g ec o n s u mp ti o n o r a v eragei ncome per person.There the leasr iikelr r o : , - .
were only three such surveyscollected in 1979, representing9 percent of R ava llion 2001, r . i"- .
the world's population;in 1,998there were 57 surveys,covering53 coun- great erit em nor r r c. : . .
tries that are home to 70 percent of the world's population. that m ost househ, , l,: , . .
One source of uncertainty is measurementerfor in the household sur- In shor t .m osr ( \ : : - : .
veys. In contrast to the highly scripted sufveys of national production, ti mat esbasedon h, , : . - '.
there is no strict protocol for data collection for national expenditure sur- l i kel y t o f or ger ir cr : - . : -
veys. Hence discrepanciescan arise due to measufement differences. In cl ai m it em s r har r hr . . '
some surveys respondents keep a diary of their expenditures. In other stateincom e and r h. . :
surveysrespondentsare asked to recall expenditures.The length of the re- ternat iveest im at crr '- ' ,
call period might matter since respondents are more likely to remember N at ional pr odu, l -
tw o d at a set spr ot iJ- . : . :
In addition-and most important-a country's price strl-lctufefor goods and servicests re-
isolatethe effect of i-..::
lated to the country's level of economic developmentitself. Labor is abundant and cheap in samein each c, r se . r , .
poor countries, so labor-intensive goods and services (e.g., haircuts) are cheaper in poor measur et he r r r ur cr i, r,:, -.
countries than in rich countries. Capital, on the other hand, is scarce in poor countrie\, so coincide.Yer rhe tl.u; .
goods that require more capital than labor tend to be relatilely expensivein poor countrles.
R ava llion ( 2003. p. br -
Becauseforeign exchangerates are basedlargely on the trade of capital-intensivegoods that
disfavor poorer countries, exchange rates cannot be relied upon for setting international
on ho useholdexpen; : . -
prices to compare living standardsacrosscountries. As a result, scholarshave developedspe- they a r e obr ained. "T- . -
cial currency indexes, called purchasing power parity (PPP)measufes,based on prices from lustrate in concrctc r,,.,
an extensivemarket basket of goods and services,not just on prices of goods that nations can re su lt in d iscr cp. i.:. --
trade (Nuxoll 1994; Summers and Heston 1991). Almost all studies now use PPP measures
for comparing material well-being across countries, in line with instructions in the United N AT IONA L P R O D U C TI O \ .:
Nations Systemof National Accounts (7993,para.1.38): "Vhen the objective is to compare
the volumes of goods and servicesproduced or consumed per head, data in national curren- In the t it le of an ar r : ; . ,
cies must be converted into a common currency by means of purchasing power parities." R avallion ( 2003r r . x. '
T h e F 6 r :rthR ul c o l rti
N AT I O N AL PR O D U C T IO N DATA
ap5ree?" The term "national accounts" here refers to production data and
the term "surveys" refers to consumption data from household surveys.
Consumption data are collected from consumers. Production data are
collected from producers, or from retailers. There is no standard con-
sumption survey used by all countries, nor do all countries field con-
sumpuon surveys on a regular basis. By contrast, virtually all countries
attempt to estimate their total annual economic output using highly
scripted methods developedby the United Nations. National accounts are
compiled following the protocols spelled out in the 1993 version of the
Systems of National Accounts or SNA93 (United Nations 1993). To
the extent that national accounting practices differ acrosscountries' then'
the fault lies with uneven implementation, not with uneven standards,
since the standardsare set internationally.
Household or private consumption is estimated from national account
data as a residual (Deaton 2005; Triplett 1'997). Total domestic produc-
r ion is es t ima te dfo r e a c hc o m m o d i ty ,th e n governmentconsumpti on.i n-
vestment, intermediate consumption, and net exports are subtracted out
to estimate private consumption. Domestic food production is estimated
by weighting the acreageof land under cultivation (from agricultural cen-
suses)by an estimate of average yield per acre (from crop-cutting sur-
veys).This figure is important becausethe food that is produced for home
us ec ons t it u te sa l a rg ep o rri o n o f th e to ta l consumpti oni n poor countri es.
The key point to note here is that there are ample opportunities for er-
ror in each step of the estimation process(Deaton 2005). Let's begin with
the estimation of total production. As defined by SNA93, production in-
cludes all goods whether they are exchangedor not' so food produced for
your own consumption counts. In the caseof services,however, exchange
or work within the household, such as care of children, typically does not
count. As societiesbecomericher, within-household exchangesthat didn't
count before might be replaced by market exchangesthat now count (for
example, someone is paid to care for children), so the rate of welfare
growth is likely to be exaggeratedby national account estimates.
There are other issuesthat make it difficult to estimate national pro-
duction, even for statisticians who are highly trained in following the
U.N. protocols. One is the problem of economic activity that is concealed
from authorities to avoid taxation or regulation (or prosecution, if the ac-
tivity is illegal). Becauseproducers typically have greater incentive than
consumersto concealsuch activity, concealedeconomic activity is expected
to bias production estimatesof income downwardly more than it does ex-
penditure survey estimatesof income. This differencebetr,veennational ac-
count (production) and household survey (consumption) estimates of
income is likely to be more than offset, though, by the failure of household
surveysto include the imputed rent of homeowners'Homeowners of course
The FourthRule . 1i,{
that theseexclusion errors and measurementerrors are random, samplees- procedures,so thar -rr', - i-:
rimates will provide unbiased estimatesof population parametersbut the errce sur t hc dat u. r r ul , - . ; :
variance of the sample estimateswill increase.Even in the best of circum- coursc.exact ly id. r r r : -,
stances,then, r.l'eexpect exclusion errors and measurementerrors across Hence compromises :---.. .
data sets to add uncertainty to our results-uncertainty that is not cap- searcherhas control rr',:r r-
tured by the usual estimatesof standard errors (Clogg and Dajani 1991.). As noted above. hr,',', :. : .
This is the sort of uncertainty that we want to gauge by replication studiestypicalll' anal. zr
across data sets within a given study. The explicit purpose of such inter- rcsult sucr osst laI u : r r r , '.=
nal replication is to provide a method for gauging total uncertainty aris- nal rcplicalioz \ l r , r r r J\ .
ing from the data collection process itself. \7e want to compare results l n t hc caseol elr r r
from the identical analysisof different data sets.'Wewant the replications data point and useCoD',:11
to be done in parallel, by the same researchers,to make the analysesas dency and disper sior r
similar as possible. beenformalized Lrsins-r ii-:r-
Internal replication is especiallyuseful in the caseof large data setswhere anal ysis.M oder n m cr . ; - . :
sampling error is not a major problem but other types of exclusion error sucha s G eneG lass 1*- - . - -
could be. Examples of internal replication are nonethelesshard to find. mul tiple st udiesr o r r r .-
Replication acrossdata setsrequiresdata setswith overlappingcontent. Un- mi gh t want t o kr r , . . . . - -
derstandably, funding agencieswant to support new and unique data, not achievement,and u'h.::-.: - ,
repetitivedata sets.As a result the overlapof content acrossmaior social sci- The lit er at ur er c\ : ; . - -
ence data sets is often lessthan ideal for extensivereplication work. can seenas a kind oi : : . , - :
One solution might be to piggyback on surveys done by independent much more standirrdrz.* .- -
polling organizationssuch as Gallup and Harris. Ideally this would be done l i terat ur er evier r s.h. r '. : : - -
when different polling agenciesare in head-to-headcompetition to make on met a- analyt icnr . r h - . .
accuratepredictions about a common target population, as occurs, for ex- narrat ivelook at ) on] . - . - - - '
ample, in the polls leading up to an election. By enabling identical analyses sul tsfr om a neu' r t r r i.
of parallel data sets,the addition of economic and social indicatorsto these sti tut et he dat a.Thu. : . - . . : - ,
simultaneous polls should provide leverage for investigating the effects of as si z eof t he sam plq.: - .- - . :
the nonprobabilistic exclusion errors discussedin this chapter. are used. qualit r or : - .- . . -
As it stands now, however, it is hard to find studies that perform paral- reviewed journal, u'h.:.-.: :'
lel analysesof different data sets as described here. Hopefully this mode sourceas the independ.':.: ,:
of researchwill become more common in the future. In the meantime, it studiesthemselves ore r:; ,: -
is instructive to consider a more common mode of research, called meta- pri or st udies.or ar lc. r . : - ,
analysis, which also uses multiple data sets in a single study. The aim in ro rest on much lnorr - "
meta-analysis,however,is not the parallel analysesof a limited number of fi nds in m osr ot her r r : . . ,- .
data setsbut the averaging of effects acrossmany data setsto obtain more Met a- analysisis h. . : , - :
dependableestimatesof effect sizes. the i s sueof nonr esiJr ' .
evi dencer hat a f ar hcr . : . -
hi s ch ildr en'sacaden: : . , - -
Mr.re-ANervs r s: SvNlrrnsrzruc Rnsurrs FoRMALLyACRoss Srurrss hi s vi sit swit h his chr i- : , - - -
l ated issues,Paul \ r : . : : : - .
To gauge uncertainty, ideally we want to analyzedifferent data sets rn a anal y sisof 63 st uJi. . -.
single study becausein a single study we can make every effort to run qual if y: "To be ir r . lu- =, . - : -
identical analysesacross the data sets.Our goal then is identical analytic i n fat her - absent
hou. . :
The FourthRule . 1'-,-
' i , ' : -rri n g! and ( c ) P r es e n t issue,Amato and Gilbreth report "fail safe N values" using procedurcs
ijr suggestedby Rosenthal (1,979).This statistic refers ro the number of nex'
: . '. . . . : ct q lT eS um m ar iZ i n g
': :, ..::ta bles "( P . 561) . T h e studies that would be required to reduce a significant mean effect size to
nonsignificanceif all the new studies had null results. Amato and Gilbreth
r-r: i . ,;lte the studies that
,- :. : : :t?3d dat a bas es( S oc i o - also compare the effect sizesfor large and small samples.This servesas
another check for publication bias, as follows: Supposeauthors are reluc-
r: .-- .;-.c.supplementedthe
tant to submit (and editors are reluctant to accept)articles in which results
" - .: . ; . :'.:,,t th e ar t ic lesunc o v -
j:- , -rlic'Il.e PaPerSand dis- are not statistically significant. If so, it follows that published correlations
- - - : : : :.r Public at ions . should tend to be larger in small samplesthan in larger samples,srncea
:,:-.',: :,,r bo)'s and girls, the larger correlation is required to attain statistical significance in a small
': :- : - '-::l l l es. M eas ur esof t h e sample. Thus larger effect sizesin published studies with smaller samples
:. .' ::: -.lregorized as school may indicate the existenceof publication bias.
-: .--:-- ::rrblems (for examPle, Quality of the study might matter, of course, and we would want to
give more weight to the findings of the better studies.Amato and Gilbreth
: : - .: :t : :or erample, low self-
test for the effect of study quality by coding each study as 0 (no) or 1
(yes) on the following characteristics:whether families were selectedran
.. : -, -: .:::i rn gs of c los enes san
, d
,i :r .r - :Lr chi i dr en' sP r oblems , domly, whether the sample size is at least 100, whether the dependent
,;"-- :,rne\\'ork, providing ex- variable had a reliability coefficient of 0.8 or grearer, whether the study
employed control variables, and whether the study used different infor-
mants for the independentand dependentvariables.By codifying the stud-
:r : -.: ;orrelation coefficient r
-. ies in this way they were able to assesswhether the quality of the study
: : .... .:'.rdiesthat used control
moderated the strength of the association between paternal involvement
::- . :irns of other statistics,
.: :: .;- instancesAmato and and offspring outcome. Our confidence that the effects are real is en-
hanced if we find that the better studies find the srronger results.
.r , ' r,r-. 1985) and Rosenthal
Finally, it should be noted that meta-analysisis not useful where a lit-
: ..:.:-:..-..e of the associationsof erature has gone off the rails. Recall, for example, the fatally flawed
i: :-,--:' of the four indePendent cross-country studies of foreign investment effects described in chapter 3
:r: : :he dimensions of father (where foreign investment and domestic investment were measured very
::: :-.:>of closeness, and author- differently). Meta-analysis would not have helped in that case,since con-
':.: :; ,l.rild outcomes,with au- clusions were basedon a fundamental misinterpreration of the model be-
i i:::': :r)siti\-eeffect.FrequencY ing estimated.A meta-analysiswould simply have averagedthe erroneous
::: ,,.r:r,.-)r1 rvith child outcomes' interpretations.
::::: :a) Saugethe rObustnesS of
:-.:>. or the "file drawer Prob-
:.j: ,-;l,rs .r'hen researchersPut SuuueRy: Youn CoNrTDENCEINrr,Rvars Ans Too Nennow
=: :::,..:tsubmittingthem for Pub-
.: ::-.i ;hild outcomes'one might One key to the scientific method is the validation of one's results with re-
;-r::rii their findingswhen there peated trials. In the case of nonexperimental social research, "repeated
.:.: :. ...sslikell'to accePta PaPer trials" often means different samples.Yet it is rare to seethat type of val-
..:-:,':t b,iasfavoring studiesthat idation in sociological research articles (though the findings might be
:: , - .. . j u 'ei l -b eing' replicated in subsequentresearch),and in my reading in the other social
meta- sciencesit appearsto be relatively uncommon in those other disciplinesas
-: ::r:rs published,then
:: : : : : : :e .t si z es T
. o addr es sth e well. I am hard-pressedto find examples of substantial literatures where
110 . Ch a p te r4
parallel analysesin a single study have become an integral part of the re-
search process. If my observations are correct, we will know that social
researchersare coming to grips with the uncertainty issuewhen the report-
ing of identical analysesof parallel data sets becomesas commonplace in
researcharticles as the reporting of significancetests is today.
Identical analyses(samemeasures,models, and estimation methods) of
parallel data sets(different samplesof the sametarget population) can yield
divergentresultsdue to exclusion error and measurementerror. Of the "big
three" erclusion errors-coverage error, sampling error, and nonresponse
error (error due to unit nonresponse)-only sampling error is reflected in
conventional standarderrors. And measurementerror is not reflectedat all
in conventional standarderrors. As a result, confidenceintervals understate
the true level of uncertainty.
Unlike sampling error and measurement error, which often can be
modeled using probability theory, coverageerror and nonresponse error
generallycannot be modeled or are very difficult to model. To gauge total
uncertainty, then, we must often proceed inductively, that is, by examining
more data. That's where internal replication-the identical analysisof par-
allel data sets by the same researchteam-comes in. Realistically,replica-
tion may not be possible for most social researchtoday. But that should
not prevent us from being alert to the possibility. Rule 4 is a call for social
scientists to become more opportunistic regarding replication. Perhaps
internal replication will become a hallmark of social research later in the
twenty-first century, replacing today's norm of relegating replication to
follow-up studies.
A PPEN DI X
StudentExerciseson Rule 4
The exercisesbelow use data from the American National Election study
(ANES). ANES data are collected every rwo years around both presiden-
tial and midterm elections. Each survey is carried out in two waves, one
wave before the election and one after the election. Much like the GSS,
the ANES contains a multistage probability sample of noninstitutional-
ized individuals in the United States, age eighteen or older on Election
Day of each national election year.
To do the exercisesbelow,3 go to the web site http://sda.berkeley.edu.
As with previous exercises,the instructions below should generateal1the
statisticsyou need to answer the questions, making hand calcr-rlationsun-
necessary.Again, remember that variable names are in boldface. Note
well: The student exercisesat the end of chapters 7 and 2 use rhe same
web site. The instructions below include only the key commands; I do not
give step-by-stepinstructions. The fomrat in the sDA rveb site rnight have
changed somewhat since these directions were written, but ,vou still
should be able to follow the logic below to obtain the tables vou need ro
answer the questions.
One might think, therefore, that public support for the ERA declined Wc f ind sim ilar r : ,
as well in the face of these arguments. But the data are lessthan definitive trenr,Js. Let's brgin ri :-
on that issue,as we seefrom thesetrends in Gallup and Roper polls span- quer ied r espor r dcn[ :. . .
ning the period March 1976 to June 1982. v83 3) : "An ef ior r i. :
According to the Gallup polls, the percentagein favor declined from sti tut ionwhich u, - u - ' -
70 percent in 1,976 to 62 percent six years later. On the basis of the sex,I)o yt-luirppr()\t'
Gallup series,then, it appears that support for the ERA in fact eroded Constitution?"
over the late 1970s. But the Roper polls indicate no such erosion of sup- . From the SD\ -
port for the amendment (and perhaps even a slight increasein support).
ur chive" lr nd r i. :
r You will r r eetri , ' - .
-
Question 1. The question wording differs for the Gallup and Roper polls. f r om your anir '. ': .
Observe that the Gallup poll says that the Equal Rights Amendment
sponsesam or 't !: - :
"would give women equal rights and responsibilities." The Roper ques-
t r eatt hoseii'h,, -
tion says nothing of equal responsibilities for women. Do you think the -.
ing.
addition of "and responsibilities"in the Gallup question is the decisive . Fr om t he r eeoc- . , '
difference that explains the divergencein the Gallup and Roper trends, or f or "Nam e ior : .
are there other differencesin the polls that you think are more important? "Nam e( s) t - l1y\ . - : :
Defend your answer. lect "yes" inr '-
t ion. For r oir I
"approve" for " .--
PublicOpinion Surveyson the ERA, 1976-1982 t ype "2" f or ". : .
I . " Under "\ \ h. : :
Gallup Polls
ables( if an\ - . : " : .
Have you heard or read about the Equal Rights Amendment to the Constitution
( This will aLr r ( , : '. i
which would give women equal rights and responsibilities?Do you favor or
oppose this amendment? ing. ) Then e1r . ^
shouldhave- {.i. . -
3/1976 5/1978- 7/1980" 7/1981* 12/1981 6/1982* o To cr oss- t abul. r :.- .
Favor aio/
65% 65% 66% 63% 62% menu ar rhe rop : '
Oppose 30% 35% 35% 34% 37Y" 38"/" cies clr cr ossr . r b: :. :
var iableol' ir r r r : r -
"Question asked only of those who had heard or read about the ERA.
st udy)as t he . , ' - -
Roper Polls cent age5. 5elc. r "\
The various State Legislaturesare now voting on an amendment to the United Under "Char r , ': :
StatesConstitution which would assurewomen equal rights under the law. As char t " {unlet r t
I'm sure you know, there is a lot of controversy for and against this amendment. f ault " m ode. . r . '.
How do you personally feel about it-are you in favor of the Equal Rights
t he r able" r o , , - - i. :
Amendment or opposed to it?
valid cases,oi r '.: -
1 2 /1 9 7 7 7/1978 10/1979 12/1981 in opposit ion. 'l- -
In Favor 67% 69% 69% 71% aSome readersmigh: .,.
Opposed 33% 3 r% 31% 29"/o
for ANES. As discusse.i:r :
Note: Only valid responsesare shown Data provided by the iPoll Databank of the Roper neces s aryto rec ons truai:a:
Center for Public Opinion Research. oversampled.(Recall rh.r: ::
TheFourthRule . tt:
:: , r .i ,.i stion is t he dec is i v e o From the recode screen(found under "Create variables"), type in era
, ..: lnd Roper trends,or for "Name for the new variable to be created." Type in v833 under
: -...i Aremore imPortant? "Name(s) of existing variablesto use for the recode." Be sure to se-
lect "yes" for the "Replace that variable,if it already exists?" op-
tion. For row 1 under "Output variable," type "1" for "value,"
"approve" for "label," and "1" for "Var 1." For the secondrow,
type "2" for "valuer" "disapprove" for "labelr" and "5" for "Var
1." Under "\7hat to do with unspecifiedcombinations of input varr-
ables (if dny)," make sure "Convert them to MD code" is ielected.
-.nr to the Constitution
(This will automatically code the "Don't Know" responsesas miss-
:.i Do 1'oufavor or
ing.) Then click on "Start recoding" to convert v833 to era. You
should have 4,589 valid cases.
1211981 6/ 19 8 2 r e To cross-tabulatethe recoded variable with survey year, return to the
62% menu at the top of the home page. Select"Analysis," then "frequen-
63%
37% 38% cies or crosstabulations." Becauseera is the outcome or deoendent
variable of interest here, enter era as the row variable and v4 lyear of
: ::r. L,RA. study) as the column variable, and be sure ro ask for the column per-
centages.Select"No weight" under the "'Weight" drop-down menu.
::'..-;rdmentto the United Under "Chart options" use the drop-down menu to indicate "no
::.ihts under the law. As chart" (unless you want a chart). Leave the other options in "de-
: .rglinst this amendment. fault" mode, as you have done in previous assignments.Click "Run
: ,,i the Equal Rights the table" to obtain the cross,tabulation.You should have 4,589
valid cases,of whom 3,215 are in support of the ERA and 1,,374 are
I ; t 19 7 9 1 2/1 981 in opposition.a Print out your results.
GSS Resrzlrs
Next let's seewhat the trend looks like using the General Social Survey.In
1977 and 1982 respondentswere asked "Have you heard or read about
the Equal Rights Amendment?" For those who answered yes, the GSS
variable era asks simply, "Do you strongly favor, somewhat favor, some,
what oppose, or strongly oppose this amendment?"
. You will notice that there are four responsecategories.To allow for
an easiercomparison with the ANES, collapse the era variable into a
dichotomy scored 1 for "Strongly favor / Somewhat favor" and 2 for
"Somewhat oppose/ Strongly oppose."
o From the recode screen,type in era2 for "Name for the new variable
to be created." Type in era under "Name(s) of existing variablesto use
for the recode." For row 1 under "Output variable," type "1" for
"valuer" "favof" for "label," and "1 -2" for "Var 1." For the second
row, type "2" for "valuer" "oppose" for "labelr" and "3 -4" for "Yar
1." Make sure to select "yes" for the "Replace that variable, if it al-
ready erists?" option. Then click on "Start recoding" to convert era
to eta2. You should have 2,765 valid cases.
r From the "frequenciesor cross-tabulations" screen,enter era2 as the
row variable (dependentvariable) and year as the column variable (in-
dependent variable). Select "oversamp-weight for black oversam-
ples" in the "lWeight" menu and retain the same options as in previous
examples.Then click "Run the table" to obtain the cross-tabulation.
You should have a total of 2,025 respondentswho favor the ERA and
749 who oppose it. (If your numbers differ, you probably selecteda
different weighting option. )
Your results should show different trends for the ANES and GSS data.
On the basis of the ANES data you would conclude that support for the
Equal Rights Amendment declined sharply after 1,976,from 80.8 per-
cent approval in 1976 down to 61.3 percent approval in 1980. On the
basisof the GSSdata, by conrrasr,you would conclude that support for
the ERA did not change from 1977 to 1982. (Note that the Chi-square
value indicates no association between year and era2 in the GSS data-
in other words, there was no statistically significant change in era2 from
1977 t o 1, 9 8 2 .)
case of the ANES data eramined here, there are no oversamples,and the weighting scheme
is intended instead to weight respondents as necessary to obtain a sample that looks in the
aggregdte like the U.S. popularion with respect to age, educational artainment, and geo-
graphical location. Becausethe provided weights adjust for nonresponse,using them would
take away some of the heuristic value of this exercise.
The FourthRule . I 1.5
'$7hat
accounts for the large discrepancy between the GSS and ANF,S
results? Two possibilities can be ruled out at the beginning. First, sam-
pling error is highly unlikely. For samples this large, it is ertremeiy un-
i. : i ir: Ge ner alS oc ialS ur v eYIn
'
or r ead ab o ut likely that differencesthis big are due entirely to sampling error. Second,
, -i: - . r. .'ou h ear d
' :: ,r,1\-r.lnsweredYes,the GSS year of melsuremenl is also unlikely. Consider the differcnces in the ap-
proval rates basedon the ANES in 1980 and the GSSin 1982; 61.3 per-
a , :-.1 ,:. somewhatfavor, some-
_ _ -_----r: "
cent versus 72.8 percent Although historical eventscould account for a
shift of this magnitude over a two-year period, that appearsunlikely here,
::i: arL. categories' To allow for since the Gallup results (above) show no evidence of radical swings in
i.. - ..:lse the era variable into a public opinion on the ERA from 1980 to 1982.
::.' . flttlervhat favor" andZ for
Question 2. (a) Describe the critical differences in the wording of the
! \i.
:-l : : "\ame for the new variable questions in the ANES and GSS.
\, :': r of existing variablesto use (b) Describe the differences in the population actually surveyed. (Recall
-,-\--:r',rtlariabler" type "1" for
that the GSS question uses the filter, "Have you heard or read about the
- i -1" :or ''Var 1." For the second
Equal Rights Amendment?" whereas the ANES does not.)
r- : : "irbel." and "3 -4" for "Yar (c) Either difference-the difference in wording, or the difference in the
:-.: '-it:place that variable'if it al- population surveyed-could plausibly account for some or all of the dis-
r ">:.:rr recoding" to conveft era crepancy in the two trends.'S7hichdifference do you think is more critical
here? Defend your answer.
screen,enter era2 as the
::-.: r.rr ls the column variable (in-
..;1-;-11'sight for black oversam- AssrcNurNr 2: Estruerno Vorr, TunNour UsINc Drrrtnr,Nt
::.1 ::- sJmeoptionsas in previous Sel.plrNc Fneuss
: : : .rt'tain the cross-tabulation'
::: r.;-:'rtsrvho favor the ERA and In 2000 the ANES experimentedwith drawing respondentsfrom two dif-
:.:. :-::er. r'ou ProbablYselecteda ferent sampling frames. In the first method (referred to here as the FTF
frame, for "face-to-face"), a traditional multistage probability sample de-
sign was used to select respondents for home interviews. In this design,
=.... :,-rrthe ANES and GSS data' large regions are selectedprobabilistically, then subregions are selected
for the
-.,: ;onclude that suPport within the chosen larger regions, and so on, down to the household level.
-::: '. -.rfrer1'976, ftom 80'8 Per-
In the final step, a single respondent (an adult) is randomly selectedfrom
:. : - . : t aPPro v alin 1980' O n th e the chosen housing units. The FTF sample includes 1,001 respondentsin
for
- ,,. :iJ concludethat suPPort the preelection wave. All FTF interviews were conducted in respondents'
1-. L \ote that the Chi-square
homesin face-to-faceinterviews.
.. \.3r :r-rdera2 in the GSS data- In the second method, random-digit-dialing (RDD) was used to select
i .'. ..:iificant changein era2 from respondents,and all interviews were conducted over the telephone.In the
caseof RDD, then, the frame population excludesindividuals who do not
have telephones.The 2000 RDD sample includes 806 respondentsin the
:::.lllrples, and the weighting scheme
preelection wave of the survey.s
' :: - - ::. , obt ain a s a m p l e t h a t l o o k s in tb e
5For further explanation of the RDD and FTF designs used in the 2000 ANES, go to
:: : .t::. .ducational attainment, and geo-
- - :- .::: iL)rnonre S p o l l s eu, s i n g t h e m wo u ld ANES 2000, click on "codebook" at the top of the SDA analysis page, then click "Intro-
ductions" in the left frame. Finally, click "Sample Design" from the introduction menu.
116 . Cha p te4r
Use the Berkeley SDA web site to examine the differences in voting
turnout between the two sampling frames describedabove.From the SDA
homepage, click on "SDA archive," then go to the American National
Election Study (ANES) 2000 data and follow these instructions:
o Recode vL24I into a dichotomous variable, coded 1 for respondents
who voted and 2 for respondentswho did not vote, as follows: From
the recode screen,type in vote2000 for "name for the new variable
to be created." Type in v1241 under "Name(s) of existing variables
to use for the recode." For row 1 under "Output variable," type "1"
for "valuer" "voted" for "label," and"4" for "Var 1." For the sec-
ond row, type "2" for "valuer" "did not vote" for "labelr" and "1-3"
for "Var 1." Then click on "Start recoding" to convert v1,24L to
vote2000.You should have 1,554 valid cases.
o Cross-tabulate vote2000 with the sampling frame variable, denoted
v4. Insert vote2000 as the row variable (dependentvariable)and v4
as the column variable (explanatory variable). In the "'Weight"
drop-down menu, select"No weight." Use the options for previous
6The wording may seem a bit lengthn but this is designed to avoid what
survey re-
searcherscall "social desirability effects." Becausepeople often feel ashamed to admit that
they failed to vote, this rvording aims to avoid falsely affirmative answers from rhese non-
voting respondentsby giving them other, less stigmatizing reasonsrhat they might not have
voted. And by calling attention to a specific election, they attempt to avoid memory recall
errors by forcing respondentsto think carefully.
The FourthRule . 11-
(b) Is this type of exclusion likely to bias the RDD versusFTF results
on reported voting in a positive direction for RDD (higher reported
rates of voting in RDD) or in a negative direction for RDD (lower
reported voting in RDD)? Explain.
(c) AdditionallS the FTF sample may find it harder to reach the rela-
tively well-off and older people who live in gated communities, further
differentiating the composition of the two samples.This type of exclu-
sion is likely to exacerbatethe difference in the RDD and FTF reported
rates of voting. Explain why this is the case.
Measurement error
(d) Peopleinterviewed over the telephone may give less thought to the
question, or it might be easierto lie over the telephone. Is this type of
measurement error likely to bias the RDD versus FTF results on re-
ported voting in a positive direction for RDD (higher reported rates of
voting in RDD) or in a negativedirection for RDD (lower reported vot-
ing in RD D )? E x p l a i n .
4 1;;ffn:ffix;'
(b) Give le"rt riu,,
"t
*llljT#:H"ffi'.1?,,:il.j:::i*:tri;llti?
rares, *.",,ffi ',i:h:',ili :::iJ"l "', dirr...;;.;;i,'.,0o,,,.
and/or
th.high.,;;r,."i.i;;;;;l;;.ffi i::,:Tili:J:X',?X:n":
the argumentsyou maclern
assignment2.
. -. ;1.'el'l-sftof contracting
nor
- :-L!ractlung cancer, ConnErerroN AND Ceuserrry
r-. -rresusceptibleto lung
The first law of causality that students are taught in introductory starisric
- :- r: :! not t he only r ea s o n
classesis "No causation without correlation." Causality, then, implies
: : .,,ci .1 1r es ear c hm aY b e
correlation. But the reverseis not true, as students learn from the second
: ...:.::rlteob jectiveis to esti-
l4v7-"Q6r1slation does not prove causation." Thus students are sternly
:', . .ince questionsof what
warned: "Thou shalt not infer causation from correlation.,'
:- : : '-.r.: the battle involves de-
Many homespun examplescan be given of the secondlaw. There is, for
.; ,\ irh regard to rising in-
example, a nearly perfect correlarion between length of individuals' right
::.:.::tPle.is the dilation of
and left legs. Yet we would not conclude that the length of the right leg
: : : :-..:Jle.or at the bottom of
causesthe length of the left leg, or vice versa.Nor would we infer that the
.: -.'. i:1(r\\' the answersto these
absenceof gray hair is causally related to having babies, even though we
- :: :(r amelioratethe conse- observethat women without gray hair are more likely to have babies than
,:.:i 1! for the rich and stag-
' '. .r..fu l f or f as hioningta x women with gray hair.
': r:rSotlSfor the rising ine- The classictextbook example of a noncausal correlation is a reported
positive associationbetween presenceof storks and presenceof babies.
In one version there is said to be a significant positive relationship be-
: .:: -.r,.rslleffects are the same for
tween the prevalence of nesting storks and the prevalence of human
-
. :or good overviews of litera- births in villages in southern Germany (the explanation being that houses
with a newborn are preferred by nesting storks becausethey tend to be
722 . Chapter5
<- ex
#1,:::T,:[ir1,-u,.h"].;;;;;i;;;;:iy,,]lrH,; ll.:"11;
th..nn,..g;;;;""#ilil:?*:","JrTii:tri,i,}fl *:*".,,x",:,il1
berweenthe cancer-ana
.""i."r gl3rir',"i,r-' regard
to thcir contactwith
,t['f,",*:::.ff:ilru1*l-m'.1..';;;;;',it".r'"can ca.ncerts spread by creatures
such as ratsithe ,";;;; ;;^^:::'^il|'rner,
theconstipa,,"" of
,',ijill llt:tt ,cancer)'
'rttit '*lii"iuntu., 1r,,,.r,
;,T,*i"Tff '_I!:'JJ';"fi::';ff ;,'i:l';ff
i'k::;::i::]
wasnot,t",i.ri."rif'r:;'J#fri:;f: sroup,b,t thedirference
on
rvinrerm
sor,'"okin rT""'
gu.r.,','
r,..r."ll"i'iT'iol
iT'lii'iilfiil,Xl
udethat" h""; ;;;'i' -.,n..,,,,
J.tfl,j;.:'"cl somererario,,o
"1i.,
A secondlandma_rkstudy
on the.possibledang.erof
a decaderater in science. smoking appeared
R"y-o.,d'p-.-""ri a medicar
Hopkins, had carefully _ri*lr.J statisticiJn at
h;;il records of hundreds Johns
lies in the Baltimore area. of fami_
Unlike the retrospective
study, the pearl study ."", L.mbaril-Doering
p.orf..ii"",'foffo*ing
time (Pearl 193S). Thus pearj 6,g13 subjecrsover
;;;;;lr'
jlfit'ftIl:.*l:
$li;:lll*l"ll.,,rsmokers,-r;;;::[Tu':,i
smokers r'"ar"*"'i:ffi'Tl*:iill?l*""1- 'r'i'ii" 'iI""rnr"""'
had lower survival rates smokers,.who rurn
rhan nonsmokers]::*a.
r-uk.r,li,,.d
o"rrrr"'rir,, ;-o"lxT* ;:*ll ffiT:
,"jff:lJ,.
The pearrstudy was folrowed
by two rnuchlargerprospective
of- smokingand lune t:n:..., stuiries
onein Englandand o'e in
In Engrand'Richarl the U'itecJsrares.
iJ a'ratTa'itt, sentquesrionnaires
9^"1
smokinghabitsto all 6Q000 about
ing repliesfrom about ,he medicarprofession, recerv_
-.-1..r'"r
10,0.09.ii;;#;data from tire British,"g,r.ro-
:ff ffif;'j;,.*:, f,,eddearh,,;.,
;;;^ r""g.n;._.#',_,,i1^,
"o
*-i*l
mond and Daniel Ho* fi #rTi;*?lr.:: **#;:r unrted st-ates,cuyler Ham-
so-G
eo,,-r*'
-*:,:i..ll-?Tr
i",r,:il #'iffi',ili :'ffi41.;
..ti",-^,:;!^':;_:',:
; l,**:: ;:l;:;
reportedresurts
vervsim'ar ,o ,rr. o"iiiitiir-,rrarn*r,
over thenextfour
jfn r"lrg.,*",'* ^,'r.,.*,i,*, r,igrl..
f';i: jltif ;:.T:., ro..i.
"u,
f24 . Chapter5
Box 5.1
A Frank Statementto CigaretteSmokers
ir statisticalcorrelation In light of what we know now about the close link between srnoking
.;:!er's Digesr published and lung cancer,it might be hard to imagine such a "frank statement" ap-
...-rrningof the dangersof 'We
: pearing in today's leading media, evenwere such advertising allowed.
'- - :1i public beganto ques- must remember that, a half century ago, the case against smoking was
! :.rolrsht,tobacco compa- based almost entirely on statistical associationsthat were open to alterna
i r {Jfett€ Smokers" that tive interpretations. Hence tobacco companies were not out of line in
,.: lur-rdred other newsPa-
(continuedfrom ltreuiouspage)
'We
believethe products we make are not injurious to health.
\7e always have and always will cooperate closely with those
whose task it is to safeguard the public health. For more than 300
-< Ttntes and years tobacco has given solace, relaxation, and enjoyment to
:--::ri ' 4, 1954 mankind. At one time or another during those years critics have
held it responsible for practically every disease of the human
:,rr e given wide public- body. One by one these chargeshave been abandonedfor lack of
:ome way linked with evidence.
Regardless for the record of the past, the fact that cigarette
:::!roflx1 standing,these smoking today should even be suspectedas a causeof a serious dis-
,^.:he field of cancer re- easeis a matter of deep concern to us.
i::rous medicalresearch, Many people have asked us what we are doing to meet the pub-
-. ,:ld be disregarded or lic's concern arousedby the recent reports. Here is the answer:
'We
1. are pledging aid and assistanceto the research effort into
'..: interestto call atten-
all phasesof tobacco use and health. This joint financial aid
:-.earch scientistshave
will of course be in addition to what is already being con-
, ,i theseerperiments.
tributed by individual companies.
2. For this purpose we are establishinga joint industry group con-
rdicatesmany possible sisting initially of the undersigned.This group will be known as
Toeecco INoustny RsseeRcH CoumrrtrE.
.luthorities regarding 3. In charge of the researchactivities of the Committee will be a
scientist of unimpeachable integrity and national repute. In
is one of the addition there will be an Advisory Board of scientistsdisinter-
ested in the cigarette industry. A group of distinguished men
:.:-1:.rre smoking with the from medicine, science,and education will be invited to serve
,: :, i .1n\-one of many other on this Board. These scientistswill advise the Committee on
-,...::n.of the statisticsthem- its researchactivities.
i -: i :tiis t s .
This statementis being issued becausewe believethe people are en-
basic responsibility, titled to know where we stand on this matter and what we intend to
business. do about it.
1954 to note that "there is no proof that smoking is one of the causes[of
lung canccr]." As late as 1.959the eminent and colorful statisticianSir
Ronald A. Fisher published a book emphasizing that, in the absenceof
randomized experimentswith human subjects,the correlation of smoking
(continuedfrom preuiouspage)
BENsoN& HEDGES
JosephF. Cullman, Jr., President
Lenus & BRoTHERCouneNl INc.
\7. T. Reed,Jr., President
R. J. RTvNoLDSToBACCoConpeNy
E. A. Darr, President
:: . . .:rusality from those what the cancer rate would have been for the smoking group had they not
:": -. I l: rhirt correlationsdo smoked. That assumption is plausible since by random assignment we
- :.. : correlation between have alleviated the effects both of selectionbias3and of initial differences
: , : :-I\\ ' c en t he r wo v a ri - between smokers and nonsmokers that might account for subsequentdif-
r obs er v at i o n a l
t : - : . . ,.:-..:,'. litin ferencein cancer rates.
: , ' . . "':.'. r( ) ( aus es , I f indi v i d - To summarize: The problem in estimating causal effects is that we can-
'$fhat we
- i - - ':.1 ',1ririd, then our work not simultaneouslyobserveX and not-X for a given individual.
-- - - : i 'rLd be m uc h c lose rto would like to have is a reverse-Xuniversein which, for example, smokers
- -- - : ,- ::. ior examPle'to deter- in our universe are nonsmokers in the other universe. In the absenceof
-- :: - : if indiv id u a l s such a universe we attempt to simulate it. The most obvious simulation
i-: ':- '-:)lgnedr -rrningssome to attend method is the controlled experiment, where the researcherrandomly as-
:-, ,: ri e fr om c ollege at te n - signs subjectsto treatment and control groups. But there are other meth-
'- : r : :,.-: l l o t onlY t he P r ob l e m ods, and the remainder of this chapter focuseson them. The idea in each
,- :- ::1. differencesbetween method is to compare like with like.
: . -'ut also the Problem of
: - - - : '* here individuals self-
TVpssop Srnerr,crns FoR CoMpARINGLIKE wITH LIKE
:- -:- ;. ': L,i;tsin observational
The oblective in causal analysis is to estimate a causal effect from an ob-
served correlation. To estimate a causal effect from an observed correla-
- .,'- -. .:-.:ntis designedto com- tion, we must compare like with like to rule out other explanations for a
: .- :-': ..-.J;,rnt r ol gr oupst o d i ffe r correlation between X and Y:
t - ::- t o x we c anm o re 1. Omitted-variables: Individuals' differenceson X are related to other
-:l :l l ent
-: .' :-. Ii-i other wordst random causesof Y.
- :rrrolledex P er im e n tsto 2. Selection:Individuals selectX (or others selectX for them, as in the
,: -. , ::elation does not Prove caseof tracking in schools) on the basis of expectations about Y.
: . , :: ,..ndno scruples,and we In the classic experimental design we avoid bias, first, by random
i :::' , .-.: irouP Or tO a nOnSmOk- selection of units before the treatment X (to avoid selection bias) and,
: r:r r: -lge sixteen.on your stx- second, by random assignment of units into the treatment (to avoid
'We
: , -: - i ieterrnine whether or not omitted-variables bias). also would like a large N, to guarantec that
: . . :-: SroupYou must begin to our chances of sampling error are vanishingly small. \fhat we want, in
.; :::-.:: \ ou want to or not; like- brief, is a researchdesign that makes indiuiduals in treatment and control
groups indistinguishable from each other except for (a) tredtment status
-: .. . : ::,,h i b it ed f r om s m o k i n g ,
,-:rscvary the level among and (b) completely random shocks to the outcome uariable Y.
- : : ,
:: :'.-11fPack each daY, some So there are three key ingredients in causal analysis:random selection
'.:
i-r -. : i-.-o n t he ex P ec t at io nth a t of units, random assignmentof units (to a purported cause), and sample
lighter size. Chapters 1 and 4 addressedthe issue of random selection of units,
- :::-:,: iung cancer than
r ..:'.. s e focus on the two-group 3Selectionbias here rvould mean that individuals basetheir decision to smoke in part on
: ^ . :: :he ra t esof lung c anc e ra re their actual propensity to contract lung cancer, u.hich seemsfarfetched. In many other in-
-:
:r:. I : r"'ould claim that smoking stances,however, selection bias is very likely in observational data, as in the case of college
-r , : r' i : Jl ri m on t he as s um p ti o n attendance(choosing to go to college is based in part on how likely you are to benefit from
:: . --: is a reliable measureof co lle se).
130 . Chapter5
and the sample size issue is addressedin those chaprers and elsewherein
the book. In this chapterI focus on avoiding omitted-variablesbias through
the random assignmentof units, or through some alternative strategy that
mimics the effect of random assignment.Randomization is the means to
an end; the end here is the comparison of like with like. Once the subjects
are selected,how do we craft our study to try to make individuals in the
treatment and control groups indistinguishable except for treatment sta-
tus and random shocks to Y? The most common method is regression.
Regressionis a parametric approach, meaning that it tries to compare like
with like by using estimated paramerersto adjust for differencesbetween
treated and nontreated individuals.
Although regression aims, in a very general sense,to "match" on the
confounding variables,other methods do so more explicitly by the pairing
of similar units. The aim is to compare individuals in the treatment and
control group who have the same or very similar values on confounding
variables. One way to match individuals is on the basis of measuredvari-
ables,using either eract matching or matching with propensity scores(de-
scribed later). A second method is to match a unit on itself through
multiple observations (for example, two or more observations per indi-
vidual for investigationsof causal individual-levei effects, or observations
on two or more family membersfor causal family effects).For this match-
ing method, unlike the first method, we do not need to measure all the
causal variables in order to compare like with like.
A third way to match is through random assignmenrto the treatment
and control groups. Although random assignmentis not always included
in discussions of matching methods, the matching or "balancing" of
treatment and control groups is precisely what random assignment aims
to do. Indeed, random assignmenris a particularly attractive method for
matching becauseit harnessesthe power of probability in the service of
matching. To be sure, as researcherswe are not always able to randomly
assignunits. Nonetheless,as we will seesubsequently,it is sometimespos-
sible to exploit naturally occurring random assignment.
MercnrNc vERSUS
LoorrNc ron DrppnRr,NcEs
i l r: lh.rprersand elsewherein Consider again the 1928 Lombard-Doering study of the determinanrs
..r": -..ned-r'ariables biasthrough of cancer. That study compared a control group of 217 individuals
,-:.-. : .le alternative strategy that without cancer with 277 cancer patients, where the two groups werc
is the means to
:, i..:-.;,rtt.iizatien matched on the basis of age and sex. Note that, contrary to the standard
: r : ...rrh like. Once the subjects experiment, the "control group" in this caseis a category of the outcome
: ::., :() make individuals in the variable Y, not the treatment variable T. Moreover-and more to thE
, r: :1 . a\cept for treatment sta- point about looking for differences-the cancer and control groups were
-: - - 'r,)n me r hor lis r egr es s i o n , not matched on the basis of hypothesizedcausesof cancer.At that trmc it
:rl r-r :irar it tries to comparelike was thought that cancer might be linked to constipation, carried by ver-
- .,; .l:t for differencesbetween min, or spread from victim to victim. If those theories are correct, we
would expect to find differences between the two groups with regard to
i r :: : - t - se nseto, " m at c h" on th e laxativeuse, housingconditions,and contact with cancervictims, respec-
: : ,re explicitly by the pairing tively. To bring data to bear on the theories, then, we would not want to
, :.-: .,:iu.rls in the treatment and remove the differencesa priori by matching on those characteristics.
: . :. - : -rl -rrra l u es on c onf oun d i n g The overarching point here is that we need to understand whether our
: .: :. :he basisof measuredvari- obiective is to discover the causesof an effect or to estimate the effect of
:ir :'i ',r'ithpropensityscores(de- a cause.If we want to know if T, U, and V are likely causesof some out-
-.:-.: .r unit on itself through come, then we cannot match on T, U, and V. To locate the possiblecauses
: r'rrrr observationsper indi- of lung cancer, for example, we want to know how lung cancer victims
:*,,.-.:i. cl effects,or observations differ from those who do not have lung cancer.This harks back to rule 2,
.... :.. :- -r' cffects).For this match- "Look for differencesthat make a difference," as well as to the first law
: i :- ,r need to measureall the of causality, "No causation without correlation." In exploratory studies,
: \ -- Ng ,
we do not want to match on potential causes.
' : ' -.-..ignmentto the treatment If, on the other hand, our aim is to estimate the effect of a cause,then
.i . i i: r -n t i s not alway sinc lu d e d we want to match or "compare like with like" to eliminate the confound-
: - : : - -.-::h i n go r " balanc ing " o f ing effects of other causes.In the social sciences,regressionanalysis is by
- , : :.rndomas s ignm enta i m s far the most common strategy for eliminating the effects of other causes.
'We
::.: --...rrll attractivemethod for begin, then, r,vith the regressionstrategy.
:: : ::,rb.rbility in the serviceof
: ::: : ',i irlrvaysable to randomly
-- . ^-: --l sntl r. i t is s om er im esp o s - THn SreNoaRDREGRESSToN
METHoD FoR CoupenrNc
: :: -:s!lqnment. Lrr<nwrtn Lrrs
:: 'red. for example, to a the @sdoes not help us much, since the gls and the ps generally differ
.,. :., removethe confound- (dr* B, etc.)when the Xs and V/s are correlated.
So when the measured and unmeasured causesof Y are correlated, as
: r:rerallv the casein social is almost always the case with observational data, the SLR approach is
problematic. The standard practice of regressingY on its measuredcauses
: - :. -\, unm eas ur edc a u s a l is, we say, subject to omitted-variablesbias, This is all very familiar to re-
. : ..,: s Jcnot edby a, searchers:We know that we are all too often estimating truncated models
such as model 5.2, and we know that this isn't ideal since our estimates
( s.1 )
are subject to omitted-variables bias when the Xs and Viisare correlated.
: - : : : :::. .tn it ( indiv idual,c o u n - The SLR model continues to dominate many fields of social sciencede-
- :-,,.: .-.-::,-imean of zeto and is spite classic warnings about its limitations. Notable critics of standard
. .rn overall constant and regression analyses in social research include econometrician Edward
.tf J apt ur esdif f er e n c e s Leamer (1983), who writes about taking the "con" out of "economet-
: :rnd not otherwise ac- rics," and sociologistStanleyLieberson(1985, chap.2),who warns about
-:.. , -::1da, are conflated in the dangers of the usual control variable approach in regressionmodels.
. : ,..::1p :r neldat a, aS we s e e As Lieberson notes, adding control variables willy-nilly or ritualistically,
- i:. -:-ro\\- \'ector of parame- with little thought given to how their addition changesthe interpretation
- - -.:::i lectors of variables. of other variablesin the model, can have disastrousconsequences.
. -: 'r - B,X r i+ ' ' ' ' + F p X p p Some observersgo further in their criticism of standard regression in
. ,,: .:i l es , and y W , is s h o rt- social research. Statistician David Freedman, for example, argues that
' -::: Q is t he num be r o f regression-though useful for summarizing linear associations among
variables-is ill-suited for finding causal relationships in social research,
: -:.: i:lon between measured no matter how much we dress up the method. He writes: "I see no cases
'.1... in which regressionequations, let alone the more complex methods, have
The model is very gener-
- succeededas engines for discovering causal relationships [in social re-
:: .:. i:ta sincewe rarely can
- . : . : . n. \ OU Ca nn ot estim at e search]. . . . As I read the record, correlational methods have not deliv-
:: :-: :he truncated model: ered the goods. nfle need to work on measurement,design, theory. Fancier
statisticsare not likely to help much" (1997, pp. 1I4, 757).
/s 7)
I agree about the "need to work on measurement,design, theory" (the
:. : =: : : . :.ssi o n ( S LR) .T he ide a i s theme of chapter 7),but not with the implication rhar we would do better
-.- ,.:: ,:-imanv of thosecausesas to abandon regression methods altogether in causal analysis. The prob-
:. - : : :ri o d i n m os t of t he s o c i a l lem lies not in regression itself, but in reliance on regression control
:r: : ::-: -: social reseafch. (residualization) as the primary or only method for matching like with
like. The crux of the matter is what to do about the "W's problem" in
-,ri
equation 5.1. There are two general approaches or grand strategiesfor
-<51 if7-7fgg1r trying to minimize omitted-variablesbias:
':ir::: 1 Striltegy are well knOwn.
1. Add-a-regressor approach: Move variables from W to X, that is,
-. :,:r '.i hen u'e do, we might not
measuremore of the causal variables.
::: : i . r : )' on the mea su red caus es
2. Design approach: Reduce the correlation between the V/s and Xs
':: '.1.:. rie in effect estimate the
through researchdesign.
: - . ::.i c.tusal effect of X, on Y,
i: r.; ,.-.:lr.1dof B, we estimate d3; Until recently observational research has relied most heavily on the
::.:: '.:-iJs unbiasedestimates
of add-a-regressorapproach. In this chapter I suggest that we follow the
734 . Chapter5
lead of recent statistical and econometric work and try something else:
ti rne.Th e subscr ipt/ is r : : - : : -
Let's focus our attention on the creation of research designsthat reduce first term in the panel rirr:::
or eliminate the correlation of the Xs with the Ws. On this I think Freed- measurementpoints rha: .:: -
man would agree. Yet, as I show in the examples that follow, regression rviseascountedfor br ti.; .'.r ,
may still play an integral role in causal analysesin the social sciences, is not needcdfor cross-ir:: . i
since research designs developed to eliminate the correlation of the Xs ro Y , X a nd W, t hc p; r ,
with the Ws very often still rely on regressionto estimarethe key causal pendentvar iablest o cha. i: ', : '
parameters. Importantly, though, this more tailored use of regression rc B and 7t he m odel ai- , '. '. : - .
analysis plays to a strength of regression-summarizing the information time-
in data-rather than to a weakness-matching like with like. To i l l u st r at ct hc hr ir . ; - ' - -
A major shortcoming of the conventional add-a-regressorapproach is ' *
X s at two point s in r ir r ';
that we never know how successfulwe have been in identifying ancrmea- . - : -
i 5.3) w e can wr it e our : : . -
suring all of the causes,or at least most of the important ones. For causal
time 2 as follows:
analysis, then, we would prefer the second option above, the design ap-
proach. Random assignmentto X is one such design method. Yet random Yl = ar + a- + B, X, , - , r , .\ \
assignmentis not the only possibiliry and, when random assignmentto X
Y,r= arl a,l p.X,. - ,r\\ , -
is not feasible-as is frequently the casein anthropology, economics, po-
litical science,and sociology-social researchersare prone to scurry back The critical advantag.- .. ::
too quickly to the comfortable add-a-regressorapproach. A central mes- individual changeover :r:-: - ,
sageof this chapter is that we need to think more creatively about how to from Y,r:
design our researchto compare like with like when classicrandom assign- Y ,r-Yu=@z- a, ) - P. \ . - p
m ent is out o f th e q u e s ti o n .
=( ( r z- or t - B. \ _ - )
Social researchershave at their disposal a number of strategiesthat,
whether they realize it or not, are rooted in rule 5. 'We begin with the +02- 7r \ \ ' -
most obvious: Compare individuals with themselves. E qua r ion5. 4 st at ei r r . : : - - . -
or), bY change in ) "s : : - . . . , . - - .
W ,r-W ,r . r espect i\ el\. : -
Collpenruc Llxe wrrrr Lrrn tnnoucH Flxno-ETFECTS
METHoDS unmeas ur edcausest F. - F . '
term (e,r - qr ) . Fr or r rce; . . :
First-Difference Models: Swbtracting Out the Effects of the W s disappearin t hr - - . - - :
ConfoundingVariables
r the Ws ar e consr r r - - : l
The first-difference method compares like with like by matching individ- c rhe pffcrtc of rh- \\ . -. -
uals to themselves.To match individuals to themselves,we musr have
measuresof individuals at two or more points in time, that is, we must l n shor t ,a sim pleiir - : - - - - . - .
have panel data (more on panel models in chapter 5). Consider the elabo- confounding effectsot ..--...:.., -
ration of the standard regressionmodel to permit measuresof individual ables that are const2lnt: ' -, :
units (people,corporations, nations, etc.) at multiple time points: effects over time. Thi. :. ,. :
unmeasuredcausesver\ i: :-
Y i, = a, + d i + F } i r+ y fl ,r+ e o (s.3
) or approxim at elvso. \ , ', : : : - :
- .- : : ' -
i ncl uded if f icult - t o- r nr . i)
Note that the panel form of the regression model adds the subscript I
work ethic, and sense(-\:- .- j
(for "time") to each term in the linear regressionmodel (5.1) excepr a,,
cri mi nal just ice.cr r m in. - - : -
which needs no subscript / becauseit reflects effects that are stable over individual differencesr:r :: -': -
-l'heFifrh Rrrle . 1-l-i
Yi t= dt+ a. + B r X . r + ! 1W ,r+ e ,,
- : - --, : : \ freas ur e.F or r es ea rc h speak of a constant causal effect, but if B, + Brthen the causal effect itself
:-: . . - -:.i nceof dif f er ing " c o r- changesover time. This might complicate our conclusions, to be sure, but
: : : : , : . -Lrfl S t loty, et dif f ic ul t to the point is that with first-difference models we can at least test for timc-
varying causal effects (more on this in chapter 6).
- :-:- -.:-: -rle.rsuretraits include
Iil :- : t -. -'.tahas a c ount r y ' sc u l - It is surprising, then, that social scientistsdo not use first-difference
,- : . . --..r;ces st o s eapor t sa, n d and other types of fixed-effectsmodels more often (Allison 2005; Ha-
-, : : : --i .,i thes eendur ingc h a r- laby 2004). One likely reason is habit. Until the Iast few decades
- *r: : -'- .::-larcristicsare constant cross-sectiondata setswere the norm. As panel data sets becornernore
routine, first-difference and related models should become mare stan-
: ,i: - . :: ,ni tfpe of a m or e g e n - dard as well.
. - - i : Halaby 2004; Hs i a o As with all methods, the first-difference method is no panacea; there
-::- ; iiffers from the usual are costs involved. In the first place, first-difference models remove only
i::::i :-..\s. To avoid omitted- the stable effects of unchanging Vi/s.So you cannot rely on differencing
.:: -- r - ::-.\ method, the unmea- as a foolproof method for sweeping away the effects of all types of
' , :- : -: :re asu r edc aus esof Y . In unmeasured causal variables. First-difference models are designedfor in-
. - : : - : : :. ,mi tte d- v ar iablesbia s i s stanceswhere there are important causesof Y that are stable and hard to
-: . .: : ::lects are stableover the measure.
::.i -d i ffer enc em odelsa re A second cost is loss of variance on the measured causes. First-
-:- r - . .
:: :. r .:.. \\e might call this the differencing (and other fixed-effects methods, such as adding dummy
variables for individuals) reduces the variance in X. When we regress y
on X for a cross-sectionof individuals, we use the variance in X acrossall
: : : : r : : . -: r'nodelr eP lac est he t YP - individuals. \fhen we measure the same individuals at a later point in
-:, r-: : J .rn du nm eas ur edc au s e s
j time, we add change in X over time for individuals lxiz-X) to the vari-
r' : ,:.-,;:::tt' assumption that the ance in X. Thus when we regressY on X for a panel of individuals mea-
- : , : - :'.>fAnt ef f ec t s . sured at two points in time, the variance in X consistsof change in X over
::;e of the first-difference time for individuals as well as variance in X across individuals. By re-
: rt-: : nonexperimentalsocial moving the variance in X across individuals, first-difference models trun-
:. :- i- . .:i regr es s ionanaly s is w
, e cate the variance in X. Truncating variance on independent variables
important causes, or typically is not beneficial since,as noted in rule 2, we need variance to ex-
-::- ..-. :he
::;..,::; .r'ith the causeswe have plain variance. Yet truncation may be beneficial here becausewith first-
:! ! -. -.:: , ,ns. The first-difference difference models you generally have less, but "higher-quality," variance.
.: - :::. leveland effectsof the The variance is higher quality becauseit has been purged of the enduring
:- : : : : -::l p ti o n t hat unm eas u re d causal effects of unmeasuredfixed traits.
-r : : : ::a l l the ex am plesof th e Let me elaborate.Becausean individual contributes to variance in X. -
: . r ::trions,and the topogra- X,, only if X changesfor that individual, causal results for first-difference
: ::.: .::: -.:trPtionthat thosecauses models depend heavily on how much Y changesfor individuals whose X
r:- - , - .. Lrn i tsT. his f eat ur egi v e s changes over time. So the practical implication of variance truncarion
..: : - ::-,\ regressionmodels in here is that results for first-difference models depend heavily on how
much Y increases(declines)for individuals whose X increases(declines)
-:--.: ,.: :,r rest whether the causal over time. Consider, for example, the effect of southern residence on
. ,: : : Ii u'e r egr es sY , r - Y , , o n racial attitudes. Differencing means that our estimate of the causal effect
: ,'r. :>:rmatesthe difference be- of southern residencedependson how much (and in what direction) atti-
:.::l:---ters are the samewe can tudes changed for individuals who moved into or out of the South.
I -l8 . (.hapter-5
'We
can difference our :-::-
Sibling Models
analogousto the diffe:.r.
Sibling models attempt to compare like with like by matching individu- data. Alt hough t he . l. r : . ,- .
als to strategic other individuals-their siblings-who are likely to be ual tra ir sr equir esp; r nc.- . :
similar in multiple (and often unobservable) ways. Sibling fixed-effects acteri st icsr equir e' t ih. : .
models, or simply sibling models, are used to remove family environ- S upposeour sam pl. -
ment effects that are the same for all children in a family. A sibling l i ngs.Call t hesedlt r r . : - '
model can be thought of as a contextual model where family is the con- of the siblings( r and, 'i . -
text. Contextual models assume that the characteristics of contexts- call thesedata sampleB. ','.
families, neighborhoods,schools,and so on-matter, just like individual (Note that the sibline :- :.
traits do. By collecting data on individuals with shared environments- which we can general:2. .
children in the same family or in the same neighborhood, for example- the subscript i indere s -..-.:
we are able to difference our consranr unmeasured effects of the shared l i ngs a r e denot edr . 1 . i: '- :
environment. A pplyingequar ioni':
Becausesibling models are a type of fixed-effect model, the effects of
Y ,o =ao+ a- u+Br \ . - ; .
stable confounding variablescan be removed without measuring the vari-
ables.In light of that feature, Guang Guo and Leah Vanlfey (1999) use Y,o=a"t a,r+ B"X.u - ? ,
sibling models to determinewhether the inverseassociationbetweenfamily
N ow subt r agrY. , t : . .
size and children's intellectual development is causal. In brief, therr an-
swer is no, it is not; the "dumber by the dozen" concern (Zalonc 1975)ris Y ,o-Y. o=( a, - a
ID 1A Ib :1
- lJ
without merit (see Guo and Van'Wey 1999 along with commentary by (l,ruW,,- yA\\',{ - :...
Phillips 1.999 and by Downey et a|. 1999). (As we note subsequently,Dal-
ton Conley and RebeccaGlauber [2005] investigatethe same issue using Keep in mind that )-. - I
an instrumental variables approach. Interestingly, Conley and Glauber the differencesfor the :..:.
find causal effects that are smaller than those found in traditional regres- in the same familr-.
sion approachesbut nonethelessare not zero.) \7e have then a s:::-.: ,
To see how the differencing method works for sibling models, con- di fferencem odels de: - : .: :
sider again the standardlinear regressionmodel (equation 5.1). In addi- gui sh f am ily- levelc. r u. . -
tion to distinguishing observed from unobserved causes, we also equation 5.4 abor.e.re\,.:.:
distinguish family characteristicsfrom ir-rdividualcharacteristics,so the
Y ,o-Y, n=@i, - o. . . - 13
sibling model consistsof four classesof causal variables:measuredand
unmeasured individual traits, and measured and unmeasured family +( t n- 71 \ \ . .
characteristics.Of course,if the context were neighborhood instead of
family, we would substitute measured and unmeasured neighborhood - r \ Bor t n- O ''
characteristicsfor measured and unmeasured family characteristics. + t ^t t A t t t _. i
' I rB ; I
Our essentialconclusions about sibling models apply to other types
of contextual models (neighbor models, school classmatemodels, etc.) From th e lasrlinesof : - - . :
as well. unmeasuredfamilv ch,::.:, .
\X/eadd the superscript Fam to distinguish family variables and coeffi- model when:
cients from individual traits and coefficients. Thus the seneral sibline
model is: 'To s i mpl i fy equl rrnr ' .
Iecred.If that a> s ul rrprr,' - . '
Y,=o+ a,+ BXr+ yw,+ BF"-X,F"-+ 7 F"''wiF"'+ q (s.8) conclusions.
The Fifth Rule . 1,11
'We
can difference out family characteristicsfor sibling pairs in a manner
analogous to the differencing out of individual characteristicswith panel
i.' L.vmatching individu- data. Although the data demands differ-the differencing out of individ-
r - r . -:-\r'ho are likely to be ual traits requires panel data, whereasthe differencing out of family char,
: :: ,,:r.. Si bling f ix ed- ef fe c ts acteristicsrequires sibling pairs-the logic is the same.
. :-nto\ e f anr ily env ir o n - Supposeour sampleconsistsof i=7,2, . . . , N children who have sib-
-- : -
:- * r:-. i11a family. A sibling lings. Call these data sample A. Imagine we collect the same data for one
i - -: ',,,here family is the con- of the siblings (randomly selected)for each of the children in sample A;
. - : . - .r-.,tEri s t icof s c ont ex t s - call thesedata sampleB. !7e have, then, N matchedpair:sof (A, B) siblings.
- ---'.,-i cr. j us t lik e indiv id u a l (Note that the sibling model rules out only-children-so the population ro
. ., :: .h.rred environments- which we can generalizeis children with siblings, not all children.) Now
: -: - -.orh ood, f or ex am pl e - the subscript I indexes a sibling pair, not an individual; the individual sib-
--:' .: -.rr.deffectsof the shared lings are denoted iA and iB.
Applying equation 5.8 to samplesA and B in turn, we have:
:- . . : - . : :-.t model, t he ef f ec t so f
Y,o= cro* a,o+ BoX*+ yAW iA+ BAt"-X,ou"- * Zo F,-W.^F"- + a^
, : - , riLrur measuringthe vari-
.,--; :rh Van!fley (1999) use Y,u= ar+ c.i|+ PBXIB+TsW,r+ Br.u-X,uuo- * 7, Fu.W,nFu-+ e,u (5,9)
. ::ir ',ri,tcietionbetweenfamily
Now subtract to from Y,r:s
:-r : -.r'-iSil.In brief, their an-
: r -',nCern( Z ajonc 1975 ) rs Y,"-Y io=(a u- a;a)+ (Buxru- FaX,o)+ (f, t"-X,ut"- - fo Fu-xiAFu-)
+
'-- - :-,: u'ith commentary by
(y"w - zow,o)+ (yro"'w*u"- - 7ot"-w,ou"-)* (e,u- e,x) (5.10)
--: ' : iiot€ subsequentlY,Dal- "
- . : :.1 :cthe s am eis s ueus i n g Keep in mind that Y,r- Y,o is the Y difference for the ith sibling pair. Thus
: :: r i: ' j -r. Conley and G lau b e r the differencesfor the family variablesdenote differencesbetween siblings
-. : -. rd in traditional regres- in the same family.
: I7e have then a simple first-difference model, similar to the 6rst-
.. , i: . . rL)rsrb lingm odels , c o n - difference models described earlier, but with additional terms to distin-
- ' : . e quat ion5. 1) . I n ad d i - guish family-level causes from individual-level causes.Proceeding as in
- . - r)rfVed c aus es , we a l s o equation 5.4 above,rewrite (5.10) as:
.-. . :,.,-.1 char ac t er is t ic ss,o th e
Y,u- Y,o= la,u- a,) + ft(XiB - X ,o)+ (Br- Fo)X,o + yr(W.u - W^)
,r -- .,. ...lriables:measuredand
-
:- ': - -ti tJ u nm eas ur edf ami l y + (Tn- Ta)w ro+Bur"-(x.BF"- - xiAFu-)
,: . i : : , ::i ghbor hood ins t ea do f
:- - -.:::llL'i.lsured neighborhood + (B sF^ - F etu')X,ooo-* zrFu'(w*Eu* -'wiAFun')
-=- -. . - =; i e rn ily c har ac t er is ti c s .
+ (yBF" - zoF"-)W,oFu'+ (e,o- t,o) (5.11)
-,: -' *.,s aPPl,vto other tYPes
i. i- - . Jl ass m at em odels , e tc .) From the last lines of equation 5.11 we seeimmediately that the effects of
unmeasuredfamily characteristics(the W,tu') are eliminared in the sibling
.- ,- :-'.:irilvr-ariablesand coeffi- model when:
:- . : : . . Th Lrst he gener als ib l i n g
5To simplify equation 5.10, we assume aA=aB since samplesA and B
are randomlv sc
lected. If that assumption is false we could restore the term ar-aA without affecur.rgour
,.-.-
i '--\\' ( 5 .8 ) conclusions.
",
142 . Chapter5
o W is the same for the sibling pairs, that is, 'W ='1X/
rF"- ,o'^^, ard
. the effect of V/ is constant over time, that is, y"r^- = yor^ .
Observe the policy implications here: If income effects are not causal,
then raising the incomes of the poor will be ineffective in improving their
children's life chances.So it is important to know whether familv income
114 . Chapter5
:* ,. :ir Jifferences
in family en- environmentswith respectto Q. Then the causalmodel for A Scsoorrxc
would be:
r :", :.r: rhe problemof unmea- A ScHoouNc,= Bo+ B, A Fel.trvlNcoMEi+4 A CnrroTnaIrs,
- - ..: J.rtaconsistsof 328 sib- + 7A Fel -rrvQ,+ a, ( - 5. 131
-.-,':* -) The dependentvariable
'- -' .: :hrsrr.rodel: where A FerltrvQ, is sibling difference on family characteristic Q. tsl'
omitting A Fer,rrrvQ,in equation 5.13, our estimateof the family income
effect will be biased when A FamtrvlNcoun is correlated with A F.q'rtI-
j ( 5. 1 2 )
rvQ,. In general, the greater the correlation, the greater will be the bias.
' r:, I rus -\ S c uoor , r Nc ,is th e The sameprinciples apply to the biasir"rgeffect of unmeasured indiuidual-
^ .nl cr edf or r he r r h s i b - leuel causaleffectsthat do not "difference out," that is, WiB lWiA in equa-
-:: - : ihe difference between tion 5.11. There may be unmeasureddifferencesin the personaltraits of
-- j:- i:J -\ CutroTnetrs,, and siblings that affect their differencesin years of school achieved (perhaps,
r- , - ...rl of income when they for example, we have not measured motivation adequately).If those un-
,.r: -: :::e caseof twins there is observeddifferencesin siblings'personaltraits are correlatedwith differ-
-:: :u'ins are the same age. ences in their exposure to different levels of family income-which
-:- r
ri: Duncan et al. (p. 411) say is unlikely-then our estimateof the family in-
-- -::rerdllv is not zero, and
' i ., ::las over time produces come effect will be biased.
It is time now to summarize. The key idea in sibling models (as in stan-
::- -::: and younger children
' -: -- i r o o t his as s um pti o n dard first-difference models) is to difference out the effects of confound'
--
ing causeswithout differencing out tbe effectsyou want to inuestigate-So
" i-: r-. iel has no measure of you want siblings to be alike and different: alike on unobserved causes,
' - . . .- nme nt m eas ur esis n o t and different on the causesyou are invesrigating.Thus sibling models are
-:: -.:r :liminated the effect of most effective when your purpose is to estimate the causal effect of some
-:: :r.-:rr for the effect of dif- feature of family environments and:
:: -' *: . -\s shown above, dif-
I . Fami l y envi ronmentschangefor the causesyou are i nvest igat ing.
: : r r : -t : rti u nm eas ur edc au s a l Becauseyou cannot explain a variable with a constant (chapter 2),
-,'. -
"- :: : hen yrF"' zaF"' (equa- siblings must be exposed to different family environments for the
ir '*: : -.'.edo not need to mea- causesyou are tryi ng to esti mate.
- :: ,..-r:.icteristics are the same 2. Key unobserved features of farnily environments are stable over
-1i. :r:r,': on Y for both samples time so siblings are exposed to the same featureswhen growing up.
The sibling model will difference out these effects (W.BF"--'WiAFu'').
-:: - ': -: : ::si i cs,then, we m us t a s -
3. Siblings are randomly assignedto samples A and B so we expect
-: :', '- :rrr bclth siblingsas they
that the average effects of unobserved causes will be the same tn
- - . . , , -::ri sti c of int er es t ,ho w - both samples(yn=laand yuu^^=I4F"-)and that sibling differences
rtr :-: !.ii'nefor both siblings,or
in unobserved personal traits (W*-W,o) will be uncorrelated with
.i: .,.-:,:-.: !o estimate.In the Dun- the family-level characteristicsof interest.
' . . :: . : . '. .chooling,f or ex am p l e ,
4. Sibling differencesare the samewhether you have one sibling or five
: -': -.::.:.i nco m e,c hangeds i g - (or, if sibling differencesdo depend on family size,the effect can be
' : i: . . -:. of the f am ily env ir o n -
modeled).
- - :- i. r f.rmi l v c har ac t er is ti c - When these conditions are met, sibling models provide traction in the es-
,,.::: ::,lred in different family timation of family-level causal effects.
146 . Chapter-5
Exact Matcbing
observationaibio-
The textbook example of exact matching comesfrom
a clinical study where an lnvestlgator wants to
medical research.Imagine
assessthe efficacy .ri rr."r-."t X. for alleviating some symptom Y'
accessto two
There is no randomizedtrial, but the investigatordoeshave
profiles. One set consists of the demographic charac-
sets of individual
(X,), health histories (X'), and.current measurement of symp-
i.rirti.,
y were treated, and the other set consists .f the same
tom fo. tii,rr. who
lvho were not treated. Matching consists then of
information for those
from among the control group to provide matches
strategicsubsampling
The treatment ef-
on X.-and X, fu iniividuals in the treatment group'
calculating the average difference on y for the
i;;, i: ..ti_"t.d by
matched units.
I nex ac t m at c lr i rrg re s e a rc h e rs s i ftth ro u g h thecontrol stofi ndi ndi vi d-
cases
uals who match those in the treatmentgroup. Becausenonmatchlng
matching is most commonly used where there is an
are discarded,exact
a bundanc eof uni ts i n th e c o n tro l g ro u p fro mw h i chttl formmatches
the re-
with units in rhe treatmenrgroup. In prospectiveclinical studies
set through the discarding of control units can be cost
duction of the data
permits investigators to follow onll' the control units
effective becauseit
data
that matter (that is, the ones that match) rather than collecting
controls. In those types of studies researchers
over time on all the original
T h e Fi fthR ul e . l ,+ i
with limited budgets might decide to usc one-to-one
exact .ratc'i'g.
w here eac h r r eat edunit i s rn a tc l .reto
c r e ra c tl y ,.rr" .o r.,tr,rr.' ," i i . r"
eral, however,it is better to use a[ control units * ." -
that n'o,.t , .in.. ,t.'^,
techniquereducesthe varianceof estirnates.
Neither method-exact_matching nor one-to-onc
exact matching-is
very common in the social scierces,where the
curse dimensio,aliry
makesexact matchesproblernatic.(The te'n ..curse 'f
of clime'si<.r'ality,, i'
statisticsrefers to the<lifficulty of fitting a modcl
*rth ,,.r,rf folrrut. .,._
planatory variables.)with regard to thJiss,,e
of matchi'g, ,tr. rrro..
ables you have to match on, and the morc categories "".r-
there arc for each
variable' the more difficult it is to find exact
,naicrres.rypi."rii we want
to march individuals simurtaneouslyon multipr.
uariabrei,uni ort.n tn.
matching variableshave multiple categoriesjthe
confoundirrg uorr"bl.,
might be conrinu'us, for e"arnple;.As"a result,
..r."..h.., _i!nr.na up
discardingunits in the.treatedgi,rup, as well as
unirs ir ,rt" ."riror grorp,
for lack of a match. The-discaidingof .ur., rs
so severein many applica_
tions that onlv a small fracti<rnof the data can
be used and, if treated
units are among those discarded,the treatmenteffect
of inte..st might no
lo.ger be the trearmenr effect being estimated.
In other words, the
wh.olesalediscardingof casescould result in b.th
inefficientand l:iased
estimatesof the treatment effect.
mirtching does not require the exact matching of
-Fortunately, individ-
ual units. It sufficesinstead ro march ihe distributions
as.tor?tf porrr
ble, as we now see. ",
respect propensity scoring is similar to standard regressioncontrol with David Harding (201)-1.
: ^
covariates.Flowever, becausethe use of propensity scores permits us to neighborhoodeffecr.,
achieve balance by trial and error, whether or not we know the proper
functional form, the method offers an attractive alternative to standard The solr r t ionr r sL
J . -. .
regressionin instanceswhere there is uncertainty about the proper func- mQre control sul.i.,:,
tional form of the relationship between Y and its predictors (Harding idenrical to the cor::
2003). Hence the approach is less subject to bias arising from confound- treatment.. . . Tre.::::'.:
ing variables that are included, but in the wrong functional form. subjectchar acr er ir : :--
The general procedure is to summarize all the measured confounders treat m entand cor r r : . _
by a single variable called the propensity score and then to match the To use propensrlr':, ::
treated and nontreated units on the basis of the propensity scores.There di ffere ncesbet wecn. : . . - :
are four basic steps:(1) use logistic regression(or a similar method) to children in other neign: :-
regress the binary treatment variable X on the confounders to create a example, since childre:: ,,.
variable X; (2) match treated subjectsto controls on rhe basis of the esti- l i kel y t o live in po( ) r : . : r '
mated propensity sco.esX, for each unit (in "nearest-neighbormatching," likely to finish high scir,
for example, each treated unit is matched to the control unit with the mediate the effect of r..:-
most similar value of X); (3) verify that rnarching has balanced the treat- how ever ,sincet hat r vr'. : . -
ment and control groups; and (4) draw conclusions about causal effects effect.)Then we use log:.: .
on the basis of the differences on Y between the treated and control tor to regressthe "trear::-:
groups for the matched data. denotedX) on f am ilr i: : . -
If step 3 telis us that we have not balanced our data, then we try a poor neighbor hood.
again with a different specificationof our regression.That is the cardi- The regression of ,\ :
nal virtue of the propensity-scoremethod: With propensity scoreswe nei ghb or hoodyieldsi. . ' -
can try increasingly elaborate specifications (interaction terms, squared di stri bu t ionsof * lr he - . : .
terms, and so on) until we find a specificationthat balancesthe data. As distribution ofX for th.
-
Daniel Ho et al. (2007) point out, the specification works when it most instancesit makc. ..:
works, and when it doesn't work we keep trying. This is in contrast to for X where the treatnrr:.: ,
standard regression-control-by-residualization, where we are beholden overlap,estimatesof rh. ::.
to unverifiable assumptions about proper functional form for balancing tions about functional r, :-:
our data. to avoid by using prr)f.. ) :-
To flesh out the propensity-score method with an example, consider with covariates.
again the effect of childhood poverty on later educational attainment, ex- Next we match trearr.:.'
cept this time think about the effect of living in a poor neighborhood i ng one of a num ber , , i :
rather than the effect of living in a poor family. rWeknow that children Morgan and Winship. -
who grow up in poor neighborhoods are lesslikely than other children to for discussionof softii e:.
complete high school. \fhat we want to know is how much, if any, of desi redbalanceon . or ^: - - :
this association is causal, that is, how much the graduation prospects can use a quantile-qu,t:.:.:
would improve for children in poor neighborhoods had they grown up in confounders.Good b.r1::-.
nonpoor neighborhoods.
As noted earlier in this chapter, the problem is that we cannot observe
5 Where rhe pool ot .:. -
the counterfactual, since a single individual cannot simultaneously both f
R uhi n and Thomc s (l qq^. - - '
experienceand not experiencegrowing up in a poor neighborhood. Our
he rel evan t.A gai n. rari .rl .i . :: j
alternative is to have control subiects serve as the counterfactual. As di ri oni ng o n the c on:equ.; ...
The Fifth Rule . 1'19
i:: - : : - : :Jg re s s ionc ont r ol wi th David Harding (2003, p. 684) explainsin his propensity-scoreanalysisof
:' - ' .) .'ores permits us tt-r neighborhood effects:
. : - .-..)r\\e know the proper
The solution used here is to march each treated subject with one or
=:-...- , .: -rlternative to standard
'.-:* . r i'. rt out the proper func- more control subjects such that the treated subjects are, on average,
- | : . . - r:s p r edic t or s( Har d i n g identical to the control subjects on observablecharacterisricspnor ro
treatment. . . . Treatment and control groups are well matched when
-: : - .:. .--risingfrom confound- subject characteristicsthat affect the outcome are "balanced" in the
: i i ir :-:ncfl O nal f O r m .
treatmentand control groups.
-: i :--:' nlriSUr€d confounders
- -i.' -, -..nilthen to match the To use propensity scores,we first must find measuresof the pertinent
i ' t - : :ro pens it y s c or es T . he re differences between children who grow up in poor neighborhoods and
.. : : : . . -. ,; a s im ilar m et hod) to children in other neighborhoods.'Wewant to measure family income, for
- . . -r;,rn follnder st o c r eatea example, since children who grow up in poor neighborhoods are more
likely to live in poor families, and children from poor families are less
' - : .,:-!:-n er ghborm at c hin g ," likely to finish high school. ('Wedo not wanr to conrrol for variablesthat
-:- i ::.- control unit with the mediate the effect of neighborhood poverty on educational attainmenr,
- : : - - :'- i rl s balanc edt he t r e a t- however, since that would bias our estimate of the neighborhood poverry
-- .--. ,:r: about causal effects e ffect.)Thenw e use l ogi sti cregressi on
or someother appropri a r eesr im a-
-
-r :, . : : ::a tre at ed and c ontro l tor to regressthe "treatment" (growing up in a poor neighborhood, here
denoted X) on family income and other variablesassociatedwith living in
-. i . i- -r: ,ru r dat a, t hen we try a poor neighborhood.6
, , - - : : : ::>i Lfrr. T hat is t he c ard i - The regression of X on the relevant predictors of living in a poor
-. - 'i. :l Fropensity scoreswe neighborhood yields X, which we use to match units. We also use the two
'.:::;r.ti o n t er m s , s qua re d distributions of X lthe distribution of X for the treatment group and the
. :.-.-:lbalancesthe data. As
-:: distribution ofX for the control group) to check for adequateoverlap. In
- : - . : -...;.]ti o n wor k s when i t most instancesit makes senseto restrict the analysisto the range of values
::: : : :-.:.Th is is in c ont r as t to for -* where the treatment and control groups overlap. \7here there is no
: : : : -.'.,.her ewe ar e behold e n overlap, estimatesof the treatment effect depend on unverifiable assump-
,: : : - . , :.,,:trl for m f or balanc i n g tions about functional form of the effect-the very situation we are rying
to avoid by using propensity scoresinstead of standard regressionsontrol
:: 1 r '. ,.::1a n ex am ple,c ons i d e r with covariates.
.::: :: * j.ltional attainment,ex- Next we match treatment and control subjectson propensity scoresus-
: :-.: :1 a poor neighborhood ing one of a number of possible matching algorithms (see chapter 4 of
: :.-:--... \\e know that children Morgan and \Winship,2007, for examples;seeBecker and Ichino 2002,
: . : -: . . .:.r th an ot her c hildr e nto for discussionof software). To verify that our matching has produced the
r r ',.. ri how much, if any, of desired balance on confounders across treatment and control groups, we
----:. :re graduation prospects can use a quantile-quantile plot (Ho et al., in press) for each of the
.' - : - J' had t hey gr own up i n confounders. Good balance is more critical for variablesthat have a larse
effect on rhe outcome variable than it is for variablesthat have a small ef- andW inship,2007t.\f -, .
fect. If necessary,as noted above,X can be reestimated,using a different data do es not r ule oui ; . . s - - '
now see.
functional form for the confounders, until balance is achieved'
The final step is to estimatethe averagetreatment effect for the ueated
(or ATT) for each of the matched groups of individuals as the difference Matching as a Prcprocd-i-rrr:i)
between the treated and control subjects within each of the groups. To Mod"el Dependence
calculate the overall effect, it is usually a simple matter to take a weighted
averageof the treatment effects for each matched group' -Vodel dependenceeoir. ' ::
The propensity-scoremethod has severaladvantagesover standard re- causal ef f ect sin soci, r r : - : -
gression. First, as I have emphasized,successfulbalancing of the data on since estimatesof cau.., : :-
the basis of propensity scoreseliminates the need for strong assumptions searchersmake abour i-,: i -
about the functional form of the relationship between the outcome vart- tional form to use,rhe i.: : :
ables and the regressors.Second,inferencesare less likely to be made on so on, As a r esult ,t hr . i. r - ': - ,
the basis of extrapolation from treatment and control casesthat do not articlesgenerallyrepr.i: . .
overlap (note that extrapolation is especially dangerouswhen we are un- matcspossiblet r r r de r ....:-..-:
certain about the functional form of relationships). One of the virtues of esti mat esar e m odel- dci; . - . - ; tt .
the propensity-scoreapproach is that it brings the nonoverlapping cases l ectto r he suspicionof '. ; - . : .
problem out of the closet (in contrast to standard regression analysis, suspicionthat researchl:i , :
genialwith their favore; .. . ; .
where the problem generally is well hidden). Checking for adequateover-
genialwith their data.
lap of the distribution of the treatment group and the distribution of the
B eca usef avor ed- hr r r - : : -
control group is standard procedure in propensity-scoreanalysis. Under
search.r educingm odc. - : : : - -
regression,researchersmight or might not perform the necessarychecks.
researc hdesigns.To r . i . - . -
Third, comparing similar groups of treated and control casesis a more
intuitive method for control than the residualization method used by re- that researchersuse prL,:::- :
rrep, which Ho and
gression. This can be an important advantagewhen presenting results to -, ::.-.
a lay audience. Finallp propensity scores can result in smaller standard propensit yscor esr o r s: '. i_-: - :
errors despitethe removal of nonoverlapping cases(seeSmith 1997 for an possi ble;r hen, r o ohr . r : - -
example), and in some instancespropensity scorescan be useful for test- parametrictechniqueth-.. -,:
ing sensitivity to bias from unmeasured confounders (seeHarding 2003 gressi o n,logit , st r uct ur . i.. - _ _-
for an example). Preprocessing with F:, :: : -
It is important to stress that neither method-propensity scoring nor nri ni sh ingt he im pact ( ': : - '- .
regression-solves the V/s problem.'Whetherwe attempt to match by resid- er al., 2007). The balan;, = :
ualization on confounders, as in regression,or by balancing the distribu- sel f) req uir esf ewer r ssl- : -: :
tions of theX in the treatment and control groups, as in the propensity-score cessingthe treatment qr,,--: ..
approach, we can match only on observables.Becauseskepticscan always eonfoundingvar iahiecf . : . -
point to potentially confounding variablesthat you did not include in your parametric adjustments.::.r. _
analysis, both methods require strong theory. implicarionis that r\\ o ir. . : -- _ _
The most obvious limitation of propensity-score analysis relative to to addre sst he sam er esr - lr - . -, , :
regression is that the propensity-scoremethod yields no estimatesof the resul tsi f t hey successiu. .:-,. =-
confounders' effects on the outcome variable. (Fixed-effectsmethods are Jo not p r epr ocess t he d: : : . : - :-
subiect to the same criticism.) This limitation is more apparent than real
-
for causal analysis,however, since there is emerging consensusthat causal H o an d c o l l e a g u e s n o l c : - : - :
scores.to balance
and Winship,2007). Moreover, the use of propensity
use
data does ,-rtt ,,rle out the subsequent <lf anall.sis,as u'e
regrcssi<ln
now see.
7Ho and colleaguesnote that their approach is similar in spirit to ideas in Imat and van
T h o mas (2000)' among others'
Dy k (200 4 ) , R o s " J a . t m a n d Ru b in ( 1 9 8 4 ) , a n d Ru b in a n d
I52 . Chapter5
has been achieved,we can be more confident that small changes in the be able to incorporate::-.-:--
authors' model specificationwould not greatly alter the paper's empirical tinely in our researchci. :-
conclusions.
INstRuuEr'ital- VAI{I.\8 . -
CoupenrNc Lrrt wrtn Lrr<r rnRoucH NATURALLYOccURRING R.q,Npor,r As sr cx r t r r-:
Reunotvr AsstcNl-ENt
The i nst r um entvcr i, r l', .
The purpose of random assignmentin social researchis to create matched a sort of par t ial r r r t Jor ' . -
groups, so that we can compare apples with apples,like with like. Social causei n t he lV nr et hr , ; :
scientists in every field understand and appreciate the power of random self, that is randomizec.
assignment in controlled experiments. Individuals in an experiment are To fix ideas.supfr,-i '.
assignedrandomly either to a treatment group that receivesX or to a con- assignedto X. Supposc:-- :
trol group that does not receive X. The notion is that we want to vary X that is, X and Y each [-.-r -..;
to seewhat happens to some outcome Y. Of course, lots of things can go Due to random assignnr.:-:
wrong in an experiment with human subiects becausewith humans it is X nevertheless is corr..,:.=*
hard to hold constant all factors except those under investigation' Even is depictedin figure -i.1. .' ,
under laboratory conditions, factors that we don't want to vary might denotcsall the othcr c-:...:-
vary anyway during the course of the experiment (seeCampbell and Stan- causei t showsall t h. . "- -
ley's 119661classic statement on threats to internal and external validity Our o bject iveis t , r r - : "
in experiments). In an ideal experiment, though, X varies, other causesof of usr two var iabler\ . ; . -
j
Y do not vary, and-due to random assignment-we expect no initial dif- Y when (as here) the r',:: -.
ferences of consequencebetween the two groups' so we can infer that al ent to est im at ing' \ . , '
subsequentgroup differences on Y are due to the introduction of group pri nci plesof pat h . r n. r.. - -
differenceson X. and Y i s a f unct ion of r
\fhy do we expect random assignmentto produce initially equivalent j ust one link. X- - + ) '. r : -
groups? The answer lies in probability theory. In controlled experiments, correl a t iondoes r cf lc. . ,
we attempt to harness the power of probabillly to produce indistin- causality.
guishable groups. The point I want to make in this section is that random In ob ser vat ion. r. l.
l r:...
assignment is also found in nature, so in some instances observational to X, so in obsert',tti,,:.: -
studies can also harnessthe power of probability to compare like with than X-+ Y. When lhr:r :
like. correl a t iondoes nor i: ': :
Until recently, discussion of the random assignment of subjects has Fi gure5. 3 depicr ' t h. i: -
been largely relegatedto experimental research,since observational stud- 5.2), but conf oundinc' r -
ies in many areasof social sciencerarely consider how to incorporate ran- X , si nceC causesX. I r : - -
dom assignment in the research design. This state of affairs might have ( l ) the c ausalef f ccrt r ' - '
come about becauseof a misimpression that random assignment is out- both X a nd Y. As . r r c: . . . .
side nature's domain and must be manufactured by researchers:Either we
can randomly assign individuals, in which casewe do an experiment; or
we cannot randomly assignindividuals, in which casewe don't give the con-
cept of random assignmentfurther thought. As a result, nonexperimental
social researchhas been dominated by lessreliable methods for removing Fi o rrre i I Fi -- -
: : : - : :r-i t sma ll c hangesin th e be able to incorporate naturally occurring random assignmentmore rou-
.:::: , ,-::r rhe paper'sempirical tinely in our researchdesigns.
The instrument variable (IV) method for estimating causal effects involves
- r r: :: '-i.h is to creatematched a sort of partial random assignment.The random assignmentis partial be-
- - : : r'. h ke wit h lik e' S oc i a l causein the IV method it is incentiuesto take tredtment, not treatment it-
rr r i: - .::. the P ower of r ando m self, that is randomized.
-- . --,.- rn a n ex per im enta re To fix ideas, suppose X is a cause of Y, and individuals are randomly
t: ,-r r- -:i receivesX or to a con- assignedto X. Supposefurther that X and Y are standardized variables,
- 1 :. .: :i,ti \1-ewant to vary X that is, X and Y each has a mean of zero and a standard deviation of one.
: : ,.:).. iors of things can go Due to random assignment,X is uncorrelated with the other causesof Y.
- :, . . -.r.r\r r r it h hum ansit i s X neverthelessis correlated with Y, since it is a cause of Y. This situation
:- -: -.:-.jerinvestigation'Even is depictedin figure 5.2, where X-+Y denotesthat X causesY and er+ Y
i: .. j :' : \\'ant tO vafy might denotes all the other causesof Y. Figure 5.2 is called a path diagram be-
' , :: - : . : .re Cam P belland S t a n - causeit shows all the links or paths from X to Y.
r - i : i:- ..iI and external validity Our objective is to estimate B, the causal effect of X on Y. In the case
:: -: : . \ r.rries ,ot her c aus e so f of just two variablesX and Y, the slope B equals the correlation of X and
'- -'- --'... e expectno initial dif- Y when (as here)the variablesare standardized.So estirnatingB is equiv-
-: -.:>. so we can infer that alent to estimating r(X,Y), the correlation of X and Y. According to the
:-: . ::-- i n tro duc t ionof gr o u P principles of path analysis (Alwin and Hauser 1975), the correlation of X
and Y is a function of all the links between X and Y. In figure 5.2 there is
-: : : : :.l ce i nit ially equiv al e n t iust one link, X+ Y. So for the model depicted in figure 5.2 the bivariate
- . : -:' ,ntro lledex per im e n ts , correlation does re{lect the causal effect of X on Y: correlation implies
-,
:' -: :.:,', ro pr oduc e indisti n - causality.
tr: i r i .r>i e ction is t hat r and o m In observational data, however, individuals are not randomly assigned
. - : : :nsta nc es obs er v at io n a l to X so in obseruational data we expect links between X and Y other
'When
:: - ; r. . ,:i to c om par e lik e wi th than X-+ Y. there are links between X ar-rdY other than X+Y,
correlation does not imply causality (recall the secondlaw of causality).
,-, - ,.-:.iitnlent of subiectshas Figure 5.3 depicts that situation. In figure 5.3, X causesY (as in figure
:' :: : - -. .:n ceo bs er v at ionals t u d - 5.2), but confounding variable C also causesY, and C is correlatedwith
- : - :,: ilLl\\'to lncorporate ran- X, since C causesX. In figure 5.3, then, the correlation of X and Y reflects
, - . -:,r:<oi affairs might have (1) the causal effect from X to Y and (2) the effect of the confounder C on
' ::.:: :..:-.iotl assignmentrs out- both X and Y. As a result, in figure 5.3 (unlike figure 5.2) the correlation
:::-::: :'' researchers: Either we
-.- - . . , . .i I Jtr an ex per im ent:o r
',i : -. ,,rseu'e don't give the con-
- x ----------------* Y
:-: 1.: ,,-:esult, nonexperimental
::: :: .-::remethodsfor removing Figure5.2. ExampleWhereCorrelationr(X,Y) Is Causal
: :.:'.: : , considerhow we might Effectof X on Y
l -54 . Chapter5
y€- tv
rhroughout
rhis discus.:,
rhat our object iveis t o es: . -
rhe correlarions of Z :. :-
,.t,1,1,) =n.
In shor t , if we can t lr : : . : -
hrrr has no dir ccr cl lc. : l
nd r(X , Z) pr ovides ili_ : - - -
,\- Lr
can si mply divide r he . , : : . . '
tnd Z: F- r ( Y, Z) / r lX. Z -- :
Figure5.3. Example'Where
Correiation
r(X,Y)Is Nor Causal
Effectof X on Y ' tandard ized)t he I Y eiir lr r : :
u'ith the covarianceof -\ :.: : _-
:ot,(Y,Z)/cou(X,ZJ=[i,,: .,. _--.
of X and Y gives a biased estimareof B, the causal effect of X on y.
B,r.=F when cou(Zl s. = :-
(Note that if B=0 then figure 5.3 reducesto figure 5.1. So figure 5.1-
F{enceif we can 6nd ., . .-- ,.
"correlation of X and Y is [entirely] due to common qasss"-is a special
:hn cou( Y, Z) / cor 'qX. Z: : _:
caseof figure 5.3.) ,,n Y .qThat is r he lc. , qr,. : : . . . '
Random assignmenton X alleviatesthe problem by attempting ro re-
move the empirical link betweenX and C. If the link is removed so rhat
.\.{M PLE: E FFE C T O F 5 . \ \ 1 E -. : ' .
X and C indeed are independent,then r(X,Y)=B. The difficulty in ob-
servationalstudiesis that X is not randomly assigned.The instrumental Intui ti ve ly,inst r um en[ . r ' : - . -
variables strategy is to try to find a randomly assigned variable Z that rr.rsby using onlr p. r r r , : : . .
causesX but not Y (except indirectly, through X-see figure 5.4). Be- ,orrel atedwit h r he coni, : - -_
causeZ is randomly assigned,the correlation of Z with the confounder .imilar to the fixed effe.:. .-: -
C is zero (in expectedvalue), so the correlation r(X,Z) is the causal ef- Consider again the .r,.'.
fect Z-+X (denoted a in figure 5.4). BecauseZ causesX, we have the .ehool . In dozensof , r , : ; . .
causal chain Z-+X-+Y. That is key to the IV strategy, since in the :r.rtel l ect ual advant ages,: - .
causal chain Z -+ X -+ Y the correlation of Z and Y is aB, the product of 5ated whether childrcn ..':
the causal effect of Z on X and the causal effect of X on Y (we assume :.reed i n cognit ive der . . : . - -
:hat l arge f am ily size h- . . .: . : : -
Jer.el opmentis well e\ f r : : : : _
Y(__r 1975 Psychology Tod,i.. :.:.
p.769) advisespar enr i: "- r : -
:mportant to you, the n: *:
:h:rn tw o , , , becauser hr . . , :
,f i ntel l e ct ualf unct ionin: , '
R esear cherconsir s t enr . . - .
-;ore l ow er on t esr sof c : -
j :uer
x (€x yea r s of schoolin, :- - ,
8In statistical parlance, IV estimates are consistent but not unbiased (see Angrist and
Krueger 2001). An estimator is said to be consistentwhen estimatesconverge to the popula-
tion parameter as the sample slze lncreases.
eBecausethe IV method truncates the variance of the causal variable of interest. the
":: : Es timat ingB method generally does not work well with small samples.
156 . Chapter5
inverse association between family size and academic performance per- of familyvulues.
rh.,r' -
sistsevenwhen socioeconomicstatusis controlled (Blake 1989; Downey n o m i c sta tu s, i n te l l r g c- ,;,
1995. t able 3 ). of Y ).
The question,of course,is whether the associationsare causal. Skep- The causal leveragc::
tics can point to any number of W-variablesthat are difficult to measure, conditions: First, ser r::-. -
or to measure adequately, in standard regression approaches (Guo and must aff ecracader nr . . . . . .
Van'Wey 1999). Some potentially confounding factors, such as parents' first condition can be i..i-r ;
values and the intellectual climate of the home, are difficult to measure on the basisof theorr. \\ :
completely. If parents' values and intellectual climate of the home jointly to childbearing.Coniri. :
determine children's academic performance and decisions about family ported in Angrisr and f ', :-,
size, the observedassociationsare likely to be spurious. and one girl,37.2 perc.:.: -
Becauseof the difficulty of measuring confounders, researchersin or tw o g ir ls,43. 2 per , r : . : -
this field have sought alternativeways to relieve omitted-variablesbias. restrict the analysisro :::::
One possibility is to use sibling models (Guo and Vanlil/ey 1999), as censusfiguresfor 199i,,.-.
describedearlier. Another possibility is to use instrumental variables,as boy and one girl, 34.-1i.:,.
describedhere. boysor t wo gir ls, 40. - ; . - .
The trick for the instrumental variable approach is to find a ran- marri edwom en t he dir : =: - -
domly assigned Z such that Z -+X-+ Y. Here Y is school achievement The secondcondiric,-....
and X is family size. \7e want to know whether the observedeffect of controver sialbecausc: : *'-
X on Y is causal. To be more concrete, let's suppose Y is measured by si der,for exam ple.J. . - - ,
'What
failing a grade in school. we want to know is whether children in mix of children as an i:-.:: -:
large families would be less likely to fail a grade if they were in small ing (X) on women's l.rl : :
families. arguethat t her eis no. : : :
The problem is that family size (X) is not randomly assigned,so larger to affect labor force fr:: . :
and smaller families are likely to differ on any number of characteristics bearing, Mark Rosenz...,.-
other than family size. As a result we cannot infer a causal effect of X on coul d b e r he caset hr r r : : : - :
Y from an association of X and Y. To make causal inferenceswe seek a costsfo r sam e- sexanJ : . - - .
random Z that affects family size yet has no direct effect on a child's dren might be cheap. r r . - -
probability of failing a grade. share a bedroom), and :: ,
What we would like to find, then, is some sort of nature-provided ran- the mother works.
dorn assignment mechanism that affects family size but not children's A s th e Rosenzweigr : . ,
academic success(except indirectly', through its effect on family size). variables strategy is nor -'.:.
Conley and Glauber (2005) suggestthat the sex mix of children provides is the difficulty of findi:: :
such a random assignmentmechanism Z. Among women with two chil- causalvariableof interc::. '
dren, about one-half will have same-sexchildren and one-half will have omitted variablesbias a:.
mixed-sex children. Importantly, the "assignment" of same-sex versus It i s nor alwayseasvr o : : * '
mixed-sex children is random. So we expect women with two boys or two anisms that provide rhe s ::
girls to have the same preferencesin general as women with one boy and casefor an instrument.rl ',--,
one girl, with one important exception: If there is an underlying preference fects Y only t hr ough ir . - - '
for having at leastone child of eachsex,then women with same-sexchildren one should abandon rhc -.-
will be more likely to want a third child. In other words, among women need to do a berrerjoh : -
with rwo children, the sex mix of children-a random occurrence-affects ever it may be found in n.::-
decisions about childbearing (so Z-+X) but otherwise is independent forces researchersto rhr:.-. :
The Fifth Rule . 157
.-- ..-..;sr]]icPerformancePer- of family values, the intellectual climate of the home, parents' socioec<.r-
- :- :: Bl ak e 1989; Down e y nomic status, intelligence, and so on (so Z is unrelated to other causes
of Y).
-- -..,.uonsar e c aus al.S k e p - The causal leverageprovided by sex mix of children dependson two
:-..: -lrc difficult to measure, conditions: First, sex mix must be related to family sizel second,sex mix
.: ,:r eppro2ches(Guo and must affect academic successonly through its effects on family size. The
:: :-rctors,such as parents' first condition can be checkedempirically whereasthe secondis determined
^ :. -rrc difficult to measure on the basisof theory. \fith regard to the first condition, sex mix ls related
- .:rJIe of the home iointly to childbearing. Consider these figures from the U.S. censusfor 1980 (re-
.: :.; decisionsabout family ported in Angrist and Evans 1998, table 3): Among women with one boy
: i-- r f i O L l S . and one girl,37.2 percent had a third child; among women with two boys
- - .rrrJ er s ,r es ear c hersi n or two girls,43.2 percent had a third child, a differenceof 6 percent.If we
:', : Lrmitted-variables bias. restrict the analysisto married women, the difference is 6.8 percent. U.S.
.i- ::.rd Vannfley 7999), as censusfigures for 1,990 show the same patrern: Among women with one
i: 1:irulrefltal variables,as boy and one girl, 34.4 percent had another child; among women with two
boys or two girls, 40.7 percent,or 6.3 percentmore, had another child (for
r r : ', rrrr)Jch is t o f ind a r a n - married women the differenceis 7 percent).
: :: : : : t i s sc hool ac hiev eme n t The second condition, that Z affects Y only through X, is often more
, -: r-:r th e obs er v edef f ect o f controversial becauseit must be establishedon the basis of theory. Con-
. :i : : ,.:Fo se Y is m eas ur edb y sider, for example, Joshua Angrist and William Evans's (1998) use of sex
' : . .. ','.'is rvhether children in mix of children as an instrumenr to estimatethe causal effect of childbear-
-: ', ::.:ie if they were in small ing (X) on women's labor force participation (Y). While Angrist and Evans
argue that there is no compelling reason to expect the sex mix of children
: :.,.:-.i,rnlr'assigned,so larger to affect labor force participation decisionsaside from its effect on child-
bearing, Mark Rosenzweig and Kenneth Wolpin (2000) are skeptical. It
- - '. ::.: e causaleffect of X on could be the casethat there are important differencesin the childrearing
-.-r : -.'...>.rlinferenceswe seek a costs for same-sexand different-sex children (for example, same-sexchil-
-r : - :l re ct e f f ec t on a c hi l d ' s dren might be cheaper becausethey can wear hand-me-down clothes or
share a bedroom), and those costs might affect decisions about whether
,,,- r : :: of nature-providedran- the mother works.
:: r : . - ', .i ze b ut not c hildre n ' s As the Rosenzweig and \Wolpin critique underscores,thc instrumenral
,--- :: r'ifect on family size). variables strategy is not a panaceafor social research.The chief problem
:-.: -: '. :rir of children Provides is the difficulty of finding randomly assignedZs that are related to the
- ----. :-* \\'omen with two chil- causal variable of interest. Studies using instrumental variables to relieve
, :- r::':r lfld one-half will have omitted variablesbias are only as convincing as the instruments they use.
t:: l--rll:Ilt" of same-sexversus It is not always easyto find naturally occurring random assignmentmech-
'.-: ,,. :::<nwith two boys or two anisms that provide the sort of leverageone needs to make a compelling
'::: :. '.','omen with one boy and casefor an instrumental variable Z that measurably affects X and that af-
r i--:r: : :n underlying preference fects Y only through its effect on X. That difficulty does nor mean that
-:- '.,. :r':r ri'ith same-sexchildren one should abandon the attempt, however. In observational research we
: .. :t--r \\'ords,among women need to do a better job of harnessingthe effects of randomization wher-
::.- -,-:.:r.iJom occurrence-affects ever it may be found in nature, and the IV strategy,fallible as it is, at least
'. r--: ,,:henviseis independent forces researchersto think more concretely about how one mieht be able
158 . Chapter5
Cholera, which had first appeared in Europe in the early 1800s, came in
epidemic waves. Victims became sick suddenly and often died. From his
clinical study of the course of the disease,Snow conjectured that cholera
was caused by a living organism that humans contracted through food or
drink and then passedthrough the body to infect new victims through the
water supply.
Snow did a great deal of detectivework to find evidencefor his theory.
If miasmas in the air causethe disease,then propinquity alone should ac-
count for the way the diseasespreads.But if the diseaseis waterborne, he
reasoned that the use of a common water supply is the key. It turns out
that London at the time presented a naturally occurring random assign-
ment mechanism for testing his theory. Some areasof London were served
by more than one water company, so in some districts next-door neigh-
bors receivedtheir water from different sources.Of two major water com-
panies in London, the Lambeth company drew its water upstream in the
Thames River, above the main sewagedischarge points. The Southwark
The Fifth Rulc ' 159
and Vauxhall company, by contrast, drew its water below the sewagc
discharge points. In his own words, here is how Snow described the ex
periment:
Each company supplies both rich and poor, both large houscs and
! :\ . R \\DoM AsstcNrytnur small; there is no difference either in the condition or occupation of the
persons receiving the water of the different Companies. . . . As there is
no difference whatever in the housesor the people receiving the supply
--,r:::- ,/ tlte treatmenl is ran- of the two'$fater Companies, or in any of the physical conditions with
:'. rl lll(lf€ leveragefrom ran- which they are surrounded, it is obvious that no experiment could have
,, : : : : . li td ol rl l Yas s igned,as in been devised which would more thoroughly test the effect of water
I
-:. :: ) : h at p os s ibilit Y ,whic h supply on the progress of cholera than this, which circumstances
. ,:. l:e
: : treatment group. BY placed ready made before the observer.
:-.:.'.--- rhat nature necessarily The experiment, too, was on the grandestscale.No fewer than three
Ll':l-i: \\'ere not done deliber- hundred thousand people of both sexes,of every age and occupation,
: : . , , ,'rrl l ldo
d in a c ont r oile d
and of every rank and station, from gentlefolks down to the very
poor, were divided into two groups without their choice, and in most
t'::.:t litl bv illustration. David cases,without their knowledge; one group being supplied with water
::-. r::rltion, in the 1850s,that containing the sewage of London, and amongst it, whatever might
i:. =::i..tt.r| 1997, P. 29 4) Places have come from the cholera patients, the other group having water
: I rr| S : quite free from such impurity. (Snow 1855, pp. 74-75, quoted in
Freedman 1.991.,p. 297)
:.:\ .r. :rr.neand forget that germs
' : - .i rlicir resolution is Poor. Snow's example is particularly compelling becausehe was able to find full-
:.. l^. i.olation of such micro- blown "natural" random assignmenton the cause of interest-a rarity in
I . .:tfection theorY has some the social sciences.Social researchin this century will become more pow-
...::.:.>:resultsfrom "miasmas": erful to the extent that we can more routinely follow Snow's nineteenth-
: l:. century example in our investigationsof human behavior.
r : ': fh c earl Y 1800s ,c am e i n
::-:'..',.-rnd often died' From his
CoupenrsoN oF STRATEGTES
FoR Coru.peRrucLrrn wrrH LrrB
. i: ,i ;on;ectured that cholera
-:-: , ,nlr;1ctedthrough food or
Confounding variables are the bane of causal analysiswith observational
.. : : r: n ew vi c t im st hr ough t h e
data. The methods described in this chapter attempt to deal with the
problem by comparing like with like. To evaluate the methods, we con-
r. i i:rd evidencefor his theorY.
trast the strategiesthey use for comparison.
:: :: ,rinquity alone should ac-
In this section the key distinction is between X, the cause of interest,
: .: :.-.:Jiseaseis waterborne,he
and the Cs, the other causesconfounded with X. Earlier I distinguished
:r :-^rrlr is the keY.It turns out
measuredcauses(Xs) from unmeasuredcauses(V/s). Confounding vari-
- - ': . i ;trrri n g r andom as s ig n -
of London were served ables can be either measured or unmeasured,which I will denote C, and
-:-:.,1s
-,. C, respectively,where boldface is used to denote a vector of variables.So
. :-: .listricts next-door neigh-
,-:-::. [)f ttvo maior water com- there are three categories of variables to keep in mind: X, the specific
. J::'," lts \\'aterupstream in the causeof interest or "treatment variable"; C* measuredcausesof Y other
tban X; and C* unmeasuredcausesof Y.
:-:,:.::e Points. The Southwark
160 . Chapter5
T.q,srp
5.1.
y{-€y Comparisonof Strategies
for -\.
C:
in Observational
Studics
Strategyand Model
Propensity scores
Now consider figure 5.5, a stylized example of the confounding vari- B= tt, lt, where p, is thc :.r:.r
'Wewant to estimate the (average) Y for the treatment group -\ =
ablesproblem in observational data. B'
p . . .i s t h e m e a n o f Yfo r th e ;, ::-
causal effect of X on Y. The regressionof Y on X does not yield a good
g r o u p ( X =0 )
estimate of B since that method fails to separatethe causal effect of X on
Y from the causal effects of C on Y, where C is bolded to emphasizethat Instrumental variables
it denotes a vector of confounding variables (virtually always there ts Y , = a + P *, + e ,, w h e r eX i r .r '.:
more than one confounding variable). Figure 5.5 is a generalizationof fig- of Z.
ure 5.3 to the typical case of multiple confounding variables. In the gen-
eral case X and C are correlated, as indicated by the curved arrow
between X and C in figure 5.5. The curved arrow means that there are F i r e d - e f f e c t sm o d e l s( i l l u 'tr .r t- - - ,
causal effects from C to X, or common causesof X and C, or both. ;ifferencing method)
The key to estimating B lies in removing the confounding effects of the \Y,= a+ p AX,+ q, rvhere:
variables in C. The standard regressionstrategy is to regress Y on X and Y,r.-Yn for the itl.' L:::.:
log{r-log{, for rhc ::
C* the measured variables in C. Alternatively, we obtain the same re-
and X on Cn4,to create i in growth-rate moJ..
gi!tti"" coefficientsby ( 1 ) regressingY on C1u1,
Y,o-Y,o for the lrl: srb
and -*,-values of Y and X for individuals, predicted on the basis of the
models
measuredCs-and (2) regressingY,-t,onX,-*,. Thus the standardre-
AX. is defined in the same ri'.:.
gressionstrategycan be thought of as removing the effect of the measured
Key: B i s the caus aleffec t of \ ,.' : :
confounding variablesthrough residualization. Becausethe residuals {-
X i s the c aus alv ari abl eoi r.:r::.'
Y, and X,_X; are uncorrelatedwith those confounders, any associationbe-
Y is an outcome variabie,
t-ee" Y,-f, andX,-*,could not be due to the confounders that we have Z is an instrumental vrn.l. -
measured. So if we can accurately measure all the confounders' we can C is vector of confounder \ -:r : :.
use standard regressionto obtain an unbiased estimateof the causal effect C y i s vector of meas ured., :: . -,
of X on Y. That is the basic logic of the standard regression approach
( t able5. 1) .
In the case of propensity scores, as in standard regression, we must B eca usem easur ing. r . - -
measure confounders to control for their effects. Instead of controlling alwaysfeasible.one of ::: -'
through residualization,however,we control through matching on propen- to developalternatives::.::;a
sity scores,where propensity scoresrefer to the predicted probabilities of observational studies. O:.: ,
selectioninto the treatment group. Propensity scoresprovide an attractive random lyassignedr ': r : : ,
alternative to residualization because propensity scores typically elimi- i ndi rect ly,t hr ough ir s r : : - :
nate the need for strong assumptions about the functional form of effects w hereX is a linearir r ni: . -
and very often result in smaller standard errors despite the loss of cases. pendent of the orher ;:--::
The Fifth Rule . 161
There5.1
Comparisonof Strategies
for Alleviatingthe Effectsof Confounders
in Observational
Studies
. .:,. - : -rrd regression,we must Becausemeasuringall confounders,or eventhe most crucial ones,is not
: r :rr: -i !. Insteadof c ont r ol l i n g always feasible,one of the most pressingissuesin applied statisticshas been
.: : -: :gh matchingon propen- to develop alternative strategiesfor alleviating the effects of confounders in
--. 1i .' r ednr nhabilit ie so f observational studies. one strategy is to use an instrumental variable-a
:r: I :- ':esprovide an attractive randomly assignedvariable Z that affects X but not C and affects y only
' r r : ': r'' scor est y pic allY eli m i - indirectlS through its effect on X. In the IV strategy we regress Y on *,
, '-: : : :,:nctionalform of effects where X is a linear function of Z (tat:le 5. 1 ). The effect of X on y is inde-
:: : . :.'p i te t he los sof c as e s . pendent of the other causes of Y becauseX is a linear function of Z,
162 . Chapter5
where Z is independent of the other causesof Y (that is, the causesof Y correlationis all lr,eha,r .,
other than X). Moreover, because Z has no causal effect on Y except rhischaprer hcshecn:: :
t hr ough it s effe c to n X, th e s u h s ti tu ti o no f * for X doesnor bi asour esri - correlationdoesimpir ..
mate by mixing together the causal effects of X and Z. Therc is no inde- rhosecondirionshr -
Dendentcausal effect of Z to mix with the effect of X. p r o d u c ed r a r r .l o n r ,l 5 \l - ..- .r
In principle the instrumental variable approach is a very attractive alter- urul l y o ccr r r r ingr , r r . l' - .
native to standard regression control, since it suggestsa way to avoid matching.
omimed-variablesbias without the needto measurethe confounding causal C on sir , lsruglir r r h.
variables. (In that sense,the promise of the IV approach is similar to the $7hen we say r hlr "r : '
promise of controlled experiments and of fixed-effects models.) In prac- al l smo ker swill be \ i! . r ,
tice, though, it is often difficult to find defensibleinstrumental variables. avoi d i r . Rar her .r he - :
Another strategy for alleviating the effects of unobserved confounding smoker s ar e m or e ir k; :
variables is to use change models, growth-rate models, or sibling models. smoked and, sim ilar i'. _: : -
I group these three methods becausethey can be thought of as belonging cancerrhan if t hev h. r r - *
to a classof models called fixed-effectsmodels. Fixed-effectsmodels pro- ing on lung cancer.rhr:-....,.
vide traction for eliminating the effectsof confounding variablesby provid- rate w o uld have becn : : . .
ing two or more individual observationswhere X varies but C is constant. smokersif r hey had : r : . . - _
'When
we know that two or more observationshave the samevalues on C served,so they musr l.e :.: :
we in effect can match individuals on the confounding variables (for ex- to use the observedcr:.,:: :
ample, two siblings are matched on family environment since they have for smokershad ther : : .-
the samefamily environment; or the same individual at two points in time cer rates for srnoksr: . . -
is matched with herself on permanent individual traits). By using subtrac- smoked. I n ef f ecr \ \ '; . . ) : .
tion or some other method (Allison 2005) we can eliminate the confound- smokersas proxies it,: .:--
ing effect of unmeasured variables that are the same for the matched viously dependson r:: ::
observations(table 5. 1 ). smokersand nonsnrc.,k;:..,:
Becausefixed-effects models can alleviate the effects of unobserved smoki ng)r har leadr t , . . : - _-
confounding variables, they are attractive in a number of social science In the labor at or \ .\ \ - : -
applications. Indeed, as Charles Halaby (2004) argues,fixed-effectsmod- distinguishablear rhe tr,::..:
els are probably underutilized in social research.It is important ro note, u n its are ran dr_tntlt,r_. _.;. ,;. .
however, that the fixed-effects strategy assumes that X differs for the ment an d som e don'r . . , 1-
matched units (the matched units are siblings in the case of sibling mod- other is solely on the b-.-.:.
els, individuals measured at two points in time in the case of panel mod- researchertsypicallr Li. .
els, and so on), but that the variables in C are the same for the matched the caus eis r ar elr 'f ut r : - :
units. The fixed-effect method generally is most effective, then, in appli- ternati v em et hodsr o ln : : . _
cations where the causal variables of interest change over time (or differ common m et hod is r c! - . r .
across siblings, etc.) while the unmeasuredconfounders are stable. tempt to removethe cor:.::
set of ca usalvar iablesr , , : - .
both X (the causallari.r:- .
CoNc r us r oN the other causalvariab,lr... ,
si mi l ar b y m odelingr he
The idea in causal analysis is to find ways to circumvent the second law distinguishthem. The i,:--: -::.-
:.,
of causality, that correlation does not prove causation. 'Wewant a design w e can discer nt he ot he: . . . . . ,
or method that permits us to infer causality from correlation because l i abl y; ev enif we can r : . . , , , .
The Fifth Rule . 163
.:.:- : i :l-ratis, the causesof Y correlationis all we have (we cannot observecausalitydirectly). That's what
ir - - ':::.l i e f f ec t 0n Y ex c e p t this chapter has been about. \Wediscovered that, under specific conditrons,
'. - : .\ .'loes not bias our esti- correlation does imply causality. The trick in causal analysis is to create
- \ : ' -:,.idZ.There is no inde- those conditions by comparing like with like-through investigator-
--, -
'r
\- produced random assignment(as in a controlled experiment), through nar,
. : : - . - l' :s :r very attractive alter- urally occurring random assignment, or through some other means of
' -- : r':i9$eStS a WaY tO avoid matching.
- 'i . -..:. rhe c onf oundingc au s a l Consider again the question of whether smoking causes lung cancer.
\il/hen we say that "smoking causeslung cancer," we do not mean that
.- : . '. .:.rFro ac his s im ilar t o th e
: - , . . :--i i e cts m odels . )I n P ra c - all smokers will be victims of lung cancer,nor rhar all nonsmokerswill
: :-- : r -tlstrufitentalvariables. avoid it. Rather, the claim "smoking causeslung cancer" means that
'i:-i- : ,:nobservedconfounding smokers are more likely to contract lung cancer than if they had not
:--- ,: : :-..1l l els or, s iblingm od e l s . smoked and, similarly, that nonsmokers are less likely to contract lung
,.,- -:- :irought of as belonging cancer than if they had smoked. To investigatethe causal effect of smok-
'- .j: : f .red-effectsmodels Pro- ing on lung cancer,then, what we really want to know is what the cancer
' -: , :-.::nqvariablesby provid- rate would have been for smokers if they had not smoked, and for non-
-
i -:r: \. r'.rriesbut C is constant' smokers if they had smoked. Both of these rates are, by definition, unob-
':'. i'- ',':',.e the samevalueson C served,so they must be estimated.The first strategy that comes to mind is
-: - . r ,::rJingvariables(for ex- to use the observed cancer rates for nonsmokers to infer the cancer rates
- :' . .:,rnfi]€rt since theY have for smokers had they not smoked, and similarly to use the observedcan-
' : - .:-:.r].rrt r v o P oint sin t i m e cer rates for smokers to infer the cancer rates for nonsmokers had they
,: - .., :r.iits). By using subtrac- smoked. In effect we use smokers as proxies for nonsmokers and non-
' ., ,.,:-.elin.rinatethe confound- smokers as proxies for smokers. The reliability of the proxy method ob-
-
:: :- : : :. sam e f or t he m at c h e d viously depends on the reliability of the proxies-we must assume that
smokers and nonsmokers are alike with respectto the factors (other than
:- ,:: :r- effects of unobserved smoking) that lead to lung cancer.
: ., :--!rnlberof social science In the laboratory, we try to make the treatment and control groups in-
: ,::gues,fixed-effectsmod- distinguishableat the outset through random assignment.In other words,
-
r:': . : , r. It i s i m P or t antt o no te , units are randomly assigned to the cause units receive the treat-
:-- . -:i rhat X differs for the ment and some don't, and the placement of units inro one group or the
r - -- '. lhc caseof sibling mod- other is solely on the basis of chance.Outside psychology,however, social
: . : r:' the caseof Panelmod- researcherstypically use observational data, and random assignment to
- [ .:. ::e same for the matched the cause is rarely found in observational data. Thus we need to find al-
: -' .i :ffective, then, in appli- ternative methods to investigator-producedrandom assignment.The most
:::: -: - :'.-lnqeover time (or differ common method is regression using covariates; that is, researchersat-
:: - , . -: ',i n der sar e s t able. tempt to remove the contaminating effects of other variables by adding a
set of causal variablesto the regressionequation for Y, thus residualizing
both X (the causal variable of interest) and Y (the outcome of interest) on
the other causal variables.In effect we try to make dissimilar individuals
similar by modeling the effects of the causal variables (other than X) that
: 1 i : -.:.umvent t he s ec ondl a w distinguish them. The limitations of this merhod are well known. Even if
r . : - , --i -rti o n. ' $7e
want a de s i g n we can discern the other causes,we might not be able to measurethem re-
-: : t ::,)rn c or r elat ionbec a u s e liably; even if we can measure them reliably, we might not be confident
164 . Chapter5
a confidentiality statement. (You will need to create an ICPSR account if Now r:ecodemarst3r-
you do not already have one.) variablecalled married
-.
The directions for recoding variablesshould be familiar by now; if you (code das l) . Your enr : . : :
need a refresher,refer to previous chapters. As before, all variable names marri e d,l; 0, not nr . r : :. *
appear in boldface. to MD code,and clic\ '. -
Finally, recode healih
goc,d.guod. si. r [ islJ, t: .
Cross-sectional Relationships between Marital Statws, Health,
heal th_r eso t h; r hr l: , : . -
and Happiness (1995 Data)
the " Recoding,Rulc: ' .
Our dependentvariable,huppy, is a count of the number of days in the past satisfactory,3; 2. pc,,,:.-.
week respondentsfelt happy. Note that huppy refers to the L995 data, i s eonv er t edt o \ lD i *. .
happy2 refers to the 1998 data, and happy3 refers to the 2001 data. The is an ordinal varial.lt. ,',.
other variablesare similarly named. This protocol is important for the sub- pl i ci ty. You shouldh. :. .
sequentFE analysisbecauseit enablesus to create change scoresfor indi- 'Wecan now run rl--:
viduals simply by subtracting variables(e.g.,happydiff =happy3 -huppy). regression" from the "
-
The frequency distributions of huppy, happy2, and happy3 show that dent" box, then married ,.
ASOC respondentsare generally a happy bunch. In all three years the vast "
" R un regr ession. \ , , - : - -
majority of respondents report being happy all seven days of the past
week, and only a tiny fraction say that they did not feel h"ppy at all over Quest ion1. ( a) Do m . r : : .
that period. ple of mostly older -\:::.:
We first estimate a model using the 1995 cross-section.In this analysis (b) D e scr ibehos or ' : : , -
we regresshuppy on marital status and health. Strictly speaking, huppy causalef f ect sof nr . r : - . - .
is not a continuous variable since it is bounded by 0 and 7.The more im- or more om it t ed \ '. r :. : : . : -
portant problem, however, is that the distribution of huppy is heavily
skewed-as just noted, the vast majority of respondentsare in the highest C ri ti cs m ight poinr : : - .
category, "h"ppy all 7 days." So ordinarily we might dichotomize huppy abl es:whet herbec. r r r . -
and use logit or probit regression. For the sake of simplicity here, how- effectsof ear lv chilJl: -
ever,we use ordinary leastsquares(OLS) regression. "naturally" happr' .rr-: :-
\7e need to tidy up happy a bit before we can use it, since the variable happier people Are n-. ::
treats missing-datacodes (98 and 99) as valid responses.Becauseit is im- mai ntaina sr ablenr J: : , r .
possibleto have been happy for 98 or 99 of the past 7 days, we need to re- Our cr oss- see r iur r . r-. . - - ,
code this variable to eliminate those values. Select "Recode variables" w e ccn t ake advant . r : .
under the "Create new variables" menu, enter happy-re as the name of changein m ar it al st . r : - : .
your new variable (make sure that you selectthe option to replaceany ex- By restricringrhe .rrr.r. - .
isting variable with the same name if it already exists), and enter happy as el i mi na t er he ef f eer , , : - '
the name of the variable to recode. In the first row enter "0" for "Value," to be happierthan ori:':,
"no happy days" for "Label," and "0" for "Var 1." (You need to fill in
rows 2-8 for values 1-7 only if you want the value labels on your print-
Maital Status, Healtl:, .;,:
out.) Select"assign the value of input variable #1" in the "'S7hatto do with
unspecifiedcombinations of input variables(if any)" option (this is impor- For the 1995 ASO L u , : . : .
tant). Then scroll down to the "Optional specificationsfor the new vari- cises),we find thar -\:::.: ,
able" section, where you should enter "0" in the "Minimum valid value" Americanswho are fl1,r. :l
box and then "7" in the "Maximum valid value" box. Click "Start Recod- thosewho ar e in poor : . =
ing." You should now have 1,130 valid cases,with a valid range of 0 to 7. the curr ent chapt er . . r . - -
The Fifth Rule . 167
are hidden variablesthat causeboth health and happiness,or both marriage should have 1, 1l6 r - . .r : -
and happiness.Or perhaps there is reversecausation, as just noted in the as for marrdiff,
discussionof endogeneitybias. N ow r un t he f ir e; - . : : .
With FE methods we can gain leverageon the hidden variablesissue as der th e "Analy si<" n: . . .
well as on the reversecausation issue.The cost of FE models is that they and nolongermarried. n..
truncate variance becausethey use only the within-unit variance. In the vari ables.Click "Rur : . - :
caseof panel data for individuals, "within-unit variance" means individ-
ual change over time. Here individual change over time refers to change Question2. O r r r lr r 1. . -
in marital status and health status among Americans, with an oversam- (a) D o cs healr hsr r ll | . -
pling of those age 50 and older. Becausethe possibility of failing health pi ness?Conr par er hc I : .
and the death of one's spouse is greater among the elderly, this sample one sl openot ablr sr ir : : - .
likely provides more over-time variance on the key variablesthan would, reassuring?
for example, a sample that focused on adults who are age 30 to 50. (b) D oes m ar r iagesr : . . - .
The first step is to alter the variablesin the 2001 wave as above.The re- happi ness? Do vor r f : - '
coding and computing techniques are the same; only the variable names effectf or nolonger m ar r ,:.
differ. Recode happy3 into happy3_re (you should have 1,128 valid pi rressin t he cr oss- 'c. : . '
cases),marstat3 into married3 (you should have 1,118 valid cases),and a few sentencesof pl,:-:-.
health3 into health3_re(you should have 1.,137valid cases). riage on happiness.
Next, compute difference scores for happiness and health. Click on (c) R e f er hack t o ) o. . - .
"Compute a new variable" under the "Create variables" menu. In the omitted variables rhr:
"Expression to define the new variable" box, type happydiff=happy3 herealleviat es1or i. r r , - :
_re-happy_re. Selectthe option to replacethe variable if it already exists, (d) On r he basi. of , r l- . .i-
make sure that the "No" option is checked beside "Include numenc of mar it al st ar us. r nJ : : '
missing-data values in computations?," and leave everything else the
same.Click "Start Computing." You should have 1,114 valid cases.Re- Question3. Comp.rr.:-.
peat this procedure for marrdiff=married3-married (this should give How do you accounr:, : :
yor 1,116 valid cases)and for healthdiff=health3_re-health_re (this
should yield 1,131 valid cases).Keep a copy of the output becauseyou Quest ion4. ( a) Vhar : . : - ,
will need it for question 4 below. reported health from 1-- i
Observe that marrdiff has three values: -1 for those who losr a spouse or get worse?What f.:-;:
(due to death or divorce) from 1995 to 2001; 0 for those who did not port worse health:
change marital status (that is, remained married or unmarried); and 1 for (b) rXlhat percentage , : :
those who gained a spouse.To make that clearer in the analysis,let's cre- from 1995 r o 200 | : \ \ : , :
ate two dummy variablesfrom the marrdiff variable-one for being mar- ri ed? What per cenr Jss. . . .
ried in 1995 but not married in 2001 (nolongermarried), and another for (c) D o t he r esult sior - . .
being married in 2001 but not married in 1995 (newlymarried).Our ref- spondents?Explain.
erencegroup is those whose marital status did not change.
To create the dummy variables, return to the "Recode variables"
screen.Recode marrdiff into nolongermarried: you should recode scores Opl ronel Fr xso- Er r r ,
of -1 on marrdiff to scores of 1 on nolongermarried, and then marrdiff WHo Use SAS
scores of 0 or 1 to nolongermarried scores of 0. Then create newlymar-
ried: codes of 1 on marrdiff should remain 1 on newlymarried, but codes Thi s ex er ciseis desigr '.-
of 0 or -1 on marrdiff should become codes of 0 on newlymarried. You son' s e xcellentbook I : . .
The Fifth Rule . 159
:- - .::.:iss. or both marriage should have 1,116 valid caseson both new dummy variables-the silme
:l- ls just noted in the as for marrdiff.
Now run the fixed-effectsmodel. Click on "Multiple regression"un
..:Jen r-ariablesissueas der the "Analysis" menu, then enter happydiff as the dependenrvariable
' !-E nodels is that they and nolongermarried, newlymarried, and healthdiff as the independent
:---rnitt ar ianc e.I n t h e variables.Click "Run regression."You should have 1,091 valid cases.
'r. ':rr' r m c ansindivi d -
: -lnc refersto change Question 2. On the basis of your results for the FE model:
(a) Does health still have a statistically significant, positive effecr on hap-
-..:.>.\ \ it h an ov er s a rn -
., .-:1 i r.of f ailing hea l th piness?Compare the FE slopewith the slope in the cross-secrionmodel: Is
:-.. elderlr',this sample one slope notably steeper,or are they roughly the same size?Is this result
. '.:ri a blest han woul d , reassuring?
.:.:iJs e 30 t o 50. (b) Does marriage still have a statistically significant, positive effect on
, .,.-rre as above,The re- happiness?Do you find a positive effect for newlymarried and a negative
'- ', rhe variable names effect for nolongermarried? Given the positive effect of marr:iageon hap-
-..j h av e 1, 128 v a l i d pinessin the cross-sectionanalysis,are you surprisedby the FE results?In
-.l l Sv alidc as es ) , an d a few sentencesof plain English, summarize the apparent effect of mar-
.- !- !.1 \15/ . riage on happiness.
. .:nd health. Click on (c) Refer back to your answer to quesrion 1(b), above.For each of the
..-::-rl.les" menu. In the omitted variables that you listed, explain how the use of the FE method
.:: happydiff=happy3 here alleviates(or fails to alleviate)their confounding effects.
.::ie if rt alreadyexists, (d) On the basis of the FE results, are you more confidenr rhat the effects
..:.ie "]nclude numeric of marital statusand physical health on happinessare causal?Explarn.
,'.: evervthing else the
r_: - -. . 1.11+ r - alidc as es R . e- Question 3. Compare the R2 values for the cross-sectionand FE models.
i,"i: - :-l :rried (this should give How do you account for the fact that the FE model explains lessvariance?
r.. tj _ -. r
:h I re - healt h- r e ( t h i s
.: :r'-. orltput becauseyou Question 4. (a) What percentageof the respondentsexperiencechange in
reported health from 1995 to 2001? On average,did the health improve
or get worse? What percentagereporr better health? \X/hat percentagere-
- : :,,r th os e who did n o t port worse health?
-- i - : * : u nm ar r ied) ;and 1 f o r (b) \Vhat percentage of respondenrs report a change in marital status
r - : r -: - ,: the analYsis,Iet's cre- from 1995 to 2001 ? What percentagechanged from married to not mar-
::l -,- ..: .-one for being mar- ried? What percentagechanged from not married to married?
r'ur,ci;::r.r,1rriedt. and another for (c) Do the results for (a) and (b) seemplausible,given the age of the re-
spondents?Explain.
.: - * -: ; hange.
* : : ::-. "Rec ode v ar iable s "
irrrJ -- .hould recode scores OprroNer Flxro-Eprscrs ExERCTsE
FoR STuDENTS
:c':.::r.-rrried. :rnd then marrdiff Wuo Usr SAS
--. : . Tren createnewlymar-
. '. n.'*'lvmarried, but codes This exerciseis designedfor students who have accessto SAS. Paul Alli-
,l:- : ,rr.rnewlymarried. You son's excellent book Fixed Effects Regression Methods for Longitudi-
170 . Chapter5
nal Data Using SAS summarizes fixed-effects models and how to estr- A P P ENDI X
mate them with SAS.Chapter 2 of Allison's book contains a running ex-
ample of the use of fixed-effects methods to estimate the effect of family
poverty (a dichotomy) and child's self-esteem(a scale)on antisocial be- SelectedFunher R
havior (a 7-point scale)using a sampleof 581 children who were inter-
viewed in 1990,1.992, and 1994.
On cxperintentsatr,l t:,.:, -
C a m p b e l l , D o n a l d 1 .. _
L x p er i m e nt a l P4 - r 1 - 3 ,'.'
G h o s h . S., a n d C . R . R '
Analysis oI Exptr:..: .':
On fixed-effects r trrt,i, .: :
A l l i s o r r ,Ptu l . 2 0 tt i . F:r ' r : - -
i r g SAS.C a n . \t : \.:. - .
Halaby, Charles N. I , - -,
p r a c ti ce ."An n tttl R , . .. ..
On instrumental t',tri.;:.. ,
A n g r i s t. Jo sh u a D .. .r r .i .
s e a r ch fo r i d e n r i ti ..r :: .
Joutnal of Ecort,,ntt.' . -
On matching metho.l::
B e c k e r. S. O., a n d \. .- .
h r s e d o n p r o p e n \i r \'.. - ..
H o . D a ni e l E., Ko r r r k. I:- '
r n Pa, s n o n p a r a l n e r r r :-
. -:.- -
r i c c au sl l i n fe r e n ce .' . . '
M o r g a n. Sr e p h e nL .. .r n .,.
e f f e c ts: Pr o sp e cr ..l n J : :- .
and RcsearchJ i; i-a
R u h i n , Do n a l d B. l e - l . . -
t o r e m o veh i a s i n o h ..: .
l 9 - 9 . "U si n l tr :.:-
t o c o n tr o l b i a s i n ,,1 ....
A s s o ci a ti o n - 4 :i l q - l :
;:ls and how to esti- A PP E N D IX
' - :1t.1ifls a runnlng ex-
:'.-.rr.rhtseffect of farnily
:.-.le on antisocialbe- SelectedFurther Reading
^ . .i ren rvho were inter-
On experimentsand quasi-experimen!s:
Campbell, Donald T., and Julian C. Stanley. 7966. Experimentdl and Quasi-
Experimental Designs for Research.Boston: Houghton-Mifflin.
Ghosh, S., and C. R. Rao (editors).1996. Handbook ctf Statistics13: Design and
Analysis of Experimenls.Amsterdam:ElsevierScience.
On instrumental uariables:
Angrist, Joshua D., and Aian B. Krueger. 2001. "lnstrumental variablesand the
search for identification: From supply and demand to narural experimenrs."
Journal of Economic Perspectiues15:59-85.
On matching methods:
Becker, S. O., and A. Ichino. 2002. "Estimation of average rreatmenr effects
based on propensity scores." The STATA lournal 2:358-77.
Ho, Daniel E., Kosuke Imai, Gary King, and ElizabethA. Stuart. 2007. "Match-
ing as nonparametric preprocessingfor reducing model dependencein paramet-
ric causal inference." Political Analysis 15:199-236.
Morgan, StephenL., and David J. Harding. 2006. "Matching estimatorsof causal
effects: Prospects and pitfalls in theory and practice." Sociological Methods
dnd Research35:3-60.
Rubin, Donald B. I973. "The use of matchedsamplesand regressionadjustment
to remove bias in observational studies." Biometrics 29 185-203.
1979. "Using multivariate matched sampling and regression adiustmenr
to control bias in observational studies." Journal of the American Statistical
A ssuciat ion 74:3 18-28.
CHA P T E R 6 1.972 . Fr oml! / 1 16 1: - :
rng ycar s) isin, . cl9q l
sentat iveof t hc ch. r : - -' -
The Sixth Ruie samplef or each nq\ \ >. . -
tracking changein L- . r -
USEPANEL DATA TO STUDY INDIVIDUAL CHANGE AND 1972 G SSsam plc\ \ c: : : -
i n 195 4. So if t he G SS , ,
DAIA TO STUDY SOCIAL CHANGE
REPEATEDCROSS-SECTION
w oul d consistonlr , '- - -- :
B y select ingJ f r c. r . . : -
ci fi c i ndividualsor cr : ". .
moni r or ingindividu. : - -
Chapter 5 focused on causal effects-what they are, and how we might and l ar ge paneJd. r r . r. . : .
estimate them as social scientists.Investigating causesis not the only aim research.
of social research. Sometimes the primary aim of research in the social In view of the limi:--. 1
sciencesis to describekey featuresof the social world, and how those fea- designs,the bestd;1r.r.=:.
tures are changing, This chapter focuses on the question of how to study data setsroutineh' .ldi :: -
change. Although researchersfrequently study change at the individual ti ve sam plelsom er t f . r : -
level in order to investigatecausal relationships, the study of broad social the respondents.1rcI
changemost often is descriptive.Generally the aim in aggregate-levelstud- orher hybr id desic. : r ...
ies is to describe broad change and its correlates-to get the basic facts secti o nr "I am r eiar f : : : :
right. Both types of studiesare needed.Indeed,I seedescription and expla- study p r im anlr r cr r: . -
nation as complementary, since we need to know what there is to explain each time? The be.: ..:: : :
before we set about to provide an explanation. ten rely on panel J.r:.r
Rule 6 statesthat you should use panel data to study change at the indi- so there is good rc.r. - -
vidual level and repeatedcross-sectiondata to study broad social change, you are not interestal .: :-
such as change in societies.An example of the repeatedcross-sectionde- This chapter preSr:ri:
sign is the repeatedsurvey, where the same questions are asked from sur- pl oi t the panel and r : : . . :
vey to survey,but a new sample is selectedfor each survey.Many repeated availablein the socir. , -
surveys,such as the American National Election Study, are spacedat regu- of the m odelscan L. - : .
lar intervals (usually every year or every other year). Other repeated sur- The next sectionsers-.: -
veys,such as the Gallup and Roper polls on the Equal Rights Amendment, d if ferencebetr,recn F .. :
are spacedat irregular intervals (seeassignment1 in chapter 4).
Unlike repeatedsurveys,panel surveysfollow the sameindividuals over
time. Although repeatedsurveysand panel surveysboth enableresearchers A N eryr t c Dlr r ent st . - .
to study change over time, they are designedfor different purposes (Dun- C no ss- sr clor D'. . .
can and Kalton 19871. Panel surveys track how individuals change over
time-for exarnple, whether an individual has switched her political alle- S uppo sewe want t o : : i : i'i
giance from Republican to Democrat, or vice versa-whereas repeated on X a nd Y f or a \ . lr - ': :
surveys track aggregate change over time-for example, change in the ti me 2. What analrr i. . , -
proportion of Americans who identify themselvesas Democrats. ual sover t im e, a: ir r : : . . : . .
Repeatedsurveysand panel surveyshave become staplesof modern so- The panel dcsisn : : . - .
cial research.One of the best-known repeatedsurveysin the United States cross-sect ion design.I . : . : .
is the General Social Survev. an omnibus survey of attitude chanse since change;t hat is, \ \ 'q . . - - "
The Sirth Rule . L73
- -- , , :--red her political alle- Supposewe want to estimatethe relationship between X and Y from data
: -r :':-:'-11'l-ttttasrepeated on X and Y for a sample of N, individuals at time 1 and N, individuals at
change in the time 2. ril/hat analytic leveragedo we gain by follorving the same individ-
-r...I.nple,
: - -: r. .:: D em oc r at s . uals over time, as in the panel design?
: - : - .i -.;>tJP les of m oder n s o - The panel design offers two important advantagesover the repeated
:: , - :- - -:,:r's i n t he Unit ed S t a te s cross-sectiondesign.First, with panel data we can measuregross individual
j
.rrti t udec hanges in c e change; that is, we can measure all the change that is taking place for
174 . Chapter6
individuals (for example, how many individuals marry or remarry, and other terms are pirra:r-i.:
how many divorce, over some period). With repeatedsurveyswe observe ei fi c to t ir t r e/ ( r . r lr t . : : -
only net individual change, for example, change in the overall percentage and not otheru'ise.1J. -: '
married or divorced (though of coursewe can ask people about their mar- Y , and p, is t he ef f r r :
ital histories). effcet o[ X on ) dt';. :. .
The second advantage is that panel designs provide two or more de- the subscriptr.)
grees of freedom for each individual in the study. As a result, panel de- The t er m 7F, ho, c. : : .
signs enable us to use fixed-effects models for individuals. Recall that for the it h indir idu. r . ' -
with fixed effects we are able to remove the constant effects of enduring individual. In other r.. :-
individual traits, as described in chapter 5. This is an important virtue of enoughdegr eesof ir . . - -
panel data becausewith fixed-effectsmethods (for example, differencing) for each individr r . r :l : : - -
we can very often remove the effects of significant confounding variables greesof freedonrt,. u : '
even when we don't know what those variables are. Becausewe cannot To i l l ust r at e,suf ln. : .
apply those methods when we have only one data point for each individ- 1.000 individuals c. r , : . ,
ual, in the case of repeated cross-sectiondata we must actually measure N -1or 999 dum m r ", . '. :
enduring individual traits in order to remove their confounding effects. w e ha ve 1, 000x. 3= i-
This point harks back to what was called, in chapter 5, the "first-difference donr t o est im at ccqLl, r L.: .
advantage": The first-difference model replaces the standard regression for repeatedcross-:r.i "
assumption that measured and unmeasured causesare uncorrelated with ual s. r equir ing J. e9- *.
the less restrictive assumption of stability in the unmeasured causesand sectiondata, then. to., : :
their effects. i t si ncewe cdhnot r r : : .
In short, panel data sets, but not repeatedcross-sectiondata sets' per- fixed-effectsmethods.
mit you to use fixed-effects methods to control for unmeasured traits of
individuals (Liker, Augustyniak,and Duncan 1985). Becausethis point is
central to what follows, it is useful to write out a model that demonstrates TH nrE G ENEReIQ r r -
the panel advantage more formally and provides a point of comparison
for repeatedcross-sectionmodels. C hapt er6 is or ganizc; , -
Consider the simple casewhere X one of the causesof Y, changesover ti me. O ne Q uest ionf r ', - .- :
time for individuals, while the other causesof Y are constant over time. ci al change( aggr eg. r t r.-
To simplify notation, imagine that these constant traits-the ones that and socialchangeh) . , . .
confound our estimation of the effect of X on Y-can be captured by a cial change-
single variable F ("fixed traits") and that the effect of F on Y is constant 1. Changing- elle, :, . :
over time. Then we have a simple two-regressor model where Y is a func- The quest ion aic: .
tion of a variable X that changes over time (or a set of variables that changed0\ er t 'li; :
change over time-the model readily generalizesto multiple Xs) and a 2. Conuergence/tlti, ":
variable F that does not change over time: The quesr ionh. r . -
YI t.f '= a+ a+ y F + B X +e. (6.1) it changed in the ..',:
t'tttIt
Y', is the value of Y for the lth individual at time /, F, is the value of F for I Wi rh more than one . .: , .
the ith individual (there is no subscript / since F is the same for individual yF, (so v ou don' r i i reral l r ' :.. -
I regardlessof the time of measurement),X,r is the value of X for the ith ampl e, y ou c oul d ure .r fi r.--- ".
individual at time t, and e,,is the error for the ith individual at time t. The of dummy rari ahl eshere hr - r .
The SixthRule , 175
* - .-..irr\ or remafry, and other terms are parameters:a is an overall constantr a, is a constant spe-
:- : - , 'r: : : iLI f V C Y SW e O b S e f Ve
cific to time / (capturing change in Y that is constant across individuals
and not otherwise accounted for by the regressors),y is the effect of F on
- - i- -: : :|le o|erall Percentage
Y, and F, is the effect of X on Y at time /. Our aim is to estimate6,. (If the
: r - . . -:'rFl e aboutt heir m a r-
effect of X on Y does not vary over time, then F,- F, und we could drop
the subscript /.)
:. - -' :- '..:.l env o 0r m or e d e -
The term yF, holds the key. Because7 is a constant and F- is a constant
r- : . ' i ' \. a r es ult ,P aneld e -
for the ith individual, y F, can be captured by a dummy variable for the lth
-r : : - - .;.\i d uals . Rec all t h a t
individual. In other words, we don't need to measure F at all if we have
.-: - .,..,-.:eff ec t sof endur i n g
: - - . .',r-important virtue of enough degreesof freedom to capture yF,by the use of dummy variables
- ,i- - r\-1mple,differencing) for each individual in the studl'. With panel data we have the neededcle-
--. :- 'nfoundingv ar iab l e s greesof freedom to do that. With repeatedcross-sectiondata we do not.
.-
:' . : ' .,:'i . Be c aus ewe c ann o t To illustrate, supposewe have three rvavesof panel data consisting of
- : -,- . : :,:nt f or eac hindiv i d - 1 ,0 00 i ndi vi dual s each. B ecausethere are 1,000 i ndi vi dual s, we r r eed
: l -..i :Iac t uallYm eas u re N- 1 or 999 dummy variablesrto capture the effect of the yF,. Because
:::,
--
: . r ctxfounding effects. we have 1,000x3=3,000 data points, rve have plenty of degreesof free-
- - - ,rr:r i. the "first-difference dom to estimate equation 6.1 without measuring }. In the analogouscase
' : r . - . --:- .trndar d r egr es s i o n for repeatedcross-sectiondata, hou,ever,there are 3,000 separateindivid-
-:: - r, -:-,l re unc or r elat edwith uals, requiring 2,999 dummy variables.In the case of repeatedcross-
sectiondata, then, to control for the effect of F, rve must actually measure
: -:rlleilsufgd causesand
it since we cannot remove its effects through dummv variables or other
fixed-effects methods.
il : - -i ,i -:-ctl on dat a s et s ,pe r-
-:- : : tlnmeasuredtraits of
-
-.-: - -'i , Be c aus et his P oin ti s
THnEn GENp,nerQur,srroNs ABour CHeNcr.
:: , - . :' lel that demonstrates
:- - . - .: ;o1nsof c om par is o n
Chapter 6 is organized around three general questions about change over
: ' '.-. - -:-.:esof Y, changesover time. One questionfocuseson individual-levelchange,one focuseson so-
..:. - : ,-::L- ConstantOVertlme. cial change (aggregate-level
change),and one bridges individual change
,: - - . : ,::: i ri l it s - t he ones t h a t and social changeby asking how individual-levelchange"adds up" to so-
' -.en be captured by a cial change.
r :- : - '---:,rf F on Y is c ons ta n t 1. Changing-effect models for panel and repeated cross-sectionddta.
'--:-- - ':-. Jel n'here Yis a func- The question addressedhere is: Has the association of X and Y
. - : : ,i iet of variables that changed over time?
.-. ' , i -:(r m ult iple X s ) and a 2. Conuergence/diuergencemodels for repeated cross-section data.
The question here is: Has Y changed over time for societies,and has
( 6 .1 ) it changed in the samedirection and at the samepace for all groups?
,: : -:.: .. F is the value of F for I With more than one observation per individual, you have severaloptions for capturing
. --: - .- :l e sa m ef or indiv idu a l yF, (so you don't literally need to enter 999 dummies). For two waves of panel data, for ex-
'i ample, you could use a first-differenceequation, as described in chapter 5. I use the example
, - :-- ral u e of X f or t he l th
of dummy variableshere becauseI assumenrost readersare familiar with dummy variables.
- :- : : - :'.Ji i ' idualat t im e / . Th e
176 . Chapter6
3. Bridging indiuidual and social cbange. The question here is: How pl ane , so we assunr . : - '
much of the aggregatechange in Y can be accounted for by change (This ;rssunrprionr: : : .
in the population composition? si ons . )Tn equat ior r, - . '
y-i nt er ceptar r inr r : . :
The models differ in the way time is treated. Time is treated as cate-
gorical in changing-effectmodels and servesas the key subscript for F2 is the cffcct oi \'
w c a ssum gdoc: n', . . . . .
variablesin the models. In convergencemodels, time is treated as con-
scri pt ivepur pu5qs, - '. .... :
tinuous and servesas the primary X-axis. In models that bridge individ-
fectswe obsen'e. r r r . . , .
ual and social change, time servesas a subscript to obtain parameter
the change in r he c; '- - :
estimatesthat are used in a secondstep to estimateoverall change over
i ng X at r im e I in . l - : - : -
the time interval.
changing effecrs.
A sr at ist icall). 'iS: - , -
effecr of X on ) h. r . , .
CueNcrNc-EFFECT
MoDELS,PART1: Two PorNrs rN Trl.r
sider again the e,r.rr: .
chapt er 5 r ve nor eJ : : . , : -
Changing-effect (or changing-parameter)models ask whether the assocra-
fect of sout her nr c: Jr '-
tion of X and Y has changed over time. For example, has education be-
'What that ef f ect has c/ . '. r , : --.:
come a more important predictor of individuals' earnings? about strongestin t he So. : : : "
the association of race and earnings-are racial differences diminishing? ci ati o n of r egion i: - : :
Do Catholics still tend to have larger-than-averagefamilies, or has that decades.To deterrr:,:..
fertility difference disappeared?To fix basic concepts, we begin with the mi rri s hedwc ean. \ '. . : . :
caseof data collectedat just two points in time. regatjonist at r ir uJc.
stati sr icalllsignr t i; . : : : -
Case 7: Ttao-Vaue Panel Data witb Time-VatyingXs differencesin sesrc=-,.:-
gi on a s a dum m r \ '. - , r .-. .
Imagine you have two waves of panel data, and you are especially inter- ti al re gion t o br r r - - . . :
ested in the effect of a particular individual trait that changes over rime. hypot hesisr har r he . ". - -
\We denote that trait as X to distinguish it from the other observed van-
now than earlier.
ables,Vs (not to be confusedwith Vi/s,the unobseruedcausalvariablesdis- tV it h paneldlr , r . : : . - .
cussedin chapter 5). With two waves of panel data we can regresschange our m easur edver i. t b. . . :
in Y from time 1 to time 2 on changein X and change in V from time 1 to purge X and Y oi r r -
time 2, as in standard first-differencemodels. The aim in a changing-effect virtue of fi rst-di t'fcr.: , =
analysis(as the name suggests)is to determine whether the effect of X has purgingis a ber r er( . r '- ' -
changed from time 1 to time 2. To find out, we add X,, to the first-difference chapter5. hou er er .r : . . - .
model: on the indeperrderrt-,: ..
Y,r.- Yu= (oz- c-r)+ F2 (X i2- X iI)+ $z- F)X,, denceon r acialat r ir r : - : - .
fecr of sour her nr c\ r - : ' - -
+ y (Vi z -V ,r) + (e ,r- e ,r) (6 . 2 )
who moved inro or , :.:
To simplify we assume only one X and one conrrol variable V; the to estimatethe el-rs,: l
model easily generalizesto multiple Xs and Vs. X and V are time- ti me 2 in t he abser r - . '
varying regressors(later we consider the casewhere X is constant) that senceof m igr ar ior r... : : - -
can be either continuous or discrete; nonetheless,it is convenient to for ever yone.I nt er c. : . :-.
s peakof t he a p a ra me te rsi n e q u a ti o n6 .2 a s y-i nterceprsi n a regressi on first-difference mutic.: : -
The Sixth Rule . I77
on racial attitudes cbanged from time 1to time 2.That is the topic of the ln short: Although '.',. -
next section. gender qr Cat hulr ! u; . ': . . . ,
ferencingmcthods ro :': ^
i I w e a r e int er est clr: : . : *.
Case 2: Ttao-:WauePanel Data uith Time-Inuariant Xs
use the differencing - -,
Equation 6.2,1ike equation 6.1, removesthe confounding effectsof the F,- Equation 6,2 also c,li. r; -
the constant effects of fixed individual traits-without measuring those that var y over Linr c.L. . '.
traits. This is the feature of fixed-effectsmethods that makes them attrac- w ork for r epeat ed5ur .- -
tive: rWecan use fixed-effectsmethods to remove the effects of fixed indi- l ow i n divr dualsor r r : .
vidual differences,evenwhen we do not measurethose differences.It turns appro pr iat cf ur ir r rc: , . - . . .
out. however.that in some instancesthose are the effects of interest-and
we cannot estimatethe effectsof individual differencesif we have removed
them. Cqse 3: Repealed Ct'o-.-.-
In other words, differencingwill not work if we are interestedin the
One way to think abi.-:::r
enduring effects of fixed traits becausedifferencing removes the variance
Y has changcdor cr : . : :
of interest. But if we are interested in estimating change in effects-
the effecrof X on ) '. I : - . - .
which ls the oblective in the case of changing-effect models-then fixed-
i ncl uder im e in r hc n: : .
effects methods can be used even when the traits of interest are fixed.
of interest.
To illusrate, suppose we hypothesized (1) that Americans who are
l ma ginewe hnr c r . ',
"born Catholic" (that is, Americans who grow up in a Catholic home) are
the oth er adm inisr cr - : , - -
more likely to disapproveof abortion and (2) that the effect has weakened
focal variable X irn,i -: . -:
over time, reflecting the declining influence of official church doctrine '.
i zesto m ult ipleX' . i: . * -
among American Catholics. Fixed-effectsmethods can be used to exam-
or di s cr et e;noner hcl. . . . -
ine the secondhypothesis but not the first. Let Y be a measure of attitude
inrerceptsin a reg,rq.. - :
about abortion, coded so that high values reflect disapproval of abortion
one of the regressor\.-,.-
and low values reflect approval. Let X be a dummy variable for Catholic-
W e could est im . lr r: : : :
born (X-, =Xi2= 1 for those reared Catholic, 0 otherwise).Becausethe
lows:
term (X,r-X,,)drops out in equation6.2, we cannot use equation 6.2 to
test the hypothesisthat Americans who grew up Catholic are more likely Y ,,=d, t Br X, . ', \ ' -
to disapprove of abortion. The equation does shed light, however, on
whether the effect of Catholic upbringing changed from time 1 to time 2. Y ,z =0: I p, X, _- ', \ '-
The coefficient for X,, estimates(82- 91),so a negativecoefficient for X,,
The su bscr ipti indcr . - '
is consistentwith our hypothesisthat the effect of Catholic upbringing on
spondentin t he / t h : . : : . .
anti-abortion attitudes is weaker now (assumingthe effect was positive in
that the ef f ect ot , \ or , :
the first place).2
on Y a t t im e 1 is den, : . :
2 B e c a u s eC a th o lic u p b r in g in g is a fixe d tr a it fo r indi vi dual s,the term (X,r-X,,) drops noted Br.
out in equation 6.2 and we have:
: - - - - .r.ri is the topic of the In short: Although we cannot estimatethe effects of fixed traits such as
gender or Catholic upbringing with differencing methods, we can use dif-
ferencing methods to estimate the changing effect of such variables.Thus
if we are interested in estimating changesin the effect of gendcr, we can
r- I-. -- -r'+ Yr
use the differencing model of equation 6.2 when we haye panel dara.
r - ' .-:'.,.:.r ef fectsof the F,- Equation 6.2 also can be used to estimatethe changing effects of variables
:; :-- .::-.,,-! nlcasuringthose that vary over time. Fired-effects models such as equation 6.2 do not
'r:: ,:. :- -,:ilakes them attrac- work for repeatedsurvey data, however,sincerepeatedsurveysdo not fol-
-:- , : :ffects of fired indi- low individuals over time. As we now see, an interaction model is more
:i. - -- .: .ltfferences. lt turns appropriate for investigating changing effects with repeatedsurvey data.
!- - '- r ::::JIi of interest-and
I - -: t :"::. i i n ' e hav er em ov e d
Case 3: Repeated Cross-section Data
i r, : .'.: -'.ieinterestedin the
One way to think about the question of whether the associarionof X and
'l:l:-, : ; : anlO V e S t h e v a f l a n C e
Y has changed over time is to think of time as a variable that moderates
::r --r - in effects-
- -'l:tnge the effect of X on Y. Instead of using time as the basisfor differencing, we
: - r-:,i :l l odels - t hen f ix ed -
include time in the model as a r.ariableand interact it with the variable(s)
-- :
: :-., .' . illtcrest are fixed. of interest.
'-
: :-..: -\mericans who are Imagine we have two repeatedsur\-e\-s,one administered at time 1 and
:: .-r I .i Catholic home) are the other administeredat time 2. To simplifv the model we assumea single
: -: : :r: e ffec thasweak ene d focal variableX and a singlecontrol r-ariable\'; the model easilvgeneral-
-,!: : :i;ial church doctrine
izesto multiple Xs and Vs. The variablesX and \/ can be either continuous
- : i- -:.Jn b e us edt o ex am -
' or discrete;nonetheless,so that we can speak of the a parametersas )-
:: : r: .1measureof attitude intercepts in a regressionplane, it is often convenient to think of at ieast
' r : r : - r ;-sappr ov alof abor t io n
one of the regressorsas contlnuous.
: : - ':". .ariable for Catholic- 'We
could estimate the effects seDaratelvfor time 1 and time 2 as fol-
lows:
. , : - : -rtL )rus eequat ion6. 2 to
:: ,,. - : t :rholic are more likely Y i t= a t+ FtX ;t+ yV ,r+ e,, (6.3a)
' : :.
-rcd lrght, however' on
- : ' Y,r= ar+ BrXiz* 7V;:t €, (6.3b)
- -: : :ro m t im e 1 t o t im e 2 .
- : "r:-rtite coefficientfor X,, The subscript I indexesindividual, so {, is the value of Y for the lth re-
:-' : -- -r rrholicupbr ingingo n
spondent in the /th survey (t--1,2). Note that these equations assume
- - ' - - :-. e i fe c twas pos it iv ei n that the effect of X on Y might have changed over time : the effect of X
on Y at time 1 is denoted B. and the effect of X on Y at time 2 is de-
noted Br.
: .: .. :he term (X,, X,,) drops
One way to determine whether the effect has changed, then, is to esri- \7ith panel data se .;.: ...
mate separatemodels and compare coefficients.But there is a simpler way. a given set of inditic'-:: .
Consider again the issueof Catholic upbringing and abortion arrirudes.To whether the effecr ot -\
determine whether the associationof Catholic upbringing and attitude to- if we are interesre.l .: -
ward abortion has changed from one survey to the next, we can estrmate gencral populatior.r.r:-.. .
the parametersfor both equations in a single step by regressinga measure appro pr ir t e. This r r . : -
of abortion attitudeson a dummy variablefor time (7, coded 1 for time 2), fresh sarnplefclr errch.-.:,
a dummy variable for Catholic upbringing (X), a dummy variable repre- rcprescn t lt ive slt r r pl'
senting the interaction of Catholic upbringing and time (XxT,), and a chang ein t he cali- r . ll: : : , -
measureof the control variable V: metho d is likeiv r hr .h- : : . -
effectsof confor-rndinc', ..:
Yu= dr+ (cr2
- ar)T.+BrX,,+(82-B,)6 ,xTr) + yVo+ e, (6.4)
For r epeat edcr os: - : . . :
Equation 6.4 subsumesequations 6.3a and 6.3b (as you can see by efi ecrsr estBr + B, i< f : - : - .
substituting Tu=0 into equation 6.4 to reproduce 6.3a and Tz=7 rnto models as descr ihcJ. : .
equation 6.4 to reproduce6.3b). No important information is lost by es- 7997, chap.13 ). In sr:: * ,::
timating 5.4 in place of 5.3a and 6.3b-in facr, equarion6.4 is better be- to kno w. f or exar npl. . . , .- .
cause it gives you an easy way to determine whether or not observed for al l gr oups,r h. r rii. . , .'
change in the effect of X is statistically significant. Note that equarion 6.4 i s the s am ef or nr er r . r . ,
contains terms for the focal variable X, for time (Tr), and for the interac- and no nsout her ner . .- , : .
-
tion of X and time.3 not refer to gender -
-'-ir:-
The aim of equation 6.4 is to determine whether the effect of X on Y to time-there is .r :::--
has changed over time. The key is the interaction term, XxTr. The pa- know is whet hert lr c : i : - .
rameter for this interaction term is (82-81), change in the effect of X groups, in the s.rm. .'..:.. : -
from time 1 to time 2. Hence a statistically-significantcoefficient for the has rhe sameeffeer, :.
interaction term indicatesa change in the effect of X over the time period i cansas it doesior n: , : =
examined. In the language of interaction effects, rhe effect of X on y is S upposewe f inJ : : . . : : '
"moderated" by time. change?In the caseo: :.:. ,
In the case of Catholic upbringing and abortion, rhe parameter for i ng populat ionconr l. . :
the interaction term indicateswhether or nor the effect of Catholic up- ti me I t o t im e 2. ecl! ! :
bringing has changed over time. Recall thar equation 6.2 also esti- vesri g at ingchangeri: : - - -
mates Fz-F, except that equation 6.2 has the advantage of Statesin 1970 r.ersu:l
differencing out the effects of the confounder V without measuring it. ti al changein t he [ - . r -
Even if we measure V perfectly in equation 6.4, we do not expect this rul es out such ef ie. r : : - , .
equation and equation 6.2 to yield the same value for Br-B, because (thoughcom posir ior '. r ,- '
the meaning of Fz- Ft differs for the two methods, as rhe ,ri*t puru- enti al a t t r it ion) . I n r - . -
graph explains. changingpopular iun. _
The interaction model of equation 6.4 and the differencing model of observedchangein r h. , - - -
6.2both can be used to investigatechanging effects of X on Y. But the \fl e c onsiderr he . : : . - .
two models ask somewhat different questions about changing effects. detai l l a t er in t he chap: . . .
H ere th e im por r anr r , - . :
rA note on notation: a typically denotesy-intercept ri on i s t hat ir can ch. - .-' , :
and p typically denotesslope. Equa-
tions 6.3 and 6.4 use .r as the parameter for T and not X becauseT is always a categorical another .To det er m r n. - - .
variable in changing-effectmodels, whereas X could be either categorical or continuous. secti ondat a, r eest ir l. lt : I - :
The Sirth Rul.' . 1\
\fith panel data we can ask whether the effect of X on Y has changed for
a given set of individuals. Nfith repeatedcross-sectiondata u.e can ask
whether the effect of X on Y has changed in the generalpopulation. Thr-rs
if we are interested in describing change in some association in rhc
general population, the repeated survey/interaction method is the more
appropriate. This result should not be surprising since, by selecung a
fresh sample for each survey, repeated surveys generally provide a nors
representative sample of changing populations. If we are interested in
change in the causal effect of X on Y, however, the panel/differencing
method is likely the better choice, since differencing removesthe enduring
effects of confounding variables.
For repeated cross-sectiondata where X is continuous, the changing-
effects test B2+ B1 is the tesr for nonparallel slopesin analysisof covanance
models as described in statistics texts (for example, Agresti and Finlay
1997, chap.13). In standardanalysisof covariancemodelswe might want
to know, for example, whether the income return to education is the same
for all groups, that is, whether the effect of years of education on income
is the samefor men and women, for whites and nonwhites, for southerners
and nonsoutherners,and so on. In the caseof equation 6.4, "group" does
not refer to gendercategoriesor race categoriesor regional categories,but
'$fhat
to time-there is a time 1 group and a time 2 group. we want to
know is whether the effect of X on Y is the same for the time 1 and time 2
groups, in the same way that we might want to know whether education
has the same effect on income for African Americans and Hispanic Amer-
icans as it does for white Americans.
Suppose we find that the effect of X on Y has changed. IThy did it
change? In the caseof repeatedcross-sectiondata, one possibility is chang-
ing population composition-the population membership changes from
time 1 to time 2, especiallyif the time interval is large. For example, in in-
vestigating changes in the determinants of voting behavior in the United
Statesin 1970 versus2000. we would need to bear in mind the subsran-
tial change in the U.S. electorate over rhe period. The use of panel data
rules out such effects since the same individuals are followed over rlme
(though compositional effects on a small scale might occur due to differ-
ential attrition). In the case of repeated cross-section data, however,
changing population composition might account for some or all of the
observedchange in the effects of the Xs.
\7e consider the effect of changing population composition in more
detail later in the chapter, when we examine cohort replacement effects.
Here the important point to nore about changing population composr-
tion is that it can change the observed association of one variable rvith
another. To determine if that has happened we can, with repeatedcross-
section data, reestimate the association using a sample restricted ro the
782 . Chapter6
birth-cohorts present in the population at the initial measurementpoint.a thirty coefficienrs.s.' :-' :
(The term "birth-cohort" refers to people born the same year.) Larer in w i th m acr o- leleI r ar r . ,
this chapter I describe a method that isolates the contribution of chang- -.
w ard mult ilevelln. r i, . : - .
ing population composition to societal trends. As I show there, the
method can be used to estimate how much changing population compo- 'Vbat'Ve'Want
sition contributes to any trend in societS from declining voting rates to to Knr't
rising rates of online shoppingto changein norms about sex (to cite just \/hat ar e t lt e r r r . t r r r, . . .
a few examples). w hat c ondit ionsh. r r . - : . . .
In short: We can do quite a bit with data for just two points in time. chartg covEr [ ir uei q. , . :
That holds for both panel data and repeatedcross-sectiondata. Certainly feren cesin r at ss ui , r : . .... .
two points in rime is better than one. But multiple data waves are better voti ng f or Dcr r r u. r . r r r . .
still.'tThe next sectiontakes up that issue. -
w hy the ef f ectoi eJLr ., -
Thesesorts oi qrr..:
varyi ngm acr o- lc\ cl. ': . - :
CneNcrNc-EFFECTMoDELS,PART2: Murrrrp.vEL MoDELS levelanalysiswas delt-, :".
WITH TIME ASTHE CoN:tIx:t i t has pr oved useiLr:l : : -
merhodherewe think
This section describeshow to use multiple waves of data to examine the machiner y.includir r ,::: . . .
correlates of changing parameters.The question of how to exploit multi- approachinspired1., :- - -
ple waves of data is becoming more relevant as we cumulate longer trme- tion (seeDiPrerei.lnJ ( ---
seriesin the social sciences. effectsituationu hqr. ..
The two-wave changing-effect models described above are limited in
what they can do. The models are designedto detect, not explain, change
in individual-level associationsor causal effects. The major advantageof Tbe General Mtrltilet e.
"
the two-wave models above is that they make few data demands: Two To i nt r oducet he g. ': - - . , -
surveyssuffice. Increasingly in the social sciences,however, we have mul- gregat ion,wher e inc: . -
tiple surveys to exploit. The changing-effect models above quickly be- he extendedto firorcr: ,' .
come unwieldy as we add waves of data. If we want to srudy how the rooms nest edin s'h. ,
effect of X varies over (say) thirty surveys,we would have thirty regres, pl i cationsof r he nr ,', = . '
sion coefficients to compare, one for each survey, To summarize those consist sof a "lelcl l'
To fix basicconcepti.j -.
aTo illustrate, suppose we found a steadily rising gender gap in voting for
Democratrc arrdfwo level2 r cg, r . . . - ,
versusRepublican candidatesfrom the 1984 national election to the 2004 national election.
Part or all of this growth in the gender gap could be due to greater gender differer"rcesin vot- , -. , -,
L rv L L
I
l
/ ^", n-n\
rvt L
\ rvllLnul
ing among the new cohorts ofvoters (those rvho became eligible to vote between 1984 and
2004, that is, citizens born from 1967 to 1986). To examine the effect of changing popula- The levelI equatiur.
..,
tion composition, then, we could eramine the gender gap rvith and rvithout the new birth
cohorts. If gender differences diminish significantly when lve remove the nerv cohorrs, we
Y rl =8..+B,X
' \tl ' rl ;
-t
rvould ir"rferthat the cohorts recently added to the U.S. electoratecontributed to the growth j -
Y ,,i s t he valueof ) :-
in the gender gap.
i With three or nlore rvavesofpanel data it is possible
to bring evidenceto bear on the is-
val ue of X f or r hc : :
sue of whether the treatment effect in an experiment is better estimated bi' using change in from the standardrc: :..
Y (that is, Yr-Y,) as the outcome variable or by using regressionto conrrol for the initial acrosscont e\ t s, A) : : - - - . .
value of Y(that is, by including Y, as a conrrol variable in a rnodel of Yr). SeeAllison (1990) (nor j ust t he var i. r h. . -
and Morgan and Winship (2007, chap. 9).
F,, i s rhe slopet t hc . '- . - -
Thc S i rth R u ie . 1: r
- -- r.rsurement point,4 thirty coefficients,we might very well correlate the coefficientsthemscllcs
. . ::ilc \ ear. ) L ate r in with macro-level variables for the thirty measurementpoints-a step to-
- -: : ih utio n of ch an g- w a rd mul ti l evelanal ysi s.
.. r.. i shorv there, the
' - - -: :r)pu latio n co mpo -
WhatweVant to Know
,: - -.:r]q \-otlng rates to
.ri ss-\ (to cite ius t I7hat are the macro changesthat matter for the X-Y relationship, that is,
what conditions have changed over time to causethe X-Y relationship to
.;r.; - .--. :.r.o points in time. change over time? Ve might want to know, for example, why racial dif
:: . .,. j-: . -, n d ata . Ce rtain ly ferencesin rates of imprisonment have grown, or why the gender gilp in
t.::-:- \\'aves are better voting for Democratic candidateshas grown. Or we might want to know
why the effect of education on earningshas increasedin the United States.
These sorts of questionslead naturally to a muldlevel model with time-
varying macro-levelconditions as the contextual variables.Although multi-
l . l ,-,p r ls level analysiswas developedinitially to study the effectsof school contexts,
it has proved useful for the study of other contexts as well. To apply the
method herewe think of time as a context, so we can wheel in the multilevel
--riJ to exarline the machinern including multilevel software,to apply to change.The multilevel
- .'. t o er ploit m ulti - approach inspired by the school-effectsliterature can, with some modifica-
- ..:':. r l. t tlonger
e r ime - tion (seeDiPrete and Grusky 7990a, 1990b), be applied to the changing-
effect situation where it is time. not school,that forms the context.
,[ -,- : r ,:rrrveare limited in
, -r. :,rt er plain, c hang e
The General Multileuel Model
. -: ::r.ljor advantageof
, '. :,ria demands:Two To introduce the generalmultilevel model, let us focus on two levelsof ag-
\:-- - r -\ Pr ..
"
h.rr" mrtl-
gregation, where individuals are nestedin time. (The multilevel model can
::.. -rfror-equickly be- be extendedto more than two levels-for example,studentsnestedin class-
..::: to study how the rooms nestedin schoolsnestedin school districts, or four levels.Most ap-
j 1.
..r,.
\ \ e rh
r ' r "ri rtrr
) re
, L6'oLJ
rec
- plications of the model employ two levels,however.)The two-level model
-
T , summarizethose consistsof a "level 1" (micro) equation and a "level 2" (macro) equation.
To fix basic concepts,consider the simple caseof one level 1 regressor(X)
- - . ,r r i n g f o r D e m o c r a tic and two level2 regressors(2, and Z, refl,ectingmacro-levelconditions).
'-. I | ) + n a t i o n a l e l e c t i o n .
' - : . : : ie r d i f f e r e n c e si n v o t- revrl 1 (urcno) EeuArIoN
: 'le l.etlvecn 1984 and The level 1 equation is:
-' : o i e h r n s i n s o o o u l a
: -: . ' . i t h o u tt h e n e w b i rth
Y ,i= Foi+F, jX,i+ (i =1, 2, . . . , J contexts) ( 6.s
)
- -' . . t h e n e w c o h o r t s , w e ",i
: , :r::lbrlred to the growth Y,, is the value of Y for the lth individual in the lth context and X,, is the
uJl,.teof X for the ith individual in the fth context. Equation 6.5'differs
- . . i- r . ' et o h e r r o n r h e is-
. . : : -. : : : . 1 b V u s i n g c h a n g ein
from the standard regressionmodel in that effects are permitted to differ
. . : : ) io n t r o l f o r t h e i n i t i a l across contexts, as reflected by the subscript i attached to the parameters
i: : )' . i . S e eA l l i s o n ( 1 9 90 ) (not just the variables).Thus Bn, is the y-intercept for the ith surve-vand
fr, is the slope (the effect of X on Y) for the ith survey.
L84 . Chapter6
Becauseequation 5.5 permits effects to vary acrosscontexts, it is natu- for estimating multiler': -
ral to use the multilevel framework for changing-effect models where we R a udenbush199f r , t . . . - .
have measurementsat multiple points in time. The key is to think of time appropriate methods it,: .
'With in multilEvEl,ttoglsl5,fo: :
as the contefi. multiwave panel or repeatedcross-sectiondata, then,
we specify survey wave as the "group" and apply the multilevel modeling Although multilevel-:r :
approach to estimate the Bs in equation 6.5 that reflect the changing i ngl y pr act ical as nr . , . . .
effects over time. i mport ant t o em plr r t t r zr: '
To simplify matters, multilevel models generally use deviation scoresto tinue to lack porver it,r .
center X on zero. By centering X on zero, the y-interceprsin equation 6.5 is still limited in the nr(,-: .
are Ys (Y-means).6Thus Bo, is the mean of Y for the first survey,Bo, is the and a paucity of time ', .
mean of Y for the second survey, and so on. few variables.In the c:..
ampl e ,t her e ar e 46. i I
rBvsr 2 (r,recno)EeuArroN ti me cont ext s( sur ver '.. '
Becausewe are modeling both intercepts and slopes,we have an equation one everytwo years.In .::,
for each.For two Zs,the equationsare: time is the contert. -\t '-.-
still lack power.
(6.6a) 'Weturn our attenti(.,.-
Foi=Toot TorZri+ TozZzj+zn, (intercePts) :
(6.6b) chapt er .t he conver sc: l-
F r i= r r c + T rrZ ri + T ttZ z i + 2 ,, (s l o Pe s ) pol i ti c al indicar or sclt . '. -:
Note that the dependent variables in the level 2 equations are coeffi- in a society?
cients from the level 1 equations. The first equation models the level 1 in-
rerceptsas functions of the two Zs.The second equation models the level
1 slopesas functions of the two Zs. I follow standard multilevel notation C oN v t ncI NCE \ loD: -
(Bryk and Raudenbush 1992), where the first subscript for y identifiesthe
level 1 coefficient being used as the dependentvariable (0 for intercept, I S oci a l scient ist sh. t r . , -
for the slope for the first micro-level regressor,and so on), and the second and e t hnic dir isior r - . - . '
subscript for gamma is the conventional notation of 0 for intercept in that srates "ver sus"r cJ - - '. : --
equation,l for the slopefor Z, and so on. Substantively,then: and th e poor . r nJ - -
deepe ningdivisit 'n. " - - -
. 7e1is the effect of macro-level variable Z, onthe Ys across surveys,
i zati on") , m or al r r l: . =- . '
and y,r.is the effect of macro-level variable Z, on the K across sur- arrd g ender .Br ". 1. . : - '
veys. soci e t allevel.so r sf ; . '. ri -
. 7r, is the effect of variance in macrolevel variable Z, on variance in
by rule 6.
the regressionslopes,that is, on variance in the effect of X on Y over To f ind out if dt '. : . - :
time. Similarly, 7n is the effect of variance in Z, on variance in the w i th r epeat edsr uss- : :- .
effect of X on Y over time. In short, the model is designedto tell us uni que r cgr essionl, . : - .
whether cbange in the effect of X on Y over time is associatedwith groups. ( Alt er n. r r ir , . ,
change in macro conditions Z, and Z, group. A singler t r , ''.
Becausethe level 2 equations involve clustering effects that violate key ever.s incea singlr : : : *.
a b o u tth e e rro r te rm, o rd i n a ryleastsquaresi s not appropri ate
as s um pt ions encesin t he gr oup : - . - -
featu r eof t his m ( , J; --
6This follows from the fact that a regressionline goes through the point (X,Y), so when eachgr oup r ef lect .: t . - -
X i s c e n t e r e do n ze r o r h e y- in r e r ce p rit th c p o in r r 0 ,Yr. thought of as n LJr : - : . : - .
The Sirth Rule . IB-5
:- -. - .! -' )ltte rts,itisna tu- for estimating multilevel models (DiPrete and Grusky 1990a; Bryk and
i- : ' * :l: : . : nlod els whe re w e Raudenbush 1992). Severalcomputer packagesare available that employ
-: - :r:r sto thin ko f tim e appropriate methods for estimating coefficients and their standard errors
rt r " . -' -- r3 atiOn d ata , the n, in multilevelmodels (for example,Raudenbush2004).
,: :" . . .: -:-,:rltilevelmodeling Although multilevel analysesof replicated surveyswill becomc increas-
' * ' ' r: :::lect the changing ingly practical as we continue to cumulate such surveys over time, it is
important to emphasizethat our macro-levelmodelsnonetheless will con-
t:- . ' " . -: Jella tion sco res tO tinue to lack power for some time to come. The number t-rftime contexts
": - l:1-: li> ln e qu atiOn 6.5 is still limited in the most popular replicated surveysin the social sciences,
- - : i-l ilirve\-,
F62 iS the and a paucity of time contexts in turn limits the macro equation to just a
few variables.In the caseof the cumulative General Social Survey,for ex-
ample, there are 46,570 respondents(as of the 2004 survey)but only 25
time contexts (surveys),and the surveysare being added at the rate of only
i-: :r.. .,.: h: t |e an equat io n one every two years.In effect, N=25 for GSSmacro-levelequationswhere
time is the context. At the macro level, then, our changing-effectanalyses
sdll lack power.
( 6. 6 a ) 'We turn our attention now to the secondgeneral question posed in the
useful for addressing a wide range of issuesincluding, most notably, is- The qucst ion of c, , : ', . :
suesrelatedto fissuresin society. si gnsof t hc f s ; nd r lr r . '.
The convergencemodel provides a simple way to test hypothesesabour Y is diverging bctu'eer:::..:
the reduction or deepeningof social divisions in a society.For two groups are both positive,fo. e-.,
the model can be expressedas follows, with T (time) and G (group) as the basel i n egr oup on I inr : : ,
indeoendentvariables: ing over time-dir-erge :-.::
Y is convergingbetse;.. :
Y,,=u+ BT+ SrGr+QrtTxG.)+ e,, (6.7)
the trends have crossei ,
The first two terms in equation 6.7, a+BT, refer to the baselineor ref- Differenr group !(\ir .-..
erencegroup, where a and B are the y-intercept and slope,respectively. with different busclins
-
In contrast to the changing-effectmodel, the convergencemodel treats vergi ng,f or exam plc.r , . .
time as continuous. Because time is continuous in the convergence the baselinegroup. Br ..--
model, 7 has no subscript in equation 6.7. The groups, however, are vergenceor divergen.. : -
discrete, and numerical subscripts are used to distinguish them. Thus
ILLU ST R A TI O I \ : AI I IIL DI
G, is a dummy variable coded 1 for the secondgroup. The parameter B
SPEN D I N G O N E D L : C . \ T. ].
is the linear time trend for the baseline group, 6, is the difference in
y-intercepts between the second group and the baseline group, and $, S omeo bser ver sar su( : : . . :
is the difference in the slope for the second group and the baseline A meri cansociet r '- t ui: :
group. i n A mer ica,or gener r r ' -
T he c onv e rg e n c e s mul ti pl e groups (1,2,...,
mo d e l g e n e ra l i z e to G Marx has been suppl::-.:.
groups) in a straightforward manner: tions. The convergencs:r.
case of gener at ionr l\ '. . - - :
Y , , =a+ B T+ 6 rGr+ 6 ,G, + . . .+ 6 c c c + d r (TxGr)
young a nd r he old 'h, . . *
+ @, ( Tx G,)+ ...+ $ " (T x G" )+ e ,, (6.8) issuessuch as fundinc : :
In equation 6.8, as in equation 6.7, group 1 is the baselinegroup. Thus ti on. Considert his qu: : :
the parametersfor the other groups are in comparison to group 1. 'We are faced g'ith :-,.
be s olved easil' o: - ,
The Sign Test for Conuergence: CompaingYour Ss and 6s problem s,and f c, r . . , . , -
we're spcndingtoc, :'-.
To accommodateadditional groups, equation 6.8 adds dummy variables
ri ght am ounr , , . . . \ : ;
(G' etc.) to capture differences in intercepts, and interaction terms
ri ght ar r our r ron -.
(Tx C' etc.) to capture differencesin time trends. It is convenienrto code
cial Security?
T as zero for the first measurement,so a is the predicted value of Y for the
baselinegroup at the first measurementor wave of data. The 6s are dif- Thesei t em s invit e u> : : :
ferencesin y-intercepts between the baselinegroup and the other groups, di vi si o ns acr ossgenc: . : :
so 5, is the difference in y-intercepts between the first group and the sec- spending on educarro: ..
ond group, 5, is the difference in y-intercepts between the first group and S urveysince l97j. n: . . :
the third group, and so on. In other words, the 5s reflect initial group dif- (natsoc)has beenask- - ,
ferenceson Y (frtted by linear regression). Americansbelieveeirhr: .-
Similarly, the @sare differencesin slopes (linear time trends) berween money is spent on eJ. : , . . :
the first group and the other groups. Thus /, is the difference in linear cent be lievet hat t oo - . ,
trends for the first and secondgroups, @..is the difference in trends for the centagehas beendc. l: : . : -
first and third group. and so on. di vergr ngat t it ude: f , , r - - .
The Sixth Rule . 787
To find out, we apply the convergencemodel of equation 6.7 to nateduc and "natural iogarithr:-" .
and natsoc.Becausethose in the paid labor force are paying into Social Se- 10. To convert a lo-uirr....-
curity while retireesare withdrawing from it, a comparison of the attitudes logit coefficientof Ll.v1*.
of paid workers and retirees toward Social Security should be especially BE,;ause the depcni<..:
telling. A comparison of those two groups is strategicin the caseof spend- spendir r g, spendi'r
, ii i . -
s,
ing on public education as well, since both groups pay school taxes even rwo different wfl\ \ ,rrr..r-
though retireesare much lesslikely than paid workers are to have children pendentvariableis clicr, :
in public schools.If attitudes are determined by self-interest,then, we ex- right" or "too little." Ir :r
pect retireeswill be more likely than workers to favor increasedspending ing" versus"about rigir:"
for Social Securityand lesslikely than workers to favor increasedspending Retirees in fnct ilru -
for education.Moreover, if the deepening-divisionsargument has merit, we spendingt oo m uch ol. : . .
should find an exacerbation of these differences over time. of tabl e 6. 1) and less, . .
Table 6.1 reports results for spending on public education. I use logit umn 2) . The ohser r cJ ; - -
models. In a logit or "log-odds" model, the dependent variable is ex- ti sti callysignif canr
i . r n-
pressedas the natural logarithm of the odds of Y, where Y is a dichotomy above t he y- int er ccpt: :
l ogi ts below on "t t , o l: : : . .
1973, the first survel rr...:
TesrB6.1 ucati on.The f inding. ' -
ChangingAttitudestowardSpendingon U.S.PublicEducation: lievesthat we spend rc-tc, :-.
1,97
versus'Workers,
Retirees 3-2004
odds thar a wor ker b. ,
Too much spending Too little spending tion, and the odds that -.-
uersusabout right or uersusabout right or cati on is only ahour h. : -
too little spending too much spending rhat w e spendr oo lir r l,
(lolit) (losit) \il/ith regard to speni.: :
substantialgenerariona j
Convergencecoefficients -
- workers,initial year (5r)
Retirees +0.91.9 -0.613 The results in table 6.1 s
(.0ee) (.06e) suppor t f or spendinqr ': .
- workers,trends( frr)
Retirees -0.012 -0.001 trend (slope)for "roo nr.^-:
( .007) (.004) tle spending" (last rorr' ,-':::
group herew or ker s beclnr c lr . .
Baseline
Interceptfor workers(d) -) )9 +0.1.27 on educat ion, r nd ur ur . . . .
(.051) (.028) But retireesexhibit rhe .,::',
Trendfor workers(B) -0.028 +0.038 the trend differences c, -.
(.003) (.002) l n shor t : O ver r lr r p. . : .
for more spendingon F.::. -
N = 1 9 , 8 1 6 . S t a nd a r de r r o r s a r e in p a r e n th e se s.
Dataz 1973-2004 General Social Survey, adjusted for black oversamplestn 7982 and 1987. De- genera t ionaldif f er en, . . , - -
pendent variable is GSS variable nateduc. \l'orkers are respondents who are working full-time or portive than workers fc,: . -.
part-time in the paid labor force (codes 1 and 2 on the GSS variable wrkstat), and retirees are those ive today. Yet there :. :
coded 5 on wrkstat.
widening.
Model: logit(y,,\ - a + pT + SrGr+ Qr(T x G.)
The subscript "2" denotes group 2. Year is coded I973=0 so the intercepts pertain to 1973.
N ow let 'slook . r t r h: . : -
"Workers" is the baselinecategory and "retirees" is group 2, so 5, is the estimated difference be- ri ry l table 6. 2) . lf r h. r = .
tween retirees and workers in 1973 and @, is the estimated difference in the linear time trends for gested,then we shoul; ::
retireesversus workers. Trends are reported as averageannual change. w orkers over t he piiir : . , .
The SixthRule . 189
- '-: - iJirrtion 6,7 to natedsc and "natural logarithm" refers to the use of the basee instead of the base
r - '-, . : p.r\-il1g into SocialSe- 10. To convert a logit back to an odds, you take the antilog of the logit. A
- :- : - ::'.:rrison of the attitudes logit coefficientof 0.919, for example,implies an odds of e)'ete-2.5L
..,: -":-,.r :, should be esPeciallY Becausethe dependentvariable consists of three categories(too much
- - -: ', ::.:, i n t he c as eof s P e n d - spending,spendingis about right, and too little spending),I dichotomize Y
r- a- .":: I'1. school taxes even two different ways and report results for both. In the first model the de-
l,; : ....::) .rreto havechildren pendent variable is dichotomized as "too much spending" versus "about
-r- .:.:-:n ter c s tt,hen, we e x - right" or "too little." In the secondmodel the contrast is "too little spcnd-
:-- - :.-'.,rrincreasedspending ing" versus"about right" or "too much."
-.
,:.:-i - -..-','(rr increasedspending Retirees in fact are more likely than workers to say that we are
j - - .,::ument has merit, we spendingtoo much on public education in the United States(column 1
':- , -:: .:: iifilf . of table 6.1) and less likely to say that we are spendingtoo little (col-
. - : . . - ., ;d uc at ion.I us e lo g i t umn 2). The observeddifferencesbetween retireesand workers are sta-
: r -, i :r:ri dent v ar iableis e x - tistically significantand large. The y-intercept for retireesis 0.919 logits
::-' : .'.rer eY is adic hot om y above the y-intercept for workers on "too much spending" and 0.513
logits below on "too little spending." The y-interceptsrefer to valuestn
l 9 7 J ,1hs fi rst surveythar asksthe questi onubout spendi ngon publiced-
ucation. The findings indicate that, in 1'973, the odds that a retiree be-
lievesthat we spendtoo much on public educationis eo'ele-2.5times the
odds that a worker believesthat rve spend too much on public educa-
:,;{ Tcto little spending tion, and the odds that a retiree believesthat rve spendtoo little on edu-
tr' ot' ;= 0.54)that a rvorker believes
- | uersusabout right or c a ti on i s onl y about hal f the odds (s
i: too much spending that we spendtoo little.
(logit) \7ith regard to spendingon education,then, there does appearto be a
substantialgenerationaldifference,as leastas of 1973. But is it growing?
-0.613 The results in table 6.1 sr.rggest not. Among workers there was growing
(.06e) support for spending for public education, as indicated by the negative
-0.001 trend (slope) for "too much spending" and the positive slope for "too lit-
(.004) tle spending" (last rou, of table 6.1). In other words, over the time period
here workers becameless likely to believethat we are spending too much
+0.t27 on education and more likely to believe that we are spending too little.
(.028) But retireesexhibit the sametrends, as indicated by the nonsignificanceof
+0.03
8 th e tr end di fferences(drs).
(.002) In short: Over the past three decadesthere has been growing support
for more spending on public education in the United States.And there are
: : . . rnrp l e si n 1 9 8 2 a n d 1 9 8 7 ' De -
generational differences:Thirty years ago retirees tended to be less sup-
r -: .: r: r: : \ \ 'ho a r e w o r k i n g f u l l - tim e o r portive than workers for such spending, and retireesare still lesssupport-
: - : ' : \\ rkstat), and retireesare those ive today. Yet there is no evidence that the generational division is
widening.
Now let's look at the trend in attitudes about spending on Social Secu-
-- : - . : le in t e r c e p t sp e r t a i n t o 1 9 7 3 .
.- ,-: - I : s t h e e s t i m a t e dd i f f e r e n ceb e -
rity (table 6.2).If there is generationalpolarization, as some have sug-
- - -: - : - '. : : : : ; c t n t h e l i n e a r t i m e t r en d s fo r gested, then we should observe an increasing gap between retirees and
: . .:..:.
' workers over the past two decades.As expected, we find that in 1984
790 . Chapter6
T,tsrg 6.2
versus\)forkers,
ChangingAttitudestoward Spendingon SociaiSecurity:Retir:ees
1984-2004
Convergencecoefficients
Retirees-workers, initial year ( 6r) -0 .882 -0.345
( .171) (.06e)
Retirees- r.vorkers,trends ( $, ) + 0 .014 -0.003
( .014) (.006)
Baselinegroup
Intercept for workers (a) - z.-) r +0.062
(.053
) (.02e)
i
retirees were less likely than workers to believe that we spend too much
on SocialSecurity:According to the table 6.2 estimates,in 1.984the odds
that a retiree believesthat we spend too much on Social Security were less
than half the o{ds that a worker believesthat we spend too much on So-
cial Security (e 0 882=0.4).Contrary to the polarization thesis,however,this
gap did not grow over the next two decades.since @, is not statistically
significant. \7e conclude, then, that the time trends for retirees and work-
ers are parallel.
An interesting wrinkle appearsin our results when the dependentvari-
able is coded as spending too little on Social Security. Recall that retirees
rn 1984 were lesslikely to say that we spend too much on Social Security.
The wrinkle is that retirees were also less likely than workers to say that
we spend too little on Social Security. In other words, retirees were more
likely than workers to say that spending on Social Security is about right,
whereasworkers were more likely than retireesto say that the spending is
The S i xthR ul e . l9l
off one way or the other, either too high or too low.7 Interestingly, work-
:::-ej lersus workers, ers over the past two decadeshave become more likely to say that we
s p e n dtoo l i ttl e on S oci alS ecuri tytB = 9.922. hi ghl y si gni fi cant) Ycr
. r e-
tirees have also becomemore likely to say we spend too little, and the gap
Too little spending
t,ersusabout right or between retirees and workers has remained stable from 1984 to 2004
too much spending (dz=-0.003, not significant).As we saw, the gap is constant also in the
(logit) caseof "spend too much." In other words, there is neither divergencenor
convergence;the trends are parallel for workers and retirees.
\Wefind no evidenceat all, then, for the notion that generational divi
-0.345 'We
(.06e) sions are growing in the United States. do find evidenceof growrng
-0.003 support for increasedspendingfor both public education and Social Secu-
(.006) rity in the United States, at least over the time periods examined here
(1973-2004 in the caseof public education and 1984-2004 in the caseof
Social Security). Critically, though, this support has increasedin tandem
+0.062
for retireesand workers. Even with a sample size close to 20,000 for both
(.02e)
issues,we found no statistically significant differences in the trends for
+0.022
(.002) workers and retirees.There appearsto be no generationalwar around the
corner.
Finally, it is important to stressthat convergencemodels should not
: : ). rm p l e s i n 1 9 8 7 . D e pe n d e n t
--: rr g .i u l l - t i r n eo r p a r t -tir n ein be estimated blindiy, rvithout first becorning familiar with one's data and
: ': : : t i r e e s a r e t h o s e c o d e d 5 o n building reality checksinto one'sresearch,as dictatedby rule 3. It is pos-
sible, for example, that change in Y is not monotonic over the time
interval-Y might exhibit a secular increaseover the first part of the in-
- r r e r r ' e n r \n e r t a i n t o 1 9 8 4 .
i :. rhe estinrateddifference be- terval and a secular decline over the second part of the interval. Or Y
--1. . r h e l i n e a r t i m e t r e n d s fo r
might change monotonicallv for some groups but not for others. It is a
', r good idea, then, to graph the mean of Y over time before estimating con-
::-.::.
vergencemodels. In some instancesthese graphs will show that the time
interval needs to be broken into smaller periods or "periodized" for
-,: : : li-][ $'e spendtOOmuch analysis.
: - - : -t.:tl .l tesin
, 1984 t he od d s
whether the associationof X and Y has changedfor society in general.Re- public cducationquest:' : , -
peated cross-section data can be analyzed using either changing-effect (therewas no 2001 sLr : ', . ,
models or (if there are enough measurementpoints) convergencemodels. geststhat ir chungir r g- . - -.,-
Changing-effectmodels treat time as categoricaland use regressionmodels reportc dabuue,I r r r hr . . .
-
of the individual-levelrelationship of X and Y to ask whether the effect of cent of wor kcr s and . l s. . - .
X on Y has changed over time. Changing-effectmodels can be descriptive tle on public educarro:. -.
(has the association of X and Y changed over time?) or causal (has the worker:sand retirees.I:' :r.
causal cffcct of X on Y changed over time?). Convergencemodels are de- ures had jumped ro a l
scriptive models that focus on change in Y at the aggregatelevel. Conver- ti rees,a dif f er enceof 1- : - '
gencemodelstreat time as continuoustnot categorical,and use regression Although there is Dlliur .-:
to model the trend in Y over time. As the name suggests,convergencemod- publ i c educat ionr h. r u: . . : -
els ask whether differenr groups are converging (or diverging) on Y. port has incr easedcq, r . - .
When applied to social divisions, the convergencequestion (are group Thes e im pr essit r nr , . '. -
trends converging or diverging?) and the changing-effectsquestion (have S oci alSur veyconsi'r -
group differences changed?) are similar but not identical. A changing- of the changing- ef f cir- ' -
effect test is often a natural supplement to a convergencetest. For the ed-
Y ,,= o r +( a. - o, J- - . - '
ucation and Social Security examples above it would be useful to know
whether we would have reached the same conclusions about the stability Recall that the suf'.,: ::-
of generationaldifferenceshad we used a two-wave changing-effectmodel model. The indepenclci-:.':
instead of a convergencemodel based on all the relevant waves of GSS ous or d iscr et e.I n r hr , . . '
data. On the basis of data fbr the two endpoints, a changing-effect test w e code X as a . lLr r r r : : . . . .
tells us whether the difference in attitudes between workers and retirees is this model:
larger or smaller in the initial and final years. Although the two-wave '.
Y i ,= dt +\ az- a. , T - :
changing-effectmodel discards data for intervening years, it can still be a
powerful method becauseit is basedon individual-level data. Tabl e 6. 3 r epor r . ': . r -
The convergenceand changing-effecttests are not redundant. The con- GSSdata on attirud.> : ,,,,:
vergencemodel usesdata for intermediate years, as well as the endpolnrs, S tares.As bef or e.r hr : : : . - '
to determine if there is linear convergenceor divergence.If there is linear spendi n gver sr r slf ', ) Llr- - - -
convergence,the difference between retirees and workers will be smaller or too m uch. Resr r lr ..
in the final year than in the initial year. If there is linear divergence,rhe The changing-efiee r ::. -.
difference will be larger. The converse is not true. A larger difference at on publ ic educat ion\ \ . : . . -
the final year (ch?nging-effectmodel) does not necessarilymean that rhe i n 1973- 7S- wor kcr . : -
trends are still divdrging, since the trends might be nonlinear, with the di- too much on educ. r r r . , -. . ' .
vergenceoccurring earlier in the period. For the same reasons, a smaller (dr- du colum ns 1 an; I -
difference at the final year does not necessarilymean that the trends are the conver gence nr oJ. . . - . , -
still converging. In other words, while the results of a two-wave changrng- ti on 6.10 also squar e. . '.: :
effect model can tell you whether the effect of X on Y has changed over i n the m id- 1970sr r cr . . - '
time, it cannot tell you when the change occurred, and whether it is con- we spendtoo much on :-.-
tinuing. w orkers t o say t hr r \ \ q . - . . -
Consider again the issueof generationaldifferencesin support for public The key f inding in r : - . -
education in the United States,this time investigatedusing a changing- teraction term rel,et'-. ' .'
effect model. To enlargethe samplefor the changing-effectanalysis,I com- cati ng that r he dif f er . n. . l.
bine the data for 1973-7 5 (the first three surveysin which the spendingon sameas it was thirtr' \'e.rr: :,,
The Sixth Rule . 193
^ -i-:, r : : io.iet,vin general.Re- public educationquestionappeared)as well as the 2002 and 2004 surveys
-;- .- .: ;irher changing-effect (there was no 2003 survey). A quick inspection of the percentagessug-
-,:-. - ..:: aonvergence models' geststhat a changing-effectanalysiswill yield the samestory as the results
': : -." ::'.j rse regressionmodels reported above.In the combined 1973-75 surveys,for example,53.8 per-
,: r : '..< n'hether the effect of cent of workers and 38.8 percent of retireesthought that we spendtoo lit-
i-: --: -- ': .iel: can be descriPtive tle on public education, a difference of 15 percentage points between
j :' r rr-.ri or causal (has rhe workers and retirees. In the combined 2002 and 2004 surveys,those fig-
--' , r:qencemodels are de- ures had jumped to 76.0 percent for workers and 62.0 percent for re-
: - . ::srcgate level. Conver- tirees, a difference of 14 percentagepoints between workers and retirees.
'' - . : - -.-.i 1.. r r r Jus er egr es s i o n Although there is much greater support today for increasedspending on
-j:-' :' . -:-rts. ConvefgenCe mod- public education than there was three decadesago, the change in that sup-
::: -- :.rl \'!'r ging)O n Y . port has increasedequally among retirees and workers.
- :: (are group These impressions can be tested more formally. Becausethe General
- -::-..i qllestion
: - - r--', :-rffccts ques t ion( h a v e Social Survey consists of repeated surveys,I employ the interaction form
| . --. . ientical. A changing- of the changing-effectmodel:
: - - ' ::ience test. For the ed-
Yi ,= dt+ l az,-ar)T,+ B rX ,,+ @ 2- B r)i ' X i /71)+ e,t 6. 9)
r, : : , '-:iJ be useful to know
-: Recall that the subscripts L and 2 refer to time in the changing-effect
- , ..:.(rnsa bout t he s t abi l i ty
-, i, :.h .1nging- ef f ec t m o d e l model. The independentvariableX in equation 6.9 can be either contlnu-
- : : -.: :elevantwaves of GSS ous or discrete.In the erample under considerationhere, X is discrete.If
:-ir ::. .t changing-effecttest we code X as a dummv variable for retirees(X=1 for retirees)rve have
': -:r , : r:-.\\-orkersand retireesis this model:
: ,-- \lrhough the two-wave Yi ,= dt+ (az-arl T,+ B rreti rees+ l h- P t)(reti reesxT)\+si t ( 6. 10)
- i: - : -.r--ive ar s ,it c an s t ill b e a
:: : , .....-l e leldat a. Table 6.3 reports results for the application of equation 6.10 to the
::-:- , J -.(rtredundant.The con- GSS data on attitudes torvard spendir-rgon public education in the United
:: : , : -. ,...rl ell as t he endpoi n ts , States.As before, the dependent variables is dichotomized as too much
-: : - , I f t her e is lin e a r spending versus about right or too little, and too little versus about right
-::. .. : -:se nc e. will be smaller
.,..orkers or too much. Results closely resemble those for the convergencemodel.
- : : - . -. :. L i rr eardiv er genc eth , e The changing-effect model indicates that workers' support for spending
- - : ::..:. .\ larger differenceat on public educationwas substantiallyhigher in 2002-04 than it had been
:.cssaril-v mean that the in 1.973-75-workers in 2002-04 were less likely to say that we spend
' : lt ()nlin ea r, with rhe di- too much on education and more likely to say that we spend too little
: : ::'.. a,1mereaSOnS, a Smallef (a2- a p columns 1 and 2) . This result squareswith the trends observedin
::.,i. : :li.lfl that the trends are the convergencemodel. Results for the retirees dummy variable in equa-
r:--- r- ,)i a f\\'o-wavechanging- tion 6.10 also squarewith results from the convergencemodel. Retirees
:-. r .\ (rn )'has changedover in the mid-1970s were significantly more likely than workers to say that
'- - -- J..rnd uhet her it is c o n - we spend too much on public education and significantly less likely than
workers to say that we spend too little.
: ::: ::rics ir-rsupportfor public The key finding in table 6.3 is that Gl r- b ,1, rhe coefficient for the in-
: -.rt.J us ing a c hang i n g - teraction term retireesx T, falls well short of statistical significance,indi-
ysls,I com-
::!r-efiectanalvsis. cating that the difference between workers and retirees in 2002-04 is the
n'hich the spendingon sameas it was thirty yearsearlier. On the basisof this finding we conclude,
794 . Chapter6
Tasls 6.3
ChangingAttitudestoward Spendingon U.S.PublicEducation:Resultsfor
Model, 1973-7 5 versus2002-04.
Changing-Effect
Convergence
coefficients
Retirees-workers, initialyear(Br) +0.767 -0,610
(.140) (.106)
- workers,trends(Br* Br)
Retirees -0 .013 -0.002
(.00e) (.006)
group
Baseline
Interceptfor workers(&,) I )1
+0.154
(.0711 (.043)
Trendfor workers(42-Ar) -0.022 +0.034
(.004) (.002)
N=4,761. Standard errors are in parentheses.
Data:7973-75 and 2002-04 General Social Surveys. Dependent variable is GSS variable nate-
duc. Workers are respondentswho are working fulltime or part-time in the paid labor force (codes
1 and 2 on the GSS variable wrkstat), and retireesare those coded 5 on wrkstat.
M o d e l : l o g i t ( Y,,) - - a r + ( a .- a r ) T r + Br r e tir e e s+ ( 8 ,- Br)reti reesxT,, rvhere teti reesj s a dummy
v a r i a h l ec o d e d I fo r r e tir e e s.
In the changing-effect model, the subscripts I and 2 denote time. Year is coded 1973*75=0 so
the intercepts pertain to time 1. Note that all the variables,including time, are dummy variables,so
"linear trend" here is reflected by differences in intercepts. To facilitate comparison with results
from the convergencemodel (table 6.1), coefficients for workers'trend (a,-a,) and trend differ-
ence (Fr-Fr) were divided by 29 to convert from total to averageannual change over the period
1973-75 to 2002-04.
as before. that rhe rrends for workers and retireesare neither converging
nor divcrging. Viith the changing-effect model, however, the evidenceis
less direct since we do not know what the trend was between 1975 and
2O02-something that we can readily examine under the convergence
model. The evidencefrom the changing-effect model is also based on a
smaller sample.
In this instance, then, the changing-effect model and the convergence
model tell the samestory,with effectsizesthat are roughly similar (table6.4).
This will not always be the case.Becauseit usesall the waves of data, the
convergencemodel provides a more direct test of whether trends are con-
verging or diverging. On the other hand, changing-effectmodels are more
widely applicable in the social sciencessince they require only two waves
of data. They ask whether effects are the same at two different points in
time-a more delimited issuethan the issue of convergence.Convergence
The Sirth Rule . 195
Trnrp 6.4
: . --: - r: ,r: Resultsfo r ChangingAttitudestoward Spendingon U.S.PublicEducation:ComparingResults
for Convergenceand Changing-Effect
Models
r, - .. . .:. :. Too little sPending Too much spending Tto little spending
: " -' ': ' L'ersusubout right or uersusdbout right or uersu.sdbout right or
too little spending too little spending
i (logit) (logit) (logit)
Convergencecoefficients (both models)
- 0. 610 Retirees- workers, initial year
( . 106) Convergencemodel + 0.9I9 -0.613
- 0. 002 Changing-effect model + 0.767 -0.610
( . 006) Retirees- workers, trend difference
Convergencemodel
+ 0. 154 Changing-effect model
( . 043)
+ 0. 034 Baselinegroup (both models)
( . 002) Intercept for workers
Convergencemodel -2.29 +0.I27
Changing-effect model I tl
+0.154
-. -: . : ri; rb l e i s G S S v a r i a b len a te - Tr end f o r w o r k e r s ( a n n u a lc h a n g e r
: :r. in the paid labor force (codes Convergencemodel -0.028 +0.038
.: ' ,,n *'rkstat. Changing-effect model -0.022 +0.034
: , 1.. rlhere retirees is a dummY
No /e :Resul tsfromtabl es6.l and5..l .\=l 9.S l 5forconvergencernodel and4,76 l forc hangi ng-
': . . \ 'eir r i s c o d e d 1 9 7 3 - 7 5 = 0 so effect model.
-: . : : : : inre ,a r e d u m m y v a r i ab le s,so " Not statistically significant.
c o m p a r i s o nw i t h r e su lts
: --. . : , it a r e
- : : rnd ( a , - 4 1 ) a r - r dt r e n d d iffe r -
r: : -rinu a l c h a n g eo v e r t h e Pe r io d
models will becomeincreasingiyuseful in the social sciencesas we accu-
mulate more and more measurementpoints in our data sets.In the mean-
time, with changing-effectmodeis we can often learn quite a bit from just
r::i Jre neither converging two waves of data, particularly if the measurementinterval is appropriare
.:'.rr\\'c v eff ,he ev idenc ei s fo r th e changei n questi on.
-.J \\..rsbet."veen1975 and
:'.: Lrnderthe convergence
: :r'.odelis aiso based on a BnrncrNc INorvrouer AND SocrAL CneucE: EstruetrNc Cononr
Rp,plecrlrpNT EFFECTS
::. ,Jel irnd the convergence
'Where
: :, ,r.r9hlr-similar(table6.4). does social change come from? By social change I am referring to
. ,..11the waves of data, the change on some aggregateattribute for a societS such as change in per-
': $'hethertrends are con- centage of peopie who vote or percentageof people who attend church
:rl-effecrmodels are more regularly. There are two major proximate sources: Individuals can
:', require only two waves change, and the composition of a population can change (Ryder 1965).ln
,:-:r\\'o different Points ln other words, change in public attitudes could be due to changing public
-,:i\ ergence.convergence opinion or to changing publics.
196 . Chapter6
for example, that party preferenceis fixed for all individuals after age eight- S up posewe ha\ € iiir , .
een: Once a Republican always a Republican, and similarly for Democrats trni tsl individuals,r n : : . -
and Independents. Percent Republican could nonetheless increase within mon i n t he socialSur r - - .- .
birth-cohorts if Republicanstend to outlive others.Similarln the averageed- anal ysisas cum ular . J - . : :
ucation of a cohort could rise over time simply becausethe less-educated Lineardecomposirir)n
: ..
tend to die younger than the better-educated. Sfe could detect this sort of rnerge ddat a. and t lr c - . . .
effect with panel data but not with repeatedcross-sectiondata. Thus in in- threev ar iables{agc.\ : . : :
stanceswhere there is reason to believethat differential mortality or migra- The 6r st st ep ir r {', : - . .
tion is relatedto Y, it is a good idea if possibleto comparethe within-cohort year an d r espondent '.- , - , .
trend in repeatedsurveyswith the comparabletrend in panel data. the sake of dar a dr . p. . . .
cohort int o br oaderLr . : - :
Line ar D ecomp osition Metb o d l arl y i f t he sur verr . r r r j .
itself, however, one-\'--:
The cohort replacement accounring scheme divides total social change shorthandI r r ill r . f . , , lt , l
over some time period into (i) the part due to cohort replacementand (ii) as rhe appr opr iat cn'. : - -
the part due to aggregatedindividual change. The magnitude of the co- bi rth-y earand sur r . r . '- '
hort replacementeffect is determined by the size of cohorr differencesfor ti on of t he y- ir r t er cr F:..: .
the cohorts rotating in versus those dying off and the rate of cohort re- surveyand bir t h- re. r ". : .
placement. The contribution of aggregated individual change to social The next st epi< r o: - - - -
change is determined by the intracohort trend, that is, by the average vey (yeart hat r he r nJ ' *
change over time within cohorts.
Partitioning of this type makes senseonly when the pace of cohort re- Yu= Bor +B r , St t r t 'i^.
\..:'
placement is commensuratewith the pace of social change, so that cohort
E quation 6. || r ' r r c: : : :
replacementis a plausible sourceof the change.It would be pointlessto in-
val ue o f Y f or an in; - -
vestigatethe contribution of cohort replacementto the weekly or monrhly
the i nd ividualuJ:
swings in the popularity of a sitting U.S. president, for example, since the '. : : . -
From t he r esulr . , '- : -
ups and downs of presidential popularity happen at a much faster pace
changeon Y. To ASSI >: : ': -
than the pace o{aohort succession.It would also be pointless, ar the other
I to su r vey f is d<n: . . ,
extreme, to investigatethe contribution of cohort replacementto change
those meansb1 t akir - - -
in voting patterns from, say, 1910 to today, since there is no overlap of
and l ast SLlr V€VSnr - ,r '
voting cohorts-in a vacuoussenseall change since 1910 is attributable to
cohort replacementbecausethe replacementis complete. In the discussron E(Y,,)=Y,=F,,r-i )r.
that follows, we assumethat the pace of the social change of interest is in
ElY,r\=lr= B, r- r
fact commensuratewith the pace of cohort replacement,so that cohort re-
placement could sensibly account for some or all of the observed social SurueyYear, is ther-e;:::.
change. \7e also assumethat measurementintervals for the data are ap- theaverage yearol 1..::
propriate for the pace of social change under investigation, so that the for the Tth surver-.
data indeed capture the change. Socialchangein ) :: -
Linear decomposition provides a way to separate out the cohort re- Y from time 1 to rim. :.
placement component of overall social change. It is called linear decom-
position becauseit assumeslinear within- and cross-cohort change; other nY=Yr_Yr=Ft\;,
methods are possible when linearity assumptions are nor met (Firebaugh whereA denotes chrn!.. -
1989. 1997 \. time 7 (for example.; =
The Sirth Rule . 199
Equation 6.14 is the key equation in linear decomposition.On the ba- Finally, let me strc:: -.- ' :
sis of equation 6.14 we can partition social changeinro two components: nratesuur ces :. . .
ul clr lr r r gc.
using linear decompo:r::-
. The first component, BrLSurueyYeal, is the contribution of aggre-
look for rhe underlriui .....-:
gated individual change to overall social change from survey 1 to i ndi vi dualconver sion: - . : '
survey T. Recall that B, is the linear within-cohort slope (a weighted tion composition(asoi;.: .
averageof all the within-cohort slopes),or averageannual change is an irnportant first sr.: .'
in Y within cohorts. To get the total contribution of aggregated
individual changeto social change over the entire time inrerval, then,
we weight the average change per year (Fr) by A,SurueyYear,the
SuuuaRv
number of years from the first survey to the last survey.
r The second component, BrABirthYear, is the contribution of cohort
R ul e 6 dicr at est he r r >: - :
replacement to overall social change from the first survey to the last
use of r epeat edur osr - - ; - -
survey. B, is the linear cross-cohort slope, or averagedifference in Y
useful f or bot h descr : : : . . -
from one cohort to the next. To get the total contribution of cohort
over time, panel data .. :-i i
replacement to social change over the entire time interval, then, w-e
to rel no vet he enJt r r r n- : - - :
weight the averagecross-cohort change l!z) by ABirthYear, the dif- causalmodels.
ferencebetween the averageyear of birth for respondentsin the first P ane ldat a and r cp- '. : '.-
survey and the last survey. To illustrate, supposethe averageage for gate the f ir st gener . r .l '. . .-
respondentsis forty for a survey taken in 1980 and forty-two for a questj o n:Hcs t he . f r - , :
survey taken in 2005. The 1980 respondentson averagewere born ask w het hert he clic- :
i"J279rgd 2005 respondents on average were born in 1963, so al s.\X /it hr epeat edCf , : : - - : -
LBirtbYear=1963-1940=23 years.To esrimatethe contribution of has changedin t hc - - - . -
cohort replacementto social changein Y from 1980 to 2005, then, poi nts a r e r equir eJ : . .
we would multiply Frby 23.
tw o w a vesor t \ \ 'o ai, - - - - - -
The linear decomposition method has been used in a number of recent in- associ a t ion changeJ: : '-
vestigationsof social change (for example, Alwin 1996; Brewster and Pa- causeso r cor r ellt er . : : - .
davic 2000). The key assumption is that the effectsare linear and additive. mentpo int s.Wit h cr : , : -
In particular, equation 6.14 assumesthe same intracohort and intercohort models with time l' ::'.- - -
slopesat times 1 and T(compare equations6.12 and 6.13). We can check B y c ollecr inguf J. : : : - . .
the plausibility of our assumptionsby summing the trvo components.Gen- poi nt, repeat edcr o. r - . =- -
erally the two componentsdo not sum exactly to the differencebetween Y, ti gati ngsocialchanq. . l- , '-
andYn but the discrepancyshould not be large. If the components do not for addressingthe st; :-: .
sum approximately to Yr-Y' the problem could be survey-to-survey convergencequestion H,,-
bouncinessin the Y-trend. In that caseperhaps more reliable estimatesof al l groups?)and t hc s, '. . .
the overallchangein Y might be obrained by using regressionor moving i n Y ca n be accour r r . : -
averagesto smooth the trend in the Y. If a large discrepancyremains after popul at ion?)The
. enr - '.- - -
smoothing, the underlying assumptionsof the linear decomposition model interaction terms to i::::'-'
are dubious, and another decompositionmethod should be used. i ng on Y. The conr - : l'. -
The linear decomposition merhod is simple-making very few data be estim at edusing t r : . - , - '
demands-and elegant. It applies to the study of social change for binary regress ionof Y or r hi: : - - -
variables(changein proporrions or percentages) as well as to the study of The issuesin t hi. . : '. : : -
chanse for continuous variables. the quant it at ivest uJr
Thc Sixth Rule . 201
should continue to improve in the future as ongoing panel and repeated A P P E, NDI X
survey data sets steadily add new waves of data. It is likely, then, that the
issuesof this chapter-the investigationof changeor stability in individual-
leveleffects,the investigationof convergenceand divergenceof groups, and StudentExercise
s
the investigation of cohort replacement'scontribution to social change-
will be central to social researchin the twenty-first century.
A ssrcNvr Nr l: Cor i, . -
IN A TTITU D E S A B .I ..- .
[n most of t he sur r , . - -
labeled racmar:
Do you t hink r L. - ,
groes/Blacks/-\ir.-.. .
B eginningin l9Et r . . . . : -
as to whit e Am cr ir ', : ', :
cans sey that ther f.: . -
exercisewe f ocus o" ''
A s we will .:: -
'ee
w hi te adult s u hu . . . :
l astt hr eedecade. r. : - ' -
est trendsobser ve. : ' - -
declinewas dr ir '. n : - -
chan gingr heir vicr i . '
to answert hat qr . r . . : .
Th e exer ciseior - - : - : '.
trend f r om 19- l r , -
i- : - :-.spanel and repeated AP PE N D IX
: :; . . , .: .s l i k elr ' ,t hen, t hat th e
-- :-: : :.tabilit v in indiv idu a l -
,J: ',,- I - r',a:genceof groups, and StudentExerciseson Rule 6
, - - :. ,n to social change-
.: - - - ':-.i n tur ) - .
The exercisesagain use the 7972-2004 data from the General Social Sur-
vey. Go to the web site http://sda.berkeley.eduand follow rhe instrucrrons
below. By following the instrucions you should be able to generateall the
statisticsyou needto answer the questions.Variable namesare in boldface.
Note: Recall that the figures given below arc tor the 1972-2004 GSS,the
most recent GSSdata avaiiablewhen this book was written. Thus if you are
working with more recent data, to replicate the figures eractly you will
need to use the "seiection filter" in the SDA program to remove years after
2004. Also recall that the format in the SDA web site might have changed
somewhatsincethesedirections were written, so some of the specificdirec
tions below might be out of date. You nonethelessshould be able to follow
the logic below to obtain the tablesyou need to answer the questions.
In most of the surveys from 1972 to 2002 the GSS asked this question,
labeled racmar:
Do you think there should be laws against marriages between (Ne-
groes/Blacks/African
Americans)and whites? (yes/no)
Beginningin 1980, the questionwas posed to African Americansas well
as to white Americans. Horvever, only about 6 percent of African Ameri-
cans say that they favor laws against black-whire intermarriage, so in this
exercisewe focus on the trend for whites.
As we will see shortl,v,the GSS results indicate that the percentageof
white adults who say they favor such laws has declined sharply over the
last three decades,from 4 in 10 in 1972 to 1 in 10 today, one of the steep-
est trends observed in the GSS. It is important to determine whether the
decline was driven primanly by cohort replacement or by individuals
changing their views on interracial marriage. Our aim in this exerciseis
to answer that question.
The exerciseconsists of five steps: Recode racmar; look at the racmar
trend from 1972to 2002 (the question was not asked in 2004): creare a
Chapter6
variable called birthcohort and calculate change in the mean of birthco- shou ldobt ain ir r c{r r : :
hort from 1.972 to 2002; regress racmar (recoded) on birthcohort and for birthcohort.
year (year of survey); finally, estimatethe contributions of cohort replace-
ment and aggregatedindividual change by inserting the appropriate val- Question1. How do r',.-:
Which one reflectsrriti..:.
ues into equation 6.14, above. Here is some help on how to do that:
cohortchange?Erplairr,"
o Use the recode command (found in the pull-down menu under "Cre-
ate Variables") to create a new variable racmar2 that is coded 100 -
Question 2. (a) Use equ.,.:.
for those who think there should be laws against interracial marriage contributions of cohorr i i. . .,- ; -
and 0 for those who think there should not be such laws. (We use the 1972 - 2002de'lr r r c. . . . . . . :
100 instead of l becausewe want to use percentagesinstead of marriage.Round off r c".r::-: :-.
proportions-percentages are more convenient for the decomposi- ':
\flhat do you findr (HI\l: -
tion later.) For the 1972-2004 cumulative data set, you should find counts for about nvo-th^:-, - .
22,155 respondentswho say they opposesuch laws, and 6,629 who changeaccountsfor a1.,,-:: -: r-
say they favor such laws. (b) D o the t wo com pon. : ': - - - '
r The second step is to examine the trend over time by crosstabulating the percentagesfor racnrrrl ''
racmar2 (row variable) with year (column variable). Because we i q i mnorrnn r
want to restrict the analysisto whites, enter "race(1)" in the box be- i c) S umm ar izeyour r cr L: . : - . .
side "selection filter(s)." Select "No weight," and run the table. replacementis the larger r ":: -
Note the decline in the percentageof those who think there should
be laws against black-white intermarriage.
r Now use the "Compute a new variable" command under "Create A ssrcrl ' , t ENr2: CH r r e t r :
Variables" to create a variable called birthcohort, as follows: birth- P uerrc Epucer r ox: R. . - , , '
c ohor t -y e a r-a g e - 1 8 8 3 . T h e v a l u e " 1883" i s addedto setbi rthco-
hort at zero for the first birth-cohort in the sample. (The oldest The challengein this .r!) - - :-
people in the GSS cumulativedata were born in 1883.) We do this chaptera nd in t he nolr - : ' -
for conveniencein interpreting the y-intercept. table. You can use the E.:. . ,
o Next we need to find the mean of birth-cohort for 1972 and 2002,
l ogi t analysisis one ( r : : : : :-
the first and last years we are examining. Go to "Comparison of " A nal ysis") . M ake sur . : :
means" under "Analysis." Enter birthcohort as the dependent code the durnmy r-ari.ti,.. : .
variable and year as the row variable. Again use the filter "race(1)"
to restrict your analysis to whites. Also enter racmar2(0-100),
birthcohort(0-103), and year(1972-2002) as fiiters. These filters Hints
are necessaryto make the sample here consistentwith the sample . You will needto ::- *:
you will use in the next step,the regression.Your samplesizeshould
create two dumnr" ..:.: r - :
be 24, 6 7 7 . -
i s coded 1 f or "sp. : : : :
e Now go to "Multiple regression"under "Analysis." Estimateequa-
tion 6.11 by regressingracmar2 (dependentvariable) on year and
roo lit t le. " Tlt c . . . -. : .
rl e" and 0 f or ". r i'' ..: - - '
birthcohort.e Rememberto use the filter "race(1)" to restrict your e R ecodewr kst at t ( r a: : . . . : -
analysisto whites. Select"No weight," then "Run regression."You
for wr kst at =5 ani r . : : : . ' =
eln this casewe arrive at the same conclusion rvhether we use logistic regressionor Ol-S r C re at ea neu vr r i. : i'. : : - -
regression so, to simplify matters, the assignment asks for OLS even though the outcome to th e f ir st ) ear . I - - :
variable is dichotomous. o C re at et he int er r ; . . .'
t":ril
ChapterTitle . 205
:i: -r',.rre in the mean of birthco- should obtain a regressioncoefficientof -0.318 for year and -0.122
:r.rr :;;oded) on birthcohort and for birthcohort.
.: - -riihutions of cohort replace-
Question 1. How do you interpret the coefficients -0.318 and -0,722?
: :', ::lrcrting the appropriateval-
Which one reflects within-cohort change and which one reflects cross-
:---:.:clp on how to do that:
- cohort change?Explain what these coefficients mean, in plain English.
:. :-: :.ril-down menu under "Cre-
:: ..:.;- tacmar} that is coded 100 Question 2. (a) Use equation 6.14 and a hand calculator to estimate the
',: ,r',',i against interracial marriage contributions of cohort replacementand aggregatedindividual change to
,..; not be such laws. ('Weuse the 7972-2002 decline in support for laws prohibiting black-white inter-
--.
:' - :" use percentagesinstead of marriage. Round off your estimatesto the nearestone-tenth of a percent.
:: - ,irlenient for the decomPosi- V/hat do you find? (HINT: You should find that cohort replacemenrac-
-:-.,-.,r:i'edata set, you should find counts for about two-thirds of the decline and that aggregatedindividual
'
l:^, )\e such laws, and 6,629 who change accounts for about one-third of the decline.)
(b) Do the two componenrs sum approximately ro rhe observeddecline in
-:::-.i orer time by crosstabulating the percentagesfor racmar2 from 1972 to 2002? Explain why this result
.ir ;,rlLrmn variable). Becausewe is important.
:. : : i. . itte r "ra c e( 1) " in t he bo x b e - (c) Summarize your results in a paragraph. Are you surprised that cohort
-\ n'eight," and run the table. replacementis the larger contributor to the decline?Why or why not?
:: : :hose who think there should
This givesyou the variablesyou need to reproduce the results in table 6.1. C H A PTE, R 7
Statisticalestimatii,: :- .'-
and resear ch. lct il: . . : .
t() the researchqr.c-:. , .
-i.
designand qucstr,r;--. .
ti ce mer hodear rlr c-
parti c ularm er hoJ : : : , :
than thc method dr's:-,
l n this bt - r okI lr : .
ods bu t have lo. L: : :- -
phasison det iqr r, . : - - -
Ii ttl e value in lr s, - j : -
si gn right - \ o \ \ ': : . - . - . '
refining our estinr-r:
ti cs and econor "- : - - - '.
resear ch. Fin. r ll'. : '.
uni maginat ir e r c. ; . - - - -
searcht hesedr r r : . r \ . . - - -
resear chdesign. .T'- - - .
searchr odal li. . : : : : - : : . -
conj un ct ion r r ir h . : : . : . -
methods per se.l
In t har light . r r r . -
j
i s i mplicit in m u. l:
[O Sefv e a s a L O L I l l t i : .r :
-
wright2004,p.lt': ..-',
better r esear . h.) , , . : . , :
w ho wor ship at r : . '- : . -
thi nking t hat r ceh: ' - ,
subject matter, careful theorizing, and appropriate research design.2As to end on a positile :: :.
social researchersour aim should not be researchwith the most bedaz- scribe the domin.rni. I
zling estimation method-in effect making method the master-but re- pline, sociologv.
search with the most appropriate researchdesign and estimation method
for the question at hand. Often simplestis best. If percentageswill do, then
use percentages. Onsrs sr oN u't r H R: . , '
There are two ways that statisticalestimation can becomethe master rn
social research. The first is through the hegemony of a single statistical To gauge t he pr edr . . - :
technique or set of closely related techniques.Although social scienrists I coun t edt he u's \ , : - : : - -
give lip serviceto the principle that the tool should fit the researchdesign SociologicalRetic:; .--
and objective,in practicethere is a remarkablesameness in the tools used quantit at ivear t i'lcr : '- -
in much nonexperimental research.To determine the causesof some out- or regression-iiken.:: -,
come variable Y, researchersroutinely regressY on some set of other vari- study.
ables.The problem is not regressionitself, but the rourine use of regression, In c hapt er 5 I r r , ': - . ' '-
as if rcgressionwere the only tool available.When there is only a hammer pl ression as t hr pr r r t . '
in the tooibox, then everythingbeginsto look like a nail. The danger is that menti o lr edt her c.lr ; ; *. . .-
social researcherswill adjust their questions to fit the method, instead of (1985) havem adc iir r : ', : :
the other way around. " del i ver t he goo, 1i." : - -
A secondway that statistical method can becomethe master is through that, evet tif alu. t . . * . ,
unrealistic expectations about what statistics can do. Statistics becomes soci alr esear che. r r :: . - '
the master when we begin to believethat fancy statisticscan substitute for of soc ial r ese, r r ; h: ''. - - - -
hard thinking about data collection and researchdesign. The danger here ti oner s m r ghr l'c r : ', - - '- -
is that false hope in the remedial power of statisticscan undercut careful the " rc gr ess) on \ " ^
thinking about design. Consulting statisticians often complain about be- the hegem onrof , t . . : '- . -
ing contacted for statistical advice too late-afrer the data are collected. Y on X, socialr . - . . '. '- - -
Sfe cannot rely on statistical wizardry to overcome faulty data and re- thi nki ng about , , : : ', - :
search design. anal ys is. decision- t : . . . -
Unfortunately, social sciencemethods too often have become impedi- i ncapa cit yt o t hink . . - - -
ments to social researchtoday, along the lines just described.The knee- ci ous ef f ect sof oLr -
jerk use of regression methods and false hope in the remedial power of cial sciences.
statistical estimation methods both limit our imagination in research. A gu [ r . I st r ess : : . . . : : .
Quantitative research in the soC,ialsciencestoday tends to be longer on unthi nking useo1': . j'
statistical sophistication than on imagination. di fferentappr o. r eh -
My aim in this chapter is to provide examples to serve as a counter- cal w or k in r he : --
'oi
weight to the privileged position of the standard stand-alone regression to do the helr r ll. ' - .
sffategy in social research.The examples are meant to be suggestive,nor
exhaustive.They barely scratch the surface with regard to the options we
should be considering in social research. I intend the examples to be N e.ruRelr Y O cr L. . '.
positive-examples of what to do, not of what not to do. I want the book
N atural exper ini- : . : - - -
2In addition, there is the danger that reliance on statistical soci alr esear chcr:.. : : -
fixes can lead to a detelora-
tion of research qualit,v as researchersare beholden to techniques that they can easily use seemt hat nar ur . r l: i I - -
(due to user-friendly computer softr.vare)but don't fully understand. facr ra r e. But inr . r : . : : .
The SerenthRule . 209
:. - : - r:i r.j ear c h des ign. 2A s to end on a positive note. Before discussingthose examples,though' I cle-
-: - : i- ,',i rht he m os t beda z - scribe the dominance of regression analysis in one social sciencedisci-
- - .- :.' * lhe master-but re- pline, sociology.
- ::. - '-::Jestimationmethod
-'.'-- - lr:'llltlt$€S rvill do, then
Oe s sssroNw ITH R E GR E S S IoN
1:-lL'corTlcthe master in
To gauge the predilection for regression methods among sociologists,
'' .r . ingle \ r r lt is r i c a l
:^, Jughsocial scientists I counted the use of regression in the articles published in lJteAmertcqn
r :i rhe researchdesign Sociological Reuiew in 1999, the last year I was editor. Of the thirty-eight
':::ess rn the tools used quantitative articles published that year, all but four employed regression
r aJusesof some out- or regression-like methods (for example, event-history analysis) in their
. Jrlteset of other vari- study.
In chapter 5 I noted the problem with relying on standard linear re-
-.:.narlseof regression,
::lrrc is only a hammer gression as the primary, or onlv, tool for comparing like with like. As I
::-:r1.The dangeris that mentionedthere,Freedman1I991,1997), Leamer (1983), and Lieberson
::-.rmethod, insteadof (1985) have made similar points about the limited ability of regressionto
"deliver the goods." Here I want to make a somewhat different point:
: - :.: :hc masteris through that, even if always done well, the dominance of the regressionmodel in
social researchcan be stultif,ving;it limits our imagination.In some lines
- " -: ,. Statisticsbecomes
- ...::tJsc an s ubs t it ut ef o r of social researchthe regressionapproach is so domineeringthat practl-
'- - -:.r{r1. T he dangerhe re tioners might be hard-pressedeven to conceptualizeresearchoutside of
' -i ,: careful the "regressY on X" box. Clevernessand originality are the casualtiesof
-.1Ilundercut the hegemony of a single method.'$(hen the automatic routine is to regress
. ::.:- con.rplainabout be-
, :- :ir. data are collected. Y on X, social researchersquit looking for natural experiments and quit
:-- .--: i.tulty data and re- rh i nki ng about other opti ons-computer si mul ati on, decomposit ion
analysis,decision-treemethods, network analysis,and so on. That trained
: ::'- ::l re bec om eim ped i - incapacity to think outside the regression box is among the most perni-
' ...: Jcsc r ibed.T he k ne e - cious effects of our over-relianceon standard linear regression in the so-
.: ::--: remedial Power of cial sciences.
:r',.i l n. lliO nin r es earc h . Again, I stressthat the fault lies not with regression itself, but in the
. - , . .,-,,d, to be longer on unthinking use of regression,as if it were the only game in town. Often a
different approach would be better. Some of the most promising empiri-
:- :.r Sefve aS a COUnter-
cal work in the social sciencesusessomething other than regressionper se
: .:.rnd-aloneregression to do the heavy lifting, as we now see.
- ': .rr-ri () be s ugges t iv e,
not
i-- ::..irtl oPtionswe
::i.irtl to the options
- : ::; rh e er am P lest o b e Nerunerrv OccuRRrNc RaNoou AssrcNru.rNt,AcAIN
' : :,' Jo. I want the book
Natural experiments are potential gold mines for social research. Yet
social researchersrarely mine natural experiments. At first blush it might
.' .' -. - i.\ . i i n l e a d to a d e te ri o r a-
- '
seem that natural experiments are seldom exploited becausethey are in
- - . . - : : I n at tne v ca n ea sl ry USe
. - -.'.: fact rare. But interruptions of various sorts are commonplace in the social
2I0 . Chapter7
realm-new laws or policies, natural or human-made disasters,and so marri agesin dir or ; : , '- . ''
on-and many of these interruptions, with a little imagination, might be transmissionof n. . r- : . ,
exploited as natural experiments or quasi-experiments.More than a half on divorce itself ai :-:' :r -
century ago, Trygr.e Haavelmo (1,944)noted the "stream of experiments generat ion.hor r r . . . . . -
that nature is steadily turning out from her own enormous laboratory." l essi n vest m entin : : '. . :
Arguably, the relativescarcityof natural experimentsin social researchis tow ar d r nar r ilgc u . : : :
due to the lack of imagination of researchersmore than it is due to the w E ca n exper l . h. . - . : . .
lack of raw material in the social world. servi r t gt heir par cr . : . . .
\Wefirst encounterednaturall,voccurring random assignmentin the dis-
havi o r st hat r cdu, ; : . . : .
cussion of rule 5, r.vhererve noted that natural experiments are strategic Oth er expl. r n. n. , - - - -
for creating instrumental variables.Recall that the term "naturally occur- as thc pr incipirl t r . : -
ring" does not necessarilymean that nature made the assignments,but learning and morc - :i
that the assignmentswere not done for the sake of randomization, as one generalr ule singlcf-:': .:::" -.
would do in a controlled experimenr. In this secrion I describe more ex- dren. Less t im c or : - :
amples of naturally occurring random assignmentthat social researchers often rneansloir cr t - .. : r .
have been able to exploit. From theseexamples I hope to convince you of cation, financial pri:: -.i:
the value of thinking hard about rvhat a relevant natural expenment i ncreasingt he r isk ' ': ; .
would look like, and of doing this thinking early in your researchproject. The dccim at ior ,r . r : '
I do not mean to suggestthat you will always find an appropriate natural a natur alexper ir r r cr :. : . .
experiment (though of course you are more likely to find a natural exper- ent thar is t he ke. r : : ". ^
iment if you are looking for one). But the exercise itself is worthwhile: men ageseighrern '. '. :'
Even "dry searches" are useful becausethe discipline of thinking about morralit y r at e f or
natural experiments helps to sharpen your thinking about threats to the treate das a r elr r i\ . . - -
r eliabilit yof y o u r re s e a rc hfi n d i n g s . the me n who sr r r t : . i- - -
To stimulate thinking along these lines, I describe four investigations matter here, such .i: i-.: -
that have used natural experiments of some sort to gain leverageon a sig- dren to t he t r ear nr c: :- . . -. .
nificant issue in the social sciences.The first investigation addressesthe to w i dowhood- s. 1r : r :
question of how marital instability is transmitted across generations,the Ger m an childr cn -
second addressesthe question of birth-cohort effects, the third estimates and for r nedf ar nilr c, : : .
peer effects in schools, and the fourth addressesthe question of whether gel har dt{ |999r uir '- ' : - -
schooling in America is an equalizing force or merely a mechanism for re- Germln populr r t i, )n r-
producing classprivilege acr/ss generations.The issuesare diversein sub- chi l dr enhavc f ar . . l r , :
stance; what the examples have in common is their use of some type of and I 945. 77 per ecr . .*: .
natural experiment (along with regressionmethods) to gain analytic trac- in single-parentfam: ::
tion on a question that has proved difficult to answer using more conven- single-parentfamiL:. : -:.
tional methods. paren t s) Dieknr
. . r nr..: . . * :
vari atem odel usin: , . . - -
Random Assignment Based on National Tragedy: basel ine.O t her t h r r . . . . - .
Post-Woild War II Germany for respondent\lror: :
paren tf am ily.l/ r ir . . :: - . " -
The first example usesfamily structure in post-\7orld War II Germany to w ho gr ew up ir r . r . : . - ,
gain leverageon the causesof divorce. Divorce feeds divorce, that is, chil- ri sk of divor ce \ \ . 1: - : '. - : : -
dren whose parents are divorced are more likely themselvesto end their on the r eason ior . : : . , - .
The SeventhRule . 277
-
-- - -r--ridedisasters,and so marriages in divorce. Observershave offered various explanations for the
- - . . : .:rrg inat ion.m ight b e transmission of marital instability acrossgenerations.Some of thesefocus
- : 1 . : : . -J:,!s\.. { or e t han a ha l f on divorce itself as the factor that increasesthe risk of divorce in the next
r i :i i- - r:re .r m of ex per im e n ts generation. For example, respondentsfrom divorced families might placc
-,:. . ::rorlrQus laboratory." less investment in their own marriages becauseof more negativeattitudes
:', t,r I -' ::'Ii itl social researchis toward marriage that they acquired while growing up. In addition, smce
,-, :'- rl :: fhan it is due t o t h e we can expect children to learn conflict-resolution behaviors from ob-
serving their parents, children of divorce are more likely to acquire be-
:., -r-.signment in the dis- haviors that reduce the chancesof successin their own marriages.
:'.:i :i rnent s ar e s t r at e g i c Other explanations identify the absenceof a parent, not divorce itself,
: :rrm "naturally OCCur- as the principal factor. These explanationsplace less emphasison social
,:: :he assignments,but learning and more emphasison the obstaclesfaced by single parents. As a
: :.rndomization, as one generalrule single parents have lesstime and money to invest in their chil-
-:r i I describemore ex- dren. Less time often means less supervision and control; less rloncy
' : ::r.lt social researchers often means lolver investment in a child's education. Along with lessedu-
- .-.,:e to convinceyou of cation, financial pressure in the home can lead to earlier marriage, thus
. .1.: natural experiment
increasingthe risk of divorce.
- . our researchproject. The decimation of the German'World War II male population provides
.: ,:r1appropriate natural a natural experiment to test whether it is divorce or the absenceof a par-
: ' trnd a natural exper- ent that is the key transmitting mechanism. About one-fourth of German
- .. itself is worthwhile: men ages eighteen to forty were killed in the war. I7ith such a stunning
- :.rne of thinking about mortality rate for young men, it is plausible to assumethat death can be
. :-: ebout threats to the treated as a relativeli' random event,or at least that the men who died and
the men who survived did not differ greatly with respect to things that
- :.-:rr. four investigations matter here, such as their parenting skills. Thus the assignment of chil-
- :.rin leverageon a sig- dren to the treatment caregory-growing up in a single-parentfamily due
-:i : : - ::ri qation addr es s es
the to widowhood-can be vieu.ed as random, or nearly so.
';- ::: j ,:-JTOSS
generations, the German children of the World \War II generation have now grown up
- - '-.
-r.. rh e r hir d es r im ar e s and formed families of their orvn. Andreas Diekmann and Henriette En-
*- - :- -: - :.l - q ues t ionof whet h e r gelhardt 11999)use marital data from a 1988 random sample of the \fest
: : :::-\ e mechanismfor re- German population ages eighteen to fifty-five to investigare how those
-. - -: i r,re sar ediv er s eins u b - children have fared in their own marriages.For those born between 1933
- : : -.:'::Llseof some type of and 1945,77 percentgrew up in two-parent families, 16 percentgrew up
:.r.n s ei n e n e l
o....._.._ .yflc trac - in single-parent families due to war deaths, and 2 percent grew up in
: : :' -',,. uslng more conven- single-parentfamilies due to parental divorce (5 percent grew up with no
-r
parents).Diekmann and Engelhardt(table2) report the resultsof a multi-
variate model using children who grew up in a two-parent family as the
r ::-' t' . baseline.Other things equal, thev find no difference in the risk of divorce
for respondentsfrom a two-parent family and respondentsfrom a single-
parent family, i/the absenceof the parent is due to death. For respondents
r' ::-'1,, :,d \\ar II Germany to who grew up in a single-parent household due to divorce, however, the
: - : : : :-l i di to r c e, t hat is , c hil - risk of divorce was increasedby 180 percent.Thus divorce risk depends
: , : '. :rcmselvesto end their on the reason for a parent's absencel "risks are higher for children of
zIz . LnaDtef /
divorce than for respondents who grow up with a widowed parent" Constitution, rvhich :-.:. '
(p.792). This evidencesuggests,then, that it rs divorce that is transmitted. women were eligib'1.:
i nto rwo gr olr p\ - r - '
voti ng agc.ar r elJ ir ' . . .
Random Assignment Based on National Laws: Voting in
P ost-Ninete enth Amendment America A mendm enrchungc- . .
fect of hist or ic. r-l . ': - '
A venerablebody of work in the social sciencesassertsthat historical con- voti ng r at esof u, ': '
ditions presentwhen one "comes of age" have lasting effectson one's atti- vote wit h t he vot in: - . : : . .
tudes and behaviors (Mannheim 1927; Ryder 7965). Becausedifferent '
amendm entin lq- ^ : '
birth-cohorts are subjectedto different "slices of history" as they come of voting ratesof nrq:'.','.-
age, cohort differencesarise. As Norman Ryder (p. 844) explains, "Each of age on vot inc. I : \ ' -
new cohort makes fresh contact with the contemporary social heritage and of age bef or e\ \ 'onr : : ' -
carries the impress of the encounter through life." Or as Ron Lesthaeghe age male count er f . : . - : - .
and Johan Surkyn (1988, p. 40) put it, "Cohorts develop distincrive w e have st r ong r r '. : . - - - .
meaning-giving universesearly in life and seemto maintain thern through- tioning.
or-rtadulthood." \Wecall this embedding of history in cohorts a cohort ef- Fi rsrir is uscf Lr:l . .
fect: "Cohort effects occur wheneverthe past history of individuals exerts to vote t han m cn. \ : . ': : , '
an influence on their current behavior in a way that is not fully captured ment wom en in i. i. : . ' - - .
by an age variable" (Hobcraft, Menken, and Preston 1982, p. 10). to do so. I n t hq ": . : . : - '
Yet it is very difficult to determine how much cohort effects matter, or pol i ti cal scienuc" | : ' ":
evenwhether they exist at all. The problem turns on separatingcohort ef- ri am a nd Har olJ F
fects from age effects. Peoplewho come of age at the same time are the the 19 23 m a\ '() r . t - . - -
same age, and age together. So if we find, for example, that people who data for the * h, . '
were born in the 1940s differ from those who were born in the 1960s, it 1982, t able2: I ll; '-
is hard to know whether the difference is due to their age differencesor to for men and \\'onri::
their cohort differences(arising from their experienceof different histori- The ke1 que( ! . ' '- - -
cal conditions during their formative years). Adding data for other rime ti nuedt oer hil. it . '- -
periods only complicates the problem, since that introduces the possibil- and t2 ) whet herr : . : * . :
ity of what is called period effects, and cohort (year of birth) equals pe- K evi n Chcn and i. '-
riod (year of measurement)minus age (Glenn 2005). teenth Am en. lnr . : - : . - . '
Note that the opeqativemechanism for cohort effects is historical con- w ornell t hc ir t t pr , ': ; . : .
ditioning: Different/cohorts have different tastesor values or worldviews tel l i ng f acr or .lor i. '. : - . -
becausethey were exposed to different historical conditions during the i n vot e t ur nout . "
formative period of their lives. To separatecohort effects from age effects, Mor eover .\ \ 'c r \ : . , -
t hen, we need to i ma g i n ec i rc u m s ta n c e isn w hi ch i ndi vi dual si n a gi ven horts t hat is kerc- : :
society are randomly assignedto different historical conditions as they l angu ageof er p. r , : ' - - : '
come of age. By framing the issuesthis way, we see immediately that the j. : - :
S omewom en in
effects of historical conditioning-cohort effects-could be isolated from cenrs dur ing di. r : '- - '- -
age effects through a controlled experiment in which people of the same no mem or \ of Ji. : : . : - . ' -
age were randomly assignedto different historical conditions during their duri ng ef f ector r ' - - - . -
formative years. greate st exposur c: - '.
Can we find something resembling such an experiment in the real have "t he gr eat c. : . : - : "-
world? Consider the passageof the Nineteenth Amendment to the U.S. vote, wit h a 1c. . . - : - - . - -
The Seventh Rule . 213
.,. -:: \\'ith a widowed Parent" Constitution, which extended voting rights to women in all states-Before
.: : ;-.dir orce that is transmitted' women were eligible to vote, sex randomly separatedchildren at birth
into two groups-a group that would be eligible to vote upon reaching
voting age, and a group that would not be eligible to vote. The Nineteenth
:. L-ttt's:Voting in Amendment changed the historical conditions. To compare the lasting ef-
fect of historical conditioning on birth cohorts, then, we can compare the
ir:i-c: ilSserts that historicalcon- voting rates of women who came of age before women were eligible to
-
-: , r Lrsting effects on one's atti- vote with the voting rates of women who came of age after passageof the
; R . :cr 195-5).Becausedifferent amendment in 1920. By comparing the voting rates of women with the
':".-.s of history" as they come of voting rates of men who are the same age, we can separateout the effect
. R '. ier (p .844) ex plains ," E a c h of age on voting. If Nineteenth Amendment women-women who came
social heritageand of age before women could vote-are less likely to vote than their same-
- :-.:-nrporary
:::' lLfe." Or as Ron Lesthaeghe age male counterparts, and this gender gap disappearsfor later cohorts,
,. "Coho.tt develoP distinctive we have strong evidence for the enduring effects of a cohort's condi-
to maintain them through- tioning.
- ::::n
: : :lstorv in cohorts a cohort ef- First it is usefulto know whether,in the 1920s,women were lesslikely
: :-rr: history of individualsexerts to vote than men. Scattered evidence suggeststhat Nineteenth Amend-
:. -:-'.,..r\that is not fully captured ment women in fact were less likely to vote (than men) when first eligible
. :. : '. iP re sto n1 982, P . 10) . to do so. In the "first sample survey recorded in the annals of American
',.":-.ruchcohort effects matter, or political science" (Bennettand Bennett 1987, p. 158)' Charles E' Mer-
.:t :urns on separatingcohort ef- riam and Harold F. Gosnell (1924) discovereda substantial gender gap in
-: i Jge at the sametime are the the 1923 mayoral election in Chicago-a result consistent with turnout
::. :or example,that PeoPlewho data for the whole of Illinois for the 1920 national election (Kleppner
.. '...ho u'ere born in the 1960s' it 1982, table 2; Illinois was the only state to report vote turnout separarely
.; :-.. ro their age differencesor to for men and women).
-:r: c\perienceof different histori- The key questionsare (1) whether women of the amendmentera con-
:,'-:. . .\dding data for other time tinued to exhibit lower voting rates than men thirty to forty years later,
..r:: rhat introducesthe Possibil- and (2) whether the disparity disappearedamong later birth cohorts. As
. ;, ;v1rrt(-vearof birth) equals Pe- Kevin Chen and I explain (1995, p.978): "After passageof the Nine
t ' .: : lrl )0 05). teenth Amendment, historical conditions no longer 'taught' young
:': :,rhort effectsis historical con- women the impropriety of voting. So if historical conditioning is the
':.: :.lstesor valuesor worldviews telling factor, postamendmentcohorts should exhibit no sex differences
-: :..iorical conditions during the in vote turnout."
:=:. -,rhort effectsfrom age effects, Moreover, we expect a pattern of disappearinggender gaps across co-
:: : \ \ l ri ch i ndiv idualsin a g i v e n horts that is keyed to the passageof the Nineteenth Amendment. In the
::::': historical conditions as they languageof experiments,there are severallevelsof the "treatment effect":
.,,..:.'..\\'e see immediately that the Some women in fact were denied the right to vote, others were adoles-
-
.: ::iccts-could be isolatedfrom cents during disenfranchisement,others were children, and others have
:',:..: in rvhich PeoPleof the same no memory of disenfranchisement.If historical conditioning has an en-
': : i:llrlcal conditions during their during effect on cohorts, the effect should be greatestfor cohorts with the
greatest exposure to the conditioning; hence disenfranchisementshould
: .,--h an exPeriment in the real have "the greatest lasting effect on women actually denied the right to
Ame ndm entt o t he U .S' vote, with a lesser effect on women who were adolescentsduring the
' , .. : . en th
. - . "" , .1
2I4 . Chapter7
disenfranchisementera, followed by those who were children during that C l assr oom soao. , a. - _- ,
er a" ( F ir ebau g ha n d C h e n 1 9 9 5 , p .9 7 9 ). classmates speud.r !:;-. -:
Data from the 1952-1988 American National Election Study strongly truc pe eref f ect rir , ', . . . -
confirm thesepredictions. Even though the right to vote had been extended ful to concepnr , r ll/ .'-
to women more than thirty years-and in some instances,more than fifty ti on i s f he t r eat r nt L: -
years-before the electionsexamined here, women who came of age before uutculllewouhl h. t r r . : .
or during the Nineteenth Amendment era continued to vote at lower rates peerswhu $gr c Ju; : : .
than their male counterparts.And the stronger the historical "treatment," one' si nt er estis r n : : - : : : ,
the greater the difference.For men and women born before 1905, the odds The nt assirc r c. l- . - :
that a man voted is twice the odds that a woman did, controlling for age ol i n.r) SchoolDir i; . .
and age-squared,education, family income, region, party partisanship, how studentaulr ir rr . :
marital status, and election year (Firebaugh and Chen 1995, table 2). For l n an ef f or : tt o h, r l. i- . , - .
those born from 1,906to 1915, the odds were 43 percenrgreater for menl (as measur edl'r i- ; - - - ;
for those born from 1916 to 1,925,the odds were 28 percent greater for reduc ed- pr icelun. : - .: - - -
men.eVoting ratesare the samefor men and women born after 1925. popul a t ion.t he di. : : - : - .
In short, the ratification of the Nineteenth Amendment provided a nat- to 5 pe r centof ir : : : . .- : .
ural experiment to test the concept that the values of birth-cohorts are of the const Anrr c. . - : : : ^ -
shaped by their common youthful experiences-a central social science student sin t he dr . : : - : '
concept that is often assumedbut hard to demonstrate empirically. ti on du e eir hert . ' : . = - , - .
(H oxby and \ \ cir : : : - : ' -
For r esear . hcr . :. : . r . .
School Reassignments in Nofth Carolina ti gate p eer ef ici: . ^- - '
Our third example of the use of natural experiments in social research mentsin which . : . . - - : . -
comes from an econometric investigation of peer effects on student and Weingar r h.. r . - - : - - -
achievementby Caroline Hoxby and Gretchen Ifeingarth (2005). Peeref- dents in t he t hir - : - -
fects, like cohort effects, are often surmised but seldom proved. In addi- rhe 1994-95 s.h, : :
tion to the usual concernsabout measurementerror and omitted-variables ent var iableis : r r r ; : : '. :
bias, the empirical search for peer effects is complicated by the possibility ol i na' s end- of - qr . r ; =. . . . .
of self-selectionon the dependenrvariable. For example, adolescentswho permi t t he er ar r r r r : . . :
have an unmeasurgd propensity for delinquency might befriend other encech angesin p. . - . '- -
delinquency-prondadolescents,resulting in a positive associationbetrveen datasetincludesnr : - . -- . - .
individual's delinquency and peers' delinquency. The unwary researcher reducedpr ice lr r ni; : - - -
might interpret this associationas indicating a peer effect when in fact the i ncl ud esinf or r nlt r t ': .: . - ,
delinquency-prone individual would have committed the same acts of B ec ause t he pLr r p .
delinquencywithout the peers.There is also the possibility of a "reflection" prevent( or lt le, r . t '. .'
(Manski 1993) or "social multiplier" (Glaeser,Sacerdote,and Scheinkman the rea ssignm enr. \i: : : :
2003) effect. BecauseI am a peer to my peers,ifthere is a peer effect, then gradesin crou'ded s;:.
my outcomeswill affect their outcomes,which in turn will affect my our- i n l ess- cr or , r 'ded'r r , : - - . :
come, and so on in a continuing loop. This loop further complicates the rThe rel rri grrrnc T:.:- -.
analysis of peer effects. -
rel sri gnc d s tudent. :' - .'
pl e. or to honre-..l -,' , : -
eMoreover, this pattern of gender differences occurred di ffi cul t, horv erc r' ... F
only for voting, not for other
measuresof political involvement. with the reassignneni.
Rule .
The Seventh 215
::: ;hildrenduringthat Classroomsprovide a strategic place to search for peer effects because
classmatesspenda great deal of time together.The challengeis to s€parate
:.:-:ior-rStudystrongly true peer effects from selectioneffects and other types of biases.It is help-
:: h,rdbeenertended ful to conceptualize this in terms of an experiment where peer composi-
. -. . , : . , <5.nro re th an fift y tion is the treatment effect. Wl-ratwe want to know is whether a stlrdent's
' .,.i(r .ame of age before outcome would have been different had she been embeddedin a group of
: -r t,l \'ot€ at lowef rates peerswho were different on the characteristicsof interest (such as race, if
f .tl "tre clme nl "
. t r, r iu one's interest is in the effects of race segregationon some outcclme).
':. :.itore 190.5,the odls The massivereassignmentof students in the Sfake County (North Car-
: j. controlling for age olina) School District provides a grand narural experiment for studying
- :. Pilrrv partisanship, how student achievementis affected by the peer composition of the class.
.-:r 1 995,t able2) . Fo r In an effort to balance its schools on the basis of race and family income
:::-ent greaterfor men; (as measured by percentage of srudents participating in the free or
:: l\ percent greater for reduced-pricelunch program) in the face of a rapidly growing srudent
: :.(rrnalter 1,925. population,the district has,eachvear for more than a decade,assignedup
;:-.*:rentprovided a nat- to 5 percent of its studentsto different schools.rFor students,the result
.--. ,rf birth-cohorts are of the constant reshuffling is changing peer composition: 62 percent of
- : ---. Jintra l so cia l sclen c e studentsin the district have experienceda changein their peer composi-
'..
,.-
.ri! - - lPr r^i
Lrl .;^^
r Lar r).
l l ., tion due either to the reassigr.rrr.rent
of others or to their own reassignment
(H o x by and W ei ngarth2005, p. 15).
For researchers, the reshufflingpresentsa unique opportunity to lnves-
tigate peer effects becauseof the "very large number of r-raturalexperi-
: r :: : r-:l l rs i n s oc ial r es ear c h ments in which studentsexperiencenew peersin the classroom" (Hoxby
- :--r effects on student and Weingarth, abstract).Hoxb.v and \Teingarth use panel data for stu-
- . - '. ' .:r:ri .rrt h dents in the third through eighth grade in the \fake County district for
- ( 2005) .P eere f-
.: - - . i ..,Jo m pr ov ed.I n add i - the 1.994-95 school 1'earthrough the 2002-03 school year. The depend-
*:- :: : : rn d om it t ed- v ar iab l e s ent variable is student achievement,as measuredby scoreson North Car-
: - -'r .:-rted by the possibility olina's end-of-grade tests in reading and mathematics. The panel data
: I - :r ..:rple,adolescents who permit the examination of students' scores before and after they expen-
-.rrphrhef r iend ot h e r encechangesin peer composition. In addition to students' test scores,the
- - : - :..e a ss oc iat ion bet we e n dataset includes measures of race, gender, participation in the free or
r -: - , lh.' unrvary researcher reduced-pricelunch program, and parents' education. The datasetalso
- i r r::r .iiecr rvhen in fact the includesinformation to identify eachchild's classroom.
. - -- :-::::d the same acts of Becausethe purpose of the reassignmentis to relieve overcrowding and
: - : : --:rrl i rv of a " r ef lec t io n " prevent (or at least alleviate) segregationof schools by race and income,
:':.::. '., -:r[]ote, and Scheinkman the reassignmentsare not random. Specifically,students in more-crowded
:-:-.. : : -::i is a peereffect,then grades in crowded schools are more likely to be reassignedthan students
- - ' :-.rn u-ill affect my out- in less-crowded situations. And, given the aim of racial and income
- ' :-rherc om nlic at est h e
3The reassignmentstrategv for balancing schools
could be sabotagedif parents removed
reassignedstudents frorn the public schools (they might srvitch to private schools, for eram-
ple, or to home-schooling).The procedures used by the school district rnake noncompliance
rr voting, not for other difficult, however (seeHoxby and Weingarth 2005, p. 1,1),and most fan.riliesin fact comply
with the reassignment.
216 . Chapter7
balance, in any given year students of a particular race or a particular in- Cognitive inequ.llr;. -.;
come category are more likely to be reassigned. i s w ell docum cnt . J : : . ,
The reassignmentsare, however, conditionally random: Conditional children frorn fam'l:.. ',' :
on a student's race, ethnicity, lunch participation (the income measure), rai n m or e r t lu. ilt t r '' .
school, and grade level-variables included in the dataset-students are educa t ion,t t t t t r r r '
equally likely to be reassigned.In other words, for two students of the thi s ger r er ulSLS : . ,
same race with the same level of family income, those in the same school ti on -+ Childr en's: t :
and grade are equally likely to be reassigned.In particular, as Hoxby and l n a ddit ior rt o r . ': , . .
'Weingarth (table
2) show, once the reassignmentvariables (crowdedness frorrrh ighcr SE5I . i: . . . : - -
of school and so on) are controlled for, the probability of reassignmentis student sgencr . r ll'. r . . . . . -
independentof students' initial test scores.This independenceis critical to to hi g her abilr r i '*: - . . .
their investigation,sincetheir aim is to determinewhat effect peers'achieve- favora bleint er acr r :' . .
ment has on one's own achievement, and how that effect might differ age i n equalit ie'" r r : : - .-
acrossgroups (perhaps,for example, the presenceof high-achieving peers some obsen'er: hrr
benefitsonly other high achieversand has no effect or even a negativeef- from s er vingAs. r n: , . l
fect on low-achieving students). exace r bat einequr t l; : : . . .- '
Hoxby and \fleingarth conclude that there dre peer effects on students' sentsone of the mos: :--:
achievements,but these effects hinge on peers' achievementlevels,not on S E S advant age. or
-.-:-
their income level or their race. Quoting Hoxby and rWeingarth (p. 28; serve t he int er cst : : : . .
bracketed clausewas added for clarification): (B ow lesand G inr r - - - -
N o one disput - . : , - '
The vast majority of the apparent impact of a concentrationof racial
tO recelVem or c . l: - ^ - - '
minorities, ethnic minorities, or poor students[that is, studentsin the
tion to transntir a. -,r- -' ",
free or reduced lunch program] is really the effect of their achieve-
cl ass dif f er ences.D. . : : .
ment. Put another way, if we seetwo schools with the samedistribu-
nonethelessf un. t r ' ' .
tion of achievement(not merely the same mean), we should expect
l ow er SESst udent . .. : . - .
their students' achievement to evolve similarly in the future, even if
14) explain:
the schools have quite different racial, ethnic, and income composi-
t ions . E ve n if scht loi.. r : - . . '
i n s chooler t r ir t 'r . : . . . . .
In short, on ,he basis of the natural experiments provided by the Wake
vironments. Son',. - -
County rea(signments,Hoxby and lfeingarth find that what matters for
enc es,but t he di. . i; . - -
school achievement is the achieuement composition of one's peers, nor
even more Selerc. -..
r,vhetherthey are black or white or poor or not poor. It remains to be seen
qualit y schoolr r r - : "
whether other researchersare able to replicate their findings with other
child am endir r .gr l: - - - .
data, but the important point here is Hoxby and Ifeingarth's imaginative
vant agedst udenr - .. - '
use of a school district's reassignmentplan to provide analytic leverageon
the difficult issue of peer effects in schools. In short, to determin. '.'.-.
considert wo lear ni'r - . : '
ideal experiment, \\'e \,..
Using Seasonal Comparisons to Assess Scbooling's
and nonschoolcondir , : -
Impact on Inequality
schoolsfail to ler-elrh. :
The fourth example, a study by Douglas Downey, Beckett Broh, and Paul inequality in learnins :.:
von Hippel (2004\, also involves schools:"Are schoolsthe great equalizer? tions as it is under ti:: '
Rule .
The Seventh 277
' -.-,.. or a particular in- Cognitive inequality during the summer months and the school year." It
is well documented that, in the United Statesand other'Western nations,
' :.-.nJolr-r:Conditional children from families with higher SES(socioeconomicstatus) tend to ob-
. -: incornemeasure), tain more education than children from lower SES families. And one's
-..r:aser-studentsare education,in turn, is a major determinantof one's own SES.So we have
' : :r\ o students of the this general SES transmission model: Parents' SES--; Children's edusa-
- i: in the sameschool tion-+Children's SES.
.::.;u l a r . as Hox by and In addition to obtaining more years of schoolingon average,children
, ..:r-rhles (crowdedness from higher SESfamilies also ter-rdto receivebetter education.Higher SES
- :, of reassignmentis students generally attend schools rvith better resources,tl-reyare assigned
-,:::ndence is critical to to higher ability grolrps, and the1.and their parents typically have more
-.,: ;fiect peers'achieve- favorable interactions rvith teachers.Due to what one author calls "sav-
...: eiiect might differ age inequalities" in public education in the United States(Kozol 1991),
: ::ieh-achievingpeers some observershave concluded that education in the United States.far
-- ': cvena negaf iv eef - from serving as an equalizing force. servesinstead to reproduce or even
exacerbateinequalities.Indeed,some scholarsarguethat schoolingrepre-
r, .: efiectson students' sentsone of the most important channelsbv u.hich parents transmit their
' :-..ment levels,not on SES advantage(or disadr,antage)to their children. In that way schools
, ' - I \ \ ' e i n o a rrh tn
'r'
)g
- ui serve the interests primarilv of those at the top of the SES hierarchy
(B o wl esand Gi nti s 1976).
- No one disputesthe fact that children from higher SESfamilies tend
- :r.entration of racial to receivemore and better schooling.The issueis rvhetherschoolsfunc-
:: - : - ::r.1 its , s t udent sin t he
--. ,::r'.t o f t heir ac hiev e - tion to transmit classdifferencesfrom parentsto children or reduce such
class differences.Despite large disparities in resources,schools might
::.rthe same distribu-
- : -.:.:.:'. \\-e should expect nonethelessfunction to reduce disparities in skills between higher and
iower SESstudents.As Dorvner',Broh, and von Hippel (2004, pp. 613--
.: ::r rhe future, even if
14) explain:
.::J rnc om ec om pos i -
Even if schoois are unfair, they may serve as equalizersif the variation
'Sfake in school environrnentsis smallerthan the variation in non-schoolen-
::,,r ided by the
vironments. Some children may have relatively poor school experi-
.. ::rlt n-hat matters{d
' ,: ,rf one's peers, not ences,but the disadvantages in their non-schoolenvironmentsmay be
even more severe.As a result, a disadvantagedchild attending a low-
:. Ii remainsto be seen
quality school can enjov a larger "school boost" than an advantaged
: : hndings with other
child attendinga high-qualitl school.In this way schoolscan favor ad-
.,,.. :rS.rrth'simaginative
vantagedstudents,i'et still serveas equalizers.
:: ,rnalvticleverageon
In short, to determine r'vhetheror not schoolsserveas equalizers,we must
consider two learning environments: in school and out of school. In an
ideal experiment, we would randomly assign children to various school
: ': a -i
and nonschool conditions to try to separatetheir effects. The claim that
schools fail to level the playing field would be supported if we found that
. . :'..:ketr Broh, and Paul inequality in learning tended to be just as high under the school condi-
tions as it is under the nonschoolconditions.
218 . Chapter 7
Although such an experiment is not feasible,we can use summer vaca- On the basisof r hc. . :
tion for students as a natural experiment to separatethe effects of school vari ab leY will , . lr . lnl-
and nonschool environments on learning. During the school year, children able X.
are exposed both to school and nonschool environments. During the sum- D ecom posit ion ir : ', . .
mer, the large majority of children are exposed only to nonschool envi- change or v ai l J rrrc l r. : :
ronments. To separate out the school effect on learning, then, we can ti ons of subpopulit r i( ) r r:
compare rates of learning over the summer with rates of learning when qoul rtiugpr ocedr r r . . , , ,
school is in session.As Downey, Broh, and von Hippel (p. 616) observe, vari anceunder inr r r : - .
this design is "analogous to a crossoverdesign in health research,where the proximate sourar. : :-
the samepatients are exposed to different treatmentsin different periods." not con st it ut et he c- . . - : - . .:
To compare students'learning in the summer and during the schooryear, can point us t o r he l: : J: :
Downey, Broh, and von Hippel use test score data from a nationally repre- ways that overall ch:::. -
sentativesample of about 20,000 children who enteredkindergartenin the the subpopular ions: - . *
United Statesin 1998. By beginning when formal education first begins, tive, or policy inreres:.I:..
Dor.vney',Broh and von Hippel are able ro examine differences across chil- populations:The nrc,r: .::.
dren that are relativelyuncontaminatedby schoolprocesses.Consistentwith decomposition.
prior research,they find significant differencesin the reading and mathemat- To givc r sim plec\ r - . -
ics skills of higher and lower SESchildren as they begin their schoolingca- rate ri s es over sonr L : . '
reers. On the first day of kindergarten a child whose family SES is one cause o f t hat r ise. \ \ r '. ,...
standard deviation higher than that of another child will, on average,have poverty primarili' ant,,:.:
nearly a t\,vo-monthschool head start on that other child lp. 62a). young or t he oldi I . : : . .
The question is whether schools tend to exacerbatethose SESdispari- country? And so on.
ties. By comparing learning rates during kindergarten and first grade with D ecom posit ioni: l: : . - . - -
learning rates during the intervening summer, Downey, Broh, and von seenas m er ell der . r r p: -
Hippel find that schools in fact reduce the inequality among students of j ust to descr ibe.Bu! : : . :
differing SESbackgrounds. Children learn ar much more equal rares dur- w hat th er e is t o er pl. r . : . .
ing the school year than they do during summer vacation. As a result, the factsar e. [ r al50f , . . : .
even though the learning gap between SES groups grew during kinder- Decompositionhas : ,.
garten and first grade, the gap grew much more slowly when school was or rates,or what \\ c :-.._.-.
in sessionthan when it was not. In the language of an experiment, expo- anal ysisof var ianee. ". '. -
sure-rd.the"treatment" (school) reduced the disparity in learning rates for si s of i n equalir l. l : -
higher and lower SESchildren. In that sense,schools are "great equaliz- 'p. .
ti on. The well- knor .r . ,
ers" in America (Downey,Broh, and von Hippel, p. 633). Apparentlv the group medns(Blinder 1-- -.
disadvantagesthat lower SES children face in school, as savageas they decomposit ion.The l : : =. , -
might be, are less consequential for their learning than are the disadvan- di vi di ng socialchanc. . -
tages they face in their homes and neighborhoods. replacementand the p.r:: *,.
exampl elt he "gr oup ': ': . : .
poi nts i n t im e, sinc. i: : . , .
DEcouposrrroN Wonr rN THESocrer ScrrNcns ods, such as the Blintle:-, ,. ,
(as describedin chapri: : :
Regressionanalysisproducescoefficients(slopes)that summarizethe as- posi ti onof inequalr r r.'. : . - . .
sociations among variables.Under certain conditions these slopes are B ecausechapt er h *: . - -
estimatesof the averagecausaleffectsacrossindividuals in somepopulation. i n th i s chapr er I f , , . :.
The SeventhRule . 219
i: I :. ' : a. ln U S e S U m m e f V a Ca - On the basis of these slopes,then, w.ecan predicr how much an olltcolne
r -:. :i.iii the effectsof school variable Y will change on averagefor a one-unit change in a causal vari-
-. :-3 5choolyear,children able X.
-"--
: ...:r-nts.During the sum- Decomposition analysis, bv contrastJ accounts for the observed
rr. -:- :-.1\to nonschool envi- change or variance in some ourcome variable Y by isolating the contribu-
.w e
- = . - , . ii l-1- i l H ,
-.il
,L--
tl l g l l , C an tions of subpopulations ro rhe overall change or variance in y. The ac-
',:- :
r-1.t1's
of learning when counting proceduresrvill varv dependingon the narure of the changeor
',: - tr:pcl ( p. 616) obs er v e , variance under investigation, bur the objective is the same: to determine
::'i :- - ;ealth research,where the proximate sourcesof the in Y. The proximate sourcesmight
-:ri--:r- r> rn differentperiods." not constitute the causes per'ariance
se of the variance, but proximate soufces
-- - - ' -i Ju ri ng t he s c hooly ea r. can point us to rhe underlr'ing causes.There are an endlessnumber of
r: :.,:.:.::()nt a natronallyrepre- ways that overall change and variance can be partitioned, of course, so
- : r:::J kindergartenin rhe the subpopulationsshould be groups of particular theoretical, substan-
- ' --:.: cducation first begins, tive, or policy interesr. The firsr srep, then, is to identify the strategic sub-
t,: : -riicrencesacrosschil- populations:The more strategicthe subpopulations,the more relling the
*., . . : '-essesc.ons is t ent wi th d e c o mposi ti on.
To give a simple erample of decomposition,supposethe U.S. poverry
: -: .'.gin their schoolingca- rate rises over some period. As a first step toward understanding the
-)
- ,.. ,rse family SES is one cause of that rise, \\.e \\'ant to determine its source: Is the growth in
- r
r-j ' - : riill. o n avera ge . h av e poverty primarily among blacks or whites or Latinos? Is it among the
:: :-: ;:.iild@.624). young or the o1d?Is the increaseconcentratedin certain regions of the
: i , , :: r,tre those SESdispari- country? And so on.
I Icr I )-t ^' ^- r de with Decompositionis underutilizedin socialresearch,perhapsbecauseit is
-: --: . -:.1 (1lrU 5I .
decomposition of segregation.In keeping with the theme of this chaprer, the rate of change is the .:
the methods for the most part are not fancy, but they are appropriate for w ay of conccpt ualizingi: ': : r ', : '
the task at hand. I rely on examples from prior research, beginning with tive here becausetherr .- .,
the decomposition of variance and inequality. (ANOVA), for deccrrl;.,,.,,
-
mctter, th en, t o use of t : . . .
Decomposition of Variance and Inequali4t varianceof log(earninss .:..
nents.And, sincevarianct :
Someof the most compelling examplesof decomposition in social research betw een- class and u ir hir , - - . .,
involve the partitioning of variance or inequality into successivelysmaller
cl assi neq ualit y.r espe. r ir .
subpopulations, each contained or "nested" within a larger population.
Weeden and colleasue. ...,
The citizens of the world, for example, can be partitioned geographically:
The Theil index is approp: .-::
Individuals are nestedin local communiries;local communities are nestedin
usi ng al gebr a( Jenkins1"- . .
larger jurisdictions or regions; regions are nested in countries; and coun-
decompositions(of the r;- ..-
tries are nested in the world. Likewise, students are nested in classrooms
dicate that inequalitv s'i:r : :-
that are nestedin schoolsthat are nestedin school districts, and so on.
the wage inequalitv in ::'.
The decomposition of variance or inequality within and across nested grown more rapidlr'. hor..:'.::
groups is often very telling. Some inequality measures,for example, have
Iumpier as well as more i..-;
the property of additiue decomposabilily (Jenkins 1991), meaning that As the exampie of c:::-. -
total inequality can be partitioned exactly into its within- and between- w ork i n the social scicn. . . - . .
group components (for nested groups). This property has been used, for changesby apportionins ::-.
example, to partition global income inequality into the parr due to w e can uset he logic in r sr ; : - :
within-nation inequality and the part due to berween-nation inequality Consider,for example.h .'.
(recallthe discussionof global income inequalityin box 1.1 of chapter 1).
al l i ndi vidualsin r hc r i - -
As another example, \Teeden et aL. (2007) additively decompose the (with a fast computer)\\'. - -
growth of earnings inequality in the Unired States over recent decades,
culating somemeasllre(): -:-:-
7973-2004. Earlier analysesin economicsand sociologyhave focusedon
do not have that datl..r.r...: .,..
the causesof growing inequality. Weeden and her colleagues,in conrrasr,
al l the po pulousnet i, , n. - -
investigate the consequencesof growing inequality for the class structure '
i ty), and est im at esoi
in the United States. Nested-group decomposition is appropriate here, "'. . : '
strategyi n r his in: r . r r r.e.
since individual workers are nested in occupations that are nested in '
p o s r i l o n eq u a tl o n s to i !r ::i '1 i :
larger categoriesthat \Teeden and colleaguescall "big classes."The ques-
Goesl i ng2001; Schulr
tion is whether growing inequality is strengthening or weakening the big-
Fi nal l y consider
, inr . r - . . .:
class structure in the United States. More rapid growth in between
fundame nt alinequalr n : - . , .
big-classas opposed to within big-classinequality would indicate an in-
and Tul l apur kar200i. : . - - "
clining importance of class for earnings in the United States, that is, a
spanfor all individuLr l. -
lumpier earnings distribution with relatively stronger class distinctions.
tries over the secontlh.r,:
More rapid growth of the within-class component would indicate a de-
uariance in adult life sr,-, . ,,
clining relative importance of classfor earnings in the United States.
'Weeden among those who live i, :,-:
and colleagues use two standard measures of inequality to j apurkar2 005) .Var ianc; . - - ; - :
study the trend in earnings inequality. The first measure is simply the
than it is in other high-in; :--: - ,-
variance of logged earnings. The variance of the log is often used as a
France,Japan,and SueJc:- : : ,
measure of inequality becauseit meets the essentialrequirements for ine-
W hat is t he sour er ' : - -
quality measures.In particular, rhe variance of the log is constant when
classic study of differc:: ., -'
The SeventhRule , 221
' :Irrr neol t his c hapte r, the rate of change is the same for everyone-consistent with the standard
. ::t;\ Jre appropriatefor way of conceptualizinginequality (Allison 1978). The measureis attrac
re;rch, beginningwith tive here because there is a ready-made method, analysis-of-variance
(ANOVA), for decomposing variance. It is a relatively straigl,tforward
matter) then, to use off-the-shelf computer packages to decompose the
,:;', variance of log(earnings) into its between-classand within-class compo-
nents.And, sincevarianceof log(earnings)is a measureof inequality,the
-::- -r'r .lrion in sclcialresearch between-classand within-class components reflect between- and within-
c i - : - :t:()su c c es s iv ely
smaller classinequality, respectivell'.
r:- -. :r .l larger population. 'Weeden
and colleaguesalso use the Theil index to mcasure inequality.
:- -- - -:: o neJ geogr aphic a l l y : The Theil index is appropriate here becauseit is additively decomposable
.- . - ::rmunitiesare nestedin using algebra (Jenkins1995; Firebaugh2003, pp.127-28). Both setsof
-: - : -': - :-iaount r iesand; CO u n - decompositions(of the varianceof logged earnings,and of the Theil) in-
:-i:.:- ...:. nestedin classrooms dicate that inequality within occupationsremains the source of most of
the wage inequality in the Unired States.Between-classinequality has
- - : '. .',::hi n and ac r os snes te d grown more rapidly, however; as a result, the labor market is becoming
r -- : ::,rilS. for example,have lumpier as well as more unequal.
: -: ..s 1991) , m eaning t h a t As the example of earnings inequalit,vdemonstrates,decomposition
-: :. ii'ithin- and between- work in the social sciencescan be used to shed ncw light on observed
.- - -: ::rtv h as been us ed,fo r changesby apportioning the u'hole irrto its constituentparts. Sometimes
:: : - r: '. rn ro t he par t due to we can usethe logic in reverse,to trv to estimatethe whole from the parts.
-. -. ::i-.r cefl-Ildtion inequality Consider, for example, hon' one might estimate income inequality across
:*, : : .:: L .or 1. 1 of c hapt er1 ). all individuals in the rvorld. If rve had income data for everyone, then
- ::;irively decomposethe
- (with a fast computer) rve could estimateglobal income inequality by cal-
- :: - a:..-::iover recent decades, culating some measureof inequality such as the Theil index. Of coursewe
.: : --, ::ol o gv hav ef oc us edo n do not have that data, but rve do have estimatesof the averageincome for
- :-: -.:: ;Lrlleagues, in contrast, all the populous nations in the world (to estimatebetween-nation inequal-
-: - ,..: :-. ior the classstructure ity), and estimates of rvithin-nation inequality for many of them. The
-: i :. ,it is appropriate here, strategy in this instance, then, is to sum up to the total by using decom-
"- --: . :..,)r15that ar e nes t ed i n position equations to estimate the whole from the parts (for example,
' " The ques-
- -: - - .. rrg classes. Goesling2001; Schultz 1998).
-
- :: :1- :'.i (-lr\\:eaken-ngthe big- Finally, considerinequalitv in individuals'life spans-"arpluably the most
''. : : ..:.J growth in between fundamental inequalitv that exists among human populations" (Edwards
:' ri o uld indic at ean i n - and Tuljapurkar 2005, p. 665). Life expectancyat birth (that is, aueragelife
. * i-: t ri re d S t at es ,t hat is , a span for all individuals) convergedamong twenty-one industrialized coun-
: . -- :.!(r c las sdis t inc t io n s . tries over the second half of the twentieth century (White 2002). However,
.: .::-.: ri'ould indicate a de-
- uariance in adult life span, as measured by variance in the age at death
.r- - -: . :he United States. among those who live to age ten, has not converged (Edwards and Tul-
-..:: :--:-:.:Lires of inequality to japurkar 2005). Variancein the adult life span is higher in the United States
re.lsure is simply the than it is in other high-incomecountries,such as Britain, Canada,Denmark,
:-: ltrg rs often used as a France,Japan,and Sweden(Edwardsand Tuljapurkar 2005, fig. 5).
" : .r rrquir em ent sf or in e -
Vhat is the source of this U.S. exceptionalism? 'We know, from the
: :^.r-log is constant when classic study of differential mortality by Evelyn Kitagawa and Philip
222 . Chapter7
Hauser (1973), that there are substantialdifferencesin mortality rates (Reardon and FircL'.rr:...-
acfoss social classesand racial categoriesin the United States.As Kita- ro w hi ch t he lo. al . r r , '
gawa and Hauser (1973, p. 4) note, "The importance of socioeconomic or someot her popui: t : , - - '
differentials in mortality is that they point to the possibilities of reducing l f ou r aim is f o t s- . . . ;
morrality through the betterment of socioeconomicconditions in the pop- mtrst 6 r st det er nt 'ni ', ',- '
ulation." In addition, socioeconomic and racelethnic differencesin mor- schoul sunr . lsubur l. , i. ,r - . ,
tality might help explain the puzzle of the greater heterogeneityin adult suburb s?Wit hir r >*1, .
life spans in the United States. Greater heterogeneity in adult life spans ousl l th e r educt ion , : . . -
might be tied to greater heterogeneity in the U.S' population, since the i ted eff ect if nr or r - .
United States tends be more racially heterogeneousthan other rich na- Li kew i s er eduer t t gr ; - : ; - - .
tions. To test this coniecture, r'vecould decomposethe variance in adult l i trl e i m pact on or r r . : - -
life span by race. !(hat we want to know is whether the variance in adult of the s egr egat ionli. - ,
life span within race/ethnicgroups in the United Statesis similar in size to and the suburbs.
'Sfhatever In fashioningpoii; - - -
the variance for racially homogeneousgroups in other nations.
we discover regarding variance in age at death within racial groups, we first to decomposes.i:::..:
could probe further by decomposing the within-nation, within-race vari- of the s egr egat ion.: . . . '
ancesalong other key dimensionssuch as sex, social class,and causeof S eanR e ar don.John \ ': - .
death. The point is that successivepartitioning of the variance is likely to Thei l ' se nt r opvin. 1c. , ' - : -
tell us quite a bit about heterogeneityin adult life spans and reveal where overal l segr egar ion, . ,
between-aggr€gdte :ci:: i ::
we should look-and not look-for explanations of the greater life span
uncertainty in the United States. ti onal unir , su. lr . t . . -
raci al g r oup. \ er r : : - - - -
posedfur t her :t hr r - .
Decomp osition of Segregation Indexes ther pa r t it ioned ir : : - :
Segregationrefers to the extent to rvhich individuals' local environments di stri cts in t he sub, : : : - . ,
differ on the composition of some population trait (such as race or so- schoolswithin sch,-.,,- : .:-
cioeconomic status). School racial segregation'for example, refers to the C on siderf igur c - . . . . -
extent to which racial composition varies acrossschools.The more that i n U .S . m et r opolir . r r- , - - -
racial composition varies across schools, the more that a student's local
racial environment depends on the school that he or she attends, so
Total metrtri,
school racial segregationcan also be thought of as the association be-
tween race and school. From the raw data we can tell immediately when
school racial segregationis zero-that. would occur when racial composi-
tion is the same for all schools. so race and school are not related. We
could also tell immediately when there is maximum or complete Within-citr
segregation-all schoolswould be monoracial, with whites attending all-
white schools, blacks attending all-black schools, Latinos attending all-
Latino schools, and so on. Across schooi ;
In virtually all cases,of course,the racial segregationof schools is nei-
Within schooi ;
ther zero nor complete but falls somewherebetween the two extremes. In
that caseit may not be obvious, from the raw data, whether school segre-
gation is greater in city A or in city B. A number of segregationindexes Figure 7.1. Decon-,:
have been developedto solve this problem, including multigroup indexes Areas.Source:Re-,.::
Ruie .
The Sever.rth 223
, ": 1:- ,r: LII lTlortolity rates (Reardon and Firebaugh 2002). Segregationindexes measlrre the extent
- ' : to which the local environments of individuals differ, on averageJon racc
- ...:<.1 states. As I(ita-
iJ ot socloeconomlc or some other population characteristic.
- : . .. i .i l it iesof r educ in g If our aim is to reduce the overall racial segregationof schools,we
' , ,,rditionsir-rthe pop- must first determine where the segregation is located-between city
-
'' : Ji rt r r enc c sin m or- schoolsand suburban schoolsJBetweenschool districts within qities or
i i - 'i: i -.:ero geneit Yin adul t suburbs?'sfithin school districts in central cities or in suburbs? Obvi-
:: : - -: '::: i n adult lif e s pan s ously the reduction of segregationwithin school districts will have lim-
:-: - :rtPu lat ion,s inc et he ited effect if most school segregationlies between school districts.
- . . r : -: --- :h .r n ot her r ic h na - Likewise reducing segregationacrosssuburban school districts will have
:- -: i: :ite t':rriancein adult little impact on overall school segregationin a metropolitan area if most
. , - -..-: :he lirriance in adult of the segregationlies within the centrai city, or betweenthe central city
- - . - .-.:-. i . and the suburbs.
' im ilar in s iz et o
t-:. - :-..: nations. $Thatever In fashioning policies to alleviate school segregation,then, it is helpful
:,: : : - : ::1:l lacial grOuPS,we first to decompose segregationindexes to pinpoint the principal sources
: - .-: ,n . rr ir hin- r ac ev ar i - of the segregation,so that vou knou- u'here to concentrate your effort.
.; , . : ,1-1cl - as s ,and c aus eo f Sean Reardon, John Yun, and Tamela \{cNultv Eitle (2000) show that
- - : : :.-.:\'.1riance is likely to Theil's entropy index of segregation(H) can be additivelydecomposed,so
: * : l: :l-]flS and reveal where overall segregationcan be expressedas a sllm of within-aggregateand
-:: : rhe greater life span berween-aggregate segregation.rvhere"aggregate"refersto an organiza-
tional unit, such as school district, or to a demographicgroup, such as
racial group. Very often the n-ithin-aggregate component can be decom-
posed further; thus, for example,segregationr,vithina suburb can be fur-
ther partitioned into the part that is due to segregationacross school
.: :. .'..' local environments districts in the suburb, and the part that is due to segregationacross
:: - :-.: : such as race or so- schoolswithin school districts.
.: - . : .\Jmple, refersto the Consider figure 7.1, a fir.e-partdecomposition of school segregation
:r - -- :-a()ols .T he m or e t ha t in U.S. metropolitan areas.In this schemeoverall school segregationfor
:-: -' :: :i-r;rta student'slocal
* : -: . : :- o r s he at t ends ,s o
Total metropolitan school segregation,as measured by H
,* :-- : -r: the as s oc iat ionbe -
i : -.,," :.11immediatelywhen J
- - . -, . - 'r hcrtr ac ialc om pos i -
IWe Citiesvs. suburbs(I) e+
:- j -, '. rre not related.
': - ::'-'.rlmum or C0mplete (II) <+ Within-suburbsegregation
lfithin-city segregation (III)
,- : :: ri'hitesattendingall-
\i - ,. l,-rtinosattending all-
Across school districts (IIa) Across school districts (IIIa)
: -: : : : :-t:i on o f s c hoolsis ne i - \Within school districts (IIb) \X/ithin school districts (IIIb)
' : r .r-1 ::-:1i he tr v o ex t r em esIn
.
:: ' :..:-:-.'.rhetherschool segre-
- -:---:: ,l tegregationindexes Figure 7.1. Decomposition of School Segregationin U.S. Metropolitan
-. -- -,-...: multigroup indexes Areas.Source: Reardon, Yun, and Eitle (2000).
224 . Chapter7
metropolitan areasis first partitioned into three components: the part that TrYrv 7,l
is due to differences in racial composition of the central city as a whole C ompo nentof
s Sr 'hoo.
::::
yersus suburbs as a whole, the part that is due to differences in racial Metropoli tan A re.rs
composition across schools within central cities, and the part that is due
to differences in racial composition across schools within suburbs. The
second and third parts-within-city and within-suburb segregation-are
further apportioned into segregationdue to differing racial compositions
I. Segregationof citr r.. - -
across school districts and segregationdue to differing racial composi-
tions across scbools within school districts. So there are five components II. Within-city segres.i:.-
in all (I, IIa, IIb, IIIa, IIIb), and they sum to total segregationfor the I I a . A cr o sssch o o l .:: -
-
metropolitan area. I I h . W i th i n sch o .,li .: -
Note the differing policy implications for the components. Compo-
I I I . Wi t h i n - su b u r l .>q :- r- .:
nent I can be reduced only by convergencein racial compositions of I I I a . Acr o sssch o o l - .:'
cities and suburbs as a whole. Component II is reduced by convergence IIIb. \Tithin school c .::
in the racial composition of schoolswithin central cities. By decompos-
ing II further, we can determine whether significant reductions in school
segregationare possible by changing the racial composition of schools Sozrca:
Reardon.
\'Lrr:.
.::.: : :
within school districts (IIb), or whether changesacross school districts
will be required (IIa). The samereasoningappliesof coursefor reducing segreg at iont o t hs t , r . . . . : -
school segregation within suburban areas, where components IIIa and lessthan one-third rr: ::-:::
IIIb are strategic. S econd,r he lr r r sr - : . "- .
Simultaneousdecomposition along two dimensions is also possible nori ty c om posit i, r n, r - : - -
j .:
with H. For example, in their analysis of average school segregation in D i ffere nces
in r hc i. i, .
2L7 metropolitan areasin the United Statesin 1995, Reardon,Yun, and suburbscontributc. -',:-- -:, -
Eitle (2000) partition simultaneously by the five-part scheme above and overall white/blackH:.:..-
by race. In the case of race, they u'ant to know how much of the total school s would r edu,
school segregationis due to the white-minority divide and how much is (36.9%*3. 9oot in r h: . r . -
due to segregationamong minorities themselves(blacks, Hispanics, and Thi rd , segr egr t r r , '.
, r,... - -
Asians). There are two race components (segregationof whites from mi- quarter of all m er r ,':
norities, and segregation across minoritiesa) in their scheme, so the central cit ies, m ost : ; i: : jl
Reardon-Yun-Eitletwo-way decompositionyields 2x5= 10 components thern.The opposirerr ::.-
tf school segregationin U.S. metropolitan areas.Becauserheiruesrlt, ,o posi ti on acr ossscho -
nicely illustrate the value of decomposition work in the social sciences,I encesi n r acial cur r r p,. . :
reproduce them in table 7.1. reduce school segres-:.:
From this two-way decomposition we see,first, that about four-fifths of mated 1 8. 6 per cenr .
the overall school segregationin the 217 metropolitan areas arises from ln shorr, decomp,. :
the segregationof whites and minorities. This means that eliminating all in pinpointingu'her. :,
segregation among black, Hispanic, and Asian students would reduce pol i cy i n t er vent ionii. r '..
overall segregationon averageonly by about one-fifth. Reardon, Yun, and Y un-E idedecom poiir, -
Eitle (2000, p. 358) note that the contribution of black/Hispanic/Asian segregation within ';l'.' -
l essthan l0 per cenr , : :
aBecausewhites are considered a single race, there is a within-group segregationcom Srates,whereassegrcl.'::
ponent only for minorities (blacks, Hispanics, and Asians), and not for r'"'hites. for nearly25 per eer r,r : : -
The SeventhRuie . 225
we know that policies to reduce segregationwithin school districts have context ef f ect s.t he: . ''. - .
a potentially greater payoff (for the reduction of total segregation)when nation. This inter..:
applied to city school districts rather than to suburban school districts. w l tcrrwc f ushr ur r , . . : : .
few i ss t r es
of t t t , , , .
ior is affc,,tcelb, -. ',.... .
THr Errscrs oF SocIAL CoNrpxr
One of the central notions of social scienceis that social context matters. Context Effects .7-sQf7'. --;-.
Growing up in a crime-infestedpoor neighborhood, for example, surely is
Let' s begin br Jct , : : " - -
worse than growing up in a safe middle-classneighborhood. Note, criti-
key soc ial and c, : ' - -
cally, that this sort of effect is due to the child's neighborhood context,
nei ghbor hoodsh. r '- - -
not to characteristicsof the child or of the child's family. To determine
school sar e bet t eri. . : : - . - - -
the effects of neighborhoods, the telling comparison is not between poor
el s of ra cial \ eqr c! . : :
children and middle-classchildren, but betrveenpoor children who grow
therei s f l cont e\ t t r i: - . .
up in poor unsafe neighborhoods and equally poor children who grew up
a c t e r i s ti cs o f L o l i tL \:.
in more affluent, safer neighborhoods.
i n d e p e nd e r r r o l 'tl i . - ... - - .
Although it might seem virtually self-evidentthat children who grorv
a m p l c . th e r e i s ,r .. ^
up in bad neighborhoods will tend to fare worse in life than they would
d e n t s i n i n n cr cr tv r ."
have had they grown up in better surroundings, that claim has proved de-
s t u d e r i rs i n su L ,r r r I.'.::.
-,
vilishly difficult to demonstrateempirically. The major roadblock is selec- Moreform.rlll.- '- -
tion bias, a problem we encountered before, in our discussion of causal
/ contexts
{forrr.r"': ..'
inference (chapter 5). Becausepeople self-selectinto neighborhoods, it is
val ue on s ofl te i l Lr:- " .
hard to determine whether differenceswe observe across neighborhoods
andX denot er h. - . .
are due to the neighborhoods themselvesor to the way that people sort ,t
(tor eram ple. . r \ t - r - :
themselvesinto those neighborhoods. Returning to the example in the pffprt p v i c t< u' l t,' rt \'
previous paragraph, suppose we find that poor children in unsafe neigh-
Ieuclcauscs of \. :':. . '
borhoods fare worse than poor children in safe neighborhoods: That
put it. "The indiri-l:.r .
difference might reflect, not a neighborhood effect, but the effect of
the questionis u'h.:.:.--'.
unmeasured family characteristics (such as differences in parental com-
al so exe r t sso. iel . ', : . . . '
mitment to their children) that would induce some poor parents to sacri-
dent of r he inf lucnl- i - . - - -
fice more than other poor parents do to live in a safer neighborhood for
In a classicst uJ, : . '.
their children.
l eagues ( St ouf ier . t l
Context effects,then, are both important and contentious in the social " prevalenceof vr l. r c. .
sciences.There is, for example, the question of whether racial segregarion
havi orsof Anr er ie. r r r)
(a contextual property) has negativeconsequencesfor minority groups-
that i nexper i5'q; sl. r , . : : - -
and, if so, what kinds of segregationlead to what kinds of harm. With tho. .r!au\
.".1, i,,. ,, .
LrrL/ vvl...
'* rr ivl !'.--
respect to residential segregation,for example, Douglas Massey (2004)
recently has claimed that the segregationof races leads to poorer health
among African American adults. \fith respect to school segregation, ' l t i s important r(' r' :.
there is longstanding interest in the effect of racial segregationon the ac- rrf popul lti on (utnp,' ;tt: '
ow q ro di f fererrc es i rr .rr r-
ademic achievement of minority students (Card and Rothstein 2006;
are hi ghe r i n c ountrr . l rl : -
Coleman et aL.1966;Charles,Dinwiddie, and Massey2004; Hoxby and ferencei n ferri l i rr i ' J L,. :
Weingarth2005). Becausesome of the burning issuesof our day turn on rhe w onten i n c uurrtrr'- \ :: -
Rule .
The Severrth 2)7
:-:r school districts have context effects,there has been a reslrrgenceof interestlately in their estr-
- :,,tal segregation)when mation. This interest is rvelcome. 'Wc must "let method be the servant"
- -.:ian schooldistricts. when we fashion our methods to addressissuesof import, and there are
few issuesof more irnport in the social sciencestl'ranhow hunran behav-
ior is affectedby social conrexr.
units-an indication of the influence of the vererans (who believed the most other sar e poor ir J--
inexperienced soldiers were not ready for combat) on the inexperienced w here ut her sir r e lr ul l.
soldiers. el assi rr a r r r iJcllc-1.
. n.
Interest in context effects among social scientiststoday centers on the classneighborhood.
effect of peers, schools,and neighborhoods on life chances,especiallyfor I usea st u. l1 f lc, Lt.,-
the yonng (Borlas1995; Cutler and Glaeser1997;Hoxby and Weingarth and n study f r r t m . o. 'i" '
2005; Massey and Denton 1993; Mouw and Entwisle 2006). The intel-
F XAC T R BA TI N C C O \ TI \i' .
lectual forebear for much of this research is the so-called Coleman Re-
port on the stateof American educationin the 1960s.In one of the largest In addressingthe quesr:,::
and most influential social sciencestudies ever (the sample consisted of E dw ard G laeserllvu-
more than 150,000 students),JamesColeman and colleagues(1966) found raci alor et hnicgr or r p\- -
that school effects on student achievementare relatively small compared groups in question. "C':-..:
to the effects of family background, and that disadvantagedblack stu- w ri te (p. 827) , "especL- r - .
dents profit from schooling in racially integrated classrooms.The surpri- within a segregatedsr(r-.: :
sing finding that school effects are relatively unimportant prompted a S o the q uest ionof r r h. : : ', - -
school-effectsresearch literature (searchingfor such effects) that contin- ring to a raciailv sepirr.r;* -
ues today. The finding that black students may benefit from school rnte- i s an ernpir icalqt r . . r i,'r '. , ''
gration served as a catalyst for the implementation of busing to achieve U.S. cities,Cutler :rnd C'.,-.
racial integration. That conclusion also served to boost interest in re- sinceblacks in more Ssgi;i,
search on the social consequencesof residential and school segregarion of schooling,emplovnre;::..
(researchthat continues to flourish today). regatedcities.Becauseb....-
In thinkingrabout the effects of social context, it is important ro note these three outcomes C..: ,
that some ch/racteristics of contexts have a counterpart at the individual context (racial segreq.ru,:'.
level and some do not. A city, for example, can be characterizedby aver- l eveleff ectof heing l. l. r . .
age household income, or by the percentageof residentswho are white or
who are renters.'Wecan call theseaggregatiueproperties. Other character- .\IIT IGAT INC C O N TE . \ 1 ' L I : :
: -: ,: -:,1ns(rvho believedthe most others are poor is less alienating than being poor in a neighborhood
: - ^'".: on thc inexperienced where others are not poor. Similarly, one might be better off to be n-riddle
classin a middle-classneighborhoodthan to be middlc classin an upper-
r :. ": ...:is roday centerson the classneighborhood.
,rr - rr ihances,especiallyfor I use a study from economicsto illustrateexacerbatingcolltcxt cffccts
- . :. . ---. H oxb . vand' W einga rth and a study from sociology to illustrate mitigating contcxt effects.
.- - " .,rsl e2006) . T he inte l -
EXACERBATING CONTEXT EFFECT
- : - : . '-^ rs.In one of t he lar g e s t In addressingthe question "Are ghettos good or bad?" David Cutler and
:-: : :r :1e SampleconSiStedof Edward Glaeser (1997) begin bv noting that the spatial separation of
-i- . , -: ( 1966) f ou n d racial or ethnic groups could have either positive or negativeeffects on the
-i )l l eagues
- .rri rell s m all c om pare d groups in question. "Ghettos mav have benefits as well as costs," they
- - -'. -.i .l J\ent aged blac k s ru - write (p. 827), "especiallyif thev allow for mixing acrossincome classes
:: :- - : : :;l a ssro om s . T he s ur p ri - within a segregatedgroup and for positive spilloverswithin that group."
: - . :'. rpor t ant pr om pt ed a So the questionofwhether ghettos(usedin the nonpejorativesense,refer-
. . ! , ,L !^l:f^^r.
, ' ! ! rr/ r +l^
r,,at con tl n- ring to a racially separatedcommunin,) are good or bad for their residents
r: .-: :.nefit from school inte- is an empirical question. On the basis of their analysisof censusdata for
.. o f hr r s inot o ac hie v e U.S. cities,Cutler and Glaeserconcludethat segregationis bad for blacks,
: . : : . : : :r) boos t int er es tin r e - since blacks in more segregatedcities are significantly worse off (in terms
. j :- i. , . :nd sc hool s egr ega ti o n of schooling, emplovment, and single parenthood) than blacks in lessseg-
regated cities. Becauseblacks are also worse off at the individual levei on
-::'.:. lt is important to note these three outcomes(Cutler and Glaeser,table IV), the effect of social
-
- : -::rnart
' ' _ r *_ _ at t he indiv idu a l context (raciai segregation)in this instance exacerbatesthe individual-
:. -.:.:-. :.- characterizedby aver- leveleffect of being black.
i: : -..idents who are white or
M ITIGAT IN G C ON T EXT EF F EC T
:: , --.' :erties. Other character-
rr .: -, . ..r.h as t he jur is dic t io n a l JamesDavis's(1966) studv "The Campusas a Frog Pond" providesa nice
::- - - r ::n { i ndiv idualc har ac te r- example of a mitigating context effect. The term "frog pond" here refer-
: : . -: - : i vi dual lev el( S elv inan d encesthe saying that it is better to be a big frog in a little pond than a big
frog in a big pond. We expect frog pond effects where "successis judged
: :: :crties can be classifiedas by relativestandingin the social group, not by standingin the total popu-
-::: : rr:rcerbatingeffects,the lation" (Davis 7966, p.25). The collegecampus, Davis suggests,is one
' :- - - ':.c the s am es ign. I n t h e such contert, since student grades tend to be about the same across
-. ::: :le individual-leveleffect schools, regardlessof selectivity.This tendency of collegesto grade on a
curve has real-lifeconsequences becauseit implies that a studentwill tend
:. ...'. >rienceinvolve the possi- to obtain higher gradesin a lesscompetitive college, and gradesaffect ca-
-::'.-: \fle know, for ex- reer decisions.
-ifects.
: :-r::: on individuals;but does Davis's central concept-that what matters is one's reldtiue rank on X
:: :.j.::::, trttlladvantage? And, at (for example, one's ability relative to others on the campus)-has been ap-
:r::.:-:-i!-$'ork in the sameway, plied widely, from accounting for revolution (that revolution is fueled by
-r- :-. rndependentof the effect relative, not absolute, deprivation) to explaining satisfactionwith income
:i.: :::;ctS instead mitigate the (what matters is one's income relative to the income of one's peers). For
',., r :
-r neighborhood where our purposeshere, the critical point is that X,,(the individual value) and X
230 . Chapter7
(the value for the conrexr) have opposing effects. In Davis,s example, ca- \\-illlace.(In both surve\':. i - -
reer aspirationsare boosted by high academicability at the individual level .rad voted for'Wallacc.r
but lowered by high academicability among an individual's peer group. The naiveexplanationr: .
1968 election,rvhites\\ ci ' -
Context Effects as Nuisance heavi l ybl a ck dist r ict s.I n , ':
j or rol e i n t hat elect ion.. \ . : :
In the last section we observedthat, iust as individuals can be distin- lace,their presenceaffectr.: : :'
guished on the basisof their values on some attribute X, so too neighbor- .' -
the proper multilevelmocl.
hoods and other social conrexrs can be distinguished on the basis of the .Y,,(race of respondentli -- -:
prevalenceof that attribute among those in each context. Becauseit could rhe odds of voting for \\: ,-
be said that thesediffering propertiesof social conrextsput the "social" The problem is thitt \\'- ..-.--
in social research,determining the effect of social context is fundamental rhe individual-leveleffect :r-,
to social research.Increasingly,social researchis multilevel, neasuring both l eve ls.I n t he cir se( ) : i. ; .
properties of conrexrs as well as properties of individuals. In many in- ..'
that we cannot obserle t
stancesthe goal of the researchis to estimarecontext effects. amine data aggregate d br :-..
Unfortunately, data in the social sciencesdo not always permit us to postelectionsurvevs.For :-.'
separatecontext effects from individual-level effects. In those insrances sinceavailabledata vel\ L\:.:
context effects can be a nuisance. consider again rhe case of Governor time to poll individual'.
George'Wallace's bid for the presidencyin the 1968 U.S. election(chaprer Context effectsArc a n'.. :.'- - i
1). Becauseof Wallace'sreputation as a segregationist, we would nor ex- to estimatethem (irnd th-.. : . '.
pect ril/allaceto garner many votes among blacks. Yet, when we look at the most comfiton occur:.. ':
the vote in the 1968 election, we find that George Wallace received a inferencesabout indrtidl:,'. ^: - -
greater share of the vote in regions with higher percenragesof blacks. ti mesresear cherignors . : . '. . "
Across congressional districts in the South, for example, the correlation sions about individutl. :: :- .-
was r=0.55 betr,veenpercenr black and percent who voted for Wallace W i l l i am S. Robinson 1- i -- :
(Firebaugh 1978).'we see the same pattern acrosslarger regions in the
calleda fallacy because.i rr't, :' -
United States.\7allace's srrongestshowing by far was in the south, where shipsat the aggregatelir.. :: ::
blacks constitute nearly 25 percent of the population. In the East, where ci al conte xthasan el'lc, : .
blacks make up only about 10 percent of the population, \Tallace fared w e should avc, iddr . t , ': : - -
much worse, capturing a much smaller share of the vote (lessthan 6 per- aggregaterelationships.i:'i. , --
cent of the vote in the East versusmore than 20 percent in the South).6 show, relationship'.tr rlt= " - - -
The naive erplanation is that blacks must have been more inclined than posite directions.
other racial groups to vote for \fallace. This explanation is quickly ruled Aggregationbias is ,rr-.:-. :
out by postelection survey data. In one postelection survey, the 196g , t he Pr oblr : '
sci encesbut ::-
American National Election Study (ANES), we find that, of the g7 black and epidemiologl'-)u'h.':. - ,:
respondentswho said they voted, not a single respondent had voted for growing technical liter.r:,... -
\X/ailace.Among the other respondenrswho
vored, fully 1 in 8 said they solutionsto the probler:.
had voted for rwallace.The GSS resulrs for 1972 and 1973 confirm the
ANES findings: of the 237 black vorers, only 2 said they had voted for
C nrrrcer TEsr s r N Socl. . -
6These figures are based
on the 1972-73 General Social Sur'e1'. To complete the ac-
count, \(allace recervedless than 10 percent of the vote in the west and Midwest, u.here In this book I haverepe;:.:
blacks constitute roughly 8-9 percent of the total N in the 1972-73 CSS.
ods and m ult ipledat . r . . - '
Rule .
The Seventh 23I
-'' I-t.rvis'serample, ca- Wallace. (In both surveys,11.3 percent of respondentsoverall said they
. rt i h c indiv idunllev e l had voted for Wallace.)
,, The naive erplanation is off the mark becauseof context effects. In the
- rdual'speer group.
1968 election,whites were more likely to vote foI'Wallaceif t|ey livcd irr
heavily black districts.In other words, social context effectsplayed a ma-
jor role in that election.Although blacksthemselvesdid not vote for'Wal-
-. . :Juals can be distin- lace, their presenceaffected the propensity of wbites to do so. Thus, with
--' -,.:- X, so too neighbor- the proper multilevel model we would find opposing signs for the effect of
-. ,.'.;.1on the basisof the X,, (race of respondent is black) and X, (percent black for the region) on
- - - :-.ic\t.Becauseit could the odds of voting for \(allace.
-r\rS nr r rt he " s oc ia l " The problem is that we cannot always separatethe context effect from
,- - ,ntext is fundamental the individual-level effect becausewe do not always have accessto data at
. :l rrrlt ilev el.
m eas ur i n g both levels.In the caseof elections,for example,the secretballot means
- - r rJuals .I n m any in - that we cannot observehow individuals actually vote; at best we can ex-
. :\teffec t s .
- amine data aggregatedby precincts,or we can (in someinstances)rely on
- - . : .rin'ays permit us to postelection survevs. For historical data we often are even more limited,
: : ' :.i 5 . l n t hos e ins t anc e s since available data very often are aggregated,and we cannot go back in
- . - :'. rhe case of Governor time to poll individuals.
Context effectsare a nuisancewhen we do not have the appropriatedata
- . - : : :ri st.w e would not e x - to estimatethem (and thus must simply assumethat they do not exist). In
: , - - -. \et, when we look a t the most common occurrenceof this problem, a researcherwants to draw
: r:L- Wallace receiveda inferencesabout individual behavior on the basis of aggregatedata. Some-
-,, :ercentagesof blacks. times researchersignore the possibility of context effectsand drarv conclu-
' i- rr:,inrple,the correlation sions about individuals from aggregatedata anyway' committing what
- - - - .,.ho voted for \fallace rWilliam S. Robinson (1950) called the ecologicalfalldq'. That practiceis
: - : .. l .rrgerr egionsin t h e called a fallacy becauseit makes the problematic assumption that relation-
'. - '.,..rs in the South,where ships at the aggregatelevelmirror relationshipsat the individual level.If so-
: . :.::(rnIn . t he E as t ,whe re cial context has an effect, however,there is said to be aggregationbias, and
: - : . :'.l l .rtion, ' W allacfear e d we should avoid drarving conclusions about individual-level effects from
-: : ::'.- r'ote (lessthan 6 per- aggregaterelationships. Indeed, as the 1968 presidential election results
- - :-:.ent in the South).6 show, relationshipsat the individual and aggregatelevelmay evenbe in op-
- , : .-.cn rr.roreinclined than posite directions.
- : . : ,.:rition is quickly ruled Aggregation bias is an issuein virtually all the social and behavioral
' . . : :-i i ol l S ur v ey ,t he 196 8 sciences,but the problem is particularly acute in fields (such as history
- " I rl r:t. of r he 87 blac k and epidemiology) q'here data often are not available for individuals. A
:: :..ttl d ent had v ot ed f o r growing technical literature (for example, King 7997) continues to seek
: ,:. fu l l r . 1 in 8 s aid t he y solutionsto the problem.
- -- - -:nd 1973 confirm the
: ' - :,.-rdthey had voted for
CRrrrcer Trsrs rN Socrer Rnsr,eRcn
: '. : r. T o c o m p l e t e t h e ac-
I rsr and Midrvest, where In this book I have repeatedlyemphasizedthe importance of multiple meth-
- T C;S S .
ods and multiple data sets in social research.In chapter 3, for example,
232 . Chapter7
I cited a number of examplesof multiple methods used as validity checks- scores rand ior r 'r r"
testing the plausibility of the claim that Bush lost 10,000 votes in the was Steepesttor n',r- ',',
Florida panhandle, using ethnographic data to interpret survey responses that i n 1993 black , : . .
on race, using foilow-up in-depth interviews to investigateabuse wirhin Qrral i 6 cat ior r Test t i .
farnilies.hr chapter4, I extolled the virtues of identical analysesof paral- scoredin t hc t t - , pi: , , . :
lel data sets. thirty yearsearlier.\,.^: - -
In this section I want to return to the reality check rule and the repli- the A F'Q T ear ned, . -
cation rule to make the point that progress in social research is often a the nrtionll r \ o , l- i
matter of progressivelyeliminating rival explanations by a seriesof criti- seoreslor hlrt.kr i: -
cal observations or tests. In some instances a single critical test might earni n gsf or hln. \ - : '-
suffice, as in the earlier example (chapter 5) of John Snow's classicinves- The quest ionoi, ' : .
tigation of how cholera is spread. More often, multiple causes are in- question of r','hat is ;,.
volved, and progress in sorting our those causesis often slow and halting. subj ectof a som er r : : : . .- ,
To illustrate the latter, consider the longstanding line of research on the first demonstrated::- :
black-white test score gap in U.S. schools.I begin with three observa- and sinceth.n .r.,.rp,,.
' -.
tions: (1) The gap is real. (2) The gap is importanr. (3) The causeof the starts.a s Jencks. 1r 'J
gap is the subject of a vigorous debate today in the social sciences.
First, the test score gap is real. Although the black-rvhite gap has nar- B y now r hiser '. - '. , -
rowed considerably in recent decades-the black-white reading gap was bl ac k- whit eg, . r p - .
cut by almost one-half, and the math gap by one-third, from 197I to The debat e
'r . r r r .
1996-the average black student in the United Statesstill scores below al l y pr ecipit et c.. : '
r,,i l ., -^-..t..^
ruil)/ rg)(r\fu.
r L,.-
r'1.. -.. :
three-fourths of white students in the United Stateson most srandardized
tests (Jencksand Phillips 1,998a,p. 1). On some tests the gap is even end t-rfrhe deb-rt. : .
greater, with more than half of black Americans scoring lower than 85 It ha pper r ed, ls. r : . . '
percent of white Americans. The black-white test score gap is no longer much can 11-gfqr, :i
disputed, since it is found in study after study (for example, Bali and Al- E du calionalR. : ; - . .
varez 2003; Hedges and Nowell 1998; National Center for Education
Most l ikelv t her c . r : . -
Statistics2007). The gap appearsearly, even before children enter kinder-
senseth e t esrgJp f i: 1. '-
garten (Fryer and Levitt 2004; Wesr, Denton, and Reaney 2001), and it
as note d above.BLr :: : .
persists into adulthood. There are similar, though typically smaller, resr
cases:Tr y t o f ind ir ': - '
score gaps between white students and Latino students.For our purposes
That i s r he appr or r eh: . - .
here, it sufficesto focus on the black-white gap.
possrb leexplanlt ion- '
The black-white test score gap is enormously important because "re-
ducing the black-white test score gap would do more to move America to- o On e hypot hesi.r . :
ward racial equality than any politically plausible alrernative" (Jencksand gap. If so, then r.r.
Phillips 1998b, p. 51). Scholarssuch as ChristopherJencksand Meredith tively constairr , .
Phillips believe that is the case becauseof pretty strong evidencethat, in ".
scoresfor bl,rck s:-.
today's world, reducing cognitive inequality would go a long way toward raisedin u,hite f;::'
reducing economic inequality. On the basis of recent data on test scores
and earnings from the National Longitudinal Survey of Youth (NLSY), -A mo np
\r l rdel r. \\ -
for example, we find that the black-white earnings ratio for men (black
than rvl t i te \tudent\ t' , r-:-
earnings as a fraction of white earnings for men in their thirties) increased poi nr our . equ:rl i z i ng. rt.t .- -
for men with lower-than-averagetest scores, for men with average test ment as well.
The SeventhRule . 233
.: Lrsed as validity checks- scores) and for men with higher-than-averagetest scores, The ir-rcrease
.(.)st10,000 votes in the was steepestfor men with higher-than-averagetest scoresrhowev.r, ..,
-rerpretsurveyreSponses that in 1993 black men who scoredin the top half of the Armed Forces
investigateabusewithin Qualification Test (AFQT) earned 96 percent as much as white men who
. .:rrri ci lanaly s esoI par a l - scoredin the top half of the test. This is in sharp conrrast to the results
thirty years earlier, when black men scoring abovethe national averagcor1
;heck rule and the repli- the AFQT earned only 65 percent as much as white men scoring above
: ,cial researchis often a the national averageon the AFQT. In today's world, then, equalizing test
,::''rrsb v a s er iesof c r it i - scores for blacks and rvhites should go a long way toward equalizing
-.:rqle critical test might earningsfor blacks and whites in the United States.T
,rn Snow'sclassicinves, The question of hou to reduce the test score gap of courseturns on the
. nrLrltiplecausesare in- question of what is c:tusing the gap. The cause of the gap, in turn, is the
. ,rtfen slow and halting. subjectof a sometimesheateddebatein the social sciences.The gap was
- ilne of researchon the first demonstrated in the testing of U.S. Army recruits for \World 'S7arI,
r:,n with three observa- and since then empirical researchon the subject has proceededin fits and
. ...:.r3 ) T he c aus eof t h e starts,as Jencksand Phillips (1998a, p. vii) explain:
- .r;k-rvhite gap has nar- By now this cycle has becomepredictable.First someoneclaims that the
. -..'.l.rrtereading gap was black-white gap is largely innate. Then skeptics challenge the claim.
:-:-thi rd, f r om 1, 971t o The debate starts off largely unencumberedby evidence,bur it eventu-
::-rtc-sstill scoresbelow ally precipitatesa number of useful empirical studies.The issuers not
::: ()n most standardized fully resolved,but the number of plausible explanationsis smaller at the
:-i tests the gap is even end of the debatethan at the beginning.This happenedin the 1920s.
. .coring lower than 85 It happened again in the 1970s, after Arthur Jensenpublished "How
much can we boost IQ and scholastic achievement?" in the Haruard
-. - .!ore g ap is n o lo ng er
: : cxample,Bali and Al- EdwcationalReuieu,in 1969.It seemsto be happeningagain.
,.C.enterfor Education
Most likely there are multiple causes of the test score gap, and in that
::children enter kinder-
"- sensethe test gap puzzle differs from the puzzle of how cholera is spread,
RcJnev200 | t , and i t
as noted above. But the methodological principles are the same in both
::l rr pically smaller,test
cases:Try to find critical testsof the predictionsof various erplanations.
,.:.nts. For our purposes
That is the approachthar has beenused with partial successin evaluating
possibleexplanationsior the test scoregap:
)rt ant be cau se " re-
: : ' . t a{
-: t ' ) ntove AmeriCa tO- e One hypothesis is that genetic differences account for the test score
.: :ernative" (Jencks and gap. If so, then racial differencesin rest scoresshould remain rela-
:: lcn' ks a nd Mere dirh tively constant over time, and there should be no difference in test
i::ong evidence that, in
scoresfor black studentsraisedin black families and black studenrs
.- ;r1 .r long way toward raisedin white families.But neither is true. As noted earlier.the test
r,.-:rr data on test scores
- '.:r' of Youth (NLSY), : Among students with the same t\\'elfrh-gradetesr
'-) r.rrio for men (black scores,black students are mctre likely
th a n r vhi te studentsto gradr:atefrom col l ege.Thus, asJencks and Phi l l i ps (1998 b, p.46)
::err rhirties)increased point out, equalizing test scoresshould rcclr-rce black-white disparities in educational attail-
:::rn $'ith average test m e n t a s w el l .
l3+ . L - h a n te r /
score gap has declined significantly in the United Statesin recenr de-
cades.Moreover, the preadolescenttest scoresof blacks are much
higher for blacksraisedin white families (Jencksand Phillips 1998a,
p.3), which is scarcelyconsistentwith the genericargumenr.Vir:tu-
ally all scholarsnow agreethat the causeslie elsewhere.
. Another hypothesislocatesthe causein the testsrhemselves: perhaps
the tests are unreliable and biased against some groups. If so, then
test scoresshould have little or no predictive ability of educational
and occupational attainment. But test scores do predict later life
chancesfor all racial groups, so for the most part researchershave
turned elsewherefor explanations of the gap.
e Another explanation focuseson differencesin school resources,as
rneasured,for example, by spendingper pupil. That explanation was
more compelling during the "separatebut equal" days of racial seg-
regation,where schooisfor whites and blacks may have been sepa-
rate but certainly were not equal.The explanationis lesscompelling
today becauseper pupil spending(someglaring exceptionsnotwith-
standing) on averageis about the same for whites and blacks (Jencks
and Phillips 1998a, p. 9). Equal spendingdoesnot necessarilyrrans-
late into equal educational quality, however, so the test score gap
may be due in large part to differencesin the way money is spent-
an important topic for further research.
. Other explanations focus on cultural differences. If cultural differ-
encesplay a significant role, then black children raised in white fam-
ilies should score higher than black children raised in black families
(as they do). Critical observationssuch as this strengthenthe casefor
the role of culture, but they do not tell us what cultural differences
matter. A particulariy influential cultural explanation, based largely
on ethnographic studies of minority students in schools (Fordham
and Ogbu 1986; Ogbu 2003), argues that black adolescentsare
characterizedby an "oppositional culture" that hinders their aca-
demic achievement.For example, in his classroom observations of
studentsin ShakerHeights, Ohio (an affluent suburb of Cleveland),
John Ogbu (2003) found that black students were more likely that
white students to arrive late to class, come without materials, and
disrupt lessons. Whatever the merit of the controversial claim that
black adolescentsexhibit an oppositional culture that undermrnes
their academic achievement(Ainsworth-Darnell and Downey 1998;
P. L. Carter 2005; Farkas, Lleras, and Maczuga 2002; Fryer and
Torelli 2005; Mickelson 1990; Tyson, Darity, and Castellino2005),
differences in adolescentculture alone cannot account for the resr
score gap since, as noted earlier, the gap appearswell before adoles-
cence.
The SeventhRule . 23,5
..' :..; L'r.rited Statesin recentde- This brief summary scarcelydoes iustice to the richnessof the literature
-:: - . : . ; t tres of blac k s ar e mu c h on the test scoregap. My purpose' however,is 1ot to provide an overview
-- -. Jcncksand PhilliPs 1998a, of researchon this issue,but to illustratehow social researchcrslook for
, .:.-.: -.- !],eneticargtlment. Virtu- critical observations or tests ttl evaluate rival explanations, peeling off
: - : . - r: l i e e l sewher e' various explanationsone by one as they fail to predict accuratelyacrossa
, -: :.r. teststhemselves: PerhaPs varietl'of settings.
: .-,-, ::>t some groups. If so, then
:-: : rrf i ve a bilit y of educ at i o n a l
r r : : . i i o r€s do pr edic t lat e r l i fe C o NcrusIoN
' : :..: :ttost part fesearchershave
_ _ --
.,-.r ''1n
F!!r,
Even a cursory comparison of empirical work in the social scienceiotlr
-. ., j: : . : c e s i n s c hool f es our ce sa, s nals today with published work fifty vears ago will show how far we have
'- ^ ' :rrpi l . Thar ex planlt ion w a s come in terms of the sophistication of our estimation methods. For the
.. ,-.:: r-rt cqual" days of racial seg- most part, the greater sophisticationis good. There is the danger, how
- ::.- :.lircksmay have been sepa- ever,that researchers will come to rely on estimationmethods as substi-
-. :\Fli.tflxtionis lesscompelling
- tutes for careful theorizing and conceptualization.An irnportant theme of
, .:'.. !lllring exceptionsnotwith- this chapter is that we cannot collnt on technicaltricks to remedy defec-
--::r; :,,r s'hites and blacks (Jencks tive theory or researchdesign.
:: -,l..-.:doesnot necessarilytrans- Another danger posed by increasingstatistical sophistication is that the
,' . .--\\\eler, so the test score gap tools of researchwill overshadowthe substanceof research."Let method
:--:: itLthe way money ls spent- be the servant, not the master" is meant to be a poilted remilder that the
:"1:- --. researchproblem should dictate the choice of the tool, not the other rvar-
-:r ;ifierences.If cultural differ- around. Statisticalmetl-rodsshould not be the motivators of our research.
r -:,. ,hildren raisedin white fam- Nor should statistical methods be an afterthought: Data design and es-
-. ,.- l.lren raisedin black families timation method go hand in hand. Sometimesresearchersdo not think
:,-. -..ls this strengthenthe casefor much about estimation until the data have been collected. That is r iris
: :;., us n'hat cultural differences mistake because,generailyspeaking,estimationtechniquesare ill-suitecl
,.. :-::.rl erplanation, basedlargely for overcominginadequaciesin researchdesign.Researchdesignis rtL'out
: :'. )iurlents in schools (Fordham collectingthe sort of data that enablesyou to estimatethe right rl-t1t1q. \t-r-
.:..:..;! rhat black adolescentsare tistical estimationis about providing reliableand preciseestimates. To erl-
.. , , . , r u re " th at hinder st heir a c a - sure reliableand preciseestimatesof the right thing, we must not tlir i'r.e
. :- :ris classroomobservationsof researchdesignand statisticalestimation.
,::- -rftluentsuburb of Cleveland), The secretsof the social world are not easily revealed,so \\'e ll.-d to
:-.\ ):'.ldentswere more likelY that find all the leversand apply all the leveragewe can. To advanc.-in ihe stl-
-.-:::. Cortl€without materials,and cial sciences, we need to hone our skills in the use of natur,rl .\perllllcllts
.: : ,l the controversialclaim that and decompositionsand critical observationsto enticethe sclcL.rl g 61lJ 1e
r ::-rrnol culture that undermines revealits secrets.Regressionanalysis-the workhorse l.nethodin :i.l; st,ci.rl
:.'. :lir-Darnell and Downey 1998; sciencesfor well more than a quarter century-should not L'ed:.;.rr.i"d.
,.. ...:rd-\,{aczuga2002; Fryer and but used more thoughtfully,and appliedstrategicalli'inr,,r.rtLrn.i:,,n u'ith
. . :-..Dirrity, and Castellino2005)' other approaches.Above all, what we need in sociirl res.r:;i.t tt.,.l.rvis
,r , ,:rc carnnotaccount for the test more imaeination.
::i i.lp .lppearswell before adoles-
.. "- . - . , ". t . * t t
A P P E NDI X v106 is cocle. l1= , ,
uhaugcLlt ssc. ' '. . ,t
\'arra[)-tc v ,/ U + - . . '
StudentExerciseson Rule 7 dr r t e, 2=r ot cJ .-.
nrr tv r.rtrdi ,l .rr,'
var iablecalled t hr r ; r . :
,, , l -: , -,g-V*l
1,..,,,,
d lllll l\ !.'..Y
-.
D em oer ; t , \ . ',
The lessonto be learned in this exerciseis that, due to contexrual effects, vot cd f ur . r t lr r : ; - : . '
relationships observed at the aggregatelevel do not necessarilyreflecr re- el ect ionslr or l I - : '
lationships at the individual level. In fact, as this exercise demonsrrares, for Georgc \\ .r -,
individual-level and aggregate-levelrelationship can even be in opposite fi lt er " t o i: t r l. r : :: - -
directions. e The ner t , r cl .
The more general point I wish to make, in line with rule 7. is that bl ack. Ent er r { 1or .
rnethod without substancecan be dangerous. In the 1968 election, re- ti o n. O b: cr r r : . ".
gions with a higher percenrageof blacks tended to vore at higher rates for no w \ O t e 1, , ' t '.
George Wallace. I7ithout an undersrandingof the issuesof that day, yt_ru l ^ . ^, - . , . ; . , 1, . . .
might be tempted to interpret that positive associarion to indicate that ri onsas \ oLr\ : - . - -
'Wallace.
blacks voted disproportionately for Yer, as you will see, black vo t er sin t lt . - . : " : . .
votes for'$Tallacewere extremely rare.
'
The routine should be familiar by now. Go to the web site http://sda Question l. \ \ cr '- -- .
.berkeley.eduand follow the instructions below. The instructions should GeorgeWall. r c. l. i :-. ..
enable you to generateall the staristicsyou need to complete the assign- o Th e t hir d . r - r - -
ments. Variable names are in boldface.
gion ot - , , . : "- - - ' .
Note: If you did the prior exercisesyou should be able to do the as-
Again selc. r '\
signment here. To avoid redundancy, the instructions below include only
toisolat elhc: : - -
the key commands. Recall that the formar in the SDA web site mieht have
pe r cent egehl. : . . '
changed somewhat since these directions were written, ,o ,o--. of the
Cent ages t o 'r ; . - , 1-
specific directions below might be out of date. You nonethelessshould be r Recodevlll r
able to follow the logic of the instructions to obtain the tables you need to
oh r ainedin r hr : '
complete the assignments.
i s coded "1" ir r 111
since bhck: . , i: . . . .
Northeast ir.rthc - .--'
AssrcNueNT 1: Groncs \X/errecp AND THE Brecr Vorn rN mu 1968 o Th
"'_
e f innl
"
sr'_teni.'
' -
Pnssrolxrrar ErpcrroN-Rpsurrs FRoM THEANES regionblack,1. , , 1. , : . '
1) as "Select iori-. r: . .
The aim is to compare the individual-level versus regional-level associa- rvei.htino n t -rrr R -. .
tion between race and vote for George rwallace in the 1968 presidential 1, 027,t he sAm c: -. : ''' -
election. From the SDA Berkeley web site, choose the 1948-2000 cumu- black and thirdpartr i
lative ANES data set. There are five basic steps:
Quest ion 2. ( a) \ or c : : ': -
r First recode the ANES race variable v106 into a variable called si gni ficant .O n r he l. . r - -
black, coded 1 for blacks and 0 for whites (everyoneelse).Currently, compa r e t he aggr es. r r ., '
Rule ,
The Sever.rth 237
- r :r. \\'eb site http://sda Question 1. 'ff/ereblacks more likelv or less likely than others to vote for
George'S7allace? statisticallysignificant?Is it ver,vstrong?
Is the associ:rtior.r
:tr - --: instructions should
. . , -^ t - , -
- -: _ ! ( 'lllPl s r s .Uht,c a s s l g n - . The third step is to cross-tabulateblack (rorv variable)br.v112 (re-
gion of countr\'-Northeast, North Central, South, and \West).
, : ,- i rc able to do the as- Again select"No u"eight" and insert v4(1968) in the selecrionfilrer,
-.: : - -: :s b elow inc ludeonly
to isolate the vear 1968. This cross-tabulationis neededto obtain
- : -: : --r\ u'eb site might have
percentageblack in each of the four regions.We will use those per-
.i ::: .' : : if en, SO SOme Of the
centagesto createa ne$.variableregionblack.
.::: : -. lonethelessshould be o Recode v'1.12to create regionblack by inserting the percentagesyou
-:.-..:':he tablesyou needto
obtainedin the prior step.To illustrate:The Northeast region, which
i s coded " 1" i n v112, shoul d be recodedas " 6.8" i n regi onblack,
since blacks constituted 6.8 percent of the respondentsfrom the
Northeast in the 1968 ANES.
. : - ,. \ \'olr. Ix l:g't 1,968 . The final step is to cross-tabulatethirdpartyvote (row variable) with
' .1 . - . r\ES regionblack.Isolatethe 1968 electionby enteringv4(1968\,black(0-
1) as "Selectionfilter," and choose"No weight" from the pull-down
: -- . .. :: j i onal- lev elas s oc ia - weighting menu. Run the table. The number of casesshould be
r r - : - :h e 1 968 pr es ident ia l 1.027. the samenumber as in the individual-levelcross-tabulationof
-- :: ::r 1948-2000 cumu- black and thirdpartyvote.
vote for George Nilallace with the individual-level association you ob- . Do the cross-talrul-..:l
tained previously. and aggregate-lercl . ,.
(b) The context effect here is mitigating-expiain what that means, sub- kl.-L
ur4rN/
\ vv rrrr
'.-ith
t-^ra
v vL! t,
t ,-
-1 '
The second assignmentis to examine George \il/allaceand the black vote Tabl e 1 in Angr ist r r ndK. , . . -
again, this time using GSS data insteadof data from the ANES. Because make use of natr,rralrrtiY:
you can use the instructions for the ANES as your guide, the instructions ies, describe (in onc or" :',,
below will be abbreviated. used.
o The 1958 election variableis called pres68 in the GSS,coded 1 for
vote for Hubert Humphrey (the Democratic candidate), 2 for vote
for Richard Nixon (the Republican), 3 for vote for George rJfallace,
and 4 for vote for someoneelse. You should recode this variable, as
before, as 1 for vote for \il/allaceand 0 otherwise. You can call this
new variable wallacevote.You should find that 1 1.5 percent voted for
Vallace, and 88.5 percent voted for Humphrey, Nixon, or some
fourth-party candidate.
. Similarly, recode the GSS race variable, cailed race, ro creare a new
variable black that is coded 1 for black and 0 for others. (you can see
the codesfor race by entering race in the "selected" box and clicking
on "View.") Of the 46,510 respondentsin the 1972-2004 cumula-
tive GSS,you will find that 6,399, or 13.8 percenr,are black. (Note
that this percentageis likely to be somewhat different in your subse-
quent tables that include pres68, since the use of pres68 restricts the
analysisto the 1968 election.)
. The GSS region has nine categories:1-2=Northeast; 3-4=North
Central; 5-7=South; 8-9='West. Use the recodecommand to creare
a new variable region4_gss that parallels the four-category region
variablein the ANES.
r Cross-tabulate black (row variable) with region4_gss (the GSS re-
gion variable)to obtain the percentageblack in each region. Insert
year (1972-1973) as "Selecrion filter."
. Create regionblack from the percentagesyou obtained in the cross-
tabulation of black with region4_gss.
The SeventhRuie . 239
Question 3. (a) Compare the GSS results with the results you obtained
from the ANES. N7hat are the individual-level and aggregatelevel values
of R for the GSSdata? For the ANES data?
(b) Compare the valuesof R: Is the individual-levelversusaggregate-level
discrepancyroughly the same size in both data sets, or is the discrepancy
notably larger in one of the data sets?
Abbott, Andrew. 2003. Methocls ctf Discc,n,ery:Heuristics for the Social Scient:es.
New York: W.'$7. Norton.
Aghion, Philippe,Eve Caroli, and Cecilia Garcia-Peiralosa. 1999. "lnequality and
economic growth: The perspectiveof the new growth theories." .lournal of Eco
nomic Literdture 37 :7615-50.
Agresti, Alan, and Barbara Finla1,.1,997. Statistical Methods for tlte Social Sci-
ences.Third edition. Upper SaddleRiver, NJ: PrenticeHall.
Ainsworth-Darnell, James \7., and Douglas B. Downey. 1998. "Assessing rhe
oppositional culture explanation for racial/ethnic differencesin school perfor-
mance." American SociologicalReuiew 63:536-53.
Allison, Paul. 1978. "Measures of inequality." Anterican SociologicalReuiew
43 : 8 6 5 - 8 0 .
1990. "Change scoresas dependentvariablesin regressionanalysis."Pp.
93-174 in SctciologicalMethodology 1990, ed. Clifford C. Clogg. Orford:
Basil Blackwell.
2002. Missing Ddtd. SageUniversity Paper Serieson Quantitative Appli-
cationsin the SocialSciences,07-136. ThousandOaks, CA: Sage.
2005. Fixed Effects Regression Methods for Longitudinal Datd Using
SAS. Car.v,NC: SAS Press.
Alwin, Duane F. 1996. "Coresidence beliefs in American society-1973 to
1997." Journal of Marriage and tbe Family 58:393-403.
Alwin, Duane F., and Robert M. Hauser. 1975. "The decompositionof effectsin
path analysis." American Sociological Reuiew 40:37-47.
Amato, Paul R., and Joar.rG. Gilbreth. 1999. "Nonresident fathersand children's
well-being: A meta-analvsis."Journal of Marriage and the Family 61:557-73.
Amsden, Alice H. 2003. "Good-by dependencytheory, hello dependencytheory."
Studies in Comparati t'e I tttentational D euelopment 38 :32-38.
Angrist,JoshuaD., and \\'illiam N. Evans.1998. "Children and their parents'la-
bor supply: Evidencefrom exogenousvariation in family size." Americdn Eco-
nomic Reuiew 88:450-77.
Angrist, Joshua D., and Alan B. Krueger. 2001. "lnstrumental variablesand the
search for identification: Frorn supply and demand to natural erperiments."
Journal of Economic Perspectit,es 15:69-85.
Arrighi, Giovanni, Bevellv J. Silver, and Benjamin D. Brewer. 2003. "Industrial
convergence,globalization, and the persistenceof the North-South divide."
Studies in Compdratit'e Irtterrtational Deuelopment 38:3-31.
Bali. Valentina A.. and R. \.lichael Alvarez.2003. "Schoolsand educationalout-
comes: What causesthe 'race gap' in student test scores?"Social ScienceQuar-
terly 84:485-507.
Becker, S. O., and A. Ichino. 2002. "Estimation of averagetreatment effects
basedon propensityscores."STATA Journal 2:358-77.
242 . References
Downey, Douglas. 1995. "\fhen bigger is not berter: Family size, parental Firebaugh,Glenrr. rr: i.-
resources,and children's educational performance." American Sociological Re- 1 9 7 2 - 1 q 8 4 : R e l r ,': . -
uieru60:746-61.. 25r-72.
Downey, Douglas, Beckett Broh, and Paul von Hippel. 2004. "Are schools the F i r e b a u g h .C l e r ttt..t: -
great equalizer?Cognitive inequality during the surnmer months and the school A m e r i ca n 5 o r l ./, r :. ..
year." American Sociological Reuiew 69:61,3-35. F i r e h a u g h .Gl e n r r ..'.:- .
Downey, Douglas, Brian Powell, Lala Carr Steelman, and Shana Pribesh. 1999. Anrerican: orr .r lr,J -
"Much ado about siblings: Change models, sibship size, and intellectual devel- i r r go, f r l r e Ar r r r r i ..r .
opment (comment on Guo and VanWey)." American Sociological Reuiew F i s h e r ,R o r r a l d.\. I 'r -
6 4:1 .93 -98 . and Boyd.
Duncan, Greg J., and Graham Kalton. 1987. "lssuesof designand analysisof sur- F nr dham \ i o n i rl rrr -'
veys across time." International Statistical Reuieu 55:97-1,1,7. a^^i --
vvPU L 6
., i.L .l,- .: -
Duncan, Greg J., \7. Jean Yeung, Jeanne Brooks-Gunn, and Judith R. Smith. F r a n k, R o b e r t H . I ""-
1998. "How much doeschildhood poverty affect the life chancesof children?"
.lournal 107:I S-ll-r-
American Sociological Reuiew 63:406-23. 1 9 9 9 .Ittr t:':
Easterlin, Richard A. 1974. "Does economic growth improve the human lot?" In t o n : Pr i n ce to n['::.., - ,
Nations and Households in Economic Growth: Essaysin Honour of Moses F r e e d ma n ,D a l i d . I"" .
Abramouitz, ed. P. A. David and M. \7. Reder. New York: Academic Press. 353-58 (reloinder : '
1996. Growth Triumphant: The Twenty-lirst Century, in Historical Per- O r f o r d : R a si lBl ,r ci ..''.
spectiue.Ann Arbor: University of Michigan Press. Ie 9 - . "f- ,, .- .,
2001. "Income and happiness:Towards a unified theory." Economic C atr s a l rt f i / i ( , i -i j . i : t'.
. - . ... srze , p are nta l Firebaugh, Glenn, and Ker.rnethE. Davis. 1988. "Trends in ar.rtiblackprejr,rclice,
' ": \;tti616ti'o1 P"-
1972-1984: Region and cohort effects." American Journal of Sociology 94t
25 1 - 7 2 .
- ".\re schoolsrhe
- Firebaugh, Glenn, and Jack P. Gibbs. 1985. "User's guide to ratio variables."
:.:is a nd th e scho ol Ameri can Socio logicaI Reuiew 5 0 :7 73-22.
Firebaugh, Gienn, and Laura Tach. 2005. "Relative income ancl happiness:Are
: . - :.:r.rPribesl-r.1999. Americans on a hedonic treadmill?" Paper presentedat thc 100'h annual meet
. .: J lntcllectualdevel- ing of the American SociologicalAssociation,Pl.riladelphia (August).
. : : ok'tg ica l Reu iew
Fisher,Ronald A. 19-59.Smoking: The Cancer Controuersy.Edinburgh: Oliver
and Boyd.
.
-:: .rndanalysisof sur- Fordham, Signithia, and john U. Ogbu. 1986. "Black students'school success:
-- _ i - .
Coping with the 'burden of acting white."' IJrban Reuiew 78:176-206.
' -:' : Ju dith R. Smith. Frank, Robert H. 1997. "The frame of referenceas a public good." Ecotrcntic
' . , r: of children ?"
-,.nces Journal 107:1832-47.
1999. Luxurl' Feuer:Money and Happiness in an Age of F.xcess.Prince-
' .: : hc hu man lo t?" In
ton: PrincetonUniversityPress.
: ..: Hottour of Moses Freedman,David. 1991. "Statisticalmodels and shoe leather." Pp.291-313 and
-,.: ^\;ademicPress.
353-58 (rejoinder)in SctciologicalMethodobgy 19c)1,ed. l'eter V. Marsden.
::::',, in Historical Per- Oxford: Basil Blackrvell.
1997. "From assocrationfo causation via regression." Pp. 113-61 in
--: : heon'." Eco no mic Causality in Crisisi Statistical Methods and the Search for Causal Knowledge
in the Social Sciences,ed. Vaughn R. McKim and Stephen P. Turner. Sourh
- . . . , . . : .in lfe sp an san d a Bend,IN: Universin'of Notre Dame Press.
' - ...::rr[ze d cou ntrie s."
Fryer, Roland G., and StevenD. Levitt. 2004. "Understanding the black-white test
score gap in the first two vears of school." Reuiew of Economics and Statistics
"Does oppositional
- 86:447-64.
':. :-.i11.SociologicalRe-
Fryer, Roland G., Jt., and Paul Torelli. 2005. "An empirical analysisof 'acting
wl.rite."' Cambridge,-\IA: NBER rWorkingPaperno. \( 11,334 (May 1),
' ':...r.\\hshington, DC:
Galton, Sir Francis. I874. English Men of Science:Nahtre dnd Nurture, London:
Macmillan.
. .l relationshipsfrom Garrett, Geoffre,v.2004. "Globalization's missing middle." Foreign Affairs (No-
-- -l
vember/December): 84-9 6 .
r' : . f lec ts." Pp.2 43 -62 Ghosh, S., and C. R. Rao, eds. 1996. Handbook of Statistics13: Design and
--. tlrford: Basil Black- Analysis of Experimenf-s.Amsterdam: Elsevier Scrence.
Glaeser, Edward L., Bruce L. Sacerdote,and .fose A. Scl-reinkman.2003. "The
. ,ist m en t." American social multiplier." .lournal of the European Economic Association 1:345-53.
Glass,Gene. 1.976."Prinan', secondary,and meta-analysisof research."Educa-
-'-rter Serieson Quan- tional Researcb5:3-8.
. '-:r.l(Jaks, CA: Sage. 1977. "lntegrating lindings: The meta-analysisof research." Reuiew of
,::.t ttt I ity. Cambridge:
Researchin Edrtcatic.trt 5:ii l--9.
Glenn, Norval D. 2005. Cohort Anall,srs.Secondedition. SageUniversityPaperSeries
. . , , red-h r.po the sis
b ias on QuantitativeApplications ir.rtl'reSocialSciences,
5. ThousandOaks, CA: Sage.
,::.il \Iethods and Re- Goesling, Brian. 2001. "Changing income inequalitieswithin and berween na-
tions: New evidence." American Sociological Reuiew 66:745-61,.
- : \ineteenth Amend- Groves, Robert M., Floyd J. Fou'ler Jr., Mick P. Couper, James M. Lepkowski,
:. ' .\rterican lournal of Eleanor Singer, and Roger Tour:rngeau.2004. SttrueyMethodology. Hoboken,
NJ: John'Wiley and Sons.
216 . References
Cuo, Guang, and Leah K. Van\7ey. 1999. "Sibship size and intellectual develop- J e n c k s.C h r i 'tu p l r cr . ..: -
ment: Is the relationship causal?" American Sociological Reuieut 64:169-87. S c or eGa p . \\r i l ::- :: -
Haavelmo, Trygve. 1944. "The probability approach in econometrics."Econo- .fencks,Christophcr..r : '.
metrica 12 (Supplement): 1-1 15. t e s t:C l u r i r r g tl '. l ...
Halaby, Charles N. 2004. "Panel models in sociologicalresearch:Theory inro ber/October)J 1:-4:-'-:
practice." Annual Reuiew of Sociology30 507-44. Jenkins,StephenP. 1"-
Harding, David. 2003. "Counterfactual rnodels of neighborhood effects: The ef- E c o n o tn i c l r t,.1 tt.:..:'.'
fect of neighborhood poverty on dropping out and teenagepregnancy." Ameri- A r m o n k, N \ : \1 . i '
too\- .'l
cdn Journal of Sociology 109:676-719. -
Harris, David R., and JeremiahJosephSim. 2002. "Who is multiracial?Assessing U . K., l e - l - S". - .
the complexity of lived race." American Sockiogical Reuietu 67:61.4-26. K a l d or , N i ch ,,l .r - . .' '- -
Hedges, I-arry V., and I. Olkin. 1985. Statistical Methods for Meta-Analysls. San n o m i c Stu ( l i .sI ::' :-
Die go : Acad em iePr es s . K a l t o n,Gr r tl r .r r r .I''
Hedges,Larry V., and Am,v Nowell. 1998. "Black-white rest score convergcnee lications.
since 1965." Pp. 149-81 in The Black-Wbite Test ScoreGap, ed. Christopher K i n g . C a n . l ( ) "- . :
'Washington, '.
Jencks and Meredith Phillips. DC: Brookings Institution. t tr ..- - -
P r i n ce to r ['r
Ho, Daniel E., Kosuke Imai, Gary King, and ElizabethA. Stuart. 2007. "Match- K i n g . Gr r r . R ,,l ';- - , : .
ing as nonparametric preprocessingfor reducing model dependencein paramet- ScientificlrtfL,..::. . : ..
ric causal inference." PctliticalAnalysis 15:199-236. K i s h . Le sl i e . l .') ^i . \
Hobcraft, John, Jane Menken, and Samuel Preston. 1982. "Age, period, rnd 1 9 8 ,. - \i .;::.:
cohort effects in demography: A review." Population Index 48:4-43. K i r r g ,r w l . tr c ' \1 .
Holland, Paul W. 1986. "Statisticsand causalinference."lournal of the Amcrican United Stttts: -. ,. :
StatisticaI Association 81,:94 5-7 0. UniversitvPre...
Hoxby, Caroline M., and Gretchen \Weingarth. 2005. "Taking race out of the K l e p pn e r , Pa u l . i ': -
equation: School reassignmentand the structure of peer effects." Manuscript, t U r no U I. l , ,tt-.:.;.
Department of Economics, Harvard University. Retrieved March 2006. K o n o w , Ja n t.'. .: - .
Hsiao, Chcng. 2003. Analysis of Panel Dara. Secondedition. Cambridge:Cam- n o m i cu s Ir .r p r :.- i '
h ridg e Univer s it yPr es s . m o un t U n l \cr ..: : ,
Hummer, Robert A., Richard G. Rogers, Charles B. Nam, and Christopher G. Kozt-,1. Jonrth.lr'r,.
Ellison. 1999. "Religious involvemenr and U.S. adult mortality." Demographl, York: Cron'n.
36 :27 3-8 5. K u h , Ed r vi n . .r r '; -
ln.rai, Kosuke, and David A. van Dyk. 2004. "Causal inferences with general when the d.rr.r.i-. -
treatment regimes:Generalizingthe propensity score." Jourttal of the American L a y a rd .R i ch .r r .l .
Statistical Association 99 :854-66. gu1n.
Jaccard,James,and Robert Turrisi. 2003. Interaction Effects in Multiple Regres- Leamer, F.du'ar.l L. - - -
slon. Secondedition. SageUniversity Paper Serieson Quantitative Applicarions E c on o m i c R r '/;, i .
in the Social Sciences, 07-72.Thousand Oaks, CA: Sage. Lesthaeghe,Ron. .-.:- -
Jackman, Robert !(. 1980. "Keynesian governnent intervention and income ine- t h c or i e so ffe r - ' .- -
quality (Cornment on Stack, ASR, December 1978)." American Sociological L i e b er so n ,St:r n l re. - ' '
Reuiew 45:1.31.-37. a n d Th co r t. Bc:. -
Jacobs,Jerry A. 2005. "Multiple methodsin ASR." Footnotes(December).Wash- l 9 9 l . ":" ..
ington, DC: American Sociological Association. i n co m p l r l r tr q .:..- - .
Jencks, Christopher, r,vith Marshall Smith, Henri' Acland, Mary Jo Bane, David 307-20.
Cohen, Herbert Gintis, Barbara Heyns, and Stephan Michelson. 1972. Ine- L i k e r . Je ffr e r .5 L r r.\..- .-
quality: A Reassessmentof the Effect of Family and Schooling in America. o f ch a n g e :A. ", -
New York: BasicBooks. S c ie n ceR csc,r ,., : -
References . 247
, .-t t-,-,,,"t,.t-...1.,^_
Jencks, Christopher, and Meredith Phillips, eds. 1998a. The Black-White 'lest
:,.: i eu'64: 169- 87. 'Washington,
Score Cap. DC: Brookings Institurion,
:, :-r{)nle triCS."
Eco no - Jencks,Christopher,and Meredith Phillips. 1998b. "America's next achievemenr
test: Closing the black-white test score gap." The American Prospect (Scptem-
:\..rrch: Theory intcr ber/October)41.:44-5 3.
Jenkins,StephenP.1991. "The measurementof income ineqr-rality-" Pp. .3-.38in
' -:'rL)cleffects:The ef- Economic Inequdlity tmd Pouerty: International I'erspectiues,ed. Lars Osberg.
, *. ar.gnancy-" Anteri- Armonk, NY: M. E. Sharpe.
1995. "Accounting for inequality rrends: Decomposition analysesfor the
- : .:irir:rcial?Assessing U.K . , 1 9 77 - 8 6 . " E c o n o m i c a 6 2 : 2 9 - 6 3 .
.:-:r 6=:6 14 -26 . Kaldor, Nicholas. 1956. "Alternative theories of distribution." Reuiew of Eco-
' .\Iet,t-Analysrs.San nomic Studies23:83-100.
Kalton, Graham. 1,983.Introductiott to SurueySampllrug.BeverlyHills: SagePub-
i:ri SCOfe COnVergence licatiorrs.
'r-;p. ed. Christopher King, Gary. 1997. A Solutiott tct the Ecological Inference Problem. Princeron:
-. I1; tilu rion . Princeton Universitv Press.
i: -: -rr t. 10 07 . "Match- King, Gary, Robert O. Keohane,and SidneyYerba, 1994. Designing SocialInquiry;
,r,lrrr,-a
^ in norampt- Scientific Inference in Qualitatiue Researcb,Princeton: Princeton Universitv Press.
Kish, Leslie. 1965. Suruey Sampling. New York: John Wiley and Sons.
. i. ' . \ge , pe riod , an d 1987. Stotistical Desigtt for Research.New York: .|ohn \filey and Sons.
' : . . , 4S:-i-4 -3. Kitagawa, Evelyn M., and Philip M. Hauser. 1973. Differential Mortality in tbe
:, ':-tl oi the Anterican United States:A Study in SctcioeconomicEpidemiology. Cambridge: Harvard
University Press.
- . -. : lq ra ce o ut o f th e Kleppner, Paul. 1982. "'Were r,vomen to blame? Female suffrage and voter
- : rrccts," Ma nu script, turnout." lournal of lnterdisciplinary Historl, 12:621-43.
- \ l.rrch 20 06 . Konow, James, and Joseph Earle1'.2003. "The hedonic paradox: Is Homo Eco-
- - a rmhrirloe. Crm- nomicus happier?" Manuscript, Department of Economics, Lovola Mary
mount Universit_v.Retrieved December 2004.
-:. -rnd Christopher G, Kozol, Jonathan. 1991. Sat,ageInequalities: Children in America's Sc/.,t,r.,/s,New
::.:lrrr'." D emography York: Crown.
Kuh, Edwin, and Jol.rn R. Meyer. 1955. "Correlarion and regression esrimares
_ i - -n,- ,-< rr:ith oe..r" I when the data are ratios." Econometrica23:400-16.
:, ':.il ol the American Layard, Richard. 2005. Happiness..Lessonsfrom a New Science.Nerv York: Pen-
guin.
-.: :,: ,\htltiple Regres- Leamer, Edward E. 1983. "Let's take the cor-rour of economerrics."Amerrcan
..- :::.]liYeApplications Economic Reuiew 73:3 1-43.
Lesthaeghe,Ron, and Johan Surkyn. 1988. "Cultural dynamics and cconomic
: :.,)n .rnd income ine- theoriesof fertilit,vchange." Populationand DeuelopmentReuiew I4:I-45.
:,. :., i L',urSociological Lieberson, Stanley. 7985. Making It Count: The Improuement of Social Resedrch
and Theory. Berkeley: University of California Press.
;. -. L)rcember).Wash- 1991. "Small N's and big conclusions:An examination of the reasoning
in comparative studies based on a small number of cases." Social Forces 70:
. \. 1...:r. .lo Bane, David 307-20.
' .1 , rrlso n. 19 72 . Ine - Liker, Jeffrey, SueAugustyniak, and Greg Duncan. 1985. "Panel data and models
: -, . , litt.qin Ame rica . of change: A comparison of first difference and conventional models." Soclal
ScienceResearch14:80-101.
218 . Refelences
Little, R.J.A. 1992. "Regression with missing Xs: A review." Journal of tbe M u l l e r, Ed w a r d \. 1 - - r
ia I ion 8- : | 227- 37.
A tneri catt St a t i st ical Assoc SociologicalR.,rr, :, -
Lombard, Herbert L., and Carl R. Doering. 1928. "Cancer studiesin Massacl.ru- M y e r s . Sco tr l \[. l q - ^ ''
setts:2. Habits, characteristics,and environments of individuals rvith and with- p o r ta n ceo f fr m r l r J :': -
out cancer." New England lournal of Medicine 198:481-87. N a t i o n a l C e n r e r f.r r L - ...
Lott, John R., Jr. 2000. "Gore might lose a secondround: Media suppressedthe 2 0 0 /. R e p o r r 2 0 tl 1 - - .:
Bush vote." Philadelphia Inquirer, November 14, p. 23A. N o r p oth , H e l m u r . l q \-
Luhrr.ran,Reid. 2002. Race and Ethnicity in the United States: Our Differences I 9 8 0 s? " Pu h l i c u S':,::
and Our Roo/s. Fort Worth: Harcourt. N u x o ll , D a n i e l . l e e +. - ".
Mannheim, Karl. 1927 [1952]. "The problem of generations."Pp. 276-322 in in ornrrrth rcr". " 7..
Essayson the Sociologyof Knowledge. London: Rouledge and Kegan Paul. Oaxaca,Ronald. I 9--i. '1.I ,
McCloskey, Deirdre, and Stephen Zthak. 1996. "The standard error of rcgres- I n t er n a ti o n ,r lL c .':.,':.
sions." .lournal of Economic Literature 34:97-1,14. O g b u , Jo h n U . 2 0 ( l - 1 8. ..;, ,
Manski, Charles F.1993. "ldentification and endogenoussocial effects:The re- A c a d e m i cD i - sc,r {.r -,-, ;, '
flection problem." Reuiew of Economic Studies60:531-42. P a g e r .D e r a h . l r ) {) 1 . I
Massey, Douglas. 2004. "Segregationand stratification: A biosocial perspective." I n c ar ce r a ti o riro r r r r . I ' .
Du Bois Reu iew l: 7- 2- S. e r s ." Ph .D . d i \>( r r .l r :..
Massey, Douglas, and Nancy Denton. 1,993. American Apartheid: Segregation P a r r k er t,Fe l i r . l q - i . " I"-
and the Making of the Underclass.Cambridge: Harvard University Press. . ^ ., Vr
- . , - YCy
5Ur ^ r EV
^ ,,:l ^ ,..^ " ,... -
rUglllr.
Matthews, Robert. 2000. "Storks deliver babies (p=.008)." Teaching Stdttstlcs Pearl,Judea,2000, C,;r;-,-
22:36-38. Cambridge Unilcrsirr'f'-:,
.What '
Mayer, Susan. 1997. Money Can't Btry: The Effect of Parental lncome on P eerl,Raym ond.I ' 1. .
Children'sOutcomes.Cambridge:Harvard UniversityPress. M er edit h.199". .
P hi l l ips,
N4elchior,Arne, and Kjetil Telle. 2001. "Global income distribution 1965-98: k n o w a r td r l h l t r i c .: :
Convergenceand marginalisation."Forum for DeuelopmentStudies1:75-98. ican SociologicalRci :.,:. :
Merriam, Charles E., and Harold F. Gosnell. 1924. Non-uoting: Causesand R a u d e n b u sh Sr. e p h cr rI. - :
Methods of Control. Chicago: Universit.vof Chicago Press. l r g . L i n co l n u 'o .J. II , .-
Mickelson, Roslyn Arlin. 1990. "The attitude-achievement paradox among black Ravallion, Martin. I1.11-r :. \,
adolescents."Sociologl, of Education 63:44-61.. H o w w e l l d o n a r i o n .:.
Middleton, Russell.1976. "Regional differencesin prejudice." American Socio- Statistics.85:64-5--il.
logical Reuieu 41:94-117 . R e a r d o n .Se a nF.. .l n .t ( . . -
Milanovic, Branko. 2005. Vlorlds Apart: Measuring International and Global In-
B a t io n ." Pp . j j - ^ - .:
equdlity. Princeton: Princeton University Press. b e r g . Bo sto n :Bl r ,\r ,.,
Mirowsky, John, and CatherineE. Ross. 1999. "Economic hardship acrossrhe R e a r d o n .Se r n F...l u h :. .
life course."American Sociolctgical Reuieu 64:548-69. s l r u ctu r e o f sch o . . ..- .
Moffitt, Robert. 2005. "Remarks on the anal1,sisof causal relationships in popu- metropolitln-.1rc.l., -
lation research." Demography 42:9 l-108. R o b i n so n ,Vi l l i .r n r '. . - '
Morgan, StephenL. 2001. "Counterfactuals,causaleffect heterogeneitnand the trals." Americ,rrt 5, ::
Catholic schooleffect on learning." Socbktgy of Education 74:341-74. R o s e rr b a u m ,Pr u l R ,. .'*
N{organ, StephenL., and David J. Harding. 2006. "Matching estimatorsof causal p r o p e n si r vsco r e i :. .:'
effects: Prospects and pitfalls in theory and practice." Sociological Methods 41-55.
and Research35:3-60. |9 8 4 . "R c* - -
Morgan, Stephen 1.., and Christopher \finship. 2007. Counterfactuals and o n fh e p r o p e n \r t\ .- - -
Causal Inference: Methods and Principles for Social Research. Cambridge: 79:516-24.
Cambridge University Press. 19 8 5 . "( n r ::- -
Mouw, Ted, and Barbara Entwisle. 2006. "Residential segregationand interracial nl i no me t h n , l s rh : - -
r ""'b '-
friendship in schools." American Journal of Sociology 112:394-441. 39:33-38.
References . 249
- r'., " .lountaLof the Muller, Edward N. 199-5. "Economic determinantsof democrac1,."Amerrcan
SociologicalReuiew 60:805-2 1.
- . - -...; r c sin M as s ac h u - Myers, Scott M. 1996. "An interacrivemodel of religiosity inheritance:The im-
.: :.rL-;
s'ith andwith- portanceof farnily conrexr." American SociologicalReuiew 6l:8-i8 66.
National Center for Education Statistics. 2001, The Condition of Educatiott
' '' l :rr J s Llppr es sted
he 2001. Report 2001072. 'Washington,DC: NCES.
Norpoth, Helmut. 1987. "Under way and here to stay: Party realignment in the
'. .. :: Ottr Differences 1980s?" Public Opinion Q.uarterly 51:376-91.
Nuxoll, Daniel. 1994. "Differences in relative prices and internatiolal difterences
: .." I'p. 276-322 in in growth rates." The American Economic Reuiew 84:1423-36.
-: -.:rtlKegan Paul. oaxaca, Ronald. 1973, "Male-female wage differentials i' urban labor markets."
- "', .,.:clerror of regres- International Economic Reuiew1.4:693-709.
Ogbu, John U. 2003. Black American Students in an Afllttent Suburb: A Study of
- . :rrl effects:The re- Academic Disengagemenl.Mahway, NJ: Erlbaum.
__-. Pager, Devah. 2002. "The Mark of a Criminal Record: The Conseouencesof
- . oci a l p e r s p e c t i v e . " Incarceration for the Ernployment opportunities of Black a'd whire Job seek-
ers." Ph.D. dissertation, University of \(/isconsin, Maclison.
- . ; 't l: e t d : S e g r e g a t i o n Paukert, Felix. 1973. "Income distribution at differe't levels of development:A
. : :',er s i$' P r es s . survey of evidence." International Labour Reuiew 108:97-125.
- Tt:ching Statistics Pearl, Judea. 2000. Causality: Models, Reasoning, and Inferences. Cambridge:
Cambridge University Press.
' l'.irerttttlIncome on Pearl,Raymond. 1938. "Tobacco smoking and longevity." Science87:2L6-17.
Phillips, Meredith. 1999. "Sibship size and academic achievemenr:'What we now
- -::rf.utio n 1 96 5-9 8: know and u,'hatwe still need to knou. (comment on Guo and Van\7ey)." Amer-
i - ) f//.//r's Ii / J-Y 6. i can Sociologi cal Reuiew 64 : 188-92.
:-: ,tutg: Causesand Raudenbush, Stephen.2004. HLM6: Hierarchical Linear and Nonlinear Model-
lng. Lincolnu'ood, Il-: Scientific Software International.
' :.r:.1Lloxamong black Ravallion, Martin. 2003. "Measuring aggregatewelfare in developing counrnes:
How well do national accounts and surveys agree?" Reuiew of Economics and
- .:." .\nterican Socio- Statistics. 8 5 :645-52.
Reardon, SeanF., and Glenn Firebaugh.2002. "Measures of multi-group segre-
,.:.: ,,ttl .trtd Glob al In- gation." Pp. 33-67 in Sociological Methctdology,2002, ed. Ross M. Stolzen-
berg. Boston:Blackwell.
'" , rrrdship acrossthe Reardon, SeanF., John T. Yun, and Tamela McNulty Eitle. 2000. .,The changrng
structure of school segregation: Measurement and evidence of multiracial
: --:.rrionshipsin popu- metropolitan-area school segregarion,7989-199 5." D emogrdphy 37 :3 5 1-64.
Robinson, \Tilliam s. 1950. "Ecologicalcorrelationsand rhe behavior of individ-
- - _.-*..,,-,.^ir-. vr ^-,.1 +t -
,<rrrLrt uals." American Sociological Reuiew 15:351-57.
. .: ': -1 : 347- 74.
Rosenbaum, Paul R., and Donald B, Rubin. 1983. "The central role of the
: - r-ifimatorsof causal propensity score in observational studies for causal effects." Biometrika 76:-
' :rrtlctgical Methods 4i- 5 5 .
1984. "Reducing bias in obser:vationalstudies using subclassificatron
-
' ,rptterfactuals and on the propensity score." Journal of the American statistical Association
:.. -.i,r'cD. Cambridge: 79:516-24.
1985. "Construcring a control group using multivariate matched sam-
- :: :.r:ion :md interracial pling methods rhat incorporate the propensity score." American statistician
l: 19-1-'{'1
1. 39:33-38.
250 . References
Rosenthal,Robert. 1979. "The'file drawer problenr' and tolerancefor null re- S mi th,TomW. 1990."Tr r : -
srrlts." Psyc/:ologicalBulletin 86:638-41 . 1972- 1. ) 89C55. " t , \ r
1.994."Ptrametric measuresof effect size." Pp. 231-44 in The Hand- C )pini on R es e:rrc h C enr.:.
book of ResearchSt'nthesis,ed. Harris Cooper and Larry V. Hedges. New S n o w,Jo h n . ( 1 8 5 - t) 1 9 6 - i ,
York: RussellSageFoundation. New- York: Hafner.
Rosenzu'eig,\lark R., and Kenneth I. Wolpin. 2000. "Natr-rral'natural expen- Stack, Steven.1978. "Thc
-
ments' in econornics."Journal of Economic Literature 38:827-74. om,v on the degree of i:..
Rubirr, Donald B. 1973. "The use of matchedsamplesand regressionadjustmenr SociologicalReuieu,1) :''
to removebias in observationalstudies."Biometrics29:185-203. Stor-rffer,Samuel A., Eds.i:-
1979. "Using rnultivariatematched sampling and regressionadjustment and Robin M. '!(illirrr:.
to control bias in observational studies." Journal of the American Statisticdl Army Life. Volume 1. P: :
Association74:3 1.8-28. Sudman, Seymour, Norn.i.r:
1987. Mubiple Intputation for Nonresponse in Swrueys.New york: .|ohn about Answers: The .\;:
\7iley and Sons. ogy. Josse,v-Bass Sociel .'
Rubin, Donald B., and Neal Thomas. 1996. "Matching using estin'rared propen- B a ss.
sity scores:Relating theory to pracrice." Biometrics 52:249-64. Summers, Robert, :rnd -\j...:
2000. "Combining propensiry score matching with additional adjust- expanded set of interr-Li:
ments for prognostic covariates." Journal of the American Statistical Associa- Econonics 1 06:-12 -i. :.
tio n 9 5:5 73 -85. Tourengeau.Roger. L.rrr..
Russett,Bruce M., Ha1'wardR. Alker, Jr., Karl'W. Deutsch,and Harold D. Lass- of huuey Rcspon..c.C.i.:.
rvell. 1964. Vrorld Handbook of Political and Social Indicators. Ner,v Haven: Triplett, Jack E. 199-. ' \1.
Yale University Press. t h e r e se r r ch i ssu e r ." l .
Ryder,Norman B. 1965. "The cohort as a conceprin the study of socialchange." 9-42.
American SociologicalReuiew 30:843-61. T u c h , Ste r e r r .l q 8 - . "L .
Saia-i-Martin, Xavier. 2006. "The rvorld distribution of income: Falling ciologital Retiett il: i - -
poverty and. . . Convergence, period." Quarterly Journal of Econr,tmics T u f r e , Ed w a r d R . l 9 5 l . /
1.21.:351.-97. CT: GraphicsPress.
Schafer, Joe L. 1997. Analysis of lncomplete Multiuaridte Data. London: Chap- T y s o n. Ka r o l l n . \\ i l l :.r "'
man an d Ha ll. b l e ck th i n g ': U r .l e r '.r. -
hi " h " l:
schultz, T. Paul. 1998. "lnequality in the distriburion of personali'come in the " , ' h i " . , " ' . rc rr,
world: How it is changing and why." Journal of Population Ecctnomics lI: U g g e n . C h r i sto p h e r '.r ", '
307-44. C O l l se q tl e n C e t r\' l (
Schwartz, Barry, and Howard Schuman. 2005. "History, commemoration, and SuciulogicalRc'z/. i, ..- - -
belief: Abraham Lincoln in American memory, 1945-2001." Arnerican Socio- U n i r e d N cti o n '. l o o l . r ' .:
logical Reuiew 7 0 :l 8 3-203. U z z i . B r i a n . a n d R r .: . .'-
Seawright,Jason,and David Collier. 2004. "Glossary." Pp. 273-313 in Rethink- the corporarel.ru " :-
'$ 7 a i t e,
ing Social Inquiry: Diuerse Tools, Shared Standards, ed. Henry E. Brady and L i n d a J...r r r .1r .
David Collier. Lanham, MD: Row.man and Littlefield. l i oi nn i rr t l re I " t, --
m c n t R e u i e t(l q :l i t -'
Selvin,Hanan C., and \farren O. Hagstrom. 1963. "The empirical classificarron
'Waters,Mary
of formal groups." American Socblogical Reuiew 28:399-41I. C. 1q'':.
Sherman, Lawrence W. f992. Policing Domestic Violence: Experiments and Pluralism: Tltt t .
Dilemmas. New York: Free Press. We nd y F. Kr r ki r . \ - .
sherman, Lawrence'w., and Richard A. Berk. 1984. "The specificdeterrentef- nois Press.
fects of arrest for domestic assault." American Sociological Reuiew 49:26I-72. I9 9 9 . 3 7 ',- . ' ' ' '
Smith, Herbert L. 1997. "Matching with multiple conrrols ro estimate rrearment R e a l i ti cs.N o r ' \. :.
effects in observarionalstudies." Pp. 325-53 in Sociologicdl Methodology, ed. 1 0 0 1 . " fl .. .
Adrian E. Raftery. Boston: Blackwell. f r o m d e m o g r .r fr r ..' : - '
Referenccs . 2-51
::.lnce for null re- Smith, Tom \7. 1990. "Timely artifacts: A review of measurernentvariation in thc
1972-1989 GSS.' GSS Methodological Report no, 56. Chicago: National
++ h The Hand- Opinion ResearchCenter.
-:
' -' \'. Hed ge s.Ne w Snow,John. (18.5.5)1965. On the Mode of Commtmication of Cholerd.Reprint.
New York: Hafner.
. : . . : . - .,'n atu ral expe ri- Stack, Steven. 1978. "The effect of direct government involvement in the econ-
: - .l--- 4 . ()n1yon the degree of income inequality: A cross national study." Americatt
I -: -:. !iio n a dju strn en t SociologicalReuiew 43:8 80-8 8.
. . __r ,.). Stouffer,SamuelA., Edward A. Suchman,Leland C. DeVinney, Shirle,vA. Star,
- : --iiiiorl a diu stme nt and Robin M. \Villiams. 7949. Tbe Antericdrt Soldier: Adjusttnent During
-...;r'1i c,u1 Statistical Army Life. Volun're1. Princeton: Princeton University Prcss.
Sudman, Seyrnour,Norman M. Bradburn, and Norbert Schwarz. 1995, Thinking
'-.. \e'n, York: John about Answers: The Applicatbn of Cognitiue Processesto Suruey Methodol-
ogy. Jossev-Bass Social and Behavioral ScienceSeries.San Francisco:Jossey-
-
_
-.rinrrrrerl nrnnen- Ba s s .
r_.-r.-''
Surnmers,Robert, and Alan Heston. 1991. "The Penn\WorldTable (Mark.5): An
- - --b+.
:r .r.lditional adjust- expanded set of international comparisons, I950-1988." Qtnrterly Journal of
' : ;t-tt istical Associr.- E conomics 1.06:327-68.
Tourangeau, Roger, Lance J. Rips, and Kenneth Rasinski. 2000. The Psychology
. ,:rrlFiarold D. Lass- of SttrueyResponse.Cambridge: Cambridge University Press.
' ..: :.;:|)t s, Neu' Haven: Triplett, Jack E. 1997. "Nleasuring consumptior.r:The posr.-1973slowdown ar.rd
the researchissues." Lederal ReserueBdnk of St. Louis Reuiew (May{une):
9-42.
. :'. ,rf socialchange."
Tuch, Steven.1987. "Urbanism, region, and tolerancerevisited." Americdn So-
" ,jin co me:Fallin g ciological Reui ew 52:.504-10.
:,' ,:tl o l Econ omics Tufte, Edward R. 1983. The Visual Display of Q,uantitatiueInformation Cheshire,
CT: GraphicsPress.
London: Chap- Tysor.r,Karolyn, !(/illiam Darity Jr., and Domini R. Castellino.2005. "lt's not 'a
--r,;:.1.
black thing': Understanding the br-rrder.r of acting white and other dilemmas of
:::i()r1ill inCClme
in the high achievement." Am er i cdn Socio I ctgi cal Reuiew 7 0 :5 82-60 6.
: -;:: ,11LcotrcmiCS11l. Uggen, Christopher, and Jeff N4anza.2002. "Democraric contraction? Political
conseqLrences of felon disenfranchisementin the Llnited States." American
. , ,iltlremoration, and So ciological Reuiew 67 :77 7 -803.
i." .\tnerican Socio- United Nations. 1993. Systetnof National Accounts.Brussels:United Nations.
Uzzi, Brian, and Ryon l-ancaster.2004. "Embeddednessand price formation in
- _ ) - )l Jl n l ae tn l nR -
the corporate law market." American Sociological Reuiew 69:31.9-44.
:. H;urv E. Brady and Waite, Linda J., and Evelyn L. Lehrer. 2003. "The benefitsfrom marriage and re-
ligion in the United States:A comparative analysis." Population and Deuelop-
r ::::irical classification ment Reuiew 29 255-75.
'\X/aters,
_- I I I Mary C. 1998. "Multiple ethnic identity choices," Pp,28-46 in Beyond
': '. : E x p e r i m e n t s and Plttralism: The Conception of Groups and Group Identities in America, ed.
Wendy F. Katkin, Ned Landsman, and Andrea Tyree. Urbana: Universitv of Illi-
. : >lec i f i c d e t e r r e n t e f - nois Press.
'Vlest
1999. Black ldentities: Indian Immigrant Dreams and Amerrcan
-. ; . Rc t' i e u t 4 9 : . 2 6 7 - 7 2 .
i :, cslrnlate treatment Realities. Nerv York: Russell SageFoundation.
, : : -;i \Iethodology, ed. 2002. "The social construction of race and ethnicity: Some examples
from demography." Pp. 25-49 in American Diuersity: A Demographic Challenge
252 . References
for the Twenty-first Century, ed. Nancy A. Denton and Stewart E. Tolnay. Index
Albany: State University of New York Press.
2004. "Race, ethnicity and immigration in the Unired States."Pp.20-38
in Social Inequalities in Comparatiue Perspectiue,ed. Fiona Devine and Mary
C.'Waters.Malden, MA: Blackwell.
'Weeden,Kim, Kim Young-Mi,
Matthew Di Carlo, and David B. Grusky. 2007.
"Social class and earnings inequality." American Behauioral Scientist 50:
702-36. Abbon, Andrew, 4--i
'West, abortion: attitudes tos'.tr.l. : -:
Jerry, Kristin Denton, and Lizabeth Reaney. 2007. The Kindergarten Year:
Findings from the Early Childhood Longitudinal Study, Kindergarten Class of pol i cy i n R omani :r.i \-tr
1998-99 (NCES 2001-03). U.S. Deparrmentof Education,Narional Centerfor add-r-regrerrorapfr,' .r-:.. . -
Education Statistics.I(/ashington, DC: U.S. Governmenr Printing Office. addi rrv edec orrpo' .rl ' i l rr,.- -
advocac,vresearch,1
\fhite, Kevin M. 2002. "Longevity advances in high-income countries, 195-5-
age effe c ts ,212
96." Population and Deuelopment Reuiew 28:59-76.
'Winship, age pol a ri z ati on,18-i
Christopher, and StephenL. Morgan. 1999. "The esrimarion of causal
eggregr tedi nJ rr i .l rr.rl-:r.:: -.
effects from observational data." Annual Reuiew of Sociology 25:659-706. aggregcre-l ev el rn.rl r .i .. -
Zajonc, Robert B.7975. "Birth order and ir.rtelligence: Dumber b1' the dozen.', eggrege ri onbi as .I I l . r.. .. .
Psychology Todatt 8 :37-43. fal lacy
Ziliak, Stephen,and Deirdre McCloskey. 2004. "5r2e matters: The standard error .d o r p o .fi r Tp nr,rnerrie< " .
of regressionsin the American Economic Reuiew." Economic lournal V,ldtch A l l i son, P aul , 6-5-66. l 6---
1:3 31 -58 . Amato, Paul, 107-9
American Natior.ralElr;:: : .
111
A msden, A l i c e, l .{ --6
analytic porver. 2..3.lt'
Arrighi, Ciovirnni. --1--:
assuci a ti on:l rrd .-ru..L.
spuri ous ,14, 1l l . l r:
avai l ab l ec as eanah' s i s .::
balance of treatn.)cnr-r:
147. Seea/s o prop.:.
B erkel eyS D A { S urrc r
Anal ,v s i s ),3l
B i as.S eefav ored hr:,
vari a bl c sbi as rs el e;:
bi rth cohort, 181. \. .
black-white
tests.()i-
Brady,Henr,v,7---r
Bush,George\i',. --
cancer:and l l uori dl i r,
127n ; and,s mok i n:
box 5 .1 )
case'"visedeletion. {.i
causala nal y s i s ,l ogrc ,
causa l i ty
causal chains. 1.1-I -1
causald i rec ti on. 1-{
causal -proc esobs s .-r,..
Sren'art E. Tolnay. Index
Sriites."Pp. 20-38
r Devine and Marv
. :.:J B. Crusky. 2 00 7.
t '.;'.t,tnl Scientist 50:
Abbott, Andrerv, 4--5 cansality, l3-14, 120; and correlation,
.. . Kindergarten Year: abortiol.r:attituclestoward, 3 1-33, 180; 12 1-29 ; counterfactual approach to.
' :l:tt,lergarten Class of p o li cy i n R ornani a,38-40 122, 127-29; deterministic versus
add-a-regressorapproach, 133-34 probabi l i sti c,25n, 120; effect of a c aus e
: . \ational Centerfor
addrtive decompos:rbilit.v,220 versuscausesof an effect, 39, 130-31;
. ' : ; nring Office .
advocacy research, I lau's of, 121
-- ::. co un tries,19 55 -
a g e e ffects,2l 2 Central Limit Theorem, 105
a g e p ol ari zati on, 1B 5 change: aggregatedindividual, 197; cross-
': ::rim:rtion of causal
aggregatedindividual change, 197 cohort, 197; gross and net individual,
, qr'2 -5:6 .59 -70 6. a g g r egate-l evel anal vsi s,7 1, 17),, 231 173-74 intracohort, 197. Sepalso soci:rl
.::rl.erbr.'the dozen." aggregation bias, 23 l. Seealso ecological change
fallacy changing-effectmodel, 176; versus conver
-.:-:: The stanclarderror aggregativeproperties, 228 gencemoclcl, 191-9-5; antl rnultilevel
Allison, Pavl, 6 5-66, | 69-70 mcthods, 182-85; w i th pancl data,
'' ,itic lournal Vlatch
Amato, Paul, 107-9 1,76-7 9; with repeatedcross-sectron
American National Election Stud.v(ANES), data,179-82
111 chol era,l 58-59
Amsden, Ahce,74-76 cl assi cexperi mentaldesi gn,129
analytic porver, 23, 26 Clogg, Clifford, 92
Arrighi, Giovanni, 74-76 cohort effects, 212-14
association:and causality, 12 1-29 ; cohort replacement, 195-96; estimation of
spurior.rs,14, 722, 143 effecrs,| 9--20 l ; \rudent e\erc i s eson,
available case anal,vsis,66 203-s
cohort succession.Saecohort replacement
b a la nceof treatmentand control groups, C ol enun R eport,228
747. Seealso propensity scores collected sample,93-94; and unit nonre-
Berkeley SDA (Survey Documentation and sponse error, 94-9.5
An al vsi s),31 common metric, 42; converting variables
Bias. Sea favored-hypothesisbias; omitted- to,44-46
variables bias; selection bias comparing like with like. Seematching
birth ccrhort, 182. Seealso cohort effects complete caseanalysis,6-5
black-n'hite test score gap, 232-3.5 composite measures,72-74
Brady, Henry, 77-78 confidence intervals, 109-10
Bu sh , GeorgeW.,77-78 confoundi ngvari abl es,130, 132, 140-45,
159-62 (andi z tabl e 5.1)
cancer: and fluoridation of u'ater, C onl ey,D al ton, 140, 156
l2 7n; and smoki ng, 122-29 (and i n consumption surveys, 102-3
b o r.5.1) conrerr effecr'. 22o-2- : exrcerberi ng,
casewisedeletion, 65 versurmi ri gati ng.2291as nui sr nc e.
causal analysis,logic of, 127-29. Seealso 230-3r
ca usal i ty contextual ization, 27 -28
ca u salchai ns, 13-14 control variables, 14-15. Seealso
causal direction, 1,1 residualization
causal-processobservations,77-78 convergence,1 85; sign test for, 186-87
254 r Index
:I
"T:::j"li:119
:;i:i,i:::1,1ffi
:'J
Leamer, Edlvard, 133 r,ariables
. :- l-ieberson, Stanley, I .5, 133 overs:rmplingof minority groups, 22,
--
life sp:rns,clifferencesin. Sec inequalin' 2"4-25,40
orprincipre
ofsa'rpri'g'
18
. : ixllS::lnl'J!l;1"',0t
- . I 1. 1-2-73 L ite r ary,D i gest,21 P ager,D evah,49, 56
'. : : .. . q - 1 0 { l z b o x 1 . 1 ) L o n d on \vafercomp:.l ni es
i n the l 8-i 0s, pai nvi secl el eti on.66
- :. :or estimating, 1 5 8-59 panel data, 172-73,176-79; vers us
repeatedcross-sectionclata, 173-75
market share principle, 53 path analysis, 1-5
'l :' r L . o r 2 . 2 ) m a r r iage:and hi rppi ness,8-9.1'+,16,61, path di agram, 15, 153
. : r-.19.
16 1 166-69; and ser, 34 peer effects: on student achievement,
rnatching, 130; comparison of stratcqies 214-16 on happiness,-5.1-55
, i' t - { . i 1 . 1 6 6 - 6 9 ; a n d fo r ,159-62; exact, 1,16-47;usi ng fi red- peri od effects,212
-
- :r.'.:rjrtse,8-9,16,
: e ffectsmodel s, 134-461r:si ngpropensi t,v pol i ti cal pol l s, 19-21
' : J{rr.ises on, 58-62; scores,147-51; usi ng natural l l ' occurri ng popul ati oncomposi ti on,227n; effec rsof
, : :i : r ! h o u s e w r v e s , r a ndom assi gnment,1.52-59;usi ng changesi n, 181. S eea/so cohort
standard regression, I 3 1-32 replacement
256 . Index
povert,v.Seeglobal poverty reverseuniverse, 129 stati sti cals i gni fi c l nc r ' ..:.. .
p r e s i d e n t i ae
l l e ctio n :o f 1 9 3 6 , 2 1 , o l 7 9 6 8 , r e vie w arti cl es,6n significance,36. { 1
1 9 l .o f 2 0 0 0 , 7 7 ,7 9 Ro m a ni a, 37-39 Stouffer, Samr-iel.ll -
prior research,7; challengesto, 8-l l; srrati 6c J random ..r' r : .. - -
clarifications of, 1 1; ertensions of, sanrple, 1B; advantageof large samples, substan ti r.e profi l i ns . ;"-: -
11-r7 2 3 , 26, 129. represenfati veness
of, 21 ; synthes i z i ngs tudi es1r . -
problenr
of motion,
6,39-40 sirnple random, 18; stratified random,
p r o p e n s i t ys c o r es,1 4 7 - 5 1 , t6 0 - 6 1 1 ))-)i 40 target population. f -i---
preprocessingwith, 1.51--52 sa m p l esi ze:pri nci pl esof, 19-21; i n test score gap, 2.ll--r l
p s e u d o c o n t r o l s,l5 q u " ri i t;rri ve
\rudi e.. 2J-2{- Thei l i n der: r' i i nc ".., . --
sanrpling: principles of, 18-23. Seealsct segregation,2l.j
qualitative research,23-26, 7 9-82 sa mpl e rheory.irnportrl t.- ' . ' - ' -
sir n r p l i ngerror,94-97,105, I 10, 129 total survev error n--::.r--:- .
race/ethnicity, 80; and ernplovmenr, 49, sampling frame, 115. Seealso {rame triangr-rlation,6.{
5 6 - 5 7 ; s e l f - i d e n tifica tio n o f, 8 0 - 8 1 ; a n d population truncJted v .l rttnc a. , -
v o t e f o r G e o r g eWa lla ce ,2 3 0 - 3 1 , school reassignmentsin North Carolina, Tuite, Edu'ard, ,i I
236-39; and vote in 2000 presidential 274-16
election, 79 school segregation,components o1,225 lin uncertai ntyi n s oal f,.:::::-,
random assignment,128-30; eramples ta b le 7.1 ) data c ol l ec ti on.el --. -
o f n a t u r a l l r .o ccu r r in g , 1 5 2 - 5 9 , SDA (Survev Documentation and Analvsis), ment error, 100- I
2 0 9 - 1 8 ; a n d t he p o we r o f p r o b a b ilit.v, 31
t52 secondar,yanalysis,22
random san.rple.See sarnple segregation,components of, 225 (lr table
ratio variables.Seecomposite measures 7 .1 )
reality checks, 64; external, 76-82; selectedsanrple,93-94; ilnd sampling error,
internal, 65-76 9 5-97
R e a r d o n ,S e a n , 2 2 3 - 2 5 selection bias, 14, 128-29, 226
r e g r e s s i o nm e t h od , 1 3 1 - 3 2 , 1 6 1 ; cr itiq u e selection effect. Seaselection bias
o f , I t 2 - 3 4 : o hse ' sio nr vir h . 2 0 9 : p :r th se x: a rti rrrdc'to$ard,62.: hurrrrrrgcnJ
model of, 160 (}r figure .5.-5).Seea/so frequency of, 34-35; marital status and
residualization frequency of, 34
r e i i g i o s i t y ,1 5 sib lin g rnodel s,140-.+.5,161
repeatedcross-sectiondata, 172-7 3, smoking and cancer, 122-29 land inbox
179-80; r'ersuspanel data,173-75 5 .1 )
replication, 90; internal and external, Sn o r v,.fohn,1.58-59
106-7; versusnultiple--methodsresearch, social change, 172; deconrposition of,
8l 197-201- key questior.rsabout,
research:as an ongoing conversation!4i t7 5 -7 6
establisl'redfindings of, 7; ethics of, 3; social contert effects. Saacontext effects
o b j e c t i v e so 1 , 7 2 0 - 2 7 social multiplier effects, 214
researchdesign, 129, 1-52,23.5; as ke.yto "so-rvh:rt" objection to research,6
b e t t e r s o c i a lr e se a r ch ,9 1 ,1 6 4 ,2 0 7 ; n e e d spending on Social Security and education,
for morc imagination in, 207, 235; and attitudes about, 187-91 (and in tables
o m i t t e d - v a r i a b le b
s ia s, 1 3 3 - 3 4 6 .1 - 6.1)
researchquestions, 2; interest criterion, 4- spurious association, 14, 1,22, 113
I 8; researchabilitycriterion, 2-4 stability assumption (first-difference
residualization, 74n2; as matching m o del s),136
m e t h o d u s e d bv r e g r e ssio n7, 4 ,1 3 3 , standardization of variables,44-46
150 statistical estimation, 207; as servant to
responserates, 95-97 theorv and researchdesign, 207-8,
reversecausation, 16 235
Indcx . 2) /
statisticalsignificance
versussubstantive unit nonresporrse,
94-98 land in l:tox4.1),
signi fi cance,36,41 1 17; versusi tem nonresponse 94
,
Stouffer, Samuel, 227 tttrmearured!crrse5.5ce confound i ns
stratified randorn sample, 22, 40 variables
-: : i l r g e s a r n p l e s , substantiveprofiling, 46-49
- -: : : : - ' l t l r e n e SOSf , 2 l ; synthesizingstudies, 16-1 7 variance on explanatorv variables,37-39;
' . . . : . r t r f i e dr a n d o r n , qualitl,-quantity tradeoff, 138, 177 ;
target population, 93-94 stratified sampling to increasesize of,
. .. lq-21;in tcst score gap, 232-35 22-25,40-41
. t r_16 T h e il i ndex: of i nequal i ty,221; of voting rates: for George Wallace, bv race,
segregation,223 230-31, 236-39; surveyesti m aresof,
theory',importance of, 54-.56, 235 1 15- I 8; for rvomen versus rlen, 21,2-14
.129
ll.;. 110, total survel' error paradign.r,91
t i. i . l / - . oi r a m e triangulation, 64 \Wallace,George C., 19-20, 2"30-31,,
trlrncated variance, 137, 177 236-39
T u fte, E drvard,5l Waters, N{ary, 80-81
'Weedcn,
Kim,22t)
r : : L r ( ) n e n ros f , 2 2 5 ( i n uncertainty in social research,91; due to rX/right, Orville and Wilbur, 28
- data collection, 92-99; due to rneasure
: : : : . r t i o na n d A n a l y s i s) , m e nt error, 100-101 Zeno's paradoxes,28
- l- , . n f , 2 2 5 ( l n
table
: -l: ; n d s a r n p l i n ge r r o r ,
.-._r.) tlA
_
- : . Jri o n bi as
- - : . r l : h u n ti ng an d
..- I i: rr.irital st:ttus and
- :i. 161
,. -. l l-19 l a rd i n bo x
--
: : : o t l l P 6 1 5 i 1 i 961f ,
---
-. 1 , ,r ' r sr r b o u t ,
-: . . \ i . ' c o n f e x t e f f e c t s
- -: _ 111
- : , r ase i trch ,5
- r , : t f i r\ :rnd ed u ca ti on ,
9 7 t ,tntl iru tables
. . . j. ltt. 1 43
::. r-r1
ifierence