Académique Documents
Professionnel Documents
Culture Documents
Harassment Speaking
by Q&A: Initial about Privacy
Thoughts on and Publicity
Formspring.me [http://www.z
[http://www.zephoria.org/
about-
by-qa- privacy-
initial- and-
thoughts- publicity.html]
on-
formspring-
me.html]
Big Data:
Opportunities
for
Computation
al and Social
Sciences
Scott Golder recently wrote
blog post at Cloudera entitled
Scaling Social Science with
Hadoop
[http://www.cloudera.c
om/blog/2010/04/scalin
g-social-science-with-
hadoop/] where he
accounts for how social
scientists are using large
scale computation. He
begins with a delightful quote
from George Homans: The
methods of social science are
dear in time and money and
getting dearer every day. He
then turns to talk about the
trajectory of social science:
When
Homans
one of my
favorite
20th
century
social
scientists
http://www.zephoria.org/thoughts/archives/2010/04/17/big-data-opportunities-for-computational-and-social-sciences.html 1/22
15/11/2017 danah boyd | apophenia Big Data: Opportunities for Computational and Social Sciences
wrote
the above,
one of the
reasons
the data
needed to
do social
science
was
expensive
was
because
collecting
it didnt
scale very
well. If
conducting
an
interview
or lab
experimen
t takes an
hour, two
interviews
or
experimen
ts takes
two hours.
The
amount of
data you
can collect
this way
grows
linearly
with the
number of
graduate
students
you can
send into
the field
(or with
the
number of
hours you
can make
them
work!).
But as our
collective
http://www.zephoria.org/thoughts/archives/2010/04/17/big-data-opportunities-for-computational-and-social-sciences.html 2/22
15/11/2017 danah boyd | apophenia Big Data: Opportunities for Computational and Social Sciences
body of
knowledge
has
accumulat
ed, and
the low-
hanging
fruit
questions
have been
answered,
the
complexity
of our
questions
is growing
faster than
our
practical
capacity to
answer
them.
Things are
about to
change.
Though
social
scientists
care what
people
think its
also
important
to observe
what
people , do
http://www.zephoria.org/thoughts/archives/2010/04/17/big-data-opportunities-for-computational-and-social-sciences.html 3/22
15/11/2017 danah boyd | apophenia Big Data: Opportunities for Computational and Social Sciences
especially
if what
they think
they do
turns out
to be
different
from what
they
actually
do.
Increasingly, computational
scientists are having a field
day with Big Data. This is
exemplified by the web
science community and
highly visible in conferences
like CHI and WWW and
ICWSM and many other
communities in which I am a
peripheral member. In these
communities, Ive noticed
something that I find
increasingly worrisome
Many computational scientists
believe that because they
http://www.zephoria.org/thoughts/archives/2010/04/17/big-data-opportunities-for-computational-and-social-sciences.html 4/22
15/11/2017 danah boyd | apophenia Big Data: Opportunities for Computational and Social Sciences
http://www.zephoria.org/thoughts/archives/2010/04/17/big-data-opportunities-for-computational-and-social-sciences.html 5/22
15/11/2017 danah boyd | apophenia Big Data: Opportunities for Computational and Social Sciences
http://www.zephoria.org/thoughts/archives/2010/04/17/big-data-opportunities-for-computational-and-social-sciences.html 6/22
15/11/2017 danah boyd | apophenia Big Data: Opportunities for Computational and Social Sciences
challenges in doing
interdisciplinary work is
being about to account for
these differences, to know
what approach works best for
what question, to know what
theories speak to what data
and can be used in which
ways.
http://www.zephoria.org/thoughts/archives/2010/04/17/big-data-opportunities-for-computational-and-social-sciences.html 7/22
15/11/2017 danah boyd | apophenia Big Data: Opportunities for Computational and Social Sciences
http://www.zephoria.org/thoughts/archives/2010/04/17/big-data-opportunities-for-computational-and-social-sciences.html 8/22
15/11/2017 danah boyd | apophenia Big Data: Opportunities for Computational and Social Sciences
A%/category/academia]
=Bi 20O %2 ttp |
Fw
g%
2FCommentsppo are
ww. Fw %3
closed
20D zep
%2 rtun ww. A%
ata hori
Fw ities zep 2F
15
%3 comments
ww. %2 hori %2to
a.or
A% Data:
zep
Big 0for a.or Fw
g%
20O 2Fth
hori %2 g% ww.
Opportunities
ppo oug
a.or 0Co 2Fth zep
for
rtun hts
g% mpu oug hori
Computational
ities %2
2Fth tatio hts a.or
and Farc
%2
oug nal
Social %2 g%
0for hive
hts %2 Farc 2Fth
Sciences
%2 s%2 0an hive oug
0Co F20
Farc d% s%2 hts
mpu 10
hive 20S F20 %2
Brett Bobley
tatio %2
s%2 ocial 10 Farc
[http://ww
nal F04
F20 %2 %2 hive
w.diggingi
%2 %2
10 0Sci F04 s%2
ntodata.or
0an
%2 g] F17 enc %2 F20
d% %2
F04April es%2010
17th, F17at 10 10:30
%2 pm
20S Fbig 20- %2 %2
[http://www.zephori
ocial -%2 Fbig F04
F17
%2 a.org/thoughts/archi
0htt -
data %2
ves/2010/04/17/big-
0Sci -p% data F17
Fbig
data-opportunities-
-enc 3A -
opp %2
for-computational-
es& ortu
data %2 opp Fbig
and-social-
-bod F% ortu -
nitie
sciences.html#comm
y=h s-
opp 2Fw nitie data
ent-36853]
ttp for-
ortu ww. s- -
Thanks
%3
nitie zep
comfor this
for-danah.
oppI
think you make some
A% puta
s- hori com ortu
excellent points here. I
2F
for- a.or liked
tion
particularly putayournitie
%2
com
commentg% If tion
al- s-
were going
Fw
puta 2Fthattack
and-
to actually al- Big for-
Data,
ww. soci
tion the best
oug and- com solution
would be to combine
zep al-
al- hts soci puta
forces between social
hori
and- %2 al- tion
scie
scientists and
a.or
soci Farc scie
nces
computational al-
scientists.
g%
al-
I think.ht
hive nces and-
that bringing
2Fth
scie s%2
ml&
together .ht soci
different
disciplines
F20 isml&
oug title
nces really al-
key
here. I help run a grant
hts
.ht 10 t=Bi scie
=Bi
program called the
%2
ml&
Digging%2
g% into g%
Data nces
Farc
t=Bi F04 This
20D
Challenge. 20Dprogram
.ht
hive ata
g%
brings %2 ata ml&
together
interdisciplinary
s%2
20D F17 %3teams
%3 title
from the humanities,
F20 A%
ata %2 A% =Bi
social sciences, and
10
%3 Fbig 20O g%
20O
computer/information
%2
A% - to tackle
ppo
sciences ppo 20D
F04
20O datausing
rtun
questions rtunBigataData
approaches.
%2 ities
ppo - ities %3is to
My hope
http://www.zephoria.org/thoughts/archives/2010/04/17/big-data-opportunities-for-computational-and-social-sciences.html 11/22
15/11/2017 danah boyd | apophenia Big Data: Opportunities for Computational and Social Sciences
inspire%2
rtun
F17 projects
opp %2thatA% are
genuinely collaborative
%2 0for
ities ortu 0for 20O
across the disciplines
Fbig
%2 nitie %2 ppo
%2
that is, that raise really
- interesting
0for s- research
0Co 0Co rtun
data
%2 for- in mpu
mpu
questions ities
multiple
-
0Co com tatio %2
tatio
domains. As you
suggested,
opp
mpu puta we
nal nal need
0forto
develop the intellectual
ortu
tatio tion %2 %2
%2
apparatus to actually
nitie
nal al- Big0an
0an
analyze Data. 0Co
Im
s-hopingd%
%2 and- d% ofmpu
that some the
for-
0an soci
20S
projects were20S tatio
funding will
help with
com ocial
d% that
al- ocial nal
development. (see:
puta %2
20S scie %2 %2
http://www.diggingintoda
tion
ocial
ta.org0Sci
nces 0Sci 0an
al-
%2 .ht enc d%
enc
[http://www.diggingi
and-
0Sci ml es] ). Brett
es]
ntodata.org] 20S
soci
enc %2
(@brettbobley) ocial
al- 0 ]
es& %2
scie
s=S 0Sci
nces
cott enc
anonymous
.ht
%2 April 18th,
es&
2010 at 2:00
ml&
0Gol sour
am
ui=
der ce=
[http://www.zephori
2&tf
%2 dan
a.org/thoughts/archi
=1&
0rec ah+
ves/2010/04/17/big-
shv
entl boy
data-opportunities-
a=1
y% d+
for-computational-
]
20w %7
and-social-
rote C+a
sciences.html#comm
%2 pop
ent-36912]
0blo
As a CS grad student,
heni
g%what do you think Ia+
20p
should do to not make mak
ost
such mistakes ? Doing
%2courses in the dept+co of
Sociology ? other than
0at nne
that ?
%2 ctio
0Clo ns+
uder Kevin whe
K
a% re+
April 18th,
20e 2010 atnon2:27
ntitl am e+p
ed[http://www.zephori revi
%2a.org/thoughts/archi ousl
0%ves/2010/04/17/big- y+e
data-opportunities-
22S xist
for-computational-
calin ed&
g%and-social- sum
sciences.html#comm
20S mar
ent-36914]
ocial y=S
%2But mucking with Big
cott
Data alone is not
http://www.zephoria.org/thoughts/archives/2010/04/17/big-data-opportunities-for-computational-and-social-sciences.html 12/22
15/11/2017 danah boyd | apophenia Big Data: Opportunities for Computational and Social Sciences
jkd
[http://jac
ob.kramer-
duffield.co
m]
April 19th, 2010 at 12:42
pm
[http://www.zephori
a.org/thoughts/archi
ves/2010/04/17/big-
data-opportunities-
for-computational-
and-social-
sciences.html#comm
ent-37468]
Seconded.
http://www.zephoria.org/thoughts/archives/2010/04/17/big-data-opportunities-for-computational-and-social-sciences.html 15/22
15/11/2017 danah boyd | apophenia Big Data: Opportunities for Computational and Social Sciences
tricia wang
[http://ww
w.triciawa
ng.com]
April 20th, 2010 at 2:12
am
[http://www.zephori
a.org/thoughts/archi
ves/2010/04/17/big-
data-opportunities-
for-computational-
and-social-
sciences.html#comm
ent-37639]
This is a really great
article to compliment your
post dana looks at 3
different interdisciplinary
areas of research and one
of those areas is
ethnography in the IT
industry!
http://bit.ly/d5vakd
[http://bit.ly/d5vakd]
Barry, A., Born, G., and
Weszkalnys, G. Logics of
interdisciplinarity.
Economy and Society 37,
1 (2008), 20-49.
This paper interrogates
influential contemporary
accounts of
interdisciplinarity, in which
it is portrayed as offering
new ways of rendering
science accountable to
society and/or of forging
closer relations between
scientific research and
innovation. The basis of
the paper is an eighteen-
month empirical study of
three interdisciplinary
fields that cross the
boundaries between the
natural sciences or
engineering, on the one
hand, and the social
sciences or arts, on the
other.
marcus
[http://na]
April 20th,
2010 at
10:11 am
http://www.zephoria.org/thoughts/archives/2010/04/17/big-data-opportunities-for-computational-and-social-sciences.html 16/22
15/11/2017 danah boyd | apophenia Big Data: Opportunities for Computational and Social Sciences
[http://www.zephori
a.org/thoughts/archi
ves/2010/04/17/big-
data-opportunities-
for-computational-
and-social-
sciences.html#comm
ent-37724]
nice article. thanks. has
be pondering if
quantitative and
qualitative will ever co-
exist. sort of like debating
the existence of god in
society. everyone sees
the same things, yet each
has a very different
perspective of what they
are seeing. hope you are
well.
e. pyatt
April 20th,
2010 at 3:49
pm
[http://www.zephori
a.org/thoughts/archi
ves/2010/04/17/big-
data-opportunities-
for-computational-
and-social-
sciences.html#comm
ent-37798]
Another good article. Ive
seen similar problems in
linguistic research by non-
linguists (from biologists
in particular) and
linguists trying to answer
archaeological problems
(usually badly), so I
sympathize.
There may be promising
approaches in in
combining information
from Big Data and
ethnography, but the
researcher has to
understand the
methodologies BOTH
disciplines so as to not
create a too simplistic
model (e.g. more daily
contact = closer
relationtionship).
http://www.zephoria.org/thoughts/archives/2010/04/17/big-data-opportunities-for-computational-and-social-sciences.html 17/22
15/11/2017 danah boyd | apophenia Big Data: Opportunities for Computational and Social Sciences
zephoria
[http://dan
ah.org]
April 20th,
2010 at 9:49 pm
[http://www.zephori
a.org/thoughts/archi
ves/2010/04/17/big-
data-opportunities-
for-computational-
and-social-
sciences.html#comm
ent-37858]
CS grad student: Learn
social science
methodologies from social
scientists. Develop a taste
for the different
methodological
approaches, what
questions can be
addressed through what
means, etc. Find a social
scientist advisor who can
help acculturate you.
Jed Hallam
[http://roc
k-star-
pr.com/]
April 21st, 2010 at 5:08
am
[http://www.zephori
a.org/thoughts/archi
ves/2010/04/17/big-
data-opportunities-
for-computational-
http://www.zephoria.org/thoughts/archives/2010/04/17/big-data-opportunities-for-computational-and-social-sciences.html 18/22
15/11/2017 danah boyd | apophenia Big Data: Opportunities for Computational and Social Sciences
and-social-
sciences.html#comm
ent-37942]
Hey Danah,
This is a brilliant post
thank you! I started
writing up a comment
basically talking about
how Big Data is effecting
social media analysis and
tried to put a social media
spin on things and it
ended up being horrifically
long so I published it as a
post instead (here ->
http://rock-star-
pr.com/big-data-and-
social-media-analysis/
[http://rock-star-
pr.com/big-data-
and-social-media-
analysis/] ).
Thanks again!
Cornelius
[http://yna
da.com/]
April 21st,
2010 at 9:06 am
[http://www.zephori
a.org/thoughts/archi
ves/2010/04/17/big-
data-opportunities-
for-computational-
and-social-
sciences.html#comm
ent-37972]
Thanks for a very
insightful and timely post,
danah. I think many of
the issues you raise about
the suggestiveness of
data and its
representation will stay
with us for many years to
come. I also think they
will spill out beyond
compsci/socsci and
beyond scholarship in
general. Im referring to
the applications of
social/human data
modeling in things like
profiling, context-sensitive
ads and semantic web
technologies that infer
http://www.zephoria.org/thoughts/archives/2010/04/17/big-data-opportunities-for-computational-and-social-sciences.html 19/22
15/11/2017 danah boyd | apophenia Big Data: Opportunities for Computational and Social Sciences
without understanding
context. People will
mistrust these
technologies even if they
become very reliable over
time, and there will be a
significant need for critical
debate and mediation
between those who build
them and the public at
large, which is something
social scientists and
humanities scholars who
are tech literate can
hopefully help with.
One thought came up
when reading your piece
and thinking about
numerous other pieces on
Big Data Ive read over
time (e.g. Chris
Andersons article on The
end of theory:
http://www.wired.com/sci
ence/discoveries/magazin
e/16-07/pb_theory
[http://www.wired.c
om/science/discoveri
es/magazine/16-
07/pb_theory] ). Big
Data ultimately strikes me
as an incredibly male
idea. Quantitative data is
incredibly suggestive, esp.
when visualized. There is
the idea that it shows or
proves something that
precludes other
interpretations and it
conveniently provides
rhetorical ammunition to
get almost anything
across (when misused).
Being data literate is
essential.
eliot
[http://ww
w.eliotbate
s.com]
April 23rd, 2010 at 12:41
am
[http://www.zephori
a.org/thoughts/archi
ves/2010/04/17/big-
data-opportunities-
for-computational-
http://www.zephoria.org/thoughts/archives/2010/04/17/big-data-opportunities-for-computational-and-social-sciences.html 20/22
15/11/2017 danah boyd | apophenia Big Data: Opportunities for Computational and Social Sciences
and-social-
sciences.html#comm
ent-38464]
Danah, thanks for this
blog posting, it dovetails
in an uncanny way with
Tim OReilleys recent
keynote at the MySQL CE
2010 conference
http://www.youtube.com/
watch?v=WqLB99dA48k
[http://www.youtub
e.com/watch?
v=WqLB99dA48k]
about The Cloud and
the corporate use of data
in order to facilitate
consumer convenience
(while accumulating an
unprecedented amount of
data that can be analyzed
and monetized). It seems
that many of the
problems you note by
scholars without a social
science background who
make erroneous
assumptions using Big
Data boil down to nothing
more that logical flaws
(intentional fallacies,
proof by assertion, etc.)
Perhaps a couple
semesters of formal
logical training would
assist, in addition to more
social science
background!
Ben
Shneiderma
n
[http://ww
w.cs.umd.edu/~ben]
May 3rd, 2010 at 7:18 am
[http://www.zephori
a.org/thoughts/archi
ves/2010/04/17/big-
data-opportunities-
for-computational-
and-social-
sciences.html#comm
ent-41606]
Go danah! Bravo for this
thoughtful post.
Combining quantitative &
http://www.zephoria.org/thoughts/archives/2010/04/17/big-data-opportunities-for-computational-and-social-sciences.html 21/22
15/11/2017 danah boyd | apophenia Big Data: Opportunities for Computational and Social Sciences
qualitative methods to
from credible case studies
is the challenge as
Science 2.0 researchers
press forward to
understand the science of
the made world. Big data
is a great opportunity, but
ethnographic methods are
needed to make the
results meaningful.
Boris
Shakhnovich
[http://ww
w.iamscien
tist.com]
October 23rd, 2010 at
11:21 am
[http://www.zephori
a.org/thoughts/archi
ves/2010/04/17/big-
data-opportunities-
for-computational-
and-social-
sciences.html#comm
ent-656986]
I think that the future of
science is really in inter-
disciplinary research that
can be used to create
collaborations between
folks that produce data
and those that analyzie it.
For example, check out
the latest in collaborative
grants and RFAs posted
by organizations here:
http://www.iamscientist.c
om/rfas
[http://www.iamscie
ntist.com/rfas]
http://www.zephoria.org/thoughts/archives/2010/04/17/big-data-opportunities-for-computational-and-social-sciences.html 22/22