Académique Documents
Professionnel Documents
Culture Documents
2011,AmericanMarketingAssociation
Allrightsreserved.Cannotbereprintedwithouttheexpress
permissionoftheAmericanMarketingAssociation.
Modeling Multiple Relationships in Social Networks
Asim Ansari
1
Oded Koenigsberg
Florian Stahl
December 21, 2010
1
Asim Ansari (maa48@columbia.edu) is the William T. Dillard Professor of Marketing at the Columbia
Business School, 517 Uris Hall, 3022 Broadway, New York, NY 10027. Oded Koenigsberg is Barbara
& Meyer Feldberg Associate Professor at the Columbia Business School, and Florian Stahl is Assistant
Professor at University of Zurich. Please address all correspondence to the rst author.
Abstract
Firms are increasingly seeking to harness the potential of social networks for marketing purposes.
Marketers are therefore interested in understanding the antecedents and consequences of relation-
ship formation within networks and in predicting interactivity among users. In this paper we
develop an integrated statistical framework for simultaneously modeling the connectivity structure
of multiple relationships of different types on a common set of actors. Our modeling approach
incorporates a number of distinct facets to capture both the determinants of relationships and the
structural characteristics of multiplex and sequential networks. We develop hierarchical Bayesian
methods for estimation and illustrate our model via two applications. The rst application uses a
sequential network of communications among managers involved in new product development ac-
tivities and the second uses an online collaborative social network of musicians. Our applications
demonstrate the benets of modeling multiple relations jointly for both substantive and predic-
tive purposes. We also illustrate how information in one relationship can be leveraged to predict
connectivity in another relation.
1 Introduction
The rapid growth of online social networks has led to a resurgence of interest within marketing
for studying the structure and function of social networks. A better understanding of social net-
works allows managers to comprehend and predict economic outcomes (Granovetter 1985) and
in particular, enables marketers to interface with both external and internal actors. Online brand
communities, which are composed of users interested in particular products or brands, allow such
an external interface with customers. Such communities not only help rms to interact with cus-
tomers and prospects but also enable customers to communicate and exchange information with
each other, consequently increasing the value that can be derived from a rms products. Similarly,
rms forge alliances and enter into collaborative relationships with other rms for co-production
and social commerce (Stephens and Toubia, 2010, Van den Bulte & Wuyts, 2007) using inter-
organizational networks. Within the rm, intra-organizational networks of managers play a crucial
role in cross-functional integration, as is the case with networks of marketing and organizational
professionals engaged in new product development (Van den Bulte & Moenaert, 1998).
As pointed out by Van den Bulte and Wuyts (2007), network structure has implications for
power, knowledge dissemination and innovation within rms and for contagion and diffusion
among customers. Thus, understanding and predicting the patterns of interactions and relationships
among network members is an important rst step in their effective use for marketing purposes.
The focus of social network analysis is on a) explaining the determinants of relationship formation
b) identifying well connected actors and c) capturing structural characteristics of the network as
described by reciprocity, clustering, transitivity, and other measures of local and global structure
using a combination of assortative, relational and proximity perspectives (Rivera, Soderstrom and
Uzzi, 2010).
Actors belonging to a social network connect with each other using multiple relationships, pos-
1
sibly, of different types. In this paper we develop statistical models of multiple relations that yield
an understanding of the drivers of multiplex relationships and predict the connectivity structure of
such multiplex networks. Multiplexity of relationships can arise from different modes of interac-
tion or because of different roles individuals play within a network setting. For example, in many
online networks, members can form explicit friendship and business relations, exchange content
with each other and can communicate with each other. The different relationships that connect a
group of actors can differ not only in their substantive content but also in their directionality and in-
tensity. For instance, some relationships are symmetric in nature, where as others can be directed.
Some relationships involve the ow of resources thus necessitating a focus on the intensity of such
weighed connections. Finally, multivariate patterns of connections can also arise from viewing the
same relationship at different points in time, as is the case of sequential networks.
An understanding of multiplex patterns in network structures is important for marketers. For
instance, Tuli et al. (2010) nd that in a business-to-business setting increasing multiplexity in
relationships leads to an increase in sales and to a decrease in sales volatility to a customer. Mul-
tiplexity contributes to the total strength of a tie and increases the number of ways in which one
can reciprocate favors (Van den Bulte and Wuyts 2007). Thus, it is relevant for identifying inu-
ential actors such as opinion leaders in diffusion contexts and powerful executives within intra-
organizational networks. In analyzing sequential relationships, a multivariate analysis can help
investigate the impact of managerial interventions on the relationship structure of a network across
different points. Sequential dyadic interactions are also useful for understanding the dynamics
of power and cooperation within intra-organizational networks and in modeling long-term rela-
tionships between buyers and sellers in business markets (Iacobucci and Hopkins, 1992). Finally,
when marketers are interested in predicting relationship patterns, multiplexity allows leveraging
information from one relationship to predict connections on other relations.
A substantive understanding of multiplex relationships can be obtained by simultaneously an-
2
alyzing the multiple connections among the network actors. One can investigate whether these
multiple relations exhibit multiplex patterns that are characterized by the ow of multiple relations
in the same direction, or whether they represent patterns of generalized exchange where a tie in
one direction on one relation is reciprocated with a connection in the other direction using a differ-
ent relationships. Most models of social networks, however, analyze a single relationship among
network members. When the lens is trained on a single relationship, an incomplete understanding
of the nature of linkages can result. For instance, one cannot fathom whether individuals play a
similar role across multiple relationships. A joint analysis can also help in uncovering common
antecedents that impact relationships, and if some relationships exhibit unique patterns, then such
uniqueness can emerge only when multiple relationships are contrasted.
In this paper we develop an integrated latent-variable framework for modeling multiple rela-
tionships. We make a methodological contribution to the social networking literature both within
marketing and in the wider social sciences by offering a rich framework for modeling multiple
relations of different types. Our modeling framework has several novel features when compared
to previously proposed models for multi-relational network data. Specically, our framework can
a) model multiple relationships of different types (i.e., weighted, un-weighted, and also both undi-
rected and directed), b) model sequential relationships, c) leverage partial information from one
network (or relation) to predict connectivity in another relation, d) accommodate missing data in
a natural fashion, e) capture sparseness in weighted relationships, f) incorporate sources of dyadic
dependence, g) account for higher order effects such as triadic effects and h) include continuous
covariates. While some of the above aspects have been handled by previous models, we do not
know of any paper in the social network analysis literature that simultaneously incorporates all the
above aspects.
We illustrate the benets of our approach via two applications. Our rst application involves
sequential network data that studies network structure over two points in time. We reanalyze data
3
from Van den Bulte and Moenaert (1998) involving a network of R&D, marketing and opera-
tions managers who are engaged in new product development. The data contains communications
among these managers both before and after co-location of R&D teams. Our results show that
substantive conclusions can be impacted if the full generality of our framework is not utilized. We
also show how our methods can be used to leverage information from one relationship to predict
the connections in another relationship.
In our second application, we use data from an online social networking site involving the
interactions among a set of musicians. We model friendship, communications and music down-
load relationships within this network to show how a combination of directed & undirected, and
weighted & un-weighted relations can be modeled jointly. We analyze the determinants of these
relationships and assess the importance of our model components in capturing different facets of
the network structure. Our results show that artists exhibit similar network roles across the three
relationships and that these relationships are mostly inuenced by common antecedents. We also
show that when dealing with weighted relationships (e.g., music downloads), it is crucial to model
jointly both the incidence and intensity of such relationships, instead of just focusing on the inten-
sity, otherwise, prediction and recovery of structural characteristics suffers appreciably.
The rest of the paper is organized as follows. The next section provides a brief review of the
marketing and statistical literature on social networks. Section 3 presents the components of our
modeling framework and describes inference and identication of model parameters. Section 4 and
Section 5 describes the two applications. Section 6 concludes with a discussion of our contributions
and model limitations, and also outlines future research possibilities.
4
2 Literature Review
Social network data offers considerable opportunities for research in marketing, as has been iden-
tied by Van den Bulte and Wuyts (2007) in their expansive survey of the role and importance of
social networks in our eld. Most research in marketing on social networks falls into one of two
streams. Papers in the rst stream explore the impact of word of mouth on the behavior of others,
and are thus primarily concerned about the role of social interactions and contagion (Iyengar, Va-
lente and Van den Bulte 2010; Watts and Dodds 2007; Nair, Manchanda and Bhatia 2010; Trusov,
Bodapati and Bucklin, 2010). Papers in the second stream focus on modeling network structure.
Iacobucci and Hopkins (1992) is a pioneering contribution in this area. The focus here is on un-
derstanding the antecedents of relationship formation (Narayan and Yang, 2007) and in studying
how interventions inuence future connectivity (Van den Bulte and Moenaert, 1998). The current
paper adds to this second stream of research by offering a comprehensive framework for modeling
multivariate or sequential relations.
There is a rich history of statistical modeling of network structure within sociology and statis-
tics that spans more than 70 years. More recent approaches stem from the log-linear p
1
model
developed by Holland and Leinhardt (1981) which assumes independent dyads. As the p
1
model is
incapable of representing many structural properties of the data, two general ways of capturing the
dependence among the relationships have been proposed in the literature. The rst approach uses
Exponential Random Graph Models (ERGM) or p
X
ij1
, X
ji1
X
ij2,I
, X
ji2,I
X
ij2,S
, X
ji2,S
, i < j.
The relations can be further specied in terms of underlying latent variables. The latent variable
specication allows us to model these random variables in terms of dyad-specic and actor-specic
covariates.
Latent Variable Specication: We use underlying latent utilities u
ij1
, for modeling the existence
of a tie in the direction (i j) and u
ji1
in the reverse direction, for the rst relationship,
X
ij1
=
1, if u
ij1
> 0,
0, otherwise,
X
ji1
=
1, if u
ji1
> 0,
0, otherwise.
(1)
For the second relationship, let u
ij2
and u
ji2
represent the underlying utilities that characterizes the
existence of the relationship. We assume as before, that,
X
ij2,I
=
1, if u
ij1
> 0,
0, otherwise,
X
ji2,I
=
1, if u
ji2
> 0,
0, otherwise.
(2)
10
The truncated counts, conditional on a tie in a given direction, are modeled using a Poisson distri-
bution truncated at zero, i.e.,
X
ij2,S
tPoisson(
ij
), if u
ij2
> 0,
X
ji2,S
tPoisson(
ji
), if u
ji2
> 0.
(3)
where, the
ij
and
ji
are the rate parameters of the Poisson.
Covariates and Homophily: Each latent utility is composed of a systematic part involving the
observed covariates and a stochastic part that incorporates unobserved variables. We distinguish
between dyad-specic covariates and individual-specic covariates. Let x
d
ij1
and x
d
ij2
be vectors
of dyad-specic covariates that inuence the two relations. These allow for homophily, which
implies that individuals who share observable characteristics tend to form connections. We also
use individual-specic covariates for modeling directed relations. Let x
i1
, and x
i2
be the vector of
individual is covariates, respectively, for the two relations. The vector x
ij1
= (x
d
ij1
, x
i1
, x
j1
) then
represents all the covariates that impact the tie in the direction (i j) for the rst relation and
x
ji1
= (x
d
ji1
, x
j1
, x
i1
) represents the covariates for the reverse direction (i j). The sender and
receiver effects are important to model the asymmetry in the two directions. For the weighted rela-
tionship one can further distinguish between the incidence and intensity components. We therefore
use covariate vectors x
ij2,I
= (x
d
ij2,I
, x
i2,I
, x
j2,I
) and x
ji2.I
= (x
d
ji2,I
, x
j2.I
, x
i2,I
) for the binary
component, where, for example, x
d
ij2,I
contains a subset of the dyad-specic variables in x
d
ij2
that
impact intensity, and x
i2,I
is similarly a subset of x
i2
. We use analogously dened vectors x
ij2,S
and x
ji2,S
to model the Poisson rate parameters
ij
and
ji
.
Heterogeneity: The dyads cannot be considered independent as multiple dyads share a common
actor either as a sender or receiver. Accounting for such dependence is important for obtaining
proper inferences about substantive issues. We therefore use heterogeneous and correlated random
effects to account for the dependence structure. While for undirected relations a single random
effect is needed, for directed relationships, one can use two distinct random effects per actor to
11
distinguish between expansiveness, which is the propensity to send ties and popularity or attrac-
tiveness which is the propensity to receive ties. The expansiveness parameter
i
, captures the
outdegree which is the number of connections emanating from an individual i and the attractive-
ness parameter
i
, captures the indegree, which is the number of connections impinging upon an
individual. Thus for the directed binary relationship we use random effects
i1
and
i1
. We simi-
larly use
i2,I
and
i2,I
for the incidence component and
i2,S
and
i2,S
for the intensity equations
of the weighted directed relationship.
Let
i
= {
i1
,
i1
,
i2,I
,
i2,I
,
i2,S
,
i2,S
}. We allow these random effects to be correlated
across the relationships and assume that
i
is distributed multivariate normal
2
N(0,
) where
, 11
, 12
, 21
, 22
.
The diagonal sub-matrices capture the within relation covariation in the random effects within a
relation. A positive correlation between the random effects for a relation implies that popular in-
dividuals also tend to reach out more to others. The off-diagonal sub-matrices capture correlation
across relationships and help determine whether individuals exhibit similar tendencies across rela-
tions. If the off-diagonal sub-matrices in
ir
z
jr
.
3
In particular,
for a two relation model, we use two kernels, one for each relationship, represented generically as
z
ir
z
jr
. The latent vectors for each relationship are assumed to come from a relationship-specic
multivariate-normal distribution z
ir
N(0,
zr
). The dimensionality of the latent space can be
determined using a scree plot of the sum of the mean absolute prediction error of the entire triad
census versus the dimensionality, as is usually done in the multidimensional scaling literature.
4
Full Model: Bringing together all the components of the model, we can write the latent utilities
3
Other kernels such as those based on the Euclidean norma can also be used. We leave a detailed examination of
the pros and cons of using different kernel forms for future research.
4
The Bayes factor can also be used to determine dimensionality. However, it is difcult to compute in our model
given the high dimensional numerical integration that is involved in obtaining the likelihood for each observation. We
therefore opt for the predictive MAD criterion that focuses on triad census recovery.
13
and response propensities as follows:
u
ij1
= x
ij1
1
+
i1
+
j1
+z
i1
z
j1
+ e
ij1
,
u
ji1
= x
ji1
1
+
j1
+
i1
+z
i1
z
j1
+ e
ji1
,
u
ij2
= x
ij2,I
2,I
+
i2,I
+
j2,I
+z
i2
z
j2
+ e
ij2,I
,
u
ji2
= x
ji2,I
2,I
+
j2,I
+
i2,I
+z
i2
z
j2
+ e
ji2.I
,
log
ij
= x
ij2,S
2,S
+
i2,S
+
j2,S
+z
i2
z
j2
+ e
ij2,S
,
log
ji
= x
ji2,S
2,S
+
j2,S
+
i2,S
+z
i2
z
j2
+ e
ji2,S
.
(4)
We assume that the vector of all errors e
ij
is distributed multivariate normal N(0, ). Also,
i
N(0,
) and z
ir
N(0,
zr
), r.
Identication: Not all parameters of the model are identied. The error variance matrix has a
special structure because of scale restrictions on the binary utilities and because of exchangeability
considerations stemming from the fact that the labels i and j are arbitrary within a pair. As the
scale of the utilities of the binary responses cannot be determined from the data, the error variances
associated with the binary components are set to 1. In addition, symmetry restrictions on the
correlations stem from the exchangeability considerations and the resulting variance matrix can be
written as
=
X
ij1
X
ji1
X
ij2,I
X
ji2,I
X
ij2,S
X
ji2,S
X
ij1
1
1
2
3
4
5
X
ji1
1
3
2
5
4
X
ij2,I
1
6
7
8
X
ji2,I
1
8
7
X
ij2,S
2
9
X
ji2,S
2
(5)
The correlation parameter
1
captures the impact of common unobserved variables impacting the
14
binary relationship and also accounts for reciprocity. Similarly,
6
and
9
capture correlations for
the weighted relationship and also account for reciprocity within this relationship. The correlation
parameters
7
and
8
reect common unobserved variables that inuence both the incidence and
intensity equations of the weighted component of the second relationship, and are akin to selectiv-
ity parameters. Notice that the intensity equations have a common variance.
The expansiveness and attractiveness random effects are individual-specic and are thus sepa-
rately identiable from the equation errors which are dyad-specic. The latent positions also are
individual-specic, but as they enter the equations as interactions, they can be separately identied
from the random effects. However, as they appear in bilinear form, we can only identify these
subject to rotation and reection transformations. Finally, the covariance matrices
z1
and
z2
associated with the latent space parameters are restricted to be diagonal as their covariance terms
are not identied. Moreover, each matrix has a common variance term across all the dimensions
within a latent space.
Table 1 summarizes how the different model parameters can be related to substantive issues
of interest. Given that the model components work in tandem, a parameter may also be related
to other aspects apart from the one shown in the table. For example,
1
is needed to capture
reciprocity, but may also represent the impact of other shared unobservables.
3.2 Bayesian Estimation
We now describe briey our inference procedures. The likelihood for the model is computation-
ally complex. Conditional on the random effects and latent positions, the dyad-specic likelihood
requires numerical integration to obtain the multivariate normal c.d.f. Moreover, the unconditional
likelihood for the entire network requires additional multiple integration of very high dimensional-
ity because of the crossed nature of the random effects. The dependency structure of our model is
15
considerably more intricate than what is typically encountered in typical panel data settings in mar-
keting as one cannot assume independence across individuals or dyads for computing the uncondi-
tional likelihood. We therefore use Markov chain Monte Carlo methods involving a combination
of data augmentation and the Metropolis-Hastings algorithm to handle the numerical integration.
The data augmentation step allows us to leverage information from one relation to predict missing
data on other relations. The complexities involved in modeling multiple relations and the identi-
cation restrictions on the covariance matrix mean that the methods of inference for existing
latent space models, as outlined, for example, in Hoff and colleagues (2002, 2005), cannot be used
directly for our model. We therefore provide a full derivation of the posterior full conditionals in
Appendix 1.
4 New Product Development Network
In our rst application, we illustrate our modeling framework on sequential network data. We start
with a special case of our general modeling framework and handle the simpler situation of a single
directed binary relationship observed at two points in time. We, therefore, do not need the intensity
component in this application. We use the same data as in Van den Bulte and Moenaert (VdBM,
1998) on communications among members of different R&D teams and marketing and operations
professionals who are all involved in new product development activities. Here, we briey analyze
this dataset to revalidate the results in VdBM and to investigate whether our modeling framework
(which differs signicantly from that in VdBM) is better able to recover the structural characteris-
tics of the network, and whether it generates different conclusions or additional insights.
Data are available about communication patterns both before and after the R&D teams were
co-located into a new facility. The data set used in VdBM and the one we reanalyze here, comes
from a survey conducted in the Belgian subsidiary of a large U.S. telecommunication cooperation.
16
The data consists of two 22-by-22 binary (who talks to whom) sociomatrices, X
1
and X
2
, one for
1990 (before co-location) and one for 1992 (after the R&D teams were co-located in a separate
facility). The actors in both years are the same, 13 R&D professionals spread over four teams and
nine members of a steering committee consisting of seven positions in marketing and sales and two
in operations. The characteristic element x
ij,t
in each of the two matrices is 1 if i reports to talk to
j at least once a week in year t and zero otherwise. VdBM studied the impact of the co-location
intervention on the communication and cooperation patterns among the different R&D teams and
between R&D and marketing/operations by contrasting these patterns before and after co-location.
Barriers between R&D and marketing professionals resulting from differences in personality,
training or department culture imply that intra-team and intra-functional communication will be
more prevalent than cross-team and cross-functional communication. The idea of co-location was
to foster communication among the different R&D teams. VdBM proposed ve hypotheses to
investigate various substantive issues of interest. The rst two hypotheses test the communicative
implications of the barriers between R&D and marketing and relate to team- and function-specic
homophily effects. The third and fourth hypotheses focus on effects of co-locating R&D teams to
foster communication among these teams. In a fth hypothesis VdBM posit that co-locating R&D
groups may not just foster between group communication, but may even annihilate any difference
between within and between-group communication.
VdBM used p
1
log-linear models developed by Wasserman and Iacobucci (1988) to test their
hypotheses. Our approach differs from that of these earlier papers on several facets. First, in
contrast to the p
1
model, we do not assume dyadic independence. In our model, the dyads are in-
dependent conditional on the random effects, but are dependent unconditionally. Second, we allow
for individual-specic expansiveness and attractiveness parameters in contrast to group specic
parameters to yield a richer specication of heterogeneity. Finally, we account for higher order
effects via a latent space. The added generality of our model is consistent with the call by VdBM
17
for a new generation of models better able to handle triadic effects and other dependency issues.
4.1 Models and Variables
We estimated four models on the dataset. These include, Full: The full model involves all the
different components that form part of our modeling framework. These components include dyad-
specic variables, attractiveness and expansiveness random effects that are correlated both within
and across years, separate latent spaces for the two years, and correlated error terms for the utility
equations of the four binary variables characterizing a dyad.
Uncorr: This is a restriction of the full model. It assumes that the random effects and the utility
errors are correlated within a year but are uncorrelated across years. This is akin to having a
separate model for each year, and this offers limited leeway in modeling muliplexity.
Nozizj: This is a restriction of the full model such that the higher order terms that characterize the
latent space, (i.e., the z
i
z
j
terms) are not included. We use this model to assess whether using the
latent space results in better recovery of the triadic structure of the network, and whether it impacts
substantive conclusions.
Team: This model closely mimics the VdBMpaper within our modeling framework. In this model,
we restrict the expansiveness and attractiveness parameters to be the same for all individuals within
a group and also do not include the latent space.
4.2 Variables
The variables used in our investigation are the same as in VdBM and are reproduced below:
INTEAM
ij
= 1, if i and j are R&D professionals on the same team, 0, otherwise.
18
BETWTEAM
ij
= 1, if i and j are R&D professionals but in different teams, 0, otherwise.
INRD
ij
= 1, if i and j are R&D professionals, 0, otherwise.
INMKTOPS
ij
= 1, if i and j are both marketing or both operations executives, 0, otherwise.
4.3 Results
The four models were estimated using MCMC methods. Each MCMC run is for 250,000 iterations
and the results are based on 200,000 iterations after discarding a burn-in of 50,000.
4.3.1 Recovery of Structural Characteristics
We begin by comparing the above described models in their ability to recover the structural charac-
teristics of the network. Given our interest on modeling sequential relationships, we focus on those
aspects of the network structure that pertain to the two relations simultaneously. In particular, we
compute statistics involving the dyadic as well as transitivity patterns of interactions that span both
years.
Dyadic relationships in each year can be described as belonging to one of the following three
types: Mutual (M), Asymmetric (A) and Null (N). Looking across the two years, one can construct
the following ten possible combinations (Fienberg et al., 1985) of these types; NN, AN, NA,
MN, NM, AA, AA, AM, MA and MM. The names are self explanatory for most pairs. For
example, NN refers to a the number of dyads that are null in both years. Two patterns that require
greater explanation are AAand AA. The pair AArepresents a dyad in which one actor is connected
to the other in both years, but neither relationship is reciprocated. The pair AA represents a dyad
in which one actor initiates communication with the other in the rst year and the other actor
reciprocates by initiating in the second year, a kind of generalized exchange.
19
Table 2 reports the recovery of the sequential dyadic patterns. The columns report the absolute
deviations between the actual frequencies and those predicted by the different models. The last
row of the table reports the Mean Absolute Deviations (MAD) across all the patterns for each
model. It is clear that the uncorrelated model (Uncorr) which ignores cross-year linkages and
models the two years separately is signicantly worse in recovering the cross-relationship dyadic
patterns compared to the other models. All other models are roughly similar in their recovery with
Team being the best. This indicates that it is important to model the two relations jointly to be able
to recover the cross-relation dyadic patterns, as all the models, except for Uncorr, accommodate
correlations across the two relations.
We also investigate transitive patterns spanning both years to understand how well the model
recovers extra dyadic effects. This is needed to assess whether adding the latent space is impor-
tant. Eight such transitivity effects are possible. These are {X
ij1
, X
jk1
, X
ik2
}, {X
ij1
, X
jk2
, X
ik1
},
{X
ij1
, X
jk2
, X
ik2
}, {X
ij2
, X
jk1
, X
ik2
}, {X
ij2
, X
jk2
, X
ik1
}, {X
ij2
, X
jk1
, X
ik1
} {X
ij1
, X
jk1
, X
ik1
}
and {X
ij2
, X
jk2
, X
ik2
}. For the sake of brevity, we do not include a full table of results which is
available from the authors. We nd that the Full model performs signicantly better that all other
models in recovering these transitive patterns (MAD=34.43). The Full model and Uncorr model
(MAD =47.09) both of which include the latent space perform signicantly better than the other
two models Nozizj (MAD= 109.46) and Team (MAD= 164.69) which do not include the higher
order effects.
In summary, looking across all dyadic and triadic measures, we nd that the Full model always
does better than Uncorr, thus highlighting the need for joint modeling. The Full model also
performs better than all the other models in recovering the transitivity patterns, indicating that in
this application, the latent space is important for handling extra dyadic patterns. One can conclude
on the whole, that the Full model recovers best the structural characteristics of the network.
20
4.3.2 Parameter Estimates and Hypotheses
We begin by investigating whether the assumption of individual-level expansiveness and attrac-
tiveness parameters in our models has support. Figures 1 and 2 report the and values for the
22 managers for Year 1 and Year 2 respectively for our Full model. The labels for each point in
the gures represents the team name. It is apparent from these gures that while some members
of a group are clustered together, many groups exhibit considerable within-group heterogeneity.
This is particularly noticeable for the Marketing group (m) and the R&D groups (r
3
, r
1
, and r
4
)
which exhibit greater within-group variability. This suggests that the data support a richer charac-
terization of heterogeneity than what is possible with group-specic effects. We now move to the
parameter estimates.
The parameter estimates are summarized in Table 3. It is evident from the table that the pa-
rameters values differ substantially across the models in their magnitude and signicance, clearly
indicating that the different model components inuence substantive conclusions. Focusing on the
bottom part of the table, we see that models that do not include the latent space yield higher esti-
mates of the error correlations, possibly due to the confounding of variances across levels. We can
use the coefcients to infer the degree of support for the different hypotheses studied in VdBM.
Table 4 reports the extant of support for each hypothesis according to the different models. The
column named VdBMcontains results from Van Den Bulte and Moenaert (1998). All other entries
are computed from our reanalysis. Each entry for a particular model represents the probability that
the corresponding hypothesis is true under that model. A number of interesting differences across
the models are evident from this table.
The rst two hypotheses pertain to within and between team homophily effects. Comparing
the Full model with VdBM, we see that our model supports both H1 and H2, whereas, VdBM
found mixed support for these. In particular, we nd that H1b has signicant probability across all
21
our models indicating that R&D professionals tended to communicate predominantly with other
R&D folks before the move. The Full model as well as Nozizj and Uncorr suggest strong support
for Hypotheses 2 (H2a and H2b) in contrast to Team and VdBM. It appears that differences in
support for this hypothesis are driven by the extant of within group heterogeneity captured by
the model. Recall that both Team and VdBM assume that all members within a group share the
same attractiveness and expansiveness parameters. Figures 1 and 2 show, however, that there is
considerable heterogeneity in the recovered and parameters in the marketing group and we
nd that failure to model this heterogeneity comprehensively can affect substantive conclusions.
We also nd some differences in the support for the co-location hypotheses H3 and H4 across
the models. The full model Full and Uncorr has a lower probability associated with these hy-
potheses when compared to VdBM, Team and Nozizj. These differences can be explained by the
presence or absence of the latent space for capturing higher order effects and are also consistent
with the strong support for H1b in our model. Finally, all models yield no support for H5
5
. These
differences in supported hypotheses (H1b, H2a, H2b, H3 and H4) across models clearly demon-
strate that the different model components can impact the theoretical and substantive conclusions,
and that it is important to account and dyadic-dependence and higher order effects.
4.3.3 Leveraging Information Across Relationships
We now illustrate how our framework can be used to leverage information in one relationship to
predict relations in another. For instance, we assume that the data involving the entire marketing
groups is missing in the second year. In such a situation, the log-linear modeling framework
employed by previous researchers cannot be readily used, as the group-specic parameters that
are used in such models will not be available for the marketing group. However, for our models,
the natural reliance on data augmentation to obtain the utilities and the individual-specic random
5
For H5b, we found that the probability associated with INTEAM2 > BETWTEAM2 is 0.99 for all our models
22
effects when estimating parameters ensures that such missing data can be handled seamlessly.
In particular, the covariance matrix of the random effects can be used to leverage information
from year 1 to year 2 about these individual-specic parameters. We therefore estimate our Full
model and Uncorr on such a data set to see whether incorporating cross-relation linkages improves
predictions in such situations. Notice that in the Full model, the data on year 1 for the individuals
in the marketing group can be leveraged to predict relations in year 2. This is not possible in the
uncorrelated model Uncorr where the two years are modeled separately. Tables 1 and 2 (in the
Online Technical Appendix) report how well the cross-year dyadic and transitivity patterns are
recovered on such a data set with missing values for the marketing group. These tables report the
absolute deviations between the actual frequencies and those predicted by the Full and Uncorr
models and illustrate that the Full model does signicantly better in recovering these cross-year
relationships.
6
5 Online Social Network
In this application our focus is on modeling relationships of different types. We use a combination
of undirected and directed binary relationships and a directed weighted relation. In particular, we
show how it is important to model both the incidence and intensity of weighted relationships, as
conclusions and predictions depend crucially on this distinction. The data for this application come
from a Swiss online social network on which members can create a prole either as a user or as an
artist, and can then connect with other registered members via friendship relationships. The social
networking site offers different services to these two distinct user groups. While both groups can
publish user generated content such as blogs, photos or videos on their proles, only artists can, in
addition, publish up to a maximum of 30 songs on their prole.
6
We also investigated the role of demographics that were part of the data, but did not nd any signicant impact.
Our models can handle such continuous covariates, something that is not possible with a log-linear specication.
23
Artists use the different services offered by the platform to promote their music and concerts
and to seek collaboration with other musicians and bands. They establish friendship relations
with other artists, send personal messages and write public comments on other artists proles.
For entertainment and informational reasons, users, as well as artists, visit proles of different
artists and download songs. Artists engage in active promotion and relationship effort in the hope
that it can result in increased collaboration, communication, popularity, and song downloads. In
summary, the online networking site offers a platform that combines social networking services
with entertainment and communication services. There are four components of the data set (1)
member data, (2) friendship data, (3) communication data and (4) music download data. The
member data contains information collected at registration and includes stable variables such as
the registration date, date of birth, gender, city, or in the case of artists, their genre, and information
about their ofine concerts and performances before joining the network. In addition, the data also
contain information on the number of page-views of each members page on the network during
a given time period. The other data components pertain to our three dependent variables and are
described in greater detail below.
5.1 Data and Structural Characteristics
Our sample consists of 230 artists who created a prole on the network between February 1st
and March 31st 2007, provided information about their activities and characteristics, and uploaded
atleast one song on their prole. We model three types of relationships among these artists over the
course of the six months between April 1st and September 30th 2007.
7
These relationships include
1) friendship (f), 2) communications (c) and 3) music downloads (m). The dataset thus contains
three 230 230 matrices Y
f
, Y
c
and Y
m
, respectively, for these relationships.
7
Our data is not entirely representative of the entire network which consists of other artists who joined the network
subsequent to our data period. In addition, we focus only on the sub-network of connections involving artists, instead
of also considering fans, as this is consistent with the primary focus of the network in offering a platform for artists to
present themselves and seek collaborations.
24
5.1.1 Structural Characteristics
We now briey describe the structural characteristics of the three relationships for our set of artists.
1) Friendship: The friendship relation y
f
ij
is binary and undirected. It indicates whether a friend-
ship is formed between the pair {i, j} before the end of our data period. The network has 3,564
friendship relations among a maximum possible 26,335 connections, yielding a network density
of 13.53%.
2) Communications: The communication relation y
c
ij
is binary and directed and indicates whether
artist i sent a communication (direct message or comment) to artist j within the time period of
the data. We observe 4,575 communication relations, yielding a density of 8.68%. The relation
exhibits considerable reciprocity or mutuality (dened as the ratio of the number of pairs with
bidirectional relations to the number of pairs having at least one tie) equal to 30.9%. Artists vary in
their level of expansiveness (or outdegree) as measured by the number of artists they communicate
with and their popularity or receptivity i.e., the number of artists communicating with a given artist
(indegree). The indegree and outdegree distributions are highly skewed. The mean degree is 19.97.
The maximum and minimum for the indegree distribution are 203 and zero respectively, whereas,
the maximum and minimum for the outdegree distribution are 182 and zero respectively.
3) Music Downloads: The music downloads represent a directed and weighted relationship. Each
song download entry y
m
ij
is a count of the number of times that artist j listens to a song on artist
is prole and may include multiple downloads of the same song. As discussed in Section 3, we
distinguish between the incidence and intensity of music downloads. Focussing rst on incidence,
we nd that of the possible 52,670 ties our data contains only 17,912 binary ties, implying a density
of 34%. The reciprocity is 39.1%. The outdegree of an artist is the number of other artists who
download from that artist and the indegree is the number of other artists from whom the artist
downloads music. For the binary component, the mean degree is 17.89 and the maximum indegree
25
and outdegree are 213 and 135, respectively. As this is a weighted relation, we can also study the
intensity of connections. On average, each artist downloads songs on 59.98 occasions (including
multiple song downloads). The maximumweighted-indegree i.e., the number of songs downloaded
from an artist is 2,899, whereas, the maximum weighted-outdegree, i.e., the maximum number of
times a single artist listens to songs is 612. We also nd that the artist-specic degree statistics are
highly correlated across the three relationships.
5.2 Model Specications and Variables
We estimate a number of different variants of the full model (which we name Full Model from now
on) that we outlined in our model section (Section 3). Our null models impose different restriction
on the full model and are constructed to investigate how crucial the different components of the
full model are in capturing important aspects of the data generating process. The models are:
Full Model: The full model includes dyad-specic covariates (x
ij
) to accommodate homophily
and heterophily; artist-specic covariates (x
i
) and (x
j
) to account for asymmetry in responses;
artist-specic and relationship-specic sender and receiver parameters (
i
), to model heterogeneity
in expansiveness and receptivity; correlations in these random effects across relationships (
);
latent spaces of random locations for the three relations, (z
im
, z
ic
and z
if
), to capture higher
order effects; and correlations in errors (
m
,
c
, and
f
), to incorporate reciprocity within each
relationship. In addition, the full model uses a correlated hurdle count model to account for the
sparse nature of many network datasets.
Poisson: This model uses a Poisson distribution for the music download relationships, instead of
the correlated hurdle Poisson. Thus, the counts are modeled directly and we do not distinguish
between the incidence and intensity of counts. The model is otherwise identical to the full model
in all other respects.
26
Uncorr: In this model we treat the three relationships as independent. Thus, the artist-specic
random effects in
i
are assumed to be uncorrelated across the relationships, and therefore,
has
a block-diagonal structure. This model is thus equivalent to running three separate models on the
three relationships.
NoZiZj: Here, we do not include the higher order terms z
i
z
j
that characterize the latent space.
5.2.1 Variables
Each artist can be described using the variables detailed in Table 5 . We use these artist-specic
variables to compute dyad-specic variables. We use different combinations of the artist and dyad-
specic variables in explaining the three relationships. Details of the covariate specications for
the three relationships are given in Appendix 2. Table 6 shows descriptive statistics associated with
these covariates. The last two rows of Table 6 contain dyad-specic binary variables. The variable
Common Region is equal to 1 when both artists in a dyad are from the same region, otherwise it is
zero. The last variable Common Genre is coded in an analogous manner.
5.3 Results
The models are estimated using MCMC methods that are described in Appendix 1. Each MCMC
run is for 250,000 iterations and the results are based on a sample of 200,000 iterations after a
burn-in of 50,000.
5.3.1 Model Adequacy
Before discussing parameter estimates, we use a combination of predictive measures and posterior
predictive model checking (Gelman et al., 1996) to compare the adequacy of our models. This
27
involves generating G hypothetical replicated data sets from the model using the MCMC draws
and comparing these datasets with the actual dataset. These comparisons are made using various
test quantities that represent different structural characteristics of the network. If the replicated
data sets differ systematically from the actual data on a given test quantity, then the model does not
adequately mimic the structural characteristic that the test quantity represents. The discrepancy
between the replicated data sets and the actual data can be assessed using posterior predictive p-
value. This p-value is the proportion of the G replications in which the simulated test quantity
exceeds the realized value of that quantity in the observed data. An extreme p-value, (one that is
either close to 0 or 1, i.e, 0.05 or 0.95) suggests inadequate recovery of the corresponding test
quantity.
We use a number of test statistics associated with the weighted relationship to assess whether
the distinction between incidence and intensity is crucial. Table 7 shows the model adequacy
results for the music download relationship based on G = 10, 000 MCMC draws. Column 2 of
the table reports the value of the test statistics for the observed data and the other columns report
the posterior predictive mean and p-values for the different models. A few conclusions can be
readily drawn from the table. First, we see that Poisson which models the intensity directly using
a Poisson specication (as in Hoff 2005) does not recover any of the test statistics adequately as
almost all p-values in Column 6 are extreme. Furthermore, the posterior predictive mean values
for the test statistics (Column 5) are appreciably different from their counterparts in the actual
data. Second, we observe that none of the p-values associated with the other models are extreme,
and this indicates that modeling incidence and intensity separately is important for adequately
capturing the structural network characteristics associated with weighted relations. Finally, the
fact that the NoZiZj model performs almost as well as the Full model indicates that the latent
space is not crucial for the recovery of structural characteristics in this application.
The discussion above focussed on model adequacy for music downloads to highlight the con-
28
tribution of the correlated multivariate hurdle Poisson in modeling weighted relationships. The
results from the other two relationships show that all models (including Poisson) are similar in
their recovery of structural characteristics for these relationships. The model adequacy results are
based on in-sample simulations. We report the predictive performance of our model in the online
technical appendix. We nd that the Full model outperforms other models on almost all measures.
We also nd that the Poisson model does very poorly in predicting future activity. indicating that
there are signicant gains in modeling the incidence and intensity separately.
5.4 Parameter Estimates
We now discuss the parameter estimates based on the entire sample of six months. We focus on
the parameter estimates from the Full model. Estimates from other models mostly yield similar
qualitative conclusions and we do not include these for the sake of brevity. We begin with the
covariate effects.
5.4.1 Covariate Effects
Table 8 reports the posterior means and standard deviations of the coefcient estimates for the
three relationships. In interpreting this table, recall that all variables associated with the friendship
relation are dyad-specic and binary, whereas, for the other relations, some variables are dyad-
specic and some are artist-specic. Also, note that both components of the music relation have
the same set of covariates. The denitions for these covariates are available in Appendix 2.
For the sake of brevity we synthesize the results across all the relationships. There is clear
evidence of homophily and proximity, as for all the three relationships, the dyad-specic variables
CRegion and CGenre have a positive and signicant impact on the likelihood of forming connec-
tions. The positive coefcient for CRegion is consistent with the fact that a common language
29
and geographical proximity can enhance the likelihood of collaborative effort. The positive coef-
cient for CGenre means that pairs of artists who produce music in the same genre have a higher
propensity to form friendship connections, communicate with each other, and download music
from each other. We also nd that artists belonging to a band have a higher chance of forming
friendships (BothBand) and a higher probability of sending and receiving communications (SBand
and RBand).
We nd that measures of online popularity that are based on the total number of page views for
an artist (BothPopular, SPviews, DPviews and PPviews) positively inuence relationship forma-
tion. For example, the positive coefcients for DPviews in the music relation indicate that artists
with greater online popularity in this network have a greater likelihood of downloading songs from
other artists, and that they tend to download more music. In contrast, most measures of prior and
ofine popularity or experience (indicating audience size of concerts, or years of activity) do not
appear to impact the formation of online relationships within the network.
8
Finally, we see for
the music relationship that different coefcients inuence the incidence and intensity components
of the music relationship and thus these two facets are not isomorphic and need to be modeled
separately.
5.4.2 Covariance Structure of the Relationships
The covariance matrix
, Psychometrika, 61 (September),
401-425.
Wuyts, Stefan, Stefan Stermersch, Christophe Van den Bulte and Philip Hans Franses (2004)
Vertical Marketing Systems for Complex Products: A Triadic Perspective,, Journal of Marketing
Research, 41 (November), 479-487.
Watts, Duncan J., and Peter Sheridan Dodds (2007), Inuentials, Networks, and Public Opin-
35
ion Formation, Journal of Consumer Research, 34 (May), 441-458.
36
Figure 1
THE AND VALUES FOR THE MANAGERS IN YEAR 1: THE POINT LABELS INDICATE
TEAM MEMBERSHIP
r
1
r
1
r
1
r
1
r
2
r
2
r
2
r
3
r
3
r
4
r
4
r
4
r
4
m
m
m
m
m
m
m
o
o
3 2 1 1 2 3
1.5
1.0
0.5
0.5
1.0
1.5
Figure 2
THE AND VALUES FOR THE MANAGERS IN YEAR 2: THE POINT LABELS INDICATE
TEAM MEMBERSHIP
r
1
r
1
r
1
r
1
r
2
r
2
r
2
r
3
r
3
r
4
r
4
r
4
r
4
m
m
m
m
m
m
m
o
o
3 2 1 1 2 3
1.5
1.0
0.5
0.5
1.0
1.5
37
Table 1
DESCRIPTION OF PARAMETERS
Variable Effects
i
Expansiveness, Productivity
i
Attractiveness, Popularity or Prestige
z
i
1
,
6
,
9
Reciprocity
7
,
8
Selectivity
3
,
5
Generalized Exchange
2
,
4
Multiplexity
Heterogeneity
Table 2
APPLICATION 1: RECOVERY OF CROSS-RELATION DYADIC COUNTS
Pattern Full NoZiZj Uncorr Team
NN 7.51 6.44 12.67 7.04
AN 3.00 4.17 4.24 2.70
NA 4.25 4.90 4.82 5.50
MN 3.79 1.53 7.45 2.15
NM 0.81 2.65 6.07 1.43
AA 0.92 1.89 1.89 0.80
AA 1.97 0.38 4.84 0.02
AM 2.14 2.26 1.67 1.92
MA 0.10 0.80 0.84 0.77
MM 5.33 4.48 9.46 3.80
MAD 2.983 2.950 5.394 2.612
38
Table 3
APPLICATION 1: PARAMETER ESTIMATES FROM DIFFERENT MODELS
Full Nozizj Uncorr Team
INTERCEPT1 -1.48 -0.87 -1.23 0.48
INTEAM1 5.03 3.02 4.03 3.42
BETWTEAM1 1.60 0.88 0.84 1.42
INMKTOPS1 1.03 1.40 1.02 0.49
INTERCEPT2 -1.18 -0.99 -1.12 0.78
INTEAM2 3.51 3.01 3.17 3.79
BETWTEAM2 2.15 1.73 1.80 2.71
INMKTOPS2 1.23 1.61 1.02 0.07
1
0.59 0.83 0.60 0.79
2
0.43 0.64 - 0.63
3
0.26 0.57 - 0.53
6
0.48 0.72 0.53 0.67
Bold indicates that the 95% posterior interval does not span 0.
39
T
a
b
l
e
4
A
P
P
L
I
C
A
T
I
O
N
1
:
H
Y
P
O
T
H
E
S
I
S
S
U
P
P
O
R
T
E
D
B
Y
D
I
F
F
E
R
E
N
T
M
O
D
E
L
S
H
y
p
o
t
h
e
s
i
s
F
o
r
m
a
l
T
e
s
t
F
u
l
l
T
e
a
m
V
d
B
M
N
o
z
i
z
j
U
n
c
o
r
r
H
y
p
o
t
h
e
s
i
s
1
.
B
o
t
h
b
e
f
o
r
e
a
n
d
a
f
t
e
r
c
o
-
l
o
c
a
t
i
o
n
,
R
&
D
p
r
o
f
e
s
s
i
o
n
a
l
s
t
e
n
d
t
o
c
o
m
m
u
n
i
c
a
t
e
w
i
t
h
o
t
h
e
r
R
&
D
p
r
o
f
e
s
s
i
o
n
a
l
s
r
a
t
h
e
r
t
h
a
n
w
i
t
h
m
a
r
k
e
t
i
n
g
o
r
o
p
e
r
a
t
i
o
n
s
e
x
e
c
u
t
i
v
e
s
.
H
1
a
I
N
T
E
A
M
1
>
0
0
.
9
9
0
.
9
9
0
.
9
0
.
9
9
0
.
9
9
H
1
b
B
E
T
W
T
E
A
M
1
>
0
0
.
9
9
0
.
9
9
n
s
0
.
9
9
0
.
9
1
H
1
c
I
N
T
E
A
M
2
>
0
0
.
9
9
0
.
9
9
0
.
9
9
0
.
9
9
0
.
9
9
H
1
d
B
E
T
W
T
E
A
M
2
>
0
0
.
9
9
0
.
9
9
0
.
9
9
0
.
9
9
0
.
9
9
H
y
p
o
t
h
e
s
i
s
2
.
B
o
t
h
b
e
f
o
r
e
a
n
d
a
f
t
e
r
c
o
-
l
o
c
a
t
i
o
n
,
m
a
r
k
e
t
i
n
g
a
n
d
o
p
e
r
a
t
i
o
n
s
e
x
e
c
u
t
i
v
e
s
t
e
n
d
t
o
c
o
m
m
u
n
i
c
a
t
e
w
i
t
h
m
e
m
b
e
r
s
o
f
t
h
e
i
r
o
w
n
d
e
p
a
r
t
m
e
n
t
.
H
2
a
I
N
M
K
T
O
P
S
1
>
0
0
.
9
3
0
.
8
5
n
s
0
.
9
9
0
.
9
2
H
2
b
I
N
M
K
T
O
P
S
2
>
0
0
.
9
7
0
.
5
5
n
s
0
.
9
9
0
.
9
4
H
y
p
o
t
h
e
s
i
s
3
.
R
&
D
p
r
o
f
e
s
s
i
o
n
a
l
s
h
a
v
e
a
h
i
g
h
e
r
p
r
o
b
a
b
i
l
i
t
y
t
o
c
o
m
m
u
n
i
c
a
t
e
w
i
t
h
m
e
m
b
e
r
s
o
f
o
t
h
e
r
R
&
D
t
e
a
m
s
a
f
t
e
r
c
o
-
l
o
c
a
t
i
o
n
t
h
a
n
b
e
f
o
r
e
.
H
3
B
E
T
W
T
E
A
M
2
>
B
E
T
W
T
E
A
M
1
0
.
8
2
0
.
9
9
0
.
9
9
0
.
9
9
0
.
8
6
H
y
p
o
t
h
e
s
i
s
4
.
W
h
e
n
c
o
-
l
o
c
a
t
i
n
g
R
&
D
t
e
a
m
s
i
m
p
l
i
e
s
i
n
c
r
e
a
s
i
n
g
t
h
e
p
h
y
s
i
c
a
l
d
i
s
t
a
n
c
e
w
i
t
h
o
t
h
e
r
d
e
p
a
r
t
m
e
n
t
s
,
R
&
D
p
r
o
f
e
s
s
i
o
n
a
l
s
t
e
n
d
e
n
c
y
t
o
c
o
m
m
u
n
i
c
a
t
e
w
i
t
h
o
t
h
e
r
R
&
D
p
e
o
p
l
e
r
a
t
h
e
r
t
h
a
n
e
x
e
c
u
t
i
v
e
s
f
r
o
m
o
t
h
e
r
d
e
p
a
r
t
m
e
n
t
s
w
i
l
l
i
n
c
r
e
a
s
e
.
H
4
I
N
R
D
2
>
I
N
R
D
1
0
.
8
6
0
.
9
9
0
.
9
9
0
.
9
9
0
.
8
2
H
y
p
o
t
h
e
s
i
s
5
.
B
e
f
o
r
e
c
o
-
l
o
c
a
t
i
o
n
,
R
&
D
p
r
o
f
e
s
s
i
o
n
a
l
s
h
a
v
e
a
h
i
g
h
e
r
p
r
o
b
a
b
i
l
i
t
y
t
o
c
o
m
m
u
n
i
c
a
t
e
w
i
t
h
m
e
m
b
e
r
s
o
f
t
h
e
i
r
o
w
n
t
e
a
m
r
a
t
h
e
r
t
h
a
n
w
i
t
h
m
e
m
b
e
r
s
o
f
o
t
h
e
r
R
&
D
t
e
a
m
s
.
A
f
t
e
r
c
o
-
l
o
c
a
t
i
o
n
,
t
h
e
t
e
n
d
e
n
c
y
t
o
c
o
m
m
u
n
i
c
a
t
e
a
m
o
n
g
R
&
D
p
e
o
p
l
e
i
s
a
s
s
t
r
o
n
g
b
e
t
w
e
e
n
a
s
w
i
t
h
i
n
t
e
a
m
s
.
H
5
a
I
N
T
E
A
M
1
>
B
E
T
W
T
E
A
M
1
0
.
9
9
0
.
9
9
0
.
9
9
0
.
9
9
0
.
9
9
H
5
b
I
N
T
E
A
M
2
=
B
E
T
W
T
E
A
M
2
n
s
n
s
n
s
n
s
n
s
1
)
E
n
t
r
i
e
s
r
e
p
r
e
s
e
n
t
t
h
e
p
r
o
b
a
b
i
l
i
t
i
e
s
o
f
a
h
y
p
o
t
h
e
s
i
s
b
e
i
n
g
t
r
u
e
.
2
)
n
s
i
n
d
i
c
a
t
e
s
t
h
a
t
t
h
e
c
o
r
r
e
s
p
o
n
d
i
n
g
h
y
p
o
t
h
e
s
i
s
i
s
n
o
t
s
u
p
p
o
r
t
e
d
.
40
Table 5
APPLICATION 2: DESCRIPTION OF VARIABLES
Variable Description
Songs The # of songs available on the artists prole
Band Whether the artist belongs to a band. Equal to 1
if artist belongs to a band, 0, otherwise
Audience The audience size of the largest concert by the artist.
A median split yields 1 when the audience is > 700 and 0 otherwise.
Genre Genre of the artist. We distinguish between rock genres and non-rock genres.
Region The geographical region to which the artist belongs.
Artists belong to one of the 26 cantons in Switzerland.
These are aggregated into three regions: French, German and Italian.
PageViews The # of page views of the artists prole during the data period.
These page views originate from other registered members,
including the fans, or from Internet users from outside the social network.
YActive The # of years of activity of the artist in the music industry.
Table 6
APPLICATION 2: DESCRIPTIVE STATISTICS FOR ARTIST VARIABLES
Variable Observations Mean Median Std. Dev. Min. Max
Page Views 230 781.2 467 1069.8 43 10931
Songs 230 4.108 3 3.26 1 29
Audience 230 0.482 0 0.5 0 1
Band 230 0.704 1 0.457 0 1
Years Active 230 7.24 6 5.82 1 27
Common Region 26,335 0.628 1 0.483 0 1
Common Genre 26,335 0.564 1 0.495 0 1
41
T
a
b
l
e
7
A
P
P
L
I
C
A
T
I
O
N
2
:
P
O
S
T
E
R
I
O
R
P
R
E
D
I
C
T
I
V
E
C
H
E
C
K
I
N
G
F
O
R
M
U
S
I
C
D
O
W
N
L
O
A
D
S
F
O
R
T
H
E
F
O
U
R
M
O
D
E
L
S
D
a
t
a
F
u
l
l
U
n
c
o
r
r
N
o
Z
i
Z
j
P
o
i
s
s
o
n
B
i
n
a
r
y
:
M
e
a
n
p
-
v
a
l
u
e
M
e
a
n
p
-
v
a
l
u
e
M
e
a
n
p
-
v
a
l
u
e
M
e
a
n
p
-
v
a
l
u
e
N
u
l
l
D
y
a
d
s
2
3
,
3
7
6
2
3
,
4
2
0
0
.
7
9
2
3
,
4
1
6
0
.
7
7
2
3
,
4
2
0
0
.
8
0
2
3
,
2
7
5
0
.
0
3
A
s
y
m
m
.
D
y
a
d
s
1
,
8
0
2
1
,
7
8
3
0
.
3
5
1
,
7
8
5
0
.
3
6
1
,
7
7
7
0
.
3
0
2
,
1
3
4
1
M
u
t
u
a
l
D
y
a
d
s
1
,
1
5
7
1
,
1
3
2
0
.
2
4
1
,
1
3
4
0
.
2
6
1
,
1
3
7
0
.
2
8
9
2
5
0
R
e
c
i
p
r
o
c
i
t
y
0
.
3
9
0
.
3
9
0
.
4
4
0
.
3
9
0
.
4
5
0
.
3
9
0
.
5
1
0
.
3
0
0
T
r
a
n
s
.
T
r
i
a
d
s
5
1
,
6
9
2
5
0
,
5
2
3
0
.
3
5
0
,
7
1
0
0
.
3
2
4
9
,
5
3
4
0
.
1
6
4
4
,
3
0
3
0
I
n
t
r
a
n
s
.
T
r
i
a
d
s
1
3
6
,
2
3
0
1
3
3
,
3
9
8
0
.
2
7
1
3
3
,
3
2
1
0
.
2
6
1
3
5
,
0
1
2
0
.
3
9
1
1
7
,
4
9
4
0
M
e
a
n
D
e
g
r
e
e
1
7
.
8
9
1
7
.
5
9
0
.
1
8
1
7
.
6
2
0
.
2
1
7
.
6
2
0
.
2
1
1
7
.
3
2
0
.
0
4
S
t
d
.
I
n
d
e
g
r
e
e
1
7
.
9
2
4
1
8
.
1
0
.
6
5
1
8
.
0
7
0
.
6
2
1
8
.
1
6
0
.
6
9
1
6
.
6
1
0
S
t
d
.
O
u
t
d
e
g
r
e
e
3
1
.
6
1
3
1
.
0
6
0
.
1
4
3
1
.
0
9
0
.
1
6
3
1
.
0
7
0
.
1
4
2
7
.
7
4
0
D
e
g
r
e
e
C
o
r
r
.
0
.
8
9
8
0
.
8
9
3
0
.
3
0
.
8
9
0
.
3
1
0
.
8
9
3
0
.
3
2
0
.
8
9
6
0
.
4
5
Q
u
a
n
t
i
t
y
:
M
e
a
n
S
t
r
e
n
g
t
h
5
9
.
5
8
5
8
.
8
3
0
.
2
5
5
8
.
6
9
0
.
2
3
5
8
.
8
6
0
.
2
6
2
0
9
.
4
1
S
t
d
.
I
n
d
e
g
r
e
e
2
1
3
.
2
2
0
8
.
4
4
0
.
3
4
2
0
6
.
3
4
0
.
2
9
2
0
9
.
6
6
0
.
3
8
1
,
3
2
7
.
9
1
S
t
d
.
O
u
t
d
e
g
r
e
e
7
4
.
1
3
7
0
.
0
6
0
.
1
5
6
9
.
4
7
0
.
1
2
6
9
.
5
1
0
.
1
3
7
5
1
.
1
1
42
Table 8
APPLICATION 2: COEFFICIENT ESTIMATES FOR THE THREE RELATIONSHIPS FOR THE FULL
MODEL
Friendship Relationship Communications Relationship
Parameter Mean Std Parameter Mean Std
Intercept -2.732 0.167 Intercept -3.446 0.270
CRegion 0.341 0.050 CRegion 0.321 0.042
CGenre 0.150 0.033 CGenre 0.144 0.028
BothPopular 0.163 0.063 SPviews 0.015 0.004
BothNotPopular 0.015 0.087 SSongs 0.009 0.010
BothBigSongs 0.061 0.061 SBand 0.321 0.106
BothSmallSongs 0.016 0.065 SAudience 0.135 0.093
BothBand 0.640 0.153 SYActive -0.003 0.006
BothNoBand -0.257 0.163 RPviews 0.002 0.003
BothBigAudience 0.203 0.143 RSongs 0.013 0.009
BothSmallAud -0.134 0.143 RBand 0.464 0.154
BothLongActive -0.036 0.075 RAudience 0.043 0.138
BothShortActive 0.034 0.071 RYActive -0.004 0.006
Music Relationship: Incidence Music Relationship: Intensity
Parameter Mean Std Parameter Mean Std
Intercept -3.007 0.272 Intercept -2.024 0.269
CRegion 0.183 0.040 CRegion 0.235 0.050
CGenre 0.118 0.031 CGenre 0.152 0.038
PPviews 0.021 0.004 PPviews 0.011 0.003
PSongs -0.006 0.010 PSongs 0.023 0.009
PBand 0.106 0.104 PBand 0.169 0.086
PAudience 0.100 0.091 PAudience -0.005 0.074
PYActive -0.013 0.006 PYActive -0.003 0.006
DPviews 0.031 0.005 DPviews 0.034 0.006
DSongs 0.022 0.014 DSongs 0.021 0.017
DBand 0.062 0.141 DBand -0.055 0.149
DAudience -0.113 0.120 DAudience -0.157 0.120
DYActive -0.015 0.009 DYActive -0.012 0.011
Bold indicates that the 95% posterior interval does not span 0.
43
Table 9
APPLICATION 2: COVARIATION IN THE RANDOM EFFECTS
m
i1
m
i1
m
i2
m
i2
c
i
c
i
f
i
m
i1
0.397 0.378 0.234 0.222 0.268 0.500 0.499
(0.051) (0.056) (0.037) (0.049) (0.040) (0.064) (0.065)
m
i1
0.707 0.238 0.441 0.329 0.617 0.593
(0.091) (0.042) (0.073) (0.053) (0.086) (0.086)
m
i2
0.192 0.146 0.195 0.315 0.319
(0.034) (0.038) (0.033) (0.050) (0.052)
m
i2
0.489 0.188 0.313 0.288
(0.082) (0.048) (0.079) (0.081)
c
i
0.407 0.478 0.513
(0.052) (0.065) (0.067)
c
i
1.000 0.988
(0.109) (0.104)
f
i
1.031
(0.111)
Posterior standard deviations are in parenthesis.
44
Appendix 1: Full Conditional Distributions
1. The full conditional for precision matrix
1
|{
i
}) = Wishart
+ N,
i=1
i
+ R
1
, (6)
where, the prior for
1
is Wishart(
, R
). The quantities
and R
a
r
+ pN,
1
p
N
i=1
p
k=1
z
2
irk
+ b
1
r
. (7)
3. The full conditional for the coefcients is multivariate normal as we have a seemingly
unrelated regression system of equations conditional on knowing the latent variables. Form
the adjusted utilities (e.g., u
ij1
= u
ij1
i1
j1
z
i1
z
j1
) and adjusted log-rate parameters
by subtracting terms that do not involve from the latent dependent variables. We then
have the system of equations, v
{ij}
= X
{ij}
+ e
{ij}
, for an arbitrary pair {i, j}, where
e
{ij}
N(0, ). We can write the full conditional as
p(|{ v
{ij}
}) = N( ,
), (8)
where,
1
= C
1
+
{ij}
X
{ij}
1
X
{ij}
and =
C
1
+
{ij}
X
{ij}
1
v
{ij}
.
45
4. The full conditional for the heterogeneity parameter
i
is a multivariate normal. We again
begin by creating adjusted utilities and rate parameters, such as u
ij1
= u
ij1
x
ij1
j1
i1
z
j1
by subtracting all terms that do not involve
i
. Let v
{ij}
be the vector of adjusted
utilities for the three relationships. Then we have the system v
{ij}
=
i
+
{ij}
. Once again,
we can use standard Bayesian theory for the multivariate normal to obtain the resulting full
conditional
p(
i
|{ v
{ij}
}) = N(
i
,
), (9)
where,
1
=
1
+ (N 1)
1
, and
i
=
1
j=i
v
{ij}
5. The full conditional for z
i
that contains all the latent space vectors associated with an
individual i is multivariate normal. Creating adjusted utilities such as u
ij1z
= u
ij1
x
ij1
1
i1
j1
, we can form the vector of adjusted utilities and latent rate parame-
ters v
{ij}z
= Z
j
z
i
+ e
{ij}
, where, Z
j
is a an appropriately constructed matrix from the
latent space vector of actor j. This is a seemingly unrelated regression system. Given
the prior z
i
N(0,
z
), where,
z
is constructed from the different
zr
matrices, we
can write the full conditional as N( z
i
,
z
i
), where
1
z
i
= (
z
)
1
+
j=i
Z
1
Z
j
and
z
i
=
z
i
[
j=i
Z
j
()
1
v
{ij}z
]. The model depends upon the inner product of the latent
space vectors which is invariant to rotations and reections of the vectors. Thus visual rep-
resentations of these vectors require that they be rotated to a common orientation, This can
be done by using a Procustrean transformation as outlined in Hoff (2005).
6. The variance-covariance matrix of the errors for the weighted relationship, , has a special
structure as described in Section 3. Given this special structure, we follow the separation
strategy of Barnard, McCulloch and Meng, (2000) in setting the prior in terms of the stan-
dard deviations and correlations in . The covariance matrix can be decomposed into a
correlation matrix, Rand a vector s, of standard deviations, i.e., = diag(s)Rdiag(s),
where s contains the square roots of the standard deviations. Let contain the logarithms
of the elements in s. We assume a multivariate normal distribution N(0, I) for the non-
46
redundant elements of R, such that it is constrained to the subspace of the p dimensional
cube [1, 1]
p
, where p is the number of equations, that yields a positive denite correlation
matrix. Finally, we assume a univariate standard normal prior for the single log-standard
deviation in .
The full conditional distribution for the free element in the vector of log standard de-
viations of errors can only be written up-to a normalizing constant (recall that the
terms associated with the binary utilities in are xed to 0, for identication pur-
pose). Given our assumption of a normal prior for the single free element, we use a
Metropolis-Hastings step to simulate the standard deviation in . A univariate normal
proposal density can be used to generate candidates for this procedure. If
(t1)
k
is the
current value of kth component of , then a candidate value is generated using a ran-
dom walk chain
c
k
=
(t1)
k
+ N(0, ), where is a tuning constant that controls the
acceptance rate.
Many different approaches can be used to sample the correlation matrix R. Here, we
use a multivariate Metropolis step to sample a vector of non-redundant correlations in
R. We used adaptive MCMC (Atchade 2006) for obtaining the tuning constant so as to
ensure rapid mixing.
7. The full conditional distribution associated with the set of latent utilities and latent rate pa-
rameters in u
{ij}
is again unknown. We sample the utilities and log rate parameters using
univariate conditional draws. Sampling the utilities is straightforward, as these are truncated
univariate conditional normal draws. The log-rate parameters log
ij
and log
ji
are sam-
pled such that these are univariate normal draws if the corresponding observation involves
a zero count, and for an observations where a positive count is observed, we use a univari-
ate Metropolis step that combines the likelihood for a truncated Poisson distribution with a
conditional normal prior.
47
Appendix 2: Model Specications for Application 2
Covariates for the Friendship Relation
CRegion: CRegion is equal to 1 if both artists in a pair are from the same region, zero otherwise.
CGenre: CGenre is equal to 1 if both artists in a pair are from the same genre, zero otherwise.
BothPopular: BothPopular is equal to 1 if both artists in a pair are viewed (online) by more than
the population median, zero otherwise.
BothNotPopular: BothNotPopular is equal to 1 if both artists in a pair are viewed by less than the
population median, zero otherwise.
BothBigSongs: BothBigSongs is equal to 1 if both artists in a pair post more songs than the
population median, zero otherwise.
BothSmallSongs: BothSmallSongs is equal to 1 if both artists in a pair post less songs than the
population median, zero otherwise.
BothBand: BothBand is equal to 1 if both artists in a pair post are from a band, zero otherwise.
BothNoBand: BothNoBand is equal to 1 if both artists in a pair post are not from a band, zero
otherwise.
item BothBigAudience: BothBigAudience is equal to 1 if both artists in a pair had large
concerts with more than 700 spectators, zero otherwise.
BothSmallAudience: BothSmallAudience is equal to 1 if both artists in a pair had small concerts
with more than 700 spectators, zero otherwise.
BothLongActive: BothLongActive is equal to 1 if both artists in a pair had being active for more
than six years, zero otherwise.
BothShortActive: BothShortActive is equal to 1 if both artists in a pair had being active for less
than six years, zero otherwise.
48
Covariates for the Communication Relation
CRegion: CRegion is equal to 1 if both artists in a pair are from the same region, zero otherwise.
CGenre: CGenre is equal to 1 if both artists in a pair are from the same genre, zero otherwise.
SPviews: SPviews represents the number of sender paged views.
SSongs: SSongs represents the number of songs posted on the sender web page.
SBand: SBand is equal to 1 if the sender belongs to a band, zero otherwise.
SAudience: SAudience is equal to 1 if the sender performed in-front of an audience larger than
700 people, zero otherwise.
SYActive: SYActive is equal to 1 if the sender was active for more than 6 years, zero otherwise.
RPviews: RPviews represents the number of receiver paged views.
RSongs: RSongs represents the number of songs posted on the receiver web page.
RBand: RBand is equal to 1 if the receiver belongs to a band, zero otherwise.
RAudience: RAudience is equal to 1 if the receiver performed in-front of an audience larger than
700 people, zero otherwise.
RYActive: RYActive is equal to 1 if the receiver was active for more than 6 years, zero otherwise.
Covariates for the Music Download Relation
CRegion: CRegion is equal to 1 if both artists in a pair are from the same region, zero otherwise.
CGenre: CGenre is equal to 1 if both artists in a pair are from the same genre, zero otherwise.
PPviews: PPviews represents the number of provider paged views.
PSongs: PSongs represents the number of songs posted on the provider web page.
49
PBand: PBand is equal to 1 if the provider belongs to a band, zero otherwise.
PAudience: PAudience is equal to 1 if the provider performed in-front of an audience larger than
700 people, zero otherwise.
PYActive: PYActive is equal to 1 if the provider was active for more than 6 years, zero otherwise.
DPviews: DPviews represents the number of downloader paged views.
DSongs: DSongs represents the number of songs posted on the downloader web page.
DBand: DBand is equal to 1 if the downloader belongs to a band, zero otherwise.
DAudience: DAudience is equal to 1 if the downloader performed in-front of an audience larger
than 700 people, zero otherwise.
DYActive: DYActive is equal to 1 if the downloader was active for more than 6 years, zero other-
wise.
50
Online Technical Appendix
In this appendix we focus on the predictive performance of our model in the two applications.
We begin with Application 1. As reported in Section 4.2.3, Tables 1 and 2 illustrate how well the
cross-year dyadic and transitivity patterns are recovered when data are missing for the marketing
group. The entries in the table are the absolute deviations between the actual frequencies and those
predicted by the Full and Uncorr models. Focussing on the mean absolute deviations (last row),
we see that the Full model does signicantly better in recovering the cross-year relationships as it
is able to leverage information across the years.
We now look at predictions associated with the Online Music Network. We divide our entire
estimation sample into two equal portions. The rst portion (new calibration sample) includes re-
lationships formed in the rst three month period and the second sample (holdout sample) involves
relationships for the second three month period for the same set of artists. We re-estimate our three
models on the new calibration sample and make predictions on the holdout sample. Table 3 reports
the Mean Absolute Deviations (MAD) and Root Mean Square Errors (RMSE) of music downloads
predictions for the holdout period. Apart from overall predictive accuracy, we also compute pre-
dictions for the number of songs downloaded by and from each artist and also report the MAD and
RMSE for these two measures. It is clear from Table 3 that the Full model outperforms the other
models on almost all the measures. It is also striking that the Poisson model, which models the
intensity directly performs very poorly in predicting future activity. The fact that the Full model
outperforms Uncorr suggests that there are gains in modeling relationships jointly, and the fact
that NoZiZj yields similar predictions to the Full model, indicates that incorporating the latent
space does not add signicantly to the results in this application.
51
Table 1
APPLICATION 1: MISSING DATA - RECOVERY OF CROSS RELATION DYADIC PATTERNS
Pattern Full Uncorr
NN 9.52 10.53
AN 5.55 7.89
NA 5.83 3.33
MN 12.15 19.06
NM 1.62 5.46
AA 1.35 1.41
AA 3.73 6.16
AM 0.41 0.67
MA 0.06 2.28
MM 13.72 17.51
MAD 5.39 7.43
Table 2
APPLICATION 1: MISSING DATA - RECOVERY OF CROSS RELATION TRANSITIVITY
PATTERNS
Pattern Full Uncorr
{X
ij1
, X
jk1
, X
ik2
} 139.37 206.49
{X
ij1
, X
jk2
, X
ik1
} 136.48 197.72
{X
ij1
, X
jk2
, X
ik2
} 154.03 277.07
{X
ij2
, X
jk1
, X
ik2
} 184.62 275.46
{X
ij2
, X
jk2
, X
ik1
} 202.18 299.63
{X
ij2
, X
jk1
, X
ik1
} 145.01 212.41
{X
ij1
, X
jk1
, X
ik1
} 13.14 12.82
{X
ij2
, X
jk2
, X
ik1
} 140.01 252.19
MAD 139.35 216.72
Table 3
APPLICATION 2: HOLDOUT PREDICTIONS FOR MUSIC DOWNLOADS
Full Uncorr NoZiZj Poisson
Statistic MAD RMSE MAD RMSE MAD RMSE MAD RMSE
Overall 0.186 0.898 0.209 0.954 0.189 0.904 0.348 3.976
Downloads From 17.79 35.37 19.02 41.55 17.74 35.51 48.648 147.918
Downloads By 24.77 62.03 29.62 81.82 25.23 64.26 59.42 249.69
52