Académique Documents
Professionnel Documents
Culture Documents
1.
Introduction
2.
1276
3.
Estimators
i1
X
yj. +
DR
Pi1
j=1
pi
j=1
pj )yi.
(3)
where yi. is the total of the mi y-values in the network which includes the i-th selection. Note that
the edge units are not included in these totals, and
hence units not satisfying the criterion are incorporated in the estimate only if they are selected as
initial units. Also, the probabilities pi are calculated exclusive of selection as an edge unit. Thus to
estimate the population mean and variance without
replacement of networks the following equations are
used (Salehi and Seber 1997).
n
N et =
1 X
z ,
N n i=1 i
(4)
1 X
zi ,
=
N n i=1
(1
var(
N et ) =
(1)
n
1 X
var(zi )
N 2 n2 i=1
(5)
zi =
i1
X
j=1
yj +
(1
Pi1
j=1 pj )yi
pi
vd
ar(
N et ) =
(2)
X z
1
( i
N et )2
n(n 1) i=1 N
(6)
When the design is selection without replacement of clusters, the relevant probabilities pi are
unchanged, but the zi are z1 = yi. /pi , and, for
i = 2, 3, ..., n,
P
X
(1 jsi1 pj )yi.
zi =
yj. +
,
(7)
pi
js
i1
1277
Clust =
1
Nn
n
X
(8)
i=1
0
n
1 X
var(zi )
N 2 n2 i=1
I(so , d ) =
X zi
1
(
Clust )2
n(n 1) i=1 N
(10)
ei =
1
0
1
0
if
otherwise
g(so ) = d
M urthy
P (so )I(so , d )
(so )
so S
P (so )I(so , d )
so S
(14)
Unfortunately, for without replacement of clusters
equation 14 can not be reduced but for without
replacement of networks it reduces to the estimator
given in Salehi and Seber (1997):
M N et =
n
1 X P (s|i)
yi.
N i=1 P (s)
(15)
Let K denote the number of networks in the population. Let i denote the network which includes
unit i and wi represents the average value of a unit
in the network which contains unit i, that is
1 X
yj
(16)
wi =
mi
ji
var(
M N et )
K K
1 XX
mi mj 1
N 2 i=1 j<i
X P (s|i)P (s|j)
(wi wj )2
P
(s)
s3i,j
(17)
(11)
d = {(i, ei , yi ) : i s}
(13)
(9)
(12)
1278
vd
ar(
M N et )
n X
n
X
1
mi mj
N 2 P (s)2 i=1 j<i
(wi wj )2
(18)
Equation 14 for without replacement of clusters can
also be viewed as
M C = E[
Clust |d ]
(19)
= var(
Clust )
E[(
Clust
M C )2 ]
n
1 X
var(zi )
=
2
N n2 i=1
2
The estimators
M C and
M N et may be hard to
compute for large n. The number of different permutations we have to compute is n! for equation 14.
The number of calculations can be reduced by noting
that certain sets of permutations give the same value
of the estimator. Permutations that switch the order
of networks that do not meet the condition, are not
in the neighborhood of a network selected, that does
meet the condition, and have the same y-value give
the same value of the estimators. One such situation
is when the condition is yi > 0 and two networks of
value zero that arent edge units of an intersected
network can be interchanged. Although, this can
reduce the number of permutations to be looked at
considerably,
M C may still be complicated to compute. For calculating
M N et it is possible to reduce
the number of calculations still further (Salehi and
Seber 1997).
Acknowledgements
P (so )
Clust (so )
M C (so )
so S
(20)
An unbiased estimate of the variance is:
Support for this research was provided by the National Institutes of Health, National Institute on
Drug Abuse, grant RO1 DA09872, and the National
Science Foundation, grant DMS-9626102.
vg
ar(
M C )
X zi
1
(
Clust )2
n(n 1) i=1 N
4.
P
(s
o )I(so , d )
so S
P (so )I(so , d )
so S
2
M C
Clust (so )
(21)
A more efficient estimator of the variance is
vd
ar(
M C )
= E[g
v ar(
M C )|d ]
1
= P
P
(s
o )I(so , d )
so S
X
0
P (so )I(so , d )
5. Salehi, M.M. and Seber, G.A.F. (1997) Adaptive cluster sampling with networks selected
without replacement Biometrika 84 209-219.
X zi
1
(
Clust )2
n(n 1) i=1 N
1
so S P (so )I(so , d )
6. Thompson, S.K. (1990). Adaptive Cluster Sampling. Journal of the American Statistical Association 85 1050-1059.
P (so )I(so , d )
so S
M C
Clust (so )
so S
Bibliography
(22)
1279