Académique Documents
Professionnel Documents
Culture Documents
2010"
School of Information Science, Japan Advanced Institute of Science and Technology, Ishikawa 923-1292,
Japan
!Received 8 June 2010; accepted 23 August 2010; published online 4 November 2010"
Quantum mechanics for many-body systems may be reduced to the evaluation of integrals in 3N
dimensions using Monte Carlo, providing the Quantum Monte Carlo ab initio methods. Here we
limit ourselves to expectation values for trial wave functions, that is to variational quantum Monte
Carlo. Almost all previous implementations employ samples distributed as the physical probability
density of the trial wave function, and assume the central limit theorem to be valid. In this paper we
provide an analysis of random error in estimation and optimization that leads naturally to new
sampling strategies with improved computational and statistical properties. A rigorous lower limit to
the random error is derived, and an efficient sampling strategy presented that significantly increases
computational efficiency. In addition the infinite variance heavy tailed random errors of optimum
parameters in conventional methods are replaced with a Normal random error, strengthening the
theoretical basis of optimization. The method is applied to a number of first row systems and
compared with previously published results. 2010 American Institute of Physics.
#doi:10.1063/1.3488651$
I. INTRODUCTION
0021-9606/2010/133"17!/174120/16/$30.00
133, 174120-1
Downloaded 27 Sep 2012 to 131.111.62.210. Redistribution subject to AIP license or copyright; see http://jcp.aip.org/about/rights_and_permissions
174120-2
&
!2EL#!$dR/
&
!2dR,
!1"
! and R is
where the local energy is defined as EL#!$ = !1H
the 3N dimensional vector composed of the positions of all
electrons.
A general sampling approach to VMC consists of defining a random position vector R characterized by some distribution and drawing r independent samples of this random
variable. Writing the distribution as
Pg = "!2/w
!2"
!3"
' w !E L .
!4"
1 r1/2 1/2q2
e
,
2# *C*1/2
!5"
!6"
c11 = var#w$,
!7"
and
q2 = r
x1 $1
x2 $2
, +
T
C1
x1 $1
x2 $2
!8"
Downloaded 27 Sep 2012 to 131.111.62.210. Redistribution subject to AIP license or copyright; see http://jcp.aip.org/about/rights_and_permissions
Pr!u" =
+
&
&
100
x1 Pr!x2 = ux1,x1"dx1
+%
x1 Pr!x2 = ux1,x1"dx1 ,
!9"
.
/
!$2 $1u"2
r
2 !c11u2 2c12u + c22"
.
!10"
1
!u Etot"2
Pr!u" =
exp
,
-2# (E
2(E2
1
!11"
1!2ELdR
1!2dR
and variance
(E2 =
=
!12"
/ + , + , 0
1 1
$2
$2
c22 2
c +
r $21
$1 12
$1
c11
1 1!2/wdR1w!2!EL Etot"2dR
.
r
#1!2dR$2
!13"
Replacing the means by their unbiased estimates provides the total energy estimate
'wEL
Etot =
'w
!14"
r 'w2!EL Etot"2
.
r1
!'w"2
!15"
P (|X| > x)
174120-3
S( 32 , 0, 1, 0; 0)
105 2
10
N (0, 1)
100
104
2x
Downloaded 27 Sep 2012 to 131.111.62.210. Redistribution subject to AIP license or copyright; see http://jcp.aip.org/about/rights_and_permissions
174120-4
A. Standard sampling
!16"
' EL ,
!17"
!18"
The standard sampling choice also has some further deficiencies, as discussed in a previous paper,11 due to the necessary existence of a 3N 1 dimensional nodal hypersurface
where ! is zero. Such zeroes are usually accompanied by
singularities in the sampled local energy on these surfaces
!this is not necessarily the case, indeed is not the case for an
exact wave function, but is usually unavoidable", and the
analogous local quantities averaged to estimate expectation values for other operators. Such singularities are purely
an artifact of zeroes in Ps, and cause the distribution of estimates for many physical quantities to be a Stable distribution
with power law tails. For standard sampling the total energy
estimate is somewhat unusual in that CLT is valid.11,12 For
most other quantities, such as the derivatives used for forces
and in wave function optimization, this is not so.
B. Optimum sampling
1
1!2w!EL Etot"2dR
,
!EL Etot"2
1!2/wdR
!19"
!20"
(opt =
1
r1/2
&
!2*EL Etot*dR,
!21"
(s =
1
r1/2
/&
!2!EL Etot"2dR
1/2
!22"
Downloaded 27 Sep 2012 to 131.111.62.210. Redistribution subject to AIP license or copyright; see http://jcp.aip.org/about/rights_and_permissions
174120-5
value is sought by the MC calculation. A natural approximation is to replace the exact Etot with some previous nonexact
estimate. Unfortunately this worsens the statistics considerably, resulting in an estimate that does not statistically converge to a constant with increasing sample size.
An improved definition of optimum sampling is required. We take a self-consistent approach in order to allow
ll,u =
0 = 2(F2 1
+
1 1
1 1 ) 2
)
!c11" +
!c c11c22"
2 c11
r $1
)w
r $21 )w 12
)
2
!c 2Etotc12 + Etot
c11".
)w 22
!24"
)
)
!c11" +
!c 2E0c12 + E20c11",
0 = 2)w
)w 22
2
!25"
!23"
!26"
This distribution is optimum in the sense that it is the solution of the best available approximation to the equation that
defines optimum sampling.
In principle, the estimates could be calculated iteratively
by supplying !Etot , (F" from one calculation as !E0 , -" for the
next, and proceeding to convergence. Since !E0 , -" would not
converge to the exact estimate, the covariance matrix remains defined and the bivariate CLT is valid. In practical
application we go no further than the first iteration, using
relatively inaccurate total energies such as those provided by
an optimization process.
A poor value of !E0 , -" does not bias the total energy
estimate, which converges to the true value as long as - # 0.
Similarly, accurate values for E0 are not necessary as long as
this is reflected in the accompanying error, -, in that for large
- this sampling strategy becomes equivalent to standard sam-
Downloaded 27 Sep 2012 to 131.111.62.210. Redistribution subject to AIP license or copyright; see http://jcp.aip.org/about/rights_and_permissions
174120-6
!27"
!28"
bution provides a total energy estimate that is normally distributed and whose variance can be estimated. Sampling using Eq. !28" is referred to as efficient sampling in what
follows.
To compare the performance of efficient, standard, optimum, and Slater determinant !SD" sampling total energy estimates, these were calculated for an isolated all-electron carbon atom in the 3 P nonrelativistic ground state. A version of
the CASINO !Ref. 17" package was used, with modifications
introduced to enable generalized sampling. Of the variety of
trial wave functions available for use in QMC we employ the
form18,19
NCSF
!=e
J!R"
' /n.n!X",
!29"
n=1
Downloaded 27 Sep 2012 to 131.111.62.210. Redistribution subject to AIP license or copyright; see http://jcp.aip.org/about/rights_and_permissions
174120-7
102
PSD
!tot "
E
E (a.u.)
0.01
103
Ps
Popt
Ps
Pef f
0
(a)
"
E (a.u.)
Popt
50
Tcpu (h)
100
10
(b)
PSD
Pef f
10
100
Tcpu (h)
FIG. 2. Evolution of total energy estimates with computational time. !a" shows the evolution of confidence intervals for efficient, standard, and optimum
samplings. No confidence interval is available for SD sampling, and only the estimated value is shown. An arbitrary constant offset has been added for each
sampling strategy to aid clarity. !b" shows the evolution of standard errors for efficient, standard, and optimum samplings. For SD sampling the quantity shown
arises from evaluating the standard error estimate when it is incorrect to do so, and is unrelated to confidence intervals.
1/2
mum samplings show a consistent 0Tcpu
evolution with a
large variation in the prefactor between different sampling
strategies. For SD sampling the result of an incorrect application of Eq. !15" shows no consistent power law behavior
with sporadic discontinuous jumps introduced by outliers,
and although confidence intervals do exist for the underlying
distribution, they are not related to the quantity shown in the
figure. Sample sizes and estimates obtained by allocating a
fixed computational resource to each sampling strategy are
given in Table I. Compared to standard sampling, efficient
sampling increases the number of samples by &34, reducing
the error by &1/5, whereas optimum sampling decreases the
number of samples by &1/7, increasing the error by &2.
The performance improvement of efficient sampling
when compared to standard sampling varies between systems, and some limits are available. For r samples distributed
as Peff, and assuming that the cost of evaluating the PDF is
negligible when compared with evaluating the trial wave
function, the speedup with respect to standard sampling is
the ratio
T !2
Ts
2 !m 1"
+ 1.
Teff
T EL
!30"
Etot
Popt
Ps
Peff
PSD
430 096
3 193 192
106 829 024
136 971 968
237.8435!2"
237.8436!1"
237.843 61!2"
237.843 607
Downloaded 27 Sep 2012 to 131.111.62.210. Redistribution subject to AIP license or copyright; see http://jcp.aip.org/about/rights_and_permissions
174120-8
E=
&
&
f EL!a"dR/
&
&
f dR,
!32"
f dR,
g!a" =
'w!a"!EL!a" E"2
,
'w!a"
'w!a"EL!a"
,
E =
'w!a"
!33"
!34"
a sample surface drawn from a distribution of random surfaces. Optimization on this surface is usually unsuccessful
since it is not bounded from below, and that although the
energy estimate itself is zero-variance, its gradient is not.
Hermiticity of the Hamiltonian operator naturally provides a variety of estimates for the total energy gradient,
including a zero-variance estimate. The cost of choosing the
zero-variance gradient estimate is that its gradient !the Hessian" is not a symmetric matrix; hence the estimated gradient
is not itself a derivative of a sample surface.
For notational simplicity we introduce a new parameter
Downloaded 27 Sep 2012 to 131.111.62.210. Redistribution subject to AIP license or copyright; see http://jcp.aip.org/about/rights_and_permissions
174120-9
'w!1!i#EL#!$ Etot#!$$
#
!
!b"$
=
2
E!1"
i
'w
!35"
!36"
!37"
1
bn0
!bn + !bn+1".
+ 3bn+1
0
!39"
'wEL#!$
'w
-n+1 =
and monitoring convergence across cycles provide an effective control for this indefiniteness. In what follows we do not
take this into account explicitly, and assume that its effect
can be entirely accounted for by allowing for correlation between cycles and a modification of eopt that becomes negligible close to statistical convergence.
The distribution of errors of the estimated gradient function !whose zero is sought" and of the estimated matrix elements can be assessed as described in the Appendix. For
standard sampling, Stable distributions arise for estimates
with a mean, with x5/2 power law tails, and with no variance. For the gradient estimate this occurs even for a stationary nodal surface. The total energy estimate performs better,
in that it is Normal for a stationary nodal surface and for the
initial parameters of a cycle, but not otherwise. Similarly,
most of the matrix elements are samples drawn from a Stable
distribution with x5/2 tails.
Employing efficient or optimum sampling of Sec. I removes the nodal surface in the sampling distribution, hence
estimates of the total energy, total energy gradient, and matrix elements are all Normal.27
!40"
,
bn
where the parameters converge to those for which the estimated gradient is zero.
To ensure that such a method is robust we use the semiorthogonalization, rescaling, and level-shift modifications as
developed by Toulouse and Umrigar,26 which easily transfer
to generalized sampling, and normalize the diagonal of Sij to
unity to provide an energy scale for level-shift parameters.
Note that these modifications do not alter the self-consistent
solution, but are necessary to stabilize the method.
The convergence behavior of this approach requires
some attention. For the special case where the variational
freedom of the trial wave function includes the exact eigenstate, then it is clear that self-consistency occurs for a zero
gradient estimate and a minimum energy estimate at the
same parameter values. However, the two will not generally
coincide since the gradient estimate is not usually the gradient of the total energy estimate. Furthermore, the gradient
estimate is not the derivative of a sample surface so there is
no reason to expect a point to exist where the gradient is
zero. So, self-consistency need not occur. Limiting the optimization to a few !usually one" iteration within each cycle
!41"
!42"
where a0 is the exact minimum of the total energy with respect to parameter variation. It is immediately apparent that
eopt * 0 and E#eopt$ * 0. It is also clear that the random error
in optimum parameters is related to the random error in the
derivative of the minimized function, not to the random error
in the function itself.
Deriving the distribution of eopt is not simple, with the
primary difficulty due to inversion of the random vector
function, and here we offer a perturbative solution. We begin
by expanding the random vector function about the true
minimum, a0,
g!1"!a" = g!1"!a0" + g!2"!a0"!a a0" + ,
!43"
Downloaded 27 Sep 2012 to 131.111.62.210. Redistribution subject to AIP license or copyright; see http://jcp.aip.org/about/rights_and_permissions
174120-10
!44"
To simplify this expression further we rotate and rescale parameters such that E#g!2"!a0"$ = I, and expand the matrix/
vector product to first order in the deviation of each random
variable from its mean value to give
3a = g!1" + #g!2" I$E#g!1"$*a0 ,
!45"
!46"
a sum of the squares of N p correlated random variables. Provided that both g!1" and g!2" possess a mean and a variance,
then it follows that eopt possesses a mean and a variance
given by
E#eopt$ =
1
' E#3a2i $,
2 i
1
var#eopt$ = ' !E#3a2i 3a2j $ E#3a2i $E#3a2j $",
4 ij
!47"
!48"
2
E#3a2i 3a2j $ = !CiiC jj + 2C2ij"/ropt
,
which may be introduced into Eq. !47" to give the first two
moments of eopt as
E#eopt$ = !2ropt"1 ' Cii ,
i
var#eopt$ =
2 1
!2ropt
"
'ij
!49"
C2ij .
Downloaded 27 Sep 2012 to 131.111.62.210. Redistribution subject to AIP license or copyright; see http://jcp.aip.org/about/rights_and_permissions
174120-11
!tot (a.u.)
E
-37.6
-37.7
-37.8
0
10
cycle
20
FIG. 3. Evolution of total energy estimates with optimization for allelectron C and using efficient sampling. Four optimization processes are
shown using 384, 1480, 14 800, and 80 000 samples in order of decreasing
Etot. All total energy estimates are generated using 2 200 000 samples.
The approach was applied to a group of first row systems, the isolated atoms, homonuclear diatomic molecules,
diatomic hydrides, and the diatomic molecules LiF, CN, CO,
and NO. This collection of systems was chosen to allow
comparison of total energy estimates with recently published
ab initio total energies and accurate approximate total energies in literature.
For any VMC calculation the achievable statistical and
systematic error is unavoidably a compromise between finite
computational resources and the incompleteness of available
wave function parametrization. To make our compromise we
chose to allocate the same computational resource to each
system, divided between optimization and estimation as in
the previous section.29 Trial wave functions of the form described in Sec. II, Eq. !29", were constructed with expansion
orders chosen to include as much variational freedom as possible while obtaining a usefully low statistical error.
For both Jastrow and backflow the number of parameters
was set so as to employ polynomials two-orders less than the
onset of numerical error in polynomial evaluation. Multideterminant expansions were constructed as sums of uncoupled
spatial and spin symmetry eigenstates, that is as CSFs, with
optimized coefficients. For isolated atoms the CSF expansion
was obtained using numerical orbitals from ATSP2K !Ref. 22"
as for C in Sec. I. For the diatomic molecules no MCHF
-75.05
-75.055
-75.055
!std (a.u.)
E
!ef f (a.u.)
E
-75.05
-75.06
-75.065
(a)
-75.06
10
cycle
15
20
-75.065
(b)
10
cycle
15
20
FIG. 4. Evolution of total energy estimates with optimization for O, for both standard and efficient sampling. !a" shows the evolution of estimates using
standard sampling and !b" shows the evolution of estimates using efficient sampling, with the same total computational cost of each. The gray regions are the
confidence interval for the final total energy estimate constructed using an average of converged parameters and equal computational cost.
Downloaded 27 Sep 2012 to 131.111.62.210. Redistribution subject to AIP license or copyright; see http://jcp.aip.org/about/rights_and_permissions
174120-12
TABLE II. Summary of parameters for the isolated atoms and diatomic molecules. Experimental bond lengths
used are from Ref. 31 for the homogeneous diatomic molecules and from Ref. 32 for the rest.
Li
Be
B
C
N
O
F
Ne
Li2
Be2
B2
C2
N2
O2
F2
Ne2
LiH
BeH
CH
NH
OH
FH
LiF
CN
CO
NO
Ne
D
!a.u."
3
4
5
6
7
8
9
10
6
8
10
12
14
16
18
20
4
5
7
8
9
10
12
13
14
16
5.051
4.65
3.005
2.3481
2.075
2.283
2.668
5.84
3.0139
2.5372
2.1163
1.9581
1.8324
1.7328
2.9553
2.2144
2.1321
2.1746
Term
S
S
2
P
3
P
4
S
3
P
2
P
1
S
1 +
5g
1 +
5g
3
5g
1 +
5g
1 +
5g
3
5g
1 +
5g
1 +
5g
1 +
5
2 +
5
2
6
3
5
2
6
1 +
5
1 +
5
2 +
5
1 +
5
2
6
1
Ndet
NCSF
Nparam
502
137
521
550
458
472
553
499
120
39
61
69
35
12
7
6
31
30
25
21
25
61
62
35
37
53
86
29
43
52
25
41
29
14
34
11
16
17
10
8
6
5
15
18
13
9
13
13
14
8
9
14
326
339
368
377
350
366
354
324
275
252
257
258
251
249
247
246
452
455
450
446
450
450
451
445
446
451
and approximate exact total energies from the sources specified in the table.
The lowest variational total energy estimates for first
row isolated atom in literature appear to be those of Brown et
al.23 They employed standard sampling, linear optimization,
and trial wave function of the same form as Eq. !29" but with
less variational freedom, mostly due to a smaller multideterminant expansion. Our VMC results consistently fall between the VMC and DMC estimates of this paper, are consistently closer to DMC than VMC estimates of this paper,
and provide a small but significant recovery of between 0.1%
and 1.5% more correlation energy at VMC level.
Toulouse and Umrigar31 also provided isolated atom energies using both VMC and DMC, based on linear optimization of trial wave function using standard sampling. Their
trial wave functions were composed of RHF or MultiConfigurational Self-Consistent Field !MCSCF" orbitals represented using a Slater basis set, with orbital relaxation, a
Jastrow factor, and one or two CSFs. Our VMC energies
consistently improve on the estimates in this paper, providing
between 0.3% and 12.5% more correlation energy than the
VMC estimates, and between 0.0% and 5.6% more correlation than DMC estimates. The improvement is entirely consistent with the contribution to correlation provided by backflow and multideterminant expansions discussed in Ref. 23.
Toulouse and Umrigar31 also provided VMC and DMC
total energies for the first row homonuclear diatomic molecules, with trial wave functions constructed in the same
Downloaded 27 Sep 2012 to 131.111.62.210. Redistribution subject to AIP license or copyright; see http://jcp.aip.org/about/rights_and_permissions
174120-13
TABLE III. Total electronic energies for isolated atoms and diatomic molecules. The table presents VMC estimates obtained as described in this
paper !EVMC". Approximate exact total energies for isolated atoms are from
Ref. 33 for Li and Ref. 34 otherwise. Approximate exact total energies are
from Ref. 31 for Be2, B2, C2, and Ne2, and from Ref. 35 otherwise.
Li
Be
B
C
N
O
F
Ne
Li2
Be2
B2
C2
N2
O2
F2
Ne2
LiH
BeH
CH
NH
OH
FH
LiF
CN
CO
NO
EVMC
!a.u."
Eexpt
!a.u."
Ecorr
!%"
27.478 052!2"
214.667 243!3"
224.653 29!1"
237.843 61!2"
254.586 41!4"
275.060 58!5"
299.726 23!8"
2128.9299!1"
214.983 98!4"
229.318 85!7"
249.3837!1"
275.8893!2"
2109.4981!3"
2150.2843!4"
2199.4819!4"
2257.8517!5"
28.069 08!2"
215.244 07!2"
238.468 42!5"
255.208 63!8"
275.7222!1"
2100.4446!2"
2107.4177!2"
292.6755!2"
2113.2845!3"
2129.8457!3"
27.478 06032
214.667 36
224.653 91
237.8450
254.5892
275.0673
299.7339
2128.9376
214.9951
229.3380
249.415
275.9265
2109.5421
2150.3267
2199.5303
2257.8753
28.0704
215.2468
238.4788
255.2227
275.7371
2100.4592
2107.4344
292.7250
2113.3261
2129.9047
99.981!5"
99.876!4"
99.503!8"
99.11!1"
98.52!2"
97.39!2"
97.64!3"
98.02!3"
91.00!3"
90.60!4"
90.34!4"
92.83!4"
91.99!6"
93.57!6"
93.60!5"
96.99!6"
98.41!2"
97.08!3"
94.78!3"
94.24!3"
95.29!4"
96.24!4"
96.22!4"
90.10!5"
92.23!6"
90.27!5"
manner as the isolated atoms but allowing larger multideterminant expansions for some of the molecules. The difference
between results is not as consistent as for the isolated atoms.
Our VMC results recover between 6.7% less and 5.4% more
correlation energy, with a consistent improvement as electron
number is increased and the crossover occurring between B2
and C2. The same trend occurs when comparing with DMC
results, with our VMC energies higher than the DMC results
of Ref. 31 for all molecules except Ne2. This is consistent
with the isolated atom behavior and the observation that our
multideterminant expansion is inferior in that it is not obtained from a self-consistent calculation. This reduction in
correlation energy is apparent for the small systems only,
with the contribution to correlation energy provided by backflow becoming dominant as electron number increases.
Curtiss et al.36 provided total energies for a large set of
molecules arrived at using the Gaussian-4 !G4" theory, a
state-of-the-art perturbation method. These provide a useful
context for the VMC results presented here, provided we
bear in mind that the comparison is less clear cut since G4
errors may be positive or negative due to it being a nonvariational method. Of the 26 VMC results in this paper G4
total energies are available for 23. Of these, our VMC total
energies are below !above" those of G4 for 17 !6" of the
systems, with the VMC recovering between 25.8% more and
4% less correlation energy than the G4 results. The isolated
Downloaded 27 Sep 2012 to 131.111.62.210. Redistribution subject to AIP license or copyright; see http://jcp.aip.org/about/rights_and_permissions
174120-14
standard sampling informs us that although VMC total energy estimates for fixed parameters are normally distributed,
almost all of the quantities that appear in optimization are
not.
Two new sampling strategies are presented, both of
which ensure a Normal random error wherever possible. Optimum sampling is obtained by deriving the sample distribution for which the Normal random error is as small as possible for a given sample size. This naturally provides a lower
limit to the random error possible for a given system, and an
accurate approximation to this distribution is derived.
Efficient sampling is designed by attempting to avoid a
large part of the computational cost of providing samples
using the Metropolis algorithm. The resulting sample distribution function is computationally cheap to evaluate, has the
correct properties for estimates and optimization functions to
be normally distributed, and does not perform significantly
worse than standard or optimum sampling for a given
samples size.
Of the two, efficient sampling provides the most useful
improvement over standard sampling by increasing the number of samples possible for a fixed computational resource by
an order of magnitude and significantly decreasing the available random error. In addition, the analysis of optimum and
efficient samplings suggests improved statistical properties
for optimization when compared to standard sampling. However, no convincing numerical evidence that this is the case
for real systems was obtained, mainly due to the greater
computational cost of standard sampling, the small random
error in optimization compared to estimation, and that any
rigorous test would require a statistically significant number
of optimization calculations to be compared. Perhaps the
most useful results provided by this analysis is a theoretical
justification for averaging sets of optimized parameters in
order to reduce the optimization random error.
Variational quantum Monte Carlo using efficient sampling and optimization was applied to evaluate the electronic
total energy for 26 isolated atoms and molecules, using trial
wave function with a great deal of variational freedom and a
fixed and modest computational budget divided equally
among calculations. At VMC level more than 97% of correlation energy was recovered for first row isolated atoms, and
more than 90% for a small set of diatomic molecules, results
that significantly improve on many previously published
VMC results and approaching DMC for some systems.
As systems increase in size the efficient sampling strategy is likely to become less advantageous, but this has not
occurred for the systems considered so far. When this does
occur it should be possible to employ further sampling distributions that perform better for these systems, such as optimum sampling, or the distribution used by Attaccalite and
Sorella9 for a dense extended system.
ACKNOWLEDGMENTS
The authors thank R. J. Needs of the University of Cambridge for many useful discussions. Financial support was
provided by the Special Coordination Funds for Promoting
Science and Technology, Promotion of environmental im-
&
P!R"xL!R"d3NR
!A1"
1
r
' xL!R".
!A2"
&
x=xL!R"
P!R" 3N1
d
R,
*$RxL*
!A3"
Downloaded 27 Sep 2012 to 131.111.62.210. Redistribution subject to AIP license or copyright; see http://jcp.aip.org/about/rights_and_permissions
174120-15
!A4"
1
#b0 + b1s + $,
sn
!A5"
&
P!s,T"
. .
ds 3N1
d
T,
dxL
!A6"
so in the limit s 0,
P!x" 0
sm
1/*sn+1*
!A7"
+ ,
1
*x*
!m+n+1"/n
!A8"
The existence of positive and/or negative tails, and their possible equality, is entirely decided by the continuity of P!R" at
the hypersurface of interest, and the symmetry of the singularity in xL. An example of this analysis arises for the total
energy estimate via standard sampling, for which !m , n"
= !2 , 1", hence P possesses asymptotic tails that decay as x4,
and the CLT is valid for the estimate since the second moment exists.
For the generalized sampling a normally distributed estimate requires a valid bivariate CLT, and this is decided by
the existence of the covariance matrix. The random estimate
is
Z=
Y 'yi
,
=
X 'xi
!A9"
with each (yi) and (xi) independent and identically distributed, with variables in the two sets independent for i # j,
co-distributed as P!y , x" for i = j. The validity of the bivariate
CLT requires the covariance matrix of P!y , x" to exist, and
since this matrix is positive definite, we only require all diagonal matrix elements to existthe variance of both yi and
xi. Consequently, we only need to consider the distribution of
each random variable independently of the other, given by
Eq. !A3" for xi and the equivalent expression for yi, and the
univariate analysis given above can be applied. Note that yi
and xi are parametrically related to each other !via the underlying random spatial variable" so P!y , x" is nonzero only
on a line in two dimensional space, but this does not prevent
the existence of the covariance matrix or the continuity of
P!Y , X" as long as more than one sample appears in each
sum.
An example arises for optimization with standard sampling, for which !m , n" = !2 , 2" for both of !yi , xi" as soon as
the nodal surface moves away from its starting position. So,
Downloaded 27 Sep 2012 to 131.111.62.210. Redistribution subject to AIP license or copyright; see http://jcp.aip.org/about/rights_and_permissions
174120-16
Downloaded 27 Sep 2012 to 131.111.62.210. Redistribution subject to AIP license or copyright; see http://jcp.aip.org/about/rights_and_permissions