Académique Documents
Professionnel Documents
Culture Documents
MAKSIM OKS
Department of Industrial Engineering and Operations Research, University of California,
Berkeley, California 94720, max@ieor.berkeley.edu
FRANCOIS OUSTRY
INRIA Rhone-Alpes ZIRST, 655 avenue de l’Europe 38330 Montbonnot Saint-Martin, France,
francois.oustry@inria.fr
Classical formulations of the portfolio optimization problem, such as mean-variance or Value-at-Risk (VaR) approaches, can result in a
portfolio extremely sensitive to errors in the data, such as mean and covariance matrix of the returns. In this paper we propose a way to
alleviate this problem in a tractable manner. We assume that the distribution of returns is partially known, in the sense that only bounds
on the mean and covariance matrix are available. We define the worst-case Value-at-Risk as the largest VaR attainable, given the partial
information on the returns’ distribution. We consider the problem of computing and optimizing the worst-case VaR, and we show that these
problems can be cast as semidefinite programs. We extend our approach to various other partial information on the distribution, including
uncertainty in factor models, support constraints, and relative entropy information.
Received September 2001; revision received July 2002; accepted July 2002.
Subject classifications: Finance, portfolio: Value-at-Risk, portfolio optimization. Programming, nonlinear: semidefinite programming,
dualing, robust optimization.
Area of review: Financial Services.
x x
5. There exist ∈ n and v ∈ R such that + M − px dx
Rn 1 1
w/2
2 T
+
v − x̂ w 0 (11) where A B = TrAB denotes the scalar product in the
w T /2 v space of symmetric matrices, M = M T is a Lagrange mul-
tiplier matrix, and is the indicator function of the set
Let us comment on the above theorem. The equivalence
between Propositions 1 and 2 will be proved below, but = x −xT w (13)
can also be obtained as an application of the (exact) multi-
variate Chebyshev bound given in Bertsimas and Popescu Because 0, strong duality holds (Smith 1995, Bonnans
(2000). This implies that the problem of optimizing the and Shapiro 2000). Thus, the original problem is equivalent
VaR over w ∈ is equivalent to to its dual. Hence, the worst-case probability is
√
minimize
w T
w − x̂T w subject to w ∈ (12) !wc = inf !M (14)
M=M T
As noted in the introduction, this problem can be cast
as a second-order cone programming problem. SOCPs are where !M is the dual function
special forms of SDPs that can be solved with efficiency
close to that of linear programming (see, e.g., Lobo et al. !M = sup p M
p∈Rn
1998).
The equivalence between Propositions 1 and 3 in the = M + sup x − lxpx dx
above theorem is a consequence of duality (in an appropri- p Rn
subject to /+ 0 /− 0 + 0 − 0
3.1. Uncertainty in the Factor’s Mean and
+ −− w/2 Covariance Matrix
0 w = /− −/+ (29)
w T /2 v The simplest case is when we consider errors in the mean-
covariances of the factors. Based on a factor model, we may
As noted before, the above formulation allows us to opti- be interested in “stress testing,” which amounts to analyz-
mize the portfolio over w ∈ : It suffices to let w be a ing the impact of simultaneous changes in the correlation
variable. Because is a polytope, the problem falls in the structure of the factors, on the VaR. In our model, we will
SDP class. assume (say, componentwise) uncertainty on the factor data
In the case when the moments are exactly known, that is, S and fˆ. For a fixed value of the sensitivity matrix A, and
+ =
− =
and x̂+ = x̂− = x̂, the above problem reduces of the diagonal marix D, we obtain that the corresponding
to Problem (3) as expected (with the correct value of of worst-case VaR can be computed exactly via the SDP
course). To see this, note that only the variables v, =
+ − − and /− − /+ =w play a role. The optimal value maximize −xT w
of is easily determined to be ww T /4v, and optimizing
subject to x̂ = Afˆ
= D +ASAT f− fˆ f+
over v > 0 yields the result.
We should also mention that the worst-case VaR can
x − x̂
be similarly computed and optimized when we have com- S− S S+ 0
x − x̂T
2
ponentwise bounds on the mean x̂ and second-moment
matrix S, specifically, where f± and S± are componentwise upper and lower
bounds on the mean and covariance of factors. A simi-
x− x̂ = Ex x+ S− S = ExxT S+ (30) lar analysis can be performed with respect to simultaneous
changes in D, S and fˆ. Likewise, portfolio optimization
where x+ x− and S+ S− are given vectors and matrices,
results are similar to the ones obtained before.
respectively, and the inequalities are understood componen-
twise. A derivation similar to above shows that in this case
the worst-case VaR can be optimized via the SDP in vari- 3.2. Uncertainty in the Sensitivity Matrix
ables M+ M− , /+ /− and w: One may also be looking at the impact of errors in the sen-
opt sitivity matrix A, on the VaR. As pointed out by Linsmeier
V = minM+
+ −M−
− +
2 v +/T+ x+ −/T− x− and Pearson (1996), the mean-variance approach to Value-
subject to /+ 0 /− 0 M+ 0 at-Risk can be used to analyze the risk of portfolios con-
taining possibly very complex instruments such as futures
0 w/2 contracts, exotic options, etc. This can be done using an
M− 0 M+ −M− + 0
w T /2 v approximation called risk mapping, which is a crucial step
in any practical implementation of the method.
w = /− −/+ ∈ (31) In general, one can express the return vector of a portfo-
lio containing different instruments as a function of “market
3. FACTOR MODELS factors,” such as currency exchange rates, interest rates,
Factor models arise when modelling the returns in terms underlying asset prices, and so on. Those are quantities for
of a reduced number of random factors. A factor model which historical data are available and for which we might
expresses the n-vector of returns x as follows: have a reasonable confidence in mean and covariance data.
In contrast, most complex instruments cannot be directly
x = Af + u (32) analyzed in terms of mean and covariance.
The process of risk mapping amounts to approximating
where f is a m-vector of (random) factors, u contains resid- the return vector via the decomposition (32). In essence,
uals, and A is a n × m matrix containing the sensitivities the factor model is a linearized approximation to the actual
of the returns x with respect to the various factors. If S return function, which allows use of mean-variance analysis
(resp. fˆ) is the covariance matrix (resp. mean vector) of the for complex, nonlinear financial instruments.
factors, and u is modeled as a zero-mean random variable Because the factor model is a linearized approximation
with diagonal covariance matrix D, uncorrelated with f , of the reality, it may be useful to keep track of linearization
then the covariance matrix (resp. mean vector) of the return errors via uncertainty in the matrix of sensitivities A. In
vector is
= D + ASAT (resp. x̂ = Afˆ). fact, instead of fitting one linear approximation to the return
El Ghaoui, Oks, and Oustry / 551
vector, one may deliberately choose to fit linear approxi- the worst-case VaR given in Equation (33) can be com-
mations that serve as upper and lower bounds on the return puted (optimized) via the following SDP in variables /
vector. The risk analysis then proceeds by analyzing both (and w ∈ ):
upper and lower bounds, for all the instruments considered.
Thus, it is of interest (both for numerical reasons and for
min /1 + /2 + /3 − w T A0 fˆ
more accurate modelling) to take into account uncertainty 2
in the matrix A. /1 In+m Cw dw
We assume that the statistical data S, D, and fˆ are per-
subject to Cw
T
/2 I l ew
0 (36)
fectly known, with S 0 and D 0, and that the errors
in A are modeled by A ∈ , where the given set describes dwT ewT /3
the possible values for A. We are interested in computing
(or finding bounds on) and optimizing with respect to the where
portfolio weight vector w the worst-case VaR
S 1/2 ATi
Cw = M1 w ··· Ml w with Mi = 1 i l
S 1/2 AT 0
Vwc w = max
w − w T Afˆ (33)
A∈ D1/2 S 1/2 AT0
2 w T Ai fˆ
dw = w ei w = − 1 i l
Ellipsoidal uncertainty. We first consider the case when D1/2
A is subject to ellipsoidal uncertainty:
Norm-bound uncertainty. We now consider the case
l when
l
= A0 + ui Ai u ∈ R
u
2 1 (34)
i=1 = A + L;R ; ∈ Rl×r
;
1 (37)
n×m
where the given matrices Ai ∈ R , i = 0 l, determine
where A ∈ Rn×m , L ∈ Rn×l , and R ∈ Rr×m are given and ;
an ellipsoid in the space of n × m matrices.
is an uncertain matrix that is bounded by one in maximum
The worst-case VaR then expresses as −w T A0 fˆ + -w,
singular value norm. The above kind of uncertainty is use-
where
ful to model “unstructured” uncertainties in some blocks
-w = max
Cwu + dw
2 + ewT u (35) of A, with the matrices L, R specifying which blocks in
u
2 1 A are uncertain. For details on unstructured uncertainty in
matrices, see Boyd et al. (1994). A specific example obtains
for an appropriate matrix Cw and vectors dw, ew, by setting L = R = I, which corresponds to an additive
linear functions of w that are defined in Theorem 4 below. perturbation of A that is bounded in norm but otherwise
Our approach hinges on the following lemma, whose proof unknown. The following result follows quite straightfor-
can be found in El Ghaoui et al. (2000). wardly from Lemma 1.
Lemma 1. Let C ∈ RN ×l , d ∈ RN , e ∈ Rl and 8 0 be
Theorem 5. When the distribution of returns obeys to the
given. An upper bound on the quantity
factor model (32) and the sensitivity matrix is only known
to belong to the set given by Equation (37), an upper
- = max
Cu + d
2 + eT u
u
2 8 bound on the worst-case VaR given in Equation (33) can be
computed (optimized) via the following SDP in variables /
can be computed via the following SDP: (and w ∈ ):
/ 1 IN C d
min /1 + t + /3 − w T Afˆ
2- min /1 + 82 /2 + /3
C
T
/ 2 Il e 0 2
T
dT eT /3 /1 In+m dw C C
subject to
T T
dw /3 e eT
The following theorem is a direct consequence of the
above lemma, in the case 8 = 1. It shows that we can t wT L
not only compute but also optimize an upper bound on 0 (38)
LT w Il
the worst-case VaR under our current assumptions on the
distribution of returns. where
Theorem 4. When the distribution of returns obeys to the S 1/2 AT S 1/2 RT −Rfˆ
factor model (32) and the sensitivity matrix is only known dw = w C= e=
D 1/2
0
to belong to the set given by (34), an upper bound on
552 / El Ghaoui, Oks, and Oustry
Proof of Theorem 5. Defining Hypercube support. First consider the case when the
1/2 T 1/2 T support is the hypercube > = #xl xu , where xl xu are
S A S R
dw = w C= given vectors, with xl < x̂ < xu . Theorem 1 is extended as
D1/2 0 follows.
−Rfˆ Theorem 6. When the probability distribution of returns
e= rw = LT w has known mean x̂ and covariance matrix
, its support is
included in the hypercube > = #xl xu , and is otherwise
we may express the worst-case VaR as −w T Afˆ + -w, arbitrary, we can compute an upper bound on the worst-
where case Value-at-Risk by solving the semidefinite programming
problem in variable x:
-w =
max
C;T rw + dw
2 + eT ;T rw
;
1
maximize −xT w
=
max
Cu + dw
2 + eT u
x − x̂
u
2
rw
2
subject to 0
x − x̂T
2
min /1 +
rw
22 /2 + /3
2
2 xl x̂ −x
2 xu xl x xu (39)
/1 In+m C dw where
is given in (8).
CT / 2 Ir e
0 We will not present the proof of Theorem 6 here because
dwT eT /3 it is similar to the proof of Theorem 1. Interested readers
may refer to El Ghaoui et al. (2000) for details. If we let
where the last inequality is derived from Lemma 1, with xl = − and xu = +, the last inequalities in (39) become
8 =
rw
2 . Using Schur complements we may rewrite void, and the problem reduces to the one obtained in the
the linear matrix inequality in the last line as case of no support constraints. Thus, the above result allows
T us to refine the condition obtained by simply no taking into
/1 In+m dw C C
account the support constraints. Contrarily to what hap-
T T
dw /3 e eT pens with no support constraints, there is no “closed-form”
solution to the VaR, which seems to be hard to compute
where = 1//2 . Introducing a slack variable t /2 ·
exactly; but computing an upper bound is easy via SDP.
rw
22 =
rw
22 /, we can rewrite the objective as
In fact, Problem (39) can be expressed as an SOCP,
/1 + t + /3 , where t is such that t
rw
22 /. The latter
which makes it amenable to even faster algorithms. Again,
inequality can be written as the linear matrix inequality in
we stress that while this approach is the best in the case of
the theorem. (We note that this constraint is a second-order
cone constraint and its structure should be exploited in a known moments, or with independent polytopic uncertainty
numerical implementation of the theorem.) (as in §2.3), the SDP formulation obtained above is more
Note: Goldfarb and Iyengar (2001) consider uncertainty useful with general convex uncertainty on the moments.
structures in which the uncertainty in the mean is inde- When
0, the SOCP formulation is
pendent of the uncertainty in the covariance matrix of the maximize −xT w
returns. This assumption leads to the terms ew in (35) and
e in (38) being equal to zero. As a result, the expressions in subject to
−1/2 x − x̂
2
2
Lemma 1 and Theorems 4 and 5 become exact, for exam-
2 xl x̂ −x
2 xu xl x xu (40)
ple, the worst-case VaR can be computed and optimized
exactly in this case. Moreover, for the specific uncertainty Let us now examine the problem of optimizing the VaR
structures Goldberg and Iyengar consider, they are able to with hypercube support information. We consider the prob-
formulate these problems as SOCPs. lem of optimizing the upper bound on the worst-case VaR
obtained previously:
4. EXTENSIONS AND VARIATIONS
opt = min max −xT w
V
In this section, we examine extensions and variations on w∈ x
the problem. We assume throughout that x̂ and
are
given, with
0. The extension to moment uncertainty is
x − x̂
subject to 0
straightforward. x − x̂T
2
We now restrict the allowable probability distributions to We can express the inner maximization problem in a dual
be of given support > ⊆ Rn and seek to refine Theorem 1 form (in SDP sense), as a minimization problem. This leads
accordingly. to the following result.
El Ghaoui, Oks, and Oustry / 553
Theorem 7. When the distribution of returns has known argued that such a worst-case scenario is unrealistic. In this
mean x̂ and covariance matrix
, its support is section, we seek to enforce that the worst-case probability
included in the hypercube > = #xl xu , and is other- distribution has some degree of smoothness. The easiest
wise arbitrary, we can optimize an upper bound on the way to do so is to impose a relative entropy constraint
worst-case Value-at-Risk by solving the SDP in variables with respect to a given “reference” probability distribution.
w t u v /u l *u l : We will assume that the probability distribution of
returns satisfies the following assumption, and is other-
opt = min
+
2 v − x̂T w +
2 xuT /u −xlT /l
V wise arbitrary. We assume that the distribution of returns,
+*uT xu − x̂−*lT xl − x̂ while not a Gaussian, is not “too far” from one. Precisely,
we assume that the Kullback-Leibler divergence (negative
subject to
relative entropy) satisfies
w +/l −/u +*u −*l /2
dP
w +/l −/u +*u −*l T /2 v KLP P0 = log dP d (43)
dP0
0 (41)
where d 0 is given, P is the probability distribution of
In the above, when some of the components of xu (resp. xl ) returns, and P0 is a nondegenerate Gaussian reference dis-
are + (resp. −), we set the corresponding components tribution, that has given mean x̂ and covariance matrix
of /u *u (resp. /l *l ) to zero.
0. (Note that a finite d enforces that the distribution
Again, if we set /u = /l = *u = *l = 0 in (41), we recover of returns P is absolutely continuous with respect to the
the expression of the VaR given in (11), which corresponds Gaussian distribution P0 .)
to the exact conditions when first and second moments are We prove the following theorem.
known, and no support information is used. Theorem 9. When the probability distribution of returns is
Ellipsoidal support. In many statistical approaches, such only known to satisfy the relative entropy constraint (43),
as maximum-likelihood, the bounds of confidence on the and the mean x̂ and covariance matrix
of the refer-
estimates of the mean and covariance matrix take the form ence Gaussian distribution P0 are known, the entropy-
of ellipsoids. This motivates us to study the case when the constrained Value-at-Risk is given by
support > is an ellipsoid in Rn :
> = x x − xc T P −1 x − xc 1 V w =
d
1/2 w
2 − x̂T w (44)
i j −
nom i j 8
nom i j
Figure 1. Worst-case VaR of the nominal and robust Figure 3. Worst-case VaR of the nominal and robust
portfolios, as a function of the size of data portfolios, as a function of the probability
uncertainty, 8. level
.
0.025 0.03
0.02
0.02
0.015
0.01
0.01
0.005 0
0 2 4 6 8 10 12 14 16 18 20 0 5 10 15 20 25
Relative uncertainty size ρ (%) Epsilon (%)
556 / El Ghaoui, Oks, and Oustry
6. CONCLUDING REMARKS , . 2000. Optimal inequalities in probability theory:
A convex optimization approach. Working paper, MIT/
Our results can be summarized as follows. The problem of
INSEAD. Submitted to Oper. Res.
computing the worst-case VaR, or optimizing it, takes the
Black, F., R. Litterman. 1992. Global portfolio optimization.
general form Financial Analysts J. 28–43.
minimize - x Bonnans, J. F., A. Shapiro. 2000. Perturbation Analysis of Opti-
mization Problems. Springer Series in Operations Research.
x − x̂ Springer-Verlag, Berlin, Germany.
subject to x x̂
∈
0 Boyd, S., L. El Ghaoui, E. Feron, V. Balakrishnan. 1994. Linear
x − x̂T 2
Matrix Inequalities in System and Control Theory, Vol. 15.
where - is the support function of a convex set, that Studies in Applied Mathematics. SIAM, Philadelphia, PA.
describes the set of admissible portfolio allocation vectors; Costa, O., A. Paiva. 2001. Robust portfolio selection using linear-
the set
reflects the partial information (moments bounds matrix inequalities. J. Econom. Dynam. Control. To appear.
and support) we have on the distribution of returns, and El Ghaoui, L., M. Oks, F. Oustry. 2000. Worst-case value-at-
risk and robust asset allocation: A semidefinite program-
the risk factor depends on the chosen optimization model
ming approach. Technical report M00/59, ERL, University
(entropy-constrained or moment-constrained).
of California, Berkeley, CA.
The optimal variables
x̂ in the above problem are Goldfarb, D., G. Iyengar. 2001. Robust portfolio selection prob-
selected to be the most prudent when facing data uncer- lems. CORC technical report, TR-2001-05. Submitted to
tainty. The duality between the portfolio weights and the Math. Oper. Res.
worst-case probability distribution information (x̂
) is Halldórsson, B. V., R. Tütüncü. 2000. An interior-point method
reminiscent of the duality in option pricing problems, for a class of saddle-point problems. Submitted to J. Optim.
between the optimal hedging strategy (for replicating the Theory Appl.
price of an option) and the risk-neutral probability measure Hiriart-Urruty, J., C. Lemaréchal. 1993. Convex Analysis and Min-
(Musiela and Rutkowski 1997). imization Algorithms. Springer-Verlag, Berlin, Germany.
As noted in §2.1, the above formulation has a determin- Linsmeier, T. J., N. D. Pearson. 1996. Risk Measurement: An
istic interpretation, in which the returns are only known to introduction to value-at-risk. Technical report 96-04, OFOR,
belong to a union of ellipsoids of the form University of Illinois, Urbana-Champaign, IL.
Lobo, M. S., S. Boyd. 1999. The worst-case risk of a portfolio.
x
2
x − x̂x − x̂T In preparation.
, M. Fazel, S. Boyd. 2000. Portfolio optimization with linear
where the shape matrix
and center x̂ are unknown- and fixed transaction costs and bounds on risk. Submitted to
but-bounded, and the problem is to allocate resources in Oper. Res.
a “min-max,” or game-theoretic, manner. Our SDP solu- , L. Vandenberghe, S. Boyd, H. Lebret. 1998. Second-order
tion illustrates a kind of “certainty equivalent principle” cone programming: Interior-point methods and engineering
by which a problem involving probabilistic uncertainty has applications. Linear Algebra Appl. 284 193–228.
an interpretation, and an efficient numerical solution, as a Luenberger, D. G. 1999. Investment Science. Oxford University
deterministic game. Press, Oxford, U.K.
The numerical tractability of the above problem depends on Markowitz, H. 1952. Portfolio selection. J. Finance 7(1)
the structure of the sets ,
. We have identified some prac- 77–91.
Musiela, M., M. Rutkowski. 1997. Martingale Methods in
tically interesting cases when these sets result in a tractable,
Financial Modelling, Vol. 36. Applications of Mathemat-
semidefinite programming problem, namely componentwise ics, Stochastic Modelling and Applied Probability. Springer,
and ellipsoidal bounds. In the case of support constraints Berlin, Germany.
on the distribution of returns the problem does not seem to Nesterov, Y., A. Nemirovsky. 1994. Interior Point Polynomial
be tractable, but we have shown how to compute an upper Methods in Convex Programming: Theory and Applications.
bound on the worst-case VaR via semidefinite programming. SIAM, Philadelphia, PA.
Pearson, N. D., X. Ju. 1999. Using value-at-risk to control risk
ACKNOWLEDGMENTS taking: How wrong can you be? J. Risk 1(2).
Pritsker, M. 1997. Evaluating value-at-risk methodologies: Accu-
The authors acknowledge the valuable input from racy vs. computational time. J. Financial Services Res.
Alexandre d’Aspremont, Dimitris Bertsimas, Stephen 12(2/3) 201–242.
Boyd, Sanjiv Das, Darrell Duffie, Miguel Lobo, and Ioana Saigal, R., L. Vandenberghe, H. Wolkowicz, eds. 2000. Hand-
Popescu. They thank Xavier Burtschell and Aniruddha Pant book on Semidefinite Programming and Applications. Kluwer
for their help with numerical implementations. Academic Publishers, Dordrecht, The Netherlands.
Smith, J. 1995. Generalized Chebyshev inequalities: Theory and
REFERENCES applications in decision analysis. Oper. Res. 43 807–825.
Sturm, J. 1998. SeDuMi: A Matlab Toolbox. Version 1.03.
Berstimas, D., I. Popescu. 1999. On the relation between option Vandenberghe, L., S. Boyd. 1996. Semidefinite programming.
and stock prices: A convex optimization approach. Work- SIAM Rev. 38(1) 49–95.
ing paper #4085, Sloan School of Management, MIT, , . 1999. SP Version 1 1 Software for Semidefinite Pro-
Cambridge, MA. gramming, User’s Guide.