Académique Documents
Professionnel Documents
Culture Documents
Ravi Shukla
Contents
1 Notation 2
2 Definitions 2
3 Mean-Variance Frontier 3
13 Pricing Error 20
1
These notes are a compilation of some basic ideas in asset pricing. The motivation
behind these notes is to have the rigorous derivations and related concepts available
in a consistent notation in one place. Many of the concepts in mean-variance analysis
can be found in Merton (JFQA, September 1972) and Roll (JFE, 1977; JF, September
1978; JFQA, December 1980).
The first two sections set up the notation and provide the definitions of some of
the common concepts. Sections 3 through 9 do the mean-variance analysis and derive
various forms of the capital asset pricing equations. Section 10 provides a criticism of
the mean-variance analysis. Section 11 onwards are devoted to the development of the
arbitrage pricing theory.
1 Notation
Let the market have n risky assets. Then we define the following notation:1
ΣT = Σ and rank (Σ) = n. The rank condition guarantees that there are no linear
dependencies among asset returns and that there is no risk-free asset among these n
assets.
2 Definitions
A risk-free portfolio is defined to have weights Wf such that:
WfT ΣWf = 0,
WfT ` = 1,
WfT M = rf > 0.
2
Orthogonal (zero covariance) portfolios Z and Z are defined to have weights
WZ and WZ such that:
T
WZ
ΣWZ = 0,
T
WZ
` = 1,
T
WZ ` = 1
WT ΣW = 0,
T
W ` = 0,
WT R̃ = WT M = 0.
M = f (Σ)
Quadratic utility function or joint normality of asset returns transforms the problem
to:
Max
F (WT M, WT ΣW), (2.3)
W
s.t. WT ` = 1, (2.4)
where F is the function obtained after the transformation. For risk-averse investors,
F1 > 0 and F2 < 0 where Fk denotes the partial derivative with respect to the k th
argument, k = 1, 2.
3 Mean-Variance Frontier
Following the discussion above we will assume that the normality or the quadratic
utility function assumption is satisfied, so that we can limit our analysis to mean and
variance only. We will also assume unanimity of beliefs so that the problem of ag-
gregation is avoided.2 Realizing the properties of F , we can write the problem for
2 This is not very crucial to our results. Several authors have shown that the mean-variance analysis holds
3
risk-averse investors as:
Min
σ 2 = WT ΣW,
W
s.t. WT M = µ,
WT ` = 1,
where µ is the desired level of the portfolio expected return. The solution to this prob-
lem will result in a parametric equation for the efficient frontier. Writing the Lagrangian
the problem is transformed to:
Min
WT ΣW − λ1 (WT M − µ) − λ2 (WT ` − 1).
W
The first order conditions give us:
From (3.1),
1h i
WT = λ1 MT + λ2 `T Σ−1 ,
2
so that by post-multiplying,
1 1
WT M = λ1 MT Σ−1 M + λ2 `T Σ−1 M = µ
2 2
1 T −1 1 T −1
WT ` = λ1 M Σ ` + ` Σ ` = 1
2 2
Define
to get
1 1
λ1 B + λ2 A = µ,
2 2
1 1
λ1 A + λ2 C = 1,
2 2
which can be solved to give
4
where D = BC − A2 > 0.3 Substituting for λ1 and λ2 in (3.4) we get:
Cµ − A T B − Aµ T
T
W = M + ` Σ−1 . (3.7)
D D
This defines the composition of the mean-variance frontier portfolio corresponding to
expected return µ. Now we want to get the equation for the mean-variance frontier. For
that we have to eliminate W from the equations. Post-multiplying equation (3.8) by
ΣW we get:
Cµ − A T B − Aµ T
WT ΣW = M W+ ` W
D D
but MT W = WT M = µ, `T W = WT ` = 1 and WT ΣW = σ 2 , so that,
Cµ − A B − Aµ
σ2 = µ+ ,
D D
Cµ2 − 2Aµ + B
= ,
D
which is a parabola in (µ, σ 2 ) plane.4
Since
dσ 2 2(Cµ − A) A
= =0 ⇒ µ= ,
dµ D C
A 1
the minimum variance portfolio in (µ, σ 2 ) plane is located at ( C , C ). The composition
T 1 T −1
of the minimum variance portfolio is given by W = C ` Σ . The efficient frontier
is the part of the parabola for which a higher variance (σ 2 ) is associated with a higher
2
dσ 2 2(µC − A) 2C(µ − C
A
)
= = ,
dµ D D
2(µ − µ) A 1
= where µ= and σ2 = ,
Dσ 2 C C
we get µ > µ for the efficient portion of the frontier.
3 Since Σ−1 is positive definite, the quadratic form (AM − B`)T Σ−1 (AM − B`) > 0. Which can
be simplified to:
A2 MT Σ−1 M + B 2 `T Σ−1 ` − 2ABMT Σ−1 ` > 0,
A2 B + B 2 C − 2A2 B > 0,
B(BC − A2 ) > 0,
−1
and since B > 0 (it is a quadratic form of Σ ) we get the desired result. Also note that C > 0 for the
same reason as B.
4 For the purposes of plotting the curve, the following form of the equation is more useful:
r
A D 2 D
µ= ± σ − 2.
C C C
5
Let us examine some of the properties of the frontier in the (µ, σ) plane. The
frontier is a hyperbola in this plane.
dσ dσ dσ 2 1 dσ 2
= 2
= ,
dµ dσ dµ 2σ dµ
µ−µ
= ,
Dσσ 2
which means that the efficient portion of the frontier has the positive slope and
d2 σ 1 d2 σ 2 1 −1 dσ dσ 2
= + ,
dµ2 2σ dµ2 2 σ2 dµ dµ
1 d2 σ 2 1 1 dσ 2 dσ 2
= − ,
2σ dµ2 2σ 2 2σ dµ dµ
" 2 2 #
1 d2 σ 2 1 dσ
= − ,
2σ dµ2 2σ 2 dµ
" 2 #
1 2 1 2(µ − µ)
= − 2 ,
2σ Dσ 2 2σ Dσ 2
1 2 2 (µ − µ)2
= − 2 ,
2σ Dσ 2 σ D2 σ 4
1 (µ − µ)2
= 0 if = ,
Dσ 2 σ2 D2 σ4
or Dσ 2 σ 2 = (µ − µ)2 .
This condition is not met by all µ and σ. This implies that the efficient frontier in
(µ, σ) plane is not a straight line. The efficient frontier is a straight line only when Σ
is not positive definite which happens when either there is a risk-free asset or the assets
are perfectly (positively or negatively) correlated.
6
The first order conditions:
MT − λ1 (2WT Σ) − λ2 (`T ) = 0, (4.1)
T 2
W ΣW = σ , (4.2)
WT ` = 1. (4.3)
From (4.1)
1 h T i
WT = M − λ2 `T Σ−1 ,
2λ1
so that by post-multiplying,
1 h T i
WT M = µ= M − λ2 `T Σ−1 M,
2λ1
1 h T i
and WT ` = 1 = M − λ2 `T Σ−1 `.
2λ1
Simplifying using the definitions of A, B and C from above:
2λ1 µ + λ2 A = B,
and 2λ1 + λ2 C = A.
BC − A2
⇒ λ1 =
2(µC − A)
µA − B
and λ2 =
µC − A
Substituting these in equation (4.4) and simplifying we get the same form as (3.8).
7
T T
B`T −AMT −1
where GT = CM D−A` Σ−1 and HT = D Σ .
Comparing (5.2) and (5.1) we get:
x(WPT − WQ
T
) = µGT + HT − WQ
T
.
WT = µδ(WPT − WQ
T
) − α(WPT − WQ
T T
) + WQ . (5.3)
Comparing (5.3) with (5.2) and realizing that WP and WQ must be independent
of µ, we get:
GT = δ(WPT − WQ
T
),
and HT = T
WQ − α(WPT − WQ
T
),
which can be solved to give:
1+α T
WPT = G + HT = µP GT + HT ,
δ
T α T
and WQ = G + HT = µQ GT + HT
δ
with
1+α T
µP = WPT M = G M + HT M,
δ
1 + α CB − A2 BA − AB
= + ,
δ D D
1+α
= . (5.4)
δ
Similarly,
T α
µQ = W Q M = , (5.5)
δ
from which
1 µQ
δ= and α= .
µP − µQ µP − µQ
WP and WQ are not unique. For different values of α and δ different pairs of
funds may be selected. Note that funds P and Q can generate all frontier portfolios, not
just the efficient ones. Also note from the expressions for WP and WQ and comparing
them with (5.2) that funds P and Q lie on the mean-variance frontier but not necessarily
on the efficient part of it.
8
6 Properties Of The Funds
WPT ΣWP = µP GT ΣWP + HT ΣWP ,
CMT − A`T B`T − AMT
= µP WP + WP ,
D D
CMT WP − A`T WP B`T WP − AMT WP
= µP + ,
D D
1+α 1+α
1 + α C( δ ) − A B − A( δ )
= + ,
δ D D
C(1 + α)2 − Aδ(1 + α) Bδ − A(1 + α)
= + ,
Dδ 2 Dδ
or,
C(1 + α)2 + Bδ 2 − 2Aδ(1 + α)
σP2 = . (6.1)
Dδ 2
T
WQ ΣWQ = µQ GT ΣWQ + HT ΣWQ ,
α C( αδ ) − A B − A( αδ )
= + ,
δ D D
or,
2 Cα2 − 2Aαδ + Bδ 2
σQ = . (6.2)
Dδ 2
Combining (6.1) and (6.2) and simplifying we get:
C + 2(αC − Aδ)
σP2 = σQ
2
+ .
Dδ 2
Finally,
WPT ΣWQ = µP GT ΣWQ + HT ΣWQ ,
CMT − A`T B`T − AMT
= µP WQ + WQ ,
D D
CµQ − A B − AµQ
= µP + ,
D D
C( 1+α 1+α
δ )( δ ) − A( δ ) + B − A( δ )
α α
= ,
D
Cα2 − 2Aαδ + Cα + Bδ 2 − Aδ
= , (6.3)
Dδ 2
or,
2 Aδ − Cα
σP Q = σQ − . (6.4)
Dδ 2
If we restrict both the funds to be efficient portfolios and without loss of generality
let µP > µQ ≥ µ = C A
and σP2 > σQ 2
, then δ > 0 and α ≥ Aδ C . Then from (6.4)
σP Q = 0 requires:
2 Aδ − αC
σQ = <0
Dδ 2
9
which is impossible. This means that there are no two funds that lie on the efficient
portion of the frontier and are orthogonal.
CMT − A`T −1
WPT − WQ
T
= (µP − µQ ) Σ .
D
which leads to:
" #
A D WPT − WQT
MT = `T + Σ. (7.4)
C C µP − µQ
10
This is the answer to our pricing objective alluded in section 2. Post-multiplying
(7.4) with the weight vector for an arbitrary portfolio X, we get:
A T D WPT ΣWX − WQ T
ΣWX
M T WX = ` WX + ,
C C µP − µQ
which leads to the following form of the asset pricing equation:
A D Cov(r̃P , r̃X ) − Cov(r̃Q , r̃X )
µX = + . (7.5)
C C µP − µQ
From (8.1)
λ
WT = (MT − `T rf )Σ−1 . (8.3)
2
Multiplying (8.3) by M, we get:
λ λ
WT M = (MT − `T rf )Σ−1 M = (B − Arf ), (8.4)
2 2
where A and B are as defined in section 3. Combining (8.2) and (8.4) and simplifying
we get:
2(µ − rf + WT `rf )
λ= . (8.5)
B − Arf
Post-multiplying (8.3) by ` we get:
λ λ
WT ` = (MT − `T rf )Σ−1 ` = (A − Crf ). (8.6)
2 2
11
Substitute for WT ` from (8.6) in (8.5) and simplify to get:
2(µ − rf )
λ= . (8.7)
B − 2Arf + Crf2
WT gives us the portions of wealth invested in the risky assets. (1−WT `) portion
of the wealth is invested in the risk-free asset. Now,
µ − rf
σ 2 = WT ΣW = [MT − `T rf ]W,
B − 2Arf + Crf2
(µ − rf )2
= .
B − 2Arf + Crf2
Since σ 2 > 0, (B − 2Arf + Crf2 ) > 0. Now we get the equation of the frontier as:
q
µ − rf = ±σ B − 2Arf + Crf2 ,
[MT − `T rf ]
WT = (µ − rf )UT where UT = Σ−1
B − 2Arf + Crf2
and,
wf = 1 − (µ − rf )UT `.
x = δ(µ − rf ) + 1 − α, δ 6= 0,
12
so that equation (8.9) becomes:
T T T T T T
W = δ(µ − rf )(WP − WQ ) + (1 − α)(WP − WQ ) + WQ .
Using the same arguments as in section 5, we get for the risky assets:
δ(WPT − WQ
T
) = UT ,
and (1 − α)(WPT − WQ
T T
) + WQ = 0,
which give us
αUT
WPT =
δ
and
T (α − 1)UT
WQ = ,
δ
and for the risk-free asset:
α(A − rf C)
wfP = 1 − WPT ` = 1 − ,
δ(B − 2Arf + Crf2 )
T (α − 1)(A − rf C)
wfQ = 1 − WQ `=1− .
δ(B − 2Arf + Crf2 )
If we want the fund Q to be the risk-free asset and fund P to be made up of risky
assets only, we get from the condition for the fund Q: wfQ = 1 and WQ = 0 which
imply α = 1. From the condition on fund P we get wfP = 0 which, together α = 1,
gives us:
A − rf C
δ= ,
B − 2Arf + Crf2
since δ 6= 0 we get rf 6= A
C. Under these conditions we get:
[MT − `T rf ] −1
WPT = Σ (8.10)
(A − rf C)
[MT − `T rf −1 B − rf A
µP = WPT M = Σ M= , (8.11)
(A − rf C) A − rf C
and
µP − rf
Var(r̃P ) = WPT ΣWP = . (8.12)
A − rf C
For P to be an efficient portfolio, µP > rf . Then from nonnegativity of Var(r̃P )
A
we get rf < C .
13
Now for an arbitrary portfolio X with weights WX ,
[MT WX − `T WX rf ] µX − rf
Cov(r̃X , r̃P ) = WPT ΣWX = = .
(A − rf C) A − rf C
which can be combined with (8.12) to give the security market line of the capital asset
pricing model:
1 ∂µM ∂(1/σM )
+ (µM − rf ) − λ` = 0, (9.1)
σM ∂WM ∂WM
and
T
WM ` = 1 (9.2)
Simplify (9.1):
2
1 ∂(1/σM ) ∂σM ∂σM
M + (µM − rf ) 2 = λ`,
σM ∂σM ∂σM ∂WM
1 −1 1
M + (µM − rf )( 2 )( )2ΣWM = λ`,
σM σM 2σM
1 (µM − rf )
M− 3 ΣWM = λ`. (9.3)
σM σM
14
T
Pre-multiply by WM to get:
T
WM M (µM − rf ) T T
− 3 WM ΣWM = λWM `,
σM σM
µM (µM − rf ) 2
− 3 σM = λ,
σM σM
rf
⇒ λ = .
σM
Substitute this value of λ in (9.3) to get:
1 (µM − rf ) rf
M− 3 ΣWM = `,
σM σM σM
(µM − rf )
⇒ M− 2 ΣWM = rf ` (9.4)
σM
T
Multiply (9.4) by WX the weights of an arbitrary portfolio X, to get:
T µM − rf T T
WX M− 2 WX ΣWM = rf WX `,
σM
µM − rf
⇒ µX − 2 σXM = rf ,
σM
which gives us the capital asset pricing model:
σXM
µX = rf + (µM − rf ) 2 .
σM
Now we can solve for the composition of the market portfolio WM . Rewrite (9.4)
as:
µM − rf
2 WM = Σ−1 (M − rf `).
σM
15
viz. (7.3), (7.5), (8.13) and (9.5) simply state that there is a linear tradeoff between the
expected return and a measure of risk. The risk is measured by some combination of the
covariance of the asset’s returns with that of the mutual funds. The verifiability of the
entire mean-variance analysis, therefore, depends on the empirical measurability of the
funds. Theoretically, we must consider all the assets in the economy while obtaining
the funds. Empirically, this is an impossible task. The true verification of the theory is,
therefore, not possible. This is the main concern of Roll’s critique of the mean-variance
capital asset pricing models.
Another problem with the theory is the justifiability of the assumptions that lead to
the suitability of analysis in the mean-variance framework. Imposing quadratic utility
functions on the investors is very difficult to justify. There may be some justification
for the distributional assumption. Empirical analysis by Fama and many others pro-
vides some hope that asset returns may belong to the stable paretian family. Recently,
Chamberlain has done some theoretical work outlining the kind of distributions that
permit mean-variance analysis. But one cannot say with confidence that the distribu-
tional assumptions fit the data.
There is also the question of justification of expected utility maximization. As-
sumption of rationality on part of the investors so that they fulfill all the axioms for
measurable utility functions has been under fire from time to time. Apparently it is
much easier to maintain these assumptions while considering a narrow range of wealth
than a wide one.
Finally, every test of the asset-pricing result involves a joint test of the assumption
of informational efficiency of financial markets.
The major discontent with the mean-variance analysis on the theoretical front has
been as discussed in the first two paragraphs. On the empirical side, there have been
questions about the degree of systematic noise not incorporated in the model. The
commonly known arguments on this front surfaced in the form of the size effect.
There has been a general awareness for a long time that a better model is needed.
Multiperiood models of Merton, Breeden etc. were efforts in this direction but they
never gained as much popularity. Ross (1976,1977) capitalized on the popular mul-
tifactor asset returns model and combined that with a simple implication of market
efficiency to arrive at another linear, approximate asset pricing model called the arbi-
trage pricing model. The biggest plus with the arbitrage pricing model is that it can be
tested without looking at the entire population of assets.
Here δ̃s are the economy-wide common independent factors that affect the returns
of all the assets in the economy to different degrees. βik is the sensitivity of the ith as-
16
set’s returns to the k th common factor. ˜i is the idiosyncratic or the unique component
of the asset return that is not related to any of the common factors. In the matrix form
we can write:
˜ + ε̃ε
R̃ = M + B∆ (11.2)
R̃ and M are the n × 1 realized and expected return vectors for the assets, respec-
tively, as defined in section 1.
β11 β12 . . . β1k
β21 β22 . . . β2k
B= . .. .. ..
.. . . .
βn1 βn2 ... βnk
ηT ` = 0 (11.3)
The portfolio is called arbitrage portfolio for another reason. It is created in such a way
that it has no factor related or common risk i.e.:
η T B = ZT
k (11.5)
ηT R̃ = η T M + η Tε̃ε (11.6)
17
idea behind APT is that the investors will choose arbitrage portfolios such that the
idiosyncratic component of the random return (and, therefore, the risk) is minimized.
In the limit, if the number of assets is very large (tending to infinity), this component
will be (tending to) zero. Therefore, we can approximate the cashflow as:
η T R̃ ' η T M (11.7)
The above equation is saying that on the arbitrage portfolio constructed above the
future cashflow is not random. Instead, it is a determinate amount equal to η T M. In
frictionless efficient markets, a zero net position with a zero factor risk and insignificant
(approximately zero) idiosyncratic risk should provide insignificant (approximately
zero) cashflows or else there would be arbitrage opportunities in the economy. Ross
motivates this transition using the law of large numbers. Others have used limit or
sequence economies for the same purpose. As a result equation (11.7) can be written
as:
Now combining equations (11.3), (11.5) and (11.8) one sees that vector η is ex-
actly orthogonal to the k column vectors B and the unit vector ` and is approximately
orthogonal to vector M. If we had exact orthogonality with all the vectors, we would
know that the vectors `, M and the k column vectors of B lie in a k + 1 dimensional
space. As a result any of the vectors could be expressed as an exact linear combination
of the remaining k + 1 vectors. Since, we do not have exact orthogonalities, the linear
combination relationship would only be an approximate one. As a result we write:
M ' γ0 ` + BΓ (11.9)
Here γ0 and Γ = [γ1 γ2 . . . γ3 ]T are constants that satisfy (11.9). In expanded form,
we can write for an asset i:
or,
M = γ0 ` + BΓ + u (11.12)
where ui is the pricing error in the ith asset. Several interpretations of the γs have
been offered by different authors (Ross; Ingersoll; Admati & Pfleiderer). The simplest
of them all is by Ross. Each γk ∀ k is a risk premium demanded by a ‘factor fund’
portfolio. If we were to construct k + 1 funds such that the first fund would be totally
independent of (orthogonal to) the other k and each of the remaining k funds would
exactly mimic the k factors, then γ0 would be the expected return on the first fund
(subscripted zero) while γk ∀ k > 0 would be the risk premia associated with such
18
fund portfolios. The risk premium is defined with respect to γ0 i.e., γk = µk − µ0
where µk is the expected return on the k th fund and µ0 = γ0 . If there is a risk-free
asset in the market then µ0 and γ0 equal the return on the risk-free asset. In this notation
the pricing equation (11.10) can be written as:
For testing purposes, the mean-variance models require an expectation model. APT
circumvents that problem. Substitute (11.11) in (11.1) to get:
(a) The asset pricing relationship (11.11) is an approximate one. The approximation
error is not due to estimation or such other empirical issues. The pricing error is
the recognition of the fact that assets do have non-diversifiable idiosyncratic risk
and investors do expect to be compensated for assuming that risk. As to how small
this pricing error should be will, of course, depend on the characteristics of the
investor and that of the economy. Several authors have provided bounds for this
pricing error (Dybvig; Grinblatt & Titman).
(b) The number of factors is not exactly known. The issue can not be resolved em-
pirically. Any text-book on factor analysis will tell you that in factor extraction
exercise, the number of factors extracted could be a very personal matter. Also,
the number of factors (based on any test of statistical significance) will increase as
the number of variables being analyzed is increased. There is significant amount
of empirical evidence on this issue relating to APT also (Dhrymes, Friend & Gul-
tekin; Trzcinka).
(c) There is also the related issue of the effect of inability to extract all the factors on
the properties of the error component of the return generation process. Essentially,
the errors will indeed have cross-sectional dependencies. It is not clear if the pric-
ing error bound will be as small as it is in the case of perfectly uncorrelated error
terms. Some authors have done work on this problem and have shown APT to be
rigorous under approximate factor structure (Ingersoll; Connor; Chamberlain &
Rothschild).
19
(d) Finally, there is the issue of identification of these ‘pervasive economic factors’.
No major published work has appeared in this area. This is simply because, draw-
ing from the factor analytic research in other academic areas, the task is almost
impossible.
13 Pricing Error
We know that the approximation in (11.8) will be a good one for some investors while
it may not be as good for others depending on their utility functions and wealth levels.
This section provides a unique definition of no-arbitrage and derives an expression for
the pricing error.
Let us denote the utility function of the investor by U (W ). We define the no-
arbitrage condition as certainty equivalent of the random cashflow being equal to zero.
If this condition were to be violated, it would be possible to trade the arbitrage portfolio
for a nonzero value. Therefore,
where W0 is the initial total wealth of the investor. Substituting for η R̃ from (11.6)
and using the condition (13.1) we get:
realizing that E(η Tε̃ε) = 0 and using Jensen’s inequality and concavity of the utility
function we get:
W0 ≤ W0 + η T M (13.5)
or,
ηTM ≥ 0 (13.6)
This implies that the required (expected) return on the arbitrage portfolio will be pos-
itive, rather than zero. As yet we have no implication for the individual pricing errors
but the equations above can be used to determine a zero error in pricing. An exact
expression for the zero error can be obtained if we assume an exact functional form for
the utility function. Let us take U (W ) = −e(−AW ) then equation (13.3) can be written
as:
T T
−e(−AW0 ) = −e[−A(W0 +η M)]
E(e(−Aη ε)
ε̃
) (13.7)
20
or,
T T
e(Aη M)
= E(e(−Aη ε)
ε̃
) (13.8)
Now, if we assume a specific density function for ε̃ε, we can arrive at an explicit ex-
pression. Let us assume it to be a multivariate normal with mean vector zero and the
variance-covariance matrix Φ, then by the use of the moment generating function for
the normal distribution we get:
T 2
1
η T Φη)
e(Aη M)
= e( 2 A (13.9)
or
1 T
ηTM = Aη Φη (13.10)
2
This can be written as:
1
η T (M − AΦη) = 0 (13.11)
2
Equation (13.11), therefore, should be used in the derivation of the pricing equation
instead of equation (11.8). If we use this equation, we get the following exact pricing
equation:
1
M = γ0 ` + BΓ + AΦη (13.12)
2
or,
1
µi = γ0 + βi1 γ1 + βi2 γ2 + . . . + βik γk + Aη i Var(˜
i ) (13.13)
2
This gives us the pricing error from the usual approximate APT. If the investor is
risk-averse then realizing that Φ is diagonal consistent with the original factor model
with independent error terms, we realize that the pricing error will have the same sign
as the amount of wealth invested in the asset to form the arbitrage portfolio! As we saw
in section 11, n − k − 1 different arbitrage portfolios can be formed by the investors,
and as a result, different investors will have different signs and magnitudes of η i . This
seems to indicate that the pricing error (or the premium demanded for bearing the
idiosyncratic risk) will be different for different investors. Uniformity, and therefore
equilibrium, can be imposed by aggregation or some other assumption. We attempt a
mean-variance approach in section 15.
Alternately, we can derive an expression by using the Taylor expansion.
U (W0 + η T M + η Tε̃ε) = U (W0 ) + (η T M + η Tε̃ε)U 0 (W0 )
1
+ (η T M + η Tε̃ε)2 U 00 (W̃0 )
2
where W̃0 is in (W0 , W0 + η T M + η Tε̃ε). Substituting this in (13.3) and taking ex-
pectations we get:
1 U 00 (W̃0 )
η T M = − η T Φη 0
2 U (W0 )
21
00
(W̃0 )
If we define − UU 0 (W 0)
= Ã we get the same form as (13.10) and the rest of the deriva-
tion is same as above with the difference that the pricing error is now random and the
best we can do is find a bound.
Both Dybvig and Grinblatt & Titman arrive at pricing error bounds under different
market conditions and assumptions . The above pricing error expression has direct
resemblance to these bounds.
B2 = B1 Θ
ε̃ε2 = ε̃ε1
Φ2 = Φ1
R̃2 − M2 = ˜ 2 + ε̃ε2
B2 ∆
= B1 ΘΘ−1 ∆ ˜ 1 + ε̃ε1
= ˜
B1 ∆1 + ε̃ε1
= R1 − M1
22
And now equation (14.2):
T
Σ2 ˜ 2∆
= B2 E(∆ ˜ )BT + Φ2
2 2
˜ T Θ−1 )ΘB1 + Φ1
˜ 1∆
= B1 ΘE(Θ−1 ∆ 1
T
˜ 1∆
= B1 E(∆ ˜ )BT + Φ1
1 1
= Σ1
˜ are unobservable, one solution is indistinguishable from the other. This
Since the ∆s
issue of identification is essentially a scaling problem.
Some order can be instated by arbitrarily specifying some property of ∆. ˜ It is
customary to set the variance-covariance matrix equal to the identity matrix i.e.
˜∆
Ω = E(∆ ˜ T) = I (14.3)
Now this identification problem is resolved because we cannot find Θ 6= I such that:
T T
˜ 2∆
Ω2 = E(∆ ˜ 1∆
˜ ) = ΘE(∆ ˜ )ΘT = ΘIΘT = I
2 1
Ω2 = E(∆ ˜ T)
˜ 2∆
2
T
˜ 1∆
= TE(∆ ˜ )TT
1
= TITT
= I
23
And finally, (14.4):
Σ2 = B2 B T
2 + Φ2
= B1 TTT BT
1 + Φ1
= B1 B T
1 + Φ1
= Σ1
This problem is in the nature of rotation of the factors. It is not clear if a property
of B or another property of ∆˜ can be specified to control for this problem.
If we solve the problem as stated here, it can be easily verified that the only solution
is the trivial solution η = Zn . The way to prevent this from happening is to force the
norm of η to be nonzero i.e. add one more constraint η T η = 1 where the norm
has been arbitrarily scaled to unity. This quadratic problem, however, is extremely
difficult to solve. Therefore, we devise another simpler, linear constraint. As long
as we can force one element of the η to be nonzero, we have a non-trivial solution
at hand. However, it should be realized that by doing this we may be obtaining a
minimum under an artificial constraint. It is quite likely that forcing another element
to be non-zero would have led the arbitrarily set element to be zero and a ‘better’
solution. Therefore, the minimization process will have to be done in two steps. First,
find minimum variance by forcing one element of η to be non-zero. In the second step,
choose the global minimum from among all these solutions. Therefore, we impose the
additional constraint:
η T ei = 1
where ei is a n×1 unit vector with 0 in all positions but the ith which has a 1. The non-
zero element of η has been arbitrarily scaled to be 1. The Lagrangian for the problem
24
is:
Min
η T Φη − (η T B − ZT T T
k )λ1 − (η `)λ2 − (η ei − 1)λ3
η
λ1 is a k × 1 vector while λ2 and λ3 are scalars. The first order conditions are:
T
2η T Φ − λT T T
1 B − λ2 ` − λ3 ei = ZT
n (15.1)
ηTB = ZT
k (15.2)
ηT` = 0 (15.3)
η T ei = 1 (15.4)
η T B = λT T −1
1B Φ B + λ2 `T Φ−1 B + λ3 eT
i Φ
−1
B = ZTk (15.6)
T T T −1 T −1 T −1
η ` = λ1 B Φ ` + λ2 ` Φ ` + λ3 ei Φ ` = 0 (15.7)
T −1
η T ei = λT
1B Φ ei + λ2 `T Φ−1 ei + λ3 eT
i Φ
−1
ei = 2 (15.8)
A = `T Φ−1 B
[1×k]
AT = BT Φ−1 `
[k×1]
B = BT Φ−1 B
[k×k]
C = `T Φ−1 `
[1×1]
and also,
−1
Ei = eT
i Φ B
[1×k]
EiT = BT Φ−1 ei
[k×1]
Fi = eT
i Φ
−1
` = `T Φ−1 ei
[1×1]
−1
Gi = eT
i Φ ei
[1×1]
where [· · · ] denotes the dimensionality of the variables. Now equations (15.6) through
(15.8) can be written as:
λT
1 B + λ2 A + λ3 Ei = ZT
k (15.9a)
λT T
1 A + λ2 C + λ3 Fi = 0 (15.9b)
λT T
1 Ei + λ2 Fi + λ3 Gi = 2 (15.9c)
25
To solve these equations, it will be convenient to write them as:
B T λ1 + AT λ2 + EiT λ3 = Zk (15.10a)
Aλ1 + Cλ2 + Fi λ3 = 0 (15.10b)
Ei λ1 + Fi λ2 + Gi λ3 = 2 (15.10c)
The solution to these equations is obtained using the Cramer’s rule as:
T
B Zk EiT
A 0 Fi
Ei 2 Gi
λ2 = (15.11)
T DT
B A Zk
A C 0
Ei Fi 2
λ3 = (15.12)
D
where D is the determinant:
T
B AT EiT
D = A C Fi
Ei Fi Gi
These then give us the particular solution η i by substituting in (15.5). Here the sub-
script i denotes that the solution is for the particular arbitrary ei . The global solution
would be the one with the lowest variance among all the solutions, i.e.
η = ηi s.t. ηT
i Φη is minimized. (15.14)
26