Académique Documents
Professionnel Documents
Culture Documents
Econometrics 1
Part 2: Volatility Modelling and Forecasting
Sigit Wibowo
April 7, 2015
Contents
1 Motivations
1.1 Stylised Facts in Financial Data . . . . . . . . . . . . . . . . .
1.2 Types of Non-linear Models . . . . . . . . . . . . . . . . . . .
1.3 Non-linearity Tests . . . . . . . . . . . . . . . . . . . . . . . .
2
2
4
5
2 EWMA
2.1 EWMA Specication . . . . . . . . . . . . . . . . . . . . . . .
2.2 The Advantages . . . . . . . . . . . . . . . . . . . . . . . . . .
7
7
7
3 ARCH
7
3.1 ARCH Specication . . . . . . . . . . . . . . . . . . . . . . . . 7
3.2 ARCH Effect Tests . . . . . . . . . . . . . . . . . . . . . . . . . 9
3.3 Problems with ARCH() Models . . . . . . . . . . . . . . . . . 10
4 GARCH
10
4.1 GARCH Specication . . . . . . . . . . . . . . . . . . . . . . . 10
4.2 The ML Estimation . . . . . . . . . . . . . . . . . . . . . . . . 12
5 ARCH/GARCH Model Extensions
18
5.1 EGARCH Model . . . . . . . . . . . . . . . . . . . . . . . . . . 19
5.2 GJR-GARCH Model . . . . . . . . . . . . . . . . . . . . . . . . 19
5.3 GARCH-in Mean Model . . . . . . . . . . . . . . . . . . . . . 21
6 Volatility Forecasting
6.1 Overview . . . . . . . . . . .
6.2 Volatility Forecasting . . . . .
6.3 The Use of Volatility Forecasts
6.4 Testing Non-linear Restrictions
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
21
21
22
23
24
25
References
[1] Chris Brooks Introductory Econometrics for Finance, 2nd edition. Cambridge University Press, 2008.
[2] Ronald J. Wonnacott and Thomas H. Wonnacott Econometrics, 2nd
edition. John Wiley & Sons, Inc., 1979.
Timetable
Week
Topics
References
[1], Ch. 8
[1], Ch. 6, 7
[2], Ch. 18
Motivations
1.1
= + + ... + +
(1)
or
= +
where we assumed that (0, )
What is volatility?
Volatility is simply dened as standard deviation
Variance is often preferred because it measures in the same units
as original data
More denition:
Conditional volatility
=
( )
=
Realised volatility
[+ ]
Implied volatility, e.g. Black-Scholes model:
= (, , , , )
Annualised volatility, e.g.
252
3
rjkse
10
2002
10
2003
2004
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
rsnp500
5
0
5
2002
2003
2004
Non-linear Models
Campbell, Lo and MacKinlay (1997)
A non-linear data generating process can be written
= ( , , , ...)
(2)
= ( , , ...) + ( , , ...)
(3)
1.2
1.3
Non-linearity Tests
= + + + ... + +
Other tests: the BDS test and the bispectrum test
Heteroscedasticity Revisited
A structural model can be written as follows:
= + + + +
where (0, )
The variance of errors is assumed to be constant and is known as homoscedasticity
or ( ) =
5
6
Time-varying Risk
Premium
Multivariate GARCH
GARCH-in-Mean
Diagonal-Vec MGARCH
Dynamic Conditional
Correlation (DCC)
Generalized ARCH
ARCH
Exponential Smoothing
Moving Average
Simple Regression
Historical Average
ARCH Class
Conditional Volatility
Constant Correlation
Asymmetry
Smooth Transition
Exponentially Weighted
Moving Average
Auto-Regressive Moving
Average (ARMA)
Threshold
Auto-Regressive
Historical Standard
Deviation
Asymmetric Power
GARCH
Quadratic GARCH
Exponential ARCH
Frationally Integrated
Auto-Regressive Moving
Average (ARFIMA)
Integrated
Auto-Regressive Moving
Average (ARIMA)
Random Walk
Volatility Forecast
Models
Stochastic Volatility
EWMA
2.1
EWMA Specication
EWMA
Exponentially Weighted Moving Average
= (1 ) ( )
(4)
where
denotes the estimate of the variance for period and also becomes
the forecast of future volatility for all periods
denotes the average return estimated over the observations
denotes the decay factor which determines how much weight is given
to recent versus older observation
2.2
The Advantages
EWMA
Exponentially Weighted Moving Average
Two advantages:
Volatility is likely inuenced by recent events, which carry more
weights
The effect on volatility of a single observation declines at an exponential rate as weights attracted to recent events fall
ARCH
3.1
ARCH Specication
ARCH
Autoregressive Conditionally Heteroscedastic
If heteroscedasticity exists, use a model which does not assume the variance is constant
Recall the denition of the variance :
= ( | , , , ...)
= [( ( )) | , , ...]
The variance is usually assumed to be ( ) = 0
= ( | , , ...)
= [ | , , ...]
ARCH
Autoregressive Conditionally Heteroscedastic (contd)
What could the current value of the variance of the errors possibly depend upon?
Previous squared error terms
This leads to the AutoRegressive Conditionally Heteroscedastic model
for the variance of the errors:
(5)
= +
The equation (5) is known as an ARCH(1) model
ARCH
Autoregressive Conditionally Heteroscedastic (contd)
The full model can be written as
= + + ... + + ,
where = +
8
(0, )
(6)
The ARCH model can be extended into the general case where the error
variance depends on lags of squared errors:
= + + + ... +
(7)
= + + ... + + ,
= +
(0, )
+ ... +
(8)
ARCH
Autoregressive Conditionally Heteroscedastic (contd)
Instead of (8), we can write
= + + ... + +
=
= +
(0, 1)
(9)
(8) and (9) are different way to express exactly the same model
(8) is easier to understand
(9) is required to simulate an ARCH model
3.2
2. Square the residuals, and regress them on own lags to test for ARCH
of order , for example
= + + + ... + +
where is iid. Also obtain from this regression
3. The test statistic is dened as or the number of observations multiplied by the coefcient of multiple correlation from the last regression
and is distributed as a ()
Testing ARCH Effect
(contd)
4. The null and hypotheses are
3.3
10
GARCH
4.1
GARCH Specication
= + +
(10)
= + +
= + +
Substituting into (10) for
= + + + +
= + + + +
(11)
= + + + + + +
= + + + + + +
= 1 + + + 1 + + +
An innite number of successive substitutions would yield
= 1 + + + 1 + + +
Therefore, the GARCH(1,1) model can be written as an innite order
ARCH model
11
= + + + ... +
+ + + ... + +
= +
+
=
( ) =
1 ( + )
(12)
when + < 1
Non-stationarity in variance is given by + 1
The conditional variance forecasts will not converge on their unconditional value as the horizon increases
Integrated GARCH is given by + = 1
12
4.2
The ML Estimation
= + + ,
(0, )
= + +
2. Specify the log-likelihood function to maximise:
1
1 ( )
log(2) log( )
2
2 =
2 =
3. The computer will maximise the function and generate parameter values
and their standard errors
Maximum Likelihood
Parameter Estimation
For simplicity, lets consider the bivariate regression case with homoscedastic errors:
= + +
Assume that (0, ) then ( + , )
Therefore, the probability density function for a normally distributed
random variable with the mean and variance is given by
1
| + , =
exp
(13)
13
Maximum Likelihood
Parameter Estimation (contd)
The implication of (13)
Successive values of would trace out the familiar bell-shaped
curve
Since the assumption of is iid, then will also be iid
The joint pdf for all the s can be expressed as a product of the individual
density functions:
, , ..., | + , = | + ,
| + , ...
| + ,
= | + ,
(14)
Maximum Likelihood
Parameter Estimation (contd)
Substituting into equation (14) for every from equation (13)
, , ..., | + , =
exp
=
2
(15)
Maximum Likelihood
Parameter Estimation (contd)
The typical situation we have is that the and are given and we want
to estimate , ,
14
( , , ) =
exp
(16)
2 =
2
Maximum likelihood estimation comprises of choosing parameter values ( , , ) that maximise this function
We want to differentiate (17) w.r.t. , , , but (17) is a product containing terms
Maximum Likelihood
Parameter Estimation (contd)
Since max () = max log(()), we can take logs of (17)
Using the various laws for transforming functions containing logarithms,
we get the log-likelihood function,
= log
1 ( )
log(2)
2
2 =
which is equivalent to
1 ( )
log log(2)
2
2
2 =
(17)
Maximum Likelihood
Parameter Estimation (contd)
Differentiating (17) w.r.t. , , , we get
1 ( )2 1
=
2 =
(18)
1 ( )2
=
2 =
(19)
1
1 ( )
2 2 =
(20)
15
Maximum Likelihood
Parameter Estimation (contd)
Setting (18)-(20) to zero to minimise the functions, and putting hats
above the parameters to denote the maximum likelihood estimators
From (18),
( ) = 0
= 0
= 0
1
1
= 0
(21)
Maximum Likelihood
Parameter Estimation (contd)
From (19),
( ) = 0
= 0
= 0
=
= ( )
= +
=
=
Maximum Likelihood
Parameter Estimation (contd)
16
(22)
From (20),
( )
=
=
1
( )
=
(23)
(0, )
= + + ,
= + +
=
1
1 ( )
log(2) log( )
2
2 =
2 =
()
=
=
+ +
18
(0, 1)
5.1
EGARCH Model
| |
(24)
+
log ( ) = + log ( ) +
The advantages of the model:
Because the model has log( ), will be positive even if the parameters are negative
The model accounts for the leverage effects: if the relationship between volatility and returns is negative, will be negative
19
5.2
GJR-GARCH Model
= + + +
where
1,
=
0,
(25)
if < 0
otherwise
A GJR-GARCH model is convariance stationary if and only if the parameter restrictions are satised and + + + < 1
For a leverage effect, > 0
+ 0 and 0 is required for non-negativity
The T-GARCH Model
Zakoian (1994) proposed TARCH(, , ) model
TARCH(1,1,1) can be written as follows:
= + | | + | | +
+ 0
where
1,
=
0,
20
if < 0
otherwise
(26)
0.2 0.3 0.4 0.5 0.6 0.7 0.8
GARCH
GJR-GARCH
-2
-1
5.3
GARCH-in Mean
The idea is to model the return of a security which is partly determine
by its risk
Engle, Lilien and Robins (1987) proposed the ARCH-M specication
= + +
(0, )
= +
(27)
6
6.1
Volatility Forecasting
Overview
21
GARCH is able to model the volatility clustering effect because the conditional variance is autoregressive
Such models can be also used to forecast volatility
We could show that
| , , ... = ( | , , ...)
Therefore modelling will give us models and forecasts for as well
Variance forecasts are additive over time
6.2
Volatility Forecasting
= +
(0, )
= + +
We need to generate forecasts of
+ | , + | , ..., + |
(28)
+ = + +
(29)
+
+
(30)
= +
= +
22
+
+
+
+
+
+
(31)
(32)
, = + +
Given , , the 2-step ahead forecast for made at time can be obtained by taking the conditional expectations of (30)
, = + (+ | ) + ,
(33)
(+ | ) = +
,
Therefore, the 2-step ahead forecast is given by
, = + , + ,
(35)
= + + ,
Forecasting Variances Using GARCH Models
(contd)
, = + + +
= + + ,
= + + + + ,
= + + + + ,
23
(36)
, = +
+ +
(37)
6.3
= (, , , , )
Conditional betas
, =
,
,
where
= hedge ratio
= correlation coefcient between change in spot price () and
change in futures price ( )
= standard deviation of
= standard deviation of
For time-varying covariance and correlation, the hedge ratio is
=
24
,
,
6.4
= 2( ) ()
Diagrammatic Representation
25
()
A
()
B
()
.
26
27
28
125
JPY
120
115
110
105
2003
2
2004
2005
2006
2007
2004
2005
2006
2007
RJPY
1
0
1
2
2003
29
Density
JPY
0.100
N(s=5.69)
0.075
0.050
0.025
100.0
102.5
Density
105.0
RJPY
N(s=0.44)
2.0
107.5
110.0
112.5
115.0
117.5
120.0
122.5
125.0
127.5
1.5
1.0
0.5
2.5
2.0
1.5
1.0
0.5
0.0
30
0.5
1.0
1.5
2.0
2.5
31
32
RJPY
Fitted
2
2003
5
2004
2005
2006
2007
2004
2005
2006
2007
2004
2005
2006
2007
r:RJPY (scaled)
5
2003
CondSD
0.6
0.4
2003
0.50
Forecasts
RJPY
0.25
0.00
0.25
0.50
2007624
0.0850
71
78
715
71
78
715
CondVar Forecasts
0.0825
0.0800
0.0775
0.0750
2007624
33
34
35
36
References
[1] Daniel B. Nelson Generalized Autoregressive Conditional Heteroskedasticity. Journal of Econometrics, Vol. 31, No. 2 (1991), 307?327.
[2] Robert F. Engle Autoregressive Conditional Heteroscedasticity with Estimates of the Variance of
United Kingdom Ination. Econometrica, Vol. 50, No. 4 (1982), 987-1008
[3] Robert F. Engle, David M. Lilien and Russell P. Robins Estimating Time Varying Risk Premia in the
Term Structure: The Arch-M Model. Econometrica, Vol. 55, No. 2 (1987), 391-407.
[4] Lawrence R. Glosten, Ravi Jagannathan, David E. Runkle On the Relation between the Expected
Value and the Volatility of the Nominal Excess Return on Stocks. The Journal of Finance, Vol. 48,
No. 5 (1993), 1779-1801.
[5] Daniel B. Nelson Conditional Heteroskedasticity in Asset Returns: A New Approach. Econometrica,
Vol. 59, No. 2 (1991), 347-370.
37