ThesisPDF Available

Modeling regime shifts Markov switching models Hidden Markov Model Applications

Authors:

Abstract

Regime shifts is an important aspect of financial economics, since some series have a tendency to fluctuate abruptly, such that the new trend stays for a long period. The Markov switching model captures volatility persistence, time varying correlations, skewness and kurtosis. The thesis considers discrete time space models using Hidden Markov models to study regime switching properties of the time series. The Hamilton (1988) model is referred, following the assumption that current regime is only dependent on the regime one time ago; then incorporated the extensions in regime switching models. The Hidden Markov models capture the nonlinear patterns, considering stochastic volatility, after the identification of the regimes, the probability denoting the changes in regimes is also estimated. Since the financial time series exhibits conditional variance that is not handled by traditional regression models, therefore autoregressive conditionally heteroscedastic model techniques are used. The regime switching technique extends GARCH model taking into account the variance structure considering economic states are different in the long term such that the shifts is governed by Markov chain process. The thesis describes Markov chain model, highlighting the State Space method and Kalman Filter. Then the Finite Markov mixture distribution is outlined, with emphasis on extensions of Markov switching models. Later studied the structural break in the variance of growth rate of GDP for the economies; USA, Pakistan, India and Japan. Further paper tries to capture linkages between national stock markets of six different economies taking into account the degree of dependence among markets. The real interest rate of Pakistan structural breaks were assessed, working on volatility clustering, interest rate trend is itself unrelated representing a mixture of distribution.
Modeling regime shifts
Markov switching models
Hidden Markov Model Applications
Fahad Javed Malik
Dept of Economics and Finance & Dept of Mathematical Sciences
Institute of Business Administration, Karachi
This thesis is submitted for the degree of
Bachelor of Science
Economics and Mathematics
Dr. Adnan Haider
Dr. Mohammad Nishat
Academic Program Coordinator Thesis Supervisor
Assistant Professor Professor and Associate Dean
Institute of Business Administration, Karachi. Institute of Business Administration, Karachi.
June 2016
Centre for Business and Economics
Research
Dr. Adnan Haider Dr. Mohammad Nishat
This thesis is dedicated to my dear parents.
Declaration
I hereby declare that except where specific reference is made to the work of others, the contents
of this dissertation are original and have not been submitted in whole or in part for consideration
for any other degree or qualification in this, or any other university. This dissertation is my o w n
work and contains nothing which is the outcome of work done in collaboration with others,
except as specified in the text and Acknowledgments. This dissertation contains fewer than
18,000 words including appendices, bibliography, footnotes, tables and equations and has fewer
than 15 figures.
Fahad Javed Malik
June 2016
Acknowledgements
Attaining this opportunity, I would like to thank all those individuals without whom my educa-
tion over the preceding years and particularly this thesis would not have been what they have
been.
First of all my thanks go to Professor Dr Muhammad Nishat for bringing the topic of this
thesis to my attention, for all the helpful advice throughout the time I worked on it. Moreover, I
am most grateful to him for enabling me to spend a Summer Semester in University of Oslo
and BI Norwegian Business School a year ago, to attend a research program in Norway.
Prof. Dr Muhammad Nishat patience, guidance and motivation as supervisor of this thesis,
for which I would like to thank very much indeed. Moreover, I am also heavily indebted to
Prof. Dr Adnan Haider for all the helpful advice and encouragement which he have given me
over the years. I would like to express my sincere gratitude to supervisors, for their insightful
comments and support and for giving me the opportunity to complete my thesis.
My special thanks for most helpful and influential discussions on the subjects of this thesis
go to Prof Dr. Javed Iqbal and Prof. Dr Muhammad Sheraz. For introducing me into mathemat-
ical research and all the helpful things that I learned from the very interesting work together
with them I am most grateful to Prof. Dr Nasir Touheed.
Moreover, I gratefully acknowledge the privilege of having been supported financially by
Institute of Business Administration Karachi via Merit Scholarship throughout my studies.
Furthermore, a special thank you is due to all my friends who made life and, in particular,
university life much more enjoyable. Last but not at all least I would like to thank especially
my family for simply everything.
Abstract
Regime shifts is an important aspect of financial economics, since some series have a tendency
to fluctuate abruptly, such that the new trend stays for a long period. The Markov switching
model captures volatility persistence, time varying correlations, skewness and kurtosis. The
thesis considers discrete time space models using Hidden Markov models to study regime
switching properties of the time series. The Hamilton (1988) model is referred, following the
assumption that current regime is only dependent on the regime one time ago; then incorporated
the extensions in regime switching models. The Hidden Markov models capture the non-
linear patterns, considering stochastic volatility, after the identification of the regimes, the
probability denoting the changes in regimes is also estimated. Since the financial time series
exhibits conditional variance that is not handled by traditional regression models, therefore
autoregressive conditionally heteroscedastic model techniques are used. The regime switching
technique extends GARCH model taking into account the variance structure considering
economic states are different in the long term such that the shifts is governed by Markov
chain process.
The thesis describes Markov chain model, highlighting the State Space method and Kalman
Filter. Then the Finite Markov mixture distribution is outlined, with emphasis on extensions of
Markov switching models. Later studied the structural break in the variance of growth rate of
GDP for the economies; USA, Pakistan, India and Japan. Further paper tries to capture linkages
between national stock markets of six different economies taking into account the degree of
dependence among markets. The real interest rate of Pakistan structural breaks were assessed,
working on volatility clustering, interest rate trend is itself unrelated representing a mixture of
distribution.
Table of contents
List of figures xiii
List of tables xv
Nomenclature xvii
1 Introduction 1
1.1 Markovchains................................... 1
1.2 Estimation.................................... 2
1.3 Smoothing.................................... 5
1.4 Time-varying transition probabilities . . . . . . . . . . . . . . . . . . . . . . . 7
1.5 State Space Methods and Kalman Filter . . . . . . . . . . . . . . . . . . . . 8
1.6 HiddenMarkovModels............................. 9
1.7 Hidden Markov Model Parameter Estimation . . . . . . . . . . . . . . . . . . 11
2 Markov Switching Models 17
2.1 Finite Markov Mixture Distributions . . . . . . . . . . . . . . . . . . . . . . . 17
2.2 Statistical Modeling Based on Finite Markov Mixture Distributions . . . . . 18
2.2.1 The Basic Markov Switching Model . . . . . . . . . . . . . . . . . . 18
2.2.2 The Markov Switching Regression Model . . . . . . . . . . . . . . 19
2.3 Nonlinear Time Series Analysis Based on Markov Switching Models . . . . . 19
2.4 The Markov Switching Autoregressive Model . . . . . . . . . . . . . . . . 20
2.4.1 ModelDenition............................ 20
2.5 Markov Switching Dynamic Regression Models . . . . . . . . . . . . . . . . . 21
2.6 Markov Switching Conditional Heteroscedasticity . . . . . . . . . . . . . . . 22
2.6.1 Switching ARCH Models . . . . . . . . . . . . . . . . . . . . . . . 23
List of figures
3.1 Real GDP growth rate GARCH(1,1) Model: Conditional Variance . . . . . . 36
3.2 Markov switching heteroskedasticity model Variance: India . . . . . . . . . . . 37
3.3 Markov switching heteroskedasticity model Variance: Pakistan . . . . . . . . 38
3.4 Markov switching heteroskedasticity model Variance: USA . . . . . . . . . . 38
3.5 Markov switching heteroskedasticity model Variance: Japan . . . . . . . . . 39
4.1 Markov switching model: Filtered probability India . . . . . . . . . . . . . . . 51
4.2 Markov switching model: Filtered probability Germany . . . . . . . . . . . . 52
4.3 Markov switching model: Filtered probability UK . . . . . . . . . . . . . . . 52
4.4 Markov switching model: Filtered probability Pakistan . . . . . . . . . . . . 53
4.5 Markov switching model: Filtered probability USA . . . . . . . . . . . . . . 53
4.6 Markov switching model: Filtered probability China . . . . . . . . . . . . . 54
5.1 Markov switching model Filtered probability Pakistan Real Interest Rate . . . 63
5.2 Markov switching model: Pakistan Real Interest Rate Actual versus Fitted Plot 64
List of tables
3.1 DescriptiveStatistics ............................... 31
3.2 Real GDP growth rate: GARCH(1,1) Model Estimation . . . . . . . . . . . . 34
3.3 Diagnostic Tests: GARCH(1,1) Model Estimation . . . . . . . . . . . . . . . 35
3.4 Real GDP growth rate: Markov Switching Heteroskedasticity Model estimation 36
4.1 Descriptive Statistics: Stock Market Return . . . . . . . . . . . . . . . . . . 46
4.2 Correlation of Stock Prices . . . . . . . . . . . . . . . . . . . . . . . . . . . . 47
4.3 Stock return: Markov switching heteroscedasticity model . . . . . . . . . . . 48
4.4 Estimated Probability of remaining in the same State . . . . . . . . . . . . . 49
4.5
Concordance Correlation Coefficient between probabilities of High volatility state
49
4.6
Concordance Correlation Coefficient between probabilities of Low volatility state
50
5.1 Estimates: Real Interest Rate Linear Model and Markov Switching Model . . . 61
5.2 Estimates: Real Interest Rate Transition Probabilities and Expected Duration 62
A.1 Chapter 3: Correlation Analysis . . . . . . . . . . . . . . . . . . . . . . . . 73
A.2 Chapter 5: Descriptive Statistics . . . . . . . . . . . . . . . . . . . . . . . . 73
A.3 Chapter 5: Real Interest Rate Unit Root Test . . . . . . . . . . . . . . . . . . 74
Nomenclature
Acronyms / Abbreviations
BiNom Binomial Distribution
EGARCH Exponential GARCH
TGARCH Threshold GARCH
AR Autoregressive
ARCH Autoregressive Conditional Heteroskedasticity
ARMA Autoregressive–Moving-Average
BSE Bombay Stock Exchange Index
CAPM Capital Asset Pricing Model
SSE Shanghai Stock Exchange
EM Expectation–maximization
FTSE Financial Times Stock Exchange Index
GARCH Generalized Autoregressive Conditional Heteroskedasticity
GARCH M GARCH-in-mean
HMM Hidden Markov models
IID Independent and identically distributed
JB Jarque Bera statistic
xviii Nomenclature
KSE Karachi Stock Exchange Index
LB Ljung-Box statistic
MCMC Markov chain Monte Carlo
MLE Maximum-Likelihood Estimation
MSAR Markov Switching Autoregressive
MSM Markov Switching Model
NYSE New York Stock Exchange Index
RSV Regime Switching Stochastic Volatility
SSM State Space Method
SWARCH Markov Switching Autoregressive Conditional Heteroskedasticity
VAR Vector Autoregression
Chapter 1
Introduction
The canonical a probabilistic model for temporal or sequential data is what’s called a Markov
model and Markov models are named after Andrey Markov who was a Russian mathematician
who did some work in stochastic processes in the late 1800 and early nineteen hundreds. The
idea behind a Markov model in exploits a very deep intrinsic fact about the real world; that
is the future is independent of the past given the present. In easier terms, if you know the
exact state of the world right now and you want to utilize that knowledge to predict the future
then knowing something about the past of the world 1 second ago will not be helping you in
predicting the future because you know everything that you want to know is already encoded in
the current state of the world.
1.1 Markov chains
Markov chain is a stochastic process defined as set of states
{St,t=0,1, ...}
using finite integer
values denoted by i,j. The probability of any future state value
St+1
is equivalent to j, therefore
the conditional probability distribution of any future state value
St+1
, given the past states
S0,S1, ..., St1
and the present state
St
is only dependent on the present state and independent of
the past states.
P{St+1=j|St=it,St1=it1,S1=i1,S0=i0}=P{St+1=j|St=it}=pi j (1.1)
pi j
is the transition probability from i to j, that is the probability of the next state will be j with
immediately preceding state is i. Assuming there are N states, thena transition matrix can be
2Introduction
used to describe all the transitions.
M=
p11 p12 . . . p1N
p21 p22 . . . p2N
. . . . . .
. . . . . .
pN1pN2. . . pN N
(1.2)
The transition probability
pi j
is non-negative, every state including the current state must transit
into some state such that: N
j=1
pi j =1,i=1,2, ..., N(1.3)
The above matrix signifies one-step transition probabilities. The one step case can be extended
to n-step transitions which is a result of multiple one-step transitions. A two-step transition
probability can described as
PSt+2=j|St=i
that is the summation of transition probabilities
from state i to all states and then similarly from all the states into state j, that is:
N
k=1
P{St+2=j|St+1=k}P{St+1=k|St=i}
In general, the n-step transition probability matrix can be written:
P{St+n=j|St=i}=pn
i j (1.4)
The multi-step transition probabilities can be calculated using the Chapman-Kolmogorov
equation:
pm+n
i j =
N
k=1
pn
ik pm
k j,i,j=1,2, ...N(1.5)
1.2 Estimation
The Markov chain process can be estimated through filtering by taking the joint conditional
probabilities of the future states as a joint conditional probabilities function of current states
and the transition probabilities. It is expressed as filtering because the conditional probabilities
of current states as input, that go through the procedure of filtering or system of dynamic
transformation that is transition probability matrix to derive the output in the form of conditional
1.2 Estimation 3
probabilities of the future states. Suppose, there is a simple two state Markov chain:
yt=µ1S1+µ2S2+εt(1.6)
where
S1=1
in state 1 and 0 otherwise,
S2=1
in state 2 and 0 otherwise, and the white noise
residual is
εt
. The joint probability of
yt
and
St
that transits over time is need to be calculated.
It can be derived in two steps. First, the conditional probability of being in state
st
, given
information available at time t-1 that is PSt=st|yt1.
PSt=st|yt1=PSt=st|St1=st1×PSt1=st1|yt1(1.7)
Second, the probability of being in state
st
, given information available at time t
PSt=st|yt
,
hence calculating the joint probability distribution of Stand ytthat is:
fyt,St=st|yt1
=fyt|St=st,yt1×PSt=st|yt1
=fyt|St=st,yt1×PSt=st|St1=st1×PSt1=st1|yt1
(1.8)
Then the joint probability of ytand Stcan be calculated as:
PSt=st|yt=fyt,St=st|yt1
fyt|yt1(1.9)
Let
yt
be N state Markov chain process, with autoregression of order
r
and residual
εt
and is
a function of exogenous variable
xt
and its lags known as dynamic process of autoregression.
When
yt
is
r
order autoregression, then joint conditional probability of the current state and
r
previous states, given information including all lag values of rperiods before 0; therefore:
PSt=st,St1=st1, ..., Str=str|t1(1.10)
is being considered, with
t1=yt1,yt2,..., ytr,xt1,xt2, ..., xtr
is the available infor-
mation at time t-1. The filtering procedure is being used again to determine the updated version
of probability of being in state
st
, based on information available at time t-1, by deriving joint
4Introduction
density of Ytand St:
fyt,St=st,St1=st1, ..., Str1=str1|t1
=fyt|St=st,St1=st1, ..., Str1=str1,t1
×pSt=st,St1=st1, ..., Str1=str1|t1
=fyt|St=st,St1=st1, ..., Str1=str1,t1
×PSt=st|St1=st1)×PSt1=st1, ..., Str1=str1|t1
(1.11)
Then the probability density function of ytis calculated:
fyt|t1
=
N
st=1
N
st1=1
...
N
str=1
fyt,St=st,St1=st1, ..., Str1=str1|t1(1.12)
It is not yet the filter output, hence calculating the non-serial correlation residual:
PSt=st,St1=st1, ..., Str1=str1|t
=fyt,St=st,St1=st1, ..., Str1=str1|t1
fyt|t1
(1.13)
Hence the output of the filter is:
PSt=st,St1=st1, ..., Str=str|t
N
str1=1
PSt=st,St1=st1, ..., Str1=str1|t(1.14)
Above is the probability of the states at time t, and based on current information it can be
obtained as:
PSt=st|t=
N
st=1
N
st1=1
...
N
str=1
PSt=st,St1=st1, ..., Str=str|t(1.15)
Similarly the likelihood function can be derived:
Lθ=
T
t=1
fyt|t1;θ(1.16)
1.3 Smoothing 5
where
θ
is the vector of parameters. Techniques like Gibbs sampling and the EM algorithm can
be used to estimate log likelihood function however maximum likelihood is still a convenient
and most appropriate method. Based on the assumption of two state Markov chain process,
with a normally distributed residual then the maximum likelihood function can be extended as
follows:
Lθ=
T
t=1
fyt|t1;θ
=
T
t=1
2
st=1
fyt|St=st,yt1;θ×PSt=st|St1=st1
=
T
t=1
2
st=1
2
st1=1fyt|St=st,yt1;θ×PSt=st|St1=st1
×PSt1=st1|yt1
=
T
t=11
2πσε
expytµ12
2σ2
ε×p11 ×P
tL 1+p21 ×P
tL 2
+1
2πσε
expytµ12
2σ2
ε×p21 ×P
tL 1+p22 ×P
tL 2
(1.17)
where P
tL 1=PSt1=1|yt1and P
tL 2=PSt1=2|yt1.
1.3 Smoothing
Smoothing is a process of inferring about the present states, using future information. It is
the probability of the states at time t given current information available, therefore revising
PSt=st|t
. Therefore,
PSt=st|T
replacing
t
by
T
in the probability. In case there
are no lags in
yt
, then smoothing is carried out in two steps; otherwise three steps and including
one approximation when there are lags in yt.
6Introduction
For simplification setting St=stto St:
PStr, ..., St,St+1|T
=PStr+1, ..., St,St+1|T×PStr|Str+1, ..., St,St+1,T
=PStr+1, ..., St,St+1|T×PStr|Str+1, ..., St,St+1,t
=PStr+1, ..., St,St+1|T×PStr, ..., St+1,t
PStr+1, ..., St,St+1|T
=PStr+1, ..., St,St+1|T×PStr, ..., St,t×PSt+1|St
PStr+1, ..., St,St+1|t
(1.18)
The second inequality is equivalent: PStr|Str+1, ..., St,St+1,T
=PStr|Str+1, ..., St,St+1,tiff
fyt+1,Tt |Str,Str+1, ...St,St+1,t
=fyt+1,Tt |Str+1, ...St,St+1,t(1.19)
Because T l =Tthence it follows:
PStr|Str+1, ...St,St+1,T=PStr|Str+1,...St,St+1,t,T t
=fyt+1,StrTt |Str+1, ...St,St+1,t
fyt+1,Tt |Str+1, ...St,St+1,t
=fyt+1,Tt |Str, ...St,St+1,t×PStr|Str+1, ...St,St+1,t
fyt+1,Tt |Str+1, ...St,St+1,t
(1.20)
Then summing it up over
St+1=1,2, ..., N
to get a smoothed states with residual with no serial
correlation is derived.
PStr, ..., St|T=
N
St+1=1
PStr, ..., St,St+1|T(1.21)
The above equation of smoothed states is derived when no lagged
yt
exist however when there
are lags then similar equation is set by summation:
PSt|T
N
st=1
N
st1=1
...
N
str=1
PSt,St1, ..., Str|T(1.22)
1.4 Time-varying transition probabilities 7
1.4 Time-varying transition probabilities
The extension of Markov chain process is addition of time-varying transition probabilities that
makes the model more flexible and can be defined as:
P{St+1=j|St=it|t+1}=pi j(t+1)(1.23)
And the transition probability matrix is given as:
P(t) =
p11(t)p12(t). . . p1N(t)
p21(t)p22(t). . . p2N(t)
. . . . . .
. . . . . .
pN1(t)pN2(t). . . pN N (t)
(1.24)
The time varying transition probabilities are used in binary models in the form of logit and
probit models. The exponential function and cumulative distribution function are symmetric
therefore deviation from the mean value will increase the probability. The time varying functions
are similar to smoothed transition models, however the advantage of time varying transition
probabilities is within the interval
0,1
. The logit function of transition probabilities is:
pi j(t) = 1
1+exp{−tβ
i j}(1.25)
where
β
i j
is the coefficient vector of the set of endogenous and exogenous variables. In simpler
terms, tβ
i j =ωi j0γi j yt1then we have:
pi j(t) = 1
1+exp{ωi j 0γi jyt1
(1.26)
The above equation has direct relation and implications, for instance symmetric responses are
only related to amount of deviation from central or equilibrium values no matter what is the
direction; however in asymmetric effect direction or sign of deviation is important.
8Introduction
1.5 State Space Methods and Kalman Filter
The state space models are used in dynamic time series models containing unobserved variables.
The simplest SSM is first order vector autoregressive model with:
yt=Γyt1+wt(1.27)
where the states
yt,i
is determined from the past states
yt1,j
,
j=1,2, ..., p
for
i=1,2, ..., p
and time
t=1,2, ..., n
. In order to ensure completeness,
wt
are
p×1
Gaussian white noise
vectors with covariance
Q
. The initial value of the state process is
y0
taken from variables being
normally distributed with
µ0
mean vector and covariance matrix. Though the state vector is not
observed but after some observed transformations, the equation can be written as:
zt=Atyt+vt(1.28)
Next, estimates are derived for particular point in time using Gaussian assumptions on unob-
served state vector.Based, on the initial conditions with mean
y0|0=µ0
and covariance
P
0|0=0
at time t=1,2, ..., n.
yt|t1=Γyt1|t1(1.29)
P
t|t1=ΓP
t1|t1Γ+Q(1.30)
yt|t=yt|t1+KtztAtyt|t1(1.31)
Then the Kalmn gain matrix is given as:
Kt=P
t|t1A
tAtP
t|t1A+R1(1.32)
and the covariance matrix P
t|tis given as:
P
t|t=IKtAtP
t|t1(1.33)
The innovation vector of the basic state vector equation can be defined as:
vt=ztAtyt|t1(1.34)
1.6 Hidden Markov Models 9
and its covariance as
t
=AtP
t|t01A
t+R(1.35)
This is called Kalman smoother with initial conditions
yT|T
and
P
T|T
. Then the smoother
algorithm can be explained as:
yt1|T=yt1|t1+Jt1yt|Tyt|t1(1.36)
P
t1|T=P
t1|t1+Jt1P
t|TP
t|t1J
t1(1.37)
Jt1=P
t1|t1+ΓP
t|t11(1.38)
Therefore during the filtering process of the smoothing algorithm, values are generated as
yt|t
and
P
t|t
. Assuming parameters are known, however if parameter are required to be determined;
it can be done by maximizing innovation form of likelihood function. The likelihood function
can be expressed as:
logL=1
2
T
t=1
log |
t
Θ| −1
2
T
t=1
v
tΘ1
tΘvtΘ(1.39)
where
Θ
is the vector of unknown parameters to emphasize upon the dependence of log
likelihood function on the parameters of the model.
1.6 Hidden Markov Models
The basic concepts of Hidden Markov Models will be related to state space models, the simplest
model involving specific number of counts is Poisson process. Variables are independently
and identically distributed, hence the variance is equivalent to mean. We can generate specific
number of counts by using two different Poisson processes with different means, then one of the
means are selected by using a random process known as emission process. Given the emission
process is a Markov chains, then serial dependence along with over dispersion will be found
in the observed counts. The word hidden itself signifies that states cannot be only estimated
using the observed data, there also exist hidden or latent data following a Markov chain process.
Therefore in Hidden Markov Model unobserved sequence of states following Markov chain
with finite state space, hence the probability of any observation at any time t is dependent upon
the current state.
10 Introduction
The Hidden Markov Model that is a mixture of densities over time, can also be used as an
alternative to kalman filter process which is factor analysis over time. The Kalman filter
involves linear measurements including linear evolution of states whereas Hidden Markov
Model represents a nonlinear evolution and measurement of one state at a time. In Hidden
Markov Mode, the Viterbi algorithm to estimate the most likely state sequence while the Baum-
Welch forward-backward algorithm is used to estimate the parameters following Expectation
Maximization. In order to derive HMM we include the following:
Hidden states: The states of a system that can be described by a Markov process,
Observable states: The states of the process that are visible, i.e. measurable,
Pi-Vector: Contains the probability that the model is in one of the hidden states at the
initial time,
State transition matrix: Contains the probability that a hidden state will evolve to another
state, given the previous state,
Emission probability matrix: Contains the probability that a particular measurable state
can be observed, provided that the model is in one of the hidden states.
A hidden Markov model is a standard Markov process augmented by a set of measurable states
and several probabilistic relations between those states and the hidden states. In developing
the algorithms for the model, we write the joint probability over hidden
yt
and observed
xt
states, and then use the Markov property to simplify it. This property allows us to assume that
all information about the history of the states is summarized by the value of the state at the
previous time step. The total probability is,
pyT,xT=py1
T
t=2
pyt|yt1
T
t=2
pxt|yt(1.40)
where
yT
denotes
y1,y2, ..., yT
. Following three matrices are used in characterization of
HMM:
Ai,j=pyt=i|yt1=j
Bt,i=pxt|yt=i
πt,i=pyt=i
(1.41)
1.7 Hidden Markov Model Parameter Estimation 11
These three equations define a HMM, hence referred as a set
λ={A,B,π}
; further we also
require following definitions:
γt,i=pyt=i|xT(1.42)
t,i j =pyt=i,yt1=j|xT(1.43)
αt,i=pyt=i|xt(1.44)
κt,i=pxt|xt1(1.45)
βt,i=pxt+1, ..., XT|yt=i
pxt+1, ..., XT|xt(1.46)
1.7 Hidden Markov Model Parameter Estimation
The likelihood function over N iterations is used to determine the parameters:
Q=
N
n=1ZdyTpyT|xT,nlogpy1
T
t=2
pyt|yt1
T
t=1
pxn
t|yt
=
nZdy1py1|xT,nlog py1
+
n
T
t=2Zdytdyt1pyt,yt1|xn
tlog pyt|yt1
+
n
T
t=1Zdytpyt|xn
tlog pxn
t|yt
=
n
t
γn
1,ilogπ1,i+
n
T
t=2
i,j
n
t,i j log Ai,j+
n
T
t=1
i
γn
t,ilogBt,i
(1.47)
Baum-Welch Algorithm (1970) is used to maximize the likelihood function simplified above,
since hidden states are known. The result will be equivalent to Expectation Maximization
Algorithm.The above equation will be solved by adding two constraints:
i
π1,i=1 (1.48)
i
Ai,j=1j(1.49)
12 Introduction
Finally, using Lagrange multiplier the updated objective function is:
L=Qλπ
i
π1,i1
j
λjAi,j1(1.50)
In order to maximize, the derivative of the objective function is calculated and equated to zero:
L
∂ π1,i
=
n
γn
1,i
π1,iλπ=0πnew
1,i=λπ
N
N
n=1
γn
1,i(1.51)
Thus Lagrange coefficient is derived from constraint relation as:
πnew
1,i=1
N
N
n=1
γn
1,i(1.52)
Similarly for matrix A:
L
Ai,j
=
n
T
t=2
n
t,i j
Ai,jλj=0 (1.53)
Anew
i,j=λj
NT1
N
n=1
T
t=2
n
t,i j
=N
n=1T
t=2n
t,i j
N
n=1T
t=2γn
t1,j
(1.54)
where
n
t,i j =γn
t1,j
representing the ratio of expected number of transitions from state j to state
i, divided by number of transitions from state j.
Next, it is determination of emission probabilities that can expressed as:
pxt|yt=i;θ=Bt,iθ(1.55)
where θcan be termed as:
θnew =argmax
θ
N
n=1
T
t=1
i
γn
t,ilogBt,iθ(1.56)
Here we are only concerned with discrete output, therefore parameters are probability masses
for k states of the output, xt:
Bk,i=pxt=k|yt=i(1.57)
1.7 Hidden Markov Model Parameter Estimation 13
Assuming time is independent, and must satisfy the constraint:
k
Bk,i=1i(1.58)
Similarly, we again use Lagrange multiplier to show that:
Bnew
k,i=n,ts.t.sn
t=kγn
t,i
N
n=1T
t=1γn
t,i
(1.59)
The matrix B is the expected number of times the system is in state i, there is k that is divided
by the expected number of times the system is in state i; the maximization algorithm is now
completed.
After maximization step of the algorithm, quantities
t,i j
and
γt,i
are computed. It is achieved
by computing αt,i,κt,βt,ias follow:
αt=pyt|xt=yt1pxt|ytpyt|yt1pyt1|xt1
pxt|xt1(1.60)
αt,i=jBt,iAi,jαt1,j
κt
(1.61)
Therefore the forward variable αis obtained:
αt=py1|x1=px1|y1py1
y1px1|y1py1(1.62)
α1,i=B1,iπ1,i
jB1,jπ1,j
(1.63)
Then for calculating κ:
κt=pxt|xt1=
yt
yt1
pxt|ytpyt|yt1pyt1|xt1(1.64)
κt=
i,j
Bt,iAi,jαt1,j(1.65)
14 Introduction
and the initial value is:
κ1=px1
=
y1
px1|y1py1(1.66)
Now through backward recursions, we initiate as follows:
βt1=pxt, ..., xT|yt1
pxt, ..., xT|xt1
=ytpxt, ..., xT,yt|yt1
pxt|xt1pxt+1,..., xT|xt
=ytpxt, ..., xT|ytpyt|yt1
pxt|xt1pxt+1,..., xT|xt
=ytpxt|ytpxt+1, ..., xT|ytpyt|yt1
pxt|xt1pxt+1,..., xT|xt
(1.67)
Then
βt1,j=iBt,iβt,iAi,j
κt
(1.68)
Afterwards, this recursion is started, that is:
βT1,j=pxT|yT1
pxT|xT1
=yTpxT|yTpyT|yT1
κT
(1.69)
Similarly, for initializing:
βT1,j=iBT,jAi,j
κT
(1.70)
βT,j=1 (1.71)
1.7 Hidden Markov Model Parameter Estimation 15
The recursions of γtand tcan be calculated using the values of αt,κt,βt:
γt=pyt|xt
=pyt,xT
pxT
=pxt+1, ..., xT,yt|xtpxt
pxt+1, ..., xT|xtpxt
=pyt|xtpxt+1,..., xT|xt
pxt+1, ..., xT|xt
(1.72)
Hence γt,iis attained:
γt,i=αt,iβt,i(1.73)
And the final recursion:
t=pyt,yt1|xT
=pxt, ..., xT|ytpyt|yt1pyt1|xt1
pxt|xt1pxt+1,..., xT|xt1
(1.74)
Finally, the expectation step is derived:
t,i j =Bt,iβt,iAi,jαt1,j
κt
(1.75)
If the convergence is achieved, then EM-algorithm is an alternate to M-step and E-step.
Chapter 2
Markov Switching Models
The chapter starts with the definition of a finite Markov mixture distribution, whose properties
are studied in some detail and introduces the basic Markov switching model and deals with its
extensions.
Finite mixture models are extended to deal with time series data that exhibit dependence over
time. Broadly speaking, this is achieved by substituting the discrete latent indicator
Si
introduced
as an allocation variable for finite mixture models by a hidden Markov chain. This leads to
a surprisingly rich class of nonlinear time series models that solve a variety of interesting
problems in applied time series analysis.
2.1 Finite Markov Mixture Distributions
Let
yt=1, ..., T
denote a time series of T univariate observations taking values in a sampling
space
y
which may be either discrete or continuous. As common in time series analysis,
yt=1, ..., Tis considered to be the realization of a stochastic process {Yt}T
t=1.
It is assumed that the probability distribution of the stochastic process
Yt
depends on the
realizations of a hidden discrete stochastic process
St
. The stochastic process
Yt
is directly
observable, whereas
St
is a latent random process that is observable only indirectly through
the effect it has on the realizations of
St
. A simple example is the hidden Markov chain model
Yt=µSt+εtwhere εtis a zero mean white noise process with variance σ2
18 Markov Switching Models
2.2 Statistical Modeling Based on Finite Markov Mixture
Distributions
Researchers have found Markov mixture models increasingly useful in applied time series
analysis.
2.2.1 The Basic Markov Switching Model
Assume that a time series
y1, ..., yT
is observed as a single realization of a stochastic process
Y1,..., YT
. In the basic Markov switching model the time series
y1, ..., yT
is assumed to be a
realization of a stochastic process
Yt
generated by a finite Markov mixture from a specific
distribution family:
Yt|StTθSt(2.1)
where
St
is a hidden K state Markov chain, where
Yt
satisfies the assumption; Conditional
on knowing
S= (S0, ..., ST)
, the random variables
Y1,..., YT
are stochastically independent.
For each
t1
, the distribution of
Yt
arises from one out of K distributions T
θ1, ...,
T
θK
,
depending on the state of St
Yt|St=kTθk(2.2)
The basic Markov switching model found widespread applications in many practical areas
including bioinformatics, biology, economics, finance, hydrology, marketing, medicine, and
speech recognition. Various terminology became usual to denote models based on hidden
Markov chains. Because one may choose Markov mixtures of any discrete distribution, it is
possible to model many different types of discrete valued time series data, for example, binary
time series by
PYt=1|St=πSt(2.3)
time series of bounded counts by a Markov mixture of binomial distributions,
Yt|StBiNomnt,πSt(2.4)
or time series of unbounded counts by a Markov mixture of Poisson distributions,
Yt|StPµSt(2.5)
2.3 Nonlinear Time Series Analysis Based on Markov Switching Models 19
An important feature of applying Markov mixture models to discrete-valued time series is the
ease with which autocorrelation is introduced, and the properties of the marginal distribution
are easily analyzed. The basic Markov switching model has been generalized in several ways as
outlined in the following sections.
2.2.2 The Markov Switching Regression Model
An early attempt at introducing Markov switching models into econometrics in order to deal
with time series data that depends on exogenous variables is the switching regression model of
Goldfeld and Quandt (1973), which extends the switching regression model (Quandt, 1972)
described earlier in subsection 2.2.1. Whereas Quandt (1972) assumes that
St
is an i.i.d. random
sequence, Goldfeld and Quandt (1973) allow explicitly for dependence between the states by
modeling St as a two-state hidden Markov chain. The general Markov switching regression
mode is:
Yt=xtβSt+εt,εtN0,σ2
ε,St(2.6)
where
St
is a hidden Markov chain and
xt
is a row vector of explanatory variables including
the constant.For discrete-valued explanatory variables, the Markov switching regression model
will suffer from the same identifiability problems as the standard finite mixture of regression
models.
2.3 Nonlinear Time Series Analysis Based on Markov Switch-
ing Models
In practical time series analysis, an important aspect is properties of the marginal distribution of
Yt
as well as properties of the one-step ahead predictive density
pyt|yt1,ϑ
, implied by the
chosen time series model. Typical stylized facts of the marginal distribution of practical time
series are asymmetry and nonnormality with rather fat tails, and autocorrelation not only in the
level Ytbut also in the squared process Y2
t.
It is well known that standard ARMA models (Box and Jenkins, 1970) often are not able to
capture stylized facts of practical time series. Some unrealistic features of ARMA models based
on normal errors are normality of the predictive as well as the marginal density, linearity of the
expectation EYt|yt1,ϑand homoscedasticity of VarEYt|yt1,ϑ.
This chapter discusses Markov switching models that constitute another very flexible class of
20 Markov Switching Models
nonlinear time series models and are able to capture many features of practical time series by
appropriate modifications of the basic Markov switching model introduced in subsection 2.2.1.
Section 2.4 deals with the Markov switching autoregressive model and Section 2.5 considers
the related Markov switching dynamic regression model and later highlights that Markov
switching models give rise to very flexible predictive distributions. Section 2.6 deals with
Markov switching conditional heteroscedasticity and switching ARCH models are introduced.
Later the section studies further extensions, namely hidden Markov chains with time-varying
transition probabilities and hidden Markov models for longitudinal data and multivariate time
series.
2.4 The Markov Switching Autoregressive Model
Markov mixture model introduces autocorrelation in the process
Yt
even for the basic Markov
switching model, where conditionally on knowing the states, the process
Yt
is uncorrelated. In
this section the Markov switching autoregressive model is introduced that deals with autocorre-
lation in a more flexible way than the basic Markov switching model.
2.4.1 Model Definition
The standard model to capture autocorrelation is the AR(p) model,
Ytµ=δ1Yt1µ+... +δpYtpµ+εt(2.7)
where εtN0,σ2
εwhich is equivalent to model:
Yt=δ1Yt1+... +δpYtp+ζ+εt(2.8)
where
ζ=µ1δ1... δp
. An important extension of the basic Markov switching model
is the Markov switching autoregressive (MSAR) model, where a hidden Markov chain is
introduced into model and through the work of Hamilton (1989) who allowed for a random
shift in the mean level of process (2.7) through a two-state hidden Markov chain:
YtµSt=δ1Yt1µSt1+... +δpYtpµStp+εt(2.9)
2.5 Markov Switching Dynamic Regression Models 21
An important alternative to model (2.9) was suggested by McCulloch and Tsay (1994b), who
introduced the hidden Markov chain into (2.8) rather than into (2.7), by assuming that the
intercept is driven by the hidden Markov chain rather than the mean level:
Yt=δ1Yt1+... +δpYtp+ζSt+εt(2.10)
Although the parameterization (2.7) and (2.8) are equivalent for the standard AR model, a model
with a Markov switching intercept turns out to be different from a model with a Markov switch-
ing mean level. In (2.9), after a one-time change from
St1
to
St̸=S+t1
, an immediate mean
level shift from
µSt1
to
µSt
occurs. Both models violates assumption;
Yt|St=k
T
θk
as the
one-step ahead predictive density
pyt|yt1,St,ϑ
depends upon past values
yt1
. For a model
with switching mean level it is evident from (2.9) that the predictive density pyt|yt1,St,ϑ
depends not only on
St
but also on the past values
St1, ..., Stp
of the hidden Markov chain
fulfilling only assumption;
pyt|yt1,St,ϑ=pyt|yt1,St, ..., Stp,ϑ
. On the other hand
for a model with switching intercept the predictive density
pyt|yt1,St,ϑ
depends only
on
St
and such a process fulfills the stronger condition;
pyt|yt1,St,ϑ=pyt|yt1,St,ϑ
.
As a result, econometric inference for an MSAR model with switching intercept is not more
complicated than for the basic Markov switching model, whereas for an MSAR model with
switching mean inference on the hidden Markov chain St is far more involved.
The assumption that the autoregressive parameters switch between the two states implies differ-
ent dynamic patterns in the various states, and introduces asymmetry over time. Asymmetry
over time between the states is introduced also through the hidden Markov chain as different
persistence probabilities imply different state duration. Subsequently the notation MS(K)-AR(p)
is used occasionally to denote a Markov switching autoregressive model with K states and
autoregressive order p. A more subtle notation that also differentiates between homo- and
heteroscedastic variances, switching in the mean level or in the intercept as well as between
invariant and switching autoregressive parameters is introduced in Krolzig (1997).
2.5 Markov Switching Dynamic Regression Models
An important extension both of Markov switching autoregressive models and the Markov switch-
ing regression model, discussed in section 2.5, is the Markov switching dynamic regression
22 Markov Switching Models
model.
YtµStztβ=δ1Yt1µSt1zt 1β+... +δpYtpµStpztpβ+εt(2.11)
where the regression coefficient
β
is considered to be unaffected by
St
. In the following
dynamic regression model all parameters, including the regression coefficient
β
, are affected by
endogenous regime shifts following a hidden Markov chain:
Yt=δSt,1Yt1+... +δSt,pYtp+ztβSt+ζSt+εt(2.12)
For estimation it is useful to view this model as a Markov switching regression model as in
Subsection 2.2.1, without distinguishing between endogenous variables, exogenous variables,
and the intercept:
Yt=xtβSt+εt(2.13)
where
xt=yt1...ytpzt1...ztd 1
. In the mixed-effects Markov switching dynamic regression
model only certain elements of the parameter
βSt
in (2.13) actually depend on the state of the
hidden Markov chain, and others are state independent (McCulloch and Tsay, 1994b):
Yt=xf
tα+xr
tβSt+εt(2.14)
where
xf
t
are those columns of
xt
that correspond to the state-independent parameters
α
whereas
the columns of xr
tcorrespond to the state-dependent parameters.
2.6 Markov Switching Conditional Heteroscedasticity
Markov switching models are often used by researchers to account for specific features of
financial time series such as asymmetries, fat tails, and volatility clusters.
To deal with skewness and excess kurtosis in the unconditional distribution of daily stock returns
standard finite mixtures of normal distributions have been applied quite frequently (Fama, 1965;
Granger and Orr, 1972; Kon, 1984; Tucker, 1992). Such a modeling approach, however, is
appropriate for time series data only if the processes
Yt
and
Y2
t
do not exhibit autocorrelation;
Volatility clustering implies persistence of states of high volatility and leads to the rejection of
standard time series models in favor of time series models that allow the conditional variance
2.6 Markov Switching Conditional Heteroscedasticity 23
Varyt|yt1,ϑ
to depend on the history
yt1,yt2
,... of the observed process such as the
autoregressive conditionally heteroscedastic (ARCH) model.
2.6.1 Switching ARCH Models
A simple model to capture volatility clusters in financial time series is the ARCH model (Engle,
1982) which may be written as
Yt=σtεtεtN0,1
σ2
t=γt+α1Y2
t1+... +αmY2
tm
(2.15)
with γt=γ. Alternately,
Yt=γthtεt
h2
t=1+α1
γt1
Y2
t1+... +αm
γtm
Y2
tm
(2.16)
The two parameterizations are equivalent if
γt=γ
, however they generate different processed if
γt
is time dependent. The switching ARCH model results by allowing time dependence of
γt
through a hidden K-state Markov chain
γt=γSt
. Such a switching parameter was introduced by
Hamilton and Susmel (1994) into parameterization (2.17):
Yt=γsthtεt
h2
t=1+α1
γst1
Y2
t1+... +αm
γStm
Y2
tm
(2.17)
whereas Cai (1994) introduced a two-state and Kaufmann and Fr
¨
uhwirthSchnatter (2002) a
K-state switching parameter into parameterization (2.16):
Yt=σtεt
σ2
t=γSt+α1Y2
t1+... +αmY2
tm
(2.18)
Gray (1996) introduced switching into all coefficients of the ARCH process, represented by
(2.16):
Yt=σtεt
σ2
t=γSt+αst,1Y2
t1+... +αst,mY2
tm
(2.19)
24 Markov Switching Models
The switching ARCH model may be combined with a Markov switching autoregressive model
for the mean equation that includes the same hidden Markov chain (Gray, 1996):
Yt=ζSt+δSt,1Yt1+ut
ut=σtεt,εtN0,1
σ2
t=γSt+αSt,1u2
t1+... +αSt,mu2
tm
(2.20)
The switching ARCH model has been extended by including a leverage effect into the ARCH
specification (Hamilton and Susmel, 1994; Kaufmann and Fr
¨
uhwirth-Schnatter, 2002) to deal
with asymmetries in the marginal distribution:
Yt=σtεt,εtN0,1
σ2
t=γSt+α1y2
t1+... +αmy2
tm+ρdt1y2
t1
(2.21)
where dt=1 if yt0.dt=0 if yt>0 and ρ>0.
2.6.2 Switching GARCH Models
Francq et al. (2001) consider the following switching GARCH(m, n) model, where all coeffi-
cients are switching,
Yt=σtεt,εtN0,1
σ2
t=γSt+αSt,1y2
t1+... +αSt,my2
tm+δSt,1σ2
t1+... +δSt,nσ2
tn
(2.22)
By recursive substitution it becomes evident that the predictive density
pyt|yt1,ϑ
depends
on the whole history of
St
. For the switching GARCH(1, 1) model, for instance, the variance of
the predictive density reads:
σ2
t=γSt+αSt,1y2
t1+δSt,1γSt1+αSt1,1y2
t2+δSt,1γSt1γSt2+αSt2,1y2
t3+...
Thus the model obeys only the weakest assumption that the observation density
pyt|yt1,St,ϑ
depends on yt1and all past states of St.
2.6.3 Time-Varying Transition Matrices
Whereas the transition matrix ζof the hidden process Stis time invariant under assumptions;
2.6 Markov Switching Conditional Heteroscedasticity 25
St
is an irreducible, aperiodic Markov chain starting from its ergodic distribution
η=
η1,η2, ..., ηK:
PS0=k|ζ=ηk
The stochastic properties of
St
are sufficiently described by the (K
×
K) transition matrix
ζ
, where each element
ζjk
of
ζ
is equal to the transition probability from state j to state k:
ζjk =PSt=k|St1=j,j,k∈ {1, ..., K}
St
is a first-order homogeneous Markov chain with arbitrary transition matrix
ζ
, which
need not be irreducible or aperiodic, and starts from an arbitrary distribution
p0=
p0,1, ..., p0,Kthen:
p0,k=pS0=k
For a two-state Markov switching model, the transition probabilities ζSt1,Stmay be reparame-
terized through a logit model in the following way:
ζSt1,St=expκSt1,1
1+expκSt1,1,St̸=St1
A univariate exogenous variable ztmay then be included:
ζSt1,St=expκSt1,1+ztκSt1,2
1+expκSt1,1+ztκSt1,2,St̸=St1(2.23)
Note that the transition probability
ζSt1,St
not only depends on
zt
, but also on the state of
St1
.
The logit transform could be substituted by another increasing function F(·),
ζSt1,St=FκSt1,1+ztκSt1,2,St̸=St1(2.24)
A model with time-varying transition matrices may be estimated through the EM algorithm
(Diebold et al., 1994) or through MCMC methods (Filardo and Gordon, 1998).
References
[1]
Ahlgren, N. and Antell, J. (2002). Testing for cointegration between international stock
prices. Applied Financial Economics, 12(12):851–861.
[2]
Andersen, T. G., Davis, R. A., Kreiss, J.-P., and Mikosch, T. V. (2009). Handbook of
financial time series. Springer Science & Business Media.
[3] Ang, A. and Bekaert, G. (2002). Regime switches in interest rates. Journal of Business &
Economic Statistics, 20(2):163–182.
[4]
Bhar, R. and Hamori, S. (2003a). Alternative characterization of the volatility in the growth
rate of real gdp. Japan and the World Economy, 15(2):223–231.
[5]
Bhar, R. and Hamori, S. (2003b). New evidence of linkages among g7 stock markets.
Finance Letters, 1(1).
[6]
Bhar, R. and Hamori, S. (2005). State-space models (i). Empirical Techniques in Finance,
pages 83–104.
[7]
Bhar, R. and Hamori, S. (2006a). Empirical techniques in finance. Springer Science &
Business Media.
[8]
Bhar, R. and Hamori, S. (2006b). Hidden Markov models: applications to financial
economics, volume 40. Springer Science & Business Media.
[9]
Bhar, R. and Hamori, S. (2007). Analysing yield spread and output dynamics in an
endogenous markov switching regression framework. Asia-Pacific Financial Markets, 14(1-
2):141–156.
[10]
Bollerslev, T. (1986). Generalized autoregressive conditional heteroskedasticity. Journal
of econometrics, 31(3):307–327.
[11]
Bollerslev, T., Chou, R. Y., and Kroner, K. F. (1992). Arch modeling in finance: A review
of the theory and empirical evidence. Journal of econometrics, 52(1-2):5–59.
[12]
Cecchetti, S. G., Lam, P.-S., and Mark, N. (1988). Mean reversion in equilibrium asset
prices.
[13]
Chu, C.-S. J., Santoni, G. J., and Liu, T. (1996). Stock market volatility and regime shifts
in returns. Information Sciences, 94(1):179–190.
68 References
[14]
Clements, M. P., Franses, P. H., and Swanson, N. R. (2004). Forecasting economic
and financial time-series with non-linear models. International Journal of Forecasting,
20(2):169–183.
[15]
Corhay, A., Rad, A. T., and Urbain, J.-P. (1993). Common stochastic trends in european
stock markets. Economics Letters, 42(4):385–390.
[16]
Diebold, F. X., Lee, J.-H., and Weinbach, G. C. (1994). Regime switching with time-
varying transition probabilities. Business Cycles: Durations, Dynamics, and Forecasting,
pages 144–165.
[17]
Drost, F. C. and Nijman, T. E. (1993). Temporal aggregation of garch processes. Econo-
metrica: Journal of the Econometric Society, pages 909–927.
[18]
Durland, J. M. and McCurdy, T. H. (1994). Duration-dependent transitions in a markov
model of us gnp growth. Journal of Business & Economic Statistics, 12(3):279–288.
[19]
Elliott, R. J. and Mamon, R. S. (2003). A complete yield curve description of a markov
interest rate model. International Journal of Theoretical and Applied Finance, 6(04):317–
326.
[20]
Engle, R. F. (1982). Autoregressive conditional heteroscedasticity with estimates of the
variance of united kingdom inflation. Econometrica: Journal of the Econometric Society,
pages 987–1007.
[21]
Eun, C. S. and Shim, S. (1989). International transmission of stock market movements.
Journal of financial and quantitative Analysis, 24(02):241–256.
[22]
Fama, E. F. (1975). Short-term interest rates as predictors of inflation. The American
Economic Review, 65(3):269–282.
[23]
Fama, E. F. and Gibbons, M. R. (1982). Inflation, real returns and capital investment.
Journal of Monetary Economics, 9(3):297–323.
[24]
Filardo, A. J. (1994). Business-cycle phases and their transitional dynamics. Journal of
Business & Economic Statistics, 12(3):299–308.
[25]
Franses, P. H. and Van Dijk, D. (2000). Non-linear time series models in empirical finance.
Cambridge University Press.
[26]
Frühwirth-Schnatter, S. (2006). Finite mixture and Markov switching models. Springer
Science & Business Media.
[27]
Garbade, K. and Wachtel, P. (1978). Time variation in the relationship between inflation
and interest rates. Journal of Monetary Economics, 4(4):755–765.
[28]
Garcia, R. and Perron, P. (1996). An analysis of the real interest rate under regime shifts.
The Review of Economics and Statistics, pages 111–125.
References 69
[29]
Glosten, L. R., Jagannathan, R., and Runkle, D. E. (1993). On the relation between the
expected value and the volatility of the nominal excess return on stocks. The journal of
finance, 48(5):1779–1801.
[30]
Goldfeld, S. M. and Quandt, R. E. (1973). A markov model for switching regressions.
Journal of econometrics, 1(1):3–15.
[31]
Gray, S. F. (1996). Modeling the conditional distribution of interest rates as a regime-
switching process. Journal of Financial Economics, 42(1):27–62.
[32]
Grossman, S. J. and Shiller, R. J. (1980). The determinants of the variability of stock
market prices.
[33]
Hamilton, J. D. (1989). A new approach to the economic analysis of nonstationary time
series and the business cycle. Econometrica: Journal of the Econometric Society, pages
357–384.
[34]
Hamilton, J. D. (1990). Analysis of time series subject to changes in regime. Journal of
econometrics, 45(1):39–70.
[35]
Hamilton, J. D. (1993). 9 estimation, inference and forecasting of time series subject to
changes in regime. Handbook of statistics, 11:231–260.
[36]
Hamilton, J. D. (1996). Specification testing in markov-switching time-series models.
Journal of Econometrics, 70(1):127–157.
[37]
Hamilton, J. D. and Lin, G. (1996). Stock market volatility and the business cycle. Journal
of Applied Econometrics, 11(5):573–593.
[38]
Hamilton, J. D. and Raj, B. (2013). Advances in Markov-Switching Models: Applications
in Business Cycle Research and Finance. Springer Science & Business Media.
[39]
Hamilton, J. D. and Susmel, R. (1994). Autoregressive conditional heteroskedasticity and
changes in regime. Journal of Econometrics, 64(1):307–333.
[40]
Hamori, S. (2000a). The transmission mechanism of business cycles among germany,
japan, the uk and the usa. Applied Economics, 32(4):405–410.
[41]
Hamori, S. (2000b). Volatility of real gdp: some evidence from the united states, the
united kingdom and japan. Japan and the World Economy, 12(2):143–152.
[42]
Hansen, L. P. and Singleton, K. J. (1982). Generalized instrumental variables estimation of
nonlinear rational expectations models. Econometrica: Journal of the Econometric Society,
pages 1269–1286.
[43]
Harding, D. and Pagan, A. (2002). Dissecting the cycle: a methodological investigation.
Journal of monetary economics, 49(2):365–381.
[44]
Harding, D. and Pagan, A. (2003). A comparison of two business cycle dating methods.
Journal of Economic Dynamics and Control, 27(9):1681–1690.
70 References
[45]
Harding, D. E., Pagan, A., et al. (1999). Knowing the cycle. Melbourne Institute of
Applied Economic and Social Research, University of Melbourne.
[46]
Jochum, C. (1999). Volatility spillovers and the price of risk: Evidence from the swiss
stock market. Empirical Economics, 24(2):303–322.
[47]
Kasa, K. (1992). Common stochastic trends in international stock markets. Journal of
monetary Economics, 29(1):95–124.
[48]
Kim, C.-J. and Nelson, C. R. (1998). Business cycle turning points, a new coincident
index, and tests of duration dependence based on a dynamic factor model with regime
switching. Review of Economics and Statistics, 80(2):188–201.
[49]
Kim, C.-J. and Nelson, C. R. (1999). Has the us economy become more stable? a bayesian
approach based on a markov-switching model of the business cycle. Review of Economics
and Statistics, 81(4):608–616.
[50]
Kim, C.-J. and Nelson, C. R. (2001). A bayesian approach to testing for markov-switching
in univariate and dynamic factor models. International Economic Review, 42(4):989–1013.
[51]
Kim, C.-J., Nelson, C. R., et al. (1999). State-space models with regime switching:
classical and Gibbs-sampling approaches with applications, volume 2. MIT press Cambridge,
MA.
[52]
Kim, C.-J., Nelson, C. R., and Piger, J. (2004). The less-volatile us economy: a bayesian
investigation of timing, breadth, and potential explanations. Journal of Business & Economic
Statistics, 22(1):80–93.
[53]
Kim, C.-J., Nelson, C. R., and Startz, R. (1998). Testing for mean reversion in het-
eroskedastic data based on gibbs-sampling-augmented randomization. Journal of Empirical
finance, 5(2):131–154.
[54]
Kim, D. and Kon, S. J. (1994). Alternative models for the conditional heteroscedasticity
of stock returns. Journal of Business, pages 563–598.
[55]
Kim, I.-M. and Maddala, G. (1991). Multiple structural breaks and unit roots in exchange
rates. In Econometric Society Meeting at New Orleans.
[56]
Lam, P.-s. (1990). The hamilton model with a general autoregressive component: estima-
tion and comparison with other models of economic time series: Estimation and comparison
with other models of economic time series. Journal of Monetary Economics, 26(3):409–432.
[57] Lam, P.-s. and Mark, N. C. (1988). Mean reversion in equilibrium asset prices.
[58]
Lee, S. H., Sung, H. M., and Urrutia, J. L. (1996). The impact of the persian gulf crisis on
the prices of ldcs’ loans. Journal of Financial Services Research, 10(2):143–162.
[59]
Maddala, G. S. and Kim, I.-M. (1998). Unit roots, cointegration, and structural change.
Number 4. Cambridge University Press.
References 71
[60]
Malliaris, A. G., Urrutia, J. L., et al. (1992). The international crash of october 1987:
causality tests. Journal of Financial and Quantitative Analysis, 27(3).
[61] Malliaris, A. G., Urrutia, J. L., et al. (1998). Volume and price relationships: hypotheses
and testing for agricultural futures. Journal of Futures Markets, 18(1):53–72.
[62]
Mamon, R. S. and Elliott, R. J. (2007). Hidden markov models in finance, volume 4.
Springer.
[63]
McCarthy, J. and Najand, M. (1995). State space modeling of linkages among international
markets. Journal of Multinational Financial Management, 5:1–9.
[64]
McConnell, M. M. and Perez-Quiros, G. (1998). Output fluctuations in the united states:
what has changed since the early 1980s? FRB of New York Staff Report, (41).
[65] Neal, R., Rolph, D. S., and Morris, C. (2001). Interest rates and credit spread dynamics.
[66]
Nelson, C. R. and Schwert, G. W. (1977). Short-term interest rates as predictors of
inflation: On testing the hypothesis that the real rate of interest is constant. The American
Economic Review, 67(3):478–486.
[67]
Nelson, D. B. (1991). Conditional heteroskedasticity in asset returns: A new approach.
Econometrica: Journal of the Econometric Society, pages 347–370.
[68]
Perron, P. (1990). Testing for a unit root in a time series with a changing mean. Journal of
Business & Economic Statistics, 8(2):153–162.
[69]
Quandt, R. E. (1958). The estimation of the parameters of a linear regression system
obeying two separate regimes. Journal of the american statistical association, 53(284):873–
880.
[70]
Quandt, R. E. (1972). A new approach to estimating switching regressions. Journal of the
American statistical association, 67(338):306–310.
[71]
Satchell, S. and Knight, J. (2011). Forecasting volatility in the financial markets.
Butterworth-Heinemann.
[72]
Schaller, H. and Norden, S. V. (1997). Regime switching in stock market returns. Applied
Financial Economics, 7(2):177–191.
[73] Taylor, S. J. (2007). Modelling financial time series.
[74]
Turner, C. M., Startz, R., and Nelson, C. R. (1989). A markov model of heteroskedasticity,
risk, and learning in the stock market. Journal of Financial Economics, 25(1):3–22.
[75] Walsh, C. (1982). Interest rate volatility and monetary policy.
[76]
Walsh, C. E. (1984). Interest rate volatility and monetary policy. Journal of Money, Credit
and Banking, 16(2):133–150.
72 References
[77] Wang, P. (2008). Financial econometrics. Routledge.
[78]
Wells, C. (1996). The kalman filter in finance (advanced studies in theoretical and applied
econometrics, vol 32).
Appendix A
Table A.1 Chapter 3: Correlation Analysis
India JAPAN USA Pakistan
India 1 -0.13006 0.145842 -0.63359
JAPAN 1 0.021093 0.154037
USA 1 -0.2651
Pakistan 1
Table A.2 Chapter 5: Descriptive Statistics
Real Interest Rate
Mean -0.406993
Median 0.571534
Maximum 12.126104
Minimum -30.046879
Std. Dev. 5.350612
Skewness -2.088641
Kurtosis 10.321620
Jarque-Bera 1850.411571
Probability 0.000000
74
Table A.3 Chapter 5: Real Interest Rate Unit Root Test
Unit Root Test t-Statistic Prob.*
Augmented Dickey-Fuller test statistic -4.40837 0.00030
Phillips-Perron test statistic -5.55596 0.00000
*MacKinnon (1996) one-sided p-values.
ResearchGate has not been able to resolve any citations for this publication.
Book
This book offers the opportunity to study and experience advanced empi- cal techniques in finance and in general financial economics. It is not only suitable for students with an interest in the field, it is also highly rec- mended for academic researchers as well as the researchers in the industry. The book focuses on the contemporary empirical techniques used in the analysis of financial markets and how these are implemented using actual market data. With an emphasis on Implementation, this book helps foc- ing on strategies for rigorously combing finance theory and modeling technology to extend extant considerations in the literature. The main aim of this book is to equip the readers with an array of tools and techniques that will allow them to explore financial market problems with a fresh perspective. In this sense it is not another volume in eco- metrics. Of course, the traditional econometric methods are still valid and important; the contents of this book will bring in other related modeling topics that help more in-depth exploration of finance theory and putting it into practice. As seen in the derivatives analysis, modern finance theory requires a sophisticated understanding of stochastic processes. The actual data analyses also require new Statistical tools that can address the unique aspects of financial data. To meet these new demands, this book explains diverse modeling approaches with an emphasis on the application in the field of finance.
Article
This article revisits the relationship between callable credit spreads and interest rates. The authors use cointegration to model the time series of corporate and government bond rates and draw inference about how credit spreads evolve after a shock to government rates using a bootstrapped standard error methodology. They find little evidence that unexpected changes to government rates lead to a significant change in future credit spreads. These results hold for both large positive and negative shocks, as well as after conditioning on the prevailing interest rate environment.
Book
This new edition of Forecasting Volatility in the Financial Markets assumes that the reader has a firm grounding in the key principles and methods of understanding volatility measurement and builds on that knowledge to detail cutting-edge modelling and forecasting techniques. It provides a survey of ways to measure risk and define the different models of volatility and return. Editors John Knight and Stephen Satchell have brought together an impressive array of contributors who present research from their area of specialization related to volatility forecasting. Readers with an understanding of volatility measures and risk management strategies will benefit from this collection of up to date chapters on the latest techniques in forecasting volatility. * Leading thinkers present newest research on volatility forecasting *International authors cover a broad array of subjects related to volatility forecasting *Assumes basic knowledge of volatility, financial mathematics, and modelling.