ThesisPDF Available

Modeling regime shifts Markov switching models Hidden Markov Model Applications

June 2016

June 2016

Thesis for: BS Economics and Mathematics

Authors:

Monash University (Australia)

Regime shifts is an important aspect of financial economics, since some series have a tendency to fluctuate abruptly, such that the new trend stays for a long period. The Markov switching model captures volatility persistence, time varying correlations, skewness and kurtosis. The thesis considers discrete time space models using Hidden Markov models to study regime switching properties of the time series. The Hamilton (1988) model is referred, following the assumption that current regime is only dependent on the regime one time ago; then incorporated the extensions in regime switching models. The Hidden Markov models capture the nonlinear patterns, considering stochastic volatility, after the identification of the regimes, the probability denoting the changes in regimes is also estimated. Since the financial time series exhibits conditional variance that is not handled by traditional regression models, therefore autoregressive conditionally heteroscedastic model techniques are used. The regime switching technique extends GARCH model taking into account the variance structure considering economic states are different in the long term such that the shifts is governed by Markov chain process. The thesis describes Markov chain model, highlighting the State Space method and Kalman Filter. Then the Finite Markov mixture distribution is outlined, with emphasis on extensions of Markov switching models. Later studied the structural break in the variance of growth rate of GDP for the economies; USA, Pakistan, India and Japan. Further paper tries to capture linkages between national stock markets of six different economies taking into account the degree of dependence among markets. The real interest rate of Pakistan structural breaks were assessed, working on volatility clustering, interest rate trend is itself unrelated representing a mixture of distribution.

Content uploaded by Fahad Malik

Content may be subject to copyright.

Modeling regime shifts

Markov switching models

Hidden Markov Model Applications

Fahad Javed Malik

Dept of Economics and Finance & Dept of Mathematical Sciences

Institute of Business Administration, Karachi

This thesis is submitted for the degree of

Bachelor of Science

Economics and Mathematics

Dr. Adnan Haider

Dr. Mohammad Nishat

Academic Program Coordinator Thesis Supervisor

Assistant Professor Professor and Associate Dean

Institute of Business Administration, Karachi. Institute of Business Administration, Karachi.

June 2016

Centre for Business and Economics

Research

Dr. Adnan Haider Dr. Mohammad Nishat

This thesis is dedicated to my dear parents.

Declaration

I hereby declare that except where speciﬁc reference is made to the work of others, the contents

of this dissertation are original and have not been submitted in whole or in part for consideration

for any other degree or qualiﬁcation in this, or any other university. This dissertation is my o w n

work and contains nothing which is the outcome of work done in collaboration with others,

except as speciﬁed in the text and Acknowledgments. This dissertation contains fewer than

18,000 words including appendices, bibliography, footnotes, tables and equations and has fewer

than 15 ﬁgures.

Fahad Javed Malik

June 2016

Acknowledgements

Attaining this opportunity, I would like to thank all those individuals without whom my educa-

tion over the preceding years and particularly this thesis would not have been what they have

been.

First of all my thanks go to Professor Dr Muhammad Nishat for bringing the topic of this

thesis to my attention, for all the helpful advice throughout the time I worked on it. Moreover, I

am most grateful to him for enabling me to spend a Summer Semester in University of Oslo

and BI Norwegian Business School a year ago, to attend a research program in Norway.

Prof. Dr Muhammad Nishat patience, guidance and motivation as supervisor of this thesis,

for which I would like to thank very much indeed. Moreover, I am also heavily indebted to

Prof. Dr Adnan Haider for all the helpful advice and encouragement which he have given me

over the years. I would like to express my sincere gratitude to supervisors, for their insightful

comments and support and for giving me the opportunity to complete my thesis.

My special thanks for most helpful and inﬂuential discussions on the subjects of this thesis

go to Prof Dr. Javed Iqbal and Prof. Dr Muhammad Sheraz. For introducing me into mathemat-

ical research and all the helpful things that I learned from the very interesting work together

with them I am most grateful to Prof. Dr Nasir Touheed.

Moreover, I gratefully acknowledge the privilege of having been supported ﬁnancially by

Institute of Business Administration Karachi via Merit Scholarship throughout my studies.

Furthermore, a special thank you is due to all my friends who made life and, in particular,

university life much more enjoyable. Last but not at all least I would like to thank especially

my family for simply everything.

Abstract

Regime shifts is an important aspect of ﬁnancial economics, since some series have a tendency

to ﬂuctuate abruptly, such that the new trend stays for a long period. The Markov switching

model captures volatility persistence, time varying correlations, skewness and kurtosis. The

thesis considers discrete time space models using Hidden Markov models to study regime

switching properties of the time series. The Hamilton (1988) model is referred, following the

assumption that current regime is only dependent on the regime one time ago; then incorporated

the extensions in regime switching models. The Hidden Markov models capture the non-

linear patterns, considering stochastic volatility, after the identiﬁcation of the regimes, the

probability denoting the changes in regimes is also estimated. Since the ﬁnancial time series

exhibits conditional variance that is not handled by traditional regression models, therefore

autoregressive conditionally heteroscedastic model techniques are used. The regime switching

technique extends GARCH model taking into account the variance structure considering

economic states are different in the long term such that the shifts is governed by Markov

chain process.

The thesis describes Markov chain model, highlighting the State Space method and Kalman

Filter. Then the Finite Markov mixture distribution is outlined, with emphasis on extensions of

Markov switching models. Later studied the structural break in the variance of growth rate of

GDP for the economies; USA, Pakistan, India and Japan. Further paper tries to capture linkages

between national stock markets of six different economies taking into account the degree of

dependence among markets. The real interest rate of Pakistan structural breaks were assessed,

working on volatility clustering, interest rate trend is itself unrelated representing a mixture of

distribution.

Table of contents

List of ﬁgures xiii

List of tables xv

Nomenclature xvii

1 Introduction 1

1.1 Markovchains................................... 1

1.2 Estimation.................................... 2

1.3 Smoothing.................................... 5

1.4 Time-varying transition probabilities . . . . . . . . . . . . . . . . . . . . . . . 7

1.5 State Space Methods and Kalman Filter . . . . . . . . . . . . . . . . . . . . 8

1.6 HiddenMarkovModels............................. 9

1.7 Hidden Markov Model Parameter Estimation . . . . . . . . . . . . . . . . . . 11

2 Markov Switching Models 17

2.1 Finite Markov Mixture Distributions . . . . . . . . . . . . . . . . . . . . . . . 17

2.2 Statistical Modeling Based on Finite Markov Mixture Distributions . . . . . 18

2.2.1 The Basic Markov Switching Model . . . . . . . . . . . . . . . . . . 18

2.2.2 The Markov Switching Regression Model . . . . . . . . . . . . . . 19

2.3 Nonlinear Time Series Analysis Based on Markov Switching Models . . . . . 19

2.4 The Markov Switching Autoregressive Model . . . . . . . . . . . . . . . . 20

2.4.1 ModelDeﬁnition............................ 20

2.5 Markov Switching Dynamic Regression Models . . . . . . . . . . . . . . . . . 21

2.6 Markov Switching Conditional Heteroscedasticity . . . . . . . . . . . . . . . 22

2.6.1 Switching ARCH Models . . . . . . . . . . . . . . . . . . . . . . . 23

List of ﬁgures

3.1 Real GDP growth rate GARCH(1,1) Model: Conditional Variance . . . . . . 36

3.2 Markov switching heteroskedasticity model Variance: India . . . . . . . . . . . 37

3.3 Markov switching heteroskedasticity model Variance: Pakistan . . . . . . . . 38

3.4 Markov switching heteroskedasticity model Variance: USA . . . . . . . . . . 38

3.5 Markov switching heteroskedasticity model Variance: Japan . . . . . . . . . 39

4.1 Markov switching model: Filtered probability India . . . . . . . . . . . . . . . 51

4.2 Markov switching model: Filtered probability Germany . . . . . . . . . . . . 52

4.3 Markov switching model: Filtered probability UK . . . . . . . . . . . . . . . 52

4.4 Markov switching model: Filtered probability Pakistan . . . . . . . . . . . . 53

4.5 Markov switching model: Filtered probability USA . . . . . . . . . . . . . . 53

4.6 Markov switching model: Filtered probability China . . . . . . . . . . . . . 54

5.1 Markov switching model Filtered probability Pakistan Real Interest Rate . . . 63

5.2 Markov switching model: Pakistan Real Interest Rate Actual versus Fitted Plot 64

List of tables

3.1 DescriptiveStatistics ............................... 31

3.2 Real GDP growth rate: GARCH(1,1) Model Estimation . . . . . . . . . . . . 34

3.3 Diagnostic Tests: GARCH(1,1) Model Estimation . . . . . . . . . . . . . . . 35

3.4 Real GDP growth rate: Markov Switching Heteroskedasticity Model estimation 36

4.1 Descriptive Statistics: Stock Market Return . . . . . . . . . . . . . . . . . . 46

4.2 Correlation of Stock Prices . . . . . . . . . . . . . . . . . . . . . . . . . . . . 47

4.3 Stock return: Markov switching heteroscedasticity model . . . . . . . . . . . 48

4.4 Estimated Probability of remaining in the same State . . . . . . . . . . . . . 49

4.5

Concordance Correlation Coefﬁcient between probabilities of High volatility state

4.6

Concordance Correlation Coefﬁcient between probabilities of Low volatility state

5.1 Estimates: Real Interest Rate Linear Model and Markov Switching Model . . . 61

5.2 Estimates: Real Interest Rate Transition Probabilities and Expected Duration 62

A.1 Chapter 3: Correlation Analysis . . . . . . . . . . . . . . . . . . . . . . . . 73

A.2 Chapter 5: Descriptive Statistics . . . . . . . . . . . . . . . . . . . . . . . . 73

A.3 Chapter 5: Real Interest Rate Unit Root Test . . . . . . . . . . . . . . . . . . 74

Nomenclature

Acronyms / Abbreviations

BiNom Binomial Distribution

EGARCH Exponential GARCH

TGARCH Threshold GARCH

AR Autoregressive

ARCH Autoregressive Conditional Heteroskedasticity

ARMA Autoregressive–Moving-Average

BSE Bombay Stock Exchange Index

CAPM Capital Asset Pricing Model

SSE Shanghai Stock Exchange

EM Expectation–maximization

FTSE Financial Times Stock Exchange Index

GARCH Generalized Autoregressive Conditional Heteroskedasticity

GARCH M GARCH-in-mean

HMM Hidden Markov models

IID Independent and identically distributed

JB Jarque Bera statistic

xviii Nomenclature

KSE Karachi Stock Exchange Index

LB Ljung-Box statistic

MCMC Markov chain Monte Carlo

MLE Maximum-Likelihood Estimation

MSAR Markov Switching Autoregressive

MSM Markov Switching Model

NYSE New York Stock Exchange Index

RSV Regime Switching Stochastic Volatility

SSM State Space Method

SWARCH Markov Switching Autoregressive Conditional Heteroskedasticity

VAR Vector Autoregression

Chapter 1

Introduction

The canonical a probabilistic model for temporal or sequential data is what’s called a Markov

model and Markov models are named after Andrey Markov who was a Russian mathematician

who did some work in stochastic processes in the late 1800 and early nineteen hundreds. The

idea behind a Markov model in exploits a very deep intrinsic fact about the real world; that

is the future is independent of the past given the present. In easier terms, if you know the

exact state of the world right now and you want to utilize that knowledge to predict the future

then knowing something about the past of the world 1 second ago will not be helping you in

predicting the future because you know everything that you want to know is already encoded in

the current state of the world.

1.1 Markov chains

Markov chain is a stochastic process deﬁned as set of states

{St,t=0,1, ...}

using ﬁnite integer

values denoted by i,j. The probability of any future state value

St+1

is equivalent to j, therefore

the conditional probability distribution of any future state value

St+1

, given the past states

S0,S1, ..., St−1

and the present state

is only dependent on the present state and independent of

the past states.

P{St+1=j|St=it,St−1=it−1,S1=i1,S0=i0}=P{St+1=j|St=it}=pi j (1.1)

pi j

is the transition probability from i to j, that is the probability of the next state will be j with

immediately preceding state is i. Assuming there are N states, thena transition matrix can be

2Introduction

used to describe all the transitions.







p11 p12 . . . p1N

p21 p22 . . . p2N

. . . . . .

pN1pN2. . . pN N







(1.2)

The transition probability

pi j

is non-negative, every state including the current state must transit

into some state such that: N

∑

j=1

pi j =1,i=1,2, ..., N(1.3)

The above matrix signiﬁes one-step transition probabilities. The one step case can be extended

to n-step transitions which is a result of multiple one-step transitions. A two-step transition

probability can described as

PSt+2=j|St=i

that is the summation of transition probabilities

from state i to all states and then similarly from all the states into state j, that is:

∑

k=1

P{St+2=j|St+1=k}P{St+1=k|St=i}

In general, the n-step transition probability matrix can be written:

P{St+n=j|St=i}=pn

i j (1.4)

The multi-step transition probabilities can be calculated using the Chapman-Kolmogorov

equation:

pm+n

i j =

∑

k=1

ik pm

k j,i,j=1,2, ...N(1.5)

1.2 Estimation

The Markov chain process can be estimated through ﬁltering by taking the joint conditional

probabilities of the future states as a joint conditional probabilities function of current states

and the transition probabilities. It is expressed as ﬁltering because the conditional probabilities

of current states as input, that go through the procedure of ﬁltering or system of dynamic

transformation that is transition probability matrix to derive the output in the form of conditional

1.2 Estimation 3

probabilities of the future states. Suppose, there is a simple two state Markov chain:

yt=µ1S1+µ2S2+εt(1.6)

where

S1=1

in state 1 and 0 otherwise,

S2=1

in state 2 and 0 otherwise, and the white noise

residual is

εt

. The joint probability of

and

that transits over time is need to be calculated.

It can be derived in two steps. First, the conditional probability of being in state

, given

information available at time t-1 that is PSt=st|yt−1.

PSt=st|yt−1=PSt=st|St−1=st−1×PSt−1=st−1|yt−1(1.7)

Second, the probability of being in state

, given information available at time t

PSt=st|yt

hence calculating the joint probability distribution of Stand ytthat is:

fyt,St=st|yt−1

=fyt|St=st,yt−1×PSt=st|yt−1

=fyt|St=st,yt−1×PSt=st|St−1=st−1×PSt−1=st−1|yt−1

(1.8)

Then the joint probability of ytand Stcan be calculated as:

PSt=st|yt=fyt,St=st|yt−1

fyt|yt−1(1.9)

Let

be N state Markov chain process, with autoregression of order

and residual

εt

and is

a function of exogenous variable

and its lags known as dynamic process of autoregression.

When

order autoregression, then joint conditional probability of the current state and

previous states, given information including all lag values of rperiods before 0; therefore:

PSt=st,St−1=st−1, ..., St−r=st−r|Ωt−1(1.10)

is being considered, with

Ωt−1=yt−1,yt−2,..., yt−r,xt−1,xt−2, ..., xt−r

is the available infor-

mation at time t-1. The ﬁltering procedure is being used again to determine the updated version

of probability of being in state

, based on information available at time t-1, by deriving joint

4Introduction

density of Ytand St:

fyt,St=st,St−1=st−1, ..., St−r−1=st−r−1|Ωt−1

=fyt|St=st,St−1=st−1, ..., St−r−1=st−r−1,Ωt−1

×pSt=st,St−1=st−1, ..., St−r−1=st−r−1|Ωt−1

=fyt|St=st,St−1=st−1, ..., St−r−1=st−r−1,Ωt−1

×PSt=st|St−1=st−1)×PSt−1=st−1, ..., St−r−1=st−r−1|Ωt−1

(1.11)

Then the probability density function of ytis calculated:

fyt|Ωt−1

∑

st=1

∑

st−1=1

...

∑

st−r=1

fyt,St=st,St−1=st−1, ..., St−r−1=st−r−1|Ωt−1(1.12)

It is not yet the ﬁlter output, hence calculating the non-serial correlation residual:

PSt=st,St−1=st−1, ..., St−r−1=st−r−1|Ωt

=fyt,St=st,St−1=st−1, ..., St−r−1=st−r−1|Ωt−1

fyt|Ωt−1

(1.13)

Hence the output of the ﬁlter is:

PSt=st,St−1=st−1, ..., St−r=st−r|Ωt

∑

st−r−1=1

PSt=st,St−1=st−1, ..., St−r−1=st−r−1|Ωt(1.14)

Above is the probability of the states at time t, and based on current information it can be

obtained as:

PSt=st|Ωt=

∑

st=1

∑

st−1=1

...

∑

st−r=1

PSt=st,St−1=st−1, ..., St−r=st−r|Ωt(1.15)

Similarly the likelihood function can be derived:

Lθ=

∑

t=1

fyt|Ωt−1;θ(1.16)

1.3 Smoothing 5

where

is the vector of parameters. Techniques like Gibbs sampling and the EM algorithm can

be used to estimate log likelihood function however maximum likelihood is still a convenient

and most appropriate method. Based on the assumption of two state Markov chain process,

with a normally distributed residual then the maximum likelihood function can be extended as

follows:

Lθ=

∑

t=1

fyt|Ωt−1;θ

∑

t=1

∑

st=1

fyt|St=st,yt−1;θ×PSt=st|St−1=st−1

∑

t=1

∑

st=1

∑

st−1=1fyt|St=st,yt−1;θ×PSt=st|St−1=st−1

×PSt−1=st−1|yt−1

∑

t=11

√2πσε

exp−yt−µ12

2σ2

ε×p11 ×P

tL 1+p21 ×P

tL 2

√2πσε

exp−yt−µ12

2σ2

ε×p21 ×P

tL 1+p22 ×P

tL 2

(1.17)

where P

tL 1=PSt−1=1|yt−1and P

tL 2=PSt−1=2|yt−1.

1.3 Smoothing

Smoothing is a process of inferring about the present states, using future information. It is

the probability of the states at time t given current information available, therefore revising

PSt=st|Ωt

. Therefore,

PSt=st|ΩT

replacing

Ωt

ΩT

in the probability. In case there

are no lags in

, then smoothing is carried out in two steps; otherwise three steps and including

one approximation when there are lags in yt.

6Introduction

For simpliﬁcation setting St=stto St:

PSt−r, ..., St,St+1|ΩT

=PSt−r+1, ..., St,St+1|ΩT×PSt−r|St−r+1, ..., St,St+1,ΩT

=PSt−r+1, ..., St,St+1|ΩT×PSt−r|St−r+1, ..., St,St+1,Ωt

=PSt−r+1, ..., St,St+1|ΩT×PSt−r, ..., St+1,Ωt

PSt−r+1, ..., St,St+1|ΩT

=PSt−r+1, ..., St,St+1|ΩT×PSt−r, ..., St,Ωt×PSt+1|St

PSt−r+1, ..., St,St+1|Ωt

(1.18)

The second inequality is equivalent: PSt−r|St−r+1, ..., St,St+1,ΩT

=PSt−r|St−r+1, ..., St,St+1,Ωtiff

fyt+1,ΩTt |St−r,St−r+1, ...St,St+1,Ωt

=fyt+1,ΩTt |St−r+1, ...St,St+1,Ωt(1.19)

Because ΩT l =ΩT−Ωthence it follows:

PSt−r|St−r+1, ...St,St+1,ΩT=PSt−r|St−r+1,...St,St+1,Ωt,ΩT t 

=fyt+1,St−rΩTt |St−r+1, ...St,St+1,Ωt

fyt+1,ΩTt |St−r+1, ...St,St+1,Ωt

=fyt+1,ΩTt |St−r, ...St,St+1,Ωt×PSt−r|St−r+1, ...St,St+1,Ωt

fyt+1,ΩTt |St−r+1, ...St,St+1,Ωt

(1.20)

Then summing it up over

St+1=1,2, ..., N

to get a smoothed states with residual with no serial

correlation is derived.

PSt−r, ..., St|ΩT=

∑

St+1=1

PSt−r, ..., St,St+1|ΩT(1.21)

The above equation of smoothed states is derived when no lagged

exist however when there

are lags then similar equation is set by summation:

PSt|ΩT−

∑

st=1

∑

st−1=1

...

∑

st−r=1

PSt,St−1, ..., St−r|ΩT(1.22)

1.4 Time-varying transition probabilities 7

1.4 Time-varying transition probabilities

The extension of Markov chain process is addition of time-varying transition probabilities that

makes the model more ﬂexible and can be deﬁned as:

P{St+1=j|St=it|Ωt+1}=pi j(t+1)(1.23)

And the transition probability matrix is given as:

P(t) =







p11(t)p12(t). . . p1N(t)

p21(t)p22(t). . . p2N(t)

. . . . . .

pN1(t)pN2(t). . . pN N (t)







(1.24)

The time varying transition probabilities are used in binary models in the form of logit and

probit models. The exponential function and cumulative distribution function are symmetric

therefore deviation from the mean value will increase the probability. The time varying functions

are similar to smoothed transition models, however the advantage of time varying transition

probabilities is within the interval

0,1

. The logit function of transition probabilities is:

pi j(t) = 1

1+exp{−Ωtβ′

i j}(1.25)

where

β′

i j

is the coefﬁcient vector of the set of endogenous and exogenous variables. In simpler

terms, −Ωtβ′

i j =ωi j0−γi j yt−1then we have:

pi j(t) = 1

1+exp{ωi j 0−γi jyt−1

(1.26)

The above equation has direct relation and implications, for instance symmetric responses are

only related to amount of deviation from central or equilibrium values no matter what is the

direction; however in asymmetric effect direction or sign of deviation is important.

8Introduction

1.5 State Space Methods and Kalman Filter

The state space models are used in dynamic time series models containing unobserved variables.

The simplest SSM is ﬁrst order vector autoregressive model with:

yt=Γyt−1+wt(1.27)

where the states

yt,i

is determined from the past states

yt−1,j

j=1,2, ..., p

for

i=1,2, ..., p

and time

t=1,2, ..., n

. In order to ensure completeness,

are

p×1

Gaussian white noise

vectors with covariance

. The initial value of the state process is

taken from variables being

normally distributed with

µ0

mean vector and covariance matrix. Though the state vector is not

observed but after some observed transformations, the equation can be written as:

zt=Atyt+vt(1.28)

Next, estimates are derived for particular point in time using Gaussian assumptions on unob-

served state vector.Based, on the initial conditions with mean

y0|0=µ0

and covariance

0|0=∑0

at time t=1,2, ..., n.

yt|t−1=Γyt−1|t−1(1.29)

t|t−1=ΓP

t−1|t−1Γ′+Q(1.30)

yt|t=yt|t−1+Ktzt−Atyt|t−1(1.31)

Then the Kalmn gain matrix is given as:

Kt=P

t|t−1A′

tAtP

t|t−1A′+R−1(1.32)

and the covariance matrix P

t|tis given as:

t|t=I−KtAtP

t|t−1(1.33)

The innovation vector of the basic state vector equation can be deﬁned as:

vt=zt−Atyt|t−1(1.34)

1.6 Hidden Markov Models 9

and its covariance as

∑

=AtP

t|t01A′

t+R(1.35)

This is called Kalman smoother with initial conditions

yT|T

and

T|T

. Then the smoother

algorithm can be explained as:

yt−1|T=yt−1|t−1+Jt−1yt|T−yt|t−1(1.36)

t−1|T=P

t−1|t−1+Jt−1P

t|T−P

t|t−1J′

t−1(1.37)

Jt−1=P′

t−1|t−1+Γ′P

t|t−1−1(1.38)

Therefore during the ﬁltering process of the smoothing algorithm, values are generated as

yt|t

and

t|t

. Assuming parameters are known, however if parameter are required to be determined;

it can be done by maximizing innovation form of likelihood function. The likelihood function

can be expressed as:

logL=−1

∑

t=1

log |∑

Θ| −1

∑

t=1

v′

tΘ−1

∑

tΘvtΘ(1.39)

where

is the vector of unknown parameters to emphasize upon the dependence of log

likelihood function on the parameters of the model.

1.6 Hidden Markov Models

The basic concepts of Hidden Markov Models will be related to state space models, the simplest

model involving speciﬁc number of counts is Poisson process. Variables are independently

and identically distributed, hence the variance is equivalent to mean. We can generate speciﬁc

number of counts by using two different Poisson processes with different means, then one of the

means are selected by using a random process known as emission process. Given the emission

process is a Markov chains, then serial dependence along with over dispersion will be found

in the observed counts. The word hidden itself signiﬁes that states cannot be only estimated

using the observed data, there also exist hidden or latent data following a Markov chain process.

Therefore in Hidden Markov Model unobserved sequence of states following Markov chain

with ﬁnite state space, hence the probability of any observation at any time t is dependent upon

the current state.

10 Introduction

The Hidden Markov Model that is a mixture of densities over time, can also be used as an

alternative to kalman ﬁlter process which is factor analysis over time. The Kalman ﬁlter

involves linear measurements including linear evolution of states whereas Hidden Markov

Model represents a nonlinear evolution and measurement of one state at a time. In Hidden

Markov Mode, the Viterbi algorithm to estimate the most likely state sequence while the Baum-

Welch forward-backward algorithm is used to estimate the parameters following Expectation

Maximization. In order to derive HMM we include the following:

• Hidden states: The states of a system that can be described by a Markov process,

• Observable states: The states of the process that are visible, i.e. measurable,

•

Pi-Vector: Contains the probability that the model is in one of the hidden states at the

initial time,

•

State transition matrix: Contains the probability that a hidden state will evolve to another

state, given the previous state,

•

Emission probability matrix: Contains the probability that a particular measurable state

can be observed, provided that the model is in one of the hidden states.

A hidden Markov model is a standard Markov process augmented by a set of measurable states

and several probabilistic relations between those states and the hidden states. In developing

the algorithms for the model, we write the joint probability over hidden

and observed

states, and then use the Markov property to simplify it. This property allows us to assume that

all information about the history of the states is summarized by the value of the state at the

previous time step. The total probability is,

pyT,xT=py1

∏

t=2

pyt|yt−1

∏

t=2

pxt|yt(1.40)

where

denotes

y1,y2, ..., yT

. Following three matrices are used in characterization of

HMM:

Ai,j=pyt=i|yt−1=j

Bt,i=pxt|yt=i

πt,i=pyt=i

(1.41)

1.7 Hidden Markov Model Parameter Estimation 11

These three equations deﬁne a HMM, hence referred as a set

λ={A,B,π}

; further we also

require following deﬁnitions:

γt,i=pyt=i|xT(1.42)

Ωt,i j =pyt=i,yt−1=j|xT(1.43)

αt,i=pyt=i|xt(1.44)

κt,i=pxt|xt−1(1.45)

βt,i=pxt+1, ..., XT|yt=i

pxt+1, ..., XT|xt(1.46)

1.7 Hidden Markov Model Parameter Estimation

The likelihood function over N iterations is used to determine the parameters:

∑

n=1ZdyTpyT|xT,nlogpy1

∏

t=2

pyt|yt−1

∏

t=1

pxn

t|yt

=∑

nZdy1py1|xT,nlog py1

+∑

∑

t=2Zdytdyt−1pyt,yt−1|xn

tlog pyt|yt−1

+∑

∑

t=1Zdytpyt|xn

tlog pxn

t|yt

=∑

∑

γn

1,ilogπ1,i+∑

∑

t=2

∑

i,j

Ωn

t,i j log Ai,j+∑

∑

t=1

∑

γn

t,ilogBt,i

(1.47)

Baum-Welch Algorithm (1970) is used to maximize the likelihood function simpliﬁed above,

since hidden states are known. The result will be equivalent to Expectation Maximization

Algorithm.The above equation will be solved by adding two constraints:

∑

π1,i=1 (1.48)

∑

Ai,j=1∀j(1.49)

12 Introduction

Finally, using Lagrange multiplier the updated objective function is:

L=Q−λπ∑

π1,i−1−∑

λjAi,j−1(1.50)

In order to maximize, the derivative of the objective function is calculated and equated to zero:

∂L

∂ π1,i

=∑

γn

1,i

π1,i−λπ=0⇒πnew

1,i=λπ

∑

n=1

γn

1,i(1.51)

Thus Lagrange coefﬁcient is derived from constraint relation as:

πnew

1,i=1

∑

n=1

γn

1,i(1.52)

Similarly for matrix A:

∂L

∂Ai,j

=∑

∑

t=2

Ωn

t,i j

Ai,j−λj=0 (1.53)

Anew

i,j=λj

NT−1

∑

n=1

∑

t=2

Ωn

t,i j

=∑N

n=1∑T

t=2Ωn

t,i j

∑N

n=1∑T

t=2γn

t−1,j

(1.54)

where

Ωn

t,i j =γn

t−1,j

representing the ratio of expected number of transitions from state j to state

i, divided by number of transitions from state j.

Next, it is determination of emission probabilities that can expressed as:

pxt|yt=i;θ=Bt,iθ(1.55)

where θcan be termed as:

θnew =argmax

∑

n=1

∑

t=1

∑

γn

t,ilogBt,iθ(1.56)

Here we are only concerned with discrete output, therefore parameters are probability masses

for k states of the output, xt:

Bk,i=pxt=k|yt=i(1.57)

1.7 Hidden Markov Model Parameter Estimation 13

Assuming time is independent, and must satisfy the constraint:

∑

Bk,i=1∀i(1.58)

Similarly, we again use Lagrange multiplier to show that:

Bnew

k,i=∑n,ts.t.sn

t=kγn

t,i

∑N

n=1∑T

t=1γn

t,i

(1.59)

The matrix B is the expected number of times the system is in state i, there is k that is divided

by the expected number of times the system is in state i; the maximization algorithm is now

completed.

After maximization step of the algorithm, quantities

Ωt,i j

and

γt,i

are computed. It is achieved

by computing αt,i,κt,βt,ias follow:

αt=pyt|xt=∑yt−1pxt|ytpyt|yt−1pyt−1|xt−1

pxt|xt−1(1.60)

αt,i=∑jBt,iAi,jαt−1,j

κt

(1.61)

Therefore the forward variable αis obtained:

αt=py1|x1=px1|y1py1

∑y1px1|y1py1(1.62)

α1,i=B1,iπ1,i

∑jB1,jπ1,j

(1.63)

Then for calculating κ:

κt=pxt|xt−1=∑

∑

yt−1

pxt|ytpyt|yt−1pyt−1|xt−1(1.64)

κt=∑

i,j

Bt,iAi,jαt−1,j(1.65)

14 Introduction

and the initial value is:

κ1=px1

=∑

px1|y1py1(1.66)

Now through backward recursions, we initiate as follows:

βt−1=pxt, ..., xT|yt−1

pxt, ..., xT|xt−1

=∑ytpxt, ..., xT,yt|yt−1

pxt|xt−1pxt+1,..., xT|xt

=∑ytpxt, ..., xT|ytpyt|yt−1

pxt|xt−1pxt+1,..., xT|xt

=∑ytpxt|ytpxt+1, ..., xT|ytpyt|yt−1

pxt|xt−1pxt+1,..., xT|xt

(1.67)

Then

βt−1,j=∑iBt,iβt,iAi,j

κt

(1.68)

Afterwards, this recursion is started, that is:

βT−1,j=pxT|yT−1

pxT|xT−1

=∑yTpxT|yTpyT|yT−1

κT

(1.69)

Similarly, for initializing:

βT−1,j=∑iBT,jAi,j

κT

(1.70)

βT,j=1 (1.71)

1.7 Hidden Markov Model Parameter Estimation 15

The recursions of γtand Ωtcan be calculated using the values of αt,κt,βt:

γt=pyt|xt

=pyt,xT

pxT

=pxt+1, ..., xT,yt|xtpxt

pxt+1, ..., xT|xtpxt

=pyt|xtpxt+1,..., xT|xt

pxt+1, ..., xT|xt

(1.72)

Hence γt,iis attained:

γt,i=αt,iβt,i(1.73)

And the ﬁnal recursion:

Ωt=pyt,yt−1|xT

=pxt, ..., xT|ytpyt|yt−1pyt−1|xt−1

pxt|xt−1pxt+1,..., xT|xt−1

(1.74)

Finally, the expectation step is derived:

Ωt,i j =Bt,iβt,iAi,jαt−1,j

κt

(1.75)

If the convergence is achieved, then EM-algorithm is an alternate to M-step and E-step.

Chapter 2

Markov Switching Models

The chapter starts with the deﬁnition of a ﬁnite Markov mixture distribution, whose properties

are studied in some detail and introduces the basic Markov switching model and deals with its

extensions.

Finite mixture models are extended to deal with time series data that exhibit dependence over

time. Broadly speaking, this is achieved by substituting the discrete latent indicator

introduced

as an allocation variable for ﬁnite mixture models by a hidden Markov chain. This leads to

a surprisingly rich class of nonlinear time series models that solve a variety of interesting

problems in applied time series analysis.

2.1 Finite Markov Mixture Distributions

Let

yt=1, ..., T

denote a time series of T univariate observations taking values in a sampling

space

which may be either discrete or continuous. As common in time series analysis,

yt=1, ..., Tis considered to be the realization of a stochastic process {Yt}T

t=1.

It is assumed that the probability distribution of the stochastic process

depends on the

realizations of a hidden discrete stochastic process

. The stochastic process

is directly

observable, whereas

is a latent random process that is observable only indirectly through

the effect it has on the realizations of

. A simple example is the hidden Markov chain model

Yt=µSt+εtwhere εtis a zero mean white noise process with variance σ2

18 Markov Switching Models

2.2 Statistical Modeling Based on Finite Markov Mixture

Distributions

Researchers have found Markov mixture models increasingly useful in applied time series

analysis.

2.2.1 The Basic Markov Switching Model

Assume that a time series

y1, ..., yT

is observed as a single realization of a stochastic process

Y1,..., YT

. In the basic Markov switching model the time series

y1, ..., yT

is assumed to be a

realization of a stochastic process

generated by a ﬁnite Markov mixture from a speciﬁc

distribution family:

Yt|St∼TθSt(2.1)

where

is a hidden K state Markov chain, where

satisﬁes the assumption; Conditional

on knowing

S= (S0, ..., ST)

, the random variables

Y1,..., YT

are stochastically independent.

For each

t≥1

, the distribution of

arises from one out of K distributions T

θ1, ...,

θK

depending on the state of St

Yt|St=k∼Tθk(2.2)

The basic Markov switching model found widespread applications in many practical areas

including bioinformatics, biology, economics, ﬁnance, hydrology, marketing, medicine, and

speech recognition. Various terminology became usual to denote models based on hidden

Markov chains. Because one may choose Markov mixtures of any discrete distribution, it is

possible to model many different types of discrete valued time series data, for example, binary

time series by

PYt=1|St=πSt(2.3)

time series of bounded counts by a Markov mixture of binomial distributions,

Yt|St∼BiNomnt,πSt(2.4)

or time series of unbounded counts by a Markov mixture of Poisson distributions,

Yt|St∼PµSt(2.5)

2.3 Nonlinear Time Series Analysis Based on Markov Switching Models 19

An important feature of applying Markov mixture models to discrete-valued time series is the

ease with which autocorrelation is introduced, and the properties of the marginal distribution

are easily analyzed. The basic Markov switching model has been generalized in several ways as

outlined in the following sections.

2.2.2 The Markov Switching Regression Model

An early attempt at introducing Markov switching models into econometrics in order to deal

with time series data that depends on exogenous variables is the switching regression model of

Goldfeld and Quandt (1973), which extends the switching regression model (Quandt, 1972)

described earlier in subsection 2.2.1. Whereas Quandt (1972) assumes that

is an i.i.d. random

sequence, Goldfeld and Quandt (1973) allow explicitly for dependence between the states by

modeling St as a two-state hidden Markov chain. The general Markov switching regression

mode is:

Yt=xtβSt+εt,εt∼N0,σ2

ε,St(2.6)

where

is a hidden Markov chain and

is a row vector of explanatory variables including

the constant.For discrete-valued explanatory variables, the Markov switching regression model

will suffer from the same identiﬁability problems as the standard ﬁnite mixture of regression

models.

2.3 Nonlinear Time Series Analysis Based on Markov Switch-

ing Models

In practical time series analysis, an important aspect is properties of the marginal distribution of

as well as properties of the one-step ahead predictive density

pyt|yt−1,ϑ

, implied by the

chosen time series model. Typical stylized facts of the marginal distribution of practical time

series are asymmetry and nonnormality with rather fat tails, and autocorrelation not only in the

level Ytbut also in the squared process Y2

It is well known that standard ARMA models (Box and Jenkins, 1970) often are not able to

capture stylized facts of practical time series. Some unrealistic features of ARMA models based

on normal errors are normality of the predictive as well as the marginal density, linearity of the

expectation EYt|yt−1,ϑand homoscedasticity of VarEYt|yt−1,ϑ.

This chapter discusses Markov switching models that constitute another very ﬂexible class of

20 Markov Switching Models

nonlinear time series models and are able to capture many features of practical time series by

appropriate modiﬁcations of the basic Markov switching model introduced in subsection 2.2.1.

Section 2.4 deals with the Markov switching autoregressive model and Section 2.5 considers

the related Markov switching dynamic regression model and later highlights that Markov

switching models give rise to very ﬂexible predictive distributions. Section 2.6 deals with

Markov switching conditional heteroscedasticity and switching ARCH models are introduced.

Later the section studies further extensions, namely hidden Markov chains with time-varying

transition probabilities and hidden Markov models for longitudinal data and multivariate time

series.

2.4 The Markov Switching Autoregressive Model

Markov mixture model introduces autocorrelation in the process

even for the basic Markov

switching model, where conditionally on knowing the states, the process

is uncorrelated. In

this section the Markov switching autoregressive model is introduced that deals with autocorre-

lation in a more ﬂexible way than the basic Markov switching model.

2.4.1 Model Deﬁnition

The standard model to capture autocorrelation is the AR(p) model,

Yt−µ=δ1Yt−1−µ+... +δpYt−p−µ+εt(2.7)

where εt∼N0,σ2

εwhich is equivalent to model:

Yt=δ1Yt−1+... +δpYt−p+ζ+εt(2.8)

where

ζ=µ1−δ1−... −δp

. An important extension of the basic Markov switching model

is the Markov switching autoregressive (MSAR) model, where a hidden Markov chain is

introduced into model and through the work of Hamilton (1989) who allowed for a random

shift in the mean level of process (2.7) through a two-state hidden Markov chain:

Yt−µSt=δ1Yt−1−µSt−1+... +δpYt−p−µSt−p+εt(2.9)

2.5 Markov Switching Dynamic Regression Models 21

An important alternative to model (2.9) was suggested by McCulloch and Tsay (1994b), who

introduced the hidden Markov chain into (2.8) rather than into (2.7), by assuming that the

intercept is driven by the hidden Markov chain rather than the mean level:

Yt=δ1Yt−1+... +δpYt−p+ζSt+εt(2.10)

Although the parameterization (2.7) and (2.8) are equivalent for the standard AR model, a model

with a Markov switching intercept turns out to be different from a model with a Markov switch-

ing mean level. In (2.9), after a one-time change from

St−1

St̸=S+t−1

, an immediate mean

level shift from

µSt−1

µSt

occurs. Both models violates assumption;

Yt|St=k∼

θk

as the

one-step ahead predictive density

pyt|yt−1,St,ϑ

depends upon past values

yt−1

. For a model

with switching mean level it is evident from (2.9) that the predictive density pyt|yt−1,St,ϑ

depends not only on

but also on the past values

St−1, ..., St−p

of the hidden Markov chain

fulﬁlling only assumption;

pyt|yt−1,St,ϑ=pyt|yt−1,St, ..., St−p,ϑ

. On the other hand

for a model with switching intercept the predictive density

pyt|yt−1,St,ϑ

depends only

and such a process fulﬁlls the stronger condition;

pyt|yt−1,St,ϑ=pyt|yt−1,St,ϑ

As a result, econometric inference for an MSAR model with switching intercept is not more

complicated than for the basic Markov switching model, whereas for an MSAR model with

switching mean inference on the hidden Markov chain St is far more involved.

The assumption that the autoregressive parameters switch between the two states implies differ-

ent dynamic patterns in the various states, and introduces asymmetry over time. Asymmetry

over time between the states is introduced also through the hidden Markov chain as different

persistence probabilities imply different state duration. Subsequently the notation MS(K)-AR(p)

is used occasionally to denote a Markov switching autoregressive model with K states and

autoregressive order p. A more subtle notation that also differentiates between homo- and

heteroscedastic variances, switching in the mean level or in the intercept as well as between

invariant and switching autoregressive parameters is introduced in Krolzig (1997).

2.5 Markov Switching Dynamic Regression Models

An important extension both of Markov switching autoregressive models and the Markov switch-

ing regression model, discussed in section 2.5, is the Markov switching dynamic regression

22 Markov Switching Models

model.

Yt−µSt−ztβ=δ1Yt−1−µSt−1−zt −1β+... +δpYt−p−µSt−p−zt−pβ+εt(2.11)

where the regression coefﬁcient

is considered to be unaffected by

. In the following

dynamic regression model all parameters, including the regression coefﬁcient

, are affected by

endogenous regime shifts following a hidden Markov chain:

Yt=δSt,1Yt−1+... +δSt,pYt−p+ztβSt+ζSt+εt(2.12)

For estimation it is useful to view this model as a Markov switching regression model as in

Subsection 2.2.1, without distinguishing between endogenous variables, exogenous variables,

and the intercept:

Yt=xtβSt+εt(2.13)

where

xt=yt−1...yt−pzt1...ztd 1

. In the mixed-effects Markov switching dynamic regression

model only certain elements of the parameter

βSt

in (2.13) actually depend on the state of the

hidden Markov chain, and others are state independent (McCulloch and Tsay, 1994b):

Yt=xf

tα+xr

tβSt+εt(2.14)

where

are those columns of

that correspond to the state-independent parameters

whereas

the columns of xr

tcorrespond to the state-dependent parameters.

2.6 Markov Switching Conditional Heteroscedasticity

Markov switching models are often used by researchers to account for speciﬁc features of

ﬁnancial time series such as asymmetries, fat tails, and volatility clusters.

To deal with skewness and excess kurtosis in the unconditional distribution of daily stock returns

standard ﬁnite mixtures of normal distributions have been applied quite frequently (Fama, 1965;

Granger and Orr, 1972; Kon, 1984; Tucker, 1992). Such a modeling approach, however, is

appropriate for time series data only if the processes

and

do not exhibit autocorrelation;

Volatility clustering implies persistence of states of high volatility and leads to the rejection of

standard time series models in favor of time series models that allow the conditional variance

2.6 Markov Switching Conditional Heteroscedasticity 23

Varyt|yt−1,ϑ

to depend on the history

yt−1,yt−2

,... of the observed process such as the

autoregressive conditionally heteroscedastic (ARCH) model.

2.6.1 Switching ARCH Models

A simple model to capture volatility clusters in ﬁnancial time series is the ARCH model (Engle,

1982) which may be written as

Yt=σtεtεt∼N0,1

σ2

t=γt+α1Y2

t−1+... +αmY2

t−m

(2.15)

with γt=γ. Alternately,

Yt=√γthtεt

t=1+α1

γt−1

t−1+... +αm

γt−m

t−m

(2.16)

The two parameterizations are equivalent if

γt=γ

, however they generate different processed if

γt

is time dependent. The switching ARCH model results by allowing time dependence of

γt

through a hidden K-state Markov chain

γt=γSt

. Such a switching parameter was introduced by

Hamilton and Susmel (1994) into parameterization (2.17):

Yt=√γsthtεt

t=1+α1

γst−1

t−1+... +αm

γSt−m

t−m

(2.17)

whereas Cai (1994) introduced a two-state and Kaufmann and Fr

uhwirthSchnatter (2002) a

K-state switching parameter into parameterization (2.16):

Yt=σtεt

σ2

t=γSt+α1Y2

t−1+... +αmY2

t−m

(2.18)

Gray (1996) introduced switching into all coefﬁcients of the ARCH process, represented by

(2.16):

Yt=σtεt

σ2

t=γSt+αst,1Y2

t−1+... +αst,mY2

t−m

(2.19)

24 Markov Switching Models

The switching ARCH model may be combined with a Markov switching autoregressive model

for the mean equation that includes the same hidden Markov chain (Gray, 1996):

Yt=ζSt+δSt,1Yt−1+ut

ut=σtεt,εt∼N0,1

σ2

t=γSt+αSt,1u2

t−1+... +αSt,mu2

t−m

(2.20)

The switching ARCH model has been extended by including a leverage effect into the ARCH

speciﬁcation (Hamilton and Susmel, 1994; Kaufmann and Fr

uhwirth-Schnatter, 2002) to deal

with asymmetries in the marginal distribution:

Yt=σtεt,εt∼N0,1

σ2

t=γSt+α1y2

t−1+... +αmy2

t−m+ρdt−1y2

t−1

(2.21)

where dt=1 if yt≤0.dt=0 if yt>0 and ρ>0.

2.6.2 Switching GARCH Models

Francq et al. (2001) consider the following switching GARCH(m, n) model, where all coefﬁ-

cients are switching,

Yt=σtεt,εt∼N0,1

σ2

t=γSt+αSt,1y2

t−1+... +αSt,my2

t−m+δSt,1σ2

t−1+... +δSt,nσ2

t−n

(2.22)

By recursive substitution it becomes evident that the predictive density

pyt|yt−1,ϑ

depends

on the whole history of

. For the switching GARCH(1, 1) model, for instance, the variance of

the predictive density reads:

σ2

t=γSt+αSt,1y2

t−1+δSt,1γSt−1+αSt−1,1y2

t−2+δSt,1γSt−1γSt−2+αSt−2,1y2

t−3+...

Thus the model obeys only the weakest assumption that the observation density

pyt|yt−1,St,ϑ

depends on yt−1and all past states of St.

2.6.3 Time-Varying Transition Matrices

Whereas the transition matrix ζof the hidden process Stis time invariant under assumptions;

2.6 Markov Switching Conditional Heteroscedasticity 25

•St

is an irreducible, aperiodic Markov chain starting from its ergodic distribution

η=

η1,η2, ..., ηK:

PS0=k|ζ=ηk

The stochastic properties of

are sufﬁciently described by the (K

K) transition matrix

, where each element

ζjk

is equal to the transition probability from state j to state k:

ζjk =PSt=k|St−1=j,∀j,k∈ {1, ..., K}

•St

is a ﬁrst-order homogeneous Markov chain with arbitrary transition matrix

, which

need not be irreducible or aperiodic, and starts from an arbitrary distribution

p0=

p0,1, ..., p0,Kthen:

p0,k=pS0=k

For a two-state Markov switching model, the transition probabilities ζSt−1,Stmay be reparame-

terized through a logit model in the following way:

ζSt−1,St=expκSt−1,1

1+expκSt−1,1,St̸=St−1

A univariate exogenous variable ztmay then be included:

ζSt−1,St=expκSt−1,1+ztκSt−1,2

1+expκSt−1,1+ztκSt−1,2,St̸=St−1(2.23)

Note that the transition probability

ζSt−1,St

not only depends on

, but also on the state of

St−1

The logit transform could be substituted by another increasing function F(·),

ζSt−1,St=FκSt−1,1+ztκSt−1,2,St̸=St−1(2.24)

A model with time-varying transition matrices may be estimated through the EM algorithm

(Diebold et al., 1994) or through MCMC methods (Filardo and Gordon, 1998).

References

[1]

Ahlgren, N. and Antell, J. (2002). Testing for cointegration between international stock

prices. Applied Financial Economics, 12(12):851–861.

[2]

Andersen, T. G., Davis, R. A., Kreiss, J.-P., and Mikosch, T. V. (2009). Handbook of

ﬁnancial time series. Springer Science & Business Media.

[3] Ang, A. and Bekaert, G. (2002). Regime switches in interest rates. Journal of Business &

Economic Statistics, 20(2):163–182.

[4]

Bhar, R. and Hamori, S. (2003a). Alternative characterization of the volatility in the growth

rate of real gdp. Japan and the World Economy, 15(2):223–231.

[5]

Bhar, R. and Hamori, S. (2003b). New evidence of linkages among g7 stock markets.

Finance Letters, 1(1).

[6]

Bhar, R. and Hamori, S. (2005). State-space models (i). Empirical Techniques in Finance,

pages 83–104.

[7]

Bhar, R. and Hamori, S. (2006a). Empirical techniques in ﬁnance. Springer Science &

Business Media.

[8]

Bhar, R. and Hamori, S. (2006b). Hidden Markov models: applications to ﬁnancial

economics, volume 40. Springer Science & Business Media.

[9]

Bhar, R. and Hamori, S. (2007). Analysing yield spread and output dynamics in an

endogenous markov switching regression framework. Asia-Paciﬁc Financial Markets, 14(1-

2):141–156.

[10]

Bollerslev, T. (1986). Generalized autoregressive conditional heteroskedasticity. Journal

of econometrics, 31(3):307–327.

[11]

Bollerslev, T., Chou, R. Y., and Kroner, K. F. (1992). Arch modeling in ﬁnance: A review

of the theory and empirical evidence. Journal of econometrics, 52(1-2):5–59.

[12]

Cecchetti, S. G., Lam, P.-S., and Mark, N. (1988). Mean reversion in equilibrium asset

prices.

[13]

Chu, C.-S. J., Santoni, G. J., and Liu, T. (1996). Stock market volatility and regime shifts

in returns. Information Sciences, 94(1):179–190.

68 References

[14]

Clements, M. P., Franses, P. H., and Swanson, N. R. (2004). Forecasting economic

and ﬁnancial time-series with non-linear models. International Journal of Forecasting,

20(2):169–183.

[15]

Corhay, A., Rad, A. T., and Urbain, J.-P. (1993). Common stochastic trends in european

stock markets. Economics Letters, 42(4):385–390.

[16]

Diebold, F. X., Lee, J.-H., and Weinbach, G. C. (1994). Regime switching with time-

varying transition probabilities. Business Cycles: Durations, Dynamics, and Forecasting,

pages 144–165.

[17]

Drost, F. C. and Nijman, T. E. (1993). Temporal aggregation of garch processes. Econo-

metrica: Journal of the Econometric Society, pages 909–927.

[18]

Durland, J. M. and McCurdy, T. H. (1994). Duration-dependent transitions in a markov

model of us gnp growth. Journal of Business & Economic Statistics, 12(3):279–288.

[19]

Elliott, R. J. and Mamon, R. S. (2003). A complete yield curve description of a markov

interest rate model. International Journal of Theoretical and Applied Finance, 6(04):317–

326.

[20]

Engle, R. F. (1982). Autoregressive conditional heteroscedasticity with estimates of the

variance of united kingdom inﬂation. Econometrica: Journal of the Econometric Society,

pages 987–1007.

[21]

Eun, C. S. and Shim, S. (1989). International transmission of stock market movements.

Journal of ﬁnancial and quantitative Analysis, 24(02):241–256.

[22]

Fama, E. F. (1975). Short-term interest rates as predictors of inﬂation. The American

Economic Review, 65(3):269–282.

[23]

Fama, E. F. and Gibbons, M. R. (1982). Inﬂation, real returns and capital investment.

Journal of Monetary Economics, 9(3):297–323.

[24]

Filardo, A. J. (1994). Business-cycle phases and their transitional dynamics. Journal of

Business & Economic Statistics, 12(3):299–308.

[25]

Franses, P. H. and Van Dijk, D. (2000). Non-linear time series models in empirical ﬁnance.

Cambridge University Press.

[26]

Frühwirth-Schnatter, S. (2006). Finite mixture and Markov switching models. Springer

Science & Business Media.

[27]

Garbade, K. and Wachtel, P. (1978). Time variation in the relationship between inﬂation

and interest rates. Journal of Monetary Economics, 4(4):755–765.

[28]

Garcia, R. and Perron, P. (1996). An analysis of the real interest rate under regime shifts.

The Review of Economics and Statistics, pages 111–125.

References 69

[29]

Glosten, L. R., Jagannathan, R., and Runkle, D. E. (1993). On the relation between the

expected value and the volatility of the nominal excess return on stocks. The journal of

ﬁnance, 48(5):1779–1801.

[30]

Goldfeld, S. M. and Quandt, R. E. (1973). A markov model for switching regressions.

Journal of econometrics, 1(1):3–15.

[31]

Gray, S. F. (1996). Modeling the conditional distribution of interest rates as a regime-

switching process. Journal of Financial Economics, 42(1):27–62.

[32]

Grossman, S. J. and Shiller, R. J. (1980). The determinants of the variability of stock

market prices.

[33]

Hamilton, J. D. (1989). A new approach to the economic analysis of nonstationary time

series and the business cycle. Econometrica: Journal of the Econometric Society, pages

357–384.

[34]

Hamilton, J. D. (1990). Analysis of time series subject to changes in regime. Journal of

econometrics, 45(1):39–70.

[35]

Hamilton, J. D. (1993). 9 estimation, inference and forecasting of time series subject to

changes in regime. Handbook of statistics, 11:231–260.

[36]

Hamilton, J. D. (1996). Speciﬁcation testing in markov-switching time-series models.

Journal of Econometrics, 70(1):127–157.

[37]

Hamilton, J. D. and Lin, G. (1996). Stock market volatility and the business cycle. Journal

of Applied Econometrics, 11(5):573–593.

[38]

Hamilton, J. D. and Raj, B. (2013). Advances in Markov-Switching Models: Applications

in Business Cycle Research and Finance. Springer Science & Business Media.

[39]

Hamilton, J. D. and Susmel, R. (1994). Autoregressive conditional heteroskedasticity and

changes in regime. Journal of Econometrics, 64(1):307–333.

[40]

Hamori, S. (2000a). The transmission mechanism of business cycles among germany,

japan, the uk and the usa. Applied Economics, 32(4):405–410.

[41]

Hamori, S. (2000b). Volatility of real gdp: some evidence from the united states, the

united kingdom and japan. Japan and the World Economy, 12(2):143–152.

[42]

Hansen, L. P. and Singleton, K. J. (1982). Generalized instrumental variables estimation of

nonlinear rational expectations models. Econometrica: Journal of the Econometric Society,

pages 1269–1286.

[43]

Harding, D. and Pagan, A. (2002). Dissecting the cycle: a methodological investigation.

Journal of monetary economics, 49(2):365–381.

[44]

Harding, D. and Pagan, A. (2003). A comparison of two business cycle dating methods.

Journal of Economic Dynamics and Control, 27(9):1681–1690.

70 References

[45]

Harding, D. E., Pagan, A., et al. (1999). Knowing the cycle. Melbourne Institute of

Applied Economic and Social Research, University of Melbourne.

[46]

Jochum, C. (1999). Volatility spillovers and the price of risk: Evidence from the swiss

stock market. Empirical Economics, 24(2):303–322.

[47]

Kasa, K. (1992). Common stochastic trends in international stock markets. Journal of

monetary Economics, 29(1):95–124.

[48]

Kim, C.-J. and Nelson, C. R. (1998). Business cycle turning points, a new coincident

index, and tests of duration dependence based on a dynamic factor model with regime

switching. Review of Economics and Statistics, 80(2):188–201.

[49]

Kim, C.-J. and Nelson, C. R. (1999). Has the us economy become more stable? a bayesian

approach based on a markov-switching model of the business cycle. Review of Economics

and Statistics, 81(4):608–616.

[50]

Kim, C.-J. and Nelson, C. R. (2001). A bayesian approach to testing for markov-switching

in univariate and dynamic factor models. International Economic Review, 42(4):989–1013.

[51]

Kim, C.-J., Nelson, C. R., et al. (1999). State-space models with regime switching:

classical and Gibbs-sampling approaches with applications, volume 2. MIT press Cambridge,

MA.

[52]

Kim, C.-J., Nelson, C. R., and Piger, J. (2004). The less-volatile us economy: a bayesian

investigation of timing, breadth, and potential explanations. Journal of Business & Economic

Statistics, 22(1):80–93.

[53]

Kim, C.-J., Nelson, C. R., and Startz, R. (1998). Testing for mean reversion in het-

eroskedastic data based on gibbs-sampling-augmented randomization. Journal of Empirical

ﬁnance, 5(2):131–154.

[54]

Kim, D. and Kon, S. J. (1994). Alternative models for the conditional heteroscedasticity

of stock returns. Journal of Business, pages 563–598.

[55]

Kim, I.-M. and Maddala, G. (1991). Multiple structural breaks and unit roots in exchange

rates. In Econometric Society Meeting at New Orleans.

[56]

Lam, P.-s. (1990). The hamilton model with a general autoregressive component: estima-

tion and comparison with other models of economic time series: Estimation and comparison

with other models of economic time series. Journal of Monetary Economics, 26(3):409–432.

[57] Lam, P.-s. and Mark, N. C. (1988). Mean reversion in equilibrium asset prices.

[58]

Lee, S. H., Sung, H. M., and Urrutia, J. L. (1996). The impact of the persian gulf crisis on

the prices of ldcs’ loans. Journal of Financial Services Research, 10(2):143–162.

[59]

Maddala, G. S. and Kim, I.-M. (1998). Unit roots, cointegration, and structural change.

Number 4. Cambridge University Press.

References 71

[60]

Malliaris, A. G., Urrutia, J. L., et al. (1992). The international crash of october 1987:

causality tests. Journal of Financial and Quantitative Analysis, 27(3).

[61] Malliaris, A. G., Urrutia, J. L., et al. (1998). Volume and price relationships: hypotheses

and testing for agricultural futures. Journal of Futures Markets, 18(1):53–72.

[62]

Mamon, R. S. and Elliott, R. J. (2007). Hidden markov models in ﬁnance, volume 4.

Springer.

[63]

McCarthy, J. and Najand, M. (1995). State space modeling of linkages among international

markets. Journal of Multinational Financial Management, 5:1–9.

[64]

McConnell, M. M. and Perez-Quiros, G. (1998). Output ﬂuctuations in the united states:

what has changed since the early 1980s? FRB of New York Staff Report, (41).

[65] Neal, R., Rolph, D. S., and Morris, C. (2001). Interest rates and credit spread dynamics.

[66]

Nelson, C. R. and Schwert, G. W. (1977). Short-term interest rates as predictors of

inﬂation: On testing the hypothesis that the real rate of interest is constant. The American

Economic Review, 67(3):478–486.

[67]

Nelson, D. B. (1991). Conditional heteroskedasticity in asset returns: A new approach.

Econometrica: Journal of the Econometric Society, pages 347–370.

[68]

Perron, P. (1990). Testing for a unit root in a time series with a changing mean. Journal of

Business & Economic Statistics, 8(2):153–162.

[69]

Quandt, R. E. (1958). The estimation of the parameters of a linear regression system

obeying two separate regimes. Journal of the american statistical association, 53(284):873–

880.

[70]

Quandt, R. E. (1972). A new approach to estimating switching regressions. Journal of the

American statistical association, 67(338):306–310.

[71]

Satchell, S. and Knight, J. (2011). Forecasting volatility in the ﬁnancial markets.

Butterworth-Heinemann.

[72]

Schaller, H. and Norden, S. V. (1997). Regime switching in stock market returns. Applied

Financial Economics, 7(2):177–191.

[73] Taylor, S. J. (2007). Modelling ﬁnancial time series.

[74]

Turner, C. M., Startz, R., and Nelson, C. R. (1989). A markov model of heteroskedasticity,

risk, and learning in the stock market. Journal of Financial Economics, 25(1):3–22.

[75] Walsh, C. (1982). Interest rate volatility and monetary policy.

[76]

Walsh, C. E. (1984). Interest rate volatility and monetary policy. Journal of Money, Credit

and Banking, 16(2):133–150.

72 References

[77] Wang, P. (2008). Financial econometrics. Routledge.

[78]

Wells, C. (1996). The kalman ﬁlter in ﬁnance (advanced studies in theoretical and applied

econometrics, vol 32).

Appendix A

Table A.1 Chapter 3: Correlation Analysis

India JAPAN USA Pakistan

India 1 -0.13006 0.145842 -0.63359

JAPAN 1 0.021093 0.154037

USA 1 -0.2651

Pakistan 1

Table A.2 Chapter 5: Descriptive Statistics

Real Interest Rate

Mean -0.406993

Median 0.571534

Maximum 12.126104

Minimum -30.046879

Std. Dev. 5.350612

Skewness -2.088641

Kurtosis 10.321620

Jarque-Bera 1850.411571

Probability 0.000000

Table A.3 Chapter 5: Real Interest Rate Unit Root Test

Unit Root Test t-Statistic Prob.*

Augmented Dickey-Fuller test statistic -4.40837 0.00030

Phillips-Perron test statistic -5.55596 0.00000

*MacKinnon (1996) one-sided p-values.

ResearchGate has not been able to resolve any citations for this publication.

Modelling Financial Time Series.

Article

Jan 1987

Empirical Techniques in Finance

Book

Jan 2005

This book offers the opportunity to study and experience advanced empi- cal techniques in finance and in general financial economics. It is not only suitable for students with an interest in the field, it is also highly rec- mended for academic researchers as well as the researchers in the industry. The book focuses on the contemporary empirical techniques used in the analysis of financial markets and how these are implemented using actual market data. With an emphasis on Implementation, this book helps foc- ing on strategies for rigorously combing finance theory and modeling technology to extend extant considerations in the literature. The main aim of this book is to equip the readers with an array of tools and techniques that will allow them to explore financial market problems with a fresh perspective. In this sense it is not another volume in eco- metrics. Of course, the traditional econometric methods are still valid and important; the contents of this book will bring in other related modeling topics that help more in-depth exploration of finance theory and putting it into practice. As seen in the derivatives analysis, modern finance theory requires a sophisticated understanding of stochastic processes. The actual data analyses also require new Statistical tools that can address the unique aspects of financial data. To meet these new demands, this book explains diverse modeling approaches with an emphasis on the application in the field of finance.

A new approach to the economic analysis of nonstationary time series and the business cycle

Article

Jan 1989
ECONOMETRICA

J.D. Hamilton

A Bayesian Approach to Testing for Markov Switching in Univariate and Dynamic Factor Models

Article

Jan 1998

State Space Modelling of Linkages among International Markets

Article

Jan 1995

Generalized autoregressive conditional heteroskedasticity

Article

Jan 1986
J ECONOMETRICS

Tim Bollerslev

Interest rates and credit spread dynamics

Article

Jan 2015

This article revisits the relationship between callable credit spreads and interest rates. The authors use cointegration to model the time series of corporate and government bond rates and draw inference about how credit spreads evolve after a shock to government rates using a bootstrapped standard error methodology. They find little evidence that unexpected changes to government rates lead to a significant change in future credit spreads. These results hold for both large positive and negative shocks, as well as after conditioning on the prevailing interest rate environment.

Forecasting Volatility In The Financial Markets

Book

Jan 2007

This new edition of Forecasting Volatility in the Financial Markets assumes that the reader has a firm grounding in the key principles and methods of understanding volatility measurement and builds on that knowledge to detail cutting-edge modelling and forecasting techniques. It provides a survey of ways to measure risk and define the different models of volatility and return. Editors John Knight and Stephen Satchell have brought together an impressive array of contributors who present research from their area of specialization related to volatility forecasting. Readers with an understanding of volatility measures and risk management strategies will benefit from this collection of up to date chapters on the latest techniques in forecasting volatility. * Leading thinkers present newest research on volatility forecasting *International authors cover a broad array of subjects related to volatility forecasting *Assumes basic knowledge of volatility, financial mathematics, and modelling.

Autoregressive conditional hetroscedasticity and changes in regime

Article

Jan 1994
J ECONOMETRICS

Advances in Markov-Switching Models: Applications in Business Cycle Research and Finance (Studies in Empirical Economics)

Book

Jan 2003

Modeling regime shifts Markov switching models Hidden Markov Model Applications

Abstract

Recommended publications

Monash Faculty of Information Technology

Forgetting of the initial condition for the filter in general state-space hidden Markov chain: A cou...

Conditionally heteroscedastic unobserved component models and their reduced form

Zero-State Coupled Markov Switching Count Models for Spatio-Temporal Infectious Disease Spread

Consistency of the Maximum Likelihood Estimator for general hidden Markov models

Discrete Hidden Markov Model Bayesian Processors