ArticlePDF Available

Delay propagation network in air transport systems based on refined nonlinear Granger causality

Authors:

Abstract and Figures

To probe deeply into the underlying mechanism of flight delay propagation, we construct the delay propagation networks among airports via refined nonlinear Granger causality, which is more fitting on nonlinear airport delay data and obtain more precise causal estimation. Specifically, the causal analysis method is used to determine the delay propagation relationship between airports. The delay propagation relationship of these airport pairs constitutes an airport delay propagation network, in which the nodes and edges of the network, respectively, represent the specific airport and the delay propagation relationship between airports. Complex network theory is then applied to analyse the global structure of delay propagation networks, indicating that small airports are inclined to cause or exacerbate delays. In addition, the airlines’ delay propagation networks display that the delay propagation by large airlines spread wider while the delay propagation by small airlines spread faster. These findings can be employed to design strategies for delay propagation dampening.
Content may be subject to copyright.
Delay Propagation Network in Air Transport Systems Based on
Refined Nonlinear Granger Causality
Ziyu Jiaa,b,c, Xiyang Caia, Yihan Hua, Junyu Ji a, and Zehui Jiaod
aSchool of Computer and Information Technology, Beijing Jiaotong University, Beijing,
China; bBeijing Key Laboratory of Traffic Data Analysis and Mining, Beijing, China; cKey
Laboratory of Intelligent Passenger Service of Civil Aviation, CAAC, Beijing, China;
dComputer Science & Engineering, University of California San Diego, La Jolla, CA, USA
ARTICLE HISTORY
Compiled October 17, 2023
ABSTRACT
To probe deeply into the underlying mechanism of flight delay propagation, we con-
struct the delay propagation networks among airports via refined nonlinear Granger
causality, which is more fitting on nonlinear airport delay data and obtain more pre-
cise causal estimation. Specifically, the causal analysis method is used to determine
the delay propagation relationship between airports. The delay propagation rela-
tionship of these airport pairs constitutes an airport delay propagation network, in
which the nodes and edges of the network respectively represent the specific airport
and the delay propagation relationship between airports. Complex network theory
is then applied to analyze the global structure of delay propagation networks, indi-
cating that small airports are inclined to cause or exacerbate delays. In addition, the
airlines’ delay propagation networks display that the delay propagation by large air-
lines spread wider while the delay propagation by small airlines spread faster. These
findings can be employed to design strategies for delay propagation dampening.
KEYWORDS
Granger causality; delay propagation; complex network; conditional mutual
information
1. Introduction
Flight delays seriously affect the safety and cost efficiency of air transportation systems
and gradually become a global challenge (Zhang et al. 2019; Duytschaever 1993; Cook
and Tanner 2011). Initial flight delays are attributed to a variety of reasons, such
as extreme weather and air traffic control. Usually, propagation delays occur due to
connected resources and the most common resource is the aircraft (Kafle and Zou
2016; Hao et al. 2014). For example, if the same aircraft is scheduled to fly multiple
legs, earlier delays affect subsequent missions of the same aircraft (Lan, Clarke, and
Barnhart 2006). Besides, the crews shift among diverse aircrafts, making flight delay
to disseminate to the other flights (Beatty et al. 1999; Wang et al. 2017). Hence, the
small initial delay often results in the large delay propagation (Daqing et al. 2014;
Meng and Zhou 2011).
CONTACT Ziyu Jia. Email: ziyujia@b jtu.edu.cn
In order to alleviate the large delay propagation, many researchers have made a
lot of efforts on the local dynamics of some flights, airports or airlines (Wang et al.
2020b,a; Evler et al. 2020; uvercin, Ferhatosmanoglu, and Gedik 2020; Lee, Marla,
and Jacquillat 2020). For example, Beatty et al. (1999) and Wu and Law (2019) pro-
posed the idea of delay multiplier to quantify delay propagation. In addition, as the air
transport system is a complex system, some researchers have built large and complex
networks for analysis to fully grasp the characteristics of this system(Barrat et al.
2004; Guimera et al. 2005; Opsahl et al. 2008; Wuellner, Roy, and D Souza 2010;
Fleurquin, Ramasco, and Eguiluz 2013). Campanelli et al. proposed a novel method
including slot reallocation and swapping for the European aviation network (Campan-
elli et al. 2014). Afterward, they evaluated the impact of disruptions on European and
American aviation networks based on simulated delay propagation (Campanelli et al.
2016). The above work not only assists air managers to understand dynamic air trans-
portation, but also gives theoretical guides for the development of air management
strategies. However, few studies have considered the interdependence of delay time
series to study delayed propagation. Exploring the systematic framework of causal-
ity between airports is challenging. In recent years, researchers have begun to use
causal analysis to explore complex systems. The classic Granger causality test (GC)
is increasingly utilized in various fields including physiology (Kugiumtzis et al. 2017;
Kugiumtzis and Kimiskidis 2015) and finance (Papana et al. 2017b,a). Causal analy-
sis can explore interactions in dynamical systems, which helps reveal the underlying
mechanisms of complex systems.
The air transport systems are the representative complex systems. So far, some
studies have explored the delayed propagation of air transport systems through causal
analysis methods. These efforts provide a new perspective for studying the delay prop-
agation mechanism between airports. For example, using the classic linear GC method
(Granger 1969), Zanin, Belkoura, and Zhu (2017) and Du et al. (2018) studied delay
propagation by delay time series and construct delay networks between airports. Nev-
ertheless, because of the complexity of present transport system, the flight delay time
series is usually high-dimensional and non-linear. The traditional GC method is a com-
mon model to deal with linear time series (Maziarz 2015). Therefore, the information-
based nonlinear GC (NGC) can be used as an alternative method to analyze delay
time series, which is a common method for measuring the causality of nonlinear time
series (Faes, Nollo, and Porta 2011). However, the information-based nonlinear GC
produces bias in evaluating high-dimensional conditional mutual information (CMI)
(Wibral, Vicente, and Lizier 2014). Hence, it is challenging to accurately detect the
causality of multivariate time series.
To solve the above challenge, we propose refined nonlinear Granger causality
(RNGC) , which uses a low-dimensional approximation CMI (LA-CMI) in calculation
to obtain a more precise estimation of nonlinear multivariate causality. The RNGC
overcomes the curse of dimensionality by the LA-CMI. Then, its validity is verified
on the simulated multivariate time series. Besides, we use the RNGC to detect the
delay propagation of airport systems. We show the causality framework of delay prop-
agation, which relies on the interdependence of nonlinear airport delay time series.
To understand the delay interaction relationship among airports, we build the global
structure of delay propagation networks, which exhibiting the range and direction of
propagation. The results of the study indicate the importance of small airports, which
are inclined to cause or exacerbate delays. In addition, the analysis results show that
the delay propagation by large airlines spread wider while the delay propagation by
small airlines spread faster. These insights help aviation managers comprehend the
2
mechanisms of delay propagation among airports and provide theoretical support for
reducing flight delays. We will open source all data sets and codes.
The remainder of this article is designed as follows. We put forward a RNGC method
based on information theory in Section 2. In Section 3, we demonstrate the accuracy
of the RNGC method through simulation experiments. In Section 4, we describe the
data sets on the air transport system and report the results of using complex network
theory to analyze data sets. And Section 5 summarises the major conclusions.
2. Methods
Traditional nonlinear Granger causality estimates the causality of nonlinear systems
based on information theory. Generally, the traditional method uses CMI to measure
the causal relationship between time series. However, for high-dimensional time se-
ries, the estimation of CMI usually produces bias. Therefore, the calculated causality
is inaccurate. To solve this problem, we propose a low-dimensional approximation
method to estimate CMI, and then detect more accurate causality. First, the tradi-
tional nonlinear Granger causality is shown in this section. Then, the low-dimensional
approximation CMI is presented. Finally, the RNGC method is proposed.
2.1. Nonlinear Granger causality
The nonlinear Granger causality based on information theory, an extension of tradi-
tional Granger causality, has been extensively employed to estimate causality in nonlin-
ear systems. A whole dynamical system consists of Ksubsystems X, Y, Z1, . . . , ZK2,
where Xdenotes a driving subsystem, Ydenotes a target subsystem, and Zdenote
the other subsystems. The multivariate causality from Xto Yon Zis represented
generically as:
NGCXY|Z=IYn;XL
n|YL
n, ZL
n
=HYn|YL
n, ZL
nHYn|XL
n, Y L
n, ZL
n,(1)
where XL
n={Xn1,· · · , XnL},YL
n={Yn1,· · · , YnL}, and ZL
n=
{Zn1,· · · , ZnL}are variable sets that represent historical states. Ldenotes max-
imum lag range for each variable. The N GCXY|Zmeasures the information supplied
by the XL
nabout Yn.
The key to compute nonlinear Granger causality NGCXY|Zby nonuniform embed-
ding method is to construct candidate embedding vectors VnM=XL
n, Y L
n, ZL
n.
We generate an empty candidate set V(0)
n=, and then the construction process is
introduced as follows:
In the first step s= 1, we choose the candidate vector V(1)
nM, which is tested
by a criterion:
V(1)
n=W(1)
n= argmax
WnM
I(Yn;Wn).(2)
In the step s > 1, the candidate vector is expanded via adding W(s)
n, which
provides most information for Ynwhen V(s1)
n={W(1)
n, . . . , W (s1)
n}already
3
exists. W(s)
nis tested via a CMI criterion:
W(s)
n= argmax
WnM\V(s1)
n
IYn;Wn|V(s1)
n.(3)
The choice loop is stopped when W(s)
ndoes not meet significance test. Then, this
candidate set is obtained as Vn=V(s1)
n.
The NGC measure is computed as:
NGCXY|Z=HYn|VY
n, V Z
nH(Yn|Vn).(4)
where Vn={VX
n, V Y
n, V Z
n},VX
n, V Y
nand VZ
nVncome from variables X, Y, and
Z, respectively.
2.2. The Proposed Method
2.2.1. The motivation for applying the LA-CMI
The formation process of candidate vector is regarded as the feature selection process
based on information theory (Han et al. 2018). The selected candidate vector provides
the most information for the target variable and minimizes redundancy information
between the selected variable set. Therefore, each step needs to estimate the accu-
rate CMI and select the best variable. Nevertheless, as candidate vector dimension
increases, the curse of dimensionality appears due to the increase of state space di-
mension, which leads to the gradually inaccurate estimation of CMI (Wibral, Vicente,
and Lizier 2014; Jia et al. 2019). To tackle the issue, we propose a LA-CMI to improve
the NGC method in the following sections.
2.2.2. Low-dimensional approximation for CMI
A major problem for feature selection research is how to accurately estimate the high-
dimensional CMI. The researchers have done a lot of excellent work. For example, the
JMI criterion (Yang and Moody 2000), the MRMR criterion (Peng, Long, and Ding
2005), and the DISR criterion (Meyer and Bontempi 2006). Moreover, Brown et al.
(2012) proposed a parameterized general criterion based on the above criteria:
J(Ws) = I(Ws;Yn)ϵX
WjV
I(Ws;Wj) + ηX
WjV
I(Ws;Wj|Yn) (5)
where the different criteria relay on the values of ϵand η, such as the JMI criterion
is gained when ϵ=η= 1/|V|; the MRMR is gained when ϵ= 1/|V|,η= 0. Fur-
thermore, Vinh et al. (2016) proposed the RelaxMRMR to improve the traditional
criteria. Specifically, higher-order feature interplays are studied within this RelaxM-
RMR by relaxing specific conditional independence assumptions.
The innovativeness of RelaxMRMR compared to Equation (5) is the I(Ws;Wi|Wj),
4
which contains the second-order interplays between features.
J(Ws) = I(Ws;Yn)ϵX
WjV
I(Ws;Wj)+ηX
WjV
I(Ws;Wj|Yn)θX
Wi,WjV;i=j
I(Ws;Wi|Wj),
(6)
where ϵ=η= 1/|V|and θ= 1/|V|(|V| 1).
In our work, the above RelaxMRMR criterion in Equation (6) is utilized as the
replacement for high-dimensional CMI. We define the target variable Ynand candidate
vector V=W1, W2,...W(s1), an expression for low-dimensional approximation of
CMI is as follows:
Wn=argmax
WsM\V
I(Ws;Yn)1
|V|X
WjV
I(Ws;Wj) + 1
|V|X
WjV
I(Ws;Wj|Yn)
1
|V|(|V| 1) X
Wi,WjV;i=j
I(Ws;Wi|Wj)
(7)
2.2.3. LA-CMI for nonlinear Granger causality
We present the Refined-NGC (RNGC) method, which utilizes the LA-CMI criterion
when the candidate vector dimension is high. We set a criterion control factor mto
determine when the low-dimensional approximate CMI criterion is used. We generate
an empty candidate set V(0)
n=. Then, the details of the RNGC method are as
follows.
In the first step s= 1, we choose the candidate vector V(1)
nM, which is tested
by a criterion :
V(1)
n=W(1)
n= argmax
WnM
I(Yn;Wn).(8)
In the step 1 < s m, the candidate vector is expanded via adding W(s)
n, which
provides most information for Ynwhen V(s1)
n={W(1)
n, . . . , W (s1)
n}already
exists. W(s)
nis tested via a standard CMI criterion (Equation 3).
In the step s>m, the low-dimensional approximation CMI criterion (Equation
7) is used for embedding.
The choice loop is stopped when W(s)
ndoes not meet significance test. Then, this
candidate set is obtained as Vn=V(s1)
n.
The proposed RNGC measure:
RNGCXY|Z=HYn|VY
n, V Z
nH(Yn|Vn).(9)
where Vn={VX
n, V Y
n, V Z
n},VX
n, V Y
nand VZ
nVncome from variables X, Y, and
Z, respectively.
5
3. Simulation study
In this section, we employ simulation experiments to validate the effectiveness of
RNGC method. The proposed RNGC method is compared to three traditional causal-
ity methods (GC , TE, and NGC) on the Henon maps. To guarantee the robustness
of the experiment, 100 realizations are generated for the Henon maps. Each causal
strength obtained is the average of 100 realizations. In addition, the significance level
αis set to 0.05 in the significance test for all methods (Kugiumtzis 2013).
X1
X2
X3XK-1
Xk
Figure 1. The true causality network of this Kcoupled Henon maps.
We define the Kcoupled Henon maps (Kugiumtzis 2013):
xi,t =1.4x2
i,t1+ 0.3xi,t2for i= 1, K
1.4(0.5C(xi1,t1+xi+1,t1) + (1 C)xi,t1)2+ 0.3xi,t2for i= 2, . . . , K 1
(10)
where Cdenotes coupling strength. In addition, true causality of this coupled Henon
maps is shown in Figure. 1. Table 1 presents three evaluation metrics obtained on this
K= 5 coupled Henon maps for varying sequence length n. The results present that the
RNGC measure performs better than other methods. Moreover, the proposed RNGC
measure is greater than 0.9 in all evaluation metrics. As the time series changes, the F1
score of the proposed RNGC measure does not change much. Therefore, the proposed
RNGC measure is less influenced by time series length and maintains a high accuracy
to detect coupling. To study the influence of different Kon causal detection, we fix
n= 256 and change the variable Kto get the evaluation metrics as shown in Table
2. As the number of variables increases, the accuracy of detecting coupling decreases.
However, the proposed RNGC method still performs better than other methods in
terms of various evaluation metrics. In addition, the results indicate that the GC
method is not accurate because the GC method may be more suitable to linear time
series. Therefore, the RNGC method is more fitting to process nonlinear delay time
series.
4. Case study of air transport system
In the section, we first introduce the air transport system data sets. We then construct
the airport delay time series and show the nonlinearity of the delay time series. Finally,
we construct and investigate the delay propagation networks.
6
Table 1. The evaluation metrics from the coupled Henon maps with varying length
n. The experimental parameters are α= 0.05, K= 5, and C= 0.9.
Sensitivity Specificity F1 score
n= 256
GC (Granger 1969) 0.901 0.937 0.859
TE (Montalto, Faes, and Marinazzo 2014) 0.993 0.957 0.934
NGC (Faes, Nollo, and Porta 2011) 0.985 0.960 0.938
RNGC 0.988 0.964 0.941
n= 512
GC (Granger 1969) 0.993 0.887 0.845
TE (Montalto, Faes, and Marinazzo 2014) 1.000 0.942 0.916
NGC (Faes, Nollo, and Porta 2011) 1.000 0.944 0.919
RNGC 1.000 0.952 0.930
n= 1024
GC (Granger 1969) 1.000 0.866 0.825
TE (Montalto, Faes, and Marinazzo 2014) 1.000 0.902 0.865
NGC (Faes, Nollo, and Porta 2011) 1.000 0.913 0.878
RNGC 1.000 0.944 0.919
Table 2. The evaluation metrics from the coupled Henon maps with varying K.
The experimental parameters are α= 0.05, n= 256, and C= 0.9.
Sensitivity Specificity F1 score
K= 5
GC (Granger 1969) 0.901 0.937 0.859
TE (Montalto, Faes, and Marinazzo 2014) 0.993 0.957 0.934
NGC (Faes, Nollo, and Porta 2011) 0.985 0.960 0.938
RNGC 0.988 0.964 0.941
K= 10
GC (Granger 1969) 0.911 0.904 0.755
TE (Montalto, Faes, and Marinazzo 2014) 0.978 0.945 0.864
NGC (Faes, Nollo, and Porta 2011) 0.979 0.947 0.869
RNGC 0.980 0.950 0.875
K= 15
GC (Granger 1969) 0.936 0.909 0.711
TE (Montalto, Faes, and Marinazzo 2014) 0.965 0.959 0.847
NGC (Faes, Nollo, and Porta 2011) 0.966 0.961 0.854
RNGC 0.967 0.963 0.861
4.1. Data description
The dataset is given by Civil Aviation Administration of China (CAAC), compris-
ing flight status information in China from 1 to 31 December 2016. In addition, the
dataset includes 313,477 flights, 205 airports, and 39 airlines. The information avail-
able for each flight is the aircraft registration number, scheduled departure time, actual
departure time, scheduled arrival time, actual arrival time, airline International Air
Transport Association (IATA) code, departure airport IATA code, and arrival airport
IATA code.
Table 3. Information for each flight.
Available information for each flight
Aircraft registration number Actual arrival time
Scheduled departure time Airline IATA
Actual departure time Departure airport IATA
Scheduled arrival time Arrival airport IATA
7
4.2. Time series preprocessing and analysis
To capture the delayed disseminate among airports, we construct the arrival delay
time series (DTS) for all airports, which describes on-time performance concerning all
airports. Each time point in the formed DTS denotes average flight delay time per
unit hour. Specifically, the average DTS for i-th airport is defined as:
Di(p, q) = P(Ti
aTi
s)
Ni(p, q),1p31,0q23,(11)
where Di(p, q) is average delay time for i-th airport on day pand hour q. And Ti
aand
Ti
sare the actual and scheduled flight arrival time, respectively. Ni(p, q) is the number
of flights for the i-th airport per hour. To confirm the nonlinearity of DTS, we employ
the Lyapunov exponent. Generally speaking, a dynamic system can be regarded as a
nonlinear system when the largest Lyapunov exponent is positive (Wolf et al. 1985).
The largest Lyapunov exponent of the data is 0.15, which indicates that the flight
delay system exhibits significant nonlinearity.
4.3. Network metrics
In this study, complex network is used to investigate the causality of delayed propa-
gation in air transport systems.
Degree. The degree of each airport represents the number of remaining airports with
which it has a delayed propagation relationship. The in-degree of airport iindicates
the number of remaining airports affecting airport i. Conversely, the out-degree is
obtained. The network average degree denotes each airport affects others on average.
Assortativity coefficient (Newman 2002). A positive value of assortativity coefficient
indicates that the network is an assortative network. In the assortative network, the
airports prefer to link preferentially toward other airports with the similar degree.
Conversely, it is called disassortative network.
Link density. The larger the link density value, the more the number of network
links. That is to say, when the link density value is large, delays are easily propagated
through the network.
Average clustering coefficient (Watts and Strogatz 1998). The average aggregation
coefficients is computed as fellow after getting local aggregation coefficients for all
nodes. And the larger the value of average aggregation coefficient, the higher degree
of aggregation for airport nodes within this network.
Modularity (Newman 2004). The modularity can measure the strength of the net-
work community structure. The entire network is made up of several communities.
The spread of delays among airports within each community is relatively dense. In
contrast, the spread of delays among communities is relatively sparse (Arenas et al.
2007). The higher modularity, the higher regionalization degree of delay propagation.
Efficiency (Latora and Marchiori 2001). The efficiency of the network reflects the
ease of delay propagation between two airports. The greater the value of efficiency,
the faster the delay propagate to entire network.
8
4.4. Analysis of air transport system
4.4.1. Basic characteristics for delay propagation network
The basic characteristics for delay propagation network constructed by using delay
time series are analyzed. The proposed RNGC method is utilized to construct a delayed
propagation network, which contains 205 airport nodes as shown in Figure. 2. The
larger the node’s degree of network, the larger the nodes. Conversely, the smaller the
node’s degree of network, the smaller the nodes. Figure. 2 presents some small airports
with larger nodes, such as GMQ and XYI.
Figure 2. The delay propagation network between airports. The links denote delay propagation between
airports. The degree of a node is proportional to its size.
0 1 0 2 0 3 0 4 0
0
2 0
4 0
6 0
8 0
PEK
C A N
N L H
G M Q
XYI
O utdegree
N u m b e r o f fl i g h t s ( 1 E 3 )
(a) Out-degree and the number of flights
for each airport.
0 1 0 2 0 3 0 4 0
0
2
4
6
PEK
C A N
W N Z
L H W
Indegree
N u m b e r o f fl i g h t s ( 1 E 3 )
(b) In-degree and the number of flights for
each airport.
Figure 3. The relationship between degree and the number of flights at the airports. GMQ, NLH, and XYI
are small airports. PEK and CAN are large airports.
Figure. 3 further reflects the relationship between flight number and airport node’s
degree, where Figure. 3(a) shows the relationship between out-degrees of nodes and
the number of flights are inversely related; that is to say, the node’s degree with a
larger number of flights is small, and it is consistent with other air transport networks
(Cook, Tanner, and Zanin 2013; AhmadBeygi et al. 2008; Xiao et al. 2020; Jetzki 2009;
Jia et al. 2020). Figure. 3(b) reflects the relationship between in-degrees of nodes and
9
the number of flights and it is obvious that there is no obvious relationship between
them. However, most of the in-degrees of nodes with a larger number of flights are still
small.
Table 4. Delay propagation network topological metric
Topological metric Delay propagation relational
network metrics value
Average value of 1000 random
network metrics
Average degree 2.62 2.62
Link density 0.01 0.01
Assortativity coefficient -0.09 -0.01
Average clustering coefficient 0.02 0.01
Modularity 0.63 0.42
Efficiency 0.39 0.32
In summary, airports with a small number of flights in the delay propagation network
often experience delays. These airports are usually small airports, and the aviation
resources and mitigation measures of these small airports may not be as good as those
of large airports. Hence, small airports may increase the spread of delays.
Table 4 shows the network topological indicators calculated from the delay prop-
agation network and the average values of 1000 random networks constructed from
the delay propagation network. From the value of the average degree 2.62, it is known
that each airport spreads delays to multiple other airports. The value of the link
density is 0.01 indicating that the delay propagation network is sparse. The average
clustering coefficient, modularity, and efficiency of the network are higher than those
of the random networks, which indicates that the delay propagation network has a
strong clustering trend, relatively obvious community structure. In other words, there
are some sub-networks (network communities) with more internal connections, but
the connections between sub-networks are relatively sparse. In fact, the community
structure is in the constructed delay propagation network. Therefore, airports in each
network community may easily spread delays to each other. However, the assortativity
coefficient of the delay propagation network is lower than that of the random networks,
which shows that the network has higher disassortative. That is, the nodes with the
larger degree tend to link those with smaller degree in the network. Because the de-
gree of nodes is inversely related to the number of flights, there is delay propagation
between airports with more flights and airports with fewer flights.
4.4.2. Airline characteristics for delay propagation network
To investigate the differences in delay propagation for each airline, the airline’s monthly
delay propagation network is constructed in this section. The constructed airline net-
works are equivalent to stratifying the delay propagation network used in the section
4.4.1 according to the airlines. All airlines are ordered by flight number and the top
10 airlines are selected. The IATA codes are CZ, MU, CA, HU, ZH, MF, SC, 3U, GS,
and FM respectively.
Figure.4 shows that 3U airline has fewer flights, but total delay time is similar to
that of CA airline. Figure.5 shows the number of nodes and links for each airline’s
delay propagation network. It is obvious that the number of links and nodes for three
large airline networks is greater than those of remaining small airlines. Because the
three major airlines involve more cities and flights, the delay spread is larger.
The efficiency and assortativity coefficient for each airline’s delay propagation net-
work are illustrated in Figure.6. Figure.6(a) presents the efficiency of delay propagation
networks, which indicates that the delay propagation of small airlines is more efficient.
10
C Z M U C A H U Z H M F S C 3 U G S F M
0
1 0
2 0
3 0
4 0
5 0
N u m b e r o f f l i g h t s ( 1 E 3 )
I A T A c o d e f o r a i r l i n e s
N u m b e r o f f l i g h t s
S u m m a r y o f a r r i v a l d e l a y s
0
1 0
2 0
3 0
S u m m a r y o f a r r i v a l d e l a y s ( 1 E 4 m i n )
Figure 4. The sum of the number of flights and delay time for each airline in December 2016.
(a) The number of nodes of each airline’s delay prop-
agation network.
(b) The number of links of each airline’s delay prop-
agation network.
Figure 5. The number of nodes and links of each airline’s delay propagation network.
However, the delay propagation of large airlines is inefficient, because the network is
large or there are resources to mitigate delays. As shown in Figure.6(b), Each small
airline usually has a small assortativity coefficient, especially MF, SC and GS airlines.
And the efficiency of delay propagation networks are improved in small airlines.
5. Conclusion
In the research, we propose a refined causal analysis approach to reveal the delay
propagation among airports. The proposed RNGC method overcomes the curse of
dimensionality for the traditional method by using the low-dimensional approximation
CMI, and accurately detects the causality of simulated time series. In addition, in
actual applications, the delay propagation networks are established based on the DTS
of each airport. Then, network analysis metrics are utilized to reflect the macroscopic
presentation of delayed propagation.
Understanding the mechanism of delayed propagation can assist managers to de-
velop aviation planning. We adopt complex network theory to investigate this mech-
anism. The results of the analysis emphasize the importance of small airports, which
are inclined to cause or exacerbate delays. In addition, the analysis results also show
11
(a) The efficiency of each airline’s delay propagation
network.
(b) The assortativity coefficient of each airline’s delay
propagation network.
Figure 6. The efficiency and assortativity coefficient of each airline’s delay propagation network.
that the delay propagation by large airlines spread wider while the delay propagation
by small airlines spread faster. Therefore, it is necessary to consider the characteristics
of the airline’s delay propagation when there is an emergency of flight delays.
These insights are employed to develop air traffic schemes to mitigate or suppress
the delayed spread. The research based on delay propagation networks to detect de-
layed propagation can be extended. For instance, the delay propagation networks con-
structed in this study are unweighted networks. In the future, the edge weights can be
considered because the edge weights reflect the degree of causality.
Disclosure statement
No potential conflict of interest was reported by the authors.
Acknowledgment
This project was funded by the Fundamental Research Funds for the Central Univer-
sities (Grant No.2020YJS025) and partially supported by the Zhejiang Lab’s Interna-
tional Talent Fund for Young Professionals.
References
AhmadBeygi, Shervin, Amy Cohn, Yihan Guan, and Peter Belobaba. 2008. “Analysis of the
potential for delay propagation in passenger airline networks.” Journal of air transport
management 14 (5): 221–236.
Arenas, Alex, Jordi Duch, Alberto Fern´andez, and Sergio omez. 2007. “Size reduction of
complex networks preserving modularity.” New Journal of Physics 9 (6): 176.
Barrat, Alain, Marc Barthelemy, Romualdo Pastor-Satorras, and Alessandro Vespignani. 2004.
“The architecture of complex weighted networks.” Proceedings of the national academy of
sciences 101 (11): 3747–3752.
Beatty, Roger, Rose Hsu, Lee Berry, and James Rome. 1999. “Preliminary evaluation of flight
delay propagation through an airline schedule.” Air Traffic Control Quarterly 7 (4): 259–270.
Brown, Gavin, Adam Pocock, Ming-Jie Zhao, and Mikel Luj´an. 2012. “Conditional likelihood
12
maximisation: a unifying framework for information theoretic feature selection.” Journal of
machine learning research 13 (Jan): 27–66.
Campanelli, B, P Fleurquin, VM Egu´ıluz, JJ Ramasco, A Arranz, I Etxebarria, and C Ciruelos.
2014. “Modeling reactionary delays in the European air transport network.” Proceedings of
the Fourth SESAR Innovation Days, Schaefer D (Ed.), Madrid .
Campanelli, Bruno, Pablo Fleurquin, Andr´es Arranz, Izaro Etxebarria, Carla Ciruelos,
V´ıctor M Egu´ıluz, and Jos´e J Ramasco. 2016. “Comparing the modeling of delay propaga-
tion in the US and European air traffic networks.” Journal of Air Transport Management
56: 12–18.
Cook, Andrew, Graham Tanner, and Massimiliano Zanin. 2013. “Towards superior air trans-
port performance metrics–imperatives and methods.” Journal of Aerospace Operations 2
(1-2): 3–19.
Cook, Andrew J, and Graham Tanner. 2011. “European airline delay cost reference values.” .
Daqing, Li, Jiang Yinan, Kang Rui, and Shlomo Havlin. 2014. “Spatial correlation analysis of
cascading failures: congestions and blackouts.” Scientific reports 4: 5381.
Du, Wen-Bo, Ming-Yuan Zhang, Yu Zhang, Xian-Bin Cao, and Jun Zhang. 2018. “Delay
causality network in air transport systems.” Transportation research part E: logistics and
transportation review 118: 466–476.
Duytschaever, DIRK. 1993. “The development and implementation of the eurocontrol central
air traffic flow management unit (cfmu).” The Journal of Navigation 46 (3): 343–352.
Evler, Jan, Michael Schultz, Hartmut Fricke, and AJ Cook. 2020. “Development of stochastic
delay cost functions.” 10th SESAR Innovation Days .
Faes, Luca, Giandomenico Nollo, and Alberto Porta. 2011. “Information-based detection of
nonlinear Granger causality in multivariate processes via a nonuniform embedding tech-
nique.” Physical Review E 83 (5): 051112.
Fleurquin, Pablo, Jos´e J Ramasco, and Victor M Eguiluz. 2013. “Systemic delay propagation
in the US airport network.” Scientific reports 3: 1159.
Granger, Clive WJ. 1969. “Investigating causal relations by econometric models and cross-
spectral methods.” Econometrica: Journal of the Econometric Society 424–438.
Guimera, Roger, Stefano Mossa, Adrian Turtschi, and LA Nunes Amaral. 2005. “The world-
wide air transportation network: Anomalous centrality, community structure, and cities’
global roles.” Proceedings of the National Academy of Sciences 102 (22): 7794–7799.
uvercin, Mehmet, Nilgun Ferhatosmanoglu, and Bugra Gedik. 2020. “Forecasting Flight De-
lays Using Clustered Models Based on Airport Networks.” IEEE Transactions on Intelligent
Transportation Systems .
Han, Min, Weijie Ren, Meiling Xu, and Tie Qiu. 2018. “Nonuniform state space reconstruction
for multivariate chaotic time series.” IEEE transactions on cybernetics 49 (5): 1885–1895.
Hao, Lu, Mark Hansen, Yu Zhang, and Joseph Post. 2014. “New York, New York: Two ways
of estimating the delay impact of New York airports.” Transportation Research Part E:
Logistics and Transportation Review 70: 245–260.
Jetzki, Martina. 2009. “The propagation of air transport delays in Europe.” Master’s thesis,
RWTH Aachen University, Airport and Air Transportation Research .
Jia, Ziyu, Youfang Lin, Zehui Jiao, Yan Ma, and Jing Wang. 2019. “Detecting causality in
multivariate time series via non-uniform embedding.” Entropy 21 (12): 1233.
Jia, Ziyu, Youfang Lin, Yunxiao Liu, Zehui Jiao, and Jing Wang. 2020. “Refined nonuniform
embedding for coupling detection in multivariate time series.” Physical Review E 101 (6):
062113.
Kafle, Nabin, and Bo Zou. 2016. “Modeling flight delay propagation: A new analytical-
econometric approach.” Transportation Research Part B: Methodological 93: 520–542.
Kugiumtzis, Dimitris. 2013. “Direct-coupling information measure from nonuniform embed-
ding.” Physical Review E 87 (6): 062918.
Kugiumtzis, Dimitris, and Vasilios K Kimiskidis. 2015. “Direct causal networks for the study
of transcranial magnetic stimulation effects on focal epileptiform discharges.” International
journal of neural systems 25 (05): 1550006.
13
Kugiumtzis, Dimitris, Christos Koutlis, Alkiviadis Tsimpiris, and Vasilios K Kimiskidis. 2017.
“Dynamics of epileptiform discharges induced by transcranial magnetic stimulation in ge-
netic generalized epilepsy.” International journal of neural systems 27 (07): 1750037.
Lan, Shan, John-Paul Clarke, and Cynthia Barnhart. 2006. “Planning for robust airline op-
erations: Optimizing aircraft routings and flight departure times to minimize passenger
disruptions.” Transportation science 40 (1): 15–28.
Latora, Vito, and Massimo Marchiori. 2001. “Efficient behavior of small-world networks.”
Physical review letters 87 (19): 198701.
Lee, Jane, Lavanya Marla, and Alexandre Jacquillat. 2020. “Dynamic disruption management
in airline networks under airport operating uncertainty.” Transportation Science 54 (4):
973–997.
Maziarz, Mariusz. 2015. “A review of the Granger-causality fallacy.” The journal of philosoph-
ical economics: Reflections on economic and social issues 8 (2): 86–105.
Meng, Lingyun, and Xuesong Zhou. 2011. “Robust single-track train dispatching model under
a dynamic and stochastic environment: a scenario-based rolling horizon solution approach.”
Transportation Research Part B: Methodological 45 (7): 1080–1102.
Meyer, Patrick E, and Gianluca Bontempi. 2006. “On the use of variable complementarity
for feature selection in cancer classification.” In Workshops on applications of evolutionary
computation, 91–102. Springer.
Montalto, Alessandro, Luca Faes, and Daniele Marinazzo. 2014. “MuTE: a MATLAB toolbox
to compare established and novel estimators of the multivariate transfer entropy.” PloS one
9 (10): e109462.
Newman, Mark EJ. 2002. “Assortative mixing in networks.” Physical review letters 89 (20):
208701.
Newman, Mark EJ. 2004. “Fast algorithm for detecting community structure in networks.”
Physical review E 69 (6): 066133.
Opsahl, Tore, Vittoria Colizza, Pietro Panzarasa, and Jose J Ramasco. 2008. “Prominence and
control: the weighted rich-club effect.” Physical review letters 101 (16): 168702.
Papana, Angeliki, Catherine Kyrtsou, Dimitris Kugiumtzis, and Cees Diks. 2017a. “Assessment
of resampling methods for causality testing: A note on the US inflation behavior.” PloS one
12 (7): e0180852.
Papana, Angeliki, Catherine Kyrtsou, Dimitris Kugiumtzis, and Cees Diks. 2017b. “Financial
networks based on Granger causality: A case study.” Physica A: Statistical Mechanics and
its Applications 482: 65–73.
Peng, Hanchuan, Fuhui Long, and Chris Ding. 2005. “Feature selection based on mutual in-
formation: criteria of max-dependency, max-relevance, and min-redundancy.” IEEE Trans-
actions on Pattern Analysis and Machine Intelligence (8): 1226–1238.
Vinh, Nguyen Xuan, Shuo Zhou, Jeffrey Chan, and James Bailey. 2016. “Can high-order
dependencies improve mutual information based feature selection?” Pattern Recognition 53:
46–58.
Wang, Yan-Jun, Ya-Kun Cao, Chen-Ping Zhu, Fan Wu, Ming-Hua Hu, Baruch Barzel, and
HE Stanley. 2017. “Characterizing departure delays of flights in passenger aviation network
of United States.” arXiv preprint arXiv:1701.05556 .
Wang, Yanjun, Yakun Cao, Chenping Zhu, Fan Wu, Minghua Hu, Vu Duong, Michael Watkins,
Baruch Barzel, and H Eugene Stanley. 2020a. “Universal patterns in passenger flight depar-
ture delays.” Scientific reports 10 (1): 1–10.
Wang, Yanjun, Hongfeng Zheng, Fan Wu, Jun Chen, and Mark Hansen. 2020b. “A Com-
parative Study on Flight Delay Networks of the USA and China.” Journal of Advanced
Transportation 2020.
Watts, Duncan J, and Steven H Strogatz. 1998. “Collective dynamics of ‘small-
world’networks.” nature 393 (6684): 440.
Wibral, Michael, Raul Vicente, and Joseph T Lizier. 2014. Directed information measures in
neuroscience. Springer.
Wolf, Alan, Jack B Swift, Harry L Swinney, and John A Vastano. 1985. “Determining Lyapunov
14
exponents from a time series.” Physica D: Nonlinear Phenomena 16 (3): 285–317.
Wu, Cheng-Lung, and Kristie Law. 2019. “Modelling the delay propagation effects of multiple
resource connections in an airline network using a Bayesian network model.” Transportation
Research Part E: Logistics and Transportation Review 122: 62–77.
Wuellner, Daniel R, Soumen Roy, and Raissa M D Souza. 2010. “Resilience and rewiring of
the passenger airline networks in the United States.” Physical Review E 82 (5): 056101.
Xiao, Yinhong, Yaoshuai Zhao, Ge Wu, and Yizhen Jing. 2020. “Study on delay propagation
relations among airports based on transfer entropy.” IEEE Access 8: 97103–97113.
Yang, Howard Hua, and John Moody. 2000. “Data visualization and feature selection: New
algorithms for nongaussian data.” In Advances in neural information processing systems,
687–693.
Zanin, Massimiliano, Seddik Belkoura, and Yanbo Zhu. 2017. “Network analysis of Chinese
air transport delay propagation.” Chinese Journal of Aeronautics 30 (2): 491–499.
Zhang, Mingyuan, Xuting Zhou, Yu Zhang, Lijun Sun, Ming Dun, Wenbo Du, and Xianbin
Cao. 2019. “Propagation Index on Airport Delays.” Transportation Research Record 2673
(8): 536–543. https://doi.org/10.1177/0361198119844240.
15
... As for interpretability and complexity, linear models assume a linear combination of variables, while nonlinear models relax this assumption; as for stability, linear methods are usually more stable, while nonlinear methods are more flexible but can suffer from overparameterization. Studies traditionally infer Granger causality among time series using linear methods (see Belkoura and Zanin, 2016;Zanin et al., 2017;Wang et al., 2020), ignoring the nonlinearity in data. Recent studies relax the linear assumption as in Xiao et al. (2020) and Jia et al. (2022). The nonlinear method is preferable to the linear method when constructing a delay propagation network if (a) the delay time series follow a nonlinear structure and (b) the linear method fails to extract causality from nonlinear data. ...
... Unfortunately, estimating it from high-dimensional time series is challenging, as noted by Runge et al. (2012) and Vinh et al. (2016). Xiao et al. (2020) and Jia et al. (2022) use a low-dimensional approximation of high-dimensional estimation and construct the delay propagation network with the transfer entropy inference. The multilayer perceptron is a new method to construct the delay propagation network considering the nonlinear data structure and supplementing the current research. ...
... Zanin et al. (2017) consider whether to discard the negative delays; they modeled based on time series with all flights to include propagation of early arrivals. Belkoura and Zanin (2016), Xiao et al. (2020) and Jia et al. (2022) also include all flights without removing punctual and early arrival flights. The discussion in the following focuses on the hourly average arrival delays with only delayed flights. ...
Article
Recent studies have explored the delay propagation network among airports by inferring Granger causality among time series using linear methods. Granger causality indicates whether delays at one airport can help predict delays at itself or other airports and thus help to understand the interconnection among airports. This paper makes inferences about the Granger causality relation among airports by applying a nonlinear multilayer perceptron method to the arrival delay time series in Europe and China. We find that the propagation results can be overestimated with data input containing early arrival flights, besides delayed flights. Europe presents a smaller magnitude of delay propagation than China, considering only delayed flights, indicated by the link densities of the networks with the same penalty parameter. Unlike some recent research findings, our results suggest that large airports have more out-degree and more impact in the delay propagation network. These results can help predict and understand delay propagation in daily operations or simulation environments.
... As of now, the application of self-supervised learning methods is mostly focused on image, speech, and semantic data. In these fields, sufficient labeled data are available, making supervised learning already highly competitive (Jia et al., 2022a;Liang et al., 2023). In contrast, in areas where labels are hard to obtain, such as physiological signal data, self-supervised learning has greater potential (Krishnan et al., 2022). ...
Article
Full-text available
Introduction The significant role of emotional recognition in the field of human-machine interaction has garnered the attention of many researchers. Emotion recognition based on physiological signals can objectively reflect the most authentic emotional states of humans. However, existing labeled Electroencephalogram (EEG) datasets are often of small scale. Methods In practical scenarios, a large number of unlabeled EEG signals are easier to obtain. Therefore, this paper adopts self-supervised learning methods to study emotion recognition based on EEG. Specifically, experiments employ three pre-defined tasks to define pseudo-labels and extract features from the inherent structure of the data. Results and discussion Experimental results indicate that self-supervised learning methods have the capability to learn effective feature representations for downstream tasks without any manual labels.
... The success or failure of innovation ultimately depends on the ability of a country to adjust its socioeconomic paradigm to meet the requirements and possibilities of the technological economic paradigm. Since the 1990s, with the emergence of a large number of emerging technologies such as information and communication technology and Internet technology, the increasing diversification of customer needs and the increasing concern for social equity and sustainable development [16], the connotation and boundary of innovation has been constantly expanded, with user innovation, destructive innovation, open innovation, business model innovation, service innovation, mass innovation, frugal innovation and low-cost innovation. The research on market driven innovation such as green innovation has developed rapidly, and innovation research has become the focus of the theoretical, economic and industrial circles [17]. ...
Article
Full-text available
China has a vast territory, with an abundance of natural resources. Industrial investment of provinces promulgating and implementing state policies and measures on the basis of the various regions and industries are slowly widening the gap, which leads to significant differences in the level of industrial development. The imbalance between development at both provincial and municipal levels has become more prominent. The state has formulated a policy of coordinated development of the regional economy. There are differences between regions and industries in the demand for technological innovation of high-tech industry. This article is from the regional perspective, industry perspective and the perspective of the two. This paper, through the research of technology innovation and output data of China’s 31 provinces, municipalities and autonomous regions, finds the delivery rate is an independent variable of the average number of employees, R&D funds, R&D internal expenditure, external spending, new product sales income, the absorption of funding expenditure, technical transformation of funds for fixed assets, and gross profit as a technological low-carbon innovation efficiency measure. Using factor analysis, multiple regression methods and SPSS software, from the perspective of two regions and industries, enables empirical analysis of the influencing factors of this variable.
... As the development of new energy in rural areas is not at a mature stage, all rural power grids should establish a unified data specification to eliminate information islands and realize data resource sharing; deeply study the application of information technology and networking [28][29][30][31][32] in rural power grid management under the background of new energy, and formulate the specifications and practical standards of integrated information systems for rural power grid management; actively apply mature information security technologies and products at home and abroad to do a good job in rural power information security. ...
Article
Full-text available
In China’s economic and social development planning, dual carbon goals and modern rural constructions have become new keywords. It is important to realize the green transformation of energy systems and complete the construction of a modern rural power grid based on data analysis under the guidance of dual carbon goals. Based on the systematic summary of the rural energy structure, this paper assesses the current situation of the scientific and technological development of rural power grids, analyzes the main problems faced by the development of rural power grids, and gives suggestions on the path of energy transformation and the path of rural power grid development. The results show that rural energy transformation plays an important role in the implementation of the dual carbon goals, and the relationship between the long-term and short-term goals of rural energy transformation, top-level design and differentiated development, government regulation and market forces, development stage and energy consumption expenditure must be handled well. In addition, as an important support, rural power grid enterprises should seize the opportunities brought by the construction of rural power grids under the dual carbon goals, make rural power a technology to be valued, and promote the further development of new energy-related technologies.
... In recent years, brain-computer interface gradually gain widespread attention from researchers [10,12,15,29]. Since EEG signals can directly reflect human brain activities, they become one of the frequently used data for emotion brain-computer interfaces. ...
... With continuous breakthroughs in wireless network connectivity, the commercialization of wide-area networks, such as NB-IoT, Long Range (LoRa) and 5G, is accelerating, and the integration of socioeconomic and information technology is expanding. IoT application technology will experience significant development across various industries, including livelihood, health care, education, transportation and electricity [17][18][19][20][21]. ...
Article
Full-text available
Solar energy, as a prominent clean energy source, is increasingly favored by nations worldwide. However, managing numerous photovoltaic (PV) power generation units via wired connections presents a considerable challenge. The advent of the Internet of Things (IoT) and cloud service technologies has facilitated the creation of an efficient and convenient PV grid-connected management system. This paper investigates IoT technology and PV grid-connected systems, integrating wireless sensor network technology, cloud computing service platforms and distributed PV grid-connected systems. We propose a Zigbee wireless network featuring ad hoc network functionality and Narrow Band Internet of Things (NB-IoT) smart gateway with multi-protocol and multi-device support. This system enables the collection and uploading of PV grid-connected system data to cloud service platforms, addressing daily operation and maintenance as well as intelligent management of distributed PV power stations. Furthermore, it expedites the aggregation and analysis of PV power generation data, streamlining power marketing across various regions.
Article
Full-text available
Methods used to infer causal relations from data rather than knowledge of mechanisms are most helpful and exploited only if the theoretical background is insufficient or experimentation impossible. The review of literature shows that when an investigator has no prior knowledge of the researched phenomenon, no result of the Grangercausality test has any epistemic utility due to different possible interpretations. (1) Rejecting the null in one of the tests can be interpreted as either a true causal relation, opposite direction of the true causation, instant causality, time series cointegration, not frequent enough sampling, etc. (2) Bi-directional Granger causality can be read either as instant causality or common cause fallacy. (3) Non-rejection of both nulls possibly means either indirect or nonlinear causality, or no causal relation.
Article
Full-text available
WRKY transcription factors play significant roles in plant stress responses. In this study, a class III WRKY gene ScWRKY5, was successfully isolated from sugarcane variety ROC22. The ScWRKY5 was a nucleus protein with transcriptional activation activity. The ScWRKY5 gene was constitutively expressed in all the sugarcane tissues, with the highest expression level in the stem epidermis and the lowest in the root. After inoculation with Sporisorium scitamineum for 1 d, the expression level of ScWRKY5 was significantly increased in two smut-resistant varieties (YZ01-1413 and LC05-136), while it was decreased in three smut-susceptible varieties (ROC22, YZ03-103, and FN40). Besides, the expression level of ScWRKY5 was increased by the plant hormones salicylic acid (SA) and abscisic acid (ABA), as well as the abiotic factors polyethylene glycol (PEG) and sodium chloride (NaCl). Transient overexpression of the ScWRKY5 gene enhanced the resistance of Nicotiana benthamiana to the tobacco bacterial pathogen Ralstonia solanacearum, however the transiently overexpressed N. benthamiana was more sensitive to the tobacco fungal pathogen Fusarium solani var. coeruleum. These results provide a reference for further research on the resistance function of sugarcane WRKY genes.
Article
Full-text available
State-space reconstruction is essential to analyze the dynamics and internal interactions of complex systems. However, it is difficult to estimate high-dimensional conditional mutual information and select the optimal time delays in most existing nonuniform state-space reconstruction methods. Therefore, we propose a nonuniform embedding method framed in information theory for state-space reconstruction. We use a low-dimensional approximation of conditional mutual information criterion for time delay selection, which is effectively solved by the particle swarm optimization algorithm. The obtained embedded vector has relatively strong independence and low redundancy, which better characterizes multivariable complex systems and detects coupling within complex systems. In addition, the proposed nonuniform embedding method exhibits good performance in coupling detection of linear stochastic, nonlinear stochastic, chaotic systems. In the actual application, the importance of small airports that cause delay propagation has been demonstrated by constructing the delay propagation network.
Article
Full-text available
Recent studies have characterized the structures of air transport network in different countries and regions using complex network metrics. These studies coincided with the trend of increasingly available large empirical flight datasets that enable researchers to investigate the dynamics of the system, such as the propagation of flight delay. However, linking network structure with network dynamics remains a challenging task. In this paper, we proposed a method to construct flight delay networks from operational data. We provided a detailed comparison of the key structural properties of the flight delay networks in the United States and China. The comparisons of betweenness centrality of delay networks and flight networks show the advantage of the proposed method. We further found that airports in similar geographical locations do exhibit similar delay patterns in both countries. To explore the underlying mechanisms, the Multifractal Detrended Fluctuation Analysis (MF-DFA) is applied to the flights’ delay time series at both the airport level and network level. Singularity spectra analyses reveal the fundamental characteristics of the airport systems and air transportation system. Our findings contribute to the understanding of structure and dynamics of air transportation systems.
Article
Full-text available
To understand the mechanism of delay propagation from the perspective of multiple airports, constructing delay propagation relation (DPR) networks among airports is a novel analysis method. The latest method is to use transfer entropy to mine the delay propagation relation among airports. However, the transfer entropy will produce bias due to estimating high-dimensional conditional mutual information (CMI) in the delay propagation scenario. In this paper, we propose the low-dimensional approximation of CMI for transfer entropy (LTE) to address the above issue. By applying this improved algorithm, the delay propagation relation among airports can be more accurately explored and a more accurate DPR network can be obtained. For this network, we apply the complex network theory and its related indicators to provide systematic analysis about delay propagation. The results of case analysis show that in the delay propagation interactions among airports, large airports always receive delays and small airports propagate delays outward. Meanwhile, delays propagate more efficiently in the aviation system of smaller airlines. These results can provide some theoretical supports for making measures to reduce delay propagation.
Article
Full-text available
Departure delays are a major cause of economic loss and inefficiency in the growing industry of passenger flights. A departure delay of a current flight is inevitably affected by the late arrival of the flight immediately preceding it with the same aircraft. We seek to understand the mechanisms of such propagated delays, and to obtain universal metrics by which to evaluate an airline’s operational effectiveness in delay alleviation. Here we use big data collected by the American Bureau of Transportation Statistics to design models of flight delays. Offering two dynamic models of delay propagation, we divided all carriers into two groups exhibiting a shifted power law or an exponentially truncated shifted power law delay distribution, revealing two universal delay propagation classes. Three model parameters, extracted directly from dual data mining, help characterize each airline’s operational efficiency in delay mitigation. Therefore, our modeling framework provides the crucially lacking evaluation indicators for airlines, potentially contributing to the mitigation of future departure delays.
Article
Full-text available
Causal analysis based on non-uniform embedding schemes is an important way to detect the underlying interactions between dynamic systems. However, there are still some obstacles to estimating high-dimensional conditional mutual information and forming optimal mixed embedding vector in traditional non-uniform embedding schemes. In this study, we present a new non-uniform embedding method framed in information theory to detect causality for multivariate time series, named LM-PMIME, which integrates the low-dimensional approximation of conditional mutual information and the mixed search strategy for the construction of the mixed embedding vector. We apply the proposed method to simulations of linear stochastic, nonlinear stochastic, and chaotic systems, demonstrating its superiority over partial conditional mutual information from mixed embedding (PMIME) method. Moreover, the proposed method works well for multivariate time series with weak coupling strengths, especially for chaotic systems. In the actual application, we show its applicability to epilepsy multichannel electrocorticographic recordings.
Article
Air traffic disruptions result in flight delays, cancellations, passenger misconnections, and ultimately high costs to aviation stakeholders. This paper proposes a jointly reactive and proactive approach to airline disruption management, which optimizes recovery decisions in response to realized disruptions and in anticipation of future disruptions. The approach forecasts future disruptions partially and probabilistically by estimating systemic delays at hub airports (and the uncertainty thereof) and ignoring other contingent disruptions. It formulates a dynamic stochastic integer programming framework to minimize network-wide expected disruption recovery costs. Specifically, our Stochastic Reactive and Proactive Disruption Management (SRPDM) model combines a stochastic queuing model of airport congestion, a flight planning tool from Boeing/Jeppesen and an integer programming model of airline disruption recovery. We develop a solution procedure based on look-ahead approximation and sample average approximation, which enables the model’s implementation in short computational times. Experimental results show that leveraging even partial and probabilistic estimates of future disruptions can reduce expected recovery costs by 1%–2%, as compared with a myopic baseline approach based on realized disruptions alone. These benefits are mainly driven by the deliberate introduction of departure holds to reduce expected fuel costs, flight cancellations, and aircraft swaps.
Article
Estimating flight delays is important for airlines, airports, and passengers, as the delays are among major costs in air transportation. Each delay may cause a further propagation of delays. Hence, the delay pattern of an airport and the location of the airport in the network can provide useful information for other airports. We address the problem of forecasting flight delays of an airport, utilizing the network information as well as the delay patterns of similar airports in the network. The proposed “Clustered Airport Modeling” (CAM) approach builds a representative time-series for each group of airports and fits a common model (e.g., REG-ARIMA) for each, using the network based features as regressors. The models are then applied individually to each airport data for predicting the airport’s flight delays. We also performed a network based analysis of the airports and identified the Betweenness Centrality (BC) score as an effective feature in forecasting the flight delays. The experiments on flight data over seven years using 305 US airports show that CAM provides accurate forecasts of flight delays.
Article
This paper explores the propagation effect of flight delays among airports in the aviation system and proposes a new measure, the propagation index, to effectively analyze the interrelationship among airports in relation to flight delays. This index quantifies the effect of delay propagation by measuring the causality among delay time series. To assess the effectiveness of the proposed index on airport delays, three neural network-based regression models are built. The comparative experiments demonstrate that the propagation index proposed is highly correlated with observed airport delays.