An illustrative example of adaptive prediction

Source publication

Online Data-Driven Adaptive Prediction of Train Event Times

Article

Full-text available

Aug 2014

This paper presents a microscopic model for accurate prediction of train event times based on a timed event graph with dynamic arc weights. The process times in the model are obtained dynamically using processed histori-cal track occupation data, thus reflecting all phenomena of railway traffic captured by the train describer systems and preprocess...

Context 1

... moving average smoothing method is used to incorpo- rate the prediction error observed during the train run into future predictions until the next stop. A schematic example of adaptive prediction is given in Fig. 5. The running train departed from station A and in the situation from the figure has just cleared the j th out of m blocks to station B where it is scheduled to stop. The gray solid line starting at station A represents the predicted running time of the train based on the actually registered departure delay. For the sake of clarity, for ...

View in full-text

Context 2

... l is a parameter l ∈ {1, ..., j−1} that specifies the length of the moving average. Parameters l and m are calibrated separately for each train type. The red dotted line in Fig. 5 denotes the adjusted prediction of running times to station B. By applying this adaptive prediction strategy, the continuous delay sources of the conflict-free run of a single train (e.g. due to particular driving style or defective rolling-stock) as well as temporary speed restrictions (due to infrastructure malfunctions or ...

View in full-text

The Impact of Attitudes and Perceptions on High Speed Rail Usage Uptake in Taiwan and the Shanghai Area

Article

Full-text available

Mar 2017

China has the world's most extensive high speed rail (HSR) network. The modernized railway network began commercial operations in April 2007. On the other side of the strait, Taiwan High Speed Rail was inaugurated in January 2007. The newly introduced travel mode has over time significantly changed the travel patterns in these areas. This study aim...

Sea-level rise impacts on transport infrastructure: The notorious case of the coastal railway line at Dawlish, England

Article

Full-text available

Feb 2016

Future climate change is likely to increase the frequency of coastal storms and floods, with major consequences for coastal transport infrastructure. This paper assesses the extent to which projected sea-level rise is likely to impact upon the functioning of the Dawlish to Teignmouth stretch of the London to Penzance railway line, in England. Using...

Explainable train delay propagation: A graph attention network approach

Article

Apr 2024

Explaining train delay propagation using influence factors (to find the determinants) is essential for transport planning and train operation management. Due to high interpretability to train operations, graph/network models, e.g., Bayesian networks and alternative graphs, are extensively used in the train delay propagation/prediction problem. In these graph/network models, nodes represent train arrival/departure/passage events, whereas arcs describe train headway/ running/dwelling processes. However, previously proposed graph/network models do not have edge weights, making them incapable of apperceiving the diverse influences of factors on train delay propagation/prediction. The train dwelling, running, and headway times vary over time, space, and train services. This potentially makes these factors have diverse strengths on train operations. We innovatively use the Graph Attention Network (GAT) to model the train delay propagation. An attention mechanism is used in the GAT model, allowing the GAT model to have arcs with diverse weights (learned from data). This enables the GAT model to discern the nodes' diverse influences; thus, with the learned importance coefficients, the model can be distinctly explained by the influencing factors. Further, the model's accuracy is expected to be improved, because the GAT model (with the attention mechanism) can pay more attention (represented by the learned weights) to the significant factors/nodes. The proposed GAT model was calibrated on operation data from the Dutch railway network. The results show that: (1) the influence factors contribute diversely to the delay propagation, and the train headway is the determinant of train delay propagation; (2) the accuracy of the proposed GAT model is significantly improved (because of the attention mechanism), compared against other state-of-the-art graph/network models. In a word, the proposed GAT method improves the interpretability of delay propagation and the accuracy of delay prediction.

Data‐driven train delay prediction incorporating dispatching commands: An XGBoost‐metaheuristic framework

Article

Full-text available

Dec 2023
IET INTELL TRANSP SY

Train delays can significantly impact the punctuality and service quality of high‐speed trains, which also play a crucial role in affecting dispatchers with their decision‐making. In this study, a data‐driven train delay prediction framework was proposed and strengthened by considering the impact of dispatching commands and the mechanisms of train delay propagation using XGBoost. Four metaheuristic algorithms were utilized to fine‐tune its hyperparameters. A vast dataset comprising 1.9 million records spanning 38 months of train operation data was utilized for feature extraction and model training. The model's accuracy was evaluated using three statistical metrics, and a comparison of the four tuning frameworks was performed. To emphasize the model's interpretability and its practical guidance for train rescheduling, the relationship of dispatching commands, delay propagation and delay prediction was validated by combining the theory and practical results, and a SHAP (SHapley Additive exPlanations) analysis was used for a clearer model explanation. The results revealed that distinct XGBoost‐Metaheuristic models exhibit unique effects in different criteria, yet they all demonstrated high accuracy and low prediction errors, thereby revealing the potential of using machine learning for train delay prediction, which is valuable for decision‐making and rescheduling.

A review of data-driven approaches to predict train delays

Article

Full-text available

Mar 2023

Identifying the rail operating features associated to intermodal freight rail operation delays

Article

Feb 2023

Intermodal freight rail operations represent a complex stochastic system that is impacted by disruptions and disturbances from diverse causes like extreme weather events, unplanned upstream network delays, equipment failures, labor actions, and intra-railyard inefficiency, which in turn generate delays in travel times. Understanding and predicting the delays caused by the occurrence of these disruptions and disturbances holds the potential to limit their system-wide schedule impact through early-warning prompting mitigating actions. This paper presents the training of a suite of supervised machine learning models using classification algorithms to predict the delay times caused by the occurrence of disruptions and disturbances in intermodal freight rail operations, and the most suitable model in terms of the evaluation metrics (e.g., AUC, recall, and F1-score) was used to explore the major predictors of the delays caused by disturbances and disruptions (using the Morris method). The supporting dataset includes intermodal freight rail operations with origin the central station of the freight rail network of CFL, the National Railway Company of Luxembourg, in the intermodal hub of Bet-tembourg, connecting several EU countries terminals forming a pan-European network. Results reveal that the CatBoost implementation of the gradient boosting machine model outperforms other ML models in terms of the selected metrics. Additionally, results suggest that the train weight, train length, number of TEU, weight per wagon, distance between stations, and the month of operation are key features to predict the delays caused by the occurrence of disruptions and disturbances in the freight operations in the studied rail network. The outcome of the study suggests that longer and more heavily loaded trains are related to the occurrence of trip delays, and this insight can be used to optimize the freight operations of the National Railway Company of Luxembourg.

Prediction of high-speed train delay propagation based on causal text information

Article

Full-text available

Sep 2022

The delay-causing text data contain valuable information such as the specific reasons for the delay, location and time of the disturbance, which can provide an efficient support for the prediction of train delays and improve the guidance of train control efficiency. Based on the train operation data and delay-causing data of the Wuhan–Guangzhou high-speed railway, the relevant algorithms in the natural language processing field are used to process the delay-causing text data. It also integrates the train operating-environment information and delay-causing text information so as to develop a cause-based train delay propagation prediction model. The Word2vec model is first used to vectorize the delay-causing text description after word segmentation. The mean model or the term frequency-inverse document frequency-weighted model is then used to generate the delay-causing sentence vector based on the original word vector. Afterward, the train operating-environment features and delay-causing sentence vector are input into the extreme gradient boosting (XGBoost) regression algorithm to develop a delay propagation prediction model. In this work, 4 text feature processing methods and 8 regression algorithms are considered. The results demonstrate that the XGBoost regression algorithm has the highest prediction accuracy using the test features processed by the continuous bag of words and the mean models. Compared with the prediction model that only considers the train-operating-environment features, the results show that the prediction accuracy of the model is significantly improved with multiple regression algorithms after integrating the delay-causing feature.

Train traffic control in merging stations: a data-driven approach

Preprint

Full-text available

Aug 2022

Railway operations are subject to deviations from the planned schedule, i.e., delays. In those situations, timely and high-quality control actions are needed to reduce the impacts of delays on the networks. Existing studies mainly used prescriptive techniques (e.g., mathematical programming, heuristics) to solve the train traffic control problem during interruptions. These methods have limitations in the strong reliance of few deterministic parameters prescriptively or normatively determined beforehand; exponential increase of complexity when considering multiple aspects or larger cases; low transferability because of the assumptions used and unmodeled effects; and little understandability by the practitioners, which hinders their acceptance in practice. Based on decision graphs, this study is able to analyze and exploit past realization data to provide decision support for traffic control, in case of delayed trains in merging-line stations (multiple lines merge as one line). The brand-new perspective is to use realized data, to learn the historical traffic control actions, their resulting effects (i.e., the delay reductions) so that decisions taken by human dispatchers can be explained and proactively suggested, in case of delayed conditions. The model is applied to case studies with train traffic realization data from two stations with multiple lines merging in the Swiss railway network. The method quickly determines the stochastic effects of the two possible decisions at merge points, and is able to identify which factors are most useful, to determine the best outcome. The experimental results show that the traffic control rule obtained from the proposed model is superior to two standard rescheduling methods.

Asynchronously updated predictions of electric vehicles' connection duration to a charging station

Conference Paper

Jul 2022

A review of train delay prediction approaches

Article

Jun 2022

Railway operations are vulnerable to delays. Accurate predictions of train arrival and departure delays improve the passenger service quality and are essential for real-time railway traffic management to minimise their further spreading. This review provides a synoptic overview and discussion covering the breadth of diverse approaches to predict train delays. We first categorise research contributions based on their underlying modelling paradigm (data-driven and event-driven) and their mathematical model. We then distinguish between very short to long-term predictions and classify different input data sources that have been considered in the literature. We further discuss advantages and disadvantages of producing deterministic versus stochastic predictions, the applicability of different approaches during disruptions and their interpretability. By comparing the results of the included contributions, we can indicate that the prediction error generally increases when broadening the prediction horizon. We find that data-driven approaches might have the edge on event-driven approaches in terms of prediction accuracy, whereas event-driven approaches that explicitly model the dynamics and dependencies of railway traffic have their strength in providing interpretable predictions, and are more robust concerning disruption scenarios. The growing availability of railway operations data is expected to increase the appeal of big-data and machine learning methods.

Using a Machine Learning Approach to Predict the Thailand Underground Train’s Passenger

Article

Full-text available

May 2022
J ADV TRANSPORT

In today’s world, data has become an asset for businesses. Many sectors use data technology to advance their businesses. Building management is one of the processes on which numerous studies have been conducted to assist building users. Thailand has progressed in terms of transportation infrastructure and public transportation. The Metropolitan Rapid Transit (MRT) system has more than one hundred million users per year. However, crowding is a concern in the present since crowding creates a problem and reduces customer pleasure. The goal of this research is to create a machine learning model for forecasting passenger demand over time. In addition, standard data collecting equipment was used to collect data from the Metropolitan Rapid Transit (MRT) Purple Line. This line has a total of 16 stations. Station name, date, day, month, period, number of passengers, holidays, weekends, and weather are among the nine factors. Analysis approaches included the analysis phase, classification, and regression algorithm. However, the regression algorithm’s accuracy is poor and therefore cannot be used. Before using machine learning classification methods, the K-means was used to cluster the types of passengers. In addition, for this investigation, three classification methods were used: artificial neural network, random forest, and decision tree. Furthermore, the findings revealed that the artificial neural network has a high predicting accuracy. The accuracy value stated is more than 0.85 for demand over time.

Data-driven stochastic model for train delay analysis and prediction

Article

Apr 2022

İsmail Şahin

A homogeneous Markov chain model is proposed to make delay analysis and prediction for near future train movements in a non-periodic single-track railway timetable setting. The prediction model constitutes two principal processes, namely sectional running and conflict resolution, which are represented by the stochastic recovery and deterioration matrices, respectively. The matrices are developed using a data-driven approach. Given the initial delay of a train at the beginning of the prediction horizon, its delay within the horizon can be estimated by vector and matrix operations, which are performed for individual processes separately or in combination of the processes. A baseline linear model has also been developed for comparison. The numerical tests conducted give consistent and stable predictions for train delays made by the Markov model. This is mainly because of that the Markov model can capture uncertainties deep in the horizon and respond to variations in train movements.

An illustrative example of adaptive prediction

Contexts in source publication

Similar publications

Citations