ArticlePDF Available

Maximizing daily rainfall prediction accuracy with maximum overlap discrete wavelet transform‐based machine learning models

Wiley
International Journal of Climatology
Authors:

Abstract and Figures

Rainfall is an important phenomenon for various aspects of human life and the environment. Accurate prediction of rainfall is crucial for a wide range of sectors, including agriculture, water resources management, energy production, disaster management and many more. The ability to predict rainfall in an accurate fashion enables stakeholders to make informed decisions and take necessary actions to mitigate the impacts of natural disasters, water scarcity and other issues related to rainfall. In addition, advances in rainfall prediction technologies have the potential to contribute to sustainable water management and the preservation of water resources by providing the necessary information for decision‐makers to plan and implement effective water management strategies. Hence, it is important to continuously improve the accuracy of rainfall prediction. In this paper, the integration of the maximum overlap discrete wavelet transform (MODWT) and machine learning algorithms for daily rainfall prediction is proposed. The main objective of this study is to investigate the potential of combining MODWT with various machine‐learning algorithms to increase the accuracy of rainfall prediction and extend the forecast time horizon to 3 days. In addition, the performances of the proposed hybrid models are contrasted with the models hybridized with commonly used discrete wavelet transform (DWT) algorithms in the literature. For this, daily rainfall raw data from three rainfall observation stations located in Turkey are used. The results show that the proposed hybrid MODWT models can effectively improve the accuracy of precipitation forecasting, based on model evaluation measures such as mean square error (MSE) and Nash‐Sutcliffe coefficient of efficiency (CE). Accordingly, it can be concluded that the integration of MODWT and machine learning algorithms have the potential to revolutionize the field of daily rainfall prediction.
This content is subject to copyright. Terms and conditions apply.
RESEARCH ARTICLE
Maximizing daily rainfall prediction accuracy with
maximum overlap discrete wavelet transform-based
machine learning models
Kübra Küllahcı| Abdüsselam Altunkaynak
Department of Civil Engineering Hydraulics and Water Resources Division, Istanbul Technical University, Maslak, Turkey
Correspondence
Kübra Küllahcı, Department of Civil
Engineering Hydraulics and Water
Resources Division, Istanbul Technical
University, Maslak 34469, Istanbul,
Turkey.
Email: onerk@itu.edu.tr
Abstract
Rainfall is an important phenomenon for various aspects of human life and
the environment. Accurate prediction of rainfall is crucial for a wide range of
sectors, including agriculture, water resources management, energy produc-
tion, disaster management and many more. The ability to predict rainfall in an
accurate fashion enables stakeholders to make informed decisions and take
necessary actions to mitigate the impacts of natural disasters, water scarcity
and other issues related to rainfall. In addition, advances in rainfall prediction
technologies have the potential to contribute to sustainable water management
and the preservation of water resources by providing the necessary information
for decision-makers to plan and implement effective water management strate-
gies. Hence, it is important to continuously improve the accuracy of rainfall
prediction. In this paper, the integration of the maximum overlap discrete
wavelet transform (MODWT) and machine learning algorithms for daily rain-
fall prediction is proposed. The main objective of this study is to investigate the
potential of combining MODWT with various machine-learning algorithms to
increase the accuracy of rainfall prediction and extend the forecast time hori-
zon to 3 days. In addition, the performances of the proposed hybrid models are
contrasted with the models hybridized with commonly used discrete wavelet
transform (DWT) algorithms in the literature. For this, daily rainfall raw data
from three rainfall observation stations located in Turkey are used. The results
show that the proposed hybrid MODWT models can effectively improve the
accuracy of precipitation forecasting, based on model evaluation measures
such as mean square error (MSE) and Nash-Sutcliffe coefficient of efficiency
(CE). Accordingly, it can be concluded that the integration of MODWT and
machine learning algorithms have the potential to revolutionize the field of
daily rainfall prediction.
Received: 27 October 2023 Revised: 29 March 2024 Accepted: 28 May 2024
DOI: 10.1002/joc.8530
This is an open access article under the terms of the Creative Commons Attribution-NonCommercial-NoDerivs License, which permits use and distribution in any
medium, provided the original work is properly cited, the use is non-commercial and no modifications or adaptations are made.
© 2024 The Authors. International Journal of Climatology published by John Wiley & Sons Ltd on behalf of Royal Meteorological Society.
Int J Climatol. 2024;122. wileyonlinelibrary.com/journal/joc 1
KEYWORDS
hybrid, machine learning, maximum overlap discrete wavelet transform, prediction,
preprocessing, rainfall, wavelet
1|INTRODUCTION
Rainfall prediction is important for several reasons, par-
ticularly in terms of its impact on hydrology and water
resources. Accurate rainfall prediction can help in flood
forecasting and management; by predicting heavy rainfall
events, appropriate measures can be taken to minimize
flood damage (Bezak et al., 2016; Bui et al., 2019). It is
also important in water resources planning and manage-
ment during drought conditions (Deo et al., 2018;
Mouatadid et al., 2018) in maximizing crop yields and
reducing water usage (Hartmann et al., 2016). Accurate
rainfall prediction is essential for managing water
resources such as lakes, reservoirs and rivers. It can help
authorities make decisions about water storage, release
and distribution (Serinaldi & Kilsby, 2012), and, in effi-
cient water supply management by predicting the avail-
ability of water for various uses (Ali et al., 2018; Bagirov
et al., 2017; Zeynoddin et al., 2018). Rainfall prediction is
also critical for the management of hydropower genera-
tion. Accurate prediction of rainfall patterns can facilitate
the optimization of the generation of hydropower and
ensure the stability of the electrical grid (Haddad, 2011).
In precipitation forecasting, numerical models based
on physical mechanisms of natural events, statistical
models and their combinations have been used for years
to examine the relationship between precipitation and
geographic coordinates such as latitude and longitude
(Chegaar & Chibani, 2001) and other atmospheric
parameters such as temperature, humidity, pressure and
wind speed (Ali et al., 2018; Giebel & Kariniotakis, 2017;
Shahrban et al., 2016; Yu et al., 2016). Numerical models
for precipitation forecasting require substantial amounts
of data, leading to high computational costs (Gouda
et al., 2019; Mousavi et al., 2017). The statistical models
offer a methodology of extracting the characteristic fea-
tures of historical rainfall time series and using these
characteristics to forecast future trends in rainfall (Ashby
et al., 2005). On the other hand, importantly, Chong et al.
(2020) highlighted that certain statistical techniques may
be inadequate for predicting rainfall due to the tendency
of historical data to undergo significant transformations
in a relatively brief period of time.
The utilization of artificial intelligence and machine
learning algorithms in rainfall forecasting research has
emerged as a significant approach for modelling complex,
nonlinear phenomena, over the past few decades
(Altunkaynak & Küllahcı,2022; Chadalawada
et al., 2017; Küllahcı& Altunkaynak, 2023a,2023b;
Mandal & Jothiprakash, 2012; Wang & Altunkaynak,
2012) These cutting-edge technologies have proven to be
highly effective in accurately predicting rainfall patterns,
and overcoming the limitations of traditional statistical
methods (Altunkaynak & Nigussie, 2017; Jaiswal &
Malhotra, 2018). By combining multiple individual
models or techniques, these hybrid methods have the
potential to produce more robust and accurate predic-
tions, leading to improved outcomes and advancements
in the prediction field (Heidary & Abad, 2021; Küllahcı&
Altunkaynak, 2023a,2023b; Li et al., 2018; Ouyang
et al., 2016; Pandey et al., 2019; Partal & Ki¸si, 2007; Solgi
et al., 2014; Song et al., 2021; Yin et al., 2023; Zhao
et al., 2021). A selection of studies on the utilization of
both machine learning and signal processing techniques
in the prediction of rainfall time series can be found in
Table 1.
In the present contribution, we introduce an original
approach to rainfall analysis. We apply the maximum
overlap discrete wavelet transform (MODWT) signal
decomposition algorithm to enhance the accuracy of
daily rainfall predictions and extend the prediction time
horizon. Additionally, hybrid MODWT models are con-
trasted with hybrid discrete wavelet transform (DWT)
models. To the best of authors' knowledge, the MODWT
algorithm, as a signal decomposition method has not
been used in conjunction with different prediction
modelling methods. This study represents a pioneering
effort by integrating the MODWT (maximal overlap dis-
crete wavelet transform) with six distinct prediction tech-
niques. These methods include Artificial Neural
Networks (ANN), K-Nearest Neighbours (K-NN),
Extreme Learning Machine (ELM), Fuzzy Logic,
XGBoost (eXtreme Gradient Boosting) and a Deep Learn-
ing approach known as Long-Short Term Mem-
ory (LSTM).
The motivation of this study is to research and find
answers to the following three questions:
1. Can the MODWT improve rainfall predicting perfor-
mance when integrated with ML algorithms?
2. Can MODWT integrated with machine learning algo-
rithms achieve higher prediction accuracy compared
to models integrated with DWT in daily rainfall
prediction?
2KÜLLAHCIand ALTUNKAYNAK
3. In case of the success of the integrated models, which
MODWT hybridized machine learning method pro-
vides the best performance for daily rainfall
prediction?
These questions are investigated to assess the poten-
tial of the MODWT method through a daily rainfall-
predicting case study in Turkey. In exploring answers to
these questions, it is also expected that in addition
to evaluating the potential of MODWT for precipitation
forecasting, MODWT can be leveraged towards other
important hydrological forecasting applications
(e.g., groundwater level, evaporation, stream flow, water
quality).
The remainder of this study is structured as follows:
section 2provides a brief overview of the study area, data
and the methods employed in the analysis; section 3pre-
sents the key findings and a discussion of their implica-
tions, and section 4offers concluding remarks and
suggestions for future research directions.
2|MATERIALS AND METHODS
2.1 |Study area and data
The present study utilizes daily precipitation data from
three distinct precipitation observation stations, namely
Diyarbakır, Şanlıurfa and Adıyaman, located in the
Southeastern Anatolia region of Turkey. The Diyarbakir,
Şanlıurfa and Adıyaman stations, designated with the
numbers 17280, 17270 and 17265 are located at coordi-
nates 3753050.300N4012009.700 E, 3709038.900N
3847010.700E and 3745019.100 N3816039.000E, respec-
tively. The data used in this study was obtained from the
Turkey State Meteorological Service (MGM, 2020) and
consisted of daily precipitation measurements. Figure 1
depicts the geographic positions of the meteorological
stations employed in the investigation. The observational
data from three stations cover a time period of 51 years,
from January 1970 to June 2021. The time series of daily
rainfall data for each of the stations are depicted in
Figure 2ac, respectively. Table 2provides descriptive sta-
tistics for both model calibration and test data sets. The
division of data into training and test sets is a critical step
in machine learning algorithms, as it facilitates accurate
model performance evaluation. During the training
phase, the model learns the most suitable parameters that
capture patterns and relationships within the training
data. These parameters enable the model to make predic-
tions based on input features derived from the test data
by establishing internal representations and decision
boundaries. Subsequently, the model's predictions are
compared with the actual target values in the test data to
assess its performance. In this study, the k-fold cross-
validation method was employed to ensure model
TABLE 1 A few of the rainfall time series studies related to the hybrid usage of machine learning and signal processing technique.
References
Study
area
Temporal
scale
Decomposition
method Prediction method
Performance
evaluation
Feng et al. (2015) China Monthly DWT SVM R, RMSE, MAE, NSE
Altunkaynak and Nigussie
(2015)
Turkey Daily DWT, SA MLP RMSE, CE, SS
Amiri et al. (2016) Iran Monthly DWT ANN MAE, RMSE, SDR, IA
Tao et al. (2017) China Monthly EMD LSSVM NSE, RAE
Ghamariadyan et al. (2019) Australian Monthly DWT ANN RMSE, MAE, d
r
,R
Bojang et al. (2020) Taiwan Monthly SSA LSVR, RF RMSE, NSE
Wu et al. (2021) China Monthly,
annual
DWT ARIMA, LSTM RMSE, MAE, R
2
Wang et al. (2021) China Monthly WPD BPNN, GMDH,
ARIMA
RMSE, MAE, R, NSE
Singh et al. (2024) India Monthly DWT ANN RMSE, CE, R
2
Narimani et al. (2022) South
Korea
Daily SSA, EMD LightGBM, XGBoost RMSE, NSE, MAE, R
2
Abbreviations: ARIMA, autoregressive integrated moving average; BPNN, back-propagation neural network; CE, coefficient of efficiency; d
r
, refined index of
agreement; EMD, empirical modal decomposition; GMDH, group method of data handing; IA, index of agreement; LSSVM, least squares support vector
machine; LS-SVR, least-squares support vector regression; MAE, mean absolute error; MLP, multilayer perceptron; NSE, NashSutcliffe; R, correlation
coefficient; RAE, relative absolute error; RF, random forest; RMSE, root-mean-square error; SA, season algorithm; SDR, standard deviation of residuals; SS,
skill score; SSA, singular spectrum analysis; WPD, wavelet packet decomposition.
KÜLLAHCIand ALTUNKAYNAK 3
accuracy and optimize hyperparameters. Cross-
validation, particularly k-fold cross-validation, assesses
model accuracy and determines optimal hyperpara-
meters. This technique requires splitting the data into
multiple subsets (folds) for both training and validating
the machine-learning model. In this study, daily rainfall
data is partitioned into two parts: training and testing.
The first 28 years of observed daily rainfall data, consti-
tuting 55% of the total data were allocated for model cali-
bration (from January 1970 to May 1997). The remaining
23 years of observed daily rainfall data, representing 45%
of the dataset (from May 1997 to June 2021), were
reserved for evaluating model performance. To ensure a
robust assessment of model accuracy, k-fold cross-
validation with k=5 is employed. This approach
enhances the reliability of our model evaluation by sys-
tematically rotating through different subsets of data for
training and validation, thereby reducing the risk of over-
fitting and providing a more comprehensive assessment
of model generalization performance.
2.2 |Discrete wavelet transform
vs. maximal overlap discrete wavelet
transform
This section provides an introduction to the traditional
discrete wavelet transform (DWT) and the maximal over-
lap discrete wavelet transform (MODWT). Additionally,
the proposed MODWT decomposition technique is pre-
sented in this section.
Wavelet analysis is a commonly utilized technique for
preprocessing signals. It was introduced, in contrast to
Fourier analysis (Daubechies, 1990), to extract both tem-
poral and frequency information and to overcome certain
limitations of stationary and nonstationary time series
modelling. The fundamental concept behind the discrete
wavelet transform (DWT) algorithm involves applying
both high-pass and low-pass filters to the original signal
simultaneously, followed by a downsampling operation.
This approach allows for the separation of the signal into
different frequency bands, with the low-pass filter
capturing the low-frequency components (called approxi-
mation) and the high-pass filter extracting the high-
frequency components (called detail).
The maximal overlap discrete wavelet transform
(MODWT) is a modified version of the discrete wavelet
transform (DWT) that is specifically engineered to
achieve a more comprehensive signal decomposition,
particularly for nonstationary signals (Percival &
Wladen, 2000). Unlike the DWT, the MODWT uses a fil-
ter bank that has a longer impulse response and an over-
lap between adjacent sub-bands, which allows for a more
accurate and complete decomposition of a signal into dif-
ferent frequency bands. In the MODWT, the filters are
designed to be maximally decimated, which means that
the subsampling step is delayed until the end of the
decomposition process, thus ensuring that all of the data
is used in the decomposition. The resulting sub-bands
have the same length as the original signal, and the over-
lap between adjacent sub-bands allows for a more accu-
rate reconstruction of the original signal. The details and
FIGURE 1 Study area.
[Colour figure can be viewed at
wileyonlinelibrary.com]
4KÜLLAHCIand ALTUNKAYNAK
comparison of DWT and MODWT can be found in Cor-
nish et al. (2006).
For a daily rainfall signal R=Rt,t=0,1,,k1g
f, ini-
tially, the primary-stage approximations and details
should be calculated,
Dj,nX
k1
t=0
~
h
j,tWj,n+tmod k,ð1Þ
Aj,n=X
k1
t=0
~
g
j,t
~
Vj,n+tmod k:ð2Þ
The elements of jth level MODWT scaling and wave-
let coefficients Vjand Wjcan be written as, respectively,
Vj,n=X
k1
t=0
~
g
j,tRntmod kj=1,2:3,,L,ð3Þ
Wj,n=X
k1
t=0
~
h
j,tRntmod k,ð4Þ
where kis the length of rainfall signal, ~
g
j,tand
~
h
j,tare jth
level low- and high-pass filters yielded by periodizing ~
gj,t
FIGURE 2 Time series of
the daily rainfall data obtained
from (a) Station 17280
(Diyarbakır), (b) Station 17270
(Şanlıurfa) and (c) Station 17265
(Adıyaman). [Colour figure can
be viewed at
wileyonlinelibrary.com]
KÜLLAHCIand ALTUNKAYNAK 5
and
~
hj,tto length k, respectively, and ~
gj,tand
~
hj,tare jth
level MODWT low- and high-pass filter.
Ultimately, the original rainfall time series signal can
be expressed in relation to the approximations and details
as follows:
RtðÞ=X
L
j=1
Dj+Aj:ð5Þ
Figure 3shows a flowchart of three-level MODWT for
rainfall time series.
2.3 |Artificial neural network
Artificial neural networks (ANNs) are simplified mathe-
matical models that capture various aspects of the func-
tions and structure of the human brain. Although ANNs
are originated from preliminary studies focused on devel-
oping mathematical models inspired by biological sys-
tems, their development has been influenced by various
mathematical and computational principles beyond
direct emulation of biological systems (Anderson
et al., 2001; Argatov, 2019; Avramidis & Wu, 2007;
Niarakis, 2022). ANNs are composed of several key com-
ponents, including
The input layer often termed the input vector com-
prises input elements representing independent vari-
ables. Ideally, the number of neurons in the input
layer aligns with the number of inputs, but this is not
always necessary or optimal. Neural networks possess
the ability to process inputs of varying dimensions
thanks to techniques like feature engineering or
dimensionality reduction. These methods enhance the
network's flexibility and accuracy by effectively trans-
forming and extracting meaningful information from
the input data, enabling more robust and efficient
learning.
Hidden layers in artificial neural networks play a piv-
otal role in processing input data to create more mean-
ingful representations, allowing the network to
capture intricate relationships. While there is no fixed
rule governing the optimal configuration of hidden
layers, their design significantly impacts the network's
TABLE 2 Related statistical
properties for rainfall stations.
Station Data Min Max Mean Standard deviation Skewness
Diyarbakır Total 0.00 71.60 3.35 5.33 3.29
Training 0.00 71.60 2.27 5.24 3.48
Testing 0.00 58.60 4.02 5.43 3.11
Şanlıurfa Total 0.00 90.50 3.47 5.74 3.64
Training 0.00 64.70 3.10 5.64 3.80
Testing 0.00 90.50 4.61 5.83 3.50
Adıyaman Total 0.00 105.90 4.44 7.11 3.47
Training 0.00 80.10 3.53 6.93 3.14
Testing 0.00 105.90 4.31 7.30 3.80
FIGURE 3 A flowchart of
three-level MODWT for rainfall
time series. [Colour figure can
be viewed at
wileyonlinelibrary.com]
6KÜLLAHCIand ALTUNKAYNAK
performance. Traditionally, trial-and-error methods
have been employed to determine the ideal number of
hidden layers and neurons per layer, as mentioned by
(Altunkaynak, 2007). However, it is crucial to note that
alongside these empirical techniques, more systematic
approaches like grid search, random search, Bayesian
optimization, gradient-based optimization and evolu-
tionary optimization have gained prominence for
hyperparameter tuning (Bergstra & Bengio, 2012; Joy
et al., 2016; Sergeyev et al., 2017). These methods offer
a more structured and efficient means of exploring the
vast architectural space, leading to improved model
performance and convergence.
Weighted connections between nodes in adjacent
layers, which allow information to flow between the
layers.
An output layer comprising one or more elements that
represent the dependent variable(s) being predicted by
the network (Walczak, 2014).
For a comprehensive understanding of artificial neu-
ral networks, it is essential to have knowledge about acti-
vation functions. Activation functions are mathematical
operations applied to the output of each neuron in a neu-
ral network to determine its output behaviour. In an arti-
ficial neural network, the sum of the products of inputs
and their corresponding weights is calculated, and
finally, an activation function is applied to obtain the out-
put of that layer and provide it as input to the next layer.
Therefore, the selection of a suitable activation function
can significantly impact the effectiveness of a neural net-
work in solving a specific problem. While numerous
types of activation functions can be used in artificial neu-
ral networks, the most preferred functions are linear,
hyperbolic tangent, sigmoid and step functions (Bueno &
Salmeron, 2009). In recent years, activation functions
such as ReLU, Leaky ReLU, Parameterized ReLU and
SoftMax have gained popularity in the literature. The
ReLU activation function, in particular, is favoured for its
mathematical simplicity, ease of derivative computation
leading to accelerated training processes, mitigation of
the vanishing gradient problem, and ability to induce
sparsity by zeroing out negative inputs. These character-
istics contribute to its widespread adoption in neural net-
work architectures across various domains (Hayou
et al., 2019; Sharma et al., 2020).
2.4 |Fuzzy Logic
Fuzzy Logic, developed by Lotfi A. Zadeh in the 1960s,
allows for graded evaluation rather than strict true or
false determination (Zadeh, 1965,1968,1978) It finds
wide applications in engineering problems such as con-
trol systems, pattern recognition, image processing and
decision-making (Altunkaynak, 2010; Özger & Şen, 2007;
Şen & Altunkaynak, 2004; Xiong et al., 2001) The algo-
rithm involves fuzzification, inference and defuzzifica-
tion processes. The algorithm involves fuzzification,
inference and defuzzification processes. Two main fuzzy
inference methods are Mamdani and Takagi-Sugeno. The
selection of appropriate parameters in TS fuzzy logic
modelling can be a challenging task (Mamdani, 1974;
Takagi & Sugeno, 1985). Therefore, ANFIS (Adaptive
Network-based Fuzzy Inference System) methodology,
originally introduced by Jang and Roger (1993), is com-
monly utilized to estimate the parameters of the member-
ship and consequent functions.
2.5 |K-Nearest Neighbour
K-Nearest Neighbour (KNN) is a widely used machine
learning algorithm for both classification and regression
tasks, known for its ease of applicability. KNN assigns
the label or value of the majority class among its
knearest neighbours for classification tasks or computes
the average of the values of its knearest neighbours for
regression tasks. KNN is classified as a nonparametric
algorithm since it does not presuppose any hypotheses
regarding the underlying distribution of the data (Fix &
Hodges, 1951). It is also a lazy learning algorithm, which
means that KNN does not have a training stage in the tra-
ditional sense. KNN stores the entire training dataset,
and at the time of inference, it computes predictions
directly (Hellman, 1970). KNN identifies the knearest
neighbours to the new observation based on a distance
metric.
The K-NN algorithm requires the computation of the
distance metric between a predicted data point and
the known data points in the training set. While there
exists a plethora of distance metrics, this study will focus
on Euclidean distance that is commonly employed in the
K-NN algorithm. The Euclidean distance dbetween two
point x,yin a multidimensional space can be calculated
using Equation (6) as follows:
dx,yðÞ=Xn
i=1xiyi
jj
2

1
2,ð6Þ
where (xi,yi) are the variables of vectors xand y, respec-
tively, in the two-dimensional vector space, nis the num-
ber of variables and dis the Euclidean distance. The
common use of Euclidean distance in K-NN is due to its
simplicity and effectiveness, as it considers differences in
all data dimensions. Selecting the optimal number of
KÜLLAHCIand ALTUNKAYNAK 7
neighbours (k) is crucial for K-NN's performance, which
is achieved through cross-validation. k-fold cross-
validation partitions the data into training and test sets
across multiple folds, allowing for thorough model evalu-
ation. By testing various kvalues during each fold, the
impact on performance metrics such as accuracy and pre-
cision can be analysed, aiding in the selection of the opti-
mal kvalue.
2.6 |Extreme Learning Machine
Extreme Learning Machine (ELM) is a type of
machine learning algorithm that belongs to the family of
artificial neural networks (ANNs). ELMs are designed to
address some of the limitations of traditional ANNs, such
as long training times, the need for fine-tuning, and the
risk of overfitting. The Extreme Learning Machine (ELM)
proposed by Huang et al. (2006) is characterized by a
feedforward neural network architecture with a single
hidden layer consisting of one neuron. For more detailed
information, refer Gumaei et al. (2019) and Huang
et al. (2015,2011).
2.7 |Long-Short Term Memory Neural
Network
Long Short-Term Memory (LSTM) is a specialized variant
of recurrent neural networks (RNN) that is specifically
engineered to address the issue of vanishing gradients, a
persistent challenge encountered in conventional RNNs.
By Hochreiter and Schmidhuber (1997), like traditional
RNNs, LSTMs are designed to model sequential data by
maintaining a hidden state that captures the current
contextof the input sequence. However, LSTMs differ
from traditional RNNs in that they incorporate memory
cells, which allow the network to selectively store and
retrieve information over long periods of time. The mem-
ory cell is controlled by various gates, which regulate the
flow of information into and out of the cell, allowing
the LSTM to learn and model long-term dependencies in
the input sequence (Lecun et al., 2015).
2.8 |Extreme Gradient Boosting
(XGBoost)
Ensemble modelling involves the generation of models
through tree-based methods such as random forests, extra
trees, adaptive and gradient boosting techniques
(Friedman, 2001). This study uses XGBoost to benchmark
a multi-stage ensemble model, enhancing performance
compared to gradient-boosted decision trees (GBDT).
XGBoost creates decision trees faster due to its paralleli-
zation feature, optimizing loop steps for efficient execu-
tion and handling missing values efficiently (Chen &
Guestrin, 2016; Li et al., 2019; Wang et al., 2020).
2.9 |Performance evaluation criteria
The evaluation of machine learning methods often relies
on multiple performance metrics, which allow for a com-
prehensive and reliable assessment of their predictive
capabilities. In the present study, we employed four dis-
tinct diagnostic metrics to compare the performance of
the methods under consideration, namely the mean
square error (MSE), coefficient of efficiency (CE),
mean absolute error (MAE) and correlation coefficient
(R). Each of these metrics captures different aspects of
the model's accuracy and ability to fit the data, and their
combined use enables a more robust evaluation of the
machine learning algorithms. Table 3presents the equa-
tions and intervals for the performance metrics employed
in this study.
The expressions presented here are commonly used to
evaluate the performance of prediction models. In these
expressions, Pprepresents the predicted values, Porepre-
sents the observed values, Ppand Porepresent the mean
of the predicted and observed values, respectively, and
nrepresents the number of samples.
3|RESULTS AND DISCUSSION
3.1 |Model development
This study proposes a combined model of machine learn-
ing algorithms with data processing tools MODWT to
improve daily rainfall prediction accuracy. Hybrid
MODWT models are developed to improve the prediction
accuracy of daily rainfall for three stations in Turkey over
an extended time horizon of up to 3 days. In addition to
the hybrid MODWT models, independent models and
hybrid DWT models are also created for comparison pur-
poses. The present study aims to develop an enhanced
prediction framework and obtain predictive results for
daily rainfall at three stations in Turkey, with an
extended time horizon of up to 3 days. To achieve this
objective, a series of methodological steps are performed,
which can be summarized as follows:
1. Determining the number of previous inputs for the
predictive model is a fundamental step in timeseries
research. In order to identify the lags that are
8KÜLLAHCIand ALTUNKAYNAK
correlated with the original series, autocorrelation
functions (ACFs) are generated. Subsequently, an arti-
ficial neural network (ANN) model is applied, taking
into account the lag times obtained from the ACFs.
The purpose of this step was to determine the optimal
number of lags to be used in the model, which is criti-
cal for achieving an accurate and reliable prediction
of the target variable. By considering different lag
times acquired from the ACFs, the ANN model is able
to effectively identify the lag times that are most
strongly correlated with the original series, thereby
optimizing the predictive performance of the model.
2. In the modelling stage, the stand-alone ML algo-
rithms, ANN, Fuzzy, KNN, ELM, LSTM and XGBoost
are developed to predict daily rainfall. Then, the
MODWT decomposition technique is incorporated
into the stand-alone models to improve the prediction
accuracies. In the last step of the modelling stage, dis-
crete wavelet transform (DWT) is included in the
independent models in order to make a comparison
with the hybridized MODWT model results.
3. Finally the accuracy of all models is evaluated with
respect to the performance metrics. Moreover, the
scatter and Taylor diagrams are presented to provide
the intercomparison of the generated models.
The flowchart of the current research is shown in
Figure 4. The dashed lines in the figure correspond to the
primary phases and preprocessing steps involved in
developing a machine-learning model. On the other
hand, the solid lines depict the operational stages of the
model.
3.1.1 | Stage 1: Lag time determination
Optimizing the lag time is an utmost important compo-
nent of effective time series forecasting. The lag time
denotes the delay between the occurrence of an event and
the manifestation of its impact in the time series data. The
appropriate lag time is determined by identifying the opti-
mal number of past observations to be incorporated into
the model to achieve accurate predictions. The literature
describes two common methods for input selection: a trial
and error approach to determine the optimal number of
lagged time series data for the best forecast performance,
and using the autocorrelation function (ACF) method to
identify the most correlated lagged variables.
In this study, the ACF method was used to select
input variable combinations, which is a frequently used
approach in hydrological forecasting studies (Kothe
et al., 2019; Mislan et al., 2015). The ACF method identi-
fied that lag time-1 has the highest autocorrelation, as
illustrated in Figure 5. However, during model develop-
ment, the Artificial Neural Network (ANN) algorithm
was employed to determine whether lags larger than
1 have a positive impact on the model. The general archi-
tecture of the employed Artificial Neural Network (ANN)
algorithm is as follows: it comprises three layers, each
consisting of 50 neurons. The activation function utilized
throughout the network is the tanh. The optimization
algorithm employed is Adam, with a learning rate set at
0.001. Additionally, the training process iterates
250 epochs. This configuration is selected to ensure a
structured and robust framework for neural network
modelling, allowing for effective learning and adaptation
to complex patterns within the data.
Representing other stations, the results of the model
for station 17280 are presented in Table 4.
Initially, a lag time of daily (t) is deemed as a single
input for the analysis, after which additional lag times
are sequentially integrated up to a maximum of six con-
secutive lag times. Table 4shows that the inclusion of
more lag times leads to an increase in the model's perfor-
mance from one to two lag times. (CE
(t)
:0.54,
CE
(t1,t)
:0.60). Furthermore, as evidenced by the
TABLE 3 Performance evaluation
criteria. Metrics Maximum Minimum Equation
MSE 0MSE=1
nP
n
i=1
Ppi Poi
2
CE 1 −∞
CE=1P
n
i=1
Ppi Poi
ðÞ
2
P
n
i=1
Ppi Poi
ðÞ
2
MAE 0MAE=1
nP
n
i=1
Ppi Poi
R11
R=P
n
i=1
Ppi Ppi
ðÞ
×Poi Poi
ðÞ

2
ffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi
P
n
i=1
Ppi Ppi
ðÞ
2

s×ffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi
P
n
i=1
Poi Poi
ðÞ
2
r
2
6
6
6
6
4
3
7
7
7
7
5
KÜLLAHCIand ALTUNKAYNAK 9
presented values in Table 4, it was observed that the
inclusion of more than two lag times did not contribute
to the accuracy of the models. Based on these observa-
tions and in accordance with the ACF results, the num-
ber of time lags was determined to be two.
3.1.2 | Stage 2: Time series decomposition
Wavelet decomposition is a signal processing technique
that decomposes a signal into a set of wavelet basis
functions. The basis functions are generated by dilating
and translating a single mother wavelet function, which
is usually chosen to have compact support in both the
time and frequency domains. Despite numerous efforts in
the literature, identifying a universal wavelet type that is
applicable to all time series remains a challenging task
due to the lack of established guidelines for mother wave-
let selection. In this respect, this study employed four dis-
tinct wavelet types, namely Symlets wavelets, Daubechies
wavelets, Fejer-Korovkin and Coiflet wavelets, to deter-
mine the optimal wavelet type that best matches the
FIGURE 4 Flowchart of the
model development processes.
[Colour figure can be viewed at
wileyonlinelibrary.com]
10 KÜLLAHCIand ALTUNKAYNAK
observations. To identify the optimal level of decomposi-
tion, three to six levels were tested for each mother wave-
let. The ELM model was selected for this process due to
its fast processing capability. Hyperparameters used in
the ELM model are number of neurons in input layer,
35, number of neurons in hidden layer, 35, activation
function, tanh, and learning rate, 0.001.
Table 5presents the bold values in the results that
indicate the optimal combination of wavelet function and
decomposition level for daily rainfall prediction. The
results demonstrate that the Symlets wavelets produce the
highest accuracy, while the Fejer-Korovkin wavelets yield
the lowest accuracy. Furthermore, among the various
decomposition levels tested for Symlets wavelets, the third
level achieves the best results, with an MSE of 0.84 and an
CE of 0.97166. These results were used in subsequent pre-
dictions for daily rainfall by combining Symlets wavelet
decomposition with various machine learning methods.
Figure 7illustrates the obtained subseries (called Level
1, Level 2 and Level 3) and the approximation through
Symlets wavelet decomposition, respectively.
In order to be able to compare the MODWT model
results obtained during the modelling phase, the time
series is further decomposed into subseries with DWT in
stand-alone models. As in MODWT, different wavelet
types and levels have been tested and the best perfor-
mance has been obtained for the Haar wavelet and three
subseries.
FIGURE 5 Autocorrelation function graph of three stations. [Colour figure can be viewed at wileyonlinelibrary.com]
TABLE 4 The performance of
different lag-time for Station 17280.
Model Input combination Output
Training Test
MSE CE MSE CE
ANN tt+1 16.94 0.46 15.74 0.54
ANN t1,tt+1 15.69 0.51 13.76 0.60
ANN t2, t1,tt+1 15.49 0.50 13.94 0.59
ANN t3, t2, t1,tt+1 15.76 0.50 14.24 0.58
ANN t4, t3, t2, t1,tt+1 15.25 0.51 14.01 0.59
ANN t5, t4, t3, t2, t1,tt+1 15.50 0.51 14.36 0.58
ANN t6, t5, t4, t3, t2, t1,tt+1 15.40 0.51 14.31 0.58
Note: Bold indicates the prediction results obtained using models with different inputs and the data of the
model that gives the highest prediction.
KÜLLAHCIand ALTUNKAYNAK 11
3.1.3 | Stage 3: Prediction models
In this stage, six different machine learning
(ML) techniques were applied for daily rainfall prediction
in Turkey, including ANN, Fuzzy logic, K-NN, ELM,
XGBoost and LSTM hybridized with the MODWT signal
decomposition technique. The training process in
machine learning algorithms engages learning and
adjusting their parameters based on input data. The tun-
ing of hyperparameters is a crucial step in evolving reli-
able ML models. Tuning hyperparameters reduces
overfitting and enhances the model's generalizability to
new data (Bardenet et al., 2013). Selecting the best hyper-
parameters is also a significant component of improving
the accuracy of the model. There are various methods for
hyperparameter selection and finding the optimal solu-
tion. Grid search and random search are among the
many methods developed for hyperparameter optimiza-
tion. Random search methods randomly select different
hyperparameter values for a specified number of itera-
tions, while grid search explores all potential values
within a predefined range of hyperparameters through
trial and error to find the best solution. In this study, a
grid search technique with five-fold cross-validation was
used to explore the possible values of hyperparameters
(Kuhn & Johnson, 2013). This approach is particularly
advantageous when dealing with a large number of
hyperparameters or a high-dimensional search space,
offering improved efficiency compared to an exhaustive
random search (Yu & Zhu, 2020). Table 6contains the
necessary hyperparameters and trial intervals for each
algorithm. During grid search, each combination of
hyperparameters on the grid is evaluated by training and
validating the model using the cross-validation set. By
thoroughly exploring all possible combinations of
TABLE 5 The performance of different mother wavelets and
their corresponding different decomposition levels for daily rainfall
prediction.
MODWT function Level
Test
MSE CE
Symlets wavelets 3 0.84 0.97166
4 0.88 0.97033
5 0.86 0.97079
6 0.88 0.97031
Daubechies wavelets 3 1.00 0.96626
4 1.10 0.96282
5 1.09 0.96308
6 1.00 0.96601
Fejer-Korovkin wavelets 3 1.64 0.94427
4 1.62 0.94502
5 1.61 0.94551
6 1.68 0.94308
Coiflets wavelets 3 0.98 0.96691
4 1.04 0.96470
5 0.86 0.97070
6 0.89 0.96699
Note: Bold indicates the wave type and properties that give the best results
by examining the models obtained in different wave types and categories in
order to decide on the wave type to be used in the proposed MODWT
separation method.
TABLE 6 Hyperparameters of models for all algorithms.
Method Parameter Value
ANN Network type Feed Forward Neural
Network with Back
Propagation
Neurons used for
hidden layer
[3, 15, 35]
Activation function [sigmoid, tanh, linear]
Optimization
algorithm
[adam, sgd, rmsprop]
Learning rate [0.001, 0.01, 0.1]
Number of epochs [100, 500, 1000]
Batch size [16, 64, 128]
KNN Max_k [50]
Distance metric [euclidean, manhattan,
minkowski]
Weights [Uniform, distance]
ANFIS Number of fuzzy sets [2, 3, 5]
Membership function [Triangular, Trapezoidal,
Gaussian]
Training algorithm [gradient descent,”“least
squares,”“hybrid]
XGBoost Number of trees [1, 100]
Maximum depth [1, 30]
Learning rate [0.1, 0.95]
ELM Number of neurons in
input layer and
hidden
[20, 35, 100]
Activation functions [sigmoid, tanh, sine,
radial]
Learning rate [0.001, 0.01, 0.1]
LSTM Number of Hidden
units
[100, 250, 500]
Dropout rate [0.1, 0.2, 0.3]
Initial learning rate [0.001, 0.005, 0.01]
Activation functions [sigmoid, tanh, relu]
Backpropagation
algorithm
[adam, sgd, rmsprop]
12 KÜLLAHCIand ALTUNKAYNAK
hyperparameters within the defined grid, grid search aids
in identifying the best set of hyperparameters that yield
the highest performance metric.
3.2 |Model results
3.2.1 | Stand-alone models results
Table 7displays the results of the stand-alone models in
terms of different performance indicators. Evident from
Table 7is, as the prediction time increases, it can be
observed that the MSE and MAE values increase, while
the CE and R values decrease. As the correlation value
decreases with increasing lead times, it is reasonable to
expect that the predictive performance of the models
will be lower for a wider future time horizon. Based on
themodelresults,thehighestperformanceforT+1
lead time at station 17280 is observed in the LSTM
model with CE
LSTM
=0.51, while the lowest perfor-
manceisfoundintheFuzzymodelwithCE
Fuzzy
=0.41.
Fortheothermodels,theobtainedresultsarebetween
these two values close to each other. When the results
for station 17270 are examined, the highest perfor-
mance is observed in the ANN model with
CE
ANN
=0.61, while the lowest performance is seen in
the Fuzzy model with CE
Fuzzy
=0.53. For the remaining
models, obtained results are close to each other between
these two values. Finally, when the results for station
17265 for T+1 lead time are considered, the highest
performanceisobservedintheLSTMmodelwith
CE
LSTM
=0.43, while the Fuzzy model provided lowest
performance with CE
Fuzzy
=0.35. Overall, it is observed
that for T+1 lead time, only a few models achieve the
acceptable success criterion of CE =0.5 at all three sta-
tions, while the other models perform below this value.
Altunkaynak (2010)notedthatthereisnowidely
accepted standard for evaluating model performance
based on the CE value. However, according to Donigian
and Love (2012) and Altunkaynak (2010) CE values fall-
ing within the ranges of 0.650.75 and 0.750.85 can be
deemed fair and good, respectively. Furthermore, a CE
value exceeding 0.85 is regarded as indicative of very
good model prediction performance. For the subsequent
lead times (T+2) and (T+3), the prediction perfor-
mances of the models are below the acceptable level,
indicating that stand-alone models need to be used in
conjunction with preprocessing techniques to achieve
high-accuracy prediction results for daily rainfall at the
three stations.
TABLE 7 The comparative performance evaluation of stand-alone machine learning models using selected indicators.
Station 17280 (test) 17270 (test) 17265 (test)
Models MSE CE MAE RMSE CE MAE RMSE CE MAE R
t+1 ANN 14.76 0.50 1.59 0.71 13.76 0.60 1.44 0.77 30.81 0.42 2.11 0.65
Fuzzy 17.26 0.41 1.97 0.64 16.08 0.53 1.79 0.73 34.51 0.35 2.67 0.60
K-NN 15.09 0.49 1.56 0.70 14.59 0.57 1.40 0.76 30.97 0.42 2.11 0.65
ELM 15.07 0.49 1.62 0.70 13.85 0.59 1.44 0.77 31.74 0.41 2.18 0.64
XGBoost 15.95 0.46 1.67 0.68 14.93 0.56 1.51 0.75 31.90 0.40 2.22 0.63
LSTM 14.53 0.51 1.43 0.72 14.18 0.58 1.29 0.77 30.35 0.43 1.95 0.66
t+2 ANN 18.38 0.38 2.08 0.61 18.40 0.46 1.95 0.68 38.06 0.29 2.78 0.54
Fuzzy 21.12 0.28 2.50 0.53 21.56 0.37 2.35 0.61 41.72 0.22 3.24 0.47
K-NN 18.38 0.38 2.08 0.61 19.50 0.43 1.88 0.65 38.84 0.27 2.71 0.52
ELM 18.85 0.36 2.19 0.60 18.88 0.45 2.03 0.67 38.89 0.27 2.89 0.52
XGBoost 19.55 0.34 2.18 0.58 20.56 0.40 2.07 0.63 39.75 0.26 2.92 0.51
LSTM 17.95 0.39 1.97 0.63 17.96 0.47 1.78 0.69 37.96 0.29 3.00 0.55
t+3 ANN 20.59 0.30 2.36 0.55 21.13 0.38 2.26 0.62 43.02 0.19 3.41 0.44
Fuzzy 23.20 0.21 2.78 0.46 24.71 0.27 2.65 0.53 45.73 0.14 3.70 0.38
K-NN 20.79 0.30 2.31 0.54 22.60 0.34 2.19 0.58 42.38 0.21 3.06 0.46
ELM 20.98 0.29 2.52 0.54 24.91 0.27 2.63 0.52 45.12 0.15 3.54 0.40
XGBoost 21.72 0.26 2.50 0.51 23.60 0.31 2.39 0.55 43.16 0.19 3.30 0.44
LSTM 20.76 0.30 2.29 0.55 20.98 0.38 2.27 0.62 41.26 0.23 3.10 0.48
KÜLLAHCIand ALTUNKAYNAK 13
3.2.2 | Hybrid MODWT machine learning
models results
After obtaining the results from the stand-alone models
the daily rainfall time series data are decomposed using
the maximum overlap discrete wavelet transform
(MODWT), which is an alternative to the commonly used
discrete wavelet transform (DWT) preprocessing algo-
rithm. In this study, the daily time series data are divided
into four subseries (bands), with one being approxima-
tion and the remaining three being detail bands
(Figure 6).
FIGURE 6 Plots of three
bands and the approximation
decomposed by MODWT for
station 17280. [Colour figure can
be viewed at
wileyonlinelibrary.com]
TABLE 8 The comparative performance evaluation of hybrid MODWT machine learning models using selected indicators.
Station 17280 (test) 17270 (test) 17265 (test)
Models MSE CE MAE RMSE CE MAE RMSE CE MAE R
t+1 MODWT-ANN 0.81 0.97 0.39 0.99 0.78 0.98 0.36 0.99 1.71 0.97 0.53 0.98
MODWT-Fuzzy 1.95 0.93 0.82 0.97 1.27 0.96 0.39 0.98 1.96 0.96 0.56 0.98
MODWT-K-NN 1.26 0.96 0.48 0.98 1.45 0.96 0.47 0.98 2.96 0.94 0.66 0.97
MODWT-ELM 0.84 0.97 0.39 0.98 0.90 0.97 0.37 0.99 1.96 0.96 0.56 0.98
MODWT-XGBoost 1.04 0.96 0.47 0.98 1.31 0.96 0.47 0.98 2.32 0.96 0.65 0.98
MODWT-LSTM 0.87 0.97 0.47 0.99 1.09 0.97 0.45 0.99 1.92 0.96 0.58 0.98
t+2 MODWT-ANN 5.33 0.82 0.96 0.91 4.61 0.86 0.85 0.93 11.39 0.79 1.33 0.89
MODWT-Fuzzy 5.65 0.81 0.99 0.90 5.45 0.84 0.88 0.92 11.50 0.78 1.38 0.89
MODWT-K-NN 5.58 0.81 1.02 0.91 5.10 0.85 0.92 0.93 11.73 0.78 1.40 0.89
MODWT-ELM 5.09 0.83 0.95 0.91 4.75 0.86 0.87 0.93 11.13 0.79 1.33 0.90
MODWT-XGBoost 5.45 0.82 1.01 0.9 5.10 0.85 0.93 0.93 11.56 0.78 1.41 0.89
MODWT-LSTM 5.38 0.82 1.02 0.92 4.45 0.87 0.85 0.94 10.92 0.80 1.57 0.90
t+3 MODWT-ANN 5.98 0.80 1.00 0.90 5.75 0.83 0.99 0.92 13.28 0.75 1.52 0.87
MODWT-Fuzzy 6.72 0.77 1.13 0.88 6.67 0.80 1.04 0.90 13.77 0.74 1.53 0.87
MODWT-K-NN 6.71 0.77 1.20 0.88 6.42 0.81 1.10 0.91 13.71 0.74 1.59 0.87
MODWT-ELM 6.35 0.78 1.13 0.89 9.28 0.73 1.41 0.86 20.09 0.62 2.02 0.79
MODWT-XGBoost 6.77 0.77 1.22 0.88 6.43 0.81 1.12 0.91 13.55 0.75 1.60 0.87
MODWT-LSTM 6.14 0.79 1.15 0.89 5.78 0.83 1.08 0.92 12.19 0.77 1.46 0.88
14 KÜLLAHCIand ALTUNKAYNAK
ML models are applied to each subseries which are
divided into training and testing sets. The subseries
are predicted up to a lead time of 3 days. The results are
presented in Table 8.
Upon evaluation of the results of hybrid MODWT
models, it is found that CE values for t+1 lead time ran-
ged between 0.93 and 0.98 for all three stations. These
results clearly indicate that the hybrid MODWT models
provide perfect accuracy for t+1 lead time. The CE value
ranges between 0 and 1, and values close to 1 are indica-
tive of excellent performance. When evaluating hybrid
MODWT models for t+1 lead time, the MODWT-ANN
model achieved the lowest MSE and highest CE values
for the three stations. However, it is evident that other
models also performed well with CE values being rela-
tively close. When comparing the hybrid MODWT
models with stand-alone models, it is observed that the
CE values for t+1 lead time increased from approxi-
mately 0.500.97 with the hybrid models. The findings of
stand-alone models indicate that they are not sufficient
for even predicting t+1 lead time, whereas the hybrid
MODWT models exhibited excellentperformance with
CE values above 0.97. According to the prediction results
for t+2 lead time in Table 8, the CE values range from
0.78 to 0.87 for all three stations. When the results for
t+2 lead time are examined based on the prediction
algorithm, the highest performance with a CE value of
0.83 is obtained with the MODWT-ELM model for station
17280, the highest performance with a CE value of 0.87 is
obtained with the MODWT-LSTM model for station
17270, and finally, the highest performance with a CE
value of 0.80 was obtained with the MODWT-LSTM
model for station 17265. Upon analysis of the results
obtained for the t+3 lead time, it is found that the
hybrid models exhibit goodperformance, with CE
values reaching up to 0.83 across all three stations. As the
time lag between the input data and the predicted values
increases, there is a slight decline in the models' predic-
tive performance owing to the diminishing correlation
with delays, or time steps. Nevertheless, the findings indi-
cate that the hybrid MODWT models constitute a viable
means of enhancing predictive accuracy, as well as
extending the temporal horizon of reliable precipitation
forecasting for up to 3 days.
3.2.3 | Hybrid DWT machine learning
models results
As part of the final stage of modelling, the results of the
hybrid DWT model are presented for comparison with
the hybrid MODWT model. First, the daily rainfall time
series is divided into four subseries (bands) using DWT,
with one approximation and three details. Similar to the
other models, the subseries are separated into training
and testing sets, and the hybrid DWT model was applied
to each subseries to generate individual predictions for
lead times ranging from 1 to 3 days. The MSE, CE, MAE
and Rvalues of the hybrid DWT models are calculated
for the testing (calibration) phase, for lead times ranging
from 1 to 3 days, at three stations. These results are pre-
sented in Table 9. When examining the results of the
hybrid DWT model (Table 9), the CE values for the t+1
lead time are obtained between 0.71 and 0.73 for station
17280, between 0.72 and 0.80 for station 17270, and
between 0.80 and 0.86 for station 17265. The results indi-
cate that the hybrid DWT models performed better than
stand-alone models, but lagged behind the hybrid
MODWT models. For the hybridized DWT model, the
t+2 lead time CE values ranged between 0.69 and 0.47
for the three stations, while for the t+3 lead time, all CE
values were below 0.5. The decomposition of daily rain-
fall time series into certain frequencies and scales using
the widely used DWT decomposition method in the liter-
ature has contributed positively to the prediction perfor-
mance and improved the prediction accuracy. However,
the results of the model show that the DWT hybrid
model could not achieve the performance success of the
hybrid MODWT model. It is recognized that
the MODWT hybrid model provides a reliable and suffi-
cient contribution to increasing and improving the pre-
diction accuracy, especially for the t+2 and t+3 lead
times.
Moreover, the assessment of the outcomes is facili-
tated by employing Taylor diagrams, which furnishes a
comprehensive evaluation of the models' performance
across multiple dimensions, thereby allowing for the pre-
cise determination of their level of accuracy. The findings
presented in Figure 7demonstrate that the stand-alone
models exhibited lower levels of accuracy compared to
the hybrid-DWT models. Remarkably, the hybrid-
MODWT models outperformed all other models across
all lead times. Although the Taylor diagrams provide
insights into the accuracy of the predicted time series,
they do not offer information about the distributions of
the predicted and observed time series. Hence, it is essen-
tial to consider the distributional characteristics of the
time series to gain a comprehensive understanding of
the model performance. Additionally, scatter plots were
employed to assess the statistical significance of the
results.
A scatter plot in statistical analysis constitutes a visual
depiction of the correlation between two distinct vari-
ables. It typically consists of a horizontal axis represent-
ing one variable and a vertical axis representing the
other, plotting data points as individual points on
KÜLLAHCIand ALTUNKAYNAK 15
the graph. The scatter diagram allows for visualizing pat-
terns or trends in the data, such as a positive or negative
correlation between the two variables. It can be helpful
for identifying outliers or clusters in the data, as well as
for identifying potential relationships or dependencies
between the variables. Scatter diagrams for the models
for station 17280 for lead time t+1 are shown in
Figure 8and provide a visualization of the relationship
between the observed data and the corresponding precip-
itation forecasts. Moreover, the 45diagonal line (also
known as the 1:1 line) is utilized as an effective explor-
atory tool to assess the degree of concordance between
the observed daily rainfall data and the corresponding
predictions generated by the proposed models. Based on
the analysis of Figure 8, it can be observed that all of the
stand-alone models yielded suboptimal performances in
t+1 prediction, and predictions indicated a poor correla-
tion between the predicted and observed rainfall values.
However, the hybrid-DWT model demonstrated an
improvement in the performance of the stand-alone
models due to wavelet decomposition. As expected, the
t+1 prediction had the best results. However, this
enhancement was not sufficient for accurate rainfall pre-
diction, as the scatter plots for t+1 showed inadequate
dispersion of data points. Figure 8revealed that the
hybrid MODWT model outperformed the hybrid DWT
models, exhibiting significant improvements in t+1 lead
time. Consequently, it can be inferred that the hybrid
MODWT model produced the best-scattered results
among the proposed models.
In summary, the proposed hybrid MODWT models
showed superior performance in forecasting future rain-
fall for all time spans (1, 2 and 3 days) compared to other
models evaluated.
4|CONCLUSION
This study has aimed to compare the performance of var-
ious hybrid models for daily rainfall prediction by incor-
porating preprocessing methods, MODWT and DWT,
with machine learning algorithms to improve accuracy
and extend the lead time of prediction. The accurate pre-
diction of daily rainfall is crucial for effective water
resource management, flood forecasting and agricultural
planning, among other applications. However, due to the
complex and nonlinear nature of meteorological systems,
accurate rainfall prediction remains a challenging task.
Hybrid models, which combine multiple techniques to
improve prediction accuracy, have shown promising
results in previous studies and are increasingly being uti-
lized in rainfall prediction research. Therefore, this study
TABLE 9 The comparative performance evaluation of hybrid DWT machine learning models using selected indicators.
Station 17280 (test) 17270 (test) 17265 (test)
Models MSE CE MAE RMSE CE MAE RMSE CE MAE R
t+1 DWT-ANN 7.88 0.73 1.16 0.86 6.80 0.80 0.98 0.89 8.14 0.85 1.19 0.92
DWT-Fuzzy 8.90 0.70 1.32 0.84 9.61 0.72 1.29 0.85 10.53 0.80 1.45 0.90
DWT-K-NN 8.57 0.71 1.22 0.84 8.34 0.76 1.16 0.87 8.20 0.85 1.16 0.92
DWT-ELM 8.30 0.72 1.21 0.85 8.03 0.76 1.10 0.88 7.66 0.86 1.21 0.93
DWT-XGBoost 8.68 0.71 1.22 0.84 8.90 0.74 1.17 0.86 8.64 0.84 1.22 0.91
DWT-LSTM 8.05 0.73 1.17 0.55 8.21 0.76 1.20 0.88 7.63 0.86 1.16 0.93
t+2 DWT-ANN 13.22 0.55 1.78 0.74 13.39 0.61 1.64 0.78 25.33 0.53 2.36 0.71
DWT-Fuzzy 15.71 0.47 1.94 0.68 14.43 0.58 1.73 0.76 20.64 0.61 2.09 0.78
DWT-K-NN 14.10 0.52 1.92 0.72 14.30 0.58 1.79 0.76 17.72 0.67 1.85 0.81
DWT-ELM 13.39 0.55 1.78 0.74 13.60 0.60 1.64 0.78 17.74 0.67 1.87 0.81
DWT-XGBoost 14.21 0.52 1.84 0.72 15.21 0.55 1.76 0.74 18.39 0.66 1.94 0.81
DWT-LSTM 13.45 0.54 1.83 0.74 13.37 0.61 1.59 0.78 16.76 0.69 1.88 0.82
t+3 DWT-ANN 16.49 0.44 2.06 0.66 15.20 0.56 1.88 0.75 22.51 0.58 2.21 0.76
DWT-Fuzzy 17.28 0.41 2.12 0.64 16.69 0.51 1.98 0.72 26.77 0.50 2.59 0.71
DWT-K-NN 17.62 0.40 2.19 0.64 14.30 0.58 1.79 0.76 22.25 0.58 2.22 0.76
DWT-ELM 16.56 0.44 2.03 0.66 19.25 0.43 2.19 0.66 28.27 0.47 2.55 0.68
DWT-XGBoost 17.76 0.40 2.11 0.63 17.68 0.48 2.04 0.69 22.92 0.57 2.32 0.76
DWT-LSTM 16.66 0.44 2.05 0.66 18.02 0.47 2.09 0.71 21.47 0.60 2.18 0.77
16 KÜLLAHCIand ALTUNKAYNAK
aimed to contribute to the ongoing efforts to improve
rainfall prediction accuracy and extend the lead time of
prediction by comparing the performance of the hybrid
MODWT model. Based on the findings of the study, it
can be inferred that several conclusions can be drawn:
From the results of this study, it can be inferred that
the stand-alone machine learning methods had inade-
quate prediction performance as indicated by the per-
formance metrics. However, the implementation of the
discrete wavelet transform (DWT) showed some
improvement in the prediction accuracy. On the other
hand, the hybrid DWT approach did not meet the
expected accuracy levels, particularly when predicting
for time horizons t+2 and t+3, while the hybrid
MODWT models revealed a considerable increase in
model accuracies up to 3 days at all stations.
The MODWT decomposition method is compatible
with various machine learning algorithms, as evi-
denced by the similar performance results obtained
from the six different algorithms used in the study.
This suggests that the MODWT method is a suitable
technique for use in different estimation algorithms,
thereby contributing to the development of more accu-
rate and effective predictive models.
All the hybrid MODWT models are found as the best
model according to the Taylor diagrams and scatter
plots.
The findings of the diagnostic assessment criteria indi-
cate that as the prediction lead times increase, there is
FIGURE 7 The Taylor diagrams of employed models' errors for station 17270, considering prediction horizons of (a) t+1, (b) t+2 and
(c) t+3. [Colour figure can be viewed at wileyonlinelibrary.com]
KÜLLAHCIand ALTUNKAYNAK 17
a decrease in the values of CE and R, while the values
of MSE and MAE increase for all of the stations. This
can be attributed to the decreasing autocorrelation,
which in turn affects the accuracy of the predictions.
These results suggest that longer prediction lead times
may require more sophisticated modelling techniques
that account for the decrease in autocorrelation over
time in order to achieve more accurate and reliable
predictions.
The success of the developed MODWT model high-
lights its potential for accurate long-term prediction and
suggests that it may be applied to various scientific fields
where such predictions are of importance. As such, it is
recommended that further investigations be carried out
to explore the potential of the MODWT model for accu-
rately predicting various hydrological variables in earth
sciences, and for improving predictions across various
time scales and regions. However, it is important to note
that this study is limited by the use of machine learning
applications, and the authors intend to address this limi-
tation by adopting multi-stage modelling approaches in
their future research on rainfall time series.
AUTHOR CONTRIBUTIONS
Kübra Küllahcı:Methodology; validation; visualization;
formal analysis; software; data curation; conceptualiza-
tion; writing original draft; resources; investigation.
Abdüsselam Altunkaynak: Writing review and edit-
ing; project administration; supervision; investigation.
ACKNOWLEDGEMENT
We sincerely thank the Turkey Meteorological Service for
providing precipitation data.
FIGURE 8 Scatter plots of models of lead time (t+1) for station 17280 (for the test set) (a) Stand-Alon models, (b) Wavelet models and
(c) MODWT models (the red line represents the 1:1 perfect line). [Colour figure can be viewed at wileyonlinelibrary.com]
18 KÜLLAHCIand ALTUNKAYNAK
CONFLICT OF INTEREST STATEMENT
The authors declare no conflicts of interest.
DATA AVAILABILITY STATEMENT
The data that support the findings of this study are avail-
able from the corresponding author upon reasonable
request.
ORCID
Kübra Küllahcıhttps://orcid.org/0000-0003-4699-5878
REFERENCES
Ali, M., Deo, R.C., Downs, N.J. & Maraseni, T. (2018) Multi-stage
hybridized online sequential extreme learning machine inte-
grated with Markov chain Monte Carlo copula-bat algorithm
for rainfall forecasting. Atmospheric Research, 213, 450464.
Available from: https://doi.org/10.1016/j.atmosres.2018.07.005
Altunkaynak, A. (2007) Forecasting surface water level fluctuations
of Lake van by artificial neural networks. Water Resources Man-
agement, 21(2), 399408. Available from: https://doi.org/10.
1007/s11269-006-9022-6
Altunkaynak, A. (2010) A predictive model for well loss using
Fuzzy Logic approach. Hydrological Processes, 24, 24002404.
Available from: https://doi.org/10.1002/hyp.7642
Altunkaynak, A. & Küllahcı, K. (2022) Transfer precipitation learn-
ing via patterns of dependency matrix-based machine learning
approaches. Neural Computing and Applications, 34(24),
2217722196. Available from: https://doi.org/10.1007/S00521-
022-07674-8/FIGURES/5
Altunkaynak, A. & Nigussie, T.A. (2015) Prediction of daily rainfall
by a hybrid wavelet-season-neuro technique. Journal of Hydrol-
ogy, 529, 287301. Available from: https://doi.org/10.1016/j.
jhydrol.2015.07.046
Altunkaynak, A. & Nigussie, T.A. (2017) Monthly water consump-
tion prediction using season algorithm and wavelet transform
based models. Journal of Water Resources Planning and Man-
agement, 143(6), 04017011. Available from: https://doi.org/10.
1061/(asce)wr.1943-5452.0000761
Amiri, M.A., Amerian, Y. & Mesgari, M.S. (2016) Spatial and tem-
poral monthly precipitation forecasting using wavelet trans-
form and neural networks, Qara-Qum catchment, Iran.
Arabian Journal of Geosciences, 9, 421. Available from: https://
doi.org/10.1007/s12517-016-2446-2
Anderson, P.G., Klein, G., Oja, E., Steele, N.C., Antoniou, G.,
Mladenov, V. et al. (2001) Neural networks and their applica-
tions: introduction. Informatica (Ljubljana), 25(1), 1.
Argatov, I. (2019) Artificial neural networks (ANNs) as a novel
modeling technique in tribology. Frontiers in Mechanical Engi-
neering,5,19. Available from: https://doi.org/10.3389/fmech.
2019.00030
Ashby, S.A., Taylor, M.A. & Chen, A.A. (2005) Statistical models for
predicting rainfall in the Caribbean. Theoretical and Applied
Climatology, 82(12), 6580. Available from: https://doi.org/10.
1007/s00704-004-0118-8
Avramidis, S. & Wu, H. (2007) Artificial neural network and mathe-
matical modeling comparative analysis of nonisothermal diffu-
sion of moisture in wood. Holz Als Roh-Und Werkstoff, 65(2),
8993. Available from: https://doi.org/10.1007/s00107-006-
0113-0
Bagirov, A.M., Mahmood, A. & Barton, A. (2017) Prediction of
monthly rainfall in Victoria, Australia: clusterwise linear
regression approach. Atmospheric Research, 188, 2029. Avail-
able from: https://doi.org/10.1016/j.atmosres.2017.01.003
Bardenet, R., Brendel, M., Kégl, B. & Sebag, M. (2013) Collaborative
hyperparameter tuning. In: Proceedings of the 30th international
conference on machine learning, ICML 2013, Vol. 28. New York,
NY: ACM, pp. 858866.
Bergstra, J. & Bengio, Y. (2012) Random search for hyper-parameter
optimization. Journal of Machine Learning Research, 13,
281305.
Bezak, N., ˇ
Sraj, M. & Mikoˇ
s, M. (2016) Copula-based IDF curves
and empirical rainfall thresholds for flash floods and rainfall-
induced landslides. Journal of Hydrology, 541, 272284. Avail-
able from: https://doi.org/10.1016/j.jhydrol.2016.02.058
Bojang, P.O., Yang, T., Pham, Q.B. & Yu, P. (2020) Applied sciences
linking singular spectrum analysis and machine learning for
monthly rainfall forecasting.
Bueno, S. & Salmeron, J.L. (2009) Benchmarking main activation
functions in fuzzy cognitive maps. Expert Systems with Applica-
tions, 36(3), 52215229. Available from: https://doi.org/10.
1016/j.eswa.2008.06.072
Bui, D.T., Khosravi, K., Shahabi, H., Daggupati, P., Adamowski, J.F.,
Melesse, A.M. et al. (2019) Flood spatial modeling in northern
Iran using remote sensing and GIS: a comparison between evi-
dential belief functions and its ensemble with a multivariate
logistic regression model. Remote Sensing, 11(13), 1589. Available
from: https://doi.org/10.3390/rs11131589
Chadalawada, J., Havlicek, V. & Babovic, V. (2017) A genetic pro-
gramming approach to system identification of rainfall-runoff
models. Water Resources Management, 31(12), 39753992.
Available from: https://doi.org/10.1007/s11269-017-1719-1
Chegaar, M. & Chibani, A. (2001) Global solar radiation estimation
in Algeria. Energy Conversion and Management, 42(8), 967973.
Available from: https://doi.org/10.1016/S0196-8904(00)00105-9
Chen, T. & Guestrin, C. (2016) XGBoost: a scalable tree boosting
system. In: Proceedings of the ACM SIGKDD international con-
ference on knowledge discovery and data mining. New York, NY:
ACM, pp. 785794. Available from: https://doi.org/10.1145/
2939672.2939785
Chong, K.L., Lai, S.H., Yao, Y., Ahmed, A.N., Jaafar, W.Z.W. & El-
Shafie, A. (2020) Performance enhancement model for rainfall
forecasting utilizing integrated wavelet-convolutional neural
network. Water Resources Management, 34(8), 23712387.
Available from: https://doi.org/10.1007/s11269-020-02554-z
Cornish, C.R., Bretherton, C.S., & Percival, D. B. (2006) Maximal
overlap wavelet statistical analysis with application to atmo-
spheric turbulence. Boundary-Layer Meteorology, 119, 339374.
Daubechies, I. (1990) The wavelet transform, time-frequency locali-
zation and signal analysis. IEEE Transactions on Information
Theory, 36(5), 9611005. Available from: https://doi.org/10.
1109/18.57199
Deo, R.C., Salcedo-Sanz, S., Carro-Calvo, L. & Saavedra-Moreno, B.
(2018) Drought prediction with standardized precipitation and
evapotranspiration index and support vector regression models.
In Integrating disaster science and management (pp. 151174).
Elsevier.
KÜLLAHCIand ALTUNKAYNAK 19
Donigian, A.S. & Love, J.T. (2012) Sediment calibration procedures
and guidelines for watershed modeling. Proceedings of the
Water Environment Federation, 2003(4), 728747. Available
from: https://doi.org/10.2175/193864703784828345
Feng, Q., Wen, X. & Li, J. (2015) Wavelet analysis-support vector
machine coupled models for monthly rainfall forecasting in
arid regions. Water Resources Management, 29, 10491065.
Available from: https://doi.org/10.1007/s11269-014-0860-3
Fix, E. & Hodges, J.L. (1951) Discriminatory analysis. In: Nonpara-
metric discrimination: consistency properties. USAF School of
Aviation Medicine, Randolph Field.
Friedman, J. (2001) Greedy function approximation: a gradient
boosting machine. Annals of Statistics, 29(5), 11891232.
Ghamariadyan, M., Imteaz, M.A. & Mekanik, F. (2019) A hybrid
wavelet neural network (HWNN) for forecasting rainfall using
temperature and climate indices. IOP Conference Series: Earth
and Environmental Science, 351, 012003. Available from:
https://doi.org/10.1088/1755-1315/351/1/012003
Giebel, G. & Kariniotakis, G. (2017) Wind power forecastinga
review of the state of the art. Renewable Energy Forecasting,
2017, 59109.
Gouda, S.G., Hussein, Z., Luo, S. & Yuan, Q. (2019) Model selection
for accurate daily global solar radiation prediction in China.
Journal of Cleaner Production, 221, 132144. Available from:
https://doi.org/10.1016/j.jclepro.2019.02.211
Gumaei, A., Hassan, M.M., Hassan, M.R., Alelaiwi, A. &
Fortino, G. (2019) A hybrid feature extraction method with reg-
ularized extreme learning machine for brain tumor classifica-
tion. IEEE Access, 7, 3626636273. Available from: https://doi.
org/10.1109/ACCESS.2019.2904145
Haddad, M.S. (2011) Capacity choice and water management in
hydroelectricity systems. Energy Economics, 33(2), 168177.
Available from: https://doi.org/10.1016/j.eneco.2010.05.005
Hartmann, H., Snow, J.A., Stein, S., Buda, S., Zhai, J., Jiang, T.
et al. (2016) Predictors of precipitation for improved water
resources management in the Tarim River Basin: creating a
seasonal forecast model. Journal of Arid Environments, 125, 31
42. Available from: https://doi.org/10.1016/j.jaridenv.2015.
09.010
Hayou, S., Doucet, A. & Rousseau, J. (2019) On the impact of the
activation function on deep neural networks training. In: 36th
international conference on machine learning, ICML 2019. New
York, NY: ACM, pp. 47464754.
Heidary, M. & Abad, K.F.H. (2021) A wavelet-based model for
determining asphaltene onset pressure. Natural Resources
Research, 30(1), 741752. Available from: https://doi.org/10.
1007/s11053-020-09753-w
Hellman, M.E. (1970) The nearest neighbor classification rule with
a reject option. IEEE Transactions on Systems Science and
Cybernetics, 6, 179185. Available from: https://doi.org/10.
1109/TSSC.1970.300339
Hochreiter, S. & Schmidhuber, J. (1997) Long short-term memory.
Neural Computation, 9(8), 17351780.
Huang, G., Huang, G.B., Song, S. & You, K. (2015) Trends in
extreme learning machines: a review. Neural Networks, 61, 32
48. Available from: https://doi.org/10.1016/j.neunet.2014.
10.001
Huang, G.B., Wang, D.H. & Lan, Y. (2011) Extreme learning
machines: a survey. International Journal of Machine
Learning and Cybernetics, 2(2), 107122. Available from:
https://doi.org/10.1007/s13042-011-0019-y
Huang, G.B., Zhu, Q.Y. & Siew, C.K. (2006) Extreme learning
machine: theory and applications. Neurocomputing, 70(13),
489501. Available from: https://doi.org/10.1016/j.neucom.
2005.12.126
Jaiswal, A. & Malhotra, R. (2018) Software reliability prediction
using machine learning techniques. International Journal of
Systems Assurance Engineering and Management, 9(1), 230
244. Available from: https://doi.org/10.1007/s13198-016-0543-y
Jang, J. & Roger, S. (1993) ANFIS: adaptive-network-based fuzzy
inference system. IEEE Transactions on Systems, Man and
Cybernetics., 23, 665685. Available from: https://doi.org/10.
1109/21.256541
Joy, T.T., Rana, S., Gupta, S. & Venkatesh, S. (2016) Hyperpara-
meter tuning for big data using Bayesian optimisation. In: 23rd
International Conference on Pattern Recognition (ICPR). IEEE,
pp. 25742579. Available from: https://doi.org/10.1109/ICPR.
2016.7900023
Kothe, S.K., Ayare, B.L., Bhange, H.N. & Patil, S.T. (2019) Rainfall-
runoff prediction based on artificial neural network: a case
study Priyadarshini watershed. International Journal of Current
Microbiology and Applied Sciences, 8(5), 13281334. Available
from: https://doi.org/10.20546/ijcmas.2019.805.151
Kuhn, M. & Johnson, K. (2013) Applied predictive modeling with
applications in R, Vol. 26. Heidelberg: Springer.
Küllahcı, K. & Altunkaynak, A. (2023a) Integrating wavelet decom-
position and stacking ensemble learning for accurate daily rain-
fall forecasting. In: ICHEAS 4th international conference on
health, engineering and applied sciences April 14 - 16, 2023 Dubai.
Küllahcı, K. & Altunkaynak, A. (2023b) Enhanced rainfall predic-
tion performance via hybrid empirical-singular-wavelet-fuzzy
approaches. Environmental Science and Pollution Research, 30,
5809058108.
Lecun, Y., Bengio, Y. & Hinton, G. (2015) Deep learning. Nature,
521(7553), 436444. Available from: https://doi.org/10.1038/
nature14539
Li, G., Ma, X. & Yang, H. (2018) A hybrid model for monthly pre-
cipitation time series forecasting based on variational mode
decomposition with extreme learning machine. Information
(Switzerland), 9(7), 177. Available from: https://doi.org/10.
3390/info9070177
Li, W., Yin, Y., Quan, X. & Zhang, H. (2019) Gene expression value
prediction based on XGBoost algorithm. Frontiers in Genetics,
10, 17. Available from: https://doi.org/10.3389/fgene.2019.
01077
Mamdani, E.H. (1974) Application of fuzzy algorithms for control
of simple dynamic plant. Proceedings of the Institution of Electri-
cal Engineers., 121, 1585. Available from: https://doi.org/10.
1049/piee.1974.0328
Mandal, T. & Jothiprakash, V. (2012) Short-term rainfall prediction
using Ann and Mt techniques. ISH Journal of Hydraulic Engi-
neering, 18, 2026. Available from: https://doi.org/10.1080/
09715010.2012.661629
Mislan, H., Hardwinarto, S., Sumaryono, W. & Aipassa, M. (2015)
Rainfall monthly prediction based on artificial neural network:
a case study in Tenggarong Station, East
Kalimantan-Indonesia. Procedia Computer Science, 59, 142151.
Available from: https://doi.org/10.1016/j.procs.2015.07.528
20 KÜLLAHCIand ALTUNKAYNAK
Mouatadid, S., Raj, N., Deo, R.C. & Adamowski, J.F. (2018) Input
selection and data-driven model performance optimization to
predict the standardized precipitation and evaporation index in
a drought-prone region. Atmospheric Research, 212, 130149.
Available from: https://doi.org/10.1016/j.atmosres.2018.05.012
Mousavi, S.M., Mostafavi, E.S. & Jiao, P. (2017) Next generation pre-
diction model for daily solar radiation on horizontal surface
using a hybrid neural network and simulated annealing method.
Energy Conversion and Management,153,671682. Available
from: https://doi.org/10.1016/j.enconman.2017.09.040
Narimani, R., Jun, C., Saedi, A., Bateni, S.M. & Jeill, O. (2022) A
multivariate decompositionensemble model for estimating
long-term rainfall dynamics. Climate Dynamics, 61, 16251641.
Available from: https://doi.org/10.1007/s00382-022-06646-x
Niarakis, A. (2022) Addressing barriers in comprehensiveness,
accessibility, reusability, interoperability and reproducibility of
computational models in systems biology. Briefings in Bioinfor-
matics, 23, 111.
Ouyang, Q., Wenxi, L., Xin Xin, Y., Zhang, W.C. & Ting, Y. (2016)
Monthly rainfall forecasting using EEMD-SVR based on phase-
space reconstruction. Water Resources Management, 30(7),
23112325. Available from: https://doi.org/10.1007/s11269-016-
1288-8
Özger, M. & Şen, Z. (2007) Prediction of wave parameters by using
fuzzy logic approach. Ocean Engineering, 34(34), 460469.
Available from: https://doi.org/10.1016/j.oceaneng.2006.03.003
Pandey, P.K., Tripura, H. & Pandey, V. (2019) Improving prediction
accuracy of rainfall time series by hybrid SARIMAGARCH
modeling. Natural Resources Research, 28(3), 11251138. Avail-
able from: https://doi.org/10.1007/s11053-018-9442-z
Partal, T. & Ki¸si, Ö. (2007) Wavelet and neuro-fuzzy conjunction
model for precipitation forecasting. Journal of Hydrology,
342(12), 199212. Available from: https://doi.org/10.1016/j.
jhydrol.2007.05.026
Percival, D.B. & Wladen, A.T. (2000) Wavelet methods for time series
analysis. Cambridge: Cambridge University Press.
Şen, Z. & Altunkaynak, A. (2004) Fuzzy Awakening in Rainfall-
Runoff Modeling. Nordic Hydrology, 35(1), 3143. Available
from: https://doi.org/10.2166/nh.2004.0003
Sergeyev, Y.D., Kvasov, D.E. & Mukhametzhanov, M.S. (2017)
Emmental-type GKLS-based multiextremal smooth test prob-
lems with non-linear constraints. In: Battiti, R., Kvasov, D. &
Sergeyev, Y. (Eds.) Learning and intelligent optimization. LION
2017. Lecture notes in computer science, Vol. 10556. Cham:
Springer, pp. 383388.
Serinaldi, F. & Kilsby, C.G. (2012) A modular class of multisite
monthly rainfall generators for water resource management
and impact studies. Journal of Hydrology, 464465, 528540.
Available from: https://doi.org/10.1016/j.jhydrol.2012.07.043
Shahrban, M., Walker, J.P., Wang, Q.J., Seed, A. & Steinle, P. (2016)
An evaluation of numerical weather prediction based rainfall
forecasts. Hydrological Sciences Journal, 61(15), 27042717.
Available from: https://doi.org/10.1080/02626667.2016.1170131
Sharma, S., Sharma, S. & Athaiya, A. (2020) Activation functions in
neural networks. International Journal of Engineering Applied
Sciences and Technology, 4(12), 310316. Available from:
https://doi.org/10.33564/ijeast.2020.v04i12.054
Singh, S., Kumar, D., Vishwakarma, D. K., Kumar, R., &
Kushwaha, N. L. (2024) Seasonal rainfall pattern using coupled
neural network-wavelet technique of southern Uttarakhand,
India. Theoretical and Applied Climatology.117.
Solgi, A., Nourani, V. & Pourhaghi, A. (2014) Forecasting daily pre-
cipitation using hybrid model of wavelet-artificial neural net-
work and comparison with adaptive neurofuzzy inference
system (case study: Verayneh Station, Nahavand). Advances in
Civil Engineering, 2014, 279368. Available from: https://doi.org/
10.1155/2014/279368
Song, C., Chen, X., Pan, W. & Jin, H. (2021) Combining time vary-
ing filtering based empirical mode decomposition and machine
learning to predict precipitation from nonlinear series. Journal
of Hydrology, 603, 126914. Available from: https://doi.org/10.
1016/j.jhydrol.2021.126914
Takagi, T. & Sugeno, M. (1985) Fuzzy identification of systems and
its applications to modeling and control. IEEE Transactions on
Systems, Man and Cybernetics., SMC-15, 116132. Available
from: https://doi.org/10.1109/TSMC.1985.6313399
Tao, L., He, X. & Wang, R. (2017) A hybrid LSSVM model with
empirical mode decomposition and differential evolution for
forecasting monthly precipitation, 18, 159176. Available from:
https://doi.org/10.1175/JHM-D-16-0109.1
Walczak, S. (2014) Artificial neural networks. Comprehensive Bio-
medical Physics,6,117. Available from: https://doi.org/10.
1016/B978-0-444-53632-7.01101-1
Wang, K.-H. & Altunkaynak, A. (2012) Comparative case study of
rainfall-runoff modeling between SWMM and fuzzy logic
approach. Journal of Hydrologic Engineering,17,283291. Avail-
able from: https://doi.org/10.1061/(asce)he.1943-5584.0000419
Wang, L., Chongzhi, W., Tang, L., Zhang, W., Lacasse, S., Liu, H.
et al. (2020) Efficient reliability analysis of earth dam slope sta-
bility using extreme gradient boosting method. Acta Geotech-
nica, 15(11), 31353150. Available from: https://doi.org/10.
1007/s11440-020-00962-4
Wang, W., Yujin, D., Chau, K., Chen, H. & Liu, C. (2021) A com-
parison of BPNN, GMDH, and ARIMA for monthly rainfall
forecasting based on wavelet packet decomposition. Water,
13(20), 2871.
Wu, X., Zhou, J., Huaying, Y., Liu, D., Xie, K., Chen, Y. et al. (2021)
The development of a hybrid Wavelet-Arima-Lstm model for
precipitation amounts and drought analysis. Atmosphere, 12(1),
117. Available from: https://doi.org/10.3390/ATMOS12010074
Xiong, L., Shamseldin, A.Y. & O'Connor, K.M. (2001) A non-linear
combination of the forecasts of rainfall-runoff models by the
first-order Takagi-Sugeno fuzzy system. Journal of Hydrology,
245(14), 196217. Available from: https://doi.org/10.1016/
S0022-1694(01)00349-3
Yin, H., Zheng, F., Duan, H.F., Savic, D. & Kapelan, Z. (2023) Esti-
mating rainfall intensity using an image-based deep learning
model. Engineering, 21, 162174. Available from: https://doi.
org/10.1016/j.eng.2021.11.021
Yu, T., & Zhu, H. (2020) Hyper-parameter optimization: a review of
algorithms and applications. arXiv preprint arXiv:2003.05689.
Yu, W., Nakakita, E. & Jung, K. (2016) Flood forecast and early
warning with high-resolution ensemble rainfall from numerical
weather prediction model. Procedia Engineering, 154, 498503.
Available from: https://doi.org/10.1016/j.proeng.2016.07.544
Zadeh, L.A. (1965) Fuzzy sets-information and Control-1965. Infor-
mation and Control, 75, 118121. Available from: https://doi.
org/10.1080/00098650209599249
KÜLLAHCIand ALTUNKAYNAK 21
Zadeh, L.A. (1968) Fuzzy Algorithms. Information and Control, 12,
94102. Available from: https://doi.org/10.1016/S0019-9958(68)
90211-8
Zadeh, L.A. (1978) Fuzzy sets as a basis for a theory of possibility.
Fuzzy Sets and Systems,1,328. Available from: https://doi.
org/10.1016/0165-0114(78)90029-5
Zeynoddin,M.,Bonakdari,H.,Azari,A.,Ebtehaj,I.,
Gharabaghi, B. & Madavar, H.R. (2018) Novel hybrid lin-
ear stochastic with non-linear extreme learning machine
methods for forecasting monthly rainfall a tropical cli-
mate. Journal of Environmental Management, 222, 190
206. Available from: https://doi.org/10.1016/j.jenvman.
2018.05.072
Zhao, N., Ghaemi, A., Chengwen, W., Band, S.S., Chau, K.W.,
Zaguia, A. et al. (2021) A decomposition and multi-objective
evolutionary optimization model for suspended sediment load
prediction in Rivers. Engineering Applications of Computational
Fluid Mechanics, 15(1), 18111829. Available from: https://doi.
org/10.1080/19942060.2021.1990133
How to cite this article: Küllahcı, K., &
Altunkaynak, A. (2024). Maximizing daily rainfall
prediction accuracy with maximum overlap
discrete wavelet transform-based machine learning
models. International Journal of Climatology,122.
https://doi.org/10.1002/joc.8530
22 KÜLLAHCIand ALTUNKAYNAK
ResearchGate has not been able to resolve any citations for this publication.
Article
Full-text available
Hydrological data is crucial for accurate forecasting of precipitation which can be used for water resources planning and management. The purpose of this study is to develop a seasonal rainfall forecast model, using a hybrid wavelet-artificial neural network (WANN) model based on regression analysis to predict seasonal rainfall in Almora, Lansdown, Kashipur and Mukteswar region in Uttarakhand (India).The statistical results shows that the mean maximum rainfall was found to be 746.82 mm, 1586.58 mm, 1060.53 mm and 964.43 mm for Almora, Lansdown, Kashipur and Mukteswar, respectively. The models WANN-03 (Network 4–8-1), WANN-10 (Network 4–7-1), WANN-10 (Network 4–7-1) and WANN-15 (Network 4–8-1) were found to be the most efficient models for Mukteswar, Lansdown, Kashipur and Almora, based on the high coefficient of determination (R2) and coefficient of efficiency (CE) values and low root mean square error (RMSE) values that were obtained using each model. For each season, four WANN modelshave been developed (total of sixteen models) by varying the number of hidden neurons. The results shows that only one WANN model was not sufficient to predict the rainfall of all stations. Every station has a specific networked model which could model the data more precisely preciously. The findings illustrated that the hybrid model of WANN having Network (4–7-1) was found most superior model (R2 = 0.857, RMSE = 32.192 and CE = 0.846) for the Lansdown stations among all the stations.
Conference Paper
Full-text available
Rainfall prediction is a critical component of disaster management, as it provides advanced warning of potential floods and landslides caused by heavy daily rainfall. Accurate and reliable rainfall prediction can help to manage the water resources system, agriculture activity, early warning system, and flood risk. Moreover, daily rainfall prediction can provide information on the amount and distribution of rainfall, which is essential for managing water resources and minimizing the impact of floods and droughts. Prediction of daily rainfall with high accuracy is a challenge as a result of the complexity, non-linear and dynamic nature of rainfall. Recent advancements in machine learning methods have helped to develop various techniques that have generated accurate results in modeling rainfall prediction. Among these algorithms, ensemble methods, such as stacking, have been found to enhance the accuracy and robustness of predictions. This study investigates the application of a hybrid wavelet stacking ensemble machine learning algorithm in enhancing the accuracy of daily rainfall prediction and extending the lead time predictions up to three days. The proposed approach involves training multiple base models on the filtered and denoised time series data, followed by the combination of their predictions using a meta-model. In the study, four prevalent ML models, namely logistic linear regression, support vector regression, k-nearest neighbors, and decision tree are taken as base models. To combine the outputs from the base models, the meta-model is used as a second-layer learner to generate predictions. The findings of this study indicate that the hybrid wavelet stacking approach outperformed both the four base models and the stand-alone stacking model in predicting daily rainfall. The proposed model achieved a higher accuracy via assessment by several diagnostic metrics, demonstrating its effectiveness in capturing the complex patterns and relationships in the data. These results suggest that the hybrid wavelet stacking approach can provide a flexible and reasonable prediction framework for rainfall data, with potential applications in various fields, including disaster management.
Article
Full-text available
Rainfall is a vital process in the hydrological cycle of the globe. Accessing reliable and accurate rainfall data is crucial for water resources operation, flood control, drought warning, irrigation, and drainage. In the present study, the main objective is to develop a predictive model to enhance daily rainfall prediction accuracy with an extended time horizon. In the literature, various methods for the prediction of daily rainfall data for short lead times are presented. However, due to the complex and random nature of rainfall, in general, they yield inaccurate prediction results. Generically, rainfall predictive models require many physical meteorological variables and consist of challenging mathematical processes that require high computational power. Furthermore, due to the nonlinear and chaotic nature of rainfall, observed raw data typically has to be decomposed into its trend cycle, seasonality, and stochastic components before being fed into the predictive model. The present study proposes a novel singular spectrum analysis (SSA)-based approach for decomposing observed raw data into its hierarchically energetic pertinent features. To this end, in addition to the stand-alone fuzzy logic model, preprocessing methods SSA, empirical mode decomposition (EMD), and commonly used discrete wavelet transform (DWT) are incorporated into the fuzzy models which are named as hybrid SSA-fuzzy, EMD-fuzzy, W-fuzzy models, respectively. In this study, fuzzy, hybrid SSA-fuzzy, EMD-fuzzy, and W-fuzzy models are developed to enhance the daily rainfall prediction accuracy and improve the prediction time span up to 3 days via three (3) stations’ data in Turkey. The proposed SSA-fuzzy model is compared with fuzzy, hybrid EMD-fuzzy, and widely used hybrid W-fuzzy models in predicting daily rainfall in three distinctive locations up to a 3-day time horizon. Improved accuracy in predicting daily rainfall is provided by the SSA-fuzzy, W-fuzzy, and EMD-fuzzy models compared to the stand-alone fuzzy model based on mean square error (MSE) and the Nash-Sutcliffe coefficient of efficiency (CE) model assessment metrics. Specifically, the advocated SSA-fuzzy model is found to be superior in accuracy to hybrid EMD-fuzzy and W-fuzzy models in predicting daily rainfall for all time spans. The results reveal that, with its easy-to-use features, the advocated SSA-fuzzy modeling tool in this study is a promising principled method for its possible future implementations not only in hydrological studies but in water resources and hydraulics engineering and all scientific disciplines where future state space prediction of a vague nature and stochastic dynamical system is important.
Article
Full-text available
This study aims to present a novel decomposition–ensemble model that uses multivariate data. Two algorithms, Light Gradient Boosting Machine (LightGBM) and Extreme Gradient Boosting (XGBoost), were used to develop a new model for rainfall reconstruction using daily meteorological data from 2003 to 2017 in Seoul, South Korea. First, the dataset was decomposed by two decomposition methods: singular spectrum analysis (SSA) and empirical mode decomposition (EMD). Second, the input time series was constructed as trend terms, fluctuating terms, and noise components in the SSA method and as intrinsic mode functions (IMFs) in the EMD method. Finally, these decomposed datasets were used as the input sets for the ensemble models for training, testing, and evaluation to reconstruct long-term daily rainfall. Performance statistics indicated that SSA integrated with LightGBM improved the efficiency over other combinations (EMD–LightGBM, SSA–XGBoost, EMD–XGBoost) by a lower root-mean-square error (RMSE) and higher Nash–Sutcliffe efficiency coefficient (NSE). After selecting the best combination, support vector machine recursive feature elimination (SVM-RFE) was applied to select the best decomposed dataset to improve the SSA–LightGBM performance. Finally, the performance of the model was compared to Random Forest (RF) algorithm for robustness analysis. The results showed that using the SSA and LightGBM models reconstructed long-term daily rainfall more accurately, especially when coupled with SVM-RFE, which obtained values for the square of the correlation coefficient (R2), RMSE, NSE, and mean absolute error (MAE) of 0.92, 3.27, 0.91, and 0.99, respectively. In particular, the proposed decomposition–ensemble model (SSA–SVM-RFE–LightGBM) can be used to reconstruct long-term daily rainfall on a global scale.
Article
Full-text available
Accurate precipitation prediction is very significant for urban, environmental, and water resources management as well as mitigating the negative effects of drought and flood. However, precipitation prediction is a complex and challenging task which involves meteorological parameters that contain uncertainty. This study attempts to ease the complexity of the problem via proposing a correlation matrix approach. Covariance and correlation matrices are analytical tools that are widely used to identify the interrelationships and possible dependencies throughout the data. Correlation matrices have some advantages over covariance matrices. The main drawback of covariance matrices is their sensitivity to the measurement units of variables. The variables with relatively large variances will dominate the results of multivariate analysis when the covariance matrix is used. Accordingly, the covariance matrix fails to provide useful information when there exist large differences between variances of variables. On the other hand, besides their easy interpretable features, the results of different analyses obtained from correlation matrices can effectively be compared. Therefore, in this study, in order to improve the performances of the predictive models, interrelationships and possible dependencies among data obtained from eighteen precipitation observation stations located in the Upper Euphrates Basin of Turkey (1980–2010) is investigated using correlation matrix approach. Relatedly, dependencies between the stations are resolved by means of examining the correlation matrix and optimal model inputs (data of particular stations) are selected for each prediction scenario. The transfer precipitation learning was performed throughout the period from 1980 to 2010 for eighteen precipitation observation stations located in the Upper Euphrates. Three different data-driven models Fuzzy, K-nearest neighbors (KNN), and multilinear regression (MR) are developed based on the patterns of correlation matrix. Predictive powers of the models are compared by means of performance evaluation criteria, i.e., Nash–Sutcliffe efficiency, mean square error, mean absolute error, and coefficient of determination (R²). Results of this study show that all developed correlation matrix patterns-based Fuzzy, KNN, and MR models have high precipitation prediction performance. However, even though all model results are close to each other, Fuzzy model provided more accurate results with requiring data from a relatively low number of stations. Therefore, patterns of correlation matrix-based Fuzzy model is the most efficient and well-suited approach for precipitation prediction among all the developed models.
Article
Full-text available
Computational models are often employed in systems biology to study the dynamic behaviours of complex systems. With the rise in the number of computational models, finding ways to improve the reusability of these models and their ability to reproduce virtual experiments becomes critical. Correct and effective model annotation in community-supported and standardised formats is necessary for this improvement. Here, we present recent efforts toward a common framework for annotated, accessible, reproducible and interoperable computational models in biology, and discuss key challenges of the field.
Article
Full-text available
Urban flooding is a major issue worldwide, causing huge economic losses and serious threats to public safety. One promising way to mitigate its impacts is to develop a real-time flood risk management system; however, building such a system is often challenging due to the lack of high spatiotemporal rainfall data. While some approaches (i.e., ground rainfall stations or radar and satellite techniques) are available to measure and/or predict rainfall intensity, it is difficult to obtain accurate rainfall data with a desirable spatiotemporal resolution using these methods. This paper proposes an image-based deep learning model to estimate urban rainfall intensity with high spatial and temporal resolution. More specifically, a convolutional neural network (CNN) model called the image-based rainfall CNN (irCNN) model is developed using rainfall images collected from existing dense sensors (i.e., smart phones or transportation cameras) and their corresponding measured rainfall intensity values. The trained irCNN model is subsequently employed to efficiently estimate rainfall intensity based on the sensors’ rainfall images. Synthetic rainfall data and real rainfall images are respectively utilized to explore the irCNN’s accuracy in theoretically and practically simulating rainfall intensity. The results show that the irCNN model provides rainfall estimates with a mean absolute percentage error ranging between 13.5% and 21.9%, which exceeds the performance of other state-of-the-art modeling techniques in the literature. More importantly, the main feature of the proposed irCNN is its low cost in efficiently acquiring high spatiotemporal urban rainfall data. The irCNN model provides a promising alternative for estimating urban rainfall intensity, which can greatly facilitate the development of urban flood risk management in a real-time manner.
Article
Full-text available
Suspended sediment load (SSL) estimation is essential for both short- and long-term water resources management. Suspended sediments are taken into account as an important factor of the service life of hydraulic structures such as dams. The aim of this research is to estimat SSL by coupling intrinsic time-scale decomposition (ITD) and two kinds of DDM, namely evolutionary polynomial regression (EPR) and model tree (MT) DDMs, at the Sarighamish and Varand Stations in Iran. Measured data based on their lag times are decomposed into several proper rotation components (PRCs) and a residual, which are then considered as inputs for the proposed model. Results indicate that the prediction accuracy of ITD-EPR is the best for both the Sarighamish (R2 = 0.92 and WI = 0.96) and Varand (R2 = 0.92 and WI = 0.93) Stations (WI is the Willmott index of agreement), while a standalone MT model performs poorly for these stations compared with other approaches (EPR, ITD-EPR and ITD-MT) although peak SSL values are approximately equal to those by ITD-EPR. Results of the proposed models are also compared with those of the sediment rating curve (SRC) method. The ITD-EPR predictions are remarkably superior to those by the SRC method with respect to several conventional performance evaluation metrics.
Article
Full-text available
Accurate rainfall forecasting in watersheds is of indispensable importance for predicting streamflow and flash floods. This paper investigates the accuracy of several forecasting technologies based on Wavelet Packet Decomposition (WPD) in monthly rainfall forecasting. First, WPD decomposes the observed monthly rainfall data into several subcomponents. Then, three data-based models, namely Back-propagation Neural Network (BPNN) model, group method of data handing (GMDH) model, and autoregressive integrated moving average (ARIMA) model, are utilized to complete the prediction of the decomposed monthly rainfall series, respectively. Finally, the ensemble prediction result of the model is formulated by summing the outputs of all submodules. Meanwhile, these six models are employed for benchmark comparison to study the prediction performance of these conjunction methods, which are BPNN, WPD-BPNN, GMDH, WPD-GMDH, ARIMA, and WPD-ARIMA models. The paper takes monthly data from Luoning and Zuoyu stations in Luoyang city of China as the case study. The performance of these conjunction methods is tested by four quantitative indexes. Results show that WPD can efficiently improve the forecasting accuracy and the proposed WPD-BPNN model can achieve better prediction results. It is concluded that the hybrid forecast model is a very efficient tool to improve the accuracy of mid- and long-term rainfall forecasting.