ArticlePDF Available

Advancing model calibration and uncertainty analysis of SWAT models using cloud computing infrastructure: LCC-SWAT

Authors:
  • Environment and Climate Change Canada

Abstract and Figures

Calibration and uncertainty analysis of a complex, over-parameterized environmental models such as the Soil and Water Assessment Tool (SWAT) requires thousands of simulation runs and multiple calibration iterations. A parallel calibration system is thus desired that can be deployed on cloud-based architectures for reducing calibration runtime. This paper presents a cloud-based calibration and uncertainty analysis system called LCC-SWAT that is designed for SWAT models. Two optimization techniques, sequential uncertainty fitting (SUFI-2) and dynamically dimensioned search (DDS), have been implemented in LCC-SWAT. Moreover, the cloud-based system has been deployed on the Southern Ontario Smart Computing Innovation Platform's (SOSCIP) Cloud Analytics platform for diagnostic assessment of parallel calibration runtime on both single-node and multi-node CPU architectures. Unlike other calibrations/uncertainty analysis systems developed on the cloud, this system is capable of generating a comprehensive set of statistical information automatically, which facilitates broader analyses of the performance of the SWAT models. Experimental results on SWAT models of different complexities showed that LCC-SWAT can reduce runtime significantly. The runtime reduction is more pronounced for more complex and computationally intensive models. However, the reported runtime efficiency is significantly higher for single node systems. Comparative experiments with DDS and SUFI-2 show that parallel DDS outperforms parallel SUFI-2 in terms of both parameter identifiability and reducing uncertainty in model simulations. LCC-SWAT is a flexible calibration system and other optimization algorithms and asynchronous parallelization strategies can be added to it in future.
Content may be subject to copyright.
Advancing model calibration and uncertainty analysis
of SWAT models using cloud computing infrastructure:
LCC-SWAT
Masood Zamani, Narayan Kumar Shrestha, Taimoor Akhtar,
Trevor Boston and Prasad Daggupati
Q1
ABSTRACT
Calibration and uncertainty analysis of a complex, over-parameterized environmental models such as
the Soil and Water Assessment Tool (SWAT) requires thousands of simulation runs and multiple
calibration iterations. A parallel calibration system is thus desired that can be deployed on cloud-
based architectures for reducing calibration runtime. This paper presents a cloud-based calibration
and uncertainty analysis system called LCC-SWAT that is designed for SWAT models. Two
optimization techniques, sequential uncertainty tting (SUFI-2) and dynamically dimensioned search
(DDS), have been implemented in LCC-SWAT. Moreover, the cloud-based system has been deployed
on the Southern Ontario Smart Computing Innovation Platforms (SOSCIP) Cloud Analytics platform
for diagnostic assessment of parallel calibration runtime on both single-node and multi-node CPU
architectures. Unlike other calibrations/uncertainty analysis systems developed on the cloud, this
system is capable of generating a comprehensive set of statistical information automatically, which
facilitates broader analyses of the performance of the SWAT models. Experimental results on SWAT
models of different complexities showed that LCC-SWAT can reduce runtime signicantly. The
runtime reduction is more pronounced for more complex and computationally intensive models.
However, the reported runtime efciency is signicantly higher for single node systems. Comparative
experiments with DDS and SUFI-2 show that parallel DDS outperforms parallel SUFI-2 in terms of
both parameter identiability and reducing uncertainty in model simulations. LCC-SWAT is a exible
calibration system and other optimization algorithms and asynchronous parallelization strategies can
be added to it in future.
Key words |cloud computing, DDS, optimization, SUFI-2, SWAT
HIGHLIGHTS
LCC-SWAT: a cloud-based calibration and uncertainty analysis system for SWAT models.
Two optimization techniques SUFI-2 and DDS have been implemented in the LCC-SWAT.
LCC-SWAT is capable of generating a comprehensive set of statistical information.
Experiments showed that LCC-SWAT can reduce runtime signicantly.Q2
Masood Zamani
Narayan Kumar Shrestha
Taimoor Akhtar
Prasad Daggupati (corresponding author)
School of Engineering,
University of Guelph,
50 Stone Road West, Guelph, ON,
Canada
N1G 2W1
E-mail: pdaggupa@uoguelph.ca
Masood Zamani
St.Michaels Hospital,
University of Toronto,
1 Kings College Circle, Toronto, ON,
Canada
M5S 1A8
Trevor Boston
Greenland International Consulting Ltd,
120 Hume Street, Collingwood, ON,
Canada
L9Y 1V5
1© IWA Publishing 2020 Journal of Hydroinformatics |in press |2020
doi: 10.2166/hydro.2020.066
Uncorrected Proof
GRAPHICAL ABSTRACT
INTRODUCTION
Watershed models are widely used by water resources plan-
ners and managers in the decision-making process (Devia
et al. ;Leta et al. ). With easier access to parallel
computing power and the ready availability of higher-resol-
ution observed datasets (including climate data, water
quality data, etc.), the computational complexity of water-
shed models, and especially physically based distributed
and semi-distributed watershed models is increasing signi-
cantly (Yang et al. ). Moreover, complex physically
based watershed models are typically characterized by a
large number of parameters, and complex calibration objec-
tives that are highly non-linear and multi-modal (Beven &
Binley ;Abbaspour et al. ;Tolson & Shoemaker
). Hence, automatic parameter estimation of a complex
watershed model is often hindered by high-dimensionality
and multi-modality of the underlying calibration optimiz-
ation problem (Nossent et al. ), and thus, the
calibration process is computationally intensive (Ercan
et al. ). The calibration challenge is exacerbated further
by the need for understanding model uncertainty,
especially for models that simulate water quality (e.g., maxi-
mum daily pollutants loads) (Shirmohammadi et al. ;
Borah et al. ). The computationally intensive nature
of complex watershed models, thus, necessitates
optimization algorithms and frameworks that are computa-
tionally efcient and can use parallel computing resources
(Ahmadisharaf et al. ).
The Soil and Water Assessment Tool (SWAT) is a highly
popular watershed modeling tool (Arnold et al. ) that is
widely used for the development of complex, highly parame-
terized and computationally expensive watershed models
(Nossent et al. ;Borah et al. ). Many optimization
algorithms have been developed in the past literature for
addressing the computational challenge of calibrating
SWAT and other complex watershed models (Tayfur ).
For instance, Abbaspour et al. ()proposed Sequential
Uncertainty Fitting (SUFI-2) to efciently calibrate (within
a few thousand simulations) complex SWAT models.
Tolson & Shoemaker ()proposed the Dynamically
Dimensioned Search (DDS) method to calibrate complex
and high-dimensional (i.e., with many parameters) hydrolo-
gic and watershed models. Efcient Markov Chain Monte
Carlo (MCMC) methods, e.g., the Shufed Complex Evol-
ution Metropolis (SCEM-UA) method (Vrugt et al. )
and the DiffeRential Evolution Adaptive Metropolis
(DREAM) algorithm (Vrugt et al. ;Vrugt ), and
the Multiple-response Bayesian Calibration (MRBC) frame-
work (Han & Zheng ) that quantify input and
2M. Zamani et al. |A Linux-based cloud calibration system for SWAT (LCC-SWAT) Journal of Hydroinformatics |in press |2020
Uncorrected Proof
parameter uncertainty during calibration have also been
extensively applied to watershed problems. Given the
inherent multi-objective nature of watershed model cali-
bration (Gupta et al. ,;Yapo et al. ),
numerous multi-objective algorithms have also been pro-
posed and used for watershed model calibration and
uncertainty quantication. Some notable mentions are the
ParaSol (van Griensven & Meixner ), the Borg Multiob-
jective Evolutionary Algorithm (Borg-MOEA) (Hadka &
Reed ), the Pareto archived dynamically dimensioned
search (PA-DDS) (Asadzadeh & Tolson ) and the
Non-dominated Sorting Genetic Algorithm II (NSGA-II)
(Deb et al. ;Ercan & Goodall ). ParaSol (van
Griensven & Meixner ) also applies thresholds on
different objectives to lter/identify behavioral solutions
(Beven & Binley ) and, subsequently, quantify model
uncertainty.
The use of desktop/stand-alone computational
resources is less effective and, in some cases, not feasible
for automatic calibration and analysis of large-scale water-
shed models (especially distributed and semi-distributed
models) with complex physical domains and multiple
water resource issues such as water quality, droughts and
oods (Abbaspour et al. ;Arnold et al. ;Gupta
et al. ). Hence, parallel implementations of many cali-
bration frameworks and algorithms have been introduced
in the recent literature (Kan et al. ). Rouholahnejad
et al. ()implemented a parallel version of the SUFI-2
algorithm that is widely used for calibration of SWAT
models. SUFI-2 is part of SWAT-CUP (Abbaspour ),
which is a popular stand-alone program for calibration of
SWAT models. Ercan et al. ()implemented a parallel
version of DDS for SWAT calibration with deployment on
a windows-based cloud infrastructure. Joseph & Guillaume
()presented a parallel implementation of the DREAM
algorithm that is specically designed for parameter esti-
mation and uncertainty quantication of SWAT watershed
models. Bacu et al. ()developed grid-based architectural
components for SWAT (gSWAT) with up to 60 worker
nodes. The gSWAT with its inherent SUFI-2 algorithm was
used in a ne resolution SWAT model of the Black Sea
catchment (Rodila et al. ) in the scope of an EU/FP7
enviroGRIDS project (enviroGRIDS ) and in a large-
scale Danube River Basin project (Gorgan et al. ;
Rodila et al. ). The gSWAT computing infrastructure
was found to optimize the SWAT model when running in
parallel (Bacu et al. ). Zhang et al. ()also paralle-
lized the SWAT model itself (rather than the calibration
framework) by simultaneously simulating output for each
distributed model land unit (also called Hydrological
Response Unit, HRU).
The above-mentioned parallel calibration frameworks
clearly illustrate the value of parallel processing and high-
performance computing in addressing the challenge of cali-
brating large and complex watershed models (Humphrey
et al. ;Ercan et al. ;Astsatryan et al. ;Zhang
et al. ). However, many prior studies do not adequately
discuss two key aspects of parallel watershed model cali-
bration, i.e., (i) compatibility of parallel algorithm
implementations with different cloud platforms and (ii) the
impact of cloud infrastructure on computing efciency of
parallel algorithms and frameworks.
In general, commercialized cloud computing platforms
currently support either Windows or Linux operating sys-
tems, and in some cases, both operating systems are
supported. Thus operating system compatibility is an impor-
tant criterion in developing a cloud-based calibration
platform. In Humphrey et al. (), a cloud-based cali-
bration system for SWAT models was developed, based on
Microsoft Windows Azure (Chappell ). In addition, a
multi-component enterprise cloud service was developed
and studied for watershed calibration, and the virtual
machines (VMs) for the cloud platform were created using
Hadoop and Openstack which are open-source software
(Astsatryan et al. ;Zhang et al. ).
Hardware infrastructure of cloud platforms is also an
important factor that can signicantly impact speed-up
and efciency of frameworks where simulations are exe-
cuted in parallel. ODonncha et al. ()showed that
parallel performance/efciency of a uid dynamics model
varies signicantly on single versus multi-node architec-
tures. In their review of the parallel watershed calibration
literature, Kan et al. ()note that prior work on under-
standing the effectiveness of parallel watershed model
frameworks on different cloud platforms is very limited
and needs to be explored more in future. Motivated by this
need, we implemented a cloud-based watershed calibration
and uncertainty analysis system under Linux Operating
3M. Zamani et al. |A Linux-based cloud calibration system for SWAT (LCC-SWAT) Journal of Hydroinformatics |in press |2020
Uncorrected Proof
system using SOSCIPs cloud analytic platform (SOSCIP
).
The watershed calibration system proposed in this study
is called Linux-based Cloud Calibration system for SWAT
(LCC-SWAT) and is specically designed for complex water-
shed models developed using SWAT. LCC-SWAT includes
two parallel optimization methods SUFI-2 (Abbaspour
et al. ,) and DDS (Tolson & Shoemaker ).
The design of our cloud-based calibration system is compati-
ble with the commonly used stand-alone SWAT-CUP system
(Abbaspour ). We believe that this will encourage exist-
ing SWAT (and SWAT-CUP) users and modelers to test their
models using LCC-SWAT. LCC-SWAT has been added to
the Canadian Watershed Evaluation Tool (CANWET)
platform to provide efcient, automatic calibration and visu-
alization of SWAT models.
This paper also includes a comprehensive comparison of
parallel calibration results obtained from SUFI-2 and DDS
on the SWAT model of the Grand River Basin (6,542 km
2
)
in Ontario, Canada. To the best of our knowledge, parallel
implementations of DDS and SUFI-2 have not been compared
in the past. Moreover, the detailed runtime performance of
LCC-SWAT is evaluated on different cloud architectures
(i.e., single versus multi-node CPU systems), and by using
three SWAT models of increasing complexities and sizes
(19215,918 km
2
). We believe that the availability of such a
cloud-based system is an important contribution to watershed
modeling software and to the future implementation of
improved cloud-based calibration frameworks. In addition,
unlike other cloud-based systems, LCC-SWAT automatically
generates comprehensive statistical reports pertaining to the
SWAT model calibration and uncertainty analysis which facili-
tates more comprehensive analyses of the calibration
parameters and the overall model performance.
BACKGROUND
Soil and Water Assessment Tool
SWAT (Arnold et al. ) is a long-term continuous hydro-
logic and water quality model. It is one of the most widely
used models in the hydro-environmental domain (Arnold
et al. ). Being a semi-distributed and physically based
model, SWAT has a high number of parameters related to
hydrology, erosion and sediment transport, nutrients, pesti-
cides, fecal bacteria, among others (Leta et al. ),
making it one of the more complex and over-parameterized
hydro-environmental models (Nossent et al. ). For mod-
eling purposes, SWAT divides a watershed into several sub-
watersheds. A sub-watershed is further divided into Hydro-
logical Response Units (HRUs) which are unique
combination of land-use, soil and slope. An HRU is the com-
putation unit of the SWAT model (Arnold et al. ).
Cloud computing infrastructure
The cloud computing infrastructure on which the new water-
shed calibration system was deployed and tested and was
created by the Southern Ontario Smart Computing Inno-
vation Platform (SOSCIP) consortium. The calibration
system was designed and developed on the SOSCIPs cloud
analytic platform (SOSCIP ). The allocated cloud
resource contains two VMs, or nodes, using the Linux Oper-
ating system, each with 24 computational cores, or CPUs.
The calibration system has 196 GB of RAM and 2TB of a net-
works storage which is managed as centralized data storage
to retrieve and store data by VMs using Network File System
(NFS) in a faster and more efcient manner as compared to
that of distributed data storages. The calibration process
can be performed on a single or multiple VMs. In addition,
the proposed cloud-based calibration system does not have
a limitation for employing the maximum number of available
computational cores if the computational resource on the
cloud platform is extended.
Optimization algorithms
Sequential uncertainty tting (SUFI-2)
The SUFI-2 (Abbaspour et al. ,) method was devel-
oped to address the degree of all uncertainties quantied by
the p-stat measure which is the percentage of measured data
grouped by the 95% prediction uncertainty (95PPU). The r-
stat is another measure that quanties the strength of the
uncertainty analysis of a calibration from the average of
the 95PPU band divided by the standard deviation of the
measured data. The SUFI-2 method aims to detect the
4M. Zamani et al. |A Linux-based cloud calibration system for SWAT (LCC-SWAT) Journal of Hydroinformatics |in press |2020
Uncorrected Proof
majority of the measured data with the smallest uncertainty
band. The 95PPU is calculated at the 2.5 and 97.5% levels of
the cumulative distribution of an output variable obtained
by using Latin Hypercube Sampling (LHS) (McKay et al.
). Therefore, it eliminates the lowest performing 5% of
simulations. The value of the p-stat ranges between 0 and
100, and the value of r-stat ranges between 0 and innity.
A calibration that exactly corresponds to the measured
data has a p-stat of 1 and r-stat of 0. LHS is a statistical
method to sample evenly over the sample space from the
random parameter values of a multidimensional distri-
bution. An LH is considered as a predened number of
dimensions where each sample is the only one in each
axis-aligned hyperplane that contains the sample. LHS is
applied usually to reduce the computational time of running
Monte Carlo simulations which can decrease the processing
time by up to 50%.
Dynamically dimensioned search
DDS (Tolson & Shoemaker ) is a heuristic optimization
algorithm for calibration of watershed simulation models.
The DDS optimization method was introduced for cali-
bration problems that require a large set of decision
variables whose lower and upper values are predened.
DDS aims to nd an optimal solution within a user-dened
number of simulations or, in general, function evaluations.
Initially, the DDS algorithm explores globally the solution
space and changes the search domain gradually to local
searches when the number of function evaluations or simu-
lations is reached to a predened maximum number of
iterations. In each iteration, the changes in the search
domain are achieved dynamically and probabilistically by
reducing dimensions (or the number of decision values) in
search neighborhoods of a solution space. The decision
values are the parameters of a watershed model which are
adjusted. The probability that decision variable xis selected
in iteration iis computed as follows:
Px(i)¼1ln (i)
ln (m)(1)
where mis the maximum number of function evaluations
for a calibration. In Equation (1), the probability of adjusting
the value of a decision variable is reduced gradually by the
increases in the number of iteration i.
THE LINUX-BASED CLOUD CALIBRATION SYSTEM
FOR SWAT (LCC-SWAT) DEVELOPMENT
System design
Figure 1 provides an overview of the LCC-SWAT calibration
system workow/design, runs in the Ubuntu 18.04 operating
system. It has three core components. The rst component is
user-dened where a user creates a SWAT model, sets the
calibration parameters using SWAT-CUP protocols and
uploads the model and calibration setup les to the LCC-
SWAT system on the cloud using Secure File Transfer Proto-
col (SFTP). The second component is the parallel
optimization strategy. Two parallel optimization implemen-
tations are currently included in LCC-SWAT, which are
discussed further in the System implementationand Paral-
lelization of optimization algorithmssections. The nal
component of the system is the SWAT parallelization rou-
tines, which allow multiple SWAT simulations in the batch-
parallel mode. If the optimization algorithm is set up to run
nmodel runs in an iteration (i.e., nparameter sets are pro-
vided by the optimization algorithm), these are equally
distributed among available computing resources/cores.
System implementation
The entire LCC-SWAT framework, including the two
optimization methods (discussed in the Parallelization
of optimization algorithmssection), their batch-parallel
components and input/output communications with the
SWAT model were implemented in C þþ. The actual
model les are created and congured by the SWAT-
CUP program under the Windows operating system on a
personal computer. The entire model les are uploaded
by the user to LCC-SWAT and copied to the internal net-
work storage. VMs retrieve and store the original and
updated model les from the network storage. In order
to make the inputs and outputs of the parallel optimiz-
ation methods compatible with SWAT-CUP, we
5M. Zamani et al. |A Linux-based cloud calibration system for SWAT (LCC-SWAT) Journal of Hydroinformatics |in press |2020
Uncorrected Proof
implemented (and replicated) the SWAT-CUP modules
that edit model les and represent inputs and extract
outputs to/from the SWAT program according to the
desired time series of simulations, and other modules
that describe the calibration workow as dened in
SWAT-CUPsmanual(Abbaspour ). The Linux ver-
sion of SWAT program and SWAT-CUPsswat_edit
module were embedded and called as external routines
at each step of simulation during the optimization. A
number of SWAT-CUPs modules were not available for
the Linux operating system. The functionality of these
modules has to be replicated to work on Linux, e.g.,
SUFI_extract_rch. Hence, LCC-SWAT is compatible
with SWAT-CUP and users of LCC-SWAT can formulate
their SWAT model calibration using SWAT-CUP. The
optimization methods proposed in LCC-SWAT do not
employ a multi-objective function; however, multiple per-
formance metric values (e.g., R
2
, NSE) are stored for each
simulation during calibration for post-optimization analy-
sis. LCC-SWAT can calibrate concurrently multiple
watershed models by multiple users. However, the cali-
bration times are increased differently based on how the
cloud platform allocates the computational resources,
thesizeofmodelles and the number of users.
Figure 1 |Overview of the Linux-based Cloud Calibration system for Soil and Water Assessment Tool (LCC-SWAT) system design.
6M. Zamani et al. |A Linux-based cloud calibration system for SWAT (LCC-SWAT) Journal of Hydroinformatics |in press |2020
Uncorrected Proof
Parallelization of optimization algorithms
The design of the cloud-based calibration system that
employs the paralleled SUFI-2 algorithm is illustrated as fol-
lows. If nCPUs are employed in total to perform m
simulations, the input decision values generated by LHS
are grouped in kparameter sets where kis equal to m
divided by n. In other words, the calibration process is per-
formed in kiterations where nsimulations are performed in
each iteration. The results of simulations are collected and
aggregated in each iteration. Next, after a predened
number of simulations is reached, the optimal calibrated
values of the decision variables (or parameter set) are
assigned to the simulation with the best tness value as
shown in Figure 2. Lastly, a comprehensive statistical
report is computed for all simulations on the cloud platform
for more robust statistical analyses.
The design of the cloud-based calibration system that
uses iteratively the paralleled DDS optimization method is
shown in Figure 3. Initially, nrandom samples or parameter
sets are created with regards to the dened ranges of the
SWAT models decision variables, and constant nis also equal to the number of computational cores. The samples
are distributed to the clouds VMs, and each core performs
one simulation by using one of the parameter sets. The par-
ameter set xwhose simulation generates the best tness
value is identied. Then, the DDS optimization is performed
ntimes on parameter set xto create nnew parameter sets.
The next iteration is started by repeating the distribution of
the new parameter sets to the VMs. The calibration process
is terminated when the maximum number of simulations is
reached. Lastly, the parameter set xthat is identied in the
last iteration usually corresponds to the simulation with
the best tness evaluation in all iterations. By the paralleled
DDS optimization, a calibration performed with a high
number of simulations usually generates a more optimal par-
ameter set compared to that of a calibration with a lower
number of simulations.
Testing the cloud calibration system for different SWAT
models
In order to test the effectiveness of the developed cloud com-
puting system (i.e., LCC-SWAT) as a function of increasing
size and complexity of watersheds, three SWAT models
Figure 3 |An overview of the cloud calibration system using the Dynamically Dimen-
sioned Search (DDS) optimization algorithm.
Figure 2 |An overview of the cloud calibration system using the Sequential Uncertainty
Fitting (SUFI-2) optimization algorithm.
7M. Zamani et al. |A Linux-based cloud calibration system for SWAT (LCC-SWAT) Journal of Hydroinformatics |in press |2020
Uncorrected Proof
were set up, i.e., (i) a small agricultural Wigal Creek water-
shed (19 km
2
)(Zhang et al. ), (ii) a medium-sized
Grand River Basin (6,542 km
2
)(Kaur et al. ) and (iii) a
large-sized Canadian Great Lakes Basin (215,918 km
2
).
High spatial resolution spatial dataset (digital elevation
model, land-use and soil; Supplementary Table S1), daily
meteorological dataset (precipitation and temperature; Sup-
plementary Table S1) and crop-management data were
sourced from different agencies and used during the process.
The model setup resulted in 452, 2679 and 29449 HRUs,
respectively, for Wigal, Grand River and Canadian Great
Lakes SWAT models. Models were run for 1,000 simulations
in monthly timescale for a period of 12 years in the cloud plat-
form using the SUFI-2 optimization algorithm.
We tested LCC-SWAT on two criteria, i.e., (i) via analyz-
ing calibration runtime of the framework with varying
number of allocated processors (448) and nodes/VMs (1
2) and (ii) via analyzing comparative performance of the
two algorithms currently implemented in the framework
(DDS and SUFI-2).
The primary purpose of the calibration runtime compari-
son was to provide some insights and guidelines on
identifying optimal computing resource allocation for LCC-
SWAT. The proposed calibration system can be deployed
on multiple VMs or nodes with a predened number of pro-
cessors. The number of cores allocated for any calibration
job should be within the bounds specied in the MPICH2
package, an open-source implementation of the message pas-
sing interface (MPI) method for parallel programming in
Cþþ (used in LCC-SWAT). Moreover, core allocation for a
calibration job should also be within available computational
resource limitations. For instance, we deployed and tested
LCC-SWAT on the SOSCIP cloud platform, where two com-
puting nodes were available with 24 cores each (48 cores in
total). Hence, the maximum core allocation for calibration
was 48 in our experiments. Ideally, a calibration process
with msimulations (assuming all simulations can be executed
simultaneously, i.e., in a single batch) and ncore allocations
where mis greater than ncan be performed faster by specify-
ing n(on a single or multi-node system) to be equal to the
number of available physical cores (which were 48 for our
experiments). However, parallel runtime performance, typi-
cally, does not scale proportionally with the number of
allocated cores and can vary signicantly. Variations in
runtime performance can be attributed to multiple reasons,
e.g., physical cloud infrastructure and coding structure of
the underlying simulation, etc. (Hadka & Reed ).
Hence, we analyzed runtime performance of LCC-SWAT
with a different number of core allocations to (i) deduce opti-
mal number of core allocations for LCC-SWATs deployment
on SOSCIP and (ii) provide insights on deducing optimal
core allocation for deployment on other cloud infrastructure.
For this analysis, we tested the cloud-based calibration system
using nodes in regular increments of four cores at both single
and double node (VMs) congurations.
As different optimization algorithms have their own advan-
tages in terms of converging to the global optima of multi-
dimensioned parameter search, it is important that more than
one optimization algorithm is tested. Hence, the effectiveness
of the above-mentioned optimization algorithms (SUFI-2 and
DDS) was tested for the medium-sized Grand River Basin. Fol-
lowing a global sensitivity analysis using the SWAT-CUP, the 18
most sensitive SWAT parameters (Supplementary Table S2)
were considered to optimize monthly streamow measured
at Grand River near Marsville, one of an upstream streamow
gauging station of the Grand river for an 8-year time period
(20082015). The global sensitivity analysis regresses par-
ameters generated using the LHS methodology (McKay et al.
) against a chosen objective function. We conducted the
sensitivity analysis at monthly timescale using streamow at
the same Grand River near Marsville for the same 8-year
time period (20082015) using the NashSutcliffe Efciency
(NSE; Nash & Sutcliffe ) as the objective function. It
should be noted that SWAT inherently runs in a daily timescale,
as such, in our case, the daily simulations are aggregated in the
monthly timescale. The range (maximum and minimum) of
sensitive parameters values (Supplementary Table S2) were
chosen based on similar reported works in cold-climate
region basins (Faramarzi et al. ;Shrestha et al. ;
Zhang et al. ;Kaur et al. ).
RESULTS AND DISCUSSION
Optimal number of cores
Figure 4 shows the evolution of model runtime when utilizing
an increasing number of cores in single and double node
8M. Zamani et al. |A Linux-based cloud calibration system for SWAT (LCC-SWAT) Journal of Hydroinformatics |in press |2020
Uncorrected Proof
conguration for three separate SWAT models. As expected,
the relative runtimes were higher for more complex SWAT
models (e.g., Great Lakes Basin). Figure 4 indicates that utiliz-
ation of a larger number of cores reduced, in our case, the LCC-
SWAT calibration runtime. However, for both single and
double node congurations, improvement (i.e., reduction) in
calibration runtime continued until the utilization of 20
cores only. The runtime then increased when utilizing the
maximum (24) cores on a single node. The same trend was
also observed for calibration runs utilizing two nodes.
Hence, we found 20 coresin both single anddouble node allo-
cation scenarios to be the optimal core allocation.
As mentioned in the Testing the cloud calibration
system for different SWAT modelssection, runtime per-
formance of parallel frameworks may not be proportional
to the number of allocated cores, and this trend is also
observed for LCC-SWAT. For instance, with single node
deployment, we observed that LCC-SWATs runtime
performance deteriorated when allocated processors
increased from 20 to 24. One plausible reason for this
deterioration is that when all 24 processors are allocated,
the processorsresources are shared for the execution of
simulations and the operating systems internal tasks and
job scheduling. Thus, without idle processors, the concur-
rent execution of different tasks assigned to a nodes
processors causes overloads and results in time latencies
and increasing the overall runtime of the parallel cali-
bration. The computational runtime improvement for a
single node system may also be limited by hardware con-
guration. For instance, LCC-SWAT is deployed on a
cloud infrastructure where the physical cores on a machine
are doubled by hyper-threading technology to form logical
cores which share execution, memory and I/O resources.
Hence, given SWATs high I/O requirements (Zhang et al.
), runtime performance deterioration is expected for
the cloud infrastructure employed in this study.
Figure 4 |Runtime as the function of number of cores in single- and double-node conguration for the three Soil and Water Assessment Tool (SWAT) models. Also shown is the com-
putation time overhead factor.
9M. Zamani et al. |A Linux-based cloud calibration system for SWAT (LCC-SWAT) Journal of Hydroinformatics |in press |2020
Uncorrected Proof
We also observed deterioration in runtime performance
with the utilization of multiple nodes, i.e., the computational
time of a calibration run on two nodes was more than that of
the calibration on a single node with the same number of
total processors. This deterioration was due to expected net-
work latency and I/O operations required in
communication between multi-node systems. Moreover,
SWAT is an I/O intensive (Zhang et al. ) simulation
model, and thus, parallel communication bottlenecks arising
in parallel application of SWAT simulations in multi-node
systems can easily supersede the potential advantage of
the availability of additional cores.
Issues pertaining to linear scaling of runtime perform-
ance of SWAT parallelization frameworks, with the added
number of cores, are also observed in other studies (Ercan
et al. ;Zhang et al. ;Bacu et al. ). Furthermore,
use of an increased number of cores has been debated from
a cost-effective point of view. For instance, Ercan et al. ()
showed, with an experiment involving 256 cores, that use of
64 cores was the most desirable from the economical point
of view.
Figure 4 also reports calculation time overhead, calcu-
lated as the ratio of the runtime in any conguration to
the runtime for the optimal cores (20). This metric indicates
that a cloud-based computing system (regardless of cloud
infrastructure) could reduce calibration runtime of SWAT
signicantly (compared to desktop systems up to 8 cores).
From the perspective of computational time overhead, the
added value of such a cloud-based computing system is high-
lighted especially for the complex SWAT model, the Great
Lakes Basin. For this SWAT model, the computation time
overhead for all the congurations is lower than that
observed for the less complex model (e.g., Wigle Water-
shed). For example, for two nodes conguration and using
48 cores, the computation time overhead for the Wigle
Watershed was 3.64 which reduced to 2.00 and 1.87 for
the Grand River and Great Lakes Basin, respectively.
Comparison of SUFI-2 and DDS algorithms
As a test of the cloud calibration system, both SUFI-2 and
DDS algorithms were run for 1,000 simulations with NSE
(Nash & Sutcliffe ) as the objective function to optimize
monthly streamow at Grand River near Marysville. While
the NSE was the objective function, we also calculated
PBIAS and R
2
to assign a qualitative rating to model simu-
lation (Moriasi et al. ). Moreover, two performance
aspects were considered during the comparative analysis
of DDS and SUFI-2, i.e., (i) posterior parameter distributions
(using behavioral solutions) obtained from both algorithms
(see Section Analyzing posterior parameter distributions)
and (ii) calibration statistics (of best calibrations found)
and predictive uncertainty bounds obtained from both algor-
ithms (see Section Calibration statistics and predictive
uncertainty).
Analyzing posterior parameter distributions
Figure 5 shows the posterior distributions of the three most
sensitive parameters (identied during global sensitivity
analysis; see Section Testing the cloud calibration system
for different SWAT models), obtained from DDS and
SUFI-2. Following Moriasi et al. (), all solutions with
an NSE value more than 0.8 were considered as behavioral
solutions, and the posterior distributions (represented by
histograms in Figure 5) were estimated using behavioral sol-
utions only. The results in Figure 5 show that DDS was
successful in obtaining a narrower and more well-dened
parameter distribution for all three parameters. The DDS-
based posterior distribution of all parameters showed a
clear and high relative frequency (0.7), while the SUFI-2-
based posterior distribution (Supplementary Table S2)
seems to be spread in a wider range. In stochastic modeling
paradigm, the ability of an optimization algorithm to clearly
identify an optimal range parameter is important given the
issues related to parameter identiability (Chavent ),
especially for the SWAT model that is often regarded as
an over-parameterized model (Nossent et al. ). Thus,
the posterior distribution obtained from DDS is clearly
better.
The relative superiority of DDS can be attributed to the
algorithms iterative search dynamics, where, in each paral-
lel simulation batch, new candidate calibration solutions are
obtained by perturbing the best calibration found so far (see
Figure 3 and Section Dynamically dimensioned search).
Moreover, DDS only perturbs a subset of parameters in
each algorithm iteration, and the number of parameters to
be perturbed reduces as the algorithm progresses (see
10 M. Zamani et al. |A Linux-based cloud calibration system for SWAT (LCC-SWAT) Journal of Hydroinformatics |in press |2020
Uncorrected Proof
Section Dynamically dimensioned searchand Equation (1)).
This strategy is especially effective for calibration problems
with many parameters (Asadzadeh & Tolson ). SUFI-
2, on the other hand, uses LHS (McKay et al. ), which
is a uniform stochastic space-lling design, and thus,
posterior distributions obtained from SUFI-2 are more
uniform. However, if the budget of functions evaluations
for SUFI-2 is increased (from 1,000; which is not desirable
for computationally expensive SWAT models), posterior dis-
tributions of parameters will become more well-dened and
narrower.
Calibration statistics and predictive uncertainty
Figure 6 shows the 95% predictive parameter uncertainty
bands on the monthly streamow for the 8-year period
(20082015), for both DDS and SUFI-2. Owing to the
better identiability of parameters by the DDS algorithm
over the SUFI-2 algorithm, the DDS 95% predictive par-
ameter uncertainty band on monthly streamow also
Figure 6 |The 95% predictive parameter uncertainty (PPU) band on monthly streamow
of Grand River near Marsville. PBIAS: Percentage of bias, NSE: Nash-Sutcliffe
Efciency, and R2: Coefcient of determination.
Figure 5 |Posterior distribution of top three most sensitive parameters for all behavioral solutions from Sequential Uncertainty Fitting (SUFI-2) and Dynamically Dimensioned Search (DDS)
optimization algorithms. SMTMP: Snow melt temperature, CN2: Curve number for moisture condition II, and SFTMP: Snow fall temperature.
11 M. Zamani et al. |A Linux-based cloud calibration system for SWAT (LCC-SWAT) Journal of Hydroinformatics |in press |2020
Uncorrected Proof
consistently out-performed of the SUFI-2 algorithm
(Figure 6). While the p-stat (percentage of the observations
encapsulated in the 95% PPU band) values for both optimiz-
ation algorithms are fairly comparable, there is a signicant
difference in the r-stat (the thickness of the 95% PPU band).
The SUFI-2-based optimization resulted in a wider 95% PPU
band (r-stat ¼1.14), inferring to higher uncertainty in simu-
lated monthly streamow at Grand River near Marysville.
Furthermore, all the goodness-of-t statistics (pertaining to
the deterministic modeling paradigm) for the best-t simu-
lation also showed consistent underperformance of the
SUFI-2 algorithm (compared to DDS). Following Moriasi
et al. ()model performance criterion, the qualitative
rating for the SUFI-2 based simulated monthly streamow
is good, while the same for the DDS-based simulation is
very good.
As stated in the Sequential uncertainty tting (SUFI-2)
and Analyzing posterior parameter distributionssections,
SUFI-2 uses LHS (McKay et al. ), with uniform a priori
distribution of model parameter from the dened range, to
search for optimal solutions in a high-dimensional (i.e., with
18 parameters) parameter space. It is therefore evident that
SUFI-2 optimization may not always reach the neighborhood
of the global optima. Whereas the DDS algorithm initially
explores globally in the solution space and changes the
search domain gradually to local searches (by reducing the
number of parameters to be perturbed; see Section Dynami-
cally dimensioned search). Hence, there is a higher chance
that the DDS algorithm may nd solutions close to the
global optima. In our example case, the higher performance
of the DDS algorithm may be related to the above-stated
reasons. It should, however, be noted that further studies
are needed to explicitly conclude the added advantage of
one optimization algorithm over another. This cloud-based
calibration (and uncertainty analysis) system indeed offers a
platform to conduct such a computationally demanding task.
Recommendations, limitations and future perspective
of the work
It is well known that SWAT is a highly parameterized model
(Nossent et al. ), as such is highly I/O intensive. The
LCC-SWAT system is exible and can be deployed on any
cloud platform (and with the different number of available
computational cores). However, increasing the number of
cores may require the addition of more computational
nodes or VMs. Since multiple nodes may be linked together
by networks with I/O trafc affecting, overall computational
time, it is imperative that multi-node computational over-
head is considered before deploying LCC-SWAT (and
similar frameworks) on multi-node cloud infrastructure (as
indicated in the results discussed in the Optimal number
of coressection). When frameworks similar to LCC-
SWAT are deployed on multi-node cloud platforms, the
architecture of the networks storage determines signi-
cantly the data storage, data retrieval and computational
costs. Therefore, a very fast and dedicated network storage
is a great advantage (and is highly recommended for LCC-
SWAT deployment on multi-node systems) to boost the
high demands of data accesses by VMs during parallel
multi-node simulation runs in LCC-SWAT.
LCC-SWATs implementation is modular and, thus, can
incorporate extensions and modications in future, to
improve the systems ease-of-use and calibration perform-
ance. In this regard, three key extension/improvement
avenues are (i) inclusion of more optimization algorithms,
especially multi-objective algorithms for exploring calibration
trade-offs, (ii) asynchronous parallel implementation of exist-
ing and new algorithms, for enhancing runtime efciency and
(iii) implementation of an interactive user interface (the user
interface of LCC-SWAT is currently, console-based). Given
the computational time overhead induced by the cloud infra-
structure and high I/O intensive nature of SWAT simulations,
parallel efciency of the LCC-SWAT framework may benet
signicantly from the inclusion of asynchronous parallel
optimization algorithms (Zhabitskaya & Zhabitsky ).
The user interface of LCC-SWAT can be enhanced from a
console-based interaction to a more user-friendly visual inter-
face, and the interface enhancement can be used as a
blueprint when the LCC-SWAT system is planned to be distrib-
uted for client-based accesses through the internet.
Development of a friendly user-interface may be especially
important to enable broad uptake and use of the LCC-SWAT
system. Many research and public organizations (e.g., U.S.
Environmental Protection Agency (USEPA), Ontario Ministry
of Natural Resources and Forestry (OMAFRA), etc.), use
SWAT for watershed modeling (Francesconi et al. )and
watershed nutrient management. However, efcient and
12 M. Zamani et al. |A Linux-based cloud calibration system for SWAT (LCC-SWAT) Journal of Hydroinformatics |in press |2020
Uncorrected Proof
effective SWAT calibration remains a challenge for such
organizations. It is envisioned that LCC-SWAT could be
used as a cloud calibration service tool by such organizations
(especially organizations in Canada) in future.
SUMMARY AND CONCLUSIONS
In this study, a cloud calibration and uncertainty analysissystem
offering two paralleled optimization algorithms (SUFI-2
and DDS) is developed for SWAT models, and deployed on
the SOSCIP Cloud Analytic platform. The proposed cloud-
based system, called LCC-SWAT, is a key contribution to
the watershed calibration practice that allows parallel bench-
marking of alternate parallel optimization algorithms on
different cloud computing architectures. An illustration of
the potential application of LCC-SWAT in parallel bench-
marking of calibration algorithms is provided in this study
via a comparison of parallel SUFI-2 and parallel DDS with
a budget of 1,000 evaluations and with 20 cores each. Results
show that the performance of DDS is better than SUFI-2.
Results of performance benchmarking of the LCC-
SWAT system on both single (i.e., with one VM) and dual
node (i.e., with two VMs) architectures are also provided
with the application to three SWAT models of increasing
complexities. Although a maximum of 48 cores in two
VMs were available, results indicate that 20 cores in a
single virtual machine is an optimal conguration (as per
runtime perspective) for the cloud architecture tested in
this study. However, for more complex watershed models,
the runtime efciency of multi-node systems improves
since the computation time overhead reduces and core util-
ization improves. These results also indicated that an
asynchronous parallel implementation may further improve ef-
ciency and scalability of LCC-SWAT for multi-node systems.
Moreover, the design of LCC-SWAT is modular and exible;
thus, other single and multi-objective parallel algorithms can
be added to enrich the system for efciently solving future
large-scale watershed model calibration problems. Finally,
LCC-SWATwasalsosuccessfullyintegratedintothe
CANWETplatform. The platform is designed to facilitate the
use of modeling as a means of watershed management and
policy testing. Further information is available at https://www.
grnland.com/Greenland-Technologies-Group/CANWET.html.
ACKNOWLEDGEMENTS
The authors thank SOSCIP Smart Computing for Innovation
for providing advanced computing platform.
FUNDING
The corresponding author was supported with Natural
Sciences and Engineering Research (NSERC) discovery grant
(number: 2017-04400). Funding and access to parallel comput-
ing resources were provided to the project by SOSCIP Smart
Computing for Innovation. Greenland International Consult-
ing Ltd was an in-kind contributor to the research.
CONFLICTS OF INTEREST
The authors declare that there is no conict of interest.
AUTHOR CONTRIBUTION
P.D. and T.B. conceptualized the study. M.Z. developed the
LCC-SWAT platform. N.K.S. tested the platform for various
SWAT models for runtime, parameter identication, cali-
bration and uncertainty analysis. M.Z., N.K.S. and T.A.
drafted and revised the manuscript. P.D. and T.B. provided
their comments and revision.
DATA AVAILABILITY STATEMENT
Data cannot be made publicly available; readers should con-
tact the corresponding author for details Q3.
REFERENCES
Abbaspour, K. C.  SWAT Calibration and Uncertainty
Programs A User Manual. Swiss Federal Institute of
Aquatic Science and Technology (Eawag).
Abbaspour, K. C., Johnson, C. A. & van Genuchten, M. T. 
Estimating uncertain ow and transport parameters using a
sequential uncertainty tting procedure.Vadose Zone
Journal 3(4), 13401352.
13 M. Zamani et al. |A Linux-based cloud calibration system for SWAT (LCC-SWAT) Journal of Hydroinformatics |in press |2020
Uncorrected Proof
Abbaspour, K. C., Yang, J., Maximov, I., Siber, R., Bogner, K.,
Mieleitner, J., Zobrist, J. & Srinivasan, R.  Modelling
hydrology and water quality in the pre-alpine/alpine Thur
watershed using SWAT.Journal of Hydrology 333 (24),
413430.
Ahmadisharaf, E., Camacho René, A., Zhang Harry, X.,
Hantush Mohamed, M. & Mohamoud Yusuf, M. 
Calibration and validation of watershed models and
advances in uncertainty analysis in TMDL studies.Journal
of Hydrologic Engineering 24 (7), 03119001.
Arnold, J. G., Srinivasan, R., Muttiah, R. S. & Williams, J. R. 
Large area hydrologic modeling and assessment part I:
model development.Journal of the American Water
Resources Association 34 (1), 7389.
Arnold, J. G., Kiniry, J. R., Srinivasan, R., Williams, J. R., Haney,
E. B. & Neitsch, S. L.  Soil and Water Assessment Tool
Input/Output File Documentation, Version 2009. Agrilife
Blackland Research Center, Temple, Texas.
Arnold, J. G., Moriasi, D. N., Gassman, P. W., Abbaspour, K. C.,
White, M. J., Srinivasan, R., Santhi, C., Harmel, R. D., van
Griensven, A., Van Liew, M. W., Kannan, N. & Jha, M. K.
 SWAT: model use, calibration, and validation. American
Society of Agricultural and Biological Engineers 55 (4), 18.
Asadzadeh, M. & Tolson, B.  Pareto archived dynamically
dimensioned search with hypervolume-based selection for
multi-objective optimization.Engineering Optimization
45 (12), 14891509.
Astsatryan, H., Narsisian, W. & Asmaryan, S.  SWAT
hydrological model as a DaaS cloud service.Earth Science
Informatics 9(3), 401407.
Bacu, V., Mihon, D., Rodila, D., Stefanut, T. & Gorgan, D. 
Grid based architectural components for SWAT model
calibration. In HPCS 2011 - International Conference on
High Performance Computing and Simulation,48 July,
Istanbul, Turkey, pp. 193198.
Bacu, V., Nandra, C., Stefanut, T. & Gorgan, D.  SWAT model
calibration over Cloud infrastructures using the BigEarth
platform. In 2017 13th IEEE International Conference on
Intelligent Computer Communication and Processing (ICCP),
pp. 453460.
Beven, K. & Binley, A.  The future of distributed models:
model calibration and uncertainty prediction.Hydrological
Processes 6(3), 279298.
Borah, D. K., Ahmadisharaf, E., Padmanabhan, G., Imen, S. &
Mohamoud, Y. M.  Watershed models for development
and implementation of total maximum daily loads.Journal of
Hydrologic Engineering 24 (1), 03118001.
Chappell, D.  Introducing Windows Azure. David Chappell
and Associates.
Chavent, G.  On parameter identiability.IFAC Proceedings
Volumes 18 (5), 531536.
Deb, K., Pratap, A., Agarwal, S. & Meyarivan, T.  A fast and
elitist multiobjective genetic algorithm: NSGA-II.IEEE
Transactions on Evolutionary Computation 6(2), 182197.
Devia, G. K., Ganasri, B. P. & Dwarakish, G. S.  A review on
hydrological models.Aquatic Procedia 4, 10011007.
enviroGRIDS  The enviroGRIDS Project. Available from:
http://www.envirogrids.net/
Ercan, M. B. & Goodall, J. L.  Design and implementation of a
general software library for using NSGA-II with SWAT for
multi-objective model calibration.Environmental Modelling
& Software 84, 112120.
Ercan, M. B., Goodall, J. L., Castronova, A. M., Humphrey, M. &
Beekwilder, N.  Calibration of SWAT models using the
cloud.Environmental Modelling & Software 62, 188196.
Faramarzi, M., Srinivasan, R., Iravani, M., Bladon, K. D.,
Abbaspour, K. C., Zehnder, A. J. B. & Goss, G. G. 
Setting up a hydrological model of Alberta: data
discrimination analyses prior to calibration.Environmental
Modelling & Software 74,4865.
Francesconi, W., Srinivasan, R., Pérez-Miñana, E., Willcock, S. P.
& Quintero, M.  Using the Soil and Water Assessment
Tool (SWAT) to model ecosystem services: a systematic
review.Journal of Hydrology 535, 625636.
Gorgan, D., Bacu, V., Mihon, D., Rodila, D., Abbaspour, K. &
Rouholahnejad, E.  Grid based calibration of SWAT
hydrological models.Natural Hazards and Earth System
Sciences 12 (7), 24112423.
Gupta, H. V., Sorooshian, S. & Yapo, P. O.  Toward improved
calibration of hydrologic models: multiple and
noncommensurable measures of information.Water
Resources Research 34 (4), 751763.
Gupta, H. V., Kling, H., Yilmaz, K. K. & Martinez, G. F. 
Decomposition of the mean squared error and NSE
performance criteria: implications for improving
hydrological modelling.Journal of Hydrology 377 (1), 8091.
Gupta, H. V., Clark, M. P., Vrugt, J. A., Abramowitz, G. & Ye, M.
 Towards a comprehensive assessment of model
structural adequacy.Water Resources Research 48 (8). Q4
Hadka, D. & Reed, P.  Borg: an auto-adaptive many-objective
evolutionary computing framework.Evolutionary
Computation 21 (2), 231259.
Hadka, D. & Reed, P.  Large-scale parallelization of the Borg
multiobjective evolutionary algorithm to enhance the
management of complex environmental systems.
Environmental Modelling & Software 69, 353369.
Han, F. & Zheng, Y.  Multiple-response Bayesian calibration
of watershed water quality models with signicant input and
model structure errors.Advances in Water Resources 88,
109123.
Humphrey, M., Beekwilder, N., Goodall, J. L. & Ercan, M. B. 
Calibration of watershed models using cloud computing. In
2012 IEEE 8th International Conference on E-Science, pp.
18.
Joseph, J. F. & Guillaume, J. H. A.  Using a parallelized
MCMC algorithm in R to identify appropriate likelihood
functions for SWAT.Environmental Modelling & Software
46, 292298.
14 M. Zamani et al. |A Linux-based cloud calibration system for SWAT (LCC-SWAT) Journal of Hydroinformatics |in press |2020
Uncorrected Proof
Kan, G., He, X., Li, J., Ding, L., Hong, Y., Zhang, H., Liang, K. &
Zhang, M.  Computer aided numerical methods for
hydrological model calibration: an overview and recent
development.Archives of Computational Methods in
Engineering 26 (1), 3559.
Kaur, B., Shrestha, N. K., Daggupati, P., Rudra, R. P., Goel, P. K.,
Shukla, R. & Allataifeh, N.  Water security assessment of
the grand river watershed in Southwestern Ontario, Canada.
Sustainability 11 (7), 1883.
Leta, O. T., Nossent, J., Velez, C., Shrestha, N. K., van Griensven,
A. & Bauwens, W.  Assessment of the different sources of
uncertainty in a SWAT model of the River Senne (Belgium).
Environmental Modelling and Software 68, 129146.
McKay, M. D., Beckman, R. J. & Conover, W. J.  A
comparison of three methods for selecting values of input
variables in the analysis of output from a computer code.
Technometrics 21 (2), 239245.
Moriasi, D. N., Gitau, M. W., Pai, N. & Daggupati, P. 
Hydrologic and water quality models: performance measures
and evaluation criteria. 58 (6).
Q5
Nash, J. E. & Sutcliffe, J. V.  River ow forecasting through
conceptual models part I a discussion of principles.Journal
of Hydrology 10 (3), 282290.
Nossent, J., Elsen, P. & Bauwens, W.  Sobolsensitivity
analysis of a complex environmental model.Environmental
Modelling & Software 26 (12), 15151525.
ODonncha, F., Ragnoli, E., Venugopal, S., James, S. C. & Katrinis,
K.  On the efciency of executing hydro-environmental
models on cloud.Procedia Engineering 154, 199206.
Rodila, D., Bacu, V. & Gorgan, D.  Comparative parallel
execution of SWAT hydrological model on multicore and
grid architectures.International Journal of Web and Grid
Services 8(3), 304320.
Rodila, D., Ray, N. & Gorgan, D.  Conceptual model for
environmental science applications on parallel and
distributed infrastructures.Environmental Systems Research
4(1), 23.
Rouholahnejad, E., Abbaspour, K. C., Vejdani, M., Srinivasan, R.,
Schulin, R. & Lehmann, A.  A parallelization framework
for calibration of hydrological models.Environmental
Modelling & Software 31,2836.
Shirmohammadi, A., Chaubey, I., Harmel, R. D., Bosch, D. D.,
Muñoz-Carpena, R., Dharmasri, C., Sexton, A., Arabi, M.,
Wolfe, M. L., Frankenberger, J., Graff, C. & Sohrabi, T. M.
 Uncertainty in TMDL models.Transactions of the
ASABE 49 (4), 10331049.
Shrestha, N. K., Du, X. & Wang, J.  Assessing climate change
impacts on fresh water resources of the Athabasca River
Basin, Canada.Science of The Total Environment 601602,
425440.
SOSCIP  Cloud Analytics. Smart Computing for Innovation
(SOSCIP). Available from: https://www.soscip.org/
2017impactreport/platforms/.
Tayfur, G.  Modern optimization methods in water resources
planning, engineering and management.Water Resources
Management 31 (10), 32053233.
Tolson, B. A. & Shoemaker, C. A.  Dynamically dimensioned
search algorithm for computationally efcient watershed
model calibration.Water Resources Research 43 (1). Q6
van Griensven, A. & Meixner, T.  A global and efcient multi-
objective auto-calibration and uncertainty estimation method
for water quality catchment models.Journal of
Hydroinformatics 9(4), 277291.
Vrugt, J. A.  Markov chain Monte Carlo simulation using the
DREAM software package: theory, concepts, and MATLAB
implementation.Environmental Modelling & Software 75,
273316.
Vrugt, J. A., Gupta, H. V., Bouten, W. & Sorooshian, S.  A
Shufed Complex Evolution Metropolis algorithm for
optimization and uncertainty assessment of hydrologic
model parameters. Water Resources Research 39 (8). Q7
Vrugt, J. A., ter Braak, C. J. F., Clark, M. P., Hyman, J. M. &
Robinson, B. A.  Treatment of input uncertainty in
hydrologic modeling: doing hydrology backward with
Markov Chain Monte Carlo simulation.Water Resources
Research 44 (12). Q8
Yang, J., Jakeman, A., Fang, G. & Chen, X.  Uncertainty
analysis of a semi-distributed hydrologic model based on
a Gaussian Process emulator.Environmental Modelling &
Software 101, 289300.
Yapo, P. O., Gupta, H. V. & Sorooshian, S.  Multi-objective
global optimization for hydrologic models.Journal of
Hydrology 204 (1), 8397.
Zhabitskaya, E. & Zhabitsky, M.  Asynchronous differential
evolution. In: Mathematical Modeling and Computational
Science. MMCP 2011. Lecture Notes in Computer Science
(G. Adam, J. Buša & M. Hnatič, eds). Springer, Berlin.
Zhang, D., Chen, X. & Yao, H.  Development of a prototype
web-based decision support system for watershed
management. Water 7(2). Q9
Zhang, D., Chen, X., Yao, H. & James, A.  Moving SWAT
model calibration and uncertainty analysis to an enterprise
Hadoop-based cloud.Environmental Modelling & Software
84, 140148.
Zhang, Y., Hou, J., Cao, Y., Gu, J. & Huang, C.  OpenMP
parallelization of a gridded SWAT (SWATG).Computers &
Geosciences 109, 228237.
Zhang, B., Shrestha, N., Daggupati, P., Rudra, R., Shukla, R., Kaur,
B. & Hou, J.  Quantifying the impacts of climate change
on streamow dynamics of two major rivers of the Northern
Lake Erie Basin in Canada. Sustainability 10 (8). Q10
Zhang, B., Shrestha, N. K., Rudra, R., Shukla, R., Daggupati, P.,
Goel, P. K., Dickinson, W. T. & Allataifeh, N.  Threshold
storm approach for locating phosphorus problem areas: an
application in three agricultural watersheds in the Canadian
Lake Erie basin. Journal of Great Lakes Research.Q11
First received 23 April 2020; accepted in revised form 22 September 2020. Available online 20 October 2020
15 M. Zamani et al. |A Linux-based cloud calibration system for SWAT (LCC-SWAT) Journal of Hydroinformatics |in press |2020
Uncorrected Proof
Author Queries
Journal: Journal of Hydroinformatics
Manuscript: HYDRO-D-20-00066
Q1 Please indicate which authors, if any, are IWA members.
Q2 Please conrm whether the Highlights points used in the article are correct.
Q3 Please conrm whether the Data availability statement used in the article are correct.
Q4 Please provide missing page range for reference Gupta et al. 2012.
Q5 Please provide missing Journal title and page number for reference Moriasi et al. 2015.
Q6 Please provide missing page range for reference Tolson and Shoemaker 2007.
Q7 Please provide missing page range for reference Vrugt et al. 2003.
Q8 Please provide missing page range for reference Vrugt et al. 2008.
Q9 Please provide missing page range for reference Zhang et al. 2015.
Q10 Please provide missing page range for reference Zhang et al. 2018.
Q11 Please provide missing volume and page number for reference Zhang et al. 2020.
Disclaimer
This is the uncorrected version of your paper sent to you with the DOI that will be used for the published paper (Version of
Record). The uncorrected version will show online while the following services are applied to your manuscript; copyediting,
proofreading and typesetting. To see the most current version of your paper, please use the DOI provided.
... The regulation of ecosystems prone to flooding is influenced by the regulation of the wandering watershed in the region and the combined effect of complex and variable climate, hydrology, soil, land use, vegetation cover, and plant transpiration [1,22]. Therefore, a surface runoff simulation and calibration model should be adopted to simulate the real hydrological formation process in a complex environment, and more detailed climate, soil, land-use databases, and calibration methods should be used to improve the assessment accuracy of FRES supply [9,22,29]. Current methods of measuring the demand for FRES focus on drawing susceptibility or risk zoning maps [30][31][32][33] through flood susceptibility or risk analysis [17,[34][35][36]. Specifically, this includes economic loss calculation [6], inundated area simulation measurements [15], the index weight method [18], data mining and machine learning [30,31], the logistic regression model [33,37], the analytic hierarchy process [17,32,35], frequency ratio method [36] and the deep neural network method [38,39]. ...
... Land-use data were obtained based on the Data Center for Resources and Environmental Sciences, Chinese Academy of Sciences. According to the data requirements of the SWAT model, soil-type data were obtained based on HWSD soil database, and meteorological data were obtained based on CMADS dataset [29,42]. The data on water system, road, medical treatment, and firefighting facilities were downloaded based on AMAP and the universal electronic map of Shuijingzhu. ...
... Its simulation object is a watershed scale, which can simulate hydrology and related material migration and transformation by integrating the topography and geology, soil, land use, weather, and management measures of the basin. The SWAT model can calculate hundreds of sub-basins at the same time and has the characteristics of a flexible watershed and sub-basin and calculation structure [29]. At present, the model has become an indispensable tool in water conservation management planning and is used by some national and local government decision makers. ...
Article
Full-text available
Global climate change has led to flood disasters increasing in terms of frequency and damage caused, which seriously threatens urban and rural security. The flood regulation (FR) service function of the ecosystem plays an important role in mitigating flood disaster risk. Previous studies on flood regulation ecosystem services (FRES) are still lacking in a cross-scale assessment of supply and demand, refined simulation of regional complex hydrology, and application of spatial zoning management. Taking the Fujian Delta as an example, this study established a cross-scale research framework based on the social-ecosystem principle. The SWAT model was used to simulate the regional hydrological runoff and calculate the macro-scale supply of FRES. Taking patches of land as units, a flood risk assessment model was constructed to calculate the micro-scale demand for FRES for urban and rural society. Through a comparison of supply and demand across spatial scales, a zoning management scheme to deal with flood disaster risk was proposed. The results showed that: (1) The supply of FRES differed greatly among the sub-basins, and the sub-basins with low supply were mostly distributed in the lower reaches of Jiulong River and the coastal areas. (2) The demand for FRES was concentrated in high-density urban built-up areas. (3) By comparing the supply and demand of FRES in sub-basin units, 2153 km2 ecological space was identified as the primary ecological protection area, and 914 km2 cultivated land and bare land were identified as the primary ecological restoration area. (4) By comparing the supply and demand of FRES of land patch units, 65.42 km2 of construction land was identified as the primary intervention area. This study provides a decision-making basis for regional flood disaster management from the perspective of FRES.
... In general, there are four broad types of research methods for alleviating the computational burden associated with computationally expensive model applications: (1) utilizing metamodeling approaches (Chandra et al., 2020;Sun et al., 2015), (2) developing computationally efficient algorithms (Humphrey et al., 2012;Joseph and Guillaume, 2013), (3) opportunistically avoiding model evaluations (Razavi et al., 2010), and (4) utilizing parallel computing technologies and infrastructures (Yang et al., 2020;Huang et al., 2019;Wu et al., 2013Wu et al., , 2014Zamani et al., 2020). The first, second, and third ideas above share the same goal of reducing computational demand by using lightweight surrogate models, by decreasing the number of model simulations, and by terminating model execution early when the simulation result is poorer than expected, respectively. ...
... Among these methods, model parallelization is the most frequently adopted. It has been extensively applied to optimize the efficiency of generic modeling activities, such as model calibration (Zhang et al., 2013;Ercan et al., 2014;Gorgan et al., 2012), sensitivity analysis (Khalid et al., 2016;Hu et al., 2015), uncertainty analysis Wu and Liu, 2012;Zamani et al., 2020) and the identification of beneficial management practices (Liu et al., 2013). For spatially explicit models, it is possible to decompose a large-scale model into multiple smaller models and, in parallel, to simulate the independent smaller models to further improve model performance (hereafter, for simplicity, this spatial-decomposition and simulation method is referred to as the spatial-decomposition method). ...
Article
Full-text available
High-fidelity and large-scale hydrological models are increasingly used to investigate the impacts of human activities and climate change on water availability and quality. However, the detailed representations of real-world systems and processes contained in these models inevitably lead to prohibitively high execution times, ranging from minutes to days. Such models become computationally prohibitive or even infeasible when large iterative model simulations are involved. In this study, we propose a generic two-level (i.e., watershed- and subbasin-level) model parallelization schema to reduce the run time of computationally expensive model applications through a combination of model spatial decomposition and the graph-parallel Pregel algorithm. Taking the Soil and Water Assessment Tool (SWAT) as an example, we implemented a generic tool named GP-SWAT, enabling watershed-level and subbasin-level model parallelization on a Spark computer cluster. We then evaluated GP-SWAT in two sets of experiments to demonstrate the ability of GP-SWAT to accelerate single and iterative model simulations and to run in different environments. In each test set, GP-SWAT was applied for the parallel simulation of four synthetic hydrological models with different input/output (I/O) burdens. The single-model parallelization results showed that GP-SWAT can obtain a 2.3–5.8-times speedup. For multiple simulations with subbasin-level parallelization, GP-SWAT yielded a remarkable speedup of 8.34–27.03 times. In both cases, the speedup ratios increased with an increasing computation burden. The experimental results indicate that GP-SWAT can effectively solve the high-computational-demand problems of the SWAT model. In addition, as a scalable and flexible tool, it can be run in diverse environments, from a commodity computer running the Microsoft Windows operating system to a Spark cluster consisting of a large number of computational nodes. Moreover, it is possible to apply this generic tool to other subbasin-based hydrological models or even acyclic models in other domains to alleviate I/O demands and to optimize model computational performance.
... Identifying algorithms that are effective with a limited number of model evaluations provides tools that can both be used for large watersheds and for watershed models with a lot of spatial detail (both of which are more computationally expensive and hence the number of model evaluations will be limited). Moreover, such algorithms are also effective in multi-step/ sequential calibration scenarios, i.e., scenarios where calibration experts apply automatic algorithms for model calibration in multiple iterations, with changes made in parameter choice and range in each iteration (Xia et al., 2022a;Wu et al., 2021;Franco et al., 2020;Zamani et al., 2020). Thus, assessment and identification of suitable MO algorithms for expensive watershed problems and limited evaluation budgets is important. ...
... The first one is calibrating and validating the two models independently and integrating them for a forward run (Bailey et al. 2016;Chunn et al. 2019;Wei et al. 2019). The second procedure is calibrating the two models independently and recalibrating for selected parameters after integrating the models (Taie The SWAT model is commonly calibrated using either Sequential Uncertainty Fitting 2 (SUFI2) or GLUE (Arnold et al. 2012;Kumar et al. 2017;Zamani et al. 2021). On the other hand, MODFLOW calibration is performed commonly using a parameter estimation (PEST) interface (Doherty 2004). ...
Article
Full-text available
River-aquifer interaction is a key component of the hydrological cycle that affects water resources and quality. Recently, the application of integrated models to assess interaction has been increasing. However, calibration and uncertainty analysis of coupled models has been a challenge, especially for large-scale applications. In this study, we used PESTPP-IES, an implementation of the Gauss-Levenberg–Marquardt iterative ensemble smoother, to calibrate and quantify the uncertainty of an integrated SWAT-MODFLOW model for watershed-scale river aquifer interaction assessment. SWAT-MODFLOW combines the Soil and Water Assessment Tool (SWAT), a widely used watershed model, with a three-dimensional groundwater flow model (MODFLOW). The calibration performance of the model was evaluated, and the uncertainty in the parameters and observed ensemble, including the uncertainty in forecasting groundwater levels, was assessed. The results showed that the technique could enhance the model performance and reduce uncertainty. However, the results also revealed some limitations and biases, such as overestimating the groundwater levels in most monitoring wells. These biases were attributed to the limited availability of groundwater level in the first year of the calibration and the uncertainty in groundwater flow model parameters. The river-aquifer interactions analysis shows that water exchange occurs in almost all cells along the river, with most of the high-elevation areas receiving groundwater and flatter regions discharging water to the aquifer. The study showed that PESTPP-IES is a robust technique for watershed-scale river-aquifer modeling that can ensure model calibration and parameter uncertainty analysis. The findings of this study can be used to improve water resources management in watersheds and help decision-makers in making informed decisions.
Article
Full-text available
The Soil and Water Assessment Tool (SWAT) has been widely applied to simulate the hydrological cycle, investigate cause-and-effect relationships, and aid decision-making for better watershed management. However, the software tools for model dataset analysis and visualization to support informed decision-making in a web environment are not considered fully fledged and are technically intensive to implement. This study focuses on addressing these issues by establishing a tool and library (named PAVLIB4SWAT) that can largely reduce technical expertise requirements for developers to adopt and customize this work to their own demands. Specifically, we created PAVLIB4SWAT based on a Kepler.gl widget to visualize SWAT model data, including shapefiles from the watershed delineation process, model inputs, and simulated results via dynamic and interactive maps. We evaluated PAVLIB4SWAT through a Jinjiang watershed SWAT model use case to demonstrate its utility and ease of adoption. The case study shows that PAVLIB4SWAT can provide various geospatial analysis and mapping functionalities for SWAT models and can flexibly distribute visualized results as standalone offline web pages and web servers. In addition, PAVLIB4SWAT was designed as an open-source project and implemented purely in the Python programming language; thus, developers can easily adapt and customize it to suit their demands.
Preprint
Full-text available
River-aquifer interaction is a key component of the hydrological cycle that affects water resources and quality. Recently, the application of integrated models to assess the interaction has been increasing. However, calibration and uncertainty analysis of coupled models has been a challenge, especially for large-scale applications. In this study, we used PESTPP-IES, an implementation of the Gauss-Levenberg-Marquardt iterative ensemble smoother, to calibrate and quantify the uncertainty of an integrated SWAT-MODFLOW model for watershed-scale river aquifer interaction assessment. SWAT-MODFLOW combines the Soil and Water Assessment Tool (SWAT), a widely used watershed model, with a three-dimensional groundwater flow model (MODFLOW). The calibration performance of the model was evaluated, and the uncertainty in the parameters and observed ensemble, including the uncertainty in forecasting groundwater levels, was assessed. The results showed that the technique could enhance the model performance and reduce uncertainty. However, the results also revealed some limitations and biases, such as overestimating the groundwater levels in most monitoring wells. These biases were attributed to the limited availability of groundwater level in the first year of the calibration and the uncertainty in groundwater flow model parameters. The river-aquifer interactions analysis shows that water exchange occurs in almost all cells along the river, with most of the high-elevation areas receiving groundwater and flatter regions discharging water to the aquifer. The study showed that PESTPP-IES is a robust technique for watershed-scale river-aquifer modeling that can ensure model calibration and parameter uncertainty analysis. The findings of this study can be used to improve water resources management in watersheds and help decision-makers in making informed decisions.
Article
Automatic calibration (autocalibration) of models is a standard practice in hydrologic sciences. However, hydrologic modelers, while performing autocalibrations, spend considerable amount of time in data pre-processing, coding, and running simulations rather than focusing on science questions. Such inefficiency, as this paper outlines, stems from: (i) platform dependence, (ii) limited computational resource, (iii) limited programming literacy, (iv) limited model structure and source code literacy, and (v) lack of data-model interoperability in the so-called autocalibration process. By expanding and enhancing an existing web-based modeling platform SWATShare, developed for the Soil and Water Assessment Tool (SWAT) hydrologic model, this paper demonstrates a generalizable pathway to making autocalibration efficient via cyberinfrastructure (CI) solutions. SWATShare is a collaborative platform for sharing and visualization of SWAT models, model results, and metadata online. This paper describes the front and back end architectures of SWATShare for enabling efficient SWAT model autocalibration on the web. In addition, this paper also demonstrates three implementation case studies to validate the autocalibration workflow and results. Results from these implementations show that SWATShare autocalibration can produce streamflow hydrograph and parameters that are comparable with commonly used offline SWATCUP calibration outputs. In some instances, the parameter values from SWATShare calibration are more physically relevant than those from SWATCUP. Although the discussion in this paper is in the context of SWAT and SWATShare, the conceptual and technical design presented here can be used as an Open Science blueprint for similar CI-enabled developments in other hydrologic models, and more importantly, in other domains of Earth system sciences.
Article
A simulation optimization framework requires a substantial number of model simulations, which are computationally intensive and may be impractical when the model simulations are extremely time-consuming. This paper presents an improved Hadoop-based cloud framework to alleviate the computational burden of optimization. The framework parallelizes conventional sequential-model-based optimization techniques by concurrently orchestrating multiple model computations within Hadoop MapReduce. It guarantees the reliability of simulation optimization tasks by handling node failures without affecting the ongoing simulation. A case study, using Bayesian optimization to calibrate a SWAT model, achieved a speedup of nearly 55–58 when using 100 cores, demonstrating the efficiency of parallelizing the Bayesian optimization algorithm on the Hadoop-based cloud. Experiments in which computing nodes were dynamically increased or decreased demonstrated that the framework can automatically rebalance the workload across the remaining nodes. The framework is readily adaptable to other complex model applications that perform sequential-model-based optimizations or large-scale simulations.
Article
Full-text available
Water security is the capability of a community to have adequate access to good quality and a sufficient quantity of water as well as safeguard resources for the future generations. Understanding the spatial and temporal variabilities of water security can play a pivotal role in sustainable management of fresh water resources. In this study, a long-term water security analysis of the Grand River watershed (GRW), Ontario, Canada, was carried out using the soil and water assessment tool (SWAT). Analyses on blue and green water availability and water security were carried out by dividing the GRW into eight drainage zones. As such, both anthropogenic as well as environmental demand were considered. In particular, while calculating blue water scarcity, three different methods were used in determining the environmental flow requirement, namely, the presumptive standards method, the modified low stream-flow method, and the variable monthly flow method. Model results showed that the SWAT model could simulate streamflow dynamics of the GRW with 'good' to 'very good' accuracy with an average Nash-Sutcliffe Efficiency of 0.75, R 2 value of 0.78, and percentage of bias (PBIAS) of 8.23%. Sen's slope calculated using data from over 60 years confirmed that the blue water flow, green water flow, and storage had increasing trends. The presumptive standards method and the modified low stream-flow method, respectively, were found to be the most and least restrictive method in calculating environmental flow requirements. While both green (0.4-1.1) and blue (0.25-2.0) water scarcity values showed marked temporal and spatial variabilities, blue water scarcity was found to be the highest in urban areas on account of higher water usage and less blue water availability. Similarly, green water scarcity was found to be highest in zones with higher temperatures and intensive agricultural practices. We believe that knowledge of the green and blue water security situation would be helpful in sustainable water resources management of the GRW and help to identify hotspots that need immediate attention.
Article
Full-text available
This paper focuses on understanding the effects of projected climate change on streamflow dynamics of the Grand and Thames rivers of the Northern Lake Erie (NLE) basin. A soil water assessment tool (SWAT) model is developed, calibrated, and validated in a base-period. The model is able to simulate the monthly streamflow dynamics with 'Good' to 'Very Good' accuracy. The calibrated and validated model is then subjected with daily bias-corrected future climatic data from the Canadian Regional Climate Model (CanRCM4). Five bias-correction methods and their 12 combinations were evaluated using the Climate Model data for hydrologic modeling (CMhyd). Distribution mapping (DM) performed the best and was used for further analysis. Two future time-periods and two IPCC AR5 representative concentration pathways (RCPs) are considered. Results showed marked temporal and spatial variability in precipitation (−37% to +63%) and temperature (−3 • C to +14 • C) changes, which are reflected in evapotranspiration (−52% to +412%) and soil water storage (−60% to +12%) changes, resulting in heterogeneity in streamflow (−77% to +170%) changes. On average, increases in winter (+11%), and decreases in spring (-33%), summer (−23%), and autumn (−15%) streamflow are expected in future. This is the first work of this kind in the NLE and such marked variability in water resources availability poses considerable challenges to water resources planners and managers.
Article
Full-text available
Despite various criticisms of GLUE (Generalized Likelihood Uncertainty Estimation), it is still a widely-used uncertainty analysis technique in hydrologic modelling that can give an appreciation of the level and sources of uncertainty. We introduce an augmented GLUE approach based on a Gaussian Process (GP) emulator, involving GP to conduct a Bayesian sensitivity analysis to narrow down the influential factor space, and then performing a standard GLUE uncertainty analysis. This approach is demonstrated for a SWAT (Soil and Water Assessment Tool) application in a watershed in China using a calibration and two validation periods. Results show: 1) the augmented approach led to the screening out of 14–18 unimportant factors, effectively narrowing factor space; 2) compared to the more standard GLUE, it substantially improved the sampling efficiency, and located the optimal factor region at lower computational cost. This approach can be used for other uncertainty analysis techniques in hydrologic and non-hydrologic models.
Article
Hot-spots and hot-moments of phosphorus loads in an agricultural watershed depend not only on thewatershed characteristics but also on the type and intensity of storms. Not all storms will generate phos-phorus that can be considered problematic. A threshold storm is thus proposed and defined as the max-imum storm intensity in which the phosphorus generated in a watershed is below seasonal phosphorustolerance limit. To evaluate the threshold storm approach, separate Agricultural Non-point Source(AGNPS) models for three diverse small agricultural watersheds in southern Ontario, Canada were cali-brated for runoff volume, sediment yield, and total phosphorus and run for representative storms withincreasing return periods (2-year through 100-year). Results showed that in an upland watershed(Holtby), a 4.8-year early spring storm tend to generate phosphorus load above the threshold limit forthe season. The same for low-land watersheds (Wigle and Jeannette) were, respectively, 14.9-year and12.4-year. In all three watersheds, summer storms up to 100-year will fail to reach the seasonal tolerancelimit for phosphorus. The critical source areas, identified based on the threshold storms, were distributeduniformly across the watersheds. As a phosphorus problem is essentially a source problem, such a simpleyet robust approach to identify critical source areas of phosphorus can be useful in designing cost-effective best management practices
Article
Watershed models are widely used in total maximum daily load (TMDL) studies to predict the impacts of pollutant discharges on the biochemical functioning and assimilative capacity of water bodies. The reliability of a TMDL is therefore tightly linked with the predictive capability of these models. While there has been an increasing availability and application of watershed models for TMDL studies, guidelines for model evaluation, including recommendations for an appropriate selection and implementation of calibration, validation, and uncertainty analysis strategies, remain at present limited. The ASCE Environmental and Water Resources Institute (EWRI) TMDL Analysis and Modeling Task Committee was established in part to identify existing and emerging challenges encountered by water resources professionals during any phase of a TMDL development and to produce documentation to address these challenges. This paper reviews existing approaches for model calibration, validation, and uncertainty analysis, including r recommendations to establish baseline modeling practices to obtain a satisfactory watershed model.
Article
This paper reviews 14 prevalent watershed models for their capabilities, credibility, and suitability in total maximum daily load (TMDL) development and implementation. Brief descriptions of the models, including sources, capabilities, and applicability are presented. General information such as intended watershed and simulation types, simulated outputs, uncertainty analysis capabilities, graphical user interface, and availability are also presented. Mathematical bases of the hydrologic and water quality simulations, which indicate credibility, expected performance, and accuracy, and dictate model features (e.g., structure, input data, and parameters) are presented. Routing procedures, the backbones of the models, are compared and ranked. The Gridded Surface and Subsurface Hydrologic Analysis (GSSHA) and MIKE SHE (Système Hydrologique Européen) models rank high on overland and Storm Water Management Model (SWMM) on channel/pipe flow routing as the most accurate for representation of the physical processes and also the most numerically complex. The Generalized Watershed Loading Function (GWLF) and Spreadsheet Tool for the Estimation of Pollutant Load (STEPL) rank at the bottom on both the aspects. The rest of the models are in between, although the Dynamic Watershed Simulation Model (DWSM) is computationally efficient among kinematic wave models. Notable strengths and limitations of the models for TMDL development and implementation are presented. All these provide valuable information on the models, not readily available in a concise form elsewhere, to compare and help determine relative credibility and make informed selections for TMDLs and similar studies. Future research should focus on further comparisons of the models based on other key aspects such as simulation capabilities of processes, uncertainty analysis, required resources, and performances on watersheds followed by developing better models or improving existing ones by strengthening the weaknesses found. Robust physically based algorithms, uncertainty analysis capabilities, and use of remotely sensed and high-resolution data are recommended to be part of the model improvements.
Article
Two types of sampling plans are examined as alternatives to simple random sampling in Monte Carlo studies. These plans are shown to be improvements over simple random sampling with respect to variance for a class of estimators which includes the sample mean and the empirical distribution function.
Article
Large-scale, long-term and high spatial resolution simulation is a common issue in environmental modeling. A Gridded Hydrologic Response Unit (HRU)-based Soil and Water Assessment Tool (SWATG) that integrates grid modeling scheme with different spatial representations also presents such problems. The time-consuming problem affects applications of very high resolution large-scale watershed modeling. The OpenMP (Open Multi-Processing) parallel application interface is integrated with SWATG (called SWATGP) to accelerate grid modeling based on the HRU level. Such parallel implementation takes better advantage of the computational power of a shared memory computer system. We conducted two experiments at multiple temporal and spatial scales of hydrological modeling using SWATG and SWATGP on a high-end server. At 500-m resolution, SWATGP was found to be up to nine times faster than SWATG in modeling over a roughly 2,000 km² watershed with 1 CPU and a 15 thread configuration. The study results demonstrate that parallel models save considerable time relative to traditional sequential simulation runs. Parallel computations of environmental models are beneficial for model applications, especially at large spatial and temporal scales and at high resolutions. The proposed SWATGP model is thus a promising tool for large-scale and high-resolution water resources research and management in addition to offering data fusion and model coupling ability.
Article
Proper management of blue and green water resources is important for the sustainability of ecosystems and for the socio-economic development of river basins such as the Athabasca River Basin (ARB) in Canada. For this reason, quantifying climate change impacts on these water resources at a finer temporal and spatial scale is often necessary. In this study, we used a Soil and Water Assessment Tool (SWAT) to assess climate change impacts on fresh water resources, focusing explicitly on the impacts to both blue and green water. We used future climate data generated by the Canadian Center for Climate Modelling and Analysis Regional Climate Model (CanRCM4) with a spatial resolution of 0.22° × 0.22° (~ 25 km) for two emission scenarios (RCP 4.5 and 8.5). Results projected the climate of the ARB to be wetter by 21–34% and warmer by 2–5.4 °C on an annual time scale. Consequently, the annual average blue and green water flow was projected to increase by 16–54% and 11–34%, respectively, depending on the region, future period, and emission scenario. Furthermore, the annual average green water storage at the boreal region was expected to increase by 30%, while the storage was projected to remain fairly stable or decrease in other regions, especially during the summer season. On average, the fresh water resources in the ARB are likely to increase in the future. However, evidence of temporal and spatial heterogeneity could pose many future challenges to water resource planners and managers.