ArticlePDF Available

Regional Flood Frequency Analysis for a Poorly Gauged Basin Using the Simulated Flood Data and L-Moment Method

Authors:

Abstract and Figures

The design of hydraulic structures and the assessment of flood control measures require the estimation of flood quantiles. Since observed flood data are rarely available at the specific location, flood estimation in un-gauged or poorly gauged basins is a common problem in engineering hydrology. We investigated the flood estimation method in a poorly gauged basin. The flood estimation method applied the combination of rainfall-runoff model simulation and regional flood frequency analysis (RFFA). The L-moment based index flood method was performed using the annual maximum flood (AMF) data simulated by the rainfall-runoff model. The regional flood frequency distribution with 90% error bounds was derived in the Chungju dam basin of Korea, which has a drainage area of 6648 km2. The flood quantile estimates based on the simulated AMF data were consistent with the flood quantile estimates based on the observed AMF data. The widths of error bounds of regional flood frequency distribution increased sharply as the return period increased. The results suggest that the flood estimation approach applied in this study has the potential to estimate flood quantiles when the hourly rainfall measurements during major storms are widely available and the observed flood data are limited.
Content may be subject to copyright.
water
Article
Regional Flood Frequency Analysis for a Poorly
Gauged Basin Using the Simulated Flood Data and
L-Moment Method
Do-Hun Lee 1, * and Nam Won Kim 2
1Department of Civil Engineering, Kyung Hee University, Yongin 17104, Korea
2Department of Land, Water and Environment Research, Korea Institute of Civil Engineering and Building
Technology, Goyang 10223, Korea
*Correspondence: dohlee@khu.ac.kr
Received: 31 July 2019; Accepted: 16 August 2019; Published: 18 August 2019


Abstract:
The design of hydraulic structures and the assessment of flood control measures require
the estimation of flood quantiles. Since observed flood data are rarely available at the specific
location, flood estimation in un-gauged or poorly gauged basins is a common problem in engineering
hydrology. We investigated the flood estimation method in a poorly gauged basin. The flood
estimation method applied the combination of rainfall-runomodel simulation and regional flood
frequency analysis (RFFA). The L-moment based index flood method was performed using the annual
maximum flood (AMF) data simulated by the rainfall-runomodel. The regional flood frequency
distribution with 90% error bounds was derived in the Chungju dam basin of Korea, which has a
drainage area of 6648 km
2
. The flood quantile estimates based on the simulated AMF data were
consistent with the flood quantile estimates based on the observed AMF data. The widths of error
bounds of regional flood frequency distribution increased sharply as the return period increased. The
results suggest that the flood estimation approach applied in this study has the potential to estimate
flood quantiles when the hourly rainfall measurements during major storms are widely available and
the observed flood data are limited.
Keywords:
regional flood frequency analysis; L-moment; index flood; poorly gauged basin;
HEC-HMS
1. Introduction
Flood estimates are needed to design hydraulic structures and assess flood control measures.
Since flood data in the investigated location are rarely available, reliable estimation of flood quantiles
in un-gauged or poorly gauged basins is a common problem in engineering hydrology. In the past,
various regional flood frequency analysis (RFFA) methods have been applied to estimate the flood
quantiles in ungauged or poorly gauged basins [
1
,
2
]. The RFFA can be performed by pooling the flood
discharge data observed at dierent sites within a region. However, it is very dicult to perform
the RFFA in Korea because of limited regional flood discharge data. In practice, the flood quantiles
in Korea are generally estimated using the rainfall-based design event approach. The design event
approach is based on the transformation of design rainfall into design flood [3,4].
The joint probability approach has been proposed to resolve the limitations of the design
event approach. The joint probability approach considers probability-distributed inputs and model
parameters using a Monte Carlo simulation approach [
4
,
5
]. Continuous simulation approaches have
become more popular as a complement to the RFFA. The continuous simulation approach applies the
continuous model with long rainfall inputs to derive the flood frequency distribution [
6
8
]. These
methods might have computational constraints for applications in engineering practice.
Water 2019,11, 1717; doi:10.3390/w11081717 www.mdpi.com/journal/water
Water 2019,11, 1717 2 of 15
In this study, we explore the alternative flood estimation method, which applies the combination
of event rainfall-runomodel simulation and the RFFA. First, the rainfall-runomodel simulates
the regional flood data for major storm events. Subsequently, the L-moment based index flood
method is performed using the simulated annual maximum flood (AMF) sampled at many sites
within a homogeneous region. The flood estimation method developed in this study is motivated
by the following consideration. Rainfall data for major storms are widely available at the study
basin. The L-moment method provides less bias in flood estimation and is universally applied in the
literature [916].
Use of the simulated flood data in the RFFA was previously proposed by Kim et al. [
17
]. They
applied the RFFA to AMF data generated from the storage function runomodel. They called their
approach the spatial data extension method. Since the RFFA that employs the spatial data extension
method is based on the simulated AMF, it is important to assess the applicability of the method in the
RFFA. In this study, we aim to test the spatial data extension method using a rainfall-runomodel
and modeling strategy which are dierent from the previous study [
17
]. The proposed rainfall-runo
modeling strategy allowed us to generate the AMF data in a poorly gauged basin. In the present
context, the poorly gauged basin is the basin which has long-term flood data at one or two hydrometric
stations such that it is infeasible to apply RFFA using the observed AMF. The simulated AMF data
were used to derive a regional growth curve and index flood in the study basin. The uncertainty of the
regional growth curve was also estimated using the Monte Carlo simulation approach suggested by
Hosking and Wallis [
9
]. The estimated flood quantiles were evaluated using the AMF data observed
at the outlet and one interior location in the study basin. Our results were compared to those of the
previous study [17] to assess the applicability of the developed flood estimation method.
2. Materials and Methods
2.1. Study Region and Data
In this study, we investigated in the Chungju dam basin, which is located on the Namhan river in
Korea. The drainage area of the basin is 6648 km
2
, and the Chungju dam (CJD) is located at the outlet
of the basin (Figure 1). The average annual precipitation is 1330 mm, and 70% of the rain falls in the
monsoon season from June to September [
18
]. The main land use type of the basin is forests, which
occupy 82% and the agricultural area covers 13% of the basin.
Water 2019, 11, x FOR PEER REVIEW 2 of 15
In this study, we explore the alternative flood estimation method, which applies the combination
of event rainfall-runoff model simulation and the RFFA. First, the rainfall-runoff model simulates the
regional flood data for major storm events. Subsequently, the L-moment based index flood method
is performed using the simulated annual maximum flood (AMF) sampled at many sites within a
homogeneous region. The flood estimation method developed in this study is motivated by the
following consideration. Rainfall data for major storms are widely available at the study basin. The
L-moment method provides less bias in flood estimation and is universally applied in the literature
[9–16].
Use of the simulated flood data in the RFFA was previously proposed by Kim et al. [17]. They
applied the RFFA to AMF data generated from the storage function runoff model. They called their
approach the spatial data extension method. Since the RFFA that employs the spatial data extension
method is based on the simulated AMF, it is important to assess the applicability of the method in
the RFFA. In this study, we aim to test the spatial data extension method using a rainfall-runoff model
and modeling strategy which are different from the previous study [17]. The proposed rainfall-runoff
modeling strategy allowed us to generate the AMF data in a poorly gauged basin. In the present
context, the poorly gauged basin is the basin which has long-term flood data at one or two
hydrometric stations such that it is infeasible to apply RFFA using the observed AMF. The simulated
AMF data were used to derive a regional growth curve and index flood in the study basin. The
uncertainty of the regional growth curve was also estimated using the Monte Carlo simulation
approach suggested by Hosking and Wallis [9]. The estimated flood quantiles were evaluated using
the AMF data observed at the outlet and one interior location in the study basin. Our results were
compared to those of the previous study [17] to assess the applicability of the developed flood
estimation method.
2. Materials and Methods
2.1. Study Region and Data
In this study, we investigated in the Chungju dam basin, which is located on the Namhan river
in Korea. The drainage area of the basin is 6648 km2, and the Chungju dam (CJD) is located at the
outlet of the basin (Figure 1). The average annual precipitation is 1330 mm, and 70% of the rain falls
in the monsoon season from June to September [18]. The main land use type of the basin is forests,
which occupy 82% and the agricultural area covers 13% of the basin.
Figure 1. The Chungju dam basin and sites used for the RFFA (left). The divided sub-watershed is
based on water resource unit map defined in WAMIS (www.wamis.go.kr).
Figure 1 shows the rainfall and discharge stations used in this study. Major storm events for each
year were selected from 1987 to 2010 (Table S1 in Supplementary Materials). These storm events
would produce the maximum peak discharge in a given year. For some years, multiple storm events
are considered to sample the maximum peak discharge at various sites because the maximum peak
Figure 1.
The Chungju dam basin and sites used for the RFFA (left). The divided sub-watershed is
based on water resource unit map defined in WAMIS (www.wamis.go.kr).
Figure 1shows the rainfall and discharge stations used in this study. Major storm events for
each year were selected from 1987 to 2010 (Table S1 in Supplementary Materials). These storm
events would produce the maximum peak discharge in a given year. For some years, multiple
Water 2019,11, 1717 3 of 15
storm events are considered to sample the maximum peak discharge at various sites because the
maximum peak discharge at various sites depends on the spatial and temporal variability of rainfall
characteristics. The flood data were collected at two locations: The dam inflow data at the CJD and
the water level data measured at the Yeongchun (YC) station were converted into discharge using
rating curve. The studied basin is considered a poorly gauged basin because long-term flood data are
available only at two hydrometric stations. These hydrological data have been managed by K-Water
(http://www.kwater.or.kr) and WAMIS (http://www.wamis.go.kr). The other data used for the analysis
are the DEM, with 30 m resolution, a 1:25,000 land cover map constructed by Ministry of Environment,
and a 1:25,000 soil map constructed by National Institute of Agricultural Sciences. The hydrologic soil
groups classified for Korean soils are also available from the National Institute of Agricultural Sciences.
2.2. Generation of Regional Flood Data Using Rainfall-RunoModel
The simulation of regional flood data in this study is based on HEC-HMS [
19
]. The HEC-HMS
provides various components and models for rainfall-runomodeling. The event simulation of
the rainfall-runorelation requires models for rainfall loss, excess rainfall transformation, baseflow,
and channel flow routing. The loss of rainfall is based on the SCS curve number method. The
transformation of excess rainfall into runois calculated using the SCS and Clark unit hydrograph
methods. The reasons for using these unit hydrograph methods are twofold: (1) the number of
parameters in the unit hydrograph methods is small and (2) the parameters can be identified using the
empirical formula developed for the Chungju dam basin. A simple model with a reduced number
of parameters was suggested to simulate flood formation process to reduce the uncertainty of model
predictions [
20
,
21
]. The exponential recession model was applied for modeling baseflow components.
The Muskingum–Cunge model was used for channel flow routing. The Muskingum–Cunge model is a
physically based model and can be used for ungauged watersheds. The HEC-HMS manual provides
detailed descriptions for the equations of rainfall-runocomponents.
The first step in setting up HEC-HMS is to delineate sub-watersheds and stream networks. The
entire Chungju dam basin is divided into 57 sub-watersheds based on map of the standard watershed
(Figure 1). The standard watersheds in Korea are defined for the eective identification of watersheds
and the management of water resources (http://www.wamis.go.kr). The average areal rainfall for each
sub-watershed is defined by the Thiessen polygon method.
The parameters and conditions for each watershed and channel reach should be identified to
simulate the response of rainfall-runorelations. For a gauged watershed, a calibration can be
performed to identify the model parameters using the observed flow data. However, it is dicult to
identify the model parameters in an ungauged or poorly gauged watershed. Predicting ungauged
basins has been a great challenge for hydrology community [22,23].
The parameters related to SCS curve number model are the initial abstraction ratio (
λ
) and curve
number (
CN
). The empirical relationship between initial abstraction (
Ia
) and potential maximum
retention (S) is given as
Ia=λS(1)
The relationship between Sand CN is given as
S=25400
CN 254 (2)
The initial abstraction ratio in the original SCS curve number equation is defined as
λ
=0.2.
However, many studies suggested a dierent
λ
value of 0.05 [
24
,
25
]. We applied two dierent initial
abstraction ratios of
λ
=0.2 and
λ
=0.05. The use of
λ
=0.05 requires a new set of
CN
values [
25
]. The
CN values for λ=0.05 can be obtained from CN values for λ=0.2 as
CN(λ=0.05)= 100
1.879100/CN(λ=0.2)11.15 +1(3)
Water 2019,11, 1717 4 of 15
For the ungauged watershed, the curve number can be estimated from the
CN
table of
USDA-NRCS [
26
] based on land use/cover and soil information. The curve number might vary
from event to event, and the variation in the
CN
results from dierences in rainfall intensity and
duration, total rainfall, soil moisture conditions, cover density, stage of growth, and temperature [
26
].
The three classes of antecedent runoconditions (ARC) are defined in USDA-NRCS [
26
] to reflect the
variability in
CN
: ARC II for average conditions, ARC I for dry conditions, ARC III for wet conditions.
The parameters needed for Clark and SCS unit hydrograph models are the time of concentration
(
Tc
), storage coecient (
R
) and lag time (
TL
). Lee et al. [
27
] developed a regionalized empirical formula
for
Tc
and
R
in the Chungju dam basin. The empirical formulas are expressed as a function of channel
length (L) and channel slope (S):
Tc=0.306(L2/S)0.256
R=0.786(L2/S)0.195 (4)
The USDA-NRCS [26] suggests the relationship between TLand Tc:
TL=0.6Tc(5)
These empirical formulas are applied to estimate the parameters
Tc
,
R
and
TL
for
each sub-watershed.
The Muskingum–Cunge model for channel flow routing requires a description of channel cross
section, reach length, roughness coecient, and energy slope. The trapezoidal channel configuration
was assumed to be a representative cross section. The energy slope was estimated by the channel bed
slope. The various field investigation reports in WAMIS were used to estimate the principal dimension
of channel section, reach length, and Manning roughness coecients. A total of 42 reaches were
defined for setting up the Muskingum–Cunge model in the Chungju dam basin. One representative
cross section per reach was defined in setting up the Muskingum–Cunge model.
The exponential recession model includes a specification for the initial baseflow and an exponential
decay constant. The initial baseflow is assumed to be equal to an initial discharge at the start of the
storm event. The specific discharges (discharge per unit area) for each storm event were estimated
using initial discharges observed at the CJD and YC stations. The average specific discharge was
estimated to be 0.04 m
3
/s/km
2
in the Chungju dam basin. The initial baseflow for each sub-watershed
can be specified by multiplying the specific discharge by the area of a sub-watershed. The exponential
decay constant suggested by Pilgrim and Cordery [28] was used in the simulation.
A number of simulations for each storm event were carried out using a combination of two unit
hydrograph methods (Clark and SCS), two
λ
values (0.2 and 0.05), and two ARC levels (II and III). The
selections of these conditions and methods were based on conditioning the observed peak discharge
data of each storm event at the CJD and YC stations. After the regional flood data were simulated for
each storm event, the simulated AMF data at various sites were extracted by selecting the largest peak
discharge in a given year.
2.3. Regional Flood Frequency Analysis
2.3.1. Index Flood Method
The index flood method originally proposed by Dalrymple [
29
] is based on the main hypothesis
that the frequency distributions of sites within a homogeneous region are identical apart from a
site-specific scaling factor (index flood). The quantile function at a site
i
,
Qi(F)
is estimated by the
product of the index flood (µi) and regional growth curve as
Qi(F)= µiq(F)(6)
Water 2019,11, 1717 5 of 15
In the equation,
F
implies the non-exceedance probability. The regional growth curve
q(F)
is the
dimensionless frequency distribution common to all sites within a homogeneous region. The index
flood is usually estimated as the mean of the AMF data at a site
i
. The other location estimator (such as
a median) can be used as the index flood [
14
]. Bocchiola et al. [
30
] reviewed the various methods for
estimating index flood at gauged and ungauged watersheds.
2.3.2. L-moment
The L-moment has theoretical advantages over the conventional moments. In contrast to the
conventional moments, the L-moment is reliably applied for the regional frequency analysis [
9
]. The
L-moment can characterize a wide range of distributions. The location, scale, and shape estimators
of the L-moment are unbiased irrespective of the probability distribution. The L-moments are more
robust to the presence of outliers in the data and are related to the probability weighted moment
(PWM). Greenwood et al. [
31
] introduced the PWM of a random variable and the L-moment is defined
by the linear combinations of PWM. Using the location measure (
λ1
) and scale measure (
λ2
), Hosking
and Wallis [9] defined the L-moment ratios as:
L-CV: τ=λ2/λ1
L-skewness: τ3=λ3/λ2
L-kurtosis: τ4=λ4/λ2
For a distribution which takes only positive values, the range of L-CV is 0
τ
<1. The ranges of
L-skewness and L-kurtosis are given as 1<τ3<1 and 1<τ4<1.
2.3.3. Discordancy and Heterogeneity Measure
The statistical tests for AMF data need to be performed to check the adequacy of data used for the
RFFA. The randomness test is based on the randtests package in R software [
32
]. The Kendall package
(https://www.rdocumentation.org/packages/Kendall/versions/2.2) based on the method of Mann [
33
]
was applied for the trend test. The outlier test was performed by the Bulletin 17B method [
34
] and the
G-B method [
35
]. These tests indicated that the simulated AMF data satisfied all assumptions needed
for the RFFA.
A discordancy measure can be used to check that the data are valid for the regional frequency
analysis. It allows one to detect sites that are discordant with the group as a whole. Hosking and
Wallis [9] defined the discordancy measure for a site ias
Di=N
3(uiu)TC1(uiu)(7)
where
N
is the number of sites,
ui
=
[τiτi
3τi
4]
is the vector containing the L-moment ratios,
u
is the
un-weighted group average of
ui
, and
C
is the sample covariance matrix of sums of squares and
cross-products. The site
i
is considered to be discordant with the group when
Di
is greater than the
critical value. The critical values depend on the number of sites in the region. When the number of
sites in the region exceeds 15, the critical value is three [9].
A test of regional homogeneity is required for the regional frequency analysis to assess whether a
proposed region might be regarded as a homogeneous region. The heterogeneity measure was proposed
to estimate the degree of heterogeneity in a group of sites [
36
]. The heterogeneity measure compares
the between-site dispersion of the sample L-moment ratios for the region under consideration with that
of the simulated L-moment ratios for a homogeneous region. A simple measure of the between-site
dispersion of the L-moment ratios is the weighted standard deviation of the L-moment ratios.
The heterogeneity statistic (H) can be calculated as
H=VoµV
σV
(8)
Water 2019,11, 1717 6 of 15
where
Vo
is the weighted standard deviation of the observed L-CV, and
µV
and
σV
are the mean and
standard deviation of
Nsim
values of
V
calculated for each simulated homogeneous region. The Monte
Carlo simulation was used to generate
Nsim
realizations of a homogeneous region whose frequency
distribution was the kappa distribution with four parameters. The proposed region is considered to
be heterogeneous when
H
is suciently large. Hosking and Wallis [
9
] suggest that the region under
consideration is “acceptably homogeneous” if
H
<1, “possibly heterogeneous” if 1
H
<2, and
“definitely heterogeneous” if H2.
2.3.4. Selection and Estimation of Regional Growth Curve
There are dierent methods for selecting an appropriate regional frequency distribution that
describes the features of the sample data. The L-moment ratio diagram plots L-skewness versus
L-kurtosis. This plot can be used for a visual comparison of L-moment ratios between sample data and
candidate distributions. The selection of a frequency distribution based on the L-moment diagram is
not quantitative and rather subjective. However, there are some quantitative measures proposed in the
literature. The Zstatistic proposed by Hosking and Wallis [9] is defined as
ZDIST = (τDIST
4tR
4+B4)/σ4(9)
In this equation,
DIST
indicates any candidate distribution. The three parameter distributions
applied for the RFFA are the generalized logistic (GLO), generalized extreme value (GEV), generalized
normal (GNO), Pearson type III (PE3), and generalized Pareto (GP). The L-kurtosis of the fitted
distribution is indicated by
τDIST
4
, and
tR
4
is the regional average L-kurtosis weighted proportionally to
the record length of sites. The bias of
tR
4
is indicated by
B4
, and
σ4
is the standard deviation of
tR
4
. The
calculation of B4and σ4is based on the fitting of a kappa distribution and Monte Carlo simulation of
kappa distributed regions. Further details of calculating
B4
and
σ4
can be explained in Hosking and
Wallis [9]. Any candidate distributions are considered to be adequate fits when ZDIST1.64.
Kroll and Vogel [
37
] proposed the average weighted orthogonal distance (AWOD). The AWOD
measures the average weighted distance between sample L-moment ratios and theoretical L-moment
ratios of a given frequency distribution. The AWOD is defined as
AWOD =
N
X
i=1
nidi,N
X
i=1
ni(10)
where
di
is the orthogonal distance between the sample L-kurtosis at site
i
and the theoretical L-kurtosis
for a given distribution. A candidate distribution having the smallest AWOD value is considered as
the best distribution.
Once the appropriate regional frequency distribution is identified, the form of the regional growth
curve is known apart from the undetermined parameters. The regional L-moment algorithm is applied
to estimate the undetermined parameters using L-moment ratios [9].
2.3.5. The Accuracy of the Estimated Regional Growth Curve
Results of statistical analysis might be uncertain, and some assessment of uncertainty should
be performed. The Monte Carlo simulation approach was suggested by Hosking and Wallis [
9
] to
estimate the accuracy of the estimated regional frequency distribution. The simulations should agree
with the specific characteristics of the data from which the estimates are computed. Hence, the number
of sites, record length at each site, and the regional average L-moments ratios of the simulated region
are the same as those of the actual region. The simulated region might include heterogeneity and
inter-site dependence of the data. The heterogeneity measure computed from the simulated region
needs to be consistent with that computed from the actual region. The observed sample L-moment
ratios cannot be used as the population L-moment ratios of the simulated region because this will
Water 2019,11, 1717 7 of 15
result in a simulated region that has much more heterogeneity than the actual region [
9
]. Therefore,
the variations in the L-moment ratios of the simulated region are usually set to be less than that of
sample L-moment ratios of the actual region.
According to Hosking and Wallis [
9
], the overall accuracy measure for the estimated regional
growth curve is given as
RR(F)= 1
N
N
X
i=1
Ri(F)(11)
In this equation,
RR(F)
implies the regional average relative RMSE of the estimated regional
growth curve. The relative RMSE of the estimated regional growth curve at a site iaveraged over all
M
averaged over all simulations is given as
Ri(F)=
1
M
M
X
m=1(ˆ
qm
i(F)qi(F)
qi(F))2
0.5
(12)
In this equation,
ˆ
qm
i(F)
indicates the estimated growth curve of a site
i
at the m
th
repetition, and
qi(F)
implies the true growth curve of a site
i
. For a homogeneous region, the growth curves are
the same at all sites such that
ˆ
qm
i(F)
and
qi(F)
in Equation (12) can be replaced with
ˆ
qm(F)
and
q(F)
.
Hosking and Wallis [9] defined 90% error bounds for the estimated growth curve as
ˆ
q(F)
U0.05(F)q(F)ˆ
q(F)
L0.05(F)(13)
In this relation,
L0.05(F)
is some value below which 5% of the simulated values of
ˆ
q(F)/q(F)
lie,
and U0.05(F)is some value above which 5% of the simulated values of ˆ
q(F)/q(F)lie.
2.4. Performance Evaluation Measures
The coecient eciency (CE) and coecient of persistence (CP) were used as the evaluation
measure of the simulated results. Cheng et al. [
38
] suggested the use of these two measures for the
evaluation of real-time flood forecasting model. The CE and CP are defined as
CE =1Pn
t=1Qo
tQs
t2
Pn
t=1Qo
tQo
t2CP =1Pn
t=1Qo
tQs
t2
Pn
t=1Qo
tQo
tk2
Qo
t
is the observed value at time t,
Qo
t
is the average of the observed value,
Qs
t
is the simulated value at
time t. Since the average time of concentration for sub-watersheds of this study basin is about 4 h, the
CP is estimated using k =3 h. The CE and CP dier only in the denominator term. The CE and CP
values range from −∞ to 1. The model performance is the best when CE and CP values become one.
3. Results
3.1. Assessment of the Simulated Flood Discharge
To understand the adequacy of the simulated flood discharge, the simulation performance was
assessed at two locations. The assessment results for the simulated hydrographs were described in
Table S2 of the Supplementary Materials. For some storm events, the CE and CP values are not given in
Table S2 because some parts of the observed discharge hydrograph are not reliable. The computed CE
values for the simulated hydrographs ranged from 0.62 to 0.98 while the computed CP values ranged
from
5.41 to 0.74. The performance of the simulated hydrographs was satisfactory in terms of CE
measure. But the simulation results for low and medium peak discharges exhibited poor performance
with low CP values. The Supplementary Materials (Figures S1–S3) illustrate the comparison of the
Water 2019,11, 1717 8 of 15
simulated and observed hourly hydrographs selected from storm events with high, medium and low
peak discharges. The simulated and observed hydrographs were in good agreement for large flood
events (Figure S1) while the medium and small flood events (Figures S2 and S3) showed less agreement
between simulated and observed hydrographs.
Figure 2compares the observed AMF with the simulated AMF at CJD and YC locations. The
simulated AMF shows good agreement with the observed AMF. The estimated coecient of correlation
exceeds 0.98 for both locations. The computed CE value was 0.96 for the CJD location and 0.95 for the
YC location. It is noted that for very large flood events, there is a trade-oin simulation performance
between CJD and YC since the AMF is underestimated in CJD and overestimated in YC. This indicates
that there is a diculty in identifying the spatial parameters and conditions for a semi-distributed
rainfall-runomodel. Based on the assessment of the simulated flood responses at two locations, we
hypothesize that the simulated flood responses are a reasonable approximation to the actual flood
responses throughout the study basin.
Water 2019, 11, x FOR PEER REVIEW 8 of 15
ranged from 5.41 to 0.74. The performance of the simulated hydrographs was satisfactory in terms
of CE measure. But the simulation results for low and medium peak discharges exhibited poor
performance with low CP values. The Supplementary Materials (Figure S1–Figure S3) illustrate the
comparison of the simulated and observed hourly hydrographs selected from storm events with high,
medium and low peak discharges. The simulated and observed hydrographs were in good agreement
for large flood events (Figure S1) while the medium and small flood events (Figure S2 and S3) showed
less agreement between simulated and observed hydrographs.
Figure 2 compares the observed AMF with the simulated AMF at CJD and YC locations. The
simulated AMF shows good agreement with the observed AMF. The estimated coefficient of
correlation exceeds 0.98 for both locations. The computed CE value was 0.96 for the CJD location and
0.95 for the YC location. It is noted that for very large flood events, there is a trade-off in simulation
performance between CJD and YC since the AMF is underestimated in CJD and overestimated in YC.
This indicates that there is a difficulty in identifying the spatial parameters and conditions for a semi-
distributed rainfall-runoff model. Based on the assessment of the simulated flood responses at two
locations, we hypothesize that the simulated flood responses are a reasonable approximation to the
actual flood responses throughout the study basin.
Figure 2. Comparison of observed AMF and simulated AMF. R indicates the coefficient of correlation
and CE indicates the coefficient of efficiency. The blue dotted line indicates 1:1 line.
3.2. Regional Flood Frequency Distribution and its Uncertainty
The flood quantiles at any sites within a homogeneous region can be estimated by Equation (6).
The regional growth curve in Equation (6) was estimated using the RFFA based on the simulated
AMF data at the 20 sites shown in Figure 1. The watershed area of the sites is very diverse and ranges
from 109 km2 to 6648 km2. Hosking and Wallis [9] suggested that there is little gain in accuracy for
RFFA using more than 20 sites.
Identification of the homogeneous region can be performed by applying the discordancy
measure of Equation (7) and heterogeneity measure of Equation (8). The computed discordancy
values for 20 sites ranged from 0.09 to 2.06. Since the computed discordancy values were less than
the critical value of three, no sites were discordant with the group. The heterogeneity measure was
estimated as
H
= 0.59. The negative value of
H
might be related to a positive correlation between
AMF data at different sites. Therefore, the entire Chungju dam region was considered to be
acceptably homogeneous because
H
was less than one.
The goodness-of-fit measure was computed by Equation (9) to test whether the candidate
distributions fit the AMF data closely. Table 1 shows
Z
statistics calculated for five candidate
distributions. The
Z
values of three distributions (GLO, GEV, and GNO) are lower than the critical
Z
value of 1.64. For these distributions, Table 1 shows AWOD values calculated using Equation (10).
It appears that the GEV distribution has the lowest
Z
and AWOD values. The L-moment ratio
diagram is plotted in Figure 3 to compare the theoretical and sample L-moment ratios. The filled
circle dot in the figure indicates the regional average of sample L-moment ratios.
Figure 2.
Comparison of observed AMF and simulated AMF. R indicates the coecient of correlation
and CE indicates the coecient of eciency. The blue dotted line indicates 1:1 line.
3.2. Regional Flood Frequency Distribution and its Uncertainty
The flood quantiles at any sites within a homogeneous region can be estimated by Equation (6).
The regional growth curve in Equation (6) was estimated using the RFFA based on the simulated AMF
data at the 20 sites shown in Figure 1. The watershed area of the sites is very diverse and ranges from
109 km
2
to 6648 km
2
. Hosking and Wallis [
9
] suggested that there is little gain in accuracy for RFFA
using more than 20 sites.
Identification of the homogeneous region can be performed by applying the discordancy measure
of Equation (7) and heterogeneity measure of Equation (8). The computed discordancy values for 20
sites ranged from 0.09 to 2.06. Since the computed discordancy values were less than the critical value
of three, no sites were discordant with the group. The heterogeneity measure was estimated as
H
=
0.59. The negative value of
H
might be related to a positive correlation between AMF data at dierent
sites. Therefore, the entire Chungju dam region was considered to be acceptably homogeneous because
Hwas less than one.
The goodness-of-fit measure was computed by Equation (9) to test whether the candidate
distributions fit the AMF data closely. Table 1shows
Z
statistics calculated for five candidate
distributions. The
Z
values of three distributions (GLO, GEV, and GNO) are lower than the critical
Z
value of 1.64. For these distributions, Table 1shows AWOD values calculated using Equation (10). It
appears that the GEV distribution has the lowest
Z
and AWOD values. The L-moment ratio diagram is
plotted in Figure 3to compare the theoretical and sample L-moment ratios. The filled circle dot in the
figure indicates the regional average of sample L-moment ratios.
Water 2019,11, 1717 9 of 15
Table 1. Goodness-of-fit and AWOD statistics for candidate distributions.
Distribution Z Value AWOD
GLO 0.71 0.0321
GEV 0.66 0.0261
GNO 1.31 0.0284
PE3 2.5 -
GP 4.11 -
Water 2019, 11, x FOR PEER REVIEW 9 of 15
Table 1. Goodness-of-fit and AWOD statistics for candidate distributions.
Distribution Z Value AWOD
GLO 0.71 0.0321
GEV 0.66 0.0261
GNO 1.31 0.0284
PE3 2.5 -
GP 4.11 -
Figure 3. L-moment ratio diagram (left) and the comparison of regional growth curves (right). Kim is
the result of Kim et al. [17].
Once the homogeneity of the region was tested and the proper regional frequency distribution
was selected, the regional growth curve )(Fq can be estimated using a regional L-moment
algorithm explained in Section 2.3. The estimated growth curves are shown in the Figure 3. The
growth curves for GEV and GNO distributions were considered and compared with the result of the
previous study [17]. The estimated growth curves between GEV and GNO distributions were very
similar for the return period smaller than 100 years.
The Monte Carlo simulation approach explained in Section 2.3.5 was applied to assess the
uncertainty of the estimated growth curve. In the simulations, the following statistics between the
simulated region and the actual region were set to be the same: The record length at each site, the
regional average L-moments ratios and the average inter-site correlation. The range of L-CV and L-
skewness of a simulated region was set to 0.13, which is smaller than the range of sample L-moment
ratios. The average heterogeneity values computed from 500 simulated regions were
H
= 0.23 for
the GEV distribution and
H
= 0.39 for the GNO distribution. Since these values were slightly higher
than those of actual region, a little more heterogeneity was allowed in the simulation.
Figure 4 shows the regional growth curves with upper and lower error bounds. These error
bounds were estimated by Equation (13). Figure 5 shows the widths of the error bounds and the
regional average relative RMSE of the estimated growth curve computed by Equation (11). The
widths of the error bounds and the regional average relative RMSE between GEV and GNO
distributions were very similar for the return period smaller than 100 years. The width of the error
bounds and the regional average relative RMSE of the GEV distribution becomes greater than that of
the GNO distribution when the return period exceeds 100 years.
Figure 3.
L-moment ratio diagram (
left
) and the comparison of regional growth curves (
right
). Kim is
the result of Kim et al. [17].
Once the homogeneity of the region was tested and the proper regional frequency distribution
was selected, the regional growth curve
q(F)
can be estimated using a regional L-moment algorithm
explained in Section 2.3. The estimated growth curves are shown in the Figure 3. The growth curves for
GEV and GNO distributions were considered and compared with the result of the previous study [
17
].
The estimated growth curves between GEV and GNO distributions were very similar for the return
period smaller than 100 years.
The Monte Carlo simulation approach explained in Section 2.3.5 was applied to assess the
uncertainty of the estimated growth curve. In the simulations, the following statistics between the
simulated region and the actual region were set to be the same: The record length at each site, the
regional average L-moments ratios and the average inter-site correlation. The range of L-CV and
L-skewness of a simulated region was set to 0.13, which is smaller than the range of sample L-moment
ratios. The average heterogeneity values computed from 500 simulated regions were
H
=0.23 for the
GEV distribution and
H
=0.39 for the GNO distribution. Since these values were slightly higher than
those of actual region, a little more heterogeneity was allowed in the simulation.
Figure 4shows the regional growth curves with upper and lower error bounds. These error
bounds were estimated by Equation (13). Figure 5shows the widths of the error bounds and the
regional average relative RMSE of the estimated growth curve computed by Equation (11). The widths
of the error bounds and the regional average relative RMSE between GEV and GNO distributions
were very similar for the return period smaller than 100 years. The width of the error bounds and
the regional average relative RMSE of the GEV distribution becomes greater than that of the GNO
distribution when the return period exceeds 100 years.
Water 2019,11, 1717 10 of 15
m
Figure 4.
Regional growth curves with 90% error bounds. GEV distribution (
left
); GNO
distribution (right).
Water 2019, 11, x FOR PEER REVIEW 10 of 15
Figure 4. Regional growth curves with 90% error bounds. GEV distribution (left); GNO distribution
(right).
Figure 5. Width of error bounds and regional average relative RMSE of the estimated growth curves.
width of error bounds (left); regional average relative RMSE (right).
3.3. Assessment of Index Flood and Flood Quantiles
The index flood at any site can be estimated by averaging AMF data when the AMF data are
available at a site. The index flood needs to be estimated indirectly for ungauged watersheds where
AMF data are not available. In this study, the regionalized relation for the index flood was developed
using the scale invariance properties of the index flood with respect to watershed area. The previous
studies also applied the watershed area as the principal variable for predicting the index flood
[14,39,40]. The power law equation for the index flood is represented by considering the first order
moment of the peak discharge distribution within a homogeneous region [30].
)A(q = )1(qm
A
In this equation, A is the watershed area in km2,
m
is a scaling exponent, and )1(q is the
index flood associated with a unit watershed area. Using the simulated AMF data of 20 sites in Figure
1, the power law relation between )A(q and A is determined as
)A(q = 2.987 0.907
A A>100 km2
Figure 6 shows the developed index flood relation and the coefficient of determination ( 2
R) is
estimated as 0.99. Using the flood data generated from a storage function runoff model, Kim et al.
[18] developed the following index flood relation in terms of watershed area.
)A(q = 3.895 0.873
A
The scaling exponent of this study was slightly higher than that of Kim et al. [18]. The index
flood relations between this study and the previous study [18] are similar, and the differences in the
index flood slightly increase as the watershed area decreases. Table 2 shows the relative errors
between the observed and predicted index floods at the CJD and YC stations.
.
Figure 5.
Width of error bounds and regional average relative RMSE of the estimated growth curves.
width of error bounds (left); regional average relative RMSE (right).
3.3. Assessment of Index Flood and Flood Quantiles
The index flood at any site can be estimated by averaging AMF data when the AMF data are
available at a site. The index flood needs to be estimated indirectly for ungauged watersheds where
AMF data are not available. In this study, the regionalized relation for the index flood was developed
using the scale invariance properties of the index flood with respect to watershed area. The previous
studies also applied the watershed area as the principal variable for predicting the index flood [
14
,
39
,
40
].
The power law equation for the index flood is represented by considering the first order moment of the
peak discharge distribution within a homogeneous region [30].
q(A) = q(1)Am
In this equation,
A
is the watershed area in km
2
,
m
is a scaling exponent, and
q(
1
)
is the index
flood associated with a unit watershed area. Using the simulated AMF data of 20 sites in Figure 1, the
power law relation between q(A)and A is determined as
q(A)= 2.987 A0.907 A>100 km2
Figure 6shows the developed index flood relation and the coecient of determination (
R2
) is
estimated as 0.99. Using the flood data generated from a storage function runomodel, Kim et al. [
18
]
developed the following index flood relation in terms of watershed area.
q(A)= 3.895 A0.873
Water 2019,11, 1717 11 of 15
The scaling exponent of this study was slightly higher than that of Kim et al. [
18
]. The index flood
relations between this study and the previous study [
18
] are similar, and the dierences in the index
flood slightly increase as the watershed area decreases. Table 2shows the relative errors between the
observed and predicted index floods at the CJD and YC stations.
Water 2019, 11, x FOR PEER REVIEW 11 of 15
Figure 6. Index flood relation.
Table 2. Comparison of relative errors for index flood.
Station This Study Kim
CJD 3.24% 6.46%
YC 1.03% 3.24%
* Kim is based on the result of Kim et al. [18].
Figure 7 illustrates a comparison of the flood quantile relations at the CJD and YC stations. The
flood quantiles were computed using Equation (6) with the developed index flood and growth curve.
The probability of the ordered AMF data was estimated by the median plotting position formula
described in Stedinger et al. [41]. The median plotting position formula was also applied by Kjeldsen
et al. [12] to assess the proposed regional frequency distribution.
Figure 7. Comparison of flood quantiles. Kim is the result of Kim et al. [17].
The predictive performance for the estimation of flood quantile was assessed using the CE and
root mean square normalized error (RMSNE) statistic used in Salinas et al. [2]. Table 3 summarizes
the CE and the RMSNE statistic for the estimation of flood quantiles at two locations. All cases
exhibited satisfactory performance with CE values higher than 0.9. The RMSNE values predicted by
this study were lower than those predicted by Kim et al. [17].
Table 3. CE and RMSNE statistic for the estimation of flood quantile.
Station GEV GNO Kim
CJD 0.92 (0.13) 0.93 (0.13) 0.93 (0.15)
YC 0.94 (0.13) 0.94 (0.13) 0.91 (0.17)
* The values in the parenthesis indicate RMSNE. Kim is based on the result of Kim et al. [17].
Figure 6. Index flood relation.
Table 2. Comparison of relative errors for index flood.
Station This Study Kim
CJD 3.24% 6.46%
YC 1.03% 3.24%
* Kim is based on the result of Kim et al. [18].
Figure 7illustrates a comparison of the flood quantile relations at the CJD and YC stations.
The flood quantiles were computed using Equation (6) with the developed index flood and growth
curve. The probability of the ordered AMF data was estimated by the median plotting position
formula described in Stedinger et al. [
41
]. The median plotting position formula was also applied by
Kjeldsen et al. [12] to assess the proposed regional frequency distribution.
Water 2019, 11, x FOR PEER REVIEW 11 of 15
Figure 6. Index flood relation.
Table 2. Comparison of relative errors for index flood.
Station This Study Kim
CJD 3.24% 6.46%
YC 1.03% 3.24%
* Kim is based on the result of Kim et al. [18].
Figure 7 illustrates a comparison of the flood quantile relations at the CJD and YC stations. The
flood quantiles were computed using Equation (6) with the developed index flood and growth curve.
The probability of the ordered AMF data was estimated by the median plotting position formula
described in Stedinger et al. [41]. The median plotting position formula was also applied by Kjeldsen
et al. [12] to assess the proposed regional frequency distribution.
Figure 7. Comparison of flood quantiles. Kim is the result of Kim et al. [17].
The predictive performance for the estimation of flood quantile was assessed using the CE and
root mean square normalized error (RMSNE) statistic used in Salinas et al. [2]. Table 3 summarizes
the CE and the RMSNE statistic for the estimation of flood quantiles at two locations. All cases
exhibited satisfactory performance with CE values higher than 0.9. The RMSNE values predicted by
this study were lower than those predicted by Kim et al. [17].
Table 3. CE and RMSNE statistic for the estimation of flood quantile.
Station GEV GNO Kim
CJD 0.92 (0.13) 0.93 (0.13) 0.93 (0.15)
YC 0.94 (0.13) 0.94 (0.13) 0.91 (0.17)
* The values in the parenthesis indicate RMSNE. Kim is based on the result of Kim et al. [17].
Figure 7. Comparison of flood quantiles. Kim is the result of Kim et al. [17].
The predictive performance for the estimation of flood quantile was assessed using the CE and
root mean square normalized error (RMSNE) statistic used in Salinas et al. [
2
]. Table 3summarizes the
CE and the RMSNE statistic for the estimation of flood quantiles at two locations. All cases exhibited
satisfactory performance with CE values higher than 0.9. The RMSNE values predicted by this study
were lower than those predicted by Kim et al. [17].
Water 2019,11, 1717 12 of 15
Table 3. CE and RMSNE statistic for the estimation of flood quantile.
Station GEV GNO Kim
CJD 0.92 (0.13) 0.93 (0.13) 0.93 (0.15)
YC 0.94 (0.13) 0.94 (0.13) 0.91 (0.17)
* The values in the parenthesis indicate RMSNE. Kim is based on the result of Kim et al. [17].
4. Discussion and Conclusions
We investigated the flood estimation method, which applies the combination of a rainfall-runo
simulation and the RFFA. The regional flood data were generated using the developed rainfall-runo
model. We tested the simulated flood responses at the outlet and one interior location in the basin.
Due to the limited flood data, we could not test the simulated flood responses at other locations. The
complete validation of a rainfall-runomodel is very dicult and might not be feasible in a poorly
gauged basin. The rainfall-runomodel simulation might be the only alternative option for flood
estimation in a poorly gauged basin even though the accuracy of the simulated flood data is uncertain.
The simulation of a rainfall-runomodel contains various sources of uncertainty related to the inputs,
model structures, model parameters, and initial conditions. These uncertainties have not been fully
considered in this study. The methodology for uncertainty estimation of a rainfall-runomodel has
been suggested in the literature [
42
]. Future research is needed to investigate the uncertainty related to
how a rainfall-runomodel development aects the results of the RFFA.
The responses of simulated peak discharge based on the developed rainfall-runomodel were
shown to be reasonable. The accuracy of peak discharge is a major concern for the present study rather
than the accuracy of the entire hydrographs. Some observation related to the simulated hydrographs
is briefly discussed. The satisfactory performance for large flood events might be related to the fact
that the infiltration excess runomechanism becomes dominant. Consequently, the unit hydrograph
method implemented in HEC-HMS was able to properly predict the runoresponses resulting from
the infiltration excess runo. The unsatisfactory performance for small and medium flood events
might be due to a change in the dominant runoprocesses. The dominant runoprocess for small
and medium flood events might change from the infiltration excess runoto other runoprocesses
such as saturation excess overland flow or subsurface stormflow. It is well known that various runo
generation processes occur depending on the characteristics of rainfall, topography, soils, vegetation,
and land use [
43
]. The previous studies demonstrated the change in the dominant runoprocesses as
the flood magnitude changes [
44
,
45
]. For the accurate simulation of hydrographs, it might be needed
to consider the various runogeneration processes which the HEC-HMS cannot account for. For some
storm events, the time to peak between the simulated hydrographs and the observed hydrographs
are not consistent. This dierence might be attributed to the inaccuracy of time of concentration
specified in the simulation. We applied the empirical equation for time of concentration developed in
the study basin, but the empirical equation might contain some uncertainty. The major parameters
of the HEC-HMS aecting the flood responses are curve number, time of concentration, and storage
coecient. The poor performance of the simulated hydrographs for some storm events might be
related to the misspecification of these parameters along with the incorrect runogeneration process
considered in the simulation. In addition to the uncertainty of parameters and runoprocesses, there
is a need to investigate the uncertainty of the observed discharge data which aects the performance of
simulation results.
The entire Chungju dam basin is considered as a homogeneous region based on the estimated
discordancy and heterogeneity statistic. The regional average of sample L-moment ratios was the
closest to the GEV distribution, and sample L-moment ratios were evenly dispersed through the GEV
distribution. The examination of the L-moment diagram,
Z
statistic, and AWOD measure indicated
that the GEV distribution was identified as the best fit to the simulated AMF data. This result is
consistent with the previous study [
17
]. The dierence of growth curves between GEV and GNO
Water 2019,11, 1717 13 of 15
distributions grew slightly as the return period increased. The growth curves estimated from this
study are slightly lower than that of Kim et al. [
17
]. However, the assessment of flood quantiles
indicated that GEV and GNO distributions exhibited similar results such that it was dicult to identify
the optimal regional frequency distribution. Further investigation is necessary for how to select the
optimal regional frequency distribution among the candidate distributions. The widths of error bounds
of the regional growth curve based on the GNO distribution were smaller than those based on the
GEV distribution. The widths of error bounds estimated in the present study were comparable with
the previous study [
10
]. There are dierent approaches for estimating the uncertainty bounds of the
regional growth curve [
46
]. In future study, it might be meaningful to investigate the dierences of
uncertainty bounds of the regional growth curve between dierent approaches. The estimation of
extreme flood quantiles was shown to be highly uncertain because the error bounds for the estimated
growth curve significantly increased as the return period increased beyond 100 years.
The index flood was regionalized in terms of watershed area. The scaling exponent of the power
law equation for the index flood was found to be around 0.9. The relative errors of the index flood
evaluated at the CJD and YC stations were relatively small and the relative errors of the index flood for
this study were smaller than those of the previous study [
18
]. The flood quantile estimates based on
the simulated AMF data exhibited good agreement with those based on the observed AMF data. The
estimated flood quantiles of this study are slightly better than those of the previous study [
17
] that
were obtained for the same basin using a dierent rainfall-runomodel. The regional growth curves
between this study and the previous study [
17
] were very similar although the location of sites used
for the RFFA were dierent between two studies. This suggests that the results of RFFA are not largely
aected by the choice of the sites used for the RFFA. The results of this study suggest that the flood
estimation approach attempted in the present study might be a useful tool for the estimation of flood
quantiles in the basin where the hourly rainfall data for major storm events are widely available and
the observed flood data are sparse. The flood estimation method explored in this study is conceptually
simple and involves less uncertain procedures compared with the design event approach, which
transforms design rainfall into flood quantiles and has a wide application in engineering practice.
Supplementary Materials:
The following are available online at http://www.mdpi.com/2073-4441/11/8/1717/s1,
Figure S1: Observed and simulated hourly hydrographs for large flood events (a) the simulation period 5 August
2002 to 9 August 2002, (b) the simulation period 9 July 2006 to 19 July 2006. Figure S2: Observed and simulated
hourly hydrographs for medium flood event (the simulation period 20 July 1987 to 25 July 1987). Figure S3:
Observed and simulated hourly hydrographs for small flood event (the simulation period 17 August 2004 to
21 August 2004). Table S1: Simulation conditions and observed peak flow for each storm event. Table S2:
Simulation results for each storm event.
Author Contributions:
Conceptualization, N.W.K. and D.-H.L.; methodology, D.-H.L.; software, D.-H.L.; formal
analysis, D.-H.L.; writing—original draft preparation, D.-H.L.; writing—review and editing, N.W.K. and D.-H.L.;
funding acquisition, N.W.K.
Funding:
This research was supported by the Strategic Research Project [project number 20180374] funded from
Korea Institute of Civil Engineering and Building Technology.
Acknowledgments:
The Authors are grateful to anonymous reviewers for their constructive comments
and suggestion.
Conflicts of Interest: The authors declare no conflict of interest.
References
1.
GREHYS. Presentation and review of some methods for regional flood frequency analysis. J. Hydrol.
1996
,
186, 63–84. [CrossRef]
2.
Salinas, J.L.; Laaha, G.; Rogger, M.; Parajka, J.; Viglione, A.; Sivapalan, M.; Blöschl,G. Comparative assessment
of predictions in ungauged basins-Part 2: Flood and low flow studies. Hydrol. Earth Syst. Sci.
2013
,17,
2637–2652. [CrossRef]
3.
Ball, J.; Babister, M.; Nathan, R.; Weeks, W.; Weinmann, E.; Retallick, M.; Testoni, I. Australian Rainfall and
Runo: A Guide to Flood Estimation; Geoscience Australia: Canberra, Australia, 2019.
Water 2019,11, 1717 14 of 15
4.
Rahman, A.; Weinmann, P.E.; Hoang, T.M.T.; Laurenson, E.M. Monte Carlo simulation of flood frequency
curves from rainfall. J. Hydrol. 2002,256, 196–210. [CrossRef]
5.
Caballero, W.L.; Rahman, A. Development of regionalized joint probability approach to flood estimation:
A case study for Eastern New South Wales, Australia. Hydrol. Process. 2014,28, 4001–4010. [CrossRef]
6.
Blazkova, S.; Beven, K. Flood frequency estimation by continuous simulation for a catchment treated as
ungauged (with uncertainty). Water Resour. Res. 2002,38, 14-1–14-14. [CrossRef]
7.
Boughton, W.; Droop, O. Continuous simulation for design flood estimation—A review. Environ. Model. Softw.
2003,18, 309–318. [CrossRef]
8.
Moretti, G.; Montanari, A. Inferring the flood frequency distribution for an ungauged basin using a spatially
distributed rainfall-runomodel. Hydrol. Earth Syst. Sci. 2008,12, 1141–1152. [CrossRef]
9.
Hosking, J.R.M.; Wallis, J.R. Regional Frequency Analysis: An Approach Based on L-Moments; Cambridge
University Press: New York, NY, USA, 1997.
10.
Hussain, Z.; Pasha, G.R. Regional flood frequency analysis of the seven sites of Punjab, Pakistan, using
L-moments. Water Resour. Manag. 2009,23, 1917–1933. [CrossRef]
11.
Jingyi, Z.; Hall, M.J. Regional flood frequency analysis for the Gan-Ming River basin in China. J. Hydrol.
2004,296, 98–117. [CrossRef]
12.
Kjeldsen, T.R.; Smithers, J.C.; Schulze, R.E. Regional flood frequency analysis in the KwaZulu-Natal province,
South Africa, using the index-flood method. J. Hydrol. 2002,255, 194–211. [CrossRef]
13.
Kumar, R.; Chatterjee, C.; Kumar, S.; Lohani, A.K.; Singh, R.D. Development of regional flood frequency
relationships using L-moments for Middle Ganga Plains Subzone 1(f) of India. Water Resour. Manag.
2003
,
17, 243–257. [CrossRef]
14.
Lim, Y.H.; Voeller, D.L. Regional flood estimations in Red river using L-Moment-based index-flood and
bulletin 17B procedures. J. Hydrol. Eng. 2009,14, 1002–1016. [CrossRef]
15.
Noto, L.V.; La Loggia, G. Use of L-moments approach for regional flood frequency analysis in Sicily, Italy.
Water Resour. Manag. 2009,23, 2207–2229. [CrossRef]
16.
Saf, B. Regional flood frequency analysis using L-moments for the west mediterranean region of Turkey.
Water Resour. Manag. 2009,23, 531–551. [CrossRef]
17.
Kim, N.W.; Lee, J.E.; Lee, J.W.; Jung, Y. Regional frequency analysis using spatial data extension method:
I. An empirical investigation of regional flood frequency analysis. J. Korea Water Resour. Assoc.
2016
,49,
439–450. [CrossRef]
18.
Kim, N.W.; Lee, J.W.; Lee, J.; Lee, J.E. SWAT application to estimate design runocurve number for South
Korean conditions. Hydrol. Process. 2010,24, 2156–2170. [CrossRef]
19.
Feldman, A.D. Hydrologic Modeling System HEC-HMS: Technical Reference Manual; U.S. Army Corps of
Engineers Hydrologic Engineering Center: Davis, CA, USA, 2000.
20.
Aronica, G.T.; Candela, A. Derivation of flood frequency curves in poorly gauged Mediterranean catchments
using a simple stochastic hydrological rainfall-runomodel. J. Hydrol. 2007,347, 132–142. [CrossRef]
21.
Wagener, T.; Wheater, H.S.; Gupta, H.V. Rainfall-RunoModelling in Gauged and Ungauged Catchments; Imperial
College Press: London, UK, 2004.
22.
Hrachowitz., M.; Savenije, H.H.G.; Blöschl, G.; McDonnell, J.J.; Sivapalan, M.; Pomeroy, J.W.; Arheimer, B.;
Blume, T.; Clark, M.P.; Ehret, U.; et al. A decade of Predictions in Ungauged Basins (PUB)—A review.
Hydrol. Sci. J. 2013,58, 1198–1255. [CrossRef]
23.
Sivapalan, M. Prediction in ungauged basins: A grand challenge for theoretical hydrology. Hydrol. Process.
2003,17, 3163–3170. [CrossRef]
24.
D’Asaro1, F.; Grillone, G. Empirical investigation of curve number method parameters in the Mediterranean
area. J. Hydrol. Eng. 2012,17, 1141–1152. [CrossRef]
25.
Woodward, D.E.; Hawkins, R.H.; Jiang, R.; Hjelmfelt, A.T.; Van Mullem, J.E.; Quan, Q.D. Runocurve
number method: Examination of the initial abstraction ratio. In Proceedings of the World Water and
Environmental Resources Congress, Philadelphia, PA, USA, 23–26 June 2003. [CrossRef]
26.
USDA-Natural Resource Conservation Service (USDA-NRCS). National Engineering Handbook: Part 630
Hydrology; USDA: Washington, DC, USA, 2004.
27.
Lee, J.; Yoo, C.; Sin, J. Theoretical backgrounds of basin concentration time and storage coecient and their
empirical formula. J. Korea Water Resour. Assoc. 2013,45, 155–169. [CrossRef]
Water 2019,11, 1717 15 of 15
28.
Pilgrim, D.H.; Cordery, I. Chapter 9 flood runo. In Handbook of Hydrology; Maidment, D.R., Ed.; McGraw-Hill
Inc.: New York, NY, USA, 1993; pp. 9.1–9.42.
29.
Dalrymple, T. Flood Frequency Analyses, Manual of Hydrology: Part 3; U.S. Government Printing Oce:
Washington, DC, USA, 1960.
30. Bocchiola, D.; De Michele, C.; Rosso, R. Review of recent advances in index flood estimation. Hydrol. Earth
Syst. Sci. 2003,7, 283–296. [CrossRef]
31.
Greenwood, J.A.; Landwehr, J.M.; Matalas, N.C.; Wallis, J.R. Probability weighted moments: Definition
and relation to parameters of several distributions expressible in inverse form. Water Resour. Res.
1979
,15,
1049–1054. [CrossRef]
32.
Mateus, A.; Caeiro, F. An R implementation of several randomness tests. AIP Conf. Proc.
2014
,1618, 531–534.
[CrossRef]
33. Mann, H.B. Nonparametric tests against trend. Econometrica 1945,13, 245–259. [CrossRef]
34. McCuen, R.H. Hydrologic Analysis and Design; Pearson Prentice Hall: Upper Saddle River, NJ, USA, 2005.
35.
Grubbs, F.; Beck, G. Extension of sample sizes and percentage points for significance tests of outlying
observations. Technometrics 1972,14, 847–854. [CrossRef]
36.
Hosking, J.R.M.; Wallis, J.R. Some statistics useful in regional frequency analysis. Water Resour. Res.
1993
,29,
271–281. [CrossRef]
37.
Kroll, C.N.; Vogel, R.M. Probability distribution of low streamflow series in the United States. J. Hydrol. Eng.
2002,7, 137–146. [CrossRef]
38.
Cheng, K.S.; Lien, Y.T.; Wu, Y.C.; Su, Y.F. On the criteria of model performance evaluation for real-time flood
forecasting. Stoch. Env. Res. Risk Assess 2017,31, 1123–1146. [CrossRef]
39.
Kumar, R.; Chatterjee, C. Regional flood frequency analysis using L-moments for North Brahmaputra region
of India. J. Hydrol. Eng. 2005,10, 1–7. [CrossRef]
40.
Meigh, J.R.; Farquharson, F.A.K.; Sutclie, J.V. A worldwide comparison of regional flood estimation methods
and climate. Hydrol. Sci. J. 1997,42, 225–244. [CrossRef]
41.
Stedinger, J.R.; Vogel, R.M.; Foufoula-Georgiou, E. Chapter 18 Frequency analysis of extreme events.
In Handbook of Hydrology; Maidment, D.R., Ed.; McGraw-Hill Inc.: New York, NY, USA, 1993; pp. 18.1–18.66.
42. Beven, K.; Binley, A. GLUE: 20 years on. Hydrol. Process. 2014,28, 5897–5918. [CrossRef]
43. Beven, K.J. Rainfall-RunoModelling: The Primer; John Wiley & Sons, Ltd.: Chichester, UK, 2001.
44.
Jothityangkoon, C.; Sivapalan, M. Towards estimation of extreme floods: Examination of the roles of runo
process changes and floodplain flows. J. Hydrol. 2003,281, 206–229. [CrossRef]
45.
Sivapalan, M.; Wood, E.F.; Beven, K.J. On hydrologic similarity. 3. A dimensionless flood frequency model
using a generalised geomorphic unit hydrograph and partial area runogeneration. Water Resour. Res.
1990
,
26, 43–58. [CrossRef]
46.
Hailegeorgis, T.T.; Alfredsen, K. Regional flood frequency analysis and prediction in ungauged basins
including estimation of major uncertainties for mid-Norway. J. Hydrol. Reg. Stud.
2017
,9, 104–126. [CrossRef]
©
2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access
article distributed under the terms and conditions of the Creative Commons Attribution
(CC BY) license (http://creativecommons.org/licenses/by/4.0/).
... In Table 3, the homogeneity statistic was estimated as 1.175. Although this is higher than the values of 0.39 obtained in a previous study by Lee and Kim [23] in an entirely different region of Chungju basin, Korea, the value of = 1.175 obtained in this study is less than 2. Hence, according to Sine and Ayalew (2004) [18], the six river sub-basins of Anambra-Imo river can be declared homogeneous as = 1.175 < 2. Figure 3 shows the plot of annual peak flood against its return period to give the at-site FFC of each gauging station under study. From Table 4, the estimated flood quartiles for the different gauged catchments indicate that Rivers Otamiri and Ivo have similar values while the gauging sites of Imo River at Umuopara and River Ajali also have similar values. ...
... This result is valid for this specific region as it agrees with Dubey [15], who obtained R 2 =1 for RFFC developed for 16 gauging sites in Narmada basin, India. However, the model performance is rated lower than those obtained by Lee and Kim [23] and Dubey [15] in similar studies in which R 2 = 0.99 and 0.945 respectively. To buttress the validity of the model, the deficient model performance is linked to fewer number of small gauged catchments (A<1000km 2 ) used in this study due to scarce gauging sites in the region, in contrast to the 16 and 20 large gauging sites used in previous studies [15,23]. ...
... However, the model performance is rated lower than those obtained by Lee and Kim [23] and Dubey [15] in similar studies in which R 2 = 0.99 and 0.945 respectively. To buttress the validity of the model, the deficient model performance is linked to fewer number of small gauged catchments (A<1000km 2 ) used in this study due to scarce gauging sites in the region, in contrast to the 16 and 20 large gauging sites used in previous studies [15,23]. This is further confirmed in Figure 5 ...
Article
Full-text available
Hydrologic designs require accurate estimation of quartiles of extreme floods. But in many developing regions, records of flood data are seldom available. A model framework using the dimensionless index flood for the transfer of Flood Frequency Curve (FFC) among stream gauging sites in a hydrologically homogeneous region is proposed. Key elements of the model framework include: (1) confirmation of the homogeneity of the region; (2) estimation of index flood-basin area relation; (3) derivation of the regional flood frequency curve (RFFC) and deduction of FFC of an ungauged catchment as a product of index flood and dimensionless RFFC. As an application, 1983 to 2004 annual extreme flood from six selected gauging sites located in Anambra-Imo River basin of southeast Nigeria, were used to demonstrate that the developed index flood model: , overestimated flood quartiles in an ungauged site of the basin. It is recommended that, for wider application, the model results can be improved by the availability and use of over 100 years length of flood data spatially distributed at critical locations of the watershed. Doi: 10.28991/cej-2020-03091627 Full Text: PDF
... The development of L-moment is the most remarkable contribution to statistical hydrology. It is the most widely used approach in RFA and has been credited for its superior performance and statistical characteristics, just to mention a few [16,18,65,67,69,[71][72][73][74][75][76][77][78] Th.is approach has been adopted for this study for its robustness as they tend to suffer less effects of sampling variability, requires less computational power, they are more robust in the presence of outliers. Furthermore, they yield more efficient parameter estimates than the maximum. ...
Article
Full-text available
Futuristic rainfall projections are used in scale and various climate impact assessments. However, the influence of climate variability on spatial distribution patterns and characteristics of rainfall at the local level, especially in semi-arid catchments that are highly variable and are not well explored. In this study, we explore the influence of climate variability on the spatial distribution and rainfall characteristics at a local scale in the semi-arid Shashe catchment, Northeastern Botswana. The LARS-WG, Long Ashton Research Station Weather Generator downscaling method, three representative scenarios (RCP 2.6, RCP 4.5, and RCP 4.5), three trend detection methods (Mann-Kendall, Sen's slope, and innovative trend analysis) and L-moment method were used to assess climate change impacts on rainfall. Two data sets were used; one with 40 years of observed data from 1981-2020 and the other with 70 years from 1981-2050 (40 years of observed and 30 years of projected data from 2021-2050). Generally, the study found trend inconsistencies for all the trend detection methods. In most cases, Sen's Slope has a high estimate of observed and RCP 2.6, while ITA overestimates rainfall totals under RCP 4.5 and RCP 8.5. The trend is increasing for annual total rainfall in most gauging stations while decreasing for annual maximum rainfall. The catchment is homogeneous, and Generalized Logistic distribution is the dataset's best-fit distribution. Spatial coverage of a 100-year rainfall between 151-180 mm will be 81% based on observed data and 87% based on projected data under RCP 2.6 scenario when it happens. A 200-year rainfall between 196-240 mm under RCP 4.5 and 8.5 has high spatial areal coverage, at least 90% of the total catchment. The outcomes of this study will provide insightful information for water resource management and flood risk assessment under climate change. There is a need, however, to assess the transferability of this approach to other catchments in the country and assess the performance of other advanced modelling systems, such as machine learning, in this region.
... To overcome data limitations, hydrologists have proposed a regional flood frequency analysis (RFFA), which attempts to estimate design floods at an ungauged catchment based on the concept of a homogeneous region, which pools observed flood data from a group of similar catchments to estimate design floods at the ungauged catchment [38,39]. This method became more popular among researchers than physical models because it saves time and resources [40]. Probabilistic rational method (PRM) [41], multiple linear regression (MLR) [42,43], quantile regression techniques (QRT) [44,45], and index flood method (IFM) [46,47] are some of the most commonly used RFFA techniques. ...
Article
Full-text available
Flood is one of the most destructive natural disasters, causing significant economic damage and loss of lives. Numerous methods have been introduced to estimate design floods, which include linear and non-linear techniques. Since flood generation is a non-linear process, the use of linear techniques has inherent weaknesses. To overcome these, artificial intelligence (AI)-based non-linear regional flood frequency analysis (RFFA) techniques have been introduced over the last two decades. There are limited articles available in the literature discussing the relative merits/demerits of these AI-based RFFA techniques. To fill this knowledge gap, a scoping review on the AI-based RFFA techniques is presented. Based on the Scopus database, more than 1000 articles were initially selected, which were then screened manually to select the most relevant articles. The accuracy and efficiency of the selected RFFA techniques based on a set of evaluation statistics were compared. Furthermore, the relationships among countries and researchers focusing on AI-based RFFA techniques are illustrated. In terms of performance, artificial neural networks (ANN) are found to be the best performing techniques among all the selected AI-based RFFA techniques. It is also found that Australia, Canada, and Iran have published the highest number of articles in this research field, followed by Turkey, the United Arab Emirates (UAE), India, and China. Future research should be directed towards identification of the impacts of data quantity and quality, model uncertainty and climate change on the AI-based RFFA techniques.
... The well-established L-moments regional frequency analysis based on the L-moments approach (Hosking and Wallis, 1997), which is commonly used for extreme value analysis, such as flood frequency estimates for ungauged catchments (e.g., Nyeko-Ogiramoi et al., 2012;Drissia et al., 2019;Lee and Kim. 2019;Cassalho et al., 2019), is not usually applied to daily streamflow datasets. This is an obvious consequence of the important serial stochastic structure of daily streamflow (e.g., autocorrelation and seasonality), which prevents the use of Lmoment-based heterogeneity and goodness-of-fit measures. Nonetheless, if both serial dependence an ...
Article
Hydrological observations in Angola are quite scarce and, as such, the water allocation process, one of the main components of water resources management, is commonly undertaken with a high degree of uncertainty. It is, therefore, vital to increase our understanding of the hydrological processes, namely the frequency distribution of daily streamflow, in this part of southern Africa and validate efficient methods that facilitate the extrapolation of information from gauged to ungauged catchments. All the components of this study were carefully designed to address all the above-mentioned goals. This was achieved with the modelling of 121 flow-duration curve samples observed in different parts of the country, including large datasets (e.g., 1954/55 to 1968/69) but mainly focused on the hydrological years of 1967/1968 and 1973/1974 and the implementation of regional frequency analysis based on the L-moments approach. The frequency distribution of daily streamflow was approximated with nine different probability distribution functions considering different subsets of the daily streamflow time series: (i) daily streamflow (ii) different subsets of daily streamflow divided into two ‘flows seasons’, wet and dry, (iii) and the previous subsets transformed with the definition of a thirty-day time lag, thereby reducing the serial dependency of daily streamflow and enabling the use of the L-moments approach. Overall, the results enabled two probability distribution functions to be identified able to provide a remarkable approximation to all the above-mentioned daily streamflow datasets (the four-parameter Kappa and three-parameter Generalized Pareto distributions). Furthermore, the regional frequency analysis supported the prediction of daily streamflow quantiles for eight test catchments with impressive accuracy (Nash-Sutcliffe efficiency coefficient: μ = 0.86; σ = 0.10; Pearson correlation coefficient: μ = 0.97; σ = 0.02), clearly showing that this approach represents a sound alternative for the prediction of daily streamflow in ungauged catchments located in this region.
... RFA using L-moments is a popular method and has been used in several case studies around the world. For example; in Korea, Lee and Kim (2019); in Canada, Requena et al. (2017); in Norway, Hailegeorgis and Alfredsen (2017); in India, Alam et al. (2016); in China, Yang et al. (2010); in Iran, Mesbahzadeh et al. (2019); in Turkey, Aydo gan et al. (2016); and many more. Two important studies providing inter-comparison of various regional flood estimation procedures are by GREHYS (1996aGREHYS ( , 1996b. ...
Article
Full-text available
The study provides results of regional frequency analysis (RFA) using annual maximum peak flows (AMPF) of 36 sites located on various streams and rivers of Khyber‐Pakhtunkhwa, Pakistan. Assumptions of randomness, independent and identical distribution regarding AMPF at each site have been validated using different statistical tests. The region of 36 sites is heterogeneous as confirmed by L‐moments based heterogeneity measure. Therefore, it is subdivided into four homogeneous regions considering the most influential site characteristic among available using wards clustering method and Euclidean distance. To identify good‐fit‐regional distribution(s), from a set of popular three‐parameter distributions, L‐moment ratio diagram and |Z‐Dist| statistic are used as goodness‐of‐fit criteria. To obtain the most suitable distribution having robust properties, a simulation‐based assessment analysis is performed for each homogeneous region considering root mean square error and 95% error bounds of regional quantiles as accuracy measures. Due to non‐linearity (in the functional relationship between the mean of AMPF at various sites and their corresponding site characteristics) and the existence of multicollinearity between the site characteristics, radial basis function (RBF) network has been used for the estimation of quantiles at ungauged sites. The results show that the adopted methodology is useful for the estimation of quantiles at gauged and ungauged sites within the defined homogeneous regions.
... RFA using L-moments is a popular method of estimation and has been used frequently in various case studies worldwide. For example; in Canada, (Requena et al., 2017); in Norway, (Hailegeorgis and Alfredsen, 2017); in Iran, (Mesbahzadeh et al., 2019); in India, (Alam et al., 2016); in Korea, (Lee and Kim, 2019); in China, (Yang et al., 2010); in Turkey, (Aydoğan et al., 2016). GREHYS (1996a, b) provided a detailed comparison of several regional flood estimation procedures. ...
... To overcome this issue, LM derived by Hosking (1990) is a preferred choice. A large number of studies have used LM in regional frequency analysis while modeling annual maxima's, for instance (Lee and Kim 2019;Vivekanandan 2015;Drissia et al. 2019;Hussain 2017;Khan et al. 2019;Rutkowska et al. 2018). ...
Article
Full-text available
Modeling of extreme values like annual maxima’s is important in many applications. Pearson Type-3 (PE3) distribution is an important probability distribution, widely used for modeling of extreme values with a variety of estimation methods. The focus of this study is to assess the effects of three methods of estimation of parameters for PE3 distribution namely L-moments (LM), maximum likelihood estimation (MLE) and maximum product of spacing (MPS). Assessment is based on a two-step approach. The first step uses simulation experiments while the second is based on empirical analyses, by varying size and shape characteristics of the sample. The study concluded that the estimates using LM method have low bias in case of small sample and when data exhibits small to moderate skewness and kurtosis. MPS is a reasonable alternative and provides efficient estimates, especially when the data shows large skewness and kurtosis with small to moderate size of sample. MLE method is useful in case of very large sample size with low values of shape characteristics of data. The results of this study provide useful guidelines for fitting PE3 distribution, especially to extreme values.
... The RFA based on L-moments has been applied to describe low flow Dodangeh et al. 2014;Modarres 2008), flood (Atiem and Harmancioglu 2006;Aydogan et al. 2014;Kumar and Chatterjee 2008;Lee and Kim 2019;Meshgi and Khalili 2009;Parida et al. 1998;Shabri and Jemain 2013;Yang et al. 2010), drought (Lana et al. 2008;Abolverdi and Khalili 2010;Eslamian et al. 2012;Sarhadi and Heydarizadeh 2014;Ulah et al. 2020), and wave heights (Ma et al. 2006). To analyse the precipitation data the method of L-moments has been considered worldwide. ...
Article
Full-text available
Precipitation is the main water resource for agronomy. The frequency is one of the most significant characteristics of precipitation. The objectives of this study are (1) to identify the homogenous regions, (2) to determine the probability distribution that fits precipitation the best for each region, (3) to perform regional precipitation-frequency analysis by using L-moment methods, and (4) to derive the return levels for precipitation. In this study, the precipitation data collected from 24 meteorological stations for the period 1965–2013 over Serbia were analyzed. Three distributions [generalized extreme value (GEV), generalized Pareto (GPD), and generalized logistic (GLO)] based on three parameters (scale, shape, and location) are investigated. The L-moment method is applied to determine the parameters. Three independent precipitation regions (R1, R2, and R3) were studied. The homogeneity test indicates that the identified regions are homogenous. To confirm the goodness-of-fit for the selected three probability distributions, the Z-statistics was applied. Based on the obtained results, the GEV distribution best fits the precipitation data in the regions R1 and R2, while the GPD was selected for the region R3.
Article
Full-text available
Regional information on stream discharge is needed in order to improve flood estimates based on the limited data available. Regional flood estimation is fundamental for designing hydraulic structures and managing flood plains and water resource projects. It is essential for estimating flood risks during recurrent periods due to suitable distributions. Regional flood frequency analysis is crucial for evaluating design flows in ungauged basins, and can complement existing time series in gauged sites and transfer them to ungauged catchments. Hence, this study aims to perform a regional flood frequency analysis of the Genale–Dawa River Basin of Ethiopia using the index flood and L-moments approach for sustainable water resource management. Three homogeneous hydrological regions were defined and delineated based on homogeneity tests from data of 16 stream-gauged sites, named Region-A, Region-B, and Region-C. The discordancy index of regional data for L-moment statistics was identified using MATLAB. All regions showed promising results of L-moment statistics with discordance measures (discordance index less than 3) and homogeneity tests (combined coefficient of variation (CC) less than 0.3). L-moment ratio diagrams were used to select best fit probability distributions for areas. Generalized extreme value, log-Pearson type III, and generalized Pareto distributions were identified as suitable distributions for Region-A, Region-B, and Region-C, respectively, for accurately modeling flood flow in the basin. Regional flood frequency curves were constructed, and peak flood was predicted for different return periods. Statistical analysis of the gauged sites revealed an acceptable method of regionalization of the basin. This study confirms that the robustness of the regional L-moments algorithm depends on particular criteria used to measure the performance of estimators. The identified regions should be tested with other physical catchment features to enhance flood quantile estimates at gauged and ungauged sites. Henceforth, this study’s findings can be further extended into flood hazard, risk, and inundation mapping of identified regions of the study area. Furthermore, this study’s approach can be used as a reference for similar investigations of other river basins.
Article
Runoff prediction in ungauged and scarcely gauged catchments is a key research field in surface water hydrology. There have been numerous studies before and since the launch of the predictions in ungauged basins (PUB) initiative by the International Association of Hydrological Sciences in 2003. This study critically reviews and assesses the decadal progress in the regionalization of hydrological modeling, which is the major tool for PUB, from 2000 to 2019. This paper found that the journal publications have noticeably increased in terms of PUB in the past 7 years, and research countries have been expanded dramatically since 2013. The regionalization methods are grouped into three categories including similarity‐based, regression‐based, and hydrological signature‐based. There are more detailed researches focusing on the interdisciplinary and profound improvement of each regionalization method. Namely, tremendous efforts have been made and lots of improvements have been carried out in the parameterization domain for the post‐PUB period. However, there is still plenty of room to improve the prediction capability in data‐sparse regions (e.g., further verification and proof of multi‐modeling adaptation and uncertainties description). This paper also discusses possible research directions in the future, including PUB in a changing environment and better utilization of multi‐source remote‐sensing information. This article is categorized under: • Science of Water > Science of Water Abstract The concept map of hydrological regionalization methods for runoff prediction in ungauged catchments
Article
Full-text available
Study region: 26 boreal catchments (mid-Norway). Study focus: We performed regional flood frequency analysis (RFFA) using the L-moments method and annual maximum series (AMS) of mean daily streamflow observations for reliable prediction of flood quantiles. We used similarity in at-site and regional parameters of distributions, high flow regime and seasonality, and runoff response from precipitation-runoff models to identify homogeneous catchments, bootstrap resampling for estimation of uncertainty and regression methods for prediction in ungauged basins (PUB). New hydrological insights for the region: The rigorous similarity criteria are useful for identification of catchments. Similarity in runoff response has the least identification power. For the PUB, a linear regression between index-flood and catchment area (R2 = 0.95) performed superior to a power-law (R2 = 0.80) and a linear regression between at-site quantiles and catchment area (e.g. R2 = 0.88 for a 200 year flood). There is considerable uncertainty in regional growth curves (e.g. −6.7% to −13.5% and +5.7% to +24.7% respectively for 95% lower and upper confidence limits (CL) for 2–1000 years return periods). The peaks of hourly AMS are 2–47% higher than that of the daily series. Quantile estimates from at-site flood frequency analysis (ASFFA) for some catchments are outside the 95% CL. Uncertainty estimation, sampling of flood events from instantaneous or high-resolution observations and comparative evaluation of RFFA with ASFFA are important.
Article
Full-text available
Model performance evaluation for real-time flood forecasting has been conducted using various criteria. Although the coefficient of efficiency (CE) is most widely used, we demonstrate that a model achieving good model efficiency may actually be inferior to the naïve (or persistence) forecasting, if the flow series has a high lag-1 autocorrelation coefficient. We derived sample-dependent and AR model-dependent asymptotic relationships between the coefficient of efficiency and the coefficient of persistence (CP) which form the basis of a proposed CE–CP coupled model performance evaluation criterion. Considering the flow persistence and the model simplicity, the AR(2) model is suggested to be the benchmark model for performance evaluation of real-time flood forecasting models. We emphasize that performance evaluation of flood forecasting models using the proposed CE–CP coupled criterion should be carried out with respect to individual flood events. A single CE or CP value derived from a multi-event artifactual series by no means provides a multi-event overall evaluation and may actually disguise the real capability of the proposed model.
Conference Paper
Full-text available
The Initial Abstraction ratio (Ia/S, or λ) in the Curve Number (CN) method was assumed in its original development to have a value of 0.20. Using event rainfall-runoff data from several hundred plots this assumption is investigated, and λ values determined by two different methods. Results indicate a λ value of about 0.05 gives a better fit to the data and would be more appropriate for use in runoff calculations. The effects of this change are shown in terms of calculated runoff depth and hydrograph peaks, CN definition, and in soil moisture accounting. The effect of using λ=0.05 in place of the customary 0.20 is felt mainly in calculations that involve either lower rainfall depths or lower CNs.
Article
Full-text available
This paper reviews the use of the Generalized Likelihood Uncertainty Estimation (GLUE) methodology in the 20 years since the paper by Beven and Binley in Hydrological Processes in (1992), which is now one of the most highly cited papers in hydrology. The original conception, the on-going controversy it has generated, the nature of different sources of uncertainty and the meaning of the GLUE prediction uncertainty bounds are discussed. The hydrological, rather than statistical, arguments about the nature of model and data errors and uncertainties that are the basis for GLUE are emphasized. The application of the Institute of Hydrology distributed model to the Gwy catchment at Plynlimon presented in the original paper is revisited, using a larger sample of models, a wider range of likelihood evaluations and new visualization techniques. It is concluded that there are good reasons to reject this model for that data set. This is a positive result in a research environment in that it requires improved models or data to be made available. In practice, there may be ethical issues of using outputs from models for which there is evidence for model rejection in decision making. Finally, some suggestions for what is needed in the next 20 years are provided. © 2013 The Authors. Hydrological Processes published by John Wiley & Sons, Ltd.
Article
For the design of infrastructures controlling the flood events at ungauged basins, this study tries to find the regional flood frequencies using peak flow data generated by the spatial extension of flood records. The Chungju Dam watershed is selected to validate the possibility of regional flood frequency analysis using the spatially extended flood data. Firstly, based on the index flood method, the flood event data from the spatial extension method is evaluated for 22 mid/smaller sub-basins at the Chungju Dam watershed. The homogeneity of the Chungju dam watershed was assessed in terms of the different size of watershed conditions such as accumulated and individual sub-basins. Based on the result of homogeneity analysis, this watershed is heterogeneous with respect to individual sub-basins because of the heterogeneity of rainfall distribution. To decide the regional probability distribution, goodness-of fit measure and weighted moving averages method from flood frequency analysis were adopted. Finally, GEV distribution was selected as a representative distribution and regional quantile were estimated. This research is one step further method to estimate regional flood frequency for ungauged basins.
Conference Paper
In many statistic methods, including distribution-free methods, we assume to work with random samples. In this note, we present randtests: an R package implementation of several nonparametric randomness tests. After a brief description of the tests included in the package, we present an application to real data sets in the field of Agricultural.
Article
This study proposes proper forms of empirical formulas for the concentration time and storage coefficient based on their theoretical backgrounds and evaluates several existing empirical formulas by comparing them with the formula proposed in this study. Additionally, empirical formulas for the concentration time and storage coefficient of the Chungju Dam basin were derived using the forms proposed by considering their theoretical backgrounds, and compared with exiting empirical formulas. The results derived are summarized as follows. (1) The concentration time of a basin is proportional to the square of the main channel length, but inversely proportional to the channel slope, as the flood flow is generally turbulent. (2) The storage coefficient is proportional to the concentration time. (3) The comparison results with existing empirical formulas for the concentration time indicates that the empirical formulas like the Kirpich, Kraven (I), Kraven (II), California DoT, Kerby, SCS, and Morgali & Linsley are in line with the form proposed in this study. Among existing empirical formulas for the storage coefficient, the Clak, Russell, Sabol and Jung are found to be well matched to this study. (4) The application results to Chungju Dam basin indicates that among empirical formulas for the concentration time, the Jung, Yoon, Kraven (I), and Kraven (II) show relatively similar results to the observed in this study, but the Rziha shows abnormal results. Among the empirical formulas for the storage coefficient, the Yoon and Hong, Jung, Lee, and Yoon show somewhat reasonable results, but the Sabol shows abnormal results. In conclusion, the empirical formulas for the concentration time and storage coefficient developed in Korea are found to reflect the basin characteristics of Korea better.
Article
The curve number (CN) method is widely used as a technique for estimating surface runoff depth from rainstorms. This simply lumped method is based on the main parameter CN, which represents the lumped expression of basin absorption, and on a parameter that represents interception, infiltration during the early part of a storm, and surface depression storage, called initial abstraction. In this paper, CN is evaluated at the basin scale from rainfall-runoff multiday events, in the observation period 1940-1997 (recorded length mean equal to 20 years) for 61 Sicilian basins with three different methods: NEH4 method, asymptotic fitting method, and a least-squares method. A first analysis of Sicilian watershed behavior indicates a major occurrence of standard CN response (43 basins), as opposed to a complacent response (10 basins), and a few cases of violent behavior (3 basins). For basins with complacent behavior a modified formula of a runoff CN equation is proposed. The original assumption of the initial abstraction ratio (Ia=S or λ) equal to 0.20 is investigated for watersheds with standard and violent CN response, using "natural" and "ordered" rainfall-runoff data. Results indicate a median λ value of 0 for natural data and 0.05 for ordered data, according to recent worldwide studies. CN seasonal preliminary analysis indicates higher CNs in the dormant season versus in the growing season, whereas λ seasonal analysis indicates values close to 0 in both the dormant and growing seasons.
Article
Estimation of design flood in ungauged catchments is a common problem in hydrology. Methods commonly adopted for this task are limited to peak flow estimation, e.g. index flood, rational and regression-based methods. To estimate complete design hydrograph, rainfall–runoff modelling is preferred. The currently recommended method in Australia known as Design Event Approach (DEA) has some serious limitations since it ignores the probabilistic nature of principal model inputs (such as temporal patterns (TP) and initial loss) except for design rainfall depth. A more holistic approach such as Joint Probability Approach (JPA)/Monte Carlo Simulation Technique (MCST) can overcome some of the limitations associated with the DEA. Although JPA/MCST has been investigated by many researchers, it has been proved to be difficult to apply since its routine application needs readily available regional design data such as stochastic rainfall duration, TP and losses, which are largely unavailable for Australian states. This paper presents regionalization of the model inputs/parameters to the JPA/MCST for eastern New South Wales (NSW) in Australia. This uses data from 86 pluviograph stations and six catchments from NSW to regionalize the input distributions for application with the JPA/MCST. The independent testing to three test catchments shows that the regionalized JPA/MCST generally outperforms the at-site DEA. The developed regionalized JPA/MCST can be applied at any arbitrary location in eastern NSW. The method and design data developed here although primarily applicable to eastern NSW can be adapted to other Australian states and countries. Copyright © 2013 John Wiley & Sons, Ltd.