ArticlePDF Available

Determination and integration of appropriate spatial scales for river basin modelling

Authors:

Abstract and Figures

Appropriate spatial scales of dominant variables are determined and integrated into an appropriate model scale. This is done in the context of the impact of climate change on flooding in the River Meuse in Western Europe. The objective is achieved by using observed elevation, soil type, land use type and daily precipitation data from several sources and employing different relationships between scales, variable statistics and outputs. The appropriate spatial scale of a key variable is assumed to be equal to a fraction of the spatial correlation length of that variable. This fraction was determined on the basis of relationships between statistics and scale and an accepted error in the estimation of the statistic of 10%. This procedure resulted in an appropriate spatial scale for precipitation of about 20 km in an earlier study. The application to river basin variables revealed appropriate spatial scales for elevation, soil and land use of respectively 0·1, 5·3 and 3·3 km. The appropriate model scale is determined by multiplying the appropriate variable scales with their associated weights. The weights are based on SCS curve number method relationships between the peak discharge and some specific parameters like slope and curve number. The values of these parameters are dependent on the scale of each key variable. The resulting appropriate model scale is about 10 km, implying 225–250 model cells in an appropriate model of the Meuse basin meant to assess the impact of climate change on river flooding. The usefulness of the appropriateness procedure is in its ability to assess the appropriate scales of the individual key variables before model construction and integrate them in a balanced way into an appropriate model scale. Another use of the procedure is that it provides a framework for decisions about the reduction or expansion of data networks and needs. Copyright © 2003 John Wiley & Sons, Ltd.
Content may be subject to copyright.
HYDROLOGICAL PROCESSES
Hydrol. Process. 17, 2581– 2598 (2003)
Published online 11 August 2003 in Wiley InterScience (www.interscience.wiley.com). DOI: 10.1002/hyp.1268
Determination and integration of appropriate spatial
scales for river basin modelling
M. J. Booij*
Department of Civil Engineering, University of Twente, PO Box 217, 7500 AE Enschede, The Netherlands
Abstract:
Appropriate spatial scales of dominant variables are determined and integrated into an appropriate model scale. This is
done in the context of the impact of climate change on flooding in the River Meuse in Western Europe. The objective
is achieved by using observed elevation, soil type, land use type and daily precipitation data from several sources
and employing different relationships between scales, variable statistics and outputs. The appropriate spatial scale of
a key variable is assumed to be equal to a fraction of the spatial correlation length of that variable. This fraction was
determined on the basis of relationships between statistics and scale and an accepted error in the estimation of the
statistic of 10%. This procedure resulted in an appropriate spatial scale for precipitation of about 20 km in an earlier
study. The application to river basin variables revealed appropriate spatial scales for elevation, soil and land use of
respectively 0Ð1, 5Ð3and3Ð3 km. The appropriate model scale is determined by multiplying the appropriate variable
scales with their associated weights. The weights are based on SCS curve number method relationships between
the peak discharge and some specific parameters like slope and curve number. The values of these parameters are
dependent on the scale of each key variable. The resulting appropriate model scale is about 10 km, implying 225– 250
model cells in an appropriate model of the Meuse basin meant to assess the impact of climate change on river flooding.
The usefulness of the appropriateness procedure is in its ability to assess the appropriate scales of the individual key
variables before model construction and integrate them in a balanced way into an appropriate model scale. Another
use of the procedure is that it provides a framework for decisions about the reduction or expansion of data networks
and needs. Copyright 2003 John Wiley & Sons, Ltd.
KEY WORDS river flooding; rainfall-runoff modelling; spatial scales; appropriateness; correlation length;
semi-correlation; variance reduction function; Meuse basin
INTRODUCTION
Global climate changes induced by increases in greenhouse gas concentrations are likely to increase
temperatures, change precipitation patterns and probably raise the frequency of extreme events (IPCC, 2001).
These changes may have serious impacts on society, e.g. on river deltas because of both sea level rise
and an increased occurrence of flooding events. Flooding events may cause enormous economic, social
and environmental damage and even loss of life. This necessitates the application of robust and accurate
flood estimation procedures to provide a strong basis for investments in flood protection measures with
climate change.
A broad palette of models is available for this purpose, ranging from simple, lumped black-box models
to complex, distributed models including lots of physics and mathematics. These include empirical models,
conceptual models and physically based models. These divisions are somewhat arbitrary and hybrid forms
exist in which different methods are combined. The complexity of models depends not only on the model
class to which they belong, but also on the processes incorporated, the process formulations used, and the
different space and time scales employed. In general, models should be sufficiently detailed to capture the
* Correspondence to: M. J. Booij, Department of Civil Engineering, University of Twente, PO Box 217, 7500 AE Enschede, The Netherlands.
E-mail: m.j.booij@ctw.utwente.nl
Received 10 June 2002
Copyright 2003 John Wiley & Sons, Ltd. Accepted 19 November 2002
2582 M. J. BOOIJ
dominant processes and natural variability, but not unnecessarily refined that computation time is wasted
or data availability is limited. It would seem that an optimum model complexity associated with minimum
total uncertainty exists, with a balance in uncertainties from input, model structure and parameters. This
raises the question of what such an appropriate model should look like given the specific modelling objective
and research area. More specifically, which physical processes and data should be incorporated and which
mathematical process formulations should be used and at which spatial and temporal scale to obtain an
appropriate model level?
Booij (2002a) has developed a model appropriateness procedure in which subsequently dominant processes,
appropriate scales and associated appropriate process formulations are determined. This study focuses on the
determination of the appropriate spatial scales. An appropriate scale of a variable is intuitively defined as a
scale which is sufficiently detailed to capture the variability of that variable, but not more than that. First
the dominant processes, important with respect to the impact of climate change on flooding in a particular
river basin, have to be defined. Next, these dominant processes can be linked to dominant variables to be
able to assess the appropriate scales for the different processes through these variables. In this way, the
appropriate variable scales are derived before model construction, which is believed to be more reasonable
than on the basis of model sensitivities (see Booij, 2002b). Only for the integration of these appropriate
variable scales to an appropriate model scale have model sensitivities been used, which is considered to
be inevitable.
Besides precipitation, three processes are recognized as being responsible for the relatively fast trans-
port of precipitation to the stream network during flood events (Maidment, 1992). These are infiltration
excess overland flow (Horton overland flow), saturation excess overland flow (Dunne overland flow) and
subsurface storm flow (interflow). The relative contributions of these three flood-generating processes to
a specific flood vary heavily between different watersheds and different precipitation events, and within
a specific watershed. Additionally, the soil moisture conditions at the beginning of a flood are of impor-
tance and should be accurately determined. These initial conditions are mainly influenced by the pre-
ceding precipitation, evapotranspiration and subsurface flow in the soil matrix. Moreover, the river flow
itself contributes to the occurrence and magnitude of floods. The dominant variables were derived from
these processes by Booij (2002a) on the basis of several studies (e.g. Mauser and Sch¨
adlich, 1998;
Veihe and Quinton, 2000) and are precipitation, elevation, soil type (texture and parent material) and
land use.
The emphasis will be on the appropriate spatial scale and thus, the appropriate time scale should be deter-
mined beforehand. The variables elevation, soil and land use are assumed to be time invariant on the temporal
domain considered (¾80 years), although land use may show a seasonal dependence and long-term trends.
Obviously, precipitation is strongly variable in time and should be modelled accordingly. The appropriate
temporal scale to assess the impact of climate change on flooding in the river basin considered has been
assessed at one day (Booij, 2002a). The appropriate spatial scale for extreme daily precipitation was deter-
mined at 20 km (Booij, 2002c) and will be used in the integration of the appropriate variable scales into an
appropriate model scale. The appropriate spatial scales for elevation, soil and land use are derived in this
study.
Several studies show that the ‘appropriate’ scales for elevation (e.g. Bruneau et al., 1995; Brasington
and Richards, 1998; Wolock and McCabe, 2000), soil moisture (e.g. Farajalla and Vieux, 1995; Western
et al., 1998; Cosh and Brutsaert, 1999) and land use type (e.g. Moody and Woodcock, 1995; Walsh
et al., 2001) vary considerably depending on geographical area (climate, vegetation), extent of the area
and support scale of the data. For example, it is found that the scales for soil moisture determined
by Farajalla and Vieux and Cosh and Brutsaert compare favourably (about 1000 m), in contrast to the
35–60 m estimated by Western et al. This may be due to incorrect, large sampling distances in the
former studies resulting in an overestimation of the ‘true’ appropriate scale. The spatial scales of land
use have been mainly assessed in relation to land use change; implications of spatial scales of land use
for hydrological modelling are rarely investigated. The extent and support scale are the basics of the
Copyright 2003 John Wiley & Sons, Ltd. Hydrol. Process. 17, 2581– 2598 (2003)
SPATIAL SCALES FOR RIVER BASIN MODELLING 2583
so-called scale triplet (Bl¨
oschl and Sivapalan, 1995) and can have a substantial effect on the estimation
of the correlation length and related appropriate scales (see Western and Bl¨
oschl, 1999). These scale
features should be taken into account when determining and integrating the appropriate scales for the key
variables.
The objective of this paper is therefore to determine the appropriate spatial scales of the dominant variables
elevation, soil type, land use and precipitation and integrate them into an appropriate model scale. Dominant
refers to the relative importance of these variables with respect to the impact of climate change on river
flooding. The River Meuse basin in France, Belgium and the Netherlands is the geographical area chosen
for this research purpose. The objective is achieved by using observed data from several sources and then
employing different relationships between scales, variable statistics and outputs. The results are presented and
discussed and conclusions drawn.
OBSERVED DATA
Elevation, soil, land use, precipitation and discharge data are used in this study. The sources and spatial
characteristics (spacing, extent) of the data are described below.
Elevation data are from a global digital elevation model (DEM) GTOPO30 (US Geological Survey, 1996)
and a continental United States digital elevation model US7Ð5MIN (US Geological Survey, 1995). These data
are distributed by the EROS Data Center Distributed Active Archive Center (EDC DAAC), located at the US
Geological Survey’s EROS Data Center in Sioux Falls, South Dakota.
GTOPO30 is a global data set covering the full extent of latitude and longitude with elevation values
ranging from 407 to 8752 m. The horizontal grid spacing is 30 arc-sec (1/120°or approximately 1 km) and
the vertical unit is metres above mean sea level. The source for the European GTOPO30 data is the Digital
Terrain Elevation Data (DTED) set. DTED is a raster topographic database with a horizontal grid spacing of
3 arc-sec (approximately 90 m). The source for the United States GTOPO30 data are USGS 1°DEMs with a
horizontal grid spacing of 3 arc-sec. The 30 arc-sec data were obtained from the 3 arc-sec data by selecting
one representative elevation value to represent the area covered by 100 full resolution cells (10 ð10). The
absolute vertical accuracy of GTOPO30 for Europe and the United States is š30 m linear error at the 90%
confidence level (US Geological Survey, 1993).
US7Ð5MIN is a data set for the continental United States. The horizontal grid spacing is 30 m and the
vertical unit is metres above mean sea level. The data sources are digitized cartographic map contour
overlays and scanned National Aerial Photography Program (NAPP) photographs. The absolute vertical
accuracy of US7Ð5MIN derived from photographs is 7 m or better (90%) and 815 m (10%) (US Geological
Survey, 1995).
Soil data are from the European Soil Database. The data are distributed by the European Soil Bureau
located at the Joint Research Centre’s Space Applications Institute in Ispra, Italy. Only data from the Soil
Geographical Database (King et al., 1994), part of the European Soil Database, are used. The horizontal scale
is 1 : 1 000 000 similar to a horizontal grid spacing of approximately 2Ð5 km, because the positional accuracy
is estimated at 0Ð5–5 km (0Ð5– 5 mm at scale 1 : 1 000 000). Two attributes from this database have been used,
namely Dominant Parent Material (MAT1) and Dominant Surface Textural Class (TEXT1).
Land use data are from the European Environmental Agency (EEA) NATLAN Database. The data are
distributed by the European Environmental Agency in Copenhagen, Denmark. Only data from the CORINE
Land Cover Database (Bossard et al., 2000), part of the EEA NATLAN Database, are used. The horizontal
grid spacing is 250 m. The CORINE database has 44 classes and is derived from Landsat and SPOT
satellite images.
Daily precipitation data from 39 stations in the Meuse basin in Belgium and France for the period 197099
were used (see Booij, 2002c). Daily discharge data at the basin outflow point (Borgharen) and at several
sub-basin outflow points were employed.
Copyright 2003 John Wiley & Sons, Ltd. Hydrol. Process. 17, 2581– 2598 (2003)
2584 M. J. BOOIJ
DETERMINATION AND INTEGRATION OF APPROPRIATE SPATIAL SCALES
Variables at different scales
The spatial variance is an important statistic describing the patterns of elevation, soil type and land use
type. The variance 2for a variable Zx is estimated as:
2D
n
xD1
[Zx ]2
n11
where nis the total number of points or cells in the spatial domain and is the average of Zx over n.In
general, it applies that the larger the area over which Zx is aggregated, the smaller will be the variance.
The relation between the point variance (subscript ‘p’) and the areally averaged variance (subscript ‘A’) can
be stated as follows:
2AD2p22
where 2is the variance reduction function decreasing with increasing area A. Its magnitude depends on the
spatial correlation structure of the variable, and the size and shape of the area. Rodriguez-Iturbe and Mejia
(1974) showed for a stationary isotropic spatial random field that 2is the expected value of the correlation
coefficient between any two points randomly chosen at distance h:
2DH
0
hfhdh3
where His the maximum distance within the area, (h) is the spatial correlation function and fh is the
probability density function (pdf) of the random variable h. Assuming a square area with sides a(and area
ADa2), the pdf is as follows (Ghosh, 1951):
fh D4h
a4h 4
where
h D1
2a22ah C1
2h20<h<a
h Da2sin1a
hcos1a
hC2ah2a21
2h2C2a2 a<h<H
Assuming an exponential spatial correlation function h Dexph/,whereis the spatial correlation
length, the spatial variance associated with an aggregation scale Acan be determined from the spatial point
variance. The determination of is done using variograms from geostatistics (continuous variables) and
semi-correlation functions (categorical variables).
The starting point for a variogram is the semi-variance between two points (xand xCh) at distance h
(e.g. Kitanidis, 1997):
gx, h D1
2[Zx Zx Ch]25
This semi-variance gx, h can be determined for all pairs of points in a specific area. An exponential variogram
model is then fitted to these semi-variances:
h D21exp h
 6
Copyright 2003 John Wiley & Sons, Ltd. Hydrol. Process. 17, 2581– 2598 (2003)
SPATIAL SCALES FOR RIVER BASIN MODELLING 2585
The sill 2is estimated directly from the variogram and the spatial correlation length for each area can
then be determined from the relation between and h. In this way, fitted variograms h for a specific area
are obtained.
Variograms cannot be used for categorical variables such as land use. The spatial variability of these
variables is described by introducing a so-called semi-correlation rŁas:
rŁs, h D
n
xD1
Zsx Ch
M
mD1
Zmx Ch
n7
where sis a specific category of Zx resulting in Zsx, m is an arbitrary category of Zx resulting in Zmx
and Mis the total number of categories. For each category s, the semi-correlation rŁcan be plotted as a
function of the separation distance hto obtain a semi-correlogram from which the spatial correlation length
for each category can be estimated.
The correlation length will be used to determine the appropriate scale and should therefore be meaningful
for hydrological applications, in particular under extreme conditions. For example, it does not make sense
to estimate correlation lengths associated with mountain ranges or large geological structures, as they are
not assumed to play a key role in extreme hydrological behaviour. On the other hand, scales related to hill
slopes and soil types may be very important for this kind of behaviour. These considerations may arise
when non-stationary variograms or semi-correlograms with a number of preferred scales are the result of the
geostatistical analyses. In that case, the scale associated with extreme hydrological behaviour, assuming local
stationarity at that scale (see e.g. Bl¨
oschl, 1999), will be employed. If necessary, this will be clarified for the
variable concerned.
Variables at appropriate scales
The appropriate spatial scale for a variable is dependent on its correlation structure and the application
area studied. Spatial scales should be sufficiently detailed to capture natural variability, but not unnecessarily
refined that computation time is wasted. The appropriate spatial scale can be determined by means of the
relations above given a specific appropriateness criterion. This criterion is based on the bias allowed in
estimating the variance of areally averaged variables from the variance of point variables. The higher this
permitted bias, the larger the appropriate scale. The determination of the appropriate scale is illustrated in
Figure 1, where the reduction of the variance is given as a function of dimensionless aggregation scale (a/).
The dotted line illustrates the determination of the appropriate scale assuming a permitted bias of 10%. This
arbitrary bias is assumed to be appropriate in this study and results in an appropriate scale for the variance
which is about 20% of the correlation length. The appropriate scale for return values was found to be 25%
of the correlation length assuming a bias of 10% (see Booij, 2002c). It thus depends on the relevant statistic
of which fraction of the correlation length represents the appropriate scale for a given bias.
Variables at the appropriate model scale
The appropriate scales for each key variable should be combined to derive an appropriate river basin model
scale. For variables and processes with smaller appropriate scales than this appropriate model scale, distribution
functions may be used to solve the subgrid scale variability or values and processes may be simply averaged.
For variables and processes with less variability, constant values may be used over several appropriate model
scales. The integration of separate appropriate variable scales towards an appropriate model scale is performed
by means of relations between key variable scales and the output variable of interest. The relative importance
Copyright 2003 John Wiley & Sons, Ltd. Hydrol. Process. 17, 2581– 2598 (2003)
2586 M. J. BOOIJ
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
0.001 0.01 0.1 1 10
reduction
0.21
a/
l
Figure 1. Determination of the appropriate scale for a variable accepting a 10% bias
of the separate appropriate variable scales is dependent on the sensitivity of the model output to changes in
these scales. This sensitivity can be assessed by means of a sensitivity analysis, however when no specific
model is available another method should be employed. The SCS method (see e.g. Maidment, 1992) is used
for this purpose, because an approximate estimation is required and the model output of interest is the peak
discharge.
The peak discharge qpin the SCS method is derived from a triangular approximation to the hydrograph
shown in Figure 2 resulting from a rainfall excess intensity peof duration Tp(and volume PeDpeTp). The
lag Tlfrom the centroid of rainfall excess to the peak and the time of rise Tqto the peak are illustrated as
well. The base length of the hydrograph 2Ð67Tqis based on the study of many unit hydrographs (Maidment,
1992). The volume of runoff under the hydrograph Vpis derived with the basic SCS relationship
VpD[25Ð4Pe5Ð08fCN]2
25Ð4PeC20Ð3fCNwith fCND1000
CN 10 8
Tl
Tp/2
Tp
Tq
qp
pe
1.67Tq
Figure 2. SCS triangular hydrograph
Copyright 2003 John Wiley & Sons, Ltd. Hydrol. Process. 17, 2581– 2598 (2003)
SPATIAL SCALES FOR RIVER BASIN MODELLING 2587
where Vpand Peare in millimetres and CN is the well-known curve number dependent on soil type, land use
type and hydrologic condition of the land surface. CN values can be found in extensive tables (e.g. Maidment,
1992) and vary between 20 and 100. Equating the volume Vpto the volume 1
2qp2Ð67Tqin Figure 2, rearranging
and adjusting for units gives qp(m3s1):
qpD5Ð28AVp
0Ð5TpCTl
9
where Ais the catchment area (km2), Tpis in hours and Tlis in hours as follows (Kent, 1972):
TlDl0Ð8[1 CfCN]0Ð7
3Ð42S0Ð5
0
10
where lis the hydraulic length of the catchment (km) and S0is the overland slope in parts per 10 000.
The sensitivity of the peak discharge to changes in variable scales is assessed by introducing into
Equation (9) relationships between its parameters (Pe,CNandS0) and variable scales, instead of constant
parameters. In this way, relations between the peak discharge and the variable scales (e.g. for elevation) are
obtained. The relationships between the SCS parameters and variable scales are empirical ones acquired by
estimating the different parameters for variable fields with different spatial resolutions. If CN and S0are used
as indicators for respectively soil type/land use and topography, relations between CN and soil type/land
use scale, and S0and elevation scale can be incorporated into Equation (9). A precipitation–scale relation
can be directly implemented into Equation (8). In this way, relationships between the key variable scale
and the output variable are established. These relationships are used to assess the weights associated with
an appropriate variable scale. Finally, the weights are multiplied with the appropriate scales to obtain the
appropriate model scale.
It should be mentioned that this hypothesis (i.e. that the appropriate model scale is a linear combination
of appropriate scales of dominant variables) has not been tested. The hypothesis may not hold for small and
fast responding catchments with strong non-linear spatial interactions in rainfall-runoff processes. However,
in large river basins (like the Meuse) the methodology may give a good indication on the appropriate model
scale, which can be used in the model design. Moreover, the weights used in the methodology have been
obtained by non-linear relations between the model output and the variable scales, through the SCS method
and its parameters.
RESULTS AND DISCUSSION
Appropriate scale for elevation
West er n a nd B l ¨
oschl (1999) presented directives on spacing, support and extent when estimating the cor-
relation length. They gave for spacing (equal to support in this case) a maximum value of 20% of the true
correlation length. Assuming true correlation lengths of about 100–1000 m corresponding to realistic hill
slope lengths, it is obvious that DEMs with a resolution of 1 km cannot be used for this purpose. Therefore,
the spatial correlation length should be determined from the US DEM with a resolution of 30 m, on condition
that the elevation pattern of this DEM is representative for the Meuse basin. Unfortunately, DEMs with a
higher resolution than 1 km were not available for the Meuse basin. The geographical area chosen is a region
in the northeast of the United States (69 °W–68°W and 46 °N–47°N) with a similar elevation pattern and
elevation distribution as the Meuse basin.
Figure 3 shows the cumulative frequency distribution of elevations from US7Ð5MIN (US 30m DEM) for five
selected subregions of about 150 km2, GTOPO30 for the whole region (US 1000m DEM), GTOPO30 for the
15 main sub-basins of the Meuse basin (Tributary 1000m DEM) and GTOPO30 for the Meuse basin (Meuse
1000m DEM). The cumulative frequency distributions for the US region and the Meuse basin are similar,
Copyright 2003 John Wiley & Sons, Ltd. Hydrol. Process. 17, 2581– 2598 (2003)
2588 M. J. BOOIJ
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
0 100 200 300 400 500 600 700 800
Elevation (m)
Frequency (-)
US 30 m DEM
US 1000 m DEM
Tributary 1000 m DEM
Meuse 1000 m DEM
Amblève
Mehaigne
Figure 3. Cumulative frequency distribution of elevation for US 30m DEM, US 1000m DEM, Meuse tributary 1000m DEM and Meuse
1000m DEM
indicating that the US region has a similar elevation pattern and elevation distribution as the Meuse basin. The
cumulative frequency distributions for elevations derived from the US 30m DEM are steeper than those derived
from the Tributary 1000m DEM and thus in the latter DEMs larger elevation differences can be found. This can
be explained by the fact that the Tributary 1000m DEMs have been derived for larger areas (350–3200 km2)
than the US 30m DEMs (150 km2). A similar effect as for the extent has been observed for the support
scale, i.e. the smaller the support scale for a specific area, the larger the elevation differences. Large elevation
differences are particularly found for the sub-basins in the eastern part of the Meuse basin (e.g. the Ambl`
eve).
The mean, minimum and maximum elevation in the Meuse 1000m DEM are respectively 284, 43 and 676 m.
Figure 4 shows the semi-variance as a function of separation distance for the US 1000m DEM, the Meuse
1000m DEM and the US 30m DEM as double logarithmic plots. For the US 30m DEM, the semi-variance
has been derived for the five subregions for very small separation distances up to 2 km. Nine grid points
were selected in each subregion and for each grid point average semi-variances were calculated for a specific
separation distance (radius). The average semi-variance as a function of separation distance over 45 (5 ð9)
grid points is shown in Figure 4b. Similar approaches were used for the US 1000m DEM and Meuse 1000m
DEM, but then for nine grid points in the whole region and separation distances up to 20 km. The variograms
in Figure 4a are similar, although the one for the US region shows less spatial variability. Therefore, it can
be assumed that the US region is representative for the Meuse basin regarding elevation patterns and it is rea-
sonable to assume that the variogram for the US 30m DEM in Figure 4b is representative for the Meuse basin
as well. The variograms in Figure 4a are non-stationary and no clear preferred scales can be distinguished.
The variogram in Figure 4b may not be completely stationary (in particular for larger separation distances
than shown), but a correlation length can be derived if local stationarity is assumed. This correlation length
agrees well with hill slope scales important for hydrological behaviour.
The mean, minimum and maximum correlation length determined from the 45 US 30m variograms are
respectively 527, 272 and 1401 m. It can be seen that even for the minimum correlation length the resolution
of the US 30m DEM is sufficient. The mean correlation lengths computed from the US 1000m DEM and the
Meuse 1000m DEM were respectively 4138 and 4220 m. This overestimation of the correlation lengths could
be expected from the relations between spacing and apparent correlation described in Western and Bl¨
oschl
(1999). The appropriate scale for elevation can be determined with the criterion for appropriate scales given
earlier. Then, it was found that the appropriate scale for an appropriate description of the variability is about
20% of the correlation length. The appropriate scale for elevation in the Meuse basin is therefore 110 m.
This scale is in the same range as the scales recommended for DEMs in other studies (e.g. Brasington and
Richards, 1998).
Copyright 2003 John Wiley & Sons, Ltd. Hydrol. Process. 17, 2581– 2598 (2003)
SPATIAL SCALES FOR RIVER BASIN MODELLING 2589
1
10
100
1000
10000
100 1000 10000 100000
Separation distance (m)
Semi-variance (m2)
US 1000 m DEM
Meuse 1000 m DEM
(a)
1
10
100
1000
1 10 100 1000 10000
Separation distance (m)
Semi-variance (m2 )
US 30 m DEM
(b)
Figure 4. Semi-variance (m2) as a function of separation distance (m) for (a) US 1000m DEM and Meuse 1000m DEM and (b) US 30m
DEM
The methodology for the integration of scales requires relationships between key variable scales and the
output variable of interest. The key variable elevation is represented in Equation (10) by the slope. Therefore,
the average slope is derived from elevation maps with different resolutions. These different resolutions are
assumed to represent different data availability levels (e.g. at 1 km or 8 km resolution).
Appropriate scale for soil types
Figure 5 shows the frequency distribution of MAT1 and TEXT1 in the Meuse basin. The major dominant
parent materials are secondary limestone (18%, particularly in the southern part of the basin), residual and stony
loam (34%, in the Ardennes and central part) and loess (17%, in the northern part). The major surface textural
class is medium, which means 18% <clay <35% and sand >15% or clay <18% and 15% <sand <65%.
Variograms for the different soil types cannot be constructed, because soil data are categorical data.
Therefore, the semi-correlation is calculated and plotted versus separation distance in a semi-correlogram.
Figure 6 shows the semi-correlogram for the different classes of MAT1 and TEXT1. Obviously, the
general trend is a decreased semi-correlation with increased separation distance. This trend is approximately
exponential as confirmed by the high weighted mean (taking into account the frequencies) PPCCŁvalues of
0Ð93 for MAT1 and 0Ð98 for TEXT1. PPCCŁis defined as the correlation between minus the logarithm of the
semi-correlation and the distance, analogous to the probability plot correlation coefficient of Filliben (1975).
The semi-correlogram for secondary limestone in Figure 6a shows a slightly increasing semi-correlation with
increasing distance between 30 and 45 km. This may be attributed to large-scale spatial patterns of secondary
limestone. The semi-correlation decreases towards its ‘background’ frequency from Figure 5 at a distance
Copyright 2003 John Wiley & Sons, Ltd. Hydrol. Process. 17, 2581– 2598 (2003)
2590 M. J. BOOIJ
(a)
0.00
0.05
0.10
0.15
0.20
0.25
River alluvium
Limestone
Secondary
limestone
Marl
Secondary
marl
Secondary
clay
Residual clay
Clay with flints
Tertiary sands
Soft quartzy
sandstone
Residual loam
Stony loam
Sandy loam
Eolian loam
Loess
Acid crystalline
rocks
Granite
Schists
Frequency (-)
(b)
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
Coarse Medium Medium fine Fine No texture
Frequency (-)
Figure 5. Frequencies of (a) MAT1 and (b) TEXT1 in the Meuse basin
larger than the corresponding correlation length. The semi-correlograms can be used to derive the spatial
correlation lengths for the different soil types.
Figure 7 shows the spatial correlation lengths for the different classes of MAT1 and TEXT1. The correlation
lengths for MAT1 vary between 1 and 10 km for residual clay, clay with flints and schists to more than 30 km
for secondary limestone and secondary marl. The correlation lengths for TEXT1 vary between 15 and 35 km.
The horizontal grid spacing of the soil data is sufficient for most of the soil types, because their correlation
lengths are more than five times the grid spacing. The grid spacing is insufficient for the above-mentioned three
MAT1 classes with small correlation lengths and limestone. However, these MAT1 classes occupy less than
5% of the Meuse basin and therefore possible errors in estimated correlation lengths due to grid spacing can
be neglected. The weighted mean spatial correlation length for dominant parent material is about 25Ð1km.
Then, the appropriate scale for a good description of the parent material variability is about 20% of this
correlation length or 5Ð3 km. These figures are for the dominant surface textural class respectively 31Ð5km
and 6Ð6 km. The appropriate scale with respect to the parent material has been chosen to be representative
for the soil data, because it is the smallest one and it is supposed to be more suitable in the methodology for
the integration of appropriate scales. The correlation lengths for parent material and dominant surface textural
class are much larger than the correlation lengths for soil moisture found in the literature (50–1200 m). These
differences can be attributed to the additional variability for soil moisture introduced by e.g. topography and
vegetation on top of the variability of the parent material and soil texture itself.
The key variable soil is represented in Equations (8) (10) by the curve number CN. The average CN will
be derived from soil (MAT1) maps with different resolutions.
Copyright 2003 John Wiley & Sons, Ltd. Hydrol. Process. 17, 2581– 2598 (2003)
SPATIAL SCALES FOR RIVER BASIN MODELLING 2591
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
0 10000 20000 30000 40000 50000 60000 70000 80000
Separation distance (m)
0 10000 20000 30000 40000 50000 60000 70000 80000
Separation distance (m)
Semi-correlation (-)
River alluvium Limestone Secondary limestone Marl Secondary marl
Secondary clay Residual clay Clay with flints Tertiary sands Soft quartzy sandstone
Residual loam Stony loam Sandy loam Eolian loam Loess
Acid crystalline rocks Granite Schists
(b)
(a)
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
Semi-correlation (-)
Coarse (clay < 18 % and sand > 65 %)
Medium (18% < clay < 35% and sand > 15%/ clay < 18% and 15% < sand < 65%)
Medium fine (clay < 35 % and sand < 15 %)
Fine (35 % < clay < 60 %)
No texture (histosols, ...)
Figure 6. Semi-correlation as a function of separation distance (m) for (a) MAT1 and (b) TEXT1
Appropriate scale for land use types
Figure 8 shows the frequency distribution of land use types in the Meuse basin. The land use types are
described in Bossard et al. (2000). In the Meuse basin, 29 of the 44 land use types in the CORINE database
are present. Urban, industrial and mining areas occupy about 9% of the space, agricultural areas 54%, forests
and scrubs about 36% and water-related areas about 1%. The Meuse north sub-basin has the largest fraction
of urban, industrial and mining areas (29%, not shown here), the northwestern Jeker and Mehaigne sub-basins
have the largest fraction of agricultural areas (80%) and the Viroin sub-basin has the largest fraction of forests
and scrubs (57%).
Land use data are categorical data as well and therefore semi-correlograms are computed for the different
land use types as shown in Figure 9. The correlograms can be approximated by exponential functions as
Copyright 2003 John Wiley & Sons, Ltd. Hydrol. Process. 17, 2581– 2598 (2003)
2592 M. J. BOOIJ
0
5000
10000
15000
20000
25000
30000
35000
River alluvium
Limestone
Secondary limestone
Marl
Secondary marl
Secondary clay
Residual clay
Clay with flints
Tertiary sands
Soft quartzy sandstone
Residual loam
Stony loam
Sandy loam
Eolian loam
Loess
Acid crystalline rocks
Granite
Schists
Spatial correlation
length (m)
0
5000
10000
15000
20000
25000
30000
35000
40000
Coarse Medium Medium fine Fine No texture
Correlation length (m)
(a)
(b)
Figure 7. Spatial correlation length (m) for (a) different MAT1 and (b) different TEXT1
confirmed by the reasonable weighted mean PPCCŁvalue of 0Ð89. Figure 9 shows expected features such as
the slow decrease of the semi-correlation with separation distance for broad-leaved forest and the fast decrease
of the semi-correlation with separation distance for continuous urban fabric. The ‘background’ frequency from
Figure 8 is reached at a large distance compared to the correlation length as shown for some land use types
with small correlation lengths (20 km).
Figure 10 shows the spatial correlation lengths for the different land use types in the Meuse basin. The
correlation lengths vary between 100 and 500 m for port areas, airports, rice fields and sclerophyllous veg-
etation to about 20 km for discontinuous urban fabric, broad-leaved forest and mixed forest. The horizontal
grid spacing of the CORINE land use data is sufficient for most of the land use types, except for the above-
mentioned four land use types with small correlation lengths and construction sites, sport and leisure facilities
and natural grassland. However, these seven land use types occupy less than 1% of the Meuse basin and
therefore possible errors in estimated correlation lengths due to grid spacing can be neglected. The weighted
mean spatial correlation length for land use is 15Ð7 km which gives an appropriate scale for a good description
of the land use variability of about 3Ð3 km. This appropriate scale is in the middle of the range of spatial
scales recommended for land use mapping and used in land use models.
The key variable land use is represented in Equations (8) (10) by the curve number CN as well. The
average CN will be derived from land use maps with different resolutions.
Copyright 2003 John Wiley & Sons, Ltd. Hydrol. Process. 17, 2581– 2598 (2003)
SPATIAL SCALES FOR RIVER BASIN MODELLING 2593
0
0.05
0.1
0.15
0.2
Continuous urban fabric
Discontinuous urban fabric
Industrial and commercial units
Road and rail networks
Port areas
Airports
Mineral extraction sites
Dump sites
Construction sites
Green urban areas
Sport and leisure facilities
Non-irrigated arable land
Rice fields
Fruit trees and berry plantations
Pastures
Annual crops
Complex cultivation patterns
Principally agriculture
Broad-leaved forest
Coniferous forest
Mixed forest
Natural grassland
Moors and heathland
Sclerophyllous vegetation
Transitional woodland-shrub
Inland marshes
Peatbogs
Water courses
Water bodies
Frequency (-)
Figure 8. Frequencies of land use types in the Meuse basin
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0 5000 10000 15000 20000
Separation distance (m)
Semi-correlation (-)
Continuous urban fabric Discontinuous urban fabric Industrial and commercial units Road and rail networks
Port areas Airports Mineral extraction sites Dump sites
Construction sites Green urban areas Sport and leisure facilities Non-irrigated arable land
Rice fields Fruit trees and berry plantations Pastures Annual crops
Complex cultivation patterns Principally agriculture Broad-leaved forest Coniferous forest
Mixed forest Natural grassland Moors and heathland Sclerophyllous vegetation
Transitional woodland-shrub Inland marshes Peatbogs Water courses
Water bodies
Figure 9. Semi-correlation as a function of separation distance (m) for different land use types
Integration of appropriate variable scales
Here, the appropriate scales for precipitation (20 km, see Booij, 2002c), elevation, soil and land use are
integrated to one appropriate model scale. This is achieved by applying the methodology described previously.
First, the relationships between the parameters from Equations (8)–(10) (e.g. the slope) and the spatial scales
Copyright 2003 John Wiley & Sons, Ltd. Hydrol. Process. 17, 2581– 2598 (2003)
2594 M. J. BOOIJ
0
5000
10000
15000
20000
25000
Continuous urban fabric
Discontinuous urban fabric
Industrial and commercial units
Road and rail networks
Port areas
Airports
Mineral extraction sites
Dump sites
Construction sites
Green urban areas
Sport and leisure facilities
Non-irrigated arable land
Rice fields
Fruit trees and berry plantations
Pastures
Annual crops
Complex cultivation patterns
Principally agriculture
Broad-leaved forest
Coniferous forest
Mixed forest
Natural grassland
Moors and heathland
Sclerophyllous vegetation
Transitional woodland-shrub
Inland marshes
Peatbogs
Water courses
Water bodies
Spatial correlation length (m)
Figure 10. Spatial correlation length (m) for land use types in the Meuse basin
of the key variables (e.g. the elevation) are considered. Second, these relationships are integrated into the
equations to obtain relations between the output and the spatial scales of the different key variables. The
output of interest is the peak discharge with a return period of 20 years, RV(20).
The key variable precipitation in Equation (8), Pe, should be the extreme precipitation with a return period
of 20 years, because a return period of 20 years is assumed for the discharge. The relation between precipi-
tation return values and scales has been theoretically considered and applied by Booij (2002c). This relation
can be used directly in Equation (8) to obtain a relationship between the output and spatial scales of extreme
precipitation.
0.0001
0.001
0.01
0.1
1
10 100 1000 10000 100000
Spatial scale (m)
Slope (-)
US 30 m DEM (5X)
US 1000 m DEM
Meuse 1000 m DEM
Figure 11. Average slope as a function of scale (m) for US 30m DEM, US 1000m DEM and Meuse 1000m DEM
Copyright 2003 John Wiley & Sons, Ltd. Hydrol. Process. 17, 2581– 2598 (2003)
SPATIAL SCALES FOR RIVER BASIN MODELLING 2595
The key variable elevation is represented in Equation (10) by the slope S0. The average slope as a func-
tion of scale derived from US 30m DEM, Meuse 1000m DEM and US 1000m DEM elevation maps and
their aggregated versions is given in Figure 11. Main features are the similarity of the US 1000m DEM and
Meuse 1000m DEM slope versus scale relationships and the apparent continuity of the relationships over
a broad range of scales (from 30 m to more than 30 km). The combination of these two aspects leads to
the assumption that the slope versus scale relationships from the US 30m DEM can also be used for the
Meuse basin and thus the appropriate scale for elevation (represented by slope) of about 100 m is covered by
these relationships. This coverage is necessary for the methodology for the integration of appropriate scales.
Namely, the scale–variable relation should at least cover the range between the appropriate variable scale
and the appropriate model scale or vice versa. This requirement has been checked after the determination of
the appropriate model scale. Another interesting feature in Figure 11 is the clear distinction between relations
for flat US regions (lower curves for US 30m DEM) and hilly US regions (upper).
The key variables soil and land use are represented in Equations (8)– (10) by the curve number CN. The
average, overall curve number for the Meuse basin was derived with Equations (8)– (10) and the variables
and parameters in Table I. The values for Peand qpwere derived keeping in mind the 20-year return period
and the scale for Pe, and the value for S0is estimated from the elevation field with the highest resolution
(1000 m). This results in an average CN of 77Ð1 for the soil and land use map with the highest resolution.
This average CN and standard tables (Maidment, 1992) have been used to estimate curve numbers for the
different soil and land use types. Finally, the average curve numbers as a function of map resolution for soil
parent material and land use could be determined (not shown).
The derived relationships between the SCS parameters and the scales of extreme precipitation, slope and
curve number (such as in Figure 11) can be implemented in Equations (8) (10) to obtain relationships between
the output and key variable scales. Figure 12 gives the dimensionless peak discharge [qp(spatial scale)/qp
(smallest spatial scale)] as a function of the spatial scale of the key variables. The relationships between SCS
parameters and variable scales were used as an intermediate step to obtain this relation between the peak
discharge (a function of SCS parameters) and variable scales. The peak discharge increases slightly with scale
for soils in contrast to the other key variables, which may be due to the spatial distribution of soil types.
Figure 12 has been used to assess the weights associated with an appropriate variable scale by comparing the
slopes of the different relationships. The larger the slope, the larger the weight which should be attributed to
a specific appropriate variable scale. The sum of the four weights (precipitation, elevation, soil type and land
use) is obviously equal to 1. The slope was determined for at least the range between the appropriate variable
scale and the appropriate model scale (checked a posteriori). The appropriate variable scales, the associated
weights and the resulting appropriate model scale are summarized in Table II. The appropriate model scale
for modelling the impact of climate change on flooding in the River Meuse is about 10 km. This scale will
result in 225–250 model cells or sub-basins for the Meuse basin.
Table I. Values for variables and parameters in
Equations (8)–(10)
Variable/parameter Unit Value
Pemm 44Ð6
qpm3/s 2390
Akm221 ð103
Tph24
lDpAkm 145
S0parts/10 000 176
CN – 77Ð1
Copyright 2003 John Wiley & Sons, Ltd. Hydrol. Process. 17, 2581– 2598 (2003)
2596 M. J. BOOIJ
0.0
0.2
0.4
0.6
0.8
1.0
1.2
0 5000 10000 15000 20000 25000 30000 35000
Spatial scale (m)
Dimensionless peak discharge
q
p
'
Precipitation
Elevation-Slope
Landuse-CN
Soil-CN
Figure 12. Dimensionless peak discharge q0
pas a function of the spatial scale (m) of the key variables
Table II. Appropriate variable scales, associated weights and appro-
priate model scale
Variable Appropriate variable scale Weight
(km) (–)
Precipitation 19Ð90Ð39
Elevation 0Ð10Ð26
Soil 5Ð30Ð21
Land use 3Ð30Ð14
Integrated 9Ð51Ð00
CONCLUSIONS
The idea of model appropriateness is introduced in a search for an optimum model complexity for a specific
research objective and area. The definition of an appropriate model is based on a consistency criterion implying
that a model should have scales and formulations at a balanced level. This criterion has been implemented in
a model appropriateness procedure, which was applied to assess the appropriate model scale. The processes
to be incorporated and the process formulations to be used were determined qualitatively by Booij (2002a).
The usefulness of the appropriateness procedure is thus in its ability to assess the appropriate scales
of the individual key variables before model construction and integrate them in a balanced way into an
appropriate model scale. The procedure will not result in a model for a specific situation with a prescribed
maximum uncertainty. Obviously, the model output uncertainty can be determined afterwards and compared
with accuracy criteria formulated by model users or policy makers. This output uncertainty should consist of
several uncertainties at a balanced level. Another use of the procedure is that it provides a framework for
decisions about the reduction or expansion of data networks and needs. Application of the framework to the
dominant variables in a specific situation may reveal where possible data inconsistencies (e.g. a too sparse
network) exist. Also, discrepancies between observations and model results may be explained with the help
of the procedure, as has been done for example by Booij (2002c). Obviously, the adequacy of the integration
function for a specific situation (the curve number method in this case) should be considered when applying
the appropriateness procedure.
Copyright 2003 John Wiley & Sons, Ltd. Hydrol. Process. 17, 2581– 2598 (2003)
SPATIAL SCALES FOR RIVER BASIN MODELLING 2597
The results may be generalized to other geographical areas and research problems. The appropriateness
procedure can in principle be applied to every arbitrary area in the world. The dominance of processes is area
dependent, but in general, similar variables play an important role in flood generating rainfall-runoff processes.
The appropriate scales of these variables depend on the region. For example, the appropriate spatial scale
for precipitation will be much smaller for tropical, mountainous areas with large orographical effects than
for temperate, relatively flat areas like the Meuse basin. Furthermore, the appropriate scales are influenced
by the size of the basin. Small catchments require smaller time scales than large basins and consequently
associated spatial scales will differ between small and large basins. The appropriate spatial scales were based
on a daily time scale in this research for the Meuse basin, whereas the determination of appropriate spatial
scales for a similar study in a research catchment of several square kilometres should be based on an hourly
or even minute time scale leading to smaller appropriate spatial scales. Similarly, scale analysis in a climate
change study for the Amazon may be based on a weekly or monthly time scale, probably resulting in larger
appropriate spatial scales. Appropriate process formulations will change with these changes in scale as they
are often scale dependent.
The type of research problem may influence the importance of processes and associated variables. For
example, when studying the impact of climate change on low flows, evapotranspiration and groundwater
flow become more important, whereas overland flow and subsurface storm flow may be less significant.
Consequently, other variables are dominant and other appropriate spatial and temporal scales will prevail.
Therefore, based on the results obtained here, only very general guidelines with respect to appropriate models
for other research objectives can be given.
ACKNOWLEDGEMENTS
The daily station climatological data for Belgium and France have been provided respectively by Luc
Debontridder from the KMI (Belgian Royal Meteorological Institute) and Christophe Dehouck from M´
et´
eo
France. Bob Jones of the European Soil Bureau and Malene Bruun of the European Environmental Agency
kindly supplied the soil and land use data, respectively. Eric Sprokkereef of RIZA gave the essential Meuse
basin data and Joop Gerretsen of Rijkswaterstaat Limburg made the discharge data available. The comments
of two anonymous reviewers and discussions with Kees Vreugdenhil of the University of Twente helped to
improve the manuscript substantially.
REFERENCES
Bl¨
oschl G. 1999. Scale issues in snow hydrology. Hydrological Processes 13: 2149 –2175.
Bl¨
oschl G, Sivapalan M. 1995. Scale issues in hydrological modelling: a review. Hydrological Processes 9: 251 –290.
Booij MJ. 2002a. Appropriate modelling of climate change impacts on river flooding . PhD thesis, University of Twente, Enschede.
Booij MJ. 2002b. Modelling the effects of spatial and temporal resolution of rainfall and basin model on extreme river discharge. Hydrological
Sciences Journal 47: 307 320.
Booij MJ. 2002c. Extreme daily precipitation in Western Europe with climate change at appropriate spatial scales. International Journal of
Climatology 22: 69–85.
Bossard M, Steenmans Ch, Feranec J, Otahel J. 2000. The revised and supplemented Corine land cover nomenclature. Technical Report 38,
European Environment Agency, Copenhagen.
Brasington J, Richards K. 1998. Interactions between model predictions, parameters and DTM scales for TOPMODEL. Computers &
Geosciences 24: 299– 314.
Bruneau P, Gascuel-Odoux C, Robin P, Merot Ph, Beven K. 1995. Sensitivity to space and time resolution of a hydrological model using
digital elevation data. Hydrological Processes 9: 69 –81.
Cosh MH, Brutsaert W. 1999. Aspects of soil moisture variability in the Washita ’92 study region. Journal of Geophysical Research 104:
19 751– 19 757.
Farajalla NS, Vieux BE. 1995. Capturing the essential spatial variability in distributed hydrological modelling: infiltration parameters.
Hydrological Processes 9: 55 –68.
Filliben JJ. 1975. The probability plot correlation coefficient test for normality. Technometrics 17: 111– 117.
Ghosh B. 1951. Random distances within a rectangle and between two rectangles. Bulletin of the Calcutta Mathematical Society 43: 17 24.
Copyright 2003 John Wiley & Sons, Ltd. Hydrol. Process. 17, 2581– 2598 (2003)
2598 M. J. BOOIJ
IPCC. 2001. Climate change 2001: The scientific basis. Contribution of Working Group I to the Third Assessment Report of the
Intergovernmental Panel on Climate Change, Houghton JT, Ding Y, Griggs DJ, Noguer M, van der Linden PJ, Dai X, Maskell K,
Johnson CA (eds). Cambridge University Press: Cambridge.
Kent KM. 1972. Travel time, time of concentration and lag. In SCS National Engineering Handbook, section 4 Hydrology . USDA:
Washington, DC.
King D, Daroussin J, Tavernier R. 1994. Development of a soil geographical database from the soil map of the European Communities.
Catena 21: 37– 56.
Kitanidis PK. 1997. Introduction to Geostatistics: Applications in Hydrogeology. Cambridge University Press: Cambridge.
Maidment DR (ed.). 1992. Handbook of Hydrology. McGraw-Hill: New York.
Mauser W, Sch¨
adlich S. 1998. Modelling the spatial distribution of evapotranspiration on different scales using remote sensing data. Journal
of Hydrology 212: 250 –267.
Moody A, Woodcock CE. 1995. The influence of scale and the spatial characteristics of landscapes on land-cover mapping using remote
sensing. Landscape Ecology 10: 363 379.
Rodriguez-Iturbe I, Mejia JM. 1974. On the transformation from point rainfall to areal rainfall. Water Resources Research 10: 729 735.
US Geological Survey. 1993. Digital elevation models . Data user guide 5. US Geological Survey: Reston.
US Geological Survey. 1995. Digital elevation model data from 7.5-minute quadrangle maps. US Geological Survey: Sioux Falls.
US Geological Survey. 1996. Global 30-arc second elevation data set . US Geological Survey: Sioux Falls.
Veihe A, Quinton J. 2000. Sensitivity analysis of EUROSEM using Monte Carlo simulation I: hydrological, soil and vegetation parameters.
Hydrological Processes 14: 915 –926.
Walsh SJ, Crawford TW, Welsh WF, Crews-Meyer KA. 2001. A multiscale analysis of LULC and NDVI variation in Nang Rong district,
northeast Thailand. Agriculture, Ecosystems & Environment 85: 47 –64.
Wes te rn AW , Bl ¨
oschl G. 1999. On the spatial scaling of soil moisture. Journal of Hydrology 217: 203– 224.
Wes te rn AW , B l ¨
oschl G, Grayson RB. 1998. Geostatistical characterisation of soil moisture patterns in the Tarrawarra catchment. Journal
of Hydrology 205: 20 –37.
Wolock DM, McCabe GJ. 2000. Differences in topographic characteristics computed from 100- and 1000-m resolution digital elevation
model data. Hydrological Processes 14: 987 –1002.
Copyright 2003 John Wiley & Sons, Ltd. Hydrol. Process. 17, 2581– 2598 (2003)
... The temporal and spatial scales should be detailed enough to capture the dominant processes and watershed characteristics variability. At the same time, computational time should be considered to avoid unnecessarily refined resolution (Booij, 2003). Temporal scaling entails time intervals of observations and temporal variability of processes, which determines the temporal resolution of the model and the duration of simulation period. ...
... Therefore, the primary variables selected are precipitation, soil type, elevation, and land use. Booij (2003) suggested the spatial scales of daily precipitation, soil, elevation, and land use as 19.9 km, 5.3 km, 0.1 km, 3.3 km, respectively. Based on the relative weight of these variables on flood events, a 9.5 km resolution was suggested to capture the flood model response under climate change. ...
Article
Full-text available
The main mission of natural resources conservation and management (NRCM) is to restore and protect the productivity of land through technical conservation practices, education, and outreach activities. Pursuit of this mission improves soil health, water quality, air quality, wildlife, wetlands, and local economy; and should promote partnership among farming community, and provincial and central government agencies. In other words, NRCM helps keep the entire ecosystem healthy by keeping farms, ranches, forest lands, rivers, lakes, wetlands, wildlife habitats, and the environment healthy. This is accomplished through planning and execution at the watershed scale or watershed management. Fundamental to watershed management is hydrology and hydrologic modeling. This paper attempts to sketch the role of hydrology in NRCM through watershed management.
... The major drawback of distributed models is their large demand for high-resolution data for the model set-up and operation and a CPU demand that rapidly grows with system resolution. The question about the optimal balance of spatial resolution and computational burden has therefore been a long-standing issue (not only) in the hydrological sciences (Melsen et al., 2016;Liu et al., 2016;Dehotin and Braud, 2008;Booij, 2003;Gharari et al., 2020). In this context, a range of methods has been proposed to address the computational problem; it can either be crushed by massive parallel computing (Kollet, 2010) or reduced by avoiding redundant computations. ...
... At last we have tested adaptive clustering with the example of a relatively simple conceptual hydrological model with limited internal variability. The performance of adaptive clustering in more advanced models such as MIKE SHE (Abbott et al., 1986), HydroGeoSphere (HGS;Brunner and Simmons, 2011;Davison et al., 2018), the Noah-MP land surface model (LSM; Niu et al., 2011), or the community land model (CLM; Lawrence et al., 2019) where computation times are indeed a challenge -remains to be demonstrated. On the one hand, the potential savings by adaptive clustering will increase with the level of process detail in a model. ...
Article
Full-text available
In this paper we propose adaptive clustering as a new method for reducing the computational efforts of distributed modelling. It consists of identifying similar-acting model elements during runtime, clustering them, running the model for just a few representatives per cluster, and mapping their results to the remaining model elements in the cluster. Key requirements for the application of adaptive clustering are the existence of (i) many model elements with (ii) comparable structural and functional properties and (iii) only weak interaction (e.g. hill slopes, subcatchments, or surface grid elements in hydrological and land surface models). The clustering of model elements must not only consider their time-invariant structural and functional properties but also their current state and forcing, as all these aspects influence their current functioning. Joining model elements into clusters is therefore a continuous task during model execution rather than a one-time exercise that can be done beforehand. Adaptive clustering takes this into account by continuously checking the clustering and re-clustering when necessary. We explain the steps of adaptive clustering and provide a proof of concept at the example of a distributed, conceptual hydrological model fit to the Attert basin in Luxembourg. The clustering is done based on normalised and binned transformations of model element states and fluxes. Analysing a 5-year time series of these transformed states and fluxes revealed that many model elements act very similarly, and the degree of similarity varies strongly with time, indicating the potential for adaptive clustering to save computation time. Compared to a standard, full-resolution model run used as a virtual reality “truth”, adaptive clustering indeed reduced computation time by 75 %, while modelling quality, expressed as the Nash–Sutcliffe efficiency of subcatchment runoff, declined from 1 to 0.84. Based on this proof-of-concept application, we believe that adaptive clustering is a promising tool for reducing the computation time of distributed models. Being adaptive, it integrates and enhances existing methods of static grouping of model elements, such as lumping or grouped response units (GRUs). It is compatible with existing dynamical methods such as adaptive time stepping or adaptive gridding and, unlike the latter, does not require adjacency of the model elements to be joined. As a welcome side effect, adaptive clustering can be used for system analysis; in our case, analysing the space–time patterns of clustered model elements confirmed that the hydrological functioning of the Attert catchment is mainly controlled by the spatial patterns of geology and precipitation.
... Some researchers have argued that complex physically based distributed models can simulate the observed streamflow better than simple conceptual models (e.g., Refsgaard 1996;Reggiani and Schellekens 2003), but others argue against this and concluded that complex models do not lead to better results (e.g., Booij 2003;Te Linde et al. 2008;Ponce and Hawkins 1996;Uhlenbrook 2003). Distributed physically based models are valuable when either the study involves spatial scenarios or there are observed data related to the hydrological variables at the local interior locations within the watershed (Pokhrel and Gupta 2011;Smith et al. 2012). ...
... It is advised that the selection criteria of model type should be based on the availability of the forcing data, watershed scale, driving processes, and application of the model. This can result in using a more simplified and accurate model for watershed streamflow simulation (Booij 2003). ...
Article
Full-text available
The prairie region in Canada has been characterized as a graveyard of hydrological models due to its challenging cold-regions processes and complex landscape with numerous land depressions that influence runoff pathways. Efforts were made at the small-basin scale to propose new algorithms and/or modify existing physically based hydrological models in order to achieve some semblance of a coherent mathematical runoff modeling system. To date, there has been very little research on modifying conceptual bucket-type models to include lateral pothole flow complexities for peak-flow estimation. In this study, the conceptual Hydrologiska Byråns Vattenbalansavdelning (HBV)- light model is modified to work in the prairies by incorporating a conceptual lateral-flow component to represent the pothole storage complexities. The modification of the HBV-light model resulted in a HYdrological model for Prairie Region (HYPR) that can be used for prairie streamflow simulation. The traditional HBV-light and HYPR conceptual models are tested on different pothole-dominated watersheds within the Qu’Appelle River Basin in Saskatchewan, Canada. The incorporation of a pothole storage-modeling component in HYPR results in a better streamflow simulation than that of HBV-light. Also, a new approach is proposed in this study to better identify the proper calibration period to arrive at a successful streamflow simulation. Although HYPR’s processes representation is simplified, the model shows potential for simulating the overall hydrograph and peak flows. HYPR shows strengths as a possible tool for operational and flood prediction purposes in the prairies, especially when data are limited. DOI: 10.1061/(ASCE)HE.1943-5584.0001922. © 2020 American Society of Civil Engineers.
... km 2 ). The spatial resolution of 30 m DEM can be deemed satisfactory (Booij 2003;Alwan et al., 2020;Mahmood et al., 2020), as it strikes a balance between data availability and computational efficiency, making it a reasonable choice for the study. Using a higher resolution dataset would result in a much larger amount of data, which could be difficult to process and analyze (Carrara et al., 1999;De Winnaar et al., 2007;Yilmaz 2009). ...
Article
Water scarcity is one of the most pressing concerns facing the world in the 21st century. The groundwater reserves are being depleted at a much faster rate than they can be replenished by precipitation. This problem can be avoided through the proper identification of rainwater harvesting sites and management of the water resources. Kumari River basin is a semi-arid, hard rock landscape where water scarcity has been a serious concern for decades. The main objective of this research is to identify suitable sites for surface rainwater harvesting (SRWH) in Kumari River basin using Geographic Information Systems (GIS) and the Analytic Hierarchy Process (AHP) techniques. To estimate the SRWH sites in the Kumari basin, 12 factors are taken into consideration. The suitability map is classified into 4 classes of which 20.26% show high suitability zones, 32.74% are moderately suitable zones, 29.40% are less suitable zones and 17.60% are unsuitable zones. The validation has been done by Receiver Operator Characteristic (ROC) curve using the 52 existing check dam sites, whose predictive accuracy value is 87.4 %, which indicates an excellent accuracy rate of the model. Through this study, sites for the construction of SRWH structures, including 32 suitable check dam sites, 17 earthen dam sites, 35 percolation tank sites, 24 farm pond sites, and 19 gulley plug sites, have been proposed. The findings of this study will help policymakers and planners in developing sustainable water resources management plans and scientific strategies to enhance irrigational potentiality.
... Also, to extract the runoff channels and watershed delineation, digital elevation model (DEM) in 30 m resolution (NASA) was used. It should be noted that spatial resolution is defined as a pixel (the smallest optical satellite image) on Earth, which is ranked low, medium, and high on a temporary basis, as imaging technology is constantly evolving (Booij, 2003). Thus, the 30 m resolution, meaning each pixel stands for a 30 m * 30 m area on the Earth. ...
... Several regions have successfully applied TRMM-3B42 and GPM-3IMERGDF satellite rainfall products as inputs for hydrological modeling of mountainous areas. Complex hydrological modeling does not always give better results (Booij, 2003;Linde et al., 2007). However, the selection of an appropriate model was also important to produce an accurate hydrological model response. ...
Article
Full-text available
In Indonesia, ground-based rainfall monitoring is uneven and sometimes lacks continuity especially in small watersheds, which makes hydrological modeling difficult. This paper aims to the performance evaluation of the HBV Light model from the manual measurement of rainfall (MMR), Global Precipitation Measurement (GPM-3IMERGDF), and Tropical Rainfall Measuring Mission (TRMM-3B42) as input for the hydrological model. The Hydrologiska Byrans Vattenbalansavdelning (HBV) Light hydrological model is applied to three small watersheds, namely Sampean Baru, Bedadung, and Mayang. The model’s performance evaluation is assessed based on the correlation between the average rainfall data for the satellite product area and the MMR product, the stationarity of the rainfall and discharge data, and the model accuracy. The model simulation results show that the MMR rainfall in all watersheds provides a better discharge response than the other two products. Meanwhile, the simulation model of the GPM-3IMERGDF satellite product is slightly better than TRMM-3B42. The stationarity test of rainfall and discharge data needs to be enforced before modeling.
... According to Van Dijk et al. (2007), a larger slope results in shallower soils, less infiltration and therefore more runoff. The impacts of topographic variation on hydrology were also investigated by, for instance, Booij (2003), Hudson and Colditz (2003) and Romshoo et al. (2012), with results similar to ours. ...
Article
Full-text available
This study investigated the influence of land use change and climate change on changes in streamflow in the Bengawan Solo River (BSR, 16 389 km²) basin, Indonesia. We applied the excess water and energy approach to three tributaries of the BSR, namely Pidekso, Madiun and Kening rivers, which represent the up-, mid-, and downstream parts of the BSR, respectively. This study used hydrological data from 1982−2013 and land use data for the years 1994 and 2013. The results show that from 1982 to 2013, the annual streamflow of the Pidekso and Madiun River increased, while in the Kening River it decreased. Based on attribution measures, the increasing streamflow trend in the Pidekso and Madiun rivers is mainly attributed to land use change, while the decreasing streamflow trend in the Kening River is mainly attributed to climate change. These conclusions are supported by statistical analyses and land use change analysis.
... Because of the relatively short calibration period and the fact that VIC has performed better in studies other than this one (Liang et al., 1994), the results must be considered as preliminary. The results, though, support the notion that even for a well documented river basin such as the Rhine, the available approaches are still far from providing a satisfactory representation of rainfall-runoff transformation and that more complex modelling does not always lead to better results (Booij, 2003;Uhlenbrook, 2003). Calibration and comparison of the model performance of VIC and HBV can be refined at the sub-basin level for the Rhine basin. ...
Article
Full-text available
The general idea exists among hydrologists that physically-based distributed modelling better represents observed discharges as compared to lumped model approaches. In this paper, the hydrological models HBV and VIC were compared for the Rhine basin by testing their performance for simulating discharge. Overall, the semi-distributed lumped conceptual HBV model performed much better than the distributed physically-based VIC model. It is argued here that, even for a well documented river basin, such as the Rhine, the available approaches are still far from providing a satisfactory representation of the rainfall-runoff transformation and that more complex modelling does not always lead to better results. Moreover, it is concluded that deviations between observed and simulated discharge in many cases seem not to result from a structural problem in model definition, but from errors or deviations in forcing data.
Article
Full-text available
Introduction One of the common methods for quantitative analysis using hydrological models is the watershed. A class of these models used to simulate rainfall-runoff process. A hydrological rainfall-runoff model with time series data integration, parameters, variables area and physical laws in the system catchment, simulation, runoff and other processes possible. Flood hydrograph is a graphical representation and major hydrological in the analysis. The response basin is directly related in form hydrograph With the management. In fact, watershed management will not be possible unless hydrological characteristics basin for understand the system predictable. HEC-HMS model is of process simulation models of rainfall - runoff process. HEC-HMS model provided by the Engineering Hydrology America's Army. This model also has the ability to run together integrated and distribution. Most research in the field of calibration of hydrological models with local parameters but if the parameters are calibrated at larger scales can be generalized the results to regions where no data.Some researchers have done research in this field have to go through that after identifying sensitive parameters, the parameters are calibrated as regional and according to local parameters, are obtained large scale parameters and calibrated. The aim of this study was to compare local and large scale calibration. Study area Kohsukhteh watershed is geographically positioned between 50 °40' to 51 ° 20 ' East longitude and 31° 20' North and 32 ° latitude is located. The study area is an area of 2783 square kilometers, minimum height of 1705 meters in area and the maximum height of 3398 meters and the average slope is 19%. The mean annual precipitation in Shahrekord synoptic stations 320 mm. Matarials and method The data preparation in the initial stage of extension HEC-GEOHMS, based on the digital elevation model (DEM) and through different stages, physical characteristics of the basin prepared and HEC-HMS model was introduced. Model parameters local and also by taking a fixed amount for each sub-basin in large scale, were calibrated. Parameters related to loss method, transfer method and routing calculation and inter the model. In the meteorological model and control specifications necessary mechanisms for run was done , in the time series section rainfall data of 6 hours and 6 hours discharge is used. In order to identify sensitive parameters, parameter sensitivity analysis was performed. To evaluate the performance of models with different scenarios, 6 events for the calibration and 2 event for validation of the model was chosen . Result and discussion basin model map was obtained using HEC-GeoHMS extension and the resulting map as the first step in modeling, HEC-HMS program has been entered. Sensitivity analysis on the parameters of the model show that Lag time parameter and initial abstraction has high sensitivity. Event 5 Dey 91 highest accuracy and 11th Farvardin 91 were least accurate local calibration. The reason for this difference may change the permeability soil in winter and spring. In the large scale calibration in the event 5 Dey 91 high performance is observed. In large scale calibration parameters After selecting parameters in the sensitivity analysis lag time, initial abestraction, curve number and impervious were calibration. In assessing performance results indicator Nash-Sutcliffe for local calibration 0.85 and large scale calibration 0.65, showed that the domain is configured to simulate. Conclusion The results of the model HEC-HMS rainfall-runoff model in the study, indicates that the use local calibration is more than appare accuracy large scale calibration and this is according to research Samaniego and et al (2010: 22) and Hundecha and Bardossy(2004:293).Nash-sutcliffe index in the calibration of large scale calibration less than variability local calibration. . Both calibration acceptable responses were in runoff while obtained parameters in large scale or public calibration is less sensitive location. This makes generalized calibration results in areas where no data increases. Thus, as the second part of this study, it was shown calibration using large scale parameters for distribution projects and research suitable but local method was more effective . The results Parajeka et al (2005: 168) is coordinated. In general, using large scale parameters save time that the results of Troy (2008: 8), Pokhrel and Gupta (2008: 11) and Beven (2001: 9) are consistent.
Article
Full-text available
Topographic characteristics computed from 100- and 1000-m resolution digital elevation model (DEM) data are compared for 50 locations representing varied terrain in the conterminous USA. The topographic characteristics are three parameters used extensively in hydrological research and modelling—slope (S), specific catchment area (As) and a wetness index computed as the logarithm of the specific catchment area divided by slope [ln(As/S)]. Slope values computed from 1000-m DEMs are smaller than those computed from 100-m DEMs; specific catchment area and the wetness index are larger for the 1000-m DEMs compared with the 100-m DEMs. Most of the differences between the 100- and 1000-m resolution DEMs can be attributed to terrain-discretization effects in the computation of the topographic characteristics and are not the result of smoothing or loss of terrain detail in the coarse data. In general, the terrain-discretization effects are greatest on flat terrain with long length-scale features, and the smoothing effects are greatest on steep terrain with short length-scale features. For the most part, the differences in the average values of the topographic characteristics computed from 100- and 1000-m resolution DEMs are predictable; that is, biases in the mean values for the characteristics computed from a 1000-m DEM can be corrected with simple linear equations. Copyright © 2000 John Wiley & Sons, Ltd.
Conference Paper
Full-text available
How good should a river basin model be to assess the impact of climate change on river flooding for a specific geographical area? The determination of such an appropriate model should reveal which physical processes should be incorporated and which data and mathematical process descriptions should be used at which spatial and temporal scales. A procedure for determining an appropriate model has been developed and applied to the above mentioned specific problem for the Meuse river in France, Belgium and the Netherlands. The model appropriateness procedure consists of three steps. First, the dominant processes and associated key variables are identified. Second, statistical analyses with respect to the key variables are performed, which result in appropriate spatial and temporal scales for each key variable and relationships between key variable scales and the output variable. These latter relationships are used to combine the appropriate scales to one appropriate model scale. In the third step, mathematical process descriptions consistent with these model scales are selected. The resulting appropriate components have been implemented in an existing modelling framework to obtain the appropriate model. Two additional models were constructed to assess the sensitivity of the results to model complexity. The appropriate spatial model scale turned out to be around 10 km with a daily time step. The model results became somewhat better with increasing model complexity. The general trend with climate change (doubling of the CO 2 concentration) is a small decrease (5 %) of the average discharge and a small increase (5-10 %) of discharge variability and extreme discharges. It was found that the uncertainties in extreme discharges with climate change are large and that those due to precipitation and extrapolation errors are the most important ones.
Thesis
Full-text available
How good should a river basin model be to assess the impact of climate change on river flooding for a specific geographical area? The determination of such an appropriate model should reveal which physical processes should be incorporated and which data and mathematical process descriptions should be used at which spatial and temporal scales. A procedure for determining an appropriate model has been developed and applied to the above mentioned specific problem for the Meuse river in France, Belgium and the Netherlands. The model appropriateness procedure consists of three steps. First, the dominant processes and associated key variables are identified. Second, statistical analyses with respect to the key variables are performed, which result in appropriate spatial and temporal scales for each key variable and relationships between key variable scales and the output variable. These latter relationships are used to combine the appropriate scales to one appropriate model scale. In the third step, mathematical process descriptions consistent with these model scales are selected. The resulting appropriate components have been implemented in an existing modelling framework to obtain the appropriate model. Two additional models were constructed to assess the sensitivity of the results to model complexity. The appropriate spatial model scale turned out to be around 10 km with a daily time step. The model results became somewhat better with increasing model complexity. The general trend with climate change (doubling of the CO 2 concentration) is a small decrease (5 %) of the average discharge and a small increase (5-10 %) of discharge variability and extreme discharges. It was found that the uncertainties in extreme discharges with climate change are large and that those due to precipitation and extrapolation errors are the most important ones. 1 INTRODUCTION Global climate change induced by increases in greenhouse gas concentrations is likely to increase temperatures, change precipitation patterns and probably raise the frequency of extreme events [IPCC, 2001]. This may have serious impacts on society, e.g. through an increased occurrence of flooding events. A broad palette of models is available to assess these impacts. In general, models should be sufficiently detailed to capture the dominant processes and natural variability, but not unnecessarily refined that computation time is wasted. Therefore, the question is what is an appropriate river basin model to assess the impact of climate change on river flooding for a specific geographical area? Different approaches with respect to model appropriateness have been suggested in literature. They can be classified according to the specific part of the model which is evaluated, such as the output, processes, formulations and scales. An example of a scale-related approach is the Representative Elementary Area (REA) concept in catchment hydrology [Wood et al., 1988]. A REA can be considered as an appropriate scale at which a simple description of the rainfall-runoff process could be obtained. However, this and many other approaches employ a specific model in their procedure. Moreover, they only consider a specific part of the appropriateness problem. Preferably, the components of an appropriate model are determined in an integrated way before model construction and application. The determination should be dependent on the research objective and area considered.
Article
The concept of scale can be used to quantify characteristic lengths of (a) a natural process (such as the correlation length of the spatial snow water equivalent (SWE) variability); (b) a measurement (such as the size of a snow density sample or the footprint of a satellite sensor), and (c) a model (such as the grid size of a distributed snow model). The different types of scales are denoted as process scale, measurement scale and model scale, respectively. Interpolations, extrapolations, aggregations, and disaggregations are viewed as a change in model scale and/or measurement scale. In a first step we examine, in a linear stochastic analysis, the effect of measurement scale and model scale on the data and the model predictions. It is shown that the ratio of the measurement scale and the process scale, and the ratio of the model scale and the process scale are the driving parameters for the scale effects. These scale effects generally cause biases in the variances and spatial correlation lengths of satellite images, field measurements, and simulation results of snow models. It is shown, by example, how these biases can be identified and corrected by regularization methods. At the core of these analyses is the variogram. For the case of snow cover patterns, it is shown that it may be difficult to infer the true snow cover variability from the variograms, particularly when they span many orders of magnitude. In a second step we examine distributed snow models which are a non-linear deterministic approach to changing the scale. Unlike in the linear case, in these models a change of scale may also bias the mean over a catchment of snow-related variables such as SWE. There are a number of fundamental scaling issues with distributed models which include subgrid variability, the question of an optimum element size, and parameter identifiability. We give methods for estimating subgrid variability. We suggest that, in general, an optimum element size may not exist and that the model element scale may in practice be dictated by data availability and the required resolution of the predictions. The scale effects in distributed non-linear models can be related to the linear stochastic case which allows us to generalize the applicability of regularization methods. While most of the paper focuses on physical snow processes, similar conclusions apply and similar methods are applicable to chemical and biological processes. Copyright (C) 1999 John Wiley & Sons, Ltd.
Article
A general methodology is developed for the transformation of point rainfall to areal rainfall. The reduction factor is shown to depend solely on the expected correlation coefficient between the point rainfall at two randomly chosen points in the area in consideration. The methodology can be used to characterize the input to rainfall-runoff models, and it includes the case in which multiple inputs are used in the model in the form of a subdivision in modules of the whole catchment. An example is worked out discussing the different methodologies for the estimation of total mean volume of rainfall over an area.
Article
The Washita '92 experiment, which took place in the Little Washita River watershed in Oklahoma, generated maps describing the volumetric surface soil moisture field during an 8-day drying period. The Little Washita is part of the Red River basin, a major tributary of the Lower Mississippi. A semivariogram analysis revealed that the surface soil moisture field of this 18km×25km study region as a whole was spatially nonstationary and exhibited certain fractal characteristics. However, for smaller subareas within the study region, it was determined that the moisture field could be considered stationary for distances smaller then 5 km, with ranges of approximately 1 km over which surface soil moisture was spatially correlated. A semivariogram analysis for each of the individual soil types over the entire Washita '92 region revealed that surface soil moisture was stationary within each soil type, again with a range of correlation of approximately 1 km. This indicates that soil moisture variability is strongly affected by soil type.