ArticlePDF Available

Influence of Irrigation Drivers Using Boosted Regression Trees: Kansas High Plains

Wiley
Water Resources Research
Authors:

Abstract and Figures

Groundwater levels across parts of western Kansas have been declining at unsustainable rates due to pumping for agricultural irrigation despite water‐saving efforts. Accelerating this decline is the complex agricultural landscape, consisting of both categorical (e.g., management boundaries) and numerical (e.g., crop prices) factors that drive irrigation decisions, making integrated water budget management a challenge. Furthermore, these factors frequently change through time, rendering management strategies outdated within relatively short time scales. This study uses boosted regression trees to simultaneously analyze categorical and numerical data against annual irrigation pumping to determine the relative influence of each factor on groundwater pumping across both space and time. In all, 45 key water use variables covering approximately 19,000 groundwater wells were tested against irrigation pumping from 2006 to 2016 across five categories: (1) management/policy, (2) hydrology, (3) weather, (4) land/agriculture, and (5) economics. Study results showed that variables from all five categories were included among the top 10 drivers to irrigation, and the greatest influence came from variables such as irrigated area per well, saturated thickness, soil permeability, summer precipitation, and pumping costs (depth to water table). Variables that had little influence included regional management boundaries and irrigation technology. The results of this study are further used to target the factors that statistically lead to the greatest volumes of groundwater pumping to help develop robust management strategy suggestions and achieve water management goals of the region.
This content is subject to copyright. Terms and conditions apply.
1. Introduction
Groundwater level decline of the High Plains Aquifer (HPA) underlying western Kansas has been exten-
sively analyzed with most efforts focused on irrigation related to one of four domains: (1) physical (e.g.,
climatic and atmospheric factors and underlying geology; Edwards,2016; MardanDoost etal.,2019; Whit-
temore etal.,2016), (2) agricultural (e.g., crop type, irrigation limits and management, cover crops, and
irrigation efficiency; Butler etal.,2016; Kisekka etal.,2017; Pfeiffer & Lin,2014), (3) socioeconomic (e.g.,
environmental policies, energy costs, irrigation efficiency, global markets, and crop prices; Hrozencik
etal.,2017; Sanderson etal.,2017; Sukcharoen etal.,2020), or (4) an integrated approach among the first
three categories (e.g., Haacker etal.,2019; Majumdar etal.,2020; Smidt etal.,2016). While these studies
provide a comprehensive overview of the complexity of factors that drive irrigation use, little is known
about the relationship between these factors across both space and time relative to irrigation pumping. For
example, studies have shown that irrigation efficiency drives groundwater use (Pfeiffer & Lin,2014), yet it
cannot conclusively be determined that efficiency is more or less influential than annual weather patterns
in driving total withdrawals from the aquifer. The same applies for management strategies versus crop
prices or any other combination of factors compared to others. So, while there is a clear understanding of
Abstract Groundwater levels across parts of western Kansas have been declining at unsustainable
rates due to pumping for agricultural irrigation despite water-saving efforts. Accelerating this decline
is the complex agricultural landscape, consisting of both categorical (e.g., management boundaries)
and numerical (e.g., crop prices) factors that drive irrigation decisions, making integrated water budget
management a challenge. Furthermore, these factors frequently change through time, rendering
management strategies outdated within relatively short time scales. This study uses boosted regression
trees to simultaneously analyze categorical and numerical data against annual irrigation pumping to
determine the relative influence of each factor on groundwater pumping across both space and time.
In all, 45 key water use variables covering approximately 19,000 groundwater wells were tested against
irrigation pumping from 2006 to 2016 across five categories: (1) management/policy, (2) hydrology,
(3) weather, (4) land/agriculture, and (5) economics. Study results showed that variables from all five
categories were included among the top 10 drivers to irrigation, and the greatest influence came from
variables such as irrigated area per well, saturated thickness, soil permeability, summer precipitation, and
pumping costs (depth to water table). Variables that had little influence included regional management
boundaries and irrigation technology. The results of this study are further used to target the factors that
statistically lead to the greatest volumes of groundwater pumping to help develop robust management
strategy suggestions and achieve water management goals of the region.
Plain Language Summary Water use for crops has lowered groundwater levels in western
Kansas. Past studies have shown that this water use is driven by many factors spanning policy, economics,
and the physical environment. Because of this complexity, it has been difficult to fully understand which
factors most drive irrigation use relative to each other. This study uses a machine-learning model to
rank the influence of 45 factors on irrigation pumping. These factors are analyzed over space (19,000
wells across western Kansas) and time (2006–2016). Based on this study, drivers to water use include
total irrigated area, summer rainfall, and depth to the water table. Factors that have little influence
include management district boundaries and irrigation system type. These results are used to make water
management suggestions for the region.
LAMB ET AL.
© 2021. The Authors.
This is an open access article under
the terms of the Creative Commons
Attribution License, which permits use,
distribution and reproduction in any
medium, provided the original work is
properly cited.
Influence of Irrigation Drivers Using Boosted Regression
Trees: Kansas High Plains
Susan E. Lamb1 , Erin M. K. Haacker2 , and Samuel J. Smidt1
1Soil and Water Sciences Department, University of Florida, Gainesville, FL, USA, 2Department of Earth &
Atmospheric Sciences, University of Nebraska-Lincoln, Lincoln, NE, USA
Key Points:
The relative influence of drivers
to irrigation pumping in western
Kansas is modeled from 2006 to 2016
using boosted regression trees
Site-specific factors are more
influential than regional policies,
and weather-related factors are more
influential at longer time scales
Irrigation decision making largely
follows resource availability and
operation cost considerations over
localized management schemes
Supporting Information:
Supporting Information may be found
in the online version of this article.
Correspondence to:
S. E. Lamb,
susan.lamb@ufl.edu
Citation:
Lamb, S. E., Haacker, E. M. K., & Smidt,
S. J. (2021). Influence of irrigation
drivers using boosted regression trees:
Kansas High Plains. Water Resources
Research, 57, e2020WR028867. https://
doi.org/10.1029/2020WR028867
Received 18 SEP 2020
Accepted 14 MAR 2021
10.1029/2020WR028867
Special Section:
The Quest for Sustainability
of Heavily Stressed Aquifers at
Regional to Global Scales
RESEARCH ARTICLE
1 of 16
Water Resources Research
the driving factors that lead to irrigation use, both individually and as a
group (e.g., Haacker etal.,2019; Smidt etal.,2016), uncertainty still exists
in understanding the spatiotemporal influence of how these factors work
together to influence irrigation decision making. Measurement is a hall-
mark of precision agriculture and other targeted management schemes
(e.g., deficit irrigation management; Rudnick etal.,2019), as farmers can
often better manage the resources that they can measure. The uncertain-
ty surrounding the relative influence (RI) of irrigation drivers leads to
mismanagement and continued groundwater level decline for the region
(Smidt etal.,2016).
Past studies analyzing the relationships between irrigation drivers across
space and time have been limited as drivers are both numerical (e.g., av-
erage temperature and crop price) and categorical (e.g., crop type and
Groundwater Management District [GMD]). One approach to overcom-
ing these methodological data challenges has been to analyze numerical
data by category and then compare results (e.g., on a GMD by GMD basis;
Whittemore etal., 2016). This can be useful for resource management,
since the boundaries of the GMDs correspond to some physical features
of the aquifer, and these analyses provide results at the same scale at
which management programs are enacted. However, this limits modeled
relationships by (1) preventing categorical data from interacting directly
with numerical data or (2) minimizing the scale at which drivers can im-
pact aquifer use. Fortunately, advances in machine-learning techniques,
namely boosted regression trees (BRT; Elith etal.,2008), have allowed
for improved analysis when grouping disparate data. BRT is a statisti-
cal ensemble that combines regression trees and data boosting to define
relationships between variables, including the simultaneous analysis of
categorical and numerical variables (Elith etal.,2008). When applied to
irrigation in western Kansas, BRT can accurately characterize the relative influence of sociophysical fac-
tors on annual irrigation pumping and offer strong predictive power for estimating future water use (Elith
etal.,2008; Hu etal.,2017).
Here, we use BRT analysis to quantify the predominant drivers to irrigation use in western Kansas from
2006 to 2016 to (1) improve understanding of how these drivers relate, (2) develop meaningful manage-
ment objectives based on these relationships, and (3) demonstrate BRT as a useful water management tool.
Synchronously, we analyze 45 irrigation drivers spanning five driver groups (management/policy, hydrol-
ogy, weather, land/agriculture, and economy) to quantify the predominant controls of irrigation pumping
across both space and time. We further isolate dominant driver trends to target the most influential social
and physical variables impacting western Kansas. The results of this analysis are then used to prioritize
management efforts across the region to balance agricultural production and groundwater level declines.
2. Methods
2.1. Site Description
Agriculture is the dominant land use across western Kansas. Approximately 94% of land cover was dedi-
cated to agricultural production from 2006 to 2016, and 77% of all cropland was composed of just six com-
modities: winter wheat (38%), corn (19%), sorghum (12%), soybeans (3%), alfalfa hay (3%), cotton (<1%), or
a double crop planting of the six (1%; USDA-CDL,2006–2016). During the same period, HPA groundwater
levels across the region declined by an average of 2.8m (Figure1; derived from Haacker etal.,2016). Despite
this decline, water withdrawals from the underlying HPA remain essential to the agricultural production
and cultural identity of western Kansas, especially due to the region’s low humidity, persistent winds, and
limited precipitation relative to crop water demands (Sanderson & Frey,2014).
LAMB ET AL.
10.1029/2020WR028867
2 of 16
Figure 1. High Plains Aquifer site map (a) and groundwater level
change for western Kansas from 2006 to 2016 (b; derived from Haacker
etal.,2016).
Water Resources Research
Groundwater used for irrigation in western Kansas largely sustains the expansive agricultural production
of the region and can add more than $500 million in annual revenue for the state compared to dryland-on-
ly production (excluding operation costs; Smidt etal.,2019). In 2005, the groundwater was valued at $1.2
billion (as natural capital; Fenichel etal.,2015). However, continued groundwater pumping in Kansas is
estimated to deplete nearly 40% of the underlying HPA in the next half century (Steward etal.,2013). This
depletion will inevitably force some farmers to convert to less productive dryland operations which can pro-
duce 2–4 times lower crop yields per area (Cotterman etal.,2018; Smidt etal.,2016; Steward etal.,2013).
Several strategies have been introduced to slow water loss and stabilize groundwater levels throughout
the region, including efficient technologies (Pfeiffer & Lin,2014), drought-resistant cultivars (Cotterman
etal.,2018), and localized management boundaries (Deines etal.,2019). For example, five GMDs have been
formed since the 1970s to develop and enforce local irrigation policies (Peck,2006). Yet, groundwater lev-
els have continued to decline across the region prompting the development of further management zones
within and across district boundaries (K.S.A. 82a-1036,1978; K.S.A. 82a-1041,2012). While some areas have
found success in these efforts (e.g., Deines etal.,2019), much of the region has yet to stabilize groundwater
levels (e.g., Haacker etal.,2019; Whittemore etal.,2018). A clear gap remains between the intended man-
agement of groundwater, the responding use of irrigation, and the understanding of how these interact
throughout the region.
Compounding this agricultural water management challenge is the socioenvironmental heterogeneities
that further influence water use practices across western Kansas (Figure2; Whittemore etal., 2018). For
example, GMDs 1, 3, and 4 receive less annual precipitation (43–58cm) than GMDs 2 and 5 (58–63 cm;
Whittemore etal.,2018). In regards to management, GMDs 1, 3, and 4 operate under “planned depletion”
strategies while GMDs 2 and 5 have more favorable groundwater recharge and enact a “safe yield” scheme
(Peck,2006; Whittemore etal.,2018). Other differences at the GMD level include a variability in average
monthly temperature, soil permeability, depth to water table, and interstate compacts (IC). Collectively,
through these heterogeneities, western Kansas is a unique case study of mixed physical and social variabil-
ity with high data resolution valuable for informing progressing agricultural water use and management
techniques in this region and elsewhere.
As part of the state’s water management agenda, the Kansas Department of Agriculture and Kansas Ge-
ological Survey have continually managed an open-access Water Information Management and Analysis
System (WIMAS) since 1996 with annual water use records for over 45,000 wells across the state. Of these
wells, 19,000 are located in western Kansas with access to the HPA. Associated data include information
such as well type, installation depth, depth to water, pumping amount, pumping allotment based on gov-
ernment regulation, irrigation system type, and crop type. No other region of the HPA has this type of data
resolution, likely even in private databases. Collectively, this combination of driver complexity and data
LAMB ET AL.
10.1029/2020WR028867
3 of 16
Figure 2. Example data layers for the western Kansas High Plains Aquifer region including dominant groundwater
management boundaries (a; Kansas Department of Agriculture), precipitation (b; PRISM, Oregon State University),
temperature (c; PRISM, Oregon State University), permeability (d; United States Geological Survey), saturated thickness
(e; Haacker etal.,2019), and depth to groundwater (f; derived from Haacker etal.,2016; United States Geological
Survey).
Water Resources Research
resolution make this location an ideal selection for using BRT analysis to better understand influences on
irrigation.
2.2. Data
Data used in this study can be summarized into independent variable (i.e., drivers to irrigation pump-
ing) and dependent variable (i.e., irrigation pumping) categories. Collectively, we analyzed 45 independent
variables and 1 dependent variable. The dependent variable is total annual irrigation pumping amount
reported on the well level. The independent variables were identified as influential to irrigation use based
on published literature and informal conversations with experts, colleagues, and stakeholders and are or-
ganized into five categories: management/policy, hydrology, weather, land/agriculture, and economics. All
data used in the analysis are summarized in Table1.
Well-specific data, such as annual pumping data and crop irrigated, for approximately 19,000 agricultural
irrigation wells in western Kansas for years 2006–2016 were downloaded through the WIMAS maintained
by the Kansas Department of Agriculture, Division of Water Resources and Kansas Geological Survey
(KDA-DWR & KGS, version 5). These data, which included GPS coordinates for each pumping well, were
then read into ArcGIS (version 10.5). Spatial driver data were downloaded as or converted to raster files
in ArcGIS. If downloaded, cell sizes were kept consistent with their original format. If converted from a
shapefile, we assigned a standard 11.1-m × 11.1-m cell size. Well density was produced in with a 1.11km by
1.11km cell size. Annual, statewide market crop value data for alfalfa, corn, sorghum, soybeans, and wheat
grown in Kansas were accessed from the United States Department of Agriculture and integrated into the
well data attribute table (USDA-NASS,2006–2016). Saturated thickness and water table elevation were cre-
ated using the methods described in Haacker etal.(2016). Depth to water table was created in ArcGIS using
the aforementioned water table elevation data and a digital elevation model from the United States Geolog-
ical Survey. The Right variable was generated by concatenating WIMAS data set variables relating to right
type as well as status of the water right file, water right, and point of diversion. The shapefiles for intensive
groundwater use control areas (IGUCAs) are public data and were requested from the Kansas Department
of Agriculture via an Open Records Request. All data were unique annually except for boundary data (e.g.,
GMD, Basin), hydraulic conductivity, and soil permeability as they were effectively static over the temporal
range of the study. No predetermined weights were applied to predictor variables in the analysis due to the
exploratory nature of the research. All data were spatially aligned using the NAD 1927 Geographic Coor-
dinate System and USA Contiguous Albers Equal Area Conic projected coordinate system. All driver data
values were then spatially attributed to each well annually from 2006 to 2016 and exported in spreadsheet
format (i.e., columns are variables and rows are individual wells) for use in the boosted regression tree
modeling.
2.3. Description of Boosted Regression Trees
Boosted regression trees draw on both statistical and machine-learning techniques to determine the rela-
tive influence of each predictor variable (i.e., independent variable) on a response variable (i.e., dependent
variable; Elith etal.,2008). Specifically, BRT relies on two algorithms: (1) regression trees and (2) boosting.
Regression trees are decision trees that have repeating binary splits to identify how a subset of predictor var-
iables relate to the response variable based on a defined predictor value as a split point (e.g., May precipita-
tion>35mm; Elith etal.,2008). Decision tree complexity (i.e., total number of trees and number of nodes in
a tree) is defined by the user. Each regression tree culminates with the calculation of a residual value. Boost-
ing then uses the results of a regression tree (i.e., residual) to improve the fit of the next tree (i.e., improve
the residual). This sequence progresses stage-wise through the model rather than stepwise, thus existing
trees are not changed but instead the model estimates are updated as new trees are added (Elith etal.,2008).
This process continues until the defined number of trees is reached or the residual has reached its optimum
value, at which point improvements in model estimates are negligible. A learning rate further defines the
contribution, or weight, of each tree to the model. Based on the results of the boosted regression trees, the
model can then quantify the relative influence of each predictor variable on the response variable. Refer to
Hastie etal.(2009) for more information and extended derivation of regression trees, Ridgeway(2007) for
boosting, and Elith etal.(2008) and Friedman and Meulman(2003) for an integration of the two.
LAMB ET AL.
10.1029/2020WR028867
4 of 16
Water Resources Research
In addition to its ability to analyze both numerical and categorical data, a BRT model was selected for this
study as it is not sensitive to outliers and missing data in predictor variables (Elith etal.,2008). BRT was
chosen over a random forest model, which tends to perform poorly when there are many low-influencing
variables (Hastie etal.,2009). Furthermore, statistical approaches such as generalized additive models were
not utilized as, unlike BRT, interactions between predictor variables are not automatically modeled (Elith
etal.,2008; Hastie etal.,2009).
LAMB ET AL.
10.1029/2020WR028867
5 of 16
Variable Description Units/scale Mean; standard deviation Source
Dependent variable
Amt_Irr Total annual irrigation pumping m3/year 204,479; 152,316 1
Independent variables
Management/policy
GMD Groundwater Management District Boundary Categorical 1
IGUCA Intensive Groundwater Use Control Area Boundary Categorical 2
RC Rattlesnake Creek Management Plan Boundary Categorical 3
IC Interstate compacts Boundary Categorical 3
Right Water rights Concatenated Categorical 1
Auth_Amt Authorized amount of irrigated water m3/year 326,391; 273,905 1
Auth_Rate Authorized pumping rate m3/min 3; 12 1
Auth_Area Authorized irrigated area ha/year 169; 262 1
Hydrology
Basin River basin Boundary Categorical 1
ST Saturated thickness Meters, 250-m × 250-m 36; 27 6
WT_Elev Interpolated water table elevation Meters, 250-m × 250-m 771; 171 6
K Aquifer hydraulic conductivity Contours, 0–90m/day Categorical 5
Weather
T01, T02, … Mean monthly temperature Celsius, 4-km × 4-km 13; 2 4
T_Annual Mean annual temperature Celsius, 4-km × 4-km 13; 1 4a
P01, P02, … Total monthly precipitation mm, 4-km × 4-km 48; 36 4
P_Annual Mean annual precipitation mm, 4-km × 4-km 48; 15 4a
Land/agriculture
SP Mean soil permeability Very slow to very rapid Categorical 3
System Irrigation technology Type Categorical 1
Area_Irr Reported annual irrigated area ha/year 58; 57 1
WD Well density Wells/km2, 0.01°×0.01° 0.4; 0.2 1
Crop Reported commodity Type Categorical 1
Economics
CV_Area Annual KS crop price per unit area $/ha Crop specific 5
Depth Depth to water table Meters, 250-m × 250-m 41; 28 3, 6a
Note. Mean and standard deviation of collected data from 2006 to 2016. Source key: (1) Water Information Management and Analysis, maintained by the Kansas
Department of Agriculture, Division of Water Resources and Kansas Geological Survey; (2) Kansas Department of Agriculture, Division of Water Resources –
Open Records Request; (3) United States Geological Survey; (4) PRISM Climate Group, Oregon State University; (5) United States Department of Agriculture;
(6) Haacker etal.(2016).
aIndicates that data were derived from these sources rather than utilized directly.
Table 1
Data, Descriptions, and Sources
Water Resources Research
2.4. Construction of BRT Model
We used foundational R code (R Development Core Team,2018) from Elith etal.(2008) to implement the
BRT. While this code utilized the gbm R package, we used its expanded version for this study, the dismo R
package (v. 3.5.2 for Mac and v. 3.5.1 and v.3.6.2 for Windows; Hijmans etal.,2017). This code was then
modified to fit the conditions of our data set.
In BRT analyses, five conditions must be predetermined: (1) the total number of trees to be used in the
analysis, (2) distribution of the loss function, (3) bag fraction (i.e., the amount of data randomly selected at
each step and not replaced), (4) the tree complexity (i.e., number of nodes in each tree), and (5) the learning
rate of the model (i.e., the influence of each tree in the model; a value of 0–1) (Elith etal.,2008). Here, we
did not set a limit to the number of trees and allowed the model run until it optimized the residual loss on
the response variable. We used a Laplace distribution because it is optimal for data sets with a continuous
response variable (Ridgeway,2007) and provides a more robust fit to the data compared to other distribu-
tion options (Lampa etal.,2014; Murphy,2012), which we believe is more suitable for agricultural data that
often have large variability. The bag fraction was set at 0.5. The learning rate was set to 0.05, and the tree
complexity to 24, as determined through a calibration and validation process.
We ran a BRT model using the 2016 data for learning rate (lr) values of 0.1, 0.05, and 0.01 and tree com-
plexity (tc) values of 2–18 in increments of 2. We utilized two values from models run on each combination
of parameters (1) a tenfold cross validation (CV) statistic and (2) r-squared statistic. For each combination
of parameters. We maintain that CV is an appropriate metric for evaluating the best model, as BRT differs
from other statistical methods in that there are no p values and degrees of freedom are difficult to identify
(Elith etal.,2008). Also, CV can be a more robust sensitivity analysis for machine-learning models rather
than Akaike information criterion (Hauenstein etal., 2017). For the r-squared analysis, we conducted a
BRT model on 50% of the data and used the function predict.gbm to predict the remaining 50% of data. This
prediction was plotted against the observed values to determine the r-squared statistic.
BRT is a stochastic technique, thus the CV and r-squared statistics change marginally between iterations
even if conducted on the same parameter combinations. Because of this, we utilized the mean CV statistics
for each learning rate, and the mean r-squared statistics for each tree complexity. Identifying the highest
mean CV and r-squared statistics, we determined that a learning rate of 0.05 and a tree complexity of 24
would produce the best performing model. Please reference the Supplemental Information for more detail
on the calibration and validation steps.
2.5. Application of BRT Models
Once parameters were identified, we conducted four groups of BRT models: (1) annually, (2) annually with
pumping normalized by irrigated area, (3) all years grouped together, and (4) all years grouped together
with pumping normalized by irrigated area. Annual models (Groups 1 and 2) utilized unique data for each
year, whereas aggregated models (Groups 3 and 4) examined all years of data simultaneously and time was
not distinguished as a variable. We normalized pumping by irrigated area in two of the groups to eliminate
an anticipated strong correlation between total pumping and total irrigated area. We also initially ran each
year as an individual BRT model to systematically flag the noninfluential variables using the gbm.simplify
function of the R code. This allowed us to manually reduce noise by eliminating noninformative variables
prior to our analysis. Note that the collective influence of variables will always add up to 100, and even
noninformative variables will be assigned a nonzero value in a BRT analysis, albeit quite small. We used the
simplified sets for Groups 1 and 2. We could not use the simplified sets for Groups 3 and 4 as noninformative
variables were flagged in some years but not others (Table2), leaving incomplete driver data sets for the
groups with years combined together. We also used only a randomly selected 25% of data for each variable
in Groups 3 and 4, as this still included over 2.7 million data points and challenged computing capabilities
(Dell Precision 5820 desktop computer, 2018, Windows 10, Intel(R) Xeon(R) W-2123 CPU @ 3.60 GHz,
32GB RAM). A summary of completed models is outlined in Table2.
LAMB ET AL.
10.1029/2020WR028867
6 of 16
Water Resources Research
3. Results
The relative influences of each irrigation driver on groundwater pumping amount for the Group 1 models
(annual, simplified) and Group 3 model (years combined, unsimplified) are displayed in Figure3; each
variable is further color coded by driver category.
Drivers are arranged by decreasing mean RI for Group 1 over the temporal range of the study, where the
boxes on the Group 1 boxplots are the interquartile range (IQR) and the whiskers extend to the minimum
and maximum values. Hollow points indicate RI outliers from Group 1 results. In Group 1, irrigated area
was the most dominant driver with a mean RI of 18.0%, followed by authorized amount, saturated thick-
ness, and authorized pumping rate (mean RI of 5.3%, 5.3%, and 4.7%, respectively). Localized management
drivers had the least influence on irrigation pumping, with GMDs accounting for 0.1% of influence, IGU-
CAs for 0.1%, and IC for 0.2%. The RI of weather-related drivers ranged from an 0.6% to 5.3%, with annual
precipitation being the greatest influencing weather variable by mean. Weather variables also had the great-
est occurrence of RI outliers across time, likely corresponding to extreme weather events. Regardless of the
month, precipitation had a higher median RI on irrigation pumping than did temperature. For manage-
ment/policy variables, well-scale policies were strong drivers to irrigation pumping whereas regional-scale,
LAMB ET AL.
10.1029/2020WR028867
7 of 16
Year Variables removed Number of trees CV (mean, standard deviation) Deviance (mean, standard deviation)
Group 1: 11 models, annual, simplified (gbm.simplify)
2006 4 (GMD, IC, IGUCA, RC) 2,150 0.834, 0.003 43.257, 0.379
2007 4 (GMD, IC, IGUCA, RC) 1,600 0.836, 0.004 40.124, 0.438
2008 2 (IC, RC) 1,650 0.862, 0.003 41.055, 0.333
2009 4 (GMD, IC, IGUCA, RC) 1,900 0.851, 0.002 38.424, 0.349
2010 4 (GMD, IC, IGUCA, RC) 2,000 0.857, 0.002 38.644, 0.273
2011 2 (GMD, RC) 2,000 0.874, 0.004 44.883, 0.387
2012 2 (IGUCA, RC) 1,800 0.859, 0.004 42.913, 0.420
2013 5 (GMD, IC, IGUCA, RC, Right) 1,450 0.860, 0.003 41.058, 0.404
2014 5 (GMD, IC, IGUCA, RC, Right) 1,800 0.860, 0.002 40.486, 0.371
2015 6 (GMD, IC, IGUCA, K, RC, Right) 1,600 0.833, 0.003 37.077, 0.188
2016 4 (GMD, IC, IGUCA, RC) 1,900 0.845, 0.003 36.923, 0.302
Group 2: 11 models, annual, simplified (gbm.simplify), normalized by area irrigated
2006 4 (GMD, IC, IGUCA, RC) 2,150 0.550, 0.030 0.315, 0.002
2007 4 (GMD, IC, IGUCA, RC) 2,150 0.631, 0.007 0.292, 0.002
2008 2 (IC, RC) 1,900 0.641, 0.014 0.296, 0.002
2009 4 (GMD, IC, IGUCA, RC) 2,350 0.617, 0.016 0.278, 0.003
2010 4 (GMD, IC, IGUCA, RC) 2,150 0.634, 0.005 0.282, 0.001
2011 2 (GMD, RC) 2,700 0.560, 0.007 0.336, 0.003
2012 2 (IGUCA, RC) 2,550 0.566, 0.007 0.327, 0.003
2013 5 (GMD, IC, IGUCA, RC, Right) 1,950 0.653, 0.009 0.301, 0.003
2014 5 (GMD, IC, IGUCA, RC, Right) 2,500 0.619, 0.006 0.294, 0.002
2015 6 (GMD, IC, IGUCA, K, RC, Right) 2,200 0.553, 0.040 0.278, 0.002
2016 4 (GMD, IC, IGUCA, RC) 2,450 0.632, 0.006 0.270, 0.002
Group 3: 1 model, combined years (25% from each year)
Combined years No removed variables 2,650 0.857, 0.002 41.199, 0.230
Group 4: 1 model, combined years (25% from each year), normalized by area irrigated
Combined years No removed variables 2,750 0.597, 0.027 0.310, 0.002
Table 2
BRT Models (Size, Cross Validation, and Deviance)
Water Resources Research
boundary level policies were not strong drivers. The Rattlesnake Creek Management Plan was also a district
boundary examined in this study but was classified as noninfluential by gbm.simplify across all years and
was not considered in the Group 1 model and thus excluded from Figure3. Generally, variables with the
greatest variance in RI across years were among top influencing variables, and those with the smallest
variance in RI across years were among low-influencing variables. With the exception of irrigated area,
all driver variables reported RI values of less than 10% in any given year, with most being less than 5% in
all years. All five variable categories are represented in the top nine influencing drivers, with the top three
drivers accounting for 28.7% of RI by mean across years.
As for Group 3 results (all years combined), the top three drivers included irrigated area with a RI of 21.7%,
saturated thickness with 5.2%, and annual precipitation with 4.8%. Outside the large RI increase for annual
precipitation, trends remained consistent between Groups 1 and 3. In addition to annual precipitation, irri-
gated area also had a notable increase in RI; combined, these increases resulted in the slight RI decrease for
most other variables as all contributing variables sum to 100%. The overall complexity of variables influenc-
ing irrigation pumping is further highlighted through all five categories contributing to the top nine driving
factors and nearly all drivers contributing less than 5% to the total influence on pumping.
The relative influences of each irrigation driver on groundwater pumping amount for the Group 2 models
(annual, simplified, and normalized by irrigated area) and Group 4 model (years combined, unsimplified,
and normalized by irrigated area) are displayed in Figure4; each variable is further color coded by driver
category.
LAMB ET AL.
10.1029/2020WR028867
8 of 16
Figure 3. Boxplots showing RI outputs from BRT runs conducted annually from 2006 to 2016 (Group 1). Black points
show RI from a single BRT run conducted on collective data from 2006 to 2016 (Group 3). Note scale change following
x-axis break. RI, relative influence; BRT, boosted regression trees.
Water Resources Research
Drivers are arranged by decreasing mean RI for Group 2 over the temporal range of the study, where whisk-
ers on the Group 2 boxplots are likewise set at minimum and maximum values and the box is the IQR, and
the hollow dots are outlier RI values. While the collective RI values still summed to 100%, normalizing
pumping by irrigated area allowed for more detailed characterization of other less influential variables.
Here, trends remained similar to those observed in Group 1, with the exception of relative shuffling of the
top five variables. In Group 2, saturated thickness was the top influencing driver (mean RI of 6.3%) and
authorized area was second (mean RI of 5.9%). Depth to water (cost to pump; mean RI of 5.7%) moved
ahead of authorized rate (mean RI of 5.4%) when compared to Group 1. Crop type experienced increased
variability across years, as did crop value per hectare.
Variables with the largest mean RI no longer observed the greatest variance in RI across years, although
generally larger RI values resulted in greater variance. All driver variables still reported RI values of less
than 10% in any given year, with most being less than 5% in all years. All five variable categories are rep-
resented in the top 10 influencing drivers, with the top three drivers accounting for 17.7% of RI by mean
across years.
In Group 4 (all years combined, normalized by irrigated area), the top three influencing drivers were satu-
rated thickness (6.2%), annual precipitation (6.0%), and crop type (4.9%). Similar to Group 3, annual precip-
itation had the greatest increase in RI, resulting in a decrease in RI for other top drivers. Outside the top 10
drivers, Group 4 results matched closely with the results of Group 2. All five categories were present in the
top 10 variables, and all variables had RI values of less than 8% with most contributing to less than 5% of the
total influence on groundwater pumping.
LAMB ET AL.
10.1029/2020WR028867
9 of 16
Figure 4. Boxplots showing RI outputs from BRT runs conducted annually from 2006 to 2016 on data normalized by
irrigated area (Group 2). Black points show RI from a single BRT run conducted on collective data from 2006 to 2016 on
data also normalized by irrigated area (Group 4). RI, relative influence; BRT, boosted regression trees.
Water Resources Research
While RI is a meaningful metric, it fails to characterize whether a given variable contributes to inhibits
pumping. This type of influence is displayed using partial dependence (PD) plots. Figure5 displays PD plots
for 12 example variables from Group 1 in 2012, where positive y-axis values indicate that corresponding
x-axis values are more likely to predict irrigation pumping (stronger prediction). The reverse is also true,
negative y-axis values indicate that corresponding x-axis values are less likely to predict irrigation pumping
(weaker prediction). The magnitude of these values describes the strength of the correlation. These func-
tions reveal the individual impact of the driver on irrigation pumping after the mean impact of the other
drivers in the model is considered (Elith etal., 2008). Because of this, individual variables with strong
interactions may not be well represented in these plots, but it remains a useful tool for understanding the
general relationship between a predictor variable and the response variable (Friedman,2001; Friedman &
Meulman,2003). Reference the Supplemental Information for examples regarding variable interactions.
Drivers are arranged in order of decreasing RI and can be summarized into three general patterns: (1)
increasing PD slopes, (2) decreasing PD slopes, and (3) nonlinear PD slopes. Variables with increasing PD
LAMB ET AL.
10.1029/2020WR028867
10 of 16
Figure 5. Partial dependence plots showing 12 influential variables in 2012 (Group 1), ordered by decreasing relative
influence. Tick marks along the inside of the x-axes represent the distribution of data in deciles.
Water Resources Research
slopes include irrigated area, authorized amount, saturated thickness, depth to water, authorized rate, and
water table elevation. For example, as the number of irrigated areas increased, so did the occurrence of
pumping, creating a positive PD slope. Variables with decreasing PD slopes included July, August, and Oc-
tober precipitation. Here, low precipitation values correlated with pumping activity and high precipitation
values did not correlate with pumping activity, leading to a negative PD slope. Variables with nonlinear
PD slopes included authorized area, well density, and soil permeability. These variables did not have a pre-
dictable PD pattern, showing both correlation with pumping and no pumping at variable points within the
range of their data values. This relationship is also common in categorical data which do not have linear
characteristics or behaviors to generate meaningful slope depictions.
Because outliers can seemingly skew the results of PD plots, decile marks are also included along the x-axis
of each variable to communicate the magnitude of data represented in the plot. To further demonstrate how
these data are interpreted, we plotted empirical cumulative distribution plots of driver data beneath three
example PD plots for three variables in 2012 (Group 1), where shaded values have positive y-axis values,
thus are correlated with pumping (Figure6). These plots also identify the threshold at which pumping is
likely to occur (or no longer occur). For example, pumping was likely to occur when saturated thickness was
more than 41-m, July precipitation less than 77-mm, or water table elevation more than 737-m.
4. Discussion
4.1. Correlative Relationships
In this study, we seek to confirm correlations between predictor variables and irrigation pumping rather
than infer causality. In discussion of results, we offer potential explanations for these correlations as we aim
characterize what drives irrigation use in this region. Beyond identifying strong correlations, the BRT model
also determines noisy variables. These variables have the lowest relative influence, indicating that they not
only are uncorrelated with pumping but also do not drive pumping. Therefore, future water management
efforts can be guided away from these low-influencing variables and redirected toward stronger drivers. This
is a valuable deduction as a wide range of variables impact irrigation decision making and any reduction of
noise helps target effective management.
Furthermore, some predictor variables in this study are endogenous and have potential to cause pumping,
result from pumping, or likely both. For example, water table depth is highly correlated to pumping in this
study which could be a cause of pumping and/or result from pumping. As this work focuses on irrigation
drivers, we propose possible reasons why water table depth could drive pumping without claiming certain
causality. Parsing causality from consequence becomes even more difficult with a variable like depth as it is
temporally nonautonomous. In this way, pumping in a given year can be impacted by the change in water
table depth from the previous year, which can then impact water table depth in the upcoming year. Depth
is unlike autonomous variables such as precipitation, which are not linked to their impact in prior growing
LAMB ET AL.
10.1029/2020WR028867
11 of 16
Figure 6. Partial dependence plots (top) with corresponding empirical cumulative distribution functions (bottom) to
show influence of driver on irrigation pumping along driver data ranges.
Water Resources Research
seasons (i.e., the amount of rain in a given year is not largely impacted by the amount of rain in the previous
year). Rather, depth can both be a driver and consequence of irrigation pumping. Even if this BRT model
was conducted using predevelopment water table depths rather than current measurements, this unique
relationship could not be fully captured.
4.2. Strong Versus Weak Drivers
Strong drivers were primarily those specific to the conditions of an individual well (e.g., total irrigated
area, authorized rate, authorized amount, authorized area, and well density). Weak drivers were primari-
ly those more representative of regional characteristics (e.g., groundwater doctrine, GMD, other localized
management boundaries, river basin, or hydraulic conductivity of the aquifer). This difference between
well-scale and regional influence can likely be attributed to individual farmers making decisions to irrigate
on a case-by-case basis within the regulations of their larger-scale governing frameworks. For example, a
water user more inclined to irrigate compared to a neighbor under the same regional governance will am-
plify well-specific drivers and buffer the influence of regional governance drivers, as regional drivers are
the same between two disparate users. This relationship likely explains why well-specific governance (e.g.,
authorized area) is a strong driver, whereas regional governance (e.g., state groundwater doctrine; Right),
which defines the well-specific governance, is a weak driver.
More directly, this relationship suggests that users are not limited or forced to change behavior by their re-
gional governance and may be self-electing to change behavior despite their legal rights to business-as-usu-
al. For example, GMDs 1, 3, and 4 have less groundwater supply but operate under “planned depletion”
doctrine, and GMDs 2 and 5 have greater groundwater supply under the more conservative “safe yield”
doctrine (Peck,2006; Whittemore et al.,2018). But this does not mean all farmers in planned depletion
zones are seeking to deplete the aquifer. This is especially true as GMD (and thus management scheme) was
a weak driver. Several grassroots movements have been observed throughout the study region to prioritize
water conservation given the limitation that regional governance does not adequately align conservation
goals, and the impacts of these movements on reducing water usage may also be captured here; water man-
agement decisions have been found to correlate with close network members such as families, friends, and
neighbors (Nian etal.,2020). Another contributing explanation as to why well-specific drivers are stronger
than regional drivers may be that Local Enhanced Management Areas (LEMAs) introduced in some por-
tions of the study area have enacted policies shown to reduce irrigation pumping within the frameworks of
larger-scale governing regulations (Deines etal.,2019; Whittemore etal.,2018). However, LEMAs were not
evaluated in this BRT model as the driver did not cover the full temporal range of the study.
Additionally, precipitation drivers were always stronger drivers to pumping than temperature drivers, and
growing season precipitation was more influential than off-season precipitation. This is not only logically
supported as adequate soil moisture is a critical metric to plant production (Basso & Ritchie,2014), but this
is also supported by the greater variability in seasonal values for precipitation compared to temperature.
Irrigation is often used as a tool for overcoming or reducing the negative impacts of seasonal variability
(Whittemore etal.,2016), where water applications can be used to buffer the impacts of drought conditions
or extreme temperatures (Basso & Ritchie,2014; Whittemore etal.,2016). As precipitation was more varia-
ble than temperature during this study, it is reasonable that it would be the stronger driver.
Irrigation technology, such as flood or center pivot, was not an influential variable despite the frequent
discussion that higher-efficient systems lead to water savings (e.g., Schaible & Aillery,2012). This may be
due to nearly 76% of all wells being high-efficiency LEPA systems (KDA-DWR & KGS, 2021), in which
case reduced variability within the driver has less influence on the pumping results. Also possible, this
low influence is because efficient irrigation has been documented not to result in water savings (Pfeiffer &
Lin,2014), as farmers can irrigate more area for a reduced price compared to inefficient technologies. This
behavioral response further supports total irrigated area being the predominant driver to pumping and the
growing understanding that efficient technology does not reduce water use as long as there are incentives
for farmers to irrigate more area (Pfeiffer & Lin,2014; Smidt etal.,2016). Interestingly, few drivers in this
study can be controlled directly by farmers, and those that can were among high-influencing variables (i.e.,
number of acres irrigated, crop type). Irrigation system type is the only farmer-controlled driver among
low-influencing variables.
LAMB ET AL.
10.1029/2020WR028867
12 of 16
Water Resources Research
4.3. Partial Dependence Behaviors
Most of the top influencing drivers followed a positive slope (i.e., as driver values increase, so does the
connection to pumping), with the exception of well density, crop type, and precipitation; well density and
precipitation followed a negative slope, and crop type was categorical. While the partial dependence of these
drivers was mostly as expected (e.g., dry conditions led to pumping), we did observe unexpected results in
moderately influential drivers. For example, the results of depth to water were opposite what seemed intu-
itive. Instead of greater depths leading to a negative pumping slope (i.e., increased pumping costs leading
to decreased pumping), we found greater depths to water led to more pumping. This is perhaps because
pumping is so heavily established in these well locations that increased costs are a necessary, or unavoid-
able, operational factor or because decreased well yields may demand greater time spent for a center pivot
to make its way around a field (Rad etal.,2020), potentially affecting the ratio between evaporation and in-
filtration. It may also be feasible that smaller farm behaviors, acting more in line with expected operational
costs, are buffered in this model by larger farms with larger operational budgets. In addition, as water table
depth changes, aquifer transmissivity can change nonlinearly, causing a ranging impact on wells in a region
(Korus & Hensen,2020). Another possible explanation is that variable is dominated by planned depletion
management schemes (Peck,2006), but this seems unlikely as most of the region is not under planned de-
pletion strategies. Likewise, increased well density did not result in a positive pumping slope. Instead, fewer
wells per area resulted in greater pumping. This may be in part because more wells per area can share the
water demand of a larger area compared to an individual well, so total pumping per well can be reduced.
However, this may be because fewer wells per area may be correlated to more individual landowners and
subsequently different operational decisions. In this case, the connection between pumping and fewer wells
per area may be attributed to smallholder farmers maximizing short-term profits or mitigating seasonal var-
iability risk through increased pumping (Whittemore etal.,2016); these ownership data were not available
specifically for each well and could not be evaluated in this study.
4.4. Irrigation and Climate
For Groups 1 and 2 (annual), the weather category was the last category listed among top variables, while
it ranked as the first category for Groups 3 and 4 (all years combined). Considering that irrigation can be
used as means of climate control (Whittemore etal., 2016), it was intriguing that weather variables were
not more influential in Groups 1 and 2. This could result for two reasons: (1) the total influence is shared
across many weather drivers, so a single driver is ultimately buffered: many weather drivers sum to have
a large influence, but no single driver is largely influential, and (2) weather-related variables in the study
region are sufficient for dryland agricultural production, as irrigation is used to capture incentives other
than baseline production. However, it seems unlikely the weather conditions are sufficient for dryland pro-
duction in western Kansas given regional water demands of the produced commodities and precipitation
patterns (Cotterman etal.,2018). Even with the increase of drought-resistant cultivars (Hu & Xiong,2014),
the shared influence of weather drivers is a likely explanation for the lack of highly influential weather
drivers in Groups 1 and 2. This makes logical sense as seasonal weather extremes are often short lived (U.S.
Drought Monitor,2020), not typically observed in repeated years with the same intensity (U.S. Drought
Monitor,2020), and are not always limiting to crop production as crops can partially rebound within a
season. We also found that the RIs of weather-related variables were not noticeably higher during drought
years within the study range (2011–2014; U.S. Drought Monitor,2020). This may point to the practice of
taking irrigated area out of production during abnormally dry conditions in order to meet higher irrigation
demands of the remaining fields (Deines etal.,2017; Nie etal.,2018).
Furthermore, weather variables in Groups 1 and 2 may be relatively weaker drivers due to the spread of
collective influence across many variables because seasonal extremes are combined into one variable in
Groups 3 and 4. In these groups, annual precipitation became the third most influential driver on irrigation
pumping. So, while annual precipitation may have less influence at the annual scale, its combined influence
at longer time scales (multiyear) on regional pumping is significant. This is further supported in Groups 1
and 2 where annual weather-related variables were more influential than monthly, just as combined years
were more influential than annual. Collectively, these relationships suggest that climate, acting at longer
LAMB ET AL.
10.1029/2020WR028867
13 of 16
Water Resources Research
time scales than weather, is likely to play a significant role on the pumping patterns across the region. Con-
sequently, climate change may reconfigure irrigation drivers in this region.
4.5. Management
This study highlights that irrigation decision making typically follows two questions: (1) how much is avail-
able to irrigate, both in water volume and land area (e.g., irrigated area, saturated thickness, authorized
area, authorized rate, and authorized amount), and (2) how much does it cost (e.g., depth)? Other drivers
or considerations appear to be marginal compared to the answers to these two questions. Observed trends
further indicate irrigation is a default behavior and is intensified by weather conditions and not necessarily
a result of weather conditions. So, while water use is an annual decision, compounding weather-related var-
iables appear to shape behaviors at longer time scales. Collectively, this means water conservation strategies
(even in planned depletion zones) would be better suited to focus on well-specific policies designed within
the framework of these two questions while stabilizing water use incentives over longer time scales.
As each driver category was represented in the top 10 most influential drivers in each model group, policies
must also be well rounded and account for the variations across categories through time, rather than em-
phasizing a suite of specific drivers. Water management is complex in this region and must be approached
as such to avoid unintended water use outcomes. The summed totals of each category for each model group
are reported in Table 3. The Weather category not only contributed the highest collective influence of all
variable categories but also contributed the highest number of variables in the model (26 out of 45). The
Economics category not only contributed the lowest collective influence of all categories but also had the
lowest number of variables in the model (2 out of 45). The contribution of the Land/Agriculture category
was about 2 times higher in Groups 1 and 3 than 2 and 4 due to the inclusion of total irrigated area in Groups
1 and 3.
5. Conclusion
Although many political, economic, and physical factors impact irrigation decision making in western Kan-
sas and elsewhere, characterization of their relative influence on pumping has largely remained unknown.
To quantify the influence of irrigation drivers, we utilized a BRT machine-learning technique on data across
space and time to characterize the impact of 45 drivers relating to five categories (management/policy, hy-
drology, weather, land/agriculture, and economy) on irrigation pumping from approximately 19,000 wells
across western Kansas from 2006 to 2016. BRT is a useful and informative tool for analyzing water use de-
cision making and can effectively capture both numerical and categorical variable relationships across both
space and time. In addition to total driver influence, BRT can also be used to understand the magnitude of
influence as well the conditions in which a user typically decides to stop irrigating. In the future, this tech-
nique can also be used with other models to improve their irrigation prediction (e.g., agent-based irrigation
models; Mewes & Schumann,2019). From this study, we have identified four main conclusions:
(1) Influences on irrigation use in this region are complex, as all five variable categories were represented
in the top 10 most influential variables under all modeling scenarios. In addition, the influence of many
LAMB ET AL.
10.1029/2020WR028867
14 of 16
Relative influence of drivers on irrigation pumping (%)
Management/policy Hydrology Weather Land/agriculture Economics
Group 1 15.0 9.0 42.2 27.7 6.6
Group 2 17.5 11.2 50.5 12.9 8.5
Group 3 11.7 9.2 42.8 30.6 5.7
Group 4 13.3 12.0 54.7 13.1 6.9
Note. Groups 1 and 2 do not sum to 100 as the reported values represent means across 11 models where Groups 3 and
4 are single values reported across one model.
Table 3
Relative Influence of Drivers by Category Across BRT Model Runs
Water Resources Research
drivers, like precipitation and crop value, varied from year to year. As a result of this complexity and
variability, effective policy should focus on comprehensive, multifaceted measures rather than targeting
individual, undesired behaviors.
(2) Well-specific drivers were considerably more influential to irrigation use than regional-specific drivers.
This relationship suggests irrigation applications are a user-by-user decision not largely influenced by
preexisting regulatory frameworks. Instead, water use decisions in this region are more a function of
maximizing crop production across disparate and self-motivated water conservation strategies.
(3) Decisions to irrigate can largely be summarized in response to two questions: (1) how much is available
to irrigate, both in water volume and land area (e.g., irrigated area, saturated thickness, authorized area,
authorized rate, and authorized amount), and (2) how much does it cost (e.g., depth to water, well yield
as a function of saturated thickness)? Other considerations contribute notably less to overall use.
(4) While influential in the short-term, weather-related factors have a greater influence at longer time
scales due to varying impact at shorter time scales (e.g., seasonal compared to annual time scales, an-
nual compared to multiannual time scales). This increased influence at longer time scales suggests
irrigation use in this region may be susceptible to changes in irrigation patterns and behaviors under
changing climate scenarios.
Conflict of Interest
The authors declare no conflicts of interest relevant to this study.
Data Availability Statement
Model data are available at http://www.hydroshare.org/resource/c0b6ebc880f54c92b1c4a633b0b85353
(Smidt,2020).
References
Basso, B., & Ritchie, J. T. (2014). Temperature and drought effects on maize yield. Nature Climate Change, 4(233). 233. https://doi.
org/10.1038/nclimate2139
Butler, J. J., Whittemore, D. O., Wilson, B. B., & Bohling, G. C. (2016). A new approach for assessing the future of aquifers supporting irri-
gated agriculture. Geophysical Research Letters, 43, 2004–2010. https://doi.org/10.1002/2016GL067879
Cotterman, K. A., Kendall, A. D., Basso, B., & Hyndman, D. W. (2018). Groundwater depletion and climate change: Future prospects of
crop production in the Central High Plains Aquifer. Climatic Change, 146, 187–200. https://doi.org/10.1007/s10584-017-1947-7
Deines, J. M., Kendall, A. D., Butler, J. J., & Hyndman, D. W. (2019). Quantifying irrigation adaptation strategies in response to stake-
holder-driven groundwater management in the US High Plains Aquifer. Environmental Research Letters, 14, 044014. https://doi.
org/10.1088/1748-9326/aafe39
Deines, J. M., Kendall, A. D., & Hyndman, D. W. (2017). Annual irrigation dynamics in the U.S. Northern High Plains derived from Landsat
satellite data. Geophysical Research Letters, 44, 9350–9360. https://doi.org/10.1002/2017GL074071
Edwards, E. C. (2016). What lies beneath? Aquifer heterogeneity and the economics of groundwater management. Journal of the Associa-
tion of Environmental and Resource Economists, 3, 453–491. https://doi.org/10.1086/685389
Elith, J., Leathwick, J. R., & Hastie, T. (2008). A working guide to boosted regression trees. Journal of Animal Ecology, 77(4), 802–813.
https://doi.org/10.1111/j.1365-2656.2008.01390.x
Fenichel, E. P., Abbott, J. K., Bayham, J., Boone, W., Haacker, E. M. K., & Pfeiffer, L. (2015). Measuring the value of groundwater and other
forms of natural capital. Proceedings of the National Academy of Sciences, 113(9), 2382–2387. https://doi.org/10.1073/pnas.1513779113
Friedman, J. H. (2001). Greedy function approximation: A gradient boosting machine. Annals of Statistics, 29(5), 1189–1232.
Friedman, J. H., & Meulman, J. J. (2003). Multiple additive regression trees with application in epidemiology. Statistics in Medicine, 22(9),
1365–1381. https://doi.org/10.1002/sim.1501
Haacker, E. M. K., Cotterman, K. A., Smidt, S. J., Kendall, A. D., & Hyndman, D. W. (2019). Effects of management areas, drought, and
commodity prices on groundwater decline patterns across the High Plains Aquifer. Agricultural Water Management, 218, 259–273.
https://doi.org/10.1016/j.agwat.2019.04.002
Haacker, E. M. K., Kendall, A. D., & Hyndman, D. W. (2016). Water level declines in the High Plains Aquifer: Predevelopment to resource
senescence. Groundwater, 54, 231–242. https://doi.org/10.1111/gwat.12350
Hastie, T., Tibshirani, R., & Friedman, J. (2009). The elements of statistical learning: Data mining, inference, and prediction (2nd ed.). New
York, NY: Springer.
Hauenstein, S., Wood, S. N., & Dormann, C. F. (2017). Computing AIC for black-box models using generalized degrees of freedom: A
comparison with cross-validation. Communications in Statistics - Simulation and Computation, 47(5), 1382–1396. https://doi.org/10.10
80/03610918.2017.1315728
Hijmans, R. J., Phillips, S., Leathwick, J., & Elith, J. (2017). Package “dismo”. CRAN. http://cran.nexr.com/web/packages/dismo/dismo.pdf
Hrozencik, R. A., Manning, D. T., Suter, J. F., Goemans, C., & Bailey, R. T. (2017). The heterogeneous impacts of groundwater management
policies in the Republican River Basin of Colorado. Water Resources Research, 53, 10757–10778. https://doi.org/10.1002/2017WR020927
Hu, H., & Xiong, L. (2014). Genetic engineering and breeding of drought-resistant crops. Annual Review of Plant Biology, 65, 715–741.
https://doi.org/10.1146/annurev-arplant-050213-040000
LAMB ET AL.
10.1029/2020WR028867
15 of 16
Acknowledgment
None.
Water Resources Research
Hu, Y., Quinn, C. J., Cai, X., & Garfinkle, N. W. (2017). Combining human and machine intelligence to derive agents’ behavioral rules for
groundwater irrigation. Advances in Water Resources, 109, 29–40. https://doi.org/10.1016/j.advwatres.2017.08.009
KDA-DWR & KGS, version 5. (2021). WIMAS, Water Information Management and Analysis System, 2006–2016. Kansas Geological Sur-
vey and the Kansas Department of Agriculture, Division of Water Resources. Retrieved from http://hercules.kgs.ku.edu/geohydro/
wimas/index.cfm
Kisekka, I., Oker, T., Nguyen, G., Aguilar, J., & Rogers, D. (2017). Revisiting precision mobile drip irrigation under limited water. Irrigation
Science, 35, 483–500. https://doi.org/10.1007/s00271-017-0555-7
Korus, J. T., & Hensen, H. J. (2020). Depletion percentage and nonlinear transmissivity as design criteria for groundwater-level observation
networks. Environmental Earth Sciences, 79, 382. https://doi.org/10.1007/s12665-020-09123-y
K.S.A. 82a-1036. (1978). Initiation of proceedings for designation of intensive groundwater use control areas; duties of chief engineer;
findings. Retrieved from https://www.ksrevisor.org/statutes/chapters/ch82a/082a_010_0036.html
K.S.A. 82a-1041. (2012). Local enhanced management areas; establishment procedures; duties of chief engineer; hearing; notice; orders;
review. Retrieved from https://www.ksrevisor.org/statutes/chapters/ch82a/082a_010_0041.html
Lampa, E., Lind, L., Lind, M., & Bornefalk-Hermansson, A. (2014). The identification of complex interactions in epidemiology and toxicol-
ogy: A simulation study of boosted regression trees. Environmental Health, 13(57). https://doi.org/10.1186/1476-069X-13-57
Majumdar, S., Smith, R., Butler, J., Jr., & Lakshmi, V. (2020). Groundwater withdrawal prediction using integrated multitemporal remote
sensing data sets and machine learning. Water Resources Research, 56, e2020WR028059. https://doi.org/10.1029/2020WR028059
MardanDoost, B., Brookfield, A. E., Feddema, J., Sturm, B., Kastens, J., Peterson, D., & Bishop, C. (2019). Estimating irrigation demand
with geospatial and in-situ data: Application to the High Plains Aquifer, Kansas, USA. Agricultural Water Management, 223, 105675.
https://doi.org/10.1016/j.agwat.2019.06.010
Mewes, B., & Schumann, A. (2019). The potential of combined machine learning and agent-based models in water resources management.
Hydrologie und Wasserbewirtschaftung, 63, 332–338. https://doi.org/10.5675/hywa_2019.6_2
Murphy, K. (2012). Machine learning: A probabilistic perspective. Cambridge, MA; London, UK: MIT Press.
Nian, Y., Huang, Q., Kovacs, K. F., Henry, C., & Krutz, J. (2020). Water management practices: Use patterns, related factors, and correla-
tions with irrigated acres. Water Resources Research, 56, e2019WR025360. https://doi.org/10.1029/2019WR025360
Nie, W., Zaitchik, B. F., Rodell, M., Kumar, S. V., Anderson, M. C., & Hain, C. (2018). Groundwater withdrawals under drought: Recon-
ciling GRACE and land surface models in the United States High Plains Aquifer. Water Resources Research, 54, 5282–5299. https://doi.
org/10.1029/2017WR022178
Peck, J. C. (2006). Groundwater management in Kansas: Brief history and assessment. The Kansas Journal of Law & Public Policy, 15(3),
441–466
Pfeiffer, L., & Lin, C.-Y. C. (2014). Does efficient irrigation technology lead to reduced groundwater extraction? Empirical evidence. Journal
of Environmental Economics and Management, 67, 189–208. https://doi.org/10.1016/j.jeem.2013.12.002
Rad, M. R., Brozović, N., Foster, T., & Mieno, T. (2020). Effects of instantaneous groundwater availability on irrigated agriculture and impli-
cations for aquifer management. Resource and Energy Economics, 59, 101129. https://doi.org/10.1016/j.reseneeco.2019.101129
R Development Core Team. (2018). R: A language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical
Computing. http://www.R-project.org
Ridgeway, G. (2007). Generalized boosted models: A guide to the gbm package. CRAN. https://cran.r-project.org/web/packages/gbm/vi-
gnettes/gbm.pdf
Rudnick, D. R., Irmak, S., West, C., Chávez, J. L., Kisekka, I., Marek, T. H., etal. (2019). Deficit irrigation management of maize in
the High Plains Aquifer region: A review. Journal of the American Water Resources Association, 55(1), 38–55. https://doi.
org/10.1111/1752-1688.12723
Sanderson, M. R., Bergtold, J. S., Heier Stamm, J. L., Caldas, M. M., & Ramsey, S. M. (2017). Bringing the “social” into sociohydrolo-
gy: Conservation policy support in the Central Great Plains of Kansas, USA. Water Resources Research, 53, 6725–6743. https://doi.
org/10.1002/2017WR020659
Sanderson, M. R., & Frey, R. S. (2014). From desert to breadbasket… to desert again? A metabolic rift in the High Plains Aquifer. Journal of
Political Ecology, 21, 516. https://doi.org/10.2458/v21i1.21149
Schaible, G., & Aillery, M. (2012). Water conservation in irrigated agriculture: Trends and challenges in the face of emerging demands.
USDA-ERS Economic Information Bulletin, 99. https://doi.org/10.2139/ssrn.2186555
Smidt, S. J. (2020). Irrigation drivers boosted regression; Kansas HPA. HydroShare. http://www.hydroshare.org/resource/
c0b6ebc880f54c92b1c4a633b0b85353
Smidt, S. J., Haacker, E. M. K., Kendall, A. D., Deines, J. M., Pei, L., Cotterman, K. A., etal. (2016). Complex water management in modern
agriculture: Trends in the water–energy–food nexus over the High Plains Aquifer. The Science of the Total Environment, 566–567(567),
988–1001. https://doi.org/10.1016/j.scitotenv.2016.05.127
Smidt, S., Kendall, A., & Hyndman, D. (2019). Increased dependence on irrigated crop production across the CONUS (1945–2015). Water,
11, 1458. https://doi.org/10.3390/w11071458
Steward, D. R., Bruss, P. J., Yang, X., Staggenborg, S. A., Welch, S. M., & Apley, M. D. (2013). Tapping unsustainable groundwater stores for
agricultural production in the High Plains Aquifer of Kansas, projections to 2110. Proceedings of the National Academy of Sciences, 110,
E3477–E3486. https://doi.org/10.1073/pnas.1220351110
Sukcharoen, K., Golden, B., Vestal, M., & Guerrero, B. (2020). Do crop price expectations matter? An analysis of groundwater pumping
decisions in Western Kansas. Agricultural Water Management, 231, 106021. https://doi.org/10.1016/j.agwat.2020.106021
USDA-CDL. (2006–2016). National Agricultural Statistics Service Cropland Data Layer. Retrieved from https://nassgeodata.gmu.edu/
CropScape
USDA-NASS. (2006–2016). Crop values annual summary. Retrieved from https://usda.library.cornell.edu/concern/publications/k35694332
U.S. Drought Monitor. (2020). National Drought Mitigation Center University of Nebraska-Lincoln, United States Department of Agricul-
ture, National Oceanic and Atmospheric Administration. Retrieved from https://droughtmonitor.unl.edu/About.aspx
Whittemore, D. O., Butler, J. J., & Wilson, B. B. (2016). Assessing the major drivers of water-level declines: New insights into the future of
heavily stressed aquifers. Hydrological Sciences Journal, 61, 134–145. https://doi.org/10.1080/02626667.2014.959958
Whittemore, D. O., Butler, J. J., & Wilson, B. B. (2018). Status of the High Plains Aquifer in Kansas. Kansas Geological Survey Technical
Series, 22, 1–28.
LAMB ET AL.
10.1029/2020WR028867
16 of 16
... A considerable number of methods have been used to explore drivers of irrigation water use, including linear regression analysis (Gibson et al., 2018;Monaco and Sali, 2018), principal component analysis (PCA) (Chen et al., 2020), factor decomposition models (Zou et al., 2018) and boosted regression trees (Lamb et al., 2021). Nevertheless, the dominant methods rely on relatively simple mathematical models, which can limit model's capacity in searching large numbers of key predictors of irrigation water usage. ...
Article
Full-text available
Farm-level seasonal irrigation water usage is often highly variable across time and space in an irrigation district. Identifying the driving factors of this variation can help researchers and managers understand the underlying efficiency of water usage, identify the sources of water waste and develop best irrigation practices to facilitate the development of more efficient and sustainable irrigated cropping systems. This study explored driving factors for the seasonal irrigation water usage across an irrigation district in southeastern Australia, using extensive climate, soil, cropping and water use data from 312 farms (a total of 1099 annual crop water use records). It focused on three irrigated crops (corn/maize, cotton and rice) for 2011-2019. Factors considered included climate, soils, irrigation practices and water allocation, from which key driving factors and their effects on the seasonal irrigation water usage were identified. A Bayesian hierarchical model averaging approach was developed to determine key predictors and estimate seasonal irrigation water usage. The Bayesian hierarchical modelling framework allowed seasonal irrigation water usage to be estimated at all crop-farm-year observation points simultaneously while also enabling different key predictors to be selected for different crop types. Results showed that the seasonal irrigation water usage was mainly driven by irrigation practices and soil, but the water use of different crop types was driven by different subsets of irrigation and soil predictors. Modelling was undertaken in natural log space and the model showed reasonable accuracy in estimating the seasonal irrigation water usage (R2 = 0.62). Our model is considered useful in 1) identifying drivers of variation of the seasonal irrigation water usage across time and space, and 2) suggesting potential adjustments to irrigation practices and quantifying their benefits to reduce irrigation water usage.
... However, this approach does not account for the complex interactions between climate, land use, and soil type. Machine learning-based solutions are able to ingest a variety of diverse datasets, including remote sensing and model-derived datasets; they also can account for complex relationships between input datasets and have been shown to provide reliable estimates (Lamb et al., 2021;Majumdar et al., 2020Majumdar et al., , 2022Wei et al., 2022). ...
Article
Full-text available
Study region The Mississippi Alluvial Plain (MAP) in the United States (US). Study focus Understanding local-scale groundwater use, a critical component of the water budget, is necessary for implementing sustainable water management practices. The MAP is one of the most productive agricultural regions in the US and extracts more than 11 km3/year for irrigation activities. Consequently, groundwater-level declines in the MAP region pose a substantial challenge to water sustainability, and hence, we need reliable groundwater pumping monitoring solutions to manage this resource appropriately. New hydrological insights for the region We incorporate remote sensing datasets and machine learning to improve an existing lookup table-based model of groundwater use previously developed by the U.S. Geological Survey (USGS). Here, we employ Distributed Random Forests, an ensemble machine learning algorithm to predict annual and monthly groundwater use (2014–2020) throughout this region at 1-km resolution, using pumping data from existing flowmeters in the Mississippi Delta. Our model compares favorably with the existing USGS model, with higher R2 (0.51 compared to 0.42 in the previous model), and lower root mean square error (RMSE) and mean absolute error (MAE)— 0.14 m and 0.09 m, respectively in our model, compared to 0.15 m and 0.1 m in the previous model. Therefore, this work advances our ability to predict groundwater use in regions with scarce or limited in-situ groundwater withdrawal data availability.
... Machine learning models have been widely applied in the prediction of physical hydrologic systems (e.g., Akrami et al. 2014;Guimarães Santos & Silva 2014;Alizadeh et al. 2017aAlizadeh et al. , 2017b. Methods including random forests, boosted regression trees, and artificial neural networks have been leveraged to identify climatic and governance factors that influence municipal and irrigation demand (Toth et al. 2018;Bolorinos et al. 2020;Fiorillo et al. 2021;Lamb et al. 2021). Short-term urban demand forecasting has also benefited from methods such as long short-term memory networks (Hu et al. 2019;Mu et al. 2020;Fu et al. 2022;Zanfei et al. 2022), neural networks (Huang et al. 2021;Huang et al. 2022;Liu et al. 2023), and hybrid approaches (Guo et al. 2022). ...
Article
Full-text available
Accurate models of water withdrawal are crucial in anticipating the potential water use impacts of drought and climate change. Machine learning methods can simulate the complex, nonlinear relationship between water use and potential explanatory factors, but rarely incorporate the hierarchical nature of water use data. This work presents a novel approach for the prediction of water withdrawals across multiple usage sectors using an ensemble of models fit at different hierarchical levels. Models were fit at the facility and sectoral grouping levels, as well as across facility clusters defined by temporal water use characteristics. Using repeated holdout cross-validation and a dataset of over 300,000 observations of monthly water withdrawal across 1,509 facilities, it demonstrates that ensemble predictions led to statistically significant improvements in predictive performance in five of the eight sectors analyzed. The use of ensemble modeling resulted in lower predictive errors compared to facility models in 65% of facilities analyzed. The relative improvement gained by ensemble modeling was greatest for facilities with fewer observations and higher variance, indicating its potential value in predicting withdrawal for facilities with relatively short data records or data quality issues.
... The first type is to incorporate some simplified soil/plant-based irrigation rules with predefined thresholds in the process-based models (de Rosnay et al., 2003;Haddeland et al., 2006;Lawston et al., 2015;Nie et al., 2022Nie et al., , 2018Ozdogan et al., 2010), such as the most widely used maximum allowable depletion (MAD) irrigation scheme with 50% based on soil moisture (Ozdogan et al., 2010). However, irrigation decisions are complicated by many factors, such as farmers' preferences and irrigation infrastructures, thus these frameworks for estimating irrigation water use accompany large uncertainties from farmers' irrigation behavior and process-based models (Foster et al., 2014;Lamb et al., 2021;D. Wang & Cai, 2009). ...
Article
Full-text available
Estimating irrigation water use accurately is critical for sustainable irrigation and studying terrestrial water cycle in irrigated croplands. However, irrigation is not monitored in most places, and current estimations of irrigation water use has coarse spatial and/or temporal resolutions. This study aims to estimate irrigation water use at the daily and field scale through the proposed model‐data fusion framework, which is achieved by particle filtering with two configurations (concurrent, CON, and sequential, SEQ) by assimilating satellite‐based evapotranspiration (ET) observations into an advanced agroecosystem model, ecosys. Two types of experiments using synthetic and real ET observations were conducted to study the efficacy of the proposed framework for estimating irrigation water use at the irrigated fields in eastern and western Nebraska, United States. The experiments using synthetic ET observations indicated that, for two major sources of uncertainties of ET difference between observations and model simulations, which are bias and noise, noise had larger impacts on degrading the estimation performance of irrigation water use than bias. For the experiments using real ET observations, monthly and annual estimations of irrigation water use matched well with farmer irrigation records, with Pearson correlation coefficient (r) around 0.80 and 0.50, respectively. Although detecting daily irrigation records was very challenging, our method still gave a good performance with RMSE, BIAS, and r around 2.90, 0.03, and 0.4 mm/d, respectively. Our proposed model‐data fusion framework for estimating irrigation water use at high spatio‐temporal resolution could contribute to regional water management, sustainable irrigation, and better tracking terrestrial water cycle.
... Earning the highest ranking of model inputs by the GWWP, underscores the combined impact of two neglected variables in previous studies, namely saturated thickness and the number of active high production wells, on GWL variations in an intensively irrigated region. We note that saturated thickness and depth to groundwater table, the former corresponding to GWWP and the latter representing GWD, are among the most influential irrigation drivers in western Kansas (Lamb et al., 2021). Both GWWP and GWD are indicators of groundwater development intensity (i.e., degree of pumping establishment) across the study area, offering a process-based justification for their top ranking. ...
Article
The steady overexploitation of the Ogallala Aquifer underlying the U.S. High Plains Region has put irrigated crop production at risk, particularly in the Southern and Central High Plains. To manage this issue properly, a data-driven modeling framework is developed and tested that is fast to employ and yet provides reliable long-term groundwater level (GWL) forecasts as a function of climatological and anthropogenic factors. The modeling framework uses the random forests (RF) technique in combination with ordinary kriging, and is tested in Finney County in southwest Kansas. The introduction of groundwater withdrawal potential as a new surrogate for pumping intensity enables the RF model to capture decline in groundwater depletion rate as the system progresses towards aquifer depletion and/or as a result of well retirement policies. The RF model is executed from 2017 to 2099 for 20 different downscaled global climate models (GCMs) for the two representative concentration pathways (RCP) scenarios of 4.5 and 8.5. The results show the aquifer will cease to support irrigated agriculture in most of the county by 2060 under status quo management and average climate conditions. Moreover, climate will likely shift the aquifer’s depletion time frame by 15 years or less in most of the study area. The long-term combined impact of well retirement plans and climate conditions on groundwater depletion trends imply well retirement policies do not lead to sustained groundwater savings. This study demonstrates the capacity of machine learning models to serve as a rapid assessment tool, informing policymakers about future groundwater availability in intensively irrigated regions and under different climate and management conditions.
... Because the factors that influence water use tend to be complex and nonlinear, there is increasing use of machine learning to model and predict water use. Methods including random forests, boosted regression trees, and artificial neural networks have been leveraged to identify climatic and governance factors that influence municipal and irrigation demand (Bolorinos et al., 2020;Fiorillo et al., 2021;Lamb et al., 2021;Toth et al., 2018). Short-term urban demand forecasting has also benefited from methods such as long short-term memory networks (Fu et al., 2022;Hu et al., 2019;Mu et al., 2020), neural networks (Huang et al., 2021), and hybrid approaches (Guo et al., 2022). ...
Preprint
Full-text available
Accurate models of water withdrawal are crucial in anticipating the potential water use impacts of drought and climate change. Machine-learning methods are increasingly used in water withdrawal prediction due to their ability to model the complex, nonlinear relationship between water use and potential explanatory factors. However, most machine learning methods do not explicitly address the hierarchical nature of water use data, where multiple observations through time are typically available for multiple facilities, and these facilities can be grouped in a variety of different ways. This work presents a novel approach for prediction of water withdrawals across multiple usage sectors using an ensemble of models fit at different hierarchical levels. A dataset of over 300,000 records of water withdrawal was used to fit models at the facility and sectoral grouping levels, as well as across facility clusters defined by temporal water use characteristics. Using repeated holdout cross validation, it demonstrates that ensemble predictions based on models learned from different data groupings improve withdrawal predictions for 63% of facilities relative to facility-level models. The relative improvement gained by ensemble modeling was greatest for facilities with fewer observations and higher variance, indicating its potential value in predicting withdrawal for facilities with relatively short data records or data quality issues. Inspection of the ensemble weights indicated that cluster level weights were often higher than sector level weights, pointing towards the value of learning from the behavior of facilities with similar water use patterns, even if they are in a different sector.
Preprint
Full-text available
Irrigated agriculture is the dominant use of water globally, but most water withdrawals are not monitored or reported. As a result, it is largely unknown when, where, and how much water is used for irrigation. Here, we evaluated the ability of remotely sensed evapotranspiration (ET) data, integrated with other datasets, to calculate irrigation water withdrawals and applications in an intensively-irrigated portion of the central United States. We compared irrigation calculations based on OpenET data with reported groundwater withdrawals from a flowmeter database and hundreds of farmer irrigation application records at three spatial scales (management area, water right group, and field). We found that ET-based calculations of irrigation exhibited similar temporal patterns as flowmeter data, but tended to be positively biased with substantially more interannual variability than reported pumping rate. Disagreement between ET-based irrigation calculations and reported irrigation was strongly correlated with annual precipitation. Agreement between calculated and observed ET was better for multi-year averages than for individual years across all spatial scales. The selection of an ET model was also an important consideration, as variability in calculated irrigation across an ensemble of satellite-driven ET models was larger than the potential impacts of conservation measures employed in the region. Linking individual wells to specific fields was challenging, but uncertainties in calculating irrigation depths due to the above-mentioned factors exceeded potential uncertainty from irrigation status and field boundary mapping. From these results, we suggest key practices for working with ET-based irrigation data that include accurately accounting for changes in root zone soil moisture for within-season applications, such as irrigation scheduling, and conducting an application-specific evaluation of sources of uncertainty. Remotely-sensed approaches have a high potential for improving scientific research and water resource management through improved spatial and temporal characterization of irrigation, but uncertainties must be resolved to fully realize this potential.
Article
Full-text available
Agricultural water management is increasingly prioritized throughout the world as producers are tasked with meeting growing crop demand while also managing environmental resources more sustainably. Likewise, agriculture is increasingly modifying the terrestrial water cycle. In response to these dynamics, the informal research discipline of agrohydrology continues to grow, fueled by a new era of rapidly evolving research tools and big data availability. While many researchers are actively invested in agrohydrology as a research topic, there remains a gap in formalizing this valuable discipline. This article aims to: (a) identify key research themes in agrohydrology, (b) conceptualize future research topics within each theme, and (c) estimate a timeframe before topics become pressing (i.e., before a topic becomes a limiting factor in advancing water management in an agricultural context). This commentary is meant to guide the trajectory of an evolving discipline of agrohydrology, the practice of agricultural water management at multiple nested scales, and the conversation of the invested public.
Article
Full-text available
Earth system models suggest that anthropogenic climate change will influence marine phytoplankton over the coming century with light-limited regions becoming more productive and nutrient-limited regions less productive. Anthropogenic climate change can influence not only the mean state but also the internal variability around the mean state, yet little is known about how internal variability in marine phytoplankton will change with time. Here, we quantify the influence of anthropogenic climate change on internal variability in marine phytoplankton biomass from 1920 to 2100 using the Community Earth System Model 1 Large Ensemble (CESM1-LE). We find a significant decrease in the internal variability of global phytoplankton carbon biomass under a high emission (RCP8.5) scenario and heterogeneous regional trends. Decreasing internal variability in biomass is most apparent in the subpolar North Atlantic and North Pacific. In these high-latitude regions, bottom-up controls (e.g., nutrient supply, temperature) influence changes in biomass internal variability. In the biogeochemically critical regions of the Southern Ocean and the equatorial Pacific, bottom-up controls (e.g., light, nutrients) and top-down controls (e.g., grazer biomass) affect changes in phytoplankton carbon internal variability, respectively. Our results suggest that climate mitigation and adaptation efforts that account for marine phytoplankton changes (e.g., fisheries, marine carbon cycling) should also consider changes in phytoplankton internal variability driven by anthropogenic warming, particularly on regional scales.
Article
Full-text available
Key Points Groundwater withdrawals are not actively monitored in most places of the world at a scale necessary to implement sustainable solutions Various multitemporal remote sensing data are integrated into a machine learning framework to effectively predict groundwater withdrawals The results over the High Plains Aquifer, Kansas, USA, show that this approach is applicable to similar regions having sparse in situ data
Article
Full-text available
Intensive use of groundwater for irrigation has depleted available supplies in many areas of the world. Depletion reduces well yields, which negatively impacts agricultural productivity and profitability. The impact of groundwater depletion on wells depends on the relative changes in saturated thickness (depletion percentage) and the rates of change in transmissivity during depletion (nonlinear transmissivity). In heterogeneous aquifers of variable saturated thickness, the impacts on wells vary in time and space. An observation well network designed to account for these spatiotemporal variations is a key part of achieving sustainable development of groundwater. This paper presents a framework for incorporating estimates of depletion percentage and nonlinear transmissivity into GIS-based multicriteria analysis for observation-well network design. The workflow includes (1) selection of criteria using stakeholder input, (2) assessment of data quality, (3) interpolation of hydrogeologic surfaces with prediction standard errors, (4) estimation of hydraulic conductivity values, (5) calculation of saturated thickness and transmissivity using 3 and 4 above, (6) mapping of well densities, and (7) normalization of raster values and calculation of a priority index map. The framework is widely applicable, but flexible to adapt to local needs. By incorporating depletion percentage and nonlinear transmissivity, the methodology can help management agencies prioritize monitoring locations in areas where future depletions could have the greatest impacts on well yields.
Article
Full-text available
This study describes the water management practice (WMP) use patterns by producers in Arkansas, USA, identifies the factors related to producers' choices among WMP groups, and examines the correlations between WMP uses and irrigated acreage. Using data from the 2016 Arkansas Irrigation Survey, WMPs are divided into four groups: field management, water flow control, water recovery/storage, and advanced irrigation scheduling practices. We find that about 77% of producers in the study area use two or more groups of WMPs to manage multiple aspects of irrigation, but that the factors that are associated with choices of WMPs vary by WMP group. Regression results show that the use of water flow meters, providing producers with education on the looming groundwater shortage problems and awareness of financial assistance available for conservation efforts, such as the state tax credits program, and use of WMPs by family members, friends, and neighbors are associated with increased use of WMPs. However, we find that producers that are older, have smaller farms, and rely more on groundwater are less likely to use some of the WMPs. Regression results also show that using water flow meters positively correlates with total irrigated acres, irrigated acres in rice, and irrigated acres in soybeans, and that for soybeans using more groups of WMPs is associated with a reduced extent of irrigated acreage. This study contributes to the small economics literature on WMPs and provides a more comprehensive picture of how producers use different WMPs to manage irrigation.
Article
Full-text available
Agenten-basierte Modelle ermöglichen es, komplexe dynamische Systeme nachzubilden und werden häufig dort eingesetzt, wo das Verhalten und das Zusammenspiel einzelner Akteure von Wichtigkeit sind. Besonders in gekoppelten, sozio-hydrologischen Modellen ist dieser Modelltypus verbreitet. Agenten-basierte Modelle leiden jedoch unter der starken Abstraktion und der mangelnden Dynamik in der Adaption an neue Um-gebungen, die nicht explizit im Regelwerk des Verhaltens der Agenten definiert sind. Um diesen Makel zu begrenzen wird hier ein wasserwirtschaftliches Agenten-basiertes Modell der Bewässerungsplanung um eine Lernfunktion erweitert, die mit Hilfe des maschi-nellen Lernens hydrologische Situationen erkennt, einordnet und die daraus resultierenden Reaktionen adaptiert. Damit kann ohne weitere Veränderungen der Randbedingungen und Modellkomponenten eine Strategieentwicklung der Agenten innerhalb dieser syn-thetischen Modellstudie beobachtet werden und der Ertrag aus der Bewässerungslandwirtschaft durch eine optimale Ausnutzung der verfügbaren Wasser Ressourcen unter Minimierung von Krankheitsrisiken verdoppelt werden.
Article
Full-text available
Efficient irrigation technologies, which seem to promise reduced production costs and water consumption in heavily irrigated areas, may instead be driving increased irrigation use in areas that were not traditionally irrigated. As a result, the total dependence on supplemental irrigation for crop production and revenue is steadily increasing across the contiguous United States. Quantifying this dependence has been hampered by a lack of comprehensive irrigated and dryland yield and harvested area data outside of major irrigated regions, despite the importance and long history of irrigation applications in agriculture. This study used a linear regression model to disaggregate lumped agricultural statistics and estimate average irrigated and dryland yields at the state level for five major row crops: corn, cotton, hay, soybeans, and wheat. For 1945–2015, we quantified crop production, irrigation enhancement revenue, and irrigated and dryland areas in both intensively irrigated and marginally-dependent states, where both irrigated and dryland farming practices are implemented. In 2015, we found that irrigating just the five commodity crops enhanced revenue by ~$7 billion across all states with irrigation. In states with both irrigated and dryland practices, 23% of total produced area relied on irrigation, resulting in 7% more production than from dryland practices. There was a clear response to increasing biofuel demand, with the addition of more than 3.6 million ha of irrigated corn and soybeans in the last decade in marginally-dependent states. Since 1945, we estimate that yield enhancement due to irrigation has resulted in over $465 billion in increased revenue across the contiguous United States (CONUS). Example applications of this dataset include estimating historical water use, evaluating the effects of environmental policies, developing new resource management strategies, economic risk analyses, and developing tools for farmer decision making.
Article
The Ogallala Aquifer is the main water resource for irrigated agricultural production in much of Western Kansas. It is hypothesized that as crop price expectations increase, producers will apply more water to increase yields in order to maximize profit. Using field-level panel data on groundwater pumped for irrigation in Western Kansas, this paper examines whether irrigated producers’ groundwater pumping decisions are consistent with the profit maximization framework by empirically testing if crop price expectations have a positive impact on the quantity of groundwater pumped. In general, the empirical results indicate that crop price expectations have no statistically significant impact on the quantity of groundwater pumped per acre. This suggests that groundwater pumping decisions are not consistent with the profit maximization framework and that irrigated producers consider groundwater as a fixed input possibly due to limited availability of groundwater in the area. Our econometric analysis also suggests that only a small portion of rainfall is effective.
Article
Groundwater is an important input for agricultural production in many parts of the world. Aquifer depletion has been shown to affect the rate that groundwater can be extracted from an aquifer. In this paper, we develop an analytical framework that accounts explicitly for the effects of limited instantaneous groundwater extraction rate (well capacity) on a producer's irrigation decisions. We show that limited well capacities can affect the producer's groundwater use and profit. We draw three important insights from these findings. First, we demonstrate that the price elasticity of demand for groundwater is higher for lower well capacities. Second, farmers' irrigation decisions are non-monotonic with respect to well capacity and climate conditions. Under a drier climate, producers with greater well capacities increase their groundwater use, and producers with lower well capacities reduce their water use. Third, through numerical analysis, we show that considering spatial heterogeneity in well capacities is important for estimating the cost-effectiveness and distributional impacts of groundwater management policies. Our results shed new light on the importance of extraction capacity for groundwater management policies and the potential impacts of climate change on agricultural production.
Article
Unsustainable groundwater mining threatens the economic stability of several regions around the world.Developing sustainable water-use policies and integrated water-management plans requires reliably predictingwater demand under variable weather and land-use conditions. Here, a water-budget model capable of esti-mating spatial and temporal variations in daily irrigation demand under variable weather and land-use scenariosis modified to consider irrigation management and groundwater pumping limitations. This model uses a com-bination of geospatial and in-situ measured data, including land-use and land-cover maps, crop-specific eva-potranspiration data, and weather data. A coupled sub-model limits pumping rates based on pump capacity andspatial and temporal variations in the saturated thickness of the aquifer. The ability of this water-budget modelto accurately estimate irrigation demand is demonstrated using the High Plains aquifer region of Kansas, USA, anaquifer that has undergone, and continues to undergo, significant depletion due to decades of irrigation. Themodel was calibrated with reported and measured water use for more than 1200fields, with ratios of simulatedannual irrigation demand to actual reported water use of 1.10, 0.78, 0.75, 0.93, and 1.40 for corn, sorghum,soybean, winter wheat, and alfalfa, respectively. Further simulations at a variety of scales, fromfield to multi-county levels, demonstrate that the developed water-budget model is capable of simulating the spatial andtemporal variability of irrigation demand.
Article
Unsustainable groundwater mining threatens the economic stability of several regions around the world. Developing sustainable water-use policies and integrated water-management plans requires reliably predicting water demand under variable weather and land-use conditions. Here, a water-budget model capable of estimating spatial and temporal variations in daily irrigation demand under variable weather and land-use scenarios is modified to consider irrigation management and groundwater pumping limitations. This model uses a combination of geospatial and in-situ measured data, including land-use and land-cover maps, crop-specific eva-potranspiration data, and weather data. A coupled sub-model limits pumping rates based on pump capacity and spatial and temporal variations in the saturated thickness of the aquifer. The ability of this water-budget model to accurately estimate irrigation demand is demonstrated using the High Plains aquifer region of Kansas, USA, an aquifer that has undergone, and continues to undergo, significant depletion due to decades of irrigation. The model was calibrated with reported and measured water use for more than 1200 fields, with ratios of simulated annual irrigation demand to actual reported water use of 1.10, 0.78, 0.75, 0.93, and 1.40 for corn, sorghum, soybean, winter wheat, and alfalfa, respectively. Further simulations at a variety of scales, from field to multi-county levels, demonstrate that the developed water-budget model is capable of simulating the spatial and temporal variability of irrigation demand.
Article
We use an 82-year record of water table data from the High Plains Aquifer to introduce a new application of segmented regression to hydrogeology, and evaluate the effects of droughts, crop prices, and local groundwater management on groundwater level trajectories. Across the High Plains, we find discernable regional cycles of faster and slower water table declines. A parsimonious Classification And Regression Tree (CART) analysis details correlations between select explanatory variables and changes in water table trajectories, quantified as changes in slope of well hydrographs. Drying relative to prior-year conditions is associated with negative changes in slope; in the absence of drying conditions, steep declines in commodity price are associated with positive changes in hydrograph slopes. Establishment of a groundwater management area is not a strong predictor for change in water table trajectories, but more wells tend to have negative changes in around the time of management areas are formation, suggesting that drought conditions are associated with both negative deflections in water table trajectory and enactment of management areas. Segmented regression is a promising tool for groundwater managers to evaluate change thresholds and the effectiveness of management strategies on groundwater storage and decline, using readily available water table data.