ArticlePDF Available

Influence of Irrigation Drivers Using Boosted Regression Trees: Kansas High Plains

Water Resources Research

May 2021
57(5)

DOI:10.1029/2020WR028867

License
CC BY 4.0

Authors:

Erin Haacker

University of Nebraska at Lincoln

Groundwater levels across parts of western Kansas have been declining at unsustainable rates due to pumping for agricultural irrigation despite water‐saving efforts. Accelerating this decline is the complex agricultural landscape, consisting of both categorical (e.g., management boundaries) and numerical (e.g., crop prices) factors that drive irrigation decisions, making integrated water budget management a challenge. Furthermore, these factors frequently change through time, rendering management strategies outdated within relatively short time scales. This study uses boosted regression trees to simultaneously analyze categorical and numerical data against annual irrigation pumping to determine the relative influence of each factor on groundwater pumping across both space and time. In all, 45 key water use variables covering approximately 19,000 groundwater wells were tested against irrigation pumping from 2006 to 2016 across five categories: (1) management/policy, (2) hydrology, (3) weather, (4) land/agriculture, and (5) economics. Study results showed that variables from all five categories were included among the top 10 drivers to irrigation, and the greatest influence came from variables such as irrigated area per well, saturated thickness, soil permeability, summer precipitation, and pumping costs (depth to water table). Variables that had little influence included regional management boundaries and irrigation technology. The results of this study are further used to target the factors that statistically lead to the greatest volumes of groundwater pumping to help develop robust management strategy suggestions and achieve water management goals of the region.

High Plains Aquifer site map (a) and groundwater level change for western Kansas from 2006 to 2016 (b; derived from Haacker et al., 2016).

…

Example data layers for the western Kansas High Plains Aquifer region including dominant groundwater management boundaries (a; Kansas Department of Agriculture), precipitation (b; PRISM, Oregon State University), temperature (c; PRISM, Oregon State University), permeability (d; United States Geological Survey), saturated thickness (e; Haacker et al., 2019), and depth to groundwater (f; derived from Haacker et al., 2016; United States Geological Survey).

…

Boxplots showing RI outputs from BRT runs conducted annually from 2006 to 2016 (Group 1). Black points show RI from a single BRT run conducted on collective data from 2006 to 2016 (Group 3). Note scale change following x‐axis break. RI, relative influence; BRT, boosted regression trees.

…

Boxplots showing RI outputs from BRT runs conducted annually from 2006 to 2016 on data normalized by irrigated area (Group 2). Black points show RI from a single BRT run conducted on collective data from 2006 to 2016 on data also normalized by irrigated area (Group 4). RI, relative influence; BRT, boosted regression trees.

…

Partial dependence plots showing 12 influential variables in 2012 (Group 1), ordered by decreasing relative influence. Tick marks along the inside of the x‐axes represent the distribution of data in deciles.

…

Figures - available from: Water Resources Research

This content is subject to copyright. Terms and conditions apply.

Access to this full-text is provided by Wiley.

Learn more

Content available from Water Resources Research

This content is subject to copyright. Terms and conditions apply.

1. Introduction

Groundwater level decline of the High Plains Aquifer (HPA) underlying western Kansas has been exten-

sively analyzed with most efforts focused on irrigation related to one of four domains: (1) physical (e.g.,

climatic and atmospheric factors and underlying geology; Edwards,2016; MardanDoost etal.,2019; Whit-

temore etal.,2016), (2) agricultural (e.g., crop type, irrigation limits and management, cover crops, and

irrigation efficiency; Butler etal.,2016; Kisekka etal.,2017; Pfeiffer & Lin,2014), (3) socioeconomic (e.g.,

environmental policies, energy costs, irrigation efficiency, global markets, and crop prices; Hrozencik

etal.,2017; Sanderson etal.,2017; Sukcharoen etal.,2020), or (4) an integrated approach among the first

three categories (e.g., Haacker etal.,2019; Majumdar etal.,2020; Smidt etal.,2016). While these studies

provide a comprehensive overview of the complexity of factors that drive irrigation use, little is known

about the relationship between these factors across both space and time relative to irrigation pumping. For

example, studies have shown that irrigation efficiency drives groundwater use (Pfeiffer & Lin,2014), yet it

cannot conclusively be determined that efficiency is more or less influential than annual weather patterns

in driving total withdrawals from the aquifer. The same applies for management strategies versus crop

prices or any other combination of factors compared to others. So, while there is a clear understanding of

Abstract Groundwater levels across parts of western Kansas have been declining at unsustainable

rates due to pumping for agricultural irrigation despite water-saving efforts. Accelerating this decline

is the complex agricultural landscape, consisting of both categorical (e.g., management boundaries)

and numerical (e.g., crop prices) factors that drive irrigation decisions, making integrated water budget

management a challenge. Furthermore, these factors frequently change through time, rendering

management strategies outdated within relatively short time scales. This study uses boosted regression

trees to simultaneously analyze categorical and numerical data against annual irrigation pumping to

determine the relative influence of each factor on groundwater pumping across both space and time.

In all, 45 key water use variables covering approximately 19,000 groundwater wells were tested against

irrigation pumping from 2006 to 2016 across five categories: (1) management/policy, (2) hydrology,

(3) weather, (4) land/agriculture, and (5) economics. Study results showed that variables from all five

categories were included among the top 10 drivers to irrigation, and the greatest influence came from

variables such as irrigated area per well, saturated thickness, soil permeability, summer precipitation, and

pumping costs (depth to water table). Variables that had little influence included regional management

boundaries and irrigation technology. The results of this study are further used to target the factors that

statistically lead to the greatest volumes of groundwater pumping to help develop robust management

strategy suggestions and achieve water management goals of the region.

Plain Language Summary Water use for crops has lowered groundwater levels in western

Kansas. Past studies have shown that this water use is driven by many factors spanning policy, economics,

and the physical environment. Because of this complexity, it has been difficult to fully understand which

factors most drive irrigation use relative to each other. This study uses a machine-learning model to

rank the influence of 45 factors on irrigation pumping. These factors are analyzed over space (∼19,000

wells across western Kansas) and time (2006–2016). Based on this study, drivers to water use include

total irrigated area, summer rainfall, and depth to the water table. Factors that have little influence

include management district boundaries and irrigation system type. These results are used to make water

management suggestions for the region.

LAMB ET AL.

This is an open access article under

the terms of the Creative Commons

Attribution License, which permits use,

distribution and reproduction in any

medium, provided the original work is

properly cited.

Influence of Irrigation Drivers Using Boosted Regression

Trees: Kansas High Plains

Susan E. Lamb1 , Erin M. K. Haacker2 , and Samuel J. Smidt1

1Soil and Water Sciences Department, University of Florida, Gainesville, FL, USA, 2Department of Earth &

Atmospheric Sciences, University of Nebraska-Lincoln, Lincoln, NE, USA

Key Points:

• The relative influence of drivers

to irrigation pumping in western

Kansas is modeled from 2006 to 2016

using boosted regression trees

• Site-specific factors are more

influential than regional policies,

and weather-related factors are more

influential at longer time scales

• Irrigation decision making largely

follows resource availability and

operation cost considerations over

localized management schemes

Supporting Information:

Supporting Information may be found

in the online version of this article.

Correspondence to:

S. E. Lamb,

susan.lamb@ufl.edu

Citation:

Lamb, S. E., Haacker, E. M. K., & Smidt,

S. J. (2021). Influence of irrigation

drivers using boosted regression trees:

Kansas High Plains. Water Resources

Research, 57, e2020WR028867. https://

doi.org/10.1029/2020WR028867

Received 18 SEP 2020

Accepted 14 MAR 2021

10.1029/2020WR028867

Special Section:

The Quest for Sustainability

of Heavily Stressed Aquifers at

Regional to Global Scales

RESEARCH ARTICLE

1 of 16

Water Resources Research

the driving factors that lead to irrigation use, both individually and as a

group (e.g., Haacker etal.,2019; Smidt etal.,2016), uncertainty still exists

in understanding the spatiotemporal influence of how these factors work

together to influence irrigation decision making. Measurement is a hall-

mark of precision agriculture and other targeted management schemes

(e.g., deficit irrigation management; Rudnick etal.,2019), as farmers can

often better manage the resources that they can measure. The uncertain-

ty surrounding the relative influence (RI) of irrigation drivers leads to

mismanagement and continued groundwater level decline for the region

(Smidt etal.,2016).

Past studies analyzing the relationships between irrigation drivers across

space and time have been limited as drivers are both numerical (e.g., av-

erage temperature and crop price) and categorical (e.g., crop type and

Groundwater Management District [GMD]). One approach to overcom-

ing these methodological data challenges has been to analyze numerical

data by category and then compare results (e.g., on a GMD by GMD basis;

Whittemore etal., 2016). This can be useful for resource management,

since the boundaries of the GMDs correspond to some physical features

of the aquifer, and these analyses provide results at the same scale at

which management programs are enacted. However, this limits modeled

relationships by (1) preventing categorical data from interacting directly

with numerical data or (2) minimizing the scale at which drivers can im-

pact aquifer use. Fortunately, advances in machine-learning techniques,

namely boosted regression trees (BRT; Elith etal.,2008), have allowed

for improved analysis when grouping disparate data. BRT is a statisti-

cal ensemble that combines regression trees and data boosting to define

relationships between variables, including the simultaneous analysis of

categorical and numerical variables (Elith etal.,2008). When applied to

irrigation in western Kansas, BRT can accurately characterize the relative influence of sociophysical fac-

tors on annual irrigation pumping and offer strong predictive power for estimating future water use (Elith

etal.,2008; Hu etal.,2017).

Here, we use BRT analysis to quantify the predominant drivers to irrigation use in western Kansas from

2006 to 2016 to (1) improve understanding of how these drivers relate, (2) develop meaningful manage-

ment objectives based on these relationships, and (3) demonstrate BRT as a useful water management tool.

Synchronously, we analyze 45 irrigation drivers spanning five driver groups (management/policy, hydrol-

ogy, weather, land/agriculture, and economy) to quantify the predominant controls of irrigation pumping

across both space and time. We further isolate dominant driver trends to target the most influential social

and physical variables impacting western Kansas. The results of this analysis are then used to prioritize

management efforts across the region to balance agricultural production and groundwater level declines.

2. Methods

2.1. Site Description

Agriculture is the dominant land use across western Kansas. Approximately 94% of land cover was dedi-

cated to agricultural production from 2006 to 2016, and 77% of all cropland was composed of just six com-

modities: winter wheat (38%), corn (19%), sorghum (12%), soybeans (3%), alfalfa hay (3%), cotton (<1%), or

a double crop planting of the six (1%; USDA-CDL,2006–2016). During the same period, HPA groundwater

levels across the region declined by an average of 2.8m (Figure1; derived from Haacker etal.,2016). Despite

this decline, water withdrawals from the underlying HPA remain essential to the agricultural production

and cultural identity of western Kansas, especially due to the region’s low humidity, persistent winds, and

limited precipitation relative to crop water demands (Sanderson & Frey,2014).

LAMB ET AL.

10.1029/2020WR028867

2 of 16

Figure 1. High Plains Aquifer site map (a) and groundwater level

change for western Kansas from 2006 to 2016 (b; derived from Haacker

etal.,2016).

Water Resources Research

Groundwater used for irrigation in western Kansas largely sustains the expansive agricultural production

of the region and can add more than $500 million in annual revenue for the state compared to dryland-on-

ly production (excluding operation costs; Smidt etal.,2019). In 2005, the groundwater was valued at $1.2

billion (as natural capital; Fenichel etal.,2015). However, continued groundwater pumping in Kansas is

estimated to deplete nearly 40% of the underlying HPA in the next half century (Steward etal.,2013). This

depletion will inevitably force some farmers to convert to less productive dryland operations which can pro-

duce 2–4 times lower crop yields per area (Cotterman etal.,2018; Smidt etal.,2016; Steward etal.,2013).

Several strategies have been introduced to slow water loss and stabilize groundwater levels throughout

the region, including efficient technologies (Pfeiffer & Lin,2014), drought-resistant cultivars (Cotterman

etal.,2018), and localized management boundaries (Deines etal.,2019). For example, five GMDs have been

formed since the 1970s to develop and enforce local irrigation policies (Peck,2006). Yet, groundwater lev-

els have continued to decline across the region prompting the development of further management zones

within and across district boundaries (K.S.A. 82a-1036,1978; K.S.A. 82a-1041,2012). While some areas have

found success in these efforts (e.g., Deines etal.,2019), much of the region has yet to stabilize groundwater

levels (e.g., Haacker etal.,2019; Whittemore etal.,2018). A clear gap remains between the intended man-

agement of groundwater, the responding use of irrigation, and the understanding of how these interact

throughout the region.

Compounding this agricultural water management challenge is the socioenvironmental heterogeneities

that further influence water use practices across western Kansas (Figure2; Whittemore etal., 2018). For

example, GMDs 1, 3, and 4 receive less annual precipitation (43–58cm) than GMDs 2 and 5 (58–63 cm;

Whittemore etal.,2018). In regards to management, GMDs 1, 3, and 4 operate under “planned depletion”

strategies while GMDs 2 and 5 have more favorable groundwater recharge and enact a “safe yield” scheme

(Peck,2006; Whittemore etal.,2018). Other differences at the GMD level include a variability in average

monthly temperature, soil permeability, depth to water table, and interstate compacts (IC). Collectively,

through these heterogeneities, western Kansas is a unique case study of mixed physical and social variabil-

ity with high data resolution valuable for informing progressing agricultural water use and management

techniques in this region and elsewhere.

As part of the state’s water management agenda, the Kansas Department of Agriculture and Kansas Ge-

ological Survey have continually managed an open-access Water Information Management and Analysis

System (WIMAS) since 1996 with annual water use records for over 45,000 wells across the state. Of these

wells, 19,000 are located in western Kansas with access to the HPA. Associated data include information

such as well type, installation depth, depth to water, pumping amount, pumping allotment based on gov-

ernment regulation, irrigation system type, and crop type. No other region of the HPA has this type of data

resolution, likely even in private databases. Collectively, this combination of driver complexity and data

LAMB ET AL.

10.1029/2020WR028867

3 of 16

Figure 2. Example data layers for the western Kansas High Plains Aquifer region including dominant groundwater

management boundaries (a; Kansas Department of Agriculture), precipitation (b; PRISM, Oregon State University),

temperature (c; PRISM, Oregon State University), permeability (d; United States Geological Survey), saturated thickness

(e; Haacker etal.,2019), and depth to groundwater (f; derived from Haacker etal.,2016; United States Geological

Survey).

Water Resources Research

resolution make this location an ideal selection for using BRT analysis to better understand influences on

irrigation.

2.2. Data

Data used in this study can be summarized into independent variable (i.e., drivers to irrigation pump-

ing) and dependent variable (i.e., irrigation pumping) categories. Collectively, we analyzed 45 independent

variables and 1 dependent variable. The dependent variable is total annual irrigation pumping amount

reported on the well level. The independent variables were identified as influential to irrigation use based

on published literature and informal conversations with experts, colleagues, and stakeholders and are or-

ganized into five categories: management/policy, hydrology, weather, land/agriculture, and economics. All

data used in the analysis are summarized in Table1.

Well-specific data, such as annual pumping data and crop irrigated, for approximately 19,000 agricultural

irrigation wells in western Kansas for years 2006–2016 were downloaded through the WIMAS maintained

by the Kansas Department of Agriculture, Division of Water Resources and Kansas Geological Survey

(KDA-DWR & KGS, version 5). These data, which included GPS coordinates for each pumping well, were

then read into ArcGIS (version 10.5). Spatial driver data were downloaded as or converted to raster files

in ArcGIS. If downloaded, cell sizes were kept consistent with their original format. If converted from a

shapefile, we assigned a standard 11.1-m × 11.1-m cell size. Well density was produced in with a 1.11km by

1.11km cell size. Annual, statewide market crop value data for alfalfa, corn, sorghum, soybeans, and wheat

grown in Kansas were accessed from the United States Department of Agriculture and integrated into the

well data attribute table (USDA-NASS,2006–2016). Saturated thickness and water table elevation were cre-

ated using the methods described in Haacker etal.(2016). Depth to water table was created in ArcGIS using

the aforementioned water table elevation data and a digital elevation model from the United States Geolog-

ical Survey. The Right variable was generated by concatenating WIMAS data set variables relating to right

type as well as status of the water right file, water right, and point of diversion. The shapefiles for intensive

groundwater use control areas (IGUCAs) are public data and were requested from the Kansas Department

of Agriculture via an Open Records Request. All data were unique annually except for boundary data (e.g.,

GMD, Basin), hydraulic conductivity, and soil permeability as they were effectively static over the temporal

range of the study. No predetermined weights were applied to predictor variables in the analysis due to the

exploratory nature of the research. All data were spatially aligned using the NAD 1927 Geographic Coor-

dinate System and USA Contiguous Albers Equal Area Conic projected coordinate system. All driver data

values were then spatially attributed to each well annually from 2006 to 2016 and exported in spreadsheet

format (i.e., columns are variables and rows are individual wells) for use in the boosted regression tree

modeling.

2.3. Description of Boosted Regression Trees

Boosted regression trees draw on both statistical and machine-learning techniques to determine the rela-

tive influence of each predictor variable (i.e., independent variable) on a response variable (i.e., dependent

variable; Elith etal.,2008). Specifically, BRT relies on two algorithms: (1) regression trees and (2) boosting.

Regression trees are decision trees that have repeating binary splits to identify how a subset of predictor var-

iables relate to the response variable based on a defined predictor value as a split point (e.g., May precipita-

tion>35mm; Elith etal.,2008). Decision tree complexity (i.e., total number of trees and number of nodes in

a tree) is defined by the user. Each regression tree culminates with the calculation of a residual value. Boost-

ing then uses the results of a regression tree (i.e., residual) to improve the fit of the next tree (i.e., improve

the residual). This sequence progresses stage-wise through the model rather than stepwise, thus existing

trees are not changed but instead the model estimates are updated as new trees are added (Elith etal.,2008).

This process continues until the defined number of trees is reached or the residual has reached its optimum

value, at which point improvements in model estimates are negligible. A learning rate further defines the

contribution, or weight, of each tree to the model. Based on the results of the boosted regression trees, the

model can then quantify the relative influence of each predictor variable on the response variable. Refer to

Hastie etal.(2009) for more information and extended derivation of regression trees, Ridgeway(2007) for

boosting, and Elith etal.(2008) and Friedman and Meulman(2003) for an integration of the two.

LAMB ET AL.

10.1029/2020WR028867

4 of 16

Water Resources Research

In addition to its ability to analyze both numerical and categorical data, a BRT model was selected for this

study as it is not sensitive to outliers and missing data in predictor variables (Elith etal.,2008). BRT was

chosen over a random forest model, which tends to perform poorly when there are many low-influencing

variables (Hastie etal.,2009). Furthermore, statistical approaches such as generalized additive models were

not utilized as, unlike BRT, interactions between predictor variables are not automatically modeled (Elith

etal.,2008; Hastie etal.,2009).

LAMB ET AL.

10.1029/2020WR028867

5 of 16

Variable Description Units/scale Mean; standard deviation Source

Dependent variable

Amt_Irr Total annual irrigation pumping m3/year 204,479; 152,316 1

Independent variables

Management/policy

GMD Groundwater Management District Boundary Categorical 1

IGUCA Intensive Groundwater Use Control Area Boundary Categorical 2

RC Rattlesnake Creek Management Plan Boundary Categorical 3

IC Interstate compacts Boundary Categorical 3

Right Water rights Concatenated Categorical 1

Auth_Amt Authorized amount of irrigated water m3/year 326,391; 273,905 1

Auth_Rate Authorized pumping rate m3/min 3; 12 1

Auth_Area Authorized irrigated area ha/year 169; 262 1

Hydrology

Basin River basin Boundary Categorical 1

ST Saturated thickness Meters, 250-m × 250-m 36; 27 6

WT_Elev Interpolated water table elevation Meters, 250-m × 250-m 771; 171 6

K Aquifer hydraulic conductivity Contours, 0–90m/day Categorical 5

Weather

T01, T02, … Mean monthly temperature Celsius, 4-km × 4-km 13; 2 4

T_Annual Mean annual temperature Celsius, 4-km × 4-km 13; 1 4a

P01, P02, … Total monthly precipitation mm, 4-km × 4-km 48; 36 4

P_Annual Mean annual precipitation mm, 4-km × 4-km 48; 15 4a

Land/agriculture

SP Mean soil permeability Very slow to very rapid Categorical 3

System Irrigation technology Type Categorical 1

Area_Irr Reported annual irrigated area ha/year 58; 57 1

WD Well density Wells/km2, 0.01°×0.01° 0.4; 0.2 1

Crop Reported commodity Type Categorical 1

Economics

CV_Area Annual KS crop price per unit area $/ha Crop specific 5

Depth Depth to water table Meters, 250-m × 250-m 41; 28 3, 6a

Note. Mean and standard deviation of collected data from 2006 to 2016. Source key: (1) Water Information Management and Analysis, maintained by the Kansas

Department of Agriculture, Division of Water Resources and Kansas Geological Survey; (2) Kansas Department of Agriculture, Division of Water Resources –

Open Records Request; (3) United States Geological Survey; (4) PRISM Climate Group, Oregon State University; (5) United States Department of Agriculture;

(6) Haacker etal.(2016).

aIndicates that data were derived from these sources rather than utilized directly.

Table 1

Data, Descriptions, and Sources

Water Resources Research

2.4. Construction of BRT Model

We used foundational R code (R Development Core Team,2018) from Elith etal.(2008) to implement the

BRT. While this code utilized the gbm R package, we used its expanded version for this study, the dismo R

package (v. 3.5.2 for Mac and v. 3.5.1 and v.3.6.2 for Windows; Hijmans etal.,2017). This code was then

modified to fit the conditions of our data set.

In BRT analyses, five conditions must be predetermined: (1) the total number of trees to be used in the

analysis, (2) distribution of the loss function, (3) bag fraction (i.e., the amount of data randomly selected at

each step and not replaced), (4) the tree complexity (i.e., number of nodes in each tree), and (5) the learning

rate of the model (i.e., the influence of each tree in the model; a value of 0–1) (Elith etal.,2008). Here, we

did not set a limit to the number of trees and allowed the model run until it optimized the residual loss on

the response variable. We used a Laplace distribution because it is optimal for data sets with a continuous

response variable (Ridgeway,2007) and provides a more robust fit to the data compared to other distribu-

tion options (Lampa etal.,2014; Murphy,2012), which we believe is more suitable for agricultural data that

often have large variability. The bag fraction was set at 0.5. The learning rate was set to 0.05, and the tree

complexity to 24, as determined through a calibration and validation process.

We ran a BRT model using the 2016 data for learning rate (lr) values of 0.1, 0.05, and 0.01 and tree com-

plexity (tc) values of 2–18 in increments of 2. We utilized two values from models run on each combination

of parameters (1) a tenfold cross validation (CV) statistic and (2) r-squared statistic. For each combination

of parameters. We maintain that CV is an appropriate metric for evaluating the best model, as BRT differs

from other statistical methods in that there are no p values and degrees of freedom are difficult to identify

(Elith etal.,2008). Also, CV can be a more robust sensitivity analysis for machine-learning models rather

than Akaike information criterion (Hauenstein etal., 2017). For the r-squared analysis, we conducted a

BRT model on 50% of the data and used the function predict.gbm to predict the remaining 50% of data. This

prediction was plotted against the observed values to determine the r-squared statistic.

BRT is a stochastic technique, thus the CV and r-squared statistics change marginally between iterations

even if conducted on the same parameter combinations. Because of this, we utilized the mean CV statistics

for each learning rate, and the mean r-squared statistics for each tree complexity. Identifying the highest

mean CV and r-squared statistics, we determined that a learning rate of 0.05 and a tree complexity of 24

would produce the best performing model. Please reference the Supplemental Information for more detail

on the calibration and validation steps.

2.5. Application of BRT Models

Once parameters were identified, we conducted four groups of BRT models: (1) annually, (2) annually with

pumping normalized by irrigated area, (3) all years grouped together, and (4) all years grouped together

with pumping normalized by irrigated area. Annual models (Groups 1 and 2) utilized unique data for each

year, whereas aggregated models (Groups 3 and 4) examined all years of data simultaneously and time was

not distinguished as a variable. We normalized pumping by irrigated area in two of the groups to eliminate

an anticipated strong correlation between total pumping and total irrigated area. We also initially ran each

year as an individual BRT model to systematically flag the noninfluential variables using the gbm.simplify

function of the R code. This allowed us to manually reduce noise by eliminating noninformative variables

prior to our analysis. Note that the collective influence of variables will always add up to 100, and even

noninformative variables will be assigned a nonzero value in a BRT analysis, albeit quite small. We used the

simplified sets for Groups 1 and 2. We could not use the simplified sets for Groups 3 and 4 as noninformative

variables were flagged in some years but not others (Table2), leaving incomplete driver data sets for the

groups with years combined together. We also used only a randomly selected 25% of data for each variable

in Groups 3 and 4, as this still included over 2.7 million data points and challenged computing capabilities

(Dell Precision 5820 desktop computer, 2018, Windows 10, Intel(R) Xeon(R) W-2123 CPU @ 3.60 GHz,

32GB RAM). A summary of completed models is outlined in Table2.

LAMB ET AL.

10.1029/2020WR028867

6 of 16

Water Resources Research

3. Results

The relative influences of each irrigation driver on groundwater pumping amount for the Group 1 models

(annual, simplified) and Group 3 model (years combined, unsimplified) are displayed in Figure3; each

variable is further color coded by driver category.

Drivers are arranged by decreasing mean RI for Group 1 over the temporal range of the study, where the

boxes on the Group 1 boxplots are the interquartile range (IQR) and the whiskers extend to the minimum

and maximum values. Hollow points indicate RI outliers from Group 1 results. In Group 1, irrigated area

was the most dominant driver with a mean RI of 18.0%, followed by authorized amount, saturated thick-

ness, and authorized pumping rate (mean RI of 5.3%, 5.3%, and 4.7%, respectively). Localized management

drivers had the least influence on irrigation pumping, with GMDs accounting for 0.1% of influence, IGU-

CAs for 0.1%, and IC for 0.2%. The RI of weather-related drivers ranged from an 0.6% to 5.3%, with annual

precipitation being the greatest influencing weather variable by mean. Weather variables also had the great-

est occurrence of RI outliers across time, likely corresponding to extreme weather events. Regardless of the

month, precipitation had a higher median RI on irrigation pumping than did temperature. For manage-

ment/policy variables, well-scale policies were strong drivers to irrigation pumping whereas regional-scale,

LAMB ET AL.

10.1029/2020WR028867

7 of 16

Year Variables removed Number of trees CV (mean, standard deviation) Deviance (mean, standard deviation)

Group 1: 11 models, annual, simplified (gbm.simplify)

2006 4 (GMD, IC, IGUCA, RC) 2,150 0.834, 0.003 43.257, 0.379

2007 4 (GMD, IC, IGUCA, RC) 1,600 0.836, 0.004 40.124, 0.438

2008 2 (IC, RC) 1,650 0.862, 0.003 41.055, 0.333

2009 4 (GMD, IC, IGUCA, RC) 1,900 0.851, 0.002 38.424, 0.349

2010 4 (GMD, IC, IGUCA, RC) 2,000 0.857, 0.002 38.644, 0.273

2011 2 (GMD, RC) 2,000 0.874, 0.004 44.883, 0.387

2012 2 (IGUCA, RC) 1,800 0.859, 0.004 42.913, 0.420

2013 5 (GMD, IC, IGUCA, RC, Right) 1,450 0.860, 0.003 41.058, 0.404

2014 5 (GMD, IC, IGUCA, RC, Right) 1,800 0.860, 0.002 40.486, 0.371

2015 6 (GMD, IC, IGUCA, K, RC, Right) 1,600 0.833, 0.003 37.077, 0.188

2016 4 (GMD, IC, IGUCA, RC) 1,900 0.845, 0.003 36.923, 0.302

Group 2: 11 models, annual, simplified (gbm.simplify), normalized by area irrigated

2006 4 (GMD, IC, IGUCA, RC) 2,150 0.550, 0.030 0.315, 0.002

2007 4 (GMD, IC, IGUCA, RC) 2,150 0.631, 0.007 0.292, 0.002

2008 2 (IC, RC) 1,900 0.641, 0.014 0.296, 0.002

2009 4 (GMD, IC, IGUCA, RC) 2,350 0.617, 0.016 0.278, 0.003

2010 4 (GMD, IC, IGUCA, RC) 2,150 0.634, 0.005 0.282, 0.001

2011 2 (GMD, RC) 2,700 0.560, 0.007 0.336, 0.003

2012 2 (IGUCA, RC) 2,550 0.566, 0.007 0.327, 0.003

2013 5 (GMD, IC, IGUCA, RC, Right) 1,950 0.653, 0.009 0.301, 0.003

2014 5 (GMD, IC, IGUCA, RC, Right) 2,500 0.619, 0.006 0.294, 0.002

2015 6 (GMD, IC, IGUCA, K, RC, Right) 2,200 0.553, 0.040 0.278, 0.002

2016 4 (GMD, IC, IGUCA, RC) 2,450 0.632, 0.006 0.270, 0.002

Group 3: 1 model, combined years (25% from each year)

Combined years No removed variables 2,650 0.857, 0.002 41.199, 0.230

Group 4: 1 model, combined years (25% from each year), normalized by area irrigated

Combined years No removed variables 2,750 0.597, 0.027 0.310, 0.002

Table 2

BRT Models (Size, Cross Validation, and Deviance)

Water Resources Research

boundary level policies were not strong drivers. The Rattlesnake Creek Management Plan was also a district

boundary examined in this study but was classified as noninfluential by gbm.simplify across all years and

was not considered in the Group 1 model and thus excluded from Figure3. Generally, variables with the

greatest variance in RI across years were among top influencing variables, and those with the smallest

variance in RI across years were among low-influencing variables. With the exception of irrigated area,

all driver variables reported RI values of less than 10% in any given year, with most being less than 5% in

all years. All five variable categories are represented in the top nine influencing drivers, with the top three

drivers accounting for 28.7% of RI by mean across years.

As for Group 3 results (all years combined), the top three drivers included irrigated area with a RI of 21.7%,

saturated thickness with 5.2%, and annual precipitation with 4.8%. Outside the large RI increase for annual

precipitation, trends remained consistent between Groups 1 and 3. In addition to annual precipitation, irri-

gated area also had a notable increase in RI; combined, these increases resulted in the slight RI decrease for

most other variables as all contributing variables sum to 100%. The overall complexity of variables influenc-

ing irrigation pumping is further highlighted through all five categories contributing to the top nine driving

factors and nearly all drivers contributing less than 5% to the total influence on pumping.

The relative influences of each irrigation driver on groundwater pumping amount for the Group 2 models

(annual, simplified, and normalized by irrigated area) and Group 4 model (years combined, unsimplified,

and normalized by irrigated area) are displayed in Figure4; each variable is further color coded by driver

category.

LAMB ET AL.

10.1029/2020WR028867

8 of 16

Figure 3. Boxplots showing RI outputs from BRT runs conducted annually from 2006 to 2016 (Group 1). Black points

show RI from a single BRT run conducted on collective data from 2006 to 2016 (Group 3). Note scale change following

x-axis break. RI, relative influence; BRT, boosted regression trees.

Water Resources Research

Drivers are arranged by decreasing mean RI for Group 2 over the temporal range of the study, where whisk-

ers on the Group 2 boxplots are likewise set at minimum and maximum values and the box is the IQR, and

the hollow dots are outlier RI values. While the collective RI values still summed to 100%, normalizing

pumping by irrigated area allowed for more detailed characterization of other less influential variables.

Here, trends remained similar to those observed in Group 1, with the exception of relative shuffling of the

top five variables. In Group 2, saturated thickness was the top influencing driver (mean RI of 6.3%) and

authorized area was second (mean RI of 5.9%). Depth to water (cost to pump; mean RI of 5.7%) moved

ahead of authorized rate (mean RI of 5.4%) when compared to Group 1. Crop type experienced increased

variability across years, as did crop value per hectare.

Variables with the largest mean RI no longer observed the greatest variance in RI across years, although

generally larger RI values resulted in greater variance. All driver variables still reported RI values of less

than 10% in any given year, with most being less than 5% in all years. All five variable categories are rep-

resented in the top 10 influencing drivers, with the top three drivers accounting for 17.7% of RI by mean

across years.

In Group 4 (all years combined, normalized by irrigated area), the top three influencing drivers were satu-

rated thickness (6.2%), annual precipitation (6.0%), and crop type (4.9%). Similar to Group 3, annual precip-

itation had the greatest increase in RI, resulting in a decrease in RI for other top drivers. Outside the top 10

drivers, Group 4 results matched closely with the results of Group 2. All five categories were present in the

top 10 variables, and all variables had RI values of less than 8% with most contributing to less than 5% of the

total influence on groundwater pumping.

LAMB ET AL.

10.1029/2020WR028867

9 of 16

Figure 4. Boxplots showing RI outputs from BRT runs conducted annually from 2006 to 2016 on data normalized by

irrigated area (Group 2). Black points show RI from a single BRT run conducted on collective data from 2006 to 2016 on

data also normalized by irrigated area (Group 4). RI, relative influence; BRT, boosted regression trees.

Water Resources Research

While RI is a meaningful metric, it fails to characterize whether a given variable contributes to inhibits

pumping. This type of influence is displayed using partial dependence (PD) plots. Figure5 displays PD plots

for 12 example variables from Group 1 in 2012, where positive y-axis values indicate that corresponding

x-axis values are more likely to predict irrigation pumping (stronger prediction). The reverse is also true,

negative y-axis values indicate that corresponding x-axis values are less likely to predict irrigation pumping

(weaker prediction). The magnitude of these values describes the strength of the correlation. These func-

tions reveal the individual impact of the driver on irrigation pumping after the mean impact of the other

drivers in the model is considered (Elith etal., 2008). Because of this, individual variables with strong

interactions may not be well represented in these plots, but it remains a useful tool for understanding the

general relationship between a predictor variable and the response variable (Friedman,2001; Friedman &

Meulman,2003). Reference the Supplemental Information for examples regarding variable interactions.

Drivers are arranged in order of decreasing RI and can be summarized into three general patterns: (1)

increasing PD slopes, (2) decreasing PD slopes, and (3) nonlinear PD slopes. Variables with increasing PD

LAMB ET AL.

10.1029/2020WR028867

10 of 16

Figure 5. Partial dependence plots showing 12 influential variables in 2012 (Group 1), ordered by decreasing relative

influence. Tick marks along the inside of the x-axes represent the distribution of data in deciles.

Water Resources Research

slopes include irrigated area, authorized amount, saturated thickness, depth to water, authorized rate, and

water table elevation. For example, as the number of irrigated areas increased, so did the occurrence of

pumping, creating a positive PD slope. Variables with decreasing PD slopes included July, August, and Oc-

tober precipitation. Here, low precipitation values correlated with pumping activity and high precipitation

values did not correlate with pumping activity, leading to a negative PD slope. Variables with nonlinear

PD slopes included authorized area, well density, and soil permeability. These variables did not have a pre-

dictable PD pattern, showing both correlation with pumping and no pumping at variable points within the

range of their data values. This relationship is also common in categorical data which do not have linear

characteristics or behaviors to generate meaningful slope depictions.

Because outliers can seemingly skew the results of PD plots, decile marks are also included along the x-axis

of each variable to communicate the magnitude of data represented in the plot. To further demonstrate how

these data are interpreted, we plotted empirical cumulative distribution plots of driver data beneath three

example PD plots for three variables in 2012 (Group 1), where shaded values have positive y-axis values,

thus are correlated with pumping (Figure6). These plots also identify the threshold at which pumping is

likely to occur (or no longer occur). For example, pumping was likely to occur when saturated thickness was

more than 41-m, July precipitation less than 77-mm, or water table elevation more than 737-m.

4. Discussion

4.1. Correlative Relationships

In this study, we seek to confirm correlations between predictor variables and irrigation pumping rather

than infer causality. In discussion of results, we offer potential explanations for these correlations as we aim

characterize what drives irrigation use in this region. Beyond identifying strong correlations, the BRT model

also determines noisy variables. These variables have the lowest relative influence, indicating that they not

only are uncorrelated with pumping but also do not drive pumping. Therefore, future water management

efforts can be guided away from these low-influencing variables and redirected toward stronger drivers. This

is a valuable deduction as a wide range of variables impact irrigation decision making and any reduction of

noise helps target effective management.

Furthermore, some predictor variables in this study are endogenous and have potential to cause pumping,

result from pumping, or likely both. For example, water table depth is highly correlated to pumping in this

study which could be a cause of pumping and/or result from pumping. As this work focuses on irrigation

drivers, we propose possible reasons why water table depth could drive pumping without claiming certain

causality. Parsing causality from consequence becomes even more difficult with a variable like depth as it is

temporally nonautonomous. In this way, pumping in a given year can be impacted by the change in water

table depth from the previous year, which can then impact water table depth in the upcoming year. Depth

is unlike autonomous variables such as precipitation, which are not linked to their impact in prior growing

LAMB ET AL.

10.1029/2020WR028867

11 of 16

Figure 6. Partial dependence plots (top) with corresponding empirical cumulative distribution functions (bottom) to

show influence of driver on irrigation pumping along driver data ranges.

Water Resources Research

seasons (i.e., the amount of rain in a given year is not largely impacted by the amount of rain in the previous

year). Rather, depth can both be a driver and consequence of irrigation pumping. Even if this BRT model

was conducted using predevelopment water table depths rather than current measurements, this unique

relationship could not be fully captured.

4.2. Strong Versus Weak Drivers

Strong drivers were primarily those specific to the conditions of an individual well (e.g., total irrigated

area, authorized rate, authorized amount, authorized area, and well density). Weak drivers were primari-

ly those more representative of regional characteristics (e.g., groundwater doctrine, GMD, other localized

management boundaries, river basin, or hydraulic conductivity of the aquifer). This difference between

well-scale and regional influence can likely be attributed to individual farmers making decisions to irrigate

on a case-by-case basis within the regulations of their larger-scale governing frameworks. For example, a

water user more inclined to irrigate compared to a neighbor under the same regional governance will am-

plify well-specific drivers and buffer the influence of regional governance drivers, as regional drivers are

the same between two disparate users. This relationship likely explains why well-specific governance (e.g.,

authorized area) is a strong driver, whereas regional governance (e.g., state groundwater doctrine; Right),

which defines the well-specific governance, is a weak driver.

More directly, this relationship suggests that users are not limited or forced to change behavior by their re-

gional governance and may be self-electing to change behavior despite their legal rights to business-as-usu-

al. For example, GMDs 1, 3, and 4 have less groundwater supply but operate under “planned depletion”

doctrine, and GMDs 2 and 5 have greater groundwater supply under the more conservative “safe yield”

doctrine (Peck,2006; Whittemore et al.,2018). But this does not mean all farmers in planned depletion

zones are seeking to deplete the aquifer. This is especially true as GMD (and thus management scheme) was

a weak driver. Several grassroots movements have been observed throughout the study region to prioritize

water conservation given the limitation that regional governance does not adequately align conservation

goals, and the impacts of these movements on reducing water usage may also be captured here; water man-

agement decisions have been found to correlate with close network members such as families, friends, and

neighbors (Nian etal.,2020). Another contributing explanation as to why well-specific drivers are stronger

than regional drivers may be that Local Enhanced Management Areas (LEMAs) introduced in some por-

tions of the study area have enacted policies shown to reduce irrigation pumping within the frameworks of

larger-scale governing regulations (Deines etal.,2019; Whittemore etal.,2018). However, LEMAs were not

evaluated in this BRT model as the driver did not cover the full temporal range of the study.

Additionally, precipitation drivers were always stronger drivers to pumping than temperature drivers, and

growing season precipitation was more influential than off-season precipitation. This is not only logically

supported as adequate soil moisture is a critical metric to plant production (Basso & Ritchie,2014), but this

is also supported by the greater variability in seasonal values for precipitation compared to temperature.

Irrigation is often used as a tool for overcoming or reducing the negative impacts of seasonal variability

(Whittemore etal.,2016), where water applications can be used to buffer the impacts of drought conditions

or extreme temperatures (Basso & Ritchie,2014; Whittemore etal.,2016). As precipitation was more varia-

ble than temperature during this study, it is reasonable that it would be the stronger driver.

Irrigation technology, such as flood or center pivot, was not an influential variable despite the frequent

discussion that higher-efficient systems lead to water savings (e.g., Schaible & Aillery,2012). This may be

due to nearly 76% of all wells being high-efficiency LEPA systems (KDA-DWR & KGS, 2021), in which

case reduced variability within the driver has less influence on the pumping results. Also possible, this

low influence is because efficient irrigation has been documented not to result in water savings (Pfeiffer &

Lin,2014), as farmers can irrigate more area for a reduced price compared to inefficient technologies. This

behavioral response further supports total irrigated area being the predominant driver to pumping and the

growing understanding that efficient technology does not reduce water use as long as there are incentives

for farmers to irrigate more area (Pfeiffer & Lin,2014; Smidt etal.,2016). Interestingly, few drivers in this

study can be controlled directly by farmers, and those that can were among high-influencing variables (i.e.,

number of acres irrigated, crop type). Irrigation system type is the only farmer-controlled driver among

low-influencing variables.

LAMB ET AL.

10.1029/2020WR028867

12 of 16

Water Resources Research

4.3. Partial Dependence Behaviors

Most of the top influencing drivers followed a positive slope (i.e., as driver values increase, so does the

connection to pumping), with the exception of well density, crop type, and precipitation; well density and

precipitation followed a negative slope, and crop type was categorical. While the partial dependence of these

drivers was mostly as expected (e.g., dry conditions led to pumping), we did observe unexpected results in

moderately influential drivers. For example, the results of depth to water were opposite what seemed intu-

itive. Instead of greater depths leading to a negative pumping slope (i.e., increased pumping costs leading

to decreased pumping), we found greater depths to water led to more pumping. This is perhaps because

pumping is so heavily established in these well locations that increased costs are a necessary, or unavoid-

able, operational factor or because decreased well yields may demand greater time spent for a center pivot

to make its way around a field (Rad etal.,2020), potentially affecting the ratio between evaporation and in-

filtration. It may also be feasible that smaller farm behaviors, acting more in line with expected operational

costs, are buffered in this model by larger farms with larger operational budgets. In addition, as water table

depth changes, aquifer transmissivity can change nonlinearly, causing a ranging impact on wells in a region

(Korus & Hensen,2020). Another possible explanation is that variable is dominated by planned depletion

management schemes (Peck,2006), but this seems unlikely as most of the region is not under planned de-

pletion strategies. Likewise, increased well density did not result in a positive pumping slope. Instead, fewer

wells per area resulted in greater pumping. This may be in part because more wells per area can share the

water demand of a larger area compared to an individual well, so total pumping per well can be reduced.

However, this may be because fewer wells per area may be correlated to more individual landowners and

subsequently different operational decisions. In this case, the connection between pumping and fewer wells

per area may be attributed to smallholder farmers maximizing short-term profits or mitigating seasonal var-

iability risk through increased pumping (Whittemore etal.,2016); these ownership data were not available

specifically for each well and could not be evaluated in this study.

4.4. Irrigation and Climate

For Groups 1 and 2 (annual), the weather category was the last category listed among top variables, while

it ranked as the first category for Groups 3 and 4 (all years combined). Considering that irrigation can be

used as means of climate control (Whittemore etal., 2016), it was intriguing that weather variables were

not more influential in Groups 1 and 2. This could result for two reasons: (1) the total influence is shared

across many weather drivers, so a single driver is ultimately buffered: many weather drivers sum to have

a large influence, but no single driver is largely influential, and (2) weather-related variables in the study

region are sufficient for dryland agricultural production, as irrigation is used to capture incentives other

than baseline production. However, it seems unlikely the weather conditions are sufficient for dryland pro-

duction in western Kansas given regional water demands of the produced commodities and precipitation

patterns (Cotterman etal.,2018). Even with the increase of drought-resistant cultivars (Hu & Xiong,2014),

the shared influence of weather drivers is a likely explanation for the lack of highly influential weather

drivers in Groups 1 and 2. This makes logical sense as seasonal weather extremes are often short lived (U.S.

Drought Monitor,2020), not typically observed in repeated years with the same intensity (U.S. Drought

Monitor,2020), and are not always limiting to crop production as crops can partially rebound within a

season. We also found that the RIs of weather-related variables were not noticeably higher during drought

years within the study range (2011–2014; U.S. Drought Monitor,2020). This may point to the practice of

taking irrigated area out of production during abnormally dry conditions in order to meet higher irrigation

demands of the remaining fields (Deines etal.,2017; Nie etal.,2018).

Furthermore, weather variables in Groups 1 and 2 may be relatively weaker drivers due to the spread of

collective influence across many variables because seasonal extremes are combined into one variable in

Groups 3 and 4. In these groups, annual precipitation became the third most influential driver on irrigation

pumping. So, while annual precipitation may have less influence at the annual scale, its combined influence

at longer time scales (multiyear) on regional pumping is significant. This is further supported in Groups 1

and 2 where annual weather-related variables were more influential than monthly, just as combined years

were more influential than annual. Collectively, these relationships suggest that climate, acting at longer

LAMB ET AL.

10.1029/2020WR028867

13 of 16

Water Resources Research

time scales than weather, is likely to play a significant role on the pumping patterns across the region. Con-

sequently, climate change may reconfigure irrigation drivers in this region.

4.5. Management

This study highlights that irrigation decision making typically follows two questions: (1) how much is avail-

able to irrigate, both in water volume and land area (e.g., irrigated area, saturated thickness, authorized

area, authorized rate, and authorized amount), and (2) how much does it cost (e.g., depth)? Other drivers

or considerations appear to be marginal compared to the answers to these two questions. Observed trends

further indicate irrigation is a default behavior and is intensified by weather conditions and not necessarily

a result of weather conditions. So, while water use is an annual decision, compounding weather-related var-

iables appear to shape behaviors at longer time scales. Collectively, this means water conservation strategies

(even in planned depletion zones) would be better suited to focus on well-specific policies designed within

the framework of these two questions while stabilizing water use incentives over longer time scales.

As each driver category was represented in the top 10 most influential drivers in each model group, policies

must also be well rounded and account for the variations across categories through time, rather than em-

phasizing a suite of specific drivers. Water management is complex in this region and must be approached

as such to avoid unintended water use outcomes. The summed totals of each category for each model group

are reported in Table 3. The Weather category not only contributed the highest collective influence of all

variable categories but also contributed the highest number of variables in the model (26 out of 45). The

Economics category not only contributed the lowest collective influence of all categories but also had the

lowest number of variables in the model (2 out of 45). The contribution of the Land/Agriculture category

was about 2 times higher in Groups 1 and 3 than 2 and 4 due to the inclusion of total irrigated area in Groups

1 and 3.

5. Conclusion

Although many political, economic, and physical factors impact irrigation decision making in western Kan-

sas and elsewhere, characterization of their relative influence on pumping has largely remained unknown.

To quantify the influence of irrigation drivers, we utilized a BRT machine-learning technique on data across

space and time to characterize the impact of 45 drivers relating to five categories (management/policy, hy-

drology, weather, land/agriculture, and economy) on irrigation pumping from approximately 19,000 wells

across western Kansas from 2006 to 2016. BRT is a useful and informative tool for analyzing water use de-

cision making and can effectively capture both numerical and categorical variable relationships across both

space and time. In addition to total driver influence, BRT can also be used to understand the magnitude of

influence as well the conditions in which a user typically decides to stop irrigating. In the future, this tech-

nique can also be used with other models to improve their irrigation prediction (e.g., agent-based irrigation

models; Mewes & Schumann,2019). From this study, we have identified four main conclusions:

(1) Influences on irrigation use in this region are complex, as all five variable categories were represented

in the top 10 most influential variables under all modeling scenarios. In addition, the influence of many

LAMB ET AL.

10.1029/2020WR028867

14 of 16

Relative influence of drivers on irrigation pumping (%)

Management/policy Hydrology Weather Land/agriculture Economics

Group 1 15.0 9.0 42.2 27.7 6.6

Group 2 17.5 11.2 50.5 12.9 8.5

Group 3 11.7 9.2 42.8 30.6 5.7

Group 4 13.3 12.0 54.7 13.1 6.9

Note. Groups 1 and 2 do not sum to 100 as the reported values represent means across 11 models where Groups 3 and

4 are single values reported across one model.

Table 3

Relative Influence of Drivers by Category Across BRT Model Runs

Water Resources Research

drivers, like precipitation and crop value, varied from year to year. As a result of this complexity and

variability, effective policy should focus on comprehensive, multifaceted measures rather than targeting

individual, undesired behaviors.

(2) Well-specific drivers were considerably more influential to irrigation use than regional-specific drivers.

This relationship suggests irrigation applications are a user-by-user decision not largely influenced by

preexisting regulatory frameworks. Instead, water use decisions in this region are more a function of

maximizing crop production across disparate and self-motivated water conservation strategies.

(3) Decisions to irrigate can largely be summarized in response to two questions: (1) how much is available

to irrigate, both in water volume and land area (e.g., irrigated area, saturated thickness, authorized area,

authorized rate, and authorized amount), and (2) how much does it cost (e.g., depth to water, well yield

as a function of saturated thickness)? Other considerations contribute notably less to overall use.

(4) While influential in the short-term, weather-related factors have a greater influence at longer time

scales due to varying impact at shorter time scales (e.g., seasonal compared to annual time scales, an-

nual compared to multiannual time scales). This increased influence at longer time scales suggests

irrigation use in this region may be susceptible to changes in irrigation patterns and behaviors under

changing climate scenarios.

Conflict of Interest

The authors declare no conflicts of interest relevant to this study.

Data Availability Statement

Model data are available at http://www.hydroshare.org/resource/c0b6ebc880f54c92b1c4a633b0b85353

(Smidt,2020).

References

Basso, B., & Ritchie, J. T. (2014). Temperature and drought effects on maize yield. Nature Climate Change, 4(233). 233. https://doi.

org/10.1038/nclimate2139

Butler, J. J., Whittemore, D. O., Wilson, B. B., & Bohling, G. C. (2016). A new approach for assessing the future of aquifers supporting irri-

gated agriculture. Geophysical Research Letters, 43, 2004–2010. https://doi.org/10.1002/2016GL067879

Cotterman, K. A., Kendall, A. D., Basso, B., & Hyndman, D. W. (2018). Groundwater depletion and climate change: Future prospects of

crop production in the Central High Plains Aquifer. Climatic Change, 146, 187–200. https://doi.org/10.1007/s10584-017-1947-7

Deines, J. M., Kendall, A. D., Butler, J. J., & Hyndman, D. W. (2019). Quantifying irrigation adaptation strategies in response to stake-

holder-driven groundwater management in the US High Plains Aquifer. Environmental Research Letters, 14, 044014. https://doi.

org/10.1088/1748-9326/aafe39

Deines, J. M., Kendall, A. D., & Hyndman, D. W. (2017). Annual irrigation dynamics in the U.S. Northern High Plains derived from Landsat

satellite data. Geophysical Research Letters, 44, 9350–9360. https://doi.org/10.1002/2017GL074071

Edwards, E. C. (2016). What lies beneath? Aquifer heterogeneity and the economics of groundwater management. Journal of the Associa-

tion of Environmental and Resource Economists, 3, 453–491. https://doi.org/10.1086/685389

Elith, J., Leathwick, J. R., & Hastie, T. (2008). A working guide to boosted regression trees. Journal of Animal Ecology, 77(4), 802–813.

https://doi.org/10.1111/j.1365-2656.2008.01390.x

Fenichel, E. P., Abbott, J. K., Bayham, J., Boone, W., Haacker, E. M. K., & Pfeiffer, L. (2015). Measuring the value of groundwater and other

forms of natural capital. Proceedings of the National Academy of Sciences, 113(9), 2382–2387. https://doi.org/10.1073/pnas.1513779113

Friedman, J. H. (2001). Greedy function approximation: A gradient boosting machine. Annals of Statistics, 29(5), 1189–1232.

Friedman, J. H., & Meulman, J. J. (2003). Multiple additive regression trees with application in epidemiology. Statistics in Medicine, 22(9),

1365–1381. https://doi.org/10.1002/sim.1501

Haacker, E. M. K., Cotterman, K. A., Smidt, S. J., Kendall, A. D., & Hyndman, D. W. (2019). Effects of management areas, drought, and

commodity prices on groundwater decline patterns across the High Plains Aquifer. Agricultural Water Management, 218, 259–273.

https://doi.org/10.1016/j.agwat.2019.04.002

Haacker, E. M. K., Kendall, A. D., & Hyndman, D. W. (2016). Water level declines in the High Plains Aquifer: Predevelopment to resource

senescence. Groundwater, 54, 231–242. https://doi.org/10.1111/gwat.12350

Hastie, T., Tibshirani, R., & Friedman, J. (2009). The elements of statistical learning: Data mining, inference, and prediction (2nd ed.). New

York, NY: Springer.

Hauenstein, S., Wood, S. N., & Dormann, C. F. (2017). Computing AIC for black-box models using generalized degrees of freedom: A

comparison with cross-validation. Communications in Statistics - Simulation and Computation, 47(5), 1382–1396. https://doi.org/10.10

80/03610918.2017.1315728

Hijmans, R. J., Phillips, S., Leathwick, J., & Elith, J. (2017). Package “dismo”. CRAN. http://cran.nexr.com/web/packages/dismo/dismo.pdf

Hrozencik, R. A., Manning, D. T., Suter, J. F., Goemans, C., & Bailey, R. T. (2017). The heterogeneous impacts of groundwater management

policies in the Republican River Basin of Colorado. Water Resources Research, 53, 10757–10778. https://doi.org/10.1002/2017WR020927

Hu, H., & Xiong, L. (2014). Genetic engineering and breeding of drought-resistant crops. Annual Review of Plant Biology, 65, 715–741.

https://doi.org/10.1146/annurev-arplant-050213-040000

LAMB ET AL.

10.1029/2020WR028867

15 of 16

Acknowledgment

None.

Water Resources Research

Hu, Y., Quinn, C. J., Cai, X., & Garfinkle, N. W. (2017). Combining human and machine intelligence to derive agents’ behavioral rules for

groundwater irrigation. Advances in Water Resources, 109, 29–40. https://doi.org/10.1016/j.advwatres.2017.08.009

KDA-DWR & KGS, version 5. (2021). WIMAS, Water Information Management and Analysis System, 2006–2016. Kansas Geological Sur-

vey and the Kansas Department of Agriculture, Division of Water Resources. Retrieved from http://hercules.kgs.ku.edu/geohydro/

wimas/index.cfm

Kisekka, I., Oker, T., Nguyen, G., Aguilar, J., & Rogers, D. (2017). Revisiting precision mobile drip irrigation under limited water. Irrigation

Science, 35, 483–500. https://doi.org/10.1007/s00271-017-0555-7

Korus, J. T., & Hensen, H. J. (2020). Depletion percentage and nonlinear transmissivity as design criteria for groundwater-level observation

networks. Environmental Earth Sciences, 79, 382. https://doi.org/10.1007/s12665-020-09123-y

K.S.A. 82a-1036. (1978). Initiation of proceedings for designation of intensive groundwater use control areas; duties of chief engineer;

findings. Retrieved from https://www.ksrevisor.org/statutes/chapters/ch82a/082a_010_0036.html

K.S.A. 82a-1041. (2012). Local enhanced management areas; establishment procedures; duties of chief engineer; hearing; notice; orders;

review. Retrieved from https://www.ksrevisor.org/statutes/chapters/ch82a/082a_010_0041.html

Lampa, E., Lind, L., Lind, M., & Bornefalk-Hermansson, A. (2014). The identification of complex interactions in epidemiology and toxicol-

ogy: A simulation study of boosted regression trees. Environmental Health, 13(57). https://doi.org/10.1186/1476-069X-13-57

Majumdar, S., Smith, R., Butler, J., Jr., & Lakshmi, V. (2020). Groundwater withdrawal prediction using integrated multitemporal remote

sensing data sets and machine learning. Water Resources Research, 56, e2020WR028059. https://doi.org/10.1029/2020WR028059

MardanDoost, B., Brookfield, A. E., Feddema, J., Sturm, B., Kastens, J., Peterson, D., & Bishop, C. (2019). Estimating irrigation demand

with geospatial and in-situ data: Application to the High Plains Aquifer, Kansas, USA. Agricultural Water Management, 223, 105675.

https://doi.org/10.1016/j.agwat.2019.06.010

Mewes, B., & Schumann, A. (2019). The potential of combined machine learning and agent-based models in water resources management.

Hydrologie und Wasserbewirtschaftung, 63, 332–338. https://doi.org/10.5675/hywa_2019.6_2

Murphy, K. (2012). Machine learning: A probabilistic perspective. Cambridge, MA; London, UK: MIT Press.

Nian, Y., Huang, Q., Kovacs, K. F., Henry, C., & Krutz, J. (2020). Water management practices: Use patterns, related factors, and correla-

tions with irrigated acres. Water Resources Research, 56, e2019WR025360. https://doi.org/10.1029/2019WR025360

Nie, W., Zaitchik, B. F., Rodell, M., Kumar, S. V., Anderson, M. C., & Hain, C. (2018). Groundwater withdrawals under drought: Recon-

ciling GRACE and land surface models in the United States High Plains Aquifer. Water Resources Research, 54, 5282–5299. https://doi.

org/10.1029/2017WR022178

Peck, J. C. (2006). Groundwater management in Kansas: Brief history and assessment. The Kansas Journal of Law & Public Policy, 15(3),

441–466

Pfeiffer, L., & Lin, C.-Y. C. (2014). Does efficient irrigation technology lead to reduced groundwater extraction? Empirical evidence. Journal

of Environmental Economics and Management, 67, 189–208. https://doi.org/10.1016/j.jeem.2013.12.002

Rad, M. R., Brozović, N., Foster, T., & Mieno, T. (2020). Effects of instantaneous groundwater availability on irrigated agriculture and impli-

cations for aquifer management. Resource and Energy Economics, 59, 101129. https://doi.org/10.1016/j.reseneeco.2019.101129

R Development Core Team. (2018). R: A language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical

Computing. http://www.R-project.org

Ridgeway, G. (2007). Generalized boosted models: A guide to the gbm package. CRAN. https://cran.r-project.org/web/packages/gbm/vi-

gnettes/gbm.pdf

Rudnick, D. R., Irmak, S., West, C., Chávez, J. L., Kisekka, I., Marek, T. H., etal. (2019). Deficit irrigation management of maize in

the High Plains Aquifer region: A review. Journal of the American Water Resources Association, 55(1), 38–55. https://doi.

org/10.1111/1752-1688.12723

Sanderson, M. R., Bergtold, J. S., Heier Stamm, J. L., Caldas, M. M., & Ramsey, S. M. (2017). Bringing the “social” into sociohydrolo-

gy: Conservation policy support in the Central Great Plains of Kansas, USA. Water Resources Research, 53, 6725–6743. https://doi.

org/10.1002/2017WR020659

Sanderson, M. R., & Frey, R. S. (2014). From desert to breadbasket… to desert again? A metabolic rift in the High Plains Aquifer. Journal of

Political Ecology, 21, 516. https://doi.org/10.2458/v21i1.21149

Schaible, G., & Aillery, M. (2012). Water conservation in irrigated agriculture: Trends and challenges in the face of emerging demands.

USDA-ERS Economic Information Bulletin, 99. https://doi.org/10.2139/ssrn.2186555

Smidt, S. J. (2020). Irrigation drivers boosted regression; Kansas HPA. HydroShare. http://www.hydroshare.org/resource/

c0b6ebc880f54c92b1c4a633b0b85353

Smidt, S. J., Haacker, E. M. K., Kendall, A. D., Deines, J. M., Pei, L., Cotterman, K. A., etal. (2016). Complex water management in modern

agriculture: Trends in the water–energy–food nexus over the High Plains Aquifer. The Science of the Total Environment, 566–567(567),

988–1001. https://doi.org/10.1016/j.scitotenv.2016.05.127

Smidt, S., Kendall, A., & Hyndman, D. (2019). Increased dependence on irrigated crop production across the CONUS (1945–2015). Water,

11, 1458. https://doi.org/10.3390/w11071458

Steward, D. R., Bruss, P. J., Yang, X., Staggenborg, S. A., Welch, S. M., & Apley, M. D. (2013). Tapping unsustainable groundwater stores for

agricultural production in the High Plains Aquifer of Kansas, projections to 2110. Proceedings of the National Academy of Sciences, 110,

E3477–E3486. https://doi.org/10.1073/pnas.1220351110

Sukcharoen, K., Golden, B., Vestal, M., & Guerrero, B. (2020). Do crop price expectations matter? An analysis of groundwater pumping

decisions in Western Kansas. Agricultural Water Management, 231, 106021. https://doi.org/10.1016/j.agwat.2020.106021

USDA-CDL. (2006–2016). National Agricultural Statistics Service Cropland Data Layer. Retrieved from https://nassgeodata.gmu.edu/

CropScape

USDA-NASS. (2006–2016). Crop values annual summary. Retrieved from https://usda.library.cornell.edu/concern/publications/k35694332

U.S. Drought Monitor. (2020). National Drought Mitigation Center University of Nebraska-Lincoln, United States Department of Agricul-

ture, National Oceanic and Atmospheric Administration. Retrieved from https://droughtmonitor.unl.edu/About.aspx

Whittemore, D. O., Butler, J. J., & Wilson, B. B. (2016). Assessing the major drivers of water-level declines: New insights into the future of

heavily stressed aquifers. Hydrological Sciences Journal, 61, 134–145. https://doi.org/10.1080/02626667.2014.959958

Whittemore, D. O., Butler, J. J., & Wilson, B. B. (2018). Status of the High Plains Aquifer in Kansas. Kansas Geological Survey Technical

Series, 22, 1–28.

LAMB ET AL.

10.1029/2020WR028867

16 of 16

Available via license: CC BY 4.0

Content may be subject to copyright.

Exploring key factors driving farm-level seasonal irrigation water usage with Bayesian hierarchical modelling

Article

Full-text available

Feb 2024
AGR WATER MANAGE

Farm-level seasonal irrigation water usage is often highly variable across time and space in an irrigation district. Identifying the driving factors of this variation can help researchers and managers understand the underlying efficiency of water usage, identify the sources of water waste and develop best irrigation practices to facilitate the development of more efficient and sustainable irrigated cropping systems. This study explored driving factors for the seasonal irrigation water usage across an irrigation district in southeastern Australia, using extensive climate, soil, cropping and water use data from 312 farms (a total of 1099 annual crop water use records). It focused on three irrigated crops (corn/maize, cotton and rice) for 2011-2019. Factors considered included climate, soils, irrigation practices and water allocation, from which key driving factors and their effects on the seasonal irrigation water usage were identified. A Bayesian hierarchical model averaging approach was developed to determine key predictors and estimate seasonal irrigation water usage. The Bayesian hierarchical modelling framework allowed seasonal irrigation water usage to be estimated at all crop-farm-year observation points simultaneously while also enabling different key predictors to be selected for different crop types. Results showed that the seasonal irrigation water usage was mainly driven by irrigation practices and soil, but the water use of different crop types was driven by different subsets of irrigation and soil predictors. Modelling was undertaken in natural log space and the model showed reasonable accuracy in estimating the seasonal irrigation water usage (R2 = 0.62). Our model is considered useful in 1) identifying drivers of variation of the seasonal irrigation water usage across time and space, and 2) suggesting potential adjustments to irrigation practices and quantifying their benefits to reduce irrigation water usage.

Improving crop-specific groundwater use estimation in the Mississippi Alluvial Plain: Implications for integrated remote sensing and machine learning approaches in data-scarce regions

Article

Full-text available

Feb 2024

Study region The Mississippi Alluvial Plain (MAP) in the United States (US). Study focus Understanding local-scale groundwater use, a critical component of the water budget, is necessary for implementing sustainable water management practices. The MAP is one of the most productive agricultural regions in the US and extracts more than 11 km3/year for irrigation activities. Consequently, groundwater-level declines in the MAP region pose a substantial challenge to water sustainability, and hence, we need reliable groundwater pumping monitoring solutions to manage this resource appropriately. New hydrological insights for the region We incorporate remote sensing datasets and machine learning to improve an existing lookup table-based model of groundwater use previously developed by the U.S. Geological Survey (USGS). Here, we employ Distributed Random Forests, an ensemble machine learning algorithm to predict annual and monthly groundwater use (2014–2020) throughout this region at 1-km resolution, using pumping data from existing flowmeters in the Mississippi Delta. Our model compares favorably with the existing USGS model, with higher R2 (0.51 compared to 0.42 in the previous model), and lower root mean square error (RMSE) and mean absolute error (MAE)— 0.14 m and 0.09 m, respectively in our model, compared to 0.15 m and 0.1 m in the previous model. Therefore, this work advances our ability to predict groundwater use in regions with scarce or limited in-situ groundwater withdrawal data availability.

Prediction of multi-sectoral longitudinal water withdrawals using hierarchical machine learning models

Article

Full-text available

Aug 2023

Julie Shortridge

Accurate models of water withdrawal are crucial in anticipating the potential water use impacts of drought and climate change. Machine learning methods can simulate the complex, nonlinear relationship between water use and potential explanatory factors, but rarely incorporate the hierarchical nature of water use data. This work presents a novel approach for the prediction of water withdrawals across multiple usage sectors using an ensemble of models fit at different hierarchical levels. Models were fit at the facility and sectoral grouping levels, as well as across facility clusters defined by temporal water use characteristics. Using repeated holdout cross-validation and a dataset of over 300,000 observations of monthly water withdrawal across 1,509 facilities, it demonstrates that ensemble predictions led to statistically significant improvements in predictive performance in five of the eight sectors analyzed. The use of ensemble modeling resulted in lower predictive errors compared to facility models in 65% of facilities analyzed. The relative improvement gained by ensemble modeling was greatest for facilities with fewer observations and higher variance, indicating its potential value in predicting withdrawal for facilities with relatively short data records or data quality issues.

Combining Remotely Sensed Evapotranspiration and an Agroecosystem Model to Estimate Center‐Pivot Irrigation Water Use at High Spatio‐Temporal Resolution

Article

Full-text available

Feb 2023
WATER RESOUR RES

Estimating irrigation water use accurately is critical for sustainable irrigation and studying terrestrial water cycle in irrigated croplands. However, irrigation is not monitored in most places, and current estimations of irrigation water use has coarse spatial and/or temporal resolutions. This study aims to estimate irrigation water use at the daily and field scale through the proposed model‐data fusion framework, which is achieved by particle filtering with two configurations (concurrent, CON, and sequential, SEQ) by assimilating satellite‐based evapotranspiration (ET) observations into an advanced agroecosystem model, ecosys. Two types of experiments using synthetic and real ET observations were conducted to study the efficacy of the proposed framework for estimating irrigation water use at the irrigated fields in eastern and western Nebraska, United States. The experiments using synthetic ET observations indicated that, for two major sources of uncertainties of ET difference between observations and model simulations, which are bias and noise, noise had larger impacts on degrading the estimation performance of irrigation water use than bias. For the experiments using real ET observations, monthly and annual estimations of irrigation water use matched well with farmer irrigation records, with Pearson correlation coefficient (r) around 0.80 and 0.50, respectively. Although detecting daily irrigation records was very challenging, our method still gave a good performance with RMSE, BIAS, and r around 2.90, 0.03, and 0.4 mm/d, respectively. Our proposed model‐data fusion framework for estimating irrigation water use at high spatio‐temporal resolution could contribute to regional water management, sustainable irrigation, and better tracking terrestrial water cycle.

Employing machine learning to quantify long-term climatological and regulatory impacts on groundwater availability in intensively irrigated regions

Article

Oct 2022
J HYDROL

The steady overexploitation of the Ogallala Aquifer underlying the U.S. High Plains Region has put irrigated crop production at risk, particularly in the Southern and Central High Plains. To manage this issue properly, a data-driven modeling framework is developed and tested that is fast to employ and yet provides reliable long-term groundwater level (GWL) forecasts as a function of climatological and anthropogenic factors. The modeling framework uses the random forests (RF) technique in combination with ordinary kriging, and is tested in Finney County in southwest Kansas. The introduction of groundwater withdrawal potential as a new surrogate for pumping intensity enables the RF model to capture decline in groundwater depletion rate as the system progresses towards aquifer depletion and/or as a result of well retirement policies. The RF model is executed from 2017 to 2099 for 20 different downscaled global climate models (GCMs) for the two representative concentration pathways (RCP) scenarios of 4.5 and 8.5. The results show the aquifer will cease to support irrigated agriculture in most of the county by 2060 under status quo management and average climate conditions. Moreover, climate will likely shift the aquifer’s depletion time frame by 15 years or less in most of the study area. The long-term combined impact of well retirement plans and climate conditions on groundwater depletion trends imply well retirement policies do not lead to sustained groundwater savings. This study demonstrates the capacity of machine learning models to serve as a rapid assessment tool, informing policymakers about future groundwater availability in intensively irrigated regions and under different climate and management conditions.

Prediction of multi-sectoral longitudinal water withdrawals using hierarchical machine learning models

Preprint

Full-text available

Sep 2022

Julie Shortridge

Accurate models of water withdrawal are crucial in anticipating the potential water use impacts of drought and climate change. Machine-learning methods are increasingly used in water withdrawal prediction due to their ability to model the complex, nonlinear relationship between water use and potential explanatory factors. However, most machine learning methods do not explicitly address the hierarchical nature of water use data, where multiple observations through time are typically available for multiple facilities, and these facilities can be grouped in a variety of different ways. This work presents a novel approach for prediction of water withdrawals across multiple usage sectors using an ensemble of models fit at different hierarchical levels. A dataset of over 300,000 records of water withdrawal was used to fit models at the facility and sectoral grouping levels, as well as across facility clusters defined by temporal water use characteristics. Using repeated holdout cross validation, it demonstrates that ensemble predictions based on models learned from different data groupings improve withdrawal predictions for 63% of facilities relative to facility-level models. The relative improvement gained by ensemble modeling was greatest for facilities with fewer observations and higher variance, indicating its potential value in predicting withdrawal for facilities with relatively short data records or data quality issues. Inspection of the ensemble weights indicated that cluster level weights were often higher than sector level weights, pointing towards the value of learning from the behavior of facilities with similar water use patterns, even if they are in a different sector.

Estimating irrigation water use from remotely sensed evapotranspiration data: Accuracy and uncertainties across spatial scales

Preprint

Full-text available

Feb 2024

Irrigated agriculture is the dominant use of water globally, but most water withdrawals are not monitored or reported. As a result, it is largely unknown when, where, and how much water is used for irrigation. Here, we evaluated the ability of remotely sensed evapotranspiration (ET) data, integrated with other datasets, to calculate irrigation water withdrawals and applications in an intensively-irrigated portion of the central United States. We compared irrigation calculations based on OpenET data with reported groundwater withdrawals from a flowmeter database and hundreds of farmer irrigation application records at three spatial scales (management area, water right group, and field). We found that ET-based calculations of irrigation exhibited similar temporal patterns as flowmeter data, but tended to be positively biased with substantially more interannual variability than reported pumping rate. Disagreement between ET-based irrigation calculations and reported irrigation was strongly correlated with annual precipitation. Agreement between calculated and observed ET was better for multi-year averages than for individual years across all spatial scales. The selection of an ET model was also an important consideration, as variability in calculated irrigation across an ensemble of satellite-driven ET models was larger than the potential impacts of conservation measures employed in the region. Linking individual wells to specific fields was challenging, but uncertainties in calculating irrigation depths due to the above-mentioned factors exceeded potential uncertainty from irrigation status and field boundary mapping. From these results, we suggest key practices for working with ET-based irrigation data that include accurately accounting for changes in root zone soil moisture for within-season applications, such as irrigation scheduling, and conducting an application-specific evaluation of sources of uncertainty. Remotely-sensed approaches have a high potential for improving scientific research and water resource management through improved spatial and temporal characterization of irrigation, but uncertainties must be resolved to fully realize this potential.

Forming the Future of Agrohydrology

Article

Full-text available

Dec 2023

Agricultural water management is increasingly prioritized throughout the world as producers are tasked with meeting growing crop demand while also managing environmental resources more sustainably. Likewise, agriculture is increasingly modifying the terrestrial water cycle. In response to these dynamics, the informal research discipline of agrohydrology continues to grow, fueled by a new era of rapidly evolving research tools and big data availability. While many researchers are actively invested in agrohydrology as a research topic, there remains a gap in formalizing this valuable discipline. This article aims to: (a) identify key research themes in agrohydrology, (b) conceptualize future research topics within each theme, and (c) estimate a timeframe before topics become pressing (i.e., before a topic becomes a limiting factor in advancing water management in an agricultural context). This commentary is meant to guide the trajectory of an evolving discipline of agrohydrology, the practice of agricultural water management at multiple nested scales, and the conversation of the invested public.

Anthropogenic climate change drives non-stationary phytoplankton internal variability

Article

Full-text available

Nov 2023
BG

Earth system models suggest that anthropogenic climate change will influence marine phytoplankton over the coming century with light-limited regions becoming more productive and nutrient-limited regions less productive. Anthropogenic climate change can influence not only the mean state but also the internal variability around the mean state, yet little is known about how internal variability in marine phytoplankton will change with time. Here, we quantify the influence of anthropogenic climate change on internal variability in marine phytoplankton biomass from 1920 to 2100 using the Community Earth System Model 1 Large Ensemble (CESM1-LE). We find a significant decrease in the internal variability of global phytoplankton carbon biomass under a high emission (RCP8.5) scenario and heterogeneous regional trends. Decreasing internal variability in biomass is most apparent in the subpolar North Atlantic and North Pacific. In these high-latitude regions, bottom-up controls (e.g., nutrient supply, temperature) influence changes in biomass internal variability. In the biogeochemically critical regions of the Southern Ocean and the equatorial Pacific, bottom-up controls (e.g., light, nutrients) and top-down controls (e.g., grazer biomass) affect changes in phytoplankton carbon internal variability, respectively. Our results suggest that climate mitigation and adaptation efforts that account for marine phytoplankton changes (e.g., fisheries, marine carbon cycling) should also consider changes in phytoplankton internal variability driven by anthropogenic warming, particularly on regional scales.

Framework for IOT Based Real-Time Monitoring System of Rainfall Water Level for Flood Prediction Using LSTM Network

Conference Paper

Jun 2023

Groundwater Withdrawal Prediction Using Integrated Multitemporal Remote Sensing Data Sets and Machine Learning

Article

Full-text available

Nov 2020
WATER RESOUR RES

Key Points Groundwater withdrawals are not actively monitored in most places of the world at a scale necessary to implement sustainable solutions Various multitemporal remote sensing data are integrated into a machine learning framework to effectively predict groundwater withdrawals The results over the High Plains Aquifer, Kansas, USA, show that this approach is applicable to similar regions having sparse in situ data

Depletion percentage and nonlinear transmissivity as design criteria for groundwater-level observation networks

Article

Full-text available

Aug 2020
ENVIRON EARTH SCI

Intensive use of groundwater for irrigation has depleted available supplies in many areas of the world. Depletion reduces well yields, which negatively impacts agricultural productivity and profitability. The impact of groundwater depletion on wells depends on the relative changes in saturated thickness (depletion percentage) and the rates of change in transmissivity during depletion (nonlinear transmissivity). In heterogeneous aquifers of variable saturated thickness, the impacts on wells vary in time and space. An observation well network designed to account for these spatiotemporal variations is a key part of achieving sustainable development of groundwater. This paper presents a framework for incorporating estimates of depletion percentage and nonlinear transmissivity into GIS-based multicriteria analysis for observation-well network design. The workflow includes (1) selection of criteria using stakeholder input, (2) assessment of data quality, (3) interpolation of hydrogeologic surfaces with prediction standard errors, (4) estimation of hydraulic conductivity values, (5) calculation of saturated thickness and transmissivity using 3 and 4 above, (6) mapping of well densities, and (7) normalization of raster values and calculation of a priority index map. The framework is widely applicable, but flexible to adapt to local needs. By incorporating depletion percentage and nonlinear transmissivity, the methodology can help management agencies prioritize monitoring locations in areas where future depletions could have the greatest impacts on well yields.

Water Management Practices: Use Patterns, Related Factors, and Correlations With Irrigated Acres

Article

Full-text available

Mar 2020
WATER RESOUR RES

This study describes the water management practice (WMP) use patterns by producers in Arkansas, USA, identifies the factors related to producers' choices among WMP groups, and examines the correlations between WMP uses and irrigated acreage. Using data from the 2016 Arkansas Irrigation Survey, WMPs are divided into four groups: field management, water flow control, water recovery/storage, and advanced irrigation scheduling practices. We find that about 77% of producers in the study area use two or more groups of WMPs to manage multiple aspects of irrigation, but that the factors that are associated with choices of WMPs vary by WMP group. Regression results show that the use of water flow meters, providing producers with education on the looming groundwater shortage problems and awareness of financial assistance available for conservation efforts, such as the state tax credits program, and use of WMPs by family members, friends, and neighbors are associated with increased use of WMPs. However, we find that producers that are older, have smaller farms, and rely more on groundwater are less likely to use some of the WMPs. Regression results also show that using water flow meters positively correlates with total irrigated acres, irrigated acres in rice, and irrigated acres in soybeans, and that for soybeans using more groups of WMPs is associated with a reduced extent of irrigated acreage. This study contributes to the small economics literature on WMPs and provides a more comprehensive picture of how producers use different WMPs to manage irrigation.

Das Potential der Kombination von maschinellem Lernen und Agenten-basierter Methoden in der Wasserbewirtschaftung

Article

Full-text available

Nov 2019

Agenten-basierte Modelle ermöglichen es, komplexe dynamische Systeme nachzubilden und werden häufig dort eingesetzt, wo das Verhalten und das Zusammenspiel einzelner Akteure von Wichtigkeit sind. Besonders in gekoppelten, sozio-hydrologischen Modellen ist dieser Modelltypus verbreitet. Agenten-basierte Modelle leiden jedoch unter der starken Abstraktion und der mangelnden Dynamik in der Adaption an neue Um-gebungen, die nicht explizit im Regelwerk des Verhaltens der Agenten definiert sind. Um diesen Makel zu begrenzen wird hier ein wasserwirtschaftliches Agenten-basiertes Modell der Bewässerungsplanung um eine Lernfunktion erweitert, die mit Hilfe des maschi-nellen Lernens hydrologische Situationen erkennt, einordnet und die daraus resultierenden Reaktionen adaptiert. Damit kann ohne weitere Veränderungen der Randbedingungen und Modellkomponenten eine Strategieentwicklung der Agenten innerhalb dieser syn-thetischen Modellstudie beobachtet werden und der Ertrag aus der Bewässerungslandwirtschaft durch eine optimale Ausnutzung der verfügbaren Wasser Ressourcen unter Minimierung von Krankheitsrisiken verdoppelt werden.

Increased Dependence on Irrigated Crop Production Across the CONUS (1945–2015)

Article

Full-text available

Jul 2019

Efficient irrigation technologies, which seem to promise reduced production costs and water consumption in heavily irrigated areas, may instead be driving increased irrigation use in areas that were not traditionally irrigated. As a result, the total dependence on supplemental irrigation for crop production and revenue is steadily increasing across the contiguous United States. Quantifying this dependence has been hampered by a lack of comprehensive irrigated and dryland yield and harvested area data outside of major irrigated regions, despite the importance and long history of irrigation applications in agriculture. This study used a linear regression model to disaggregate lumped agricultural statistics and estimate average irrigated and dryland yields at the state level for five major row crops: corn, cotton, hay, soybeans, and wheat. For 1945–2015, we quantified crop production, irrigation enhancement revenue, and irrigated and dryland areas in both intensively irrigated and marginally-dependent states, where both irrigated and dryland farming practices are implemented. In 2015, we found that irrigating just the five commodity crops enhanced revenue by ~$7 billion across all states with irrigation. In states with both irrigated and dryland practices, 23% of total produced area relied on irrigation, resulting in 7% more production than from dryland practices. There was a clear response to increasing biofuel demand, with the addition of more than 3.6 million ha of irrigated corn and soybeans in the last decade in marginally-dependent states. Since 1945, we estimate that yield enhancement due to irrigation has resulted in over $465 billion in increased revenue across the contiguous United States (CONUS). Example applications of this dataset include estimating historical water use, evaluating the effects of environmental policies, developing new resource management strategies, economic risk analyses, and developing tools for farmer decision making.

Do crop price expectations matter? An analysis of groundwater pumping decisions in Western Kansas

Article

Mar 2020
AGR WATER MANAGE

The Ogallala Aquifer is the main water resource for irrigated agricultural production in much of Western Kansas. It is hypothesized that as crop price expectations increase, producers will apply more water to increase yields in order to maximize profit. Using field-level panel data on groundwater pumped for irrigation in Western Kansas, this paper examines whether irrigated producers’ groundwater pumping decisions are consistent with the profit maximization framework by empirically testing if crop price expectations have a positive impact on the quantity of groundwater pumped. In general, the empirical results indicate that crop price expectations have no statistically significant impact on the quantity of groundwater pumped per acre. This suggests that groundwater pumping decisions are not consistent with the profit maximization framework and that irrigated producers consider groundwater as a fixed input possibly due to limited availability of groundwater in the area. Our econometric analysis also suggests that only a small portion of rainfall is effective.

Effects of instantaneous groundwater availability on irrigated agriculture and implications for aquifer management

Article

Nov 2019
RESOUR ENERGY ECON

Groundwater is an important input for agricultural production in many parts of the world. Aquifer depletion has been shown to affect the rate that groundwater can be extracted from an aquifer. In this paper, we develop an analytical framework that accounts explicitly for the effects of limited instantaneous groundwater extraction rate (well capacity) on a producer's irrigation decisions. We show that limited well capacities can affect the producer's groundwater use and profit. We draw three important insights from these findings. First, we demonstrate that the price elasticity of demand for groundwater is higher for lower well capacities. Second, farmers' irrigation decisions are non-monotonic with respect to well capacity and climate conditions. Under a drier climate, producers with greater well capacities increase their groundwater use, and producers with lower well capacities reduce their water use. Third, through numerical analysis, we show that considering spatial heterogeneity in well capacities is important for estimating the cost-effectiveness and distributional impacts of groundwater management policies. Our results shed new light on the importance of extraction capacity for groundwater management policies and the potential impacts of climate change on agricultural production.

Estimating irrigation demand with geospatial and in-situ data: Application to the high plains aquifer, Kansas, USA

Article

Jul 2019

Unsustainable groundwater mining threatens the economic stability of several regions around the world.Developing sustainable water-use policies and integrated water-management plans requires reliably predictingwater demand under variable weather and land-use conditions. Here, a water-budget model capable of esti-mating spatial and temporal variations in daily irrigation demand under variable weather and land-use scenariosis modified to consider irrigation management and groundwater pumping limitations. This model uses a com-bination of geospatial and in-situ measured data, including land-use and land-cover maps, crop-specific eva-potranspiration data, and weather data. A coupled sub-model limits pumping rates based on pump capacity andspatial and temporal variations in the saturated thickness of the aquifer. The ability of this water-budget modelto accurately estimate irrigation demand is demonstrated using the High Plains aquifer region of Kansas, USA, anaquifer that has undergone, and continues to undergo, significant depletion due to decades of irrigation. Themodel was calibrated with reported and measured water use for more than 1200fields, with ratios of simulatedannual irrigation demand to actual reported water use of 1.10, 0.78, 0.75, 0.93, and 1.40 for corn, sorghum,soybean, winter wheat, and alfalfa, respectively. Further simulations at a variety of scales, fromfield to multi-county levels, demonstrate that the developed water-budget model is capable of simulating the spatial andtemporal variability of irrigation demand.

Estimating irrigation demand with geospatial and in-situ data: Application to the high plains aquifer, Kansas, USA

Article

Jun 2019
AGR WATER MANAGE

Unsustainable groundwater mining threatens the economic stability of several regions around the world. Developing sustainable water-use policies and integrated water-management plans requires reliably predicting water demand under variable weather and land-use conditions. Here, a water-budget model capable of estimating spatial and temporal variations in daily irrigation demand under variable weather and land-use scenarios is modified to consider irrigation management and groundwater pumping limitations. This model uses a combination of geospatial and in-situ measured data, including land-use and land-cover maps, crop-specific eva-potranspiration data, and weather data. A coupled sub-model limits pumping rates based on pump capacity and spatial and temporal variations in the saturated thickness of the aquifer. The ability of this water-budget model to accurately estimate irrigation demand is demonstrated using the High Plains aquifer region of Kansas, USA, an aquifer that has undergone, and continues to undergo, significant depletion due to decades of irrigation. The model was calibrated with reported and measured water use for more than 1200 fields, with ratios of simulated annual irrigation demand to actual reported water use of 1.10, 0.78, 0.75, 0.93, and 1.40 for corn, sorghum, soybean, winter wheat, and alfalfa, respectively. Further simulations at a variety of scales, from field to multi-county levels, demonstrate that the developed water-budget model is capable of simulating the spatial and temporal variability of irrigation demand.

Effects of management areas, drought, and commodity prices on groundwater decline patterns across the High Plains Aquifer

Article

Jun 2019
AGR WATER MANAGE

We use an 82-year record of water table data from the High Plains Aquifer to introduce a new application of segmented regression to hydrogeology, and evaluate the effects of droughts, crop prices, and local groundwater management on groundwater level trajectories. Across the High Plains, we find discernable regional cycles of faster and slower water table declines. A parsimonious Classification And Regression Tree (CART) analysis details correlations between select explanatory variables and changes in water table trajectories, quantified as changes in slope of well hydrographs. Drying relative to prior-year conditions is associated with negative changes in slope; in the absence of drying conditions, steep declines in commodity price are associated with positive changes in hydrograph slopes. Establishment of a groundwater management area is not a strong predictor for change in water table trajectories, but more wells tend to have negative changes in around the time of management areas are formation, suggesting that drought conditions are associated with both negative deflections in water table trajectory and enactment of management areas. Segmented regression is a promising tool for groundwater managers to evaluate change thresholds and the effectiveness of management strategies on groundwater storage and decline, using readily available water table data.

Influence of Irrigation Drivers Using Boosted Regression Trees: Kansas High Plains

Abstract and Figures

Recommended publications

Applying machine learning on sensor data for irrigation recommendations: Revealing the agronomist’s...

Effects of management areas, drought, and commodity prices on groundwater decline patterns across th...

The High Plains Aquifer, USA—A Case Study

Sustainability of aquifers supporting irrigated agriculture: a case study of the High Plains aquifer...

Sustainable irrigation through local collaborative governance: Evidence for a structural fix in Kans...