ArticlePDF Available

A database of in situ water temperatures for large inland lakes across the coterminous United States

Authors:

Abstract and Figures

Water temperature dynamics in large inland lakes are interrelated with internal lake physics, ecosystem function, and adjacent land surface meteorology and climatology. Models for simulating and forecasting lake temperatures often rely on remote sensing and in situ data for validation. In situ monitoring platforms have the benefit of providing relatively precise measurements at multiple lake depths, but are often sparser (temporally and spatially) than remote sensing data. Here, we address the challenge of synthesizing in situ lake temperature data by creating a standardized database of near-surface and subsurface measurements from 134 sites across 29 large North American lakes, with the primary goal of supporting an ongoing lake model validation study. We utilize data sources ranging from federal agency repositories to local monitoring group samples, with a collective historical record spanning January 1, 2000 through December 31, 2022. Our database has direct utility for validating simulations and forecasts from operational numerical weather prediction systems in large lakes whose extensive surface area may significantly influence nearby weather and climate patterns.
This content is subject to copyright. Terms and conditions apply.
1
SCIENTIFIC DATA | (2024) 11:282 | https://doi.org/10.1038/s41597-024-03103-8
www.nature.com/scientificdata
A database of in situ water
temperatures for large inland
lakes across the coterminous
United States
Troy Sorensen
 ✉ speyKelleyKessler



and forecasting lake temperatures often rely on remote sensing and in situIn situ


the challenge of synthesizing in situ lake temperature data by creating a standardized database of




simulations and forecasts from operational numerical weather prediction systems in large lakes whose

Background & Summary
Accurately representing spatial and temporal variability of lake surface water temperatures in numerical
weather prediction (NWP) systems has been shown (particularly for Earth’s largest lakes) to improve short-
and long-term forecasts of regional precipitation, air temperature, and surface wind velocity14. us, realistic
representation of lake conditions is crucial for the development of the next generation of climate and weather
forecast models5,6. e database we introduce here was developed to support this advancement by providing in
situ validation data for a broader project sponsored by the National Oceanic and Atmospheric Administration
(NOAA) through its Joint Technology Transfer Initiative (JTTI). e parallel NOAA JTTI project is designed to
optimize representation of lake surfaces in the NOAA Unied Forecast System (UFS) by exploring the sensitiv-
ity of UFS lake models to alternative lake bathymetric data sets7. Specically, the NOAA JTTI project evaluates
potential impacts of a new global lakes bathymetric dataset (GLOBathy) on simulations of lake surface temper-
ature, and temperature depth proles, in UFS 1-D lake models8. It is informative to note that these models are
currently operationalized within NOAAs High-Resolution Rapid Refresh model, or HRRR9, which simulates
lake physics using a 1-D lake model included in the Community Land Model v4.510 with a 3-km horizontal
resolution, and 10 vertical (depth) layers.
Following an iterative in situ monitoring platform selection protocol (details below) we obtained near-surface
and subsurface lake temperature data from 134 sites across 29 lakes which (through the parallel NOAA study
referenced above) can be used to validate HRRR lake model simulations. We solicited and stored temperature
data at the highest temporal resolution available, which varies from site to site; at some sites, data is available at
sub-hourly resolution and, at others, at relatively coarse (e.g. semi-annual or monthly) resolution. Of the 134
1School for Environment and Sustainability, University of Michigan, Ann Arbor, 48104, USA. 2Department of
Electrical Engineering and Computer Science, University of Michigan, Ann Arbor, 48109, USA. 3Coastal Survey
Development Laboratory, National Ocean Service, NOAA, Silver Spring, 20910, USA. 4Great Lakes Environmental
Research Laboratory, NOAA, Ann Arbor, 48108, USA. e-mail: trsoren@umich.edu


Content courtesy of Springer Nature, terms of use apply. Rights reserved
2
SCIENTIFIC DATA | (2024) 11:282 | https://doi.org/10.1038/s41597-024-03103-8
www.nature.com/scientificdata
www.nature.com/scientificdata/
sites in our database, 84 include temperature measurements across multiple depths, allowing for comprehensive
validation of HRRR 1-D lake column model simulations.
While the lakes presented here represent a subset of all lakes in the HRRR model, we believe that, because
they are among the model domains largest lakes (by surface area; see methods below), they might be expected
to have the most profound impacts on surrounding terrestrial weather and climate dynamics. We note that
the Laurentian Great Lakes are not included in the NOAA UFS study because they are represented through a
separate 3-D modeling framework11,12 operated through NOAAs National Ocean Service (NOS). In situ data
for validating Laurentian Great Lakes 3-D models is collected and utilized separately and specically for the
NOS modeling initiative, and is therefore not addressed here. Ultimately, the goal of our database is to provide
an organized, easily-accessible aggregation of in situ lake temperature prole data that can be used not only to
support validation for the NOAA UFS 1-D lake model experiments, but to serve as a resource for related lake
model validation and empirical data analysis studies as well (see Figs. 1 and 2).

We collected lake temperature data from a variety of sources, each requiring a dierent approach, ranging from
scraping online federal agency repositories to collaborating with and soliciting data from local water qual-
ity monitoring organizations. Federal agency repositories from which we collected data include the NOAA
National Data Buoy Center (NDBC)13, the United States Geological Survey (USGS) National Water Information
System (NWIS)14, and the Water Quality Portal (WQP) - a cooperative service maintained and sponsored by
USGS and the United States Environmental Protection Agency15. e temperature data we collected from local
organizations is unlikely to be included in the aforementioned federal repositories. It is informative to note that
any data we have collected for a given lake in our study may be aggregated across one or more of these sources
(for a summary, see Table1). It is also informative to note that data quality, spatiotemporal resolution, and tem-
poral continuity can vary greatly from source to source (see Fig.3); some sources provide quality-controlled data
throughout a lake’s depth prole at high temporal resolution, while others provide relatively sparse temperature
data collected by local ad hoc or citizen-based groups with little documentation on quality control methods.
Feedback from database users has highlighted that direct examination of the data signicantly aids in under-
standing its characteristics, especially for individual lakes or sites. Accordingly, we’ve included a scriptwithinthe
database repository to facilitate the creation of data plots for each site.
One of the most important design features of the parallel NOAA lake model simulation study made possible
by our database was a focus on evaluating historical lake temperature simulations in 29 of the largest (by surface
area) lakes across the continental United States (CONUS). A second important design feature of the NOAA lake
model study was a focus on assessing lake model simulation results from just one calendar year (following a
Fig. 1 Map of the coterminous United States indicating the location of all 29 lakes in our study (blue polygons;
magnied slightly to improve clarity). For 9 representative lakes, we also include an overlay of in situ monitoring
sites (each represented by a blue ‘x’), lake boundaries, and the corresponding 3 km × 3 km HRRR lake model
pixels (yellow grids). e 9 representative lakes are, from top le advancing clockwise, Lake Washington,
Flathead Lake, Devils Lake, Red Lake, Oneida Lake, Lake Okeechobee, Lake Tawakoni, Lake Mead, and Mono
Lake. See Fig.5 for corresponding details for all 29 lakes in our study.
Content courtesy of Springer Nature, terms of use apply. Rights reserved
3
SCIENTIFIC DATA | (2024) 11:282 | https://doi.org/10.1038/s41597-024-03103-8
www.nature.com/scientificdata
www.nature.com/scientificdata/
model spin-up period) given the relatively high computational expense of running the HRRR model at CONUS
scale. erefore, the collective criteria for including a monitoring platform in our database is that it comes from
one of the largest lakes across CONUS for which there is at least one in situ temperature observation within a
recent calendar year. e selection of a common recent calendar year, in turn, is intended to maximize the total
number of temperature data points across the selected lakes and monitoring platforms.
e results of our manual and iterative selection process identied 2019 as the calendar year that maxi-
mizes the total number of in situ observations across the largest CONUS lakes. Based on our analysis of federal
databases and conversations with individual (i.e. local) database managers, it is our understanding that 2019
was (for the purposes of our study) an “optimal” year for aggregating lake temperature data because many in
situ monitoring platforms were discontinued in 2020 at the onset of the pandemic. As a result of this selection
criteria, our database includes data from monitoring platforms for which there was at least one measurement
(and, typically, many more measurements) in 2019. However, in order to support any future related empirical
and model validation studies we also included any and all data available over a historical period from January
1, 2000 through December 31, 2022, although data availability may vary greatly for years other than 2019 (see
Table2).
We used the R Statistical Soware (v4.2.116) to extract and store variables from each monitoring platform
including sample collection date and time (UTC), coordinate location, depth (m), and water temperature (C).
Details on our nal data formatting are included in the Data Records section. Details on how we extracted data
from each source are included in the subsections below, with related metadata summarized in Tables1, 2.
 e NDBC is located within NOAAs National Weather Service, and
is responsible for collecting, managing, and distributing meteorological and oceanographic data from a network
of buoys and coastal stations located in oceans, coastal waters, and large lakes (including the Great Lakes, which
are not included in this database). All data is quality controlled and publicly available here: (https://www.ndbc.
noaa.gov/).
 e USGS NWIS is a comprehensive database containing
a wide range of water-related data including streamow, groundwater levels, and water quality data including lake
temperatures. Lakes that met our study’s criteria were found manually using the online NWIS mapper (https://
maps.waterdata.usgs.gov/mapper/), and their data was accessed using the dataRetrieval17 package in R by spec-
ifying site identication numbers, desired date ranges, and the parameter code for water temperature. Note that
the WQP (described in detail below) includes data from the NWIS, but at a much lower temporal resolution.
 e WQP is a centralized repository maintained by the United States EPA
and the USGS, integrating data from multiple agencies and organizations18. Data from sites in the NWIS are
included in the WQP, but typically at a lower resolution. us data was extracted directly from the NWIS wher-
ever there was overlap with the WQP, and the WQP was instead used to extract data aggregated from other
sources.
Depth (m)
0
2
4
6
8
10
12
14
Temperature (°C)
Fig. 2 Representative example (from Oneida Lake, NY) of the relationship between in situ station locations
(top subgure; each represented by a blue ‘x’ and labeled with its site code), HRRR pixels (yellow squares), and
corresponding temperature data for each station in 2019.
Content courtesy of Springer Nature, terms of use apply. Rights reserved
4
SCIENTIFIC DATA | (2024) 11:282 | https://doi.org/10.1038/s41597-024-03103-8
www.nature.com/scientificdata
www.nature.com/scientificdata/
WQP data can also be accessed in R via the dataRetrieval package, similar to procedures used with the
NWIS. Due to the immense amount of data provided by the WQP, users can search for sites that meet certain
criteria before requesting a specic sites data. We used this functionality to search for all sites of type “Lake”,
“Reservoir”, or “Impoundment” containing any water temperature data in the year 2019. Once we had a large
list of sites meeting this criteria, we used ArcGIS to lter out any sites not located on a lake over 30 km2. is
le us with a much shorter list of site names which we could then use to query the data of each site individually.
e format for reporting depths of observations varies across dierent sites within the WQP. Some sites were
not given a depth value and were instead reported as “near-surface”; we recorded these as a depth of 0.1 m. Other
sites report depth values to a very high precision (<0.1 m). We rounded these depth values to the nearest 1 m for
sites containing temperature values throughout a prole of 5 m or deeper, and to the nearest 0.5 m for sites with
a shallower prole of less than 5 m. In either case, values of 0 m were then shied to 0.1 m as the sensors included
in the WQP only record bulk temperature.
 In addition to the well-known and established described in the sections above, we gathered
data from a multitude of other websites and local sources, including the following:
• Flathead Lake - from the Flathead Lake Bio Station (FLBS) site (https://bs.umt.edu/apps/weather/) which
includes surface water temperatures at four sites. All sites have downloadable data dating back to 2011, and we
omitted periods of data reporting extremely egregious lake temperature values ( < 50 C).
• Lake McConaughy - provided directly by Nate Nielsen of the Central Nebraska Public Power and Irrigation
District (CNPPID). While data at Lake McConaughy is available in the WQP, we were able to obtain higher
spatial and temporal resolution data from the les shared with us directly by the CNPPID.
• Lake Mendota - provided online by the Space Science & Engineering Center at the University of Wiscon-
sin-Madison (https://metobs.ssec.wisc.edu/data_download/). We extracted the data at an hourly resolution,
though other resolutions are available for download as well.
• Mono Lake - provided directly by Dr. Robert Jellison of the University of California (UC) Santa Barbara and
the Mono Lake Committee (https://www.monolake.org/).
Lake Name Latitude Longitude Size (km2)
Data provider
NDBC NWIS WQP Other
Champlain 44.632771 73.301921 979.6 X X X
Clear 39.048421 122.808826 136.8 X
Devils 48.049741 98.978507 311.7 X
Flathead 47.893863 114.130567 463.1 X
Great Salt 41.169322 112.539431 3962.2 X X
Houston 29.975492 95.141341 22.5 X
Lewisville 33.121048 96.979931 51.4 X
Malheur 43.331553 118.792309 146.1 X
Marion 33.484218 80.315048 228.7 X
McConaughy 41.248835 101.792336 66.3 X X
Mead 36.090574 114.748057 90.2 X
Mendota 43.108566 89.419588 33.8 X
Mono 38.011535 119.015838 174.1 X
Okeechobee 26.949621 80.802608 1317.4 X
Oneida 43.206771 75.907717 193.7 X
Pontchartrain 30.18727 90.119927 1655.3 X X
Red 48.035195 94.916255 1123.5 X
Sakakawea 47.752826 102.184397 942.1 X
Sebago 43.861237 70.551793 105.1 X
Seneca 42.66719 76.920011 141.2 X
Tah oe 39.100468 120.034368 483.8 X
Tawakoni 32.881902 95.987976 111.2 X
Upper Klamath 42.428961 121.934868 271.4 X X
Utah 40.220629 111.824294 321.3 X
Walker 38.699737 118.71699 119.2 X
Washington 47.625051 122.25055 61.8 X
Winnebago 44.021601 88.409584 512.4 X
Winnipesaukee 43.609695 71.341321 116.6 X
Tab le 1. Metadata associated with each lake included in the database. “Latitude” and “Longitude” indicate the
center point of the lake and “Size” indicates the surface area as provided by the HydroLakes dataset27. e “Data
provider” column indicates which data source or sources provided data for each lake; see Table2 for select
summary statistics associated with the data for any given lake and data source.
Content courtesy of Springer Nature, terms of use apply. Rights reserved
5
SCIENTIFIC DATA | (2024) 11:282 | https://doi.org/10.1038/s41597-024-03103-8
www.nature.com/scientificdata
www.nature.com/scientificdata/
• Oneida Lake - provided by Dr. Lars Rudstam at the Cornell University Biological Field Station19. While
Oneida Lake temperature data is also available through the Knowledge Network for Biodiversity (https://knb.
ecoinformatics.org/), the high resolution temperature prole data provided by Dr. Rudstam is not available
online.
• Pyramid Lake - provided by Jennessy Toribio, a sheries biologist at Pyramid Lake Fisheries (PLF). PLF data
is sampled monthly, and while there is no exact time specied in the raw data, the PLF stated that readings are
typically taken in the late morning. We therefore assigned a timestamp to each recording of 10am local time.
• Sebago Lake - publicly available through the Portland Water District (https://www.pwd.org/
sebago-lake-monitoring-buoy).
• Lake Tahoe - provided from two contacts; Dr. Gerardo Rivera at the National Aeronautics and Space Admin-
istration (NASA) Jet Propulsion Laboratory (JPL) provided data from three sites at an extremely high tem-
poral resolution, but only for shallow proles in September 2019. Dr. Shohei Watanabe of the UC Davis
Tahoe Environmental Research Center (TERC) provided monthly temperature proles dating back to 2010
throughout the entire depth prole of Lake Tahoe, but only at a single site. e original TERC data includes
temperature measurements at every meter to a depth of 480 m. Our nal database includes these temperature
measurements at meter intervals to a depth of 50 m, and at 10-meter intervals from 50 m to 480 m (the origi-
nal higher resolution data is available in the Raw_data section of our database).
• Lake Washington - provided online by King County, WA (https://green2.kingcounty.gov/lake-buoy/Data.
aspx). All values from March 2009 were omitted due to noticably incongruous data.
• Lake Winnipesaukee - provided online by the New Hampshire Department of Environmental Services
(https://www4.des.state.nh.us/rivertraksearch/search.html).

Our database is deposited in “Deep Blue Data, the University of Michigans institutional data repository20. It
can be accessed here: https://doi.org/10.7302/7gnd-mj10. e database contains sub-directories for each lake.
Within the sub-directory for each lake is the R script used to extract all data for that lake, a metadata table with
the latitude/longitude location and depth of each temperature sensor, and a directory containing the tempera-
ture data for each sensor in csv format with two columns for the date/time (in UTC) and the temperature (C).
Each sensor’s data lename is formatted as ABCXX_YY.csv where ABC is a three letter code for the lake, XX is
a unique numerical identier for the latitude/longitude location of the site, and YY is a numerical identier for
the depth of that sensor. A more detailed explanation of the directory structure is included in a README le
within the database.
Technical Validation
e data we collected from federal agency repositories (e.g. NDBC, NWIS, and WQP) and some local sources
were subjected to repository-specic quality control methods, each of which is described in detail in the respec-
tive repository’s literature and (if available) web-site. Regardless, we visually inspected all data at all depths
and removed data points or time periods periods with egregiously erroneous values for a very small number
of sites (as described in the Methods section). Additionally, to ensure overall data reliability, we validated our
in situ temperatures against remote sensed surface temperature data from the Moderate Resolution Imaging
Fig. 3 Representative time series (showing only 2018 through 2022 for clarity) of temperature data from ve
surface (or near-surface) sensors in our database. is time series underscores dierences in temporal resolution
and continuity across dierent sensors and lakes; a more comprehensive summary of temporal and spatial (i.e.
depth) resolution and continuity is included in Table2, and the database repository includes a script to visualize
the data of each individual sensor. Year labels on x-axis are positioned at the beginning of a calendar year.
Content courtesy of Springer Nature, terms of use apply. Rights reserved
6
SCIENTIFIC DATA | (2024) 11:282 | https://doi.org/10.1038/s41597-024-03103-8
www.nature.com/scientificdata
www.nature.com/scientificdata/
Spectroradiometer (MODIS), which has a spatial resolution of approximately 1 km21. We used MODIS Terra
and Aqua land surface temperature products, which collectively provide up to four surface water temperature
observations per day, per 1-km grid cell, to validate our in situ observations.
We recognize there are challenges to comparing gridded remote sensing temperature data to in situ data,
especially along lake shorelines where there is a potential for land contamination22. To address this challenge,
we ltered (i.e. removed, prior to validation) in situ sites within a 1-km buer of any lake shoreline, leaving 79
of our total 134 in situ sites for validation. en, because the original intent of our database was to validate (in
the parallel NOAA study) HRRR lake models for only the calendar year 2019, we interrogated surface tem-
perature data for MODIS pixels corresponding to each selected in situ site location in calendar year 2019 via
NASA’s AppEEARS data portal (https://appeears.earthdatacloud.nasa.gov/). Specically, the products obtained
via AppEEARS were MOD21A1D.06123, MOD21A1N.06124, MYD21A1D.06125, and MYD21A1N.06126. It is
informative to note that the MODIS data obtained through AppEEARS has quality thresholds of 0 (poor), 1
Lake Name Sites Sensors Observations Depth Coverage Historical Coverage
NDBC data summary
Champlain 3 3 45981 1 m 2019–2020
Mead 2 2 285564 0.5 m 2016–2021
Pontchartrain 1 1 1155458 0.6 m 2008–2022
NWIS data summary
Champlain 1 1 15327 1.5 m 2019–2020
Great Salt 1 1 80214 0.1 m 2018–2022
Houston 1 4 64564 0.9 m–4.3 m 2014–2022
Malheur 1 1 18039 0.5 m 2018–2020
Seneca 1 3 39046 1.8 m–29.6 m 2018–2020
Upper Klamath 1 1 68402 1 m 2007–2022
WQP data summary
Champlain 10 407 4954 1 m–100 m 2000–2021
Clear 6 24 246 0.15 m–10 m 2014–2022
Devils 5 61 228 0.1 m–17 m 2000–2022
Great Salt 4 33 623 0.1 m–10 m 2005–2022
Lewisville 3 46 65 0.1 m–19 m 2006–2022
Marion 7 7 509 0.1 m 2014–2022
McConaughy 1 34 17 0.1 m–33 m 2017–2021
Okeechobee 15 15 1155 0.5 m 2016–2022
Pontchartrain 3 3 305 0.1 m 2008–2021
Red 10 85 1548 0.3 m–9 m 2000–2022
Sakakawea 6 201 431 0.1 m–54 m 2003–2020
Tawakoni 4 56 164 0.3 m–7.9 m 2011–2022
Upper Klamath 6 6 1468 0.1 m 2005–2022
Utah 6 47 457 0.1 m–4 m 2001–2022
Walker 3 52 134 0.1 m–20 m 2006–2022
Winnebago 3 37 197 1 m–21 m 2002–2022
Summary of data from other sources
Flathead 4 4 1864273 0 m 2012–2023
McConaughy 1 32 240 1 m–32 m 2010–2022
Mendota 1 22 20953 0.5 m–20 m 2019–2022
Mono 12 623 553 0.5 m–42 m 2018–2022
Oneida 4 50 1931 0 m–14 m 2000–2020
Pyramid 1 99 76 1 m–99 m 2011–2022
Sebago 1 14 35061 1 m–37 m 2018–2019
Tah oe 4 117 128926 0.5 m–480 m 2010–2021
Washington 1 57 19047 1 m–57 m 2008–2021
Winnipesaukee 1 1 53805 1 m 2016–2022
Tab le 2. Select summary statistics categorized by data source and then by each lake for which the source has
provided data. For each specied data source and lake, “Sites” indicates the number of unique monitoring
platform locations (i.e. latitude and longitude), “Sensors” indicates the total number of temperature sensors at
any depth across all platforms, “Observations” indicates the number of measurements taken though time across
all sensors, “Depth Coverage” indicates the shallowest and deepest sensors, and “Historical Coverage” indicates
the earliest and latest years for which data is available.
Content courtesy of Springer Nature, terms of use apply. Rights reserved
7
SCIENTIFIC DATA | (2024) 11:282 | https://doi.org/10.1038/s41597-024-03103-8
www.nature.com/scientificdata
www.nature.com/scientificdata/
(marginal), 2 (good), or 3 (excellent) for each sensor reading, with each quality threshold corresponding to an
error of >2.0 K, 1.5–2.0 K, 1.0–1.5 K, or <1.0 K, respectively. We only used MODIS data with a quality threshold
of 2 or higher (i.e. reported error of <1.5 K) for our validation.
Fig. 4 Comparison between bulk surface (or near-surface) and skin temperature data from in situ platforms
and (respectively) the nearest MODIS pixel at four representative sites across calendar year 2019. Measurement
depths for in situ platforms are specied in the legend.
Washington
Flathead
Sakakawea
Devils
Red
Winnebago
Mendota
Seneca
Oneida
Champlain
Winnipesaukee
Sebago
Marion
Okeechobee
Pontchartrain
Houston
Tawakoni
Lewisville
McConaughy
Mead
Utah
Great Salt
Mono
Walker
Tahoe
Clear
Pyramid
Upper Klamath
Malheur
Fig. 5 For each lake in our study, a summary of the lake’s shoreline, associated HRRR model pixels (yellow
grids; 3 km × 3 km each), and location of in situ monitoring sites (represented by a blue ‘x’). Some panels (e.g.
Winnebago, Seneca, Champlain) also show HRRR pixels from adjacent water bodies that are not included in
our study.
Content courtesy of Springer Nature, terms of use apply. Rights reserved
8
SCIENTIFIC DATA | (2024) 11:282 | https://doi.org/10.1038/s41597-024-03103-8
www.nature.com/scientificdata
www.nature.com/scientificdata/
A visual comparison between our in situ data and data from each corresponding MODIS pixel (see Fig.4 for
a representative time series from four sites) suggests that the in situ temperatures are generally consistent with
MODIS, with minimal pronounced visible bias. To supplement this visual comparison, we calculated the bias
(relative to MODIS) of each in situ data point, along with the root-mean-square error (RMSE) of each site and
the RMSE across all sites. Specically, for each MODIS data point, we identied the closest in situ value that was
collected within 3 hours of the MODIS observation. If there was no in situ data collected within 3 hours of the
MODIS observation, then that MODIS observation was not used for validation. is approach resulted in 1,808
pairs of in situ and MODIS temperatures. e RMSE and bias across all validation data pairs was 2.780 K and
0.023 K, respectively (with MODIS being slightly warmer on average), and a more detailed assessment of RMSE
and bias for each monitoring platform (Table3) indicates that bias is generally low, especially at sites for which
there is a high number of observations.

As described in the Data Records section, our database contains the R scripts that we used to extract and format
data for each lake. Additionally, the database contains example scripts for organizing and visualizing the data.
Received: 15 August 2023; Accepted: 28 February 2024;
Published: xx xx xxxx

1. Samuelsson, P., ourzeneva, E. & Mironov, D. e impact of laes on the European climate as simulated by a regional climate model.
Boreal Environmental Res earch 15, 113–129 (2010).
2. Balsamo, G. et al. On the contribution of laes in predicting near-surface temperature in a global weather forecasting model. Tel lus
A: Dynamic Meteorology and Oceanography 64, 15829 (2012).
Site Name Lake RMSE Bias NSite Name Lake RMSE Bias N
CHA01_00 Champlain 2.02 0.32 132 OKE12_00 Okeechobee 1.11 1.11 1
CHA02_00 Champlain 3.53 0.07 125 OKE15_00 Okeechobee 1.23 1.12 2
CHA03_00 Champlain 2.9 0.81 123 ONE02_00 Oneida 2.21 0.28 11
CHA08_00 Champlain 1.93 1.58 9 ONE03_00 Oneida 2.51 0.2 8
CHA09_00 Champlain 0.54 0.18 7 ONE04_00 Oneida 4.14 1.28 9
CHA10_00 Champlain 2.22 2 6 RED01_00 Red 2.2 0.61 12
CHA11_00 Champlain 3.15 0.33 5 RED02_00 Red 2.62 0.56 12
CHA12_00 Champlain 1.84 1.06 6 RED03_00 Red 2.23 0.89 11
CHA14_00 Champlain 3.35 1.09 7 RED04_00 Red 1.65 1.54 10
CLE02_00 Clear 1.52 1.4 2 RED05_00 Red 1.64 1.51 9
CLE06_00 Clear 1.09 1.09 1 RED06_00 Red 6.36 1.58 8
DEV03_00 Devils 1.41 1.41 1 RED07_00 Red 7.35 2.23 7
GSL02_00 Great Salt 1.35 0.64 3 RED08_00 Red 1.83 0.6 5
GSL04_00 Great Salt 2.94 2.65 4 RED09_00 Red 2.58 0.11 5
GSL05_00 Great Salt 1.93 1.68 3 RED10_00 Red 1.59 1.27 4
LEW02_00 Lewisville 1.44 0.03 3 SAK01_00 Sakakawea 0.7 0.14 2
MAR05_00 Marion 3.05 2.45 3 SAK03_00 Sakakawea 1.13 1.13 1
MAR06_00 Marion 2.11 2.11 1 SAK04_00 Sakakawea 2.21 0.81 4
MEA02_00 Mead 3.28 0.61 395 TAH01_00 Ta hoe 1.53 0.4 76
MEN01_00 Mendota 3.28 0.29 200 TAH02_00 Taho e 1.53 0.39 78
MON01_00 Mono 4.37 4.15 4 TAH03_00 Tahoe 1.53 0.61 84
MON03_00 Mono 2.32 2.24 3 TAH04_00 Ta hoe 1.53 0.13 3
MON04_00 Mono 2.09 1.7 2 UPK01_00 Upper Klamath 1.53 0.28 350
MON08_00 Mono 2.97 1.39 2 UPK06_00 Upper Klamath 1.53 0.46 8
MON10_00 Mono 2.16 1.89 3 UTA01_00 Utah 1.53 1.49 3
MON11_00 Mono 4.21 0.59 5 UTA03_00 Utah 1.53 1.64 3
OKE02_00 Okeechobee 0.69 0.68 2 UTA04_00 Utah 1.53 4.08 3
OKE04_00 Okeechobee 1.96 1.63 2 UTA05_00 Utah 1.53 1.54 3
OKE06_00 Okeechobee 0.45 0.45 1 UTA06_00 Utah 1.53 1.87 6
OKE07_00 Okeechobee 0.37 0.37 1 WAL03_00 Walker 1.53 1.4 2
OKE10_00 Okeechobee 1.26 0.98 3 WIN03_00 Winnebago 1.53 3.45 2
OKE11_00 Okeechobee 1.47 1.47 1
Tab le 3. Summary of root-mean-squared errors (RMSE) and bias (all in C) at each monitoring station based on
comparison between in situ and MODIS data. Number (N) of data points used for comparison is included for
reference. Only stations for which an RMSE and bias value could be calculated (see methods above) are listed.
Content courtesy of Springer Nature, terms of use apply. Rights reserved
9
SCIENTIFIC DATA | (2024) 11:282 | https://doi.org/10.1038/s41597-024-03103-8
www.nature.com/scientificdata
www.nature.com/scientificdata/
3. Fujisai-Manome, A. et al. Improvements to lae-eect snow forecasts using a one-way air-lae model coupling approach. Journal
of Hydrometeorology 21, 2813–2828 (2020).
4. Xue, P. et al. Climate projections over the great laes region: using two-way coupling of a regional climate model with a 3-d lae
model. Geoscientic Model Development 15, 4425–4446 (2022).
5. Adrian, . et al. L aes as sentinels of climate change. Limnology and Oceanography 54, 2283–2297 (2009).
6. Benjamin, S. G. et al. Inland lae temperature initialization via coupled cycling with atmospheric data assimilation. Geoscientic
Model Development 15, 6659–6676, https://doi.org/10.5194/gmd-15-6659-2022 (2022).
7. Hsiao, W.-T. et al. ole of the tropics in state-dependent improvements of US west coast NOAA Unified Forecast System
precipitation forec asts. Geophysical Research Letters 49, e2021GL096447 (2022).
8. hazaei, B., ead, L. ., Casali, M., Sampson, . M. & Yates, D. N. GLOBathy, the global laes bathymetry dataset. Scientic Data 9,
36 (2022).
9. Dowell, D. C. et al. e High-esolution apid efresh (H): An hourly updating convection-allowing forecast model. Part I:
Motivation and system description. Weather and Forecasting 37, 1371–1395 (2022).
10. Oleson, . et al. Technical description of version 4.5 of the community land model (clm) https://doi.org/10.5065/D61W7M (2013).
11. Anderson, E. J., Schwab, D. J. & Lang, G. A. eal-time hydraulic and hydrodynamic model of the St. Clair, Lae St. Clair, Detroit
iver system. Journal of Hydraulic Engineering 136, 507–518 (2010).
12. Bai, X. et al. Modeling 1993–2008 climatology of seasonal general circulation and thermal structure in the Great Laes using
FVCOM. Ocean Modelling 65, 40–63 (2013).
13. Meindl, E. A. & Hamilton, G. D. Programs of the National Data Buoy Center. Bulletin of the American Meteorological Society 73,
985–993 (1992).
14. Blodgett, D., Lucido, J. & re, J. Progress on water data integration and distribution: a summary of select US Geological Survey data
systems. Journal of Hydroinformatics 18, 226–237 (2016).
15. ead, E. . et al. Water quality data for national-scale aquatic research: e Water Quality Portal. Water Resources Research 53,
1735–1745 (2017).
16.  Core Team. R: A Language and Environment for Statistical Computing ( Foundation for Statistical Computing, Vienna, Austria,
2022).
17. DeCicco, L. et al. dataetrieval: etrieval Functions for USGS and EPA Hydrology and Water Quality Data (2022).
18. Environmental Protection Agency & United States Geological Survey. Water quality portal, https://doi.org/10.5066/P9QUVJ.
19. udstam, L. Limnological data and depth profile from Oneida Lake, New York, 1975-present https://doi.org/10.5063/
F5061FT5068JH5064 (2021).
2 0. Sorensen, T.  . A database of in situ surface and subsurface water temperatures for large inland laes across the coterminous United
States. University of Michigan Deep Blue Data https://doi.org/10.7302/7GND-MJ10. (2023).
21. ilpatric, . A. et al. A decade of sea surface temperature from MODIS. Remote Sensing of Environment 165, 27–41 (2015).
22. Deng, X., Featherstone, W. E., Hwang, C. & Berry, P. A. M. Estimation of contamination of ES-2 and POSEIDON satellite radar
altimetry close to the coasts of Australia. Marine Geodesy 25, 249–271, https://doi.org/10.1080/01490410214990 (2002).
23. Hulley, G. & Hoo, S. MODIS/Terra Land Surface Temperature/3-Band Emissivity Daily L3 Global 1 km SIN Grid Day V061 https://
doi.org/10.5067/MODIS/MOD21A1D.061 (2021).
2 4. Hulley, G. & Hoo, S. MODIS/Ter ra Land Surface Temperature/3-Band Emissivity Daily L3 Global 1 km SIN Grid Night V061 https://
doi.org/10.5067/MODIS/MOD21A1N.061 (2021).
25. Hulley, G. MODIS/Aqua Land Surface Temperature/3-Band Emissivity Daily L3 Global 1 km SIN Grid Day V061 https://doi.
org/10.5067/MODIS/MYD21A1D.061 (2021).
26. Hulley, G. MODIS/Aqua Land Surface Temperature/3-Band Emissivity Daily L3 Global 1 km SIN Grid Night V061 https://doi.
org/10.5067/MODIS/MYD21A1N.061 (2021).
27. Messager, M. L., Lehner, B., Grill, G., Nedeva, I. & Schmitt, O. Estimating the volume and age of water stored in global laes using a
geo-statistical approach. Nature communications 7, 13603 (2016).
Acknowledgements
e authors thank the groups and individuals named in the Methods section who helped share lake temperature
data. e authors also thank Stan Benjamin, Eric Anderson, David Yates, Tanya Smirnova, Matthew Casali,
Mike Barlage, Bahram Khazaei, Nina Omani, and Eric James for helpful discussions related to HRRR model
simulation design and validation needs. is reportwas prepared by Troy Sorensenusing Federal funds under
award 018422from NOAA JTTI, U.S. Department of Commerce. e statements, ndings, conclusions, and
recommendations are those of the author(s) and do not necessarily reect the views of the NOAA JTTIor the
U.S. Department of Commerce.
Author contributions
T.S. gathered data, organized the database, developed scripts and methods, created figures, and wrote the
manuscript. E.E. issued feedback on the usability of the database, provided insight on methods and gures, and
assisted with ArcGIS. J.G.W.K. conceptualized the database and gathered data. J.K. provided insight on database
structure and assisted with scripts and gures. A.G. served as the Principal Investigator and provided overall
guidance and technical writing. All authors reviewed the manuscript. Note that while T.S. and E.E. completed
this work at the University of Michigan, T.S. is now aliated with the Colorado School of Mines and E.E. is now
aliated with North Carolina State University.

e authors declare no competing interests.
Additional information
Correspondence and requests for materials should be addressed to T.S.
Reprints and permissions information is available at www.nature.com/reprints.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and
institutional aliations.
Content courtesy of Springer Nature, terms of use apply. Rights reserved
10
SCIENTIFIC DATA | (2024) 11:282 | https://doi.org/10.1038/s41597-024-03103-8
www.nature.com/scientificdata
www.nature.com/scientificdata/
Open Access This article is licensed under a Creative Commons Attribution 4.0 International
License, which permits use, sharing, adaptation, distribution and reproduction in any medium or
format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Cre-
ative Commons licence, and indicate if changes were made. e images or other third party material in this
article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the
material. If material is not included in the article’s Creative Commons licence and your intended use is not per-
mitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the
copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
© e Author(s) 2024
Content courtesy of Springer Nature, terms of use apply. Rights reserved
1.
2.
3.
4.
5.
6.
Terms and Conditions
Springer Nature journal content, brought to you courtesy of Springer Nature Customer Service Center GmbH (“Springer Nature”).
Springer Nature supports a reasonable amount of sharing of research papers by authors, subscribers and authorised users (“Users”), for small-
scale personal, non-commercial use provided that all copyright, trade and service marks and other proprietary notices are maintained. By
accessing, sharing, receiving or otherwise using the Springer Nature journal content you agree to these terms of use (“Terms”). For these
purposes, Springer Nature considers academic use (by researchers and students) to be non-commercial.
These Terms are supplementary and will apply in addition to any applicable website terms and conditions, a relevant site licence or a personal
subscription. These Terms will prevail over any conflict or ambiguity with regards to the relevant terms, a site licence or a personal subscription
(to the extent of the conflict or ambiguity only). For Creative Commons-licensed articles, the terms of the Creative Commons license used will
apply.
We collect and use personal data to provide access to the Springer Nature journal content. We may also use these personal data internally within
ResearchGate and Springer Nature and as agreed share it, in an anonymised way, for purposes of tracking, analysis and reporting. We will not
otherwise disclose your personal data outside the ResearchGate or the Springer Nature group of companies unless we have your permission as
detailed in the Privacy Policy.
While Users may use the Springer Nature journal content for small scale, personal non-commercial use, it is important to note that Users may
not:
use such content for the purpose of providing other users with access on a regular or large scale basis or as a means to circumvent access
control;
use such content where to do so would be considered a criminal or statutory offence in any jurisdiction, or gives rise to civil liability, or is
otherwise unlawful;
falsely or misleadingly imply or suggest endorsement, approval , sponsorship, or association unless explicitly agreed to by Springer Nature in
writing;
use bots or other automated methods to access the content or redirect messages
override any security feature or exclusionary protocol; or
share the content in order to create substitute for Springer Nature products or services or a systematic database of Springer Nature journal
content.
In line with the restriction against commercial use, Springer Nature does not permit the creation of a product or service that creates revenue,
royalties, rent or income from our content or its inclusion as part of a paid for service or for other commercial gain. Springer Nature journal
content cannot be used for inter-library loans and librarians may not upload Springer Nature journal content on a large scale into their, or any
other, institutional repository.
These terms of use are reviewed regularly and may be amended at any time. Springer Nature is not obligated to publish any information or
content on this website and may remove it or features or functionality at our sole discretion, at any time with or without notice. Springer Nature
may revoke this licence to you at any time and remove access to any copies of the Springer Nature journal content which have been saved.
To the fullest extent permitted by law, Springer Nature makes no warranties, representations or guarantees to Users, either express or implied
with respect to the Springer nature journal content and all parties disclaim and waive any implied warranties or warranties imposed by law,
including merchantability or fitness for any particular purpose.
Please note that these rights do not automatically extend to content, data or other material published by Springer Nature that may be licensed
from third parties.
If you would like to use or distribute our Springer Nature journal content to a wider audience or on a regular basis or in any other manner not
expressly permitted by these Terms, please contact Springer Nature at
onlineservice@springernature.com
ResearchGate has not been able to resolve any citations for this publication.
Article
Full-text available
Application of lake models coupled within earth-system prediction models, especially for predictions from days to weeks, requires accurate initialization of lake temperatures. Commonly used methods to initialize lake temperatures include interpolation of global sea-surface temperature (SST) analyses to inland lakes, daily satellite-based observations, or model-based reanalyses. However, each of these methods have limitations in capturing the temporal characteristics of lake temperatures (e.g., effects of anomalously warm or cold weather) for all lakes within a geographic region and/or during extended cloudy periods. An alternative lake-initialization method was developed which uses two-way-coupled cycling of a small-lake model within an hourly data assimilation system of a weather prediction model. The lake model simulated lake temperatures were compared with other estimates from satellite and in situ observations and interpolated-SST data for a multi-month period in 2021. The lake cycling initialization, now applied to two operational US NOAA weather models, was found to decrease errors in lake surface temperature from as much as 5–10 K vs. interpolated-SST data to about 1–2 K compared to available in situ and satellite observations.
Article
Full-text available
Warming trends in the Laurentian Great Lakes and surrounding areas have been observed in recent decades, and concerns continue to rise about the pace and pattern of future climate change over the world's largest freshwater system. To date, most regional climate models used for Great Lakes projections either neglected the lake-atmosphere interactions or are only coupled with a 1-D column lake model to represent the lake hydrodynamics. This study presents a Great Lakes climate change projection that has employed the two-way coupling of a regional climate model with a 3-D lake model (GLARM) to resolve 3-D hydrodynamics essential for large lakes. Using the three carefully selected Coupled Model Intercomparison Project Phase 5 (CMIP5) general circulation models (GCMs), we show that the GLARM ensemble average substantially reduces surface air temperature and precipitation biases of the driving GCM ensemble average in present-day climate simulations. The improvements are not only displayed from an atmospheric perspective but are also evident in the accurate simulations of lake temperature and ice coverage. We further present the GLARM projected climate change for the mid-21st century (2030–2049) and the late 21st century (2080–2099) in the Representative Concentration Pathway (RCP) 4.5 and RCP 8.5 scenarios. Under RCP 8.5, the Great Lakes basin is projected to warm by 1.3–2.1 ∘C by the mid-21st century and 4.1–5.0 ∘C by the end of the century relative to the early century (2000–2019). Moderate mitigation (RCP 4.5) reduces the mid-century warming to 0.8–1.8 ∘C and late-century warming to 1.8–2.7 ∘C. Annual precipitation in GLARM is projected to increase for the entire basin, varying from 0 % to 13 % during the mid-century and from 9 % to 32 % during the late century in different scenarios and simulations. The most significant increases are projected in spring and fall when current precipitation is highest and a minimal increase in winter when it is lowest. Lake surface temperatures (LSTs) are also projected to increase across the five lakes in all of the simulations, but with strong seasonal and spatial variability. The most significant LST increases occur in Lakes Superior and Ontario. The strongest warming is projected in spring that persists into the summer, resulting from earlier and more intense stratification in the future. In addition, diminishing winter stratification in the future suggests the transition from dimictic lakes to monomictic lakes by the end of the century. In contrast, a relatively smaller increase in LSTs during fall and winter is projected with heat transfer to the deep water due to the strong mixing and energy required for ice melting. Correspondingly, the highest monthly mean ice cover is projected to reduce to 3 %–15 % and 10 %–40 % across the lakes by the end of the century in RCP 8.5 and RCP 4.5, respectively. In the coastal regions, ice duration is projected to decrease by up to 60 d.
Article
Full-text available
Plain Language Summary To test whether a more accurate representation of tropical weather can lead to better extratropical forecasts 3–4 weeks in advance during boreal winter, retrospective forecasts (hindcasts) are performed with the tropics forced to closely match observational estimates. The precipitation at Weeks 3–4 lead times is improved over the United States (US) West Coast in an operational weather model in forced hindcasts compared to those without forcing. To diagnose the origin of these improvements, a machine‐learning method that subsets hindcasts by the similarity of their initial weather states is used. One subset that demonstrates larger improvements at Weeks 3–4 than the others features an initially strong low pressure system in the North Pacific. The greater improvements for this subset of hindcasts originate from an incorrect simulation of tropical precipitation in the non‐forced hindcasts. In particular, the forced hindcasts are better able to simulate the weakening of the North Pacific low pressure a few weeks into the prediction that is produced by atmospheric waves emanating poleward induced by tropical precipitation. These findings identify conditions under which correctly simulating tropical precipitation in the model is the most beneficial for Weeks 3–4 precipitation forecasts over the US West Coast during boreal winter.
Article
Full-text available
Waterbodies (natural lakes and reservoirs) are a critical part of a watershed’s ecological and hydrological balance, and in many cases dictate the downstream river flows either through natural attenuation or through managed controls. Investigating waterbody dynamics relies primarily on understanding their morphology and geophysical characteristics that are primarily defined by bathymetry. Bathymetric conditions define stage-storage relationships and circulation/transport processes in waterbodies. Yet many studies oversimplify these mechanisms due to unavailability of the bathymetric data. We developed a novel GLObal Bathymetric (GLOBathy) dataset of 1.4+ million waterbodies to align with the well-established global dataset, HydroLAKES. GLOBathy uses a GIS-based framework to generate bathymetric maps based on the waterbody maximum depth estimates and HydroLAKES geometric/geophysical attributes of the waterbodies. The maximum depth estimates are validated at 1,503 waterbodies, making use of several observed data sources. We also provide estimations for head-Area-Volume (h-A-V) relationships of the HydroLAKES waterbodies, driven from the bathymetric maps of the GLOBathy dataset. The h-A-V relationships provide essential information for water balance and hydrological studies of global waterbody systems.
Article
Full-text available
Aquatic systems are critical to food, security, and society. But, water data are collected by hundreds of research groups and organizations, many of which use nonstandard or inconsistent data descriptions and dissemination, and disparities across different types of water observation systems represent a major challenge for freshwater research. To address this issue, the Water Quality Portal (WQP) was developed by the U.S. Environmental Protection Agency, the U.S. Geological Survey, and the National Water Quality Monitoring Council to be a single point of access for water quality data dating back more than a century. The WQP is the largest standardized water quality data set available at the time of this writing, with more than 290 million records from more than 2.7 million sites in groundwater, inland, and coastal waters. The number of data contributors, data consumers, and third-party application developers making use of the WQP is growing rapidly. Here we introduce the WQP, including an overview of data, the standardized data model, and data access and services; and we describe challenges and opportunities associated with using WQP data. We also demonstrate through an example the value of the WQP data by characterizing seasonal variation in lake water clarity for regions of the continental U.S. The code used to access, download, analyze, and display these WQP data as shown in the figures is included as supporting information.
Article
Full-text available
Lakes are key components of biogeochemical and ecological processes, thus knowledge about their distribution, volume and residence time is crucial in understanding their properties and interactions within the Earth system. However, global information is scarce and inconsistent across spatial scales and regions. Here we develop a geo-statistical model to estimate the volume of global lakes with a surface area of at least 10 ha based on the surrounding terrain information. Our spatially resolved database shows 1.42 million individual polygons of natural lakes with a total surface area of 2.67 × 10⁶ km² (1.8% of global land area), a total shoreline length of 7.2 × 10⁶ km (about four times longer than the world's ocean coastline) and a total volume of 181.9 × 10³ km³ (0.8% of total global non-frozen terrestrial water stocks). We also compute mean and median hydraulic residence times for all lakes to be 1,834 days and 456 days, respectively.
Article
Full-text available
Critical water-resources issues ranging from flood response to water scarcity make access to integrated water information, services, tools, and models essential. Since 1995 when the first water data web pages went online, the US Geological Survey has been at the forefront of water data distribution and integration. Today, real-time and historical streamflow observations are available via web pages and a variety of web service interfaces. The Survey has built partnerships with Federal and State agencies to integrate hydrologic data providing continuous observations of surface and groundwater, temporally discrete water-quality data, groundwater well logs, aquatic biology data, water availability and use information, and tools to help characterize the landscape for modeling. In this paper, we summarize the status and design patterns implemented for selected data systems. We describe how these systems contribute to a US Federal Open Water Data Initiative and present some gaps and lessons learned that apply to global hydroinformatics data infrastructure.
Article
Full-text available
The impact of lakes on the European climate is considered by analysing two 30-year regional climate model (RCM) simulations. The RCM applied is the Rossby Centre regional climate model RCA3.5. A simulation where all lakes in the model domain are replaced by land surface is compared with a simulation where the effect of lakes is accounted for through the use of the lake model FLake coupled to RCA. The difference in 2m open-land air temperature between the two simulations shows that lakes induce a warming on the European climate for all seasons. The greatest impact is seen during autumn and winter over southern Finland and western Russia where the warming exceeds 1 °C. Locally, e.g. over southern Finland and over Lake Ladoga, the convective precipitation is enhanced by 20%-40% during late summer and early autumn while it is reduced by more than 70% over Lake Ladoga during early summer.
Article
The High-Resolution Rapid Refresh (HRRR) is a convection-allowing implementation of the Weather Research and Forecasting model (WRF-ARW) with hourly data assimilation that covers the conterminous United States and Alaska and runs in real time at the NOAA National Centers for Environmental Prediction. Implemented operationally at NOAA/NCEP in 2014, the HRRR features 3-km horizontal grid spacing and frequent forecasts (hourly for CONUS and 3-hourly for Alaska). HRRR initialization is designed for optimal short-range forecast skill with a particular focus on the evolution of precipitating systems. Key components of the initialization are radar-reflectivity data assimilation, hybrid ensemble-variational assimilation of conventional weather observations, and a cloud analysis to initialize stratiform cloud layers. From this initial state, HRRR forecasts are produced out to 18 h every hour, and out to 48 h every 6 h, with boundary conditions provided by the Rapid Refresh system. Between 2014 and 2020, HRRR development was focused on reducing model bias errors and improving forecast realism and accuracy. Improved representation of the planetary boundary layer, subgrid-scale clouds, and land surface contributed extensively to overall HRRR improvements. The final version of the HRRR (HRRRv4), implemented in late 2020, also features hybrid data assimilation using flow-dependent covariances from a 3-km, 36-member ensemble (“HRRRDAS”) with explicit convective storms. HRRRv4 also includes prediction of wildfire smoke plumes. The HRRR provides a baseline capability for evaluating NOAA’s next-generation Rapid Refresh Forecast System, now under development.
Article
Lake-effect convective snowstorms frequently produce high-impact, hazardous winter weather conditions downwind of the North American Great Lakes. During lake-effect snow events, the lake surfaces can cool rapidly, and in some cases, notable development of ice cover occurs. Such rapid changes in the lake-surface conditions are not accounted for in existing operational weather forecast models, such as the National Oceanic and Atmospheric Administration (NOAA)’s High Resolution Rapid Refresh (HRRR) model, resulting in reduced performance of lake-effect snow forecasts. As a milestone to future implementations in the Great Lakes Operational Forecast System (GLOFS) and HRRR, this study examines the one-way linkage between the hydrodynamic-ice model (the Finite-Volume Community Ocean Model coupled with the unstructured grid version of the Los Alamos Sea Ice Model, FVCOM-CICE, the physical core model of GLOFS) and the atmospheric model (the Weather Research and Forecasting model, WRF, the physical core model of HRRR). The realistic representation of lake-surface cooling and ice development or its fractional coverage during three lake-effect snow events was achieved by feeding the FVCOM-CICE simulated lake-surface conditions to WRF (using a regional configuration of HRRR), resulting in the improved simulation of the turbulent heat fluxes over the lakes and resulting snow water equivalent in the downwind areas. This study shows that the one-way coupling is a practical approach that is well suited to the operational environment, as it requires little to no increase in computational resources yet can result in improved forecasts of regional weather and lake conditions.