ThesisPDF Available

Automated Analysis Of Coastal Webcam Footage By Means Of Machine Learning

February 2023

February 2023

DOI:10.13140/RG.2.2.19456.71689

Thesis for: Master of Science
Advisor: Prof. Dr. Oliver Zielinski, Prof. Dr.-Ing. habil. Torsten Schlurmann

Authors:

Julia Steiwer

Universität Bremen

This thesis aims to create a machine-learning approach for predicting wave heights and water levels from webcam images. For this purpose, different models and input data types were tested. Classification and regression tasks were performed with deep learning and machine learning models. The water level prediction was divided into four models depending on the sea state visible in the models. There are models for calm, smooth, slight, and rough seas and a model for wave height prediction. The best results were achieved using supervised machine learning with Gaussian process regression. For the 2015 dataset, the wave height prediction achieved an error of 11 cm while the water level prediction achieved a mean error of 26 cm. The errors for the individual water level prediction models were 0 cm for calm seas, 14 cm for smooth seas, 12 cm for slight seas, and 1 cm for moderate seas.

Webcam image from Norderney, taken on 8th August 2015 at 10:10 UTC+2. [source: https://www.foto-webcam.eu/webcam/norderney/2015/08/08/1010]

…

Beach on Wangerooge after hurricane Zeynep. [source: https://www.haz.de/der-norden/nordsee-inseln-sturm-zeynep-spuelt-straende-fast-komplett-weg-7NYUVODMNRC4NPXRDA6JPAQYKI.html]

…

East Frisian Islands offshore of the German North Sea coast. [source: https://commons.wikimedia.org/wiki/File:Ostfriesische_Inseln_(Karte).png]

…

Groyne field before the nourishment, 18 June 2019 at 9:00 UTC+2. [source: https://www.foto-webcam.eu/webcam/norderney/2019/06/18/0900]

…

+46

Groyne field during the nourishment, 23 July 2019 at 9:00 UTC+2. [source: https://www.foto-webcam.eu/webcam/norderney/2019/07/23/0900]

…

Figures - uploaded by Julia Steiwer

Content may be subject to copyright.

Content uploaded by Julia Steiwer

Content may be subject to copyright.

Studiengang

Marine Sensorik (M.Sc.)

MASTERARBEIT

Titel

Automated Analysis Of Coastal Webcam Footage By

Means Of Machine Learning

vorgelegt von

Julia Steiwer

Betreuender Gutachter

Prof. Dr. Oliver Zielinski

Zweiter Gutachter

Prof. Dr.-Ing. habil. Torsten Schlurmann

Oldenburg, 02.02.2023

Abstract

ENGLISH VERSION

This thesis aims to create a machine-learning approach for predicting wave heights and

water levels from webcam images. For this purpose, diﬀerent models and input data types

were tested. Classification and regression tasks were performed with deep learning and

machine learning models. The water level prediction was divided into four models

depending on the sea state visible in the models. There are models for calm, smooth, slight,

and rough seas and a model for wave height prediction. The best results were achieved using

supervised machine learning with Gaussian process regression. For the 2015 dataset, the

wave height prediction achieved an error of 11 cm while the water level prediction achieved

a mean error of 26 cm. The errors for the individual water level prediction models were 0 cm

for calm seas, 14 cm for smooth seas, 12 cm for slight seas, and 1 cm for moderate seas.

DEUTSCHE VERSION

Ziel dieser Masterarbeit war es, einen Ansatz basierend auf maschinellem Lernen zu

entwickeln, der die Parameter Wellenhöhe und Wasserstand aus Webcam-Bildern

vorhersagen kann. Zu diesem Zweck wurden verschiedene Modelle und Datentypen getestet.

Klassifizierung und Regression wurden mit Modellen aus den Bereichen Tiefes und

Maschinelles Lernen durchgeführt. Die Vorhersage des Wasserstands wurde in vier Modelle

nach sichtbarem Wellengang auf den Bildern unterteilt. Es gibt Modelle für glatt-ruhige,

schwach, leicht und mäßig bewegte See, sowie ein Modell für die Bestimmung der

Wellenhöhe. Die besten Ergebnisse wurden mit überwachtem maschinellen Lernen und

Gaußprozess-Regression erreicht. Auf dem Datensatz von 2015 erreichte die Vorhersage der

Wellenhöhe einen Fehler von 11 cm, während der mittlere Fehler für den gesamten

Wasserstand bei 26 cm lag. Für die einzelnen Vorhersagemodelle des Wasserstands lagen

folgende Fehler vor: 0 cm für glatt-ruhige See, 14 cm für schwach bewegte See, 12 cm für

leicht bewegte See, und 1 cm für mäßig bewegte See.

Contents

Abstract 1

Contents 2

List of Figures 4

List of Tables 10

List of Abbreviations 11

1. Introduction 13

2. Literature Review 17

2.1 Study Site 17

2.2 Preliminary Work 23

2.3 Machine and Deep Learning 32

2.3.1 Background Information 32

2.3.2 Application in Coastal Engineering 35

3. Materials and Methods 42

3.1 Webcam Technology 42

3.2 Data Sources 46

3.2.1 Wave Data 46

3.2.2 Water Level 48

3.3 Processing and Formatting Data 50

3.3.1 Preprocessing the Image Data 50

3.3.2 Creating Regression Data 60

3.3.3 Preprocessing Image Data for Classification 67

3.4 Classification and Regression with Deep Learning 70

3.4.1 Deep Network Designer 70

3.4.2 Network Example: ResNet18 74

3.4.3 Classification Task 80

3.4.4 Regression Task 82

3.5 Regression with Machine Learning 85

3.5.1 Regression Learner App 85

3.5.2 Model Example: Gaussian Process Regression (GPR) 91

3.5.3 Making Predictions with New Data 93

4. Results 94

4.1 Image Preprocessing 94

4.2 Classification Task 97

4.3 Regression Task 100

4.3.1 Regression with Deep Learning 100

4.3.2 Regression with Machine Learning 102

4.4 Predictions with 2015 Dataset 104

4.4.1 Wave Height Prediction 104

4.4.2 Water Level Prediction 106

Water Level for Calm Sea State 108

Water Level for Smooth Sea State 110

Water Level for Slight Sea State 112

Water Level for Moderate Sea State 114

4.5 Predictions with 2016 Dataset 116

4.5.1 Wave Height Prediction 116

4.5.2 Water Level Prediction 118

Water Level for Calm Sea State 120

Water Level for Smooth Sea State 122

Water Level for Slight Sea State 124

Water Level for Moderate Sea State 126

5. Discussion 128

5.1 Image Preprocessing 128

5.2 Classification Problem 134

5.3 Regression Problem 137

5.3.1 Regression with Deep Learning 137

5.3.2 Regression with Machine Learning 138

6. Conclusion 142

References 145

Scientific Publications 145

Weblinks 149

Pictorial Sources 157

Appendices 161

List of Figures

Figure Title

Page

Fig. 1: Webcam image from Norderney, taken on 8th August 2015 at 10:10 UTC+2.

Fig. 2: Beach on Wangerooge aer hurricane Zeynep.

Fig. 3: East Frisian Islands oﬀshore of the German North Sea coast.

Fig. 4: Groyne field before the nourishment, 18 June 2019 at 9:00 UTC+2.

Fig. 5: Groyne field during the nourishment, 23 July 2019 at 9:00 UTC+2.

Fig. 6: Groyne field aer the nourishment, 27 July 2019 at 9:00 UTC+2.

Fig. 7: Groyne field with placements of combined (1992) and conventional (1989)

nourishments on Norderney.

Fig. 8: Webcam position on Norderney.

Fig. 9: View of the groyne field from the webcam, 18 July 2015 at 9:00 UTC+2.

Fig. 10: Webcam image with the analysis area of the sky framed in red (Reuter,

2022).

Fig. 11: Transformation stages of the cropped image of the groyne.

Fig. 12: Images from 4th May 2016 and 4th June 2016, and their superimposed

image.

Fig. 13: Collage of water lines from the 8th to the 14th of January 2015.

Fig. 14: Tide curves of measured and calculated water levels.

Fig. 15: Scatter plots for water levels and wave heights.

Figure showing ML and DL as parts of AI.

Fig. 16: Original and over-segmented image.

Fig. 17: Original image and transformed image with ROI.

Fig. 18: Images from Farson river camera.

Fig. 19: Images taken by an ASV at diﬀerent sea state levels.

Fig. 20: Camera angle of vision with positions of the camera, the wave buoy, and

the water gauge.

Fig. 21: Image taken by the Norderney webcam on 20th August 2015 at 08:50

UTC+2.

Fig. 22: Webcam snapshot from the 18th February 2012 at 12:30 UTC+1.

Fig. 23 Webcam snapshot from the 15th September 2017 at 12:10 UTC+2.

Fig. 24: Measured vs linearly interpolated wave heights from 2015.

Fig. 25: Measured vs linearly interpolated wave heights from 2016.

Fig. 26: Tide gauge Norderney-Riﬀgat.

Fig. 27: Water levels from 2015, normalised to the 10-minute resolution of the

image data.

Fig. 28: Water levels from 2016, normalised to the 10-minute resolution of the

image data.

Figure showing the main preprocessing workflow.

Fig. 29: Webcam snapshot with the groyne framed in red.

Fig. 30: Webcam image with ROIs for brightness detection.

Fig. 31: ROIs for fog and snow detection.

Fig. 32: Screenshot of the linking table.

Fig. 33: Webcam image from 1. January 2015 at 13:30 UTC+1 and the image

section cropped to the groyne.

Fig. 34: Plotted GLCMs for the full-size and the cropped image.

Figure showing the main regression workflow.

Figure showing the workflow for feature extraction.

Figure showing the workflow for water level prediction.

Fig. 35: Diﬀerent image types used for the classification task.

Figure showing an average image.

Figure showing a diﬀerence image.

Fig. 36: Main window and model selection menu of the Deep Network Designer.

Fig. 37: Active main window of the Deep Network Designer.

Fig. 38: Deep Learning Network Analyzer.

Fig. 39: Import Image Data window of the Deep Network Designer.

Fig. 40: Example root-to-leaf branches for the mammal and vehicle subtrees

Figure showing the input block of the ResNet18.

Figure showing an identity block of the ResNet18.

Figure showing an addition and thresholding block of the ResNet18.

Figure showing a convolutional block of the ResNet18.

Figure showing the classification block of the ResNet18.

Fig. 41: ResNet18 structure, simplified block version.

Fig. 42: Relative prediction speeds and accuracies of diﬀerent pretrained

networks.

Fig. 43: Diﬀerent image types used for the classification task.

Fig. 44: Layer information for the linear model.

Fig. 45: Network architecture for a model with two inputs.

Fig. 46: Layer information for the model with two inputs.

Fig. 47: Layer graphs for the linear model and the model with two inputs.

Fig. 48: “New Session from Workspace” window of the Regression Learner app.

Fig. 49: Exemplary response plot.

Fig. 50: Exemplary plot for predicted vs true responses.

Fig. 51: Exemplary plot of residuals for each observation.

Fig. 52: Exemplary overview of the night images with outliers.

Fig. 53: Exemplary overview of the twilight images with outliers.

Fig. 54: Exemplary overview of the cropped daylight images with outliers.

Fig. 55: Exemplary overview of the bad weather images with outliers.

Fig. 56: Training plot for classification with normal images and 32 classes.

Fig. 57: Training plot for classification with normal images and four classes.

Fig. 58: Training plot for classification with edge images and four classes.

Fig. 59: Training plot for classification with diﬀerence images and 32 classes.

Fig. 60: RMSE and loss for the training of a ResNet18 for regression.

101

Fig. 61: RMSE and loss for the training of a regression network with two inputs.

101

Fig. 62: Distribution of the measured vs predicted wave heights in 2015.

104

Fig. 63: Scatter plot of predicted vs measured wave heights in 2015.

105

Fig. 64: Measured and predicted wave heights in 2015 plotted over time.

105

Fig. 65: Distribution of the measured vs predicted water levels in 2015.

106

Fig. 66: Measured and predicted water levels in 2015 plotted over time.

107

Fig. 67: Measured and predicted water levels for a two-week period in 2015.

107

Fig. 68: Distribution of predicted vs measured water levels in 2015 during calm

sea conditions.

108

Fig. 69: Scatter plot of predicted vs measured water levels in 2015 during calm

sea conditions.

109

Fig. 70: Predicted and measured water levels in 2015 during calm sea conditions

over time.

109

Fig. 71: Distribution of predicted vs measured water levels in 2015 during smooth

sea conditions.

110

Fig. 72: Scatter plot of predicted vs measured water levels in 2015 during smooth

sea conditions.

111

Fig. 73: Predicted and measured water levels in 2015 during smooth sea

conditions over time.

111

Fig. 74: Distribution of predicted vs measured water levels in 2015 during slight

sea conditions.

112

Fig. 75: Scatter plot of predicted vs measured water levels in 2015 during slight

sea conditions.

113

Fig. 76: Predicted and measured water levels in 2015 during slight sea conditions

over time.

113

Fig. 77: Distribution of predicted vs measured water levels in 2015 during

moderate sea conditions.

114

Fig. 78: Scatter plot of predicted vs measured water levels in 2015 during

moderate sea conditions.

115

Fig. 79: Predicted and measured water levels in 2015 during moderate sea

conditions over time.

115

Fig. 80: Distribution of the measured vs predicted wave heights in 2016.

116

Fig. 81: Scatter plot of predicted vs measured wave heights in 2016.

117

Fig. 82: Measured and predicted wave heights in 2016 plotted over time.

117

Fig. 83: Distribution of the measured vs predicted water levels in 2016.

118

Fig. 84: Measured and predicted water levels in 2016 plotted over time.

119

Fig. 85: Measured and predicted water levels for a two-week period in 2016.

119

Fig. 86: Distribution of predicted vs measured water levels in 2016 during calm

sea conditions.

120

Fig. 87: Scatter plot of predicted vs measured water levels in 2016 during calm

sea conditions.

121

Fig. 88: Predicted and measured water levels in 2016 during calm sea conditions

over time.

121

Fig. 89: Distribution of predicted vs measured water levels in 2016 during smooth

sea conditions.

122

Fig. 90: Scatter plot of predicted vs measured water levels in 2016 during smooth

sea conditions.

123

Fig. 91: Predicted and measured water levels in 2016 during smooth sea

conditions over time.

123

Fig. 92: Distribution of predicted vs measured water levels in 2016 during slight

sea conditions.

124

Fig. 93: Scatter plot of predicted vs measured water levels in 2016 during slight

sea conditions.

125

Fig. 94: Predicted and measured water levels in 2016 during slight sea conditions

over time.

125

Fig. 95: Distribution of predicted vs measured water levels in 2016 during

moderate sea conditions.

126

Fig. 96: Scatter plot of predicted vs measured water levels in 2016 during

moderate sea conditions.

127

Fig. 97: Predicted and measured water levels in 2016 during moderate sea

conditions over time.

127

Fig. 98: Daylight image falsely sorted into the twilight category, taken on 4th

February 2015, 10:30 UTC+1.

129

Fig. 99: Daylight image with cloudy skies, falsely sorted into the twilight category,

taken on 15th June 2015, 10:00 UTC+2.

130

Fig. 100: Night image wrongly classified as daylight image and cropped, taken on

27th January 2015 at 18:00 UTC+1.

132

Fig. 101: Original and low-light enhanced night image.

132

Fig. 102: Original and low-light enhanced cropped night image.

133

Fig. 103: Two diﬀerence images taken during diﬀerent conditions.

135

Fig. 104: Two edge images taken during diﬀerent conditions.

135

Fig. 105: RGB images which the edge images in Fig. 104 were derived from.

136

Fig. 106: Scatter plot for water levels from 2015 and 2016.

141

Fig. 107: Scatter plots for water levels from 2015 and 2016.

141

List of Tables

Title

Page

Tbl. 1: Thresholds for Daylight Detection

Tbl. 2: GLCM of the full-size webcam image.

Tbl. 3: GLCM of the cropped webcam image.

Tbl. 4: Comparison of the GLCM properties.

Tbl. 5: RMSE for regression models without test data.

102

Tbl. 6: RMSEs and models for training on 28 feature columns.

103

List of Abbreviations

Abbreviation

Meaning

Artificial Intelligence

ANN

Artificial Neural Network

ASV

Autonomous Surface Vehicle

Bayesian Network

Covariance Function

CNN

Convolutional Neural Network

CET, CEST

Central European (Summer) Time

CPT

Conditional Probability Table

CSV

Comma Separated Values

DirP

Peak Direction

Deep Learning

DRR

Disaster Risk Reduction

DWD

Deutscher Wetterdienst (German Meteorological Service)

ETD

Ebb-Tidal Delta

FCN

Fully Convolutional Network

FoV

Field of View

GLCM

Gray-Level Co-occurrence Matrix

GP, GPR

Gaussian Process, Gaussian Process Regression

GPU

Graphics Processing Unit

HIS

HIstory of Spectral parameters

KAR

Kernel Approximation Regression

k-NN

k-nearest neighbours

Linear Regression

LSTM

Long Short-Term Memory

Machine Learning

MLP

Multilayer Perceptron

MRMR

Minimum Redundancy Maximum Relevance

N/A

Not Available

NLWKN

Niedersächsischer Landesbetrieb für Wasserwirtscha, Küsten- und

Naturschutz (Lower Saxon State Department for Waterway, Coastal

and Nature Conservation)

OTUS

Largest Inter-Class Variance, Otsuʼs Method

PCA

Principal Component Analysis

Godaʼs peakedness parameter

Determination Coeﬀicient R-Squared

ReLU

Rectified Linear Unit

RGB

Red-Green-Blue (Colour Space)

RMSE

Root Mean Square Error

RNN

Recurrent Neural Network

ROI

Region of Interest

SGDM

Stochastic Gradient Descent with Momentum

SPR

Source-Pathway-Receptor

Significant Steepness

SSCQ

Sea State Characteristic Quantity

SSVM

Structured Support Vector Machine

SWH, HS

Significant Wave Height

Peak Period

UTC

Coordinated Universal Time

WSA

Waterways and Shipping Administration

1. Introduction

Climate change is real, and the Earth is now about 1.1°C warmer than in the late 1800s; the

last decade (2011-2020) was the warmest on record [1]. Among its many eﬀects are the

melting of polar ice caps and increasing sea levels [2]. Since 1880, the global mean sea level

has risen by 21–24 centimetres, and there are predictions that the sea level might increase at

least 0.3 and up to 2.2 metres from 2000 to 2100 [3]. On a planet where 70.8% of its surface is

covered by water, and two-thirds of its population lives within 100 km of the coast (Kunz,

1991), rising sea levels pose an immense threat to the safety of the people inhabiting the

coastal catchment areas.

Monitoring changes to the coast and management of coastal defence measures is therefore

vital to ensure the safety of the people living in coastal areas. However, gathering and

evaluating data reflecting changes, e.g. in beach width or the water level, is oen

time-consuming and expensive. In the past years, artificial intelligence (AI) and related

approaches such as machine learning (ML) or deep learning (DL) have become a cornerstone

of data science. These approaches have been applied in coastal, flood, and water

management, especially to detect water levels. For example, Hies et al. (2012) determined

the water depth in an open storm-water drainage canal in Singapore for watermanagement

purposes using a pressure gauge and ML approaches. Guo et al. (2020) used a similar

approach that they applied to a scaled water gauge. Vandaele et al. (2021) used deep transfer

learning for water segmentation and water level prediction from river camera images, while

Chaudhary et al. (2020) created flood maps for water level prediction using smartphone

images and a DL approach. Besides the water level, other parameters relevant to coastal

engineering can be predicted with ML and DL approaches as well. Hoonhout et al. (2015)

determine the water line and beach width by applying image segmentation techniques to

webcam images. Wang et al. (2013) performed sea state detection using images and a

grey-level co-occurrence matrix (GLCM).

What is noticeable in the aforementioned papers is that all of the presented approaches use

either webcam or smartphone images. The developments in camera and lens technology,

especially in combination with the internet as a data-sharing platform, have increased the

volume of high-quality and continuous data available for analysis. Webcams, especially

those stationed at the coast, provide a continuous series of data, even of extreme events

where it would be too dangerous to send people out for measurements. On Norderney, a

webcam has been in use since 2012, and, even though the camera and angle of view were

changed at some points, it has been providing an almost continuous series of high-resolution

coastal images since (Fig. 1). The beach, groyne, and ocean within these pictures can be used

to analyse various coastal parameters, for example, water level, sea state or wave height,

waterline position and more.

Fig. 1: Webcam image from Norderney, taken on 8th August 2015 at 10:10 UTC+2.

Norderney, as one of the East Frisian barrier islands fronting the Lower-Saxonian coast, plays

an integral role in protecting the main coastline from the eﬀects of wind and waves. However,

the wave energy and storm event frequency have increased as a result of climate change and

extreme weather conditions becoming more common. In February 2022, three hurricanes hit

the German coast: Ylenia (16th + 17th Feb.), Zeynep (18th + 19th Feb.), and Antonia (20th +

21st Feb.) [4]. These hurricanes damaged the beaches of the islands, which are part of the

coastal protection system and are used to push the surf zone seawards to protect the dike

base. On the coast of Wangerooge, for example, the storm caused massive damage to the

beach (Fig. 2), eroding around 90% of the sand [5].

Fig. 2: Beach on Wangerooge aer hurricane Zeynep.

These eﬀects of storm surges and hurricanes increase the already naturally occurring erosion

due to the tidal current and prevailing wind conditions (Niemeyer, 1995), which negatively

impacts the barrier islands, making more intensive coastal protection measures necessary. A

common method of beach protection at the barrier islands is nourishment with large sand

volumes (Kunz, 1991), especially aer erosion induced by storm surges. However, it is

necessary to evaluate how long these measures last and if they have a beneficial eﬀect on the

protection of the coast; for this purpose, parameters such as beach width and waterline

position could be analysed to assess the performance of the measure. Webcams could

capture these parameters visually in their pictures, and AI-based approaches could be used

to combine the image parameters with measured data to create automatised routines for the

detection of coastal parameters from images.

It is therefore the aim of this thesis to develop an AI-based approach for the prediction of

coastal parameters, such as water level and wave height, from webcam images. The model

needs to be able to combine images and measurements (e.g. from wave buoys or tide

gauges) to gain knowledge about which features within an image correspond to certain

parameters and vice versa. The approach should be general enough that new image data

without matching measurements produces equally good prediction results as the data with

matching measurements. Errors should be low enough so that the approach can be used for

real coastal applications. Finally, the routines need to include pre-processing steps and the

option to retrain the model on new data without having to redo the entire model-building

procedure.

Within this thesis, chapter 2 presents information about the study site, the preliminary work,

and the state of the art. Chapter 3 describes the data, how it was pre-processed, and how it

was used for classification and regression tasks with both machine and deep learning.

Moreover, it provides insights into two examples, the ResNet18 deep learning and the GPR

machine learning models. Chapter 4 presents the results of the pre-processing, classification,

and regression, while chapter 5 discusses these results. The conclusion in chapter 6

summarises the findings of this thesis and oﬀers an outlook for potential future research and

applications.

2. Literature Review

2.1 Study Site

Norderney is one of the East Frisian Barrier Islands which are located oﬀshore of the Lower

Saxonian part of the German North Sea coast (Fig. 3). East, the Wichter Ee gat separates

Norderney from its neighbouring island Baltrum; west, the broader Norderneyer Seegatt

separates Norderney from Juist. Norderney is located about 3 km further north than Juist

due to the gat Dovetief, which is up to 20 m deep. This oﬀset makes the island more

vulnerable to swell from the West and North-West directions (Mittelstaedt, 2003).

Fig. 3: East Frisian Islands oﬀshore of the German North Sea coast.

The sediment transport by the tidal current from the northern and western direction in the

German Bight causes Norderney, as well as the other Eastern Frisian Islands, to dri away

eastward (Niemeyer, 1995). Between 1650 and 1960, sedimentation caused by the

continuous current resulted in an elongation of the eastern end of Norderney of roughly 6 km

[6]. The East Frisian Islands, unlike the North Frisian Islands, are not part of the continental

shelf and were formed during the Pleistocene (about 2,580,000 to 11,700 years ago) and

Holocene (current geological epoch). Their foundation consists of marine sediments and

(sand) dunes. At the west end of Norderney, its foundation consists of diﬀerent layers of clay,

marine clay, and loam (Schlütz et al., 2021).

This rather “so” Pleistocene shelf in combination with the strong tidal currents made

coastal management procedures necessary. Starting from 1858, the first revetment on the

German North Sea coast was constructed on Noderney (Behre & van Lengen, 1995; Kunz,

1991). The revetment consists of a combined system of shore-parallel structures and groynes.

While strong tidal currents were kept away from the core of the island by the groynes and

further dune erosion could be prevented by the shore-parallel structures, both failed to

prevent beach erosion (Kunz, 1991). The Arbeitsgruppe Norderney 1952 suggested restoration

of the beach aer reviewing the classic coastal protection approach.

Thus, the first beach nourishment on Norderney was carried out in 1951 (Kunz, 1991) to shi

the surf zone further seaward to protect the bank protection structures on the West end of

the island. Since 1951/1952, the nourishments have been repeated regularly every five to ten

years. The most recent (and 13th overall) nourishment was conducted in July 2019, with a

sand volume of 200,000 cubic metres taken from the Robbenplate, a sand bar northwest of

Bremerhaven [7]. The goal was to increase the beach level on a 1.8 km stretch for a total of

eleven groyne fields between groynes F and G1 (see Fig. 4 - 6 for a before, during and aer

view, and Fig. 7 for placements on the coast), with the new level being up to 2 m higher than

before the nourishment [8].

Fig. 4: Groyne field before the nourishment, 18 June 2019 at 9:00 UTC+2.

Fig. 5: Groyne field during the nourishment, 23 July 2019 at 9:00 UTC+2.

Fig. 6: Groyne field aer the nourishment, 27 July 2019 at 9:00 UTC+2.

Beach nourishments are an established method of coastal protection as the broadening of

the beach pushes the surf zone further seaward, which in turn reduces the risk of a scarp

forming close to coast safety-relevant structures, such as revetments and embankments.

Besides the conventional nourishments, combined nourishments, where both the beach and

the shoreface are replenished, have been carried out on Norderney as well (Niemeyer et al.,

1997; Fig. 7).

Fig. 7: Groyne field with placements of combined (1992) and conventional (1989)

nourishments on Norderney (Niemeyer et al., 1997).

Measures such as traditional (Kunz, 1991)or combined (Niemeyer et al., 1997) nourishments,

as well as changes to the dune revetment (Thorenz & Blum, 2011), need to be evaluated in

regards to their ability to prevent (or at least lessen) beach erosion and protect the

foundation of the dykes. In coastal engineering, parameters that are helpful to evaluate the

performance of coastal management procedures are the water level, wave height, surf zone

width, and the rate of sedimentation vs erosion.

While wave buoys and tide gauges can measure changes in the wave field and the eﬀect of

tidal currents on the beach, they cannot measure erosion. Even though the assessment of

sedimentation and erosion can be done with bathymetric methods such as Lidar, they are

oen too costly to be executed over longer periods. However, developments in the field of

camera technology make it possible to gather long-term series of high-quality image data at

low procurement and manpower costs. Additionally, the developments in the field of

artificial intelligence and AI-based image analysis (see 2.3 Machine and Deep Learning) allow

for faster assessments once a suitable model has been established.

In 2012, a webcam - made by the manufacturers Keuschnig and Radlherr, and commissioned

by the Norderney Zimmerservice - was installed at the western end (Fig. 8). Its angle of view

looks onto a section of the groyne field that serves as part of the breakwaters which were

installed to protect the coast from erosion caused by tidal currents and waves. Given that the

camera has a fixed angle which has remained mostly unchanged since its installation(see 3.1

Webcam Technology for details), the produced snapshot images are interesting from a

coastal engineering and management perspective.

Fig. 8: Webcam position on Norderney.

The snapshot images (e.g. Fig. 9) show the boardwalk, the beach, three groynes - the one to

the right being most visible -, as well as the ocean and the sky. These image regions in

combination with data from nearby tide gauges or wave buoys allow for image-based

analysis of coastal parameters, such as wave height, water level, sedimentation, or surf zone

width, making the use of Norderney as a study site an excellent choice.

Fig. 9: View of the groyne field from the webcam, 18 July 2015 at 9:00 UTC+2.

2.2 Preliminary Work

Background Information

The preliminary work to this masterʼs thesis is the bachelorʼs thesis “Analysis of Webcam

Footage from Norderney Beach” (original German title: Auswertung von Webcam-Bildern

eines Strandabschnitts auf Norderney) by Reuter (2022). The goal of this bachelorʼs thesis was

to calculate water level and wave height and to assess morphological changes of the beach

via analysis of webcam footage, which was taken at the same site this masterʼs thesis focuses

on (see 2.1 Study Site).

Due to obstructions of view and changes of the camera, the timeframe from the 1st of

January 2015 to the 14th of October 2016 was used for the analysis, resulting in a dataset

consisting of 92,000 images (compare 3.1 Webcam Technology). The webcam provides an

image every 10 minutes, the water levels are measured every minute, and the wave height

has a resolution of 30 minutes (compare 3.2 Data Sources). Since the data was linked based

on the image timestamps, it was decided to linearly interpolate short gaps in the wave data

measurements to achieve a 10-minute resolution; for larger gaps in the data, analysis was not

possible. Bathymetric and topographic data for the evaluation of morphological changes on

the beach were not available.

Reuter developed a soware with five main steps. First, data were preprocessed by

combining images with measured wave heights and water levels and then removing images

with low brightness and incomplete data. Second, water levels were calculated from the

image and normalised using the measured data. Third, the height of incoming waves was

calculated by making use of the previously calculated water levels. Next, erosion was

evaluated by comparing images taken during ebb tide at diﬀerent moments in time. Last, the

results were visualised: for water level and wave height, the calculated results were

presented as scatter plots, while erosion was shown using a comparison image.

Preprocessing

The preprocessing started with the creation of a linking table, which joined the image data

with the measured wave heights and water levels. The image and measured data were read

into MATLAB; if there were smaller data gaps, the data were linearly interpolated to achieve a

10-minute resolution. The linking table consists of the path to an image and its timestamp, as

well as the water level and wave height measured at that point. Once the linking table had

been created and saved, the images taken during daylight hours were extracted. The

extraction of these images is necessary, as images taken during night and twilight do not

provide the necessary contrast to perform image-based analysis on them. Using the mean

colour value of an ROI within the sky (Fig. 10), the images were sorted into three categories

Daylight,Twilight, and Night. The ROI does not cover the entire sky to exclude the dark text

and logo at the upper image border and to exclude bypassing images near the horizon, which

could falsify the extraction results.

Fig. 10: Webcam image with the analysis area of the sky framed in red (Reuter, 2022).

The images were transformed from RGB with almost 17 million colours to 8-bit grayscale with

a colour space of 256 colours, which reduced the processing workload of the program, as the

threshold between light and dark could be set to a single value. Images with a colour

threshold value below 120 were sorted into the Night category. A threshold value of greater

than or equal to 155 resulted in an image being sorted into the Daylight category. All images

with a colour threshold value below 155 and equal to or greater than 120 were sorted into the

Twilight category. The images were then copied into corresponding tables; the table

containing the daylight images was used for analysis, while the other tables were set aside

for potential later uses.

Water Level Calculation

The preprocessing step provided a table with the image names and corresponding measured

water levels; this table was used to predict the water level between the two groynes on the

images. The images were cropped to show only the area between the groynes and then

rotated to display the water line parallel to the lower image edge. The mean colour values for

the water surface and each image row were calculated to detect the water line; its oﬀset to

the seafront (the lower image edge) was correlated with the measured water levels. However,

since the colour of the beach oen did not diﬀer enough from the colour of the water (e.g.

due to reflection or absorption of light by wet sand), this approach was not pursued any

further.

The approach using mean colour values was altered to focus on an area with a higher colour

contrast; for this purpose, the dark groyne on the right image side was chosen as the ROI.

Images from the Daylight table were rotated 45° counterclockwise to align the groyne

vertically with the lower image edge and cropped to the groyne and some of the area

surrounding it. The image was then transformed multiple times (Fig. 11). First, the RGB image

was transformed to grayscale (Fig. 11a) to reduce the colour space from roughly 17 million to

256 colours, to reduce computation time. An unspecified filter was applied to transform the

grayscale images to black-and-white (Fig. 11b). For each image a threshold value was

extracted from the water colour, and this threshold was used to decide whether a pixel is set

to black or white; the groyne was always set to black while its surrounding was set to white.

In some cases, light reflections could cause a false black colouration of areas within the

image (Fig. 11d). Various other filters were applied to increase the white area around the

groyne to isolate it from other black areas within the images; the black areas outside of the

groyne were deleted aerwards.

Fig. 11: Transformation stages of the cropped image of the groyne. (a) Grayscale image, (b)

black-and-white image, (c) image aer application of filters, (d) a wave in the upper image

area is misclassified as part of the groyne (Reuter, 2022).

For the remaining black areas, the geometric centre was calculated (Fig. 11c). Only areas for

which the geometric centre lay within the area of the visible groyne were considered for the

water level calculation. With an increasing water level, the coverage of the groyne with water

increases as well; this fact was used to correlate the covered length to the measured water

level. For this purpose, the distance between the top edge of the image and the first image

row not covered by water was calculated. The number of rows corresponding to this distance

was then correlated with the measured data to allow a direct comparison.

As long as a part of the groyne is still visible, the approach for calculating the water level

works. Once the groyne is fully covered by water (at a water level of around 6.60 m), it can

only be concluded that the water level is high, however, the precise water level can no longer

be calculated.

Wave Height Calculation

According to Reuter, the calculation of the wave height was a more diﬀicult task since no

clear image-based feature exists for wave height identification. Therefore, multiple features

and approaches were tested and evaluated.

The first approach was based on the shadow produced by waves. An image region showing

the sea outside of the surf zone was chosen to detect the wave shadows; a darker water

colour was chosen as the indicator for the presence of shadows. Using a histogram of the

grayscale values, a threshold was chosen to diﬀerentiate between water with and without

shadows. Individual pixels below this threshold were calculated; the larger the pixel area

below the threshold, the more shadows are visible. While this approach could prove itself

useful for images with a strong contrast between water with and without shadows, it did not

apply to the images from the Norderney webcam, as the contrast was too little. It was

therefore not possible to correlate the summed area of shadows with the measured wave

heights.

Another approach was based on the evaluation of sea foam caused by breaking waves. A

threshold diﬀerentiating the white sea foam from the surrounding sea was set based on grey

levels calculated from the sum of the weighted RGB channels (R = 0.299; G = 0.587; B = 0.114).

Diverse lighting conditions caused a strong variation of the threshold between images;

therefore, only images taken between 10:00 and 14:00 were evaluated, as the lighting

conditions were mostly consistent within this timeframe. Pixels above the threshold were

recognized as sea foam, and the total number of these pixels was correlated to the measured

wave height. While this approach provided usable results for very similar lighting conditions,

it was deemed too unstable for a universal application.

Given that the aforementioned brightness-based approaches would result in errors, Reuter

decided to predict the wave heights from the colour values of the surf zone instead. This new

approach consists of six steps. First, images from the Daylight table are read into the

workspace and transformed to grayscale, where seafoam is shown as light while wave

shadows are shown as dark. Then, the images are cropped to show only the surf zone; as the

position of the surf zone varies with the water level, an area suﬀiciently large to show all surf

zone positions was extracted. Aerwards, the extreme values of the greyscale colours were

extracted; instead of using the minima and maxima directly, a tolerance range was applied to

ensure that e.g. people walking through the surf zone do not falsify the results. Next, the

diﬀerence between the extreme values was calculated. A high diﬀerence shows that bright

(sea foam) and dark pixels (troughs) are present, while a low diﬀerence corresponds to a

uniform water surface without waves. Lastly, the calculated diﬀerences are correlated to the

measured wave heights.

Assessing Morphological Changes

The morphological changes due to erosion and sedimentation were evaluated for time

intervals between 2013 and 2020, as morphological changes occur on larger time scales than

changes in water level and wave height. Two approaches were used to visualise the erosion:

one superimposed images and highlighted diﬀerences through colour, and the other created

a collage of the water lines from diﬀerent images. Both approaches were applied to three

diﬀerent intervals: monthly from January 2015 to October 2016 at a water level of 353 cm ± 2

cm, semiannual for May and November from 2013 to 2020 at a water level of 353 cm ± 2 cm,

and event-based for three storm surges between January 2015 and October 2016.

For a water level of 353 cm ± 2cm, the water coverage of the beach is low, allowing for a large

beach width to be assessed. Two images with a temporal distance of one month are

superimposed to highlight the diﬀerences between the two images (Fig. 12). In the

superimposed image, the same pixels are grey, while the diﬀerences are coloured. For the

groyne, green pixels correspond to erosion, while magenta pixels correspond to

sedimentation. For the beach on the other hand, green corresponds to sedimentation while

magenta corresponds to erosion. Comparisons near the water line and within the surf zone

are problematic, as these vary strongly with the incoming and breaking waves. Additionally,

it is not possible to quantify the movement of sand – a thin layer of sand provides the same

colour information as a thick layer would, given that the lighting conditions are the same.

The constraint of using only images with the same lighting conditions and water levels

results in a high complexity for the image selection.

Fig. 12: Images from the 4th of May 2016 (top le), from the 4th of June 2016 (top right),

and their superimposed image (bottom) (Reuter, 2022).

The second approach revolved around creating a collage of cropped regions from multiple

images showing the surf zone and the water line (Fig. 13). Images with the same or a very

similar water level were used, and it was expected that only drastic changes of the water line

position would correspond to morphological changes.

Fig. 13: Collage of water lines from the 8th to the 14th of January 2015. The red line

corresponds to the water line, and the middle image shows the storm surge from the 11th of

January 2015 (Reuter, 2022).

Results

For the given image and water level data, the tide curve of the calculated water levels reflects

that of the measured data (Fig. 14) with an accuracy of 73.5% and an RMSE of 38.7 cm. The

coeﬀicient of determination R2is 0.628 for the entire period under review, meaning that

62.8% of the calculated data are within the variance of the measured data. Outliers are

especially prevalent for high measured water levels (Fig. 15, le). 9% of the calculated water

levels lie outside of the first standard deviation.

Fig. 14: Tide curves of measured (blue) and calculated (red) water levels (Reuter, 2022).

The calculations of the wave heights did not provide as good results as that of the water

levels. The scatter around the regression line is much higher for the wave height calculations

than for the water level calculations (Fig. 15). Furthermore, R2is 0.238, which means that less

than a quarter of the calculated wave heights are within the variance of the measured values.

However, in 80.5% of the wave heights, the calculated values reflect the general magnitude

of the measured data correctly.

Fig. 15: Scatter plots for water levels (le) and wave heights (right) (Reuter, 2022).

The assessment of morphological changes with the superimposed images showed no

changes for the monthly evaluation. Seasonal morphological changes were detected with

the semiannual evaluation: there was coastal erosion during the winter months and

sedimentation during the summer month. A semiannual evaluation using a collage was not

possible as the camera and its angle of view were changed multiple times between 2013 and

2020. The event-based assessment with superimposed images detected both the

sedimentation caused by a beach nourishment in July 2019 and the erosion caused by storm

surges in the following winter. The collage approach did not detect any changes in the water

line before and aer a storm surge on the 11th of January 2015. The quantitative evaluation

of morphological changes is not feasible using only webcam images.

2.3 Machine and Deep Learning

2.3.1 Background Information

Machine learning (ML) and deep learning (DL) are both

subsets of artificial intelligence (AI), with DL being a

subset of ML (see figure to the le, simplified from [9]).

AI itself is a branch of computer science that aims to

replicate or simulate human intelligence in machines

(Hipwell & Alexander, 2022). There are four types of AI,

based on the type and complexity of the task the

system can perform: reactive machines, limited

memory, theory of mind, and self-awareness [10]. In

the scope of this thesis, the type of AI that is being worked with is limited memory, as the

system stores data, features, and past predictions to be able to make new predictions on new

data. For example, the transfer learning that has been applied in this thesis is an ML model

that makes use of limited memory AI [11].

In machine learning, algorithms are trained to identify connections between predictors (e.g.

image features) and responses (e.g. measured data) and to recognize patterns. This learned

information is then used on new, unknown data to make predictions. The algorithms are

capable of writing their code depending on the information they have learned; however, the

user must provide the algorithm with suitable training data. What constitutes suitable datais

dependent on the ML type that is being used; ML is divided into supervised, unsupervised,

semi-supervised (a combination of supervised and unsupervised learning), and

reinforcement learning [12].

For unsupervised learning, the algorithm receives sample data but no target variables.

Dependencies and patterns are learned by the algorithm itself, and there is no “desired”

result provided by the user. Since the learning does not need labelled data, the eﬀort from

the user side is minimal, plus the data can be used in real-time [13]. This type of learning is

used for clustering (e.g. pattern and structure detection using k-means), associations (e.g.

finding correlations using FP-Growth), and reducing dimensionality (e.g. using PCA [principal

component analysis] or k-NN [k-nearest neighbours]). However, since the sample data is used

on its own without target variables, this learning approach cannot be used for predictions, as

these need to know true or false outputs beforehand.

Reinforcement learning is a type of machine learning where a system is trained within an

environment and has to make sequential decisions to interact with its environment.

Depending on the cost function, an action is either rewarded or punished [14]. Since this type

of ML is mainly used for robotics and does not work on static data (meaning previously

extracted features and measured target variables), it was not applied – or rather: not

applicable – within the scope of this thesis.

Considering that the goal of this thesis is the prediction of wave heights and water levels

based on image features, the regression models were trained using supervised learning.

During the training stage, the models were provided with sample data (features derived from

webcam snapshots) and target variables (measured wave heights and water levels). Unlike

with unsupervised learning, training data has to be generated by the user by extracting

features from the input data (predictors) and combining them with their corresponding

target variables (responses). Supervised learning is used for regression (numeric output) and

classification problems (discrete output; i.e. probability of a class being true or false) [15].

Deep learning makes use of artificial neural networks (ANN), which have a structure inspired

by human brains and aim to mimic the way neurons communicate with one another. The

three main types of neural networks are multilayer perceptrons (MLP), convolutional neural

networks (CNN), and recurrent neural networks (RNN). MLPs are feedforward networks which

consist of an input layer, one or multiple hidden layers, and an output layer [16]. Since most

real problems are not linear, rather than using perceptrons, MLPs use sigmoid neurons. This

is crucial since the fuzzy logic of sigmoid neurons ensures that small changes in the weights

and biases do not have a large eﬀect on the output, unlike the perceptrons with binary logic,

where a small change might flip the output from e.g. 1 to 0 [17].

CNNs are similar to feedforward networks but apply convolutions to identify patterns from

images, which is why they are oen used for image or pattern recognition and computer

vision. RNNs make use of feedback loops and are mainly used on time series data to make

predictions for future events (i.e. stock charts) [16].

The main diﬀerence between machine and deep learning is that the latter, by making use of

ANNs, can work with unstructured data (e.g. images, text) by converting them to numerical

values. This extracted information (features) can then be used for e.g. pattern recognition or

predictions, making the approach similar to unsupervised learning. Deep learning is

especially useful for large datasets, as their performance improves with the amount of data

available, rather than reaching saturation the way traditional ML approaches do. However,

the ability to process large amounts of unstructured data and extract features by itself comes

with the downside of high computational requirements. Both the matrix convolutions and

the calculation of the weights of the neurons result in long computation times, making GPUs

necessary, both to speed up the process and to ensure that the hardware can handle the

computational load [18].

Within the scope of this thesis, CNNs (e.g. ResNet18, see 3.4.2 Network Example: ResNet18 for

details) were used for the deep learning approach. The networks were trained for both

classification and regression problems (see 3.4 Classification and Regression with Deep

Learning). The machine learning approach used supervised learning to solve a regression

problem (see 3.5 Regression with Machine Learning).

2.3.2 Application in Coastal Engineering

Researching scientific literature for the application of machine and deep learning approaches

to coastal imagery revolved around the following questions: Which approaches exist and

have been used for coastal applications? Were images from webcams or coastal monitoring

stations (e.g. Argus), especially snapshot images, used as inputs? Can these approaches be

used to determine parameters such as water level and wave height? Given the preprocessing

scripts written in MATLAB from the preliminary work of B. Reuter (2022), are any of the

presented approaches applicable in MATLAB? Were any of the approaches applied to barrier

islands, ideally with conditions similar to that of Norderney? A paper that reviewed various

machine learning approaches for coastal engineering purposes is Goldstein et al. (2019).

Within the review paper of Goldstein et al. (2019), three papers were presented that focus on

the application of machine learning approaches to barrier islands. As the study site of this

thesis (see 2.1 Study Site) is a barrier island, the application of ML to other barrier islands

could provide useful cues as to how such an approach could be designed. The papers by

Poelhekke et al. (2016), Plomaritis et al. (2018), and Gutierrez et al. (2015) all apply a

Bayesian network (BN) approach to data from barrier islands to evaluate the morphological

development and changes of the islands over time and especially in response to storm

events. Goldstein et al. (2019) criticise that these papers do not provide access to the codes

used to develop the BNs. While structures, parameters, as well as testing and evaluation

methods are presented, the lack of direct access to the models makes reproduction and

further development of the results diﬀicult if not outright impossible. For example, Gutierrez

et al. (2015) state that the bin widths of conditional probability tables (CPTs) for nodeswere

determined by "subjectively balancing" the need to have enough bins, and that their primary

BN was “designed subjectively”, raising the question as to how these steps could be

reproduced.

Plomaritis et al. (2018) reference the model developed and used by Jäger et al. (2018) for

coastal risk analysis. The presented BN is described in detail and the code is available

open-source [19]; it combines a source-pathway-receptor (SPR) model and disaster risk

reduction (DRR) measures to perform coastal risk analysis and support decision-making in

coastal risk management. The input data for this algorithm consists of hindcast and synthetic

extreme event scenarios, information on land use, and vulnerability relationships. Included

within this dataset are water levels, currents, wind conditions, and wave spectra. However,

the model does not use images as an input, and rather than predicting sea states or

numerical values for wave height and water level, hazard probabilities and the eﬀectiveness

of DRR measures are calculated (Jäger et al., 2018).

Since none of the models developed in the aforementioned papers used images as data

input, it is questionable whether they would be suitable for the problem presented in this

thesis, where wave heights and water levels are to be predicted from coastal webcam

imagery. A more suitable approach was found in Hoonhout et al. (2015), where webcam

images were used to determine the water line, among other parameters, by using image

segmentation. Hoonhout et al. used a manually annotated dataset from ArgusNL, consisting

of 192 images from four coastal camera stations along the Dutch coast (2015). These webcams

take snapshot images every 30 minutes; they also store 10-minute mean, variance, min, and

max images, however, these were not used in the paper. The images were over-segmented

and nine classes were manually assigned; these initial classes were then aggregated into five

main classes: water,sand,vegetation,sky, and objects. Oversegmentation was achieved with

the SLIC algorithm to create superpixels (Fig. 16), which are segments of similar pixels, to

boost the number of features for classification. A region-growing algorithm was used to

post-process the over-segmentation to remove scattered superpixels, as these can cause

diﬀerent regions to be classified as the same region.

Fig. 16: Original (le) and over-segmented (right) image (Hoonhout et al., 2015).

Superpixels have variance, patterns, texture, colour, shape, and topological relations, which

were used to generate new features; location, shape, and texture features were used as well.

1727 features from the categories position,intensity,shape, and texture were used by the

classification algorithm, an SSVM (structured support vector machine) that minimises the

regularised cost function. The algorithm was implemented in Python and the performance

was measured by the percentage of correctly predicted classes. An average accuracy of 93.0%

with a standard deviation of 0.7% was achieved for 140,000 test instances. Beach width and

waterline position were determined as well, and variations of the latter could be determined

to O(10m) (Hoonhout et al., 2015).

The above-mentioned papers present approaches which are mainly used to detect the water

line within an image. However, another parameter this thesis focuses on is the water level at

the groyne on the right side of the webcam images from Norderney (e.g. Fig. 9). Using a

combined search with the keywords “machine learning” AND “water level” AND “image

processing”, the goal was to find ML approaches which can be used to identify the water level

from an image. The vast majority of papers matching these terms come from the discipline of

water management, where the ML approach of water level detection is used for drainage

channels or fuel tanks. The methodology of these approaches oen involves a scaled water

gauge within the field of view (FoV) to determine the water level e.g. through image

segmentation or edge detection. While the images from Norderney do not feature a scaled

gauge within their FoV, the groyne in combination with data from a tide gauge at

Norderney-Riﬀgat can be used to approximate the gauge from the water management

papers. Notable examples are Hies et al. (2012), Vandaele et al. (2021), and Guo et al. (2020).

Hies et al. (2012) determined the water depth in an open storm-water drainage canal in

Singapore for water management purposes. Images from a surveillance camera with a view

of a white pressure gauge were used to extract the water level. These images were

transformed to show the undistorted frontal view onto the gauge, and then the area from the

bottom of the channel to the top of the gauge was extracted as the ROI (Fig. 17). Edge

detection algorithms were applied to find the border between water and gauge, finally,

Hough transformation was applied to find the longest straight line in the edge image, which

corresponds to the water level. The water depth was then interpolated from the water level

and previous field measurements. The system can return water depths in real time; during

the study, the camera data had a deviation of 1.1%, while that of the pressure gauge was at

3.9%.

Fig. 17: Original image (le) and transformed image with ROI (right) (Hies et al., 2012).

Guo et al. (2020) use a similar approach, applied to a scaled water gauge. Their preprocessing

steps too consisted of tilt correction, edge detection, and Hough transformation.

Additionally, they used image segmentation using the largest inter-class variance (OTUS, or

Otsu method), which determines the image binarization segmentation threshold, and

diﬀerence images. Water levels were classified using sparse representation, resulting in an

error of <0.9cm between calculated and measured water levels.

Instead of using machine learning, Vandaele et al. (2021) used deep transfer learning for

water segmentation and water level prediction from river camera images. Water levels were

estimated from images by using measurements of the heights of particular landmarks or

objects in the field of view of the camera (Fig. 18). Two datasets for semantic segmentation

algorithms were used: COCO-stuﬀ and ADE20k. These were applied on fully convolutional

networks (FCNs); ADE20k was used on an FCN with a ResNet50 encoder and a UpperNet

decoder, while COCO-stuﬀ was used on the DeepLab FCN with a ResNet101 encoder and an

atrous spatial pyramid pooling decoder. Aer the transfer learning phase, experiments were

performed on LAGO and INTCATCH datasets to assess whether sample selection or

fine-tuning provides better results; the latter provided the best results. Overall, the best

results in segmentation could be achieved by training the networks from scratch on the LAGO

dataset; while the DeepLab FCN performed better river edge detection, it generally

misclassified more segments in the whole image than the other FCN.

Fig. 18: Images from Farson river camera. Le with landmarks in red dots, and right the

same image with segmented water surface and annotated green landmarks (Vandaele et

al., 2021).

Chaudhary et al. (2020) created flood maps using smartphone images and a VGG16 network

pretrained on the ImageNet dataset. Previously, they had performed flood height prediction

using deep learning and object instance segmentation; building a large, pixel-accurate

annotated dataset had, however, been a considerable eﬀort (Chaudhary et al., 2019). In their

new approach, flood estimation was defined as a per-image regression problem combined

with ranking loss, which reduced the labelling load, as relative ranking is easier to annotate.

The VGG16 network was trained in two parts: in the first, it received known absolute flood

water level for each image, and in the second, the ordering relation. A new dataset

DEEPFLOOD consisting of 8145 ground-level images with water level annotations was created

and then split into two subsets: DF-OBJ, which contains pixel-accurate object instance labels

and flood level annotations per image, and DF-IMG, which contains a single water level

annotation per image. Multiple experiments were conducted on these datasets; a

combination of ranking (trained on DF-IMG) and regression (trained on DF-OBJ) losses

performed best.

Fig. 19: Images taken by an ASV at diﬀerent sea state levels (Wang et al., 2013).

In Wang et al. (2013), sea state detection from images was performed by computing a

grey-level co-occurrence matrix (GLCM) from image texture features and then extracting the

features of the GLCM. A GLCM represents the spatial relationship between pixels in an image,

which reveals texture information, e.g. of homogeneity or recurring patterns. Therefore,

GLCMs and their features play a fundamental role in texture analysis [20]. Wang et al. used

the properties of contrast,correlation,energy, and entropy for their research (2013). Contrast

measures the local variations (diﬀerence in pixel values of a reference pixel and its

neighbouring pixel) of the GLCM, correlation measures the joint probability occurrence of the

specified pixel pairs, energy provides the sum of squared elements in the GLCM and indicates

the uniformity within an image, and homogeneity measures the closeness of the distribution

of elements in the GLCM to the GLCM diagonal [21]. The images used by Wang et al. (2013)

were taken by an autonomous surface vehicle (ASV) at diﬀerent sea state levels (Fig. 19); the

levels ranged from 1 (top row) to 4 (bottom row). The GLCM properties were extracted from

these images to detect the sea state. Analysis of the discernibility between sea states

depending on GLCM features showed that only contrast had diﬀerentiated well between the

sea states. It was proposed to improve the discernibility further by computing the SSCQ (sea

state characteristic quantity) as a logarithmic transformation of the contrast. Tests proved

that the SSCQ could indeed improve the discernibility and that the SSCQ as a texture-based

feature was only slightly aﬀected by diﬀerent light conditions in the images.

Jörges et al. (2021) used long short-term memory (LSTM) neural networks to reconstruct and

predict nearshore significant wave height (SWH) to assess the eﬀect of Ebb-Tidal Delta (ETD)

sandbanks on the wave climate oﬀthe Norderney coast. Wave data was taken from thethree

Waverider buoys Coast Ref, Coast I (both north of the ETD sandbanks), and Coast II (located

south of the ETD sandbanks, right in front of the Norderney beach), each with a 30-minute

resolution. Additionally, wind data from the DWD of the Norderney station (10 min

resolution), water level data by the Waterways and Shipping Administration (WSA) for the

gauge level Norderney-Riﬀgat (1 min resolution), and bathymetries from yearly surveys by

the NLWKN and EasyGSH-DB were used. Two LSTM networks – a standard and a parallel

(P-LSTM) model – were compared to other deep (SL-FFNN) and machine learning (SVR, RF,

MLR) approaches. Tests were done including and excluding the bathymetric data to assess its

influence on the reconstruction and prediction tasks. Using the bathymetric data improved

both tasks; the P-LSTM structure achieved an RMSE of 0.069 m for the reconstruction when

bathymetric data was included.

3. Materials and Methods

3.1 Webcam Technology

The webcam located inside the Kaiserhof building in the northwest area of Norderney is a

Canon EOS 1100D reflex camera with an EF-S 18-55 mm / 3.5-5.6 IS II lens [22]. Its viewing

direction is towards the West with a slight tilt northwards (Fig. 20). Within the analysed

period, the focal length is fixed at 18 mm while the aperture and exposure time varies

depending on the lighting conditions.

Fig. 20: Camera angle of vision with positions of the camera, the wave buoy (Boje), and the

water gauge (Pegel) (Reuter, 2022).

The webcam takes a snapshot image (Fig. 21) every ten minutes and uploads it to the

foto-webcam.eu website. The weather data at the top le of the image is provided by the

Deutscher Wetterdienst (DWD) using half-hourly meteorological data from the Norderney

weather station. The logo at the top right is that of Norderney Zimmerservice; the owner and

operator of the webcam. The webcam images have an aspect ratio of 16:9 with an original

size of 4272 x 2600 px and use the RGB colour space without an alpha channel.

Fig. 21: Image taken by the Norderney webcam on 20th August 2015 at 08:50 UTC+2.

As seen in Fig. 21, the camera faces the beach at an angle instead of head-on. Thelower third

of the image shows the dike summit and the seafront. Since the image wastaken in summer

during sunny weather, the Kaiserhof building casts a strong shadow onto the grass.

The groyne, which is the designated region of interest (ROI), is framed in red with the label

“OuterRectangle”. It is positioned within the centre third, which shows the beach, the ocean,

and two other groynes. The groynes in the middle le could be used as a control section for

the water level detection, however, due to the angle of view not showing the water cover

ideally, it was ultimately decided against. Given that the rectangular ROI includes parts of the

surf zone, it can be used for wave height prediction as well. The upper third of the image

shows the sky and the aforementioned image annotation. While the lower and upper third do

not include the groyne as the main research focus, they are needed for preprocessing the

images (see section 3.3.1 Preprocessing the Image Data).

While the whole image dataset spans from the 1st of January 2015 to the 14th of October

2016, only the period from the 1st of January to the 31st of December 2015 was used for

model training purposes. Both the 2015 and 2016 subsets were used for predictions. The

2015 data were used for prediction to analyse how much the validation RMSE from training

the models might diﬀer from the prediction RMSE. The 2016 dataset from the 1st of January

to the 14th of October 2016 was used for prediction to see how well the model performs on

previously unseen data, where it does not know the true response (measured wave height

and water level). The specific time frame from the 1st of January 2015 to the 14th of October

2016 was chosen to allow for comparability with the results from the preliminary work

(Reuter, 2022). Reuter lists multiple reasons for choosing the specific time frame, and they

will be illustrated here for reasons of comprehensibility.

While the webcam has been taking pictures since the 15th of February 2012 at 12:30 UTC+1,

both the camera and its angle of view have been changed multiple times. As seen in Fig. 22

from 2012, the angle of view diﬀers notably from the one in Fig. 21, which was taken in 2015.

While the angle seen in Fig. 22 was changed on the 23rd of February 2012 and stayed the

same until the camera replacement in 2016, images taken within the first week of operation

do not have an angle comparable to the period of interest. Additionally, many of the images

taken before 2015 contained obstructions in the form of raindrops on the camera lens,

covering the groyne and surf zone, making theimages unsuitable for analysis (Reuter, 2022).

Aerwards, protection against the rain was installed, resulting in only a few images in the

2015-2016 dataset having to be removed due to raindrops covering the view.

Fig. 22: Webcam snapshot from the 18th February 2012 at 12:30 UTC+1.

Following the camera change aer construction work in November 2016, the angle of view

was changed once again (see Fig. 23). The images aer the 14th of October 2016 until the

camera change were not used as the view was obstructed by scaﬀolds. While the new

viewing angle from 2017 onwards might be even more suitable for the deep and machine

learning approaches using only the groyne for water level and wave height prediction, the

comparability with the preliminary work would not be given. Starting from the 22nd of July

2019 (see 2.1 Study Site), the groyne is covered by sand from a beach nourishment, making

the images not suitable for analysis. The sand from the beach nourishment covered the

groyne until early May 2020; aerwards, the groyne is visible enough to be used for analysis

purposes again, and no further long-time obstructions occur.

Next to comparability, availability also played a strong role in the choice of the timeframe.

The foto-webcam.eu server unfortunately only allows images to be downloaded manually

and individually, making it diﬀicult to build a suﬀiciently large dataset quickly. As the full

dataset from 2015 and 2016 was readily available from Reuter (2022), which is the

preliminary work the performances of the approaches in this thesis are compared to, and it

was chosen as the period of interest.

Fig. 23 Webcam snapshot from the 15th September 2017 at 12:10 UTC+2.

3.2 Data Sources

3.2.1 Wave Data

The wave data was provided by the Coastal Research Station of the Lower Saxony Water

Management, Coastal Defence and Nature Conservation Agency (NLWKN) taken from the

Coast II Datawell Directional Waverider MkIII wave buoy located in front of the north-western

beach of Norderney (N 53.715653 E 7.141280) (Reuter, 2022; Jörges et al., 2021). The buoy

records data every 30 minutes; this includes the measurement timestamp, peak period TP,

peak direction DirP, significant wave height HS, Godaʼs peakedness parameter QP, and the

significant steepness SS. The buoyʼs measuring radius for the heave is 20 m in both positive

and negative directions, with a resolution of 0.01 m. For the first three years aer calibration,

the error is less than 0.5% of the measured wave height; aerwards, it increases to less than

1% of the measured values. The range of the direction measurement is 0° to 360° with a

resolution of 1.4°. The heading error is, depending on latitude 0.4° to 2°, but typically 0.5°. A

period of 1.6 to 30 seconds can be measured [23].

Fig. 24: Measured vs linearly interpolated wave heights from 2015.

In the period under review, the measured wave height data contains gaps. While smaller gaps

(e.g. spanning a few hours only) can be resolved by linearly interpolating the data, larger

gaps (Fig. 24) would introduce errors and are therefore not considered for the analysis. For

the 2015 dataset, there are three major data gaps: from the 5th of Mayuntil the 4th of June,

from the 5th until the 14th of October, and from the 24th of November until the 8th of

December. The 2016 dataset (Fig. 25) has major gaps from the 4th to the 8th of January, from

the 18th to the 30th of March, from the 14th to the 20th of September, and from the 25th of

September until the 4th of October.

Fig. 25: Measured vs linearly interpolated wave heights from 2016.

The data was not only linearly interpolated to close gaps in data but also to match the

10-minute resolution of the webcam images.

3.2.2 Water Level

Fig. 26: Tide gauge Norderney-Riﬀgat.

The water level data was recorded by the

Norderney-Riﬀgat tide gauge of the

hydrological information system of the

Federal Waterways and Shipping

Administration (WSV). The black and yellow

mooring post (Fig. 26) is located at the

south-western end of Norderney (53°41'47"

N - 07°09'28" E) [24] and records data every

minute.

The gauge data was provided by the NLWKN

as CSV tables consisting of the records for

one year. For the period under review, from

the 1st of January 2015 until the 14th of

October 2016, the gauge data is mostly

complete.

The gauge data, which was recorded every minute, is then normalised to the timestamps of

the images, which were taken at a 10-minute resolution. Gaps that are visible in the

visualisation of the water level data can be caused either by missing images, as the water

levels are normalised to the image timestamps and images taken during twilight or night

were discarded (see 3.3.1 Preprocessing the Image Data), or by missing gauge data. For the

2015 data, no larger gaps are present (Fig. 27). The 2016 data on the other hand is less

consistent than the 2015 dataset; in early January and mid-August, there are larger gaps

spanning multiple days (Fig. 28).

Fig. 27: Water levels from 2015, normalised to the 10-minute resolution of the image data.

Fig. 28: Water levels from 2016, normalised to the 10-minute resolution of the image data.

3.3 Processing and Formatting Data

3.3.1 Preprocessing the Image Data

Before using the images for any task or approach, they need to be preprocessed to ensure

the best possible results. The processing steps described in this section focus on extracting

the images taken during daylight hours, creating a linking table between images and other

data, and cropping the images to the region of interest (ROI). The creation of regression data

from the cropped image is described in 3.3.2 Creating Regression Data, while the further

processing of images for classification with deep learning is described in 3.3.3 Processing

Image Data for Classification.

The general procedure (see figure to the le) is as follows:

first, both the folder with the webcam images and the

measured wave height and water level data are loaded into

the workspace. The timestamps of the images are extracted

from the image names. Using the image timestamps, the

measured wave heights and water levels corresponding to

these timestamps are extracted. Gaps in the data are removed

and the data is updated accordingly. Subfolders for the

diﬀerent lighting conditions are created, and an ROI is defined

from an exemplary image. The images are sorted into

diﬀerent subfolders depending on the prevalent lighting

conditions; if an image is registered as a Daylight image, it will

be cropped to the bounds of the ROI. Last, a linking table that

joins images, water levels, wave heights, lighting conditions,

and beach visibility is created.

Load Data into Workspace

The first step of the preprocessing workflow is to load the recorded data and imagesinto the

MATLAB workspace to work with them. For this to work correctly, all folders have to be within

the same folder layer, meaning that they have the same parent folder. Here, the folder for the

scripts is called Matlab, while the image folder for the 2015 snapshots is Bilder2015, and the

folder containing the measured data is Daten. A list with all images is created from the image

folder; this list is used again later (see Appendix I: Preprocessing with Interpolation).

Extract Timestamps from Images

Next, the timestamps are extracted from the images and saved into the variable zst. All

images have a naming convention similar to norderney-150808-1600-hu with the JPEG

ending. This name contains the timestamp in the format YYMMDD-hhmm, meaning that the

year, month, day, hour, and minute of the time a snapshot was taken can be extracted. The

script loops through all images in the list piclist until all timestamps have been extracted.

Get Wave Heights and Water Levels

Once the timestamps have been extracted from the image names, they can be used to find

the corresponding measured wave heights and water levels. The image timestamps are used

to normalise all data to the 10-minute resolution of the image; normally, the wave heights

are recorded every 30 minutes, while the water levels are recorded every minute (see 3.2

Data Sources).

The script loads the file allewstand.mat into the workspace; this file contains previously

saved water levels wst and their timestamps dattim. If this file has not been created yet, the

script wstand13bis20.m can be used to create this file and its variables from CSV tables of

the gauge data (see 3.2.2 Water Level Data). The preprocessing script then creates a vector of

the size of the image list and assigns it to the variable wstand for the water levels. Using a

loop, all entries wst whose timestamp dattim corresponds to the image timestamp zst are

saved into wstand; therefore, gaps in the image data can result in gaps in the water level

data used for regression.

The wave heights HSare extracted from the file allewelle.mat, which contains previously

saved wave parameters. If this file does not exist yet, the wave parameters – the

measurement timestamp, the peak period TP, the peak direction DirP, the significant wave

height HS, Godaʼs peakedness parameter QP, and the significant steepness SS– can be

extracted from HIS files using the script wellen15bis16.m. Instead of looping through all

data, this part of the preprocessing script finds instances of the image timestamp zst that

are found in the vector of wave measurement timestamps dattimwelle. This extracts the

measured wave heights HSfor any image taken at the same time; however, the resulting

vector welleHs can contain gaps. Given that the resolution of the wave height

measurements is 30 minutes and that of the images 10 minutes, there is generally not as

much measured data available as there are images. Using linear interpolation, these small

gaps can be filled. However, the wave data for 2015 and 2016 was unfortunately not

complete, meaning that there are gaps spanning multiple days and sometimes even an

entire month. Filling these gaps with interpolation would result in errors, which is why they

have to be removed.

Remove Gaps and Update the Data

Measured data can contain errors such as gaps or phase shis, which would negatively aﬀect

analysis if le untreated. Visualising the data with plots helps to find larger gaps (see figures

1, 2, 4, and 5 in 3.2 Data Sources). Using MATLAB figures, it is possible to zoom in on the

starting and ending points of the gaps and to get their timestamp by clicking on the dots

corresponding to a measurement. These timestamps can then be used to index into the

image list to remove the gaps from it. If larger gaps were present and removed from the

image list, its dimension is now no longer compatible with that of the timestamp zst, water

level wstand, and wave height welleHs. Therefore, it is necessary to update these variables

so the dimensions of the vectors fit each other again; this is simply done by running the

codes that were previously run to extract the timestamps, water levels, and wave heights

again on the new image list.

Create Subfolders

The next step consists of creating new subfolders for the images sorted by lighting conditions

within the main image folder. First, the path to the main folder isappended by the names of

the new subfolders. Should subfolders already exist and need to be overwritten, they can be

removed by using rmdir. New directories are then made using the mkdir command. These

newly created subfolders are used later for the sorting step of the main workflow.

Define ROI from Image

For the task of calculating the wave height and water level from an image, an unchanging

element within the image was chosen as theregion of interest (ROI). For the webcam images

from the 1st January 2015 to the 14th October 2016, this is the groyne on the right image

side. The query of a low water level (wstand < 350 cm) during calm sea conditions (welleHs

< 10 cm) is used to index into the image list to find an image where the groyne is almost

entirely visible (Fig. 29), allowing the user to select the section of the image that shows the

ROI the most clearly. The position of the here-defined ROI is then saved into the variable pos

to be used again in the next step.

Fig. 29: Webcam snapshot with the groyne framed in red.

Sort, Crop, and Save Images

This step sorts the images into diﬀerent folders based on the prevalent lighting conditions

and crops the images taken during daylight hours to show only the previously defined ROI. A

new variable Light is created which will contain information about the lighting conditions

for each image and will be added to the final linking table. The first image in piclist is

displayed for the masks defined in the loop to work properly.

The code then loops through all of the images in piclist and converts them from RGB to

grayscale. Two masks are created for a part of the sea and of the boardwalk; together with a

section of the sky, these are used to detect the lighting conditions (Fig. 30). The dimensions

of the sky ROI were taken from the daylight image detection by Reuter (2022), while those of

the other ROIs were determined from tests with multipleimages. These positions are fixed in

the code and need to be altered if a diﬀerent angle of view or site is used. The mean of the

grey values is then used to sort the images into diﬀerent lighting categories (Tbl. 1).

Fig. 30: Webcam image with ROIs for brightness detection. The sky ROI is framed in cyan,

that of the ocean in yellow, and that of the ground in green.

Tbl. 1: Thresholds for Daylight Detection

Light Condition

Sky

Ocean

Ground

Daylight

≥ 150

≥ 65

≥ 30

Twilight

< 150

< 65

< 30

Night

< 100

< 50

< 15

In addition to the lighting conditions, the detection of “bad weather” was implemented as

well. The goal is to find images taken during daylight but with bad weather conditions and to

prevent these from being saved into the Daylight folder. Two conditions are aimed to be

detected: fog and snow. Similar to the brightness detection, ROIs are defined to detect fog or

snow (Fig. 31). A polygon covering the sky (without the text and logo) and the beach is used

to detect fog by calculating the range of grey values; if this range is below 200, it is assumed

that fog covers the image. The range was used instead of the mean as fog, similar to Gaussian

blur, reduces details and contrast within an image. Therefore, if the range of values is low,

then this means that contrast is low, and the likeliness of fog is high. Snow is detected by

calculating the mean of the grey values within a polygon that covers the promenade; if the

mean is very high (< 125), then the pixels in this region are rather bright, which points to the

presence of snow, as the grass on the promenade has an otherwise low mean (Tbl. 1). The

positions and thresholds for snow and fog detection were determined empirically from

multiple images and will have to be altered if a diﬀerent angle of view or site is used.

Fig. 31: ROIs for fog (le) and snow (right) detection.

Only the images that are bright enough to havebeen takenduring daylight hours andno fog

or snow detected are sorted into the Daylight folder. Before being saved into that folder, the

images are cropped to the ROI and have their size reduced by half. For each of the three

lighting conditions Daylight,Twilight, and Night the vector Light received the condition

name as an input.

Make Linking Table

The final step of the preprocessing workflow is to create a table that links the images and

their timestamps with measured data and information about the lighting conditions and

beach visibility (Fig. 32). For the beach visibility, a query for the water level is used to find

images where the beach is visible and save this information in the vector Beach. Here, the

beach is considered visible for a water level below 500 cm.

Fig. 32: Screenshot of the linking table.

Motivation for Preprocessing

While the preprocessing of data is time intensive, it has benefits for the tasks that use the

data. Both machine and deep learning work with extracted features – either by receiving

them from the user or by generating them directly from the images. The original images have

a size of 4272 by 2600 px while the cropped and resized images only have a size of 446 by 316

px (Fig 33). Cropping the image to the groyne reduces the influence of other image sections

(e.g. grass or sky) on the image features, which produces more clearly defined features and in

turn more robust results. In addition, the computational load can be reduced by using the

smaller images, as – for this case – they are usually smaller than 25 KB, while the original

images are oen larger than 1 MB.

Fig. 33: Webcam image from 1. January 2015 at 13:30 UTC+1 (le) and the image section

cropped to the groyne (right).

Comparing the grey-level co-occurrence matrices (GLCMs) of the original (Tbl. 2) and the

cropped image (Tbl. 3) with one another, it is noticeable that the GLCM of the original image

has no zero elements oﬀside of the main diagonal. If the values of a GLCM are concentrated

along the diagonal, then within the image it is oen the case that two adjacent pixels have

the same value, which means that there are large homogeneous regions present. If the values

of the GLCM are not concentrated along the diagonal, then the image is less homogeneous or

has more contrast [25], as is the case for the original image. The lack of homogeneityof the

original image becomes especially clear when looking at the GLCM represented as a colour

plot (Fig. 34). For the cropped image, only two fields vary strongly from the black

background, while almost the entire diagonal varies from the background for the original

image.

Tbl. 2: GLCM of the full-size webcam image.

Tbl. 3: GLCM of the cropped webcam image.

Fig. 34: Plotted GLCMs for the full-size (le) and the cropped image (right).

Comparing the properties of the two GLCMs to one another (Tbl. 4), diﬀerences are

noticeable. Contrast, which measures the intensity contrast between a pixel and its

neighbour over the whole image [26], is higher for the original image than for the cropped

image, which means that the latter is more constant. Homogeneity, which measures the

closeness of the distribution of elements in the GLCM to the GLCM diagonal [26], is slightly

higher for the cropped image, meaning that the GLCM is more diagonal and the image has

less contrast (compare Tbl. 2 + 3 and Fig. 34). The correlation, which measures how

correlated a pixel is to its neighbour over the whole image [26], is higher for the original

image. Looking at the original image, there are fewer sections with a stark change in pixel

values from one pixel to its neighbour, while pixels in the cropped image change value

strongly at the borders between sand or water and the groyne. Energy is the sum of squared

elements in the GLCM and returns 1 for a constant or uniform image [26]. As the original

image has more contrast and is less homogeneous than the cropped image, it is less uniform

and therefore returns a lower energy value.

While mean, range and variance are not part of the GLCM properties that MATLAB computes

(graycoprops, [26]), comparing them for the original and cropped image is still interesting,

especially since these values are all drastically lower for the cropped image. The lower mean

of the GLCM of the cropped image is a result of the values being lower on average than for the

original image. The lower range means that the diﬀerence between the minimum and

maximum values of the cropped image is lower than that of the original image. The lower

variance for the GLCM of the cropped image is a result of the values varying less aroundthe

mean than those of the original imageʼs GLCM. These three values being lower for the

cropped image all point to it being the more homogeneous image, which is the same result

that the graycoprops produced.

Tbl. 4: Comparison of the GLCM properties.

Property

Full-Size Image

Cropped Image

Contrast

0.0665

0.0564

Correlation

0.9887

0.9689

Energy

0.1861

0.4715

Homogeneity

0.9727

0.9737

Mean

1.7351e+05

2.5135e+03

Range

3,014,778

106,368

Variance

1.7974e+23

2.4311e+17

3.3.2 Creating Regression Data

While deep learning (DL) approaches can generate features from images by themself,

machine learning (ML) approaches need a priori extracted features as an input (see 2.3.1

Background Information). Here, it will be explained how features are extracted (see Appendix

II: Regression with Interpolation) from images for later use with ML approaches.

The main workflow consists of eleven steps (see figure to the le),

which can contain multiple sub-steps. The routine starts with the

data being loaded into the workspace, and the timestamps being

extracted from the image names. Depending on these

timestamps, the corresponding water levels and wave heights are

extracted. Possible gaps in data are removed and empty arrays for

the features are created. With the gaps removed, thetimestamps

and wave heights are updated accordingly to reflect the removal.

Then, the features are extracted from the images and the empty

arrays are filled with these features. Aerwards, the features are

used by the regression task to predict the wave heights in the

images. Using these predicted wave heights, the water level

prediction is separated into four subcategories for calm,smooth,

slight, and moderate sea. Last, the results are visualised with

diﬀerent plots and the RMSEs are calculated both for each

prediction as well as for the entire regression task.

Load Data

The routine starts by loading the data into the workspace. For this purpose, the pathsto the

folders with the cropped daylight images (\Bilder2015\Cropped\) and the measured data

(\Daten\) are saved into the variables imgfolder and datafolder, respectively. Both of

these folders, as well as the folder containing the scripts (\Matlab\), are part of the same

parental folder. A list with the paths to all images (piclist) is then created from theimages in

the Cropped subfolder; its size is stored in the variable sz for later use.

Extract Timestamps from Images

Next, the timestamps are extracted from the images and saved into the variable zst. All

images have a naming convention similar to norderney-150808-1600-hu with the JPEG

ending. This name contains the timestamp in the format YYMMDD-hhmm, meaning that the

year, month, day, hour, and minute of the time a snapshot was taken can be extracted. The

script loops through all images in the list piclist until all timestamps have been extracted.

Get Wave Heights and Water Levels

Once the timestamps have been extracted from the image names, they can be used to find

the corresponding measured wave heights and water levels. The image timestamps are used

to normalise all data to the 10-minute resolution of the image; normally, the wave heights

are recorded every 30 minutes, while the water levels are recorded every minute (see 3.2

Data Sources). The script loads the file allewstand.mat into the workspace; this file

contains previously saved water levels wst and their timestamps dattim (see 3.3.1

Preprocessing the Image Data). The preprocessing script then creates a vector of the size of

the image list and assigns it to the variable wstand for the water levels. Using a loop, all

entries wst whose timestamp dattim corresponds to the image timestamp zst are saved

into wstand; therefore, gaps in the image data can result in gaps in the water level data used

for the regression task.

The wave heights HSare extracted from the file allewelle.mat, which contains previously

saved wave parameters (see 3.3.1 Preprocessing the Image Data). Instead of looping through

all data, this part of the preprocessing script finds instances of the image timestamp zst that