ArticlePDF Available

Automatic Recognition of Long Period Events From Volcano Tectonic Earthquakes at Cotopaxi Volcano

May 2016
IEEE Transactions on Geoscience and Remote Sensing 54(99):1-11

May 2016
54(99):1-11

DOI:10.1109/TGRS.2016.2559440

Authors:

Román Lara-Cueva

Universidad de las Fuerzas Armadas-ESPE

Diego Benitez

Universidad San Francisco de Quito (USFQ)

Enrique V. Carrera

Universidad de las Fuerzas Armadas-ESPE

Mario Ruiz

Escuela Politécnica Nacional

Show all 5 authorsHide

Geophysics experts are interested in understanding the behavior of volcanoes and forecasting possible eruptions by monitoring and detecting the increment on volcano-seismic activity, with the aim of safeguarding human lives and material losses. This paper presents an automatic volcanic event detection and classification system, which considers feature extraction and feature selection stages, to reduce the processing time toward a reliable real-time volcano early warning system (RT-VEWS). We built the proposed approach in terms of the seismicity presented in 2009 and 2010 at the Cotopaxi Volcano located in Ecuador. In the detection stage, the recordings were time segmented by using a nonoverlapping 15-s window, and in the classification stage, the detected seismic signals were 1-min long. For each detected signal conveying seismic events, a comprehensive set of statistical, temporal, spectral, and scale-domain features were compiled and extracted, aiming to separate long-period (LP) events from volcano-tectonic (VT) earthquakes. We benchmarked two commonly used types of feature selection techniques, namely, wrapper (recursive feature extraction) and embedded (cross-validation and pruning). Each technique was used within a suitable and appropriate classification algorithm, either the support vector machine (SVM) or the decision trees. The best result was obtained by using the SVM classifier, yielding up to 99% accuracy in the detection stage and 97% accuracy and sensitivity in the event classification stage. Selected features and their interpretation were consistent among different input spaces in simple terms of the spectral content of the frequency bands at 3.1 and 6.8 Hz. A comparative analysis showed that the most relevant features for automatic discrimination between LP and VT events were one in the time domain, five in the frequency domain, and nine in the scale domain. Our study provides the framework for an event classification system with high - ccuracy and reduced computational requirements, according to the orientation toward a future RT-VEWS.

Seismological stations deployed at Cotopaxi Volcano.

…

Examples for preprocessing and feature extraction. (a) Raw recorded waveform conveying a VT event. (b) The same signal after filtering and normalizing. (c,d) Examples of VT and LP events after detection and segmentation. (e) Welch periodogram for the example events (LP in blue continuous, VT in red discontinuous). (f) Wavelet decomposition at level 6 for the example events (LP in blue continuous, VT in red discontinuous).

…

Figures - uploaded by Mario Ruiz

Content may be subject to copyright.

Content uploaded by Mario Ruiz

Content may be subject to copyright.

IEEE TRANSACTIONS ON GEOSCIENCES AND REMOTE SENSING , VOL. XX, NO. X, NOV 2016 1

Automatic Recognition of Long Period Events from

Volcano Tectonic Earthquakes at Cotopaxi Volcano

Rom´

an Lara-Cueva, Member, IEEE, Diego S. Ben´

ıtez, Senior Member, IEEE, Enrique V. Carrera, Member, IEEE,

Mario Ruiz, and Jos´

e Luis Rojo- ´

Alvarez, Senior Member, IEEE

Abstract

Geophysics experts are interested in understanding the behavior of volcanoes and forecasting possible eruptions, by monitoring

and detecting the increment on volcano-seismic activity, with the aim of safeguarding human lives and material losses. This work

presents an automatic volcanic event detection and classiﬁcation system, which considers feature extraction and feature selection

stages, in order to reduce the processing time towards a reliable real-time volcano early warning system (RT-VEWS). We built

the proposed approach in terms of the seismicity presented during 2009 and 2010 at Cotopaxi Volcano located in Ecuador. In

the detection stage, the recordings were time-segmented by using a non-overlapping 15-second window, and in the classiﬁcation

stage, the detected seismic signals were 1min long. For each detected signal conveying seismic events, a comprehensive set of

statistical, temporal, spectral, and scale-domain features were compiled and extracted, aiming to separate Long Period (LP) events

from Volcano Tectonic (VT) earthquakes. We benchmarked two commonly used types of feature selection techniques, namely,

wrapper (recursive feature extraction) and embedded (cross-validation and pruning). Each technique was used within a suitable

and appropriate classiﬁcation algorithm, either Support Vector Machine (SVM) or Decision Trees. The best result was obtained

by using the SVM classiﬁer, yielding up to 99%accuracy in the detection stage, and 97%accuracy and sensitivity in the event

classiﬁcation stage. Selected features and their interpretation were consistent among different input spaces in simple terms of

spectral content of the frequency bands at 3.1 and 6.8 Hz. A comparative analysis showed that the most relevant features for

automatic discrimination between LP and VT events were 1 in the time domain, 5 in the frequency domain, and 9 in the scale

domain. Our study provides the framework for an event classiﬁcation system with high accuracy and reduced computational

requirements, according to the orientation towards a future RT-VEWS.

Index Terms

Volcanic Events, Seismic Event Detection, Seismic Event Classiﬁcation, Support Vector Machines, Decision Trees, Feature

Extraction, Feature Selection.

I. INT ROD UC TI ON

In the case of natural disasters or anthropogenic hazards, early warning systems are necessary in order to safe human lives

[1], [2]. One of the main natural hazards are volcanic eruptions [3], and in this regard, several monitoring systems have been

Manuscript received April 7, 2015. RALC and EVC are with Departamento de El´

ectrica y Electr´

onica, Universidad de las Fuerzas Armadas ESPE, Quito-

Ecuador, 171-5-231B (see http://wicom.espe.edu.ec/contactos.html). DSB is with Colegio de Ciencias e Ingenier´

ıas El Polit´

ecnico, Universidad San Francisco

de Quito, Quito, Ecuador. MR is with Instituto Geof´

ısico, Escuela Polit´

ecnica Nacional, Quito, Ecuador. JLRA is with Department of Signal Theory and

Communications, Rey Juan Carlos University, Fuenlabrada-Spain.

This is the author's version of an article that has been published in this journal. Changes were made to this version by the publisher prior to publication.

The final version of record is available at http://dx.doi.org/10.1109/TGRS.2016.2559440

IEEE TRANSACTIONS ON GEOSCIENCES AND REMOTE SENSING , VOL. XX, NO. X, NOV 2016 2

previously developed and deployed around the world for trying to better understand this phenomenon [4]. The seismic signature

of volcanic events are registered today by powerful seismometers or geophones, and the main types of seismic signals on active

volcanoes are Volcano Tectonic (VT) earthquakes, Long Period (LP) events, Tremors (TRE), and Hybrid (HYB) events [5], [6].

Figures 2(c) and 2(d) show signal examples of VT and LP events originated at Cotopaxi Volcano. VT events are earthquakes

taking place in a volcanic environment with a variable time duration, usually less than 30s and wide spectral content, typically

above 5 Hz. Whilst, LP events, also known as low frequency events, show a lack of distinguishable phases and a typical time

duration below 90s, with spectral content limited at narrow frequency bands between 2 and 5 Hz. HYB earthquakes share the

features of VT and LP events, and they are characterized by high frequency signals, usually with a wide spectral bandwidth

above 10 Hz. Finally, TRE are characterized by their constant amplitude and long duration, they are the most distinctive signals

generated by volcanoes, and they are widely considered as the most complex types of events, with duration ranging from a

few minutes to several days [7].

Our main interest is to help geophysicists to understand the behavior of volcanoes and to try to predict eruptions when

possible. For this purpose, we designed a volcano monitoring system that includes a high accuracy event detection followed by

a classiﬁcation component. A Volcano Early Warning System (VEWS) based on our proposal may allow authorities to alert the

population, as soon as possible, about the risk of eventual disasters. In this setting, and as a ﬁrst step to the future development

of a real-time VEWS (RT-VEWS), the aim of this work is to build two connected systems, namely, an automatic detection

system for volcanic seismicity, and subsequently followed by a high-accuracy event classiﬁcation system. Our hypothesis is that

the use of carefully designed detector and classiﬁer systems, both of them based on appropriate feature extraction and machine

learning techniques, will reduce the processing time currently required by human experts for examination of the seismic signals.

Speciﬁcally, we addressed the design of an automatic system for ﬁrst separating LP and VT events from Background Noise

(BN), and then identifying LP from VT with the highest possible accuracy. We are mainly interested in LP and VT events,

since these types of events are crucial for helping to forecast eruptions [8]–[10]. The classiﬁcation of other types of events is

beyond the scope of this work, but it could be readily addressed with our proposed approach.

After literature review, a general procedure was deﬁned and designed, in order to process, analyze, and identify volcano-

seismic signals. We used a preprocessing stage, which removes noise and non-volcanic originated signals like thunders, by

using band pass ﬁlters [11]. Then, a stage capable of detecting volcanic events is proposed [12], and it is followed by a

feature extraction [13] and feature selection stage [14]. Following the state of art of preceding works in geophysics, the use

of ﬁlter and embedded methods allowed us to determine the main features in the time, frequency, and scale domains. Finally,

a classiﬁcation stage is used to identify VT and LP events [15]. In this setting, we benchmarked two commonly used feature

selection techniques, namely, wrapper and embedded, each of them in a suitable and appropriate classiﬁcation algorithm,

namely, Support Vector Machines (SVM) and Decision Trees (DT), as later discussed with detail.

There are two main contributions from this paper. First, the whole system (i.e., event detection and classiﬁcation) is based on

machine learning techniques, and our proposed approach applies a two-stage solution, consisting of event detection followed

by event-classiﬁcation. Previous works reported in the literature [16]–[19] did not apply any detection strategy based on

classiﬁcation, but instead they just used sliding windows (with ﬁxed or variable length) and a ratio triggering algorithm for

This is the author's version of an article that has been published in this journal. Changes were made to this version by the publisher prior to publication.

The final version of record is available at http://dx.doi.org/10.1109/TGRS.2016.2559440

IEEE TRANSACTIONS ON GEOSCIENCES AND REMOTE SENSING , VOL. XX, NO. X, NOV 2016 3

event detection. Therefore our proposed speciﬁc event detector based in classiﬁcation techniques is novel for this application. In

this setting, we also employ feature extraction and feature selection techniques in order to reduce the processing time towards

a reliable real-time system. As a second contribution, we focus our analysis to the data obtained from the Cotopaxi volcano.

Although there are several studies about seismological events at Cotopaxi volcano, including lahars studies or very-long-period

and long-period events activity analysis [20], [21], none of them focused in algorithms for automatic real-time detection and

classiﬁcation of seismic events. To our best knowledge, this is the ﬁrst try to create an automatic process for signals from such

volcano.

The rest of the paper is organized as follows. Section II summarizes previous works and results about the automatic

classiﬁcation of seismic events by using machine learning techniques. Section III describes the proposed approach and the

experimental study including event detection, feature extraction, feature selection, and event classiﬁcation. Section IV presents

the results obtained in the automatic classiﬁcation process when considering different classiﬁers. Finally, Section V presents

the discussion and the main conclusions.

II. RE LATE D WOR K

Our study is referred to Cotopaxi, an active volcano which is part of the so-called Ring of Fire, located at latitude 0◦41’05”

S and longitude 78◦25’54.8” W in the Andean mountain region of Ecuador. On this volcano, a monitoring system has been

previously deployed by the Instituto Geof´

ısico de la Escuela Polit´

ecnica Nacional (IGEPN) (see Fig. 1), which currently has

installed: (a) six short period (SP) seismological stations (PITA, NAS2, VC1, REF, CAMI, and TAM), four of them with

vertical-axis sensors and two of them with three-axis sensors, and all of them with a frequency response range of 1–50 Hz; (b)

six broadband (BB) stations (VC2, REF, NAS, TAM, MORU, and VCES), with a frequency response range of 0.1–50 Hz [22].

Currently, expert scientists must analyze seismograms of volcanic signals by visual inspection, in order to label and classify

events, which represents a subjective and hard work demanding process. This task also requires a signiﬁcant amount of time

and noteworthy experience. A reliable automatic classiﬁcation system can signiﬁcantly reduce the effort required to make a

faster and objective classiﬁcation [16], [17].

In this context, a strategy to classify VT from other non-volcanic originated events, such as quarry blasts (QB) and thunders,

was proposed for the Vesuvius Volcano (Italy) [18]. In that work, a feature extraction strategy based on Linear Predictive

Coding was developed, yielding a set of 60 parameters of temporal and spectral characteristics. By using Artiﬁcial Neural

Networks (ANN) as classiﬁer, the system reached a 100%accuracy separating VT from QB. Similarly, in [23] a Hidden

Markov Model (HMM) classiﬁer was applied to data from San Crist´

obal Volcano (Nicaragua), in order to identify LP from

two types of explosions, and BN in raw seismic data, yielding an 80%accuracy. Another system was proposed in [24], in which

a fuzzy algorithm was used for automatic classiﬁcation of local and regional earthquakes, and two non-volcanic originated

events, namely, QB and machinery noise. Six main features were considered in the time and frequency domains, yielding a

96%classiﬁcation accuracy.

A feature selection strategy was developed in [25] for Nevado del Ruiz Volcano (Colombia), considering VT, LP, and Ice

Quake. Two classiﬁers were considered, based on parametric Power Spectral Density (PSD) estimation of events. A Bayesian

This is the author's version of an article that has been published in this journal. Changes were made to this version by the publisher prior to publication.

The final version of record is available at http://dx.doi.org/10.1109/TGRS.2016.2559440

IEEE TRANSACTIONS ON GEOSCIENCES AND REMOTE SENSING , VOL. XX, NO. X, NOV 2016 4

Fig. 1. Seismological stations deployed at Cotopaxi Volcano.

classiﬁer built on dissimilarity representations and a kNearest Neighbor (k-NN) classiﬁer were used, obtaining accuracy rates

of 81%and 84%, respectively. Whilst in [26], authors considered HB, LP, TRE and VT events, and worked on the stochastic

variability of a wide set of time-variant features. With this approach, the classiﬁcation rate improved from 78%to 88%when

k-NN was used instead of HMM. In [27], several feature sets proposed in [28]–[32] were benchmarked in order to discriminate

HYB, LP, TRE, and VT events, yielding the best accuracy of 83%when a k-NN classiﬁer was used.

Meanwhile in [28], a segmentation window of 30s and a combination of ANN with Genetic Algorithms were applied

to data from Villarica Volcano (Chile), obtaining a baseline recognition rate of 93%, by using an input space of temporal

and spectral features when considering LP, TRE, and Energetic Tremor events. In [33], the seismic signals were segmented

with a rectangular window of 1min, and a feature extraction strategy was applied by using circular statistics, obtaining the

instantaneous phase and the mean energy of the events with Hilbert and Wavelet transforms, in order to identify LP events

and VT earthquakes at Llaima Volcano (Chile). A linear discriminant classiﬁer reached a 92.5%accuracy. In this setting, a

SVM classiﬁer obtained a baseline recognition accuracy of 80%, by using a set of features related to amplitude, frequency,

This is the author's version of an article that has been published in this journal. Changes were made to this version by the publisher prior to publication.

The final version of record is available at http://dx.doi.org/10.1109/TGRS.2016.2559440

IEEE TRANSACTIONS ON GEOSCIENCES AND REMOTE SENSING , VOL. XX, NO. X, NOV 2016 5

and phase of the seismic signals [19]. A 10%of the entire record was used as segmentation window with 50%overlapping,

which was distinct in different recordings, due to the variable lengths for the entire seismic events at Villarica Volcano.

According to this review, previous works have demonstrated the possibilities of using machine learning techniques in this

setting. Nevertheless, the literature still lacks supportive evidence about which are the main design parameters to be considered

in each stage, from preprocessing to classiﬁcation, in order to satisfy RT requirements.

III. PROP OS ED ME TH OD OL OG Y

As already mentioned, our main interest in the middle term is to develop a VEWS, which can accomplish RT capabilities

for monitoring and decision making. In this work, we focus on developing an automatic classiﬁer to distinguish LP and VT

events as a supporting tool that can be used by experts to quantify seismic activity and to alert in the case of emergency

situations. The proposed system consist of several stages:

1) Acquired signals are initially treated by a preprocessing subsystem.

2) Next, a subsystem previously proposed by our research group [34] is adapted here to detect the presence of an (any

type) event from BN.

3) Then, an additional subsystem is given by a machine learning approach, and it is speciﬁcally devoted to classify the

previously detected events into one of two target classes, namely VT and LP. For designing this subsystem, attention has

to be placed on the stages of feature extraction and feature selection.

These subsystems, together with their performance evaluation, are described in this section.

A. Preprocessing Subsystem

Data used in this work were provided by the IGEPN. Data consisted of N= 914 independent volcano-seismic recordings

(759 LP, 116 VT, 30 HYB, and 9 TRE) sampled at 100 Hz. These particular recordings have been extracted, identiﬁed and

labelled by experts at the IGEPN from continuous monitoring recordings of seismograms. Each available recording conveyed

a single volcanic event, and it was preprocessed with a 128th order band-pass ﬁnite impulse response ﬁlter. The ﬁlter had a

passband between 1 and 15 Hz, given that undesired sea microseisms generate spectral power content about 0.2 Hz [35]. In

addition, it is known that the main spectral content for LP events is expected to be in the frequency range of 2-to-4 Hz, and

for VT earthquakes in the 5-to-10 Hz bandwidth [36]. The resulting ﬁltered record was then normalized in order to have a

zero-mean and unit-variance signal. Figure 2 shows an example of a recording segment including a VT event, both for the

raw recording as registered by the deployed sensors (a), and for its corresponding ﬁltered and normalized signal (b).

B. Event Detection and Segmentation Subsystem

Volcanic events in the database need to be extracted from BN as a ﬁrst stage. In a previous work, we developed a LP

event detector with high detection capabilities with respect to BN (see [34] for details). Given that our current system aims to

distinguish between LP and VT events, we adapted that method to provide with an accurate event detector in the presence of

LP, VT, HYB, and TRE events available in the database, before addressing the event classiﬁcation.

This is the author's version of an article that has been published in this journal. Changes were made to this version by the publisher prior to publication.

The final version of record is available at http://dx.doi.org/10.1109/TGRS.2016.2559440

IEEE TRANSACTIONS ON GEOSCIENCES AND REMOTE SENSING , VOL. XX, NO. X, NOV 2016 6

(a) (b)

(e) (f)

Fig. 2. Examples for preprocessing and feature extraction. (a) Raw recorded waveform conveying a VT event. (b) The same signal after ﬁltering and

normalizing. (c,d) Examples of VT and LP events after detection and segmentation. (e) Welch periodogram for the example events (LP in blue continuous,

VT in red discontinuous). (f) Wavelet decomposition at level 6 for the example events (LP in blue continuous, VT in red discontinuous).

This is the author's version of an article that has been published in this journal. Changes were made to this version by the publisher prior to publication.

The final version of record is available at http://dx.doi.org/10.1109/TGRS.2016.2559440

IEEE TRANSACTIONS ON GEOSCIENCES AND REMOTE SENSING , VOL. XX, NO. X, NOV 2016 7

For this purpose, a non-overlapping sliding window of length w= 1500 samples (15s) was ﬁrst applied for segmentation of

each signal. This produced a data matrix that was used to train a classiﬁer for distinguishing between BN and the presence of

any event, according to the event labeling provided by the IGEPN experts. We used supervised learning, speciﬁcally with DT

as machine learning technique. The output of the DT classiﬁer predicted a class yfor each segment, which was labeled with

y= +1 in the case of existing events of any kind, and y=−1otherwise. The performance obtained with this event detection

stage is analyzed in the results section.

The sequence of ±1’s generated by the detector and applied to each sliding window in the same recording was post-

processed for event delimitation. The average time position of consecutive detected segments was computed and denoted as td.

We decided to use a time window of 60s since this value is a good trade-off for conveying LP and VT event duration at the

Cotopaxi Volcano [22]. Therefore, the starting (ts) and ending (te) time-stamps of each event were obtained as ts=td−30

and te=td+ 30, respectively. This post-processing stage provided us with a data matrix T, given by:

T= [tT

1,tT

2, . . . , tT

i,tT

i+1, . . . , tT

M]T,(1)

where Mis the number of vectors, and tiare the signal segments in each recording containing some event within a 60s

window. We had available 350 LP and 116 VT events in this matrix, since initial recordings only have a single event per

record. For avoiding bias in the classiﬁer, we used balanced classes for training, including 116 segments of each class (hence

M= 232 signal vectors, and each vector ticonveyed 6000 elements or time samples).

Figures 2(c) and 2(d) show two examples of waveforms conveying VT and LP events after detection and segmentation. In

these panels, black vertical lines denote the segments (or sliding windows) used for the event detector, whereas the red line

indicates the detection (i.e., +1) or not (i.e.,−1) of an event in each window. Since the whole delimiting time window has

60s, the VT event is fully included in the window (4 segments), whereas the LP event although has a larger duration, it is still

mostly included in the window.

C. Design of Feature Extraction Stage

The VT–LP classiﬁcation subsystem was also designed in a machine learning framework. The input vector to the classiﬁer

was given either by the raw signal in vector ti, or by a set of features (measurements on speciﬁc properties of the signal)

which aimed to enhance the relevant information to be fed to the classiﬁer. We scrutinized several feature extraction strategies,

in order to extract a set of relevant features from each row of data matrix T, yielding a feature vector which can be denoted

by xi=g(ti), where g(.)is the feature extraction operator to be deﬁned for each feature extraction strategy. Two different

feature extraction strategies were benchmarked, and they are summarized next.

(a) Features from PSD. The PSD was calculated for each signal by using its Welch periodogram [37], yielding feature

vectors xF

i=gF(ti), where gFdenotes the PSD computing operator. The Welch periodogram used a boxcar window with

50% overlapping, unit amplitude, and length of 512 samples. Each resulting feature vector had a number of features given by

the number of samples in the Fourier representation of the Welch periodogram, yielding therefore 257 features. Figure 2(e)

shows the PSD features for the example events in ﬁgures 2(c) and 2(d).

This is the author's version of an article that has been published in this journal. Changes were made to this version by the publisher prior to publication.

The final version of record is available at http://dx.doi.org/10.1109/TGRS.2016.2559440

IEEE TRANSACTIONS ON GEOSCIENCES AND REMOTE SENSING , VOL. XX, NO. X, NOV 2016 8

(b) Features from Signal Parameters in Several Domains. Not only the spectral representation, but also many other approaches

have been proposed in the literature in order to analyze volcanic events with a number of signal parameters. In this setting, we

can deﬁne a parameter as a measurement of some signal quality, either in the time, frequency or scale domain. In this case,

the number of parameters used as features were 80 (see [38] and references therein for details), and they were compiled as

follows:

•First, a total of 12 features were obtained in the time domain, namely: mean time (µt), standard deviation (σt), signal

entropy (Ent), concentration of the signal around the mean value of the distribution (kurtosist), multi-scale entropy

(MSEt), time to reach the maximum peak (tt,mp), difference between the maximum and minimum peaks (P2Pt), root

mean square value (RMS), difference between the maximum and minimum peaks in the RMS signal (P2PRMS

t), signal

energy in RMS (ERMS

t), zero crossing number (Zt), and peaks density in the RMS signal.

•Second, a total of 19 features were obtained in the frequency domain by applying the Fast Fourier Transform (FFT) to

each signal in the data matrix. Thus, we have: the number of peaks over a threshold of 0.9 (ψf), the maximum detected

frequency (maxf), the mean frequency (µf), the frequency standard deviation (σf), the spectral entropy (Enf), the spectral

energy (Ef), the spectral kurtosis (kurtosisf), the spectral multi-scale entropy (M SEf), the maximum peak value (U)

and maximum frequency in the frequency bands from 10 to 20 Hz (max10−20

f) and from 20 to 30 Hz (max20−30

f), the root

mean square value (RMSf), the difference between the maximum and minimum peaks (P2Pf), the difference between

the maximum and minimum peaks in RMS (P2PRM S

f), the signal energy in RMS (ERMS

f), the peaks density in RMS,

the highest peak value in RMS (Hf), the second (H0

f) and the third (H00

f) highest peaks in RMS. In all these features,

the subscript findicates their frequency domain dependence.

•Finally, a total of 49 features in the scale domain were extracted, where a Wavelet transform was applied to each signal

by using a 10th order symlet mother wavelet. Note that the use of the Wavelet transform aimed to overcome the resolution

limitations exhibited by the Fourier transform [39]. Knowing the main frequency bands where events are usually present

(from 2 to 4 Hz for LP and from 5 to 10 Hz for VT), we decided to work with a decomposition (δ) at level 6 of the

Wavelet transform, obtaining the corresponding approximation (cA) and detailed (cD) Wavelet coefﬁcients. Hence, we

retrieved 7 features: the mean value (µw), the frequency of the maximum value (fmax,w), the energy percentage (Eδ

%w),

the difference between the maximum and minimum peaks (P2Pδ

w), the maximum frequency (Vδ

w), the root mean square

value (RMSδ

w), the difference between the maximum and minimum peaks in RMS (P2Pδ,RMS

w), for each coefﬁcients

cA6,cD6,cD5,cD4,cD3,cD2, and cD1. Figures 2(e) and 2(f) allow to compare the Welch periodogram with the

level-6 Wavelet coefﬁcients, respectively, for the preceding examples of VT and LP events, the subscript windicates their

scale domain dependence.

D. Classiﬁcation Algorithms and Feature Selection Strategies

For the event classiﬁcation stage, we also considered a supervised machine learning approach. Thus, we scrutinized the

suitability of two well-known classiﬁcation algorithms, namely, DT [15], [40] and SVM [41].

DT is a non-parametric supervised learning method used for approximating discrete-valued target functions in classiﬁcation

This is the author's version of an article that has been published in this journal. Changes were made to this version by the publisher prior to publication.

The final version of record is available at http://dx.doi.org/10.1109/TGRS.2016.2559440

IEEE TRANSACTIONS ON GEOSCIENCES AND REMOTE SENSING , VOL. XX, NO. X, NOV 2016 9

and regression. The aim of this method is to create a model which can predict the cpossible values of a target variable by

learning simple decision rules inferred from the data features. Each node in the tree speciﬁes a rule for some attribute of the

instance, and each branch descending from that node corresponds to one of the possible values for that attribute. An instance is

classiﬁed by starting at the root node of the tree, testing the attribute speciﬁed by this node, and then moving down the branch

corresponding to the value of the attribute. This process is then repeated for the subtree rooted at the new node. The algorithm

promotes small trees instead of large ones, which yields classiﬁers with good generalization capabilities. The depth or leaﬁness

of the tree is the free parameter for this machine learning technique, which is measured in terms of the information actually

contained by the child nodes. Here, we used two indices for this purpose, namely, with the average amount of information

contained in each event (as given by the entropy, E(t)), and with the statistical dispersion and inequality (as given by Gini

index, G(t)) [42] as follows:

E(t) = −

c−1

i=0

p(i|t) log2p(i|t),(2)

G(t) = 1 −

c−1

i=0

[p(i|t)]2,(3)

where p(i|t)denotes the fraction of records belonging to output value iat a given node t, and cis the number of possible

values of the target variable.

Conventional machine learning classiﬁers, such as Gaussian maximum likelihood or artiﬁcial neural networks, can be strongly

affected by the high dimensionality of input observation vectors, and they can tend to over-ﬁt to the data in the presence of

noise, or to perform poorly with low number of available training samples [43]. In the last few years, the use of SVMs

[44], [45] for machine learning practical applications has received wide attention because the method integrates in the same

classiﬁcation procedure: (1) a simple way for dealing with nonlinear classiﬁcation boundaries, as samples that are nonlinearly

separable in the input space are mapped to a higher dimensional space where a simpler (linear) classiﬁcation is performed;

(2) an intrinsic regularization procedure, which controls with efﬁciency the model complexity; and (3) the minimization of an

upper bound of the generalization error, thus following the Structural Risk Minimization principle.

The ν-SVM algorithm for classiﬁcation was used in this work, it is deﬁned in summary as follows, and the interested reader

is recommended to see e.g. [45] for details. Given a labeled training data set {xi, yi}n

i=1, where xi∈RNand yi∈ {−1,+1},

and given a nonlinear mapping φ(·), the ν-SVM method solves:

min

w,ξi,b,ρ (1

2kwk2+νρ +1

i=1

ξi)(4)

subject to:

yi(hφ(xi),wi+b)≥ρ−ξi∀i= 1, . . . , n (5)

ρ≥0, ξi≥0∀i= 1, . . . , n (6)

where wand bdeﬁne a linear classiﬁer in the feature space, and ξiare positive slack variables enabling to deal with errors. The

This is the author's version of an article that has been published in this journal. Changes were made to this version by the publisher prior to publication.

The final version of record is available at http://dx.doi.org/10.1109/TGRS.2016.2559440

IEEE TRANSACTIONS ON GEOSCIENCES AND REMOTE SENSING , VOL. XX, NO. X, NOV 2016 10

appropriate choice of nonlinear mapping φguarantees that the transformed samples are more likely to be linearly separable

in the (higher dimensional) feature space. In this formulation, variable ρis controlled with coefﬁcient ν, which adds another

degree of freedom to the margin, the size of the margin increasing linearly with ρ. Therefore, the trade off between the training

error and the generalization error is controlled in the ν-SVM formulation by adjusting νin the range [0,1], which acts as an

upper bound on the fraction of margin errors, and it is also a lower bound on the fraction of support vectors.

Primal problem (4) is solved by using its dual problem counterpart, yielding w=Pn

i=1 yiαiφ(xi)(see [45] for further

theoretical and algorithmic details), and the decision function for any test vector x∗is ﬁnally given by

f(x∗) = sgn n

i=1

yiαiK(xi,x∗) + b!(7)

where αiare Lagrange multipliers corresponding to constraints in (4), being the support vectors (SVs) those training samples

xiwith non-zero Lagrange multipliers αi6= 0; and the bias term bis calculated by using the unbounded Lagrange multipliers

as b= 1/k Pk

i=1(yi− hφ(xi),wi), where kis the number of unbounded Lagrange multipliers (0< αi< C). Note that a

particularity of SVM is that decision function f(x)is a function of a small subset of the training examples, which are the

support vectors. Those are the examples that are the closest to the decision boundary and lie on the margin, as well as those

wrong-class examples. The existence of such support vectors is at the origin of the computational properties of SVM and their

competitive classiﬁcation performance. The interested reader can see [46] for more details about the algorithm with linear and

non-linear SVM.

Another key point is the use of Mercer kernels, K(xi,x∗) = hxi,x∗i, to handle the nonlinear algorithm implementations.

In this work we use the two well-known Mercer kernels given by the linear kernel, K(x,z) = hx,zi, and the Gaussian kernel,

K(x,z) = exp −kx−zk2

2σ2, where σis the width free parameter, to be tuned together with νfree parameter during the training

and validation stage.

We used a feature selection strategy in order to identify the pmost relevant features that improved tpand its overall

performance. We worked with Recursive Feature Extraction (RFE), a well known wrapper method, which adds or removes

features by generating the ranking of them using backward feature elimination [47], [48], in order to ﬁnd the optimal combination

that maximizes model performance. RFE-SVM is a weight-based method, in which at each step, the coefﬁcients of the weight

vector of SVM are used as the feature ranking criterion.

Free parameters were tuned for both classiﬁcation techniques by following the usual cross-validation process. For DT, the

tree depth was determined, meanwhile for SVM, free parameter νwas tuned to control the number of support vectors.

This is the author's version of an article that has been published in this journal. Changes were made to this version by the publisher prior to publication.

The final version of record is available at http://dx.doi.org/10.1109/TGRS.2016.2559440

IEEE TRANSACTIONS ON GEOSCIENCES AND REMOTE SENSING , VOL. XX, NO. X, NOV 2016 11

E. Performance

Both the detection and the classiﬁcation performance were measured in terms of Accuracy (A), Precision (P), Sensitivity

or Recall (R), Speciﬁcity (S) criterion, and Balanced Error Rate (BER), which are deﬁned as follows:

A(%) = NC

×100,(8)

P(%) = NT P

NT P +NF P

×100,(9)

R(%) = NT P

NT P +NF N

×100,(10)

S(%) = NT N

NT N +NF P

×100,(11)

BER = 1 −R+S

2×100,(12)

where NCis the number of correctly classiﬁed events, NTis the total number of events used to feed the classiﬁer, NT P is

the number of true positives, NFN is the number of false negatives, NTN is the number of true negatives, and NFP is the

number of false positives. We calculated these performance measures using training and testing folds. The time consumption

of the process in the system, denoted by tp, was also considered.

IV. RES ULT S

This section presents the results obtained by applying the methodology proposed to our dataset from the Cotopaxi Volcano.

The entire dataset was divided into training and testing sets, and the independence among all the records in each of these sets

was ensured. The experiments were carried out in MatlabT M , on a Core I5 PC with 3.1 GHz and 4GB RAM.

The detector stage showed a correct event detection rate of 99%, for discriminating LP and VT events from BN. Figure 3

shows examples of four detected events. As described, a time duration of 1min was set for each analysis window, wd, since

this kind of events at Cotopaxi Volcano have this mean time duration. Figures 3(c) and 3(d) show that each excerpt comprised

entirely the VT earthquakes, since VT earthquakes have time duration lower than 1min.

A. Results Using DT

In the frequency domain, the DT algorithm obtained the model by using a training set containing N=116 (58 instances for

LP and VT), which made possible to induce a tree shape taking the top-down form, as depicted in Figure 4. The DT algorithm

selected 3 key features Xi, where iis the number of the selected feature corresponding to the amplitude of the PSD in a

determined frequency, beginning from the top node with the rule X16 ≥ −0.42 (corresponding to the amplitude value at 3

Hz), X21 ≥ −0.58 (corresponding to the amplitude value at 4 Hz), and X36 ≥0.47 (corresponding to the amplitude value at

6.8 Hz), which made possible to classify into any 1 of the 6 possible leafs. Figures 4(d) to 4(f) show a mesh representation of

10 segments, where from 1 to 5 (from 6 to 10) correspond to examples of LP (VT) events. We can observe that DT algorithm

retrieved about 3 effective features in each segment in order to predict the outcome.

Cross-validation and pruning methods were performed to control leaﬁness. Cross-validation selected 2 key features, beginning

from the top node, with rule X16 ≥ −0.43, and X36 ≥0.37, for classifying into 1 of the 3 possible leafs, as depicted in

This is the author's version of an article that has been published in this journal. Changes were made to this version by the publisher prior to publication.

The final version of record is available at http://dx.doi.org/10.1109/TGRS.2016.2559440

IEEE TRANSACTIONS ON GEOSCIENCES AND REMOTE SENSING , VOL. XX, NO. X, NOV 2016 12

0 200 400 600 800 1000 1200

−1

−0.5

0.5

Time (s)

Normalized Amplitude

(a)

0 200 400 600 800 1000 1200

−1

−0.5

0.5

Time (s)

Normalized Amplitude

(b)

0 200 400 600 800 1000 1200

−1

−0.5

0.5

Time (s)

Normalized Amplitude

(c)

0 200 400 600 800 1000 1200

−1

−0.5

0.5

Time (s)

Normalized Amplitude

(d)

Fig. 3. Event detection representation with time duration of 1min for each excerpt (red), and with segmentation window w=15s (black), for LP events (a-b),

and VT earthquakes (c-d).

VT LP

LP VT VT LP

x16 >= −0.42

x21 >= −0.58 x36 >= 0.47

x36 >= −0.01 x16 >= 0.51

(a)

LP VT

x16 >= −0.43

x36 >= 0.37

(b)

VT LP VT

LP VT

x16 >= −0.43

x21 >= −0.58 x36 >= 0.47

x36 >= −0.01

(c)

(d) (e) (f)

Fig. 4. Tree representation considering frequency features, for cross validation and pruning feature selection methods in DT. Segments from 1 to 5 (from 6

to 10) correspond to examples of LP (VT) events: (a) main features in the tree, and (d) features veriﬁed in each signal; (b) cross validation and (c) pruning

feature selection methods, with main features in the tree; (d), (e),(f) features retrieved in each signal.

Figure 4(b). Pruning selected 3 key features, the top node was kept with the same rule, and 2 more rules were deﬁned, given

by X21 ≥ −0.58 and X36 ≥0.47, which corresponded to 4 Hz and 6.8 Hz, respectively, and they were classiﬁed into 1 of

the 5 possible leafs, as depicted in Figure 4(c). We observed in Figures 4(d), 4(e) and 4(f), that VT earthquakes have the main

spectral content above 4 Hz, which was retrieved by DT algorithm in 4 Hz (X21) and 6.8 Hz (X36 ), whilst LP events present

This is the author's version of an article that has been published in this journal. Changes were made to this version by the publisher prior to publication.

The final version of record is available at http://dx.doi.org/10.1109/TGRS.2016.2559440

IEEE TRANSACTIONS ON GEOSCIENCES AND REMOTE SENSING , VOL. XX, NO. X, NOV 2016 13

VT LP

VT VT LP LP LP VT LP VT

VT LP LP VT VT LP

LP VT LP VT

x27 >= 0.3

x67 >= −0.6 x25 >= −0.1

x31 >= −0.5 x57 >= −0.1 x57 >= 0.7 x5 >= −0.4

x1 >= −0.04

x51 >= −3.7

x1 >= −0.3 x55 >= −0.8 x66 >= −1.4

x1 >= −0.2

x2 >= −1.2 x10 >= −0.5

x48 >= −0.4

x45 >= 1.8

x27 >= 1

x20 >= −0.01

(a)

LP VT

x27 >= 0.3

(b)

VT LP

VT LP LP VT

x27 >= 0.3

x67 >= −0.6

x31 >= −0.45 x57 >= −0.1

x1 >= −0.04 x1 >= −0.27

(c)

Fig. 5. Tree representation considering parameter features, cross validation and pruning feature selection methods: (a) main features in the tree, cross validation

(b) and pruning (c) feature selection methods.

TABLE I

EXP ERI ME NTAL P ERF ORM AN CE RE SU LTS FO R CLA SS IFIC ATIO N WHE N AP PLYI NG FE ATUR E SEL EC TIO N STAG E WIT H DT C LAS SIFI ER .

Matrix N. Features–Method A(%) P(%) R(%) S(%) BER tp(s)

Frequency / parameters 4/15–Default 90/94 96/90 83/98 96/89 0.1/0.06 1.5/1

Frequency / parameters 2/1–Cross-Val. 83/84 78/78 92/91 74/78 0.17/0.16 0.7/0.6

Frequency / parameters 3/5–Pruning 90/94 96/90 83/98 96/89 0.1/0.06 1/0.7

their main spectral component in 3 Hz (X16) and above 6 Hz has negligible values.

The DT algorithm retrieved 15 key features by considering the parameter features, beginning from the top node, with rule

X27 ≥0.3, and making possible to classify into any 1 of the 20 possible leafs, as depicted in Figure 5(a). By using cross-

validation, the DT algorithm selected one key feature, the top node with rule X27 ≥0.3, classiﬁed into one of the two possible

leafs, as depicted in Figure 5(b), whilst pruning selected 5 key features corresponding to ERMS

f(X27), P2Pδ5

w(X67), Eδ3

(X57), H00

p(X31), and µt(X1). The top node was kept with the same rule and 5 more rules were deﬁned, which allowed to

classify into 1 of the 6 possible leafs, as depicted in Figure 5(c).

Table I shows the results obtained with the testing set containing N=116 independent cases. The pruning method identiﬁed

5 key features, which made possible to reach the best system performance. The main differences with the original features

were in terms of the BER and tp.

B. Results Using Linear and Non-linear SVM Classiﬁers

SVM is often sensitive to class unbalance, and for this reason we ﬁrst determined the training percentage for our database

which maximized the system performance. These values were set to 35% and 50% for linear and non-linear SVM, respectively.

Then we set the free parameter νto 0.05 for linear, 0.1 and 0.15 for non-linear SVM classiﬁers, with frequency and parameter

This is the author's version of an article that has been published in this journal. Changes were made to this version by the publisher prior to publication.

The final version of record is available at http://dx.doi.org/10.1109/TGRS.2016.2559440

IEEE TRANSACTIONS ON GEOSCIENCES AND REMOTE SENSING , VOL. XX, NO. X, NOV 2016 14

TABLE II

EXP ERI ME NTAL R ESU LTS FO R CLA SS IFIC ATIO N BY U SIN G LIN EA R AND N ON -LIN EA R1SVM CL ASS IFI ERS .

Matrix N. Features–Method A(%) P(%) R(%) S(%) BER tp(s)

Frequency / parameters 257/80–Default 92/97 93/99 91/96 93/99 0.08/0.03 0.23/0.08

Frequency / parameters 4/15–RFE 93/97 94/97 92/96 94/98 0.07/0.03 0.06/0.06

Frequency / parameters 257/80–Default184/97 86/96 81/98 87/96 0.16/0.03 7/3.9

Frequency / parameters 15/15–RFE193/93 96/93 91/94 96/93 0.06/0.06 1.9/1.9

TABLE III

EXP ERI ME NTAL R ESU LTS FO R CLA SS IFIC ATIO N BY U SIN G LIN EA R SVM CLA SS IFIE R,B Y CON SI DER IN G IND EP END EN TLY FEAT URE S IN T HE TI ME (T),

FREQUENCY (F), AN D SC ALE (W)D OM AIN S.

Domain N. Features A(%) P(%) R(%) S(%) BER tp(s)

T1 90 94 85 94 0.10 0.04

F5 97 97 96 97 0.03 0.05

W9 93 92 95 92 0.07 0.05

features. Parameter γwas set to 8.86e-5 and 4.00e-3, respectively, by making an iterative process in order to maximize the

system performance. Finally, we selected the main features by using RFE. The results are summarized in Table II. For both

feature sets, linear and non-linear SVM, the use of RFE improved the performance in terms of the tp. RFE selected 4 key

features for frequency features and 15 for parameter features in the case of linear SVM, whilst for non-linear SVM, 15 key

features were selected for both feature sets. We obtained the best results with linear SVM and by using RFE method, which

identiﬁed 15 key features considering parameter features, and the main difference with the other methods was in terms of tp

and BER.

Considering frequency features with linear SVM, the algorithm recognized the relevance of frequency bands corresponding

to 3.11, 3.3, 3.5, and 7 Hz, as depicted in Figure 6(a). Meanwhile non-linear SVM identiﬁed 15 bands, which corresponding

to 1.17, 2.5, 3, 3.11, 3.3, 3.5, 4.5, 6.8, 7, 7.2, 7.4, 22.3, 36.2, 39, and 41.6 Hz. We observed that not only low frequencies

were considered, but also high frequency bands were identiﬁed, as shown in Figure 6(b). Therefore, we observed in Figure

6(c) that non-linear SVM considers frequencies above 35 Hz. In general, linear and non-linear SVM were able to identify LP

from VT in simple terms of amplitude and spectral content.

By using parameter features and RFE feature selection method, linear SVM retrieved 15 key features: Zt(X11), ψf(X13),

MSEf(X20), U10−20 (X21 ), RMSf(X24), ERM S

f(X27), Vδ5

w(X39), µδ5

w(X41), µδ1

w(X53), Eδ5

%w(X56), Eδ2

%w(X59), P2Pδ5

(X68), P2Pδ2

w(X76), P2Pδ2,RM S

w(X77), P2Pδ1

w(X80), and non-linear SVM retrieved also 15 key features: Zt(X11), ψf(X13 ),

ERMS

f(X19), MSEf(X20 ), RMSf(X24), ERM S

f(X27), Vδ5

w(X39), P2Pδ6

w(X42), RMSδ5

w(X51), µδ1

w(X53), P2Pδ5

w(X68), P2Pδ2

(X76), P2Pδ2,RM S

w(X77), Vδ5

w(X78), RMSδ5

w(X79), P2Pδ1

w(X80). We identiﬁed 11 matching features, which corresponded to

Zt,ψf, MSEf, RMSf, ERMS

f, Vδ5

w,µδ1

w, P2Pδ5

w, P2Pδ2

w, P2Pδ2,RMS

w, and P2Pδ1

In the feature selection block, RFE method presented an improvement in all metrics in at least 1%by considering parameter

features, and using 4 of 257 features. Moreover, the pruning method got similar results with DT classiﬁer considering both

matrices, whilst cross-validation got worse results in at least 10%in each metric. This strategy permitted to reduce the tpfrom

1500ms to 700ms. Linear SVM-RFE identiﬁed 15 main features: 1 in the time domain T(Zt), 5 in the frequency domain F

(ψf, MSEf, U10−20, RMSf, ERM S

f), and 9 in the scale domain W( Vδ5

w,µδ5

w,µδ1

w, Eδ5

%w, Eδ2

%w, P2Pδ5

w, P2Pδ2

w, P2Pδ2,RMS

This is the author's version of an article that has been published in this journal. Changes were made to this version by the publisher prior to publication.

The final version of record is available at http://dx.doi.org/10.1109/TGRS.2016.2559440

IEEE TRANSACTIONS ON GEOSCIENCES AND REMOTE SENSING , VOL. XX, NO. X, NOV 2016 15

(a) (b)

(c)

Fig. 6. Tree representation considering frequency features, RFE feature selection method: features retrieved in each gifor linear (a), non-linear (b), and

zoomed non-linear SVM above 35 Hz (c).

P2Pδ1

w). Table III shows the results with each set tested independently, where the best result were obtained with 5 features in

the frequency domain, reaching similar performance values, by reducing the tpto 500ms. Additionally, we identiﬁed 2 bands

corresponding to details levels 2 fs2∈(0, 25) Hz and 5 fs5∈(0, 3.13) Hz in the scale domain, where by just considering

some parameters allowed us to discriminate VT from LP events with acceptable accuracy. Figure 7 shows possible features

combination, by considering the 3 most relevant features in each classiﬁer. We observed in Figure 7(e), for example, two well

deﬁned clusters, making possible by linear separability to recognize LP from VT. This was possible by considering linear

SVM classiﬁer and U10−20, RMSf, and ERM S

f, which are in the frequency domain.

V. DI SC US SI ON A ND CO NC LU SI ON S

LP and VT events have proven to be keys for monitoring any volcano activity, including Cotopaxi, since they provide

important information about the volcano internal status [8]–[10]. With appropriate RT constrains for monitoring, detecting,

and classifying volcanic signals, VEWS may be used as a tool for detecting potencial anormal increments in seismic activity.

Such information, in combination with other parameters such as the external activity of the volcano, may be used by the local

authorities to launch an effective early warning to the population.

Our proposed approach applies a two-stage solution, consisting of event detection followed by event-classiﬁcation. Previous

works did not apply any detection strategy based on classiﬁcation, but instead a rectangular sliding window of ﬁxed or variable

size was used [19], [28], [33]. In our setting, the use of the detection stage has simpliﬁed the learning problem of distinguishing

LP, VT and non-volcanic origin events. In fact, the ﬁrst stage detects seismic events from BN with high accuracy, therefore

reducing the computational requirements of the system. The detection accuracy was near 99%. In our study, the proposed event

detector showed superior performance in relation to previous studies, however a further study with standard datasets should

This is the author's version of an article that has been published in this journal. Changes were made to this version by the publisher prior to publication.

The final version of record is available at http://dx.doi.org/10.1109/TGRS.2016.2559440

IEEE TRANSACTIONS ON GEOSCIENCES AND REMOTE SENSING , VOL. XX, NO. X, NOV 2016 16

−4

−2

−1

X14

X19

X34

(a)

−1

X14

X16

X34

(b)

−4

−2

X12

X16

X35

(c)

−2

−1

X27

X39

X68

(d)

−2

−1

X21

X24

X27

(e)

−2

−1

X24

X27

X76

(f)

Fig. 7. Scatter plots of selected main features by considering: G matrix (a-b-c), and H matrix (d-e-f), using DT classiﬁer (a-d), linear SVM (b-e), and

non-linear SVM (c-f), respectively. Red dots represent VT while blue dots represent LP events.

be conducted to be able to give a quantitative comparison analysis.

In the classiﬁcation stage, the DT algorithm showed an accuracy near to 90%by considering the frequency features, while

a linear SVM has reached 97%considering the parameter features. These values are comparable to the best results obtained

in previous studies by applying different methodologies to the classiﬁcation problem, such as Hilbert and Wavelet transforms

[33], or Bayesian and k-NN classiﬁers [25].

Additionally, our experiments have shown that the best results were obtained with the parameter features, where a set of

80 features in time, frequency, and scale domains was considered. We took into account well known features with some of

them given by experts of IGEPN, which had been obtained by heuristic and observation processes. An error rate of 3% was

obtained using 15 main features retrieved by RFE, and linear SVM classiﬁer. The main difference between linear and non-linear

SVM was tpequal to 60ms and 1900ms, respectively. In the frequency domain, the linear SVM classiﬁer was able to reach a

performance similar to the one obtained when using the optimized features with RFE, but by considering only 5 main features

(ψf, MSEf, U10−20, RMSf, ERM S

f). This proposed strategy has been developed in order to use it in RT analysis, since it

reduces the processing time to values lower than 1min, approximately.

Although the experimental results show the feasibility and reliability of the proposed system, speciﬁc aspects at the detection

and classiﬁcation stages still have to be considered in order to generalize the system to identify all volcano-seismic signals. As

future work, we are interested in classifying LP, VT, HYB, TRE, and non-volcanic origin signals like thunders, by using linear

or non-linear multi class SVM. It should be possible to improve the detection stage by using well-known automatic detection

algorithms applied in speech recognition. These algorithms could determine the starting and ending time stamps of an event,

hence, a better analysis can be performed in the entire duration of the volcanic event, while maintaining or improving the

This is the author's version of an article that has been published in this journal. Changes were made to this version by the publisher prior to publication.

The final version of record is available at http://dx.doi.org/10.1109/TGRS.2016.2559440

IEEE TRANSACTIONS ON GEOSCIENCES AND REMOTE SENSING , VOL. XX, NO. X, NOV 2016 17

accuracy rate.

Regarding feature selection, we have addressed the problem in this paper from the machine learning theory point of view,

and most of the main features found by the algorithms were consistent with common features used in other works for event

classiﬁcation in other volcanoes [24], [28], [30]. Although the volcanologist signiﬁcance of the most important features related

to Cotopaxi volcano is still unclear at this moment, and it requires deeper and further investigation, feature selection techniques

were used in this paper to simplify the problem of classiﬁcation. We are planning to address the problem of volcano-speciﬁc

characteristic signiﬁcance in future studies. Furthermore, analyzing and combining data from multiple stations may help us to

improve the performance of the system, this will also be addressed in future works. Finally, benchmarking our algorithm with

seismological data acquired with similar equipment for other volcanoes is to be addressed among the next steps.

ACK NOW LE DG ME NT

The authors gratefully acknowledge the contribution of Universidad de las Fuerzas Armadas ESPE for the economical support

in the development of this project by Research Grants 2013-PIT-014 and 2015-PIC-004. This work has also been partly supported

by Research Grants PRINCIPIAS (TEC2013-48439-C4-1-R) from Spanish Government and PRICAM (S2013/ICE-2933) from

Comunidad de Madrid. Finally, we want to thank to the IGEPN for providing us the dataset used in this work.

REF ER EN CE S

[1] G. Berz, W. Kron, T. Loster, E. Rauch, J. Schimetschek, J. Schmieder, A. Siebert, A. Smolka, and A. Wirtz, “World map of natural hazards–a global

view of the distribution and intensity of signiﬁcant exposures,” Natural hazards, vol. 23, no. 2-3, pp. 443–465, 2001.

[2] K. Hewitt, Regions of risk: A geographical introduction to disasters. Routledge, 2014.

[3] S. Makowski Giannoni, R. Rollenbeck, K. Trachte, and J. Bendix, “Natural or anthropogenic? On the origin of atmospheric sulfate deposition in the

Andes of southeastern Ecuador,” Atmospheric Chemistry and Physics, vol. 14, no. 20, pp. 11297–11 312, 2014.

[4] F. A. Pfafﬂ and W.-C. Dullo, “The ﬁrst ascent to the Volcano Cotopaxi in Ecuador by Wilhelm Reiss (1838–1908),” International Journal of Earth

Sciences, vol. 103, no. 4, pp. 1175–1179, 2014.

[5] S. R. McNutt, “Volcanic seismology,” Annu. Rev. Earth planet. Sci., vol. 32, pp. 461–491, 2005.

[6] H. Langer, S. Falsaperla, T. Powell, and G. Thompson, “Automatic classiﬁcation and a-posteriori analysis of seismic event identiﬁcation at Soufriere

Hills Volcano, Montserrat,” Journal of Volcanology and Geothermal Research, vol. 153, no. 1, pp. 1–10, 2006.

[7] J. Ib´

a˜

nez and E. Carmona, “Sismicidad volc´

anica,” Curso Internacional de Volcanolog´

ıa y Geof´

ısica Volc´

anica, Serie Casa de los Volcanes, no. 7, pp.

269–282, 2000.

[8] B. A. Chouet, “Long-period volcano seismicity: its source and use in eruption forecasting,” Nature, vol. 380, no. 6572, pp. 309–316, 1996.

[9] C. J. Bean, L. De Barros, I. Lokmer, J.-P. M ´

etaxian, G. O’Brien, and S. Murphy, “Long-period seismicity in the shallow volcanic ediﬁce formed from

slow-rupture earthquakes,” Nature geoscience, vol. 7, no. 1, pp. 71–75, 2014.

[10] P. Cusano, M. Palo, and M. West, “Long-period seismicity at Shishaldin Volcano (Alaska) in 2003–2004: Indications of an upward migration of the

source before a minor eruption,” Journal of Volcanology and Geothermal Research, vol. 291, pp. 14–24, 2015.

[11] A. M. Baig, M. Campillo, and F. Brenguier, “Denoising seismic noise cross correlations,” Journal of Geophysical Research: Solid Earth (1978–2012),

vol. 114, no. B8, 2009.

[12] A. Ruano, G. Madureira, O. Barros, H. Khosravani, M. Ruano, and P. Ferreira, “Seismic detection using support vector machines,” Journal of

Neurocomputing, vol. 135, no. 0, pp. 273 – 283, 2014.

[13] I. Guyon, Feature extraction: foundations and applications. Springer Science & Business Media, 2006.

[14] H. Liu and H. Motoda, Computational methods of feature selection. CRC Press, 2007.

[15] T. M. Mitchell, Machine Learning. Burr Ridge, IL: McGraw Hill, 1997, vol. 45.

This is the author's version of an article that has been published in this journal. Changes were made to this version by the publisher prior to publication.

The final version of record is available at http://dx.doi.org/10.1109/TGRS.2016.2559440

IEEE TRANSACTIONS ON GEOSCIENCES AND REMOTE SENSING , VOL. XX, NO. X, NOV 2016 18

[16] T. S. Newman and A. K. Jain, “A survey of automated visual inspection,” Journal of Computer Vision and Image Understanding, vol. 61, no. 2, pp.

231–262, 1995.

[17] D. Mery and O. Medina, “Automated visual inspection of glass bottles using adapted median ﬁltering,” Journal of Image Analysis and Recognition, pp.

818–825, 2004.

[18] S. Scarpetta, F. Giudicepietro, E. Ezin, S. Petrosino, E. Del Pezzo, M. Martini, and M. Marinaro, “Automatic classiﬁcation of seismic signals at Mt.

Vesuvius Volcano, Italy, using neural networks,” Bulletin of the Seismological Society of America, vol. 95, no. 1, pp. 185–196, 2005.

[19] M. Curilem, J. Vergara, C. San Mart´

ın, G. Fuentealba, C. Cardona, F. Huenupan, M. Chac´

on, M. S. Khan, W. Hussein, and N. B. Yoma, “Pattern

recognition applied to seismic signals of the Llaima Volcano (Chile): An analysis of the events’ features,” Journal of Volcanology and Geothermal

Research, vol. 282, pp. 134–147, 2014.

[20] E. Aguilera, M. Pareschi, M. Rosi, and G. Zanchetta, “Risk from lahars in the northern valleys of Cotopaxi Volcano (Ecuador),” Natural Hazards,

vol. 33, no. 2, pp. 161–189, 2004.

[21] I. Molina, H. Kumagai, A. Garc´

ıa-Aristiz´

abal, M. Nakano, and P. Mothes, “Source process of very-long-period events accompanying long-period signals

at Cotopaxi Volcano, Ecuador,” Journal of Volcanology and Geothermal Research, vol. 176, no. 1, pp. 119–133, 2008.

[22] H. D. Ortiz Erazo, “Estudio de los efectos de sitio para la construcci´

on de un ´

ıdice de actividad s´

ısmica en el volc´

an Cotopaxi,” Master’s thesis, Escuela

Polit´

ecnica Nacional, 2013.

[23] L. Guti´

errez, J. Ib´

aez, G. Cort´

es, J. Ram´

ırez, C. Ben´

ıtez, V. Tenorio, and A. Isaac, “Volcano-seismic signal detection and classiﬁcation processing using

Hidden Markov Models. Application to San Crist´

obal Volcano, Nicaragua,” in Symposium in Geoscience and Remote Sensing, vol. 4, 2009, pp. 522–525.

[24] E. H. A. Laasri, E.-S. Akhouayri, D. Agliz, D. Zonta, and A. Atmani, “A fuzzy expert system for automatic seismic signal classiﬁcation,” Expert Systems

with Applications, vol. 42, no. 3, pp. 1013 – 1027, 2015.

[25] M. Orozco, M. E. Garc´

ıa, R. P. Duin, and C. G. Castellanos, “Dissimilarity-based classiﬁcation of seismic signals at Nevado del Ruiz Volcano,” Earth

Sciences Research Journal, vol. 10, no. 2, pp. 57–66, 2006.

[26] D. C´

ardenas-Pe˜

na, M. Orozco-Alzate, and G. Castellanos-Dom´

ınguez, “Selection of time-variant features for earthquake classiﬁcation at the Nevado del

Ruiz Volcano,” Computers & Geosciences, vol. 51, pp. 293–304, 2013.

[27] P. A. Castro-Cabrera, M. Orozco-Alzate, A. Adami, M. Bicego, J. M. Londo˜

no-Bonilla, and G. Castellanos-Dom´

ınguez, “A comparison between time-

frequency and cepstral feature representations for the classiﬁcation of seismic-volcanic signals,” in Progress in Pattern Recognition, Image Analysis,

Computer Vision, and Applications. Springer, 2014, pp. 440–447.

[28] G. Curilem, J. Vergara, G. Fuentealba, G. Acu˜

na, and M. Chac´

on, “Classiﬁcation of seismic signals at Villarrica Volcano (Chile) using neural networks

and genetic algorithms,” Journal of Volcanology and Geothermal Research, vol. 180, no. 1, pp. 1–8, 2009.

[29] M. Ibs-von Seht, “Detection and identiﬁcation of seismic signals recorded at Krakatau Volcano (Indonesia) using artiﬁcial neural networks,” Journal of

Volcanology and Geothermal Research, vol. 176, no. 4, pp. 448–456, 2008.

[30] I. ´

Alvarez, L. Garc´

ıa, G. Cort´

es, C. Ben´

ıtez, and A. De la Torre, “Discriminative feature selection for automatic classiﬁcation of volcano-seismic signals,”

IEEE Letters on Geoscience and Remote Sensing Letters, vol. 9, no. 2, pp. 151–155, 2012.

[31] R. Avesani, A. Azzoni, M. Bicego, and M. Orozco-Alzate, “Automatic classiﬁcation of volcanic earthquakes in HMM-induced vector spaces,” in Progress

in Pattern Recognition, Image Analysis, Computer Vision, and Applications. Springer, 2012, pp. 640–647.

[32] M. Bicego, C. Acosta-Mu˜

noz, and M. Orozco-Alzate, “Classiﬁcation of seismic volcanic signals using Hidden Markov Model based generative

embeddings,” IEEE Transactions on Geoscience and Remote Sensing, vol. 51, no. 6, pp. 3400–3409, 2013.

[33] C. San-Mart´

ın, C. Melgarejo, C. Gallegos, G. Soto, M. Curilem, and G. Fuentealba, “Feature extraction using circular statistics applied to volcano

monitoring,” in Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications. Springer, 2010, pp. 458–466.

[34] R. Lara-Cueva, D. Ben´

ıtez, E. Carrera, M. Ruiz, and J. Rojo- ´

Alvarez, “Feature selection of seismic waveforms for long period event detection at cotopaxi

volcano,” Journal of Volcanology and Geothermal Research, vol. 316, pp. 34–49, 2016.

[35] B. Kenneth, The seismic waveﬁeld, Volume I: Introduction and Theoretical Development. Cambridge Univ. Press, USA, 2001.

[36] J. Lahr, B. Chouet, C. Stephens, J. Power, and R. Page, “Earthquake classiﬁcation, location, and error analysis in a volcanic environment: Implications

for the magmatic system of the 1989–1990 eruptions at Redoubt Volcano, Alaska,” Journal of Volcanology and Geothermal Research, vol. 62, no. 1,

pp. 137–151, 1994.

[37] S. M. Kay and S. L. Marple Jr, “Spectrum analysis—a modern perspective,” Proceedings of the IEEE, vol. 69, no. 11, pp. 1380–1419, 1981.

This is the author's version of an article that has been published in this journal. Changes were made to this version by the publisher prior to publication.

The final version of record is available at http://dx.doi.org/10.1109/TGRS.2016.2559440

IEEE TRANSACTIONS ON GEOSCIENCES AND REMOTE SENSING , VOL. XX, NO. X, NOV 2016 19

[38] R. Lara-Cueva, P. Bernal, G. Saltos, D. Ben´

ıtez, and J. L. Rojo- ´

Alvarez, “Time and frequency feature selection for seismic events from Cotopaxi

Volcano,” in Asia-Paciﬁc Conference on Computer Aided System Engineering (APCASE), 2014, 2015, pp. 1–6.

[39] L. Grafakos, Classical and modern Fourier analysis. Pearson / Prentice-Hall, 2004.

[40] C. M. Bishop et al.,Pattern recognition and machine learning. Springer, 2006, vol. 1.

[41] B. Scholkopf, K.-K. Sung, C. J. Burges, F. Girosi, P. Niyogi, T. Poggio, and V. Vapnik, “Comparing support vector machines with gaussian kernels to

radial basis function classiﬁers,” IEEE Transactions on Signal Processing, vol. 45, no. 11, pp. 2758–2765, 1997.

[42] G. Dougherty, Pattern recognition and classiﬁcation: an introduction. Springer Science & Business Media, 2012.

[43] G. Hughes, “On the mean accuracy of statistical pattern recognizers,” IEEE Transactions on Information Theory, vol. 14, no. 1, pp. 55–63, Jan 1968.

[44] N. Cristianini and J. Shawe-Taylor, An introduction to support Vector Machines: and other kernel-based learning methods. New York, NY, USA:

Cambridge University Press, 2000.

[45] B. Sch¨

olkopf and A. Smola, Learning with Kernels, 1st ed. Cambridge, MA: MIT Press, 2002.

[46] I. Guyon, J. Weston, S. Barnhill, and V. Vapnik, “Gene selection for cancer classiﬁcation using support vector machines,” Machine learning, vol. 46,

no. 1-3, pp. 389–422, 2002.

[47] G. H. John, R. Kohavi, K. Pﬂeger et al., “Irrelevant features and the subset selection problem,” in Proceedings of the Eleventh International Conference

on Machine Learning, 1994, pp. 121–129.

[48] R. Kohavi and D. Sommerﬁeld, “Feature subset selection using the wrapper method: Overﬁtting and dynamic search space topology,” in KDD, 1995,

pp. 192–197.

This is the author's version of an article that has been published in this journal. Changes were made to this version by the publisher prior to publication.

The final version of record is available at http://dx.doi.org/10.1109/TGRS.2016.2559440

An Optimized Learning Model Augment Analyst Decisions for Seismic Source Discrimination

Article

Full-text available

Jan 2022

Efficient handling and planning for the urban regions’ sustainable development require a vast range of up-to-date and thematic information. Besides, obtaining an uncontaminated catalog of seismic activity is desirable to study the earthquake (EQ) clusters spatial allocation, which is a key role in mitigating seismic hazards and alleviating EQ losses by enhancing the assessment of seismic hazards. This paper considers the northeastern part of Egypt where the seismicity catalog is contaminated by quarry blasts (QBs) operated throughout the mapped area. Consequently, it is desirable to discriminate these QBs from the EQs for genuine seismicity and hazard analysis. Accordingly, we provide an efficient machine learning (ML) model for decontaminating the seismicity database so that EQ clusters can be properly delineated by relying on 870 events (EQs and QBs) observed by only one seismic station called “ GLL ”, a member of the Egyptian National Seismic Network (ENSN). The model focuses on magnitudes < 3 that have high uncertainty of being EQs or QBs and take a long time for analysis. The approach examines several linear and non-linear ML models and finally selects the best model with only two features leading to the optimal classification between the EQs and QBs. The optimization process is accomplished throughout two stages. The obtained results prove that the proposed scheme achieves 100% discrimination between the EQs and QBs relying on the extreme gradient boosting (XGB) model.

Feature Extraction and Analysis of Microseismic Signal Based on Convolutional Neural Network

Article

Full-text available

Dec 2021

Seismic detection technology has been widely used in safety detection of engineering construction abroad. Although it has just started in the field of engineering in our country, its role is becoming more and more important. Through computer technology, micro-seismic detection can provide accurate data for the construction safety detection of large-scale projects, which has important practical significance for the rapid and effective identification of micro-seismic signals. Based on this, the purpose of this article is to study the feature extraction and classification of microseismic signals based on neural games. This article first summarizes the development status of microseismic monitoring technology. Using traditional convolutional neural networks for analysis, a multi-scale feature fusion network is proposed on the basis of convolutional neural networks and big data, the multi-scale feature fusion network is used to research and analyze microseismic feature extraction and classification. This article systematically explains The principle of microseismic signal acquisition and the construction of multi-scale feature fusion network. And use big data, comparative analysis method, observation method and other research methods to study the theme of this article. Experimental research shows that the db7 wavelet base has little effect on the Megatron signal.

Seismic Intensity Estimation for Earthquake Early Warning Using Optimized Machine Learning Model

Article

Jan 2023

The need for an earthquake early-warning system (EEWS) is unavoidable in order to save lives. In terms of managing earthquake disasters and achieving effective risk mitigation, the quick identification of the earthquake’s intensity is a valuable factor. In light of this, the on-site intensity measurement can be transmitted over an Internet of Things (IoT) network. In this regard, a machine learning (ML) strategy based on numerous linear and non-linear models is proposed in this study for a quick determination of earthquake intensity after two seconds from the P-wave onset. We call this model an on-site two-second ML model-based earthquake intensity determination (2S-ML-EIOS). The utilized dataset INSTANCE for this model is observed by the number of 386 stations from the Italian national seismic network. Our model has been trained on 50,000 occurrences (150 thousand of 2s-three-component seismic windows). The model has the ability to deal with limited features of the waveform traces leading to reliable estimation of the earthquake intensity. The suggested model has a 98.59% accuracy rate in predicting earthquake intensity. The suggested 2S-ML-EIOS model can be used with a centralized IoT system to promptly send the alarm, and the IoT system will then instruct the affected administration to take the appropriate action. The 2S-ML-EIOS results are contrasted with those from the traditional manual solution approach, which corresponds to the ideal solution mean. Based on the extreme gradient boosting (XGB) model, the 2S-ML-EIOS can achieve the best intensity determination, and this improved performance demonstrates the methodology’s efficacy for EEWS.

Evaluación y comparación de algoritmos para la detección automática de eventos sísmicos

Article

Full-text available

Nov 2021

La detección temprana de eventos sísmicos permite reducir daños materiales, el número de personas afectadas e incluso salvar vidas. En particular, la actividad sísmica en Ecuador es alta, dado que se encuentra en el denominado Cinturón de Fuego del Pacífico. En tal contexto, el presente artículo tiene como objetivo comparar algoritmos para la detección automática de eventos sísmicos. Dicha comparación se realiza con respecto a la funcionalidad y configuración de los parámetros requeridos para cada algoritmo. Además, la implementación se lleva a cabo sobre una plataforma computacional tipo SBC (Single Board Computer) con la finalidad de obtener una herramienta portable, escalable, económica y de bajo costo computacional. Los métodos comparados son: Classic STA/LTA, Recursive STA/LTA, Delayed STA/LTA, Z-detector, Baer and Kradolfer picker y AR-AIC (Autoregressive-Akaike-Information-Criterion-picker). Para la evaluación y comparación se desarrollan múltiples experimentos empleando registros sísmicos reales proporcionados por la Red Sísmica del Austro (RSA), disponibles como fuente de entrada a los algoritmos. Como resultado se obtiene que el algoritmo Classic STA/LTA presenta el mejor rendimiento, ya que del total de eventos reales (58), solo un evento no fue detectado. Además, se consiguen 6 falsos negativos, logrando un 98,2% de precisión. Cabe recalcar que el software utilizado para la comparación de algoritmos de detección de eventos sísmicos está disponible de forma libre.

On Using a Microearthquake Recognition System for an Early Warning System at Cotopaxi Volcano

Chapter

Oct 2023

Volcanic activity has been increasing throughout the world, posing a significant threat to populations in the event of eruptions. Ecuador, which hosts several active volcanoes, requires robust methods for identifying potential eruptions and issuing reliable alerts to protect lives and minimize damages. This paper presents the development of an automatic microseism recognition system integrated with the Early Warning Broadcast System (EWBS). Using models such as k-Nearest Neighbors, Support Vector Machine, and Decision Trees, along with frequency-based features extracted from seismic data provided by the Instituto Geofísico de la Escuela Politécnica Nacional, the recognition system aims to accurately detect and classify microeartquakes associated with the Cotopaxi volcano during 2012. During the detection stage, the system achieves an impressive Balanced Error Rate (BER) of 0.01, indicating its effectiveness in identifying events. In the subsequent classification stage, the system achieves a BER of 0.11, demonstrating its ability to classify events accurately. The classifiers were further evaluated using 82 microearthquakes, comprising 41 LP events and 41 VT events, resulting in an accuracy of 85% and a BER of 0.15. In addition, a larger data set of 563 earthquakes, consisting of 522 LP events and 41 VT events, was used to assess the performance of the classifiers. The results showed a 7% increase in accuracy compared to the previous test, demonstrating improved performance. However, 11 earthquakes were still misclassified. Integration of these classifiers with a voting system improves their performance. The selected set of 50 features plays a crucial role in achieving accurate results. The recognition system seamlessly interfaces with the EWBS, ensuring a 30 s delay before launching an early warning. This delay provides valuable time for preparation and response measures. In conclusion, the developed microearthquake recognition system, combined with the EWBS, demonstrates its effectiveness in detecting and classifying events, thereby enhancing the ability to issue timely and reliable alerts for volcanic activity. The findings contribute to improved volcano monitoring and risk mitigation strategies.

Advanced KNN Approaches for Explainable Seismic-Volcanic Signal Classification

Article

Oct 2022

Acquisition, classification, and analysis of seismic data are crucial tasks in volcano monitoring. The large number of seismic signals that are continuously acquired during the first monitoring stage poses a huge challenge for the human experts that must classify and analyze them. Several automatic classification systems have been proposed in the literature to alleviate such an overwhelming workload, each one characterized by different levels of accuracy, computational complexity, and interpretability. Considering this last perspective, which represents one of the recent key issues in geoscience, it is possible to find many accurate methods (in terms of classification accuracy) which however represent black boxes, not permitting a clear interpretation. On the other hand, there are other approaches, such as those based on support vector machines (SVM), random forests (RF), and K-nearest neighbor (KNN), which permit the interpretation of results, rules, and models at different levels. Among these last techniques, KNN approaches for volcanic signal classification typically do not achieve the satisfactory classification results obtained with RF and SVM. One possible reason is that in this context, the KNN rule has usually been applied in its basic version, not exploiting the different advanced KNN variants that have been introduced in recent years. This paper takes one step along this direction, investigating the suitability of a number of advanced versions of the KNN rule for the problem of classifying seismic-volcanic signals. The usefulness of these rules, in comparison with the original KNN rule as well as other interpretable classifiers, is evaluated within a real-world scenario involving a five-class dataset of seismic signals acquired at the Nevado del Ruiz volcano, Colombia. The results show that the classification accuracy of basic KNN is largely improved by these advanced variants, even surpassing that obtained with other classifiers like RF and SVM.

Earthquake Time Prediction using CatBoost and SVR

Article

Full-text available

Nov 2019

Seismic tremors everywhere throughout the globe have been a noteworthy reason for decimation and death toll and property. The following context expects to recognize earthquakes at a beginning time utilizing AI. This will help individuals and salvage groups to make their errand simpler. The information in this manner comprises of these seismic acoustic signals and the time of failure. The model is then prepared utilizing the CatBoost model and the utilization of Support Vector Machines. This will help foresee the time at which a Seismic tremor may happen. CatBoost Regression Algorithm gives a Mean Absolute Error of about 1.860. The Cross Validation (CV) Score for the Support Vector Machine (SVM) approach is -2.1651. The datasets metrics are not reliable on any outer parameter in this manner the variety of exactness is constrained, and high accuracy is accomplished.

Clusters of long-period (LP) seismic events recorded during 34 days at the Irazú volcano: What are they telling us?

Article

Sep 2022
J S AM EARTH SCI

This work analyses seismic signals at the Irazú volcano (83°51′09″ W, 9°58′45″ N; active stratovolcano in Central America) in a continuous 34-day record beginning April 13, 2013. Since December 2012, the activity of the volcano has been increasing and along with it the frequency occurrence of certain long-period (LP) signals. In this work, 6 families of LP signals were identified, with 83 total events. These events show energy concentrated mainly in the range of frequencies between 1 and 3 Hz, significant variations in amplitude, and some locations on the SW flank of the volcano at 1900 m.a.s.l. (corrected for site effects). The LP signals are interpreted as linked to the interaction between the fluids of the active hydrothermal system and a fault system (NE-SW direction) in the west slope of the Irazú volcano, but without being associated with magma ascent. This contribution provides useful information for the evaluation of the volcanic hazard at Irazú volcano, in terms of the analysis of possible precursors.

Development of Low-Cost IoT based data acquisition system for real-time monitoring of volcanoes seismicity

Article

Full-text available

Jan 2022

This paper describes how to develop a low-cost data acquisition (DAQ) system for real-time remote monitoring of volcano seismicity. The DAQ system is developed based on IoT platform, consists of several module devices both hardware and software, and installed in two places called seismic-stations (SS) and observation-post (OP). In the implementation, the SS is placed on the body of the volcano meanwhile the OP is usually placed at a certain distance from the SS with more adequate electrical and communication facilities. In this design, the SS consists of seismic sensors and a digitizer, whereas the OP consists of a computer server and ADSL modem. Communication between SS and OP conducted by using a 5 GHz Wi-Fi telemetry system. At the OP, the data is processed to produce the form of digital seismograms, hypocentres, arrival times, and signal magnitudes in SAC or mini seed file formats by Earthworm software. Database server for cloud access is created using Winston Wave Server (WWS) and MySQL Data Base Management Systems (DBMS). The seismic data can be accessed in real time remotely by users using the SWARM application via website access.

DeepQuake — An application of CNN for seismo-acoustic event classification in The Netherlands

Article

Nov 2021
COMPUT GEOSCI-UK

Recent developments in instrumentation and methodology are major driving forces in the advances of solid Earth sciences. The deployment of large and dense sensor networks enables data centres to acquire data of increased volume and quality. The analysis of such data provides scientists with a better understanding about natural phenomena in the subsurface. At the same time new challenges arise to exploit the growing information potential. Innovative methods based on Artificial Intelligence offer concrete opportunities to tackle those challenges. In this paper we investigate Convolutional Neural Networks (CNN) for seismo-acoustic event classification in The Netherlands. A supervised deep learning approach is applied to continuous seismic waveforms in order to automatically detect seismo-acoustic events and discriminate their sources. Our region of interest is characterised by high seismic noise conditions and intense anthropogenic activity which deeply influence the recorded signals. Site conditions and our aim for a tool that could support operational earthquake analysis motivated us to devise a novel specialised CNN architecture. Building on existing approaches, we designed, trained and evaluated two CNN models: arch-time and arch-spect. Our results suggest that as CNN inputs, spectrograms are more suitable than continuous waveforms. Therefore, we propose a model (arch-spect) that achieves a performance among the highest in modern source detection systems. Furthermore, we focus on explainability aspects and propose a method to gain insights in the behaviour of a ‘black-box’ deep learning classification system. By generating visualisations of activations we aim to empower domain experts with a usable cross-checking tool and pave the way for a broader operational uptake of Artificial Intelligence based methods.

Time and Frequency Feature Selection for Seismic Events from Cotopaxi Volcano

Conference Paper

Full-text available

Jul 2015

This paper presents a study to select the most relevant features for classification of seismic signals obtained from the Cotopaxi Volcano, in the time and frequency domain. Fourier and Wavelet transform were used in the analysis. A total of 79 different features were used for the study. Feature selection was performed by CART by using Gini, Standard Deviation, Twoing Rule, Gram-Schmidt, and Interaction Information as relevant indices. A comparative analysis of the features obtained indicates that the most relevant features for the identification of seismic events are: Maximum Peak Value in the 10-20 Hz range, High Frequency in WT A6 and the Percentage of Energy in the D2 and D5 WT levels.

A Comparison between Time-Frequency and Cepstral Feature Representations for the Classification of Seismic-Volcanic Signals

Conference Paper

Full-text available

Nov 2014

The analysis and classification of seismic patterns, which are typically registered as digital signals, can be used to monitor and understand the underlying geophysical phenomena beneath the volcanoes. In recent years, there has been an increasing interest in the development of automated systems for labeling those signals according to a number of pre-defined volcanic, tectonic and environmental classes. The first and crucial stage in the design of such systems is the definition or adoption of an appropriate representation of the raw seismic signals, in such a way that the subsequent stage —classification— is made easier or more accurate. This paper describes and discusses the most common representations that have been applied in the literature on classification of seismic-volcanic signals; namely, time-frequency features and cepstral coefficients. A comparative study of them is performed in terms of two criteria: (i) the leave-one-out nearest neighbor error, which provides a parameterless measure of the discriminative representational power and (ii) a visual examination of the representational quality via a scatter plot of the best three selected features.

Natural or anthropogenic? On the origin of atmospheric sulfate deposition in the Andes of southeastern Ecuador

Article

Full-text available

Oct 2014

Atmospheric sulfur deposition above certain limits can represent a threat to tropical forests, causing nutrient imbalances and mobilizing toxic elements that impact biodiversity and forest productivity. Atmospheric sources of sulfur deposited by precipitation have been roughly identified in only a few lowland tropical forests. Even scarcer are studies of this type in tropical mountain forests, many of them megadiversity hotspots and especially vulnerable to acidic deposition. In these places, the topographic complexity and related streamflow conditions affect the origin, type, and intensity of deposition. Furthermore, in regions with a variety of natural and anthropogenic sulfur sources, like active volcanoes and biomass burning, no source emission data has been used for determining the contribution of each source to the deposition. The main goal of the current study is to evaluate sulfate (SO− 4 ) deposition by rain and occult precipitation at two topographic locations in a tropical mountain forest of southern Ecuador, and to trace back the deposition to possible emission sources applying back-trajectory modeling. To link upwind natural (volcanic) and anthropogenic (urban/industrial and biomass-burning) sulfur emissions and observed sulfate deposition, we employed state-of-the-art inventory and satellite data, including volcanic passive degassing as well. We conclude that biomass-burning sources generally dominate sulfate deposition at the evaluated sites. Minor sulfate transport occurs during the shifting of the predominant winds to the north and west. Occult precipitation sulfate deposition and likely rain sulfate deposition are mainly linked to biomass-burning emissions from the Amazon lowlands. Volcanic and anthropogenic emissions from the north and west contribute to occult precipitation sulfate deposition at the mountain crest Cerro del Consuelo meteorological station and to rain-deposited sulfate at the upriver mountain pass El Tiro meteorological station.

Regions of Risk

Book

Sep 2014

Kenneth Hewitt

On the mean accuracy of statistical pattern recognizers

Article

Jan 1968

G.F. Hughes

Feature Selection of Seismic Waveforms for Long Period Event Detection at Cotopaxi Volcano

Article

Mar 2016
J VOLCANOL GEOTH RES

Volcano Early Warning Systems (VEWS) have become a research topic in order to preserve human lives and material losses. In this setting, event detection criteria based on classification using machine learning techniques have proven useful, and a number of systems have been proposed in the literature. However, to the best of our knowledge, no comprehensive and principled study has been conducted to compare the influence of the many different sets of possible features that have been used as input spaces in previous works. This work presents an automatic recognition system of volcano seismicity considering feature extraction, event classification, and subsequent event detection, in order to reduce the processing time as a first step towards a high reliability automatic detection system in real-time. We compiled and extracted a comprehensive set of temporal, moving average, spectral, and scale-domain features, for separating long period seismic events from background noise. We benchmarked two usual kinds of feature selection techniques, namely, filter (mutual information and statistical dependence) and embedded (cross-validation and pruning), each of them by using a suitable and appropriate classification algorithms such as k Nearest Neighbors (k-NN) and Decision Trees (DT). We applied this approach to the seismicity presented at Cotopaxi Volcano in Ecuador during 2009 and 2010. The best results were obtained by using 15s segmentation window, feature matrix in the frequency domain and DT classifier, yielding 99% of detection accuracy and sensitivity. Selected features and their interpretation were consistent among different input spaces, in simple terms of amplitude and spectral content. Our study provides the framework for an event detection system with high accuracy and reduced computational requirements.

Automatic Classification of Volcanic Earthquakes in HMM-Induced Vector Spaces

Conference Paper

Sep 2012

Even though hidden Markov models (HMMs) have been used for the automatic classification of volcanic earthquakes, their usage has been so far limited to the Bayesian scheme. Recently proposed alternatives, proven in other application scenarios, consist in building HMM-induced vector spaces where discriminative classification techniques can be applied. In this paper, a simple vector space is induced by considering log-likelihoods of the HMMs (per-class) as dimensions. Experimental results show that the discriminative classification in such an induced space leads to better performances than those obtained with the standard Bayesian scheme.

Pattern Recognition and Machine Learning

Book

Jan 2006

Long-period seismicity at Shishaldin volcano (Alaska) in 2003-2004: Indications of an upward migration of the source before a minor eruption

Article

Dec 2014
J VOLCANOL GEOTH RES

Pattern Recognition and Classification: An Introduction

Book

Jan 2012

Geoff Dougherty

The use of pattern recognition and classification is fundamental to many of the automated electronic systems in use today. However, despite the existence of a number of notable books in the field, the subject remains very challenging, especially for the beginner. Pattern Recognition and Classification presents a comprehensive introduction to the core concepts involved in automated pattern recognition. It is designed to be accessible to newcomers from varied backgrounds, but it will also be useful to researchers and professionals in image and signal processing and analysis, and in computer vision. Fundamental concepts of supervised and unsupervised classification are presented in an informal, rather than axiomatic, treatment so that the reader can quickly acquire the necessary background for applying the concepts to real problems. More advanced topics, such as semi-supervised classification, combining clustering algorithms and relevance feedback are addressed in the later chapters. This book is suitable for undergraduates and graduates studying pattern recognition and machine learning. © Springer Science+Business Media New York 2013. All rights are reserved.

Automatic Recognition of Long Period Events From Volcano Tectonic Earthquakes at Cotopaxi Volcano

Abstract and Figures

Recommended publications

Feature Selection of Seismic Waveforms for Long Period Event Detection at Cotopaxi Volcano

On finding possible frequencies for recognizing microearthquakes at Cotopaxi volcano: A machine lear...

Comparative analysis of automated classifiers applied to volcano event identification

ESeismic: Towards an Ecuadorian volcano seismic repository