ArticlePDF Available

Forest Fire Susceptibility Modeling Using a Convolutional Neural Network for Yunnan Province of China

September 2019
International Journal of Disaster Risk Science 10(3)

September 2019
10(3)

DOI:10.1007/s13753-019-00233-1

License
CC BY

Authors:

Guoli Zhang

Beijing Normal University

Ming Wang

Beijing Normal University

Kai Liu

Beijing Normal University

Forest fires have caused considerable losses to ecologies, societies, and economies worldwide. To minimize these losses and reduce forest fires, modeling and predicting the occurrence of forest fires are meaningful because they can support forest fire prevention and management. In recent years, the convolutional neural network (CNN) has become an important state-of-the-art deep learning algorithm, and its implementation has enriched many fields. Therefore, we proposed a spatial prediction model for forest fire susceptibility using a CNN. Past forest fire locations in Yunnan Province, China, from 2002 to 2010, and a set of 14 forest fire influencing factors were mapped using a geographic information system. Oversampling was applied to eliminate the class imbalance, and proportional stratified sampling was used to construct the training/validation sample libraries. A CNN architecture that is suitable for the prediction of forest fire susceptibility was designed and hyperparameters were optimized to improve the prediction accuracy. Then, the test dataset was fed into the trained model to construct the spatial prediction map of forest fire susceptibility in Yunnan Province. Finally, the prediction performance of the proposed model was assessed using several statistical measures—Wilcoxon signed-rank test, receiver operating characteristic curve, and area under the curve (AUC). The results confirmed the higher accuracy of the proposed CNN model (AUC 0.86) than those of the random forests, support vector machine, multilayer perceptron neural network, and kernel logistic regression benchmark classifiers. The CNN has stronger fitting and classification abilities and can make full use of neighborhood information, which is a promising alternative for the spatial prediction of forest fire susceptibility. This research extends the application of CNN to the prediction of forest fire susceptibility.

Available via license: CC BY

Content may be subject to copyright.

Forest Fire Susceptibility Modeling Using a Convolutional Neural

Network for Yunnan Province of China

Guoli Zhang

·Ming Wang

1,2

·Kai Liu

1,2

©The Author(s) 2019

Abstract Forest ﬁres have caused considerable losses to

ecologies, societies, and economies worldwide. To mini-

mize these losses and reduce forest ﬁres, modeling and

predicting the occurrence of forest ﬁres are meaningful

because they can support forest ﬁre prevention and man-

agement. In recent years, the convolutional neural network

(CNN) has become an important state-of-the-art deep

learning algorithm, and its implementation has enriched

many ﬁelds. Therefore, we proposed a spatial prediction

model for forest ﬁre susceptibility using a CNN. Past forest

ﬁre locations in Yunnan Province, China, from 2002 to

2010, and a set of 14 forest ﬁre inﬂuencing factors were

mapped using a geographic information system. Over-

sampling was applied to eliminate the class imbalance, and

proportional stratiﬁed sampling was used to construct the

training/validation sample libraries. A CNN architecture

that is suitable for the prediction of forest ﬁre susceptibility

was designed and hyperparameters were optimized to

improve the prediction accuracy. Then, the test dataset was

fed into the trained model to construct the spatial predic-

tion map of forest ﬁre susceptibility in Yunnan Province.

Finally, the prediction performance of the proposed model

was assessed using several statistical measures—Wilcoxon

signed-rank test, receiver operating characteristic curve,

and area under the curve (AUC). The results conﬁrmed the

higher accuracy of the proposed CNN model (AUC 0.86)

than those of the random forests, support vector machine,

multilayer perceptron neural network, and kernel logistic

regression benchmark classiﬁers. The CNN has stronger

ﬁtting and classiﬁcation abilities and can make full use of

neighborhood information, which is a promising alternative

for the spatial prediction of forest ﬁre susceptibility. This

research extends the application of CNN to the prediction

of forest ﬁre susceptibility.

Keywords China · Convolutional neural

network · Forest ﬁre susceptibility · Geographic

information system · Machine learning

1 Introduction

Forest ﬁres have caused considerable losses in global forest

resources and people’s lives and property, seriously

impacting the global ecological balance, and have received

considerable attention from countries worldwide. In recent

years, with global warming, industrialization, and human

interventions, the frequency and severity of forest ﬁres

have been increasing signiﬁcantly in many parts of the

world (Crimmins 2006; Running 2006; Hantson et al.

2015). Regional forest ﬁre susceptibility is often affected

by many factors and has typical nonlinear and complex

characteristics; therefore, it is still a difﬁcult task to

develop forest ﬁre prediction models with satisfactory

accuracy (Ngoc Thach et al. 2018). Various approaches

have been developed for modeling forest ﬁre susceptibility,

ranging from physics-based methods to statistical and

machine learning (ML) methods (Dimuccio et al. 2011;

Tien Bui et al. 2017; Leuenberger et al. 2018; Hong et al.

2019; Jaafari et al. 2019). Compared to traditional

&Ming Wang

wangming@bnu.edu.cn

State Key Laboratory of Earth Surface Processes and

Resource Ecology / Academy of Disaster Reduction and

Emergency Management, Faculty of Geographical Science,

Beijing Normal University, Beijing 100875, China

Key Laboratory of Environmental Change and Natural

Disasters, Ministry of Education, Beijing Normal University,

Beijing 100875, China

123

Int J Disaster Risk Sci www.ijdrs.com

https://doi.org/10.1007/s13753-019-00233-1 www.springer.com/13753

qualitative and statistical analysis methods, ML approaches

have shown the ability to provide better results for the

spatial prediction of wildﬁres (Bar Massada et al. 2013). In

the last decade, various ML algorithms—such as artiﬁcial

neural networks (Dimuccio et al. 2011; Bisquert et al.

2012; Satir et al. 2016), random forests (RF) (Oliveira et al.

2012; Arpaci et al. 2014; Pourtaghi et al. 2016), support

vector machine (SVM) (Hong et al. 2018), multilayer

perceptron neural network (MLP) (Vasconcelos et al. 2001;

Satir et al. 2016), kernel logistic regression (KLR) (Hong

et al. 2015; Bui, Le, et al. 2016), naive Bayes (Elmas and

¨nmez 2011; Jaafari et al. 2018), gradient boosted deci-

sion trees (Sachdeva et al. 2018), and particle swarm

optimized neural fuzzy (Tien Bui et al. 2017)—have been

successfully developed and widely applied for producing

wildﬁre susceptibility maps. Comparative studies of mul-

tiple ML algorithms have also been employed (Pourtaghi

et al. 2016; Cao et al. 2017; Ngoc Thach et al. 2018; Kim

et al. 2019). Therefore, advanced ML approaches are very

promising for forest ﬁre spatial prediction. However, the

ML approaches mentioned above are pixel-based classiﬁers

with shallow architectures, which do not make use of the

spatial patterns that are implicit in images (Zhang et al.

2018). In addition, these classiﬁers directly classify the

input data without feature extraction, and representative

features cannot be mined from the input data to improve

classiﬁcation accuracy.

Deep learning (DL) methods (Hinton and Salakhutdinov

2006; Lecun et al. 2015) have recently received more

attention and achieved remarkable success. Deep learning

algorithms attempt to discover multiple representation

levels (Schmidhuber 2015) and have been broadly applied

in areas such as object recognition and detection, speech

recognition, and natural language processing (Lecun et al.

2015). The convolutional neural network (CNN) (LeCun

et al. 1998), which has been recognized as one of the most

successful and widely used DL algorithms, has produced

signiﬁcant improvements in the latest studies in areas such

as disaster damage detection (Muhammad et al. 2018;

Vetrivel et al. 2018), remotely sensed image classiﬁcation

(Liu and Abd-Elrahman 2018; Zhang et al. 2018), and

landslide susceptibility mapping (Wang et al. 2019).

However, none of these studies evaluated the effectiveness

of CNN in the prediction of forest ﬁre susceptibility. The

ﬁrst law of geography (Tobler 1970) emphasizes that near

things are more closely related than distant things. Whether

a pixel is an ignition point should not only consider the

situation of the pixel itself, but also consider other pixels

within a certain range around the pixel. While pixel-based

classiﬁers may overlook certain information in spatial

patterns, the contextual-based CNN explores the complex

spatial patterns that are implicit in images (Zhang et al.

2018). The CNN can make full use of contextual

information (that is, neighborhood information) and can

discover multiple levels of representations from input data,

which is more suitable for the evolution of ﬁre event spatial

characteristics. The DL process reveals the deep features

and can distinguish the differences between different geo-

graphical units. Therefore, there is a certain practical sig-

niﬁcance in studying the application of the CNN algorithm

in forest ﬁre susceptibility analysis.

Forest ﬁre susceptibility, in this article, is deﬁned as the

probability estimation of ﬁre occurrence in a region. The

main objective of this study is to utilize contextual-based

CNN with deep architectures for the spatial prediction of

regional forest ﬁre susceptibility in Yunnan Province,

China. The forest ﬁre susceptibility model was established

based on a CNN and the hyperparameters of the model

were optimized to improve the prediction accuracy. The

performance of the proposed model was compared with

benchmark methods using several statistical measures—

Wilcoxon signed-rank test (WSRT), receiver operating

characteristic (ROC) curve, and area under the curve

(AUC).

2 Study Area and Data Collection

Yunnan Province is located in southwestern China (Fig. 1).

It is a mountainous region with over 90% mountain and

plateau landscape interspersed with less than 10% small,

scattered valley basins. The terrain slopes downward from

northwest to southeast, and elevation ranges from 0 to

6135 m. The region has a plateau-type tropical monsoon

climate, and average temperatures in the summer and

winter are 19–22 °C and 6–8 °C, respectively. The distri-

bution of precipitation throughout the seasons and regions

is extremely uneven. The winter and spring seasons from

November to April of the following year are dry seasons

with precipitation accounting for only 20% or less of the

1100 mm annual precipitation. Yunnan has abundant and

diverse forest resources, including tropical rainforests,

seasonal rainforests, subtropical evergreen broad-leaved

forests, and temperate coniferous forests. Compared with

other regions in China, the frequency of forest ﬁres in

Yunnan is relatively high (Ying et al. 2018). The occur-

rence of forest ﬁres in Yunnan has a strong seasonal pat-

tern, mainly concentrated in the spring from mid-February

to mid-May.

Compiling a forest ﬁre inventory is a mandatory task for

forest ﬁre susceptibility modeling (Tien Bui et al. 2017). In

this study, a forest ﬁre event map was prepared using

multiple resources including historical ﬁre reports and the

interpretation of satellite images (see Cao et al. (2017) for

details). From 2002 to 2010, a total of 7675 ﬁres occurred,

and the number of ﬁres in the spring was 4428, accounting

123

Zhang et al. Forest Fire Susceptibility Modeling Using a Convolutional Neural Network

for 58% of the total number of ﬁres. The occurrence of

forest ﬁres is affected by many factors, and selecting the

appropriate inﬂuencing factors is important (Pew and

Larsen 2001; Guo et al. 2016). Fourteen forest ﬁre inﬂu-

encing factors were selected, and all datasets were prepared

in the form of raster/vector data, as described and listed in

Table 1.

3 Methodology

This section ﬁrst provides a preview of the CNN algorithm,

and then elaborates the procedure of the susceptibility

model development through a series of processes including

data preprocessing and sample library generation, model

architectures design and parameter adjustment, and per-

formance evaluation. Finally, the detailed methodological

ﬂowchart is described.

3.1 Preview of Convolutional Neural Network

Convolutional neural network (CNN) is one of the most

notable DL approaches and has exhibited robust perfor-

mance in feature learning for image classiﬁcation and

recognition. It is a feed-forward neural network whose

parameters are trained by using the classic stochastic gra-

dient descent based on the backpropagation algorithm (Hu

et al. 2015).

Generally, the CNN consists of several building blocks

—convolutional, pooling, and fully connected layers (Ya-

mashita et al. 2018). The different types of processing

layers play different roles. The convolutional layers, which

perform linear convolution operations between the input

tensor and a set of ﬁlters, output the feature maps. Typi-

cally, each feature map is then followed by a nonlinear

activation function. The rectiﬁed linear unit (ReLU), which

performs the nonlinear transformation of the feature map

generated by the convolution layer and introduces nonlin-

earity into the system, is the most commonly used activa-

tion function.

The purpose of the convolution operation is to extract

different input layer features and achieve weight sharing.

The input and output of each stage are sets of arrays called

feature maps. For example, if the input is a 2-dimensional

image x, the input is ﬁrst decomposed into a sequential array

x={x1,x2,…,xN}. The convolutional layer is deﬁned as:

yj¼fb

jþX

kij xi

where yjdenotes the jth output for the convolutional layer

and xidenotes each input feature map. kij denotes the

convolutional kernel with the ith input map xi. * Denotes

the discrete convolution operator, bjdenotes a trainable

bias, and fis the nonlinear activation.

The pooling layers perform a subsampling operation to

reduce the dimensions of the feature maps. According to

Fig. 1 Location of Yunnan Province, the study area, in China

123

Int J Disaster Risk Sci

the maximum and average functions, the pooling layer can

be divided into the max-pooling and average-pooling lay-

ers. The fully connected layers, which are the ﬂat feed-

forward neural network layers, provide high-level

abstraction features. They are often used at the end of the

network architecture and create the ﬁnal nonlinear com-

binations of features for making the predictions by the

network. The activation function for the last fully con-

nected layer needs to be selected reasonably based on given

tasks. The softmax or sigmoid function can be used to

compute the posterior probability for each grid cell.

3.2 Data Preprocessing and Sample Library

Generation

First, appropriate forest ﬁre inﬂuencing factors were

selected, and variables raster datasets (VRDs) and ignition

raster datasets (IRDs) were constructed through prepro-

cessing. Then, the sample libraries were collected from the

established IRDs and VRDs using an appropriate sampling

method.

3.2.1 Forest Fire Inﬂuencing Factors

Four categories of forest ﬁre inﬂuencing factors were

considered, including topography-related, climate-related,

vegetation-related, and human activities-related variables.

ArcGIS 10.5 was employed for handling geographic data.

The effect of the topography has been considered as a

signiﬁcant feature in forest ﬁre assessment (Renard et al.

2012; Adab et al. 2013). Three topography-related inﬂu-

encing factors—elevation, slope, and aspect—were

retrieved from the digital elevation model (DEM). The

DEM was produced by the Advanced Spaceborne Thermal

Emission and Reﬂection Radiometer (ASTER) GDEM

version 2 (with a 30 m pixel size). Surface roughness was

obtained from the Climate Forecast System Reanalysis

(CFSR). The climatic characteristics of an area affect the

occurrence and intensity of forest ﬁres (Moritz et al. 2012).

Six meteorologically related inﬂuencing factors—average

temperature, average precipitation, average wind speed,

maximum temperature, speciﬁc humidity, and precipitation

rate—were obtained from the CFSR. The spatial resolution

is approximately 0.3 degree (0.312°90.312°) with a 6 h

temporal resolution. These inﬂuencing factors were cal-

culated as the spring means from March through May. Six

meteorologically related factors were ﬁnally mapped with

the inverse distance weighted (IDW) interpolation method.

The Moderate Resolution Imaging Spectroradiometer

(MODIS) normalized difference vegetation index (NDVI)

has also been identiﬁed as an important variable in forest

ﬁre modeling (Bajocco et al. 2015). The NDVI values

reﬂect the vegetation’s health and essentially the fuel load

distribution (Yi et al. 2013). The MODIS NDVI monthly

Table 1 Data description of forest ﬁre inﬂuencing factors

No. Data Scale/resolution original Unit Original data format Source

1 Elevation 30 m m Raster ASTER GDEM

2 Slope 30 m degree Raster

3 Aspect 30 m degree Raster

4 Average temperature 0.312°(lat/long) °C NetCDF CFSR

5 Average precipitation 0.312°(lat/long) kg/m

NetCDF

6 Surface roughness 0.312°(lat/long) Ratio NetCDF

7 Average wind speed 0.312°(lat/long) m/s NetCDF

8 Maximum temperature 0.312°(lat/long) °C NetCDF

9 Speciﬁc humidity 0.312°(lat/long) Ratio NetCDF

10 Precipitation rate 0.312°(lat/long) mm/h NetCDF

11 Forest coverage ratio 1:1,000,000 Ratio Vector Vegetation map (Zhang 2007)

12 NDVI 500 m Ratio Raster Geospatial Data Cloud

13 Distance to roads 1:40,000,000 km Vector Basic geographic data

14 Distance to rivers 1:40,000,000 km Vector Basic geographic data

ASTER GDEM advanced spaceborne thermal emission and reﬂection radiometer global digital elevation map; CFSR climate forecast system

reanalysis; NetCDF network common data form; NDVI normalized difference vegetation index

https://search.earthdata.nasa.gov

https://rda.ucar.edu

http://www.gscloud.cn

http://www.ngcc.cn

123

Zhang et al. Forest Fire Susceptibility Modeling Using a Convolutional Neural Network

synthetic products with a 500 m resolution were obtained

from the Geospatial Data Cloud website. The clip and

raster calculator tools in ArcGIS10.5 were applied to cal-

culate the annual spring NDVI of Yunnan for the period

2002–2010. The NDVI map for each year was calculated

by taking the average NDVI of the spring months (March,

April, and May). The forest coverage data for Yunnan that

were used in this research were derived from the vegetation

map of the People’s Republic of China (1:1,000,000). The

forest coverage ratio map was calculated by the ratio of the

forest area of each pixel to the total area of the pixel. The

distance from the river network and the distance from the

road network (highways, main roads, and local roads) were

obtained by applying the buffer tool in the proximity

toolbox, which calculates the Euclidean distance from the

road and river networks. Then, maps of the proximity to

roads and rivers were produced. Figure 2shows 11 maps of

the 14 inﬂuencing factors in 2010. Not shown in Fig. 2are

maximum temperature, speciﬁc humidity, and precipitation

rate, which are discussed in more detail in Sect. 4.1.

3.2.2 Inﬂuencing Factor Evaluators

Variable selection is particularly important in the predic-

tion of forest ﬁre susceptibility. The high dimensionality of

the training dataset may complicate the prediction process

and decrease the prediction accuracy. In this study, multi-

collinearity analysis and an information gain ratio (IGR)

were selected to evaluate the forest ﬁre inﬂuencing factors.

Multicollinearity analysis (O’Brien 2007) was applied to

estimate the correlation between the forest ﬁre inﬂuencing

factors. Two measures of variance—inﬂation factors (VIF)

and tolerances (TOL)—were used to identify the degree of

multicollinearities among the forest ﬁre inﬂuencing factors.

A factor is considered to be multicollinear if its tolerance is

less than 0.1 or the VIF value is greater than 10 (Colkesen

et al. 2016). The IGR is an effective approach for selecting

an optimal subset of variables that can represent the whole

dataset to improve the prediction performance in forest ﬁre

susceptibility mapping (Dash and Liu 1997; Jaafari et al.

2018). The average merit (AM) reveals the importance of

forest ﬁre inﬂuencing factors in predicting forest ﬁre

occurrence (Jaafari et al. 2018).

3.2.3 Building Variables Raster Datasets and Ignition

Raster Datasets

The min–max normalization process was conducted for the

inﬂuencing factor maps to avoid the potential bias caused

by the unbalanced magnitudes of factors. Rasterization was

performed by the ArcGIS model builder tool to convert all

maps to a raster format with the same pixel size (5000 m9

5000 m), the same data type (8 bit-unsigned integer), and

the same coordinate system (WGS 1984 Web Mercator),

resulting in 18,718 cells in total. Finally, the composite

bands tool was used to combine all the factor maps in a

year into one raster, which was named the VRD. All the

data of inﬂuencing factors from 2002 to 2010 were pro-

cessed in the same way and a total of nine VRDs were

ﬁnally established.

Class imbalance refers to the number of events in the

nonﬁre class being much greater than that in the ﬁre class

(Lo

´pez et al. 2013). Class imbalance has a detrimental

impact on the classiﬁcation performance of the CNN and

oversampling has been proved to be a robust solution for

solving this problem in DL (Buda et al. 2018). In addition,

according to the spatial characteristics of forest ﬁre events,

a certain area near the pixel of a ﬁre vector point may be a

ﬁre-prone area. Therefore, to solve the class imbalance,

buffer analysis was used with the existing ﬁre vector points

(Cao et al. 2017) to increase the number of events in the

ﬁre class. The pixels in the 5 km buffer zone were

resampled to 1 (ﬁre), and the pixels outside the buffer zone

were resampled to 0 (nonﬁre). Then, the generated raster

data—the IRD—were obtained. All ﬁre datasets from 2002

to 2010 were processed, and nine IRDs for the corre-

sponding years were obtained. All IRDs use the same pixel

size, data type, and coordinate system as the VRDs.

3.2.4 Sample Library Generation

Since the CNN conducts its effective training in a fully

supervised manner, the training and validation sample

libraries must be constructed ﬁrst. A binary classiﬁcation

method was adopted for the susceptibility analysis of forest

ﬁres, and samples were classiﬁed to either the forest ﬁre

class or the nonﬁre class, with 1 representing the ﬁre class

and 0 representing the nonﬁre class.

The CNN has many learnable parameters to estimate;

thus, this predictor requires more data to achieve sufﬁcient

training. Table 2shows that there is a large annual varia-

tion in the actual number of ﬁre points in 2002–2009 (data

from 2010 were used as the test dataset and therefore not

included in the training and validation process). A simple

random sampling method may lead to sample quantity

imbalances, while small samples would lead to overﬁtting

the model. Therefore, to generate more samples and keep

the sample quantity balanced, proportional stratiﬁed sam-

pling was adopted. The total number of forest ﬁres was

multiplied by the sampling rate (0.8) to determine the

number of ﬁre samples in every year. The same number of

nonﬁre samples was randomly selected, which constitutes a

total of 49,706 training samples.

After the number of samples needed for each year is

determined (Table 2), a list of XY coordinates for ﬁre or

nonﬁre class was randomly generated and the locations of

123

Int J Disaster Risk Sci

Fig. 2 Forest ﬁre inﬂuencing factor maps in 2010 for Yunnan Province, China. NDVI normalized difference vegetation index

123

Zhang et al. Forest Fire Susceptibility Modeling Using a Convolutional Neural Network

the sample points were recorded—the list contains the

same number of coordinates as the sample quantity of each

year (Table 2) (XY was deﬁned using the row and column

number of the IRD). Then the same number of windows

were deﬁned, each with the size of 25925 pixels and the

center of the window was the XY coordinates in the XY

coordinate list. Then, the VRD in the window with 25925

pixels was extracted as a 3-dimensional array, n9n9c,

where ndenotes the row and column of each input patch

and crepresents the number of forest ﬁre inﬂuencing fac-

tors. Each sample consisted of two parts: a 3-dimensional

array containing inﬂuencing factors; and a corresponding

ground truth label from the IRD in the central XY coor-

dinates. The dataset was divided randomly into two parts:

(1) 80% as the training samples (49,706 pixels); and (2) the

remaining 20% as the validation samples (12,426 pixels).

3.3 The Proposed Convolutional Neural Network

Architectures and Related Parameters

The architecture of the proposed convolutional neural

network (CNN) model was completed by referring to the

AlexNet model (Krizhevsky et al. 2013) and the architec-

ture and hyperparameters were tuned based on our datasets.

As mentioned above, each input patch was a 3-dimensional

data representation of size n9n9c, taking 25 925 911 as

an example. Figure 3shows the main architecture of the

prediction model of the CNN for forest ﬁre susceptibility.

There were a total of three convolution layers, two pooling

layers, and three fully connected layers. The ﬁrst three

consecutive convolution layers had 64, 128, and 256 ker-

nels, with uniform kernel sizes of 393. Each convolution

layer was followed by an activation function (ReLU) and a

pooling layer. Zero padding was employed to retain in-

plane dimensions. All of the pooling layers perform max-

pooling and summarize a 292 neighborhood with a stride

Fig. 3 The architectural design of the proposed convolutional neural network (CNN) model (C1–C3 are convolutional layers and FC1–FC3 are

fully connected layers. The values on the right and below C1–C3 indicate the number of ﬁlters and their sizes. The values below FC1–FC3

indicate their dimensions, that is, the number of neurons in the fully connected layer.). ReLU rectiﬁed linear unit

Table 2 Training sample statistics

Year Actual ﬁre points Samples in the ﬁre buffer Sampling rate Number of ﬁre samples Number of nonﬁre samples

2002 158 1536 0.8 1229 1229

2003 267 2687 0.8 2150 2150

2004 522 4234 0.8 3387 3387

2005 620 4708 0.8 3766 3766

2006 625 4801 0.8 3841 3841

2007 835 6092 0.8 4874 4874

2008 339 3014 0.8 2411 2411

2009 491 3994 0.8 3195 3195

total 3857 31,066 24,853 24,853

123

Int J Disaster Risk Sci

of 2 pixels. At the end, the next three weight layers were

fully connected layers with 128, 64, and 32 neurons each.

Finally, the output of the last fully connected layer was fed

into a 2-way classiﬁer with an activation function named

softmax, which computes the probabilities for the two

classes’ labels.

The parameters in CNN, which are automatically

learned during the training process, refer to kernels in the

convolution layers and weights in the fully connected

layers. Training the CNN network is a process to ﬁnd

appropriate parameters to minimize the error between the

predicted results and the ground truth labels on a training

dataset. The CNN converted each input patch from the

original pixel values to the ﬁnal probability classiﬁcation

results, and the parameters were calculated by a loss

function through feed-forward propagation. The learnable

parameters were updated according to the loss value by

using the stochastic gradient descent based on the back-

propagation algorithm.

The hyperparameters (Table 3) are the variables that

need to be set before the training process begins. Dropout is

a recently introduced regularization technique and the fully

connected layers are followed by dropout rates of 0.5 to

mitigate overﬁtting. Bergstra and Bengio (2012) argued

that random searches are more efﬁcient for hyperparameter

optimization than grid searches and manual searches. A

random search was used to optimize the hyperparameters

and improve the accuracy and speed of the model. Adam,

an efﬁcient stochastic optimization algorithm based on the

gradient (Kingma and Ba 2014), was selected as the

optimizer.

3.4 Performance Evaluation

The evaluation criteria are a key factor in assessing the

classiﬁcation performance and guiding the classiﬁer mod-

eling (Sokolova and Lapalme 2009). In this article, a two-

class classiﬁcation method is modeled to predict forest ﬁre

susceptibility. Thus, ﬁve statistical measures including

overall accuracy, speciﬁcity, sensitivity, positive predictive

value (PPV), and negative predictive value (NPV) are

employed to appraise the classiﬁcation capability (Tien Bui

et al. 2017). The ﬁve statistical measures are computed in

the following manner:

Overall accuracy ¼TP þTN

TP þTN þFP þFN ;

Specificity ¼TN

FP þTN

Sensitivity ¼TP

TP þFN ;PPV ¼TP

FP þTP ;

NPV ¼TN

FN þTN

where TP (true positive) and TN (true negative) are the

number of samples that are correctly classiﬁed as positive

(ﬁre class) and negative (nonﬁre class) observations,

respectively. FP (false positive) and FN (false negative) are

the number of samples that are misclassiﬁed. Sensitivity is

the percentage of positive (ﬁre class) observations that are

correctly classiﬁed whereas speciﬁcity is the percentage of

negative (nonﬁre class) observations that are correctly

identiﬁed.

The ROC curve has been increasingly utilized to eval-

uate and validate the global performance assessment of the

prediction models in ML and data mining research (Pour-

taghi et al. 2016; Satir et al. 2016). It depicts the trade-offs

between the TPs and FPs rather than arbitrarily selecting a

particular threshold (Freeman and Moisen 2008). A ROC

plot is constructed by plotting the TP rate (TPrate, sensi-

tivity) on the Y-axis against the FP rate (FPrate, 1.0—

speciﬁcity) for all possible thresholds from 0 to 1 on the X-

axis. The ROC plot for a good classiﬁer tends to rise

sharply at the origin and then level off near the maximum

value of 1. A trivial classiﬁer will result in a ROC graph

near the diagonal where the TP rate is equal to the FP rate

for all thresholds. The AUC is generally considered to be

an important index to quantitatively assess the overall

accuracy of the classiﬁer’s performance. An AUC value

near 0.5 means that the predictive ability of the model is

completely random and a value of 1.0 represents a perfect

Table 3 A list of parameters and hyperparameters utilized in the convolutional neural network (CNN) model

Parameters Hyperparameters

Convolution layer Kernels Kernel size: 393; number of kernels: 64, 128, 256; stride =1; padding; activation function=ReLU

Pooling layer None Pooling methods: max-pooling; ﬁlter size: 292; stride = 2

Fully connected

layer

Weights Number of weights; activation function: Softmax

Others Model architecture; initialize weights; optimizer; loss function; window size; epochs; learning rate; dropout

ReLU rectiﬁed linear unit

123

Zhang et al. Forest Fire Susceptibility Modeling Using a Convolutional Neural Network

prediction without misclassiﬁcation. The closer the AUC

value is to 1, the better the performance of the forest ﬁre

prediction model. The AUC measure is computed by

obtaining only the area of the graphic:

AUC ¼1þTPrate FPrate

3.5 Technical Process for Predicting Forest Fire

Susceptibility

The technical process can be summarized into the follow-

ing ﬁve steps. The detailed procedure of this study is

depicted in Fig. 4.

Step 1 Construct the ﬁre and nonﬁre inventory maps and

create maps of the inﬂuencing factors that can poten-

tially inﬂuence the ignition susceptibility.

Fig. 4 Methodological ﬂowchart employed in this study. CNN convolutional neural network; ROC receiver operation characteristic

123

Int J Disaster Risk Sci

Step 2 Preprocess all datasets and generate the train-

ing/validation samples.

Step 3. Design the architecture of the CNN model,

optimize the CNN hyperparameters, and train the CNN

classiﬁer.

Step 4 Predict forest ﬁre susceptibility using the VRD of

2010.

Step 5 Evaluate the performance of the proposed model.

The CNN predication model was constructed under a

graphics processing unit (GPU) acceleration environment

using the Keras DL framework that uses TensorFlow as a

backend, which is a Python-based DL library. The system

conﬁguration used in the lab environment is as follows:

Intel Core i7 CPU, 16 GB RAM, Windows10 OS, and an

NVIDIA GeForce GTX 1070 with 12 GB of onboard

memory.

4 Results

This section ﬁrst shows the results of multicollinearity

analysis and an information gain ratio (IGR) for the

selection of forest ﬁre inﬂuencing factors. The loss and the

accuracy in the training/validation phases were tracked.

Then, the test dataset was fed into the trained model and

the prediction map of ignition probabilities was constructed

by the CNN model. Finally, the performance of the pro-

posed model was compared with benchmark methods.

4.1 Relative Importance Analysis of Inﬂuencing

Factors

According to the results of a multicollinearity analysis of

the 14 forest ﬁre inﬂuencing factors in Table 4, three fac-

tors—precipitation rate, speciﬁc humidity, and maximum

temperature—did not satisfy the critical values, suggesting

the existence of multicollinearity and should be excluded

from further analyses.

For the IGR method, the factors with a higher value of

average merit (AM) indicate a stronger prediction ability of

the model. However, factors with AM values equal to or

less than 0 indicate a “null” contribution to the forest ﬁre

susceptibility model and should be excluded from further

analysis (Bui, Tuan, et al. 2016). The results listed in Fig. 5

show that the AM values of all remaining 11 inﬂuencing

factors are greater than 0, indicating that all these inﬂu-

encing factors contribute to the model and should be

retained in the following prediction process. Temperature

Table 4 Multicollinearity analysis of forest ﬁre inﬂuencing factors

No. Forest ﬁre inﬂuencing factor TOL VIF

1 Elevation 0.288 3.47

2 Slope 0.819 1.221

3 Aspect 0.999 1.001

4 Average temperature 0.157 6.372

5 Average precipitation 0.4 2.5

6 Surface roughness 0.199 5.016

7 Average wind speed 0.106 9.443

8 Forest coverage ratio 0.881 1.134

9 NDVI 0.608 1.645

10 Distance to roads 0.948 1.055

11 Distance to rivers 0.932 1.073

12 Maximum temperature 0.014 69.653

13 Speciﬁc humidity 0.017 58.696

14 Precipitation rate 0.062 16.078

NDVI normalized difference vetation index; TOL tolerances; VIF

variance inﬂation factor

0 0.02 0.04 0.06 0.08 0.1 0.12 0.14 0.16

Distance to rivers

Distance to roads

Forest coverage ratio

Aspect

Slope

NDVI

Elevation

Precipitation

Surface roughness

Wind speed

Temperature

AM value

Forest fire influencing factors

Fig. 5 Average merit (AM) of

each forest ﬁre inﬂuencing

factor using the information

gain ratio (IGR) method

123

Zhang et al. Forest Fire Susceptibility Modeling Using a Convolutional Neural Network

is the most important factor for forest ﬁre susceptibility,

with the highest AM value of (0.139). It is followed by

wind speed (0.131), surface roughness (0.112), precipita-

tion (0.107), and elevation (0.102).

4.2 Model Accuracy

The training process was divided into training and valida-

tion phases. The validation dataset was used to monitor the

model classiﬁcation performance after the end of each

epoch, and the validation results were used as the basis for

whether the training process should be terminated earlier or

the hyperparameters should be ﬁne-tuned. The loss and

accuracy were two important indicators for evaluating the

effect of model training. Callback functions were applied

to adjust the training state and statistics during model

training, including “EarlyStopping,” “Re-

duceLROnPlateau,” and “ModelCheckpoint.”

Figure 6shows the loss and the training/validation

accuracy using TensorBoard for visualization. After the

training phase, the training accuracy was close to 91%, the

training loss no longer decreased after the 100th epoch and

tended to ﬁt, and the minimum loss value was 0.3. After the

80th epoch, the validation loss no longer decreased and

tended to converge. However, the validation loss rose

slightly after the 120th epoch, suggesting overﬁtting may

have occurred. Immediately, the EarlyStopping function

ended the training process to decrease the phenomenon of

overﬁtting. The validation accuracy of the model reached

82%, and the minimum validation loss was 0.45.

The slow decline curve of validation loss indicates that

the model was well ﬁtted. Overﬁtting did not occur because

the training process was effectively terminated by the

“EarlyStopping” function. After the training process, the

model corresponding to the epoch with the lowest loss of

the validating dataset in Check_pointer was selected as the

ﬁnal classiﬁcation model.

4.3 Susceptibility Map

After the training process, a classiﬁcation model was

achieved. A test dataset—the VRD of 2010—was then

used at the end of the project to evaluate the performance

of the ﬁnal model. The VRD of 2010 was fed into the

classiﬁcation model to predict forest ﬁre susceptibility. The

model resulted in probabilities for both the ﬁre and nonﬁre

classes on each pixel in the generated prediction map.

Notably, although the CNN was designed to predict a

single label from a small input patch of size 25 925 911,

the CNN was trained to predict all pixels in the new VRD

since the sliding window was densely overlapped and

covered the entire VRD during the reasoning phase. The

sum of the probabilities of the ﬁre and nonﬁre class was 1

on each pixel. The probability of the ﬁre class was chosen

as the ﬁnal predicted probability value, and then we pro-

duced a map of the probability values for all pixels by

converting the array data into an image using the libtiff

package in Python. The forest ﬁre susceptibility map was

ﬁnally produced by dividing the image into ﬁve levels

using the natural breaks method in ArcGIS10.5. Five

Fig. 6 a,bRepresent the

training accuracy and the

convergence graph of the

training loss, respectively; c,

drepresent the validation

accuracy and the convergence

graph of the validation loss,

respectively

123

Int J Disaster Risk Sci

susceptible classes were identiﬁed—very low, low, mod-

erate, high, and very high—for constructing the forest ﬁre

susceptibility map (Fig. 7).

From the predicted susceptibility results, we can see that

the highest forest ﬁre ignition susceptibilities are mainly

distributed in the south and northwest of Yunnan Province.

The areas with the lowest forest ﬁre ignition susceptibilities

are in the central and northeastern parts of Yunnan.

4.4 Model Comparison

The usability of the proposed model was compared with

benchmark methods random forests (RF), support vector

machine (SVM), multilayer perceptron neural network

(MLP), and kernel logistic regression (KLR). The four

benchmark methods were built and implemented using the

scikit-learn package, and the grid search method was used

to optimize the hyperparameters. The main hyperparame-

ters utilized in the benchmark methods are listed in

Table 5. Recursive feature elimination with cross-valida-

tion was employed to perform automatic tuning of the

number of features selected for SVM and KLR.

The performance of the ﬁve models was evaluated and

compared for both the training and validation datasets of

2009. Table 6shows that the CNN had the higher speci-

ﬁcity (93.77%), sensitivity (97.84%), PPV (94.02%), NPV

(97.75%), and overall accuracy (95.81%) for the training

dataset than the four benchmark methods. The proposed

CNN model had the highest overall accuracy (87.92%) for

the validation dataset, followed by the RF (84.36%), KLR

(81.23%), SVM (80.04%), and MLP (78.47%).

Figure 8illustrates the ignition susceptibility map con-

structed by the benchmark models using the same test

datasets. The values denote the probabilities of each pixel

having an ignition occurrence. To enhance the compara-

bility of the prediction results for the CNN and benchmark

models, the probability map was divided into ﬁve classes

with the same natural breaks method as CNN. It can be

observed that in the prediction maps of SVM, MLP, and

KLR, forest ﬁre susceptibility of most areas in the north-

west of Yunnan Province was categorized as very low and

low. In contrast, CNN and RF can well-identify the ignition

probabilities in the northwest. For the southern region of

Yunnan Province, compared with the predicted results of

the CNN model, the RF model divided most regions of

southern Yunnan into very high susceptible zones, while

the results of the SVM, MLP, and KLR models predicted

high susceptibility only in the southwest region of Yunnan

Province. The high ﬁre occurrence region in the southeast

was not clearly identiﬁed in the predicted results.

Figure 9shows that the very high and very low sus-

ceptibility classes in the CNN model account for 77.51% of

the total area, which was the highest proportion among all

the models; whereas the remaining three classes of high,

moderate, and low have the lowest proportion of all

models, accounting for 7.15%, 6.33%, and 9.01% of the

total area, respectively. The results show that the proposed

CNN model can effectively divide the very high and very

low susceptible zones. However, the probability prediction

results of the benchmark models had many areas within

medium and high susceptibility zones, and there was no

clear determination regarding zones with high forest ﬁre

susceptibility; thus, what threshold segmentation method

should be used to divide the ﬁre warning areas needs to be

considered. Most traditional ML methods will face this

problem after obtaining the probability prediction results.

In contrast, the probability results obtained by the CNN are

more distinct in their spatial pattern, and the statistical

values reﬂect a high and low bipolar distribution, which is

more advantageous in the division of the areas with ﬁre

warnings and those without ﬁre warnings and can reduce

the inﬂuence of the threshold segmentation of the delin-

eation of ﬁre warning areas. Thus, the result of the CNN

model is better for forest ﬁre predictions.

To conﬁrm the statistical signiﬁcance of the prediction

performance between the proposed CNN model and the

benchmark models, the nonparametric WSRT (Wilcoxon

1945) was employed for paired comparisons. The null-

hypothesis (H

) was that there was no signiﬁcant difference

at 95% conﬁdence intervals of two prediction models. If

the pvalue was less than the signiﬁcance level (0.05), H

was rejected, and a signiﬁcant difference exists in the

Fig. 7 Forest ﬁre susceptibility map derived from the convolutional

neural network (CNN) model for Yunnan Province, China, in 2010

123

Zhang et al. Forest Fire Susceptibility Modeling Using a Convolutional Neural Network

models (Tien Bui et al. 2019). The analysis was performed

using SPSS (Statistical Package for the Social Sciences),

and the Wilcoxon signed-rank test (WSRT) results are

reported in Table 7. The pvalue of all pairwise compar-

isons was less than 0.05, conﬁrming that the classiﬁcation

performances of the proposed model and benchmark

models are signiﬁcantly different.

The ROC curves for the ﬁve models are depicted in

Fig. 10. They were drawn using the 2010 prediction maps

of the ignition probabilities and the corresponding IRD.

The AUC provides a single measure of a classiﬁer’s per-

formance to evaluate which model is better on average.

The AUC of the CNN model is 0.86 (Fig. 10), indicating

that the global ﬁt of the model with the testing dataset is

86%, followed by RF (0.82), SVM (0.79), MLP (0.78), and

Fig. 8 Prediction maps of the ignition probabilities for the benchmark models for Yunnan Province, China. KLR kernel logistic regression; MLP

multilayer perceptron neural network; RF random forests; SVM support vector machine

123

Int J Disaster Risk Sci

KLR (0.78). The ROC curves clearly show that the CNN

model has the highest prediction performance.

5 Discussion

This section ﬁrst describes the advantages of CNN com-

pared with benchmark methods, then discusses the selec-

tion of the model architecture and window size, and ﬁnally

discusses the differences between the CNN and traditional

ML algorithms.

5.1 Advantage of the Convolutional Neural Network

Method

Compared with the benchmark methods, the CNN has the

following advantages. First, because the CNN can consider

the correlation of adjacent spatial information, it has

advantages in the study of problems with spatial and geo-

graphical correlation characteristics. Second, the CNN

preserves the spatial relationships between pixels by

learning the internal feature representations from factor

vectors. The process of DL reveals the deep features and

can distinguish the differences between different geo-

graphical units. The CNN was used to conduct multiple

convolution and pooling operations to extract the charac-

teristics. As the convolutions and pooling increased, these

features became more advanced and more abstract. These

abstract features depicted the degree of forest ﬁre suscep-

tibility, which was the decisive factor for determining

forest ﬁre susceptibility. Third, the CNN reduces the

number of weights that need to be trained and the com-

putational complexity of the network through weight

sharing.

5.2 Model Sensitivity

The architecture of the CNN model should be selected in

accordance with the quantity of sample data and the

problem complexity. For those with small quantities of

sample data and simple classiﬁcation problems, the com-

plex structure was prone to model overﬁtting. For those

with large quantities of sample data and complex classiﬁ-

cation problems, the simple structure was prone to model

nonconvergence. Both of these problems should be avoided

as much as possible in the training of the CNN model.

The selection of the window size should be consistent

with the maximum geospatial area that affects the forest

ﬁre susceptibility of the center window pixel. A larger

window size means that the pixels of a larger geographical

unit impact the center window pixel. In fact, the pixels that

affect the ﬁre susceptibility of the center window pixel

have a certain geographical spatial range. Therefore, the

selection of the window size must be appropriate. After the

experiments, the 25925 window size was most reasonable.

First, this size corresponds to the maximum geographical

spatial range that affects the ﬁre susceptibility of a pixel.

Second, the size of this window is smaller than that of the

commonly used windows (for example, 2249224) in

image processing because forest ﬁre probability prediction

and image classiﬁcation are completely different

Table 5 A list of the main hyperparameters utilized in the benchmark methods

Benchmark

methods

Hyperparameters

RF Number of trees: 160; max_features: “sqrt”; bootstrap: True; max_depth=20

SVM Penalty factor (C): 100; kernel function = “RBF”; gamma =1

MLP Number of hidden layers: 1; momentum: 0.2; learning rate: 0.001; iteration= 300; alpha = 0.01; solver =”Adam”; activation

=”ReLU”

KLR Kernel function=“RBF”; tuning parameter (δ): 0.02; regularize parameter (C): 0.025

RF random forests; SVM support vector machine; MLP multilayer perceptron neural network; KLR kernel logistic regression; RBF radial basis

function; ReLU rectiﬁed linear unit

0.00%

10.00%

20.00%

30.00%

40.00%

50.00%

60.00%

70.00%

very low low moderate high very high

Percent of susceptibility classes

Forest fire susceptibility classes

CNN

SVM

MLP

KLR

Fig. 9 Percentages of different forest ﬁre susceptibility classes. CNN

convolutional neural network; RF random forests; SVM support

vector machine; MLP multilayer perceptron neural network; KLR

kernel logistic regression

123

Zhang et al. Forest Fire Susceptibility Modeling Using a Convolutional Neural Network

applications, and the CNN model that is used for image

processing cannot be completely duplicated.

It is necessary to discuss the differences between the

CNN and traditional ML algorithms. The ﬁrst is the

dependency of the CNN model on the data and hardware.

The CNN requires a large number of training samples, and

its performance increases as the scale of data increases.

Because the CNN inherently performs a large number of

matrix multiplication operations, they rely heavily on high-

end machines compared to traditional ML algorithms that

can run on low-end machines. The second is that CNN can

automatically explore high-level features from raw data.

This is a very distinctive part of DL and a major step ahead

of traditional ML. Moreover, the CNN model shows a

strong generalization ability compared with the benchmark

methods. In terms of time efﬁciency, because the CNN has

a large number of parameters to learn, the training time of

the CNN is longer than those of traditional ML models.

Because of the reusability of the CNN neural network after

Table 6 Training and validation performance comparison

Phase Performance Prediction model

CNN RF SVM MLP KLR

Training TP 3126 3053 2860 2800 2940

TN 2996 2796 2322 2349 2372

FP 199 399 873 846 823

FN 69 142 335 395 345

Sensitivity (%) 97.84 95.56 89.51 87.64 89.5

Speciﬁcity (%) 93.77 87.51 72.68 73.52 74.24

PPV (%) 94.02 88.44 76.61 76.8 78.13

NPV (%) 97.75 95.17 87.39 85.6 87.3

Overall accuracy (%) 95.81 91.53 81.1 80.58 81.98

Validation TP 739 580 532 491 577

TN 666 768 747 763 721

FP 133 31 52 36 78

FN 60 219 267 308 222

Sensitivity (%) 92.49 72.59 66.58 61.45 72.22

Speciﬁcity (%) 83.35 96.12 93.49 95.49 90.24

PPV (%) 84.75 94.93 91.1 93.17 88.09

NPV (%) 91.74 77.81 73.67 71.24 76.46

Overall accuracy (%) 87.92 84.36 80.04 78.47 81.23

CNN convolutional neural network; FP false positive; FN false negative; KLR kernel logistic regression; MLP multilayer perceptron neural

network; NPV negative predictive value; PPV positive predictive value; RF random forests; SVM support vector machine; TN true negative; TP

true positive

Table 7 Wilcoxon signed-rank test (two-tailed)

No. Pairwise comparison pvalue Signiﬁcance

1 CNN versus RF \0.0001 Yes

2 CNN versus SLM \0.0001 Yes

3 CNN versus MLP \0.0001 Yes

4 CNN versus KLR \0.0001 Yes

CNN convolutional neural network; RF random forests; SVM support

vector machine; MLP multilayer perceptron neural network; KLR

kernel logistic regression

Fig. 10 Receiver operating characteristic (ROC) curves and area

under the curve (AUCs) of the ﬁve models. CNN convolutional neural

network; RF random forests; SVM support vector machine; MLP

multilayer perceptron neural network; KLR kernel logistic regression

123

Int J Disaster Risk Sci

initial training, there is still considerable room for

improving the efﬁciency in the later training time. The

prediction time of the CNN is relatively short when using

GPU-accelerated computing technology. Finally, although

CNN has shown excellent performance, the interpretability

is its deﬁciency, which needs to be further studied.

6 Conclusion

In this article, we investigated a CNN with deep architec-

tures for the spatial prediction of forest ﬁre susceptibility in

Yunnan Province, China. Past forest ﬁre locations from

2002 to 2010 were extracted and a set of 14 forest ﬁre

inﬂuencing factors were optimized using multicollinearity

analysis and the IGR technique. We explored the prepro-

cessing methods for forest ﬁre inﬂuencing factors and the

methods for generating effective training/validation sample

libraries. The CNN architecture suitable for the prediction

of forest ﬁre susceptibility in the study area was designed,

and hyperparameters were optimized to improve the pre-

diction accuracy. Several common methods, such as more

training samples, regularization (dropout), batch normal-

ization, and reduced architecture complexity, were used in

the CNN model to mitigate overﬁtting. Then, the test

dataset was fed into the trained model and the prediction

map of ignition probabilities was constructed by the CNN

model. Finally, the performance of the proposed model was

compared with traditional ML methods using several sta-

tistical measures, including WSRT, ROC, and AUC.

Through this research, we found that the CNN model

performs better than the benchmark methods. The CNN

model (AUC=0.86) has higher predictive power than the

benchmark methods according to the ROC–AUC. The

probability result obtained by the CNN can clearly distin-

guish the very high and very low susceptible zones, and the

susceptibility spatial pattern was very distinct. The CNN

model shows a strong generalization ability and the pre-

diction time of the CNN was relatively short when using

GPU-accelerated computing technology. In conclusion, the

CNN has the advantages of considering neighborhood

information, extracting deep features, sharing weights, and

pooling operations, which allow the CNN to obtain better

prediction results. The CNN model will have important

practical application value for forest ﬁre prevention plan-

ning and forest management.

There are still some limitations in the research. For

example, the inﬂuence of different CNN architectures—

such as VGG-net (Visual Geometry Group Network), RES-

net Residential Energy Services Network), and GoogLeNet

—on forest ﬁre prediction results have not been studied in

depth. In addition, more actual data are needed for the

experimental veriﬁcation of the method. In recent years,

the application of CNNs has become increasingly exten-

sive. Many different variants of the architecture have been

derived and many ensemble classiﬁers have been proposed.

Comparing various classiﬁers and exploring the most

suitable models to improve forest ﬁre prediction should be

investigated in the future.

Acknowledgements This research was supported by the National

Key Research and Development Plan (2017YFC1502902) and

National Natural Science Foundation of China (41621601). The

ﬁnancial support is highly appreciated. We thank Yinxue Cao for her

help in getting the data from Climate Forecast System Reanalysis. We

are also grateful to the anonymous reviewers and the editors for their

constructive comments.

Open Access This article is distributed under the terms of the

Creative Commons Attribution 4.0 International License (http://crea

tivecommons.org/licenses/by/4.0/), which permits unrestricted use,

distribution, and reproduction in any medium, provided you give

appropriate credit to the original author(s) and the source, provide a

link to the Creative Commons license, and indicate if changes were

made.

References

Adab, H., K.D. Kanniah, and K. Solaimani. 2013. Modeling forest ﬁre

risk in the northeast of Iran using remote sensing and GIS

techniques. Natural Hazards 65(3): 1723–1743.

Arpaci, A., B. Malowerschnig, O. Sass, and H. Vacik. 2014. Using

multi variate data mining techniques for estimating ﬁre suscep-

tibility of Tyrolean forests. Applied Geography 53: 258–270.

Bajocco, S., E. Dragoz, I. Gitas, D. Smiraglia, L. Salvati, and C.

Ricotta. 2015. Mapping forest fuels through vegetation phenol-

ogy: The role of coarse-resolution satellite time-series. PLoS

ONE 10(3): 1–14.

Bar Massada, A., A.D. Syphard, S., I. Stewart, and V.C. Radeloff.

2013. Wildﬁre ignition-distribution modelling: a comparative

study in the Huron–Manistee National Forest, Michigan, USA.

International Journal of Wildland Fire 22(2): 174–183.

Bergstra, J., and Y. Bengio. 2012. Random search for hyper-

parameter optimization. Journal of Machine Learning Research

13(1): 281–305.

Bisquert, M., E. Caselles, J.M. Sa

´nchez, and V. Caselles. 2012.

Application of artiﬁcial neural networks and logistic regression

to the prediction of forest ﬁre danger in Galicia using MODIS

data. International Journal of Wildland Fire 21(8): 1025–1029.

Buda, M., A. Maki, and M.A. Mazurowski. 2018. A systematic study

of the class imbalance problem in convolutional neural networks.

Neural Networks 106: 249–259.

Bui, D.T., K.T.T. Le, V.C. Nguyen, H.D. Le, and I. Revhaug. 2016.

Tropical forest ﬁre susceptibility mapping at the Cat Ba National

Park area, Hai Phong City, Vietnam, using GIS-based Kernel

logistic regression. Remote Sensing 8(4): 1–15.

Bui, D.T., T.A. Tuan, H. Klempe, B. Pradhan, and I. Revhaug. 2016.

Spatial prediction models for shallow landslide hazards: A

comparative assessment of the efﬁcacy of support vector

machines, artiﬁcial neural networks, kernel logistic regression,

and logistic model tree. Landslides 13(2): 361–378.

Cao, Y., M. Wang, and K. Liu. 2017. Wildﬁre susceptibility

assessment in Southern China: A comparison of multiple

methods. International Journal of Disaster Risk Science 8(2):

164–181.

123

Zhang et al. Forest Fire Susceptibility Modeling Using a Convolutional Neural Network

Colkesen, I., E.K. Sahin, and T. Kavzoglu. 2016. Susceptibility

mapping of shallow landslides using kernel-based Gaussian

process, support vector machines and logistic regression. Journal

of African Earth Sciences 118: 53–64.

Crimmins, M.A. 2006. Synoptic climatology of extreme ﬁre-weather

conditions across the southwest United States. International

Journal of Climatology 26(8): 1001–1016.

Dash, M., and H. Liu. 1997. Feature selection for classiﬁcation.

Intelligent Data Analysis 1(1–4): 131–156.

Dimuccio, L.A., R. Ferreira, L. Cunha, and A. Campar de Almeida.

2011. Regional forest-ﬁre susceptibility analysis in central

Portugal using a probabilistic ratings procedure and artiﬁcial

neural network weights assignment. International Journal of

Wildland Fire 20(6): 776–791.

Elmas, C¸ ., and Y. So

¨nmez. 2011. A data fusion framework with novel

hybrid algorithm for multi-agent Decision Support System for

Forest Fire. Expert Systems with Applications 38(8): 9225–9236.

Freeman, E.A., and G.G. Moisen. 2008. A comparison of the

performance of threshold criteria for binary classiﬁcation in

terms of predicted prevalence and kappa. Ecological Modelling

217(1): 48–58.

Guo, F., Z. Su, G. Wang, L. Sun, F. Lin, and A. Liu. 2016. Wildﬁre

ignition in the forests of southeast China: Identifying drivers and

spatial distribution to predict wildﬁre likelihood. Applied Geog-

raphy 66: 12–21.

Hantson, S., S. Pueyo, and E. Chuvieco. 2015. Global ﬁre size

distribution is driven by human impact and climate. Global

Ecology and Biogeography 24(1): 77–86.

Hinton, G.E., and R.R. Salakhutdinov. 2006. Reducing the dimen-

sionality of data with neural networks. Science 313(5786): 504–

507.

Hong, H., A. Jaafari, and E.K. Zenner. 2019. Predicting spatial

patterns of wildﬁre susceptibility in the Huichang County,

China: An integrated model to analysis of landscape indicators.

Ecological Indicators 101: 878–891.

Hong, H., B. Pradhan, C. Xu, and D. Tien Bui. 2015. Spatial

prediction of landslide hazard at the Yihuang area (China) using

two-class kernel logistic regression, alternating decision tree and

support vector machines. Catena 133: 266–281.

Hong, H., P. Tsangaratos, I. Ilia, J. Liu, A.X. Zhu, and C. Xu. 2018.

Applying genetic algorithms to set the optimal combination of

forest ﬁre related variables and model forest ﬁre susceptibility

based on data mining models. The case of Dayu County, China.

Science of the Total Environment 630: 1044–1056.

Hu, F., G.S. Xia, J. Hu, and L. Zhang. 2015. Transferring deep

convolutional neural networks for the scene classiﬁcation of

high-resolution remote sensing imagery. Remote Sensing 7(11):

14680–14707.

Jaafari, A., E.K. Zenner, M. Panahi, and H. Shahabi. 2019. Hybrid

artiﬁcial intelligence models based on a neuro-fuzzy system and

metaheuristic optimization algorithms for spatial prediction of

wildﬁre probability. Agricultural and Forest Meteorology 266–

267: 198–207.

Jaafari, A., E.K. Zenner, and B.T. Pham. 2018. Wildﬁre spatial

pattern analysis in the Zagros Mountains, Iran: A comparative

study of decision tree based classiﬁers. Ecological Informatics

43: 200–211.

Kim, S.J., C.H. Lim, G.S. Kim, J. Lee, T. Geiger, O. Rahmati, Y. Son,

and W.K. Lee. 2019. Multi-temporal analysis of forest ﬁre

probability using socio-economic and environmental variables.

Remote Sensing 11(1): Article 86.

Kingma, D.P., and J. Ba. 2014. Adam: A method for stochastic

optimization. Presented as a conference paper at the 3rd

International Conference for Learning Representations, San

Diego, 2015. arXiv preprint abs:1412.6980. Ithaca, NY: Cornell

University.

Krizhevsky, A., I. Sutskever, G.E. Hinton. 2013. ImageNet classiﬁ-

cation with deep convolutional neural networks. In Proceedings

of the 26th Annual Conference on Neural Information Processing

Systems (NIPS), 3–6 December 2012, Lake Tahoe, Nevada,

USA, ed. F. Pereira, C.J.C. Burges, L. Bottou, and K.Q.

Weinberger, Vol. 2, 1097–1105.

Lecun, Y., Y. Bengio, and G. Hinton. 2015. Deep learning. Nature

521(7553): 436–444.

Lecun, Y., L. Bottou, Y. Bengio, and P. Haffner. 1998. Gradient-

based learning applied to document recognition. Proceedings of

IEEE 86(11): 2278–2324.

Leuenberger, M., J. Parente, M. Tonini, M.G. Pereira, and M.

Kanevski. 2018. Wildﬁre susceptibility mapping: Deterministic

vs. stochastic approaches. Environmental Modelling and Soft-

ware 101: 194–203.

Liu, T., and A. Abd-Elrahman. 2018. Deep convolutional neural

network training enrichment using multi-view object-based

analysis of Unmanned Aerial systems imagery for wetlands

classiﬁcation. ISPRS Journal of Photogrammetry and Remote

Sensing 139: 154–170.

´pez, V., A. Ferna

´ndez, S. Garcı

´a, V. Palade, and F. Herrera. 2013.

An insight into classiﬁcation with imbalanced data: Empirical

results and current trends on using data intrinsic characteristics.

Information Sciences 250: 113–141.

Moritz, M. A., M.-A. Parisien, E. Batllori, M. A. Krawchuk, J. Van

Dorn, D.J. Ganz, and K. Hayhoe. 2012. Climate change and

disruptions to global ﬁre activity. Ecosphere 3(6): Article 49.

Muhammad, K., J. Ahmad, and S.W. Baik. 2018. Early ﬁre detection

using convolutional neural networks during surveillance for

effective disaster management. Neurocomputing 288(C): 30–42.

Ngoc Thach, N., D. Bao-Toan Ngo, P. Xuan-Canh, N. Hong-Thi, B.

Hang Thi, H. Nhat-Duc, and T.B. Dieu. 2018. Spatial pattern

assessment of tropical forest ﬁre danger at Thuan Chau area

(Vietnam) using GIS-based advanced machine learning algo-

rithms: A comparative study. Ecological Informatics 46: 74–85.

O’Brien, R.M. 2007. A caution regarding rules of thumb for variance

inﬂation factors. Quality and Quantity 41(5): 673–690.

Oliveira, S., F. Oehler, J. San-Miguel-Ayanz, A. Camia, and J.M.C.

Pereira. 2012. Modeling spatial patterns of ﬁre occurrence in

Mediterranean Europe using Multiple Regression and Random

Forest. Forest Ecology and Management 275: 117–129.

Pew, K.L., and C.P.S. Larsen. 2001. GIS analysis of spatial and

temporal patterns of human-caused wildﬁres in the temperate

rain forest of Vancouver Island, Canada. Forest Ecology and

Management 140(1): 1–18.

Pourtaghi, Z.S., H.R. Pourghasemi, R. Aretano, and T. Semeraro.

2016. Investigation of general indicators inﬂuencing on forest

ﬁre and its susceptibility modeling using different data mining

techniques. Ecological Indicators 64: 72–84.

Renard, Q., R. P

´lissier, B.R. Ramesh, and N. Kodandapani. 2012.

Environmental susceptibility model for predicting forest ﬁre

occurrence in the Western Ghats of India. International Journal

of Wildland Fire 21(4): 368–379.

Running, S.W. 2006. Is global warming causing more, larger

wildﬁres? Science 313(5789): 927–928.

Sachdeva, S., T. Bhatia, and A.K. Verma. 2018. GIS-based evolu-

tionary optimized Gradient Boosted Decision Trees for forest ﬁre

susceptibility mapping. Natural Hazards 92(3): 1399–1418.

Satir, O., S. Berberoglu, and C. Donmez. 2016. Mapping regional

forest ﬁre probability using artiﬁcial neural network model in a

Mediterranean forest ecosystem. Geomatics, Natural Hazards

and Risk 7(5): 1645–1658.

Schmidhuber, J. 2015. Deep Learning in neural networks: An

overview. Neural Networks 61: 85–117.

123

Int J Disaster Risk Sci

Sokolova, M., and G. Lapalme. 2009. A systematic analysis of

performance measures for classiﬁcation tasks. Information

Processing and Management 45(4): 427–437.

Tien Bui, D., Q.T. Bui, Q.P. Nguyen, B. Pradhan, H. Nampak, and P.

T. Trinh. 2017. A hybrid artiﬁcial intelligence approach using

GIS-based neural-fuzzy inference system and particle swarm

optimization for forest ﬁre susceptibility modeling at a tropical

area. Agricultural and Forest Meteorology 233: 32–44.

Tien Bui, D., N.D. Hoang, and P. Samui. 2019. Spatial pattern

analysis and prediction of forest ﬁre using new machine learning

approach of Multivariate Adaptive Regression Splines and

Differential Flower Pollination optimization: A case study at

Lao Cai province (Viet Nam). Journal of Environmental

Management 237: 476–487.

Tobler, W.R. 1970. A computer movie simulating urban growth in the

Detroit region. Economic Geography 46(sup1): 234–240.

Vasconcelos, M.J. P. de, S. Silva, M. Tome

´, M. Alvim, and J. M. C.

Perelra. 2001. Spatial Prediction of Fire Ignition Probabilities:

Comparing Logistic Regression and Neural Networks. Pho-

togrammetric Engineering & Remote Sensing 67(1): 73–81.

Vetrivel, A., M. Gerke, N. Kerle, F. Nex, and G. Vosselman. 2018.

Disaster damage detection through synergistic use of deep

learning and 3D point cloud features derived from very high

resolution oblique aerial images, and multiple-kernel-learning.

ISPRS Journal of Photogrammetry and Remote Sensing 140: 45–

59.

Wang, Y., Z. Fang, and H. Hong. 2019. Comparison of convolutional

neural networks for landslide susceptibility mapping in Yanshan

County, China. Science of the Total Environment 666: 975–993.

Wilcoxon, F. 1945. Individual comparisons by ranking methods.

Biometrics Bulletin 1(6):80–83.

Yamashita, R., M. Nishio, R K.G. Do, and K. Togashi. 2018.

Convolutional neural networks: An overview and application in

radiology. Insights into Imaging 9(4): 611–629.

Yi, K., H. Tani, J. Zhang, M. Guo, X. Wang, and G. Zhong. 2013.

Long-term satellite detection of post-ﬁre vegetation trends in

boreal forests of China. Remote Sensing 5(12): 6938–6957.

Ying, L., J. Han, Y. Du, and Z. Shen. 2018. Forest ﬁre characteristics

in China: Spatial patterns and determinants with thresholds.

Forest Ecology and Management 424: 345–354.

Zhang, X. 2007. Vegetation map of the People’s Republic of China

(1:1000000). Beijing: Geology Press (in Chinese).

Zhang, C., X. Pan, H. Li, A. Gardiner, I. Sargent, J. Hare, and P.M.

Atkinson. 2018. A hybrid MLP-CNN classiﬁer for very ﬁne

resolution remotely sensed image classiﬁcation. ISPRS Journal

of Photogrammetry and Remote Sensing 140: 133–144.

123

Zhang et al. Forest Fire Susceptibility Modeling Using a Convolutional Neural Network

Wildfire Susceptibility Prediction Based on a CA-Based CCNN with Active Learning Optimization

Article

Full-text available

Jun 2024

Wildfires cause great losses to the ecological environment, economy, and people’s safety and belongings. As a result, it is crucial to establish wildfire susceptibility models and delineate fire risk levels. It has been proven that the use of remote sensing data, such as meteorological and topographical data, can effectively predict and evaluate wildfire susceptibility. Accordingly, this paper converts meteorological and topographical data into fire-influencing factor raster maps for wildfire susceptibility prediction. The continuous convolutional neural network (CCNN for short) based on coordinate attention (CA for short) can aggregate different location information into channels of the network so as to enhance the feature expression ability; moreover, for different patches with different resolutions, the improved CCNN model does not need to change the structural parameters of the network, which improves the flexibility of the network application in different forest areas. In order to reduce the annotation of training samples, we adopt an active learning method to learn positive features by selecting high-confidence samples, which contributes to enhancing the discriminative ability of the network. We use fire probabilities output from the model to evaluate fire risk levels and generate the fire susceptibility map. Taking Chongqing Municipality in China as an example, the experimental results show that the CA-based CCNN model has a better classification performance; the accuracy reaches 91.7%, and AUC reaches 0.9487, which is 5.1% and 2.09% higher than the optimal comparative method, respectively. Furthermore, if an accuracy of about 86% is desired, our method only requires 50% of labeled samples and thus saves about 20% and 40% of the labeling efforts compared to the other two methods, respectively. Ultimately, the proposed model achieves the balance of high prediction accuracy and low annotation cost and is more helpful in classifying fire high warning zones and fire-free zones.

Framework to Create Dataset for Disaster Behavior Analysis using Google Earth Engine: A Case Study in Peninsular Malaysia for Historical Forest Fire Behavior Analysis

Preprint

Full-text available

Mar 2024

This research presents a comprehensive framework for efficiently generating forest fire datasets from Google Earth Engine data sources. The primary contribution of this work lies in providing a methodology to swiftly extract forest fire factors without the need for permissions or access to private datasets, rendering the dataset openly accessible and shared without barriers. Furthermore, given that the remote sensing data used is a global dataset, it can be applied in any region without restrictions. In this study, Peninsular Malaysia is chosen as a case study to demonstrate the framework's effectiveness. The generated dataset includes essential variables including the climate and environment, landcover, topography, and anthropogenic factors facilitating the analysis of fire occurrences. The methodology empowers data scientists, enabling them to leverage their analytical skills on the extracted dataset without requiring specialized remote sensing knowledge. Additionally, this study also showcases the adoption of large language models, specifically GPT-4 with the Noteable plugin, as a tool for conducting preliminary analyses on the generated dataset. Sample analyses reveal that several key features, including the KBDI, LST, PDSI, climate water deficit, and precipitation, significantly impact forest fire occurrences in Peninsular Malaysia. Despite the successful application of the GPT-4 with Noteable plugin, certain limitations and challenges are identified, highlighting the necessity for further validation of the tool's applicability and limitations. This study encourages future research to (1) adopt the proposed framework in other regions, (2) explore more detailed analyses encompassing all variables, and (3) leverage machine learning for advanced forecasting.

Modelling Current and Future Forest Fire Susceptibility in north-east Germany

Preprint

Full-text available

May 2024

Preventing and fighting forest fires has been a challenge worldwide in recent decades. Forest fires alter forest structure and composition, threaten people’s livelihoods, and lead to economic losses, as well as soil erosion and desertification. Climate change and related drought events, paired with anthropogenic activities, have magnified the intensity and frequency of forest fires. It is crucial to identify the conditions that cause the emergence and spread of forest fires to improve prevention and management. We applied Random Forest (RF) machine learning (ML) algorithm to model current and future forest fire susceptibility (FFS) in the federal state of Brandenburg (Germany) using topographic, climatic, anthropogenic, soil, and vegetation predictors. FFS was modelled at a spatial resolution of 50 metres for current (2014–2022) and future scenarios (2081–2100) considering different shared socioeconomic pathways (SSP3.70 and SSP5.85). Model accuracy ranged between 69 % (RFtest) and 71 % (LOYO), showing a moderately high model reliability for predicting FFS. The model results underscore the importance of anthropogenic parameters and vegetation parameters in modelling FFS on a regional level. This study will allow forest managers and environmental planners to better identify areas, which are most susceptible to forest fires, enhancing warning systems and prevention measures.

Integration of the AHP Method and GIS Techniques for Mapping Areas Susceptible to Forest Fires in the Southern Amazon Region (Peru)

Article

Full-text available

Jun 2024

In recent decades, the occurrence of forest fires has increased, causing damage to wild flora and fauna. For this reason, it is necessary to determine the areas susceptible to the occurrence of this phenomenon and thus implement policies for its management. In this study, the AHP and GIS method were used to map areas susceptible to forest fires in the province of Rodrí guez de Mendoza located in the southern Amazon region of Peru, using climatic variables (Temperature, Precipitation and Wind Speed), topographic (altitude, slope and aspect), socioeconomic (proximity to roads and distance to populated centers) and biological (NDVI). The results indicate that 23.65% of the area is in the high-risk class and 19.05% in the very high-risk class. These risk levels are directly related to the topographic, meteorological, social and biological variables, and could trigger large-scale fires, generating losses in biological diversity and economic losses. It is concluded that 42.70% of the study area is classified as high and very high-risk areas, which makes it necessary to take relevant measures to reduce the risk of natural disasters; Furthermore, the methodology used in this research can be used in other provinces that have similar conditions.

Framework to Create Inventory Dataset for Disaster Behavior Analysis Using Google Earth Engine: A Case Study in Peninsular Malaysia for Historical Forest Fire Behavior Analysis

Article

Full-text available

May 2024

This study developed a comprehensive framework using Google Earth Engine to efficiently generate a forest fire inventory dataset, which enhanced data accessibility without specialized knowledge or access to private datasets. The framework is applicable globally, and the datasets generated are freely accessible and shareable. By implementing the framework in Peninsular Malaysia, significant forest fire factors were successfully extracted, including the Keetch–Byram Drought Index (KBDI), soil moisture, temperature, windspeed, land surface temperature (LST), Palmer Drought Severity Index (PDSI), Normalized Vegetation Index (NDVI), landcover, and precipitation, among others. Additionally, this study also adopted large language models, specifically GPT-4 with the Noteable plugin, for preliminary data analysis to assess the dataset’s validity. Although the plugin effectively performed basic statistical analyses and visualizations, it demonstrated limitations, such as selectively dropping or choosing only relevant columns for tests and automatically modifying scales. These behaviors underscore the need for users to perform additional checks on the codes generated to ensure that they accurately reflect the intended analyses. The initial findings indicate that factors such as KBDI, LST, climate water deficit, and precipitation significantly impact forest fire occurrences in Peninsular Malaysia. Future research should explore extending the framework’s application to various regions and further refine it to accommodate a broader range of factors. Embracing and rigorously validating large language model technologies, alongside developing new tools and plugins, are essential for advancing the field of data analysis.

Comparing remote sensing-based forest biomass mapping approaches using new forest inventory plots in contrasting forests in northeastern and southwestern China

Preprint

Full-text available

May 2024

Large-scale high spatial resolution aboveground biomass (AGB) maps play a crucial role in determining forest carbon stocks and how they are changing, which is instrumental in understanding the global carbon cycle, and implementing policy to mitigate climate change. The advent of the new space-borne LiDAR sensor, NASA's GEDI instrument, provides unparalleled possibilities for the accurate and unbiased estimation of forest AGB at high resolution, particularly in dense and tall forests, where Synthetic Aperture Radar (SAR) and passive optical data exhibit saturation. However, GEDI is a sampling instrument, collecting dispersed footprints, and its data must be combined with that from other continuous cover satellites to create high-resolution maps, using local machine learning methods. In this study, we developed local models to estimate forest AGB from GEDI L2A data, as the models used to create GEDI L4 AGB data incorporated minimal field data from China. We then applied LightGBM and random forest regression to generate wall-to-wall AGB maps at 25 m resolution, using extensive GEDI footprints as well as Sentinel-1 data, ALOS-2 PALSAR-2 and Sentinel-2 optical data. Through a 5-fold cross-validation, LightGBM demonstrated a slightly better performance than Random Forest across two contrasting regions. However, in both regions, the computation speed of LightGBM is substantially faster than that of the random forest model, requiring roughly one-third of the time to compute on the same hardware. Through the validation against field data, the 25 m resolution AGB maps generated using the local models developed in this study exhibited higher accuracy compared to the GEDI L4B AGB data. We found in both regions an increase in error as slope increased. The trained models were tested on nearby but different regions and exhibited good performance.

A Soil Moisture and Vegetation-Based Susceptibility Mapping Approach to Wildfire Events in Greece

Article

Full-text available

May 2024

Wildfires in Mediterranean areas are becoming more frequent, and the fire season is extending toward the spring and autumn months. These alarming findings indicate an urgent need to develop fire susceptibility methods capable of identifying areas vulnerable to wildfires. The present work aims to uncover possible soil moisture and vegetation condition precursory signals of the largest and most devastating wildfires in Greece that occurred in 2021, 2022, and 2023. Therefore, the time series of two remotely sensed datasets–MAP L4 Soil Moisture (SM) and Landsat 8 NDVI, which represent vegetation and soil moisture conditions—were examined before five destructive wildfires in Greece during the study period. The results of the analysis highlighted specific properties indicative of fire-susceptible areas. NDVI in all fire-affected areas ranged from 0.13 to 0.35, while mean monthly soil moisture showed negative anomalies in the spring periods preceding fires. Accordingly, fire susceptibility maps were developed, verifying the usefulness of remotely sensed information related to soil moisture and NDVI. This information should be used to enhance fire models and identify areas at risk of wildfires in the near future.

Sentinel 2 Based Burn Severity Mapping and Assessing Post-Fire Impacts on Forests and Buildings in the Mizoram, a North-Eastern Himalayan Region

Article

Jun 2024

Predicting wildfire events with calibrated probabilities

Conference Paper

Jun 2024

Wildfire susceptibility mapping by incorporating damage proxy maps, differenced normalized burn Ratio, and deep learning algorithms based on sentinel-1/2 data: a case study on Maui Island, Hawaii

Article

Full-text available

May 2024

Climate change has contributed to the recent increase in wildfire occurrences, vegetation failures, human health risks, physical damage, and economic losses. Wildfire susceptibility mapping is an essential technique for assessing areas prone to wildfires. In this study, we proposed the combination of the damage proxy map (DPM) and differenced normalized burn ratio (dNBR) method to generate a precise wildfire inventory map and used it to predict areas susceptible to wildfire. The wildfire susceptibility maps were produced using frequency ratio (FR), convolutional neural network (CNN), and long short-term memory (LSTM)-based deep learning and their performances were compared. We implemented the proposed method on Maui Island, Hawaii, where wildfires frequently occur. We started the process by generating a wildfire inventory map from 2019 to 2023 based on the DPM method applied to Sentinel-1 synthetic aperture radar (SAR) data combined with a dNBR map retrieved from Sentinel-2 data. The wildfire inventory was randomly divided into a training dataset (70%) and a testing dataset (30%). Fifteen wildfire-related factors, including topographical, meteorological, land use, environmental, and anthropological factors, were selected to predict wildfires. The wildfire-related factors were selected by conducting study literature and considering spatial correlation analysis based on the FR method, information gain ratio analysis (IGR), and multicollinearity assessment using tolerance (TOL) and variance inflation factor (VIF) metrics. The level of susceptibility of an area to wildfire is divided into five, namely very high, high, moderate, low, and very low. The FR, CNN, and LSTM produced wildfire susceptibility maps with similar patterns, significantly influenced by land use and rainfall factors. The highly susceptible areas are located on gentle slopes covered by agricultural land and unhealthy vegetation, and these areas have low rainfall intensity but receive high levels of solar radiation. Meanwhile, areas with relatively low susceptibility occur in forests with high levels of wet canopy evaporation. The prediction results were evaluated using the area under the receiver operating characteristic (ROC) curve (AUC), and the CNN performed slightly better than the FR and LSTM, with AUC values of 0.879, 0.877, and 0.870, respectively. Hence, the use of the CNN algorithm in the proposed method is appropriate, specifically for the study area. In addition, the susceptibility map provides information on susceptible areas on Maui Island, Hawaii, to increase public awareness.

Regional forest-fire susceptibility analysis in central Portugal using a probabilistic ratings procedure and artificial neural network weights assignment

Article

Full-text available

May 2011

Geographic information system analysis and artificial neural network modelling were combined to evaluate forest-fire susceptibility in the Central Portugal administrative area. Data on forest fire events, indicated by burnt areas during the years from 1990 to 2007, were identified from official records. Topographic, supporting infrastructures, vegetation cover, climatic, demographic and satellite-image data were collected, processed and integrated into a spatial database using geographic information system techniques. Eight fire-related factors were extracted from the collected data, including topographic slope and aspect, road density, viewsheds from fire watchtowers, land cover, Landsat Normalised Difference Vegetation Index, precipitation and population density. Ratings were calculated for the classes or categories of each factor using a frequency-probabilistic procedure. The thematic layers (burnt areas and fire-related factors) were analysed using an advanced artificial neural network model to calculate the relative weight of each factor in explaining the distribution of burnt areas. A forest-fire susceptibility index was calculated using the trained back-propagation artificial neural network weights and the frequency-probabilistic ratings, and then a general forest-fire susceptibility index map was constructed in geographic information system. Burnt areas were used to evaluate the forest-fire susceptibility index map, and the results showed an agreement of 78%. This forest-fire susceptibility map can be used in strategic and operational forest-fire management planning at the regional scale. Additional keywords: back-propagation-learning algorithm, burnt areas, forest-fire susceptibility index, geographic information system, territorial management.

Predicting spatial patterns of wildfire susceptibility in the Huichang County, China: An integrated model to analysis of landscape indicators

Article

Full-text available

Jan 2019
ECOL INDIC

This study presents an analysis of the influence of general landscape-level indicators on wildfire and its spatial susceptibility across a fire-prone landscape in the southeast of China using an integrated WOE-AHP model that consists of a statistical/probabilistic Weights-of-Evidence (WOE) model and a knowledge-based Analytical Hierarchy Process (AHP). Multi-class landscape indicators (i.e., slope, aspect, altitude, NDVI, annual rainfall, wind speed, land use, and proximity to rivers, roads, and human settlements) are weighted by the WOE model and act as the input data for pairwise analyses, which improved the commonly used AHP procedure that relies on a traditional nine-point pairwise rating scale and expert opinion. The model performance was evaluated using the ROC-AUC method that revealed that the integrated WOE-AHP model performed well both in terms of goodness-of-fit with the training dataset (AUCsuccess rate = 0.94) and the capability to predict future ignitions (AUCprediction rate = 0.91). The efficiency of the proposed model was compared to a logistic regression and single WOE models and comparative analyses using the Wilcoxon signed-rank tests demonstrated a significant improvement of wildfire prediction using the integrated WOE-AHP model over these other models. Overall, given the proven capability of integrated modeling in identifying very influential landscape indicators, excluding the indicators with null predictive utility, and improving the prediction of wildfires, modelers can now use this alternative to the current modeling approach and tailor its use to any research related to disturbances to which landscapes are differentially susceptible.

Multi-Temporal Analysis of Forest Fire Probability Using Socio-Economic and Environmental Variables

Article

Full-text available

Jan 2019

As most of the forest fires in South Korea are related to human activity, socioeconomic factors are critical in estimating their probability. To estimate and analyze how human activity is influencing forest fire probability, this study considered not only environmental factors such as precipitation, elevation, topographic wetness index, and forest type, but also socioeconomic factors such as population density and distance from urban area. The machine learning Maximum Entropy (Maxent) and Random Forest models were used to predict and analyze the spatial distribution of forest fire probability in South Korea. The model performance was evaluated using the receiver operating characteristic (ROC) curve method, and models' outputs were compared based on the area under the ROC curve (AUC). In addition, a multi-temporal analysis was conducted to determine the relationships between forest fire probability and socioeconomic or environmental changes from the 1980s to the 2000s. The analysis revealed that the spatial distribution was concentrated in or around cities, and the probability had a strong correlation with variables related to human activity and accessibility over the decades. The AUC values for validation were higher in the Random Forest result compared to the Maxent result throughout the decades. Our findings can be useful for developing preventive measures for forest fire risk reduction considering socioeconomic development and environmental conditions.

Hybrid artificial intelligence models based on a neuro-fuzzy system and metaheuristic optimization algorithms for spatial prediction of wildfire probability

Article

Full-text available

Dec 2018
AGR FOREST METEOROL

This study provides a new comparative analysis of four hybrid artificial intelligence models for the spatially explicit prediction of wildfire probabilities. Each model consists of an adaptive neuro-fuzzy inference system (ANFIS) combined with a metaheuristic optimization algorithm, i.e., genetic algorithm (GA), particle swarm optimization (PSO), shuffled frog leaping algorithm (SFLA), and imperialist competitive algorithm (ICA). A spatial database was constructed based on 159 fire events from the Hyrcanian ecoregion (Iran) for which a suite of predictor variables was derived. Each predictor variable was discretized into classes. The step-wise weight assessment ratio analysis (SWARA) procedure was used to assign weights to each class of each predictor variable. Weights indicate the strength of the spatial relationship between each class and fire occurrence and were used for training the hybrid models. The hybrid models were validated using several performance metrics and compared to the single ANFIS model. Although the single ANFIS model outperformed the hybrid models in the training phase, its accuracy decreased considerably in the validation phase. All hybrid models performed well for both training and validation datasets, but the ANFIS-ICA hybrid showed superior predictive performance of spatially explicit wildfire prediction and mapping for the dataset. The results clearly demonstrate the ability of the optimization algorithms to overcome the over-fitting problem of the single ANFIS model at the learning stage of the fire pattern. This study contributes to the suite of research that seeks to obtain reliable estimates of relative likelihoods of natural hazards.

Convolutional neural networks: an overview and application in radiology

Article

Full-text available

Jun 2018

Convolutional neural network (CNN), a class of artificial neural networks that has become dominant in various computer vision tasks, is attracting interest across a variety of domains, including radiology. CNN is designed to automatically and adaptively learn spatial hierarchies of features through backpropagation by using multiple building blocks, such as convolution layers, pooling layers, and fully connected layers. This review article offers a perspective on the basic concepts of CNN and its application to various radiological tasks, and discusses its challenges and future directions in the field of radiology. Two challenges in applying CNN to radiological tasks, small dataset and overfitting, will also be covered in this article, as well as techniques to minimize them. Being familiar with the concepts and advantages, as well as limitations, of CNN is essential to leverage its potential in diagnostic radiology, with the goal of augmenting the performance of radiologists and improving patient care. Key points: • Convolutional neural network is a class of deep learning methods which has become dominant in various computer vision tasks and is attracting interest across a variety of domains, including radiology. • Convolutional neural network is composed of multiple building blocks, such as convolution layers, pooling layers, and fully connected layers, and is designed to automatically and adaptively learn spatial hierarchies of features through a backpropagation algorithm. • Familiarity with the concepts and advantages, as well as limitations, of convolutional neural network is essential to leverage its potential to improve radiologist performance and, eventually, patient care.

Feature Selection for Classification

Article

Jul 1997

Comparison of convolutional neural networks for landslide susceptibility mapping in Yanshan County, China

Article

May 2019
SCI TOTAL ENVIRON

Assessments of landslide disasters are becoming increasingly urgent. The aim of this study is to investigate a convolutional neural network (CNN) framework for landslide susceptibility mapping (LSM) in Yanshan County, China. The two primary contributions of this study are summarized as follows. First, to the best of our knowledge, this report describes the first time that the CNN framework is used for LSM. Second, different data representation algorithms are developed to construct three novel CNN architectures. In this work, sixteen influencing factors associated with landslide occurrence were considered and historical landslide locations were randomly divided into training (70% of the total) and validation (30%) sets. Validation of these CNNs was performed using different commonly used measures in comparison to several of the most popular machine learning and deep learning methods. The experimental results demonstrated that the proportions of highly susceptible zones in all of the CNN landslide susceptibility maps are highly similar and lower than 30%, which indicates that these CNNs are more practical for landslide prevention and management than conventional methods. Furthermore, the proposed CNN framework achieved higher or comparable prediction accuracy. Specifically, the proposed CNNs were 3.94%–7.45% and 0.079–0.151 higher than those of the optimized support vector machine (SVM) in terms of overall accuracy (OA) and Matthews correlation coefficient (MCC), respectively.

Spatial Pattern Analysis and Prediction of Forest Fire Using New Machine Learning Approach of Multivariate Adaptive Regression Splines and Differential Flower Pollination Optimization: A case study at Lao Cai province (Vietnam)

Article

Jan 2019

Understanding spatial patterns of forest fire is of key important for fire danger management and ecological implication. This aim of this study was to propose a new machine learning methodology for analyzing and predicting spatial patterns of forest fire danger with a case study of tropical forest fire at Lao Cai province (Vietnam). For this purpose, a Geographical Information System (GIS) database for the study area was established, including ten influencing factors (slope, aspect, elevation, land use, distance to road, normalized difference vegetation index, rainfall, temperature, wind speed, and humidity) and 257 fire locations. The relevance level of these factors with the forest fire was analyzed and assessed using the Mutual Information algorithm. Then, a new hybrid artificial intelligence model named as MARS-DFP, which was Multivariate Adaptive Regression Splines (MARS) optimized by Differential Flower Pollination (DFP), was proposed and used construct forest fire model for generating spatial patterns of forest fire. MARS is employed to build the forest fire model for generalizing a classification boundary that distinguishes fire and non-fire areas, whereas DFP, a metaheuristic approach, was utilized to optimize the model. Finally, global prediction performance of the model was assessed using Area Under the curve (AUC), Classification Accuracy Rate (CAR), Wilcoxon signed-rank test, and various statistical indices. The result demonstrated that the predictive performance of the MARS-DFP model was high (AUC = 0.91 and CAR = 86.57%) and better to those of other benchmark methods, ackpropagation Artificial Neural Network, Adaptive neuro fuzzy inference system, Radial Basis Function Neural Network. This fact confirms that the newly constructed MARS-DFP model is a promising alternative for spatial prediction of forest fire susceptibility.

Spatial Pattern Assessment of Tropical Forest Fire Danger at Thuan Chau area (Vietnam) using GIS-Based Advanced Machine Learning Algorithms: A comparative study

Article

Jun 2018
ECOL INFORM

Thuan Chau is a serious district affected by forest fire in Vietnam, especially in 2016; however, no forest fire prediction research has been conducted for this region. Thus, knowledge of spatial patterns of fire danger of the district plays a key role in forest succession and ecological implications. This study’s aim was to analyze the spatial pattern of fire danger for the tropical forest of Thuan Chau district using advanced machine learning algorithms, Support Vector Machine classifier (SVMC), Random Forests (RF), and Multilayer Perceptron Neural Network (MLP-Net). For this purpose, a GIS database for the study area was established with 564 forest fire locations and ten forest fire variables. Then, Pearson correlation method was used to assess the correlation of the variables with the forest fire. In the next step, three forest fire danger models, SVMC, RF, and MLP-Net, were trained and validated. Finally, global performance of these models was assessed using the classification accuracy (ACC), Kappa statistics (KS), Area under the curve (AUC). In addition, Wilcoxon signed-rank test was employed to check the prediction performance of these models. The result shows the three models performed well; however, the MLP-Net model has the highest prediction performance (ACC=81.7, KS = 0.633, and AUC = 0.894), followed by the RF model (ACC=81.1, KS = 0.621, and AUC = 0.883), and the SVMC model (ACC=80.2, KS = 0.604, and AUC = 0.867). The result in this study is useful for the local authority and forest manager in forest management and fire suppression.

Forest fire characteristics in China: Spatial patterns and determinants with thresholds

Article

Sep 2018
FOREST ECOL MANAG

Forest Fire Susceptibility Modeling Using a Convolutional Neural Network for Yunnan Province of China

Abstract

Recommended publications

Spatial Pattern Assessment of Tropical Forest Fire Danger at Thuan Chau area (Vietnam) using GIS-Bas...

Comparative study on machine learning algorithms for early fire forest detection system using geodat...

Deep neural networks for global wildfire susceptibility modelling

New GIS Approach using Machine Learning Algorithm for early floods Detection

Current and Future Patterns of Global Wildfire Based on Deep Neural Networks