ArticlePDF Available

基于自筛选深度学习的滑坡易发性预测建模及其可解释性

January 2023
Earth Science-Journal of China University of Geosciences 48(5):1696

January 2023
48(5):1696

DOI:10.3799/dqkx.2022.247

Authors:

Faming Huang

Nanchang University

Show all 6 authorsHide

Content uploaded by Faming Huang

Content may be subject to copyright.

地球科学 Earth Science

http://www.earth‐science.net

第 48 卷第5期

2 0 2 3 年 5 月

Vol. 48 No. 5

May 2 0 2 3

https://doi.org/10.3799/dqkx.2022.247

基于自筛选深度学习的滑坡易发性

预测建模及其可解释性

黄发明1，陈彬 1，毛达雄 2，刘乐开 1，张子荷 1，朱莉 1*

1. 南昌大学信息工程学院，江西南昌 330031

2. 南昌大学工程建设学院，江西南昌 330031

摘要：针对滑坡易发性预测建模中滑坡 -非滑坡样本可能存在误差、环境因子间非线性关系较复杂且机器学习可解

释性未被关注等重要问题，拟提出一种基于自筛选的双向长短时记忆网络与条件随机场的滑坡易发性预测模型（Self-

screening Bi-directional Long Short-Term Memory and Conditional Random Fields， SBiLSTM-CRF）.SBiLSTM-CRF 模型具

有深度学习网络层数深、宽度广及可循环迭代建模的优势，能预测出环境因子间的非线性关系，并通过迭代自动筛选

阈值区间外的错误滑坡样本 .该模型可用于解释各环境因子之间耦合关系的内部作用机制 .将SBiLSTM-CRF 模型用

于陕西延长县滑坡易发性预测，并与cpLSTM-CRF、随机森林、支持向量机、随机梯度下降和逻辑回归模型比较 .结果

表明，SBiLSTM-CRF 克服了传统机器学习中存在的样本误差以及因子间复杂的非线性关系问题，具有更高的预测性

能.通过该模型的可解释性能力揭示了坡度、高程和岩性等因子控制延长县的黄土滑坡发育的机制.

关键词：滑坡易发性预测；深度学习；双向长短时记忆网络；条件随机场；可解释性；工程地质 .

中图分类号：P64 文章编号：1000- 2383(2023)05-1696-15 收稿日期：2022-08-18

Landslide Susceptibility Prediction Modeling and

Interpretability Based on Self-Screening Deep Learning Model

Huang Faming1， Chen Bin1， Mao Daxiong2， Liu Lekai1， Zhang Zihe1， Zhu Li1*

1. School of Information Engineering， Nanchang University， Nanchang 330031， China

2. School of Infrastructure Engineering， Nanchang University， Nanchang 330031， China

Abstract: To address the problems of landslide susceptibility prediction (LSP) modeling including possible errors in landslide and

non-landslide samples, complex non-linear relationships between environmental factors and unaddressed machine learning

interpretability, a deep learning-based Self-screening Bi-directional Long Short-Term Memory and Conditional Random Fields

(SBiLST M-CRF) model is proposed to reduce the impact of these problems on LSP and improve its confidence. The SBiLSTM -

CRF model has the advantages of deep learning network with deep layers, wide width and iterative modeling, which can predict

the non-linear relationship between environmental factors and automatically screen out the wrong landslide samples; it can select

non-landslide samples from the initial low/very low landslide susceptibility zone through iterative modeling, and finally reveal the

基金项目：国家自然科学青年基金项目（No.41807285）.

作者简介：黄发明（ 1988 — ），男，博士，副教授，研究方向为地质灾害风险评价 . ORCID ：0000‐0002‐4428‐7133. E ‐mail ：

faminghuang@ncu. edu. cn

* 通讯作者：朱莉，E ‐mail: lizhu@ ncu. edu. cn

引用格式：黄发明，陈彬，毛达雄，刘乐开，张子荷，朱莉，20 23. 基于自筛选深度学习的滑坡易发性预测建模及其可解释性 .地球科学，48（5）：

1696-1710.

Citation：Huang Faming，Chen Bin，Mao Daxiong，Liu Lekai，Zhang Zihe，Zhu Li，2023.Landslide Susceptibility Prediction M odeling and Interpret‐

ability Based on Self‐Screening Deep Learning Model.Earth Science，48（5）：1696-1710.

第 5 期黄发明等：基于自筛选深度学习的滑坡易发性预测建模及其可解释性

internal mechanism of the coupling of environmental factors to predict landslide susceptibility. The SBiLSTM-CRF model is used

to predict landslide susceptibility in Yanchang County of China, and compared with cpLSTM-CRF, random forest (RF), support

vector machine (SVM), stochastic gradient descent (SGD) and logistic regression (LR) models. The results show that SBiLSTM-

CRF overcomes the problems of sample error and complex nonlinear relationship between factors in traditional machine learning ,

has superior performance in modeling susceptibility than conventional machine learning, and the interpretability of the model

reveals that factors such as slope, elevation and lithology control the development of mounded landslides in Yanchang County.

Key words: landslide susceptibility prediction; deep learning; Bi‐directional long short‐ term memory; conditional random field;

interpretability analysis; engineering geology.

0 引言

滑坡灾害具有分布广、发生频率高、灾害损

失严重等特点，滑坡严重威胁人类生命财产安全

（Khan et al.， 2021）.滑坡易发性预测对滑坡风险

评价和潜在滑坡的准确定位具有重要作用 .

近几十年来，各种基于 GIS 平台的方法被用

于预测滑坡易发性，包括启发式模型（Moragues

et al.， 2020），数理统计模型如判别分析（Eiras et

al.， 2021）、多元线性回归（Jia et al.， 2021）、确定

性系数等（罗路广等，2021）.由于上述模型并不

能较好地揭示出易发性建模中各环境因子之间

的非线性作用关系，并且通常需要大量先验知

识，导致这些模型存在一定的局限性 .近年来，机

器学习由于其不需要过多的先验知识（如不需要

环境因子呈正态分布），且能获得较高预测精度

而被广泛应用于滑坡易发性预测，如人工神经网

络（田乃满等，2020）、支持向量机（Support Vec‐

tor Machine ， SVM）（Yao et al.， 2008）、决策树

（Pradhan ， 2013）、随机森林（Random Forest，

RF）（吴润泽等，2021）、多层感知机（Huang et

al.， 2020a）、逻辑回归（Logistic Regression， LR）

（方然可等，2021）、随机梯度下降（Stochastic Gra‐

dient Descent ， SGD）（Hong et al.， 2020）等 .

然而，传统的机器学习应用于滑坡易发性预测

还存在许多问题：（1）表征学习需要大量的先验知

识和假设空间，且不能充分地提取出各环境因子之

间的非线性关系（Huang et al.， 2020b）；（2）不能较

好地实现滑坡样本中的误差剔除，并实现有效的非

滑坡样本选择（Yao et al.， 2022）；（3）模型对数据敏

感，如缺失值、噪声和错误数据等；（4）被认为是“黑

箱操作”模型，对各类环境因子非线性耦合作用下

预测出滑坡易发性值的内部机制缺乏认识，即模型

可解释性未得到足够关注 .因此研究一种新的用于

滑坡易发性预测的机器学习算法非常有必要 .

深度学习在一定程度上能够克服传统机器学

习的这些缺点，它具有学习能力强、覆盖范围广、适

应性好等优点，且以数据驱动而不需要过多的先验

知识和假设 .目前深度学习已被广泛应用于公安服

务领域（Zhou et al.， 2022）、医疗辅助诊断领域

（Bhattacharya et al.， 2021）和公共安全领域等（Xu

et al.， 2018），当然也被广泛应用于自然灾害易发性

预测等领域（Huang et al.， 2020b；魏志军， 2020；

赵洪宝等，2022）.上述深度学习预测滑坡易发性主

要还是从算法改进的角度出发，而没有从滑坡易发

性建模本身存在的问题出发来提升建模性能（Zhu

et al.， 2020）.单纯的深度学习算法改进有时难以得

到理想的滑坡易发性预测效果，比如建模的滑坡‒

非滑坡样本本身就存在误差比较大的问题（黄发明

等，2020）.这种滑坡‒非滑坡样本误差将会严重制约

深度学习模型的输出变量准确性，从而制约深度学

习预测滑坡易发性的精度提升 .针对该问题，本文

提出新的深度学习算法，即基于自筛选的双向长短

时记忆网络与条件随机场（Self‐screening Bi‐direc‐

tional Long Short ‐ Term Memory and Conditional

Random Fields，S BiLS TM ‐ CRF）滑坡易发性预测

模型 .该算法预期能更准确地分析出各滑坡点之间

的相关性，以便提取出环境因子之间更深层的特征

信息；能自动筛选掉错误的滑坡样本，通过二次迭

代建模选择出准确的非滑坡样本，以便提供比原数

据更准确的滑坡空间数据描述（Zhou et al.， 2022）.

为体现 SBiLSTM‐CRF 模型的优越性，本文同时

选择了 4个传统机器学习模型（即随机森林（Ran‐

dom Forest，RF）、逻辑回归（Logistic Regression，

LR）、随机梯度下降（Stochastic Gradient Descent，

SGD）、支持向量机（Support Vector Machine，

SVM））以及深度学习方法级联并行长短时记忆

网络和条件随机场（Cascade‐parallel Long Short‐

Term Memory and Conditional Random Fields，

cpLSTM‐CRF）进行比较研究（Zhu et al.， 2020）.

1697

第 48 卷

地球科学 http://www.earth‐science.net

对于滑坡易发性建模的另外一个严肃问题是

很难解释影响结果的变量或特征，即深度学习的可

解释性（Linardatos et al.， 2020）.随着深度学习模

型越来越广泛地应用于关键环境下的重要预测，对

机器学习的透明度也日益增高 .其危险在于创建

和使用不合理、不合法的决策，或者根本无法获得

对其行为的详细解释（成科扬等，2020；曾春艳等，

2021）.其实无论是传统的机器学习还是深度学习，

都面临无法充分解释环境因子作用下滑坡易发

性指数预测机制的问题（Gaur et al.， 2020），尤其

大多数以数据驱动的深度学习目前尚处于“黑盒

子 ”状态 .为了解决这一局限性，本文重点分析各

环境因子对预测模型决策的贡献度以及与预测

结果的关联性，以获得更好的可解释性结论 .

1 滑坡易发性预测的 SBiLSTM‐CRF

模型

1.1　研究思路　

滑坡易发性预测是一个复杂的非线性建模问

题，通过常规统计方法可能无法获得令人满意的结

果.传统的处理机器学习和数据挖掘问题的方法都

需要很强的前提假设或者大量的先验知识，而且易

发性建模中也存在滑坡-非滑坡样本误差等多种

问题（Hong et al.， 2020）.本文拟提出了 SBiLSTM‐

CRF 这一新的深度学习模型来尝试克服上述问题，

并首次运用于滑坡预测数据集 .该算法流程（图 1a）

如下：（1）滑坡影响因子的分析和筛选：基于滑坡编

录信息与基础环境因子，采用频率比法（胡涛等，

2020）获得各个基础环境因子的频率比 .（2）建立空

间数据集：将处理好的频率比值作为模型输入变

量，构建滑坡 -非滑坡的空间数据集，并划分出模

型训练集和测试集 .（3）数据自筛选建模：通过构建

一个 Bi‐LSTM 和全连接网络的预分类自筛选网络，

并对全区进行滑坡易发性预评估 .设置阈值 t控制

T1和T0对数据集中因人工标注、数据采集误差等不

定因素导致的数据集标注结果错误数据进行筛选，

以降低易发性评价过程中的不确定性 .再从全区预

评估结果采用自然断点法获得阈值 T′

1和T′

0进行等

量补充 .（4）特征提取建模：构建级联双向 LSTM 网

络对筛选后的数据集进行更深度地提取特征，获取

更深层次的滑坡点前后的上下文信息和因子之间

的关联信息 .（5）序列预测建模：构建全连接二分

类网络对所提取的特征进行滑坡易发性预测，并

引入 CRF 对结果进行综合解码，获得易发性预测

结果 .（6）模型性能评价：通过假阴性、假阳性、真

阴性、真阳性等指标和 ROC 曲线衡量模型性能 .

1.2　滑坡易发性预测模型　

1.2.1　滑坡栅格样本自筛选　该网络的原始输入

是处理后的延长县一维地理数据，即每个样本所

包含的 12 个环境因子 .通过一个 Bi‐LSTM 和全

连接层构成预分类网络 .经过简单的训练之后，

预分类网络对测试集（2 060 个）和全研究区域的

研究点（2 622 482 个）进行预分类，得到相应的滑

坡易发性概率 .通过自然断点法获得全区易发性

区间，并进行分级 .本研究将滑坡易发性分为

K(K= 5 ) 个等级，分别为极低、较低、中等、较

高、极高 5个等级 .对筛选阈值引入偏度以用来

度量随机变量概率分布的不对称 .公式如下：

ïï

ïT1=

sk ×t

1 + 2

T0=1-T1,

，(1)

其中 t为人为设置阈值.s k 为偏度，定义为：

sk =1

n∑

i= 1

[( Xi-μ

σ)3] ，(2)

其中，

μ为均值，

σ为标准差 .当sk < 0 时表明滑坡易

发性预分类整体预测分布为负偏离，即左偏态；当

sk > 0 时表明整体预分类分布为正偏离，即右偏态；

当sk = 0，表明数据集整体预分类分布为正态分布 .

根据阈值的自筛选规则如下：（1）由预分类

网络对延长县全研究区域进行滑坡易发性预评

估；（2）将人工标签为 1而滑坡易发性概率小于

T1的栅格筛除，将人工标签为 0而滑坡易发性

概率大于 T0的栅格筛除；（3）随机从延长县全

研究区域滑坡易发性预评估结果中筛选滑坡易

发性概率大于 T′

1的栅格作为新的滑坡样本，筛

选滑坡易发性概率小于 T′

0的栅格作为新的非

滑坡样本 .其中 T′

1和T′

0分别是预评估中由自然

间断点分级法获得的极高和极低的区间值 .

本研究中筛选的阈值 t设置为 0.6，计算 T1为

0.302 1，筛选正样本 87 个，负样本 130 个，

T′

1和T0'

分别为 0.659 9 和0.139 1. 如此筛选后等量补充，既

减弱了滑坡易发性评价的不确定性，又维持了数据

集数据量一致，也保证了滑坡和非滑坡的类平衡 .

1.2.2　特征提取建模　特征提取部分采用 K(K=

5 ) 级Bi‐LSTM 的级联网络和全连接的前馈神经

网络 .级联 Bi‐LSTM 层能够更深层次地提取滑

1698

第 5 期黄发明等：基于自筛选深度学习的滑坡易发性预测建模及其可解释性

坡环境因子的深度特征和空间特征 .网络输入

层是环境因子的实矩阵 Ri×j，其中行向量

{

i |1 ≤

i≤I，i∈Z

}

为栅格个数，

I为研究区栅格总数，

列向量

{

j |1 ≤ j≤J，j∈Z

}

为环境因子个数，

J为

滑坡因子总数 .滑坡因子向量进入 5个Bi ‐

LSTM 的级联网络，以单条 LSTM 链及 LSTM

单元为例，结构如图 1b 和图 1c 所示 .正向传播

中处理第 k层中第 i个栅格的第 j个因子时，

LSTM 内部状态记忆单元计算过程可表示为：

k,i,j= tan h

(

Wc⋅

[

hk- 1,xij

]

+bC

)

，(3)

Ck,i,j=Fk,i,j×Ck- 1 +Ik,i,j×C

k,i,j ，(4)

hk,i,j=Ok,i,j× tan h(Ck,i,j) ，(5)

其中 hk，i，j表示隐藏层状态，C

k，i，j表示输入门基于

先前隐藏层生成的候选状态，

Ck，i，j是前单元状

态，

tan h为激活函数，

Fk，i，j为遗忘门，

Ik，i，j为输入

门，

Ok，i，j为输出门，计算过程：

Fk,i,j=σ

( )

Wf⋅

[ ]

hk- 1,xij +bf,

Ik,i,j=σ

( )

Wi⋅

[ ]

hk- 1,xij +bi,

Ok,i,j=σ

( )

Wo⋅

[ ]

hk- 1,xij +bo,

(6)

其中 σ表示激活函数；

Wf，

Wi和Wo表示 LSTM

单元每个门的权重矩阵；

hk- 1 ，

xij 分别表示前一

个单元的隐藏状态和第 i个栅格点的第 j环境

因子；

bf，

bi和bo表示相应的偏置项；

Fk，i，j输出

介于 0和1之间的数字 .首先输入门产生候选

状态，然后用遗忘门确定丢弃的信息，与输入

门相加，实现 LSTM 单元状态更新，最后输出

门根据当前单元状态输出和隐变量 .Bi‐LSTM

为两条相反的 LSTM 链，公式可以表示为：

ïï

hfk,i,j=H(Whf ⋅

[ ]

hfk,i,j+xij +bhf )

hbk,i,j=H(Whb ⋅

[ ]

hbk,i,j+xij +bhb ) ，(7)

其中 hfk，i，j∈Ri×j，

hbk，i，j∈Ri×j分别表示前向层和后向

层的输出向量，最终输出 yk，i，j= [ hfk，i，j，hbk，i，j]是这两

部分的拼接，前向层和后向层的组合被定义为单个

图1　算法流程及 Bi-LSTM/LSTM 结构示意

Fig.1　Algorithm flow and Bi-LSTM/LSTM structure

a. 算法流程 . FC. 全连接网络；Bi-LSTM . 双向 LSTM；CRF. 条件随机场；b.Bi-LS TM 链；c.LSTM 单元结构

1699

第 48 卷

地球科学 http://www.earth‐science.net

Bi‐LSTM 层.在完成一次前向传播过程中，从原始

数据输入到 SBiLSTM ‐CRF 变化公式为：

yout=∏

k=1

K∑

i=1

M∑

j=1

ok,i,j

×tan h

( )

fk,i,j

×Ck-1+ik,i,j

×C

k,i,j ，(8)

在模型训练过程中，容易发生过拟合问题，因

此非常必要对网络使用 Dropout 防止过拟合，

来实现正则化效果 .在神经网络前向传播的

过程中，对每级 LSTM 层中的部分神经元按

照预设的概率随机失效 .从而实现提高神经

网络模型泛化性能的效果，最终实现过拟合

问题的改善，本文的 Dropout 设置为 0.25.

1.2.3　序列预测建模　在通过 K级级联的 Bi ‐

LSTM 网络更深层次地提取每个栅格点的每个滑

坡因子之间的多维特征后，使用一个由 32 个神经元

组成的全连接层将所有的特征融合起来，并使用两

个神经元进行滑坡预测 .这一过程使用 Sigmoid 函

数作为激活函数 .由于滑坡数据在总体上具有一定

的空间连续性，例如，在滑坡数据的周围子区域往

往具有较大的概率滑坡 .考虑邻域中标注之间的

相关性是必要的 .因此引入 CRF 对标签序列进行

综合解码，而非独立解码 .CRF 的特征函数分为两

类，第一类是定义在第 i个栅格点上的节点特征函

数，它只和当前栅格点的输出滑坡易发性评估结

果有关，第 i个栅格点 yi的节点特征函数记为：

(

yi,yi

pred,i

)

,l= 1,2,...L ，(9)

其中 L是定义在该栅格点上节点特征函数的总个

数.第二类是定义在 yi栅格点前后的局部特征函

数，它只和当前滑坡点和上一滑坡点有关，第 i个栅

格点 yi的局部特征函数记为：

(

yi- 1,yi,yi

pred,i

)

,k= 1,2,...K ，(10)

其中 K是定义在该栅格点的局部特征函数的总

个数 .无论是节点特征函数还是局部特征函数，

它们的取值只能是 0或1，即满足特征条件或者

不满足特征条件 .同时每个特征函数都被赋予一

个权值，用以表达对这个特征函数的信任度 .线

性链条件随机场的公式可表示为：

(

yi|yi

pred

)

( )

pred

exp

(

∑

i,k

λktk

(

yi- 1,yi,yi

pred,i

)

∑

i,l

μlsl

( )

yi,yi

pred,i

)

，(11)

其中 tk和μl分别是 λk和sl的权重系数，Z

(

pred

)

为规范化函数，表示为：

(

pred

)

∑

exp

( )

∑

i,k

λktk

( )

yi- 1,yi,yi

pred,i+∑

i,l

μlsl

( )

yi,yi

pred,i ，

(12)

为了得到最优的模型，需要训练 CRF，本文采用极

大似然估计法，似然函数如式（13）所示 .本文使用

了Viterbi 算法（Forney，1973）来优化 CRF 计算过

程，得到经 CRF 平滑处理后的最终输出序列标签：

{

y1，y2，⋅ ⋅ ⋅，

}

yn.

(

)

=∑

log p

( )

yi|yi

pred . (13)

1.2.4　损失函数　滑坡环境因子原始数据差异

较小，且不同环境因子对滑坡的影响不同 .因此

在训练过程中引入交叉熵作为损失函数计算损

失以扩大数据之间的差距，减小数值计算误差，

加速模型的收敛 .此外，交叉熵使得优化的概率

更加精确，既滑坡概率更接近 1，非滑坡概率更

接近 0. 交叉熵损失函数可以表示为：

Lcross ‐entropy =

-1

n∑

i=1

(yi

Llog ( yi)+(1 - yi

L) log ( 1 - yi) ) , (14)

其中，

yi表示数据真实标签值，

yi表示预测值，

n表示滑坡样本总数 .

1.3　滑坡易发性预测结果评价　

1.3.1　统计指标精度评价　本研究采用阳性预测

率（PPR）、阴性预测率（NPR）和总准确率（To‐

tal Accuracy， TA）这几种统计指标来评价各模

型的预测性能（Huang et al.， 2020b）.PPR 通过计

算预测正例在所有正例里的占比得到 .NPR 通

过预测负例在所有负例中的占比得到 .PPR 和

NPR 分别用于评估滑坡易发性模型对滑坡和非

滑坡的预测能力 .TA 被用来评估所有测试数据

集的预测准确度 .这3个统计指标的计算如下：

TA = TP + TN

TP + TN + FP + FN , (15)

其中，真阳性（True Positive， TP）表示正确分类的

滑坡栅格数；真阴性（True Negative， TN）表示正确

分类的非滑坡栅格数；假阳性（False Positive， FP）

表示错误分类的滑坡栅格数，即将非滑坡栅格错误

的分类为滑坡栅格；假阴性（False Negative， FN）表

示分类错误的费滑坡栅格数，即将滑坡栅格错误的

分类为非滑坡栅格 .TA 表征模型总的预测精度，

TA 越大表示滑坡易发性预测准确度越高 .

1.3.2　ROC 曲线评价　接受者操作特征曲线（Re‐

1700

第 5 期黄发明等：基于自筛选深度学习的滑坡易发性预测建模及其可解释性

ceiver Operating Characteristic Curve， ROC），能反

映特异性和敏感性的相互关系，被广泛应用于评价

滑坡易发性模型的优劣（罗路广等，2021）.特异性又

称假阳率（False Positive Rate， FPR），敏感性又称

真阳率（True Positive Rate， TPR），计算公式为：

FPR = FP

FP + T N , (16)

TPR = TP

TP + F N , (17)

ROC 曲线以特异性为横坐标，以敏感性为纵坐标 .

通常 ROC 曲线在 y=x之上，因此 ROC 曲线下的面

积（Area Under the Curve，AUC）一般取值范围位于

区间［0.5， 1］.ROC 曲线越接近左上角，即 AUC 越

大表示该滑坡易发性预测模型的性能越好 .

1.4　深度学习预测滑坡易发性的可解释性　

近年来深度学习在研究领域取得了很多

成果，但由于其缺乏可解释性而降低了模型的

可信度 .可解释性是指以可理解的术语向人类

提供解释的能力（Zhang et al.， 2021）.总体可

分为内置可解释性和事后可解释性（Alvarez‐

Melis and Jaakkola ，2018）.

内置可解释性也称事前可解释性，是指模型

本身已内置可解释性，发生在网络训练之前，主要

用于较为简单的网络模型 .通常直接使用自解释

模型或实现网络模型的内置可解释性（Lipton，

2018）.事后可解释性旨在从已训练模型中提取信

息，主要运用于较为复杂的网络模型，可分为全局

可解释性和局部可解释性 .全局可解释性是指基

于整个数据集的特征空间和模型结构来解释和理

解模型决策，对应的方法为全局可解释方法，通常

从模型层面进行解释；局部可解释性是指基于输

入样本的每一维特征变化对输出结果影响来解释

和理解模型决策，对应的方法为局部可解释方法，

通常从数据层面进行解释 .本研究基于预测分布

图和积分梯度对延长县滑坡空间数据集进行局部

和全局解释（Alvarez‐Melis and Jaakkola，2018）.

2 研究区及滑坡编录介绍

2.1　延长县　

延长县位于陕西省东北部，总面积为

2 368.7 km²，海拔高度为 470.6~1 383 m；延长县属

暖温带干旱大陆性季风气候，四季分明，降水量

相对较少 .其地势由西北向东南倾斜，南北高，中

间低，呈谷峰型 .河流弯曲狭窄，植被覆盖率

低，丘陵沟壑交错且河谷深切重；具有黄土高

原地貌，地层依次为三叠系碎屑沉积岩、上新

统三趾马红土、第四纪体系 .马红土分布不连

续，其上层黄土抗剪强度较差 .马兰黄土由于

其质纯、疏松、大孔隙，属于滑坡易发地层 .

该区滑坡概况如图 2所示，现存 82 处滑坡，

主要类型为小型浅层覆盖滑坡 .滑坡体积以

中小型为主，共计 81 处（占比 98.7%），主要

分布在西部，大部分的滑坡距离河流较近、

具有相对较高的地势（郭天颂等，2019）.

2.2　滑坡环境因子分析　

本研究用到的数据源包括：①县国土资源

部门调查的滑坡编录信息和野外调查的相关

资料；②30 m 分辨率的 DEM 数据；③1 ：10 万比

例尺的延长县岩土类型分布图；④30 m 分辨率

的Landsat TM8 遥感影像等 .根据野外实际情

况，综合考虑影响滑坡发育的环境因子并依据

已有文献资料，最终选取高程、坡度、坡向、剖

面曲率、平面曲率、地形起伏度、地表总辐射、

岩性、地形湿度指数（Topographic Wetness In‐

dex ， TWI）、归一化建筑物指数（Normalized

Difference Building Index ， NDBI）、归一化植被

指数（Normalized Difference Vegetable Index ，

NDVI）、修正归一化差异水指数（Modified

Normalized Difference Water Index ， MNDW I）12

个作为易发性模型的输入变量（图 3）.

图 2　延长县滑坡概况

Fig.2　Yanchang County landslide overview

1701

第 48 卷

地球科学 http://www.earth‐science.net

3 滑坡易发性建模结果

3.1　滑坡相关空间数据集　

本文共获取了 12 个环境因子作为深度学习

的输入变量，滑坡数据集中共有 6 864 个栅格单

元，其中包含从滑坡区域中转换的 3 432 个滑坡

栅格单元以及从非滑坡区域中随机选择的

3 432 个非滑坡栅格单元 .这些滑坡‒非滑坡栅格

分别被以 70%/30% 的比例随机划分后合并为训

练集和测试集（黄发明等，2020）.之后 SBiLSTM‐

CRF 模型利用训练 /测试数据集预测延长县的滑

坡易发性，并选用 LR、 RF、 SVM、 SGD 等模型

做对比 .实验所需的硬件配置如表 1所示 .

3.2　延长县滑坡易发性结果　

3.2.1　SB iLSTM ⁃ CRF 模型预测易发性　本文提

出的 SBiLSTM‐CRF 具有对数据的自筛选功能，且

具有较深的特征提取能力 .模型以 Bi‐LSTM 和全

连接层作为预测模型对输入的数据进行筛选，在以

级联 Bi‐LSTM 深度学习模型进行特征提取，利用

CRF 对预测结果做综合解码 .模型采用 Adam 优化

图3　延长县滑坡环境因子

Fig. 3　Environmental factors for landslides

a. 高程；b . 坡度；c. 坡向；d. 平面曲率；e. 剖面曲率；f. 地表起伏度；g. 岩性；h . 地形湿度指数；i.NDVI；j.NDBI；k.MNDWI；l. 地表总辐射

1702

第 5 期黄发明等：基于自筛选深度学习的滑坡易发性预测建模及其可解释性

器，学习率设置为 0.000 1 ；batch size 设置为 200；

LSTM 单元数设置为 32 个（即隐藏层大小设置

为32），训练迭代次数为 10 000 次.将训练好的

SBiLSTM ‐CRF 模型用于预测延长县全研究区

域的滑坡易发性得到滑坡易发性图（Landslide

Susceptibility Mapping， LSM）.此外，根据自然间

断点分级法（Yao et al.， 2008）将延长县划分为

极高、高、中等、低、极低 5个等级并计算出相对

应的面积占比分别为：12.80% ，13.21 % ，

13.67% ，19.53% 和40.77%（图 4a，表 2）.

3.2.2　CPLSTM ⁃ CRF 模型预测易发性　

CPLST M‐C RF 模型采用 Ad am 优化器，学习率设置

为0.001，batch size 设置为 200，unit 设置为 32 个且训

练迭代次数为 8 000 次（Huang et al.，2020b）.将训

练好的 CP LST M ‐C RF 模型用于预测延长滑坡

易发性得到 LSM 图.根据自然间断点法对研究

区域分为极高、高、中等、低、极低 5个等级并计

算出相对应的面积占比分别为：8. 59% ， 9.33% ，

13.92% ， 22.36% 和45.80% （图 4b ，表 2）.

3.2.3　RF 模型预测易发性　 RF 是基于树模型的

分类器，是滑坡预发性预测的常用模型 .RF 精度

主要采用因子特征数量 m和树的数量 t等参数来

调整 .基于袋外误差筛选法确定 RF 模型预测延长

县滑坡易发性时，其 m和t的参数分别设置为 3和

800 （吴润泽等，2021）.同样将 RF 模型用于预测

延长县全研究区域的滑坡易发性得到 LSM 图.

其中 RF 模型预测的极高、高、中等、低、极低滑坡

易发区的面积占比分别为： 10.02% ， 16.86% ，

21.99% ， 24.59% 和26.54%（图 4 c ，表 2）.

3.2.4　LR， SVM 和SG D 模型预测易发性　LR 模

型由线性回归模型方程与 Sigmoid 函数共同组成 .

其本质是假设数据服从某一分布，然后使用极大似

然估计做参数的估计（李文彬等，2021）.LR 具有实

现简单高效易解释、计算速度快、易并行的特点 .

在对 LR 建模时，设置 L2 正则惩罚项以增强抗扰

动能力，惩罚系数 C设置为 0.5 ，停止标准设置为

10- 4 .LR 模型预测的极高、高、中等、低、极低滑

坡易发区的面积占比分别为：10.90% ，16.85% ，

21.52% ，25.53% 和25.19%（图 4 d ，表 2）.

SVM 模型的惩罚系数 C设为 1.0；核函数采

用径向基；核函数系数 γ设为 0.3（Yao et al.，

2008）.SVM 预测的极高、高、中等、低、极低滑坡

易发区的面积占比分别为：14 .87% ，1 6.45 % ，

18 .76% ，22.84% 和27.08%（图 4e）.SGD 模型是

广泛使用的优化算法， SGD 采用 L2 范数作为罚

项类型，罚项系数 α设置为 0.000 1（Hong et al.，

2020）.SGD 模型预测的极高、高、中等、低、极低

滑坡易发区的面积占比分别为：1 5.20 % ，

18 .46% ，20 .16% ，22.15% 和24.03%（图 4f ，表 2）.

3.3　易发性建模结果评价　

3.3.1　统计指标精度　各滑坡预测模型的统计测

量结果如表 3所示 .结果表明，SBiLSTM ‐CRF 在阴

性预测率、阳性预测率和总的预测率上均比其他传

统机器学习模型和深度学习具有更好的预测性能 .

自筛选的方式有效去除了错误样本，为模型学习提

表1　实验平台软硬件环境

Table 1　Software and hardware environment of the experimental platform

实验平台

配置

处理器

(CPU)

显卡

(GPU)

RAM

参数

Intel(R) C ore(TM ) i5-7400@3.00 GHz

Nvidia GeForce GTX1080

8.00 GB DDR4

实验

平台

配置

内存

(ROM)

操作

系统

开发

环境

参数

Western Digital WDC WD10EZEX-08WN4A0

Windows10 + Ubuntu18.04

Python3.6.5 + TensorFlow1.14.0 +

Keras2.1.4 + Matlab 2018

表2　滑坡易发性评估统计结果

Table 2　Statistical results of landslide susceptibility evalua‐

tion

预测模型

SBiLSTM-CRF

cpLSTM-CRF

SVM

SGD

易发性等级 (%)

极高

12.80

8.59

10.02

10.9

14.87

15.2

高

13.21

9.33

16.86

16.85

16.45

18.46

中等

13.67

13.92

21.99

21.52

18.76

20.16

低

19.53

22.36

24.59

25.53

22.84

22.15

极低

40.77

45.80

26.54

25.19

27.08

24.03

1703

第 48 卷

地球科学 http://www.earth‐science.net

供了高质量的数据 .级联的 L STM 可以捕获更深层次

的因子间的交互关系，使其优于其他机器学习模型 .

3.3.2　ROC 曲线　 ROC 曲线用模型命中率和误

报率来评价模型性能，AUC 表示 ROC 曲线下的

面积，主要用于衡量模型的泛化性能 .ROC 曲线

如图 5 所示 .可以看出，SBiLSTM‐CRF 在精确率

和召回率都明显优于其他模型 .揭示了 SBiL ‐

STM‐CRF 克服了其他模型的局限性，通过级联

LSTM 明显增强了模型的非线性表达能力 .

4 讨论

4.1　模型迭代分析　

SB iLST M ‐ CRF 训练损失值和测试准确率随

迭代次数增加的变化曲线如图 6 所示 .可以看出迭

代500 次内，准确率停滞在 0.5 没有上升，但这并不

意味着模型没有更新 .相反，模型在停滞过程中 loss

急速下降 .模型迭代次数在 2 000 次内，损失值从 4

迅速下降到 0.5，而后缓慢降低，并趋于平稳 .同时，

图 4　滑坡易发性图

Fig. 4　Landslide susceptibility maps

a.SBiLSTM-CRF；b.cpLSTM-CRF；c.RF；d. LR ；e. SVM ；f.SGD

1704

第 5 期黄发明等：基于自筛选深度学习的滑坡易发性预测建模及其可解释性

准确率迅速从 0.5 上升至 0. 8 ，而后逐渐达到稳

定，并趋于平稳 .这表明模型具有较快的收敛

速度和稳定的收敛性能 .

4.2　SBiLST M⁃CRF 模型分析　

滑坡易发性建模结果表明，SBiLSTM ‐ CRF

模型性能优于以往的 CP LST M ‐C RF 和传统的

RF 、LR、SVM、SGD 等机器学习模型 .这是因为

SBiLSTM‐CRF 模型在易发性建模过程中表现出

了诸多优点 .比如该模型使用自筛选模块以解决

人工标注错误、数据采集误差等不定因素带来的

偏差；建模时提出使用 Bi‐LSTM 以增加网络宽

度，并以级联的方式增加网络的深度；CRF 能进

一步分析栅格点之间的双能量关系使得滑坡预

测概率更平滑 .SBiLSTM‐CRF 能更充分地拟合

数据以便学习环境因子间的非线性关系，具有自

动筛选错误数据、特征提取能力强的优势 .SBiL‐

STM‐CRF 模型同时还具有自筛选的能力和良好

的可解释性 .然而在网络拥有良好的特征提取能

力和优秀的分类能力的同时，SBiLSTM ‐CRF 和

其他的深度学习和机器学习网络相比更为复杂 .

因此，是否能在尽量不牺牲精度的前提下对模型

进行剪枝和优化，将成为下一步改进的方向 .

4.3　SBiLST M⁃CRF 的易发性建模可解释性　

本文分别从单因子影响滑坡易发性、双因子交

互作用以及因子贡献度等领域出发对深度学习预

测滑坡易发性进行解释（Linardatos et al.， 2020）.

4.3.1　单因子　通过将滑坡样本输入到网络中获

得滑坡易发性概率，统计环境单因子的预测分布图

可解释性，可以从自然成因层面直观揭示出不同单

因子对滑坡易发性预测结果的影响（曾春艳等，

2021）.对测试集中 2 060 个样本进行统计可以得出

各个环境因子在各区间的分布，如图 7 所示 .以坡度

为例，根据坡度的分布，对每个样本进行统计，得到

坡度在各个区间的预测分布如图所示 .上半部分的

箱型图表示各区间滑坡易发性概率的分布情况，箱

型图中的数字表示该区间概率统计的二分位数

（Q2）；下半部分的柱状图表示该区间内样本点的数

量统计情况，柱状图中的数字表示该区间样本个数 .

由图 7a可以看出，坡度在 0°~8.8°和8.8°~12.1°

区间内的样本量较少，分别为 140 例和 221 例，分别

约占总量的 6.8% 和10.7%；21.9°~41°区间内的样

本量最大，为 582例，约占总量的 2 8.3% . 坡度在 0°~

12.1°呈现出非常低的滑坡易发性概率，均小于 0.2，

两个区间内的 Q2 都为 0.085；坡度在 19.1°~21.9°和

41.0°~50.3°区间内的滑坡易发性概率集中在 0.5~

0.9，Q2 为0.671；坡度在 21.9° ~41°时，滑坡易发性

图 6　L oss 和acc uracy 随迭代次数曲线

Fig.6　Loss and accuracy curves with number of iterations

表3　不同模型对延长县滑坡预测的性能对比

Table 3　Comparison of landslide prediction performance of

different models in Yanchang County

模型

PPR (%)

NPR(%)

TA (%)

SBiLSTM-

CRF

849

857

173

181

83.07

82.56

82.82

CPLSTM -

CRF

809

701

329

221

71.09

76.03

73.30

813

771

259

217

75.84

78.04

76.89

SVM

767

729

301

263

71.82

73.49

72.62

729

724

306

301

70.43

70.63

70.53

SGD

813

616

414

217

66.26

73.95

69.37

图 5　各滑坡易发性预测模型的 R OC 曲线

Fig.5　ROC curves for each landslide susceptibility prediction

model

1705

第 48 卷

地球科学 http://www.earth‐science.net

概率集中在 0.6~0.9，Q2 为0.845，表现出了具有

较高的滑坡易发性概率（郭果等，2013）.坡度作

为易发性评价中重要的环境因子，其大小直接影

响了滑坡的发生，在坡度为 0°~41.0°内，研究区域

内的滑坡易发性概率随坡度增大而增大，而在

41.0°~50.3°区间内滑坡易发性概率较21.9°~41.0°

内有所下降 .这是因为坡度较小通常坡体比较稳

定，而坡度大通常滑坡风化层不易累计、人类活动

影响较小、坡体排水良好，反而不易发生滑坡 .

由图 7b 可看出，NDVI 在0~0.099、0.449~1 区

间的数据较少，样本数为 3例，占比约 0.15%；NDVI

在0.113~0.122、0.237~0.243、0.259~0.270、

0.347~ 0.398 之间的数据量同样较少，样本数为 118

例，占比约 5.7% . 在随着 NDVI 的增大，并没有呈现

出较强的规律性，NDVI 在0.113~0.122、0.237~

0.243、0.259~0.270、0.347~0.398 之间时，超过一半

的滑坡易发性概率都落在了 0~0.5，其 Q2 为

0.397；而在 0.157~0.197 区间内，超过一半的滑坡

易发性概率都落在了 0.5~0.9，Q2 为0.599. 植被

覆盖度是生态环境的一个重要参数（Moragues et

al.， 2020），影响着斜坡体表面土体的流失速率

和土壤侵蚀程度 .NDVI 较低时，通常为水系、城

市和裸地，通常没有滑坡发育的条件；NDVI 较高

时，边坡抗剪强度和抗渗透性较好，人类工程活

动减少，不利于滑坡，同时较高的植被也不利于

人类观测，因此有记录的滑坡频次较少 .

4.3.2　双因子交互作用　在单因子的统计上，进

一步统计双因子交互作用，以下选取相关性较高

的6个结果做分析 .以DEM 和Slope 为例，根据

DEM 和Slope 分别的结果进行统计，得到 DEM

和Slope 双因子交互如图 8a 所示，图中展示每个

各因子每个区间的分布情况，圆中的数字分别表

示当前区间下的样本量和平均滑坡易发性概率，

圆的大小表征样本量的多少，圆的颜色深浅表征

滑坡易发性概率的大小（成科扬等，2020）.

如图 8a 可知，当坡度在 0°~12.1°时堆积物抗滑

性能大于下滑性能，形成不了滑坡 .在高程不变的

情况下，滑坡易发性概率随着坡度的增大先增后

减，并在 21°~41.0°达到了最大；在同一坡度下随着

高程越大增大先增后减，并在 866.6~979.4 m 区间

达到了最大 .在高程为 866.6~979.4 m，Slope 在

21°~41.0°区间内的滑坡易发性概率（0.855）达到最

大.说明高程在 866.6~979.4 m 区间内利于滑坡堆

积成物质发育，容易接受一些坡积物和洪积物的沉

积.岩石经过风化以后，容易从高程为 979.4~

1 369.8 m 的区域通过坡积过程搬运到高程为

866.6~979.4 m 的区域，此时再遇到 19.1°~50.3°的

坡度即沉积下来的堆积物就容易发育成滑坡 .如图

8b 所示，在地形起伏度在 18.0~84.6 m 且不变的情

况下，滑坡易发性概率随着地形湿度的增大先增大

图 7　滑坡易发性预测单因子可解释性结果

Fig.7　One-way interpretable results for landslide susceptibility prediction

a. 坡度；b.NDVI

1706

第 5 期黄发明等：基于自筛选深度学习的滑坡易发性预测建模及其可解释性

后减小 .在地形起伏度为 33.1~84.6 m 和地形湿

度为 0~0.123 时，平均滑坡易发性概率（0.757）

达到最大 .在地形湿度较大的区域边坡堆积层

含水量较高且坡体抗剪强度较低，在合适的地

形起伏度下，容易发育成滑坡（冯霄等，2022）.

然而，当地形湿度过大时通常为水系发达地

区，该区域由于水体的冲刷作用，使得坡积物

和洪积物无法堆积反而不利于滑坡发育 .

4.3.3　滑坡环境因子贡献度解释　本文对逐条样

本根据各滑坡环境因子做积分梯度并取期望值，

获得各因子贡献度 .为简化计算基线设置为 0，近

似方法使用的步数设置为 50. 最后基于积分梯度

计算的滑坡环境因子对 SBiLSTM‐CRF 做决策的

贡献度总和结果显示：（1）地形地貌和基础地质

对滑坡贡献度较大，约为 0.31 和0.18.（2）地表覆

盖和水文环境对滑坡的贡献度较小，约为 -0.03

和0.02.（3）坡度、岩性、高程对滑坡影响起主要

作用，分别约为 0.14， 0.12 和0.08，这也与前面的

结果吻合 .（4） NDVI 为负贡献度（约- 0.04），

MNDWI 的贡献度几乎为 0，NDBI 的贡献度不足

0.01. 上述结果表明 NDVI、MNDWI 和NDBI 并

不是造成延长县滑坡发育的主要相关因子 .具

有黄土高原地貌的延长县的滑坡受坡度、岩

性、高程、坡向和地形起伏度的影响较为明显 .

5 结论

本文提出了一种基于 Bi‐LSTM 的新型自筛选

级联双向 LSTM‐CRF 网络模型开展滑坡易发性预

测建模 .模型与 CPLSTM‐CRF 和传统的机器学习

图8　滑坡易发性预测的双因子交互可解释性结果

Fig.8　Two-factor interactive interpretability results

a. 高程与坡度；b. 地形起伏度与 TWI

1707

第 48 卷

地球科学 http://www.earth‐science.net

（RF ， SVM ， SGD ， LR）模型相比表明，模型具

有显著的优越性，建模结果均优于其他模型 .

本文使用滑坡自然原理可解释性和深度学习网

络可解释性，使 S BiL STM ‐ CR F 从以往的黑盒

子开始白盒化 .结果表明就滑坡易发性数据集

而言，坡度、高程、岩性、地表起伏度和坡向等

滑坡因子控制了延长县堆积层滑坡发育 .在海

拔为 866.6~979.4 m ，坡度为 21.9°~41° ，地形起

伏度为 33.1~84.6 m ，岩性为 T3y，地形湿度为

0~0.123 时利于滑坡发育 .总之，SBiLSTM ‐

CRF 模型因为其非线性表征能力和可解释性

而拥有显著的滑坡易发性预测建模实用性 .

References

Alvarez ‐Melis, D., Jaakkola, T. S., 2018. Towards Robust

Interpretability with Self ‐ Explaining Neural Networks.

Advances in Neural Information Processing Systems 31

(NeurIPS 2018). https://arxiv.org/abs/1806.07538

Bhattacharya, P., Tanwar, S., Bodke, U., et al., 2021.

BinDaaS: Blockchain‐Based Deep ‐Learning as‐a‐Service

in Healthcare 4.0 Applications. IEEE Transactions on

Network Science and Engineering, 8: 1242-1255.

Cheng, K. Y., Wang, N., Shi, W. X., et al., 2020. Re‐

search Advances in the Interpretability of Deep Learn‐

ing. Journal of Computer Research and Development, 57

(6): 1208-1217 (in Chinese with English abstract).

Eiras, C., Souza, J., Freitas, R., et al., 2021. Discriminant

Analysis as an Efficient Method for Landslide Suscepti‐

bility Assessment in Cities with the Scarcity of Predispo‐

sition Data. N atural Hazards, 107: 1427- 1442.

Fang, R. K., Liu, Y. H ., Su, Y. C., et al., 2021. A Early

Warning Model of Regional Landslide in Qingchuan

County, Sichuan Province Based on Logistic Regres‐

sion. Hydrogeology & Engineering Geology, 48(1):

181-187 (in Chinese with English abstract).

Feng, X., Wang, Y., Liu, Y., et al., 2022. Susceptibility

Assessment of a Translational Rockslide Considering

the Control M echanism and Spatial U ncertainty of a

Weak Interlayer: Application Study in Tiefeng Town ‐

ship, Wanzhou District. Bulletin of Geological Science

and Technology, 41(2): 254- 266 (in Chinese with

English abstract).

Forney, G.D., 1973. The Viterbi Algorithm. Proceedings of

the IEEE, 61: 268-278.

Gaur, M., Faldu, K., Sheth, A., 2020. Semantics of the

Black ‐ Box: Can Knowledge Graphs Help Make Deep

Learning Systems More Interpretable and Explainable?.

IEEE Internet Computing, 25: 51-59.

Guo, G., Chen, Y., Li, M.H., et al., 2013. Statistic Rela‐

tionship between Slope Gradient and Landslide Proba‐

bility in Soil Slopes around Reservoir. Journal of Engi‐

neering Geology, 21(4): 607-612 (in Chinese with

English abstract).

Guo, T. S., Zhang, J. Q., Han, Y., et al., 2019. Evaluation

of Landslide Susceptibility in Yanchang County Based

on Particle Swarm Optim ization ‐Based Support Vector

Machine. Geological Science and Technology Informa‐

tion, 38(3): 236-243 (in Chinese with English abstract).

Hong, H. Y., Tsangaratos, P., Ilia, I., et al., 2020. In‐

troducing a Novel Multi ‐ Layer Perceptron Network

Based on Stochastic Gradient Descent Optimized by a

Meta‐Heuristic Algorithm for Landslide Susceptibility

Mapping. The Science of the Total Environment,

742: 140549. https://doi. org/10.1016/j. scito‐

tenv.2020.140549

Hu, T., Fan, X., Wang, S., et al., 2020. Landslide Sus‐

ceptibility Evaluation of Sinan County Using Logistics

Regression Model and 3S Technology. Bulletin of Geo‐

logical Science and T echnology, 39(2): 113-121 (in

Chinese with English abstract).

Huang, F., Cao, Z., Jiang, S. H., et al., 2020a. Landslide

Susceptibility Prediction Based on a Semi ‐ Supervised

Multiple ‐ Layer Perceptron Model. L andslides, 17:

2919-2930.

Huang, F., Zhang, J., Zhou, C., et al., 2020b. A Deep

Learning Algorithm Using a Fully Connected Sparse Au‐

toencoder Neural Network for Landslide Susceptibility

Prediction. Landslides, 17: 217- 229.

Huang, F. M., Ye, Z., Yao, C., et al., 2020. Uncertainties

of Landslide Susceptibility Prediction: Different Attri‐

bute Interval Divisions of Environmental Factors and

Different Data ‐ Based Models. Earth Science, 45(12):

4535-4549 (in Chinese with English abstract).

Jia, G., Gariano, S.L., Tang, Q., 2021. Advancing Rainfall‐

Induced Landslide Detection Using Homogeneous Slope

Units and Distributed Rainfall Thresholds. EGU Gener‐

al Assembly Conference Abstracts, EGU21-6145.

Khan, R., Yousaf, S., Haseeb, A., et al., 2021. Exploring

a Design of Landslide Monitoring System. Complexity,

2021: 5552417. https://doi.org/10.1155/2021/5552417

Li, W. B., Fan, X. M ., Huang, F. M., et al., 2021. Uncer‐

tainties of Landslide Susceptibility Modeling under Dif‐

ferent Environm ental Factor Connections and Prediction

Models. Earth Science, 46(10): 3777-3795 (in Chinese

1708

第 5 期黄发明等：基于自筛选深度学习的滑坡易发性预测建模及其可解释性

with English abstract).

Luo, L. G., Pei, X. J., Huang, R. Q., et al. 2021. Evalua‐

tion of Landslide Susceptibility in Jiuzhaigou Scenic Spot

by CF and Logistic Regression Coupled with GIS. Jour‐

nal of Engineering Geology, 29: 526-535 (in Chinese

with English abstract).

Linardatos, P., Papastefanopoulos, V., Kotsiantis, S.,

2020. Explainable AI: A Review of Machine Learning

Interpretability M ethods. Entropy (Basel, Switzerland),

23(1): 18. https://doi.org/10.3390/e23010018

Lipton, Z.C., 2018. In Machine Learning, the Concept of In‐

terpretability is Both Important and Slippery. Queue,

16: 28.

Moragues, S., Lenzano, M. G., Lanfri, M., et al., 2020.

Analytic Hierarchy Process Applied to Landslide Suscep ‐

tibility Mapping of the North Branch of Argentino Lake,

Argentina. Natural Hazards, 105: 915-941.

Pradhan, B., 2013. A Comparative Study on the Predictive

Ability of the Decision Tree, Support Vector Machine

and Neuro ‐ Fuzzy Models in Landslide Susceptibility

Mapping Using GIS ‐Science Direct. Computers & Geo‐

sciences, 51: 350- 365.

Tian, N. M., Lan, H. X., Wu, Y. M., et al., 2020. Perfor‐

mance Comparison of BP Artificial Neural Network and

CART Decision Tree M odel in Landslide Susceptibility

Prediction. Journal of Geo ‐Information Science, 22(12):

2304-2316 (in Chinese with English abstract).

Wei, Z.J., 2020. Multi‐Stage Series Deep Convolutional Neu ‐

ral Network Medical Image Registration Method (Disser‐

tation). Hunan University, Changsha (in Chinese with

English abstract).

Wu, R. Z., Hu, X. D., Mei, H. B., et al., 2021. Spatial

Susceptibility Assessment of Landslides Based on Ran ‐

dom Forest: A Case Study from Hubei Section in the

Three Gorges Reservoir Area. Earth Science, (1): 321-

330 (in Chinese with English abstract).

Xu, M ., Feng, Q., Mei, Q., et al., 2018. Deep Type: On‐

Device Deep Learning for Input Personalization Service

with Minimal Privacy Concern. Proceedings of the ACM

on Interactive Mobile Wearable and Ubiquitous Tech ‐

nologies, 2: 1-26.

Yao, J., Qin, S., Qiao, S., et al., 2022. Application of a

Two ‐ Step Sampling Strategy Based on Deep Neural

Network for Landslide Susceptibility Mapping. Bulletin

of Engineering Geology and the Environment, 81:

1-20.

Yao, X ., Tham, L., Dai, F., 2008. Landslide Susceptibility

Mapping Based on Support Vector Machine: A Case

Study on Natural Slopes of Hong Kong, China. Geomor‐

phology, 101: 572-582.

Zeng, C.Y., Yan, K., Wang, Z.F., et al., 2021. Survey of

Interpretability Research on Deep Learning Models.

Computer Engineering and Applications, 57(8): 1- 9

(in Chinese with English abstract).

Zhang, Y., Tino, P., Leonardis, A., et al., 2021. A Survey

on Neural Network Interpretability. IEEE Transactions

on Emerging Topics in Computational Intelligence, 5:

1-17.

Zhao, H. B., Liu, R., Liu, Y. H., et al., 2022. Research on

Classification and Identification of M ine Microseismic

Signals Based on Deep Learning Method. Journal of

Mining Science and Technology, 7(2): 166-174 (in Chi‐

nese with English abstract).

Zhou, T., Wu, W., Peng, L., et al., 2022. Evaluation of

Urban Bus Service Reliability on Variable Time Hori‐

zons Using a Hybrid Deep Learning Method. Reliability

Engineering and System Safety, 217(3):108090.

Zhu, L., Huang, L. H., Fan, L. Y., et al., 2020. Landslide

Susceptibility Prediction Modeling Based on Remote

Sensing and a Novel Deep Learning Algorithm of a

Cascade ‐ Parallel Recurrent Neural Network. Sensors

(Basel, Sw itzerland), 20(6): 1576. https://doi. org/

10.3390/s20061576

附中文参考文献

成科扬, 王宁, 师文喜, 等, 2020. 深度学习可解释性研究进

展. 计算机研究与发展, 57(6): 1208-1217.

方然可, 刘艳辉 , 苏永超 , 等, 2021. 基于逻辑回归的四川青

川县区域滑坡灾害预警模型 . 水文地质工程地质, 48

(1): 181-187.

冯霄 , 王禹, 刘洋 , 等, 2022. 考虑软弱夹层控滑机制及其空

间不确定性的顺层岩质滑坡易发性评价 : 万州区铁峰

乡应用研究 . 地质科技通报, 41(2): 254-266.

郭果 , 陈筠, 李明惠, 等, 2013. 土质滑坡发育概率与坡度间

关系研究 . 工程地质学报, 21(4): 607-612.

郭天颂, 张菊清, 韩煜 , 等, 2019. 基于粒子群优化支持向量

机的延长县滑坡易发性评价 . 地质科技情报, 38(3):

236-243.

胡涛, 樊鑫, 王硕, 等, 2020. 基于逻辑回归模型和 3S 技术

的思南县滑坡易发性评价 . 地质科技通报, 39(2):

113-121.

黄发明, 叶舟 , 姚池 , 等, 2020. 滑坡易发性预测不确定性:

环境因子不同属性区间划分和不同数据驱动模型的影

响. 地球科学, 45(12): 4535-4549.

李文彬, 范宣梅 , 黄发明 , 等, 2021. 不同环境因子联接和预

测模型的滑坡易发性建模不确定性 . 地球科学, 46

1709

第 48 卷

地球科学 http://www.earth‐science.net

(10): 3777-3795.

罗路广 , 裴向军 , 黄润秋, 等，2021. GIS 支持下 CF 与 Lo‐

gistic 回归模型耦合的九寨沟景区滑坡易发性评价 . 工

程地质学报, 29：526-535.

田乃满, 兰恒星 , 伍宇明 , 等, 2020. 人工神经网络和决策树

模型在滑坡易发性分析中的性能对比 . 地球信息科学

学报 , 22(12): 2304-2316.

魏志军，2020. 多级串联深度卷积神经网络医学图像配准方

法(硕士学位论文 ). 长沙：湖南大学 .

吴润泽, 胡旭东 , 梅红波 , 等, 2021. 基于随机森林的滑坡空

间易发性评价: 以三峡库区湖北段为例 . 地球科学, 46

(1): 321-330.

曾春艳, 严康, 王志锋, 等, 2021. 深度学习模型可解释性研

究综述 . 计算机工程与应用, 57(8): 1-9.

赵洪宝, 刘瑞, 刘一洪, 等, 2022. 基于深度学习方法的矿山

微震信号分类识别研究 . 矿业科学学报, 7(2):

166-174.

1710

Literature review and research progress of landslide susceptibility mapping based on knowledge graph

Article

Full-text available

Jul 2023

Fei Guo

Landslide susceptibility mapping (LSM) is the foundation and critical part of landslide risk assessment. The bibliometric analysis of LSM literature can be applied to quantitatively analyze the research progress and development trend. The result will provide references for geological hazard risk assessment in China. In this study, based on the Web of Science and CNKI databases, the CiteSpace visual knowledge graph analysis tool has been used to carry out bibliometric analysis of LSM literature from 1985 to 2022. Moreover, the LDA analysis has been conducted on the abstract to subdivide the research in this field. The results showed that: (1) LSM is still a research hotspot at present. In China, there are a large number of studies and international cooperation about LSM. (2) Four of the top 10 authors in the number of published papers on LSM are from China. The institution that published the most papers on LSM is the Chinese Academy. The Chinese Journal of Geological Hazard and Control is the most popular Chinese journal and the Natural Hazards is the most popular English journals to publish LSM papers. The research on the subject of LSM has been greatly funded by the National Natural Science Foundation of China and the National Land and Resources Survey Project. (3) In the past five years, machine learning models (including deep learning, etc.) have been widely used as the most popular LSM models. (4) In order to achieve the simplification and intelligence of landslide susceptibility modeling and to improve the accuracy and practicability of the LSM results, the following parts of LSM, including the landslide inventory, conditioning factors, assessment unit, assessment model, connection methods and accuracy verification, need to be deeply explored in further studies.

基于信息量和卷积神经网络的黄土高原滑坡易发性评价

Article

Jan 2023

基于深度卷积神经网络和迁移学习的农村房屋洪涝灾害后受损等级分类

Article

Jan 2023

基于集成学习与贝叶斯优化的岩石抗压强度预测

Article

Jan 2023

Quantitative vulnerability analysis of buildings based on landslide intensify prediction

Article

Full-text available

Apr 2023

In view of the lack of research about landslide intensity prediction in the current quantitative vulnerability analysis of buildings, this paper innovatively proposes a quantitative analysis method of based on the combination of intensity empirical curve based on InSAR technology and spatial displacement prediction of secondary development of ABAQUS.Taking the Shimongmen landslide in the Three Gorges Reservoir Area as an example, the PS-InSAR method was adopted to calculate the cumulative displacement of the landslide in 2017-2020 and obtained the empirical curve of landslide intensity. The ABAQUS software was used to compile the subroutine of load and pore water pressure to calculate the cumulative displacement under extreme scenarios (reservoir water level drop with heavy rainfall) and predicted the vulnerability of buildings. The evaluation system of resistance was constructed by weighting eight indicators of PSO-Fuzzy AHP model, which can be combined with the landslide intensity to quantitatively evaluate the vulnerability of buildings. The results indicate that: (1) The evaluation system of resistance proposed in this paper can well present the structural characteristics of rural buildings in the Three Gorges Reservoir area, and has high evaluation accuracy. (2) The retrieved upper-intensity curve obtained by PS-InSAR is Ipu = 0.065 * Dtot 0.236which has higher prediction accuracy and effectively reduces false-negative errors. (3) The landslide intensity of extreme conditions simulated by ABAQUS increases with the increase of rainfall, the predicted vulnerability level of buildings increases, and the buildings with obvious deformation in the previous investigation are successfully warned. It is concluded that the landslide intensity prediction method and vulnerability analysis method proposed in this paper has high spatial identification and early warning accuracy, and realtime vulnerability mapping of buildings can be obtained through landslide intensity information.

Susceptibility Assessment of Translational Rockslide Considering Control Mechanism and Spatial Uncertainty of Weak Interlayer: Application Study in Tiefeng Township, Wanzhou District

Article

Full-text available

Mar 2022

Translational rockslide is one of the important disasters that endanger the safety of mountain towns. The dip slopes with weak interlayers are the areas prone to a translational rockslide. The regional translational rockslide susceptibility assessment should consider the sliding mechanism and spatial distribution uncertainty analysis of weak interlayers. Taking Tiefeng Town of Wanzhou District as the study area, based on the detailed investigation of the material, structure, and spatial distribution of weak interlayer, this paper analyzed the evolution mechanism of shale and mudstone developing into the sliding surface under the action of primary deposition, tectonic deformation and supergene transformation, and summarized the deformation and failure mechanism of the translational rockslide. Considering the spatial distribution uncertainty of weak interlayer, a calculation model of the vertical distribution of weak interlayer and the contribution of the weak interlayer to slip control in the effective control depth are proposed. The key factors that characterize the sliding structure, including the type of weak interlayer and the contribution degree of the weak interlayer, are selected as susceptibility assessment factors. Besides, four elements of topography, slope structure, hydrogeology, and human activities are considered. The susceptibility of a translational rockslide in the study area was assessed with slope units adopting the analytic hierarchy process. The investigation and assessment results show that the mud interlayers of the Jurassic Zhenzhuchong Formation and the shale layer of the Ziliujing Formation are the main potential slip surface of the translational rockslide in the study area. The extreme high susceptibility area and high susceptibility area account for 9.7 % and 25.8 % respectively. The distribution of the underlying weak interlayer and the excavation of the slope units are the main factors affecting the susceptibility to landslide disasters. Human activities such as house building and road excavation are the main triggering factors of the translational rockslide. Compared with the susceptibility assessment results without considering the factors of the weak layer, the results of the method proposed in this paper are more in accord with the fact.

不同环境因子联接和预测模型的滑坡易发性建模不确定性

Article

Full-text available

Jan 2021

Exploring a Design of Landslide Monitoring System

Article

Full-text available

Mar 2021
COMPLEXITY

Landslide is a critical natural geological hazard that causes severe damage to property, infrastructure, and humans. In general, some location-specific factors trigger a landslide. Wireless sensor network (WSN) is an enabling technology to monitor most of the parameters associated with these factors. A challenge in landslide monitoring through WSN is that each sensed data item might be critical whereas the underlying wireless communication is often unreliable. In case of landslides, the terrains have irregular shapes, providing harsh conditions for wireless communication thereby more data loss may be expected. This study focuses on the effect of lossy communication in WSN on the efficiency and accuracy of landslide monitoring systems. To this end, collaborative local data analysis is used to enable each node to decide locally whether its sensed data corresponds to a potential event-of-interest. Through extensive simulations, the performance of various landslide prediction and detection models has been evaluated. By and large, the study lets a significant insight into landslide monitoring before implementing a parameterized application for real-world deployment.

Discriminant analysis as an efficient method for landslide susceptibility assessment in cities with the scarcity of predisposition data

Article

Full-text available

Mar 2021
NAT HAZARDS

The city of Ouro Preto, which is located in the state of Minas Gerais, Brazil, has a long history of mass movements influenced by the regional geology, geomorphology, and anthropic activities, which have resulted in harmful consequences to the population. However, most of the studies conducted in the region are qualitative and are directly dependent on the experience specialists. The aim of this study was to analyse the landslide susceptibility in the urban region of Ouro Preto quantitatively by using discriminant analysis. The landslide inventory was obtained by using unmanned aerial vehicle images and fieldwork. ArcGIS 10.6 and R 3.5.1 software were used, and the following landslide predisposing factors were considered: slope angle, slope aspect, profile curvature, and topographic wetness index (TWI). As geological and geotechnical data are still scarce in the interior of Brazil, we only used data derived from topography to determine the effectiveness of these factors for analysing landslide susceptibility. The slope angle proved to be the factor that most differentiated unstable from stable terrain, followed by TWI. The other parameters were not as effective in differentiating the stability conditions. The model efficiency was 88.6%, the specificity was 93.3%, and the sensitivity was 85.0%. Also, the prediction and success curve were used to evaluate the accuracy of the proposed landslides model, by using the area under the curve (AUC) criteria. It was shown that the AUC values 0.851 for testing and 0.838 for training indicate that the developed model provides an excellent prediction. The main contribution of this work is the demonstration of the effectiveness of using easily accessible data (derived from topography) for analysing landslide susceptibility with a multivariate statistical method. This method can contribute valuable information to urban planning efforts in cities without the need for robust data.

Application of a two-step sampling strategy based on deep neural network for landslide susceptibility mapping

Article

Apr 2022

The selection of nonlandslide samples is a key issue in landslide susceptibility modeling (LSM). In view of the potential subjectivity and randomness in random sampling, this paper considers LSM as a positive-unlabeled (PU) learning problem and proposes a two-step deep neural network framework (T-DNN). Through the Spy technique and iteratively training binary classifiers, negative samples with high confidence were identified from the random subsamples with unlabeled sets. Based on the framework and traditional random sampling, we used logistic regression (LR), support vector machine (SVM), and deep neural network (DNN) models for testing and validation. Taking the Changbai Mountain Area in Jilin Province, China, as an example, according to the regional landslide list and the metrological, geographical, and human factors of frequent disasters, landslide susceptibility was evaluated. Results show that the proposed T-DNN method can enhance the selection of negative samples and make the results of landslide susceptibility assessment more reliable and accurate; the area under the receiver operating characteristic curve (AUC) reaches 0.953. In addition, compared with traditional random negative sample sampling, the optimized sample set shows more stable and superior prediction performance in different classifiers.

Towards robust interpretability with self-explaining neural networks

Article

Jan 2018

2018 Curran Associates Inc.All rights reserved. Most recent work on interpretability of complex machine learning models has focused on estimating a posteriori explanations for previously trained models around specific predictions. Self-explaining models where interpretability plays a key role already during learning have received much less attention. We propose three desiderata for explanations in general - explicitness, faithfulness, and stability - and show that existing methods do not satisfy them. In response, we design self-explaining models in stages, progressively generalizing linear classifiers to complex yet architecturally explicit models. Faithfulness and stability are enforced via regularization specifically tailored to such models. Experimental results across various benchmark datasets show that our framework offers a promising direction for reconciling model complexity and interpretability.

Evaluation of urban bus service reliability on variable time horizons using a hybrid deep learning method

Article

Jan 2022
RELIAB ENG SYST SAFE

Unreliable transit services can negatively impact transit ridership and discourage passengers from regularly choosing public transport. As the most important content of bus service reliability, accurate bus arrival prediction can improve travel efficiency for enabling a reliable and convenient transportation system. Accordingly, this paper proposes a novel deep learning method, i.e. variational mode decomposition long short-term memory (VMD-LSTM), for bus travel speed prediction in urban traffic networks using a forecast of bus arrival information on variable time horizons. The method uses the temporal and spatial patterns of the average bus speed series. The results show that the VMD-LSTM model outperforms other models on forecasting bus link speed series in future time intervals, whereas the artificial neural network model achieves the worst prediction. In conclusion, the VMD-LSTM method can detect irregular peaks of transit samples from a series of temporal or spatial variations and performs better on major and auxiliary corridors.

A Survey on Neural Network Interpretability

Article

Aug 2021

Along with the great success of deep neural networks, there is also growing concern about their black-box nature. The interpretability issue affects people’s trust on deep learning systems. It is also related to many ethical problems, e.g., algorithmic discrimination. Moreover, interpretability is a desired property for deep networks to become powerful tools in other research fields, e.g., drug discovery and genomics. In this survey, we conduct a comprehensive review of the neural network interpretability research. We first clarify the definition of interpretability as it has been used in many different contexts. Then we elaborate on the importance of interpretability and propose a novel taxonomy organized along three dimensions: type of engagement (passive vs. active interpretation approaches), the type of explanation, and the focus (from local to global interpretability). This taxonomy provides a meaningful 3D view of distribution of papers from the relevant literature as two of the dimensions are not simply categorical but allow ordinal subcategories. Finally, we summarize the existing interpretability evaluation methods and suggest possible research directions inspired by our new taxonomy.

基于随机森林的滑坡空间易发性评价：以三峡库区湖北段为例

Article

Jan 2021

Semantics of the Black-Box: Can Knowledge Graphs Help Make Deep Learning Systems More Interpretable and Explainable?

Article

Jan 2021

The recent series of innovations in deep learning (DL) have shown enormous potential to impact individuals and society, both positively and negatively. DL models utilizing massive computing power and enormous datasets have significantly outperformed prior historical benchmarks on increasingly difficult, well-defined research tasks across technology domains such as computer vision, natural language processing, and human-computer interactions. However, DL's black-box nature and over-reliance on massive amounts of data condensed into labels and dense representations pose challenges for interpretability and explainability. Furthermore, DLs have not proven their ability to effectively utilize relevant domain knowledge critical to human understanding. This aspect was missing in early data-focused approaches and necessitated knowledge-infused learning (K-iL) to incorporate computational knowledge. This article demonstrates how knowledge, provided as a knowledge graph, is incorporated into DL using K-iL. Through examples from natural language processing applications in healthcare and education, we discuss the utility of K-iL towards interpretability and explainability.

基于自筛选深度学习的滑坡易发性预测建模及其可解释性

Recommended publications

Landslide Susceptibility Prediction Modeling Based on Self-Screening Deep Learning Model

Landslide Susceptibility Prediction Based on the Information Value-Logistic Regression Model and Geo...

Landslide Susceptibility Prediction Using Sparse Feature Extraction and Machine Learning Models Base...

Uncertainties in landslide susceptibility prediction: Influence rule of different levels of errors i...