ArticlePDF Available

Deep learning–based autonomous concrete crack evaluation through hybrid image scanning

Authors:

Abstract and Figures

This article proposes a deep learning–based autonomous concrete crack detection technique using hybrid images. The hybrid images combining vision and infrared thermography images are able to improve crack detectability while minimizing false alarms. In particular, large-scale concrete-made infrastructures such as bridge and dam can be effectively inspected by spatially scanning the unmanned vehicle–mounted hybrid imaging system including a vision camera, an infrared camera, and a continuous-wave line laser. However, the expert-dependent decision-making for crack identification which has been widely used in industrial fields is often cumbersome, time-consuming, and unreliable. As a target concrete structure gets larger, automated decision-making becomes more desirable from the practical point of view. The proposed technique is able to achieve automated crack identification and visualization by transfer learning of a well-trained deep convolutional neural network, that is, GoogLeNet, while retaining the advantages of the hybrid images. The proposed technique is experimentally validated using a lab-scale concrete specimen with cracks of various sizes. The test results reveal that macro- and microcracks are automatically visualized while minimizing false alarms.
Content may be subject to copyright.
Original Article
Structural Health Monitoring
1–16
ÓThe Author(s) 2019
Article reuse guidelines:
sagepub.com/journals-permissions
DOI: 10.1177/1475921718821719
journals.sagepub.com/home/shm
Deep learning–based autonomous
concrete crack evaluation through
hybrid image scanning
Keunyoung Jang
1
, Namgyu Kim
2
and Yun-Kyu An
1
Abstract
This article proposes a deep learning–based autonomous concrete crack detection technique using hybrid images. The
hybrid images combining vision and infrared thermography images are able to improve crack detectability while minimiz-
ing false alarms. In particular, large-scale concrete-made infrastructures such as bridge and dam can be effectively
inspected by spatially scanning the unmanned vehicle–mounted hybrid imaging system including a vision camera, an infra-
red camera, and a continuous-wave line laser. However, the expert-dependent decision-making for crack identification
which has been widely used in industrial fields is often cumbersome, time-consuming, and unreliable. As a target con-
crete structure gets larger, automated decision-making becomes more desirable from the practical point of view. The
proposed technique is able to achieve automated crack identification and visualization by transfer learning of a well-
trained deep convolutional neural network, that is, GoogLeNet, while retaining the advantages of the hybrid images. The
proposed technique is experimentally validated using a lab-scale concrete specimen with cracks of various sizes. The test
results reveal that macro- and microcracks are automatically visualized while minimizing false alarms.
Keywords
Concrete crack detection, deep convolutional neural network, hybrid image scanning, vision image, infrared thermogra-
phy, structural health monitoring
Introduction
Crack is one of the critical damage types in concrete
used as a representative construction material. Initial
concrete cracks inevitably produced by shrinkage dur-
ing the curing process are not typically considered as
structural damage. However, severe structural-level
cracks generated by external loads may propagate
along the surface and through-thickness directions by
repeated external loads. The propagated cracks may
lead to serious structural problems such as strength
reduction, corrosion of reinforcing rebar, and even
structural failure. Therefore, concrete cracks need to be
detected and managed from their early stage from the
safety point of view. During the last few decades, the
expert-dependent visual inspection has been widely per-
formed to manage the concrete cracks. However, the
visual inspection is often time-consuming, labor inten-
sive, unreliable, and sometimes not applicable to inac-
cessible areas of a target structure.
1
To tackle the technical issues, a number of non-
destructive evaluation (NDE) techniques have been
proposed. Fiber optic sensors were embedded in a tar-
get structure to detect concrete cracks thanks to the
advantages of being thin, lightweight, and independent
from electromagnetic interference.
2
However, their ser-
vice life is often shorter than the design life of civil
infrastructures, making the embedded sensors difficult
to be replaced. Moreover, its contact sensing mechan-
ism may suffer from several technical problems such as
limited sensing area and imperfect bonding condition.
Contact-type ultrasonic techniques have also been pro-
posed as alternatives.
3–5
Although the ultrasonic tech-
niques have high crack detectability, ultrasonic waves
are highly attenuated in concrete materials. Moreover,
1
Department of Architectural Engineering, Sejong University, Seoul,
South Korea
2
Department of Civil and Environmental Engineering, Sejong University,
Seoul, South Korea
Corresponding author:
Yun-Kyu An, Department of Architectural Engineering, Sejong University,
209, Neungdong-ro, Gwangjin-gu, Seoul 143-747, South Korea.
Email: yunkyuan@sejong.ac.kr
the complex signal interpretation is typically required
due to the inhomogeneous characteristics of concrete
materials. They also require a number of spatial mea-
surement points to cover the large inspection area and
similarly share the contact sensing mechanism limita-
tions. As another contact-type NDE technique, impact-
echo techniques have been proposed.
6,7
They are easy
to use and suitable for single-side inspection, but unex-
pected reflections coming from structures’ boundaries
may interrupt the measured data analysis.
Alternatively, fiber-reinforced concrete techniques have
been proposed as a sensor-less technique.
8,9
By insert-
ing conductive fibers into concrete materials, the con-
crete structure is able to be used as a sensor itself.
However, their crack detectability highly depends on
the manufacturing process of the fiber-reinforced con-
crete, and the performance under environmental varia-
tions such as temperature and humidity changes is not
fully validated yet.
To overcome the limitations of the contact-type
NDE techniques, various non-contact NDE techniques
have been proposed. Digital image correlation (DIC)
technique compares digital photographs at different
deformation stages for crack detection.
10,11
However, it
is more suitable for well-controlled laboratory environ-
ments than field inspection due to the requirement of
precise camera alignment and reference points of a tar-
get surface. Compared to DIC, vision-based crack
detection technique is more practical and widely
accepted thanks to its advantages of simplicity, non-
contact, cost effectiveness, and intuitive interpretation
of data.
12,13
Recently, vision cameras have been com-
bined with robots
14
or unmanned aerial vehicles
(UAVs)
15–17
to detect cracks on inaccessible and exten-
sive areas of a target structure. However, the perfor-
mance of the vision technique highly depends on the
image capturing conditions such as capturing angle,
illuminance, and undesired contaminant in the air or
target surface, causing false alarms. Alternatively, one
of the promising techniques for crack detection is laser
infrared (IR) thermography. The laser IR thermogra-
phy is able to similarly provide intuitive crack images in
a fully non-contact way. The superiority of the laser IR
thermography over the vision-based techniques for
crack detection is that it can detect subsurface damage
as well as surface damage and is robust against sensing
environments by employing laser excitation sources,
which has been already proven by applying it to metal-
lic structures and semiconductor chips.
18,19
However,
the excessive sensitivity to crack detection may inversely
disturb precise crack evaluation in concrete because of
a number of non-structural-level initial concrete cracks.
As the amount of sensing data collected from a large
target structure is getting larger, the expert-dependent
data interpretation becomes more time-consuming and
cumbersome. Thus, there have been a number of trials
to automate the data interpretation in these days. In
particular, deep convolutional neural networks (CNNs)
have been applied to classify vision images for pave-
ment crack detection,
20
nuclear power plant damage
inspection,
21
steel box girder crack identification,
22
and
concrete crack detection.
23,24
Although a number of
effective CNN architectures have been developed and
proven depending on data types and applications, the
sensing data–driven false alarm issues could not be
resolved yet.
In this study, a hybrid image scanning (HIS) system
combining the vision and laser IR thermography tech-
niques is newly developed, and deep CNN–based
autonomous concrete crack evaluation algorithm is
proposed. The proposed technique has the following
superiorities over the existing techniques: (1) fully non-
contact, non-destructive, and fast crack evaluation even
in inaccessible areas of a large concrete structure can be
effectively achieved by mounting the HIS system onto
UAVs; (2) data-driven false alarms can be remarkably
reduced by retaining the advantages of vision and IR
images; (3) the limited field of view (FOV) issues of the
vision and IR cameras, which is one of the technical
hurdles on data analysis, are resolved by developing a
time–spatial-integrated (TSI) coordinate transform;
and (4) autonomous decision-making for crack detec-
tion is accomplished by employing a tailored deep
CNN process. The developed system and algorithm are
experimentally validated using lab-scale concrete speci-
men with various size cracks, as a core technology
before embedding onto UAVs.
This article is organized as follows. Section ‘‘The
HIS system’’ explains the configuration and working
principle of the HIS system. Then, section ‘‘The deep
CNN–based crack evaluation algorithm’’ shows the
overall deep CNN process including signal and image
processing. Subsequently, the HIS system and deep
CNN algorithm are experimentally validated using a
lab-scale concrete specimen with real cracks of various
sizes in section ‘‘Experimental validation.’’ This article
concludes with a brief summary and discussions in sec-
tion ‘‘Conclusion.’’
The HIS system
Figure 1 shows the HIS system composed of excitation,
sensing, and control units. The excitation unit consists
of a continuous-wave (CW) line laser, a line beam gen-
erator, a collimator, and a focusing lens, which gener-
ates thermal waves onto a target concrete structure.
The sensing unit comprising vision and IR cameras
records the surface condition and the corresponding
thermal wave propagation along the concrete structure
2Structural Health Monitoring 00(0)
while spatially scanning. Then, the control computer in
the control unit activates the excitation and sensing
units and analyzes the saved data using the control and
processing programs coded by LabVIEW
Ò
and
MATLAB
Ò
, respectively. The HIS system will be
mounted on UAVs for moving along the predeter-
mined scanning route as shown in Figure 1. Note that
the excitation and sensing units are synchronized with
the control unit and controlled by the control computer
in the control unit.
The detailed working principle of the HIS system is
as follows. Once the control computer in the control
unit sends out control signals to the excitation and sen-
sing units, the laser driver generates a current signal to
activate the CW laser emitting a point laser beam. The
point laser beam is transformed to a line-shaped laser
beam through the line beam generator in the excitation
unit. Once the line-shaped laser beam is focused onto a
target surface through the collimator and focusing lens,
the thermal waves are generated along the target sur-
face. Simultaneously, the vision and IR cameras in the
sensing unit are operated to acquire the surface condi-
tion and thermal wave responses. Here, the thermal
wave responses are measured by only the IR camera
because the invisible range laser source is used. When
the control signal is transmitted to UAVs from the con-
trol computer, the HIS system automatically scans the
target structure along the predetermined scanning
route. Then, the measured vision and IR images which
are time and spatially changed within each FOV are
instantaneously transmitted and saved in the control
computer as raw vision (V
R
) and IR (I
R
) images,
respectively. The V
R
and I
R
images need to be
processed for precise crack evaluation because they are
varied in the time and spatial domains.
The deep CNN–based crack evaluation
algorithm
Since the I
R
and V
R
images obtained in a broad area
become massive, the expert-dependent decision-making
is quite time-consuming and cumbersome. Thus, not
only signal or image processing but also the deep
learning–based autonomous decision-making process is
strongly desirable. The main superiority of the algo-
rithm is that the I
R
and V
R
images are simultaneously
used for the autonomous decision-making process,
making it possible to minimize false alarms. This chap-
ter explains how crack information is automatically
extracted and visualized from the I
R
and V
R
images.
The overall procedure of the proposed deep CNN–
based crack evaluation algorithm is shown in Figure 2.
Since the I
R
and V
R
images acquired by spatially scan-
ning the HIS system are continuously changed in the
time and spatial domains, precise crack evaluation is
difficult. Thus, the spatially scanned I
R
and V
R
images
as a function of time need to be eventually converted to
the spatially integrated images. The details of each step
are explained in the subsequent subsections.
Image distortion calibration
Since the I
R
and V
R
images are often distorted due to
the wide angles of the camera lens, distortion calibra-
tion is needed for precise crack evaluation. In this
Figure 1. Schematics of the proposed hybrid image scanning (HIS) system.
Jang et al. 3
study, the camera calibration algorithm developed by
Zhang
25
is used, because the IR camera can also be
assumed as a pin-hole vision camera model
26
s~
m=AR
jt½
~
Mwith A=
fxskew cfxcx
0fycy
001
2
6
43
7
5,
Rjt½=
r11 r12 r13 t1
r21 r22 r23 t2
r31 r32 r33 t3
2
6
43
7
5
ð1Þ
where sis an arbitrary scale factor; ~
m=½xyz1T
and ~
M=½XYZ1Trepresent the camera and
world coordinates, respectively; Aand ½Rjtare each
camera’s intrinsic and extrinsic parameters, respectively;
in particular, fxand fyare the focal lengths; cxand cyare
the principal points and skew cfxis the skew coefficient
in the camera’s intrinsic parameters; and rij and tkare
the rotation and translation matrices, respectively. The
pin-hole camera model describes the mathematical rela-
tionship between the three-dimensional (3D) real world
coordinate and the projection on the two-dimensional
(2D) image plane. The calibration marker represents the
3D real world coordinate. Without loss of generality,
the calibration marker is assumed on Z=0.Then,a
homography matrix (H)between the calibration marker
and the image is defined as
H=h1h2h3
½=lAr
1r2t½ð2Þ
where lis an arbitrary scalar and r1and r2denote the
elements of R. Given an image of the calibration mar-
ker, Hcan be estimated based on maximum likelihood
criterion.
25
Assuming that Gaussian noise with zero
mean and the covariance matrix Lmi, the maximum
likelihood estimation of Hcan obtained by minimizing
the following objective function
J=X
i
mi^
mi
ðÞ
TL1
mimi^
mi
ðÞwith ^
mi=1
hT
3
~
Mi
hT
1
~
Mi
hT
2
~
Mi

ð3Þ
where
hiis the ith row of H. Zhang et al. assume that
Lmi=s2Ifor all i. This is reasonable if points are
extracted independently with the same procedure.
Equation (3) becomes a non-linear least-squares prob-
lem, and the non-linear minimization is conducted
Figure 2. Overview of the deep CNN–based crack evaluation algorithm. The I
C
and V
C
images are the distortion-calibrated IR and
vision images obtained from the I
R
and V
R
images, respectively. The I
ROI
and V
ROI
images denote the time–spatial-integrated IR and
vision images through the TSI coordinate transformation, respectively. The I
P
image represents the signal-processed IR images, and
the V
D
image is the resultant images obtained by the deep CNN process of the V
ROI
image. Then, the I
M
images are the crack region
images of the I
P
image selected by matching the crack regions of V
D
. Next, the crack existence of the I
M
images is evaluated by the
deep CNN process, and the I
D
images include only crack information. Finally, the final image represents only crack features by
mapping the I
D
images on the V
D
image.
4Structural Health Monitoring 00(0)
using the Levenberg–Marquardt algorithm (LMA).
27
Non-linear optimization like an LMA requires an ini-
tial guess. Assuming initial guess to homogeneous
equations x=½
hT
1
hT
2
hT
3T, equation (3) can be
rewritten as
Lx=
~
MT0Tu~
MT
0T~
MTv~
MT

x=0ð4Þ
When npoints are obtained in one image, L becomes
a2n39 matrix. As xis defined up to a scale factor,
the solution is well known to be the right singular vec-
tor of L associated with the smallest singular value.
Since L is numerically poor, the results can be enhanced
by performing a simple data normalization. Once His
estimated, it gives using the fact that r1and r2are
orthonormal
28
hT
1ATA1h2=0ð5Þ
hT
1ATA1h1=hT
2ATA1h2ð6Þ
Each homography provides the two basic constraints
on the camera intrinsic. Three independent orientations
are sufficient to solve for camera intrinsic linearly. If A
is known by closed-form solution,
29
the extrinsic para-
meters can be readily obtained
r1=lA1h1
r2=lA1h2
t=lA1h3
ð7Þ
Once the intrinsic and extrinsic parameters of the IR
and vision cameras are obtained using the calibration
marker, the I
C
and V
C
images can be, respectively,
obtained from the I
R
and V
R
images using equation (1)
as shown in Figure 3.
TSI coordinate transform
Because the HIS system continuously moves along the
predetermined scanning route, the physical inspection
areas on the I
C
and V
C
images are also continuously
changed as a function of time. Thus, it is difficult to
analyze thermal wave propagation on the entire region
of interest (ROI) using the I
C
images. In this step, the
I
C
and V
C
images are, respectively, transformed to the
spatially integrated IR (I
ROI
) and vision (V
ROI
) images
using the TSI coordinate transform. Here, the I
C
and
V
C
images share the same ROI but different spatial
resolutions. Note that the I
C
images are more complex
to be analyzed than the V
C
images, because they depend
on the laser excitation parameters.
First, the analysis area exposed to laser excitation
needs to be determined within FOV because the line
laser excitation may not cover the entire FOV.
Assuming that the HIS system scans along the horizon-
tal direction (x-axis in Figure 4), the intensity profile of
the line laser beam typically follows a Gaussian distri-
bution
19
along the y-axis as shown in Figure 4. Thus,
the analysis area can be determined by tracing the mid-
points along the x-axis and their affected boundaries
along the y-axis. Here, the mid-points can be selected
using m(x)of the Gaussian distribution, and the
affected boundary is able to be obtained by calculating
95% confidence interval of the Gaussian distribution.
Note that the analysis area physically means where the
Figure 3. Image distortion calibration using a calibration marker.
Jang et al. 5
enough thermal energy is injected by the line laser
beam to induce thermal wave propagation within the
I
C
images.
Next, the determined analysis areas are spatially
integrated as a function of time using the following TSI
coordinate transform, making it possible to reconstruct
the I
ROI
and V
ROI
images as shown in Figure 5
x
y
t
2
43
5=
0
0
1=v
0
1
0
v
0
0
0
mxðÞ
0
2
43
5
x
y
t
1
2
6
43
7
5ð8Þ
where m(x)2s<y<m(x)+2s,vis the scanning speed,
and the superscript means the transformed
coordinate. The y-axis is constant because only hori-
zontal directional scanning is assumed in this study.
The TSI coordinate transform is based on the physical
phenomenon that a specific spatial point is heated and
subsequently cooled by the line laser exposure as time
passed. The x-axis data on the I
C
images can be
regarded as the thermal variation in the time domain at
a specific point of FOV, and the t-axis data on the I
C
images are able to be considered as the thermal change
in the spatial domain at a specific time. Thus, each data
can be converted into the new integrated ROI coordi-
nate, that is, x,y, and taxis, using equation (8).
The I
ROI
images eventually show as if the entire ROI is
simultaneously and uniformly heated and subsequently
cooled in the spatially stationary condition
Figure 4. Determination of the analysis area on the I
C
images.
Figure 5. Overview of the TSI coordinate transform.
6Structural Health Monitoring 00(0)
x
y

=00v
010

x
y
t
2
43
5ð9Þ
Similarly, the V
C
image can be reconstructed using
equation (9). Since there is no laser excitation, the data
are simply integrated in the spatial domain.
Phase mapping and spatial derivative
Since the time-varying I
ROI
images cannot properly
reveal multiple cracks, additional data processing pro-
cedures such as phase mapping and spatial derivative
are necessary for precise multiple crack visualization
(Figure 6). In particular, macrocracks are typically
overwhelmed, while microcracks are hidden due to the
amplitude difference of the crack-induced features. The
phase mapping process enables cracks of various sizes
to be effectively visualized by normalizing the crack-
induced features along all pixels of interest. First, all
pixel values of the I
ROI
images are transformed to the
complex values along the taxis using Hilbert
transform
30
Hx
,y,t
ðÞ=P1
pð
IROI x,y,tðÞ
ttdtð10Þ
where Pis the Cauchy value of the integral and tis the
short time interval. Then, the instantaneous phase val-
ues (u(x,y,t)) of each pixel are simply obtained
ux,y,t
ðÞ=arctan Im H x,y,t
ðÞ½
Re H x,y,t
ðÞ½

ð11Þ
where Re and Im represent the real and imaginary
parts, respectively. Equation (11) physically means that
the responses are normalized between –pand p,mak-
ing it possible to effectively visualize even hidden micro-
cracks. However, not only crack-induced feature but
also undesired noise components might be augmented
by the phase mapping process. Thus, the denoising pro-
cess is subsequently carried out. First, u(x,y,t)is
accumulated along the taxis
[x,yðÞ=X
t
ux,y,t
ðÞ ð12Þ
where [(x,y)is the accumulation data along the
taxis. Then, the spatial derivative is subsequently
applied to [(x,y)along the xdirection which is the
scanning direction assumed in this study
Fx
,yðÞ=[x,yðÞ
xð13Þ
where F(x,y)is the spatial derivative value. The I
P
image can be obtained by reassigning F(x,y)to the
xand y-coordinate. Finally, the I
P
image visualizes
multiple cracks without noise components in the static
conditions covering the entire ROI.
The deep CNN process
Once the V
ROI
image is reconstructed by the TSI coor-
dinate transform, cracks are automatically extracted
through the deep CNN process. In this study, a pre-
trained deep CNN model, that is, GoogLeNet,
31
is
used for transfer learning. The GoogLeNet is one of
the well-known multi-layered CNN models designed
for visual pattern classification. It consists of 22 layers
including 9 inception modules as well as general convo-
lutional layers as shown in Figure 7. Here, each incep-
tion module is designed with 1 31 convolutional
layers at the beginning before the large convolutions
for reducing the dimensionality of feature maps. The
detailed structure of the inception module is shown in
Figure 7. For transplanting the GoogLeNet model into
concrete crack detection, the last two layers, that is, the
softmax and classification layers, are retrained with a
training set, having two classification outputs, that is,
intact and crack.
As for network training and validation, in total
20,000 images including concrete crack and non-crack
(intact) images are prepared by augmenting and seg-
menting 200 raw images. The representative images are
shown in Appendix 1. Among them, 9000 crack images
and 9000 intact images are used for network training,
and the other 1000 crack images and 1000 intact images
Figure 6. Phase mapping and spatial derivative.
Jang et al. 7
are selected as the validation set. All the prepared
images are then resized to 224 3224 33 pixels, main-
taining the aspect ratio in consideration of the
GoogLeNet’s input layer. Here, the training and vali-
dation sets are definitely distinct from each other. The
stochastic gradient descent with momentum is used as
a solver with 20 train epochs, and an initial learning
rate of 0.0001 is used.
23
Note that a high-performance
graphics processing unit which has 12 GB memory and
3840 cores is employed to expedite the network training
and also classification processes.
Once the tailored deep CNN is trained, the V
ROI
image is fed to the network for automated crack detec-
tion. To reduce false alarms, the V
ROI
image is scanned
by 16 different-sized masks without an overlapping
region as shown in Figure 8. The ranges of the mask
sizes are 122–144 pixels and 163–192 pixels in the hori-
zontal and vertical axes, respectively. The
corresponding 16 different probability maps are then
obtained and averaged for establishing a single prob-
ability map as shown in Figure 8. Here, the probability
map has the same resolution as the V
ROI
image. Each
pixel on the probability map has a positive value rang-
ing from 0 to 1. As the pixel value is, for example,
closer to 1, it indicates that there is a higher probability
of crack existence within the pixel.
Next, potential crack regions can be defined in the
V
ROI
image by selecting the probability values exceed-
ing 20% as shown in Figure 9. However, a lot of noise
components are still included in the potential crack
regions. Thus, the statistical denoising process is subse-
quently conducted for precise crack evaluation. A med-
ian filter is applied to the potential crack regions, and
the probability density function of the corresponding
pixel values is estimated by fitting a Weibull distribu-
tion which is one of the extreme value distributions.
32
Figure 7. Overview of the deep CNN architecture established using GoogLeNet.
Figure 8. The deep CNN process using the V
ROI
image.
8Structural Health Monitoring 00(0)
Then, the threshold value corresponding to a one-sided
99% confidence interval is established and applied to
all the pixel values for constructing the V
D
image shown
in Figure 9.
Decision-making by image matching
Although the V
D
image is able to automatically pro-
vide clear crack information, there can be a number of
data-driven false alarms due to rough surface condi-
tions, illuminance, contaminants, and so on. On the
other hand, the I
P
image can minimize the false alarms
thanks to its robustness against the arbitrary distur-
bances.
33
To reduce such false alarms, the image
matching between the V
D
and I
P
images is performed
as shown in Figure 10. First, the pixel resolution of the
V
D
image is reduced to have the same pixel resolution
as the I
P
image, because the IR camera typically has a
much lower pixel resolution than the vision camera
one. Then, the potential crack regions are selected on
the V
D
image using rectangular masks. Subsequently,
the corresponding crack regions are automatically
marked with the same size rectangular masks on the I
P
image. The marked crack regions of the I
P
image are
then extracted and resized as the resolution of
224 3224 33 pixels, defined as the I
M
images, for the
deep CNN process as shown in Figure 10(a). Note that
the rectangular mask location might not be exactly
matched between the V
D
and I
P
images due to their
pixel resolution mismatch. Nevertheless, the subsequent
CNN results using the I
M
images are not significantly
affected, because the I
M
images are used for double-
checking crack existence rather than crack quantifica-
tion. Next, the deep CNN process is repeated on the I
M
images except for mask scanning, as shown in Figure
10(b). After the deep CNN process, only crack images,
coined as the I
D
images, are retained and mapped on
the V
D
image by resizing them to the original rectangu-
lar mask size as depicted in Figure 10(c) and (d).
Finally, the final image shows only crack information
by retaining the advantages of the vision and IR
images, making it possible to reduce the vision data–
driven false alarms and enhance the reliability of crack
evaluation.
Figure 9. The statistical denoising process.
Figure 10. The decision-making procedure: (a) selection of the potential crack regions, (b) deep CNN process, (c) crack
identification, and (d) crack mapping on the V
D
image.
Jang et al. 9
Experimental validation
Test setup
The HIS system is experimentally validated using a lab-
scale concrete specimen with multiple cracks. Figure 11
shows the lab-scale test setup of the HIS system with
the concrete specimen located 700 mm away from the
HIS system. The control computer sends out control
signals to the CW line laser to generate the line laser
beam with an invisible wavelength of 950 nm. The line
laser beam with the size of 5 3200 mm
2
is targeted
through a collimator to the specimen. The peak inten-
sity of the line laser beam is set to 25 mW/mm
2
, which
is chosen by considering the thermal conductivity of
concrete (0.8 W m
21
k
21
) and scanning speed (23 mm/
s). The corresponding thermal waves of the concrete
specimen are recorded for 22 s by the IR camera (A65,
FLIR) with a frame rate of 30 Hz, a spectral range of
3–5 mm, and a resolution of 640 3512 pixels. The sur-
face images of the concrete specimen are also recorded
for 22 s by the vision camera (Hero 4, GoPro) with a
frame rate of 30 Hz and a resolution of 3264 32448
pixels. Here, the IR and vision cameras share the ROI.
In this study, a scanning jig is used to simulate the
UAV-mounted scanning mechanism as the very first
stage. Note that the HIS system will be mounted onto
UAVs or unmanned robots instead of the scanning jig
for practical usages although its miniaturization and
optimization are still underway. In particular, the
weight of the CW line laser can be reduced up to less
than 1.5 kg using the lightweight ceramic cooler and
packaging case.
The specially designed concrete specimen has the
dimensions of 1000 3500 3100 mm
3
and the com-
pressive strength of 103 MPa. The specimen is pre-
pared by mixing cement, silica sand, fly ash, super-
plasticizer, and water. The detailed mixing composition
is summarized in Table 1. During the curing process,
150-mm-width acrylic slots are inserted to make artifi-
cial cracks. The generated artificial cracks are divided
into two types, that is, macrocrack (˜500 mm) and
microcrack (\500 mm), defined in this study for conve-
nience. In addition, a fake crack with 1 mm width is
created using a pencil for the positive false alarm test.
The target ROI with the dimensions of 750 3240 mm
2
is defined so that macro- and microcracks, fake crack,
and non-cracked areas can be included (Figure 12).
Test results
Image distortion calibration. Once the V
R
and I
R
images
are obtained using the HIS system, the V
C
and I
C
images are obtained by conducting the calibration pro-
cess using a calibration marker shown in Figure 13.
Figure 13 shows the representative V
R
and V
C
images,
revealing that the images captured by the wide-angle
vision camera are distorted and successfully calibrated.
TSI coordinate transform. As expected, the V
C
and I
C
images within the ROI are time and spatially changed,
making it difficult to be analyzed as they are. To recon-
struct the V
ROI
and I
ROI
images, the TSI coordinate
transform is performed. First, the m(x)and svalues are
computed on the I
C
images. The determined ROI has a
height of 240 mm equivalent to 533 pixels on the I
C
images. Subsequently, the V
ROI
and I
ROI
images are
constructed using the TSI coordinate transform as
defined in equations (8) and (9). Figure 14(a) and (b)
Table 1. Mixing composition of the concrete specimen (%).
Cement
(type)
Silica
sand
Fly ash Super-plasticizer Water
100 (III) 100 15 0.9 35
Figure 11. Lab-scale test setup of the HIS system.
10 Structural Health Monitoring 00(0)
shows the V
ROI
image and the representative I
ROI
image at 1 s after laser excitation at each spatial point,
respectively. Even though the line laser is sequentially
scanned along the x-axis within the ROI, it looks as if
the entire ROI is simultaneously and uniformly heated
on the I
ROI
images. In reality, laser heating is not per-
fectly uniform on the entire ROI, but thermal wave
generation is enough to analyze the crack existence
within the ROI. Although crack existence can be intui-
tively observed in the I
ROI
images, they are still changed
in the time domain and have a number of unwanted
noise components as displayed in Figure 14(b).
Phase mapping and spatial derivative. To precisely evaluate
multiple cracks, the phase mapping and spatial deriva-
tive processes are subsequently applied to the I
ROI
images. It can be observed from Figure 15 that cracks
of various sizes are clearly visualized in the I
P
image
without undesired noise components. In particular,
microcracks are well visualized regardless of the ampli-
tude difference even when macro- and microcracks
coexist in a single image.
The deep CNN process. To automatically detect cracks
in the V
ROI
image, the V
ROI
image is fed to the pre-
trained deep CNN. The mask resolutions ranging from
122 3163 to 144 3192 pixels are used in this study.
Figure 12. Concrete specimen with various cracks and the target ROI.
Figure 13. Representative V
R
and V
C
images at 10 s.
Figure 14. TSI coordinate transform results: (a) V
ROI
image
and (b) representative I
ROI
image at 1 s.
Jang et al. 11
The representative outputs of the CNN process are dis-
played in Appendix 2. The entire deep CNN process is
repeated three times, and then its probability maps are
averaged to reduce the resultant errors. Based on the
probability map, the potential crack regions in the
V
ROI
image can be identified. Subsequently, the statisti-
cal denoising process is conducted, and the V
D
image is
obtained by mapping the potential crack locations on
the V
ROI
image as shown in Figure 16(a). Here, the rec-
tangular masks indicate the possible crack regions. The
performance of the deep CNN process using the vision
image can be evaluated by calculating the reliability
indices such as precision and recall
Precision =Tp
Tp +Fp ð14Þ
Recall =Tp
Tp +Fn ð15Þ
where Tp, Fp, and Fn represent the true positive, false
positive, and false negative, respectively. The precision
and recall values are computed as 59.84% and 97.26%,
respectively. The precision value physically representing
the false-positive alarm is relatively low, because the
fake crack is recognized by the deep CNN process as a
real crack. On the other hand, the recall value corre-
sponding to the false-negative alarm is relatively high,
meaning that cracks of various sizes are successfully
detected by the vision-based deep CNN process thanks
to the well-controlled laboratory conditions.
Decision-making by image matching. In order to reduce the
vision data–driven false alarms, the image matching
process is performed. The potential crack regions on
the I
P
image are selected by matching the crack loca-
tions on the V
D
image as shown in Figure 16. Then, the
extracted I
M
images are all tested by the deep CNN
process again. Only crack images are saved as the I
D
images and used for mapping on the V
D
image,
resulting in the final image. Otherwise, non-crack
images identified by the deep CNN process are dis-
carded as shown in Figure 10. The image matching
results show that the fake crack outlined by the dash-
single dotted box (yellow in color) is clearly filtered out
on the I
D
image as shown in Figure 16(b). Finally,
Figure 17 shows the final image containing only crack
information. To compare the performances of crack
detectability between the V
D
and final images, Table 2
summarizes the precision and recall indices. The two
indices are clearly enhanced when the hybrid images
including vision and IR images are used for the deep
CNN process. In particular, the precision index is
remarkably increased due to the fake crack. Then, the
recall index is also increased, meaning that the final
Figure 15. The I
P
image.
Figure 16. The deep CNN process results with crack masks:
(a) V
D
image and (b) I
D
images mapped on the I
P
image.
Table 2. Comparison of crack detectability between the vision
and final images (%).
Vision image
(V
D
image)
Final image
(V
D
and I
D
images)
Precision 59.84 98.72
Recall 97.26 99.23
12 Structural Health Monitoring 00(0)
image provides higher reliability for crack detection
than the V
D
image.
Integration of the HIS system with UAVs
Since the HIS system used in the preliminary indoor
tests is relatively large and heavy, its miniaturization
and packaging is now progressing to reduce the physi-
cal size and weight for mounting onto UAVs. The mod-
ified HIS system will be mounted onto the sticking-type
UAV as shown in Figure 18. The sticking-type UAV
can inspect a target structure by sticking the target sur-
face, making it possible to effectively reduce movement
and vibration during data acquisition of the HIS sys-
tem. Also, the effective working distance between the
HIS system and the target surface is able to be retained
for precise crack evaluation. Furthermore, the sticking-
type UAV is robust against unexpected turbulence
around the large civil infrastructures. The correspond-
ing outdoor tests will be performed on cable-stayed
grand bridges in South Korea.
Conclusion
This study presented deep learning–based concrete
crack detection using hybrid images. An HIS system
combining vision and IR thermography images was
newly developed for unmanned vehicle or robot-
mounted autonomous crack inspection of large-scale
concrete structures. Then, a deep CNN–based autono-
mous crack detection algorithm using the hybrid
images was proposed. The proposed system and algo-
rithm were experimentally validated using a lab-scale
concrete specimen with cracks of various sizes as the
very first stage of this concept. The test results revealed
that macro- and microcracks automatically and suc-
cessfully are visualized by minimizing false alarms. In
particular, false-negative and false-positive alarms were
remarkably reduced using hybrid images compared to
using only the vision image, resulting in crack detection
reliability improvement.
As a follow-up study, the proposed HIS system is
now being miniaturized and optimized to mount it onto
UAVs. Then, outdoor tests under varying environmen-
tal and operational conditions will be thoroughly car-
ried out, and it will be applied to real civil
infrastructures such as bridges, dams, and buildings.
Various real data such as shadow, dust on the surface,
rust, etc. will be additionally trained for real applica-
tion. The proposed technique is able to become a pro-
mising crack inspection alternative in large civil
infrastructures by minimizing inspection time, false
alarms, and unreliable experts’ intervention.
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with
respect to the research, authorship, and/or publication of this
article.
Funding
The author(s) disclosed receipt of the following financial sup-
port for the research, authorship, and/or publication of this
article: The research described in this article was financially
Figure 17. The final image.
Figure 18. Sticking-type UAV with the HIS system: (a)
schematic design and (b) the prototype model.
Jang et al. 13
supported by a grant (17SCIP-C116873-02) from
Construction Technology Research Program funded by the
Ministry of Land, Infrastructure and Transport of Korean
government and Basic Science Research Program of the
National Research Foundation of Korea (NRF) funded by
the Ministry of Science, ICT and Future Planning
(2015R1C1A1A01052625).
References
1. Chang P, Flatau A and Liu S. Review paper: health mon-
itoring of civil infrastructure. Struct Health Monit 2003;
2: 257–267.
2. Maheshwari M, Annamdas V, Pang J, et al. Crack moni-
toring using multiple smart materials; fiber-optic sensors
& piezo sensors. Int J Smart Nano Mater 2017; 8: 41–55.
3. Dumoulin C and Deraemaeker A. Real-time fast ultraso-
nic monitoring of concrete cracking using embedded
piezoelectric transducers. Smart Mater Struct 2017; 26:
104006.
4. Ham S, Song H, Oelze M, et al. A contactless ultrasonic
surface wave approach to characterize distributed crack-
ing damage in concrete. Ultrasonics 2017; 75: 46–57.
5. Menendez E, Victores J, Montero R, et al. Tunnel struc-
tural inspection and assessment using an autonomous
robotic system. Automat Constr 2018; 87: 117–126.
6. Hlava Z. Detection of crack in a concrete element by
impact-echo method. Ultrasound 2009; 64: 12–16.
7. Li B, Ushiroda K, Yang L, et al. Wall-climbing robot for
non-destructive evaluation using impact-echo and metric
learning SVM. Int J Intell Robot Appl 2017; 1: 255–270.
8. Han B, Zhang K, Yu X, et al. Electrical characteristics
and pressure-sensitive response measurements of carboxyl
MWNT/cement composites. Cement Concrete Compos
2012; 34: 794–800.
9. Chen P and Chung D. Carbon fiber reinforced concrete
for smart structures capable of non-destructive flaw
detection. Smart Mater Struct 1993; 2: 22–33.
10. McCormick N and Lord J. Digital image correlation.
Mater Today 2010; 13: 52–54.
11. Helm J. Digital image correlation for specimens with mul-
tiple growing cracks. Exp Mech 2008; 48: 753–762.
12. Jahanshahi M, Masri S, Padgett C, et al. An innovative
methodology for detection and quantification of cracks
through incorporation of depth perception. Mach Vision
Appl 2013; 24: 227–241.
13. Koch C, Paal S, Rashidi A, et al. Achievements and chal-
lenges in machine vision-based inspection of large con-
crete structures. Adv Struct Eng 2014; 17: 303–318.
14. Ho H, Kim K, Park Y, et al. An efficient image-based
damage detection for cable surface in cable-stayed
bridges. NDT&E Int 2013; 58: 18–23.
15. Kim H, Ahn E, Cho S, et al. Comparative analysis of
image binarization methods for crack identification in
concrete structures. Cement Concrete Res 2017; 99: 53–61.
16. Zhong X, Peng X, Yan S, et al. Assessment of the feasi-
bility of detecting concrete cracks in images acquired by
unmanned aerial vehicles. Automat Constr 2018; 89:
49–57.
17. Ellenberg A, Kontsos A, Moon F, et al. Bridge related
damage quantification using unmanned aerial vehicle ima-
gery. Struct Control Health Monit 2016; 23: 1168–1179.
18. An YK, Yang J, Hwang S, et al. Line laser lock-in ther-
mography for instantaneous imaging of cracks in semi-
conductor chips. Opt Laser Eng 2015; 73: 128–136.
19. Yang J, Hwang S, An YK, et al. Multi-spot laser lock-in
thermography for real-time imaging of cracks in semicon-
ductor chips during a manufacturing process. J Mater
Process Tech 2016; 229: 94–101.
20. ZhangA,WangK,LiB,etal.Automatedpixel-levelpave-
ment crack detection on 3D asphalt surfaces using a deep-
learning network. Comput-Aided Civ Inf 2017; 32: 12297.
21. Chen FC and Jahanshahi R. NB-CNN: deep learning-
based crack detection using convolutional neural network
and Naı
¨ve Bayes data fusion. IEEE T Ind Electron 2017;
65: 17519431.
22. Xu Y, Bao Y, Chen J, et al. Surface fatigue crack identifi-
cation in steel box girder of bridges by a deep fusion con-
volutional neural network based on consumer-grade
camera images. Struct Health Monit. Epub ahead of print
2 April 2018. DOI: 10.1177/1475921718764873.
23. Cha YJ, Choi W and Bu
¨yu
¨ko
¨ztu
¨rk O. Deep learning-
based crack damage detection using convolutional neural
networks. Comput Aided Civil Inf 2017; 32: 361–378.
24. Kim H, Ahn E, Shin M, et al. Crack and noncrack classi-
fication from concrete surface images using machine
learning. Struct Health Monit. Epub ahead of print 23
April 2018. DOI: 10.1177/1475921718768747.
25. Zhang Z. A flexible new technique for camera calibration.
IEEE T Pattern Anal Mach Intell 2010; 22: 1330–1334.
26. Vidas S, Lakemond R, Denman S, et al. A mask-based
approach for the geometric calibration of thermal-infrared
cameras. IEEE T Instrum Meas 2012; 61: 1625–1635.
27. More J. The Levenberg-Marquardt algorithm: implemen-
tation and theory. In: Watson GA (ed.) Lecture notes in
mathematics, vol. 630. New York: Springer, 1977.
28. Kanatani K, Ohta N and Kanazawa Y. Optimal homo-
graphy computation with a reliability measure. IEICE T
Inform Syst 2000; E83-D: 1369–1374.
29. Brown D. Close-range camera calibration. Photogram
Eng 1971; 37: 855–866.
30. Hahn S. Hilbert transforms in signal processing.Nor-
wood, MA: Artech House, 1996.
31. Szegedy C, Liu W, Jia Y, et al. Going deeper with convo-
lutions. In: Proceedings of the IEEE conference on com-
puter vision and pattern recognition (CVPR), Boston,
MA, 7–12 June 2015. New York: IEEE.
32. An YK, Park B and Sohn H. Complete noncontact laser
ultrasonic imaging for automated crack visualization in a
plate. Smart Mater Struct 2013; 22: 025022.
33. An YK, Kim J and Sohn H. Laser lock-in thermography
for detection of surface-breaking fatigue cracks on
uncoated steel structures. NDT&E Int 2014; 65: 54–63.
14 Structural Health Monitoring 00(0)
Appendix 1. Representative training images.
Jang et al. 15
Appendix 2. Representative outputs of the CNN process.
16 Structural Health Monitoring 00(0)
... Sakagami et al.(2016) confirmed that infrared stress measurement of structures helps detect defects in bridges. Jang et al.(2019) constructed a Convolutional Neural Network (CNN) model by transfer learning to predict concrete cracks using hybrid images. They combine vision and infrared thermography images to achieve high accuracy. ...
Article
Full-text available
Carbon Fiber Reinforced Plastic (CFRP) is a composite material consisting of a resin matrix and carbon fiber reinforcement. The material is also commonly used in a prepreg laminate, a unidirectional reinforcing material with high strength and stiffness in one direction. Damage to laminates is highly complex, including delamination, fiber fracture, and base matrix cracking, requiring highly efficient and accurate non-destructive testing. Infrared stress measurement is an example of a non-destructive testing method for CFRP structures. The infrared stress measurement is based on Kelvin’s theory to convert the surface temperature fluctuation under cyclic loading to the distributions of sum of principal stress on surface (DSPSS), so it may result in a stress distribution that differs from the actual distribution due to transient heat conduction. It is necessary to consider structural analysis and transient heat conduction in the numerical analysis to reproduce DSPSS obtained by infrared stress measurement. This study performs a finite element analysis with transient heat conduction on simple shaped CFRP specimens to reproduce the trend of DSPSS obtained by infrared stress measurement. Firstly, DSPSS generated by the forced displacement of a CFRP specimen is converted to a temperature distribution using Kelvin’s thermoelastic theory. Finally, a transient heat conduction analysis is performed, and the distribution trend is discussed using the obtained temperature distribution as the initial value. A sheet of Teflon is inserted into the CFRP specimen as a defect, assuming foreign matter contamination during the manufacturing process. Previous study predicts the internal defect information by a machine learning model using the DSPSS from numerical analysis. There is a potential for a high-accuracy defect prediction using DSPSS obtained by the infrared stress measurement if DSPSS obtained by the infrared stress measurement and the numerical analysis have similar tendencies.
... In the field of infrastructures, other materials traditionally used for construction have been the object of study and application of IRT techniques in combination with AI. One of the topics of interest is the automatic detection of cracks in both steel and concrete (Jang et al., 2019) elements by applying DL models together with thermal images, mainly in cases of bridges (Ali & Cha, 2019) and delamination of their decks (Cheng et al., 2020). The detection of internal defects has also been subject of analysis toward its resolution with DL algorithms: Qiang Fang et al. (2020) have performed an analysis of different DL algorithms for automatic defect detection and precise localization in the case of subsurface defects. ...
... Task Algorithms Kadarla et al. [30] Crack propagation using video footage CNN Vundekode et al. [33] Identifying three veritas of damages ANN Jery Hola et al. [35] Assessment of compressive strength ANN Manish et al. [36] Compressive strength prediction ANN Younq et al. [37] Concrete compressive strength ANN Gustavo H. Beckman et al. [38] Concrete spalling damage detection and quantification CNN Keunyoung Jang et al. [39] Detection of micro and macro concrete cracks CNN FuTao Ni et al. [40] Concrete thin crack identification and width measurement CNN AT Huynh et al. [41] Prediction of compressive strength ANN, DNN Narazaki Y et al. [42] Damage detecting and quantification CNN Asteris P et al. [43] Concrete compressive strength ANN conditions as shown in Table 2. Surface images may be tainted with noise, shadow, dust, or excess brightness, necessitating more robust and sophisticated categorization systems. Several researches have tackled similar practical issues, depending on the uses. ...
Conference Paper
Full-text available
Recent advancements in sensor technology, as well as fast progress in internet-based cloud computation; data-driven approaches in structural health monitoring (SHM) are gaining prominence. The majority of time is utilized for reviewing & analyzing the data received from various sensors deployed in structures. This data analysis helps in understating the structural stability and its current state with certain limitations. Considering this fact, integration with Machine Learning (ML) in SHM has attracted significant attention among researchers. This paper is principally aimed at understanding and reviewing of vast literature available in sensor-based data-driven approaches using ML. The implementation and methodology of vibration-based, vision-based monitoring, along with some of the ML algorithms used for SHM are discussed. Nevertheless, a perspective on the importance of data-driven SHM in the future is also presented. Conclusions are drawn from the review discuss the prospects and potential limitations of ML approaches in data-driven SHM applications.
... A recently widely used pretraining technique for transfer learning based on deep learning is indeed a model-based approach [20]; 2. Feature-based transfer learning approach. The knowledge to be transferred corresponds to the subspace spanned by the damage-sensitive features in the source and target domains. ...
Article
Full-text available
Bridge authorities have been reticent to integrate structural health monitoring into their bridge management systems, as they do not have the financial and technical resources to collect long-term monitoring data from every bridge. As bridge authorities normally own huge amount of similar bridges, like the pedestrian ones, the ability to transfer knowledge from one or a small group of well-known bridges to help make more effective decisions in new bridges and environments has gained relevance. In that sense, transfer learning, a subfield of machine learning, offers a novel solution to periodically evaluate the structural condition of all pedestrian bridges using long-term monitoring data from one or more pedestrian bridges. In this paper, the applicability of unsupervised transfer learning is firstly shown on data from numerical models and then on data from two similar pedestrian prestressed concrete bridges. Two domain adaptation techniques are used for transfer learning, where a classifier has access to unlabeled training data (source domain) from a reference bridge (or a small set of reference bridges) and unlabeled monitoring test data (target domain) from another bridge, assuming that both domains are from similar but statistically different distributions. This type of mapping is expected to improve the classification accuracy for the target domain compared to a procedure that does not implement domain adaptation, as a result of reducing distributions mismatch between source and target domains.
... The experimental results show that the method can successfully distinguish different types of pavement damage. Jang et al. [28] proposed a pre-trained deep convolutional network model based on ImageNet, combined with the transfer learning method, and fine-tuned the model to improve training accuracy and reduce time overhead. Similarly, Gopalakrishnan et al. [29] proposed using a pre-trained deep convolutional neural network model and transfer learning method for automatic pavement crack detection. ...
Article
Full-text available
The deep learning method has been widely used in the engineering field. The availability of the training dataset is one of the most important limitations of the deep learning method. Accurate prediction of pavement performance plays a vital role in road preventive maintenance (PM) and decision-making. Pavement performance prediction based on deep learning has been widely used around the world for its accuracy, robustness, and automation. However, most of the countries in the world have not built their pavement performance historical database, which prevents preventive maintenance using the deep learning method. This study presents an innovative particle swarm optimization (PSO) algorithm-enhanced two-stage TrAdaBoost.R2 transfer learning algorithm, which could significantly increase the pavement performance prediction database. The Long-Term Pavement Performance (LTPP) database is used as the source domain data, and one of the highways in China is chosen as the target domain to predict pavement performance. The results show that the proposed PSO-Two-stage TrAdaBoost.R2 model has the highest accuracy compared with AdaBoost.R2 model and traditional regression decision tree model. The validation case study shows significant consistency between the predicted International Roughness Index (IRI) and the whole-year measurement data with an R2 of 0.7. This study demonstrates the great potential of the innovative instance-based transfer learning method in pavement performance prediction of a region’s lack of data. This study also contributes to other engineering fields that could greatly increase the universality of deep learning.
Article
Purpose This study aims to obtain methods to identify and find the place of damage, which is one of the topics that has always been discussed in structural engineering. The cost of repairing and rehabilitating massive bridges and buildings is very high, highlighting the need to monitor the structures continuously. One way to track the structure's health is to check the cracks in the concrete. Meanwhile, the current methods of concrete crack detection have complex and heavy calculations. Design/methodology/approach This paper presents a new lightweight architecture based on deep learning for crack classification in concrete structures. The proposed architecture was identified and classified in less time and with higher accuracy than other traditional and valid architectures in crack detection. This paper used a standard dataset to detect two-class and multi-class cracks. Findings Results show that two images were recognized with 99.53% accuracy based on the proposed method, and multi-class images were classified with 91% accuracy. The low execution time of the proposed architecture compared to other valid architectures in deep learning on the same hardware platform. The use of Adam's optimizer in this research had better performance than other optimizers. Originality/value This paper presents a framework based on a lightweight convolutional neural network for nondestructive monitoring of structural health to optimize the calculation costs and reduce execution time in processing.
Article
Composite materials are widely used across major industries such as the automotive, aerospace and wind power, due to their excellent mechanical properties. A strong effort is thus put into developing innovative damage detection methodologies, for which Non-Destructive Testing (NDT) techniques can play a vital role as advanced measurement methods. One such technique is Laser Doppler Vibrometry which allows to accurately measure high-frequency vibration behavior with dense grids of points, without mass loading the structure. For data analysis, Machine Learning (ML) techniques have achieved high success on a number of structural applications, and can be leveraged to build automated and reliable damage classifiers. In this work, three methodologies have been developed by combining Laser Doppler Vibrometer (LDV) measurements with ML approaches, for the task of detecting damages on a carbon-fiber reinforced polymer (CFRP) plate. Each damage detection methodology requires pre- and post-processing steps, which were optimized with Bayesian Optimization. Principal Component Analysis (PCA) was also explored to reduce the dimensionality of the data, before classification. Moreover, making use of Finite Element Analysis (FEA), simulation data was generated with the ability of characterizing the high-frequency dynamic behavior of ply-based composite plates. The simulation data enriched the damage detection methodologies in a Transfer Learning (TL) framework. Results are presented for each damage detection methodology, alongside with a comparative overview of the advantages and disadvantages of each method.
Article
Full-text available
In concrete structures, surface cracks are important indicators of structural durability and serviceability. Generally, concrete cracks are visually monitored by inspectors who record crack information such as the existence, location, and width. Manual visual inspection is often considered ineffective in terms of cost, safety, assessment accuracy, and reliability. Digital image processing has been introduced to more accurately obtain crack information from images. A critical challenge is to automatically identify cracks from an image containing actual cracks and crack-like noise patterns (e.g. dark shadows, stains, lumps, and holes), which are often seen in concrete structures. This article presents a methodology for identifying concrete cracks using machine learning. The method helps in determining the existence and location of cracks from surface images. The proposed approach is particularly designed for classifying cracks and noncrack noise patterns that are otherwise difficult to distinguish using existing image processing algorithms. In the training stage of the proposed approach, image binarization is used to extract crack candidate regions; subsequently, classification models are constructed based on speeded-up robust features and convolutional neural network. The obtained crack identification methods are quantitatively and qualitatively compared using new concrete surface images containing cracks and noncracks.
Article
Full-text available
This paper presents the ROBO-SPECT European FP7 project, funded under the ICT-2013.2.2 programme on Robotics use cases & Accompanying measures, a robotized alternative to manual tunnel structural inspection and assessment of cracks and other defects. Physical developments include the design and implementation of a multi-degree-of-freedom robotic system, composed by a mobile vehicle, an extended crane, and a high precision robotic arm. A semi-supervised computer vision system to detect tunnel defects, and a ultrasonic sensor (US) robotic tool to measure width and depth of detected cracks have been also developed. An overview of defect detection methods, as well as the fundamental aspects of the project architecture and design, will be presented. In addition, the developed and implemented arm tip positioning algorithm for the US robotic tool shall be detailed. Finally, experimental evidence and details on validation in a real motorway tunnel with ongoing traffic will be provided.
Article
Full-text available
The impact-echo (IE) acoustic inspection method is a non-destructive evaluation technique, which has been widely applied to detect the defects, structural deterioration level, and thickness of plate-like concrete structures. This paper presents a novel climbing robot, namely Rise-Rover, to perform automated IE signal collection from concrete structures with IE signal analyzing based on machine learning techniques. Rise-Rover is our new generation robot, and it has a novel and enhanced absorption system to support heavy load, and crawler-like suction cups to maintain high mobility performance while crossing small grooves. Moreover, the design enables a seamless transition between ground and wall. This paper applies the fast Fourier transform and wavelet transform for feature detection from collected IE signals. A distance metric learning based support vector machine approach is newly proposed to automatically classify the IE signals. With the visual-inertial odometry of the robot, the detected flaws of inspection area on the concrete plates are visualized in 2D/3D. Field tests on a concrete bridge deck demonstrate the efficiency of the proposed robot system in automatic health condition assessment for concrete structures.
Article
Full-text available
A number of image processing techniques (IPTs) have been implemented for detecting civil infrastructure defects to partially replace human-conducted on-site inspections. These IPTs are primarily used to manipulate images to extract defect features, such as cracks in concrete and steel surfaces. However, the extensively varying real-world situations (e.g., lighting and shadow changes) can lead to challenges to the wide adoption of IPTs. To overcome these challenges, this article proposes a vision-based method using a deep architecture of con-volutional neural networks (CNNs) for detecting concrete cracks without calculating the defect features. As CNNs are capable of learning image features automatically , the proposed method works without the conjugation of IPTs for extracting features. The designed CNN is trained on 40 K images of 256 × 256 pixel resolutions and, consequently, records with about 98% accuracy. The trained CNN is combined with a sliding window technique to scan any image size larger than 256 × 256 pixel resolutions. The robustness and adaptability of the proposed approach are tested on 55 images of 5,888 × 3,584 pixel resolutions taken from a different structure which is not used for training and validation processes under various conditions (e.g., strong light spot, shadows , and very thin cracks). Comparative studies are conducted to examine the performance of the proposed CNN using traditional Canny and Sobel edge detection methods. The results show that the proposed method shows
Article
An 8-rotor unmanned aerial vehicle is used as a working platform. Its motion characteristics in a hovering state are obtained using a non-contact measurement instrument, which, along with the modulation transfer function of its airborne images, indicates the reliability of the airborne images of unmanned aerial vehicles in a hovering state. By installing a laser range finder on the cradle synchronized with the camera shutter to measure the object distance, the pixel resolution of the object distance is obtained. The airborne images are then processed using the MATLAB image processing toolbox, from which the pixels of concrete cracks are extracted. Compared to a static image and direct manual measurements, the airborne image of the unmanned aerial vehicle has higher precision, indicating its wide potential applications as an alternative of the conventional inspection methods of bridge-inspection vehicle and working platforms.
Article
This study conducts crack identification from real-world images containing complicated disturbance information (cracks, handwriting scripts, and background) inside steel box girders of bridges. Considering the multilevel and multi-scale features of the input images, a modified fusion convolutional neural network architecture is proposed. As input, 350 raw images are taken with a consumer-grade camera and divided into sub-images with resolution of 64 × 64 pixels (67,200 in total). A regular convolutional neural network structure is employed as baseline to demonstrate the accuracy benefits from the proposed fusion convolutional neural network structure. The confusion matrix is defined for prediction performance evaluation on the test set. A total of six additional entire raw images are used to investigate the robustness and feasibility of the proposed approach. A binary conversion process based on the optimal entropy threshold method is applied and closely followed to identify the crack pixels in the sub-images. The effect of the super-resolution inputs on accuracy is investigated. Results show that the trained modified fusion convolutional neural network can automatically detect the cracks, handwriting, and background from the raw images. The recognition errors of the fusion convolutional neural network in both the training and validation processes are smaller than those of the regular convolutional neural network. The super-resolution process hurts the general identification accuracy.
Article
Regular inspection of nuclear power plant components is important to guarantee safe operations. However, current practice is time-consuming, tedious, and subjective which involves human technicians review the inspection videos and identify cracks on reactors. A few vision-based crack detection approaches have been developed for metallic surfaces, and they typically perform poorly when used for analyzing nuclear inspection videos. Detecting these cracks is a challenging task since they are tiny, and noisy patterns exist on the components' surfaces. This study proposes a deep learning framework called NB-CNN to analyze individual video frames for crack detection while a novel data fusion scheme is proposed to aggregate the information extracted from each video frame to enhance the overall performance and robustness of the system. To this end, a Convolutional Neural Network (CNN) is proposed to detect crack patches in each video frame while the proposed data fusion scheme maintains the spatiotemporal coherence of cracks in videos, and the Naïve Bayes decision making discards false positives effectively. The proposed framework achieves 98.3% hit rate against 0.1 false positives per frame that is significantly higher than state-of-the-art approaches as presented in this paper.
Article
The CrackNet, an efficient architecture based on the Convolutional Neural Network (CNN), is proposed in this article for automated pavement crack detection on 3D asphalt surfaces with explicit objective of pixel-perfect accuracy. Unlike the commonly used CNN, CrackNet does not have any pooling layers which downsize the outputs of previous layers. CrackNet fundamentally ensures pixel-perfect accuracy using the newly developed technique of invariant image width and height through all layers. CrackNet consists of five layers and includes more than one million parameters that are trained in the learning process. The input data of the CrackNet are feature maps generated by the feature extractor using the proposed line filters with various orientations, widths, and lengths. The output of CrackNet is the set of predicted class scores for all pixels. The hidden layers of CrackNet are convolutional layers and fully connected layers. CrackNet is trained with 1,800 3D pavement images and is then demonstrated to be successful in detecting cracks under various conditions using another set of 200 3D pavement images. The experiment using the 200 testing 3D images showed that CrackNet can achieve high Precision (90.13%), Recall (87.63%) and F-measure (88.86%) simultaneously. Compared with recently developed crack detection methods based on traditional machine learning and imaging algorithms, the CrackNet significantly outperforms the traditional approaches in terms of F-measure. Using parallel computing techniques, CrackNet is programmed to be efficiently used in conjunction with the data collection software.
Article
This paper deals with the use of embedded piezoelectric transducers for ultrasonic monitoring of cracking in concrete. Based on the previous developments of our research team on that topic, we design a new data acquisition system which is able to interrogate the emitter-receiver pair up to 150 times per second. The system is based on low-voltage actuation (up to 20 V) and the signal-to-noise ratio is excellent due to the use of a voltage amplifier at the receiver side and the possibility to perform averages. With such a high measurement rate, we are able to follow brittle failure events such as the failure of a concrete cylinder in compression, which is the application example presented. In this application, we show that, in addition to the ultrasonic active monitoring of cracking, the system is also able to record the passive acoustic emission events which can be used as a complementary indicator of damage in the cylinder. We also demonstrate that because of the high level of stresses in compression, the damage indicator defined in our previous studies is not suited for crack monitoring due to the elastoacoustic effect. The amplitude of the first wave arrival is shown to be a robust alternative damage indicator allowing to follow accurately the three successive phases of cracking leading to the failure of the cylinder.
Article
Surface cracks in concrete structures are critical indicators of structural damage and durability. Manual visual inspection, the most commonly used method in practice, is inefficient from cost, time, accuracy, and safety perspectives. A promising alternative is computer vision-based methods that can automatically extract crack information from images. Image binarization, developed for text detection, is appropriate for crack identification, as texts and cracks are similar, consisting of distinguishable lines and curves. However, standardizing crack identification using image binarization is challenging, because binarization depends on the method and associated parameters. We investigate image binarization for crack identification, focusing on optimal parameter determination and comparative performance evaluation for five common binarization methods. Crack images are prepared to obtain optimal parameters by minimizing errors in estimated crack widths. Subsequently, comparative analysis is conducted using crack images with different conditions based on three performance evaluation criteria: crack width and length measurement accuracy and computation time.