ArticlePDF Available

Deep learning–based autonomous concrete crack evaluation through hybrid image scanning

January 2019
Structural Health Monitoring 18(5-6):147592171882171

January 2019
18(5-6):147592171882171

DOI:10.1177/1475921718821719

Authors:

Keunyoung Jang

Sejong University

Namgyu Kim

Sejong University

Yun-Kyu An

Sejong University

This article proposes a deep learning–based autonomous concrete crack detection technique using hybrid images. The hybrid images combining vision and infrared thermography images are able to improve crack detectability while minimizing false alarms. In particular, large-scale concrete-made infrastructures such as bridge and dam can be effectively inspected by spatially scanning the unmanned vehicle–mounted hybrid imaging system including a vision camera, an infrared camera, and a continuous-wave line laser. However, the expert-dependent decision-making for crack identification which has been widely used in industrial fields is often cumbersome, time-consuming, and unreliable. As a target concrete structure gets larger, automated decision-making becomes more desirable from the practical point of view. The proposed technique is able to achieve automated crack identification and visualization by transfer learning of a well-trained deep convolutional neural network, that is, GoogLeNet, while retaining the advantages of the hybrid images. The proposed technique is experimentally validated using a lab-scale concrete specimen with cracks of various sizes. The test results reveal that macro- and microcracks are automatically visualized while minimizing false alarms.

Schematics of the proposed hybrid image scanning (HIS) system.

…

Overview of the deep CNN-based crack evaluation algorithm. The I C and V C images are the distortion-calibrated IR and vision images obtained from the I R and V R images, respectively. The I ROI and V ROI images denote the time-spatial-integrated IR and vision images through the TSI coordinate transformation, respectively. The I P image represents the signal-processed IR images, and the V D image is the resultant images obtained by the deep CNN process of the V ROI image. Then, the I M images are the crack region images of the I P image selected by matching the crack regions of V D. Next, the crack existence of the I M images is evaluated by the deep CNN process, and the I D images include only crack information. Finally, the final image represents only crack features by mapping the I D images on the V D image.

…

Image distortion calibration using a calibration marker.

…

Determination of the analysis area on the I C images.

…

+12

Overview of the TSI coordinate transform.

…

Figures - uploaded by Yun-Kyu An

Content may be subject to copyright.

Content uploaded by Yun-Kyu An

Content may be subject to copyright.

Original Article

Structural Health Monitoring

1–16

ÓThe Author(s) 2019

Article reuse guidelines:

sagepub.com/journals-permissions

DOI: 10.1177/1475921718821719

journals.sagepub.com/home/shm

Deep learning–based autonomous

concrete crack evaluation through

hybrid image scanning

Keunyoung Jang

, Namgyu Kim

and Yun-Kyu An

Abstract

This article proposes a deep learning–based autonomous concrete crack detection technique using hybrid images. The

hybrid images combining vision and infrared thermography images are able to improve crack detectability while minimiz-

ing false alarms. In particular, large-scale concrete-made infrastructures such as bridge and dam can be effectively

inspected by spatially scanning the unmanned vehicle–mounted hybrid imaging system including a vision camera, an infra-

red camera, and a continuous-wave line laser. However, the expert-dependent decision-making for crack identification

which has been widely used in industrial fields is often cumbersome, time-consuming, and unreliable. As a target con-

crete structure gets larger, automated decision-making becomes more desirable from the practical point of view. The

proposed technique is able to achieve automated crack identification and visualization by transfer learning of a well-

trained deep convolutional neural network, that is, GoogLeNet, while retaining the advantages of the hybrid images. The

proposed technique is experimentally validated using a lab-scale concrete specimen with cracks of various sizes. The test

results reveal that macro- and microcracks are automatically visualized while minimizing false alarms.

Keywords

Concrete crack detection, deep convolutional neural network, hybrid image scanning, vision image, infrared thermogra-

phy, structural health monitoring

Introduction

Crack is one of the critical damage types in concrete

used as a representative construction material. Initial

concrete cracks inevitably produced by shrinkage dur-

ing the curing process are not typically considered as

structural damage. However, severe structural-level

cracks generated by external loads may propagate

along the surface and through-thickness directions by

repeated external loads. The propagated cracks may

lead to serious structural problems such as strength

reduction, corrosion of reinforcing rebar, and even

structural failure. Therefore, concrete cracks need to be

detected and managed from their early stage from the

safety point of view. During the last few decades, the

expert-dependent visual inspection has been widely per-

formed to manage the concrete cracks. However, the

visual inspection is often time-consuming, labor inten-

sive, unreliable, and sometimes not applicable to inac-

cessible areas of a target structure.

To tackle the technical issues, a number of non-

destructive evaluation (NDE) techniques have been

proposed. Fiber optic sensors were embedded in a tar-

get structure to detect concrete cracks thanks to the

advantages of being thin, lightweight, and independent

from electromagnetic interference.

However, their ser-

vice life is often shorter than the design life of civil

infrastructures, making the embedded sensors difficult

to be replaced. Moreover, its contact sensing mechan-

ism may suffer from several technical problems such as

limited sensing area and imperfect bonding condition.

Contact-type ultrasonic techniques have also been pro-

posed as alternatives.

3–5

Although the ultrasonic tech-

niques have high crack detectability, ultrasonic waves

are highly attenuated in concrete materials. Moreover,

Department of Architectural Engineering, Sejong University, Seoul,

South Korea

Department of Civil and Environmental Engineering, Sejong University,

Seoul, South Korea

Corresponding author:

Yun-Kyu An, Department of Architectural Engineering, Sejong University,

209, Neungdong-ro, Gwangjin-gu, Seoul 143-747, South Korea.

Email: yunkyuan@sejong.ac.kr

the complex signal interpretation is typically required

due to the inhomogeneous characteristics of concrete

materials. They also require a number of spatial mea-

surement points to cover the large inspection area and

similarly share the contact sensing mechanism limita-

tions. As another contact-type NDE technique, impact-

echo techniques have been proposed.

6,7

They are easy

to use and suitable for single-side inspection, but unex-

pected reflections coming from structures’ boundaries

may interrupt the measured data analysis.

Alternatively, fiber-reinforced concrete techniques have

been proposed as a sensor-less technique.

8,9

By insert-

ing conductive fibers into concrete materials, the con-

crete structure is able to be used as a sensor itself.

However, their crack detectability highly depends on

the manufacturing process of the fiber-reinforced con-

crete, and the performance under environmental varia-

tions such as temperature and humidity changes is not

fully validated yet.

To overcome the limitations of the contact-type

NDE techniques, various non-contact NDE techniques

have been proposed. Digital image correlation (DIC)

technique compares digital photographs at different

deformation stages for crack detection.

10,11

However, it

is more suitable for well-controlled laboratory environ-

ments than field inspection due to the requirement of

precise camera alignment and reference points of a tar-

get surface. Compared to DIC, vision-based crack

detection technique is more practical and widely

accepted thanks to its advantages of simplicity, non-

contact, cost effectiveness, and intuitive interpretation

of data.

12,13

Recently, vision cameras have been com-

bined with robots

or unmanned aerial vehicles

(UAVs)

15–17

to detect cracks on inaccessible and exten-

sive areas of a target structure. However, the perfor-

mance of the vision technique highly depends on the

image capturing conditions such as capturing angle,

illuminance, and undesired contaminant in the air or

target surface, causing false alarms. Alternatively, one

of the promising techniques for crack detection is laser

infrared (IR) thermography. The laser IR thermogra-

phy is able to similarly provide intuitive crack images in

a fully non-contact way. The superiority of the laser IR

thermography over the vision-based techniques for

crack detection is that it can detect subsurface damage

as well as surface damage and is robust against sensing

environments by employing laser excitation sources,

which has been already proven by applying it to metal-

lic structures and semiconductor chips.

18,19

However,

the excessive sensitivity to crack detection may inversely

disturb precise crack evaluation in concrete because of

a number of non-structural-level initial concrete cracks.

As the amount of sensing data collected from a large

target structure is getting larger, the expert-dependent

data interpretation becomes more time-consuming and

cumbersome. Thus, there have been a number of trials

to automate the data interpretation in these days. In

particular, deep convolutional neural networks (CNNs)

have been applied to classify vision images for pave-

ment crack detection,

nuclear power plant damage

inspection,

steel box girder crack identification,

and

concrete crack detection.

23,24

Although a number of

effective CNN architectures have been developed and

proven depending on data types and applications, the

sensing data–driven false alarm issues could not be

resolved yet.

In this study, a hybrid image scanning (HIS) system

combining the vision and laser IR thermography tech-

niques is newly developed, and deep CNN–based

autonomous concrete crack evaluation algorithm is

proposed. The proposed technique has the following

superiorities over the existing techniques: (1) fully non-

contact, non-destructive, and fast crack evaluation even

in inaccessible areas of a large concrete structure can be

effectively achieved by mounting the HIS system onto

UAVs; (2) data-driven false alarms can be remarkably

reduced by retaining the advantages of vision and IR

images; (3) the limited field of view (FOV) issues of the

vision and IR cameras, which is one of the technical

hurdles on data analysis, are resolved by developing a

time–spatial-integrated (TSI) coordinate transform;

and (4) autonomous decision-making for crack detec-

tion is accomplished by employing a tailored deep

CNN process. The developed system and algorithm are

experimentally validated using lab-scale concrete speci-

men with various size cracks, as a core technology

before embedding onto UAVs.

This article is organized as follows. Section ‘‘The

HIS system’’ explains the configuration and working

principle of the HIS system. Then, section ‘‘The deep

CNN–based crack evaluation algorithm’’ shows the

overall deep CNN process including signal and image

processing. Subsequently, the HIS system and deep

CNN algorithm are experimentally validated using a

lab-scale concrete specimen with real cracks of various

sizes in section ‘‘Experimental validation.’’ This article

concludes with a brief summary and discussions in sec-

tion ‘‘Conclusion.’’

The HIS system

Figure 1 shows the HIS system composed of excitation,

sensing, and control units. The excitation unit consists

of a continuous-wave (CW) line laser, a line beam gen-

erator, a collimator, and a focusing lens, which gener-

ates thermal waves onto a target concrete structure.

The sensing unit comprising vision and IR cameras

records the surface condition and the corresponding

thermal wave propagation along the concrete structure

2Structural Health Monitoring 00(0)

while spatially scanning. Then, the control computer in

the control unit activates the excitation and sensing

units and analyzes the saved data using the control and

processing programs coded by LabVIEW

and

MATLAB

, respectively. The HIS system will be

mounted on UAVs for moving along the predeter-

mined scanning route as shown in Figure 1. Note that

the excitation and sensing units are synchronized with

the control unit and controlled by the control computer

in the control unit.

The detailed working principle of the HIS system is

as follows. Once the control computer in the control

unit sends out control signals to the excitation and sen-

sing units, the laser driver generates a current signal to

activate the CW laser emitting a point laser beam. The

point laser beam is transformed to a line-shaped laser

beam through the line beam generator in the excitation

unit. Once the line-shaped laser beam is focused onto a

target surface through the collimator and focusing lens,

the thermal waves are generated along the target sur-

face. Simultaneously, the vision and IR cameras in the

sensing unit are operated to acquire the surface condi-

tion and thermal wave responses. Here, the thermal

wave responses are measured by only the IR camera

because the invisible range laser source is used. When

the control signal is transmitted to UAVs from the con-

trol computer, the HIS system automatically scans the

target structure along the predetermined scanning

route. Then, the measured vision and IR images which

are time and spatially changed within each FOV are

instantaneously transmitted and saved in the control

computer as raw vision (V

) and IR (I

) images,

respectively. The V

and I

images need to be

processed for precise crack evaluation because they are

varied in the time and spatial domains.

The deep CNN–based crack evaluation

algorithm

Since the I

and V

images obtained in a broad area

become massive, the expert-dependent decision-making

is quite time-consuming and cumbersome. Thus, not

only signal or image processing but also the deep

learning–based autonomous decision-making process is

strongly desirable. The main superiority of the algo-

rithm is that the I

and V

images are simultaneously

used for the autonomous decision-making process,

making it possible to minimize false alarms. This chap-

ter explains how crack information is automatically

extracted and visualized from the I

and V

images.

The overall procedure of the proposed deep CNN–

based crack evaluation algorithm is shown in Figure 2.

Since the I

and V

images acquired by spatially scan-

ning the HIS system are continuously changed in the

time and spatial domains, precise crack evaluation is

difficult. Thus, the spatially scanned I

and V

images

as a function of time need to be eventually converted to

the spatially integrated images. The details of each step

are explained in the subsequent subsections.

Image distortion calibration

Since the I

and V

images are often distorted due to

the wide angles of the camera lens, distortion calibra-

tion is needed for precise crack evaluation. In this

Figure 1. Schematics of the proposed hybrid image scanning (HIS) system.

Jang et al. 3

study, the camera calibration algorithm developed by

Zhang

is used, because the IR camera can also be

assumed as a pin-hole vision camera model

m=AR

jt½

Mwith A=

fxskew cfxcx

0fycy

001

Rjt½=

r11 r12 r13 t1

r21 r22 r23 t2

r31 r32 r33 t3

ð1Þ

where sis an arbitrary scale factor; ~

m=½xyz1T

and ~

M=½XYZ1Trepresent the camera and

world coordinates, respectively; Aand ½Rjtare each

camera’s intrinsic and extrinsic parameters, respectively;

in particular, fxand fyare the focal lengths; cxand cyare

the principal points and skew cfxis the skew coefficient

in the camera’s intrinsic parameters; and rij and tkare

the rotation and translation matrices, respectively. The

pin-hole camera model describes the mathematical rela-

tionship between the three-dimensional (3D) real world

coordinate and the projection on the two-dimensional

(2D) image plane. The calibration marker represents the

3D real world coordinate. Without loss of generality,

the calibration marker is assumed on Z=0.Then,a

homography matrix (H)between the calibration marker

and the image is defined as

H=h1h2h3

½=lAr

1r2t½ð2Þ

where lis an arbitrary scalar and r1and r2denote the

elements of R. Given an image of the calibration mar-

ker, Hcan be estimated based on maximum likelihood

criterion.

Assuming that Gaussian noise with zero

mean and the covariance matrix Lmi, the maximum

likelihood estimation of Hcan obtained by minimizing

the following objective function

J=X

mi^

ðÞ

TL1

mimi^

ðÞwith ^

mi=1





ð3Þ

where 

hiis the ith row of H. Zhang et al. assume that

Lmi=s2Ifor all i. This is reasonable if points are

extracted independently with the same procedure.

Equation (3) becomes a non-linear least-squares prob-

lem, and the non-linear minimization is conducted

Figure 2. Overview of the deep CNN–based crack evaluation algorithm. The I

and V

images are the distortion-calibrated IR and

vision images obtained from the I

and V

images, respectively. The I

ROI

and V

ROI

images denote the time–spatial-integrated IR and

vision images through the TSI coordinate transformation, respectively. The I

image represents the signal-processed IR images, and

the V

image is the resultant images obtained by the deep CNN process of the V

ROI

image. Then, the I

images are the crack region

images of the I

image selected by matching the crack regions of V

. Next, the crack existence of the I

images is evaluated by the

deep CNN process, and the I

images include only crack information. Finally, the final image represents only crack features by

mapping the I

images on the V

image.

4Structural Health Monitoring 00(0)

using the Levenberg–Marquardt algorithm (LMA).

Non-linear optimization like an LMA requires an ini-

tial guess. Assuming initial guess to homogeneous

equations x=½



3T, equation (3) can be

rewritten as

Lx=

MT0Tu~

0T~

MTv~



x=0ð4Þ

When npoints are obtained in one image, L becomes

a2n39 matrix. As xis defined up to a scale factor,

the solution is well known to be the right singular vec-

tor of L associated with the smallest singular value.

Since L is numerically poor, the results can be enhanced

by performing a simple data normalization. Once His

estimated, it gives using the fact that r1and r2are

orthonormal

1ATA1h2=0ð5Þ

1ATA1h1=hT

2ATA1h2ð6Þ

Each homography provides the two basic constraints

on the camera intrinsic. Three independent orientations

are sufficient to solve for camera intrinsic linearly. If A

is known by closed-form solution,

the extrinsic para-

meters can be readily obtained

r1=lA1h1

r2=lA1h2

t=lA1h3

ð7Þ

Once the intrinsic and extrinsic parameters of the IR

and vision cameras are obtained using the calibration

marker, the I

and V

images can be, respectively,

obtained from the I

and V

images using equation (1)

as shown in Figure 3.

TSI coordinate transform

Because the HIS system continuously moves along the

predetermined scanning route, the physical inspection

areas on the I

and V

images are also continuously

changed as a function of time. Thus, it is difficult to

analyze thermal wave propagation on the entire region

of interest (ROI) using the I

images. In this step, the

and V

images are, respectively, transformed to the

spatially integrated IR (I

ROI

) and vision (V

ROI

) images

using the TSI coordinate transform. Here, the I

and

images share the same ROI but different spatial

resolutions. Note that the I

images are more complex

to be analyzed than the V

images, because they depend

on the laser excitation parameters.

First, the analysis area exposed to laser excitation

needs to be determined within FOV because the line

laser excitation may not cover the entire FOV.

Assuming that the HIS system scans along the horizon-

tal direction (x-axis in Figure 4), the intensity profile of

the line laser beam typically follows a Gaussian distri-

bution

along the y-axis as shown in Figure 4. Thus,

the analysis area can be determined by tracing the mid-

points along the x-axis and their affected boundaries

along the y-axis. Here, the mid-points can be selected

using m(x)of the Gaussian distribution, and the

affected boundary is able to be obtained by calculating

95% confidence interval of the Gaussian distribution.

Note that the analysis area physically means where the

Figure 3. Image distortion calibration using a calibration marker.

Jang et al. 5

enough thermal energy is injected by the line laser

beam to induce thermal wave propagation within the

images.

Next, the determined analysis areas are spatially

integrated as a function of time using the following TSI

coordinate transform, making it possible to reconstruct

the I

ROI

and V

ROI

images as shown in Figure 5

x

t

1=v

mxðÞ

5ð8Þ

where m(x)2s<y<m(x)+2s,vis the scanning speed,

and the superscript means the transformed

coordinate. The y-axis is constant because only hori-

zontal directional scanning is assumed in this study.

The TSI coordinate transform is based on the physical

phenomenon that a specific spatial point is heated and

subsequently cooled by the line laser exposure as time

passed. The x-axis data on the I

images can be

regarded as the thermal variation in the time domain at

a specific point of FOV, and the t-axis data on the I

images are able to be considered as the thermal change

in the spatial domain at a specific time. Thus, each data

can be converted into the new integrated ROI coordi-

nate, that is, x,y, and taxis, using equation (8).

The I

ROI

images eventually show as if the entire ROI is

simultaneously and uniformly heated and subsequently

cooled in the spatially stationary condition

Figure 4. Determination of the analysis area on the I

images.

Figure 5. Overview of the TSI coordinate transform.

6Structural Health Monitoring 00(0)

x



=00v

010



5ð9Þ

Similarly, the V

image can be reconstructed using

equation (9). Since there is no laser excitation, the data

are simply integrated in the spatial domain.

Phase mapping and spatial derivative

Since the time-varying I

ROI

images cannot properly

reveal multiple cracks, additional data processing pro-

cedures such as phase mapping and spatial derivative

are necessary for precise multiple crack visualization

(Figure 6). In particular, macrocracks are typically

overwhelmed, while microcracks are hidden due to the

amplitude difference of the crack-induced features. The

phase mapping process enables cracks of various sizes

to be effectively visualized by normalizing the crack-

induced features along all pixels of interest. First, all

pixel values of the I

ROI

images are transformed to the

complex values along the taxis using Hilbert

transform

,y,t

ðÞ=P1

pð

‘

‘

IROI x,y,tðÞ

ttdtð10Þ

where Pis the Cauchy value of the integral and tis the

short time interval. Then, the instantaneous phase val-

ues (u(x,y,t)) of each pixel are simply obtained

ux,y,t

ðÞ=arctan Im H x,y,t

ðÞ½

Re H x,y,t

ðÞ½



ð11Þ

where Re and Im represent the real and imaginary

parts, respectively. Equation (11) physically means that

the responses are normalized between –pand p,mak-

ing it possible to effectively visualize even hidden micro-

cracks. However, not only crack-induced feature but

also undesired noise components might be augmented

by the phase mapping process. Thus, the denoising pro-

cess is subsequently carried out. First, u(x,y,t)is

accumulated along the taxis

[x,yðÞ=X

t

ux,y,t

ðÞ ð12Þ

where [(x,y)is the accumulation data along the

taxis. Then, the spatial derivative is subsequently

applied to [(x,y)along the xdirection which is the

scanning direction assumed in this study

,yðÞ=∂[x,yðÞ

∂xð13Þ

where F(x,y)is the spatial derivative value. The I

image can be obtained by reassigning F(x,y)to the

xand y-coordinate. Finally, the I

image visualizes

multiple cracks without noise components in the static

conditions covering the entire ROI.

The deep CNN process

Once the V

ROI

image is reconstructed by the TSI coor-

dinate transform, cracks are automatically extracted

through the deep CNN process. In this study, a pre-

trained deep CNN model, that is, GoogLeNet,

used for transfer learning. The GoogLeNet is one of

the well-known multi-layered CNN models designed

for visual pattern classification. It consists of 22 layers

including 9 inception modules as well as general convo-

lutional layers as shown in Figure 7. Here, each incep-

tion module is designed with 1 31 convolutional

layers at the beginning before the large convolutions

for reducing the dimensionality of feature maps. The

detailed structure of the inception module is shown in

Figure 7. For transplanting the GoogLeNet model into

concrete crack detection, the last two layers, that is, the

softmax and classification layers, are retrained with a

training set, having two classification outputs, that is,

intact and crack.

As for network training and validation, in total

20,000 images including concrete crack and non-crack

(intact) images are prepared by augmenting and seg-

menting 200 raw images. The representative images are

shown in Appendix 1. Among them, 9000 crack images

and 9000 intact images are used for network training,

and the other 1000 crack images and 1000 intact images

Figure 6. Phase mapping and spatial derivative.

Jang et al. 7

are selected as the validation set. All the prepared

images are then resized to 224 3224 33 pixels, main-

taining the aspect ratio in consideration of the

GoogLeNet’s input layer. Here, the training and vali-

dation sets are definitely distinct from each other. The

stochastic gradient descent with momentum is used as

a solver with 20 train epochs, and an initial learning

rate of 0.0001 is used.

Note that a high-performance

graphics processing unit which has 12 GB memory and

3840 cores is employed to expedite the network training

and also classification processes.

Once the tailored deep CNN is trained, the V

ROI

image is fed to the network for automated crack detec-

tion. To reduce false alarms, the V

ROI

image is scanned

by 16 different-sized masks without an overlapping

region as shown in Figure 8. The ranges of the mask

sizes are 122–144 pixels and 163–192 pixels in the hori-

zontal and vertical axes, respectively. The

corresponding 16 different probability maps are then

obtained and averaged for establishing a single prob-

ability map as shown in Figure 8. Here, the probability

map has the same resolution as the V

ROI

image. Each

pixel on the probability map has a positive value rang-

ing from 0 to 1. As the pixel value is, for example,

closer to 1, it indicates that there is a higher probability

of crack existence within the pixel.

Next, potential crack regions can be defined in the

ROI

image by selecting the probability values exceed-

ing 20% as shown in Figure 9. However, a lot of noise

components are still included in the potential crack

regions. Thus, the statistical denoising process is subse-

quently conducted for precise crack evaluation. A med-

ian filter is applied to the potential crack regions, and

the probability density function of the corresponding

pixel values is estimated by fitting a Weibull distribu-

tion which is one of the extreme value distributions.

Figure 7. Overview of the deep CNN architecture established using GoogLeNet.

Figure 8. The deep CNN process using the V

ROI

image.

8Structural Health Monitoring 00(0)

Then, the threshold value corresponding to a one-sided

99% confidence interval is established and applied to

all the pixel values for constructing the V

image shown

in Figure 9.

Decision-making by image matching

Although the V

image is able to automatically pro-

vide clear crack information, there can be a number of

data-driven false alarms due to rough surface condi-

tions, illuminance, contaminants, and so on. On the

other hand, the I

image can minimize the false alarms

thanks to its robustness against the arbitrary distur-

bances.

To reduce such false alarms, the image

matching between the V

and I

images is performed

as shown in Figure 10. First, the pixel resolution of the

image is reduced to have the same pixel resolution

as the I

image, because the IR camera typically has a

much lower pixel resolution than the vision camera

one. Then, the potential crack regions are selected on

the V

image using rectangular masks. Subsequently,

the corresponding crack regions are automatically

marked with the same size rectangular masks on the I

image. The marked crack regions of the I

image are

then extracted and resized as the resolution of

224 3224 33 pixels, defined as the I

images, for the

deep CNN process as shown in Figure 10(a). Note that

the rectangular mask location might not be exactly

matched between the V

and I

images due to their

pixel resolution mismatch. Nevertheless, the subsequent

CNN results using the I

images are not significantly

affected, because the I

images are used for double-

checking crack existence rather than crack quantifica-

tion. Next, the deep CNN process is repeated on the I

images except for mask scanning, as shown in Figure

10(b). After the deep CNN process, only crack images,

coined as the I

images, are retained and mapped on

the V

image by resizing them to the original rectangu-

lar mask size as depicted in Figure 10(c) and (d).

Finally, the final image shows only crack information

by retaining the advantages of the vision and IR

images, making it possible to reduce the vision data–

driven false alarms and enhance the reliability of crack

evaluation.

Figure 9. The statistical denoising process.

Figure 10. The decision-making procedure: (a) selection of the potential crack regions, (b) deep CNN process, (c) crack

identification, and (d) crack mapping on the V

image.

Jang et al. 9

Experimental validation

Test setup

The HIS system is experimentally validated using a lab-

scale concrete specimen with multiple cracks. Figure 11

shows the lab-scale test setup of the HIS system with

the concrete specimen located 700 mm away from the

HIS system. The control computer sends out control

signals to the CW line laser to generate the line laser

beam with an invisible wavelength of 950 nm. The line

laser beam with the size of 5 3200 mm

is targeted

through a collimator to the specimen. The peak inten-

sity of the line laser beam is set to 25 mW/mm

, which

is chosen by considering the thermal conductivity of

concrete (0.8 W m

) and scanning speed (23 mm/

s). The corresponding thermal waves of the concrete

specimen are recorded for 22 s by the IR camera (A65,

FLIR) with a frame rate of 30 Hz, a spectral range of

3–5 mm, and a resolution of 640 3512 pixels. The sur-

face images of the concrete specimen are also recorded

for 22 s by the vision camera (Hero 4, GoPro) with a

frame rate of 30 Hz and a resolution of 3264 32448

pixels. Here, the IR and vision cameras share the ROI.

In this study, a scanning jig is used to simulate the

UAV-mounted scanning mechanism as the very first

stage. Note that the HIS system will be mounted onto

UAVs or unmanned robots instead of the scanning jig

for practical usages although its miniaturization and

optimization are still underway. In particular, the

weight of the CW line laser can be reduced up to less

than 1.5 kg using the lightweight ceramic cooler and

packaging case.

The specially designed concrete specimen has the

dimensions of 1000 3500 3100 mm

and the com-

pressive strength of 103 MPa. The specimen is pre-

pared by mixing cement, silica sand, fly ash, super-

plasticizer, and water. The detailed mixing composition

is summarized in Table 1. During the curing process,

150-mm-width acrylic slots are inserted to make artifi-

cial cracks. The generated artificial cracks are divided

into two types, that is, macrocrack (˜500 mm) and

microcrack (\500 mm), defined in this study for conve-

nience. In addition, a fake crack with 1 mm width is

created using a pencil for the positive false alarm test.

The target ROI with the dimensions of 750 3240 mm

is defined so that macro- and microcracks, fake crack,

and non-cracked areas can be included (Figure 12).

Test results

Image distortion calibration. Once the V

and I

images

are obtained using the HIS system, the V

and I

images are obtained by conducting the calibration pro-

cess using a calibration marker shown in Figure 13.

Figure 13 shows the representative V

and V

images,

revealing that the images captured by the wide-angle

vision camera are distorted and successfully calibrated.

TSI coordinate transform. As expected, the V

and I

images within the ROI are time and spatially changed,

making it difficult to be analyzed as they are. To recon-

struct the V

ROI

and I

ROI

images, the TSI coordinate

transform is performed. First, the m(x)and svalues are

computed on the I

images. The determined ROI has a

height of 240 mm equivalent to 533 pixels on the I

images. Subsequently, the V

ROI

and I

ROI

images are

constructed using the TSI coordinate transform as

defined in equations (8) and (9). Figure 14(a) and (b)

Table 1. Mixing composition of the concrete specimen (%).

Cement

(type)

Silica

sand

Fly ash Super-plasticizer Water

100 (III) 100 15 0.9 35

Figure 11. Lab-scale test setup of the HIS system.

10 Structural Health Monitoring 00(0)

shows the V

ROI

image and the representative I

ROI

image at 1 s after laser excitation at each spatial point,

respectively. Even though the line laser is sequentially

scanned along the x-axis within the ROI, it looks as if

the entire ROI is simultaneously and uniformly heated

on the I

ROI

images. In reality, laser heating is not per-

fectly uniform on the entire ROI, but thermal wave

generation is enough to analyze the crack existence

within the ROI. Although crack existence can be intui-

tively observed in the I

ROI

images, they are still changed

in the time domain and have a number of unwanted

noise components as displayed in Figure 14(b).

Phase mapping and spatial derivative. To precisely evaluate

multiple cracks, the phase mapping and spatial deriva-

tive processes are subsequently applied to the I

ROI

images. It can be observed from Figure 15 that cracks

of various sizes are clearly visualized in the I

image

without undesired noise components. In particular,

microcracks are well visualized regardless of the ampli-

tude difference even when macro- and microcracks

coexist in a single image.

The deep CNN process. To automatically detect cracks

in the V

ROI

image, the V

ROI

image is fed to the pre-

trained deep CNN. The mask resolutions ranging from

122 3163 to 144 3192 pixels are used in this study.

Figure 12. Concrete specimen with various cracks and the target ROI.

Figure 13. Representative V

and V

images at 10 s.

Figure 14. TSI coordinate transform results: (a) V

ROI

image

and (b) representative I

ROI

image at 1 s.

Jang et al. 11

The representative outputs of the CNN process are dis-

played in Appendix 2. The entire deep CNN process is

repeated three times, and then its probability maps are

averaged to reduce the resultant errors. Based on the

probability map, the potential crack regions in the

ROI

image can be identified. Subsequently, the statisti-

cal denoising process is conducted, and the V

image is

obtained by mapping the potential crack locations on

the V

ROI

image as shown in Figure 16(a). Here, the rec-

tangular masks indicate the possible crack regions. The

performance of the deep CNN process using the vision

image can be evaluated by calculating the reliability

indices such as precision and recall

Precision =Tp

Tp +Fp ð14Þ

Recall =Tp

Tp +Fn ð15Þ

where Tp, Fp, and Fn represent the true positive, false

positive, and false negative, respectively. The precision

and recall values are computed as 59.84% and 97.26%,

respectively. The precision value physically representing

the false-positive alarm is relatively low, because the

fake crack is recognized by the deep CNN process as a

real crack. On the other hand, the recall value corre-

sponding to the false-negative alarm is relatively high,

meaning that cracks of various sizes are successfully

detected by the vision-based deep CNN process thanks

to the well-controlled laboratory conditions.

Decision-making by image matching. In order to reduce the

vision data–driven false alarms, the image matching

process is performed. The potential crack regions on

the I

image are selected by matching the crack loca-

tions on the V

image as shown in Figure 16. Then, the

extracted I

images are all tested by the deep CNN

process again. Only crack images are saved as the I

images and used for mapping on the V

image,

resulting in the final image. Otherwise, non-crack

images identified by the deep CNN process are dis-

carded as shown in Figure 10. The image matching

results show that the fake crack outlined by the dash-

single dotted box (yellow in color) is clearly filtered out

on the I

image as shown in Figure 16(b). Finally,

Figure 17 shows the final image containing only crack

information. To compare the performances of crack

detectability between the V

and final images, Table 2

summarizes the precision and recall indices. The two

indices are clearly enhanced when the hybrid images

including vision and IR images are used for the deep

CNN process. In particular, the precision index is

remarkably increased due to the fake crack. Then, the

recall index is also increased, meaning that the final

Figure 15. The I

image.

Figure 16. The deep CNN process results with crack masks:

(a) V

image and (b) I

images mapped on the I

image.

Table 2. Comparison of crack detectability between the vision

and final images (%).

Vision image

image)

Final image

and I

images)

Precision 59.84 98.72

Recall 97.26 99.23

12 Structural Health Monitoring 00(0)

image provides higher reliability for crack detection

than the V

image.

Integration of the HIS system with UAVs

Since the HIS system used in the preliminary indoor

tests is relatively large and heavy, its miniaturization

and packaging is now progressing to reduce the physi-

cal size and weight for mounting onto UAVs. The mod-

ified HIS system will be mounted onto the sticking-type

UAV as shown in Figure 18. The sticking-type UAV

can inspect a target structure by sticking the target sur-

face, making it possible to effectively reduce movement

and vibration during data acquisition of the HIS sys-

tem. Also, the effective working distance between the

HIS system and the target surface is able to be retained

for precise crack evaluation. Furthermore, the sticking-

type UAV is robust against unexpected turbulence

around the large civil infrastructures. The correspond-

ing outdoor tests will be performed on cable-stayed

grand bridges in South Korea.

Conclusion

This study presented deep learning–based concrete

crack detection using hybrid images. An HIS system

combining vision and IR thermography images was

newly developed for unmanned vehicle or robot-

mounted autonomous crack inspection of large-scale

concrete structures. Then, a deep CNN–based autono-

mous crack detection algorithm using the hybrid

images was proposed. The proposed system and algo-

rithm were experimentally validated using a lab-scale

concrete specimen with cracks of various sizes as the

very first stage of this concept. The test results revealed

that macro- and microcracks automatically and suc-

cessfully are visualized by minimizing false alarms. In

particular, false-negative and false-positive alarms were

remarkably reduced using hybrid images compared to

using only the vision image, resulting in crack detection

reliability improvement.

As a follow-up study, the proposed HIS system is

now being miniaturized and optimized to mount it onto

UAVs. Then, outdoor tests under varying environmen-

tal and operational conditions will be thoroughly car-

ried out, and it will be applied to real civil

infrastructures such as bridges, dams, and buildings.

Various real data such as shadow, dust on the surface,

rust, etc. will be additionally trained for real applica-

tion. The proposed technique is able to become a pro-

mising crack inspection alternative in large civil

infrastructures by minimizing inspection time, false

alarms, and unreliable experts’ intervention.

Declaration of conflicting interests

The author(s) declared no potential conflicts of interest with

respect to the research, authorship, and/or publication of this

article.

Funding

The author(s) disclosed receipt of the following financial sup-

port for the research, authorship, and/or publication of this

article: The research described in this article was financially

Figure 17. The final image.

Figure 18. Sticking-type UAV with the HIS system: (a)

schematic design and (b) the prototype model.

Jang et al. 13

supported by a grant (17SCIP-C116873-02) from

Construction Technology Research Program funded by the

Ministry of Land, Infrastructure and Transport of Korean

government and Basic Science Research Program of the

National Research Foundation of Korea (NRF) funded by

the Ministry of Science, ICT and Future Planning

(2015R1C1A1A01052625).

References

1. Chang P, Flatau A and Liu S. Review paper: health mon-

itoring of civil infrastructure. Struct Health Monit 2003;

2: 257–267.

2. Maheshwari M, Annamdas V, Pang J, et al. Crack moni-

toring using multiple smart materials; fiber-optic sensors

& piezo sensors. Int J Smart Nano Mater 2017; 8: 41–55.

3. Dumoulin C and Deraemaeker A. Real-time fast ultraso-

nic monitoring of concrete cracking using embedded

piezoelectric transducers. Smart Mater Struct 2017; 26:

104006.

4. Ham S, Song H, Oelze M, et al. A contactless ultrasonic

surface wave approach to characterize distributed crack-

ing damage in concrete. Ultrasonics 2017; 75: 46–57.

5. Menendez E, Victores J, Montero R, et al. Tunnel struc-

tural inspection and assessment using an autonomous

robotic system. Automat Constr 2018; 87: 117–126.

6. Hlava Z. Detection of crack in a concrete element by

impact-echo method. Ultrasound 2009; 64: 12–16.

7. Li B, Ushiroda K, Yang L, et al. Wall-climbing robot for

non-destructive evaluation using impact-echo and metric

learning SVM. Int J Intell Robot Appl 2017; 1: 255–270.

8. Han B, Zhang K, Yu X, et al. Electrical characteristics

and pressure-sensitive response measurements of carboxyl

MWNT/cement composites. Cement Concrete Compos

2012; 34: 794–800.

9. Chen P and Chung D. Carbon fiber reinforced concrete

for smart structures capable of non-destructive flaw

detection. Smart Mater Struct 1993; 2: 22–33.

10. McCormick N and Lord J. Digital image correlation.

Mater Today 2010; 13: 52–54.

11. Helm J. Digital image correlation for specimens with mul-

tiple growing cracks. Exp Mech 2008; 48: 753–762.

12. Jahanshahi M, Masri S, Padgett C, et al. An innovative

methodology for detection and quantification of cracks

through incorporation of depth perception. Mach Vision

Appl 2013; 24: 227–241.

13. Koch C, Paal S, Rashidi A, et al. Achievements and chal-

lenges in machine vision-based inspection of large con-

crete structures. Adv Struct Eng 2014; 17: 303–318.

14. Ho H, Kim K, Park Y, et al. An efficient image-based

damage detection for cable surface in cable-stayed

bridges. NDT&E Int 2013; 58: 18–23.

15. Kim H, Ahn E, Cho S, et al. Comparative analysis of

image binarization methods for crack identification in

concrete structures. Cement Concrete Res 2017; 99: 53–61.

16. Zhong X, Peng X, Yan S, et al. Assessment of the feasi-

bility of detecting concrete cracks in images acquired by

unmanned aerial vehicles. Automat Constr 2018; 89:

49–57.

17. Ellenberg A, Kontsos A, Moon F, et al. Bridge related

damage quantification using unmanned aerial vehicle ima-

gery. Struct Control Health Monit 2016; 23: 1168–1179.

18. An YK, Yang J, Hwang S, et al. Line laser lock-in ther-

mography for instantaneous imaging of cracks in semi-

conductor chips. Opt Laser Eng 2015; 73: 128–136.

19. Yang J, Hwang S, An YK, et al. Multi-spot laser lock-in

thermography for real-time imaging of cracks in semicon-

ductor chips during a manufacturing process. J Mater

Process Tech 2016; 229: 94–101.

20. ZhangA,WangK,LiB,etal.Automatedpixel-levelpave-

ment crack detection on 3D asphalt surfaces using a deep-

learning network. Comput-Aided Civ Inf 2017; 32: 12297.

21. Chen FC and Jahanshahi R. NB-CNN: deep learning-

based crack detection using convolutional neural network

and Naı

¨ve Bayes data fusion. IEEE T Ind Electron 2017;

65: 17519431.

22. Xu Y, Bao Y, Chen J, et al. Surface fatigue crack identifi-

cation in steel box girder of bridges by a deep fusion con-

volutional neural network based on consumer-grade

camera images. Struct Health Monit. Epub ahead of print

2 April 2018. DOI: 10.1177/1475921718764873.

23. Cha YJ, Choi W and Bu

¨yu

¨ko

¨ztu

¨rk O. Deep learning-

based crack damage detection using convolutional neural

networks. Comput Aided Civil Inf 2017; 32: 361–378.

24. Kim H, Ahn E, Shin M, et al. Crack and noncrack classi-

fication from concrete surface images using machine

learning. Struct Health Monit. Epub ahead of print 23

April 2018. DOI: 10.1177/1475921718768747.

25. Zhang Z. A flexible new technique for camera calibration.

IEEE T Pattern Anal Mach Intell 2010; 22: 1330–1334.

26. Vidas S, Lakemond R, Denman S, et al. A mask-based

approach for the geometric calibration of thermal-infrared

cameras. IEEE T Instrum Meas 2012; 61: 1625–1635.

27. More J. The Levenberg-Marquardt algorithm: implemen-

tation and theory. In: Watson GA (ed.) Lecture notes in

mathematics, vol. 630. New York: Springer, 1977.

28. Kanatani K, Ohta N and Kanazawa Y. Optimal homo-

graphy computation with a reliability measure. IEICE T

Inform Syst 2000; E83-D: 1369–1374.

29. Brown D. Close-range camera calibration. Photogram

Eng 1971; 37: 855–866.

30. Hahn S. Hilbert transforms in signal processing.Nor-

wood, MA: Artech House, 1996.

31. Szegedy C, Liu W, Jia Y, et al. Going deeper with convo-

lutions. In: Proceedings of the IEEE conference on com-

puter vision and pattern recognition (CVPR), Boston,

MA, 7–12 June 2015. New York: IEEE.

32. An YK, Park B and Sohn H. Complete noncontact laser

ultrasonic imaging for automated crack visualization in a

plate. Smart Mater Struct 2013; 22: 025022.

33. An YK, Kim J and Sohn H. Laser lock-in thermography

for detection of surface-breaking fatigue cracks on

uncoated steel structures. NDT&E Int 2014; 65: 54–63.

14 Structural Health Monitoring 00(0)

Appendix 1. Representative training images.

Jang et al. 15

Appendix 2. Representative outputs of the CNN process.

16 Structural Health Monitoring 00(0)

Discussion on infrared stress measurements based on finite element analysis of transient heat conduction

Article

Full-text available

Jun 2024

Carbon Fiber Reinforced Plastic (CFRP) is a composite material consisting of a resin matrix and carbon fiber reinforcement. The material is also commonly used in a prepreg laminate, a unidirectional reinforcing material with high strength and stiffness in one direction. Damage to laminates is highly complex, including delamination, fiber fracture, and base matrix cracking, requiring highly efficient and accurate non-destructive testing. Infrared stress measurement is an example of a non-destructive testing method for CFRP structures. The infrared stress measurement is based on Kelvin’s theory to convert the surface temperature fluctuation under cyclic loading to the distributions of sum of principal stress on surface (DSPSS), so it may result in a stress distribution that differs from the actual distribution due to transient heat conduction. It is necessary to consider structural analysis and transient heat conduction in the numerical analysis to reproduce DSPSS obtained by infrared stress measurement. This study performs a finite element analysis with transient heat conduction on simple shaped CFRP specimens to reproduce the trend of DSPSS obtained by infrared stress measurement. Firstly, DSPSS generated by the forced displacement of a CFRP specimen is converted to a temperature distribution using Kelvin’s thermoelastic theory. Finally, a transient heat conduction analysis is performed, and the distribution trend is discussed using the obtained temperature distribution as the initial value. A sheet of Teflon is inserted into the CFRP specimen as a defect, assuming foreign matter contamination during the manufacturing process. Previous study predicts the internal defect information by a machine learning model using the DSPSS from numerical analysis. There is a potential for a high-accuracy defect prediction using DSPSS obtained by the infrared stress measurement if DSPSS obtained by the infrared stress measurement and the numerical analysis have similar tendencies.

Applications of deep learning to infrared thermography for the automatic classification of thermal pathologies: Review and case study

Chapter

Jan 2024

A review on technological advancements in the field of data-driven structural health monitoring

Conference Paper

Full-text available

Jun 2022

Recent advancements in sensor technology, as well as fast progress in internet-based cloud computation; data-driven approaches in structural health monitoring (SHM) are gaining prominence. The majority of time is utilized for reviewing & analyzing the data received from various sensors deployed in structures. This data analysis helps in understating the structural stability and its current state with certain limitations. Considering this fact, integration with Machine Learning (ML) in SHM has attracted significant attention among researchers. This paper is principally aimed at understanding and reviewing of vast literature available in sensor-based data-driven approaches using ML. The implementation and methodology of vibration-based, vision-based monitoring, along with some of the ML algorithms used for SHM are discussed. Nevertheless, a perspective on the importance of data-driven SHM in the future is also presented. Conclusions are drawn from the review discuss the prospects and potential limitations of ML approaches in data-driven SHM applications.

Unsupervised transfer learning for structural health monitoring of urban pedestrian bridges

Article

Full-text available

Apr 2024

Bridge authorities have been reticent to integrate structural health monitoring into their bridge management systems, as they do not have the financial and technical resources to collect long-term monitoring data from every bridge. As bridge authorities normally own huge amount of similar bridges, like the pedestrian ones, the ability to transfer knowledge from one or a small group of well-known bridges to help make more effective decisions in new bridges and environments has gained relevance. In that sense, transfer learning, a subfield of machine learning, offers a novel solution to periodically evaluate the structural condition of all pedestrian bridges using long-term monitoring data from one or more pedestrian bridges. In this paper, the applicability of unsupervised transfer learning is firstly shown on data from numerical models and then on data from two similar pedestrian prestressed concrete bridges. Two domain adaptation techniques are used for transfer learning, where a classifier has access to unlabeled training data (source domain) from a reference bridge (or a small set of reference bridges) and unlabeled monitoring test data (target domain) from another bridge, assuming that both domains are from similar but statistically different distributions. This type of mapping is expected to improve the classification accuracy for the target domain compared to a procedure that does not implement domain adaptation, as a result of reducing distributions mismatch between source and target domains.

Novel Instance-Based Transfer Learning for Asphalt Pavement Performance Prediction

Article

Full-text available

Mar 2024

The deep learning method has been widely used in the engineering field. The availability of the training dataset is one of the most important limitations of the deep learning method. Accurate prediction of pavement performance plays a vital role in road preventive maintenance (PM) and decision-making. Pavement performance prediction based on deep learning has been widely used around the world for its accuracy, robustness, and automation. However, most of the countries in the world have not built their pavement performance historical database, which prevents preventive maintenance using the deep learning method. This study presents an innovative particle swarm optimization (PSO) algorithm-enhanced two-stage TrAdaBoost.R2 transfer learning algorithm, which could significantly increase the pavement performance prediction database. The Long-Term Pavement Performance (LTPP) database is used as the source domain data, and one of the highways in China is chosen as the target domain to predict pavement performance. The results show that the proposed PSO-Two-stage TrAdaBoost.R2 model has the highest accuracy compared with AdaBoost.R2 model and traditional regression decision tree model. The validation case study shows significant consistency between the predicted International Roughness Index (IRI) and the whole-year measurement data with an R2 of 0.7. This study demonstrates the great potential of the innovative instance-based transfer learning method in pavement performance prediction of a region’s lack of data. This study also contributes to other engineering fields that could greatly increase the universality of deep learning.

A shallow 2D-CNN network for crack detection in concrete structures

Article

Apr 2024

Purpose This study aims to obtain methods to identify and find the place of damage, which is one of the topics that has always been discussed in structural engineering. The cost of repairing and rehabilitating massive bridges and buildings is very high, highlighting the need to monitor the structures continuously. One way to track the structure's health is to check the cracks in the concrete. Meanwhile, the current methods of concrete crack detection have complex and heavy calculations. Design/methodology/approach This paper presents a new lightweight architecture based on deep learning for crack classification in concrete structures. The proposed architecture was identified and classified in less time and with higher accuracy than other traditional and valid architectures in crack detection. This paper used a standard dataset to detect two-class and multi-class cracks. Findings Results show that two images were recognized with 99.53% accuracy based on the proposed method, and multi-class images were classified with 91% accuracy. The low execution time of the proposed architecture compared to other valid architectures in deep learning on the same hardware platform. The use of Adam's optimizer in this research had better performance than other optimizers. Originality/value This paper presents a framework based on a lightweight convolutional neural network for nondestructive monitoring of structural health to optimize the calculation costs and reduce execution time in processing.

Advances in artificial vision techniques applied to non-destructive tests in heritage buildings

Chapter

Jan 2024

Multiple-type distress detection in asphalt concrete pavement using infrared thermography and deep learning

Article

May 2024
AUTOMAT CONSTR

Machine Learning approaches to damage detection in composite structures combining experimental and simulation domains

Article

Apr 2024
MECH SYST SIGNAL PR

Composite materials are widely used across major industries such as the automotive, aerospace and wind power, due to their excellent mechanical properties. A strong effort is thus put into developing innovative damage detection methodologies, for which Non-Destructive Testing (NDT) techniques can play a vital role as advanced measurement methods. One such technique is Laser Doppler Vibrometry which allows to accurately measure high-frequency vibration behavior with dense grids of points, without mass loading the structure. For data analysis, Machine Learning (ML) techniques have achieved high success on a number of structural applications, and can be leveraged to build automated and reliable damage classifiers. In this work, three methodologies have been developed by combining Laser Doppler Vibrometer (LDV) measurements with ML approaches, for the task of detecting damages on a carbon-fiber reinforced polymer (CFRP) plate. Each damage detection methodology requires pre- and post-processing steps, which were optimized with Bayesian Optimization. Principal Component Analysis (PCA) was also explored to reduce the dimensionality of the data, before classification. Moreover, making use of Finite Element Analysis (FEA), simulation data was generated with the ability of characterizing the high-frequency dynamic behavior of ply-based composite plates. The simulation data enriched the damage detection methodologies in a Transfer Learning (TL) framework. Results are presented for each damage detection methodology, alongside with a comparative overview of the advantages and disadvantages of each method.

Swin transformer network leveraging multi-dimensional features for defect depth prediction

Article

Mar 2024
INFRARED PHYS TECHN

Crack and Noncrack Classification from Concrete Surface Images Using Machine Learning

Article

Full-text available

Apr 2018
STRUCT HEALTH MONIT

In concrete structures, surface cracks are important indicators of structural durability and serviceability. Generally, concrete cracks are visually monitored by inspectors who record crack information such as the existence, location, and width. Manual visual inspection is often considered ineffective in terms of cost, safety, assessment accuracy, and reliability. Digital image processing has been introduced to more accurately obtain crack information from images. A critical challenge is to automatically identify cracks from an image containing actual cracks and crack-like noise patterns (e.g. dark shadows, stains, lumps, and holes), which are often seen in concrete structures. This article presents a methodology for identifying concrete cracks using machine learning. The method helps in determining the existence and location of cracks from surface images. The proposed approach is particularly designed for classifying cracks and noncrack noise patterns that are otherwise difficult to distinguish using existing image processing algorithms. In the training stage of the proposed approach, image binarization is used to extract crack candidate regions; subsequently, classification models are constructed based on speeded-up robust features and convolutional neural network. The obtained crack identification methods are quantitatively and qualitatively compared using new concrete surface images containing cracks and noncracks.

Tunnel structural inspection and assessment using an autonomous robotic system

Article

Full-text available

Mar 2018
AUTOMAT CONSTR

This paper presents the ROBO-SPECT European FP7 project, funded under the ICT-2013.2.2 programme on Robotics use cases & Accompanying measures, a robotized alternative to manual tunnel structural inspection and assessment of cracks and other defects. Physical developments include the design and implementation of a multi-degree-of-freedom robotic system, composed by a mobile vehicle, an extended crane, and a high precision robotic arm. A semi-supervised computer vision system to detect tunnel defects, and a ultrasonic sensor (US) robotic tool to measure width and depth of detected cracks have been also developed. An overview of defect detection methods, as well as the fundamental aspects of the project architecture and design, will be presented. In addition, the developed and implemented arm tip positioning algorithm for the US robotic tool shall be detailed. Finally, experimental evidence and details on validation in a real motorway tunnel with ongoing traffic will be provided.

Wall-climbing robot for non-destructive evaluation using impact-echo and metric learning SVM

Article

Full-text available

Sep 2017

The impact-echo (IE) acoustic inspection method is a non-destructive evaluation technique, which has been widely applied to detect the defects, structural deterioration level, and thickness of plate-like concrete structures. This paper presents a novel climbing robot, namely Rise-Rover, to perform automated IE signal collection from concrete structures with IE signal analyzing based on machine learning techniques. Rise-Rover is our new generation robot, and it has a novel and enhanced absorption system to support heavy load, and crawler-like suction cups to maintain high mobility performance while crossing small grooves. Moreover, the design enables a seamless transition between ground and wall. This paper applies the fast Fourier transform and wavelet transform for feature detection from collected IE signals. A distance metric learning based support vector machine approach is newly proposed to automatically classify the IE signals. With the visual-inertial odometry of the robot, the detected flaws of inspection area on the concrete plates are visualized in 2D/3D. Field tests on a concrete bridge deck demonstrate the efficiency of the proposed robot system in automatic health condition assessment for concrete structures.

Deep Learning-Based Crack Damage Detection Using Convolutional Neural Networks

Article

Full-text available

Mar 2017
COMPUT-AIDED CIV INF

A number of image processing techniques (IPTs) have been implemented for detecting civil infrastructure defects to partially replace human-conducted on-site inspections. These IPTs are primarily used to manipulate images to extract defect features, such as cracks in concrete and steel surfaces. However, the extensively varying real-world situations (e.g., lighting and shadow changes) can lead to challenges to the wide adoption of IPTs. To overcome these challenges, this article proposes a vision-based method using a deep architecture of con-volutional neural networks (CNNs) for detecting concrete cracks without calculating the defect features. As CNNs are capable of learning image features automatically , the proposed method works without the conjugation of IPTs for extracting features. The designed CNN is trained on 40 K images of 256 × 256 pixel resolutions and, consequently, records with about 98% accuracy. The trained CNN is combined with a sliding window technique to scan any image size larger than 256 × 256 pixel resolutions. The robustness and adaptability of the proposed approach are tested on 55 images of 5,888 × 3,584 pixel resolutions taken from a different structure which is not used for training and validation processes under various conditions (e.g., strong light spot, shadows , and very thin cracks). Comparative studies are conducted to examine the performance of the proposed CNN using traditional Canny and Sobel edge detection methods. The results show that the proposed method shows

Assessment of the feasibility of detecting concrete cracks in images acquired by unmanned aerial vehicles

Article

May 2018
AUTOMAT CONSTR

An 8-rotor unmanned aerial vehicle is used as a working platform. Its motion characteristics in a hovering state are obtained using a non-contact measurement instrument, which, along with the modulation transfer function of its airborne images, indicates the reliability of the airborne images of unmanned aerial vehicles in a hovering state. By installing a laser range finder on the cradle synchronized with the camera shutter to measure the object distance, the pixel resolution of the object distance is obtained. The airborne images are then processed using the MATLAB image processing toolbox, from which the pixels of concrete cracks are extracted. Compared to a static image and direct manual measurements, the airborne image of the unmanned aerial vehicle has higher precision, indicating its wide potential applications as an alternative of the conventional inspection methods of bridge-inspection vehicle and working platforms.

Surface fatigue crack identification in steel box girder of bridges by a deep fusion convolutional neural network based on consumer-grade camera images

Article

Apr 2018
STRUCT HEALTH MONIT

This study conducts crack identification from real-world images containing complicated disturbance information (cracks, handwriting scripts, and background) inside steel box girders of bridges. Considering the multilevel and multi-scale features of the input images, a modified fusion convolutional neural network architecture is proposed. As input, 350 raw images are taken with a consumer-grade camera and divided into sub-images with resolution of 64 × 64 pixels (67,200 in total). A regular convolutional neural network structure is employed as baseline to demonstrate the accuracy benefits from the proposed fusion convolutional neural network structure. The confusion matrix is defined for prediction performance evaluation on the test set. A total of six additional entire raw images are used to investigate the robustness and feasibility of the proposed approach. A binary conversion process based on the optimal entropy threshold method is applied and closely followed to identify the crack pixels in the sub-images. The effect of the super-resolution inputs on accuracy is investigated. Results show that the trained modified fusion convolutional neural network can automatically detect the cracks, handwriting, and background from the raw images. The recognition errors of the fusion convolutional neural network in both the training and validation processes are smaller than those of the regular convolutional neural network. The super-resolution process hurts the general identification accuracy.

NB-CNN: Deep Learning-based Crack Detection Using Convolutional Neural Network and Naïve Bayes Data Fusion

Article

Oct 2017

Regular inspection of nuclear power plant components is important to guarantee safe operations. However, current practice is time-consuming, tedious, and subjective which involves human technicians review the inspection videos and identify cracks on reactors. A few vision-based crack detection approaches have been developed for metallic surfaces, and they typically perform poorly when used for analyzing nuclear inspection videos. Detecting these cracks is a challenging task since they are tiny, and noisy patterns exist on the components' surfaces. This study proposes a deep learning framework called NB-CNN to analyze individual video frames for crack detection while a novel data fusion scheme is proposed to aggregate the information extracted from each video frame to enhance the overall performance and robustness of the system. To this end, a Convolutional Neural Network (CNN) is proposed to detect crack patches in each video frame while the proposed data fusion scheme maintains the spatiotemporal coherence of cracks in videos, and the Naïve Bayes decision making discards false positives effectively. The proposed framework achieves 98.3% hit rate against 0.1 false positives per frame that is significantly higher than state-of-the-art approaches as presented in this paper.

Automated Pixel-Level Pavement Crack Detection on 3D Asphalt Surfaces Using a Deep-Learning Network: Pixel-level pavement crack detection on 3D asphalt surfaces

Article

Aug 2017

The CrackNet, an efficient architecture based on the Convolutional Neural Network (CNN), is proposed in this article for automated pavement crack detection on 3D asphalt surfaces with explicit objective of pixel-perfect accuracy. Unlike the commonly used CNN, CrackNet does not have any pooling layers which downsize the outputs of previous layers. CrackNet fundamentally ensures pixel-perfect accuracy using the newly developed technique of invariant image width and height through all layers. CrackNet consists of five layers and includes more than one million parameters that are trained in the learning process. The input data of the CrackNet are feature maps generated by the feature extractor using the proposed line filters with various orientations, widths, and lengths. The output of CrackNet is the set of predicted class scores for all pixels. The hidden layers of CrackNet are convolutional layers and fully connected layers. CrackNet is trained with 1,800 3D pavement images and is then demonstrated to be successful in detecting cracks under various conditions using another set of 200 3D pavement images. The experiment using the 200 testing 3D images showed that CrackNet can achieve high Precision (90.13%), Recall (87.63%) and F-measure (88.86%) simultaneously. Compared with recently developed crack detection methods based on traditional machine learning and imaging algorithms, the CrackNet significantly outperforms the traditional approaches in terms of F-measure. Using parallel computing techniques, CrackNet is programmed to be efficiently used in conjunction with the data collection software.

Real-time fast ultrasonic monitoring of concrete cracking using embedded piezoelectric transducers

Article

Jun 2017

This paper deals with the use of embedded piezoelectric transducers for ultrasonic monitoring of cracking in concrete. Based on the previous developments of our research team on that topic, we design a new data acquisition system which is able to interrogate the emitter-receiver pair up to 150 times per second. The system is based on low-voltage actuation (up to 20 V) and the signal-to-noise ratio is excellent due to the use of a voltage amplifier at the receiver side and the possibility to perform averages. With such a high measurement rate, we are able to follow brittle failure events such as the failure of a concrete cylinder in compression, which is the application example presented. In this application, we show that, in addition to the ultrasonic active monitoring of cracking, the system is also able to record the passive acoustic emission events which can be used as a complementary indicator of damage in the cylinder. We also demonstrate that because of the high level of stresses in compression, the damage indicator defined in our previous studies is not suited for crack monitoring due to the elastoacoustic effect. The amplitude of the first wave arrival is shown to be a robust alternative damage indicator allowing to follow accurately the three successive phases of cracking leading to the failure of the cylinder.

Comparative analysis of image binarization methods for crack identification in concrete structures

Article

Sep 2017
CEMENT CONCRETE RES

Surface cracks in concrete structures are critical indicators of structural damage and durability. Manual visual inspection, the most commonly used method in practice, is inefficient from cost, time, accuracy, and safety perspectives. A promising alternative is computer vision-based methods that can automatically extract crack information from images. Image binarization, developed for text detection, is appropriate for crack identification, as texts and cracks are similar, consisting of distinguishable lines and curves. However, standardizing crack identification using image binarization is challenging, because binarization depends on the method and associated parameters. We investigate image binarization for crack identification, focusing on optimal parameter determination and comparative performance evaluation for five common binarization methods. Crack images are prepared to obtain optimal parameters by minimizing errors in estimated crack widths. Subsequently, comparative analysis is conducted using crack images with different conditions based on three performance evaluation criteria: crack width and length measurement accuracy and computation time.

Deep learning–based autonomous concrete crack evaluation through hybrid image scanning

Abstract and Figures

Recommended publications

Biometric Fish Classification of Temperate Species Using Convolutional Neural Network with Squeeze-a...

Pose Recognition Using Convolutional Neural Networks on Omni-directional Images

From Optical Music Recognition to Handwritten Music Recognition: a Baseline

Traffic Sign Detection from Lower-quality and Noisy Mobile Videos