ArticlePDF Available

Acceleration of Hyperspectral Skin Cancer Image Classification through Parallel Machine-Learning Methods

February 2024
Sensors 24(5):1399

February 2024
24(5):1399

DOI:10.3390/s24051399

License
CC BY 4.0

Authors:

Emanuele Torti

University of Pavia

Elisa Marenzi

University of Pavia

Hyperspectral imaging (HSI) has become a very compelling technique in different scientific areas; indeed, many researchers use it in the fields of remote sensing, agriculture, forensics, and medicine. In the latter, HSI plays a crucial role as a diagnostic support and for surgery guidance. However, the computational effort in elaborating hyperspectral data is not trivial. Furthermore, the demand for detecting diseases in a short time is undeniable. In this paper, we take up this challenge by parallelizing three machine-learning methods among those that are the most intensively used: Support Vector Machine (SVM), Random Forest (RF), and eXtreme Gradient Boosting (XGB) algorithms using the Compute Unified Device Architecture (CUDA) to accelerate the classification of hyperspectral skin cancer images. They all showed a good performance in HS image classification, in particular when the size of the dataset is limited, as demonstrated in the literature. We illustrate the parallelization techniques adopted for each approach, highlighting the suitability of Graphical Processing Units (GPUs) to this aim. Experimental results show that parallel SVM and XGB algorithms significantly improve the classification times in comparison with their serial counterparts.

Flow diagram of parallel RF classifier.

…

Example of sequential addressing reduction technique.

…

Average classification times for SVM, RF, and XGB for all the CPU and GPU devices.

…

Comparison between classification times of our work with the state of the art.

…

Figures - available via license: Creative Commons Attribution 4.0 International

Content may be subject to copyright.

Available via license: CC BY 4.0

Content may be subject to copyright.

Citation: Petracchi, B.; Torti, E.;

Marenzi, E.; Leporati, F. Acceleration

of Hyperspectral Skin Cancer Image

Classiﬁcation through Parallel

Machine-Learning Methods. Sensors

2024,24, 1399. https://doi.org/

10.3390/s24051399

Academic Editors: Christos Nikolaos

E. Anagnostopoulos, Stelios Krinidis

and Jan Cornelis

Received: 6 December 2023

Revised: 29 January 2024

Accepted: 16 February 2024

Published: 21 February 2024

Licensee MDPI, Basel, Switzerland.

This article is an open access article

distributed under the terms and

conditions of the Creative Commons

Attribution (CC BY) license (https://

creativecommons.org/licenses/by/

4.0/).

sensors

Article

Acceleration of Hyperspectral Skin Cancer Image Classiﬁcation

through Parallel Machine-Learning Methods

Bernardo Petracchi , Emanuele Torti , Elisa Marenzi and Francesco Leporati *

Department of Electrical, Computer and Biomedical Engineering, University of Pavia, I-27100 Pavia, Italy;

bernardo.petracchi01@universitadipavia.it (B.P.); emanuele.torti@unipv.it (E.T.); elisa.marenzi@unipv.it (E.M.)

*Correspondence: francesco.leporati@unipv.it

Abstract: Hyperspectral imaging (HSI) has become a very compelling technique in different scientiﬁc

areas; indeed, many researchers use it in the ﬁelds of remote sensing, agriculture, forensics, and

medicine. In the latter, HSI plays a crucial role as a diagnostic support and for surgery guidance.

However, the computational effort in elaborating hyperspectral data is not trivial. Furthermore,

the demand for detecting diseases in a short time is undeniable. In this paper, we take up this

challenge by parallelizing three machine-learning methods among those that are the most intensively

used: Support Vector Machine (SVM), Random Forest (RF), and eXtreme Gradient Boosting (XGB)

algorithms using the Compute Uniﬁed Device Architecture (CUDA) to accelerate the classiﬁcation of

hyperspectral skin cancer images. They all showed a good performance in HS image classiﬁcation,

in particular when the size of the dataset is limited, as demonstrated in the literature. We illustrate

the parallelization techniques adopted for each approach, highlighting the suitability of Graphical

Processing Units (GPUs) to this aim. Experimental results show that parallel SVM and XGB algorithms

signiﬁcantly improve the classiﬁcation times in comparison with their serial counterparts.

Keywords: hyperspectral imaging; machine learning; support vector machine; random forest;

eXtreme gradient boosting; GPU

1. Introduction

Skin cancer represents one of the most predominant tumors [

], and in recent years,

its occurrence has progressively increased. Such lesions are typically categorized into

two main

groups: melanoma skin cancer (MSC) and non-melanoma skin cancer (NMSC) [

Typically, this cancer type involves three types of cells: squamous, basal, or

melanocytic cells.

MSC originates from melanocytes, cells located in the epidermis and responsible for

skin color, thanks to melanin production. MSC can be further subdivided into three sub-

types: superﬁcial extension, lentigo maligna, and nodular tumor [

]. This is the rarest type

of skin cancer, with, if not promptly detected, the highest growth speed and, consequently,

is very difﬁcult to treat [

]. Therefore, doctors and surgeons need fast, reliable diagnostic

systems for this kind of pathology.

The traditional diagnosis procedure is biopsy, which consists in the removal of a

sample of tissue from the living body, followed by histopathological inspection [

representing an onerous and time-consuming process [5–7].

To face these problems, minimally intrusive techniques have been investigated, in-

cluding hyperspectral imaging (HSI), acquiring information about a scene both in the

spatial and in the spectral domain [

]. In fact, a hyperspectral image is represented by

a so-called hypercube containing the spectral information of every pixel over a speciﬁc

wavelength range. HSI allows precise material identiﬁcation [

] by measuring the fraction

of the incident electromagnetic radiation reﬂected by the surface (reﬂectance). This is

due to the characteristic variation in the reﬂectance over the wavelength typical of each

material, which is called the spectral signature [

]. In contrast with traditional imaging

Sensors 2024,24, 1399. https://doi.org/10.3390/s24051399 https://www.mdpi.com/journal/sensors

Sensors 2024,24, 1399 2 of 16

techniques, HSI allows the acquisition of images with a large number of spectral bands

both within the visible and non-visible range. This means that the acquired images contain

much more information compared to traditional ones, such as RGB images, and can lead to

better performances [11].

However, although the development of accurate tools in the medical ﬁeld is funda-

mental, timing requirements should also be taken into consideration when providing a

quick diagnosis is necessary. Indeed, the prompt detection of skin lesions facilitates their

treatment and increases the probability of survival of the patients.

To achieve this goal, many researchers [

–

] have exploited different kinds of devices

suitable for parallel elaboration and computation when the data size is high. Among

these, Graphical Processing Units (GPUs), used in different scientiﬁc applications [

represent a suitable technology in the ﬁeld of medical image processing. In addition,

compared with other devices such as Field Programmable Gate Arrays (FPGAs), GPUs

usually offer a bigger parallel factor due to their high memory bandwidth [20].

Existing works in the literature have focused on the classiﬁcation of HSI skin cancer

images by adopting machine-learning (ML) and deep-learning (DL) methods [

–

In [

], a classiﬁcation chain based on K-means, Spectral Angle Mapper (SAM), and

SVM was considered. The authors also implemented several parallel versions of their

classiﬁcation system exploiting multicore and many-core technologies.

The research in [

] implemented SVM, RF, and XGB, obtaining a mean classifi-

cation accuracy of 97%, considering only the model’s optimization and not the algor-

ithms’ parallelization.

Several DL models have been adopted in [

], namely, ResNet-18, ResNet-50,

ResNet-101

a ResNet-50 variant, U-Net, and U-Net++ architectures. Since neural networks are time-

consuming and computationally expensive, a parallel version of the U-Net++, resulting in

the best predictive approach, has been implemented using a low-power NVIDIA Jetson GPU.

This parallel version has achieved adequate classification performance satisfying real-time

constraints with a low power consumption.

Some works related to ML method parallelization can be found in [

], where

parallel versions of SVM and XGB have been developed for HSI image classiﬁcation.

In this paper, we propose the optimization and parallelization of three popular ML

methods to accelerate the HSI skin cancer image classiﬁcation using the Compute Uniﬁed

Device Architecture (CUDA), a framework for parallel elaboration developed by NVIDIA.

More speciﬁcally, the considered approaches are SVM, RF, and XGB, which offer a good per-

formance in classifying HSI images when the dimensions of the dataset are

limited [31,34].

Furthermore, the works in [

] showed a great reduction in the classiﬁcation time

developing parallel versions of SVM and XGB, even achieving real-time processing.

This work presents the parallelization techniques implemented on different NVIDIA

GPU devices including a GeForce RTX 2080 GPU, a GeForce RTX 4090 GPU, and a cluster

composed of ﬁve nodes of three Tesla A16 GPUs. Performance differences between the

devices in the classiﬁcation of HSI skin cancer images have also been highlighted. Indeed,

GeForce RTX 2080 and 4090 GPUs are optimized for graphics applications, while the cluster

is designed for scientiﬁc calculations. In particular, the GeForce RTX 4090 is characterized

by the latest-generation architecture (Ada Lovelace), while the GeForce RTX 2080 features

an older architecture (Turing) and is cheaper than the previous one. Lastly, each Tesla A16

features an Ampere architecture.

Experimental results show a signiﬁcant improvement of the parallel version of SVM

and XGB compared to their serial counterparts, with a speed-up of 130x and 1.4x, re-

spectively, conﬁrming that GPUs represent a valid technology in accelerating the medical

diagnosis process.

This manuscript is organized as follows. Section 2describes the HSI skin cancer

dataset and the adopted ML algorithms. Furthermore, the adopted techniques to perform

the serial and the parallel inference of the algorithms, and the architectures of the adopted

Sensors 2024,24, 1399 3 of 16

devices are shown. The obtained results are illustrated in Section 3, while Section 4presents

the discussions, and Section 5provides conclusions and future developments.

The main contributions of this paper are the following: description of the paralleliza-

tion of the SVM, RF, and XGB methods targeting GPUs; parallelization on different devices,

considering the most recent architectures developed by NVIDIA; and comparison of the

results with the state of the art, highlighting the improvement of skin cancer diagnosis

through parallel image processing.

2. Materials and Methods

2.1. Hyperspectral Sensors and the Skin Cancer Dataset

The evolution of hyperspectral sensors has resulted in the creation of various platforms,

specialized for particular applications and operational needs. The four main sensor types,

namely pushbroom, whiskbroom, stereoscopic, and snapshot are fundamental to the

hyperspectral imaging landscape [

–

]. Pushbroom sensors function through constant

scanning of the scene using a linear or 2D array of detectors. As the platform moves, the

sensor captures spectral information for every pixel in the scene, resulting in a continuous

spectral image. This technique enhances both spatial and spectral resolution, making

pushbroom sensors highly suitable for applications that demand a thorough analysis of

speciﬁc regions [39].

Whiskbroom sensors operate similarly to pushbroom ones, except for their scanning

mechanism. Rather than recording an entire line at once, whiskbroom sensors collect data

one point at a time. The sensor sweeps across the scene, gathering spectral information for

each point sequentially. Whiskbroom sensors are celebrated for their adaptability and are

frequently utilized in airborne and spaceborne reconnaissance [40].

Stereoscopic hyperspectral sensors employ several detectors to capture images from

marginally divergent viewpoints. By leveraging stereoscopic vision, these sensors provide

not only spectral data but also depth information. This facilitates the creation of 3D

models and improves the interpretation of intricate surroundings, such as hilly terrains or

urban landscapes [41].

Snapshot sensors, also referred to as snapshot hyperspectral imaging systems, obtain a

complete spectral image with a single exposure. This is accomplished through cutting-edge

optical designs that record data concurrently for all spectral ranges. Snapshot sensors

enable quick data acquisition and are ideal for dynamic scenarios or situations needing

promptly available spectral information [42].

A thorough knowledge of the peculiar characteristics of each hyperspectral sensor is

crucial to select the most appropriate technology for a particular application. Concerning

skin cancer detection, the snapshot sensor is the best choice since it acquires the whole

images in a single exposure [25,36].

The HSI skin cancer dataset used is the one considered in [

]; it contains

76 images

of skin lesions from 61 subjects, 40 of which are benign and 36 are malignant.

They were acquired with a snapshot camera (Cubert UHD, Cubert GmbH, Ulm, Germany)

able to cover the 450–950 nm range, distributed over 125 spectral channels [

]. The images

were collected in two hospitals of the Canary Islands, Spain: the Hospital Universitario de

Gran Canaria Doctor Negrín and the Complejo Hospitalario Universitario Insular-Materno

Infantil. The image labelling was led by experts such as dermatologists and pathologists

according to the taxonomy described in [32].

The spectral signatures among different patients have been normalized as illustrated

in [

] to mitigate the variations in illumination conditions. At the end of preprocessing,

the spectral signatures contain 116 bands with values in the range [0, 1].

Figure 1shows the percentage distributions of the skin lesions that include four

possible classes: Benign Epithelial (BE), Benign Melanocytic (BM), Malignant Epithelial

(ME), and Malignant Melanocytic (MM).

Sensors 2024,24, 1399 4 of 16

Figure 1. Percentage distribution of each lesion.

Figure 2shows four images taken from the dataset representing one of the considered

lesions, together with the mean spectral signatures of the hyperspectral pixels.

Figure 2. Synthetic RGB images taken from the database to represent each lesion and the mean

spectra of the pixels.

2.2. Machine-Learning Methods

This section gives a general overview of the SVM, RF, and XGB methods adopted to

classify the HSI skin cancer images. Speciﬁcally, theoretical aspects of the three algorithms

will be presented.

2.2.1. Support Vector Machine

SVM is a supervised machine-learning method proposed by Vapnik and extensively

used for classiﬁcation and regression tasks [

–

]. Originally, SVM performs binary

classiﬁcations and aims to ﬁnd the hyperplane which splits the dataset into discrete classes

Sensors 2024,24, 1399 5 of 16

according to the given training samples [

]. The data points with the minimum distance

from the hyperplane are called support vectors (SVs). For multiclass classiﬁcation, SVM

breaks down the multiclass problem into multiple binary classiﬁcation ones, solving the

following equation:

min

w,b,ζ

2wTw+C∑n

i=1ζi

subject to yiwTxi+b≥1−ζi, (1)

ζi≥0with i =1, . . . , n

where

is the support vectors,

is the penalty term,

ζi

is the distance error from the

correct margin,

is the classes,

is the margin,

is the training vectors, and

is the

number of training samples. Intuitively, the goal is to maximize the margin by minimizing

wTw, while incurring a penalty when a sample is misclassiﬁed.

The minimization problem described by Equation (1) can be transformed into a dual

problem given by Equation (2):

min

2αTQα−eTα

subject to yTα=0, (2)

0≤αi≤C with i =1, . . . , n

where

is a vector of all ones, and

is an

positive semideﬁnite matrix whose

elements are deﬁned in Equation (3):

Qij =yiyjKxixj(3)

is the kernel function that maps the data from a low-dimensional space to another space

with high dimensions. Once the optimization problem is solved, the output of decision

function for a given sample xbecomes:

∑i€SV αiK(wi,x)+b(4)

where

αi

is the dual coefﬁcients. The sign of Equation (4) gives the binary classiﬁcation,

while the multiclass classiﬁcation is achieved according to the “one-vs.-one” strategy by

repeatedly applying Equation (4).

2.2.2. Random Forest

RF was ﬁrst introduced by Leo Breiman [

]. It is a popular ensemble learning

algorithm used for both classiﬁcation and regression tasks. It combines the predictions

of multiple decision trees to improve the predictive accuracy and control over-ﬁtting.

Speciﬁcally, each tree performs a “partial” prediction, and the class with the most votes

becomes the ﬁnal prediction. Using a random subset of data and features, each decision

tree in the RF is built recursively by splitting the data according to various criteria (e.g., Gini

impurity or information gain) until a stopping criterion is met. The latter can be a maximum

tree depth, a minimum number of samples required to split a node, or a minimum number

of samples required in a leaf node.

2.2.3. eXtreme Gradient Boosting

XGB is an ensemble learning algorithm similar to RF. It is based on a generalized gra-

dient boosting method, and is used for classiﬁcation, regression, and ranking

tasks [48–50]

It provides highly accurate classiﬁcations by combining the predictions of multiple weak

predictive models, typically decision trees. One of the strong points of XGB is the sequential

addition of new models correcting the mistakes made by previous models. Particularly,

it optimizes a speciﬁc loss function by computing its gradient compared to the predicted

values. XGB builds N trees per class; the outputs of the trees belonging to the same class

Sensors 2024,24, 1399 6 of 16

are summed. The soft-max function is then applied to the outputs to obtain the probability

values of the class. The class with the biggest value is the ﬁnal prediction.

2.3. CPU and GPU Technologies

This section describes the architectures and the main features of the CPU and GPU

devices employed for the inference implementation of the three algorithms. For the serial

inference, we used an Intel Core i9-13900K with a clock frequency of 3 GHz. It is based

on the Raptor Lake architecture developed adopting an Intel 7 processor (10 nm), with

24 cores,

32 threads, and 32 MB and 36 MB of L2 and L3 cache memory, respectively. The

maximum bandwidth achievable is 89.6 GB/s.

The ﬁrst two GPU devices considered for the parallel inference were an NVIDIA

GeForce RTX 2080 and an NVIDIA GeForce RTX 4090, optimized for graphics applications.

The NVIDIA GeForce RTX 2080 is based on the Turing architecture with 2944 CUDA

cores and a clock frequency of 1.5 GHz. Other components of this device include

184 texture

units, 64 Render Output Units (ROPs), 368 tensor cores, 46 ray tracing (RT) cores, and 8 GB

of GDDR6 modules. The maximum bandwidth achievable is 448 GB/s.

The NVIDIA GeForce RTX 4090 is supported by the Ada Lovelace architecture with

16,384 CUDA cores and a clock frequency of 2.2 GHz. It also contains 512 tensor cores,

176 ROPs, and 128 RT cores. The memory dimension is 24 GB (GDDR6X), and the maximum

bandwidth is 1008 GB/s.

The last GPU device considered is a cluster dedicated to the scientiﬁc calculation com-

posed of ﬁve nodes of three NVIDIA Tesla A16s. Each GPU of the cluster is equipped with

four chips and features the Ampere architecture. Every chip of the GPU has

1280 CUDA

cores, 40 tensor cores, 16 GB of GDDR6, and a memory bandwidth of 200 GB/s.

2.4. CPU Inference

The inference of the algorithms described in Section 2.2 has been implemented using

the best parameters obtained after the training phase as detailed in [

]. Visual Studio 2022

Integrated Development Environment (IDE) was used, adopting the C language.

The serial implementation has been used as a basis for the parallel inference described

in Section 2.5.

2.4.1. SVM Inference

The SVM inference consisted in the implementation of Equation (4). The dual coefﬁ-

cients, the margin, the support vectors, and the type of kernel function have been identiﬁed

after both the training and the parameters tuning described in [

]. The Radial Basis

Function (RBF) resulted as the most appropriate kernel function, and it is represented by

the following equation:

K(wi,x)=e−γ||wi−x||2(5)

where γis the kernel parameter, whose best value obtained after the training was 10.

The steps executed to perform the SVM inference can be summarized as follows:

1. Kernel calculation for the sample to classify according to Equation (5);

Multiplication between the obtained kernel and the dual coefﬁcients adding the

bias b;

3. Pixel classiﬁcation through the “one-vs.-one” strategy.

The pseudo-code of the SVM inference is reported in Algorithm 1. Lines 2 to 4 perform

the kernel calculation by evaluating the squared Euclidean distance between the support

vectors and the sample to classify. The second step is executed in

lines 6 to 10

, where the

distance of the sample from the hyperplane is calculated according to Equation (4). Due to

the nested loops, the distance is calculated

nclass ∗(nclass −1)/

2 times. With

nclass =

10 values of the distance are obtained. Lines 12 to 21 show the last step that aims to perform

the ﬁnal prediction by observing the sign of the 10 values of the distance: if

dij

is positive

(negative), then class

wins (loses) over class

, and the array

scorei

(

scorej

is incremented

Sensors 2024,24, 1399 7 of 16

by one. Finally, line 21 ﬁnds the index of the maximum value in the array

scorei

, or rather,

the class obtaining the greatest number of scores.

Algorithm 1 Serial implementation of Support Vector Machine

Input:γ→Kernel parameter

DCij →Dual coefﬁcients matrix

wi→Support vectors matrix

x→Pixel to classify

b→Bias

1: Ste p 1 : Kernel calcul ation

2: for i=0to nsv −1

3: K(wi,x)=exp−γ∗∥wi−x∥2;

4: end

5: Ste p 2 : Distance o f the sample f rom the hy perplane

6: for i=0to nclass −1

7: for j=i+1to nclass −1

8: dij =∑

i€SV

DCij ∗K(wi,x) + b;

9: end

10: end

11: Step 3: “One vs. one” strategy

12: for i=0to nclass −1

13: scorei=0

14: for j=i+1to nclass −1

15: if dij >0

16: scorei+ +;

17: else

18: scorej+ +;

19: end

20: end

21: Find imax, index of the scoreimaximum

Output:imax

2.4.2. RF Inference

The core of serial RF inference is a recursive function representing the tree structure.

According to the obtained trained values of the features, the thresholds, as well as the

left and right children’s nodes of each parent node, the execution follows a speciﬁc path

in the tree. If the execution ends in a non-leaf node, the function is repeated and drives

the execution to the next node depending on the left and right children’s values. The

recursion stops when the execution ends in a leaf containing the output. The output of

this function is an array of 5 elements containing the probability values of the pixel of

belonging to each class. Then, a second function was realized with the goal to execute the

tree structure N times, where N is the number of decision trees. Therefore, each tree makes

its prediction on the pixel, and the class having the greatest number of votes is the ﬁnal

prediction. The number of decision trees used in this work is 425, obtained after the training

phase. The pseudo-code of RF inference is shown in Algorithm 2. Line 2 corresponds to

the

tree_structure

function that outputs the probability array (

prob_array

) exploiting the

features, thresholds, and left and right children’s node (

input_data)

. Lines 4 to 8 perform

the forest in which, at each iteration, the

tree_structure

function runs and the index of

prob_array

maximum is obtained. At the end of the iterations, the array

class

contains the

number of votes per each class. The ﬁnal prediction is the most voted class and is obtained

in line 9.

Sensors 2024,24, 1399 8 of 16

Algorithm 2 Serial implementation of Random Forest

Input:input_data →Features, thresholds, left and right

children’s nodes

1: Step 1: Development of the tree_structure f u nction

2: The single tree outputs prob_array

3: Step 2: Building of the forest

4: for i=0to ntrees −1

5: tree_structure(input_data,prob_array,i);

6: Find max, index of prob_array maximum

7: classmax + +;

8: end

9: Find imax, index of the class maximum

Output:imax

2.4.3. XGB Inference

XGB is based on the same

tree_structure

function of the RF, but in this case, the output

is a single value. The forest structure function builds N decision trees for each class; each

tree improves the output of the previous tree (belonging to the same class) by considering

its prediction mistakes. The optimal number of decision trees obtained after the training

was 400, so the forest structure function builds 2000 decision trees overall.

The outputs of the decision trees belonging to the same class are summed. In

Algorithm 3

, the pseudo-code of the XGB inference is shown. Line 2 is related to the

tree_structure

function that outputs the probability value of the single tree. Then, the forest

function is described in lines 4 to 8, where the sums of the outputs of the trees belonging to

the same class are stored in the

array of 5 elements. Lines 10 to 18 determine the ﬁnal

probability array

according to the soft-max function reported in Equation (6). The index

of Pimaximum is the ﬁnal prediction according to line 19.

P[i] = ZE[i]

∑nclass

j=0ZE[j](6)

Algorithm 3 Serial implementation of eXtreme Gradient Boosting

Input:input_data →Features, thresholds, left and right

children’s nodes

1: Step 1: Development of the tree_structure f u nction

2: The single tree outputs the probability value of its class

3: Step 2: Building of the forest

4: for i=0to nclass −1

5: for e=0to ntrees −1

6: Zi+ = tree_structure(input_data,e∗nclass +i);

7: end

8: end

9: Ste p 3 : Final probability array through so f t −max f un ction

10: for i=0to nclass −1

11: ZEi=exp(Zi);

12: end

13: for i=0to nclass −1

14: z=∑i€nclass ZEi;

15: end

16: for i=0to nclass −1

17: Pi=ZEi/z;

18: end

19: Find imax, index of the Pimaximum

Output:imax

Sensors 2024,24, 1399 9 of 16

2.5. GPU Inference

This section describes the parallel inference for the SVM, RF, and XGB algorithms.

We adopted the GPU devices described in Section 2.3 and Visual Studio 2022 with CUDA

C language.

In the following sections, we will explain some essential terms to deﬁne the basic

components of the CUDA language. First, we must deﬁne the kernel (a CUDA function)

that, when called, is executed in parallel by N different CUDA threads. Another important

component is the thread block containing a group of threads executed concurrently. The

threads belonging to the same block can cooperate through synchronization barriers. A

thread block uses the shared memory for inter-thread communication and the data sharing.

Finally, a grid is an array of thread blocks executing the same kernel; it reads and writes in

the global memory of the GPU. Each thread and block can be identiﬁed through the threa-

dIdx = (threadIdx.x,threadIdx.y,threadIdx.z) and blockIdx = (blockIdx.x,blockIdx.y,blockIdx.z)

coordinates, respectively. The dimension of the thread block is deﬁned by the blockDim =

(blockDim.x,blockDim.y,blockDim.z) array.

2.5.1. Parallel SVM

The most computationally expensive operations in SVM are Step 1 and Step 2 of

Algorithm 1 in Section 2.4.1.Step 1 involves the SV matrix (116

47,220) and the image

to classify (2500

116), while Step 2 performs the product between the obtained kernel

(2500 ×47,220) and the dual coefﬁcients matrix (47,220 ×4).

Step 2 was performed through a CUDA kernel using a number of blocks equal to

(N+nthreads −1

nthreads

with

nthreads =

32 and

being the number of SVs. The choice to

use 32 as the number of threads is because the basic unit of execution in an NVIDIA GPU is

the warp, a collection of 32 threads executed simultaneously by a Streaming Multiprocessor

(SM) of the GPU. Therefore, the resulting number of blocks was 1476. The pseudo-code of

Algorithm 4 below represents the kernel calculation through the CUDA syntax.

Algorithm 4 Kernel calculation

Input:γ→Kernel parameter

wi→Support vector matrix

x→Pixel to classify

1: i= blockIdx.x * blockDim.x + threadIdx.x

2: if i<nsv

3: for i=0to nbands −1

4: di=∥wi−x∥2

5: end

6: K(wi,x)=exp(−γ∗di)

Output:K(wi,x)

In line 1, the variables blockIdx.x and threadIdx.x indicate the current block and thread

identifier, while blockDim.x is the block dimension along the x-axis as described in

Section 2.5.

In line 4, the squared Euclidean distance

is shown; each thread performs the difference

between an element of the SV matrix

and an element of the sample to classify

in parallel.

Finally, in line 6, the kernel K(wi,x)is obtained.

Then, Step 2 was implemented by adopting the cublasSgemm and the cublasSaxpy

functions (from the cuBLAS library) explicitly designed for matrix operations: the ﬁrst has

been used to perform the multiplication between the kernel and the dual coefﬁcients

matrix, the second to sum the obtained result and

. The result of this step was a vector

of 10 elements containing the outputs of the decision function (see Equation (4)). Step 3

was performed employing 1 block of 5 threads (1 per class), whose task was to apply the

“one-vs.-one”

strategy. Finally, the cublasIsamax function has been used to determine the

ﬁnal prediction.

Sensors 2024,24, 1399 10 of 16

2.5.2. Parallel RF

For the parallel version of RF, the intrinsic nature of decision trees that is based on

sequences of if–else statements causes threads divergence, representing a challenge that did

not allow the parallelization of the

tree_structure

function. Therefore, such function has

been declared as a device function using the CUDA keyword

__device__

, meaning that

the function is called by the GPU.

The forest structure was realized with a CUDA kernel composed of 425 blocks of

1 thread, with one block for each decision tree and every block having only one thread in

order to avoid the potential thread divergence in the tree_structure function.

The pseudo-code in Algorithm 5 represents the parallel RF inference. Line 2 refers to

the serial RF

tree_structure

with the addition of the

__device__

declaration, as mentioned

above. Lines 4 to 6 perform the forest where each block builds a decision tree and outputs

the prediction (

max)

for that same tree. Furthermore, to prevent race conditions in ﬁlling

the

class

array, line 6 performs the atomicAdd operation to add the value 1 to all the elements

of the array. In line 7, the ﬁnal prediction

imax

is obtained through the cublasIsamax function.

Figure 3shows the ﬂow diagram of the RF classiﬁer and how it is divided between

host and device. The input data, stored in the host, are transferred in the device memory

through the cudaMemcpy function, thus representing the input to the forest structure

device function, where each block implements a decision tree by calling the

tree_structure

function. After that, the cublasIsamax function has been used to make the prediction for

each speciﬁc pixel. Since the device output vector contains the predictions of every pixel

of the image, its dimension is 2500. At last, the device output vector is transferred to the

host memory.

Figure 3. Flow diagram of parallel RF classiﬁer.

2.5.3. Parallel XGB

To perform the parallelized version of the XGB, the forest structure function has been

designed similarly to the parallelized RF: 2000 blocks have been adopted, each including

1 thread, and launching the tree structure function. The values obtained for each block

have been stored in the vector

. Then, the reduction technique has been used to sum

the elements of

related to the same class. To perform this task, the “sequential address-

ing” strategy has been implemented. The code below shows the sequential addressing

reduction technique.

In Code 1, for each class, 400 elements (n_estimators) of

are transferred to the GPU

shared memory through the array

. Then, the for loop reduces the entire upper portion

of the array

to the entire lower portion of

. With 512 values, the upper

256 values

are

reduced into the lower 256 values. Then, the upper 128 values of the lower 256 values

from before are reduced with the lower 128 values. The loop ends when the sum of all the

elements of the array is obtained and stored in the ﬁrst element of S.

The reduction was executed using a 2D grid composed of 1 block of 512 (512 be-

ing the ﬁrst power of 2 greater than 400) threads for the x-axis, and 5 blocks of 1 thread

for the y-axis. Each thread of the x-axis transfers one element of

to the shared mem-

Sensors 2024,24, 1399 11 of 16

ory and sums

two elements

, while the 5 blocks of the y-axis iterate over the classes.

Algorithms 4 and 5

, related to SVM and RF, respectively, involve a single index in perform-

ing their kernels; therefore, the use of a 1D grid was considered sufﬁcient. In the reduction

process, XGB involves two independent indexes,

and

, related to the elements of the

array and to the classes, respectively; as a consequence, a 2D grid has been identiﬁed as

more suitable compared to a 1D grid.

Code 1 Sequential Addressing Reduction

Input:tid,e,b→indexes of the threads and blocks

ncl →number of classes

1: int tid =threadIdx.x;

2: __shared__ f l oat S[512];

3: int e=blockIdx.x∗blockDim.x+threadIdx.x;

4: int b=bl ockIdx.y;

5: if (tid <n_estimators)

6: S[tid]=Z[e∗ncl +b];

7: __syncthreads();

8: for (s=blockDim.x/2; s>0; s≫=1){

9: if (tid < s)

10: S[tid]+ = S[tid +s];

11: __syncthreads();

12:}

Output:S

Algorithm 5 Parallel Random Forest

Input:input_data →Features, thresholds, left and right

children’s nodes

1: Step 1: Development of the device tree_structure f unc tion

2: The single tree outputs max, the prob_array maximum index

3: Step 2: Building of the forest

4: i=blockIdx.x;

5: max =tree_structure(input_data,prob_array,i);

6: atomicAdd(&classmax, 1.0);

7: Find imax, index of the class maximum

Output:imax

The sequential addressing approach solves the warp’s divergence and shared memory

bank conﬂict problems of the interleaved addressing reduction. Figure 4exempliﬁes the

concept of sequential addressing reduction.

Figure 4. Example of sequential addressing reduction technique.

Sensors 2024,24, 1399 12 of 16

To conclude, the ﬁnal probability array

of Equation (6) was obtained using a CUDA

kernel composed by 5 blocks of 1 thread.

3. Results

The inference part of SVM, RF, and XGB methods has been implemented in a serial

and a parallelized version using C and CUDA languages, respectively. The programs have

been developed with the Microsoft Visual Studio 2022 IDE and the CUDA 11.7 toolkit for

the NVIDIA GeForce RTX 2080 GPU and the CUDA 12.0 toolkit for the NVIDIA Tesla A16

and NVIDIA GeForce RTX 4090 GPUs. The serial version was compiled with the v143

compiler of Visual Studio, while the parallel code was compiled with the NVCC compiler

included in the toolkit. The compiler conﬁguration has been set to release mode, meaning

that the optimizations are enabled, and that the full debugging information is not included.

Furthermore, we have set the code generation option of the CUDA compiler to 7.5, 8.6,

and 8.9 values corresponding to the compute capability of the NVIDIA GeForce RTX 2080,

NVIDIA Tesla A16, and NVIDIA GeForce RTX 4090 GPUs. This option allowed us to fully

exploit the architectures of the respective GPUs.

The SVM, RF, and XGB inference has been tested using 10 HSI skin cancer images, all

having dimensions of 50

50 pixels and 116 bands; this dataset contains all the possible

skin lesions.

Speciﬁcally, the average classiﬁcation time of such images has been measured for each

algorithm and for all the adopted technologies. All the average classiﬁcation times with the

standard deviations and the speed-up (in brackets) are reported in Table 1.

Table 1. Average classiﬁcation times for SVM, RF, and XGB for all the CPU and GPU devices.

SVM [s] RF [s] XGB [s]

i9-13900K 445.90 ±105.72 0.51 ±0.01 1.17 ±0.02

RTX 2080 14.10 ±0.09 (32x) 0.77 ±0.00 (0.66x) 0.98 ±0.00 (1.19x)

Tesla A16 40.80 ±0.00 (11x) 1.07 ±0.00 (0.48x) 1.43 ±0.00 (0.82x)

RTX 4090 3.44 ±0.00 (130x) 0.76 ±0.00 (0.67x) 0.84 ±0.00 (1.39x)

It is worth noting that the parallel SVM features the greatest speed-up. In fact, all

GPU devices have obtained valid results for this algorithm: a speed-up of 32x, 11x, and

130x turned out for the GeForce RTX 2080, Tesla A16, and GeForce RTX 4090, respectively.

This conﬁrms that parallelizing SVM is an appropriate solution for the acceleration of skin

lesions’ detection.

Parallel XGB has outperformed its serial counterpart when using both the GeForce

RTX 2080 and GeForce RTX 4090 GPUs, achieving a speed-up of 1.19x for the ﬁrst and

1.39x for the second device conversely. The cluster has not accelerated the serial version,

its average execution time being 1.17 s, whereas 1.43 s is the average execution time of the

parallelized version.

Finally, RF is the only algorithm that has not shown improvements; however, some

observations should be made: the intrinsic nature of RF did not allow the tree structure

to be parallelized since it is based on if–else sequences. Hence, this algorithm is not fully

parallelizable. Moreover, the number of decision trees used in this work was 425, which is

not as big as it should be to adequately exploit the beneﬁts of parallel computing.

NVIDIA GeForce RTX 4090 GPU resulted as the most performant among the GPUs,

due to its high number of CUDA cores (16,384) and to its latest-generation architecture, the

Ada Lovelace.

As already said, the university cluster achieved the worst performance for all algo-

rithms, probably because the code developed for the parallel inference has not exploited

the full computational power of the cluster. Indeed, the cluster is composed of ﬁve nodes of

three Tesla A16 GPUs, while our code employed the use of one out of four chips equipped

on each single GPU.

Sensors 2024,24, 1399 13 of 16

4. Discussion

To compare the results of our methods with the state of the art, the works proposed

in [

] can be considered. The authors of [

] have developed a hybrid classiﬁcation

system based on K-means, SAM, and SVM using the same dataset here described. In partic-

ular, they implemented several parallel versions of their system using an NVIDIA GeForce

RTX 2080 GPU (the same employed in this work) and an NVIDIA Tesla K40 GPU. The best

performance was achieved through the version performing the K-means in CUDA using

the NVIDIA GeForce RTX 2080 GPU and the SVM in OpenMP. To evaluate the performance,

the authors considered nine images and measured the classiﬁcation times of each image

as the mean of ﬁve executions. They reported a diagram showing that the classiﬁcation

times of their system were approximately 1 s. However, the SVM implementation in [

]

had to classify only a limited number of pixels of the images; namely, the pixels clustered

as pigmented skin lesions from the K-means stage. In contrast, this work’s SVM classiﬁed

all the 2500 pixels of the images, discriminating between

ﬁve different

classes. Indeed, the

computational complexity of the SVM adopted in [

] is lower than the one described in this

work. Not only the number of elements to classify is lower, but also the hyperparameters

are different, since a higher number of support vectors is needed by the SVM adopted in

this paper.

In [

], a parallel XGB version was developed using an NVIDIA Quadro P4000 to

classify the Pavia University (PU), GRSS-DFC2013 Houston (GH13), and GRSS-DFC2018

Houston (GH18) datasets. All three datasets are based on a single HSI image. The PU image

features a dimension of 610

340 pixels and 103 channels, while the GH13 image is a cube

of dimensions 349

1905

144. Finally, the GH18 Houston image has

4172 ×1202 pixels

and 48 bands. The times taken to classify these images were 6.67 s, 31.05 s, and 347.30 s

for the PU, GH13, and GH18 datasets, respectively. Given the big difference between

the number of samples and features considered in the datasets of [

] and the one of

this work, a quasi-linear relation between the images size and the processing times is

observed. Indeed, the structure of XGB is poorly parallelizable, and the performances

are strictly related to the number of features and trees. In the proposed work, since the

data dimensionality is lower than that of [

], the number of features and trees is small.

Moreover, as described in Section 2.5.3, the parallelization is based on assigning each tree

to a block, whilst instead, [33] uses a standard approach.

To the best of the authors’ knowledge, no prior parallel version of RF has been devel-

oped in the HSI ﬁeld.

Table 2summarizes the prediction times of this work and the results obtained in

the literature.

Table 2. Comparison between classiﬁcation times of our work with the state of the art.

K-Means +

SAM + SVM [16]

SVM

(This Work)

XGB PU

[33]

XGB GH13

[33]

XGB GH18

[33]

XGB

(This Work)

Time [s] ~1 3.44 6.67 31.05 347.30 0.84

# pixels From 300 to

1700 2500 207,400 664,845 5,014,744 2500

# channels 116 116 103 144 48 116

5. Conclusions

In this work, a serial and a parallel inference of the SVM, RF, and XGB algorithms to

classify a dataset of HS skin cancer images have been proposed. The serial inference has

been implemented employing the CPU Intel Core i9-13900K, and to accelerate the serial

classiﬁcation, three different GPUs have been employed: the NVIDIA GeForce RTX 2080,

the NVIDIA Tesla A16, and the NVIDIA GeForce RTX 4090.

The results show that our work can signiﬁcantly accelerate medical diagnosis through

image processing techniques. In fact, the parallel versions of both SVM and XGB lead

to an acceleration very signiﬁcant in the case of the most complex SVM and minor but

Sensors 2024,24, 1399 14 of 16

not neglectable in the case of the less challenging XGB. In any case, this experimentation

conﬁrms the validity of the approach used in [

] and in [

] even in case of a problem

featuring a low parallelizable algorithm applied to a small dataset with a low number of

trees. Again, it is possible to say that hyperspectral image processing can support doctors

in timely detecting skin lesions, planning an opportune therapy, and helping surgeons

during interventions.

Future works will focus on multi-GPU programming to exploit the full computational

power of the cluster, since we only used one out of four GPUs of one NVIDIA Tesla

A16. Furthermore, integrated GPU solutions will be explored, such as the NVIDIA Jetson,

that is a System on Module (SoM) that features small dimensions, high performance, and

embedded CPU, GPU, and memory in a single board. Lastly, datasets with a higher number

of patients will be considered to better validate the proposed approach.

Author Contributions: Conceptualization, B.P. and E.T.; methodology, B.P. and E.M.; software,

B.P.; validation, B.P., E.T. and E.M.; investigation, B.P., E.T., E.M. and F.L.; writing—original draft

preparation, B.P.; writing—review and editing, E.T., E.M. and F.L.; supervision, F.L. All authors have

read and agreed to the published version of the manuscript.

Funding: This research received no external funding.

Institutional Review Board Statement: Not applicable.

Informed Consent Statement: Not applicable.

Data Availability Statement: Data available upon request to the corresponding author.

Conﬂicts of Interest: The authors declare no conﬂict of interest.

References

Ferlay, J.; Colombet, M.; Soerjomataram, I.; Parkin, D.M.; Piñeros, M.; Znaor, A.; Bray, F. Cancer Statistics for the Year 2020: An

Overview. Int. J. Cancer 2021,149, 778–789. [CrossRef] [PubMed]

Abdlaty, R.; Doerwald-Munoz, L.; Farrell, T.J.; Hayward, J.E.; Fang, Q. Hyperspectral Imaging Assessment for Radiotherapy

Induced Skin-Erythema: Pilot Study. Photodiagn. Photodyn. Ther. 2021,33, 102195. [CrossRef] [PubMed]

Scolyer, R.A.; Long, G.V.; Thompson, J.F. Evolving Concepts in Melanoma Classiﬁcation and Their Relevance to Multidisciplinary

Melanoma Patient Care. Mol. Oncol. 2011,5, 124–136. [CrossRef]

Krensel, M.; Petersen, J.; Stephan, B.; Katalinic, A.; Augustin, J. Comparison of Patient Pathways in the Early Detection of Skin

Cancer—A Claims Data Analysis. JDDG J. Der Dtsch. Dermatol. Ges. 2021,19, 389–398. [CrossRef] [PubMed]

Rey-Barroso, L.; Peña-Gutiérrez, S.; Yáñez, C.; Burgos-Fernández, F.J.; Vilaseca, M.; Royo, S. Optical Technologies for the

Improvement of Skin Cancer Diagnosis: A Review. Sensors 2021,21, 252. [CrossRef] [PubMed]

Jiang, S.; Li, H.; Jin, Z. A Visually Interpretable Deep Learning Framework for Histopathological Image-Based Skin Cancer

Diagnosis. IEEE J. Biomed. Health Inform. 2021,25, 1483–1494. [CrossRef] [PubMed]

Dildar, M.; Akram, S.; Irfan, M.; Khan, H.U.; Ramzan, M.; Mahmood, A.R.; Alsaiari, S.A.; Saeed, A.H.M.; Alraddadi, M.O.;

Mahnashi, M.H. Skin Cancer Detection: A Review Using Deep Learning Techniques. Int. J. Environ. Res. Public. Health 2021,

18, 5479. [CrossRef]

8. Abdlaty, R.; Fang, Q. Skin Erythema Assessment Techniques. Clin. Dermatol. 2021,39, 591–604. [CrossRef]

Kamruzzaman, M.; Sun, D.-W. Introduction to Hyperspectral Imaging Technology. In Computer Vision Technology for Food Quality

Evaluation; Elsevier: Amsterdam, The Netherlands, 2016; pp. 111–139.

10.

Meyer, J.M.; Kokaly, R.F.; Holley, E. Hyperspectral Remote Sensing of White Mica: A Review of Imaging and Point-Based

Spectrometer Studies for Mineral Resources, with Spectrometer Design Considerations. Remote Sens. Environ. 2022,275, 113000.

[CrossRef]

11.

Johansen, T.H.; Møllersen, K.; Ortega, S.; Fabelo, H.; Garcia, A.; Callico, G.M.; Godtliebsen, F. Recent Advances in Hyperspectral

Imaging for Melanoma Detection. WIREs Comput. Stat. 2020,12, e1456. [CrossRef]

12.

Zhang, Q.; Bai, C.; Liu, Z.; Yang, L.T.; Yu, H.; Zhao, J.; Yuan, H. A GPU-Based Residual Network for Medical Image Classiﬁcation

in Smart Medicine. Inf. Sci. 2020,536, 91–100. [CrossRef]

13.

Pandey, M.; Fernandez, M.; Gentile, F.; Isayev, O.; Tropsha, A.; Stern, A.C.; Cherkasov, A. The Transformational Role of GPU

Computing and Deep Learning in Drug Discovery. Nat. Mach. Intell. 2022,4, 211–221. [CrossRef]

14.

Wang, H.; Peng, H.; Chang, Y.; Liang, D. A Survey of GPU-Based Acceleration Techniques in MRI Reconstructions. Quant.

Imaging Med. Surg. 2018,8, 196–208. [CrossRef] [PubMed]

15.

Kalaiselvi, T.; Sriramakrishnan, P.; Somasundaram, K. Survey of Using GPU CUDA Programming Model in Medical Image

Analysis. Inform. Med. Unlocked 2017,9, 133–144. [CrossRef]

Sensors 2024,24, 1399 15 of 16

16.

Torti, E.; Leon, R.; La Salvia, M.; Florimbi, G.; Martinez-Vega, B.; Fabelo, H.; Ortega, S.; Callicó, G.M.; Leporati, F. Parallel

Classiﬁcation Pipelines for Skin Cancer Detection Exploiting Hyperspectral Imaging on Hybrid Systems. Electronics 2020,9, 1503.

[CrossRef]

17.

Shi, L.; Liu, W.; Zhang, H.; Xie, Y.; Wang, D. A Survey of GPU-Based Medical Image Computing Techniques. Quant. Imaging Med.

Surg. 2012,2, 188–206. [CrossRef]

18.

Jimenez, L.I.; Sanchez, S.; Martan, G.; Plaza, J.; Plaza, A.J. Parallel Implementation of Spatial–Spectral Endmember Extraction on

Graphic Processing Units. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2017,10, 1247–1255. [CrossRef]

19.

Marenzi, E.; Torti, E.; Leporati, F.; Quevedo, E.; Callicò, G.M. Block Matching Super-Resolution Parallel GPU Implementation for

Computational Imaging. IEEE Trans. Consum. Electron. 2017,63, 368–376. [CrossRef]

20.

Cong, J.; Fang, Z.; Lo, M.; Wang, H.; Xu, J.; Zhang, S. Understanding Performance Differences of FPGAs and GPUs. In Proceedings

of the 2018 IEEE 26th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM), Boulder,

CO, UAS, 29 April–1 May 2018; IEEE: New York, NY, USA, 2018; pp. 93–96.

21.

Leon, R.; Martinez-Vega, B.; Fabelo, H.; Ortega, S.; Melian, V.; Castaño, I.; Carretero, G.; Almeida, P.; Garcia, A.; Quevedo, E.;

et al. Non-Invasive Skin Cancer Diagnosis Using Hyperspectral Imaging for In-Situ Clinical Support. J. Clin. Med. 2020,9, 1662.

[CrossRef]

22.

Tian, C.; Xu, Y.; Zhang, Y.; Zhang, Z.; An, H.; Liu, Y.; Chen, Y.; Zhao, H.; Zhang, Z.; Zhao, Q.; et al. Combining Hyperspectral

Imaging Techniques with Deep Learning to Aid in Early Pathological Diagnosis of Melanoma. Photodiagn. Photodyn. Ther. 2023,

43, 103708. [CrossRef]

23.

Kazianka, H.; Leitner, R.; Pilz, J. Segmentation and Classiﬁcation of Hyper-Spectral Skin Data. In Data Analysis, Machine Learning

and Applications; Springer: Berlin/Heidelberg, Germany, 2008; pp. 245–252.

24.

Vinokurov, V.; Khristoforova, Y.; Myakinin, O.; Bratchenko, I.; Moryatov, A.; Machikhin, A.; Zakharov, V. Neural Network

Classiﬁer for Hyperspectral Images of Skin Pathologies. J. Phys. Conf. Ser. 2021,2127, 012026. [CrossRef]

25.

Pardo, A.; Gutiérrez-Gutiérrez, J.A.; Lihacova, I.; López-Higuera, J.M.; Conde, O.M. On the Spectral Signature of Melanoma: A

Non-Parametric Classiﬁcation Framework for Cancer Detection in Hyperspectral Imaging of Melanocytic Lesions. Biomed. Opt.

Express 2018,9, 6283. [CrossRef] [PubMed]

26.

Räsänen, J.; Salmivuori, M.; Pölönen, I.; Grönroos, M.; Neittaanmäki, N. Hyperspectral Imaging Reveals Spectral Differences and

Can Distinguish Malignant Melanoma from Pigmented Basal Cell Carcinomas: A Pilot Study. Acta Derm. Venereol. 2021,101,

adv00405. [CrossRef] [PubMed]

27.

Liu, L.; Qi, M.; Li, Y.; Liu, Y.; Liu, X.; Zhang, Z.; Qu, J. Staging of Skin Cancer Based on Hyperspectral Microscopic Imaging and

Machine Learning. Biosensors 2022,12, 790. [CrossRef] [PubMed]

28.

Qi, M.; Liu, Y.; Li, R.; Liu, L.; Zhang, Z. Classiﬁcation of Skin Cancer Based on Hyperspectral Microscopic Imaging and Machine

Learning. In Proceedings of the SPIE-CLP Conference on Advanced Photonics 2022, Virtual, 28 March 2023; Liu, X., Yuan, X.,

Zayats, A., Eds.; SPIE: Washington, DC, USA, 2023; p. 16.

29.

Huang, H.-Y.; Hsiao, Y.-P.; Mukundan, A.; Tsao, Y.-M.; Chang, W.-Y.; Wang, H.-C. Classiﬁcation of Skin Cancer Using Novel

Hyperspectral Imaging Engineering via YOLOv5. J. Clin. Med. 2023,12, 1134. [CrossRef] [PubMed]

30.

Fabelo, H.; Melian, V.; Martinez, B.; Beltran, P.; Ortega, S.; Marrero, M.; Callico, G.M.; Sarmiento, R.; Castano, I.; Carretero, G.;

et al. Dermatologic Hyperspectral Imaging System for Skin Cancer Diagnosis Assistance. In Proceedings of the 2019 XXXIV

Conference on Design of Circuits and Integrated Systems (DCIS), Bilbao, Spain, 20–22 November 2019; IEEE: New York, NY,

USA, 2019; pp. 1–6.

31.

Petracchi, B.; Gazzoni, M.; Torti, E.; Marenzi, E.; Leporati, F. Machine Learning-Based Classiﬁcation of Skin Cancer Hyperspectral

Images. Procedia Comput. Sci. 2023,225, 2856–2865. [CrossRef]

32.

La Salvia, M.; Torti, E.; Leon, R.; Fabelo, H.; Ortega, S.; Balea-Fernandez, F.; Martinez-Vega, B.; Castaño, I.; Almeida, P.; Carretero,

G.; et al. Neural Networks-Based On-Site Dermatologic Diagnosis through Hyperspectral Epidermal Images. Sensors 2022,

mboxemph22, 7139. [CrossRef] [PubMed]

33.

Samat, A.; Li, E.; Du, P.; Liu, S.; Xia, J. GPU-Accelerated CatBoost-Forest for Hyperspectral Image Classiﬁcation Via Parallelized

MRMR Ensemble Subspace Feature Selection. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2021,14, 3200–3214. [CrossRef]

34.

Camps-Valls, G.; Bruzzone, L. Kernel-Based Methods for Hyperspectral Image Classiﬁcation. IEEE Trans. Geosci. Remote Sens.

2005,43, 1351–1362. [CrossRef]

35.

Florimbi, G.; Fabelo, H.; Torti, E.; Ortega, S.; Marrero-Martin, M.; Callico, G.M.; Danese, G.; Leporati, F. Towards Real-Time

Computing of Intraoperative Hyperspectral Imaging for Brain Cancer Detection Using Multi-GPU Platforms. IEEE Access 2020,8,

8485–8501. [CrossRef]

36.

Wu, D.; Sun, D.-W. Advanced Applications of Hyperspectral Imaging Technology for Food Quality and Safety Analysis and

Assessment: A Review—Part I: Fundamentals. Innov. Food Sci. Emerg. Technol. 2013,19, 1–14. [CrossRef]

37.

Adão, T.; Hruška, J.; Pádua, L.; Bessa, J.; Peres, E.; Morais, R.; Sousa, J. Hyperspectral Imaging: A Review on UAV-Based Sensors,

Data Processing and Applications for Agriculture and Forestry. Remote Sens. 2017,9, 1110. [CrossRef]

38.

Sousa, J.J.; Toscano, P.; Matese, A.; Di Gennaro, S.F.; Berton, A.; Gatti, M.; Poni, S.; Pádua, L.; Hruška, J.; Morais, R.; et al.

UAV-Based Hyperspectral Monitoring Using Push-Broom and Snapshot Sensors: A Multisite Assessment for Precision Viticulture

Applications. Sensors 2022,22, 6574. [CrossRef] [PubMed]

Sensors 2024,24, 1399 16 of 16

39.

Abdlaty, R.; Abbass, M.A.; Awadallah, A.M. High Precision Monitoring of Radiofrequency Ablation for Liver Using Hyperspectral

Imaging. Ann. Biomed. Eng. 2021,49, 2430–2440. [CrossRef] [PubMed]

40.

Bassler, M.C.; Stefanakis, M.; Sequeira, I.; Ostertag, E.; Wagner, A.; Bartsch, J.W.; Roeßler, M.; Mandic, R.; Reddmann, E.F.; Lorenz,

A.; et al. Comparison of Whiskbroom and Pushbroom Darkﬁeld Elastic Light Scattering Spectroscopic Imaging for Head and

Neck Cancer Identiﬁcation in a Mouse Model. Anal. Bioanal. Chem. 2021,413, 7363–7383. [CrossRef]

41.

Wahabzada, M.; Besser, M.; Khosravani, M.; Kuska, M.T.; Kersting, K.; Mahlein, A.-K.; Stürmer, E. Monitoring Wound Healing in

a 3D Wound Model by Hyperspectral Imaging and Efﬁcient Clustering. PLoS ONE 2017,12, e0186425. [CrossRef] [PubMed]

42.

He, Q.; Wang, R.K. Analysis of Skin Morphological Features and Real-Time Monitoring Using Snapshot Hyperspectral Imaging.

Biomed. Opt. Express 2019,10, 5625. [CrossRef] [PubMed]

43.

La Salvia, M.; Torti, E.; Gazzoni, M.; Marenzi, E.; Leon, R.; Ortega, S.; Fabelo, H.; Callico, G.M.; Leporati, F. Attention-Based Skin

Cancer Classiﬁcation Through Hyperspectral Imaging. In Proceedings of the 2022 25th Euromicro Conference on Digital System

Design (DSD), Maspalomas, Spain, 31 August–2 September 2022; IEEE: New York, NY, USA, 2022; pp. 871–876.

44.

Chandra, M.A.; Bedi, S.S. Survey on SVM and Their Application in Image Classiﬁcation. Int. J. Inf. Technol. 2021,13, 1–11.

[CrossRef]

45.

Brown, M.; Lewis, H.G.; Gunn, S.R. Linear Spectral Mixture Models and Support Vector Machines for Remote Sensing. IEEE

Trans. Geosci. Remote Sens. 2000,38, 2346–2360. [CrossRef]

46.

Mountrakis, G.; Im, J.; Ogole, C. Support Vector Machines in Remote Sensing: A Review. ISPRS J. Photogramm. Remote Sens. 2011,

66, 247–259. [CrossRef]

47. Breiman, L. Random Forests. Mach. Learn. 2001,45, 5–32. [CrossRef]

48. Zhang, H.; Si, S.; Hsieh, C.-J. GPU-Acceleration for Large-Scale Tree Boosting. arXiv 2017, arXiv:1706.08359.

49. Mitchell, R.; Frank, E. Accelerating the XGBoost Algorithm Using GPU Computing. PeerJ Comput. Sci. 2017,3, e127. [CrossRef]

50.

Chen, T.; Guestrin, C. XGBoost. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery

and Data Mining, San Francisco, CA, USA, 13–17 August 2016; ACM: New York, NY, USA, 2016; pp. 785–794.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual

author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to

people or property resulting from any ideas, methods, instructions or products referred to in the content.

Development of a Model to Classify Skin Diseases using Stacking Ensemble Machine Learning Techniques

Article

Full-text available

May 2024

Skin diseases are highly prevalent and transmissible. It has been one of the major health problems that most people face. The diseases are dangerous to the skin and tend to spread over time. A patient can be cured of these skin diseases if they are detected on time and treated early. However, it is difficult to identify these diseases and provide the right medications. This study's research objectives involve developing an ensemble machine learning based model for classifying Erythemato-Squamous Diseases (ESD). The ensemble techniques combine five different classifiers, Naïve Bayes, Support Vector Classifier, Decision Tree, Random Forest, and Gradient Boosting, by merging their predictions and utilizing them as input features for a meta-classifier during training. We tested and validated the ensemble model using the dataset from the University of California, Irvine (UCI) repository to assess its effectiveness. The Individual classifiers achieved different accuracies: Naïve Bayes (85.41%), Support Vector Machine (98.61%), Random Forest (97.91%), Decision Tree (95.13%), Gradient Boosting (95.83%). The stacking method yielded a higher accuracy of 99.30% and a precision of 1.00, recall of 0.96, F1 score of 0.97, and specificity of 1.00 compared to the base models. The study confirms the effectiveness of ensemble learning techniques in classifying ESD.

Classification of skin cancer based on hyperspectral microscopic imaging and machine learning

Conference Paper

Full-text available

Mar 2023

Classification of Skin Cancer Using Novel Hyperspectral Imaging Engineering via YOLOv5

Article

Full-text available

Feb 2023

Many studies have recently used several deep learning methods for detecting skin cancer. However, hyperspectral imaging (HSI) is a noninvasive optics system that can obtain wavelength information on the location of skin cancer lesions and requires further investigation. Hyperspectral technology can capture hundreds of narrow bands of the electromagnetic spectrum both within and outside the visible wavelength range as well as bands that enhance the distinction of image features. The dataset from the ISIC library was used in this study to detect and classify skin cancer on the basis of basal cell carcinoma (BCC), squamous cell carcinoma (SCC), and seborrheic keratosis (SK). The dataset was divided into training and test sets, and you only look once (YOLO) version 5 was applied to train the model. The model performance was judged according to the generated confusion matrix and five indicating parameters, including precision, recall, specificity, accuracy, and the F1-score of the trained model. Two models, namely, hyperspectral narrowband image (HSI-NBI) and RGB classification, were built and then compared in this study to understand the performance of HSI with the RGB model. Experimental results showed that the HSI model can learn the SCC feature better than the original RGB image because the feature is more prominent or the model is not captured in other categories. The recall rate of the RGB and HSI models were 0.722 to 0.794, respectively, thereby indicating an overall increase of 7.5% when using the HSI model.

Neural Networks-Based On-Site Dermatologic Diagnosis through Hyperspectral Epidermal Images

Article

Full-text available

Sep 2022
SENSORS-BASEL

Cancer originates from the uncontrolled growth of healthy cells into a mass. Chromophores, such as hemoglobin and melanin, characterize skin spectral properties, allowing the classification of lesions into different etiologies. Hyperspectral imaging systems gather skin-reflected and transmitted light into several wavelength ranges of the electromagnetic spectrum, enabling potential skin-lesion differentiation through machine learning algorithms. Challenged by data availability and tiny inter and intra-tumoral variability, here we introduce a pipeline based on deep neural networks to diagnose hyperspectral skin cancer images, targeting a handheld device equipped with a low-power graphical processing unit for routine clinical testing. Enhanced by data augmentation, transfer learning, and hyperparameter tuning, the proposed architectures aim to meet and improve the well-known dermatologist-level detection performances concerning both benign-malignant and multiclass classification tasks, being able to diagnose hyperspectral data considering real-time constraints. Experiments show 87% sensitivity and 88% specificity for benign-malignant classification and specificity above 80% for the multiclass scenario. AUC measurements suggest classification performance improvement above 90% with adequate thresholding. Concerning binary segmentation, we measured skin DICE and IOU higher than 90%. We estimated 1.21 s, at most, consuming 5 Watts to segment the epidermal lesions with the U-Net++ architecture, meeting the imposed time limit. Hence, we can diagnose hyperspectral epidermal data assuming real-time constraints.

Staging of Skin Cancer Based on Hyperspectral Microscopic Imaging and Machine Learning

Article

Full-text available

Sep 2022

Skin cancer, a common type of cancer, is generally divided into basal cell carcinoma (BCC), squamous cell carcinoma (SCC) and malignant melanoma (MM). The incidence of skin cancer has continued to increase worldwide in recent years. Early detection can greatly reduce its morbidity and mortality. Hyperspectral microscopic imaging (HMI) technology can be used as a powerful tool for skin cancer diagnosis by reflecting the changes in the physical structure and microenvironment of the sample through the differences in the HMI data cube. Based on spectral data, this work studied the staging identification of SCC and the influence of the selected region of interest (ROI) on the staging results. In the SCC staging identification process, the optimal result corresponded to the standard normal variate transformation (SNV) for spectra preprocessing, the partial least squares (PLS) for dimensionality reduction, the hold-out method for dataset partition and the random forest (RF) model for staging identification, with the highest staging accuracy of 0.952 ± 0.014, and a kappa value of 0.928 ± 0.022. By comparing the staging results based on spectral characteristics from the nuclear compartments and peripheral regions, the spectral data of the nuclear compartments were found to contribute more to the accurate staging of SCC.

UAV-Based Hyperspectral Monitoring Using Push-Broom and Snapshot Sensors: A Multisite Assessment for Precision Viticulture Applications

Technical Report

Full-text available

Aug 2022
SENSORS-BASEL

Citation: Sousa, J.J.; Toscano, P.; Matese, A.; Di Gennaro, S.F.; Berton, A.; Gatti, M.; Poni, S.; Pádua, L.; Hruška, J.; Morais, R.; et al. UAV-Based Hyperspectral Monitoring Using Push-Broom and Snapshot Sensors: A Multisite Assessment for Precision Viticulture Applications. Sensors 2022, 22, 6574.

Hyperspectral remote sensing of white mica: A review of imaging and point-based spectrometer studies for mineral resources, with spectrometer design considerations

Article

Full-text available

Jun 2022
REMOTE SENS ENVIRON

Over the past ~30 years, hyperspectral remote sensing of chemical variations in white mica have proven to be useful for ore deposit studies in a range of deposit types. To better understand mineral deposits and to guide spectrometer design, this contrib ution reviews relevant papers from the fields of remote sensing, spectroscopy, and geology that have utilized spectral changes caused by chemical variation in white micas. This contribution reviews spectral studies conducted at the following types of mineral deposits: base metal sulfide, epithermal, porphyry, sedimentary rock hosted gold deposits, orogenic gold, iron oxide copper gold, and unconformity-related uranium. The structure, chemical composition, and spectral features of white micas, in this contribution defined as muscovite, paragonite, celadonite, phengite, illite, and sericite, are given. Reviewed laboratory spectral studies determined that shifts in the position of the white mica 2200 nm combination feature of 1 nm correspond to a change in Aloct content of approximately ±1.05%. Many of the reviewed spectral studies indicated that a shift in the position of the white mica 2200 nm combination feature of 1 nm was geologically significant. A sensitivity analysis of spectrometer characteristics; bandpass, sampling interval, and channel position, is conducted using spectra of 19 white micas with deep absorption features to determine minimum characteristics required to accurately measure a shift in the position of the white mica 2200 nm combination feature. It was determined that a sampling interval < 16.3 nm and bandpass <17.5 nm are needed to achieve a root mean square error (RMSE) of 2 nm, whereas a sampling interval < 8.8 nm and bandpass <9.8 nm are needed to achieve a RMSE of 1 nm. For comparison, commonly used imaging spectrometers HyMap, AVIRIS-Classic, SpecTIR®'s AisaFENIX 1K, and HySpextm SWIR 384 have 2.1, 1.2, 0.96, and 0.95 nm RMSE in determining the position of the 2200 nm white mica combination feature, respectively. An additional sensitivity analysis is conducted to determine the effect of signal to noise ratio (SNR) on the RMSE of the position of the white mica 2200 nm combination feature, using spectra of 18 white micas with deep absorption features. For a spectrometer with sampling interval and bandpass of 1 nm, we estimate that RMSEs of 1 and 1.5 nm are achievable with spectra having a minimum SNR of approximately 246 and 64, respectively. For a spectrometer with sampling interval and bandpass of 5 nm, we estimate that RMSEs of 1 and 1.5 nm are attainable with spectra having a minimum SNR of approximately 431 and 84, respectively. When using a spectrometer with a sampling interval 8.8 nm and a bandpass of 9.8 nm, a RMSE of 1 is only achievable with convolved, noiseless reference spectra. For the 8.8_9.8 nm spectrometer, spectra with SNR of 250 and 100 result in RMSE of 1.1 and 1.3, respectively. Therefore, fine spectral resolution characteristics achieve RMSEs better than 1 nm for high SNR spectra while spectrometers with coarse spectral resolution have larger RMSE, perform well with noisy data, and are useful for white mica studies if RMSE of 1.1 to 1.5 nm is acceptable.

The transformational role of GPU computing and deep learning in drug discovery

Article

Full-text available

Mar 2022

Deep learning has disrupted nearly every field of research, including those of direct importance to drug discovery, such as medicinal chemistry and pharmacology. This revolution has largely been attributed to the unprecedented advances in highly parallelizable graphics processing units (GPUs) and the development of GPU-enabled algorithms. In this Review, we present a comprehensive overview of historical trends and recent advances in GPU algorithms and discuss their immediate impact on the discovery of new drugs and drug targets. We also cover the state-of-the-art of deep learning architectures that have found practical applications in both early drug discovery and consequent hit-to-lead optimization stages, including the acceleration of molecular docking, the evaluation of off-target effects and the prediction of pharmacological properties. We conclude by discussing the impacts of GPU acceleration and deep learning models on the global democratization of the field of drug discovery that may lead to efficient exploration of the ever-expanding chemical universe to accelerate the discovery of novel medicines. GPUs, which are highly parallel computer processing units, were originally designed for graphics applications, but they have played an important role in accelerating the development of deep learning methods. In this Review, Pandey and colleagues summarize how GPUs have advanced machine learning in the field of drug discovery.

Machine Learning-Based Classification of Skin Cancer Hyperspectral Images

Article

Jan 2023

Combining hyperspectral imaging techniques with deep learning to aid in early pathological diagnosis of melanoma

Article

Jul 2023

Background: Cutaneous melanoma, an exceedingly aggressive form of skin cancer, holds the top rank in both malignancy and mortality among skin cancers. In early stages, distinguishing malignant melanomas from benign pigmented nevi pathologically becomes a significant challenge due to their indistinguishable traits. Traditional skin histological examination techniques, largely reliant on light microscopic imagery, offer constrained information and yield low-contrast results, underscoring the necessity for swift and effective early diagnostic methodologies. As a non-contact, non-ionizing, and label-free imaging tool, hyperspectral imaging offers potential in assisting pathologists with identification procedures sans contrast agents. Methods: This investigation leverages hyperspectral cameras to ascertain the optical properties and to capture the spectral features of malignant melanoma and pigmented nevus tissues, intending to facilitate early pathological diagnostic applications. We further enhance the diagnostic process by integrating transfer learning with deep convolutional networks to classify melanomas and pigmented nevi in hyperspectral pathology images. The study encompasses pathological sections from 50 melanoma and 50 pigmented nevus patients. To accurately represent the spectral variances between different tissues, we employed reflectance calibration, highlighting that the most distinctive spectral differences emerged within the 500-675 nm band range. Results: The classification accuracy of pigmented tumors and pigmented nevi was 89% for one-dimensional sample data and 98% for two-dimensional sample data. Conclusions: Our findings have the potential to expedite pathological diagnoses, enhance diagnostic precision, and offer novel research perspectives in differentiating melanoma and nevus.

Attention-based Skin Cancer Classification Through Hyperspectral Imaging

Conference Paper

Aug 2022

Acceleration of Hyperspectral Skin Cancer Image Classification through Parallel Machine-Learning Methods

Abstract and Figures

Recommended publications

Machine Learning-Based Classification of Skin Cancer Hyperspectral Images

An Attention-Based Parallel Algorithm for Hyperspectral Skin Cancer Classification on Low-Power GPUs

Edge and cloud computing approaches in the early diagnosis of skin cancer with attention-based visio...

Attention-based Skin Cancer Classification Through Hyperspectral Imaging