A Deep Learning-Based Classification Framework for Annotated Histopathology Lung Cancer Images



A Deep Learning-Based Classification Framework for
Annotated Histopathology Lung Cancer Images
Esraa A.-R. Hamed1, Mohammed A.-M. Sa-
lem 2, Nagwa L. Badr1, and Mohamed F.
Tolba 1
1Faculty of Computer and Information Sciences, Ain Shams University Cairo, Egypt
2Media Engineering and Technology, GUC, Cairo, Egypt
1 Introduction
Cancer is the main cause of mortality worldwide, according to the World Health Or-
ganization (WHO). The most frequent cancer diagnosis (11.4% of all cases) and the
leading cause of cancer mortality (18.0% of all cancer deaths) is lung cancer [1]. Glob-
ally, the incidence of malignant tumors has been seen to be rising, which may be con-
nected to population expansion. Malignancy can affect any age group depending on the
histological type; however, it is typically found in elderly people between the ages of
50 and 60 [2].
The respiratory system, which includes the lungs, is important for distributing oxygen
throughout the body. In the lung, abnormal cell proliferation can develop and lead to
cancer or pulmonary carcinoma. Poor environmental conditions and an unhealthy life-
style are to reason for this [3]. Only after the tumor is sufficiently large or has spread
to other areas can symptoms of lung cancer become apparent. The success rate of ther-
apy increases with earlier cancer diagnosis [4]. Yet, there is a decreased likelihood of
recovery if cancer is indicated to have spread to other organs. Non-small cell lung can-
cer (NSCLC) accounts for 81 85% of lung cancer cases. Squamous cell carcinoma,
adenocarcinoma, and giant cell carcinoma of NSCLC are a few of the major subtypes
of lung cancer. They are all categorized as NSCLC subtypes since they came from var-
ious types of lung cells [5].
The specific clinicopathological and genetic features that distinguish Squamous-Cell
Carcinomas (SCC) of the lung have changed significantly over time. The most common
subtype of non-small-cell lung malignancies in the past, these neoplasms were thought
to be central tumors with great molecular complexity and no genetic alterations that
might be targeted [6]. It often starts to develop in the cells lining the bronchi. Cancer
can spread over time by infiltrating neighboring lymph nodes and organs and "metas-
tasizing" (moving to other regions of the body through the blood). It has a close rela-
tionship to the history of smoking. There is a considerable risk of dying. Besides age,
family history, and exposure to secondhand smoke, there are other risk factors for SCC
[7]. As the kind of histology, molecular profile, and stage of the disease all affect how
the disease is treated, it is urgently necessary to identify lung cancer histology. It is also
crucial to analyze the histopathological images of the disease. Manually analyzing his-
topathology results, however, takes time and is not objective [8].
Artificial intelligence (AI) is the stimulation of human intelligence in computer soft-
ware that facilitates communication with machines like that of human communication.
Artificial intelligence (AI), which is utilized in various computer vision domains, has
recently emerged as the most significant science of the twenty-first century. In several
disciplines, machine learning [18,19,20] and deep learning [21,22] have the greatest
levels of accuracy. Deep learning techniques, especially Convolution Neural Net-
works (CNN), are being more widely used in the healthcare industry and has a signifi-
cant influence on all parts of primary care. CNN may be used in the field of medical
imaging to identify and classify any disease at an early stage, allowing for timely treat-
ment and easier recovery for the patient [7].
This study article aims to evaluate and test a proposed Convolutional Neural Network
(CNN) architecture for the classification of lung cancer. The paper follows the follow-
ing structure. Section 2 describes previous research on the identification and categori-
zation of lung cancer. In Section 3, the utilized dataset is briefly introduced. Further
information on the suggested deep learning procedure using CNN architecture is pro-
vided in Section 4. Section 5 summarizes all of the experimental observations and con-
clusions. The experiment is finished in Section 6, which also makes some recommen-
dations for further research.
2 Related Work
Of all machine learning techniques, deep neural networks have shown improved out-
comes in the identification of medical images. To increase the precision of detection
and classification, several CNN algorithms are applied to the classification of lung can-
cer images.
A capsule network with numerous inputs was suggested by Mumtaz et al. [9] to build
a diagnostic model for aberrant cell cancer of the lung and colon. A convolutional layer
block and an additional convolutional layer block were employed by the capsule net-
work. Pathological images are used as input by the convolutional layer block (CLB),
whereas histopathological images are used by the Separable CLB. Based on histopatho-
logical scans, the suggested model had a 99.58% accuracy rate for anomalies in the
colon and lungs.
Gessert et al. [10] used microscopic images of colon cancer to classify the data using
transfer learning-based CNN models. They used models like Inception, VGG, and
DenseNet to train datasets. They had the most success categorizing data using the
DenseNet model, which had a classification accuracy rate of 91.2%. DHSCapsNet was
suggested by Kwabena et al. [11] to assess histological images of lung and colon cancer.
The network is made up of a combination of encoder features and DHSCaps. The en-
coder features are made up of the convolutional layer features that have a lot of strong
information. HSquash pulls data from several ckgrounds. They outperformed standard
CapsNet (85.55%) with results of 99.23%.
Vuong et al. [12] proposed a multi-purpose learning strategy to evaluate digital pathol-
ogy images. They employed a collection of pathology image data divided into four
classifications for their research. They utilized the DenseNet-121 model for dataset
training and configured the input data for the model to be 800X800 pixels. They found
an 85.91% classifier accuracy rate. A CNN Pre-Trained Diagnostic Network for Lung
and Colon Cancer was suggested by Sanidhya et al. [13]. Histological slips were ana-
lyzed using a shallow CNN architecture. For the diagnosis of colon and lung malignan-
cies, the network obtained 96% and 97% accuracy, respectively.
The DarkNet-19 model was suggested by Mesut et al. [14] to train the lung and colon
malignancy dataset from scratch. To choose the inefficient features, the Equilibrium
method was used, followed by a separation of the inefficient features from the efficient
ones. The SVM is given effective features for classification. The total accuracy rating
was 99.69%. An approach for precisely identifying lung and colon cancer cells was
presented by SHAHID et al. [15]. By altering the four fundamental layers of AlexNet
and then training it on a dataset, an accuracy of 89% was attained.
Authors in [16] used histological images of colon cancer to make their classifications
using a deep learning methodology. There are four classes in the dataset. Each image
was subjected to the cell identification technique for cell patches. Here, segmenting was
used to separate the images into discrete sizes. By using cell patches created by the used
CNN model, they carried out the classification process. Their correlation accuracy rat-
ings ranged from 90% to 96.9%.
While the proposed model has achieved accuracy of 99.9% in classifying lung malig-
nant from benign lesions. The proposed model achieved the highest accuracies, com-
pared by the state-of-the-art models.
3. Used Dataset
The LC25000 Lung and Colon Histopathological Image collection contains 5000 im-
ages of each type of lung and colon cancerous. The dataset has been validated and com-
plies with HIPAA [17]. Just 750 original images were gathered in total, 250 of which
were given to each category and had a dimension of 1024 x 768 pixels. These images
are scaled down to 768x768 pixels using Python, and then they are enlarged using the
software package augmenter [17]. As a result, the bigger dataset has 5000 images for
each group.
By rotating left and right and flipping horizontally and vertically, augmentation is ac-
complished [17]. Table 1 displays the description of class names of the LC25000 da-
taset with a sample image for each category.
Table 1. Description of LC25000 dataset.
Lung Benign
Class name
Number of Images
Sample Image
Lung Adenocarcinoma
Lung Squamous Cell
Colon Benign
Colon Adenocarcinoma
Lung Benign
The lung cancer squamous cell carcinoma (SCC), which progresses through the kerat-
inization process, is distinguished by the presence of polygonal-shaped cells. In its early
stages, this disease has no symptoms. Because of this, cancer is frequently discovered
after it has spread to other organ areas. As a result, early identification is crucial to
improving treatment outcomes. The patient's likelihood of surviving for five years is
less than 20% if the diagnosis is delayed. Therefore, 10,000 histopathological images
representing two types of lung tissue (lung squamous cell cancer and benign lung tis-
sue) were chosen from the LC25000 collection.
4. Proposed Deep Learning Approach
This section discusses the proposed method for classifying images of lung cancer his-
tology from the LC25000 dataset. The dataset was randomly split into training and test-
ing sets, and the proposed CNN model achieved high accuracy in classifying the images
into benign or malignant (squamous cell carcinoma) categories. Fig. 1 illustrates the
proposed CNN architecture, which consists of two main steps: feature extraction and
classification. Image resizing to 224x224 pixels in the RGB color space is a crucial step
in image preprocessing for training the model. During the training phase, the proposed
CNN model was trained using the training data, and the model parameters obtained
from this phase were used for classification on the testing data. The expected output of
the model is the classification of lung cancer histology images into benign or malignant
(squamous cell carcinoma) categories.
Fig. 1. Proposed Architecture System.
The structure of the Convolutional Neural Network (CNN) used in the study is depicted
in Fig. 2, consisting of various layers dedicated to different functions. The CNN model
includes four convolutional layers (CL) with respective kernel values of 32, 64, 128,
and 256. The first CL layer utilized a kernel size of (11x11), while the second, third,
and fourth layers used (3x3). Following each convolution layer, a max-pooling layer
with a (2x2) kernel size was applied.
Training Images
Testing Images
Classification; Benign or
Image Pre-processing
Convolutional Neural
Network CNN
A Flatten Layer was used to generate the feature vector for the Fully Connected (FC)
layers after the final CL layer. The FC layers included three layers with 1024 and 512
neurons, respectively. Due to the two classes of lung cancer, the last FC layer has two
neurons. ReLU was applied as an activation function for each convolution layer, and
Softmax was applied for the output layer. Additionally, a Dropout Layer with a rate of
0.4 was implemented.
Fig. 2. The Proposed CNN Architecture.
5 The Experimental Work and Results
This research employed a proposed CNN model to classify the LC25000 Lung histo-
pathology images. Two experiments were conducted with different proportions of data
for the training and testing sets, and a maximum of 50 epochs were used when applied
to Google Colab.
In the first experiment, the dataset was split into a testing set of 20% and a training set
of 80%, with a batch size of 150. The proposed model was compared to other deep
learning models, including VGG16, VGG19, AlexNet, Inception ResNet v2, ResNet50,
Inception v3, GoogleNet, and MobileNet. The proposed model had a minimum total
number of training parameters of 1 million and achieved a maximum accuracy of 100%,
with zero test loss, as indicated in Table 2.
Table 2. The total training parameters (Millions), the accuracy (%), and test loss (%) in first ex-
Total parame-
CNN Accuracy
Test Loss
proposed CNN model
The second experiment involved dividing the dataset into 40% for the training set and
60% for the testing set, using a batch size of 150. Subsequently, it was compared with
various other deep learning models, including VGG16, VGG19, AlexNet, Inception
ResNet v2, ResNet50, Inception v3, GoogleNet, and MobileNet. The proposed model
used a minimum of 1 million training parameters and achieved a maximum accuracy
of 99.9%, with a low-test loss of 0.3%, as illustrated in Table 3.
Table 3. The total training parameters (Millions), the accuracy (%), and test loss (%) in second
Total parameters
CNN Accuracy
Test Loss
proposed CNN model
The comparison between the proposed CNN model and other deep learning models,
including VGG16, VGG19, AlexNet, Inception ResNet v2, ResNet50, Inception v3,
GoogleNet, and MobileNet, is illustrated in Fig. 3. The proposed model achieved a
maximum accuracy of 99.9% with a minimum total number of training parameters of
one million.
Fig. 3. The comparison between the proposed CNN model and other existing deep
learning models in second experiment.
6. Conclusions
This study introduces a proposed CNN model for the detection and classification of
lung cancer using the LC25000 lung histopathology image dataset. The proposed model
categorizes each image as either benign or malignant. Two experiments were conducted
to validate the proposed model using the lung histopathology images.
In the first experiment, 80% of the dataset was used for training, and 20% for testing.
In the second experiment, the dataset was split into 40% for training and 60% for test-
ing. The model's performance was evaluated using a maximum of 50 epochs when ap-
plied to Google Colab. The proposed model achieved an accuracy of 99.9% to 100%
and outperformed other deep learning models in terms of performance, using only four
convolutional layers, four maximum collection layers, two fully connected layers, and
one million parameters overall.
The experimental results showed that the proposed model achieved maximum accuracy
with the fewest parameters and that reducing the number of training images did not
significantly affect accuracy. The proposed approach was proven to be effective com-
pared to existing state-of-the-art deep learning models. In the future, the suggested
model can be improved to reduce computation time and applied to other datasets to
enhance the hyperparameters.
CNN Accuracy (%), and Total Parameters(Million)
CNN Accuracy Total parameters (M)
